10% de réduction sur vos envois d'emailing --> CLIQUEZ ICI
Retour à l'accueil, cliquez ici
DOCUMENTATION BULL
http://support.bull.com/documentation/byproduct/soldoc_popupList?k=servers/escala&fromsso=y
http://support.bull.com/documentation/byproduct/soldoc_popupList?k=servers/ns&fromsso=y
86A101FH01.pdf 16-Dec-2011 19:33 15M
86A102ET05.pdf 16-Dec-2011 17:07 6.7M
86A102FA01.pdf 16-Dec-2011 20:45 81M
86A102FH01.pdf 16-Dec-2011 19:37 16M
86A114FG01.pdf 16-Dec-2011 19:28 20M
86A116EF01.pdf 16-Dec-2011 19:55 20M
86A117EF01.pdf 16-Dec-2011 20:03 11M
86A121EG01.pdf 16-Dec-2011 18:26 11M
86A121EM01.pdf 16-Dec-2011 16:49 9.8M
86A125EM01.pdf 16-Dec-2011 16:52 5.5M
86A129FG01.pdf 16-Dec-2011 19:14 14M
86A134EM00.pdf 16-Dec-2011 17:14 6.7M
86A134ER00.pdf 16-Dec-2011 17:04 5.8M
86A138EG01.pdf 16-Dec-2011 19:07 13M
86A141EM06.pdf 16-Dec-2011 16:49 9.5M
86A142EG00.pdf 16-Dec-2011 18:33 12M
86A150FD06.pdf 16-Dec-2011 17:18 3.8M
86A154FF03.pdf 16-Dec-2011 18:14 45M
86A156EG01.pdf 16-Dec-2011 18:38 11M
86A169FG02.pdf 16-Dec-2011 18:50 10M
86A171FA10.pdf 16-Dec-2011 17:16 4.7M
86A172EG01.pdf 16-Dec-2011 17:01 5.9M
86A182EF03.pdf 16-Dec-2011 18:04 26M
86A185ER00.pdf 16-Dec-2011 17:11 6.7M
86A189EF02.pdf 16-Dec-2011 16:57 5.0M
86A190FF03.pdf 16-Dec-2011 19:44 16M
86A194EF01.pdf 16-Dec-2011 18:20 11M
86A194EM01.pdf 16-Dec-2011 17:03 11M
86A196KX00.pdf 16-Dec-2011 20:04 3.3M
86A197KX00.pdf 16-Dec-2011 20:05 4.7M
86A198KX00.pdf 16-Dec-2011 20:02 4.6M
86A235ER00.pdf 16-Dec-2011 20:50 34M
86A254FA04.pdf 16-Dec-2011 17:19 1.4M
86A255FA04.pdf 16-Dec-2011 17:17 2.5M
86A256FA04.pdf 16-Dec-2011 17:19 3.4M
86A257FA04.pdf 16-Dec-2011 17:18 831K
86A258FA05.pdf 16-Dec-2011 17:18 464K
86A259FA04.pdf 16-Dec-2011 17:18 1.7M
86F102ET05.pdf 16-Dec-2011 17:07 8.5M
86F102FA01.pdf 16-Dec-2011 20:39 68M
86F114FG01.pdf 16-Dec-2011 19:07 33M
86F129FG01.pdf 16-Dec-2011 18:45 14M
86F134EM00.pdf 16-Dec-2011 17:10 6.2M
86F141EM06.pdf 16-Dec-2011 16:54 9.7M
86F150FD06.pdf 16-Dec-2011 17:16 3.8M
86F189FF03.pdf 16-Dec-2011 19:45 24M
86F190FF03.pdf 16-Dec-2011 19:26 28M
86F194EM01.pdf 16-Dec-2011 16:58 11M
86X116EM00.pdf 16-Dec-2011 19:00 108M
Bull ESCALA
Expandable Storage Plus 2104 Models DS4 and TS4
Translated Safety Notices
86 X1 16EM 00
ORDER REFERENCE
Bull ESCALA
Expandable Storage Plus 2104 Models DS4 and TS4
Translated Safety Notices
86 X1 16EM 00
ORDER REFERENCE
Bull ESCALA
Expandable Storage Plus 2104 Models DS4 and TS4
Translated Safety Notices
Hardware
November 2003
BULL CEDOC
357 AVENUE PATTON
B.P.20845
49008 ANGERS CEDEX 01
FRANCE
86 X1 16EM 00
ORDER REFERENCE
The following copyright notice protects this book under the Copyright laws of the United States of America
and other countries which prohibit such actions as, but not limited to, copying, distributing, modifying, and
making derivative works.
Copyright Bull S.A. 1992, 2003
Printed in France
Suggestions and criticisms concerning the form, content, and presentation of
this book are invited. A form is provided at the end of this book for this purpose.
To order additional copies of this book or other Bull Technical Publications, you
are invited to use the Ordering Form also provided at the end of this book.
Trademarks and Acknowledgements
We acknowledge the right of proprietors of trademarks mentioned in this book.
AIX is a registered trademark of International Business Machines Corporation, and is being used under
licence.
UNIX is a registered trademark in the United States of America and other countries licensed exclusively through
the Open Group.
Linux is a registered trademark of Linus Torvalds.
The information in this document is subject to change without notice. Bull will not be liable for errors contained
herein, or for incidental or consequential damages in connection with the use of this material.
Contents
About
this
document
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
. v
Who
should
read
this
document
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
. v
Additional
information
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
. v
2104
Model
DS4
and
Model
TS4
library
.
.
.
.
.
.
.
.
.
.
.
.
.
.
. v
Related
documents
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
. v
Web
sites
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
. vi
How
to
send
your
comments
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
. vi
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
.
.
. 1
Bulgarian
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
. 2
Brazilian-Portuguese
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
. 24
Chinese,
simplified
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
. 47
Chinese,
traditional
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
. 70
Croatian
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
. 93
Czech
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
. 116
Danish
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
. 139
Dutch
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
. 162
English
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
. 185
Finnish
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
. 208
French
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
. 231
German
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
. 254
Greek
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
. 277
Hebrew
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
. 300
Hungarian
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
. 323
Japanese
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
. 346
Korean
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
. 369
Lithuanian
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
. 392
Macedonia
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
. 415
Norwegian
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
. 438
Polish
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
. 461
Portuguese
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
. 484
Romanian
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
. 507
Russian
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
. 530
Slovak
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
. 553
Slovenia
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
. 576
Spanish
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
. 599
Swedish
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
. 622
Notices
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
. 645
Trademarks
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
. 646
Electronic
emission
statements
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
. 646
Federal
Communications
Commission
(FCC)
statement
.
.
.
.
.
.
.
. 646
Industry
Canada
compliance
statement
.
.
.
.
.
.
.
.
.
.
.
.
.
. 646
Chinese
Class
A
warning
statement
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
. 646
European
Community
compliance
statement
.
.
.
.
.
.
.
.
.
.
.
. 647
Germany
compliance
statement
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
. 647
Japanese
Voluntary
Control
Council
for
Interference
(VCCI)
class
1
statement
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
. 648
Korean
Government
Ministry
of
Communication
(MOC)
statement
.
.
.
. 648
Taiwan
class
A
compliance
statement
.
.
.
.
.
.
.
.
.
.
.
.
.
.
. 648
©
Copyright
IBM
Corp.
2003
iii
iv
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
About
this
document
This
publication
contains
translated
safety
notices
and
safety
inspection
procedures
for
the
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
(hereafter
referred
to
as
the
2104
Model
DS4
and
Model
TS4).
Important:
The
installation
of
this
product
is
the
responsibility
of
the
customer.
Who
should
read
this
document
This
publication
is
intended
for
people
who
operate
a
system
that
connects
one
or
more
2104
Model
DS4s
and
Model
TS4s
to
it.
Throughout
this
publication,
the
term
service
representative
applies
to
the
person
that
has
been
authorized
by
your
organization
to
maintain
your
2104
Model
DS4
and
Model
TS4.
Additional
information
This
section
contains
the
following
information:
v
A
list
of
the
documents
in
the
2104
Model
DS4
and
Model
TS4
library
v
A
list
of
the
related
documents
v
The
available
Web
sites
v
Information
about
how
to
send
your
comments
2104
Model
DS4
and
Model
TS4
library
The
following
documents
contain
information
related
to
this
product:
v
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
Hardware
Technical
Information,
86 A1 19EM
v
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
Operator’s
Guide,
86 A1 17EM
v
Expandable
Storage
Plus
2104
Model
DS4
Installation
Guide,
86 A1 14EM
v
Expandable
Storage
Plus
2104
Model
TS4
Installation
Guide,
86 A1 15EM
v
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
Service
Guide,
86 A1 18EM
v
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices,
86 X1 16EM
Related
documents
The
following
documents
contain
information
related
to
this
product:
v
The
operator’s
guide
for
your
system
v
The
user’s
guide
for
your
using
system
SCSI
attachment
(for
example,
your
SCSI
adapter)
v
The
site
and
hardware
planning
information
for
your
system
v
The
problem
solving
guide
and
reference
for
your
system
v
The
7014
Model
T00
and
T42
Rack
Installation
and
Service
Guide,
86 A1 94KX
The
following
documents
contain
information
related
to
the
Expandable
Storage
Plus
disk
enclosures
that
attach
to
the
RISC
systems:
©
Copyright
IBM
Corp.
2003
v
v
Diagnostic
Information
for
Multiple
Bus
Systems,
86 A1 26HX
v
Site Preparation Guide for Rack Systems, 86 A1 30PX
v
Adapters,
Devices
and
Cable
Information
for
Multiple
Bus
Systems,
86 A1 27HX
vi
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
Following
are
the
translated
safety
notices
and
environmental
notices
for
the
2104
Model
DS4
and
Model
TS4.
©
Copyright
IBM
Corp.
2003
1
Bulgarian
2
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
3
4
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
5
6
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
7
8
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
9
10
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
11
12
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
13
14
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
15
16
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
17
18
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
19
20
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
21
22
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
23
Brazilian-Portuguese
24
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
25
26
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
27
28
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
29
30
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
31
32
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
33
34
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
35
36
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
37
38
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
39
40
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
41
42
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
43
44
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
45
46
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Chinese,
simplified
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
47
48
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
49
50
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
51
52
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
53
54
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
55
56
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
57
58
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
59
60
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
61
62
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
63
64
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
65
66
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
67
68
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
69
Chinese,
traditional
70
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
71
72
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
73
74
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
75
76
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
77
78
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
79
80
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
81
82
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
83
84
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
85
86
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
87
88
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
89
90
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
91
92
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Croatian
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
93
94
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
95
96
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
97
98
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
99
100
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
101
102
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
103
104
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
105
106
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
107
108
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
109
110
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
111
112
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
113
114
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
115
Czech
116
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
117
118
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
119
120
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
121
122
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
123
124
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
125
126
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
127
128
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
129
130
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
131
132
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
133
134
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
135
136
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
137
138
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Danish
Sikkerhedsmæssige og miljømæssige forskrifter
Oversatte sikkerhedsforskrifter
SCDAN901
Dette afsnit indeholder følgende oplysninger:
Oversatte sikkerhedsforskrifter
Retningslinjer for sikkerhedstjek af dette produkt
Miljømæssige forskrifter
Denne vejledning indeholder udelukkende sikkerhedsforskrifter.
Sikkerhedsforskrifter mærket advarer om forhold, der kan være livsfarlige
eller forårsage alvorlige ulykker.
Sikkerhedsforskrifter mærket advarer om forhold, der kan være farlige.
Sikkerhedsforskrifter mærket advarer om forhold eller procedurer, der
kan forårsage skade på maskiner, udstyr, programmer eller data.
Sikkerhedsforskrifterne er opstillet i numerisk rækkefølge efter deres id, der vises
i parentes efter hver forskrift. Nedenfor vises eksempler på id-nummeret på
sikkerhedsforskrifter mærket Fare! og Pas på!.
Fare!
Pas på!
Bemærk
Fare!
En forkert forbundet stikkontakt kan give livsfarlig elektrisk spænding på
metaldele i systemet eller på enheder, der er tilsluttet systemet. Det er
kundens ansvar at sikre, at stikkontakten er korrekt forbundet til jord. (1)
Pas på!
Placér ikke hænder eller værktøj i den åbning, hvor ventilatoren har
været monteret. (1)
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
139
Statisk elektricitet
Retningslinjer for sikkerhedstjek
Tjek af rack
SCDAN902
Bemærk: Tag følgende forholdsregler for at undgå statisk elektricitet, når du
håndterer udskiftelige enheder (FRU) og andre computerdele:
Begræns dine bevægelser. Bevægelse kan skabe statisk elektricitet omkring dig.
Vær altid forsigtig, når du rører ved computerdele. Hold på kanten af kortene
eller på metaldækslet. Rør aldrig ved et udækket kredsløb.
Personer, der ikke har en antistatisk beskyttelsesstrop om håndleddet, skal
undgå at røre ved computerdele.
Gør følgende, inden du installerer en ny del: Sørg for, at transportposen med
delen rører en ikke-malet metaldel på 2104 Model DS4, 2104 Model TS4 eller
værtssystemet i mindst 2 sekunder. Herved reduceres den statiske elektricitet
i pakken og i din krop.
Tag delen ud af emballagen, og installér den om muligt direkte i 2104 Model
DS4 eller Model TS4, uden først at lægge den fra dig. Gør følgende, hvis
du skal lægge delen fra dig:
1. Placér den antistatiske emballage, delen blev leveret i, på en plan flade.
2. Læg delen oven på den antistatiske emballage. Læg ikke delen direkte
på en metalflade.
Vejledningen beskriver, hvordan du udfører et sikkerhedstjek af 2104 Model DS4
eller Model TS4.
Der er oplysninger om udførelse af sikkerhedstjek af racket i
installationsvejledningen til racket.
Pas på!
2104 Model DS4 og Model TS4 er designet, så de kan installeres af kunden,
og certificerede som "kundeinstallation". Kontrollér, at det system eller det
rack, som 2104 Model DS4 eller Model TS4 skal installeres i, også er
designet og de er certificeret til kundeinstallation. Er det ikke tilfældet, skal
2104 Model DS4 eller Model TS4 installeres af en servicetekniker. (22)
140
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Kontrol af 2104 Model DS4 eller Model TS4
Udvendigt maskintjek
Indvendigt maskintjek
SCDAN903
Foretag følgende indvendige kontrol af maskinen:
1. Kontrollér, om der er foretaget ikke-IBM-ændringer af maskinen.
Hvis der er foretaget sådanne ændringer, skal du bestille formularen
"Non-IBM Alteration Attachment Survey" nummer R009 via IBM's
bestillingssystem - Publications Ordering System. Udfyld formularen,
og send den.
2. Kontrollér maskinen indvendigt for følgende:
Løse metaldele eller andre forurenende stoffer
Tegn på vand eller anden væske
Ild
Røgskade
3. Kontrollér, om der er tydelige mekaniske problemer, f.eks. løse komponenter.
4. Kontrollér ikke-beskyttede kabler og stik for slid, revner eller klemskader.
Foretag følgende udvendige kontrol af maskinen:
1. Kontrollér kabinettet for evt. skader (løse, afbrækkede eller skarpe kanter)
2. Kontrollér netledningerne, og tjek, at isoleringen ikke er slidt eller beskadiget.
3. Kontrollér, om der er foretaget åbenlyse ikke-standardændringer. Brug sund
fornuft ved vurderingen af sikkerheden i forbindelse med sådanne ændringer.
4. Kontrollér, at alle udvendige dæksler er på plads og ikke er beskadiget.
5. Kontrollér, at alle låse og hængsler er i orden.
6. Kontrollér, at netledningen ikke er beskadiget.
7. Kontrollér, at ledningerne ikke er slidte, beskadigede eller sidder i klemme.
8. Tjek ventilator- og strømforsyningsenhederne. Kontrollér, at skruer eller
nitter i dækslet til strømforsyningsenheden ikke er fjernet, og at de ikke
sidder forkert.
9. Kontrollér, at det eksterne signalkabel ikke er beskadiget.
10. Kontrollér, at der ikke er nogen skarpe kanter på dækslet, at det ikke er
beskadiget, og at der ikke er foretaget ændringer, som gør, at man kan
komme til at berøre enhedens indvendige dele.
11. Afhjælp eventuelle problemer.
Udfør nedenstående sikkerhedstjek for at klarlægge eventuelle farlige forhold.
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
141
Kontrol af sikkerhedsmærkater
SCDAN904
Kontrollér følgende i forbindelse med mærkaterne:
1. Kontrollér, at der er sikkerhedsmærkater på 2104 Model DS4 eller Model TS4.
Se Figur 1.
2. Kontrollér, at advarselsmærkaten angående netledningen er placeret på
strømforsyningen i 2104 Model DS4 eller Model TS4. Se Figur 2.
SC000274
Figur 1. 2104 Model DS4- eller Model TS4-mærkat
Figur 2. Advarselsmærkat angående netledning
SC000325
Pas på!
Enheden kan have to netledninger. Tag begge ledninger ud for at fjerne al
strøm til maskinen. (1)
142
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
SCDAN905
3. Kontrollér, at advarselsmærkaten angående strømforsyningsdækslet er
placeret på strømforsyningen til 2104 Model DS4 eller Model TS4. Se Figur 3.
4. Kontrollér, at advarselsmærkaten angående sikringen er placeret på
strømforsyningen til 2104 Model DS4 eller Model TS4. Se Figur 4.
Figur 3. Advarselsmærkat angående strømforsyningsdæksel
Figur 4. Advarselsmærkat angående sikring
SC000326
SC000327
Pas på!
Fjern ikke dækslet. Der er ingen dele, der kan repareres. (2)
Pas på!
Dobbeltpolet/neutral sikring. (3)
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
143
SCDAN905A
5. Kontrollér, at mærkaten med vægtangivelse er placeret på 2104 Model DS4.
Se Figur 5.
Pas på!
Denne enhed vejer over 32 kg. Der er flere oplysninger i manualerne. (4)
Figur 5. 2104 Model DS4 - mærkat med vægtangivelse
SC000220
Der er flere
oplysninger I
manualerne
32 kg
Bemærk:
Hvis der er lagret data på diskene, skal du mærke diskene, før du fjerner
dem. Når du monterer diskene igen, skal du montere hver disk i samme bås,
som du fjernede den fra. Hvis du ikke gør det, kan du risikere at miste data.
> 18 kg > 32 kg
Pas på!
Hvis der er installeret alt, hvad der er muligt i en 2104 Model DS4, det vil
sige 14 diske og 2 strømforsyninger, er den samlede vægt 32 kg. Du skal
fjerne mindst 6 diske fra 2104 Model DS4, før du løfter den eller installerer
den i et rack eller i 2104 Model TS4. Derved bliver den samlede vægt mindre
end 32 kg, og 2104 Model DS4 kan håndteres af to personer. Hvis vægten
ikke reduceres, kan det resultere i personskade.
144
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Pas på!
Enheden vejer mellem 32 kg og 55 kg. (5)
Figur 6. 2104 Model TS4 - mærkat med vægtangivelse
32 - 55 kg
SC000219
6. Kontrollér, at mærkaten med vægtangivelse er placeret på 2104 Model TS4.
Se Figur 6.
Pas på!
Det medfølgende løfteværktøj skal bruges ved flytning og montering af
produktet, når alt ekstraudstyr er installeret i det. Der skal 3 personer til
at flytte det på forsvarlig måde.
Hvis løfteværktøjet ikke er ved hånden, skal vægten reduceres til 32 kg
eller derunder, ved at du fjerner alle de tunge komponenter (diske og
strømforsyninger) fra produktet. Der skal så kun 2 personer til at flytte
eller montere produktet. (6)
Bemærk:
Hvis der er lagret data på diskene, skal du mærke diskene, før du fjerner
dem. Når du monterer diskene igen, skal du montere hver disk i samme bås,
som du fjernede den fra. Hvis du ikke gør det, kan du risikere at miste data.
> 18 kg > 32 kg
SCDAN906
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
145
SCDAN907
7. Kontrollér, at den mærkat, der angiver standbytilstand, er placeret på 2104
Model DS4 eller Model TS4. Se Figur 7.
Pas på!
Standbytilstand angives ved symbolet til højre for jævnstrøm direkte
over kontakten, SW1. Når du placerer SW1 i højre position direkte under
standbysymbolet, slukker du ikke for vekselstrømmen til enheden. (7)
Figur 7. Mærkat angående standbytilstand
SC000326
146
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
SCDAN908
Dette afsnit beskriver, hvordan du kontrollerer jordforbindelsen til 2104 Model
DS4 og Model TS4.
Kontrol af jordforbindelse til 2104 Model DS4
Gør følgende for at kontrollere jordforbindelsen til 2104 Model DS4:
3. Bemærk: Elektriske kredsløb kan blive beskadiget, hvis de eksterne
SCSI-kabler tilsluttes 2104 Model DS4, mens du tjekker jordforbindelsen.
Kontrollér, at der ikke er tilsluttet eksterne SCSI-kabler til stikkene .
4. Kontrollér jordforbindelsen til 2104 Model DS4:
a. Kig på selve de stik, som du fjernede fra strømfordelingsenheden eller
backupbatterienheden i trin 2, og kontrollér, at det stikben, der giver
jordforbindelsen, er i god stand.
b. Brug et ohmmeter, når du forbinder den ene leder til 2104 Model
DS4-rammen og den anden leder til det tredje stikben til jordforbindelse
foran på de stik, som du fjernede i trin 2. Der skal være 0,1 ohm eller
derunder mellem stikbenet til jordforbindelse på stikket og 2104
Model DS4-rammen.
Figur 8. Strømforsyningskabler og SCSI-stik på 2104 Model DS4
Kontrol af jordforbindelse til 2104 Model DS4 og Model TS4
1. Sørg for, at der ikke er strøm til racket. Der er flere oplysninger i
vejledningen til racket eller værtssystemet.
2. Kontrollér, at netledningerne er tilsluttet hver enkelt ventilator- og
strømforsyningsenhed. Se Figur 8. Kontrollér også, at netledningerne
er taget ud af strømfordelingsenheden eller backupbatterienheden i racket.
Der er flere oplysninger i vejledningen til racket eller værtssystemet.
2
1
SC000210
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
147
SCDAN909
Fortsæt med trin 5, hvis du udfører service på en 2104 Model DS4 (220 volt).
Fortsæt med tirn 7, hvis du udfører service på en 2104 Model DS4 (-48 volt).
5. Kontrollér, at der er forbindelse mellem 2104 Model DS4-rammen og
stikbenet til jord på netledningen på hver enkelt ventilator- og
strømforsyningsenhed. Se Figur 9.
6. Fortsæt med trin 9.
7. Kontrollér, at der er forbindelse mellem 2104 Model DS4-rammen og
stikbenene 1 og 7 til jord på hvert hovednetledningsstik. Se Figur 10.
8. Fortsæt med trin 9.
9. Udskift en ventilator- og strømforsyningsenhed, hvis forbindelserne ikke er
i orden. Der er flere oplysninger om ventilator- og strømforsyningsenheder i
.
Kontrollér jordforbindelsen igen. Hvis forbindelserne er i orden i en ventilatorog
strømforsyningsenhed, kan problemet skyldes netledningen eller
værtssystemets jordforbindelse.
Expandable Storage Plus 2104 Model DS4 and Model TS4 Service Guide
Figur 9. Stikben til jord (220 V 2104 Model DS4 )
Figur 10. Stikben til jord (-48 V 2104 Model DS4)
1
SC000329
Kontrollen af jordforbindelsen er afsluttet, hvis jordforbindelsen er i orden.
Fjern netledningerne fra ventilator- og strømforsyningsenheden i 2104 Model
DS4, hvis jordforbindelsen ikke er i orden.
SC000330
Ben 6 Ben 1
Ben 12 Ben 7
148
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
SCDAN910
Fortsæt med trin 10, hvis du udfører service på en 2104 Model DS4 (220 volt).
Læs i installationsvejledningen til racket eller værtssystemet for at isolere
fejlen, hvis du udfører service på en 2104 Model DS4 (-48 volt). Kontrollér
jordforbindelsen igen.
10. Kontrollér forbindelserne i netledningen.
Hvis forbindelserne i netledningen ikke er i orden, skal du udskifte den med
en ny, og derefter kontrollere jordforbindelsen igen.
Installationsvejledningen til racket eller værtssystemet indeholder oplysninger
om, hvordan du isolerer fejlen, hvis forbindelsen i netledningen er i orden.
Gør følgende for at kontrollere jordforbindelsen til 2104 Model TS4:
2. Kontrollér, at den anden ende af netledningerne ikke er tilsluttet stikkontakten.
Tag om nødvendigt ledningerne ud.
Kontrol af jordforbindelse til 2104 Model TS4
1. Kontrollér, at der er tilsluttet en netledning i hvert stik 1 . Se Figur 11.
Figur 11. Netledninger og SCSI-stik (2104 Model TS4)
SC000230
1
2
3
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
149
SCDAN911
3. Elektriske kredsløb kan beskadiges, hvis de eksterne SCSI-kabler
tilsluttes 2104 Model TS4, mens du tjekker jordforbindelsen.
Kontrollér, at der ikke er tilsluttet eksterne SCSI-kabler til SCSI-stikkene .
Se Figur 11.
4. Kontrollér, at der er forbindelse mellem 2104 Model TS4-rammen og
stikbenet til jord på hver netledning.
a. Tjek netledningerne, og kontrollér, at stikbenet til jord er i god stand.
b. Brug et ohmmeter til at kontrollere, at forbindelsen mellem stikbenet til
jord og jordforbindelsen til rammen er på 0,1 ohm eller derunder.
5. Kontrollen af jordforbindelsen er afsluttet, hvis forbindelsen er i orden.
6. Tag netledningerne ud af stikkontakterne, hvis forbindelsen til jord ikke er
i orden.
a. Kontrollér forbindelserne i hver netledning.
b. Udskift en netledning, hvis den er fejlbehæftet.
c. Kontrollér, at der er forbindelse mellem 2104 Model TS4-rammen og
stikbenet til jord på netledningen til hver enkelt ventilator- og
strømforsyningsenhed. Se Figur 12.
Bemærk:
d Udskift en ventilator- og strømforsyningsenhed, hvis forbindelsen ikke
er i orden. Der er flere oplysninger i
i afsnittet om ventilator- og
strømforsyningsenheder.
Fortsæt med "Kontrol af jordforbindelse til 2104 Model TS4", og udfør trin 1
til og med 6d for at udføre kontrollen af jordforbindelsen igen.
Expandable Storage Plus 2104
Model DS4 and Model TS4 Service Guide
Figur 12. Stikben til jord (2104 Model TS4)
1
2
3
SC000331
150
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
SCDAN912
Figur 13. Fjernelse af strøm fra 2104 Model DS4
Bemærk:
Bemærk:
Medmindre du har en særlig grund til at gøre det, skal du ikke fjerne
strømmen fra værtssystemet eller fra 2104 Model DS4, medmindre
du får besked på at gøre det i de vejledninger, du følger.
Gør følgende for at fjerne strømmen fra 2104 Model DS4:
1. Kontrollér med kunden, at al kommunikation mellem 2104 Model DS4 og
værtssystemet er standset.
2. Indstil kontakten dc on/standby på hver ventilator- og strømforsyningsenhed
på standby. Se Figur 13.
a. Nogle 2104 Model DS4-enheder har en ventilator- og strømforsyningsenhed
og en ventilatorenhed. Ventilatorenheden har ingen dc on/standbykontakt.
b. På en ventilator- og strømforsyningsenhed kan CHK-lampen lyse, selvom
dc on/standbykontakten er sat til standby.
Fjern strømmen fra 2104 Model DS4 eller Model TS4
1
Fjern strømmen fra 2104 Model DS4
SC000206
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
151
SCDAN913
3. Fare!
Når du udfører næste trin, skal du fjerne netledningerne. Ledningerne
er strømførende, når strømfordelingsenheden i racket eller UPSenheden
(Uninterruptable Power Supply) er tændt. (1)
1
1
2
2
Bemærk:
Bemærk:
Medmindre du har en særlig grund til at gøre det, skal du ikke fjerne
strømmen fra værtssystemet eller fra 2104 Model TS4, medmindre du
får besked på at gøre det i de vejledninger, du følger.
Gør følgende for at fjerne strømmen fra 2104 Model TS4:
1. Kontrollér med kunden, at al kommunikation mellem 2104 Model TS4 og
værtssystemet er standset.
2. Indstil dc on/standbykontakten på hver ventilator- og strømforsyningsenhed
bag på 2104 Model TS4 på standby. Se Figur 14.
a. Nogle 2104 Model TS4-enheder har en ventilator- og strømforsyningsenhed
og en ventilatorenhed. Ventilatorenheden har ingen dc on/standbykontakt.
b. På en ventilator- og strømforsyningsenhed kan CHK-lampen lyse, selvom
dc on/standbykontakten er sat til standby.
Fjern strømmen fra 2104 Model TS4
Fjern netledningerne fra bagsiden af 2104 Model DS4. Se Figur 13.
4. Gør følgende for at slutte strømmen til 2104 Model DS4 igen:
a. Montér netledningerne igen.
b. Indstil dc on/standbykontakten på hver ventilator- og
strømforsyningsenhed til on.
Du kan konfigurere motorstartsekvensen på 2104 Model DS4.
Der er flere oplysninger i afsnittene om kontakterne til automatisk start af disk og
til forsinket motorstart i
.
Bemærk:
Expandable Storage Plus 2104 Model DS4 and
Model TS4 Service Guide
152
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
SCDAN914
1
2
2
3. Fjern netledningerne fra bagsiden af 2104 Model TS4. Se Figur 14.
4. Gør følgende for at slutte strømmen til 2104 Model TS4 igen:
a. Montér netledningerne igen.
b. Indstil dc on/standbykontakten på hver ventilator- og
strømforsyningsenhed på on.
Du kan konfigurere motorstartsekvensen på 2104 Model TS4.
Der er flere oplysninger i afsnittene om kontakterne til automatisk
start af disk og til forsinket motorstart i
.
Bemærk:
Expandable Storage
Plus 2104 Model DS4 and Model TS4 Service Guide
Figur 14. Fjern strømmen fra 2104 Model TS4
SC000207
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
153
SCDAN915
Sikkerhedsforskrifter mærket Fare! (DANGER)
Fare!
Fare!
Fare!
Fare!
Fare!
Når du udfører næste trin, skal du fjerne netledningerne. Ledningerne
er strømførende, når strømfordelingsenheden i racket eller UPS-enheden
(Uninterruptable Power Supply) er tændt. (1)
Sæt ikke en netledning i ventilator- og strømforsyningsenheden,
før enheden er helt på plads, og fingerskruerne er skruet helt i. (3)
Kabler til skærmterminaler, printere, telefoner eller overspændingsafledere
til kommunikationsudstyr må ikke afmonteres i tordenvejr. (5)
En forkert forbundet stikkontakt kan give livsfarlig elektrisk spænding
på metaldele i systemet eller på udstyr, der er tilsluttet systemet. Det er
kundens ansvar at sikre, at stikkontakten er korrekt forbundet til jord.
Kabler til skærmterminaler, printere, telefoner eller overspændingsafledere
til kommunikationsudstyr må ikke afmonteres i tordenvejr. (4)
Forsøg ikke at åbne dækslerne til ventilator- og strømforsyningsenheden.
(2)
154
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
SCDAN916
Sikkerhedsforskrifter mærket Pas på! (CAUTION)
Pas på!
Enheden kan have to netledninger. Tag begge ledninger ud for at fjerne al
strøm til maskinen. (1)
Pas på!
Fjern ikke dækslet. Der er ingen dele, der kan repareres. (2)
Pas på!
Dobbeltpolet/neutral sikring. (3)
Figur 15. Advarselsmærkat angående netledning
Figur 16. Advarselsmærkat angående dæksel til strømforsyning
Figur 17. Advarselsmærket angående sikring
SC000327
SC000326
SC000325
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
155
SCDAN916A
Pas på!
Enheden vejer over 32 kg. Der er flere oplysninger i manualerne. (4)
Figur 18. 2104 Model DS4 - mærkat med vægtangivelse
SC000220
Der er flere
oplysninger
i manualerne
32 kg
Bemærk:
Hvis der er lagret data på diskene, skal du mærke diskene, før du fjerner dem.
Når du monterer diskene igen, skal du montere hver disk i samme bås, som du
fjernede den fra. Hvis du ikke gør det, kan du risikere at miste data.
> 18 kg > 32 kg
Pas på!
Hvis der er installeret alt, hvad der er muligt i en 2104 Model DS4, det vil sige
14 diske og 2 strømforsyninger, er den samlede vægt 32 kg. Du skal fjerne
mindst 6 diske fra 2104 Model DS4, før du løfter den eller monterer den i et
rack eller i 2104 Model TS4. Derved bliver den samlede vægt under 32 kg,
og 2104 Model DS4 kan håndteres af to personer. Hvis vægten ikke reduceres,
kan det resultere i personskade.
156
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Pas på!
Enheden vejer mellem 32 kg og 55 kg. (5)
Pas på!
Det medfølgende løfteværktøj skal bruges ved flytning eller montering af
produktet, når alt ekstraudstyr er installeret i produktet. Der skal 3 personer
til at flytte det på forsvarlig måde. Hvis løfteværktøjet ikke er ved hånden,
skal vægten reduceres til 32 kg eller derunder, ved at du fjerner alle de
tunge komponenter (diske og strømforsyninger) fra produktet. Der skal
så kun 2 personer til at flytte eller montere produktet. (6)
Figur 19. 2104 Model TS4 - mærkat med vægtangivelse
SC000219
Bemærk:
Hvis der er lagret data på diskene, skal du mærke diskene, før du fjerner dem.
Når du monterer diskene igen, skal du montere hver disk i samme bås, som du
fjernede den fra. Hvis du ikke gør det, kan du risikere at miste data.
> 18 kg > 32 kg
32 - 55 kg
SCDAN917
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
157
SCDAN918
Pas på!
Standbytilstand angives ved symbolet til højre for jævnstrøm direkte over
kontakten, SW1. Når du placerer SW1 i højre position direkte under
standbysymbolet, slukker du ikke for vekselstrømmen til enheden. (7)
Pas på!
Støttepladen skal fastgøres korrekt foran i bunden af racket for at forhindre
det i at vælte, når du fjerner 2104 Model DS4 fra racket. Du må ikke fjerne eller
installere enheder, medmindre der er monteret en støtteplade på racket. (8)
Pas på!
Støttepladen skal fastgøres korrekt foran i bunden af racket for at forhindre
det i at vælte, når du monterer 2104 Model DS4 i racket. Du må ikke montere
eller fjerne enheder, medmindre der er monteret en støtteplade på racket. (9)
Figur 20. Angivelse af standbytilstand
SC000328
158
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
SCDAN319
Pas på!
Placér ikke hænder eller værktøj i den åbning, hvor ventilatoren har
været monteret. (10)
Pas på!
Placér ikke hænder eller værktøj i den åbning, hvor ventilator- og
strømforsyningsenheden har været monteret. (11)
Pas på!
Placér ikke hænder eller værktøj i den åbning, hvor SCSI-interfacekortet
har været monteret. (12)
Pas på!
Dette produkt er forsynet med en 3-polet netledning og et trebensstik.
Af sikkerhedshensyn skal netledningen altid bruges sammen med en
korrekt jordforbundet stikkontakt. (13)
Pas på!
Rør ikke stikkontakten eller dens dækplade med andet end testpindene
(probe), før du har udført dette sikkerhedstjek. (14)
Pas på!
Hvis aflæsningen ikke er uendelig, må du ikke fortsætte. Få det
elektriske system bragt i orden, inden du fortsætter. Strømafbryderen
må ikke slås til, før ovennævnte trin er udført tilfredsstillende. (15)
Pas på!
Brug ikke håndtagene på ventilatorenheden eller ventilator- og
strømforsyningsenheden til at løfte 2104 Model DS4. Håndtagene er
ikke konstrueret til at bære enhedens vægt. (16)
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
159
SCDAN920
Pas på!
Når du skubber enheden helt ind, flyttes grebet automatisk mod dets
låste position. Pas på, at dine fingre ikke kommer i klemme mellem
grebet og enheden. (17)
Pas på!
Sørg for, at hovednetledningen er fjernet fra den fejlbehæftede
ventilator- og strømforsyningsenhed, før du fortsætter .(18)
Pas på!
Placér ikke hænder eller værktøj i åbningen oven over ventilator- og
strømforsyningsenheden. (19)
Pas på!
Placér ikke hænder eller værktøj i åbningen mellem
strømforsyningsenhederne. (20)
Pas på!
Du skal også overholde sikkerhedsforskrifterne for rackmodellerne
T00 og T42, før du begynder at fjerne rammeenheden, hvis 2104 Model DS4
er installeret i en Model T00 eller T42. Sikkerhedsforskrifterne findes i
afsnittet om systeminstallation i
, SA38-0577 eller på webstedet:
www-1.ibm.com/servers/eserver/pseries/library/hardware_docs/7014_t00.html
(21)
Pas på!
2104 Model DS4 og Model TS4 er designet, så de kan installeres af kunden,
og de er certificerede som "kundeinstallation". Kontrollér, at det system eller
det rack, som 2104 Model DS4 eller Model TS4 skal installeres i, også er
designet og certificeret til kundeinstallation. Er det ikke tilfældet, skal 2104
Model DS4 eller Model TS4 installeres af en servicetekniker. (22)
7014 Model T00 and T42 Rack Installation
and Service Guide
160
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
SCDAN921
Pas på!
Placér ikke hænder eller værktøj i den åbning, hvor kortet har været
monteret. (23)
Pas på!
Der skal tre personer til at løfte 2104 Model TS4. Forsøg ikke at løfte
2104 Model TS4 alene. (24)
Pas på!
Brug ikke håndtagene på ventilatorenheden eller ventilator- og
strømforsyningsenheden til at løfte 2104 Model TS4. Håndtagene er
ikke konstrueret til at bære enhedens vægt. (25)
2104 Model DS4 er designet, så den kan installeres af kunden, og den
er certificeret som "kundeinstallation". Kontrollér, at det system eller
det rack, som 2104 Model DS4 skal installeres i, også er designet og
certificeret til kundeinstallation. Er det ikke tilfældet, skal 2104 Model
DS4 installeres af en servicetekniker. (26)
Pas på!
Pas på!
2104 Model TS4 er designet, så den kan installeres af kunden, og den
er certificeret som "kundeinstallation". Kontrollér, at det system, som
2104 Model TS4 skal installeres i, også er designet og certificeret til
kundeinstallation. Er det ikke tilfældet, skal 2104 Model TS4 installeres
af en servicetekniker. (27)
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
161
Dutch
162
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
163
164
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
165
166
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
167
168
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
169
170
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
171
172
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
173
174
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
175
176
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
177
178
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
179
180
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
181
182
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
183
184
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
English
Safety and environmental notices
Safety notices and their translations
SC000901
This section contains the following information:
Safety notices and their translations
Safety inspection procedures for this product
Environmental notices and statements
Safety notices are printed throughout this document.
A notice warns you of conditions or procedures that can result in death or
severe personal injury.
A warns you of conditions or procedures that can cause personal injury
that is neither lethal nor extremely hazardous.
An notice warns you of conditions or procedures that can cause damage
to machines, equipment, programs, or data.
The notices are listed in numeric order based on their IDs, which are displayed in
parentheses, at the end of each notice. See the following examples of danger and
caution notices for the location of the ID number.
Danger
Caution
Attention
DANGER
An electrical outlet that is not correctly wired could place a hazardous
voltage on the metal parts of the system or the products that attach to the
system. It is the customer’s responsibility to ensure that the outlet is
correctly wired and grounded to prevent an electrical shock. (1)
CAUTION:
Do not insert hands or tools into the opening of the empty space that
contained the fan assembly. (1)
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
185
Electrostatic discharge (ESD)
Safety inspection procedures
Inspecting the rack
SC000902
Attention When you handle field-replaceable units (FRUs) and other computer
parts, take these precautions to avoid static damage:
Limit your movement. Movement can cause static electricity to build up around
you.
Always touch the computer parts carefully. Hold the cards by their edges or
metal cover. Never touch any exposed circuits.
Prevent people who are not correctly grounded from touching computer parts.
Before you install a new part, touch the static-protective package that contains
the part against an unpainted metal part of the 2104 Model DS4, 2104 Model
TS4, or host system for at least 2 seconds. This reduces the static electricity
in the package and in your body.
Remove the part from its package and, if possible, install it directly into the
2104 Model DS4 or Model TS4 without putting the part down. If you must put
the part down, follow these steps:
1. Place the static-protective package that contained the part onto a smooth,
level surface.
2. Place the part onto the static-protective package. Do not place the part
directly onto any metal surface.
The safety inspection procedures describe how to inspect the 2104 Model DS4 or
Model TS4.
Refer to your rack installation manual for information about performing a safety
inspection on the rack.
CAUTION:
The 2104 Model DS4 and Model TS4 are designed to be installed by the
customer and are certified as customer setup. Make sure that the system or
rack into which the 2104 Model DS4 or Model TS4 will be installed is
also designed and certified for customer setup. If they are not, then the 2104
Model DS4 or Model TS4 must be installed by a CE. (22)
186
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Inspecting the 2104 Model DS4 or Model TS4
External machine checks
Internal machine checks
SC000903
Perform the following internal machine checks:
1. Check for any non-IBM changes that might have been made to the machine.
If any are present, obtain the “Non-IBM Alteration Attachment Survey” form,
number R009, from the IBM branch office. Complete the form, and return it to
the branch office.
2. Check the condition of the inside of the machine for:
Metal or other contaminants
Indications of water or other fluid
Fire
Smoke damage
3. Check for any obvious mechanical problems, such as loose components.
4. Check any exposed cables and connectors for wear, cracks, or pinching.
Perform the following external machine checks:
1. Check the chassis for damage (loose, broken, or sharp edges).
2. Check the power cables and ensure that the insulation is not worn or
damaged.
3. Check for any obvious nonstandard changes. Use good judgment about the
safety of any such changes.
4. Verify that all external covers are present and are not damaged.
5. Ensure that all latches and hinges are in correct operating condition.
6. Check the power cable for damage.
7. Check for worn, damaged, or pinched cables.
8. Inspect the fan-and-power-supply assemblies. Check that the fasteners in the
cover of the power-supply unit (screws or rivets) have not been removed or
disturbed.
9. Check the external signal cable for damage.
10. Check the cover for sharp edges, damage, or alterations that expose the
internal parts of the device.
11. Correct any problems that you find.
Perform the following safety checks to identify unsafe conditions.
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
187
Safety label checks
SC000904
Perform the following safety label checks:
1. Verify that the label is installed on the 2104 Model DS4 or Model TS4.
See Figure 1.
2. Verify that the linecord caution label is installed on the 2104 Model DS4 or
Model TS4 power supply. See Figure 2.
SC000274
Figure 1. 2104 Model DS4 or Model TS4 label
Figure 2. Linecord caution label
SC000325
CAUTION:
This unit might have two linecords. To remove all power, disconnect both
linecords. (1)
188
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
SC000905
3. Verify that the power supply cover caution label is installed on the 2104 Model DS4
or Model TS4 power supply. See Figure 3.
4. Verify that the fusing caution label is installed on the 2104 Model DS4 or Model TS4
power supply. See Figure 4.
Figure 3. Power supply cover caution label
Figure 4. Fusing caution label
SC000326
SC000327
CAUTION:
Do not remove cover, do not service, no serviceable parts. (2)
CAUTION:
Double Pole/Neutral Fusing. (3)
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
189
SC000905A
5. Verify that the weight label is installed on the 2104 Model DS4. See Figure 5.
CAUTION:
This unit weighs over 32 Kg (70.5 lbs). Refer to manuals. (4)
Figure 5. 2104 Model DS4 weight label
SC000220
Refer to manuals 32 Kg (70.5 lbs)
Attention:
If you have data stored on the drives, label the drives before you remove
them. When you replace the drives, install each one in the same drive bay
from which you removed it. Failure to do so could result in a loss of data.
> 18 Kg (37 lbs) > 32 Kg (70.5 lbs)
CAUTION:
If a 2104 Model DS4 is fully loaded with 14 drives and two power supplies, its
total weight exceeds 32 Kg (70.5 lbs). You must remove at least six drives from
the 2104 Model DS4 before you lift it or install it into a rack or 2104 Model TS4.
This reduces the total weight to less than 32 Kg (70.5 lbs) and the
can then be safely handled by two people. Failure to do so can result in
injury.
2104 Model
DS4
190
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
CAUTION:
This unit weighs between 32 Kg - 55 Kg (70.5 lbs - 121.2 lbs). (5)
Figure 6. 2104 Model TS4 weight label
32 - 55 Kg (70.5 - 121.2 lbs)
SC000219
6. Verify that the weight label is installed on the 2104 Model TS4. See Figure 6.
CAUTION:
)
The provided cardboard lifting tool is required for moving, installing, and
relocating the product when fully populated. Three people are required to
safely move the product. Failure to do so might result in injury.
In case the lifting tool is not readily available, you must reduce the weight
to 32 Kg or less by removing all of the heavy components (disk drives
and power supplies) from the product. Then only two people are required to
move, install, and relocate the product. (6
Attention:
If you have data stored on the drives, label the drives before you remove them.
When you replace the drives, install each one in the same drive bay
from which you removed it. Failure to do so could result in a loss of data.
> 18 Kg (37 lbs) > 32 Kg (70.5 lbs)
SC000906
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
191
SC000907
7. Verify that the standby condition label is installed on the 2104 Model DS4 or
Model TS4. See Figure 7.
CAUTION:
A standby condition is indicated by the symbol to the right of the DC
directly above the switch, SW1. When you toggle SW1 to the right position
directly under the standby symbol, the ac-power to the unit is not shut off. (7)
Figure 7. Standby condition label
SC000326
192
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
SC000908
This section contains instructions for checking the grounding of the 2104 Model
DS4 or Model TS4.
Checking the grounding of the 2104 Model DS4
Perform the following steps to check the grounding of the 2104 Model DS4:
3. Some electrical circuits could be damaged if the external SCSI
cables are connected to the 2104 Model DS4 while the grounding check is
being done.
Ensure that no external SCSI cables are plugged at the connectors .
4. Check the grounding of the 2104 Model DS4:
Attention:
a. At the connector ends of the power plugs that you unplugged from
the power distribution unit or battery-backup unit in step 2, do a visual
inspection and ensure that the third wire ground pin is in good condition.
b. Use an ohm meter to connect one lead to the 2104 Model DS4 chassis
and the other lead to the third wire ground pin at the face of the power
connectors that you unplugged in step 2. You should read 0.1 ohm o
Model DS4
r
less between the ground pin on the power connector and the 2104
chassis.
Figure 8. Power cables and SCSI connectors on the 2104 Model DS4
Checking the grounding of the 2104 Model DS4 or Model TS4
1. Ensure that all power is removed from the rack. See the installation document
for the rack or host system.
2. Ensure that the power cables are plugged into each fan-and-power-supply
assembly. See Figure 8. Also ensure that the other ends of the power cables
are unplugged from the power distribution unit or battery-backup unit in the rack.
See the installation document for the rack or host system.
2
1
SC000210
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
193
SC000909
If you are servicing a 220-volt 2104 Model DS4, go to step 5.
If you are servicing a -48 volt 2104 Model DS4, go to step 7.
5. Check for continuity between the chassis of the 2104 Model DS4 and the
ground pin of the power connector on each fan-and-power-supply
assembly. See Figure 9.
6. Go to step 9.
7. Check for continuity between the chassis of the 2104 Model DS4 and the
ground pins 1 and 7 of each mainline-power connector. See Figure 10.
8. Go to step 9.
9. If any fan-and-power-supply assembly has no continuity, exchange that
assembly for a new one. See the section about fan and power supply
assemblies in the
. Perform the complete grounding check again.
If each fan-and-power-supply assembly has continuity, you might have a
problem with the power cable or with the grounding of the host system.
Expandable Storage Plus 2104 Model DS4 and Model TS4
Service Guide
Figure 9. Ground pin (220 V 2104 Model DS4 )
Figure 10. Ground pins ( -48 V 2104 Model DS4)
1
SC000329
If the grounding is correct, the grounding check is complete.
If the grounding is not correct, unplug the power cables from the
fan-and-power-supply assemblies in the 2104 Model DS4.
SC000330
Pin 6 Pin 1
Pin 12 Pin 7
194
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
SC000910
If you are servicing a 220-volt 2104 Model DS4, go to step 10.
If you are servicing a -48 volt 2104 Model DS4, see the installation document
for the rack or host system to isolate the fault. Perform the complete grounding
check again.
10. Check the power cable for continuity.
If the power cable does not have continuity, exchange it for a new one, then
perform the complete grounding check again.
If the power cable does have continuity, see the installation document for the
rack or host system to isolate the fault.
Perform the following steps to check the grounding of the 2104 Model TS4:
2. Ensure that the other ends of the power cables are not plugged into electrical
power outlets. Unplug the cables if necessary.
Checking the grounding of the 2104 Model TS4
1. Ensure that a power cable is plugged into each power socket . See
Figure 11.
1
Figure 11. Power cables and SCSI connectors (2104 Model TS4)
SC000230
1
2
3
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
195
SC000911
3. : Some electrical circuits could be damaged if the external SCSI
cables are connected to the 2104 Model TS4 while the grounding check is
being done.
Ensure that no external SCSI cables are connected at the SCSI connectors .
See Figure 11.
4. Check for continuity between the chassis of the 2104 Model TS4 and the
ground pin of each power cable.
a. Check the power cables and ensure that the third-wire ground connector is
in good condition.
b. Use a meter to check that the third-wire ground continuity is 0.1 ohm or less
between the external ground pin and the chassis ground.
5. If the continuity is good, the grounding check is complete.
6. If the grounding is not correct, unplug the power cables from the mainline power
connectors.
a. Check each power cable for continuity.
b. If either power cable is failing, exchange it for a new one.
c. Check for continuity between the chassis of the 2104 Model TS4 and the
ground pin of the power connector on each fan-and-power-supply
assembly. See Figure 12.
Attention
d If either fan-and-power-supply assembly does not have continuity, exchange
that assembly for a new one. (See the section about fan and power supply
assemblies in the
Go to “Checking the grounding of the 2104 Model TS4” and perform step 1
through step 6d to complete the grounding check again.
Expandable Storage Plus 2104 Model DS4 and Model TS4
Service Guide.)
Figure 12. Ground pin (2104 Model TS4)
1
2
3
SC000331
196
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
SC000912
Figure 13. Removing the power from a 2104 Model DS4
Unless you have a particular reason to do so, do not remove power from the
host system or from the 2104 Model DS4 unless the instructions that you are
following tell you to.
Perform the following steps to remove the power from a 2104 Model DS4:
1. Verify with the customer that all operations between the 2104 Model DS4 and
the host system have stopped.
2. Set the dc on/standby switch of each fan-and-power-supply assembly to
standby. See Figure 13.
a. Some 2104 Model DS4s have a fan-and-power-supply assembly and a fan
assembly. The fan assembly has no dc on/standby switch.
b. A fan-and-power-supply assembly might have its CHK light on although its
dc on/standby switch is set to standby.
Note:
Notes:
Removing power from the 2104 Model DS4 or Model TS4
1
Removing power from a 2104 Model DS4
SC000206
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
197
SC000913
3. DANGER
In the following step you are going to remove the power cables. These
cables are live if the rack power distribution unit or uninterruptible
power supply (UPS) unit is still switched on. (1)
1
1
2
2
Note:
Notes:
Unless you have a particular reason to do so, do not remove power from the
host system or from the 2104 Model TS4 unless the instructions that you are
following tell you to.
Perform the following steps to remove the power from a 2104 Model TS4:
1. Verify with the customer that all operations between the 2104 Model TS4 and
the host system have stopped.
2. At the back of the 2104 Model TS4, set the dc on/standby switch of each
fan-and-power-supply assembly to standby. See Figure 14.
a. Some 2104 Model TS4s have a fan-and-power-supply assembly and a fan
assembly. The fan assembly has no dc on/standby switch.
b. A fan-and-power-supply assembly might have its CHK light on although its
dc on/standby switch is set to standby.
Removing power from a 2104 Model TS4
Remove the power cables from the back of the 2104 Model DS4. See
Figure 13.
4. Perform the following steps to return power to the 2104 Model DS4:
a. Reinstall the power cables .
b. Set the dc on/standby switch of each fan-and-power-supply assembly to
on.
You can configure the motor-start sequencing of the 2104 Model DS4.
For more information, see the sections about the drive autostart switch
and the delay motor start mode switch in the
.
Note:
Expandable Storage Plus 2104
Model DS4 and Model TS4 Service Guide
198
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
SC000914
1
2
2
3. Remove the power cables from the electrical power outlet and from the
back of the 2104 Model TS4. See Figure 14.
4. Perform the following steps to return power to the 2104 Model TS4:
a. Reinstall the power cables .
b. Set the dc on/standby switch of each fan-and-power-supply assembly to
on.
You can configure the motor-start sequencing of the 2104 Model TS4.
For more information, see the sections about the drive autostart switch
and the delay motor start mode switch in the
Note:
Expandable Storage Plus 2104
Model DS4 and Model TS4 Service Guide.
Figure 14. Removing power from a 2104 Model TS4
SC000207
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
199
SC000915
Danger notices
DANGER
DANGER
DANGER
DANGER
DANGER
In the following step you are going to remove the power cables. These
cables are live if the rack power distribution unit or uninterruptible power
supply (UPS) unit is still switched on. (1)
Do not plug a power cable into the fan-and-power-supply assembly until
the assembly is fully home and its thumbscrews are fully tightened. (3)
During an electrical storm, do not disconnect cables for display stations,
printers, telephones, or station protectors for communication lines. (5)
An electrical outlet that is not correctly wired could place hazardous
voltage on metal parts of the system or the devices that attach to that
system. It is the customer’s responsibility to ensure that the outlet is
correctly wired and grounded to prevent an electrical shock. During an
electrical storm, do not disconnect cables for display stations, printers,
telephones, or station protectors for communication lines. (4)
Do not try to open the covers of the fan-and-power-supply assembly. (2)
200
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
SC000916
Caution notices
CAUTION:
This unit might have two linecords. To remove all power, disconnect both
linecords. (1)
CAUTION:
Do not remove cover, do not service, no serviceable parts. (2)
CAUTION:
Double Pole/Neutral Fusing. (3)
Figure 15. Linecord caution label
Figure 16. Power supply cover caution label
Figure 17. Fusing caution label
SC000327
SC000326
SC000325
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
201
SC000916A
CAUTION:
This unit weighs over 32 Kg (70.5 lbs). Refer to manuals. (4)
Figure 18. 2104 Model DS4 weight label
SC000220
Refer to manuals 32 Kg (70.5 lbs)
Attention:
If you have data stored on the drives, label the drives before you remove
them. When you replace the drives, install each one in the same drive bay
from which you removed it. Failure to do so could result in a loss of data.
> 18 Kg (37 lbs) > 32 Kg (70.5 lbs)
CAUTION:
If a 2104 Model DS4 is fully loaded with 14 drives and two power supplies, its
total weight exceeds 32 Kg (70.5 lbs). You must remove at least six drives from
the 2104 Model DS4 before you lift it or install it into a rack or 2104 Model TS4.
This reduces the total weight to less than 32 Kg (70.5 lbs) and the
can then be safely handled by two people. Failure to do so can result in
injury.
2104 Model
DS4
202
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
CAUTION:
This unit weighs between 32 Kg - 55 Kg (70.5 lbs - 121.2 lbs). (5)
CAUTION:
(6)
The provided cardboard lifting tool is required for moving, installing, and
relocating the product when fully populated. Three people are required to
safely move the product. Failure to do so might result in injury.
In case the lifting tool is not readily available, you must reduce the weight
to 32 Kg or less by removing all of the heavy components (disk drives
and power supplies) from the product. Then only two people are required to
move, install, and relocate the product.
Figure 19. 2104 Model TS4 weight label
SC000219
Attention:
If you have data stored on the drives, label the drives before you remove them.
When you replace the drives, install each one in the same drive bay
from which you removed it. Failure to do so could result in a loss of data.
> 18 Kg (37 lbs) > 32 Kg (70.5 lbs)
32 - 55 Kg (70.5 - 121.2 lbs)
SC000917
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
203
SC000918
CAUTION:
A standby condition is indicated by the symbol to the right of the DC directly
above the switch, SW1. When you toggle SW1 to the right position directly
under the standby symbol, the ac-power to the unit is not shut off. (7)
CAUTION:
The stabilizer must be correctly attached to the bottom front of the rack to
prevent the rack from tipping forward while the 2104 Model DS4 is being
removed from the rack. Do not pull out or install any unit if a stabilizer is not
attached to the rack. (8)
CAUTION:
The stabilizer must be correctly attached to the bottom front of the rack to
prevent the rack from tipping forward while the 2104 Model DS4 is being
installed into the rack. Do not remove or install any unit if a stabilizer is not
attached to the rack. (9)
Figure 20. Location of standby condition
SC000328
204
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
SC000319
CAUTION:
Do not insert hands or tools into the empty space that contained the fan
assembly. (10)
CAUTION:
Do not insert hands or tools into the empty space that contained the
fan-and-power-supply assembly. (11)
CAUTION:
Do not insert hands or tools into the empty space that contained the SCSI
interface card assembly. (12)
CAUTION:
This product is equipped with a 3-wire power cable and plug for the user’s
safety. Use this power cable in conjunction with a correctly grounded
electrical outlet to avoid an electrical shock. (13)
CAUTION:
Do not touch the power outlet or the power outlet face plate with anything
other than test probes before you complete this safety check. (14)
CAUTION:
If the reading is not infinity, do not proceed. Make the necessary corrections
to the wiring before you continue. Do not switch on the branch circuit CB
until you satisfactorily complete the previous steps. (15)
CAUTION:
Do not use the handles of the fan or fan-and-power-supply assemblies to
carry the 2104 Model DS4. These handles are not intended to support the
weight of the unit. (16)
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
205
SC000920
CAUTION:
As you push the assembly fully home, the lever automatically moves toward
its closed position. Ensure that your fingers do not become pinched between
the lever and the assembly. (17)
CAUTION:
Ensure that the mainline power cable has been removed from the failing
fan-and-power supply before you continue. (18)
CAUTION:
Do not insert hands or tools into the empty space above the fan-and-power
supply assembly. (19)
CAUTION:
Do not insert hands or tools into the empty space between the power supply
assemblies. (20)
CAUTION:
If the 2104 Model DS4 is installed in a Model T00 or T42 rack, you must also
observe the safety notices for those racks before you start to remove the
frame assembly. You can find the safety notices in the section about system
installation in the
SA38-0577, or at the following Web site:
www-1.ibm.com/servers/eserver/pseries/library/hardware_docs/7014_t00.html
(21)
CAUTION:
The 2104 Model DS4 and Model TS4 are designed to be installed by the
customer and are certified as customer setup. Make sure that the system or
rack into which the 2104 Model DS4 or Model TS4 will be installed is also
designed and certified for customer setup. If they a the 2104
Model DS4 or Model TS4 must be installed by a CE. (22)
7014 Model T00 and T42 Rack Installation and Service Guide,
re not, then
206
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
SC000921
CAUTION:
Do not insert hands or tools into the space that contained the card assembly.
(23)
CAUTION:
It takes three people to lift the 2104 Model TS4. Do not attempt to lift the 2104
Model TS4 by yourself. Do not attempt to lift it without help from two other
people. (24)
CAUTION:
Do not use the handles of the fan or fan-and-power-supply assemblies to
carry the 2104 Model TS4. These handles are not intended to support the
weight of the unit. (25)
The 2104 Model DS4 is designed to be installed by the customer and is
certified as customer setup. Make sure that the system or rack into which the
2104 Model DS4 will be installed is also designed and certified for customer
setup. If they are not, then the 2104 Model DS4 must be installed by a CE. (26)
CAUTION:
CAUTION:
The 2104 Model TS4 is designed to be installed by the customer and is
certified as customer setup. Make sure that the system into which the
2104 Model TS4 will be installed is also designed and certified for customer
setup. If it is not, then the 2104 Model TS4 must be installed by a CE. (27)
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
207
Finnish
208
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
209
210
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
211
212
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
213
214
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
215
216
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
217
218
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
219
220
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
221
222
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
223
224
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
225
226
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
227
228
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
229
230
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
French
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
231
232
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
233
234
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
235
236
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
237
238
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
239
240
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
241
242
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
243
244
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
245
246
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
247
248
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
249
250
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
251
252
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
253
German
Sicherheitshinweise und Hinweise zum Umweltschutz
Sicherheitshinweise und ihre Übersetzungen
SCGER901
Dieser Abschnitt enthält die folgenden Informationen:
Sicherheitshinweise und ihre Übersetzungen
Prozeduren zur Sicherheitsüberprüfung für dieses Produkt
Hinweise zur Wiederverwertbarkeit und Entsorgung
Dieses Buch enthält Sicherheitshinweise.
Die Nichtbeachtung der Vorschriften in den mit gekennzeichneten
Sicherheitshinweisen kann zu schweren Verletzungen oder sogar zum Tod führen.
Die Nichtbeachtung der Vorschriften in den mit gekennzeichneten
Sicherheitshinweisen kann zu leichten Verletzungen bzw. zu Schäden an Hardware
oder Software führen.
Die Sicherheitshinweise sind in numerischer Reihenfolge anhand ihrer IDs
aufgeführt, die am Ende der einzelnen Sicherheitshinweise in Klammern stehen.
Die folgenden Beispiele zeigen die Position der ID-Nummern in den Sicherheitshinweisen.
Vorsicht
Achtung
Vorsicht
Bei nicht ordnungsgemäß angeschlossener Netzsteckdose können an offen
liegenden Metallteilen des Systems oder an angeschlossenen Einheiten
gefährliche Berührungsspannungen auftreten. Für den ordnungsgemäßen
Zustand der Steckdose ist der Betreiber verantwortlich. (1)
Achtung:
Keine Hände oderWerkzeuge in die Öffnung des freien Bereichs einführen,
in dem sich die Ventilatorbaugruppe befand. (1)
254
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Elektrostatische Entladung (ESE)
Prozeduren zur Sicherheitsüberprüfung
Rack überprüfen
SCGER902
Achtung: Bei der Handhabung von durch den Kundendienst austauschbaren
Funktionseinheiten (FRUs) oder anderen Computerteilen die folgenden
Vorsichtsmaßnahmen ergreifen, um Schäden durch statische Entladung zu
vermeiden:
Bewegungen einschränken. Durch Bewegung kann statische Aufladung aufgebaut
werden.
Computerteile vorsichtig anfassen. Karten an den Rändern oder an der
Metallabdeckung anfassen. Niemals offen liegende Schaltkreise berühren.
Vermeiden, dass Personen, die nicht korrekt geerdet sind, Computerteile berühren.
Vor der Installation eines neuen Teils die antistatische Verpackung, in der sich
das Teil befindet, mit einem unlackierten Metallteil des 2104 Modells DS4 oder
Ts4 oder des Hostsystems mindestens zwei Sekunden lang berühren. Hierdurch
wird die statische Elektrizität abgeleitet.
Das Teil aus der Verpackung nehmen und (wenn möglich) direkt in das 2104
Modell DS4 oder TS4 installieren, ohne das Teil abzusetzen. Muss das Teil
abgesetzt werden, wie folgt vorgehen:
1. Die antistatische Verpackung, in der sich das Teil befindet, auf eine glatte,
ebene Oberfläche legen.
2. Das Teil auf die antistatische Verpackung und nicht direkt auf eine
Metalloberfläche legen.
In den Prozeduren zur Sicherheitsüberprüfung wird beschrieben, wie
das 2104 Modell DS4 oder TS4 überprüft wird.
Informationen über die Ausführung einer Sicherheitsüberprüfung für das Rack
enthält das Installationshandbuch für das Rack.
Achtung:
Die 2104 Modelle DS4 und TS4 sind für die Installation durch Kunden konzipiert
und zertifiziert. Achten Sie darauf, dass das System oder Rack, in das das
2104 Modell DS4 oder TS4 eingebaut wird, ebenfalls für die Installation
durch den Kunden konzipiert und zertifiziert ist. Ist dies nicht der Fall,
muss das 2104 Modell DS4 oder TS4 vom Kundendienst installiert werden. (22)
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
255
2104 Modell DS4 oder TS4 überprüfen
Externe Maschinenprüfungen
Interne Maschinenprüfungen
SCGER903
Führen Sie die folgenden internen Maschinenüberprüfungen aus:
1. Überprüfen Sie, ob Änderungen an der Maschine vorgenommen wurden,
die nicht von IBM ausgeführt wurden. Ist dies der Fall, besorgen
Sie sich das Formular für nicht von IBM ausgeführte Änderungen
(Non-IBM Alteration Attachment Survey, Formnummer R009) bei der
zuständigen Geschäftsstelle. Füllen Sie das Formular aus und
geben Sie es an die Geschäftsstelle zurück.
2. Überprüfen Sie das Innere der Maschine auf:
Metallteile oder andere Verschmutzungen
Anzeichen von Wasser oder Feuchtigkeit
Brandschäden
Rauchschäden
3. Überprüfen Sie, ob offensichtliche mechanische Probleme wie z. B.
lose Komponenten vorliegen.
4. Überprüfen Sie offen liegende Kabel auf Abnutzung oder Beschädigung.
Führen Sie die folgenden externen Maschinenüberprüfungen aus:
1. Überprüfen Sie das Gehäuse auf Beschädigungen (lockere, abgebrochene
oder scharfe Kanten).
2. Überprüfen Sie die Netzkabel. Die Isolierung darf nicht abgenutzt oder
beschädigt sein.
3. Überprüfen Sie, ob vom Standard abweichende Änderungen vorgenommen
wurden. Überprüfen Sie die Sicherheit solcher Änderungen mit besonderer
Sorgfalt.
4. Überprüfen Sie, ob alle externen Abdeckungen vorhanden und unbeschädigt
sind.
5. Überprüfen Sie, ob alle Verriegelungen und Scharniere in der korrekten
Betriebsposition sind.
6. Überprüfen Sie das Netzkabel auf Beschädigungen.
7. Überprüfen Sie, ob Kabel abgenutzt oder beschädigt sind.
8. Überprüfen Sie die aus Ventilator und Netzteil bestehenden Baugruppen.
Stellen Sie sicher, dass die Befestigungen in der Abdeckung des Netzteils
(Schrauben oder Nieten) nicht entfernt wurden oder beschädigt sind.
9. Überprüfen Sie das externe Signalkabel auf Beschädigungen.
10. Überprüfen Sie die Abdeckung auf scharfe Kanten, Beschädigungen oder
Veränderungen, durch die interne Teile der Einheit freigelegt werden.
11. Beheben Sie alle gefundenen Probleme.
Führen Sie die folgenden Sicherheitsüberprüfungen aus, um Gefahrenquellen zu
erkennen.
256
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Sicherheitsaufkleber überprüfen
SCGER904
Führen Sie die folgenden Überprüfungen für Sicherheitsaufkleber durch:
1. Überprüfen Sie, ob der Aufkleber am 2104 Modell DS4 oder TS4 angebracht ist.
Siehe Abbildung 1.
2. Überprüfen Sie, ob der Sicherheitsaufkleber für Netzkabel am Netzteil für
das 2104 Modell DS4 oder TS4 angebracht ist. Siehe Abbildung 2.
SCGER274
Abbildung 1. Aufkleber für 2104 Modell DS4 oder TS4
Abbildung 2. Sicherheitsaufkleber für Netzkabel
SCGER325
Achtung:
Diese Einheit könnte zwei Netzkabel haben. Um die Einheit ganz von der
Netzspannung zu trennen, beide Netzkabel lösen. (1)
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
257
SCGER905
3. Überprüfen Sie, ob der Sicherheitsaufkleber für die Netzteilabdeckung am
Netzteil für das 2104 Modell DS4 oder TS4 angebracht ist. Siehe Abbildung 3.
4. Überprüfen Sie, ob der Sicherheitsaufkleber für die Sicherung am Netzteil
für das 2104 Modell DS4 oder TS4 angebracht ist. Siehe Abbildung 4.
Abbildung 3. Sicherheitsaufkleber für Netzteilabdeckung
Abbildung 4: Sicherheitsaufkleber für Sicherung
SCGER326
SCGER327
Achtung:
Abdeckung nicht entfernen. Nicht reparieren. Keine wartungsbedürftigen
Teile. (2)
Achtung:
Zweipolige bzw. Neutralleiter-Sicherung. (3)
258
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
SCGER905A
5. Überprüfen Sie, ob der Gewichtsaufkleber am 2104 Modell DS4 angebracht ist.
Siehe Abbildung 5.
Achtung:
Diese Einheit wiegt über 32 kg. Siehe Handbücher. (4)
Abbildung 5. Gewichtsaufkleber für 2104 Modell DS4
SCGER220
Refer to manuals 32 Kg (70.5 lbs)
Achtung:
Sind Daten auf den Laufwerken gespeichert, die Laufwerke vor dem Ausbau
mit einem Aufkleber versehen. Die Laufwerke müssen beim Einbau wieder in
die Laufwerkpositionen installiert werden, aus denen sie beim Ausbau
entfernt wurden. Andernfalls kann es zu einem Datenverlust kommen.
> 18 Kg (37 lbs) > 32 Kg (70.5 lbs)
Achtung:
Ist ein 2104 Modell DS4 mit 14 Laufwerken und 2 Netzteilen ausgestattet,
überschreitet das Gesamtgewicht 32 kg. Es müssen mindestens 6 Laufwerke
aus dem 2104 Modell DS4 entfernt werden, bevor es angehoben oder in ein
Rack oder 2104 Modell TS4 installiert werden kann, damit das Gesamtgewicht
auf unter 32 kg reduziert und das 2104 Modell DS4 sicher von zwei Personen
gehandhabt werden kann. Andernfalls können Verletzungen auftreten.
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
259
Diese Einheit wiegt zwischen 32 und 55 kg. (5)
Achtung:
Abbildung 6. Gewichtsaufkleber für 2104 Modell TS4
32 - 55 Kg (70.5 - 121.2 lbs)
SCGER219
6. Überprüfen Sie, ob der Gewichtsaufkleber am 2104 Modell TS4 angebracht ist.
Siehe Abbildung 6.
Achtung:
Wird ein vollständig ausgestattetes Produkt versetzt, installiert oder
transportiert, ist das mitgelieferte Anhebewerkzeug erforderlich.
Zum sicheren Versetzen, Installieren oder Transportieren des Produkts
sind drei Personen erforderlich. Andernfalls können Verletzungen auftreten.
Ist das Anhebewerkzeug nicht verfügbar, muss das Gewicht durch das Entfernen
aller schweren Komponenten (Plattenlaufwerke und Netzteile) aus dem Produkt
auf unter 32 kg reduziert werden, da in diesem Fall nur zwei Personen erforderlich
sind, um das Produkt zu versetzen, zu installieren oder zu transportieren. (6)
Achtung:
Sind Daten auf den Laufwerken gespeichert, die Laufwerke vor dem Ausbau
mit einem Aufkleber versehen. Die Laufwerke müssen beim Einbau wieder in
die Laufwerkpositionen installiert werden, aus denen sie beim Ausbau
entfernt wurden. Andernfalls kann es zu einem Datenverlust kommen.
> 18 Kg (37 lbs) > 32 Kg (70.5 lbs)
SCGER906
260
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
SCGER907
7. Überprüfen Sie, ob der Aufkleber für Bereitschaftsbedingung am 2104 Modell DS4
oder TS4 angebracht ist. Siehe Abbildung 7.
Achtung:
Durch das Symbol rechts von "DC", direkt über dem Schalter SW1, wird eine
Bereitschaftsbedingung angegeben. Wird der Schalter SW1 nach rechts
direkt unter das Bereitschaftssymbol gelegt, wird dieWechselstromversorgung
der Einheit nicht ausgeschaltet. (7)
Abbildung 7. Aufkleber für Bereitschaftsbedingung
SCGER326
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
261
SCGER908
Dieser Abschnitt enthält Anweisungen zur Überprüfung der Erdung des
2104 Modells DS4 oder TS4.
Erdung des 2104 Modells DS4 überprüfen
Gehen Sie wie folgt vor, um die Erdung des 2104 Modells DS4 zu überprüfen:
3. Einige elektrische Schaltkreise können beschädigt werden, wenn
die externen SCSI-Kabel an das 2104 Modell DS4 angeschlossen sind,
während die Erdung überprüft wird. Achten Sie darauf, dass keine externen
SCSI-Kabel an die Anschlüsse angeschlossen sind.
4. Überprüfen Sie die Erdung des 2104 Modells DS4:
a. Stellen Sie an den Anschlussenden der Netzkabel, die in Schritt 2 aus der
Stromversorgungseinheit oder der Notstromversorgung herausgezogen
wurden, durch eine Sichtprüfung fest, ob die Schutzleiterverbindung in
Ordnung ist.
b. Schließen Sie eine Messleitung eines Ohm-Meters an das Gehäuse des
2104 Modells DS4 und die andere Messleitung an die Schutzleiterverbindung
an der Vorderseite der Netzteilanschlüsse an, die in Schritt 2
herausgezogen wurden. Der zwischen dem Schutzleiterkontakt am
Netzteilanschluss und dem Gehäuse des 2104 Modells DS4 gemessene
Wert darf maximal 0,1 Ohm betragen.
Achtung:
Abbildung 8. Netzkabel und SCSI-Anschlüsse auf dem 2104 Modell DS4
Erdung des 2104 Modells DS4 oder TS4 überprüfen
1. Achten Sie darauf, dass die Stromzufuhr zum Rack ausgeschaltet ist.Weitere
Informationen enthält das Installationsdokument für das Rack oder Hostsystem.
2. Achten Sie darauf, dass die Netzkabel in die aus Ventilator und Netzteil
bestehenden Baugruppen eingesteckt sind. Siehe Abbildung 8. Achten Sie
zudem darauf, dass die anderen Enden der Netzkabel aus der Stromversorgungseinheit
oder der Notstromversorgung im Rack herausgezogen sind.
Weitere Informationen enthält das Installationsdokument für das Rack oder
Hostsystem.
2
1
SCGER210
262
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
SCGER909
Wird ein 220 Volt 2104 Modell DS4 gewartet, machen Sie mit Schritt 5 weiter.
Wird ein -48 Volt 2104 Modell DS4 gewartet, machen Sie mit Schritt 7 weiter.
5. Führen Sie eine Durchgangsprüfung zwischen dem Gehäuse des 2104 Modells
DS4 und dem Schutzleiterkontakt des Netzteilanschlusses der einzelnen aus
Ventilator und Netzteil bestehenden Baugruppen aus. Siehe Abbildung 9.
6. Machen Sie mit Schritt 9 weiter.
7. Führen Sie eine Durchgangsprüfung zwischen dem Gehäuse des 2104 Modells
DS4 und den Schutzleiterkontakten 1 und 7 der einzelnen Hauptstromanschlüsse
aus. Siehe Abbildung 10.
8. Machen Sie mit Schritt 9 weiter.
9. Wird für eine aus Ventilator und Netzteil bestehende Baugruppe kein Durchgang
gemessen, tauschen Sie die Baugruppe aus.Weitere Informationen enthält der
Abschnitt über aus Ventilator und Netzteil bestehende Baugruppen im
.
Führen Sie die gesamte Erdungsprüfung erneut aus. Wird für jede aus Ventilator
und Netzteil bestehende Baugruppe Durchgang gemessen, liegt der
Fehler möglicherweise beim Netzkabel oder bei der Erdung des Hostsystems.
Expandable Storage Plus 2104 Model DS4 and Model TS4 Service Guide
Abbildung 9. Schutzleiterkontakt (220 V 2104 Modell DS4)
Abbildung 10. Schutzleiterkontakte (-48 V 2104 Modell DS4)
1
SCGER329
Ist die Erdung korrekt, ist die Erdungsüberprüfung abgeschlossen.
Ist die Erdung nicht korrekt, ziehen Sie die Netzkabel aus den aus
Ventilator und Netzteil bestehenden Baugruppen im 2104 Modell DS4.
SCGER330
Kontaktstift 6 Kontaktstift 1
Kontaktstift 12 Kontaktstift 7
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
263
SCGER910
Wird ein 220 Volt 2104 Modell DS4 gewartet, machen Sie mit Schritt 10 weiter.
Wird ein -48 Volt 2104 Modell DS4 gewartet, können Sie dem Installationsdokument
für das Rack oder das Hostsystem Informationen zur Fehlerbestimmung
entnehmen. Führen Sie die gesamte Erdungsprüfung erneut aus.
10. Überprüfen Sie das Netzkabel auf Durchgang.
Wird am Netzkabel kein Durchgang gemessen, tauschen Sie das Netzkabel aus.
Führen Sie dann die gesamte Erdungsprüfung erneut aus.
Wird am Netzkabel Durchgang gemessen, können Sie dem Installationsdokument
für das Rack oder das Hostsystem Informationen zur Fehlerbestimmung
entnehmen.
Gehen Sie wie folgt vor, um die Erdung des 2104 Modells TS4 zu überprüfen:
2. Achten Sie darauf, dass das andere Ende der Netzkabel nicht an eine
Netzsteckdose angeschlossen ist. Ziehen Sie die Kabel gegebenenfalls heraus.
Erdung des 2104 Modells TS4 überprüfen
1. Achten Sie darauf, dass an jeden Stromversorgungsanschluss ein
Netzkabel abgeschlossen ist. Siehe Abbildung 11.
1
Abbildung 11. Netzkabel und SCSI-Anschlüsse (2104 Modell TS4)
SCGER230
1
2
3
264
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
SCGER911
3. Einige elektrische Schaltkreise können beschädigt werden, wenn
die externen SCSI-Kabel an das 2104 Modell TS4 angeschlossen sind,
während die Erdung überprüft wird. Achten Sie darauf, dass keine externen
SCSI-Kabel an die SCSI-Anschlüsse angeschlossen sind.
Siehe Abbildung 11.
4. Führen Sie eine Durchgangsprüfung zwischen dem Gehäuse des
2104 Modells TS4 und dem Schutzleiterkontakt der einzelnen Netzkabel aus.
a. Überprüfen Sie die Netzkabel. Achten Sie darauf, dass die Schutzleiterverbindung
in Ordnung ist.
b. Messen Sie den Widerstand der Schutzleiterverbindung zwischen dem
externen Schutzleiterkontakt und dem Gehäuse. Der gemessene Wert darf
maximal 0,1 Ohm betragen.
5. Ist der Durchgang korrekt, ist die Erdungsüberprüfung abgeschlossen.
6. Ist die Erdung nicht korrekt, ziehen Sie die Netzkabel aus den Hauptstromanschlüssen.
a. Überprüfen Sie die einzelnen Netzkabel auf Durchgang.
b. Tauschen Sie die Netzkabel aus, bei denen der Durchgang nicht korrekt ist.
c. Führen Sie eine Durchgangsprüfung zwischen dem Gehäuse des
2104 Modells TS4 und dem Schutzleiterkontakt des Netzteilanschlusses
der einzelnen aus Ventilator und Netzteil bestehenden Baugruppen aus.
Siehe Abbildung 12.
Achtung:
d. Wird für eine aus Ventilator und Netzteil bestehende Baugruppe kein
Durchgang gemessen, tauschen Sie die Baugruppe aus. (Weitere
Informationen enthält der Abschnitt über aus Ventilator und Netzteil
bestehende Baugruppen im
.) Machen Sie mit "Erdung des 2104 Modells TS4
überprüfen" weiter und führen Sie die Schritte 1 bis 6d aus, um die
Erdungsprüfung erneut auszuführen.
Expandable Storage Plus 2104 Model DS4 and
Model TS4 Service Guide
Abbildung 12. Schutzleiterkontakt (2104 Modell TS4)
1
2
3
SCGER331
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
265
SCGER912
Abbildung 13. 2104 Modell DS4 von der Stromversorgung trennen
Anmerkung:
Anmerkungen:
Liegt kein besonderer Grund vor, das Hostsystem oder das
2104 Modell Ds4 nur dann von der Stromversorgung trennen, wenn
in den auszuführenden Anweisungen dazu aufgefordert wird.
Gehen Sie wie folgt vor, um das 2104 Modell DS4 von der Stromversorgung zu
trennen:
1. Überprüfen Sie zusammen mit dem Kunden, ob alle Operationen zwischen dem
2104 Modell DS4 und dem Hostsystem gestoppt wurden.
2. Setzen Sie den DC-Schalter Ein/Bereitschaft der einzelnen aus Ventilator und
Netzteil bestehenden Baugruppen auf "Bereitschaft". Siehe Abbildung 13.
a. Einige 2104 Modelle DS4 haben eine aus Ventilator und Netzteil bestehende
Baugruppe und eine Ventilatorbaugruppe. Die Ventilatorbaugruppe hat keinen
DC-Schalter Ein/Bereitschaft.
b. Bei einer aus Ventilator und Netzteil bestehenden Baugruppe kann die
Anzeige CHK an sein, obwohl der DC-Schalter Ein/Bereitschaft auf
“Bereitschaft" steht.
2104 Modell DS4 oder TS4 von der Stromversorgung trennen
1
2104 Modell DS4 von der Stromversorgung trennen
SCGER206
266
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
SCGER913
3. Vorsicht
Im folgenden Schritt werden die Netzkabel entfernt. Ist die Stromversorgungseinheit
des Gehäuses oder die unterbrechungsfreie Stromversorgung
noch eingeschaltet, führen diese Kabel Strom. (1)
1
1
2
2
Anmerkung:
Anmerkungen:
Liegt kein besonderer Grund vor, das Hostsystem oder das 2104
Modell TS4 nur dann von der Stromversorgung trennen, wenn in den
auszuführenden Anweisungen dazu aufgefordert wird.
Gehen Sie wie folgt vor, um das 2104 Modell TS4 von der Stromversorgung zu
trennen:
1. Überprüfen Sie zusammen mit dem Kunden, ob alle Operationen zwischen dem
2104 Modell TS4 und dem Hostsystem gestoppt wurden.
2. Setzen Sie den DC-Schalter Ein/Bereitschaft (er befindet sich an der
Rückseite des 2104 Modells TS4) der einzelnen aus Ventilator und Netzteil
bestehenden Baugruppen auf "Bereitschaft". Siehe Abbildung 14.
a. Einige 2104 Modelle TS4 haben eine aus Ventilator und Netzteil bestehende
Baugruppe und eine Ventilatorbaugruppe. Die Ventilatorbaugruppe hat keinen
DC-Schalter Ein/Bereitschaft.
b. Bei einer aus Ventilator und Netzteil bestehenden Baugruppe kann die
Anzeige CHK an sein, obwohl der DC-Schalter Ein/Bereitschaft auf
“Bereitschaft" steht.
2104 Modell TS4 von der Stromversorgung trennen
Trennen Sie die Netzkabel von der Rückseite des 2104 Modells DS4.
Siehe Abbildung 13.
4. Gehen Sie wie folgt vor, um das 2104 Modell DS4 wieder an die Stromversorgung
anzuschließen:
a. Schließen Sie die Netzkabel wieder an.
b. Setzen Sie den DC-Schalter Ein/Bereitschaft der einzelnen aus Ventilator
und Netzteil bestehenden Baugruppen auf "Ein".
Sie können die Motorstartfolge des 2104 Modells DS4 konfigurieren.
Weitere Informationen enthalten die Abschnitte über den Schalter
für automatisches Starten des Laufwerks und für den Modus für
verzögerten Motorstart im
.
Anmerkung:
Expandable Storage Plus 2104
Model DS4 and Model TS4 Service Guide
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
267
SCGER914
1
2
2
3. Trennen Sie die Netzkabel von der Netzsteckdose und von der Rückseite
des 2104 Modells TS4. Siehe Abbildung 14.
4. Gehen Sie wie folgt vor, um das 2104 Modell TS4 wieder an die Stromversorgung
anzuschließen:
a. Schließen Sie die Netzkabel wieder an.
b. Setzen Sie den DC-Schalter Ein/Bereitschaft der einzelnen aus Ventilator
und Netzteil bestehenden Baugruppen auf "Ein".
Sie können die Motorstartfolge des 2104 Modells TS4
konfigurieren.Weitere Informationen enthalten die Abschnitte
über den Schalter für automatisches Starten des Laufwerks
und für den Modus für verzögerten Motorstart im
.
Anmerkung:
Expandable Storage Plus 2104 Model DS4 and Model TS4
Service Guide
Abbildung 14. 2104 Modell TS4 von der Stromversorgung trennen
SCGER207
268
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
SCGER915
Sicherheitshinweise "Vorsicht"
Vorsicht
Vorsicht
Vorsicht
Vorsicht
Vorsicht
Im folgenden Schritt werden die Netzkabel entfernt. Ist die Stromversorgungseinheit
des Gehäuses oder die unterbrechungsfreie Stromversorgung
noch eingeschaltet, führen diese Kabel Strom. (1)
Kein Netzkabel in die aus Ventilator und Netzteil bestehende Baugruppe
einstecken, bis die Baugruppe vollständig in Ausgangsposition ist und die
Rändelschrauben angezogen sind. (3)
Während eines Gewitters keine Kabel von Bildschirmeinheiten, Druckern,
Telefonapparaten oder Blitzschutzeinheiten lösen. (5)
Bei nicht ordnungsgemäß angeschlossener Netzsteckdose können an
Metallteilen des Gerätes und an angeschlossenen Einheiten gefährliche
Berührungsspannungen auftreten. Für den ordnungsgemäßen Zustand der
Steckdose ist der Betreiber verantwortlich.Während eines Gewitters
keine Kabel von Bildschirmeinheiten, Druckern,Telefonapparaten oder
Blitzschutzeinheiten lösen. (4)
Die Abdeckungen der aus Ventilator und Netzteil bestehenden Baugruppe
nicht öffnen. (2)
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
269
SCGER916
Sicherheitshinweise "Achtung"
Achtung:
Diese Einheit könnte zwei Netzkabel haben. Um die Einheit ganz von der
Netzspannung zu trennen, beide Netzkabel lösen. (1)
Achtung:
Abdeckung nicht entfernen. Nicht reparieren. Keine wartungsbedürftigen
Teile. (2)
Achtung:
Zweipolige bzw. Neutralleiter-Sicherung. (3)
Abbildung 15. Sicherheitsaufkleber für Netzkabel
Abbildung 16. Sicherheitsaufkleber der Netzteilabdeckung
Abbildung 17. Sicherheitsaufkleber für Sicherung
SCGER327
SCGER326
SCGER325
270
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
SCGER916A
Achtung:
Diese Einheit wiegt über 32 kg. Siehe Handbücher. (4)
Abbildung 18. Gewichtsaufkleber für 2104 Modell DS4
SCGER220
Refer to manuals 32 Kg (70.5 lbs)
Achtung:
Sind Daten auf den Laufwerken gespeichert, die Laufwerke vor dem Ausbau
mit einem Aufkleber versehen. Die Laufwerke müssen beim Einbau wieder in
die Laufwerkpositionen installiert werden, aus denen sie beim Ausbau
entfernt wurden. Andernfalls kann es zu einem Datenverlust kommen.
> 18 Kg (37 lbs) > 32 Kg (70.5 lbs)
Achtung:
Ist ein 2104 Modell DS4 mit 14 Laufwerken und 2 Netzteilen ausgestattet,
überschreitet das Gesamtgewicht 32 kg. Es müssen mindestens 6 Laufwerke
aus dem 2104 Modell DS4 entfernt werden, bevor es angehoben oder in ein
Rack oder 2104 Modell TS4 installiert werden kann, damit das Gesamtgewicht
auf unter 32 kg reduziert und das 2104 Modell DS4 sicher von zwei Personen
gehandhabt werden kann. Andernfalls können Verletzungen auftreten.
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
271
Achtung:
Diese Einheit wiegt zwischen 32 und 55 kg. (5)
Achtung:
Wird ein vollständig ausgestattetes Produkt versetzt, installiert oder
transportiert, ist das mitgelieferte Anhebewerkzeug erforderlich.
Zum sicheren Versetzen, Installieren oder Transportieren des Produkts
sind drei Personen erforderlich. Andernfalls können Verletzungen auftreten.
Ist das Anhebewerkzeug nicht verfügbar, muss das Gewicht durch das
Entfernen aller schweren Komponenten (Plattenlaufwerke und Netzteile)
aus dem Produkt auf unter 32 kg reduziert werden, da in diesem Fall nur
zwei Personen erforderlich sind, um das Produkt zu versetzen, zu installieren
oder zu transportieren. (6)
Abbildung 19. Gewichtsaufkleber für 2104 Modell TS4
SCGER219
Achtung:
Sind Daten auf den Laufwerken gespeichert, die Laufwerke vor dem Ausbau
mit einem Aufkleber versehen. Die Laufwerke müssen beim Einbau wieder in
die Laufwerkpositionen installiert werden, aus denen sie beim Ausbau
entfernt wurden. Andernfalls kann es zu einem Datenverlust kommen.
> 18 Kg (37 lbs) > 32 Kg (70.5 lbs)
32 - 55 Kg (70.5 - 121.2 lbs)
SCGER917
272
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
SCGER918
Achtung:
Durch das Symbol rechts von "DC", direkt über dem Schalter SW1, wird eine
Bereitschaftsbedingung angegeben. Wird der Schalter SW1 nach rechts direkt
unter das Bereitschaftssymbol gelegt, wird dieWechselstromversorgung der
Einheit nicht ausgeschaltet. (7)
Achtung:
Der Stabilisator muss ordnungsgemäß unten an der Vorderseite des Racks
angebracht werden, um ein Vorkippen beim Ausbau des 2104 Modells DS4
aus dem Rack zu verhindern. Keine Einheiten installieren oder entfernen,
wenn kein Stabilisator am Rack montiert ist. (8)
Achtung:
Der Stabilisator muss ordnungsgemäß unten an der Vorderseite des Racks
angebracht werden, um ein Vorkippen beim Einbau des 2104 Modells DS4
in das Rack zu verhindern. Keine Einheiten installieren oder entfernen,
wenn kein Stabilisator am Rack montiert ist. (9)
Abbildung 20. Position einer Bereitschaftsbedingung
SCGER328
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
273
SCGER919
Achtung:
Keine Hände oderWerkzeuge in den freien Bereich einführen, in dem sich
die Ventilatorbaugruppe befand. (10)
Achtung:
Keine Hände oderWerkzeuge in den freien Bereich einführen, in dem sich
die aus Ventilator und Netzteil bestehende Baugruppe befand. (11)
Achtung:
Keine Hände oderWerkzeuge in den freien Bereich einführen, in dem
sich die SCSI-Schnittstellenkarten-Baugruppe befand. (12)
Achtung:
Netzkabel und Netzstecker dieses Gerätes entsprechen den einschlägigen
Sicherheitsbestimmungen. Dieses Gerät nur an eine Schutzkontaktsteckdose
mit ordnungsgemäßer Schutzleiterverbindung anschließen. (13)
Achtung:
Die Netzsteckdose und die Abdeckplatte der Netzsteckdose nicht mit etwas
anderem als den Prüfspitzen berühren, bevor diese Sicherheitsüberprüfung
beendet ist. (14)
Achtung:
Wird nicht 'Unendlich' gemessen, nicht weiterarbeiten. Die entsprechenden
Korrekturen an der Verkabelung vornehmen, bevor weitergearbeitet wird.
Netzstromkreis erst einschalten, wenn alle genannten Schritte richtig
ausgeführt wurden. (15)
Achtung:
Nicht die Griffe des Ventilators oder der aus Ventilator und Netzteil
bestehenden Baugruppen benutzen, um das 2104 Modell DS4 zu tragen.
Diese Griffe sind nicht dafür vorgesehen, das Gewicht der Einheit zu
tragen. (16)
274
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
SCGER920
Achtung:
Wird die Baugruppe vollständig in Ausgangsposition gebracht, bewegt sich
der Hebel automatisch zur geschlossenen Position hin. Darauf achten, dass
die Finger nicht zwischen dem Hebel und der Baugruppe eingeklemmt
werden. (17)
Achtung:
Das Hauptnetzkabel muss von der fehlerhaften, aus Ventilator und Netzteil
bestehenden Baugruppe entfernt werden, bevor weitergearbeitet wird. (18)
Achtung:
Keine Hände oderWerkzeuge in den freien Bereich über der aus Ventilator
und Netzteil bestehenden Baugruppe einführen. (19)
Achtung:
Keine Hände oderWerkzeuge in den freien Bereich zwischen den Netzteilbaugruppen
einführen. (20)
Achtung:
Wird das 2104 Modell DS4 in ein Rack-Modell T00 oder T42 installiert,
müssen zudem die Sicherheitshinweise für diese Racks beachtet werden,
bevor mit dem Ausbau des Rahmens begonnen wird. Die Sicherheitshinweise
befinden sich in dem Abschnitt über die Systeminstallation im
, IBM Form SA38-0577,
oder auf der folgendenWeb-Site:
www-1.ibm.com/servers/eserver/pseries/library/hardware_docs/7014_t00.html
(21)
Achtung:
Die 2104 Modelle DS4 und TS4 sind für die Installation durch Kunden
konzipiert und zertifiziert. Achten Sie darauf, dass das System oder Rack,
in das das 2104 Modell DS4 oder TS4 eingebaut wird, ebenfalls für die
Installation durch den Kunden konzipiert und zertifiziert ist. Ist dies nicht
der Fall, muss das 2104 Modell DS4 oder TS4 vom Kundendienst installiert
werden. (22)
7014 Model T00
and T42 Rack Installation and Service Guide
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
275
SCGER921
Achtung:
Keine Hände oderWerkzeuge in den Bereich einführen, in dem sich die
Kartenbaugruppe befand. (23)
Achtung:
Zum Anheben des 2104 Modells TS4 sind drei Personen erforderlich.
Nicht versuchen,das 2104 Modell TS4 alleine anzuheben. Zum Anheben
sind zwei weitere Personen erforderlich. (24)
Achtung:
Nicht die Griffe des Ventilators oder der aus Ventilator und Netzteil
bestehenden Baugruppen benutzen, um das 2104 Modell TS4 zu tragen.
Diese Griffe sind nicht dafür vorgesehen, das Gewicht der Einheit zu
tragen. (25)
Das 2104 Modell DS4 ist für die Installation durch Kunden konzipiert und
zertifiziert. Achten Sie darauf, dass das System oder Rack, in das das
2104 Modell DS4 eingebaut wird, ebenfalls für die Installation durch
den Kunden konzipiert und zertifiziert ist. Ist dies nicht der Fall,
muss das 2104 Modell DS4 vom Kundendienst installiert werden. (26)
Achtung:
Achtung:
Das 2104 Modell TS4 ist für die Installation durch Kunden konzipiert und
zertifiziert. Achten Sie darauf, dass das System, in das das 2104 Modell TS4
eingebaut wird, ebenfalls für die Installation durch den Kunden konzipiert
und zertifiziert ist. Ist dies nicht der Fall, muss das 2104 Modell TS4
vom Kundendienst installiert werden. (27)
276
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Greek
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
277
278
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
279
280
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
281
282
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
283
284
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
285
286
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
287
288
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
289
290
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
291
292
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
293
294
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
295
296
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
297
298
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
299
Hebrew
300
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
301
302
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
303
304
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
305
306
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
307
308
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
309
310
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
311
312
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
313
314
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
315
316
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
317
318
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
319
320
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
321
322
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Hungarian
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
323
324
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
325
326
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
327
328
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
329
330
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
331
332
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
333
334
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
335
336
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
337
338
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
SCHUN916
Óvatosságra intõ figyelmeztetések
VIGYÁZAT:
Ennek az egységnek lehet, hogy két hálózati csatlakozóvezetéke van.
Áramtalanításkor húzza ki mindkét hálózati csatlakozóvezetéket! (1)
VIGYÁZAT:
Ne vegye le a burkolatot, és ne kísérelje meg a javítást!
Nem tartalmaz javítható alkatrészt. (2)
VIGYÁZAT:
Kétpólusú nullavezetõs biztosítás. (3)
15. ábra Hálózati csatlakozóvezeték figyelmeztetõ címke
16. ábra Tápegységburkolat figyelmeztetõ címke
17. ábra Biztosító figyelmeztetõ címkéje
SC000327
SC000326
SC000325
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
339
340
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
341
342
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
343
344
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
345
Japanese
346
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
347
348
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
349
350
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
351
352
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
353
354
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
355
356
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
357
358
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
359
360
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
361
362
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
363
364
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
365
366
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
367
368
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Korean
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
369
370
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
371
372
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
373
374
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
375
376
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
377
378
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
379
380
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
381
382
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
383
384
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
385
386
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
387
388
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
389
390
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
391
Lithuanian
Saugos ir aplinkosaugos informacija
Saugos áspëjimai ir jø vertimas
SCLIT901
Ðiame skyriuje pateikiama tokia informacija:
Saugos áspëjimai ir jø vertimas
Ðio produkto saugos patikrinimo instrukcijos
Aplinkosaugos informacija ir nuostatos.
Ðiame dokumente pateikiami saugos patarimai.
Áspëjimas informuoja jus apie sàlygas arba veiksmus, kurie gali
bûti mirtinai pavojingi arba dël kuriø galite rimtai susiþaloti.
Áspëjimas informuoja jus apie sàlygas arba veiksmus, dël kuriø galite
susiþeisti, bet ne mirtinai ir ne per daug pavojingai.
Praneðimas informuoja apie sàlygas ir veiksmus, dël kuriø gali sugesti
maðinos áranga, neveikti programa bei gali bûti paþeisti duomenys.
Praneðimai yra pateikiami eilës tvarka pagal jø ID, kurie rodomi skliausteliuose
kiekvieno praneðimo pabaigoje. Perþiûrëkite toliau pateikiamus pavojaus ir perspëjimo
praneðimus, susietus su tam tikru ID numeriu.
Pavojus
Áspëjimas
Dëmesio
PAVOJUS
Dël neteisingai prijungto elektros tinklo lizdo metalinës sistemos dalys
arba prie sistemos prijungti prietaisai gali bûti labai aukðtos átampos.
Pirkëjas yra atsakingas ir turi uþtikrinti, kad tinklo lizdas yra teisingai
ájungtas ir áþemintas, ir tokiu bûdu apsaugo nuo elektros ðoko. (1)
ÁSPËJIMAS:
Nekiðkite rankø arba árankiø á tuðèià ertmæventiliatoriø rinkinyje. (1)
392
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Elektros krûvio paðalinimas
Saugos tikrinimo veiksmai
Korpuso patikrinimas
SCLIT902
Dëmesio Kai lieèiate pakeièiamus elementus ir kitas kompiuterio dalis, imkitës
ðiø priemoniø, kad iðvengtumëte paþeidimo statiniu krûviu:
Apribokite savo judesius. Dël judesiø aplink jus gali susikaupti statinis krûvis.
Visuomet bûkite atidûs liesdami kompiuterio dalis. Plokðtes laikykite uþ briaunø
arba metalinës dangos. Niekada nelieskite neapsaugotø grandiniø.
Neleiskite liesti kompiuterio daliø kitiems þmonëms nepasirûpinus áþeminimu.
Prieð ádiegdami naujà dalá modeliams 2104 Model DS4, 2104 Model TS4 arba
valdanèiajai sistemai, palieskite paketo apsauganèio nuo statinës iðkrovos
nedaþytà metalinædalá ir laikykite palietæmaþiausiai 2 sekundes. Tai sumaþins
pakuotës ir jûsø kûno statiná krûvá.
Árenginio dalá iðimkite ið pakuotës ir, jei ámanoma, nepadëdami ámontuokite jà
tiesiai á modelá 2104 DS4 arba TS4. Jei árenginio dalá turite kur nors padëti, atlikite
ðiuos veiksmus:
1. Nuo statiðkumo apsauganèià pakuotæ, kurioje buvo árenginio dalis, padëkite
ant lygaus ir horizontalaus pavirðiaus.
2. Padëkite detalæant nuo statinio krûvio apsauganèios pakuotës. Árenginio
dalies negalima dëti tiesiai ant metalinio pavirðiaus.
Saugos tikrinimo procedûrose nurodoma, kaip tikrinti modelá 2104 DS4 arba Ts4.
Informacijos apie korpuso saugos patikrinimà ieðkokite korpuso montavimo
ir diegimo vadove.
ÁSPËJIMAS:
Modeliai 2104 DS4 ir TS4 yra sukurti taip, kad juos ádiegti galëtø vartotojas,
ir yra sertifikuoti kaip vartotojo sàranka.
Ásitikinkite, kad sistema arba korpusas, á kurá bus diegiamas modelis
2104 DS4 arba TS4, yra pritaikytas ir sertifikuotas vartotojo sàrankai.
Jeigu ne, tuomet modelá 2104 DS4 arba TS4 turi ádiegti techninës
prieþiûros specialistai. (22)
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
393
Modelio 2104 DS4 arba TS4 patikrinimas
Iðoriniai árenginio patikrinimai
Vidiniai árenginio patikrinimai
SCLIT903
Atlikite ðiuos vidinius árenginio patikrinimus:
1. Patikrinkite, ar nebuvo kokiø nors, ne ámonës IBM atliktø pakeitimø.
Jei tokiø pakeitimø radote, ið IBM filialo bûstinës paimkite blankà "Non-IBM
Alteration Attachment Survey", Nr. R009. Jà uþpildægràþinkite á filialo
bûstinæ.
2. Patikrinkite, ar árenginio viduje nëra:
Metalo arba kitokiø neðvarumø
Vandens arba kito skysèio pëdsakø
Degimo poþymiø
Dûmø
3. Patikrinkite, ar nëra matomø mechaniniø nesklandumø, pvz., atsilaisvinusiø
komponentø.
4. Patikrinkite, ar matomi laidai ir jungtys nesusidëvëjæ, nesuskilæ,
nesuspausti.
Atlikite ðiuos iðorinius árenginio patikrinimus:
1. Patikrinkite, ar korpusas nepaþeistas (sulauþytas, nesutvirtintas arba
aðtriomis briaunomis).
2. Patikrinkite el. maitinimo laidus ir ásitikinkite, kad izoliacija nesusidëvëjusi ir
nepaþeista.
3. Patikrinkite, ar nëra kokiø nors matomø neáprastø pakitimø. Bûkite labai
atidûs vertindami ðiø pakitimø saugà.
4. Patikrinkite, ar yra visi iðoriniai dangalai, ir, ar jie nesugadinti.
5. Ásitikinkite, kad visi sklàsèiai ir vyriai yra tinkamoje eksploatavimui bûklëje.
6. Patikrinkite, ar nepaþeistas el. maitinimo laidas.
7. Patikrinkite, ar laidai nesusidëvëjæ, nepaþeisti arba neprispausti.
8. Patikrinkite ventiliatoriaus ir el. maitinimo tiekimo árenginius. Patikrinkite, ar
el. maitinimo tiekimo árenginio dangèio sàvarþos (varþtai arba kniedës) neiðimti
arba neatsilaisvinæ.
9. Patikrinkite, ar nepaþeistas iðorinis ryðio laidas.
10. Patikrinkite, ar dangtis neturi aðtriø briaunø, nesugadintas, ar nëra pakitimø,
kurie atidengia vidines árenginio dalis.
11. Paðalinkite rastus nesklandumus.
Norëdami nustatyti, kokios sàlygos yra nesaugios, atlikite ðiuos saugos
patikrinimus.
394
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Saugos korteliø patikrinimai
SCLIT904
1. Patikrinkite, ar kortelë pritvirtinta ant modelio 2104 DS4 arba TS4.
Þr. 1 paveikslëlá.
2. Patikrinkite, ar ant modelio 2104 DS4 arba TS4 el. maitinimo ðaltinio pritvirtinta
kortelë su áspëjimu apie laidà. Þr. 2 paveikslëlá.
SCLIT274
1 paveikslëlis. Kortelë, skirta modeliui 2104 DS4 arba Ts4
2 paveikslëlis. Áspëjimo apie laidà kortelë.
SCLIT325
ÁSPËJIMAS:
Ðiame árenginyje gali bûti du laidai. Norëdami nutraukti el. maitinimà,
atjunkite abu laidus. (1)
Atlikite saugos korteliø patikrinimà:
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
395
SCLIT905
3. Patikrinkite, ar ant modelio 2104 DS4 arba TS4 el. maitinimo ðaltinio danèio yra privirtinta
kortelë su áspëjimu. Þr. 3 paveikslëlá.
4. Patikrinkite, ar ant modelio 2104 DS4 arba TS4 el. maitinimo ðaltinio yra pritvirtinta kortelë
su áspëjimu apie saugiklá. Þr. 4 paveikslëlá.
3 paveikslëlis. El. maitinimo ðaltinio dangèio áspëjimo kortelë
4 paveikslëlis. Lentelë su áspëjimu apie saugiklá
SCLIT326
SCLIT327
ÁSPËJIMAS:
Nenuimkite dangèio, nieko nekeiskite, nëra keièiamø daliø. (2)
ÁSPËJIMAS:
Du poliai/ Neutralë su saugikliu. (3)
396
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
SCLIT905A
5. Patikrinkite, ar ant modelio 2104 DS4 yra svorio kortelë. Þr. 5 paveikslëlá.
ÁSPËJIMAS:
Ðis árengimas sveria daugiau nei 32 kg (70,5 svaro). Skaitykite vartotojo
instrukcijose. (4)
5 paveikslëlis. Modelio 2104 DS4 svorio kortelë
SCLIT220
Refer to manuals 32 Kg (70.5 lbs)
Dëmesio:
Jei diskuose saugote duomenis, prieð diskus iðimdami, uþdëkite ant jø etiketes.
Kai diskus keièiate, kiekvienà ið jø ádëkite á tà patá disko dëklà, ið kurio já iðëmëte.
Antraip prarasite duomenis.
> 18 kg (37 svaro) > 32 kg (70,5 svaro)
PERSPËJIMAS:
Jei modelyje 2104 DS4 yra visi 14 diskø árenginiø ir du el.maitinimo ðaltiniai, visas
jo svoris virðija 32 kg (70.5 svaro). Ið modelio 2104 DS4 reikia paðalinti maþiausiai
ðeðis diskus, jei norite já pakelti arba já ámontuoti á korpusà arba 2104 modelá TS4.
Bendras svoris bus maþesnis nei 32 kg (70,5 svaro) ir modelá 2104 DS4 du þmonës galës
saugiai paimti. Antraip galima susiþeisti.
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
397
ÁSPËJIMAS:
Ðis árenginys sveria nuo 32 iki 55 kg (70.5 - 121.2 svarai). (5)
6 paveikslëlis. Modelio 2104 TS4 svorio etiketë.
32 - 55 Kg (70.5 - 121.2 lbs)
SCLIT219
6. Patikrinkite, ar ant modelio 2104 TS4 pritvirtinta svorio etiketë. Þr. 6 paveikslëlá.
DËMESIO:
Pridëto dëþës pakëlimo árankio reikia perstumiant, diegiant ir perkeliant
árenginá, kai jis visai uþpildytas. Norint árenginá perkelti saugiai, reikia trijø
þmoniø. Antraip galite susiþeisti.
Jei këlimo priemonës neturite, árenginio svorá privalote sumaþinti iki 32 kg
arba daugiau nuimdami visas sunkias dalis (diskus ir el. maitinimo ðaltinius).
Tuomet reikës tik dviejø þmoniø árenginiui pastumti, ádiegti arba perkelti. (6)
Dëmesio:
Jei diskuose saugote duomenis, prieð diskus iðimdami, uþdëkite ant jø
etiketes.
Kai diskus keièiate, kiekvienà ið jø ádëkite á tà patá disko dëklà, ið kurio já
iðëmëte. Antraip prarasite duomenis.
> 18 kg (37 svarai) > 32 kg (70.5 svaro)
SCLIT906
398
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
SCLIT907
7. Patikrinkite, ar ant modelio 2104 DS4 arba TS4 pritvirtinta budëjimo reþimo
etiketë. Þr. 7 paveikslëlá.
ÁSPËJIMAS:
Budëjimo reþimas paþymëtas simboliu, esanèiu deðinëje tiesioginës srovës
pusëje, tiesiai virð jungiklio SW1. Kai SW1 perjungiate á deðinæpozicijà
tiesiai po budëjimo reþimo simboliu, árenginio kintamos srovës el.
Maitinimas neiðjungiamas. (7)
7 paveikslëlis. Budëjimo reþimo etiketë
SCLIT326
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
399
SCLIT908
This section contains instructions for checking the grounding of the 2104 Model
DS4 or Model TS4.
Modelio 2104 DS4 áþeminimo patikrinimas
Norëdami patikrinti modelio 2104 DS4 áþeminimà atlikite ðiuos veiksmus:
3. Kai kurios el. grandinës gali bûti paþeistos, jei iðoriniai SCSI
kabeliai prijungti prie modelio DS4 tuo metu, kai atliekamas áþeminimo
patikrinimas.
Ásitikinkite, kad iðoriniai SCSI kabeliai neájungti á jungtis .
4. Patikrinkite modelio DS4 áþeminimà:
ø akumuliatoriø árenginio antrajame þingsnyje, kiðtukai ir treèiojo laido
kontaktas yra geros bûklës
à ommetro kontaktà prie 2104 modelio DS4 korpuso iðvado ir kità
prie treèiojo laido áþeminimo kontakto. vienà jungiamàjá laidà prie 2104
modelio DS4 korpuso ir kità jungiamàjá laidà prie treèiojo laido áþeminimo kontakto, esanèio
prieð maitinimo jungtis, kurias atjungëte antrajame þingsnyje. Ommetro parodymai turi bûti
0,1 omo ar maþiau tarp áþeminimo kontakto, esanèio maitinimo laido jungtyje, ir 2104
modelio DS4 korpuso.
Dëmesio:
a. Apþiûrëkite ir ásitikinkite, kad maitinimo jungèiø, kurias iðjungëte ið maitimo tiekimo
ar atsargini
.
2104
2104
b. Pajunkite vien
Prijunkite ommetro
8 paveikslëlis. El. maitinimo kabeliai ir SCSI jungtys modelyje 2104 DS4
Modelio 2104 DS4 arba TS4 áeminimo patikrinimas
1. Ásitikinkite, kad visi el. maitinimo ðaltiniai atjungti nuo korpuso. Þr. montavimo diegimo
dokumente, skirtame korpusui arba valdanèiajai sistemai.
2. Ásitikinkite, kad el. maitinimo kabeliai yra ájungti á kiekvienà ventiliatoriaus ir el.
maitinimo ðaltinio deriná. Þr. 8 paveikslëlá. Taip pat patikrinkite, ar el. maitinimo kabeliø
galai atjungti nuo el. maitinimo paskirstymo arba atsarginio akumuliatoriø árenginio.
Þr. montavimo diegimo dokumente, skirtame korpusui arba valdanèiajai sistemai.
2
1
SCLIT210
400
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
SCLIT909
Jei dirbate su 220 voltø 2104 modeliu DS4, rinkitës 5 þingsná.
Jei dirbate su -48 voltø 2104 modeliu DS4, atlikite 7 þingsná.
5. Patikrinkite modelio DS4 korpuso ir kiekvieno ventiliatoriaus el. maitinimo ðaltinio
árenginio el. maitinimo jungties áþeminimo kontakto tolydumà. Þr. 9 paveikslëlá.
2104
6. Pereikite prie 9 þingsnio.
7. Patikrinkite modelio DS4 korpuso ir kiekvieno pagrindinio el. maitinimo ðaltinio
1 ir 7 áþeminimo kontaktø tolydumà. Þr. 10 paveikslëlá.
2104
8. Pereikite prie 9 þingsnio.
9. Jei kuris nors ventiliatoriaus el. tiekimo árenginys netolydus, pakeiskite já nauju.
Þr. skyriø, kuriame raðoma apie ventiliatoriaus ir el. maitinimo tiekimo árenginius, esantá
vadove
. Dar kartà iðsamiai patikrinkite áþeminimà.
Jei kiekvienas ventiliatoriaus el. maitinimo ðaltinio árenginys tolydus, gali bûti, kad
nesklandumø iðkyla dël el. maitinimo kabelio arba valdanèiosios sistemos áþeminimo.
Expandable Storage Plus 2104 Model DS4 and Model TS4
Service Guide
9 paveikslëlis. Áþeminimo kontaktas (220 V 2104 modelis DS4 )
10 paveikslëlis. Áþeminimo kontaktai (48 V modelis 2104 DS4)
1
SCLIT329
Jei áþeminimas tinkamas, áþeminimo patikrinimas uþbaigtas.
Jei áþeminimas netinkamas, iðjunkite 2104 DS4
maitinimo kabelius ið ventiliatoriaus - el. maitinimo ðaltinio árenginio.
modelio
SCLIT330
6 kontaktas 1 kontaktas
12 kontaktas 7 kontaktas
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
401
SCLIT910
Jei dirbate su 220 voltø modeliu DS4, pereikite prie 10. Jei dirbate su 48 voltø
modeliu DS4, norëdami paðalinti gedimà þr. korpuso arba valdanèiosios
sistemos diegimo dokumentà.
Dar kartà iðsamiai patikrinkite áþeminimà.
2104
2104
10. Patikrinkite, ar tolydus el. maitinimo kabelis.
Jei kabelis netolydus, pakeiskite já nauju, tuomet dar kartà iðsamiai patikrinkite
áþeminimà. Jei el. maitinimo kabelis netolydus, skaitykite korpuso arba
valdanèiosios sistemos diegimo ir montavimo instrukcijas ir paðalinkite gedimà.
Norëdami patikrinti 2104 modelio TS4 áþeminimà, atlikite ðiuos veiksmus:
2. Patikrinkite, ar el. maitinimo kabelio galai neájungti á el. maitinimo lizdus. Jei reikia,
kabelius atjunkite.
Modelio 2104 TS4 áþeminimo patikrinimas
1. Patikrinkite, ar el. maitinimo kabelis ájungtas á kiekvienà el. maitinimo lizdà .
Þr. 11 paveikslëlá.
1
11 paveikslëlis. El. maitinimo kabeliai ir SCSI jungtys (2104 modelis Ts4)
SCLIT230
1
2
3
402
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
SCLIT911
3. Kai kurios el. grandinës gali bûti paþeistos, jei iðoriniai SCSI kabeliai
jungiami prie modelio TS4 tuo metu, kai atliekamas áþeminimo
patikrinimas. Ásitikinkite, kad nëra jokiø iðoriniø SCSI kabeliø, ájungtø á jungtis. .
Þr. 11 paveikslëlá.
4. Patikrinkite modelio TS4 korpuso ir kiekvieno pagrindinio el. Maitinimo
ðaltinio áþeminimo kontaktø tolydumà.
a. Patikrinkite el. maitinimo kabelius ir paþiûrëkite, ar treèiojo laido áþeminimo
jungtis yra geros bûklës.
b. Norëdami patikrinti, ar treèiojo laido áþeminimo tolydumas tarp iðorinio
áþeminimo kontakto pagrindo ir korpuso pagrindo yra 0,1 omø arba maþiau,
naudokite matavimo prietaisà.
5. Jei áþeminimas tinkamas, jo patikrinimas baigtas.
6. Jei áþeminimas netinkamas, iðjunkite modelio DS4 el. maitinimo kabelius
ið ventiliatoriaus el. maitinimo ðaltinio árenginio.
a. Patikrinkite, ar el. maitinimo kabelis tolydus.
b. Jei kuris nors el. maitinimo kabelis yra paþeistas, pakeiskite nauju.
c. Patikrinkite modelio TS4 korpuso ir kiekvieno ventiliatoriaus el.
maitinimo ðaltinio árenginio el. maitinimo jungties áþeminimo kontakto
tolydumà. Þr. 12 paveikslëlá.
Dëmesio:
2104
2104
2104
2104
d Jei kuris nors ventiliatoriaus el. maitinimo ðaltinio árenginys netolydus,
pakeiskite já nauju. (Þr. vadovo
dalá apie ventiliatoriaus ir el. maitinimo ðaltiniø
árengimus.) Pasirinkite " odelio TS4 áþeminimo patikrinimas" ir atlikite
1-6 veiksmus, kad dar kartà iðsamiai patikrintumëte áþeminimà.
Expandable Storage Plus 2104 Model DS4
and Model TS4 Service Guide
2104 m
12 paveikslëlis. Áþeminimo kontaktas (2104 modelis Ts4)
1
2
3
SCLIT331
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
403
SCLIT912
13 paveikslëlis. El. maitinimo ðaltinio atjungimas nuo 2104 modelio DS4
Neatjunkite srovës ið valdanèiosios sistemos arba modelio DS4, jei tam
nëra konkreèiø prieþasèiø, nebent taip nurodo intrukcijos, kuriomis vadovaujatës.
Atlikite ðiuos veiksmus ir atjunkite el. srovës maitinimà nuo modelio DS4:
1. Pasiteiraukite kliento, ar visos operacijos, vykdomos tarp modelio DS4 ir
valdanèiosios sistemos, yra sustabdytos.
2. Kiekvieno ventiliatoriaus el. maitinimo árenginio tiesioginës srovës jungiklius
ájungta/ budëjimas perjunkite á budëjimo reþimà. Þr. 13 paveikslëlá.
a. Kai kuriuose modeliuose DS4 yra ventiliatoriaus el. maitinimo árenginys ir
ventiliatoriø árenginys. Ventiliatoriø árenginyje nëra tiesioginës srovës jungiklio
ájungta/ budëjimas.
b. Ventiliatoriaus - el. maitinimo árenginyje CHK lemputë gali ðviesti, nors jo tiesioginës
srovës jungiklis ájungta/ budëjimas yra budëjimo reþime.
Pastaba:
Pastabos:
2104
2104
2104
2104
El. srovës atjungimas nuo 2104 modelio DS4 arba TS4
El. srovës atjungimas nuo 2104 modelio DS4
SCLIT206
404
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
SCLIT913
3. PAVOJUS
Ðiais kabeliais teka srovë, jei korpuso el. maitinimo paskirstymo
árenginys arba nepertraukiamos srovës árenginys yra dar ájungti. (1)
1
1
2
2
2104
2104
2104
2104
2104
Pastaba:
Pastabos:
Neatjunkite el. maitinimo ðaltinio nuo valdanèiosios sistemos arba
modelio TS4, jei tam nëra konkreèiø prieþasèiø, nebent taip nurodo
instrukcijos, kuriomis vadovaujatës.
Atlikite ðiuos veiksmus ir atjunkite el. srovës maitinimà nuo modelio TS4:
1. Pasiteiraukite kliento, ar visos operacijos, vykdomos tarp modelio TS4 ir
valdanèiosios sistemos, yra sustabdytos.
2. Kiekvieno ventiliatoriaus el. maitinimo árenginio tiesioginës srovës jungiklius
ájungta/ budëjimas, esanèius modelio TS4 galinëje dalyje, perjunkite á
budëjimo reþimà. Þr. 14 paveikslëlá.
a. Kai kuriuose modeliuose DS4 yra ventiliatoriaus el. maitinimo
árenginys ir ventiliatoriø árenginys. Ventiliatoriø árenginyje nëra tiesioginës
srovës jungiklio ájungta/ budëjimas.
b. Ventiliatoriaus - el. maitinimo árenginyje CHK lemputë gali ðviesti, nors jo
tiesioginës srovës mygtukas ájungta/ budëjimas yra budëjimo reþime.
El. maitinimo ðaltinio atjungimas nuo modelio 2104 TS4
2104 m
2104
Maitinimo kabelius išjunkite iš 2104 modelio DS4 upakalin s dalies. r.
13 paveiksl .
ë
ëlá
4. odelio DS4 el. maitinimui ájungti atlikite ðiuos veiksmus:
a. Vël ájunkite el. maitinimo kabelius .
b. Kiekviename ventiliatoriaus el. maitinimo árenginyje ájunkite mygtukà
ájungta/ budëjimas.
Galite konfigûruoti modelio DS4 variklio ájungimo eigà.
Jei norite suþinoti daugiau, skaitykite skyrius apie variklio automatinio ájungimo
jungiklá ir uþdelsto ájungimo reþimo jungiklá, esanèius vadove
.
Pastaba:
Expandable
Storage Plus 2104 Model DS4 and Model TS4 Service Guide
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
405
SCLIT914
1
2
2
3. Iðtraukite el. maitinimo kabelius ið el. srovës iðvadø ir ið galinës 2104
modelio 104 TS4 dalies. Þr. 14 paveikslëlá.
4. odelio TS4 el. maitinimui ájungti atlikite ðiuos veiksmus:
a. Vël ájunkite el. maitinimo kabelius .
b. Kiekviename ventiliatoriaus el. maitinimo árenginyje ájunkite mygtukà
ájungta/ budëjimas.
Galite konfigûruoti modelio TS4 variklio ájungimo eigà.
Jei norite suþinoti daugiau, skaitykite skyrius apie variklio automatinio
ájungimo jungiklá ir uþdelsto ájungimo reþimo jungiklá, esanèius vadove
Pastaba:
Expandable Storage Plus 2104 Model DS4 and Model TS4 Service Guide.
2104 m
2104
14 paveikslëlis. El. maitinimo ðaltinio atjungimas nuo 2104 modelio TS4
SCLIT207
406
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
SCLIT915
Praneðimai apie pavojø
PAVOJUS
PAVOJUS
PAVOJUS
PAVOJUS
PAVOJUS
Kitu þingsniu atjungsite el. maitinimo kabelius. Ðiais
kabeliais teka srovë, jei korpuso el. maitinimo paskirstymo árenginys arba
nepertraukiamos srovës árenginys yra dar ájungti. (1)
Nejunkite el. maitinimo kabelio á ventiliatoriaus el. maitinimo ðaltinio
árenginá, kol árenginys nëra baigtas montuoti ir verþlës nëra visiðkai
priverþtos. (3)
Perkûnijos metu nereikia atjungti kabeliø nuo ekranø, spausdintuvø,
telefonø arba ryðiø linijø apsaugos stoèiø. (5)
Dël neteisingai prijungto elektros srovës iðvado metalinës sistemos dalys
arba prie sistemos prijungti prietaisai gali bûti labai aukðtos átampos.
Pirkëjas yra atsakingas ir turi uþtikrinti, kad tinklo lizdas yra teisingai
ájungtas ir áþemintas, ir tokiu bûdu apsaugo nuo elektros ðoko. Perkûnijos
metu nereikia atjungti kabeliø nuo ekranø, spausdintuvø, telefonø arba
ryðiø linijø apsaugos stoèiø. (4)
Nebandykite atidaryti ventiliatoriaus el. maitinimo ðaltinio árenginio
dangèiø. (2)
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
407
SCLIT916
Áspëjamieji praneðimai
ÁSPËJIMAS:
Ðiame árenginyje gali bûti du laidai. Norëdami nutraukti el. maitinimà, atjunkite
abu laidus. (1)
ÁSPËJIMAS:
Nenuimkite dangèio, netaisykite, nëra keièiamø daliø. (2)
ÁSPËJIMAS:
Du poliai/ Neutralë su saugikliu. (3)
15 paveikslëlis. Áspëjimo apie laidà kortelë
16 paveikslëlis. El. maitinimo ðaltinio dangèio áspëjimo kortelë
17 paveikslëlis. Lentelë su áspëjimu apie saugiklá
SCLIT327
SCLIT326
SCLIT325
408
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
SCLIT916A
ÁSPËJIMAS:
Ðis árenginys sveria daugiau nei 32 kg (70.5 svaro).
Skaitykite vartotojo instrukcijose. (4)
18 paveikslëlis. 2104 modelio DS4 svorio kortelë
SCLIT220
Refer to manuals 32 Kg (70.5 lbs)
Dëmesio:
Jei diskuose saugote duomenis, prieð diskus iðimdami, uþdëkite ant jø
etiketes. Kai diskus keièiate, kiekvienà ið jø ádëkite á tà patá disko dëklà, ið kurio
já iðëmëte. Antraip prarasite duomenis.
> 18 Kg (37 lbs) > 32 Kg (70.5 lbs)
PERSPËJIMAS:Jei modelyje 2104 DS4 yra visi 14 diskø árenginiø ir du el.
maitinimo ðaltiniai, visas jo svoris virðija 32 Kg (70.5 svaro).Ið modelio 2104
DS4 reikia paðalinti maþiausiai ðeðis diskus, jei norite já pakelti arba já
ámontuoti á korpusà arba 2104 modelá TS4. Bendras svoris bus maþesnis ne
32 Kg (70.5 lbs) ir modelá 2104 DS4 du þmonës galës saugiai paimti.
Antraip galima susiþeisti.
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
409
ÁSPËJIMAS:
Ðis árenginys sveria nuo 32 iki 55 kg (70.5 - 121.2 svarai). (5)
ÁSPËJIMAS:
Pridëto dëþës pakëlimo árankio reikia perstumiant, diegiant ir perkeliant
árenginá, kai jis visai uþpildytas. Norint árenginá perkelti saugiai, reikia trijø
þmoniø. Antraip galite susiþeisti.
Jei këlimo priemonës neturite, árenginio svorá privalote sumaþinti iki 32 kg
arba maþesnio nuimdami visas sunkias dalis (diskus ir el. maitinimo
ðaltinius). Tuomet reikës tik dviejø þmoniø árenginiui pastumti, ádiegti arba
perkelti. (6)
19 paveikslëlis. 2104 modelio TS4 svorio kortelë
SCLIT219
Dëmesio:
Jei diskuose saugote duomenis, prieð diskus iðimdami, uþdëkite ant jø etiketes.
Kai diskus keièiate, kiekvienà ið jø ádëkite á tà patá disko dëklà, ið kurio já iðëmëte.
Antraip prarasite duomenis.
> 18 kg (37 svarai) > 32 kg (70.5 svaro)
32 - 55 Kg (70.5 - 121.2 lbs)
SCLIT917
410
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
SCLIT918
Budëjimo reþimas paþymëtas simboliu, esanèiu deðinëje tiesioginës srovës
pusëje, tiesiai virð jungiklio SW1. Kai SW1 pakeièiate á deðinæpozicijà tiesiai
po budëjimo reþimo simboliu, árenginio kintamosios srovës el. maitinimas
neiðjungiamas. (7)
ÁSPËJIMAS:
Stabilizatorius turi bûti teisingai prijungtas prie apatinës priekinës korpuso
dalies, kad apsaugotø korpusà nuo pasvirimo á prieká tuo metu, kai
modelis DS4 yra iðimamas ið korpuso. Neiðimkite ir nemontuokite jokiø
árenginiø, jei stabilizatorius nëra pritaisytas prie korpuso. (8)
Stabilizatorius turi bûti teisingai prijungtas prie apatinës priekinës korpuso
dalies, kad apsaugotø korpusà nuo pasvirimo á prieká tuo metu, kai
modelis DS4 yra iðimamas ið korpuso. Neiðimkite ir nemontuokite jokiø
árenginiø, jei stabilizatorius nëra pritaisytas prie korpuso. (9)
2104
ÁSPËJIMAS:
2104
20 paveikslëlis. Budëjimo reþimo pozicija
SCLIT328
ÁSPËJIMAS:
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
411
SCLIT919
DËMESIO:
Nekiðkite rankø arba árankiø á ertmæventiliatoriø rinkinyje. (10)
DËMESIO:
Nekiðkite rankø arba árankiø á ertmæventiliatoriø rinkinyje. (11)
DËMESIO:
Nekiðkite rankø arba árankiø á ertmæ, esanèià SCSI sàsajø kortelës árenginyje.
(12)
DËMESIO:
Dël vartotojø saugumo ðiame produkte yra 3-laidþiai el. maitinimo kabeliai ir
kiðtukai. Naudokite ðá el. maitinimo kabelá kartu su teisingai áþemintu
elektros iðvadu, kad iðvengtumëte elektros ðoko. (13)
DËMESIO:
Nelieskite el. maitinimo iðvadø arba el. maitinimo iðvadø skydeliø ne su
tikrinimo liestukais, kol nebaigsite ðio saugos patikrinimo. (14)
DËMESIO:
Jei matavimo parodymas nëra begalybë, negalima nieko daryti. Atlikite
reikiamus elektros laidø sistemos pataisymus prieð tæsdami. Neájunkite
atðakinës grandinës pertraukiklio, kol tinkamai neatliksite ankstesniø
þingsniø. (15)
DËMESIO:
Nenaudokite ventiliatoriaus arba ventiliatoriaus el. maitinimo árenginio
rankenø neðdami modelá DS4. Ðios rankenos gali neiðlaikyti árenginio
svorio. (16)
2104
412
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
SCLIT920
ÁSPËJIMAS:
Kai árengys paruoðtas, svirtis automatiðkai pajuda link savo uþdaros
padëties. Pasisaugokite, kad jûsø pirðtai nebûtø suspausti tarp svirties ir
árenginio. (17)
Prieð tæsdami toliau, patikrinkite, ar pagrindinio el. maitinimo ðaltinio kabelis
buvo iðjungtas ið sugedusio ventiliatoriaus el. maitinimo. (18)
Nekiðkite rankø arba árankiø á tuðèià ertmævirð ventiliatoriaus el. maitinimo
ðaltinio árenginio. (19)
Nekiðkite rankø arba árankiø á tuðèià ertmætarp el. maitinimo árenginiø. (20)
Jei modelis DS4 yra ámontuotas modelio T00 arba T42 korpuse, prieð
nuimdami karkasà turite atkreipti dëmesá á pastabas apie tø korpusø saugà.
Saugos nurodymus galite rasti skyriuje apie sistemos diegimà vadove
, SA38-0577 arba
ðiame tinklapyje:
www-1.ibm.com/servers/eserver/pseries/library/hardware_docs/7014_t00.html
(21)
odelis DS4 ir modelis TS4 yra sukurti taip, kad juos galëtø
ámontuoti pirkëjas, ir yra sertifikuoti kaip vartotojo sàranka. Ásitikinkite, kad
sistema arba korpusas, á kurá bus diegiamas modelis DS4 arba TS4, yra
pritaikytas ir sertifikuotas vartotojo sàrankai. Jeigu ne, tuomet
modelá DS4 arba TS4 turi ádiegti techninës prieþiûros specialistai. (22)
7014
Model T00 and T42 Rack Installation and Service Guide
ÁSPËJIMAS:
ÁSPËJIMAS:
ÁSPËJIMAS:
ÁSPËJIMAS:
2104
ÁSPËJIMAS:
2104 m 2104
2104
2104
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
413
SCLIT921
ÁSPËJIMAS:
Nekiðkite rankø arba árankiø á ertmæ, esanèià kortø árenginyje. (23)
odelio TS4 pakelti reikia trijø þmoniø. Nebandykite jo kelti vienas.
Nebandykite jo kelti be kitø dviejø þmoniø pagalbos. (24)
Nenaudokite ventiliatoriaus arba ventiliatoriaus el. maitinimo árenginio
rankenø neðdami modelá TS4. Ðios rankenos gali neiðlaikyti árenginio
svorio. (25)
ÁSPËJIMAS:
2104 m
ÁSPËJIMAS:
2104
odelis DS4 ir modelis TS4 yra sukurti taip, kad juos galëtø
ámontuoti pirkëjas, ir yra sertifikuoti kaip vartotojo sàranka. Patikrinkite ar
sistema arba korpusas, á kurá bus ádiegtas modelis 2104 DS4, yra taip pat
sukurti ir sertifikuoti vartotojo sàrankai. Jeigu ne, tuomet modelá DS4
turi ádiegti techninës prieþiûros specialistai. (26)
2104 m 2104
2104
ÁSPËJIMAS:
ÁSPËJIMAS:
2104 m
2104
2104
odelis TS4 yra sukurtas taip, kad já galëtø ámontuoti vartotojas, ir yra
sertifikuotas kaip vartotojo sàranka. Patikrinkite ar sistema á kurià bus
ádiegtas modelis TS4 yra taip pat sukurta ir sertifikuota vartotojo
sàrankai. Jeigu ne, tuomet modelá TS4 turi ádiegti techninës
prieþiûros specialistai. (27)
414
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Macedonia
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
415
416
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
417
418
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
419
420
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
421
422
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
423
424
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
425
426
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
427
428
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
429
430
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
431
432
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
433
434
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
435
436
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
437
Norwegian
438
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
439
440
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
441
442
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
443
444
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
445
446
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
447
448
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
449
450
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
451
452
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
453
454
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
455
456
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
457
458
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
459
460
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Polish
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
461
462
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
463
464
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
465
466
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
467
468
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
469
470
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
471
472
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
473
474
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
475
476
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
477
478
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
479
480
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
481
482
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
483
Portuguese
Avisos de segurança e ambientais
Avisos de segurança e respectivas traduções
SC000901
Esta secção contém as seguintes informações:
Avisos de segurança e respectivas traduções
Procedimentos de inspecção de segurança para este produto
Avisos e declarações ambientais
Os avisos de segurança estão impressos ao longo deste documento.
Um aviso de Perigo informa-o de condições ou procedimentos que podem resultar
em morte ou ferimentos pessoais graves.
Um aviso de informa-o de condições ou procedimentos que podem resultar
em ferimentos pessoais que não são nem letais nem extremamente perigosos.
Um aviso de informa-o de condições ou procedimentos que podem provocar
danos em má quinas, equipamento, programas ou dados.
Os avisos estão listados por ordem numérica com base nos seus IDs, que são
apresentados entre parentesis, no fim de cada aviso. Consulte os seguintes exemplos
de avisos de perigo e cuidado para a localização do número de ID.
Cuidado
Atenção
PERIGO
Uma tomada eléctrica que não esteja correctamente ligada pode passar
corrente perigosa para as peças metá licas do sistema ou para os produtos
que ligam ao sistema. É da responsabilidade do cliente assegurar que a
tomada está correctamente ligada e com ligação à terra de modo a evitar
um choque eléctrico. (1)
CUIDADO:
Não insira as mãos ou ferramentas na abertura do espaço vazio que continha
o conjunto da ventoinha. (1)
484
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Descarga electroestá tica (ESD)
Procedimentos de inspecção de segurança
Inspeccionar o bastidor
SC000902
Atenção Ao manusear unidades substituíveis localmente (FRUs) e outras peças
de computador, tome estas precauções de modo a evitar danos devidos à está tica:
Limite os movimentos. O movimento pode criar electricidade está tica à sua volta.
Toque sempre com cuidado nas peças do computador. Segure nas placas pelas
extremidades ou pela cobertura metá lica. Nunca toque em circuitos expostos.
Evite que pessoas que não estejam correctamente protegidas com terra toquem
em peças do computador.
Antes de instalar uma nova peça, toque com a embalagem de protecção contra
está tica que contém a peça numa parte metá lica não pintada do 2104 Modelo
DS4, 2104 Modelo TS4, ou sistema central durante pelo menos 2 segundos.
Isto reduz a electricidade está tica da embalagem e do corpo do utilizador.
Remova a peça da embalagem e, se possível, instale-a directamente no 2104
Modelo DS4 ou Modelo TS4 sem a pousar. Se tiver de pousar a peça, siga estes
passos:
1. Coloque a embalagem de protecção contra está tica que continha a peça
numa superfície lisa e nivelada.
2. Coloque a peça numa embalagem de protecção contra está tica. Não
coloque a peça directamente em cima de qualquer superfície metá lica.
Os procedimentos de inspecção de segurança descrevem o modo de inspecção do
2104 Modelo DS4 ou Modelo TS4.
Consulte o manual de instalação do bastidor para obter informações sobre como
efectuar a inspecção no bastidor.
CUIDADO:
O 2104 Modelo DS4 e Modelo TS4 estão concebidos sendo para instalação
pelo cliente e estão certificados como para instalação pelo cliente.
Certifique-se que o sistema ou bastidor no qual o 2104 Modelo DS4 / Modelo
TS4 será instalado também está concebido e certificado para configuração
pelo cliente. Se não, o 2104 Modelo DS4 / Modelo TS4 terá então que ser
instalado por um CE. (22)
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
485
Inspeccionar o 2104 Modelo DS4 ou Modelo TS4
Verificações externas na má quina
Verificações internas na má quina
SC000903
Efectue as seguintes verificações internas na má quina:
1. Verifique se existem quaisquer alterações alheias à IBM que possam ter
sido feitas na má quina.
Se existir alguma, peça o formulá rio “Non-IBM Alteration Attachment Survey”,
número R009, ao representante da IBM. Preencha o formulá rio e devolva-o
ao representante.
2. Verifique o estado do interior da má quina e veja se não existem:
Contaminantes metá licos ou outros
Vestigios de á gua ou outro fluido
Fogo
Danos provocados por fumo
3. Verifique se existem problemas mecânicos óbvios, como componentes soltos.
4. Verifique se existem cabos e conectores expostos com desgaste, rachas ou
trincados.
Efectue as seguintes verificações externas na má quina:
1. Verifique danos no chassis (arestas soltas, partidas ou cortantes).
2. Verifique os cabos de alimentação e certifique-se que o isolamento não
está gasto ou danificado.
3. Verifique quaisquer alterações óbvias fora do comum. Faça uma boa
avaliação sobre a segurança de tais alterações.
4. Verifique que todas as tampas exteriores existem e não estão danificadas.
5. Certifique-se que todos os trincos e dobradiças estão no correcto estado de
funcionamento.
6. Verifique se existem danos no cabo de alimentação.
7. Verifique se existem cabos gastos, danificados ou trincados.
8. Inspeccione os conjuntos da vetoínha e fonte de alimentação. Verifique se
os grampos de fixação da cobertura da unidade da fonte de alimentação
(parafusos ou rebites) não foram removidos ou alterados.
9. Verifique se existem danos no cabo de sinal externo.
10. Verifique se existem arestas vivas, danos ou alterações que exponham as
peças internas do dispositivo.
11. Corrija quaisquer problemas que possa encontrar.
Efectue as seguintes verificações de segurança para identificar condições inseguras.
486
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Verificações na etiqueta de segurança
SC000904
Efectue as seguintes verificações na etiqueta de segurança:
1. Certifique-se de que a etiqueta está instalada no 2104 Modelo DS4 ou Modelo TS4.
Consulte a Figura 1.
2. Certifique-se que a etiqueta de cuidado do cabo eléctrico está instalada na
fonte de alimentação 2104 Modelo DS4 ou Modelo TS4. Consulte a Figura 2.
SC000274
Figura 1. Etiqueta do 2104 Modelo DS4 ou Modelo TS4
Figura 2. Etiqueta de cuidado do cabo eléctrico
SC000325
CUIDADO:
Esta unidade pode ter dois cabos eléctricos. Para toda a alimentação,
desligue ambos os cabos eléctricos. (1)
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
487
SC000905
3. Verifique se a etiqueta de cuidado para colocar em coberturas se encontra na
da fonte de alimentação do 2104 Modelo DS4 ou do Modelo Ts4.
Consulte a Figura 3.
4. Verifique se a etiqueta de cuidado com fusíveis está colocada na fonte de
alimentação do 2104 Modelo DS4 ou do Modelo TS4. Consulte a Figura 4.
Figura 3. Etiqueta de cuidado com a cobertura da fonte de alimentação
Figura 4. Etiqueta de cuidado com fusíveis
SC000326
SC000327
CUIDADO:
Não retire a cobertura, não tente reparações, as peças não
necessitam de manutenção. (2)
CUIDADO:
Bipolaridade/Fusíveis Neutros. (3)
488
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
SC000905A
5. Verifique se a etiqueta do peso está instalada no 2104 Modelo DS4. Ver Figura 5.
CUIDADO:
Este unidade pesa mais de 32 Kg (70.5 lbs). Consulte os manuais. (4)
Figura 5. Etiqueta de peso do 2104 Modelo DS4
SC000220
Refer to manuals 32 Kg (70.5 lbs)
Atenção:
Caso tenha dados armazenados nas unidades, identifique as mesmas antes de as
remover. Ao voltar a colocar as unidades, instale-as no compartimento de unidade
de onde as removeu. De contrá rio, pode ter como resultado a perda de dados.
> 18 Kg (37 lbs) > 32 Kg (70.5 lbs)
CUIDADO:
Se o 2104 Modelo DS4 estiver completamente carregado com 14 unidades e
duas fontes de alimentação, o seu peso total excede os 32 Kg (70.5 lbs). Tem de
remover pelo menos seis unidades do 2104 Modelo DS4 antes de o levantar ou
instalar num bastidor ou no 2104 Modelo TS4.
Assim, reduz o peso total para menos de 32 Kg (70.5 lbs) e o
. Se não o fizer, corre
o risco de provocar ferimentos.
2104 Modelo
DS4 pode ser manuseado de forma segura por duas pessoas
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
489
CUIDADO:
Esta unidade pesa entre 32 Kg - 55 Kg (70.5 lbs - 121.2 lbs). (5)
Figura 6. 2104 Etiqueta de peso do Modelo TS4
32 - 55 Kg (70.5 - 121.2 lbs)
SC000219
6. Verifique se a etiqueta de peso está instalada no 2104 Modelo TS4.
Consulte a Figura 6.
CUIDADO:
A ferramenta de elevação de embalagens fornecida é necessá ria para mover,
instalar e reposicionar o produto quando totalmente carregado. São
necessá rias três pessoas para mover o produto em segurança. Uma falha
neste procedimento pode provocar lesões.
No caso da ferramenta de elevação não estar prontamente disponível, deve
reduzir o peso para 32 Kg ou menos removendo todos os componentes
pesados (unidades de disco e fontes de alimentação) do produto. Sendo
então necessá rias apenas duas pessoas para mover, instalar e reposicionar
o produto. (6)
Atenção:
Se tiver dados armazenados nas unidades, identifique as unidades com
etiquetas antes de as remover.
Quando substituir as unidades, instale cada uma destas no compartimento
do qual as removeu. Uma falha neste processo pode resultar numa perda
de dados.
> 18 Kg (37 lbs) > 32 Kg (70.5 lbs)
SC000906
490
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
SC000907
7. Verifique se a etiqueta de condição de espera está instalada no 2104 Modelo
DS4 ou Modelo TS4. Consulte a Figura 7.
CUIDADO:
Uma condição de espera é indicada pelo símbolo à direita de DC logo acima
do interruptor, SW1. Quando muda o SW1 para a posição direita logo abaixo
do símbolo de espera, a corrente-ca para a unidade não é cortada. (7)
Figura 7. Etiqueta da condição de espera
SC000326
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
491
SC000908
Esta secção contém instruções para verificar a ligação à terra do 2104 Modelo
DS4 ou Modelo TS4.
Verificar a ligação à terra do 2104 Modelo DS4
Execute os seguintes passos para verificar a ligação à terra do 2104 Modelo DS4:
3. Parte dos circuitos eléctricos pode ser danificada se os cabos
SCSI externos estiverem ligados ao 2104 Modelo DS4 enquanto a verificação
de ligação à terra estiver a ser efectuadas.
Certifique-se de que não existem cabos externos SCSI ligados aos conectores .
4. Verifique a ligação à terra do 2104 Modelo DS4:
unidade de distribuição de alimentação ou de baterias de reserva
Atenção:
a. Nas extremidades do conector das fichas de alimentação que desligou da
no passo 2,
efectue uma inspecção visual e certifique-se de que o pino de ligação à terra
do terceiro fio está em boas condições.
b. Utilize um ohmímetro e ligue um fio ao chassis d 2104 Modelo DS4 e o outro
ao pino de ligação à terra do terceiro fio na face dos conectores de alimentação
que desligou no passo 2. Deve ler 0.1 ohm ou
Modelo DS4
menos entre o pino de ligação
à terra do conector de alimentação e o chassis do 2104 .
Figura 8. Cabos de alimentação e conectores SCSI no 2104 Modelo DS4
Verificar a ligação à terra do 2104 Modelo DS4 ou Modelo TS4
1. Certifique-se de que toda a alimentação é removida do bastidor. Consulte o
documento de instalação do bastidor ou sistema central.
2. Certifique-se de que os cabos de alimentação estão ligados a cada conjunto
de ventoínha e fonte de alimentação. Consulte a Figura 8. Certifique-se também
de que as outras extremidades dos cabos de alimentação estão desligados da
unidade de distribuição de alimentação ou de baterias de reserva do bastidor.
Consulte o documento de instalação do bastidor ou sistema central.
2
1
SC000210
492
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
SC000909
Se estiver a efectuar a manutenção num 2104 Modelo DS4 de 220-volt,
avance para o passe 5.
Se estiver a efectuar a manutenção num 2104 Modelo DS4 de -48 volts,
avance para o passo 7.
5. Verifique a continuidade entre o chassis do 2104 Modelo DS4 e o pino
da terra do conector de alimentação de cada conjunto de ventoínha e
fonte de alimentação. Consulte a Figura 9.
6. Avance para o passo 9.
7. Verifique a continuidade entre o chassis do 2104 Modelo DS4 e os pinos
de terra 1 e 7 de cada conector de alimentação principal. Consulte a
Figura 10.
8. Avance para o passo 9.
9. Se algum dos conjuntos de ventoínha e fonte de alimentação não tiver
continuidade, troque-o por um novo. Consulte a secção sobre conjuntos
de ventoínha e fonte de alimentação em
. Efectue de novo toda a
verificação de ligação à terra.
Se todos os conjuntos de ventoínha e fonte de alimentação tiverem
continuidade, pode ter um problema no cabo de alimentação ou com a
ligação à terra do sistema central.
Expandable Storage Plus 2104
Model DS4 and Model TS4 Service Guide
Figura 9. Pino de ligação à terra (2104 Modelo DS4 de 220 V)
Figura 10. Pinos de ligação à terra (2104 Model DS4 de -48 V)
1
SC000329
Se a ligação à terra estiver correcta, a verificação da mesma está completa.
Se a ligação à terra não estiver correcta, desligue os cabos de alimentação
dos conjuntos de ventoínha e fonte de alimentação do 2104 Modelo DS4.
SC000330
Pino 6 Pino 1
Pino 12 Pino 7
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
493
SC000910
Se estiver a efectuar manutenção no 2104 Modelo DS4 de 220-volts, avance para
o passo 10.
Se estiver a efectuar manutenção no 2104 Modelo DS4 de -48 volts, consulte o
documento de instalação do bastidor ou sistema central para isolar a falha.
Efectue novamente toda a verificação de ligação à terra.
10. Verifique a continuidade do cabo de alimentação.
Se o cabo de alimentação não tiver continuidade, troque-o por um novo, efectue
então novamente toda a verificação de ligação à terra.
Se o cabo de alimentação não tiver continuidade, consulte o documento de instalação
do bastidor ou sistema central para isolar a falha.
Efectue os seguintes passos para verificar a ligação à terra do 2104 Modelo TS4:
2. Certifique-se de que as outras extremidades dos cabos de alimentação não
estão ligados a tomadas eléctricas. Desligue os cabos se necessá rio.
Verificar a ligação à terra do 2104 Modelo TS4
1. Certifique-se de que têm um cabo de alimentação ligado a cada uma das tomadas de
alimentação 1 . Consulte a Figura 11.
Figura 11. Cabos de alimentação e conectores SCSI (2104 Modelo TS4)
SC000230
1
2
3
494
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
SC000911
3. : Alguns circuitos eléctricos podem ser danificados se os cabos SCSI
externos estiverem ligados ao 2104 Modelo TS4 enquanto a verificação da
ligação à terra estiver a ser efectuada.
Certifique-se que não existem cabos SCSI externos ligados nos conectores
SCSI . Consulte a Figura 11.
4. Verifique a continuidade entre o chassis do 2104 Modelo TS4 e o pino de
ligação à terra de cada cabo de alimentação.
a. Verifique os cabos de alimentação e certifique-se de que o conector de
ligação à terra do terceiro fio está em perfeitas condições.
b. Utilize um medidor para verificar que a continuidade da ligação à terra do
terceiro fio é de 0.1 ohm ou inferior entre o pino externo de ligação à terra
e a terra do chassis.
5. Se a continuidade for boa, a verificação de ligação à terra está completa.
6. Se a ligação à terra não estiver correcta, desligue os cabos de alimentação
dos conectores de alimentação da linha principal.
a. Verifique a continuidade de cada um dos cabos de alimentação.
b. Se algum dos cabos de alimentação estiver a falhar, troque-o por um novo.
c. Verifique a continuidade entre o chassis do 2104 Modelo TS4 e o pino de
ligação à terra do conector de alimentação em cada conjunto de
ventoínha e fonte de alimentação. Consulte a Figura 12.
Atenção
d Se o conjunto de ventoínha ou de fonte de alimentação não tiver
continuidade, troque esse conjunto por um novo. (Consulte a secção sobre
conjuntos de ventoínha e fonte de alimentação em
Avance para “Verificar a ligação à terra do 2104 Modelo TS4” e execute os
passos desde o 1 até ao 6d para completar de novo a verificação de ligação
à terra.
Expandable Storage Plus
2104 Model DS4 and Model TS4 Service Guide.)
Figura 12. Pino de ligação à terra (2104 Modelo TS4)
1
2
3
SC000331
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
495
SC000912
Figura 13. Remover a alimentação de um 2104 Modelo DS4
A não ser que tenha uma razão especial para o fazer, não remova a alimentação
do sistema central ou do 2104 Modelo DS4 se as instruções que estiver a seguir
não o indicarem.
Efectue os seguintes passos para remover a alimentação de um 2104 Modelo DS4:
1. Verifique junto do cliente que todas as operações entre o 2104 Modelo DS4 e
o sistema central foram paradas.
2. Coloque o interruptor cc on/standby de cada conjunto de ventoínha e fonte
de alimentação na posição standby. Consulte a Figura 13.
a. Alguns 2104 Modelo DS4 têm um conjunto de ventoínha e fonte de alimentação
e outro de ventoínha.O conjunto de ventoínha não tem interruptor cc on/standby.
b. É possível que um conjunto de ventoínha e fonte de alimentação possa ter a luz
CHK ligada embora o interruptor cc on/standby esteja na posição standby.
Nota:
Notas:
Remover a alimentação do 2104 Modelo DS4 ou Modelo TS4
1
Remover a alimentação de um 2104 Modelo DS4
SC000206
496
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
SC000913
3. PERIGO
No passo seguinte irá remover os cabos de alimentação. Estes cabos
estão alimentados se a unidade de distribuição de alimentação ou fonte
de alimentação ininterruptível (UPS) ainda estiver ligada. (1)
1
1
2
2
Nota:
Notes:
A não ser que tenha uma razão especial para o fazer, não remova a
alimentação do sistema central ou do 2104 Modelo TS4 se as instruções
que estiver a seguir não o indicarem.
Efectue os seguintes passos para remover a alimentação de um 2104 Modelo TS4:
1. Verifique junto do cliente que todas as operações entre o 2104 Modelo TS4 e
o sistema central foram paradas.
2. Na parte posterior do 2104 Modelo TS4, Coloque o interruptor cc on/standby
de cada conjunto de ventoínha e fonte de alimentação na posição standby.
Consulte a Figura 14.
a. Alguns 2104 Modelo TS4 têm um conjunto de ventoínha e fonte de
alimentação e outro de ventoínha.O conjunto de ventoínha não tem
interruptor cc on/standby.
b. É possível que um conjunto de ventoínha e fonte de alimentação possa ter
a luz CHK ligada embora o interruptor cc on/standby esteja na posição
standby.
Remover a alimentação de um 2104 Modelo TS4
Remova os cabos de alimentação da parte posterior do 2104 Modelo DS4.
Consulte a Figura 13.
4. Efectue os seguintes passos para devolver a alimentação ao 2104 Modelo DS4:
a. Volte a instalar os cabos de alimentação .
b. Coloque o interruptor cc on/standby de cada conjunto de ventoínha e
fonte de alimentação para on.
É possível configurar a sequência de arranque do motor do 2104 Modelo
DS4. Para obter mais informações, consulte as secções sobre os interruptores
de arranque automá tico e de modo de arranque retardado do motor em
.
Nota:
Expandable Storage Plus 2104 Model DS4 and Model TS4 Service Guide
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
497
SC000914
1
2
2
3. Remover os cabos de alimentação da tomada eléctrica e da parte posterior
do 2104 Modelo TS4. Consulte a Figura 14.
4. Execute os seguintes passos para realimentar o 2104 Modelo TS4:
a. Reinstale os cabos de alimentação .
b. Mude o interruptor ca on/standby de cada conjunto de ventoínha e fonte
de alimentação para on.
É possível configurar a sequência de arranque do motor do 2104 Modelo
TS4. Para obter mais informações, consulte as secções sobre o
interruptor de arranque automá tico da unidade e o de modo de arranque
retardado do motor em
Nota:
Expandable Storage Plus 2104 Model DS4 and
Model TS4 Service Guide.
Figura 14. Remover a alimentação de um 2104 Modelo TS4
SC000207
498
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
SC000915
Avisos de perigo
PERIGO
PERIGO
PERIGO
PERIGO
PERIGO
No passo seguinte irá remover os cabos de alimentação. Estes cabos
estão se a unidade de distribuição de alimentação ou fonte de alimentação
ininterruptível (UPS) ainda está ligada. (1)
Não ligue um cabo de alimentação ao conjunto de ventoínha e fonte de
alimentação até que o conjunto esteja perfeitamente encaixado e as
porcas de orelhas devidamente apertadas. (3)
Durante uma trovoada, não desligue os cabos de estações de visualização,
impressoras, telefones ou protectores de estação para linhas de
comunicação. (5)
Uma tomada eléctrica que não esteja correctamente ligada pode passar
uma voltagem perigosa às peças metá licas do sistema ou dispositivos
que estejam ligados ao sistema. É da responsabilidade do cliente
certificar-se de que a tomada está correctamente ligada e com ligação à
terra de modo a prevenir um choque eléctrico. Durante uma trovoada, não
desligue os cabos das estações de visualização, impressoras, telefones
ou protectores de estação para linhas de comunicação. (4)
Não tente abrir as tampas do conjunto de ventoínha e fonte de
alimentação. (2)
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
499
SC000916
Avisos de segurança
CUIDADO:
Esta unidade poderá ter dois cabos eléctricos. Para cortar a alimentação,
desligue ambos os cabos. (1)
CUIDADO:
Não retire a cobertura, não tente reparações, as peças não
necessitam de manutenção. (2)
CUIDADO:
Bipolaridade/Fusíveis Neutros. (3)
Figura 15. Etiqueta de cuidado com cabos eléctricos
Figura 16. Etiqueta de cuidado com a cobertura da fonte de alimentação.
Figura 17. Etiqueta de cuidado com fusíveis
SC000327
SC000326
SC000325
500
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
SC000916A
CUIDADO:
Esta unidade pesa mais de 32 Kg (70.5 lbs). Consulte os manuais. (4)
Figura 18. Etiqueta de peso do 2104 Modelo DS4
SC000220
Refer to manuals 32 Kg (70.5 lbs)
Atenção:
Caso tenha dados armazenados nas unidades, identifique as mesmas antes de
as remover. Ao voltar a colocar as unidades, instale-as no compartimento de
unidade de onde as removeu. De contrá rio, pode ter como resultado a perda
de dados.
> 18 Kg (37 lbs) > 32 Kg (70.5 lbs)
CUIDADO:
Se o 2104 Modelo DS4
, 32 Kg (70.5 lbs).
2104 Model DS4
2104 Modelo TS4.
32 Kg (70.5 lbs) e o
estiver completamente carregado com 14 unidades e
duas fontes de alimentação o seu peso total excede os Tem
de remover pelo menos seis unidades do antes de o levantar ou
instalar num bastidor ou no Assim, reduz o peso total para
menos de 2104 Modelo DS4 pode ser manuseado de forma
segura por duas pessoas. Se não o fizer, correo risco de provocar ferimentos.
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
501
CUIDADO:
Esta unidade pesa entre 32 Kg - 55 Kg (70.5 lbs - 121.2 lbs). (5)
CUI O:
(6)
DAD
A ferramenta de elevação de embalagens fornecida é necessá ria para mover,
instalar e reposicionar o produto quando totalmente carregado. São
necessá rias três pessoas para mover o produto em segurança. Uma falha
neste procedimento pode provocar lesões.
No caso da ferramenta de elevação não estar prontamente disponível, deve
reduzir o peso para 32 Kg ou menos removendo todos os componentes
pesados (unidades de disco e fontes de alimentação) do produto. Sendo
então necessá rias apenas duas pessoas para mover, instalar e reposicionar
o produto.
Figura 19. Etiqueta de peso 2104 Modelo TS4
SC000219
Atenção:
Se tiver dados armazenados nas unidades, identifique as unidades com etiquetas
antes de as remover.
Quando substituir as unidades, instale cada uma destas no compartimento do
qual as removeu. Uma falha neste processo pode resultar numa perda de dados.
> 18 Kg (37 lbs) > 32 Kg (70.5 lbs)
32 - 55 Kg (70.5 - 121.2 lbs)
SC000917
502
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
SC000918
CUIDADO:
Uma condição de espera é indicada pelo símbolo à direita de CA logo acima
do interruptor, SW1. Quando muda SW1 para a posição da direita logo abaixo
do símbolo standby, a alimentação ca para a unidade não é cortada. (7)
CUIDADO:
O estabilizador tem de estar correctamente montado na parte inferior frontal
do bastidor de modo a evitar que este tombe para a frente aquando da
remoção do 2104 Modelo DS4 do bastidor. Não remova nem instale qualquer
unidade se não houver um estabilizador montado no bastidor. (8)
CUIDADO:
O estabilizador tem de estar correctamente montado na parte inferior frontal
do bastidor de modo a evitar que este tombe para a frente aquando da
instalação do 2104 Model DS4 no bastidor. Não remova nem instale qualquer
unidade se não houver um estabilizador montado no bastidor. (9)
Figura 20. Localização da condição de espera
SC000328
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
503
SC000319
CUIDADO:
Não inserir as mãos ou ferramentas no espaço vazio que continha o conjunto
de ventoínha. (10)
CUIDADO:
Não inserir as mãos ou ferramentas no espaço vazio que continha o conjunto
de ventoínha e fonte de alimentação. (11)
CUIDADO:
Não inserir as mãos ou ferramentas no espaço vazio que continha o conjunto
da placa da interface SCSI. (12)
CUIDADO:
Este produto está equipado com um cabo de alimentação e uma ficha de 3
fios para segurança do utilizador. Utilize este cabo de alimentação
juntamente com uma tomada eléctrica devidamente ligada à terra de modo a
evitar um choque eléctrico. (13)
CUIDADO:
Não toque na tomada de alimentação ou na chapa da mesma com nada que
não sondas de teste antes de completar esta verificação de segurança. (14)
CUIDADO:
Se a leitura não for infinito, não avance. Faça as correcções necessá rias
na ligação dos fios antes de continuar. Não ligue o circuito de derivação CB
até que conclua satisfatoriamente os passos anteriores. (15)
CUIDADO:
Não utilize as pegas dos conjuntos de ventoínha e fonte de alimentação para
transportar o 2104 Modelo DS4. Estas pegas não foram concebidas para
suportar o peso da unidade. (16)
504
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
SC000920
CUI O:
,
.
. (17)
CU :
. (18)
CUI O:
no e
. (19)
CUI O:
no e
. (20)
CUI O:
2104 Model DS4 n Model T00 o T42,
observ
. e
s e
SA38-0577, o Web site:
www-1.ibm.com/servers/eserver/pseries/library/hardware_docs/7014_t00.html
(21)
CUI O:
2104 Model DS4 Model TS4 nstal d
c e certifi .
no 2104 Model DS4 o
Model TS4 i
. 2104 Model DS4 o Model TS4
instal d CE. (22)
DAD
À medida que encaixa o conjunto a alavanca move-se automaticamente para
a posição de fechada Assegure-se que não entala os dedos entre a alavanca
e o conjunto
IDADO
Certifique-se de que os cabo de alimentação da linha principal foi removido
da ventoínha e fonte de alimentação com falhas antes de continuar
DAD
Não inserir as mãos ou ferramentas spaço vazio por cima do conjunto de
ventoínha e fonte de alimentação
DAD
Não inserir as mãos ou ferramentas spaço vazio entre os conjuntos de
fonte de alimentação
DAD
Se o o estiver instalado um bastidor o u
deve também ar os avisos de segurança para esses bastidores antes de
começar a remover o conjunto da moldura Pode encontrar os avisos d
egurança na secção sobre instalação do sistema m
u no seguinte
DAD
O o e o foram concebidos para serem i a os
pelo liente estão cados como para instalação pelo cliente
Certifique-se de que o sistema ou bastidor qual o o u
o vai ser nstalado também está concebido e certificado como para
instalação pelo cliente Caso contrá rio
7014 Model T00 and
T42 Rack Installation and Service Guide,
, o o u o
terá que ser a o por um
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
505
SC000921
CUIDADO:
Não inserir as mãos ou ferramentas no espaço que continha o conjunto da
placa. (23)
CUIDADO:
São necessá rias três pessoas para levantar o 2104 Modelo TS4. Não tente
levantar o 2104 Modelo TS4 sózinho. Não tente levantá -lo sem a ajuda de
outras duas pessoas. (24)
CUIDADO:
Não utilize as pegas da ventoínha ou conjuntos de ventoínha e fonte de
alimentação para transportar o 2104 Modelo TS4. Estas pegas não foram
concebidas para suportar o peso da unidade. (25)
O 2104 Modelo DS4 está concebido como sendo para instalação pelo cliente
e está certificado como para instalação pelo cliente. Certifique-se de que o
sistema ou bastidor no qual o 2104 Modelo DS4 irá ser instalado também está
concebido e certificado como para instalação pelo cliente. Caso contrá rio,
o 2104 Modelo DS4 terá então que ser instalado por um CE. (26)
CUIDADO:
CUIDADO:
O o está concebido como sendo para instalação pelo liente
e está certificado como para instalação pelo cliente Certifique-se de que o
sistema qual o o irá ser instalado também está concebido
e certificado como para instalação pelo cliente Caso contrá rio o 2104
Modelo TS4 terá então que ser instalado por um
2104 Model TS4 c
.
no 2104 Model TS4
. ,
CE. (27)
506
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Romanian
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
507
508
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
509
510
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
511
512
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
513
514
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
515
516
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
517
518
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
519
520
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
521
522
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
523
524
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
525
526
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
527
528
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
529
Russian
530
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
531
532
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
533
534
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
535
536
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
537
538
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
539
540
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
541
542
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
543
544
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
545
546
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
547
548
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
549
550
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
551
552
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Slovak
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
553
554
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
555
556
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
557
558
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
559
560
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
561
562
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
563
564
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
565
566
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
567
568
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
569
570
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
571
572
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
573
574
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
575
Slovenia
576
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
577
578
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
579
580
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
581
582
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
583
584
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
585
586
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
587
588
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
589
590
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
591
592
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
593
594
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
595
596
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
597
598
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Spanish
Avisos de seguridad y medioambientales
Avisos de seguridad y sus traducciones
SC000901
Esta sección contiene la siguiente información:
Avisos de seguridad y sus traducciones
Procedimientos de inspección de la seguridad para este producto
Avisos y declaraciones medioambientales
Los avisos de seguridad aparecen impresos a lo largo de este documento.
Los avisos de advierten de condiciones o procedimientos que pueden
causar la muerte o daños personales graves.
Los avisos de advierten de condiciones o procedimientos que pueden
causar daños personales no letales ni extremadamente peligrosos.
Los avisos de advierten de condiciones o procedimientos que pueden
causar daños en má quinas, equipos, programas o datos.
Los avisos se listan en orden numérico según sus ID, que se muestran entre
paréntesis al final de cada uno de ellos. Compruebe la ubicación del número de
ID en los siguientes ejemplos de mensajes de peligro y precaución.
Peligro
Precaución
Atención
PELIGRO
Una toma de corriente que no esté correctamente conectada podría dar
lugar un voltaje peligroso en las piezas metá licas del sistema o en los
productos conectados al mismo. Es responsabilidad del cliente
asegurarse de que la toma esté correctamente cableada y conectada
a tierra para evitar descargas eléctricas. (1)
PRECAUCIÓN:
No introduzca las manos ni herramientas en la abertura del espacio vacío que
aloja el conjunto del ventilador. (1)
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
599
Descarga electroestá tica (ESD)
Procedimientos de inspección de la seguridad
Inspección del bastidor
SC000902
Atención Cuando manipule unidades sustituibles localmente (FRU) y otras piezas,
tome las siguientes precauciones para evitar daños producidos por la electricidad
está tica:
Limite sus movimientos. El movimiento puede atraer la electricidad está tica a
su alrededor.
Manipule siempre con cuidado las piezas informá ticas. Sostenga las tarjetas por
sus bordes o su cubierta metá lica. No toque nunca circuitos que estén al
descubierto.
Evite que toquen las piezas informá ticas personas que no estén debidamente
en conctacto a tierra.
Antes de instalar una pieza nueva, tome el paquete protector de electricidad
está tica que contiene la pieza y póngalo en contacto con una pieza sin pintar del
sistema 2104 Modelo DS4, 2104 Modelo TS4 o sistema principal durante 2
segundos como mínimo. De este modo se reducirá la electricidad está tica del
paquete y de su cuerpo.
Extraiga la pieza de su paquete y, si es posible, instá lela en el sistema 2104
Modelo DS4 o Modelo TS4 sin depositarla sobre ninguna superficie. Si debe
depositar la pieza, siga los siguientes pasos:
1. Coloque el paquete protector de electricidad está tica que contenía la pieza
sobre una superficie lisa y plana.
2. Deposite la pieza sobre el paquete protector de electricidad está tica. No
la deposite directamente sobre ninguna superficie metá lica.
Los procedimientos de inspección de la seguridad describen el modo de
inspeccionar el sistema 2104 Modelo DS4 o Modelo TS4.
Consulte el manual de instalación del bastidor para obtener información sobre el
modo de realizar una inspección segura del mismo.
PRECAUCIÓN:
El sistema 2104 Modelo DS4 y Modelo TS4 se han diseñado para que los
instale el cliente y está n certificados como configurables por el cliente.
Asegúrese de que el sistema o el bastidor en el que se vaya a instalar el
sistema 2104 Modelo DS4 o Modelo TS4 esté también diseñado y
certificado como configurable por el cliente. De lo contrario, el sistema
2104 Modelo DS4 o Modelo TS4 deberá instalarlo un técnico de sistemas. (22)
600
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Inspección del sistema 2104 Modelo DS4 o Modelo TS4
Comprobaciones externas de la má quina
Comprobaciones internas de la má quina
SC000903
Realice las siguientes comprobaciones internas de la má quina:
1. Compruebe si se ha realizado alguna modificación en la má quina que no
haya hecho IBM.
En caso afirmativo, obtenga el formulario “Non-IBM Alteration Attachment
Survey” (Inspección anexa de alteración no realizada por IBM), número R009,
de la sucursal de IBM. Rellene el formulario y devuélvalo a la sucursal.
2. Compruebe las condiciones del interior de la má quina, por si hubiese:
Metales u otros contaminantes
Indicaciones de agua u otros fluidos
Fuego
Daños por humo
3. Compruebe cualquier problema mecá nico obvio, tal como la pérdida de
componentes.
4. Compruebe el desgaste, roturas o pinzamientos de los cables y conectores
expuestos.
Realice las siguientes comprobaciones externas de la má quinas:
1. Compruebe los daños en el chasis (pérdidas, roturas o bordes cortantes).
2. Compruebe los cables de alimentación y asegúrese de que el material
aislante no esté desgastado o dañado.
3. Compruebe cualquier cambio no está ndar que sea obvio. Realice una
valoración de las implicaciones en la seguridad de cualquiera de tales cambios.
4. Verifique que todas las cubiertas externas estén presentes y no estén dañadas.
5. Compruebe que todos los pestillos y bisagras estén en condiciones de
funcionamiento correctas.
6. Compruebe si existen daños en el cable de alimentación
7. Compruebe si los cables está n desgastados, dañados o pinzados.
8. Inspeccione los conjuntos de ventilador y fuente de alimentación. Compruebe
que el mecanismo de cierre de la cubierta de la unidad de fuente de
alimentación (tornillos o remaches) no se haya desmontado o alterado.
9. Compruebe si existen daños en el cable de señal externo.
10. Compruebe si la cubierta tiene bordes cortantes, daños o alteraciones que
expongan las piezas internas del dispositivo.
11. Corrija cualquier problema que encuentre.
Realice las siguientes comprobaciones de seguridad para identificar las
condiciones en las que no exista seguridad.
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
601
Comprobaciones de las etiquetas de seguridad
SC000904
Realice las siguientes comprobaciones de las etiquetas de seguridad:
1. Verifique que la etiqueta esté instalada en el sistema 2104 Modelo DS4
o Modelo TS4. Vea la figura 1.
2. Verifique que la etiqueta de precaución de la línea conductora esté instalada en
la fuente de alimentación del sistema 2104 Modelo DS4 o Modelo TS4.
Vea la figura 2.
SC000274
Figura 1. Etiqueta del sistema 2104 Modelo DS4 o Modelo TS4
Figura 2. Etiqueta de precaución de cable conductor
SC000325
PRECAUCIÓN:
Esta unidad puede tener dos cables conductores. Para eliminar toda la
alimentación, desconecte ambos cables conductores. (1)
602
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
SC000905
3. Verifique que la etiqueta de precaución de la cubierta de la fuente de
alimentación esté instalada en la fuente de alimentación del sistema 2104
Modelo DS4 o Modelo TS4. Vea la figura 3.
4. Verifique que la etiqueta de precaución del fusible esté instalada en la fuente
de alimentación del sistema 2104 Modelo DS4 o Modelo TS4. Vea la figura 4.
Figura 3. Etiqueta de precaución de la cubierta de la fuente de alimentación
Figura 4. Etiqueta de precaución del fusible
SC000326
SC000327
PRECAUCIÓN:
No desmonte la cubierta, no proporcione servicio, no hay piezas
susceptibles de recibir servicio. (2)
PRECAUCIÓN:
Fusible de doble polo/neutro. (3)
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
603
SC000905A
5. Verifique que la etiqueta del peso esté instalada en el sistema 2104 Modelo DS4.
Vea la figura 5.
PRECAUCIÓN:
Esta unidad pesa sobre 32 Kg (70,5 lbs). Consulte los manuales. (4)
Figura 5. Etiqueta de peso del sistema 2104 Modelo DS4
SC000220
Refer to manuals 32 Kg (70.5 lbs)
Atención:
Si tiene datos guardados en la unidades, etiquete las unidades antes de extraerlas.
Cuando sustituya las unidades, instale cada una en la misma bahía de unidad
de la que la ha extraído.
El no hacerlo de esta manera, puede producir una pérdida de datos.
> 18 Kg (37 lbs) > 32 Kg (70,5 lbs)
PRECAUCIÓN
2104 Modelo DS4 está completamente cargado con 14 unidades y
dos fuentes de alimentación, su peso total excede de 32 Kg (70,5 lbs).
Debe extraer al menos seis unidades del sistema 2104 Modelo DS4 antes de
levantarlo o instalarlo en un bastidor o sistema 2104 Modelo TS4.
Esta acción reduce el peso total a menos de 32 Kg (70,5 lbs) con lo que dos
personas pueden manejar el sistema
Si no se hace de está forma se pueden producir lesiones físicas.
:
Si un sistema
2104 Modelo DS4.
604
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
PRECAUCIÓN:
Esta unidad pesa entre 32 y 55 Kg (70,5 lbs y 121,2 lbs). (5)
Figura 6. Información de peso del sistema 2104 Modelo TS4
32 - 55 Kg (70.5 - 121.2 lbs)
SC000219
6. Verifique que la etiqueta del peso esté instalada en el sistema 2104 Modelo TS4.
Vea la Figura 6.
PRECAUCIÓN:
Para trasladar, instalar y reubicar el producto cuando esté completamente
lleno se requiere la herramienta para levantar el embalaje que se
proporciona. Para mover el producto con seguridad son necesarias tres
personas. De no hacerse así podrían producirse lesiones personales
)
.
32 Kg o menos extrayendo todos los
componentes pesados (unidades de disco y fuentes de alimentación)
del producto. Entonces se necesitará n solo dos personas para trasladar,
instalar y reubicar el producto. (6
En caso de que la herramienta para levantar el producto no esté disponible
fá cilmente, puede reducir el peso a
Atención:
Si tiene datos almacenados en las unidades, etiquételas antes de extraerlas.
Cuando sustituya las unidades, instale cada una en la misma bahía de
unidad de la que la ha extraído. Si no se hiciese así, se podría producir
una pérdida de datos.
> 18 Kg (37 lbs) > 32 Kg (70,5 lbs) SC000906
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
605
SC000907
7. Verifique que la etiqueta de condición de en espera esté instalada en
el sistema 2104 Modelo DS4 o Modelo TS4. Vea la Figura 7.
PRECAUCIÓN:
Una condición de en espera se indica mediante el símbolo a la derecha
de DC directamente por encima del interruptor, SW1.
Cuando se conmuta SW1 a la posición derecha directamente bajo el
símbolo de en espera, la alimentación de ca para la unidad no se apaga. (7)
Figura 7. Etiqueta de condición de En espera
SC000326
606
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
SC000908
Esta sección contiene instrucciones para comprobar la toma de tierra
del sistema 2104 Modelo DS4 o Modelo TS4.
Comprobación de la toma de tierra del sistema 2104 Modelo DS4
Siga los pasos siguientes para comprobar la toma de tierra del sistema 2104 Modelo
DS4:
3. Algunos circuitos eléctricos se podrían dañar si los cables SCSI
externos se conectan al sistema 2104 Modelo DS4 mientras se está
realizando la comprobación de la toma de tierra.
Asegúrese de que no haya cables SCSI externos enchufados a los
conectores .
4. Compruebe la toma de tierra del sistema 2104 Modelo DS4:
Atención:
a. En los extremos de conexión de los cables de alimentación que ha
desenchufado de la unidad de distribución de alimentación o de la unidad de
respaldo en el paso 2, realice una inspección visual y asegúrese de que la
clavija de toma de tierra de tres cables esté en buenas condiciones.
b. Utilice un medidor de ohmios y conecte un extremo al chasis del sistema 2104
Modelo DS4 y el otro extremo a la clavija de toma de tierra de tres cables en la
cara de los conectores de alimentación que ha desenchufado en el paso 2. La
lectura debería ser igual o inferior a 0,1 ohmios entre la clavija de toma de
tierra del conector de alimentación y el chasis del sistema 2104 Modelo DS4.
Figura 8. Cables de alimentación y conectores SCSI en el sistema 2104 Modelo DS4
Comprobación de la toma de tierra del sistema 2104 Modelo DS4 o
Modelo TS4
1. Asegúrese de que toda la alimentación eléctrica se haya desconectado del
bastidor. Vea el bastidor o sistema principal en el documento de instalación.
2. Asegúrese de que los cables de alimentación estén enchufados en cada
conjunto de ventilador-fuente de alimentación. Vea la figura 8. Asegúrese también
de que los otros extremos de los cables de alimentación en el bastidor estén
desconectados de la unidad de distribución de alimentación o de la unidad de
respaldo por batería.
Vea el bastidor o sistema principal en el documento de instalación.
2
1
SC000210
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
607
SC000909
Si está proporcionando servicio técnico a un sistema 2104 Modelo DS4 de
220 voltios, vaya al paso 5.
Si está proporcionando servicio técnico a un sistema 2104 Modelo DS4 de
-48 voltios, vaya al paso 7.
5. Compruebe la continuidad entre el chasis del sistema 2104 Modelo DS4 y
la clavija de toma de tierra del conector de alimentación de cada conjunto
de ventilador-fuente de alimentación. Vea la figura 9.
6. Vaya al paso 9.
7. Compruebe la continuidad entre el chasis del sistema 2104 Modelo DS4 y las
clavijas de toma de tierra 1 y 7 de cada conector de línea principal. Vea la
figura 10.
8. Vaya al paso 9.
9. Si alguno de los conjuntos de ventilador-fuente de alimentación no tiene
continuidad, cambie dicho conjunto por uno nuevo. Vea la sección sobre
conjuntos de ventilador y fuente de alimentación en el manual
. Realice de nuevo
la comprobación completa de la toma de tierra. Si cada conjunto de ventiladorfuente
de alimentación tienen continuidad, es posible que exista un problema con
el cable de alimentación o con la toma de tierra del sistema principal.
Expandable
Storage Plus 2104 Model DS4 and Model TS4 Service Guide
Figura 9. Clavija de toma de tierra (Sistema 2104 Modelo DS4 de 220 voltios)
Figura 10. Clavijas de toma de tierra (Sistema 2104 Modelo DS4 de -48 voltios)
1
SC000329
Si la toma de tierra es correcta, la comprobación de la misma habrá finalizado.
Si la conexión a tierra no es correcta, desenchufe los cables de alimentación
de los conjuntos de ventilador-fuente de alimentación del sistema 2104
Modelo DS4.
SC000330
Clavija 6
Clavija 1
Clavija 7
Clavija12
608
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
SC000910
Si está proporcionando servicio a un sistema 2104 Modelo DS4 de 220
voltios, vaya al paso 10. Si está proporcionando servicio a un sistema
2104 Modelo DS4 de -48 voltios, vea la documentación de instalación del
del bastidor o sistema principal para determinar la anomalía. Realice la
comprobación completa de la conexión de tierra de nuevo.
10. Compruebe la continuidad del cable de alimentación. Si el cable no tiene
continuidad, cá mbielo por uno nuevo y, a continuación, realice la
comprobación completa de la conexión de tierra de nuevo. Si el cable tiene
continuidad, vea la documentación de instalación para el bastidor o sistema
principal para determinar la anomalía.
Para comprobar la toma de tierra del sistema 2104 Modelo Ts4, realice los
pasos siguientes:
2. Asegúrese de que los otros extremos de los cables de alimentación no estén
conectados a las tomas de alimentación eléctrica. Desenchufe los cables
si fuese necesario.
Comprobación de la toma de tierra del sistema 2104 Modelo TS4
1. Asegúrese de que haya conectado un cable de alimentación en cada toma
de alimentación 1 . Vea la figura 11.
Figura 11. Cables de alimentación y conectores SCSI (Sistema 2104 Modelo TS4)
SC000230
1
2
3
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
609
SC000911
3. : Algunos circuitos eléctricos se podrían dañar si los cables SCSI
externos se conectan al sistema 2104 Modelo DS4 mientras se está
realizando la comprobación de la toma de tierra.
Asegúrese de que no haya cables SCSI externos conectados a los
conectores . Vea la figura 11.
4. Compruebe la continuidad entre el chasis del sistema 2104 Modelo TS4
y la clavija de toma de tierra de cada cable de alimentación.
a. Compruebe los cables de alimentación y asegúrese de que el conector
de tierra del tercer cable se encuentre en buenas condiciones.
b. Utilice un medidor para comprobar que la continuidad de la toma de
tierra del tercer cable es de 0,1 ohmios o menor entre la clavija de toma
de tierra externa y la toma de tierra del chasis.
5. Si la continuidad es buena, la comprobación de la toma de tierra se ha
completado.
6. Si la toma de tierra no es correcta, desenchufe los cables de alimentación
de los conectores de la línea principal de alimentación.
a. Compruebe la continuidad de cada cable de alimentación.
b. Si el cable de alimentación falla, cá mbielo por uno nuevo.
c. Compruebe la continuidad entre el chasis del sistema 2104 Modelo TS4
y la clavija de toma de tierra del conector de alimentación en cada
conjunto de ventilador y fuente de alimentación. Vea la figura 12.
Atención
d Si el conjunto de ventilador y fuente de alimentación no tiene continuidad,
cambie el conjunto por uno nuevo. (Vea la sección sobre conjuntos de
ventilador y fuente de alimentación en el manual
Vaya a “Comprobación de la toma de tierra del sistema 2104 Modelo TS4”
y realice los pasos 1 al 6d para completar la comprobación de la toma
de tierra de nuevo.
Expandable Storage
Plus 2104 Model DS4 and Model TS4 Service Guide.)
Figura 12. Clavija de toma de tierra (Sistema 2104 Modelo TS4)
1
2
3
SC000331
610
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
SC000912
Figura 13. Eliminación de la alimentación de un sistema 2104 Modelo DS4
Salvo que tenga una razón particular para hacerlo, no elimine la alimentación
del sistema principal o del sistema 2104 Modelo DS4 a menos que las
instrucciones que está siguiendo así se lo indiquen.
Siga los pasos siguientes para eliminar la alimentación de un sistema 2104
Modelo DS4:
1. Compruebe con el cliente que todas las operaciones entre el sistema 2104
Modelo DS4 y el sistema principal se hayan detenido.
2. Posicione el interruptor de encendido/espera de cc de cada conjunto de
ventilador-fuente de alimentación en posición de espera. Vea la figura 13.
a. Algunos sistemas 2104 Modelo DS4 tienen un conjunto de ventilador-fuente
de alimentación y un conjunto de ventilador. El conjunto de ventilador no tiene
interruptor de encendido/espera de cc.
b. Un conjunto de ventilador-fuente de alimentación puede tener su indicador de
comprobación (CHK) encendido aunque su interruptor de encendido/espera de
cc esté en posición de espera.
Nota:
Notas:
Eliminación de la alimentación del sistema 2104 Modelo DS4 o
Modelo TS4
1
Eliminación de la alimentación de un sistema 2104 Modelo DS4
SC000206
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
611
SC000913
3. PELIGRO
En el paso siguiente se van a extraer los cables de alimentación. Estos
cables tienen corriente si la unidad de distribución de alimentación del
bastidor a la unidad de fuente de alimentación ininterrumpida (UPS)
está n aún encendidas. (1)
1
1
2
2
Note:
Notas:
Salvo que tenga una razón particular para hacerlo, no elimine la alimentación
del sistema principal o del sistema 2104 Modelo TS4 a menos que las
instrucciones que está siguiendo le indiquen que lo haga.
Siga los pasos siguientes para eliminar la alimentación de un sistema 2104
Modelo TS4:
1. Compruebe con el cliente que todas las operaciones entre el sistema 2104
Modelo TS4 y el sistema principal se hayan detenido.
2. En la parte posterior del sistema 2104 Modelo TS4, posicione el interruptor de
encendido/espera de cc de cada conjunto de ventilador-fuente de
alimentación en posición de espera. Vea la figura 14.
a. Algunos sistemas 2104 Modelo TS4 tienen un conjunto de ventiladorfuente
de alimentación y un conjunto de ventilador. El conjunto de
ventilador no tiene interruptor de encendido/espera de cc.
b. Un conjunto de ventilador-fuente de alimentación puede tener su indicador
de comprobación (CHK) encendido aunque su interruptor de encendido/
espera de cc esté en posición de espera.
Eliminación de la alimentación de un sistema 2104 Modelo TS4
Extraiga los cables de alimentación de la parte posterior del sistema 2104
Modelo DS4. Vea la figura 13.
4. Siga los pasos siguientes para devolver la alimentación al sistema 2104
Modelo DS4:
a. Vuelva a instalar los cables de alimentación .
b. Posicione el interruptor de encendido/espera de cc de cada conjunto de
ventilador-fuente de alimentación en posición de encendido.
Puede configurar la secuenciación de arranque del motor del sistema
2104 Modelo DS4.
Para obtener má s información, vea las secciones sobre el conmutador de
arranque automá tico de la unidad y el conmutador de modalidad de arranque
del motor de retardo en el manual
.
Nota:
Expandable Storage Plus 2104 Model DS4
and Model TS4 Service Guide
612
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
SC000914
1
2
2
3. Extraiga los cables de alimentación de la toma de alimentación eléctrica
y de la parte posterior del sistema 2104 Modelo TS4. Vea la figura 14.
4. Realice los pasos siguientes para devolver la alimentación al sistema
2104 Modelo TS4:
a. Vuelva a instalar los cables de alimentación .
b. Posicione el interruptor de encendido/espera de cc de cada conjunto
de ventilador y fuente de alimentación en la posición de encendido.
Es posible configurar la secuenciación del motor de arranque del
2104 Modelo TS4. Para obtener má s información, vea las secciones
sobre el interruptor de arranque automá tico de la unidad y el interruptor
de modo de retardo de arranque del motor en el manual
Nota:
Expandable
Storage Plus 2104 Model DS4 and Model TS4 Service Guide.
Figura 14. Eliminación de la alimentación de un sistema 2104 Modelo TS4
SC000207
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
613
SC000915
Avisos de peligro
PELIGRO
PELIGRO
PELIGRO
PELIGRO
PELIGRO
En el paso siguiente se van a extraer los cables de alimentación. Estos
cables pueden tener corriente si la unidad de distribución de alimentación
del bastidor o la unidad de fuente de alimentación ininterrumpida (UPS)
aún se encuentran encendidas. (1)
No enchufe ningún cable de alimentación en el conjunto de ventilador y
fuente de alimentación hasta que el conjunto esté completamente en su
lugar y sus tornillos estén completamente apretados. (3)
Durante una tormenta con aparato eléctrico, no desconecte los cables de
estaciones de pantalla, impresoras, teléfonos o protectores de estación
para líneas de comunicaciones. (5)
Una toma de alimentación eléctrica que no esté correctamente cableada
podría dar lugar a un voltaje peligroso en las piezas metá licas del sistema
o en los dispositivos que estén conectados al mismo. Es responsabilidad
del cliente asegurarse de que la toma de alimentación esté correctamente
cableada y conectada a tierra para evitar una descarga eléctrica.
Durante una tormenta con aparato eléctrico, no desconecte los cables de
estaciones de pantalla, impresoras, teléfonos o protectores de estación
para líneas de comunicaciones. (4)
No intente abrir las cubiertas del conjunto de ventilador y fuente de
alimentación. (2)
614
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
SC000916
Avisos de precaución
PRECAUCIÓN:
Es posible que esta unidad tenga dos cables de línea. Para suprimir toda
la alimentación, desconecte ambos cables de línea. (1)
PRECAUCIÓN:
No desmonte la cubierta, no precisa servicio, no contiene piezas que se
puedan reparar. (2)
PRECAUCIÓN:
Polo doble/Fusible neutro. (3)
Figura 15. Etiqueta de precaución del cable de línea
Figura 16. Etiqueta de precaución de la cubierta de la fuente de alimentación
Figura 17. Etiqueta de precaución del fusible
SC000327
SC000326
SC000325
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
615
SC000916A
PRECAUCIÓN
32 Kg (70,5 lbs). Consulte los manuales.
:
Esta unidad pesa má s de (4)
Figura 18. Etiqueta de peso del sistema 2104 Modelo DS4
SC000220
Refer to manuals 32 Kg (70.5 lbs)
Atención:
Si existen datos almacenados en las unidades, etiquételas antes de extraerlas.
Cuando sustituya las unidades, instale cada una de ellas en la misma bahía
de unidad de la que la ha extraído. De no hacerlo de así, se puede producir
una pérdida de datos.
> 18 Kg (37 lbs) > 32 Kg (70,5 lbs)
PRECAUCIÓN
2104 Modelo DS4 con 14 unidades y dos
fuentes de alimentación, su peso total superará los 32 Kg (70,5 lbs). Deberá
extraer seis unidades como mínimo del sistema 2104 Modelo DS4 antes de
levantarlo o instalarlo en un bastidor o sistema 2104 Modelo TS4. Esto reducirá
el peso total a menos de 32 Kg (70,5 lbs) y hará que el sistema
. No hacerlo así
puede producir lesiones.
:
Si se carga por completo un sistema
2104 Modelo
DS4 pueda ser manejado con seguridad por dos personas
616
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
PRECAUCIÓN:
Esta unidad pesa entre 32 Kg - 55 Kg (70,5 lbs - 121,2 lbs). (5)
PRECAUCIÓN
es necesaria
para transportar, instalar y reubicar el producto cuando esté completamente
lleno. Para transportar el producto con seguridad será n necesarias tres
personas. No hacerlo así podría producir lesiones.
En el caso de que la herramienta de levantamiento no esté disponible, deberá
reducir el peso a 32 Kg o menos extrayendo todos los componentes pesados
(unidades de disco duro y fuentes de alimentación) del producto. Sólo se
necesitará n dos personas para transportar, instalar y reubicar el
producto.
:
La herramienta de levantamiento del embalaje proporcionada
(6)
Figura 19. Etiqueta del peso del sistema 2104 Modelo TS4
SC000219
Atención:
Si existen datos almacenados en las unidades, etiquételas antes de extraerlas.
Cuando sustituya las unidades, instale cada una de ellas en la misma bahía de
unidad de donde las ha extraido. No hacerlo así podría resultar en una pérdida de
datos.
> 18 Kg (37 lbs) > 32 Kg (70,5 lbs)
32 - 55 Kg (70.5 - 121.2 lbs)
SC000917
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
617
SC000918
PRECAUCIÓN:
Una condición de en espera viene indicada por el símbolo a la derecha de la
CC justo encima del interruptor, SW1. Cuando conmute SW1 a la posición
de la derecha justo debajo del símbolo de en espera, la alimentación de CA
de la unidad no estará apagada. (7)
PRECAUCIÓN:
El estabilizador debe estar correctamente sujeto a la parte frontal inferior del
bastidor para evitar que éste se vuelque hacia delante cuando se extraiga el
sistema 2104 Modelo DS4 del bastidor. No extraiga ni instale ninguna unidad
si no existe un estabilizador sujeto al bastidor. (8)
PRECAUCIÓN:
El estabilizador debe estar correctamente sujeto a la parte frontal inferior del
bastidor para evitar que éste se vuelque hacia delante cuando se instale el
sistema 2104 Modelo DS4 en el bastidor. No extraiga ni instale ninguna unidad
si no existe un estabilizador sujeto al bastidor. (9)
Figura 20. Ubicación de la condición de en espera
SC000328
618
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
SC000319
PRECAUCIÓN:
No introduzca las manos ni herramientas en el espacio vacío que albergaba
al conjunto del ventilador. (10)
PRECAUCIÓN:
No introduzca las manos ni herramientas en el espacio vacío que albergaba
al conjunto del ventilador y de la fuente de alimentación. (11)
PRECAUCIÓN:
No introduzca las manos ni herramientas en el espacio vacío que albergaba
al conjunto de la tarjeta de interfaz SCSI. (12)
PRECAUCIÓN:
Este producto está equipado con un enchufe y un cable de alimentación de
3 hilos como medida de seguridad para el usuario. Utilice este cable de
alimentación junto con una toma eléctrica correctamente conectada a tierra
para evitar descargas eléctricas. (13)
PRECAUCIÓN:
No toque la toma de alimentación ni la toma de alimentación eléctrica con
nada que no sean puntas de prueba antes de completar esta comprobación
de seguridad. (14)
PRECAUCIÓN:
Si la lectura no es infinito, no continúe. Efectúe las correcciones necesarias
en los cables antes de continuar. No conmute la ramificación del circuito CB
hasta que finalice satisfactoriamente los pasos anteriores. (15)
PRECAUCIÓN:
No utilice las asas del ventilador o de los conjuntos del ventilador y de la
fuente de alimentación para transportar el sistema 2104 Modelo Ds4.
Estas asas no está n diseñadas para soportar el peso de la unidad. (16)
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
619
SC000920
PRECAUCIÓN:
Cuando empuje la unidad para colocarla totalmente en su sitio, la palanca
se desplazará automá ticamente hacia su posición de cierre. Procure no
pellizcarse los dedos entre la palanca y la unidad. (17)
Asegúrese de que el cable de alimentación de la línea principal se haya
extraído del conjunto de ventilador y unidad de alimentación defectuoso
antes de continuar. (18)
No introduzca las manos ni herramientas en el espacio vacío situado sobre
la unidad de alimentación y ventilación. (19)
No introduzca las manos ni herramientas en el espacio vacío situado entre
entre las unidades de alimentación. (20)
Si el sistema 2104 Modelo DS4 se instala en el bastidor de un sistema Modelo
T00 o T42, deberá tener en cuenta también los avisos de seguridad de dicho
bastidor antes de proceder a extraer el armazón de la unidad. Encontrará
los avisos de seguridad en la sección dedicada a la instalación del sistema
de la publicación
SA38-0577, o en el siguiente sitio Web:
www-1.ibm.com/servers/eserver/pseries/library/hardware_docs/7014_t00.html
(21)
Los sistemas 2104 Modelo DS4 y Modelo TS4 se han diseñado para que los
instale el cliente y está n certificados como configurables por el cliente.
Asegúrese de que el sistema o el bastidor en el que se vaya a instalar el
sistema 2104 Modelo DS4 o Modelo TS4 esté también diseñado y certificado
como configurable por el cliente. De lo contrario, el sistema 2104 Modelo DS4
o Modelo TS4 deberá instalarlo un técnico de sistemas. (22)
PRECAUCIÓN:
PRECAUCIÓN:
PRECAUCIÓN:
PRECAUCIÓN:
PRECAUCIÓN:
7014 Model T00 and T42 Rack Installation and Service Guide,
620
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
SC000921
PRECAUCIÓN:
No introduzca las manos ni herramientas en el espacio que albergaba el
conjunto de la tarjeta. (23)
PRECAUCIÓN:
Se necesitan tres personas para levantar el sistema 2104 Modelo TS4. No
intente levantar el sistema 2104 Modelo TS4 usted solo. No intente levartarlo
sin ayuda de otras dos personas. (24)
PRECAUCIÓN:
No utilice las asas del ventilador o de los conjuntos del ventilador y de la
fuente de alimentación para transportar el sistema 2104 Modelo TS4. Estas
asas no está n diseñadas para soportar el peso de la unidad. (25)
El sistema 2104 Modelo DS4 está diseñado para que lo instale el cliente y
está certificado como configurado por el cliente. Asegúrese de que el sistema
o bastidor en el que se va a instalar el sistema 2104 Modelo DS4 también
esté diseñado y certificado para la configuración del cliente. Si no lo está ,
un CE deberá instalar el sistema 2104 Modelo DS4. (26)
PRECAUCIÓN:
PRECAUCIÓN:
El sistema 2104 Modelo TS4 está diseñado para que lo instale el cliente y
está certificado como configurado por el cliente. Asegúrese de que el sistema
en el que se va a instalar el sistema 2104 Modelo TS4 también esté diseñado
y certificado para la configuración del cliente. Si no lo está , un CE deberá
instalar el sistema 2104 Modelo TS4. (27)
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
621
Swedish
Upplysningar om säkerhet och miljö
Säkerhetsanvisningar med översättning
SCSVE901
Det här avsnittet innehåller följande information:
Säkerhetsanvisningar med översättning
Procedurer för säkerhetsinspektion av produkten
Miljöanvisningar och annan miljöinformation
Det här dokumentet innehåller säkerhetsinformation.
Meddelanden som föregås av ordet Fara varnar för tillstånd eller procedurer som kan orsaka
död eller allvarlig personskada.
Meddelanden som föregås av ordet Varning varnar för tillstånd eller procedurer som kan orsaka
personskada som varken är dödlig eller extremt hälsovådlig.
Meddelanden som föregås av ordet Viktigt varnar för tillstånd eller procedurer som kan orsaka
skada på datorer, utrustning, program eller data.
Meddelandena är förtecknade i nummerordning efter ID, som står inom parentes efter varje
meddelande. Följande exempel visar var ID-numren är placerade i meddelanden av typen
Fara och Varning.
FARA
Ett eluttag som inte är korrekt installerat kan göra att det går livsfarlig
spänning genom metalldelar i systemet eller produkter som är anslutna
till systemet. Det är kundens ansvar att se till att alla eluttag är korrekt
installerade och jordade. (1)
VARNING:
Stoppa inte in händerna eller verktyg i öppningen till det tomma utrymmet där
fläkten var placerad. (1)
622
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Statisk elektricitet
Procedurer för säkerhetsinspektion
Inspektera racket
SCSVE902
Viktigt: När du hanterar FRU-enheter (Field Replaceable Units) och andra
datordelar ska du vidta följande försiktighetsåtgärder för att undvika skador av
statisk elektricitet:
Rör dig så lite som möjligt. När du rör dig kan statisk elektricitet byggas upp
omkring dig.
Var försiktig när du rör vid datorns delar. Håll i kortens kanter eller
metallhölje. Rör aldrig vid frilagda kretsar.
Förhindra att personer som inte är jordade rör vid datorns delar.
Innan du installerar en ny del håller du dess antistatiska förpackning mot en
omålad metalldel i 2104-enheten eller värdsystemet i minst 2 sekunder.
På så sätt minskar du den statiska elektriciteten i förpackningen och i din egen
kropp.
Ta ut delen ur förpackningen och installera den omedelbart i 2104-enheten
(modell DS4 eller TS4) utan att lägga ifrån dig delen. Om du måste lägga ifrån dig
delen gör du så här:
1. Placera den antistatiska förpackningen på ett plant och jämnt underlag.
2. Lägg delen ovanpå den antistatiska förpackningen. Lägg inte delen direkt
på en metallyta.
I anvisningarna för säkerhetsinspektionen finns en beskrivning av hur du kontrollerar
2104 modell DS4 eller TS4.
I installationshandboken till ditt rack finns mer information om hur du gör en
säkerhetsinspektion.
VARNING:
Produkten 2104, modell DS4 och TS4, är utformad för att installeras av
kunden och är certifierad som kundinstallerad. Kontrollera att den
enhet (dator eller rack) där 2104 modell DS4 eller TS4 ska installeras också är
utformad och certifierad för att installeras av kunden. I annat fall måste
2104 modell DS4 eller TS4 installeras av en servicetekniker. (22)
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
623
Inspektera 2104 - modell DS4 eller modell TS4
Externa maskinkontroller
Interna maskinkontroller
SCSVE903
Utför följande kontroller av maskinens inre:
1. Kontrollera att inte några andra ändringar än IBMs egna har gjorts på
maskinen. Om du hittar sådana ändringar skaffar du formuläret "Non-IBM
Alteration Attachment Survey", nr R009, från IBM-återförsäljaren. Fyll i
formuläret och lämna det till IBM-återförsäljaren.
2. Kontrollera maskinens inre skick och se efter om det finns tecken på
metallföroreningar eller annan nedsmutsning
vatten eller annan vätska
brand
rökskada
3. Se efter om det finns uppenbara mekaniska fel, t.ex. lösa komponenter.
4. Kontrollera att det inte finns förslitningar, sprickor eller klämskador på
exponerade kablar och kontakter.
Utför följande kontroller av maskinens utsida:
1. Kontrollera att det inte finns skador (lösa, skadade eller vassa kanter) på
chassit.
2. Kontrollera nätkablarna och se efter att isoleringen inte är sliten eller skadad.
3. Se efter att inga ändringar gjorts som avviker från standard. Använd sunt
förnuft när du bedömer säkerheten hos sådana ändringar.
4. Kontrollera att alla yttre luckor finns på plats och inte är skadade.
5. Förvissa dig om att alla spärrar och gångjärn fungerar som de ska.
6. Kontrollera att nätkabeln inte är skadad.
7. Se efter om det finns några slitna, skadade eller ihopklämda kablar.
8. Inspektera fläkt-/nätaggregatenheterna. Se efter att fästanordningarna för
nätaggregatets hölje (skruvar eller nitar) inte har avlägsnats eller rubbats.
9. Kontrollera att den yttre signalkabeln inte är skadad.
10. Kontrollera kåpan och se efter att det inte finns några vassa kanter, skador
eller förändringar som gör att enhetens inre delar exponeras.
11. Rätta till alla fel som du hittar.
Utför följande säkerhetskontroller.
624
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Kontroll av säkerhetsetikett
SCSVE904
Utför följande kontroll av säkerhetsetiketten:
1. Kontrollera att etiketten är fastsatt på 2104 modell DS4 eller modell TS4.
Se figur 1.
2. Kontrollera att nätkabelns varningsetikett är fastsatt på nätaggregatet i 2104 modell DS4
eller TS4. Se figur 2.
SC000274
Figur 1. 2104-etikett - modell DS4 eller modell TS4
Figur 2. Varningsetikett om nätkabel
SC000325
VARNING:
Enheten kan ha två nätkablar. Koppla ur båda nätkablarna så att all ström bryts. (1)
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
625
SCSVE905
3. Kontrollera att varningsetiketten om nätaggregatets hölje är fastsatt på nätaggregatet i
2104 modell DS4 eller TS4. Se figur 3.
4. Kontrollera att varningsetiketten om säkring är fastsatt på nätaggregatet i 2104 modell DS4
eller TS4. Se figur 4.
Figur 3. Varningsetikett om nätaggregat
Figur 4. Varningsetikett om säkring
SC000326
SC000327
VARNING:
Avlägsna inte höljet. Försök inte reparera. Ingen av delarna kan repareras. (2)
VARNING:
Dubbelpolig/neutral säkring. (3)
626
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
SCSVE905A
5. Kontrollera att viktetiketten är fastsatt på 2104 modell DS4. Se figur 5.
VARNING:
Enheten väger över 32 kg. Se handböckerna. (4)
Figur 5. Viktetikett för 2104 modell DS4
SC000220
32 kg (se handböckerna)
Viktigt:
Om du har data lagrade i enheterna bör du märka enheterna innan du tar bort dem. När du
sätter tillbaka enheterna installerar du dem i samma enhetsfack där de tidigare var
installerade. I annat fall finns det risk för dataförluster.
> 18 kg > 32 kg
VARNING:
Om 2104 modell DS4 är fullbestyckad med 14 enheter och två nätaggregat väger
den över 32 kg. Du måste ta ut minst sex enheter ur 2104 modell DS4 innan du kan lyfta
eller installera den i ett rack eller i 2104 modell Ts4. På så sätt minskar du vikten till under
32 kg, och 2104 modell DS4 kan utan risk hanteras av två personer. I annat fall finns det
risk för skador.
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
627
VARNING:
Enheten väger mellan 32 och 55 kg. (5)
Figur 6. Viktetikett för 2104 modell TS4
32 - 55 kg
SC000219
6. Kontrollera att viktetiketten är fastsatt på 2104 modell TS4. Se figur 6.
VARNING:
Lyftredskapet av kartong (följer med) krävs för att flytta och
installera produkten när den är fullbestyckad. Det behövs minst tre personer
när den ska flyttas. I annat fall finns det risk för skador.
Om lyftredskapet inte är tillgängligt måste du minska vikten till 32 kg eller
mindre genom att ta bort alla tunga komponenter (diskenheter och
nätaggregat) från produkten. Sedan krävs det bara två personer för att flytta
och installera produkten. (6)
Viktigt:
Om du har data lagrade i enheterna bör du märka enheterna innan du
tar bort dem. När du sätter tillbaka enheterna installerar du dem i samma
enhetsfack där de tidigare var installerade. I annat fall finns det risk för
dataförluster.
> 18 kg > 32 kg
SCSVE906
628
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
SCSVE907
7. Kontrollera att etiketten om vänteläge är fastsatt på 2104 modell DS4 eller
modell Ts4. Se figur 7.
VARNING:
Vänteläge anges av symbolen till höger om DC, direkt ovanför Sw1-
omkopplaren. När du ställer SW1-omkopplaren i det högra läget (direkt
under väntelägessymbolen) är växelströmmen till enheten inte
avstängd. (7)
Figur 7. Etikett - vänteläge
SC000326
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
629
SCSVE908
Avsnittet innehåller anvisningar för kontroll av jordningen av 2104 modell DS4 eller modell TS4.
Kontrollera jordningen av 2104 modell DS4
Så här kontrollerar du jordningen av 2104 modell DS4:
3. Viktigt: Vissa elektriska kretsar kan skadas om de externa SCSI-kablarna
ansluts till 2104 modell DS4 medan jordningstestet utförs.
Se till att inga externa SCSI-kablar är anslutna till kontakterna .
4. Så här kontrollerar du jordningen av 2104 modell DS4:
a. Granska stickkontakterna som du kopplade bort från strömfördelningsenheten
eller reservbatteriet i steg 2 och försäkra dig om att det tredje jordstiftet är i gott skick.
b. Använd en ohmmeter och anslut den ena ledaren till chassit på 2104 modell DS4
och den andra ledaren till det tredje jordstiftet på stickkontakterna som du kopplade ur
i steg 2. Avläsningen bör ge högst 0,1 ohm mellan jordstiftet och chassit.
Figur 8. Nätkablar och SCSI-kontakter på 2104 modell DS4
Kontrollera jordningen av 2104 modell DS4 eller modell TS4
1. Se till att all strömförsörjning är bortkopplad från racket. Läs installationsanvisningarna för
racket eller värdsystemet.
2. Kontrollera att nätkablarna är anslutna till vardera fläkt-/nätaggregatenheten. Se figur 8.
Se också till att nätkablarnas motsatta ände är bortkopplade från strömfördelningsenheten
och rackets reservbatteri. Läs installationsanvisningarna för racket eller värdsystemet.
2
1
SC000210
630
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
SCSVE909
Om du arbetar med en 220 volts 2104 modell DS4 går du till steg 5.
Om du arbetar med en -48 volts 2104 modell DS4 går du till steg 7.
5. Kontrollera ledningssammanhanget mellan chassit på 2104 modell DS4 och jordstift på
strömkontakten på fläkt-/nätaggregatenheterna. Se figur 9.
6. Gå till steg 9.
7. Kontrollera ledningssammanhanget mellan chassit på 2104 modell DS4 och jordstiften
1 och 7 för varje nätkabelkontakt. Se figur 10.
8. Gå till steg 9.
9. Om en fläkt-/nätaggregatenhet inte har ledningssammanhang byter du ut den mot en ny.
Se vidare avsnittet om fläkt-/nätaggregatenheter i servicehandboken till Expandable
Storage Plus 2104 modell DS4 och modell TS4. Utför hela jordningstestet på nytt.
Om alla fläkt-/nätaggregatenheter har ledningssammanhang, kanske felet beror på
nätkabeln eller jordningen av värdsystemet.
Figur 9. Jordstift (220 V 2104 modell DS4)
Figur 10. Jordstift (-48 V 2104 modell DS4)
1
SC000329
Om jordningen är korrekt är jordningstestet klart.
Om jordningen inte är korrekt kopplar du bort nätkablarna från
fläkt-/nätaggregatenheten 2104 modell DS4.
SC000330
Stift 6 Stift 1
Stift 12 Stift 7
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
631
SCSVE910
Om du arbetar med en 220 volts 2104 modell DS4 går du till steg 10.
Om du arbetar med en -48 volts 2104 modell DS4, försöker du lokalisera felet med hjälp av
installationsanvisningarna till racket eller värdsystemet. Utför hela jordningstestet på nytt.
10. Kontrollera nätkabelns ledningssammanhang.
Om nätkabeln inte har ledningssammanhang byter du ut den mot en ny. Utför sedan hela
jordningstestet på nytt. Om nätkabeln har ledningssammanhang försöker du lokalisera
felet med hjälp av installationsanvisningarna för racket eller värdsystemet.
Så här kontrollerar du jordningen av 2104 modell TS4:
2. Försäkra dig om att nätkablarnas motsatta ände inte är anslutna till eluttag.
Koppla loss kablarna om så behövs.
Kontrollera jordningen av 2104 modell TS4
1. Kontrollera att en nätkabel är ansluten till varje eluttag (1). Se
figur 11.
1
Figur 11. Nätkablar och SCSI-kontakter (2104 modell TS4)
SC000230
1
2
3
632
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
SCSVE911
3. Viktigt: Vissa elektriska kretsar kan skadas om de externa SCSI-kablarna
ansluts till 2104 modell TS4 medan jordningstestet utförs.
Se till att inga externa SCSI-kablar är anslutna till SCSI-kontakterna .
Se figur 11.
4. Kontrollera ledningssammanhanget mellan chassit på 2104 modell TS4
och jordstiftet på varje nätkabelkontakt.
a. Kontrollera nätkablarna och se efter att den tredje jordledaren är i gott skick.
b. Använd en mätare och kontrollera att jordledarens ledningssammanhang är
högst 0,1 ohm mellan det externa jordstiftet och chassits jordning.
5. Om ledningssammanhanget är bra är jordningstestet klart.
6. Om jordningen inte är korrekt kopplar du bort nätkablarna från eluttagen.
a. Testa varje nätkabels jordningssammanhang.
b. Om det är fel på någon nätkabel byter du ut den mot en ny.
c. Kontrollera ledningssammanhanget mellan chassit på 2104 modell TS4
och jordstift på strömkontakten på fläkt-/nätaggregatenheterna.
Se figur 12.
d Omfläkt-/nätaggregatenheten inte har ledningssammanhang byter du ut
den mot en ny. (Se vidare avsnittet om fläkt-/nätaggregatenheter i
servicehandboken till Expandable Storage Plus 2104 modell DS4 och
modell TS4.) Gå till "Kontrollera jordningen av 2104 modell TS4" och
utför hela jordningstestet på nytt enligt steg 1 till och med steg 6d.
Figur 12. Jordstift (2104 modell TS4)
1
2
3
SC000331
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
633
SCSVE912
Figur 13. Koppla bort strömmen från 2104 modell DS4
Anm: Koppla inte bort strömmen från värdsystemet eller från 2104 modell DS4
annat än om det står i instruktionerna att du ska göra det, eller du behöver
göra det av någon annan anledning.
Så här kopplar du bort strömmen från 2104 modell DS4:
1. Kontrollera med kunden att alla operationer mellan 2104 modell DS4 och
värdsystemet har stoppats.
2. Ställ likströmsomkopplaren på alla fläkt-/nätaggregatenheter i vänteläge.
Se figur 13.
Anm:
a. Vissa versioner av 2104 modell DS4 har en fläkt-/nätaggregatenhet och
en fläktenhet. Fläktenheten har inte någon likströmsomkopplare.
b. På en fläkt-/nätaggregatenhet kan kontrollampan lysa trots att
likströmsomkopplaren är satt i vänteläge.
Koppla bort strömmen från 2104 modell DS4 eller modell TS4
1
Koppla bort strömmen från 2104 modell DS4
SC000206
634
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
SCSVE913
3. FARA
I följande steg kopplar du bort nätkablarna. Det går ström genom
kablarna så länge rackets strömfördelnings- eller UPS-enhet
(Uninterruptable Power Supply) inte stängts av. (1)
1
1
2
2
Anm: Koppla inte bort strömmen från värdsystemet eller från 2104 modell TS4
annat än om det står i instruktionerna att du ska göra det, eller du behöver
göra det av någon annan anledning.
Så här kopplar du bort strömmen från 2104 modell TS4:
1. Kontrollera med kunden att alla operationer mellan 2104 modell TS4 och
värdsystemet har stoppats.
2. På baksidan av 2104 modell TS4 ställer du likströmsomkopplaren på
alla fläkt-/nätaggregatenheter i vänteläge. Se figur 14.
Anm:
a. Vissa versioner av 2104 modell TS4 har en fläkt-/nätaggregatenhet och
en fläktenhet. Fläktenheten har inte någon likströmsomkopplare.
b. På en fläkt-/nätaggregatenhet kan kontrollampan lysa trots att
likströmsomkopplaren är satt i vänteläge.
Koppla bort strömmen från 2104 modell TS4
Koppla loss nätkablarna från baksidan av 2104 modell DS4. Se figur 13.
4. Så här kopplar du på strömmen igen till 2104 modell DS4:
a. Anslut nätkablarna igen.
b. Ställ varje fläkt-/nätaggregatenhets likströmsomkopplare i påslaget läge ("On").
Anm: Du kan konfigurera motorstartordningen för 2104 modell DS4.
Mer information finns i avsnittet om omkopplare för autostart och fördröjd
motorstart i servicehandboken till Expandable Storage Plus 2104
modell DS4 och modell TS4.
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
635
SCSVE914
1
2
2
3. Koppla loss nätkablarna från eluttaget och från baksidan av 2104
modell TS4. Se figur 14.
4. Så här kopplar du på strömmen igen till 2104 modell TS4:
a. Anslut nätkablarna igen.
b. Ställ varje fläkt-/nätaggregatenhets likströmsomkopplare i påslaget
läge ("On").
Anm: Du kan konfigurera motorstartordningen för 2104 modell TS4.
Mer information finns i avsnittet om omkopplare för autostart
och fördröjd motorstart i servicehandboken till Expandable Storage Plus 2104
modell DS4 och modell Ts4.
Figur 14. Koppla bort strömmen från 2104 modell TS4
SC000207
636
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
SCSVE915
Fara
FARA
FARA
FARA
FARA
FARA
I följande steg kopplar du bort nätkablarna. Det går ström genom kablarna
så länge rackets strömfördelnings- eller UPS-enhet (Uninterruptable Power
Supply) inte stängts av. (1)
Anslut inte en nätkabel till fläkt-/nätaggregatenheten förrän enheten är helt
installerad och tumskruvarna åtdragna. (3)
Undvik att koppla bort kablar till bildskärmar, skrivare, telefoner eller
åskskydd för kommunikationslinjer vid åskväder. (5)
Ett felaktigt installerat eluttag kan göra metalldelar i systemet eller på
anslutna enheter strömförande. Det är kundens ansvar att se till att
alla eluttag är korrekt installerade och jordade. Undvik att koppla bort
kablar till bildskärmar, skrivare, telefoner eller åskskydd för
kommunikationslinjer vid åskväder. (4)
Försök inte öppna fläkt-/nätaggregatenhetens kåpa. (2)
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
637
SCSVE916
Varningsmeddelanden
VARNING:
Enheten kan ha två nätkablar. Koppla ur båda nätkablarna så att all
ström bryts. (1)
VARNING:
Avlägsna inte höljet. Försök inte reparera. Ingen av delarna kan repareras. (2)
VARNING:
Dubbelpolig/neutral säkring. (3)
Figur 15. Varningsetikett om nätkabel
Figur 16. Varningsetikett om nätaggregat
Figur 17. Figur 4. Varningsetikett om säkring
SC000327
SC000326
SC000325
638
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
SCSVE916A
VARNING:
Enheten väger över 32 kg. Se handböckerna. (4)
Figur 18. Viktetikett för 2104 modell DS4
SC000220
32 kg (se handböckerna)
Viktigt:
Om du har data lagrade i enheterna bör du märka enheterna innan du tar bort
dem. När du sätter tillbaka enheterna installerar du dem i samma enhetsfack
där de tidigare var installerade. I annat fall finns det risk för dataförluster.
> 18 kg > 32 kg
VARNING:
Om 2104 modell DS4 är fullbestyckad med 14 enheter och två nätaggregat väger
den över 32 kg. Du måste ta ut minst sex enheter ur 2104 modell DS4 innan du
kan lyfta eller installera den i ett rack eller i 2104 modell Ts4. På så sätt minskar
du vikten till under 32 kg och 2104 modell DS4 kan utan risk hanteras av två
personer. I annat fall finns det risk för skador.
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
639
VARNING:
Enheten väger mellan 32 och 55 kg. (5)
VARNING:
Lyftredskapet av kartong (följer med) krävs för att flytta och installera
produkten när den är fullbestyckad. Det behövs minst tre personer när den
ska flyttas. I annat fall finns det risk för skador.
Om lyftredskapet inte är tillgängligt måste du minska vikten till 32 kg eller
mindre genom att ta bort alla tunga komponenter (diskenheter och
nätaggregat) från produkten. Sedan krävs det bara två personer för att flytta
och installera produkten. (6)
Figur 19. Viktetikett för 2104 modell TS4
SC000219
Viktigt:
Om du har data lagrade i enheterna bör du märka enheterna innan du tar bort dem.
När du sätter tillbaka enheterna installerar du dem i samma enhetsfack där de
tidigare var installerade. I annat fall finns det risk för dataförluster.
> 18 kg > 32 kg
32 - 55 kg
SCSVE917
640
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
SCSVE918
VARNING:
Vänteläge anges av symbolen till höger om DC, direkt ovanför Sw1-
omkopplaren. När du ställer SW1-omkopplaren i det högra läget (direkt under
väntelägessymbolen) är växelströmmen till enheten inte avstängd. (7)
VARNING:
Tippskyddet måste vara fäst vid rackets nedre framkant för att förhindra
att racket tippar framåt när 2104 modell DS4 tagits ut ur racket. Ta inte bort
eller installera enheter förrän ett tippskydd har fästs vid racket. (8)
VARNING:
Tippskyddet måste vara fäst vid rackets nedre framkant för att förhindra
att racket tippar framåt när 2104 modell DS4 installeras i racket. Ta inte bort
eller installera enheter förrän ett tippskydd har fästs vid racket. (9)
Figur 20. Placering av etikett om vänteläge
SC000328
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
641
SCSVE319
VARNING:
Stoppa inte in händerna eller verktyg i det tomma utrymmet där fläkten var
placerad. (10)
VARNING:
Stoppa inte in händerna eller verktyg i det tomma utrymmet där fläkt-
/nätaggregatenheten var placerad. (11)
VARNING:
Stoppa inte in händerna eller verktyg i det tomma utrymmet där SCSI-kortet
var placerat. (12)
VARNING:
Av säkerhetsskäl är produkten utrustad med en nätkabel med tre ledare och
kontakt. Du undviker risken att datorns hölje blir strömförande vid en
eventuell kortslutning genom att ansluta nätkabeln till ett skyddsjordat
nätuttag. (13)
VARNING:
Rör inte vid eluttaget eller eluttagets täckplatta med något annat än
testsonder innan du har genomfört hela säkerhetskontrollen. (14)
VARNING:
Om det avlästa värdet inte är oändligt ska du inte fortsätta. Gör de ändringar
som behövs innan du fortsätter. Slå inte på kretsens strömbrytare förrän
samtliga ovanstående steg har genomförts på rätt sätt. (15)
VARNING:
Lyft inte 2104 modell DS4 genom att ta tag i handtagen på fläkten eller fläkt-
/nätaggregatenheten. Handtagen är inte tänkta att bära enhetens vikt. (16)
642
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
SCSVE920
VARNING:
När du skjutit in enheten helt och hållet stängs spaken automatiskt. Var
försiktig så att du inte klämmer fingrarna mellan spaken och enheten. (17)
VARNING:
Förvissa dig om att huvudströmkabeln har kopplats loss från den icke
fungerande fläkt-/nätaggregatenhet innan du fortsätter. (18)
VARNING:
Stoppa inte in händerna eller verktyg i det tomma utrymmet ovanför fläkt-
/nätaggregatenheten. (19)
VARNING:
Stoppa inte in händerna eller verktyg i det tomma utrymmet mellan
nätaggregaten. (20)
VARNING:
Om 2104 modell DS4 installeras i ett rack av modell T00 eller T42 måste du
också följa säkerhetsanvisningarna för dessa rack innan du börjar ta bort
ramen. Säkerhetsanvisningarna finns i avsnittet om systeminstallation i
7014 Model T00 and T42 Rack Installation and Service Guide, SA38-0577 och
på följande webbadress:
www-1.ibm.com/servers/eserver/pseries/library/hardware_docs/7014_t00.html
(21)
VARNING:
Produkten 2104, modell DS4 och TS4, är utformad för att installeras av kunden
och är certifierad som kundinstallerad. Kontrollera att den enhet (dator eller
rack) där 2104 modell DS4 eller TS4 ska installeras också är utformad och
certifierad för att installeras av kunden. I annat fall måste 2104 modell
DS4 eller TS4 installeras av en servicetekniker. (22)
Expandable
Storage
Plus
2104
Model
DS4
and
Model
TS4
translations
643
SCSVE921
VARNING:
Stoppa inte in händerna eller verktyg i det tomma uttrymme där kortet var
placerat. (23)
VARNING:
Det krävs tre personer för att lyfta 2104 modell TS4. Försök inte lyfta 2104
modell TS4 själv. Försök inte att lyfta den utan att ha hjälp av två andra
personer. (24)
VARNING:
Lyft inte 2104 modell TS4 genom att ta tag i handtagen på fläkten eller fläkt-
/nätaggregatenheten. Handtagen är inte tänkta att bära enhetens vikt. (25)
2104 modell DS4 är utformad för att installeras av kunden och är certifierad
som kundinstallerad. Kontrollera att den enhet (dator eller rack) där 2104
modell DS4 ska installeras också är utformad och certifierad för att
installeras av kunden. I annat fall måste 2104 modell DS4 installeras av en
servicetekniker. (26)
VARNING:
VARNING:
2104 modell TS4 är utformad för att installeras av kunden och är certifierad
som kundinstallerad. Kontrollera att det system där 2104 modell TS4 ska
installeras också är utformad och certifierad för att installeras av kunden.
I annat fall måste 2104 modell TS4 installeras av en servicetekniker. (27)
644
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Notices
This
information
was
developed
for
products
and
services
offered
in
the
U.
S.
A.
IBM
may
not
offer
the
products,
services,
or
features
discussed
in
this
document
in
other
countries.
Consult
your
local
IBM
representative
for
information
on
the
products
and
services
currently
available
in
your
area.
Any
reference
to
an
IBM
product,
program,
or
service
is
not
intended
to
state
or
imply
that
only
that
IBM
product,
program,
or
service
may
be
used.
Any
functionally
equivalent
product,
program,
or
service
that
does
not
infringe
on
any
IBM
intellectual
property
right
may
be
used
instead.
However,
it
is
the
user’s
responsibility
to
evaluate
and
verify
the
operation
of
any
non-IBM
product,
program,
or
service.
IBM
may
have
patents
or
pending
patent
applications
covering
subject
matter
described
in
this
document.
The
furnishing
of
this
document
does
not
give
you
any
license
to
these
patents.
You
can
send
license
inquiries,
in
writing
to:
IBM
Director
of
Licensing
IBM
Corporation
North
Castle
Drive
Armonk,
N.Y.
10504–1785
U.S.A.
The
following
paragraph
does
not
apply
to
the
United
Kingdom
or
any
other
country
where
such
provisions
are
inconsistent
with
local
law:
INTERNATIONAL
BUSINESS
MACHINES
CORPORATION
PROVIDES
THIS
PUBLICATION
“AS
IS”
WITHOUT
WARRANTY
OF
ANY
KIND,
EITHER
EXPRESS
OR
IMPLIED,
INCLUDING,
BUT
NOT
LIMITED
TO,
THE
IMPLIED
WARRANTIES
OF
NON-INFRINGEMENT,
MERCHANTABILITY
OR
FITNESS
FOR
A
PARTICULAR
PURPOSE.
Some
states
do
not
allow
disclaimer
of
express
or
implied
warranties
in
certain
transactions,
therefore,
this
statement
may
not
apply
to
you.
This
information
could
include
technical
inaccuracies
or
typographical
errors.
Changes
are
periodically
made
to
the
information
herein;
these
changes
will
be
incorporated
in
new
editions
of
the
publication.
IBM
may
make
improvements
and/or
changes
in
the
product(s)
and/or
the
program(s)
described
in
this
publication
at
any
time
without
notice.
Any
references
in
this
information
to
non-IBM
Web
sites
are
provided
for
convenience
only
and
do
not
in
any
manner
serve
as
an
endorsement
of
those
Web
sites.
The
materials
at
those
Web
sites
are
not
part
of
the
materials
for
this
IBM
product
and
use
of
those
Web
sites
is
at
your
own
risk.
IBM
may
use
or
distribute
any
of
the
information
you
supply
in
any
way
it
believes
appropriate
without
incurring
any
obligation
to
you.
©
Copyright
IBM
Corp.
2003
645
Trademarks
The
following
terms
are
trademarks
of
the
International
Business
Machines
Corporation
in
the
United
States,
other
countries,
or
both:
AIX
IBM
RS/6000
Eserver
Intel
is
a
registered
trademark
of
Intel
Corporation
in
the
United
States,
other
countries,
or
both.
Other
company,
product,
or
service
names
may
be
trademarks
or
service
marks
of
others.
Electronic
emission
statements
This
section
gives
the
electronic
emission
notices
or
statements
for
the
United
States
and
other
countries.
Federal
Communications
Commission
(FCC)
statement
This
equipment
has
been
tested
and
found
to
comply
with
the
limits
for
a
class
A
digital
device,
pursuant
to
Part
15
of
the
FCC
Rules.
These
limits
are
designed
to
provide
reasonable
protection
against
harmful
interference
when
the
equipment
is
operated
in
a
commercial
environment.
This
equipment
generates,
uses,
and
can
radiate
radio
frequency
energy
and,
if
not
installed
and
used
in
accordance
with
the
instruction
manual,
may
cause
harmful
interference
to
radio
communications.
Operation
of
this
equipment
in
a
residential
area
is
likely
to
cause
harmful
interference,
in
which
case
the
user
will
be
required
to
correct
the
interference
at
his
own
expense.
Properly
shielded
and
grounded
cables
and
connectors
must
be
used
in
order
to
meet
FCC
emission
limits.
IBM
is
not
responsible
for
any
radio
or
television
interference
caused
by
using
other
than
recommended
cables
and
connectors
or
by
unauthorized
changes
or
modifications
to
this
equipment.
Unauthorized
changes
or
modifications
could
void
the
user’s
authority
to
operate
the
equipment.
This
device
complies
with
Part
15
of
the
FCC
Rules.
Operation
is
subject
to
the
following
two
conditions:
(1)
this
device
may
not
cause
harmful
interference,
and
(2)
this
device
must
accept
any
interference
received,
including
interference
that
may
cause
undesired
operation.
Industry
Canada
compliance
statement
Avis
de
conformite
a
la
reglementation
d’Industrie
Canada:
Cet
appareil
numerique
de
la
classe
A
est
conform
a
la
norme
NMB-003
du
Canada.
Chinese
Class
A
warning
statement
Attention:
This
is
a
Class
A
product.
In
a
domestic
environment,
this
product
may
cause
radio
interference
in
which
case
the
user
may
be
required
to
take
adequate
measures.
646
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
European
Community
compliance
statement
This
product
is
in
conformity
with
the
protection
requirements
of
EC
Council
Directive
89/336/EEC
on
the
approximation
of
the
laws
of
the
Member
States
relating
to
electromagnetic
compatibility.
IBM
cannot
accept
responsibility
for
any
failure
to
satisfy
the
protection
requirements
resulting
from
a
non-recommended
modification
of
the
product,
including
the
fitting
of
non-IBM
option
cards.
This
product
is
in
conformity
with
the
EU
council
directive
73/23/EEC
on
the
approximation
of
the
laws
of
the
Member
States
relating
to
electrical
equipment
designed
for
use
within
certain
voltage
limits.
This
conformity
is
based
on
compliance
with
the
following
harmonized
standard:
EN60950.
This
product
has
been
tested
and
found
to
comply
with
the
limits
for
class
A
Information
Technology
Equipment
according
to
European
Standard
EN
55022.
The
limits
for
class
A
equipment
were
derived
for
commercial
and
industrial
environments
to
provide
reasonable
protection
against
interference
with
licensed
communication
equipment.
Attention:
This
is
a
class
A
product.
In
a
domestic
environment,
this
product
may
cause
radio
interference
in
which
case
the
user
may
be
required
to
take
adequate
measures.
Where
shielded
or
special
cables
(for
example,
cables
fitted
with
ferrites)
are
used
in
the
test
to
make
the
product
comply
with
the
limits:
Properly
shielded
and
grounded
cables
and
connectors
must
be
used
in
order
to
reduce
the
potential
for
causing
interference
to
radio
and
TV
communications
and
to
other
electrical
or
electronic
equipment.
Such
cables
and
connectors
are
available
from
IBM
authorized
dealers.
IBM
cannot
accept
responsibility
for
any
interference
caused
by
using
other
than
recommended
cables
and
connectors.
Germany
compliance
statement
Zulassungsbescheinigung
laut
Gesetz
ueber
die
elektromagnetische
Vertraeglichkeit
von
Geraeten
(EMVG)
vom
30.
August
1995.
Dieses
Geraet
ist
berechtigt,
in
Uebereinstimmung
mit
dem
deutschen
EMVG
das
EG-Konformitaetszeichen
-
CE
-
zu
fuehren.
Der
Aussteller
der
Konformitaetserklaeung
ist
die
IBM
Deutschland.
Informationen
in
Hinsicht
EMVG
Paragraph
3
Abs.
(2)
2:
Das
Geraet
erfuellt
die
Schutzanforderungen
nach
EN
50082-1
und
EN
55022
Klasse
A.
Notices
647
EN
55022
Klasse
A
Geraete
beduerfen
folgender
Hinweise:
Nach
dem
EMVG:_|t¹|
“Geraete
duerfen
an
Orten,
fuer
die
sie
nicht
ausreichend
entstoert
sind,
nur
mit
besonderer
Genehmigung
des
Bundesministeriums
fuer
Post
und
Telekommunikation
oder
des
Bundesamtes
fuer
Post
und
Telekommunikation
betrieben
werden.
Die
Genehmigung
wird
erteilt,
wenn
keine
elektromagnetischen
Stoerungen
zu
erwarten
sind.”
(Auszug
aus
dem
EMVG,
Paragraph
3,
Abs.4)
Dieses
Genehmigungsverfahren
ist
nach
Paragraph
9
EMVG
in
Verbindung
mit
der
entsprechenden
Kostenverordnung
(Amtsblatt
14/93)
kostenpflichtig.
Nach
der
EN
55022:
“Dies
ist
eine
Einrichtung
der
Klasse
A.
Diese
Einrichtung
kann
im
Wohnbereich
Funkstoerungen
verursachen.
in
diesem
Fall
kann
vom
Betreiber
verlangt
werden,
angemessene
Massnahmen
durchzufuehren
und
dafuer
aufzukommen.”
Anmerkung:
Um
die
Einhaltung
des
EMVG
sicherzustellen,
sind
die
Geraete
wie
in
den
Handbuechern
angegeben
zu
installieren
und
zu
betreiben.
Japanese
Voluntary
Control
Council
for
Interference
(VCCI)
class
1
statement
Korean
Government
Ministry
of
Communication
(MOC)
statement
Please
note
that
this
device
has
been
approved
for
business
purposes
with
regard
to
electromagnetic
interference.
If
you
find
that
this
is
not
suitable
for
your
use,
you
may
exchange
it
for
one
with
a
non-business
use.
Taiwan
class
A
compliance
statement
VS07171L
648
Expandable
Storage
Plus:
2104
Model
DS4
and
Model
TS4
Translated
Safety
Notices
Vos remarques sur ce document / Technical publication remark form
Titre / Title : Bull ESCALA Expandable Storage Plus 2104 Models DS4 and TS4 Translated
Safety Notices
Nº Reférence / Reference Nº : 86 X1 16M 00 Daté / Dated : November 2003
ERREURS DETECTEES / ERRORS IN PUBLICATION
AMELIORATIONS SUGGEREES / SUGGESTIONS FOR IMPROVEMENT TO PUBLICATION
Vos remarques et suggestions seront examinées attentivement.
Si vous désirez une réponse écrite, veuillez indiquer ci-après votre adresse postale complète.
Your comments will be promptly investigated by qualified technical personnel and action will be taken as required.
If you require a written reply, please furnish your complete mailing address below.
NOM / NAME : Date :
SOCIETE / COMPANY :
ADRESSE / ADDRESS :
Remettez cet imprimé à un responsable BULL ou envoyez-le directement à :
Please give this technical publication remark form to your BULL representative or mail to:
BULL CEDOC
357 AVENUE PATTON
B.P.20845
49008 ANGERS CEDEX 01
FRANCE
Technical Publications Ordering Form
Bon de Commande de Documents Techniques
To order additional publications, please fill up a copy of this form and send it via mail to:
Pour commander des documents techniques, remplissez une copie de ce formulaire et envoyez-la à :
BULL CEDOC
ATTN / Mr. L. CHERUBIN
357 AVENUE PATTON
B.P.20845
49008 ANGERS CEDEX 01
FRANCE
Phone / Téléphone : +33 (0) 2 41 73 63 96
FAX / Télécopie +33 (0) 2 41 73 60 19
E–Mail / Courrier Electronique : srv.Cedoc@franp.bull.fr
Or visit our web sites at: / Ou visitez nos sites web à:
http://www.logistics.bull.net/cedoc
http://www–frec.bull.com http://www.bull.com
CEDOC Reference #
No Référence CEDOC
Qty
Qté
CEDOC Reference #
No Référence CEDOC
Qty
Qté
CEDOC Reference #
No Référence CEDOC
Qty
Qté
_ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ]
_ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ]
_ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ]
_ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ]
_ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ]
_ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ]
_ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ]
[ _ _ ] : no revision number means latest revision / pas de numéro de révision signifie révision la plus récente
NOM / NAME : Date :
SOCIETE / COMPANY :
ADRESSE / ADDRESS :
PHONE / TELEPHONE : FAX :
E–MAIL :
For Bull Subsidiaries / Pour les Filiales Bull :
Identification:
For Bull Affiliated Customers / Pour les Clients Affiliés Bull :
Customer Code / Code Client :
For Bull Internal Customers / Pour les Clients Internes Bull :
Budgetary Section / Section Budgétaire :
For Others / Pour les Autres :
Please ask your Bull representative. / Merci de demander à votre contact Bull.
BULL CEDOC
357 AVENUE PATTON
B.P.20845
49008 ANGERS CEDEX 01
FRANCE
86 X1 16M 00
ORDER REFERENCE
PLACE BAR CODE IN LOWER
LEFT CORNER
Utiliser les marques de découpe pour obtenir les étiquettes.
Use the cut marks to get the labels.
86 A1 15EM 00
Expandable Storage
Plus 2104 Models
DS4 and TS4
Translated Safety
Notices
ESCALA
86 A1 15EM 00
Expandable Storage
Plus 2104 Models
DS4 and TS4
Translated Safety
Notices
ESCALA
86 A1 15EM 00
Expandable Storage
Plus 2104 Models
DS4 and TS4
Translated Safety
PL160R/T, PL260R/T and PL460R/T
Service Guide
ESCALA
REFERENCE
86 A1 02FA 01
ESCALA
PL160R/T, PL260R/T and PL460R/T
Service Guide
Hardware
July 2008
BULL CEDOC 357 AVENUE PATTON B.P.20845 49008 ANGERS CEDEX 01 FRANCE
REFERENCE
86 A1 02FA 01
The following copyright notice protects this book under Copyright laws which prohibit such actions as, but not limited to, copying, distributing, modifying, and making derivative works.
Copyright © IBM, 2007-2008 Copyright © Bull SAS 2007-2008
Printed in France
Trademarks and Acknowledgements
We acknowledge the rights of the proprietors of the trademarks mentioned in this manual.
All brand names and software and hardware product names are subject to trademark and/or patent protection.
Quoting of brand and product names is for information purposes only and does not represent trademark misuse.
The information in this document is subject to change without notice. Bull will not be liable for errors contained herein, or for incidental or consequential damages in connection with the use of this material.
Contents
Safety and environmental notices . . . . . . . . . . . . . . . . . . . . . . . .xiii
World Trade safety information . . . . . . . . . . . . . . . . . . . . . . . . . . . .xiii
German safety information . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .xiii
Laser safety information . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .xiii
Laser compliance . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .xiii
Power and cabling information for NEBS (Network Equipment-Building System) GR-1089-CORE . . . . . .xiv
Product recycling and disposal . . . . . . . . . . . . . . . . . . . . . . . . . . . .xiv
Battery return program . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .xv
About this publication . . . . . . . . . . . . . . . . . . . . . . . . . . . .xvii
How to send your comments . . . . . . . . . . . . . . . . . . . . . . . . . . . . .xvii
Chapter 1. Starting a Service Call . . . . . . . . . . . . . . . . . . . . . . . . .1
Chapter 2. Isolation Procedures . . . . . . . . . . . . . . . . . . . . . . . . .5
Bus, high-speed link (HSL/RIO/12X) isolation information . . . . . . . . . . . . . . . . . . . .7
PCI bus isolation using AIX, Linux, or the HMC . . . . . . . . . . . . . . . . . . . . . .8
Isolating a PCI bus problem while running AIX or Linux . . . . . . . . . . . . . . . . . .8
Isolating a PCI bus problem from the HMC . . . . . . . . . . . . . . . . . . . . . . .8
Verifying a high-speed link, system PCI bus, or a multi-adapter bridge repair . . . . . . . . . . . .9
Analyzing a RIO/HSL/12X or PCI bus reference code . . . . . . . . . . . . . . . . . . . .11
DSA translation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .12
Card positions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .12
Converting the loop number to RIO/HSL/12X port location labels . . . . . . . . . . . . . . . .14
Fiber optic cleaning procedures . . . . . . . . . . . . . . . . . . . . . . . . . . . .15
Status indications for RIO/HSL . . . . . . . . . . . . . . . . . . . . . . . . . . . .15
RIO/HSL/12X connection problems . . . . . . . . . . . . . . . . . . . . . . . . . .22
Additional RIO/HSL FRUs . . . . . . . . . . . . . . . . . . . . . . . . . . . .24
Powering on and powering off an I/O expansion unit . . . . . . . . . . . . . . . . . . .24
HSL loop configuration and status form . . . . . . . . . . . . . . . . . . . . . . . . .25
Installed features in a PCI bridge set form . . . . . . . . . . . . . . . . . . . . . . . .25
RIO/HSL/12X link status diagnosis form . . . . . . . . . . . . . . . . . . . . . . . .26
HSL/RIO 12X Isolation Procedures . . . . . . . . . . . . . . . . . . . . . . . . . . .27
CONSL01 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .27
RIOIP01 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .28
Main task . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .28
The ports on both ends of the failed link are in different system units on the loop . . . . . . . . . .29
The port on one end of the failed link is in a system unit and the port on the other end is in an I/O unit . .30
The ports on both ends of the failed link are in an I/O unit . . . . . . . . . . . . . . . . .31
Cannot power on unit . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .33
Manually detecting the failed link . . . . . . . . . . . . . . . . . . . . . . . . . .33
Refresh the port status . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .33
RIOIP06 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .34
RIOIP08 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .34
RIOIP09 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .35
RIOIP10 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .36
RIOIP11 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .36
RIOIP12 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .38
RIOIP56 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .39
Multi-adapter bridge (MAB) isolation procedures . . . . . . . . . . . . . . . . . . . . . . .41
MABIP02 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .41
MABIP03 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .41
MABIP05 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .41
MABIP50 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .42
MABIP51 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .42 © Copyright IBM Corp. 2008 iii
MABIP52 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .42
MABIP53 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .43
MABIP54 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .43
MABIP55 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .43
Communication isolation procedure . . . . . . . . . . . . . . . . . . . . . . . . . . .47
COMIP01, COMPIP1 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .47
Intermittent isolation procedures . . . . . . . . . . . . . . . . . . . . . . . . . . . .50
INTIP03 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .52
INTIP05 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .53
INTIP07 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .53
INTIP08 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .54
INTIP09 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .55
INTIP14 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .57
INTIP16 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .57
INTIP18 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .57
INTIP20 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .57
INTIP24 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .58
I/O processor (IOP) isolation procedures . . . . . . . . . . . . . . . . . . . . . . . . .60
IOPIP01 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .62
IOPIP13 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .64
IOPIP16 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .66
IOPIP17 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .69
IOPIP18 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .71
IOPIP19 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .72
IOPIP20 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .73
IOPIP21 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .75
IOPIP22 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .75
IOPIP23 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .76
IOPIP25 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .76
IOPIP26 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .78
IOPIP27 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .79
IOPIP28 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .81
IOPIP29 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .81
IOPIP30 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .81
IOPIP31 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .84
IOPIP32 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .86
IOPIP33 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .87
IOPIP34 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .87
IOPIP40 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .88
IOPIP41 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .89
Licensed internal code (LIC) isolation procedures . . . . . . . . . . . . . . . . . . . . . . .90
LICIP01 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .91
LICIP03 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .92
LICIP04 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .93
LICIP07 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .93
LICIP08 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .96
LICIP11 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .96
How to find the cause code . . . . . . . . . . . . . . . . . . . . . . . . . . . .97
0001 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .97
0002 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .97
0004 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .100
0005 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .101
0006 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .101
0007 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .101
0008 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .101
0009 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .101
000A . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .101
000B . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .102
000C . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .102
000D . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .102
000E . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .102 iv Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
0010 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .103
0011 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .103
0012 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .103
0015 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .103
0016 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .103
0017 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .103
0018 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .103
0019 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .103
001A . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .104
001C . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .104
001D . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .104
001E . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .104
001F . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .104
0020 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .105
0021 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .105
0022 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .105
0023 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .105
0024 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .105
0025 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .106
0026 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .106
0027 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .106
0031 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .106
0033 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .106
0034 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .106
0035 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .107
0037 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .107
0099 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .107
LICIP12 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .107
How to find the cause code . . . . . . . . . . . . . . . . . . . . . . . . . . .107
0002 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .108
0004 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .109
0007 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .109
0008 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .110
0009 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .110
000A . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .110
000B . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .110
000D . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .110
000E . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .110
002C . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .110
002D . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .111
002E . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .111
002F . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .111
0030 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .111
0032 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .111
0099 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .111
LICIP13 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .111
LICIP14 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .116
LICIP15 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .117
LICIP16 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .119
Logical partition (LPAR) isolation procedure . . . . . . . . . . . . . . . . . . . . . . . .120
LPRIP01 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .120
Operations Console isolation procedures . . . . . . . . . . . . . . . . . . . . . . . . .124
OPCIP03 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .125
Power isolation procedures . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .128
Power problems . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .129
Cannot power on system unit . . . . . . . . . . . . . . . . . . . . . . . . . . .129
Cannot power off system or SPCN-controlled I/O expansion unit . . . . . . . . . . . . . . .132
Cannot power on SPCN-controlled I/O expansion unit . . . . . . . . . . . . . . . . . .136
PWR1900 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .141
PWR1905 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .142
Instructions for Models 8203-E4A, 9407-M15, or 9408-M25 . . . . . . . . . . . . . . . . .142 Contents v
PWR1906 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .144
PWR1906 Instructions for 5094 and 5294 expansion units . . . . . . . . . . . . . . . . . .144
PWR1907 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .146
PWR1908 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .148
PWR1908 instructions for 5088 or 0588 expansion units . . . . . . . . . . . . . . . . . .148
PWR1909 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .150
Instructions for 5095, 0595, 5790, 5796, 7311-D11, 7311-D20 and 7314-G30 . . . . . . . . . . . .150
PWR1911 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .152
PWR1917 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .157
PWR1918 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .159
PWR1920 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .160
Router isolation procedures . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .162
RTRIP01 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .163
RTRIP02 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .163
RTRIP03 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .163
RTRIP05 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .163
RTRIP06 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .164
RTRIP07 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .165
RTRIP08 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .165
SAS isolation procedures . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .166
SIP3110 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .168
SIP3111 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .169
SIP3112 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .171
SIP3113 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .172
SIP3120 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .174
SIP3121 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .175
SIP3130 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .175
SIP3131 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .177
SIP3132 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .180
SIP3134 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .181
SIP3140 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .183
SIP3141 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .184
SIP3142 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .185
SIP3143 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .187
SIP3144 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .188
SIP3145 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .191
SIP3146 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .193
SIP3147 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .194
SIP3148 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .195
SIP3149 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .197
SIP3150 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .197
SIP3152 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .199
SIP3153 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .201
Service processor isolation procedures . . . . . . . . . . . . . . . . . . . . . . . . . .202
FSPSP01 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .205
FSPSP02 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .206
FSPSP03 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .206
FSPSP04 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .206
FSPSP05 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .206
FSPSP06 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .207
FSPSP07 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .207
FSPSP09 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .207
FSPSP10 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .207
FSPSP11 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .207
FSPSP12 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .208
FSPSP14 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .208
FSPSP16 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .208
FSPSP17 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .208
FSPSP18 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .208
FSPSP20 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .209
FSPSP22 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .209 vi Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
FSPSP23 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .209
FSPSP24 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .209
FSPSP25 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .209
FSPSP27 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .210
FSPSP28 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .210
FSPSP29 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .210
FSPSP30 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .211
FSPSP31 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .211
FSPSP32 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .211
FSPSP33 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .212
FSPSP34 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .212
FSPSP35 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .213
FSPSP38 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .213
FSPSP42 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .213
FSPSP45 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .214
FSPSP46 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .214
FSPSP47 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .214
FSPSP48 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .214
FSPSP49 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .214
FSPSP50 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .215
FSPSP51 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .215
FSPSP52 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .215
FSPSP54 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .215
FSPSP55 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .216
FSPSP63 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .216
FSPSP64 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .216
FSPSPC1 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .217
FSPSPD1 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .218
Tape unit isolation procedures . . . . . . . . . . . . . . . . . . . . . . . . . . . . .219
TUPIP02 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .221
TUPIP03 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .223
TUPIP04 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .224
TUPIP06 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .228
Tape unit self-test procedure . . . . . . . . . . . . . . . . . . . . . . . . . . . .229
Running the self-test . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .230
Interpreting the results . . . . . . . . . . . . . . . . . . . . . . . . . . . . .230
Tape device ready conditions . . . . . . . . . . . . . . . . . . . . . . . . . . . .231
Twinaxial workstation I/O processor isolation procedure . . . . . . . . . . . . . . . . . . . .233
TWSIP01 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .234
Workstation adapter isolation procedure . . . . . . . . . . . . . . . . . . . . . . . . .240
WSAIP01 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .241
Workstation adapter console isolation procedure . . . . . . . . . . . . . . . . . . . . . .243
Isolating problems on servers that run the AIX or Linux operating systems . . . . . . . . . . . . . .245
MAP 0210: General problem resolution . . . . . . . . . . . . . . . . . . . . . . . . .246
Problems with loading and starting the operating system (AIX and Linux) . . . . . . . . . . . . .246
Storage I/O card modes and jumpers . . . . . . . . . . . . . . . . . . . . . . . . .250
Setting or changing the mode of an I/O card . . . . . . . . . . . . . . . . . . . . . .250
How to set or change the mode of an I/O card . . . . . . . . . . . . . . . . . . . .250
Mode jumper has been overridden . . . . . . . . . . . . . . . . . . . . . . . . .251
Mode jumper is missing . . . . . . . . . . . . . . . . . . . . . . . . . . . . .252
SCSI service hints . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .252
General SCSI Configuration Checks . . . . . . . . . . . . . . . . . . . . . . . . .252
High Availability or Multiple SCSI System Checks . . . . . . . . . . . . . . . . . . . .252
SCSI-2 Single-Ended Adapter PTC Failure Isolation Procedure . . . . . . . . . . . . . . . .252
Determining Where to Start . . . . . . . . . . . . . . . . . . . . . . . . . . .253
External SCSI-2 Single-Ended Bus PTC Isolation Procedure . . . . . . . . . . . . . . . . .253
External SCSI-2 Single-Ended Bus Probable Tripped PTC Causes . . . . . . . . . . . . . . .254
Internal SCSI-2 Single-Ended Bus PTC Isolation Procedure . . . . . . . . . . . . . . . . .254
Internal SCSI-2 Single-Ended Bus Probable Tripped PTC Resistor Causes . . . . . . . . . . . .256
SCSI-2 Differential Adapter PTC Failure Isolation Procedure . . . . . . . . . . . . . . . . .256
External SCSI-2 Differential Adapter Bus PTC Isolation Procedure . . . . . . . . . . . . . . .256 Contents vii
SCSI-2 Differential Adapter Probable Tripped PTC Causes . . . . . . . . . . . . . . . . .258
Dual-Channel Ultra SCSI Adapter PTC Failure Isolation Procedure . . . . . . . . . . . . . .258
64-bit PCI-X Dual Channel SCSI Adapter PTC Failure Isolation Procedure . . . . . . . . . . . .259
MAP 0020: Problem determination procedure . . . . . . . . . . . . . . . . . . . . . .259
MAP 0030: Additional Problem Determination . . . . . . . . . . . . . . . . . . . . . .266
MAP 0040: Intermittent Problem Isolation . . . . . . . . . . . . . . . . . . . . . . . .267
MAP 0050: SCSI bus problems . . . . . . . . . . . . . . . . . . . . . . . . . . . .269
Preparing for hot-plug SCSI device or cable deconfiguration . . . . . . . . . . . . . . . . .274
Purpose of this procedure . . . . . . . . . . . . . . . . . . . . . . . . . . .274
After hot-plug SCSI device or cable deconfiguration . . . . . . . . . . . . . . . . . . .275
MAP 0054: PCI-X SCSI Bus Problems . . . . . . . . . . . . . . . . . . . . . . . . .276
MAP 0070: 888 sequence in operator panel display . . . . . . . . . . . . . . . . . . . . .278
MAP 0220: Hot-Swap FRU problem resolution . . . . . . . . . . . . . . . . . . . . . .279
MAP 0230: Platform error problem resolution . . . . . . . . . . . . . . . . . . . . . .284
MAP 0260: System hangs during resource configuration . . . . . . . . . . . . . . . . . . .288
MAP 0270: SCSI RAID problem resolution and verification . . . . . . . . . . . . . . . . . .291
MAP 0280: Console and Keyboard Problem Resolution . . . . . . . . . . . . . . . . . . .295
MAP 0285: Multipath I/O (MPIO) Problem Resolution . . . . . . . . . . . . . . . . . . .296
MAP 0291: Missing Device or Bus Problem Resolution . . . . . . . . . . . . . . . . . . .299
PFW1540: Problem isolation procedures . . . . . . . . . . . . . . . . . . . . . . . .301
PFW1542: I/O problem isolation procedure . . . . . . . . . . . . . . . . . . . . . . .302
PFW1548: Memory and processor subsystem problem isolation procedure . . . . . . . . . . . . .316
PFW1548: Memory and processor subsystem problem isolation procedure without an HMC attached . . .317
PFW1548: Memory and processor subsystem problem isolation procedure when an HMC is attached . . .324
Collect serviceable events in Integrated Virtualization Manager . . . . . . . . . . . . . . . . . .333
Chapter 3. Locating FRUs . . . . . . . . . . . . . . . . . . . . . . . . . . .335
Using location codes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .336
Physical Locations . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .339
Locations . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .339
Addresses . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .347
Chapter 4. Removing and replacing FRUs . . . . . . . . . . . . . . . . . . . .349
External cables . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .352
Battery . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .353
Cache battery pack . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .366
Control panel . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .375
Disk Drive . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .379
Replace the disk drive using AIX . . . . . . . . . . . . . . . . . . . . . . . . . . .379
Replace the disk drive using Linux . . . . . . . . . . . . . . . . . . . . . . . . . .382
Prepare to remove the disk drive . . . . . . . . . . . . . . . . . . . . . . . . . .382
Remove the disk drive . . . . . . . . . . . . . . . . . . . . . . . . . . . . .385
Replace the disk drive . . . . . . . . . . . . . . . . . . . . . . . . . . . . .386
Replace the disk drive nonconcurrently . . . . . . . . . . . . . . . . . . . . . . . . .388
Rebuild data on a replacement disk drive using Linux . . . . . . . . . . . . . . . . . . .390
Disk unit backplane . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .391
Ethernet card . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .403
Fan . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .409
GX adapter . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .415
Media device . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .421
Memory DIMM . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .430
PCI adapter . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .438
Remove and replace a PCI adapter contained in a cassette in the system with the power on in AIX . . . .443
Remove and replace a PCI adapter contained in a cassette in the system with the power on in Linux . . . .457
Prerequisites for hot-plugging PCI adapters in Linux . . . . . . . . . . . . . . . . . . .469
Verify that the Linux, hot-plug PCI tools are installed . . . . . . . . . . . . . . . . . . .470
Power supply . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .471
RAID enablement card . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .477
Auxiliary cache card . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .483
Slimline media . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .489 viii Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
System backplane . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .493
Thermal power management device (TPMD) card . . . . . . . . . . . . . . . . . . . . . .525
Voltage regulator module pair . . . . . . . . . . . . . . . . . . . . . . . . . . . . .531
Voltage regulator module - single . . . . . . . . . . . . . . . . . . . . . . . . . . . .539
Vital product data (VPD) card . . . . . . . . . . . . . . . . . . . . . . . . . . . . .547
Chapter 5. Verifying a repair . . . . . . . . . . . . . . . . . . . . . . . . . .557
Verifying the repair in AIX . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .559
Verifying the repair in Linux . . . . . . . . . . . . . . . . . . . . . . . . . . . . .563
Verifying a repair using an i5/OS system or logical partition . . . . . . . . . . . . . . . . . .564
Chapter 6. Closing a service call . . . . . . . . . . . . . . . . . . . . . . . .565
Closing a service call using Integrated Virtualization Manager . . . . . . . . . . . . . . . . . .571
Closing a service call using AIX or Linux . . . . . . . . . . . . . . . . . . . . . . . . .577
Activating and deactivating LEDs . . . . . . . . . . . . . . . . . . . . . . . . . . .583
Deactivate a system attention LED or partition LED using the HMC . . . . . . . . . . . . . . .583
Activate or deactivate identify LED using the HMC . . . . . . . . . . . . . . . . . . . .583
Deactivate a system attention LED or partition LED using the ASMI . . . . . . . . . . . . . . .584
Activate or deactivate identify LED using the ASMI . . . . . . . . . . . . . . . . . . . .584
Chapter 7. Common Service Procedures . . . . . . . . . . . . . . . . . . . . .587
Powering on and powering off . . . . . . . . . . . . . . . . . . . . . . . . . . . .588
Determining a primary or alternative console . . . . . . . . . . . . . . . . . . . . . . . .591
Determining if the system has logical partitions . . . . . . . . . . . . . . . . . . . . . . .593
Separating the 571F/575B card set and moving the cache directory card . . . . . . . . . . . . . . .594
Determining which Processor is the Secondary Service Processor . . . . . . . . . . . . . . . . .600
The System Reference Code (SRC) Format Description . . . . . . . . . . . . . . . . . . . .601
System Reference Code (SRC) Information . . . . . . . . . . . . . . . . . . . . . . . .601
System reference code (SRC) address formats . . . . . . . . . . . . . . . . . . . . . . .601
Hardware SRC formats . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .605
Logical address format . . . . . . . . . . . . . . . . . . . . . . . . . . . . .607
Using the Service Action Log . . . . . . . . . . . . . . . . . . . . . . . . . . . . .609
Problems with noncritical resources . . . . . . . . . . . . . . . . . . . . . . . . . .610
System Service Tools (SST) . . . . . . . . . . . . . . . . . . . . . . . . . . . . .611
Work with communications trace . . . . . . . . . . . . . . . . . . . . . . . . . .611
Using the product activity log . . . . . . . . . . . . . . . . . . . . . . . . . . . .617
Hexadecimal Product Activity Log data . . . . . . . . . . . . . . . . . . . . . . . .617
More information from hexadecimal reports . . . . . . . . . . . . . . . . . . . . . . .620
Reclaiming IOP cache storage . . . . . . . . . . . . . . . . . . . . . . . . . . . .636
Working with Storage Dumps . . . . . . . . . . . . . . . . . . . . . . . . . . . .636
Dedicated Service Tools (DST) . . . . . . . . . . . . . . . . . . . . . . . . . . . .637
Accessing Dedicated Service Tools . . . . . . . . . . . . . . . . . . . . . . . . .637
Selecting function 21 from the control panel . . . . . . . . . . . . . . . . . . . . .638
Control panel function codes on the HMC . . . . . . . . . . . . . . . . . . . . . .638
Performing an IPL to DST . . . . . . . . . . . . . . . . . . . . . . . . . . .640
Performing an alternate IPL to DST (type D IPL) . . . . . . . . . . . . . . . . . . .641
Start a service tool . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .642
IPL type, mode, and speed options . . . . . . . . . . . . . . . . . . . . . . . . .642
Printing the System Configuration List . . . . . . . . . . . . . . . . . . . . . . . . .643
Print the system configuration list . . . . . . . . . . . . . . . . . . . . . . . . .643
Details on the system bus, main storage, or processor . . . . . . . . . . . . . . . . . . .644
Hardware Service Manager . . . . . . . . . . . . . . . . . . . . . . . . . . . . .644
Managing the Advanced System Management Interface (ASMI) . . . . . . . . . . . . . . . .645
Accessing the ASMI . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .646
Accessing the ASMI using a Web browser . . . . . . . . . . . . . . . . . . . . . .646
Accessing the ASMI using an ASCII terminal . . . . . . . . . . . . . . . . . . . . .648
Accessing the ASMI using the HMC . . . . . . . . . . . . . . . . . . . . . . . .648
Displaying error and event logs . . . . . . . . . . . . . . . . . . . . . . . . . .649
Setting the system enclosure type . . . . . . . . . . . . . . . . . . . . . . . . . .650
Setting the system identifiers . . . . . . . . . . . . . . . . . . . . . . . . . . .650 Contents ix
Viewing vital product data . . . . . . . . . . . . . . . . . . . . . . . . . . . .651
Clearing all deconfiguration errors . . . . . . . . . . . . . . . . . . . . . . . . .651
Service functions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .652
Managing dumps . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .653
Performing dumps . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .653
Performing a platform system or service processor dump . . . . . . . . . . . . . . . . .653
Using the HMC to perform a system dump . . . . . . . . . . . . . . . . . . . . .654
Using the control panel to perform a system dump . . . . . . . . . . . . . . . . . . .654
Using the ASMI to perform a system dump . . . . . . . . . . . . . . . . . . . . .655
Performing a service processor dump . . . . . . . . . . . . . . . . . . . . . . . .655
Using the HMC to perform a service processor dump . . . . . . . . . . . . . . . . . .655
Using the control panel to perform a service processor dump . . . . . . . . . . . . . . .656
Using the ASMI to perform a service processor dump . . . . . . . . . . . . . . . . . .656
Copying a dump . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .656
Using an HMC to copy a dump . . . . . . . . . . . . . . . . . . . . . . . . . .657
Using the AIX command line to copy a dump . . . . . . . . . . . . . . . . . . . . .657
Using the Linux command line to copy a dump . . . . . . . . . . . . . . . . . . . . .657
Using the IBM i5/OS service tools to copy a dump . . . . . . . . . . . . . . . . . . . .658
Copying a platform system or SP dump . . . . . . . . . . . . . . . . . . . . . .658
Copying a main storage dump . . . . . . . . . . . . . . . . . . . . . . . . .658
Reporting a dump . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .659
Using an HMC to report a dump . . . . . . . . . . . . . . . . . . . . . . . . . .659
Using the AIX command line to report a dump . . . . . . . . . . . . . . . . . . . . .660
Using the Linux command line to report a dump . . . . . . . . . . . . . . . . . . . .660
Using the i5/OS service tools to report a main storage or platform system or SP dump . . . . . . .660
Deleting a dump . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .661
Using an HMC to delete a dump . . . . . . . . . . . . . . . . . . . . . . . . . .661
Using the AIX command line to delete a dump . . . . . . . . . . . . . . . . . . . . .662
Using the Linux command line to delete a dump . . . . . . . . . . . . . . . . . . . .662
Using the i5/OS service tools to delete a dump . . . . . . . . . . . . . . . . . . . . .662
Performing an IBM i5/OS main storage dump . . . . . . . . . . . . . . . . . . . . . . .663
Using the HMC to perform an IBM i5/OS main storage dump . . . . . . . . . . . . . . . . .663
Using the control panel to perform an IBM i5/OS main storage dump . . . . . . . . . . . . . .663
Performing a slow boot . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .665
Using the HMC to perform a slow boot . . . . . . . . . . . . . . . . . . . . . . . .665
Using the control panel to perform a slow boot . . . . . . . . . . . . . . . . . . . . . .665
Using the ASMI menus to perform a slow boot . . . . . . . . . . . . . . . . . . . . . .666
Determining the primary or alternate console . . . . . . . . . . . . . . . . . . . . . . .667
Primary console workstation requirements . . . . . . . . . . . . . . . . . . . . . . .667
Find the primary console when the system is operational . . . . . . . . . . . . . . . . . . .668
Find the primary console when the system power is off . . . . . . . . . . . . . . . . . . .668
Resetting the service processor . . . . . . . . . . . . . . . . . . . . . . . . . . . .669
Resetting the managed system connection from the HMC . . . . . . . . . . . . . . . . . . .670
Checking for a duplicate IP address . . . . . . . . . . . . . . . . . . . . . . . . . . .671
Running the online and eServer stand-alone diagnostics . . . . . . . . . . . . . . . . . . . .672
Running the online diagnostics . . . . . . . . . . . . . . . . . . . . . . . . . . .672
Running the online diagnostics in service mode . . . . . . . . . . . . . . . . . . . . .673
Running the online diagnostics in service mode with an HMC attached . . . . . . . . . . . .673
Running the online diagnostics in service mode without an HMC attached . . . . . . . . . . .673
Running the online diagnostics in maintenance mode . . . . . . . . . . . . . . . . . . .674
Running the online diagnostics in concurrent mode . . . . . . . . . . . . . . . . . . .674
Running the eServer stand-alone diagnostics from CD-ROM . . . . . . . . . . . . . . . . . .675
Running the eServer stand-alone diagnostics from CD on a server with an HMC attached . . . . . . .676
Selecting testing options . . . . . . . . . . . . . . . . . . . . . . . . . . . .676
Running the eServer stand-alone diagnostics from CD on a server without an HMC attached . . . . .677
Selecting testing options . . . . . . . . . . . . . . . . . . . . . . . . . . . .678
Running the eServer stand-alone diagnostics from a Network Installation Management server . . . . . .679
Concurrent PCI adapter removal and replacement using AIX . . . . . . . . . . . . . . . . . .682
Concurrent PCI adapter removal and replacement using Linux . . . . . . . . . . . . . . . . . .686
Prerequisites for hot-plugging PCI adapters in Linux . . . . . . . . . . . . . . . . . . . .689
Verify that the Linux, hot-plug PCI tools are installed . . . . . . . . . . . . . . . . . . . .689 x Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
Chapter 8. Part Information . . . . . . . . . . . . . . . . . . . . . . . . . .691
Appendix. Accessibility features . . . . . . . . . . . . . . . . . . . . . . . .701
Accessibility features . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .701
IBM and accessibility . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .701
Notices . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .703
Trademarks . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .705
Electronic emission notices . . . . . . . . . . . . . . . . . . . . . . . . . .707
Class A Notices . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .707
Terms and conditions . . . . . . . . . . . . . . . . . . . . . . . . . . . . .711
Contents xi
xii Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
Safety and environmental notices Safety notices may be printed throughout this guide: v DANGER notices call attention to a situation that is potentially lethal or extremely hazardous to people. v CAUTION notices call attention to a situation that is potentially hazardous to people because of some existing condition. v Attention notices call attention to the possibility of damage to a program, device, system, or data.World Trade safety information Several countries require the safety information contained in product publications to be presented in their national languages. If this requirement applies to your country, a safety information booklet is included in the publications package shipped with the product. The booklet contains the safety information in your national language with references to the U.S. English source. Before using a U.S. English publication to install, operate, or service this product, you must first become familiar with the related safety information in the booklet. You should also refer to the booklet any time you do not clearly understand any safety information in the U.S. English publications. German safety information Das Produkt ist nicht für den Einsatz an Bildschirmarbeitsplätzen im Sinne § 2 der Bildschirmarbeitsverordnung geeignet. Laser safety information IBM servers can use I/O cards or features that are fiber-optic based and that utilize lasers or LEDs. Laser compliance All lasers are certified in the U.S. to conform to the requirements of DHHS 21 CFR Subchapter J for class 1 laser products. Outside the U.S., they are certified to be in compliance with IEC 60825 as a class 1 laser product. Consult the label on each part for laser certification numbers and approval information. CAUTION: This product might contain one or more of the following devices: CD-ROM drive, DVD-ROM drive, DVD-RAM drive, or laser module, which are Class 1 laser products. Note the following information: v Do not remove the covers. Removing the covers of the laser product could result in exposure to hazardous laser radiation. There are no serviceable parts inside the device. v Use of the controls or adjustments or performance of procedures other than those specified herein might result in hazardous radiation exposure.(C026) CAUTION: Data processing environments can contain equipment transmitting on system links with laser modules that operate at greater than Class 1 power levels. For this reason, never look into the end of an optical fiber cable or open receptacle. (C027) CAUTION: This product contains a Class 1M laser. Do not view directly with optical instruments. (C028)
© Copyright IBM Corp. 2008 xiii
CAUTION: Some laser products contain an embedded Class 3A or Class 3B laser diode. Note the following information: laser radiation when open. Do not stare into the beam, do not view directly with optical instruments, and avoid direct exposure to the beam. (C030) Power and cabling information for NEBS (Network Equipment-Building System) GR-1089-CORE The following comments apply to the IBM servers that have been designated as conforming to NEBS (Network Equipment-Building System) GR-1089-CORE: The equipment is suitable for installation in the following: v Network telecommunications facilities v Locations where the NEC (National Electrical Code) applies The intrabuilding ports of this equipment are suitable for connection to intrabuilding or unexposed wiring or cabling only. The intrabuilding ports of this equipment must not be metallically connected to the interfaces that connect to the OSP (outside plant) or its wiring. These interfaces are designed for use as intrabuilding interfaces only (Type 2 or Type 4 ports as described in GR-1089-CORE) and require isolation from the exposed OSP cabling. The addition of primary protectors is not sufficient protection to connect these interfaces metallically to OSP wiring. Note: All Ethernet cables must be shielded and grounded at both ends. The ac-powered system does not require the use of an external surge protection device (SPD). The dc-powered system employs an isolated DC return (DC-I) design. The DC battery return terminal shall not be connected to the chassis or frame ground. Product recycling and disposal This unit must be recycled or discarded according to applicable local and national regulations. IBM encourages owners of information technology (IT) equipment to responsibly recycle their equipment when it is no longer needed. IBM offers a variety of product return programs and services in several countries to assist equipment owners in recycling their IT products. Information on IBM product recycling offerings can be found on IBM’s Internet site at http://www.ibm.com/ibm/environment/products/prp.shtml. Esta unidad debe reciclarse o desecharse de acuerdo con lo establecido en la normativa nacional o local aplicable. IBM recomienda a los propietarios de equipos de tecnología de la información (TI) que reciclen responsablemente sus equipos cuando éstos ya no les sean útiles. IBM dispone de una serie de programas y servicios de devolución de productos en varios países, a fin de ayudar a los propietarios de equipos a reciclar sus productos de TI. Se puede encontrar información sobre las ofertas de reciclado de productos de IBM en el sitio web de IBM http://www.ibm.com/ibm/environment/products/prp.shtml. Note: This mark applies only to countries within the European Union (EU) and Norway.
xiv Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
Appliances are labeled in accordance with European Directive 2002/96/EC concerning waste electrical and electronic equipment (WEEE). The Directive determines the framework for the return and recycling of used appliances as applicable throughout the European Union. This label is applied to various products to indicate that the product is not to be thrown away, but rather reclaimed upon end of life per this Directive. In accordance with the European WEEE Directive, electrical and electronic equipment (EEE) is to be collected separately and to be reused, recycled, or recovered at end of life. Users of EEE with the WEEE marking per Annex IV of the WEEE Directive, as shown above, must not dispose of end of life EEE as unsorted municipal waste, but use the collection framework available to customers for the return, recycling, and recovery of WEEE. Customer participation is important to minimize any potential effects of EEE on the environment and human health due to the potential presence of hazardous substances in EEE. For proper collection and treatment, contact your local IBM representative. Battery return program This product may contain sealed lead acid, nickel cadmium, nickel metal hydride, lithium, or lithium ion battery. Consult your user manual or service manual for specific battery information. The battery must be recycled or disposed of properly. Recycling facilities may not be available in your area. For information on disposal of batteries outside the United States, go to http://www.ibm.com/ibm/environment/products/batteryrecycle.shtml or contact your local waste disposal facility. In the United States, IBM has established a return process for reuse, recycling, or proper disposal of used IBM sealed lead acid, nickel cadmium, nickel metal hydride, and other battery packs from IBM Equipment. For information on proper disposal of these batteries, contact IBM at 1-800-426-4333. Please have the IBM part number listed on the battery available prior to your call. For Taiwan: Please recycle batteries. For the European Union:
Safety and environmental notices xv
Note:
This mark applies only to countries within the European Union (EU). Batteries or packaging for batteries are labeled in accordance with European Directive 2006/66/EC concerning batteries and accumulators and waste batteries and accumulators. The Directive determines the framework for the return and recycling of used batteries and accumulators as applicable throughout the European Union. This label is applied to various batteries to indicate that the battery is not to be thrown away, but rather reclaimed upon end of life per this Directive. In accordance with the European Directive 2006/66/EC, batteries and accumulators are labeled to indicate that they are to be collected separately and recycled at end of life. The label on the battery may also include a chemical symbol for the metal concerned in the battery (Pb for lead, Hg for mercury and Cd for cadmium). Users of batteries and accumulators must not dispose of batteries and accumulators as unsorted municipal waste, but use the collection framework available to customers for the return, recycling, and treatment of batteries and accumulators. Customer participation is important to minimize any potential effects of batteries and accumulators on the environment and human health due to the potential presence of hazardous substances. For proper collection and treatment, contact your local IBM representative. For California: Perchlorate Material - special handling may apply. See www.dtsc.ca.gov/hazardouswaste/perchlorate. The foregoing notice is provided in accordance with California Code of Regulations Title 22, Division 4.5 Chapter 33. Best Management Practices for Perchlorate Materials. This product, part, or both may include a lithium manganese dioxide battery which contains a perchlorate substance. xvi Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
About this publication This publication provides service providers with maintenance information for IBM® System i™ (9407-M15 and 9408-M25) and System p™ 520 (8203-E4A). This publication includes start-of-call procedures, isolation procedures, and removal and replacement procedures. For information about the accessibility features of this product, for users who have a physical disability, see “Accessibility features,” on page 701. How to send your comments Your feedback is important in helping to provide the most accurate and highest quality information. If you have any comments about this publication, use the Feedback button at http://www.ibm.com/systems/infocenter. Alternatively, you can send your comments to pubsinfo@us.ibm.com. Be sure to include the name of the book, the form number of the book, and the specific location of the text you are commenting on (for example, a page number or table number). © Copyright IBM Corp. 2008 xvii
xviii Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
Chapter 1. Starting a Service Call A service call is a starting point for service actions. You should begin all service actions with this procedure. From this point, you will be guided to the appropriate information to help you perform the necessary steps to repair the server. Note: In this topic, control panel and operator panel are synonymous. Before beginning, record information to help you return the server to the same state that the customer typically uses, such as: v The IPL type that the customer typically uses for the server v The IPL mode that is used by the customer on this server v The way in which the server is configured or partitioned 1.
Has problem analysis been performed using the procedures in Problem Analysis, SA76-0111?
Yes
No
?
Perform problem analysis using the procedures in Problem Analysis, SA76-0111. 2.
Is the failing server managed by a Hardware Management Console (HMC)?
No
Yes
?
Continue with step 7 on page 2. 3.
Is the failing server managed by Integrated Virtualization Manager (IVM)?
No
Yes
?
Go to “Collect serviceable events in Integrated Virtualization Manager” on page 333, and then return here and continue with next step. 4.
Do you have a Field Replaceable Unit (FRU), location code, and an action plan to replace a failing FRU? © Copyright IBM Corp. 2008 1
No
Yes
?
Go to Chapter 3, “Locating FRUs,” on page 335. 5.
Do you have an action plan to perform an isolation procedure?
No
Yes
?
Go to Chapter 2, “Isolation Procedures,” on page 5. 6.
Choose an action from the following symptom list: v You have a reference code but no action plan: go to Problem Analysis, SA76-0111. v There is a hardware problem that does not generate a reference code: use the Table of Contents to find an appropriate isolation procedure or contact your next level of support. v To check your server’s firmware level or to update it: see the system’s operation guide. v You have an Engineering Change (EC) or a hardware upgrade to apply: use the instructions provided with the change or upgrade. v You suspect a software problem: contact software support v To perform diagnostics or another service function: use the Table of Contents to locate the common service procedure.This ends the procedure. 7.
Is the Hardware Management Console (HMC) connected and functional?
Yes
No
?
Start the HMC and attach it to the server. If you experience a problem with the HMC, refer to the Service Guide for the Hardware Management Console Models 7042-CR4 and 7042-C06, SA76-0120. When the HMC is connected and functional, continue with the next step. 2 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
8.
Perform the following steps from the HMC that is used to manage the server. During these steps, refer to the service data that was gathered earlier: 1. In the Navigation Area, open Service Management. 2. Select Manage Events. The Manage Serviceable Events - Select Serviceable Events panel will be displayed. 3. Select the status of Open. 4. Select all other selections to ALL and click OK. 5. Scroll through the list and verify that there is a problem with the status of Open to correspond with the customer’s reported problem.Do you find the reported problem, or an open problem near the time of the reported problem?
Note: If you are unable to locate the reported problem, and there is more than one open problem near the time of the reported failure, use the earliest problem in the list.
Yes
No
?
Go to step 4 on page 1. 9.
Select the serviceable event you want to repair, and select Repair from the Selected menu. Follow the instructions displayed on the HMC. After you complete the repair procedure, the system automatically closes the serviceable event. This ends the procedure.
Note: If the Repair procedures are not available, go to step 4 on page 1 Chapter 1. Starting a Service Call 3
4 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
Chapter 2. Isolation Procedures Isolation procedures are used together with diagnostic programs that are part of server firmware. If a server is connected to an HMC, these procedures are available on the HMC. Use the HMC procedures to continue isolating the problem. If the server does not have an HMC and you are directed to perform an isolation procedure, the procedures documented here are needed to continue isolating a problem. “Bus, high-speed link (HSL/RIO/12X) isolation information” on page 7Symbolic FRUs, failing items, and bus isolation procedures use the terms “partition” and “logical partition” to indicate any single partition in a system that has multiple partitions. If the system you are working on does not have multiple partitions, then the terms refer to the primary partition. “HSL/RIO 12X Isolation Procedures” on page 27Use RIO/HSL/12X isolation procedures if there is no HMC attached to the server. If the server is connected to an HMC, use the procedures that are available on the HMC to continue FRU isolation. “Multi-adapter bridge (MAB) isolation procedures” on page 41Use multi-adapter bridge (MAB) isolation procedures if there is no HMC attached to the server. If the server is connected to the HMC, use the procedures that are available on the HMC to continue FRU isolation. “Communication isolation procedure” on page 47This topic contains the procedure necessary to isolate a communications failure. Please read and observe the following warnings when using this procedure. “Intermittent isolation procedures” on page 50These procedures help you to correct an intermittent problem. “I/O processor (IOP) isolation procedures” on page 60Use these procedures to isolate a failure in the multiple function I/O card. “Licensed internal code (LIC) isolation procedures” on page 90Use this section to isolate licensed internal code (LIC) problems. “Logical partition (LPAR) isolation procedure” on page 120These procedures help you to identify logical partition (LPAR) configuration conditions and the associated corrective actions. “Operations Console isolation procedures” on page 124These procedures help you to isolate a failure with the Operations Console. “Power isolation procedures” on page 128Use power isolation procedures for isolating a problem in the power system. Use isolation procedures if there is no HMC attached to the server. If the server is connected to an HMC, use the procedures that are available on the HMC to continue FRU isolation. “Router isolation procedures” on page 162These procedures serve as a guide to the correct isolation procedures from the reference code tables. “SAS isolation procedures” on page 166Use the following Serial Attached SCSI (SAS) isolation procedures if an HMC is not attached to the server. If the server is connected to an HMC, use the procedures that are available on the HMC to continue FRU isolation. “Service processor isolation procedures” on page 202Use service processor isolation procedures if there is no HMC attached to the server. If the server is connected to an HMC, use the procedures that are available on the HMC to continue FRU isolation. “Tape unit isolation procedures” on page 219This topic contains the procedures necessary to isolate a failure in a tape device. © Copyright IBM Corp. 2008 5
“Isolating problems on servers that run the AIX or Linux operating systems” on page 245Use the procedures for AIX or Linux servers if there is not an HMC attached to the server. If the server is connected to an HMC, use the procedures that are available on the HMC to continue FRU isolation. “Collect serviceable events in Integrated Virtualization Manager” on page 333Use this procedure when using the Integrated Virtualization Manager (IVM) to check for serviceable events on your system. 6 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
Bus, high-speed link (HSL/RIO/12X) isolation information Symbolic FRUs, failing items, and bus isolation procedures use the terms “partition” and “logical partition” to indicate any single partition in a system that has multiple partitions. If the system you are working on does not have multiple partitions, then the terms refer to the primary partition. Read all safety notices below before servicing the system and while performing a procedure. Note: Unless instructed otherwise, always power off the system before removing, exchanging, or installing a field-replaceable unit (FRU). DANGERWhen working on or around the system, observe the following precautions: Electrical voltage and current from power, telephone, and communication cables are hazardous. To avoid a shock hazard: v Connect power to this unit only with the IBM provided power cord. Do not use the IBM provided power cord for any other product. v Do not open or service any power supply assembly. v Do not connect or disconnect any cables or perform installation, maintenance, or reconfiguration of this product during an electrical storm. v The product might be equipped with multiple power cords. To remove all hazardous voltages, disconnect all power cords. v Connect all power cords to a properly wired and grounded electrical outlet. Ensure that the outlet supplies proper voltage and phase rotation according to the system rating plate. v Connect any equipment that will be attached to this product to properly wired outlets. v When possible, use one hand only to connect or disconnect signal cables. v Never turn on any equipment when there is evidence of fire, water, or structural damage. v Disconnect the attached power cords, telecommunications systems, networks, and modems before you open the device covers, unless instructed otherwise in the installation and configuration procedures. v Connect and disconnect cables as described in the following procedures when installing, moving, or opening covers on this product or attached devices. To Disconnect: 1. Turn off everything (unless instructed otherwise). 2. Remove the power cords from the outlets. 3. Remove the signal cables from the connectors. 4. Remove all cables from the devices To Connect: 1. Turn off everything (unless instructed otherwise). 2. Attach all cables to the devices. 3. Attach the signal cables to the connectors. 4. Attach the power cords to the outlets. 5. Turn on the devices. (D005) “PCI bus isolation using AIX, Linux, or the HMC” on page 8Use this procedure if you are isolating a PCI bus problem from the HMC or while running AIX® or Linux®. “Verifying a high-speed link, system PCI bus, or a multi-adapter bridge repair” on page 9Use this procedure to verify a repair for the high-speed link, a system PCI bus, or for a multi-adapter bridge. Chapter 2. Isolation Procedures 7
“Analyzing a RIO/HSL/12X or PCI bus reference code” on page 11Use Word 7 of the reference code to determine the bus number, bus type, multi-adapter bridge number, multi-adapter bridge function number, and logical card number from the direct select address (DSA). “DSA translation” on page 12The Direct Select Address (DSA) may be coded in word 7 of the reference code. “Card positions” on page 12This information specifies card positions for the various server models. “Converting the loop number to RIO/HSL/12X port location labels” on page 14Use this table to convert the RIO/HSL/12X loop number to port location labels. “Fiber optic cleaning procedures” on page 15Use the fiber optic cleaning kit and the fiber optic cleaning procedures in ?SY27-2604 Fiber Optic Cleaning Procedures? for all fiber channel connections such as those used in optical high speed link (HSL) connections or fibre channel attached devices. “Status indications for RIO/HSL” on page 15A status indication for a RIO/HSL loop is identified in the reference code. “RIO/HSL/12X connection problems” on page 22The failing component is the RIO/HSL/12X connection “HSL loop configuration and status form” on page 25Use this HSL loop configuration and status form to record the status of the HSL ports in the loop. “Installed features in a PCI bridge set form” on page 25Use this form to record the PCI bridge set card positions, and multi-adapter bridge function numbers. “RIO/HSL/12X link status diagnosis form” on page 26Use this form to record the status of the RIO/HSL/12X links.PCI bus isolation using AIX, Linux, or the HMC Use this procedure if you are isolating a PCI bus problem from the HMC or while running AIX or Linux. If you have an HMC, then this procedure should be performed from the HMC as part of the HMC directed service. If you do not have an HMC, then you should perform this procedure when directed by the maintenance package. Isolating a PCI bus problem while running AIX or Linux Can an IPL be run on the operating system? v No: Perform “MABIP52” on page 42. This ends the procedure. v Yes: Choose from the following: – If you are running AIX, go to “Running the online and eServer stand-alone diagnostics” on page 672 to isolate the PCI bus failure with online diagnostics in concurrent mode. – If you are running Linux, go to “Running the online and eServer stand-alone diagnostics” on page 672 to isolate the PCI bus failure with stand-alone diagnostics.This ends the procedure.Isolating a PCI bus problem from the HMC To isolate a PCI bus problem from the HMC, check the serviceable event view for the server for FRU part locations associated with the serviceable event, then continue with this procedure: 1. Did the serviceable event view provide the locations for the failing FRUs? Yes: Use those locations to exchange the given FRUs one at a time until the problem is resolved. This ends the procedure. No: Continue with the next step.
8 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
2. Go to “DSA translation” on page 12 to determine the Direct Select Address (DSA). 3. Perform the following: a. Record the bus number value (BBBB) from the DSA and convert it to decimal format. b. Search for the decimal system bus number in the partition resources screens on the HMC. c. Record the frame or unit type and continue with the next step.4. Record the Cc value from the DSA. Is the Cc value greater than 00? Yes: Continue with the next step. No: The multi-adapter bridge number and the multi-adapter function number have not been identified, and so the card slot cannot be identified using the DSA. Look in the HMC partition resources for non-reporting or non-operational hardware. That will indicate which cards in which positions need to be replaced. Refer to Finding part locations for the model you are working on for information about the multi-adapter bridge that controls those card slots. That multi-adapter bridge is also a FRU. This ends the procedure.5. Is the right-most character (c) F? No: Continue with the next step. Yes: Only the multi-adapter bridge number has been identified. Record the multi-adapter bridge number (left-most character of Cc) for later use. Because the card slot cannot be identified with the DSA, refer to Finding part locations for the model you are working on for information about the multi-adapter bridge that controls the card slots. Consider all card slots controlled by the multi-adapter bridge to be FRUs. This ends the procedure.6. Refer to “Card positions” on page 12 and use the BBBB and Cc values that you recorded to identify the card position. Then return to the procedure that sent you here. This ends the procedure.Verifying a high-speed link, system PCI bus, or a multi-adapter bridge repair Use this procedure to verify a repair for the high-speed link, a system PCI bus, or for a multi-adapter bridge. Within this procedure, the terms ?system? and ?logical partition? are interchangeable when used individually. 1. Perform this procedure from the logical partition you were in when you were sent to this procedure, or from the HMC if this error was worked from the HMC. 2. If you previously powered off a system or logical partition, or an expansion unit during this service action, then you need to power it off again. 3. Install all cards, cables, and hardware, ensuring that all connections are tight. You can use the system configuration list to verify that the cards are installed correctly. 4. Power on any expansion unit, logical partition or system unit that was powered off during the service action. Is one of the following true? v If the system or a logical partition was powered off during the service action, does the IPL complete successfully to the IPL or does Install the System display? v If an expansion tower was powered off during the service action, does the expansion tower power on complete successfully? v If any IOP or IOA card locations were powered off using concurrent maintenance during the service action, do the slots power on successfully? v If you exchanged a FRU that should appear as a resource or resources to the system, such as an IOA, or I/O bridge, does the new FRU’s resource appear in HSM as operational? Yes: Continue with the next step. Chapter 2. Isolation Procedures 9
No: Verify that you have followed the power off, remove and replace, and power-on procedures correctly. When you are sure that you have followed the procedures correctly, then exchange the next FRU in the list. If there are no more FRUs to exchange, then contact your next level of support. This ends the procedure. 5. Does the system or logical partition have mirrored protection? Select Yes if you are not sure. No: Continue with the next step. Yes: From the Dedicated Service Tools (DST) display, select Work with disk units, and resume mirrored protection for all units that have a suspended status. 6. Choose from the following options: v If you are working from a partition, from the Start a Service Tool display, select Hardware service manager and look for the I/O processors that have a failed or missing status. v If you are working from a HMC, look at the CEC properties.a. Choose the I/O tab. b. Look for IOAs or IOPs that have a failed or missing status.Are all I/O processor cards operational?Note: Ignore any IOPs that are listed with a status of not connected. Yes: Go to step 10. No: Display the logical hardware resource information for the non-operational I/O processors. For all I/O processors and I/O adapters that are failing; record the bus number (BBBB), board (bb) and card information (Cc). Continue with the next step. 7. Perform the following: a. Return to the Dedicated Service Tools (DST) display. b. Display the Product Activity Log. c. Select All logs and search for an entry with the same bus, board, and card address information as the non-operational I/O processor. Do not include informational or statistical entries in your search. Use only entries that occurred during the last IPL. Did you find an entry for the SRC that sent you to this procedure? No: Continue with the next step. Yes: Ask your next level of support for assistance. This ends the procedure. 8. Did you find a B600 6944 SRC that occurred during the last IPL? Yes: Continue with the next step. No: A different SRC is associated with the non-operational I/O processor. Go to the Chapter 1, “Starting a Service Call,” on page 1 procedure and look up the new SRC to correct the problem. This ends the procedure. 9. Is there a B600 xxxx SRC that occurred during the last IPL other than the B600 6944 and informational SRCs? Yes: Use the other B600 xxxx SRC to determine the problem. Go to the Chapter 1, “Starting a Service Call,” on page 1 and look up the new SRC to correct the problem. This ends the procedure. No: You connected an I/O processor in the wrong card position. Use the system configuration list to compare the cards. When you have corrected the configuration, go to the start of this procedure to verify the bus repair. This ends the procedure. 10. If in a partition, use the hardware service manager function to print the system configuration list. Are there any configuration mismatches? No: Continue with the next step. Yes: Ask your next level of support for assistance. This ends the procedure.11. You have verified the repair of the system bus. 10 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
a. If for this service action only an expansion unit was powered off or only the concurrent maintenance function was used for an IOP or IOA, then continue with the next step. b. Otherwise, perform the following to return the system to the customer: 1) Power off the system or logical partition. See “Powering on and powering off” on page 588 for details. 2) Select the operating mode with which the customer was originally running. 3) Power on the system or logical partition.12. If the system has logical partitions and the entry point SRC was B600 xxxx, then check for related problems in other logical partitions that could have been caused by the failing part. This ends the procedure. Analyzing a RIO/HSL/12X or PCI bus reference code Use Word 7 of the reference code to determine the bus number, bus type, multi-adapter bridge number, multi-adapter bridge function number, and logical card number from the direct select address (DSA). Physical card slot labels and card positions for PCI buses are determined by using the DSA and the appropriate system unit or I/O unit card positions. See “Card positions” on page 12 for details.
Table 1. RIO/HSL/12X and PCI reference code analysis
Word of the reference code
Control panel function
Panel function characters
Format
Description
1
11
1–8
B600 uuuu or B700 uuuu
uuuu = unit reference code (69xx)
1 – extended reference code information
11
9–16
iiii
Frame ID of the failing resource
1 – extended reference code information
11
17–24
ffff
Frame location
1 – extended reference code information
11
25–32
bbbb
Board position
2
12
1–8
MIGVEP62 or MIGVEP63
See Hardware SRC formats.
3
12
9–16
cccc cccc
Component reference code
4
12
17–24
pppp pppp
Programming reference code
5
12
25–32
qqqq qqqq
Program reference code high order qualifier
6
13
1–8
qqqq qqqq
Program reference code low order qualifier
7
13
9–16
BBBB Ccbb
See “DSA translation” on page 12
8
13
17–24
TTTT MMMM
Type (TTTT) and model (MMMM) of the failing item (if not zero)
9
13
25–32
uuuu uuuu
Unit address (if not zero) Chapter 2. Isolation Procedures 11
DSA translation The Direct Select Address (DSA) may be coded in word 7 of the reference code. This DSA is either a PCI system bus number or a RIO/HSL/12X loop number, depending on the type of error. With the following information, and the information in either the card position table (for PCI bus numbers) or the information in the loop-number-to-NIC-port table (for RIO/HSL/12X loop numbers), you can isolate a failing PCI bus or RIO/HSL/12X loop. Use the following instructions to translate the DSA: 1. Separate the DSA into the bus number, multi-adapter bridge number, and multi-adapter bridge function number. The DSA is of the form BBBB Ccxx, and separates into the following parts: v BBBB = bus number v C = multi-adapter bridge number v c = multi-adapter bridge function number v xx = not used2. Is the bus number less than 0684? Yes: The bus number is a PCI bus number in hexadecimal. Convert the number to decimal, and then continue with the next step. No: The bus number is a RIO/HSL/12X loop number in hexadecimal. Convert the number to decimal, and then go to step 4.3. Use one of the following guides to determine the type of system unit or expansion unit in which the bus is located: v If you are using a Hardware Management Console (HMC) interface, view the managed system’s properties on the HMC. v If you are using AIX or Linux, use the command line interface to determine the enclosure type. On the command line, type the following:
lshwres -r io --rsubtype bus The result will be in the form: unit_phys_loc=Uxxxx.yyy.zzzzzzz,bus_id=a, ...... Find the bus ID ?a? entry that matches the decimal bus number you determined in step 2. Using the corresponding Uxxxx value, look up the unit model or enclosure type using the Unit Type and Locations table in Chapter 3, “Locating FRUs,” on page 335.4. Perform one of the following: v If you are working with a PCI bus number, refer to “Card positions” to search for the bus number, the multi-adapter bridge number, and the multi-adapter bridge function number that matches the system unit or I/O tower type where the bus is located. This ends the procedure. v If you are working with a RIO/HSL/12X loop number, refer to “Converting the loop number to RIO/HSL/12X port location labels” on page 14 to determine the starting ports for the RIO/HSL/12X loop with the failed link. This ends the procedure. Card positions This information specifies card positions for the various server models. Use the table for the model you are working on: Table 2 on page 13: Card positions for models 8203-E4A, 9407-M15, and 9408-M25 Table 3 on page 13: Card positions for 5095, 0595, and 7311-D20 expansion units Table 4 on page 13: Card positions for 7311-D11 and 5790 expansion units Table 5 on page 14: Card positions for 7314-G30 expansion unit
12 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
Table 2. Card positions for models 8203-E4A, 9407-M15, and 9408-M25
Bus number in DSA (hexadecimal/decimal)
Item designated by the DSA
Position label and PCI data width
200/512
Embedded SAS controller
-P1
201/513
Embedded USB controller
-P1
202/514
PCI-X IOA card
-P1-C4
203/515
PCI-X IOA card
-P1-C5
204/516
PCI-E IOA card
-P1-C1
205/517
PCI-E IOA card
-P1-C2
206/518
Auxiliary cache adapter
-P1-C9
207/519
PCI-E IOA card
-P1-C3
Table 3. Card positions for 5095, 0595, and 7311-D20 expansion units
Bus number
Multi-adapter bridge number in DSA
Multi-adapter bridge function number in DSA
Item designated by the DSA
Position label and PCI data width
xxxx (assigned by LIC)
2
0 and 1
PCI IOP
C01
xxxx (assigned by LIC)
2
2 and 3
PCI IOP or IOA card
C02
xxxx (assigned by LIC)
2
4 and 5
PCI IOP or IOA card
C03
xxxx (assigned by LIC)
2
6 and 7
IOA card
C04
xxxx (assigned by LIC)
2
F
Multi-adapter bridge
CB1
xxxx (assigned by LIC)
2
Not used
Not used
C05
xxxx (assigned by LIC)
2
0 and 1
PCI IOP
C06
xxxx (assigned by LIC)
2
2 and 3
PCI IOP or IOA card
C07
xxxx (assigned by LIC)
2
6 and 7
IOA card
C08
xxxx (assigned by LIC)
2
F
Multi-adapter bridge
CB1
Table 4. Card positions for 7311-D11 and 5790 expansion units
Bus number in DSA (hexadecimal)
Multi-adapter bridge number in DSA
Multi-adapter bridge function number in DSA
Item designated by the DSA
Position label and PCI data width
Lower PCI bus number
2
0
PCI IOA card
-P1-C1 (64 bit)
Lower PCI bus number
2
1
Lower PCI bus number
2
2
PCI IOA card
-P1-C2 (64 bit)
Lower PCI bus number
2
3
Lower PCI bus number
2
4
Unused
Lower PCI bus number
2
5 Chapter 2. Isolation Procedures 13
Table 4. Card positions for 7311-D11 and 5790 expansion units (continued)
Bus number in DSA (hexadecimal)
Multi-adapter bridge number in DSA
Multi-adapter bridge function number in DSA
Item designated by the DSA
Position label and PCI data width
Lower PCI bus number
2
6
PCI IOA card
-P1-C3 (64 bit)
Lower PCI bus number
2
7
Lower PCI bus number
2
F
Multi-adapter bridge
-P1
Higher PCI bus number
2
0
PCI IOA card
-P1-C4 (64 bit)
Higher PCI bus number
2
1
Higher PCI bus number
2
2
PCI IOA card
-P1-C5 (64 bit)
Higher PCI bus number
2
3
Higher PCI bus number
2
4
Unused
Higher PCI bus number
2
5
Higher PCI bus number
2
6
PCI IOA card
-P1-C6 (64 bit)
Higher PCI bus number
2
7
Higher PCI bus number
2
F
Multi-adapter bridge
-P1
Table 5. Card positions for 7314-G30 expansion unit
Bus number
Multi-adapter bridge number in DSA
Multi-adapter bridge function number in DSA
Item designated by the DSA
Position label and PCI data width
xxxx (assigned by LIC)
0
0
PCI-X IOA card
-P1-C1 (64 bit)
xxxx (assigned by LIC)
0
0
PCI-X IOA card
-P1-C2(64 bit)
xxxx (assigned by LIC)
0
0
PCI-X IOA card
-P1-C3(64 bit)
xxxx (assigned by LIC)
0
0
PCI-X IOA card
-P1-C4(64 bit)
xxxx (assigned by LIC)
0
0
PCI-X IOA card
-P1-C5(64 bit)
xxxx (assigned by LIC)
0
0
PCI-X IOA card
-P1-C6(64 bit) Converting the loop number to RIO/HSL/12X port location labels Use this table to convert the RIO/HSL/12X loop number to port location labels. 14 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
Table 6. Converting the loop number to port location labels for Model 8203-E4A, 9407-M15, and 9408-M25.
Loop number (hex / dec)
Model
FRU position
RIO/HSL/12X port labels on system unit or processor tower
0686 / 1670
8203-E4A, 9407-M15, and 9408-M25 with RIO
-P1-C8
-P1-C8-T1 -P1-C8-T2
0688 / 1672
8203-E4A, 9407-M15, and 9408-M25 with RIO
-P1-C6
-P1-C6-T1 -P1-C6-T2
0781 / 1921
8203-E4A, 9407-M15, and 9408-M25 with 12X
-P1-C8
-P1-C8-T1 -P1-C8-T2
0782 / 1922
8203-E4A, 9407-M15, and 9408-M25 with 12X
-P1-C6
-P1-C6-T1 -P1-C6-T2 Fiber optic cleaning procedures Use the fiber optic cleaning kit and the fiber optic cleaning procedures in ?SY27-2604 Fiber Optic Cleaning Procedures? for all fiber channel connections such as those used in optical high speed link (HSL) connections or fibre channel attached devices. See Chapter 8, “Part Information,” on page 691. This ends the procedure. Status indications for RIO/HSL A status indication for a RIO/HSL loop is identified in the reference code. Use the table below to determine if the status indication requires a service action. Record the rightmost four characters of word 4 of the reference code. These characters are the program return code (PRC), which indicates the RIO/HSL status. The leftmost four characters of word 7 indicate the RIO/HSL loop number (in hexadecimal format).
Table 7. Status indicated by the PRC
PRC
Indicated status
3204
A RIO error was detected, indicating that a RIO link failed. 1. To diagnose the error read through the transport manager flight recorder. 2. Check the failing link. If the link shows not to have failed, check the devices connected to either end of the link. Chapter 2. Isolation Procedures 15
Table 7. Status indicated by the PRC (continued)
PRC
Indicated status
3205
During IPL, LIC determined that the loop was not complete. v This is expected if there are no I/O units on the loop. v This error can also occur when an I/O unit, shared I/O unit, or another system on the loop did not complete powering on by the time this system’s LIC checked the loop for completeness. As a result, you may see this error in the serviceable event view you are working with. v When you find the same reference code logged from the same IPL against the same resource with a PRC of 3206 or 3208, the problem no longer exists. This can happen because the error was recovered when RIO/HSL hardware came on line, was properly configured, or the diagnostic code determined that there was not a problem based on the combination of machine types, features, configuration, and topology. In this case, you may close the problem entry. v In i5/OS®, this error may also appear in the serviceable event view if any I/O units were removed from the loop without deleting the RIO/HSL I/O bridge resources of those units from Hardware Service Manager (HSM). The service procedure identified with the reference code that sent you here will help you determine if the loop is functioning correctly or if service is required. v This error may also be caused by a problem in a rack, frame, or unit connected to the RIO/HSL loop if the problem prevents the unit from powering on or being detected by LIC. Follow the service procedures for this reference code. When necessary, you may be directed to work on other reference codes before returning to this procedure.
3206
During normal operation an RIO/HSL loop recovered its redundant path. The loop is now complete.
3208
During normal operation an RIO/HSL I/O bridge recovered a failed link on the loop.
3209
See “Indicated status for PRC 3209” below.
3210
An RIO/HSL link switched to a slower speed. The link is designed to run at a faster speed based on the link’s hardware and LIC levels at both ends. If there is a FRU list in the serviceable event view, use it to complete the repair action. If not, perform the following sections of this procedure below: 1. “Determining the RIO/HSL port label” on page 17 2. “Replacing the link’s failing end point FRUs” on page 21 (replace only the ?From Frame ID? end point FRU) Indicated status for PRC 3209 Recoverable CRC (cyclical redundancy check) errors have occurred on the loop. This error requires service action. An RIO/HSL cable or connection must be exchanged. Exchange only the cable that appears in the FRU list of this reference code in this document. If you are working from the serviceable event view, then only the FRU(s) required for this error will be displayed. Choose from the following scenarios: v If there is a serviceable event view entry with a cable FRU listed with a loop number, enclosure/unit ID, and port label identified with the cable FRU, then perform one of the following: – If the cable is optical, it is possible the optical connections need cleaning. You can choose either to clean the cable connections at each end without exchanging the cable, or to exchange the cable. – If the cable is copper, examine the screws that hold the connector at the end of the cable identified in the reference code or the first cable location listed in the serviceable event view entry. It is possible to get CRC errors when the connector screws are not tight. You can choose to tighten the cable connector screws without exchanging the cable only if they are loose. Otherwise, you must exchange the cable. – If neither of these actions resolves the problem, replace the FRUs in the serviceable event view one at a time. This ends the procedure. 16 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
v If the serviceable event view entry does not list any cable FRUs, then exchange the failing items listed in the serviceable event view entry by following the normal service procedures for those FRUs. This will be the case when the RIO/HSL connection is embedded. This ends the procedure. v If the serviceable event view entry has cable FRU(s) listed, but the loop number, enclosure/unit ID, and port label are not all listed with the cable FRU, go to “Determining the RIO/HSL port label” below.Determining the RIO/HSL port label Retrieve and record the following information: v Loop number. The loop number is displayed in hexadecimal format as the four leftmost digits of word 7 in the reference code. Convert the loop number to decimal format using “DSA translation” on page 12. Record both the hexadecimal and decimal formats of the loop number. If the loop is an internal loop on a 515, 520, 525, or 570 (loops 0680, 0683, 0686, 0689 hexadecimal or 1664, 1667, 1670, 1673 decimal), then record ?Internal?. v Frame ID. The frame ID is displayed in hexadecimal format as the four leftmost digits of word 5 of the reference code. You must convert the frame ID to decimal format to match what is displayed in user interfaces and problem views. Record both the hexadecimal and decimal formats of the frame ID. If the frame ID is not zero, then translate the frame ID into the correct machine type, model, and serial number by performing the following: 1. Log on to SST/DST. Attention: Do not IPL to DST. 2. Select Hardware Service Manager. 3. Select Packaging resources. 4. Selecting Display details for each unit listed until the frame ID matches the ID you are working with. Once you find the matching frame ID, record the unit’s machine type, model, and serial number.Note: The frame has an ID of 0000 at this point in the procedure. A frame ID of zero is indicating the NIC/RIO controller in a system unit. v Port number indicator. The port number indicator is the four rightmost digits of word 5 of the reference code.Use the frame ID and port number indicator in the following table to determine the RIO/HSL port label. If you are referred to “Converting the loop number to RIO/HSL/12X port location labels” on page 14, the failing item is in a system unit. You will need the RIO/HSL loop number to determine the FRU. The following tables provide hub, port, and location code information for each of the currently supported models. This data is used to determine which RIO loop to use for the B7006981. The asterisk in the ?Port indicator number? column indicates the leading port of the port pair.
Table 8. Determining which RIO/HSL port label to use on a 515, 520, 525, and 550
Model number
Loop number
Hub
Port indicator number
Location code
515, 520, and 525
1668
0
0*
?Internal?
1
?Internal?
1669
2*
P1-T4
3
P1-T3 Chapter 2. Isolation Procedures 17
Table 8. Determining which RIO/HSL port label to use on a 515, 520, 525, and 550 (continued)
Model number
Loop number
Hub
Port indicator number
Location code
550
1668
0
0*
?Internal?
1
?Internal?
1669
2*
P1-T12
3
P1-T11
1676
4
0
P1-C6-T2 (In an expansion card not-onboard)
1
P1-C6-T1 (In an expansion card not-onboard)
Table 9. Determining which RIO/HSL port label to use on a 570
Model number
Loop number
CEC
Hub
Port indicator number
Location code
570
1668
0
0
0*
?Internal? (Top CEC)
1
?Internal?
1669
0
0
2*
P1-T8
3
P1-T9
1676
0
4
0*
P1-C7-T2
1
P1-C7-T1
570
1684
1
8
0*
?Internal? (Second down)
1
?Internal?
1685
1
8
2*
P1-T8
3
P1-T9
1692
1
12
0
P1-C7-T2
1
P1-C7-T1
570
1700
2
16
0*
?Internal? (Third down)
1
?Internal?
1701
2
16
2*
P1-T8
3
P1-T9
1708
2
20
0
P1-C7-T2
1
P1-C7-T1
570
1716
3
24
0*
?Internal? (Fourth down)
1
?Internal?
1717
3
24
2*
P1-T8
3
P1-T9
1724
3
28
0
P1-C7-T2
1
P1-C7-T1 18 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
Table 10. Determining which RIO/HSL port label to use on a 575
Model number
Loop number
Hub
Port indicator number
Location code
575
1668
0
0*
?Internal?
1
?Internal?
1669
2*
P1-T7
3
P1-T8
Table 11. Determining which RIO/HSL port label to use on a 590 and 595
Model number
Loop number
Hub
Port indicator number
Location code
590 / 0595
1668
0
0*
?Internal? FSPA
1
?Internal?
1669
0
2*
P1-C1-T1
3
P1-C1-T2
1670
1
0*
?Internal? FSPB
1
?Internal?
1671
1
2*
P1-C3-T1
3
P1-C3-T2
1672
2
2*
P1-C5-T1
3
P1-C5-T2
1674
3
2*
P1-C6-T1
3
P1-C6-T2
1676
4
2*
P1-C8-T1
3
P1-C8-T2
1678
5
2*
P1-C9-T1
3
P1-C9-T2
1680
6
2*
P1-C11-T1
3
P1-C11-T2
1682
7
2*
P1-C13-T1
3
P1-C13-T2 Chapter 2. Isolation Procedures 19
Table 11. Determining which RIO/HSL port label to use on a 590 and 595 (continued)
Model number
Loop number
Hub
Port indicator number
Location code
590 / 0595
1684
8
0*
?Internal? FSB (Second node position)
1
?Internal?
1685
8
2*
P1-C1-T1
3
P1-C1-T2
1686
9
0*
?Internal? FSPA
1
?Internal?
1687
9
2*
P2-C3-T1
3
P2-C3-T1
1688
10
2*
P2-C5-T1 (Optional card, no HSL)
3
P2-C5-T2
1690
11
2*
P2-C6-T1 (Required empty airflow)
3
P2-C6-T2
1692
12
2*
P2-C8-T1
3
P2-C8-T2
1694
13
2*
P2-C9-T1
3
P2-C9-T2
1696
14
2*
P2-C11-T1
3
P2-C11-T2
1698
15
2*
P2-C13-T1
3
P2-C13-T2
590 / 0595
1700
16
2*
P3-C1-T1 (Third position)
3
P3-C1-T2
1702
17
2*
P3-C3-T1
3
P3-C3-T2
1704
18
2*
P3-C5-T1
3
P3-C5-T2
1706
19
2*
P3-C6-T1
3
P3-C6-T2
1708
20
2*
P3-C8-T1
3
P3-C8-T2
1710
21
2*
P3-C9-T1
3
P3-C9-T2
1712
22
2*
P3-C11-T1
3
P3-C11-T2
1714
23
2*
P3-C13-T2
3
P3-C13-T2 20 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
Table 11. Determining which RIO/HSL port label to use on a 590 and 595 (continued)
Model number
Loop number
Hub
Port indicator number
Location code
590 / 0595
1716
24
2*
P4-C1-T1
3
P4-C1-T2
1718
25
2*
P4-C3-T1
3
P4-C3-T2
1720
26
2*
P4-C5-T1
3
P4-C5-T2
1722
27
2*
P4-C6-T1
3
P4-C6-T2
1724
28
2*
P4-C8-T1
3
P4-C8-T2
1726
29
2*
P4-C9-T1
3
P4-C9-T2
1728
30
2*
P4-C131-T1
3
P4-C11-T2
1730
31
2*
P4-C13-T1
3
P4-C13-T2 Note: For 570, exchange the FRU on the correct unit (primary or secondary) by matching the serial number (if available in the FRU list on the serviceable event view) or by matching the loop number to the correct unit using “Converting the loop number to RIO/HSL/12X port location labels” on page 14. Determining if the cable is the cause of the problem If there is a cable attached to the failing port: v If the cable is optical, it is possible the optical connections need cleaning. You can choose either to clean the cable connections at each end without exchanging the cable, or to exchange the cable. v If the cable is copper, examine the screws that hold the connector at the end of the cable identified in the reference code or the first cable location listed in the serviceable event view entry. It is possible to get CRC errors when the connector screws are not tight. You can choose to tighten the cable connector screws without exchanging the cable only if they are loose. Otherwise, you must exchange the cable. v If neither of these actions resolves the problem, replace the cable. Perform “RIOIP08” on page 34 to determine the frame ID and RIO/HSL port label of the other end of the cable you will be exchanging (if you do not already know). Does this correct the problem? Yes: This ends the procedure. No: Continue with replacing the FRUs that the cable is connected to, starting with the FRU listed first.Replacing the link’s failing end point FRUs Use the following table to determine the end point FRUs on the ?From Frame ID? port. Replace the FRUs one at a time. Chapter 2. Isolation Procedures 21
If replacing the end point FRUs for the ?From Frame ID? port does not resolve the error, use the table below to replace the end point FRUs at the other end of the cable. Perform “RIOIP08” on page 34 to determine the system or I/O unit frame ID and RIO/HSL port label of the other end of the cable (if you do not already know).
Table 12. End point FRUs
Model
Loop number (hexadecimal / decimal)
End point FRUs
515, 520, 525, 570
0680 / 1664
The loop is embedded in the system unit planar. Exchange the system unit planar.
515, 520, 525, 570
0681 / 1665
For all port number values, the loop’s NIC/RIO controller is embedded in the system unit planar. Exchange the system unit planar.
570
0682 / 1666
For all port number values, the loop’s NIC/RIO controller is in the NIC/RIO controller card. Exchange the NIC/RIO controller.
570
0683 / 16670686 / 16700689 / 1673
The loop is embedded in the system unit planar. Exchange the system unit planar. Note: Exchange the FRU on the correct unit (primary or secondary) by matching the serial number (if available in the FRU list on the serviceable event view) or by matching the loop number to the correct unit using “Converting the loop number to RIO/HSL/12X port location labels” on page 14. This ends the procedure. RIO/HSL/12X connection problems The failing component is the RIO/HSL/12X connection When a RIO/HSL cable is disconnected, it may result in a lost connection between the units even after the cable is reconnected. Attention: To fix this problem, you need to cycle power on the unit with the locked RIO/HSL connection (see “Powering on and powering off an I/O expansion unit” on page 24 below). This problem does not occur with 12X cables.Note: If question marks (???) appear at the end of the location code, then the port could not be determined. Use the location code associated with the other end of the cable. If question marks appear for both port locations, use the isolation procedures suggested in the reference code table for this SRC. 1. Choose from the following: v If you were sent to this procedure from another symbolic FRU, locate that FRU in Table 13 on page 23 to see the description of the RIO/HSL/12X FRU. Then continue with the next step for more information about the FRU. v If you are working with this symbolic FRU in the FRU list, the failing component is an RIO/HSL/12X connection. Diagnostic code could not determine what kind of hardware was involved. The RIO/HSL/12X hardware can be any of the following: – Cable – Embedded RIO/HSL/12X link in a FRU (a planar, for example) – RIO/HSL/12X interposer cardThe RIO/HSL/12X link is on or between the other FRU or FRUs listed for the reference code. Continue with the next step.
22 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
Table 13. RIO/HSL symbolic FRUs
RIO/HSL FRU
Description
HSL_OPT
This is an optical RIO/HSL cable. When exchanging optical RIO/HSL cables, use the optical cleaning kit and procedures. See “Fiber optic cleaning procedures” on page 15 for details. If an interposer card is called for, be sure the interposer type matches the cable type (optical or copper).
HSL1
There is a standard copper RIO/HSL cable at both ends (yellow connectors).
HSL1_UN
There is a standard copper RIO/HSL cable (yellow connector) at the detecting end, and an unknown connector type at the other end.
HSL2
There is a copper HSL2/RIO-G cable at both ends (black connectors), but the length of the cable could not be sensed.
HSL2_xx
There is a copper HSL2/RIO-G cable at both ends (black connectors). Use the xx value to determine the cable length from this list: v HSL2_01 = 1 meter HSL2 cable v HSL2_03 = 3 meter HSL2 cable v HSL2_08 = 8 meter HSL2 cable v HSL2_10 = 10 meter HSL2 cable v HSL2_15 = 15 meter HSL2 cable v HSL2_17 = 1.75 meter HSL2 cable v HSL2_25 = 2.5 meter HSL2 cable
HSLH
There is a hybrid RIO/HSL to HSL2/RIO-G cable (yellow connector at one end and black connector at the other end), but the length of the cable could not be sensed.
HSL_IB
This is an 12X cable (green connectors).
HSLH_xx
There is a hybrid RIO/HSL to HSL2/RIO-G cable (yellow connector at one end and black connector at the other end). Use the xx value to determine the cable length from this list: v HSLH_06 = 6 meter HSL/RIO to HSL2/RIO-G cable v HSLH_15 = 15 meter HSL/RIO to HSL2/RIO-G cable
HSL_I
There is a RIO/HSL interposer card for HSL/RIO cables (yellow connectors), HSL2/RIO-G cables (black connectors), or optical cables. The interposer card type could not be sensed.
HSL_I2 HSL_I3 HSL_I4
There is a RIO/HSL interposer card for HSL/RIO cables (yellow connectors), HSL2/RIO-G cables (black connectors), or optical cables. Use the x value to determine the interposer card type from this list: v HSL_I2 = Copper HSL2/RIO-G interposer card for HSL2/RIO-G cable connectors (black) in the system unit backplane position C08 or C09 of machine type 9406 model 825. v HSL_I3 = Optical HSL/RIO interposer card for optical RIO/HSL cables in system unit backplane position C08 or C09 of machine type 9406 model 825. v HSL_I4 = HSL2/RIO-G interposer/riser card on a 7040-61D I/O unit in location -P1-Riser or -P2-(Riser). 2. Choose from the following options: v If you are working from the serviceable event view, the location code or FRU description in the view will help determine the actual RIO/HSL/12X hardware to exchange. Continue with the next step. v If you are not working from the serviceable event view, or the view does not have a location code or better FRU description, then determine the location code of other FRUs in the FRU list for the error. Then continue with the next step.3. Use the location code and the information from the preceding table to determine the machine type, model, or unit feature involved in the error. If necessary, use the location code for other FRUs listed in Chapter 2. Isolation Procedures 23
the FRU list for this error to determine the failing RIO/HSL/12X connection and any related FRUs that are part of that connection. Use Chapter 3, “Locating FRUs,” on page 335 to find links to the locations information. Refer to the Parts Information chapter of the Service Guide for the failing machine type/model for part number information. Note: If you exchange all of the FRUs in the FRU list, but the problem still exists, contact your next level of support. You may be directed to exchange additional RIO/HSL/12X FRUs. “Additional RIO/HSL FRUs” has more information about RIO/HSL/12X FRUs on specific models and I/O units.Additional RIO/HSL FRUs The following are RIO/HSL/12X FRUs by model and/or unit type. For the model or unit type you are working on, there may be additional RIO/HSL/12X FRUs which were not listed in the FRU list of the error. Under the direction of your next level of support, you can try exchanging the additional FRUs. 1. In the following table, locate the unit type(s) on which you are working. Exchange the indicated RIO/HSL/12X loop connections (external or embedded) or RIO/HSL/12X interposer card. 2. Did the exchange correct the error? v Yes: The FRU you just replaced was the failing item. Go to Chapter 5, “Verifying a repair,” on page 557. This ends the procedure. v No: Call your next level of support. This ends the procedure.
Table 14. RIO/HSL/12X cable or connections. To find locations information, refer to Chapter 3, “Locating FRUs,” on page 335 or the service guide for the I/O enclosure.
System model or unit type
Description
Location code
5094, 5096, 5294, 5296
HSL (optical) I/O bridge adapter
Un-CB1-C10
5094, 5096, 5294, 5296
HSL I/O bridge card
Un-CB1-C10
5094, 5096, 5294, 5296
HSL-2 I/O bridge adapter
Un-CB1-C10
7311-D11, 5790
RIO/HSL adapter
Un-P1-C7
0595, 5095
HSL (optical) I/O bridge adapter
Un-P1-C5
7311-D20, 0595, 5095
RIO/HSL I/O bridge adapter
Un-P1-C5
7311-D20, 0595, 5095
RIO/HSL-2 I/O bridge adapter
Un-P1-C5
7314-G30, 5796
Dual-port 12X adapter
Un-P1-C7
1519
Integrated xSeries Adapter Powering on and powering off an I/O expansion unit Follow these steps if you disconnected an RIO/HSL cable from an I/O expansion unit, and the RIO/HSL link on the unit did not recover when you reconnected the cable: 1. From the Hardware Service Manager screen, select Packaging hardware resources. 2. Select the frame that has just accidentally lost its RIO/HSL connection and select Concurrent maintenance. Then press Enter. 3. Select Power off domain. 4. After reconnecting the unit into the RIO/HSL loop, select Power on domain.
24 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
HSL loop configuration and status form Use this HSL loop configuration and status form to record the status of the HSL ports in the loop. Note: You may copy this form as necessary. HSL loop configuration and status worksheet for system _______________, Loop number ___________
Table 15. HSL loop configuration and status form
HSL resource information
Leading port information
Trailing port information
Resource type
Resource name
Frame ID
Port number (or internal)
Link status (operational or failed)
Port number (or internal)
Link status (operational or failed) Installed features in a PCI bridge set form Use this form to record the PCI bridge set card positions, and multi-adapter bridge function numbers. Note: You might find it helpful to copy this form as necessary.
Table 16. Installed features in a PCI bridge set
PCI bridge set card positions
Multi-adapter bridge function number
Record if IOP or IOA is installed.
0
1
2
3
4
5
6
7 Chapter 2. Isolation Procedures 25
RIO/HSL/12X link status diagnosis form Use this form to record the status of the RIO/HSL/12X links.
Column A (starting status)
Column B
Column C (column A is failed and column B is failed)
Column D
Column E (column B is failed and column D is failed)
Resource with failing link
Port info
Port status
Port status
Port status
First
Frame ID ____ Card Position ____ Port # ____
Port _0 (or internal) ____
Port _0 (or internal) ____
Port _0 (or internal) ____
Port _1 (or internal) ____
Port _1 (or internal) ____
Port _1 (or internal) ____
Second
Frame ID ____ Card Position ____ Port # ____
Port _0 (or internal) ____
Port _0 (or internal) ____
Port _0 (or internal) ____
Port _1 (or internal) ____
Port _1 (or internal) ____
Port _1 (or internal) ____ 26 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
HSL/RIO 12X Isolation Procedures Use RIO/HSL/12X isolation procedures if there is no HMC attached to the server. If the server is connected to an HMC, use the procedures that are available on the HMC to continue FRU isolation. “CONSL01”Use this procedure to exchange the I/O processor (IOP) for the system or partition console of i5/OS. “RIOIP01” on page 28Use this procedure to isolate a failure in a RIO/HSL/12X loop using i5/OS service tools. “RIOIP06” on page 34Use HSM to examine the RIO/HSL/12X Loop to determine if other systems are connected to the loop. “RIOIP08” on page 34Starting with the unit ID and RIO/HSL/12X port for one end of an RIO/HSL/12X cable, determine the unit ID and port location for the other end. “RIOIP09” on page 35This procedure offers a description and service action for RIO/HSL/12X reference code B600 6982. “RIOIP10” on page 36Use this procedure to determine if the RIO/HSL/12X loop is complete (with both primary and redundant paths functioning for each unit on the loop). “RIOIP11” on page 36Use this procedure to recover from a B7xx 6982 RIO/HSL/12X failure. “RIOIP12” on page 38Use this procedure to recover from a B7xx 6985 RIO/HSL/12X failure. “RIOIP56” on page 39Use this procedure to restore the RIO 12X link to optimal bandwidth.CONSL01 Use this procedure to exchange the I/O processor (IOP) for the system or partition console of i5/OS. 1. Is the system managed by an HMC? No: Go to step 6 on page 28. Yes: The HMC will be required for this procedure. Move to the HMC and continue with the next step only if the HMC is functional.2. Can the customer power off the partition at this time? Yes: Power off the partition from the operating system console or the HMC. Then, continue with the next step. No: The IOP controlling the partition’s console may be controlling other critical resources. The partition must be powered off to exchange this IOP. Perform this procedure when the customer is able to power off the partition. Then, continue with the next step.3. Perform the following to determine the unit machine type, model, and serial number where the console IOP is located and the location of the console IOP: a. From the Navigation Area of the HMC, select the Management Environment. b. Select and expand the HMC environment for this HMC. c. Select and expand Server and Partition. d. Select Server Management. e. Double-click the i5/OS partition you are working on. f. Select the Settings tab. g. Record the location of the load source IOP. The unit type, model, and serial number are the first three parts of the location code and are separated by periods. h. Continue with the next step.
Chapter 2. Isolation Procedures 27
4. Record the frame type or feature by using the frame ID and system configuration listing or by locating the frame with that ID and recording the frame type or feature. 5. Perform the following to exchange the IOP in that card position: a. Go to Chapter 3, “Locating FRUs,” on page 335 and select the unit type and model that you recorded. b. Locate the card position in the FRU locations table and use the exchange procedure that is identified. c. Power on the partition. This ends the procedure.6. The problem is in the i5/OS partition of a system with one or more partitions that is not managed by an HMC. Use the table below to determine the location of the load source IOP and the appropriate exchange procedure.
Model
Load source IOP location
Load source IOA location
Link to locations information
MMA
-P1-C1
embedded SCSI in -P1
Chapter 3, “Locating FRUs,” on page 335 This ends the procedure.RIOIP01 Use this procedure to isolate a failure in a RIO/HSL/12X loop using i5/OS service tools. Follow the steps in the “Main task” and you will be directed to the proper subtasks. Note: During this procedure, you will be disconnecting and reconnecting cables. If errors concerning missing resources (such as disk units and RIO/HSL/12X failures) occur, ignore them. Missing resources will report in again when the loop reinitializes. Main task 1. Were you sent here from a B600 xxxx reference code? No: Continue with the next step. Yes: Use the serviceable event view and the system service documentation to search for a B700 xxxx reference code with the same last four characters reported at approximately the same time. If you find one, perform service on that reference code first, and when you close that problem, close this one as well. If you do not find one, continue with the next step.2. Before powering down any system unit or expansion unit, work with the customer to end all subsystems in all of the partitions using each partition’s console. 3. From the partition control panel, IPL the system or partition to Dedicated Service Tools (DST). Attention: Do not use function 21! 4. Are all system and expansion units on the loop powered on? Yes: Go to step 6. No: Continue with the next step.5. Perform the following: a. Power on all system and expansion units on the loop. If a frame cannot be powered on, perform the “Cannot power on unit” on page 33 subtask below, and then continue with step 6. b. Was the RIO/HSL/12X link error cleared up when the frames were powered back on? v No: Continue with the next step. v Yes: Go to Chapter 5, “Verifying a repair,” on page 557. This ends the procedure.6. Perform the following: 28 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
a. Access the Service Action Log (SAL) entry for this error; the field replaceable units (FRUs) should be listed there. Look for part numbers and descriptions for the FRUs containing the RIO/HSL/12X port for two frames. There should also be a FRU for the cable between them. The locations information for the FRUs is the location of the failed ports on the failed link. b. Record the loop number from the SAL (if it is displayed there in one of the FRU descriptions) or from the first four characters of word 7 of the reference code. Go to “Converting the loop number to RIO/HSL/12X port location labels” on page 14 to determine which RIO/HSL/12X cables on the system you are working with. Is this information in the SAL? Yes: Continue with the next step. No: Perform “Manually detecting the failed link” on page 33 below, and then continue with the next step of the main task.7. Is the cable connecting the failed ports an optical cable? No: Go to step 9. Yes: Continue with the next step.8. Perform the following: a. Clean the RIO/HSL/12X cable connectors and ports using the fiber optic cleaning kit and the fiber optic cleaning procedures in ?SY27-2604 Fiber Optic Cleaning Procedures?. b. To determine if cleaning the connectors and ports solved the problem, perform “Manually detecting the failed link” on page 33 below and return to this point. Did the ports you were working on have a status of ?failed?? No: Then the problem is fixed, go to Chapter 5, “Verifying a repair,” on page 557. This ends the procedure. Yes: Continue with the next step.9. There are now three cases to consider. Continue with the appropriate subtask of this procedure: v “The ports on both ends of the failed link are in different system units on the loop.” v “The port on one end of the failed link is in a system unit and the port on the other end is in an I/O unit” on page 30. v “The ports on both ends of the failed link are in an I/O unit” on page 31.The ports on both ends of the failed link are in different system units on the loop 1. There may be failed hardware that will report a different error on the other system units. Perform the following: a. Resolve any other RIO/HSL/12X problems in the serviceable event view on the other system units. b. Perform “Manually detecting the failed link” on page 33 below and return to this point. Did the ports you were working on have a status of ?failed?? No: Then the problem is fixed, go to Chapter 5, “Verifying a repair,” on page 557. This ends the procedure. Yes: Continue with the next step.2. Is the cable an optical RIO/HSL/12X cable? Yes: Go to step 4 on page 30. No: Continue with the next step.3. Perform the following: a. Verify that the cables are connected securely. For any cable that was loose, disconnect the cable at that end, wait 30 seconds, and reconnect the cable securely. If there are thumbscrews, you must tighten both thumb screws within 30 seconds of when the cable makes contact with the port. Chapter 2. Isolation Procedures 29
b. If you disconnected and reconnected the cable at either end, perform “Manually detecting the failed link” on page 33 below and return to this point. Did the ports you were working on have a status of ?failed?? No: Then the problem is fixed, go to Chapter 5, “Verifying a repair,” on page 557. This ends the procedure. Yes: Continue with the next step.4. Replace the cable between the two system unit ports on the failed link. To determine if replacing the cable resolved the problem, perform “Manually detecting the failed link” on page 33 below and return to this point. Did the ports you were working on have a status of ?failed?? No: Then the problem is fixed, go to Chapter 5, “Verifying a repair,” on page 557. This ends the procedure. Yes: Continue with the next step.5. Exchange the FRU with the RIO/HSL/12X port in one of the system units. If you are working with a serviceable event view and the RIO/HSL/12X FRUs are listed, exchange the FRU corresponding to the first RIO/HSL/12X cable port listed. Otherwise, exchange the FRU that is quickest and easiest to replace). To determine if replacing the FRU resolved the problem, perform “Manually detecting the failed link” on page 33 below and return to this point. Did the ports you were working on have a status of ?failed?? No: Then the problem is fixed, go to Chapter 5, “Verifying a repair,” on page 557. This ends the procedure. Yes: Continue with the next step.6. Exchange the remaining FRU with the RIO/HSL/12X port on the other system unit. To determine if replacing the FRU resolved the problem, perform “Manually detecting the failed link” on page 33 below and return to this point. Did the ports you were working on have a status of ?failed?? No: Then the problem is fixed, go to Chapter 5, “Verifying a repair,” on page 557. This ends the procedure. Yes: Continue with the next step.7. Use the procedure “RIO/HSL/12X connection problems” on page 22 to determine if there are any additional RIO/HSL/12X cable-related FRUs, such as interposer cards and internal ribbon cables, that may be on either unit. Did you exchange any additional RIO/HSL/12X FRUs? No: Call your next level of support for further instruction. This ends the procedure. Yes: Continue with the next step.8. To determine if replacing the FRU resolved the problem, perform “Manually detecting the failed link” on page 33 below and return to this point. Did the ports you were working on have a status of ?failed?? No: Then the problem is fixed, go to Chapter 5, “Verifying a repair,” on page 557. This ends the procedure. Yes: Call your next level of support for further instruction. This ends the procedure.The port on one end of the failed link is in a system unit and the port on the other end is in an I/O unit 1. Switch the two RIO/HSL/12X cables on the I/O unit with the failed port, so that each cable is connected to the port where the other cable was previously connected. Disconnect both cables at the same time, wait 30 seconds, and then reconnect the cables one at a time. Attention: For copper cables with thumbscrews, you must fully connect the cable and tighten the connector’s screws within 30 seconds of when the cable makes contact with the port. Otherwise, the link will fail and you must disconnect and reconnect again. Also, if the connector screws are not tightened, errors will occur on the link and it will fail. 2. Refresh the port status for the first failing resource by performing “Refresh the port status” on page 33 below. Then continue with the next step. 3. Is the port on the system unit that was failed now working? 30 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
No: Continue with the next step. Yes: Exchange the RIO/HSL/12X bridge FRU in the I/O unit. Go to Chapter 5, “Verifying a repair,” on page 557. This ends the procedure.4. Switch the cables back to their original positions by disconnecting both cables at the same time, waiting 30 seconds, and then reconnecting the cables one at a time. Attention: For copper cables with thumbscrews, you must fully connect the cable and tighten the connector’s screws within 30 seconds of when the cable makes contact with the port. Otherwise, the link will fail and you must disconnect and reconnect again. Also, if the connector screws are not tightened, errors will occur on the link and it will fail. 5. Exchange the cable between the two ports on the failed link. To determine if replacing the cable resolved the problem, perform “Manually detecting the failed link” on page 33 below and return to this point. Did the ports you were working on have a status of ?failed?? No: Then the problem is fixed, go to Chapter 5, “Verifying a repair,” on page 557. This ends the procedure. Yes: Continue with the next step.6. Use the procedure “RIO/HSL/12X connection problems” on page 22 to determine if there are any additional RIO/HSL/12X cable-related FRUs, such as interposer cards and internal ribbon cables, that may be on either unit. Did you exchange any additional RIO/HSL/12X FRUs? No: Call your next level of support for further instruction. This ends the procedure. Yes: Continue with the next step.7. Exchange the RIO/HSL/12X FRU that contains the failing port in the system unit. To determine if replacing the FRU resolved the problem, perform “Manually detecting the failed link” on page 33 below and return to this point. Did the ports you were working on have a status of ?failed?? No: Then the problem is fixed, go to Chapter 5, “Verifying a repair,” on page 557. This ends the procedure. Yes: Continue with the next step.8. To determine if replacing the FRU resolved the problem, perform “Manually detecting the failed link” on page 33 below and return to this point. Did the ports you were working on have a status of ?failed?? No: Then the problem is fixed, go to Chapter 5, “Verifying a repair,” on page 557. This ends the procedure. Yes: Call your next level of support for further instruction. This ends the procedure.The ports on both ends of the failed link are in an I/O unit 1. Switch the two RIO/HSL/12X cables on the first (or ?From?) cable’s I/O unit with the failed port so that each cable is connected to the port where the other cable was previously connected. Attention: For copper cables with thumbscrews, you must fully connect the cable and tighten the connector’s screws within 30 seconds of when the cable makes contact with the port. Otherwise, the link will fail and you must disconnect and reconnect again. Also, if the connector screws are not tightened, errors will occur on the link and it will fail. 2. Refresh the port status for the first failing resource by performing “Refresh the port status” on page 33 below. Then continue with the next step. 3. Is the port on the I/O unit on which you did not switch the cables now working? No: Go to step 5 on page 32 Yes: Exchange the RIO/HSL/12X I/O bridge card in the I/O unit where you just switched the cables. The continue with the next step. 4. To determine if replacing the FRU resolved the problem, perform “Manually detecting the failed link” on page 33 and return to this point. Did the ports you were working on have a status of ?failed?? Chapter 2. Isolation Procedures 31
No: Then the problem is fixed, go to Chapter 5, “Verifying a repair,” on page 557. This ends the procedure. Yes: Continue with the next step. 5. Switch the cables back to their original positions. Attention: For copper cables with thumbscrews, you must fully connect the cable and tighten the connector’s screws within 30 seconds of when the cable makes contact with the port. Otherwise, the link will fail and you must disconnect and reconnect again. Also, if the connector screws are not tightened, errors will occur on the link and it will fail. 6. Switch the two RIO/HSL/12X cables on the second (or ?To?) I/O unit with the failed port so that each cable is connected to the port where the other cable was previously connected. Attention: For copper cables with thumbscrews, you must fully connect the cable and tighten the connector’s screws within 30 seconds of when the cable makes contact with the port. Otherwise, the link will fail and you must disconnect and reconnect again. Also, if the connector screws are not tightened, errors will occur on the link and it will fail. 7. Refresh the port status for the first failing resource by performing “Refresh the port status” on page 33. Then continue with the next step. 8. Is the port on the I/O unit on which you did not switch cables now working? No: Go to step 10. Yes: Exchange the RIO/HSL/12X I/O bridge card in the I/O unit where you just switched the cables. Then continue with the next step. 9. To determine if replacing the FRU resolved the problem, perform “Manually detecting the failed link” on page 33 below and return to this point. Did the ports you were working on have a status of ?failed?? No: Then the problem is fixed, go to Chapter 5, “Verifying a repair,” on page 557. This ends the procedure. Yes: Continue with the next step.10. Switch the cables back to their original positions. Attention: For copper cables with thumbscrews, you must fully connect the cable and tighten the connector’s screws within 30 seconds of when the cable makes contact with the port. Otherwise, the link will fail and you must disconnect and reconnect again. Also, if the connector screws are not tightened, errors will occur on the link and it will fail. 11. Exchange the RIO/HSL/12X cable between the two ports on the failed link. To determine if replacing the cable resolved the problem, perform “Manually detecting the failed link” on page 33, then return to this point. Did the ports you were working on have a status of ?failed?? No: Then the problem is fixed, go to Chapter 5, “Verifying a repair,” on page 557. This ends the procedure. Yes: Continue with the next step.12. Use the procedure “RIO/HSL/12X connection problems” on page 22 to determine if there are any additional RIO/HSL/12X cable-related FRUs, such as interposer cards and internal ribbon cables, that may be on either unit. Did you exchange any additional RIO/HSL/12X FRUs? No: Call your next level of support for further instruction. This ends the procedure. Yes: Continue with the next step.13. To determine if replacing the FRU resolved the problem, perform “Manually detecting the failed link” on page 33 below and return to this point. Did the ports you were working on have a status of ?failed?? No: Then the problem is fixed, go to Chapter 5, “Verifying a repair,” on page 557. This ends the procedure. Yes: Call your next level of support for further instruction. This ends the procedure.
32 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
Cannot power on unit 1. Work the errors related to powering on the unit(s), and then continue with the next step. If a unit still cannot be powered on, re-cable the RIO/HSL/12X loop without the I/O units and system units that cannot be powered on, allowing the loop to be complete (no disconnected cables). 2. To determine if re-cabling the loop resolved the problem, perform “Manually detecting the failed link” below and return to this point.Manually detecting the failed link 1. Get the loop number from the reference code if you do not already have it. The loop number is a hexadecimal number in word 7 of the reference code. v If you are working from the Product Activity Log (PAL®), then the loop number is the 4 leftmost characters of the DSA in word 7 (BBBB). The loop number is in hexadecimal. Convert the hexadecimal loop number to decimal format before continuing with this procedure. v If you are working from the Service Action Log (SAL), the loop number should be displayed in the FRU description area in decimal format. 2. Sign on to SST or DST (if you have not already done so). Select Start a service tool ? Hardware service manager ? Logical hardware resources ? High-speed link (HSL) resources. 3. Select Resources associated with loop for the RIO/HSL/12X loop with the failed link. The RIO/HSL/12X bridges will be displayed under the loop. 4. Select Display detail for the loop with the failed link. 5. Record the name of the NIC/RIO controller resource you are starting from on the display. You will need to know this name to determine if you have followed the loop around and back to this resource. 6. If the leading port does not have a status of ?failed?, select Follow leading port until a leading port with a ?failed? status is found, or the display is showing information for the starting NIC/RIO resource you recorded. Did you find a leading port with a status of ?failed?? No: The loop is functioning properly. Return to the subtask that sent you here. Yes: Record the resource name at the leading port with a ?failed? status, and the type, model, and serial number for the resource with the failed status. Continue with the next step. 7. Select Follow leading port one more time and note all the information for the resource name with a failed trailing port. 8. Select Display system information and note the power controlling system’s type, model, and serial number (and name, if available). This info may be needed for FRU replacement at a later time. 9. Select Cancel twice to return to the previous screen. 10. Go to each resource name (found above) and select Associated packaging resource(s). This gives the description of the failing item and the unit ID. 11. Select Display detail to find the part number and location associated with the possible failing item. Then return to the step that sent you here.Refresh the port status 1. Wait one minute, and then sign on to SST or DST (if you have not already done so). 2. Select Start a service tool ? Hardware service manager ? Logical hardware resources ? High-speed link (HSL) resources. 3. Move the cursor to the RIO/HSL/12X loop that you want to examine and select Display detail ? Include non-reporting resources. 4. If the display is not already showing the ports for one of the units you are working on, then select Follow leading port. Continue to select Follow leading port until the display is showing the ports for one of the units you are working on. Note the status of the port you were working on. Select Follow leading port until the display is showing the ports for the other unit you are working on, and note the status of the port you were working on. Chapter 2. Isolation Procedures 33
5. Select Cancel ? Refresh ? Display detail for the failing resource you are checking. Note any change in the status for the resource. Then return to the step that sent you here.RIOIP06 Use HSM to examine the RIO/HSL/12X Loop to determine if other systems are connected to the loop. 1. Sign on to SST or DST (if you have not already done so). 2. Select Start a service tool ? Hardware service manager ? Logical hardware resources ? High-speed link (HSL) resources. 3. Move the cursor to the RIO/HSL/12X loop that you want to examine, and select Resources associated with loop. 4. Search for Remote RIO/HSL/12X NICs on the loop. Are there any Remote RIO/HSL/12X NICs on the loop? Yes: You have determined that there are other systems connected to this loop. This ends the procedure. No: You have determined that there are not any other systems connected to this loop. This ends the procedure.RIOIP08 Starting with the unit ID and RIO/HSL/12X port for one end of an RIO/HSL/12X cable, determine the unit ID and port location for the other end. 1. Sign on to SST or to DST if you have not already done so. 2. Select Start a Service Tool ? Hardware Service Manager ? Logical Hardware Resources ? High Speed Link (HSL) Resources. 3. Move the cursor to the RIO/HSL/12X loop that you want to examine, and select Resources associated with loop ? Include non-reporting resources. The display that appears shows the loop resource and all the ?HSL I/O Bridge? and all the ?Remote HSL NIC? resources connected to the loop. 4. Perform the following for each of the HSL I/O Bridge resources listed until you are directed to do otherwise. a. Move the cursor to the HSL I/O Bridge resource and select Associated packaging resource(s). b. Compare the unit ID on the display with the unit ID (in hexadecimal format) that you started with. Are the unit IDs the same? Yes: Continue with the next step. No: Select Cancel to return to the Logical Hardware Associated with HSL Loops display. Repeat this for each HSL I/O Bridge under the loop, until you are directed to do otherwise.5. Perform the following: a. Select Associated logical resource(s). b. Move the cursor to the HSL I/O Bridge resource and select Display detail. c. Examine the Leading port and Trailing port information. Search the display for the RIO/HSL/12X port location label that you recorded prior to starting this procedure. If the label is part of the information for the Leading port, then select Follow leading port. If the label is part of the information for the Trailing port, then select Follow trailing port. d. Perform the step below that matches the function you selected in the previous step: v If you selected Follow leading port, then examine the display for the Trailing port information. Record, on the worksheet that you are using, the RIO/HSL/12X port location label shown on the ?Trailing port from previous resource? line. Record this information as the ?To HSL Port Label?. v If you selected Follow trailing port, then examine the display for the Leading port information. Record, on the worksheet that you are using, the RIO/HSL/12X port location label on the ?Leading port to next resource? line. Record this information as the?To HSL Port Label?.
34 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
e. Record the ?Link type? (Copper or Optical) on the worksheet that you are using in the field describing the cable type. f. Select Cancel ? Cancel ? Cancel to return to the Logical Hardware Associated With HSL Loops? display. g. Record the resource name on the display. h. Move the cursor to the resource with the resource name you recorded in step 5g. i. Select Associated packaging resource(s). j. Record the unit ID. k. Return to the procedure that sent you here. This ends the procedure.RIOIP09 This procedure offers a description and service action for RIO/HSL/12X reference code B600 6982. Note: A fiber optic cleaning kit may be required for optical RIO/HSL/12X connections.Note: This reference code can occur on an RIO/HSL/12X loop when an I/O expansion unit on the loop is powered off for a concurrent maintenance action. 1. Is the reference code in the Service Action Log (SAL) or serviceable event view you are using? Yes: There is a connection failure on an RIO/HSL/12X link. A B600 6984 reference code may also appear in the Product Activity Log (PAL) or error log view you are using. Both reference codes are reporting the same problem. Continue with the next step. No: The reference code is only informational, and requires no service action. This ends the procedure.2. Multiple B600 6982 errors may occur due to retry and recovery activity. Is there a B600 6985 with ?xxxx 3206? in word 4 logged after all B600 6982 errors for the same RIO/HSL/12X loop in the PAL? Yes: The recovery efforts were successful. Close all of the B600 6982 entries for the same loop in the SAL. No service is required. This ends the procedure. No: Continue with the next step.3. Is there a B600 6987 reference code in the SAL, or serviceable event view you are using, logged at about the same time? Yes: Close this problem and work the B600 6987. This ends the procedure. No: Continue with the next step.4. Is there a B600 6981 reference code in the SAL, or serviceable event view you are using, logged at approximately the same time? Yes: Go to step 9 on page 36. No: Continue with the next step.5. Perform “RIOIP06” on page 34 to determine if this loop connects to any other systems and then return here. Note: The loop number can be found in the SAL in the description for the HSL_LNK FRU. Is this loop connected to other systems? Yes: Continue with the next step. No: Go to step 9 on page 36.6. Check for RIO/HSL/12X failures in the serviceable event views on the other systems. RIO/HSL/12X failures are indicated by entries with RIO/HSL/12X I/O bridge and Network Interface Controller (NIC) resources. Ignore B600 6982 and B600 6984 entries. Are there RIO/HSL/12X failures on other systems? Yes: Continue with the next step. No: Go to step 9 on page 36.
Chapter 2. Isolation Procedures 35
7. Repair the problems on the other systems and return to this step. After making repairs on the other systems check the PAL of this system. Is there a B600 6985 reference code, with this loop’s resource name, that was logged after the repairs you made on the other systems? Yes: Continue with the next step. No: Go to step 9.8. For the B600 6985 reference code you found, use “Status indications for RIO/HSL” on page 15 to determine if the loop is now complete. Is the loop complete? Yes: The problem has been resolved. Use “RIOIP01” on page 28 to verify that the loop is now working properly. This ends the procedure. No: Continue with the next step.9. The FRU list displayed in the SAL, or serviceable event view you are using, may be different from the failing item list given here. Use the FRU list in the serviceable event view if it is available. Does the reference code appear in the serviceable event view with HSL_LNK or HSLxxxx listed as a symbolic FRU? Yes: Perform “RIOIP01” on page 28. This ends the procedure. No: Exchange the FRUs in the serviceable event view according to their part action codes. This ends the procedure.RIOIP10 Use this procedure to determine if the RIO/HSL/12X loop is complete (with both primary and redundant paths functioning for each unit on the loop). 1. Is the system managed by a Hardware Management Console? Yes: Continue with the next step No: Go to step 3.2. The RIO/HSL/12X loop number found in the first 4 characters of word 7 of the SRC that sent you here is in hexadecimal. Convert this number to decimal. Locate the decimal loop number’s information in the RIO/HSL/12X Topology screen on the HMC. Are all links in this loop operational? Yes: The RIO/HSL/12X loop recovered. Return to the procedure that sent you here. This ends the procedure. No: The RIO/HSL/12X loop did not recover. Return to the procedure that sent you here. This ends the procedure.3. Search in Advanced System Management Interface (ASMI) for a B700 6985 informational SRC logged after the RIO/HSL/12X SRC you are working on. Did you find a B7000 6985 SRC? Yes: Continue with the next step. No: The status of the RIO/HSL/12X loop is unknown. Return to the procedure that sent you here. This ends the procedure.4. Compare the first half of word 7 in the B700 6985 informational log to the value that caused you to be sent to this procedure. Are the two values the same? Yes: The RIO/HSL/12X loop recovered. Return to the procedure that sent you here. This ends the procedure. No: The status of the RIO/HSL/12X loop is unknown. Return to the procedure that sent you here. This ends the procedure.RIOIP11 Use this procedure to recover from a B7xx 6982 RIO/HSL/12X failure. 1. Record the RIO/HSL/12X loop number in the first four characters of word 7 of this SRC and perform “RIOIP10.” 2. Did the RIO/HSL/12X loop recover? 36 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
No: Continue with the next step Yes: Close the problem. This ends the procedure. 3. Work with the customer to determine if a processor enclosure or I/O enclosure on the RIO/HSL/12X loop has powered down normally. 4. Was a processor enclosure or I/O enclosure on the loop powered down normally? No: Go to 6. Yes: The loop remains in a failed state until all processor enclosures and I/O enclosures on the loop are powered on and functioning. Work with the customer to determine if all the powered down enclosures on the loop can be powered on. After all enclosures on the loop are powered on, continue with the next step. 5. Did the RIO/HSL/12X loop recover? No: Continue with the next step. Yes: Close the problem. This ends the procedure. 6. Search for a serviceable event with a 1xxx xxxx SRC logged at approximately the same time and with one or more FRUs in the same unit as those in the FRU list for the SRC you are currently working. 7. Did you find a serviceable event with a 1xxx xxxx SRC? No: Go to 9 Yes: Work to resolve the problem. After you have repaired that error, the RIO/HSL/12X loop may be recovered. After you finish working on the problem, return to this procedure and check to determine if correcting that problem also corrected the RIO/HSL/12X error. To determine if the RIO/HSL/12X loop has recovered, record the RIO/HSL/12X loop number in the first four characters of word 7 of this SRC and perform “RIOIP10” on page 36. 8. Did the RIO/HSL/12X loop recover? No: Continue with the next step. Yes: Close the problem. This ends the procedure. 9. In the serviceable event view, search for a B700 6981 error logged at approximately the same time and on the same RIO/HSL/12X loop (the first four characters of word 7 are the same). 10. Did you find a serviceable event with a B700 6981 SRC at approximately the same time and on the same RIO/HSL/12X loop? No: Go to 12. Yes: Work to resolve the problem. After you have repaired that error, the RIO/HSL/12X loop may be recovered. After you finish working on the problem, return to this procedure and check to determine if correcting that problem also corrected the RIO/HSL/12X error. To determine if the RIO/HSL/12X loop has recovered, record the RIO/HSL/12X loop number in the first four characters of word 7 of this SRC and perform “RIOIP10” on page 36.11. Did the RIO/HSL/12X loop recover? No: Continue with the next step. Yes: Close the problem. This ends the procedure.12. Using the FRU list that you are working with for this SRC, exchange one FRU at a time. After you exchange each FRU, determine if the loop has recovered.To determine if the RIO/HSL/12X loop has recovered, record the RIO/HSL/12X loop number in the first four characters of word 7 of this SRC and perform “RIOIP10” on page 36. After the loop recovers or after you have exchanged all the FRUs, continue with the next step. To replace a FRU, refer to Chapter 3, “Locating FRUs,” on page 335. 13. Did the RIO/HSL/12X loop recover? No: Contact your next level of support. This ends the procedure. Yes: Close the problem. This ends the procedure.
Chapter 2. Isolation Procedures 37
RIOIP12 Use this procedure to recover from a B7xx 6985 RIO/HSL/12X failure. 1. Work with the customer to determine if a processor enclosure or I/O enclosure on the RIO loop has powered down normally. 2. Was a processor enclosure or I/O enclosure on the loop powered down normally? No: Go to 4. Yes: The loop remains in a failed state until all processor enclosures and I/O enclosures on the loop are powered on and functioning. Work with the customer to determine if all the powered down enclosures on the loop can be powered on. After all processor enclosures and I/O enclosures on the loop are powered on, check to determine if the RIO/HSL/12X loop is complete. To determine if the RIO/HSL/12X loop has recovered, record the RIO/HSL/12X loop number in the first four characters of word 7 of this SRC and perform “RIOIP10” on page 36. 3. Did the RIO/HSL/12X loop recover? No: Continue with the next step. Yes: Close the problem. This ends the procedure. 4. In the serviceable event view, search for a serviceable event with a 1xxx xxxx SRC logged at approximately the same time and with one or more FRUs in the same enclosure as those in the FRU list for the SRC you were currently working with. 5. Did you find a serviceable event with a 1xxx xxxx SRC? No: Go to 7 Yes: Work to resolve the problem. After you have repaired that error, the RIO/HSL/12X loop may be recovered. After you finish working on the problem, return to this procedure and check to determine if correcting that problem also corrected the RIO/HSL/12X error. To determine if the RIO/HSL/12X loop has recovered, record the RIO/HSL/12X loop number in the first four characters of word 7 of this SRC and perform “RIOIP10” on page 36. 6. Did the RIO/HSL/12X loop recover? No:: Continue with the next step. Yes:: Close the problem. This ends the procedure. 7. In the serviceable event view, search for a B700 6981 or B700 6986 error logged at approximately the same time and on the same RIO/HSL/12X loop (the first four characters of word 7 are the same). 8. Did you find a B700 6981 or a B700 6986 error logged at approximately the same time and on the same RIO/HSL/12X loop?. No: Go to step 10. Yes: Work to resolve the problem. After you have repaired that error, the RIO/HSL/12X loop may be recovered. After you finish working on the problem, return to this procedure and check to determine if correcting that problem also corrected the RIO/HSL/12X error. To determine if the RIO/HSL/12X loop has recovered, record the RIO/HSL/12X loop number in the first four characters of word 7 of this SRC and perform “RIOIP10” on page 36. 9. Did the RIO/HSL/12X loop recover? No: Continue with the next step. Yes: Close the problem. This ends the procedure.10. Search for a processor enclosure or I/O enclosure on the RIO/HSL/12X loop that has not powered up as expected. 11. Did you find a processor enclosure or I/O enclosure on the RIO/HSL/12X loop that has not powered up as expected? No: Go to 13 on page 39. Yes: Go to “Cannot power on SPCN-controlled I/O expansion unit” on page 136 and work that power symptom. Use the first half of word 7 to determine the loop number for later use. After you have repaired that error, the RIO/HSL/12X loop may be recovered. After you finish working 38 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
that power symptom, return to this procedure and check to determine if correcting that problem also corrected the RIO/HSL/12X error. To determine if the RIO/HSL/12X loop has recovered, record the RIO/HSL/12X loop number in the first four characters of word 7 of this SRC and perform “RIOIP10” on page 36.12. Did the RIO/HSL/12X loop recover? No: Continue with the next step. Yes: Close the problem. This ends the procedure.13. Using the FRU list that you are working with for this SRC, exchange one FRU at a time. After you exchange each FRU, determine if the loop has recovered. To determine if the RIO/HSL/12X loop has recovered, record the RIO/HSL/12X loop number in the first four characters of word 7 of this SRC and perform “RIOIP10” on page 36. After the loop recovers or after you have exchanged all the FRUs, continue with the next step. To replace a FRU, refer to Chapter 3, “Locating FRUs,” on page 335. 14. Did the RIO/HSL/12X loop recover? No: Contact your next level of support. This ends the procedure. Yes: Close the problem. This ends the procedure.RIOIP56 Use this procedure to restore the RIO 12X link to optimal bandwidth. 1. Record the RIO/HSL/12X loop number in the first four characters of word 7 of this SRC. The loop number is in hexadecimal format and must be converted to decimal. 2. Is the system managed by a Hardware Management Console? Yes: Continue with the next step. No: Go to step 6 on page 40.3. Perform the following from the Hardware Management Console: a. Select the system you are working on. b. Navigate to Hardware (information) and expand it. c. Select View RIO -12X Topology. d. In the current topology area, scroll until you find the data for the RIO/HSL/12X decimal loop number you identified in step 1.Is the link width 12X? Yes: The RIO 12X cable connection is now operating at the optimal bandwidth. No further action is required. This ends the procedure. No: Continue with the next step.4. Unplug both ends of the cable indicated in the FRU list for at least 30 seconds and then reconnect it. Refresh the View RIO-12X Topology view on the Hardware Management Console and verify that the width is now 12X for the decimal loop number you identified in step 1. Is the link width 12X? Yes: The RIO 12X cable connection is now operating at the optimal bandwidth. No further action is required. This ends the procedure. No: Continue with the next step.5. Replace the cable. Refresh the View RIO-12X Topology view on the Hardware Management Console and verify that the width is now 12X for the decimal loop number you identified in step 1. Is the link width 12X? Yes: The RIO 12X cable connection is now operating at the optimal bandwidth. No further action is required. This ends the procedure. No: Continue replacing the items in the FRU list until the problem is resolved. This ends the procedure.
Chapter 2. Isolation Procedures 39
6. Unplug both ends of the cable indicated in the FRU list for at least 30 seconds and then reconnect the cable. It is not possible to concurrently verify that the RIO 12X link has been restored to optimal bandwidth. If the same SRC occurs for this RIO 12X link after the next IPL, the problem has not been resolved. Replace the cable and check for the error condition after the next IPL. Continue replacing the items in the FRU list, and perform an IPL the system each time until the problem has been resolved. This ends the procedure. 40 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
Multi-adapter bridge (MAB) isolation procedures Use multi-adapter bridge (MAB) isolation procedures if there is no HMC attached to the server. If the server is connected to the HMC, use the procedures that are available on the HMC to continue FRU isolation. “MABIP02”Use this procedure to resolve a problem with a multi-adapter bridge. “MABIP03”Use this procedure to isolate a failing PCI adapter under a multi-adapter bridge. “MABIP05”Use this procedure to reset an IOP. “MABIP50” on page 42Use this procedure to isolate a failing PCI I/O adapter under a multi-adapter bridge. “MABIP51” on page 42Use this procedure to resolve a problem with a multi-adapter bridge. “MABIP52” on page 42This procedure will isolate a failing PCI adapter from a reference code when an IPL is not successful on the system or logical partition. “MABIP53” on page 43Use this procedure to determine a card position when no location is given for a PCI adapter FRU. “MABIP54” on page 43Use this procedure to isolate the failing PCI I/O adapter from a reference code with a Direct Select Address when the serviceable event view does not indicate a location for the PCI adapter. “MABIP55” on page 43Use this procedure to isolate a failing I/O adapter.MABIP02 Use this procedure to resolve a problem with a multi-adapter bridge. Perform “MABIP51” on page 42. MABIP03 Use this procedure to isolate a failing PCI adapter under a multi-adapter bridge. Perform “MABIP50” on page 42. MABIP05 Use this procedure to reset an IOP. Attention: When the IOP reset is performed, all resources controlled by the IOP will be reset. Perform this procedure only if the customer has verified that the IOP reset can be performed at this time. 1. Go to the SST/DST display in the partition which reported the problem. Use STRSST if i5/OS is running; use function 21 if STRSST does not work; or IPL the partition to DST. 2. On the Start Service Tools Sign On display, type in a user ID with service authority and password. 3. Select Start a service tool ? Hardware service manager ? Logical hardware resources ? System bus resources. 4. Page forward until you find the IOP that you want to reset. For help in identifying the IOP from the Direct Select Address (DSA) in the reference code, see “DSA translation” on page 12. 5. Verify that the IOP are correct by matching the resource name(s) on the display with the resource name(s) in the Service Action Log (SAL) for the problem you are working on. Chapter 2. Isolation Procedures 41
6. Move the cursor to the IOP that you want to reset, and select I/O Debug ? Reset IOP ? IPL IOP.This ends the procedure.MABIP50 Use this procedure to isolate a failing PCI I/O adapter under a multi-adapter bridge. Go to “PCI bus isolation using AIX, Linux, or the HMC” on page 8 to isolate a PCI bus problem from AIX, Linux, or the HMC MABIP51 Use this procedure to resolve a problem with a multi-adapter bridge. Go to “PCI bus isolation using AIX, Linux, or the HMC” on page 8 to isolate a PCI bus problem from AIX, Linux, or the HMC MABIP52 This procedure will isolate a failing PCI adapter from a reference code when an IPL is not successful on the system or logical partition. Attention: The removal and replacement procedure of all FRUs in this procedure are performed using dedicated maintenance (partition powered off). If not, the problem cannot be fixed. 1. Determine the PCI bridge set (multi-adapter bridge domain) by performing the following: a. Record the bus number (BBBB), the multi-adapter bridge number (C) and the multi-adapter bridge function number (c) from the Direct Select Address (DSA) in word 7 of the reference code. See “DSA translation” on page 12 for help in determining these values. b. Use the bus number that you recorded and the System Configuration Listing (or ask the customer) to determine what frame the bus is in. c. Record the frame type where the bus is located. d. Use the System Configuration Listing, the card position table for the frame type that you recorded, the bus number, and the multi-adapter bridge number to determine the PCI bridge set where the failure occurred. The PCI bridge set is the group of card positions controlled by the same multi-adapter bridge on the bus that you recorded. e. Use the card position table to record the PCI bridge set card positions. f. Examine the PCI bridge set in the frame, and record all the positions with IOA cards installed in them.2. Perform the following: a. Power off the partition. b. Remove all the IOA cards in the PCI bridge set identified in step 1. Be sure to record the card position of each IOA so that you can reinstall it in the same position later. To determine the removal and replacement procedures for the IOAs, go to Chapter 3, “Locating FRUs,” on page 335. c. Power on the partition. Does the reference code or failure that sent you to this procedure occur? No: Continue with the next step. Yes: The problem is the multi-adapter bridge. Continue with step 4 on page 43.3. Reinstall one of the IOAs and power on the partition. Does the reference code or failure that sent you to this procedure occur? Yes: The IOA that you just installed is the failing FRU. Replace the IOA.This ends the procedure. No: 42 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
v If a different SRC occurs, return to Chapter 1, “Starting a Service Call,” on page 1 and follow the service procedures for the new reference code. This ends the procedure. v If no SRC occurs and there are more IOAs to install, power off the partition and repeat this step. v If no SRC occurs and there are no more IOAs to install, the problem is intermittent; contact your next level of support. This ends the procedure. 4. Power off the partition. Determine which FRU contains the multi-adapter bridge. Locate the card position table for the frame type that you recorded. Perform the following: a. Using the multi-adapter bridge number that you recorded, search for the multi-adapter bridge function number ?F? in the card position table to determine the card position of the multi-adapter bridge’s FRU. b. Exchange the multi-adapter bridge’s FRU at the card position that you determined for it. See Chapter 3, “Locating FRUs,” on page 335 to determine the correct removal and replacement procedure. c. Install all IOAs in their original positions. d. Power on the partition.Does the reference code or failure that sent you to this procedure occur? No: Go to Chapter 5, “Verifying a repair,” on page 557. This ends the procedure. Yes: Call your next level of support. This ends the procedure.MABIP53 Use this procedure to determine a card position when no location is given for a PCI adapter FRU. Go to “PCI bus isolation using AIX, Linux, or the HMC” on page 8 to isolate a PCI bus problem from AIX, Linux, or the HMC. MABIP54 Use this procedure to isolate the failing PCI I/O adapter from a reference code with a Direct Select Address when the serviceable event view does not indicate a location for the PCI adapter. Go to “PCI bus isolation using AIX, Linux, or the HMC” on page 8 to isolate a PCI bus problem from AIX, Linux, or the HMC. MABIP55 Use this procedure to isolate a failing I/O adapter. Attention: This procedure is for use with i5/OS only. Go to “PCI bus isolation using AIX, Linux, or the HMC” on page 8 to isolate a PCI bus problem from AIX, Linux, or the HMC. 1. If the system is not IPLed, will it IPL to DST?
Option
Description
Yes
No
From the SAL display for the reference code, record the count. Continue with the next step.
Perform “MABIP54.” This ends the procedure. 2. Go to the SST/DST display in the partition which reported the problem. Use STRSST if i5/OS is running; use function 21 if STRSST does not work; or IPL the partition to DST. 3. On the Start Service Tools Sign On display, type in a user ID with QSRV authority and password. 4. Select Start a service tool > Hardware service manager > Logical hardware resources > System bus resources. Chapter 2. Isolation Procedures 43
5. Is there a resource name logged in the SAL entry?
No
Yes
Continue with the next step.
Go to step 12 6. Do you have a location for the I/O processor?
Option
Description
No
Yes
Record the Direct Select Address (DSA), word 7 of the reference code, from the SAL display. Then continue with the next step.
Go to step 10. 7. Return to the HSM System bus resources display. 8. Locate the I/O processor by performing the following: a. Select Display detail. b. Compare the DSA with the bus, card, and board information for the IOP. Note: The card information on the HSM display is in decimal format. You must convert the decimal card information to hexadecimal format to match the DSA format. c. Repeat this step until you find the IOP with the same DSA. 9. Select Cancel, and then go to step 13. 10. Locate the I/O processor in HSM by performing the following for each IOP: a. Select Associated packaging resource(s) > Display detail. b. Repeat until you find the IOP with the same location.11. Select Cancel > Cancel and go to step 13. 12. Page forward until you find the multi-adapter bridge and IOP where the problem exists. Verify that the multi-adapter bridge and IOP are correct by matching the resource name(s) on the display with the resource name(s) in the SAL for the problem you are working on. 13. For the IOP you are working on, select Resources associated with IOP (if the I/O adapters are not already displayed). 14. If there is an IOA that is listed in any state other than ?operational?, then perform steps 15 through 18 on page 45, starting with the disabled IOA by moving the cursor to the disabled IOA. Otherwise, move the cursor to the first IOA that is assigned to the IOP. 15. Select Associated packaging resource(s) > Concurrent maintenance > Power off domain. Record the unit ID of the slot you are powering off. Did the domain power off successfully? 44 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
Yes
No
Perform “MABIP05” on page 41 and then return here and continue with the next step.
Choose from the following options: v If only one IOA was listed as failing, power down the system and replace the IOA. Re-IPL the system. If a different reference code occurred, perform problem analysis and work that reference code. If there was no reference code, go to Chapter 5, “Verifying a repair,” on page 557. This ends the procedure. v If there were multiple failed IOAs and concurrent maintenance did not work on one, then move to the next failed IOA and repeat steps 15 on page 44 through 18. v If concurrent maintenance does not work for multiple failed IOAs, this procedure will not be able to identify a failing I/O adapter. Return to the procedure that sent you here. This ends the procedure. v 16. Did the IOP reset and IPL successfully?
Yes
No
Check for the same failure that sent you to this procedure. Check the system control panel, the SAL for the partition that reported the problem, or the Work with partition status display for the partition that reported the problem. In the SAL, the count will increase if the reference code occurred again. Continue with the next step.
This procedure will not be able to identify a failing I/O adapter. Return to the procedure that sent you here. This ends the procedure. 17. Did the same reference code occur after the IOP was reset and IPL’d?
Yes
No
Perform the following: 1. Go to the Hardware Service Manager display. 2. Go to Packaging Hardware Resources. 3. Power on the IOA by selecting Power on domain. 4. Reassign the IOA to the IOP 5. Return to the HSL resource display, showing the IOP and associated resources. 6. Continue with the next step.
Go to step 19. 18. Is there any other IOA, assigned to the IOP, that you have not already powered off and on?
Header
Header
Move the cursor to another IOA assigned to the IOP, choosing IOAs with a status of ?unknown? or ?disabled? before moving on to IOAs with a status of ?operational?. Go to step 15 on page 44.
Go to step 21 on page 46. 19. The failing IOA is located. Exchange the I/O adapter that you just powered off. Use the location you recorded in step 15 on page 44 to locate the IOA. 20. Power on the IOA that you just exchanged. Does the same reference code that sent you to this procedure still occur? Chapter 2. Isolation Procedures 45
Yes
No
The IOA is not the failing item. Remove the IOA and reinstall the original IOA. Continue with the next step.
You have exchanged the failing IOA. Go to Chapter 5, “Verifying a repair,” on page 557. This ends the procedure. 21. No failing IOAs were identified. Return to the procedure that sent you here. This ends the procedure. 46 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
Communication isolation procedure This topic contains the procedure necessary to isolate a communications failure. Please read and observe the following warnings when using this procedure. DANGERWhen working on or around the system, observe the following precautions: Electrical voltage and current from power, telephone, and communication cables are hazardous. To avoid a shock hazard: v Connect power to this unit only with the IBM provided power cord. Do not use the IBM provided power cord for any other product. v Do not open or service any power supply assembly. v Do not connect or disconnect any cables or perform installation, maintenance, or reconfiguration of this product during an electrical storm. v The product might be equipped with multiple power cords. To remove all hazardous voltages, disconnect all power cords. v Connect all power cords to a properly wired and grounded electrical outlet. Ensure that the outlet supplies proper voltage and phase rotation according to the system rating plate. v Connect any equipment that will be attached to this product to properly wired outlets. v When possible, use one hand only to connect or disconnect signal cables. v Never turn on any equipment when there is evidence of fire, water, or structural damage. v Disconnect the attached power cords, telecommunications systems, networks, and modems before you open the device covers, unless instructed otherwise in the installation and configuration procedures. v Connect and disconnect cables as described in the following procedures when installing, moving, or opening covers on this product or attached devices. To Disconnect: 1. Turn off everything (unless instructed otherwise). 2. Remove the power cords from the outlets. 3. Remove the signal cables from the connectors. 4. Remove all cables from the devices To Connect: 1. Turn off everything (unless instructed otherwise). 2. Attach all cables to the devices. 3. Attach the signal cables to the connectors. 4. Attach the power cords to the outlets. 5. Turn on the devices. (D005) “COMIP01, COMPIP1”This procedure helps you to isolate problems with the communications input/output adapter (IOA) or input/output processor (IOP).COMIP01, COMPIP1 This procedure helps you to isolate problems with the communications input/output adapter (IOA) or input/output processor (IOP). Please read and observe the danger notices in “Communication isolation procedure” before proceeding with this procedure. 1. If the system has logical partitions, perform this procedure from the logical partition that reported the problem. To determine if the system has logical partitions, go to “Determining if the system has logical partitions” on page 593. Chapter 2. Isolation Procedures 47
2. To determine which communications hardware to test, use the SRC from the problem summary form, or problem log, For details on line description information, see the Starting a Trace section of “Work with communications trace” on page 611. 3. Perform the following: a. Vary off the resources. b. On the Start a Service Tool display, select Hardware service manager ? Logical hardware resources ? System bus resources ? Resources associated with IOP for the attached IOPs in the list until you display the suspected failing hardware. c. Select Verify on the hardware you want to test. The Verify option may be valid on the IOP, IOA, or port resource. When it is valid on the IOP resource, any replaceable memory will be tested. Communications IOAs are tested by using the Verify option on the port resource.4. Run the IOA/IOP test(s). This may include any of the following: v Adapter internal test v Adapter wrap test (requires adapter wrap plug - available from your hardware service provider). v Processor internal test v Memory test v System port test Does the IOA/IOP test(s) complete successfully? No: The problem is in the IOA or IOP. If a verify test identified a failing memory module, replace the memory module. On multiple card combinations, exchange the IOA card before exchanging the IOP card. Exchange the failing hardware. See Chapter 3, “Locating FRUs,” on page 335. This ends the procedure. Yes: The IOA/IOP is good. Do NOT replace the IOA/IOP. Continue with the next step.5. Before running tests on modems or network equipment, the remaining local hardware should be verified Since the IOA/IOP test(s) have completed successfully, the remaining local hardware to be tested is the external cable. Is the IOA adapter type 2838, with a UTP (unshielded twisted pair) external cable? Yes: Continue with the next step. No: Go to step 8.6. Is the RJ-45 connector on the external cable correctly wired according to the EIA/TIA-568A standard? That is, -Pins 1 and 2 using the same twisted pair, -Pins 3 and 6 using the same twisted pair, -Pins 4 and 5 using the same twisted pair, -Pins 7 and 8 using the same twisted pair. Yes: Continue with the next step. No: Replace the external cable with correctly wired cable. This ends the procedure.7. Do the Line Speed and Duplex values of the line description (DSPLINETH) match the corresponding values for the network device (router, hub or switch) port? No: Change the Line Speed and/or Duplex value for either the line description or the network device (router, hub or switch) port. This ends the procedure. Yes: Go to step 9 on page 49.8. Is the cable wrap test option available as a Verify test option for the hardware you are testing? v Yes: Verify the external cable by running the cable wrap test. A wrap plug is required to perform the test. This plug is available from your hardware service provider. Does the cable wrap test complete successfully? Yes: Continue with the next step. No: The problem is in the cable. Exchange the cable. This ends the procedure.
48 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
v No: The communications IOA/IOP is not the failing item. One of the following could be causing the problem. – External cable. – The network. – Any system or device on the network – The configuration of any system or device on the network. – Intermittent problems on the network. – A new SRC - perform problem analysis or ask your next level of support for assistance. Work with the customer or your next level of support to correct the problem. This ends the procedure.9. All the local hardware is good. This completes the local hardware verification. The communications IOA/IOP and/or external cable is not the failing item. One of the following could be causing the problem: v The network v Any system or device on the network v The configuration of any system or device on the network v Intermittent problems on the network v A new SRC - perform problem analysis or ask your next level of support for assistance Work with the customer or your next level of support to correct the problem. This ends the procedure. Chapter 2. Isolation Procedures 49
Intermittent isolation procedures These procedures help you to correct an intermittent problem. DANGERWhen working on or around the system, observe the following precautions: Electrical voltage and current from power, telephone, and communication cables are hazardous. To avoid a shock hazard: v Connect power to this unit only with the IBM provided power cord. Do not use the IBM provided power cord for any other product. v Do not open or service any power supply assembly. v Do not connect or disconnect any cables or perform installation, maintenance, or reconfiguration of this product during an electrical storm. v The product might be equipped with multiple power cords. To remove all hazardous voltages, disconnect all power cords. v Connect all power cords to a properly wired and grounded electrical outlet. Ensure that the outlet supplies proper voltage and phase rotation according to the system rating plate. v Connect any equipment that will be attached to this product to properly wired outlets. v When possible, use one hand only to connect or disconnect signal cables. v Never turn on any equipment when there is evidence of fire, water, or structural damage. v Disconnect the attached power cords, telecommunications systems, networks, and modems before you open the device covers, unless instructed otherwise in the installation and configuration procedures. v Connect and disconnect cables as described in the following procedures when installing, moving, or opening covers on this product or attached devices. To Disconnect: 1. Turn off everything (unless instructed otherwise). 2. Remove the power cords from the outlets. 3. Remove the signal cables from the connectors. 4. Remove all cables from the devices To Connect: 1. Turn off everything (unless instructed otherwise). 2. Attach all cables to the devices. 3. Attach the signal cables to the connectors. 4. Attach the power cords to the outlets. 5. Turn on the devices. (D005)Use these procedures to correct an intermittent problem, if other problem analysis steps or tables sent you here. Only perform the procedures that apply to your system. Read all safety procedures before servicing the system. Observe all safety procedures when performing a procedure. Unless instructed otherwise, always power off the system or expansion unit where the FRU is located. See “Powering on and powering off” on page 588 before removing, exchanging, or installing a field-replaceable unit (FRU). Use the procedure below to identify intermittent problems and the associated corrective actions. “INTIP03” on page 52Use this procedure to isolate problems with external noise on AC voltage lines. “INTIP05” on page 53Use this procedure to isolate problems with external noise on twinaxial cables. 50 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
“INTIP07” on page 53Use this procedure to lessen the effects of electrical noise (electromagnetic interference, or EMI) on the system. “INTIP08” on page 54Use this procedure to ensure that the system is electrically grounded correctly. “INTIP09” on page 55Use this procedure to check the AC electrical power for the system. “INTIP14” on page 57Use this procedure to isolate problems with station protectors. “INTIP16” on page 57Use this procedure when you need to copy a main storage dump to give to your next level of support. “INTIP18” on page 57Use this procedure to determine if one or more PTFs are available to correct this specific problem. “INTIP20” on page 57Use this procedure to analyze system performance problems. “INTIP24” on page 58Use this procedure to collect data when the service processor reports a suspected intermittent problem.
Chapter 2. Isolation Procedures 51
INTIP03 Use this procedure to isolate problems with external noise on AC voltage lines. DANGERWhen working on or around the system, observe the following precautions: Electrical voltage and current from power, telephone, and communication cables are hazardous. To avoid a shock hazard: v Connect power to this unit only with the IBM provided power cord. Do not use the IBM provided power cord for any other product. v Do not open or service any power supply assembly. v Do not connect or disconnect any cables or perform installation, maintenance, or reconfiguration of this product during an electrical storm. v The product might be equipped with multiple power cords. To remove all hazardous voltages, disconnect all power cords. v Connect all power cords to a properly wired and grounded electrical outlet. Ensure that the outlet supplies proper voltage and phase rotation according to the system rating plate. v Connect any equipment that will be attached to this product to properly wired outlets. v When possible, use one hand only to connect or disconnect signal cables. v Never turn on any equipment when there is evidence of fire, water, or structural damage. v Disconnect the attached power cords, telecommunications systems, networks, and modems before you open the device covers, unless instructed otherwise in the installation and configuration procedures. v Connect and disconnect cables as described in the following procedures when installing, moving, or opening covers on this product or attached devices. To Disconnect: 1. Turn off everything (unless instructed otherwise). 2. Remove the power cords from the outlets. 3. Remove the signal cables from the connectors. 4. Remove all cables from the devices To Connect: 1. Turn off everything (unless instructed otherwise). 2. Attach all cables to the devices. 3. Attach the signal cables to the connectors. 4. Attach the power cords to the outlets. 5. Turn on the devices. (D005)Electrical noise on incoming ac voltage lines can cause various system failures. The most common source of electrical noise is lightning. 1. Ask the customer if an electrical storm was occurring at the time of the failure to determine if lightning could have caused the failure. Could lightning have caused the failure? No: Go to step 3 on page 53. Yes: Continue with the next step.2. Determine if lightning protection devices are installed on the incoming ac voltage lines where they enter the building. There must be a dedicated ground wire from the lightning protection devices to earth ground. Are lightning protection devices installed? Yes: Continue with the next step. 52 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
No: Lightning may have caused the intermittent problem. Recommend that the customer install lightning protection devices to prevent this problem from recurring. This ends the procedure.3. Have an installation planning representative perform the following: a. Connect a recording ac voltage monitor to the incoming ac voltage lines of the units that contain the failing devices with reference to ground. b. Set the voltage monitor to start recording at a voltage slightly higher than the normal incoming ac voltage.Does the system fail again with the same symptoms? No: This ends the procedure. Yes: Continue with the next step.4. Look at the recording and see if the voltage monitor recorded any noise when the failure occurred. Did the monitor record any noise when the failure occurred? Yes: Review with the customer what was happening external to the system when the failure occurred. This may help you to determine the source of the noise. Discuss with the customer what to do to remove the noise or to prevent it from affecting the server. This ends the procedure. No: Perform the next intermittent isolation procedure listed in the Isolation procedure column. This ends the procedure.INTIP05 Use this procedure to isolate problems with external noise on twinaxial cables. Electrical noise on twinaxial cables that are not installed correctly may affect the twinaxial workstation I/O processor card. Examples of this include open shields on twinaxial cables, and station protectors that are not being installed where necessary. Check for the following on the system: v There must be no more than 11 connector breaks in a twinaxial cable run. v Station protectors must be installed (in pairs) where a cable enters or leaves a building. v There can only be two station protectors for each twinaxial run. v There is a maximum of seven devices (with addresses 0-6) for each cable run. v There is a maximum cable length of 1524 meters (5000 feet) for each port. v All cable runs must be ended (terminated). v Disconnect all twinaxial cables that are not used. v Remove any cause of electrical noise in the twinaxial cables. v All workstations must be grounded.This ends the procedure. INTIP07 Use this procedure to lessen the effects of electrical noise (electromagnetic interference, or EMI) on the system. 1. Ensure that air flow cards are installed in all adapter card slots that are not used. 2. Keep all cables away from sources of electrical interference, such as ac voltage lines, fluorescent lights, arc welding equipment, and radio frequency (RF) induction heaters. These sources of electrical noise can cause the system to become powered off. 3. If you have an expansion unit, ensure that the cables that attach the system unit to the expansion unit are seated correctly. Chapter 2. Isolation Procedures 53
Note: If the failures occur when people are close to the system or machines that are attached to the system, the problem may be electrostatic discharge (ESD). 4. Have an installation planning representative use a radio frequency (RF) field intensity meter to determine if there is an unusual amount of RF noise near the server. You also can use it to help determine the source of the noise. This ends the procedure.INTIP08 Use this procedure to ensure that the system is electrically grounded correctly. 1. Have an installation planning representative or an electrician (when necessary), perform the following steps. 2. Power off the server and the power network branch circuits before performing this procedure. 3. Ensure the safety of personnel by making sure that all electrical wiring in the United States meets National Electrical Code requirements. 4. Check all system receptacles to ensure that each one is wired correctly. This includes receptacles for the server and all equipment that attaches to the server, including workstations. Do this to determine if a wire with primary voltage on it is swapped with the ground wire, causing an electrical shock hazard. 5. For each unit, check continuity from a conductive area on the frame to the ground pin on the plug. Do this at the end of the mainline ac power cable. The resistance must be 0.1 ohm or less. 6. Ground continuity must be present from each unit receptacle to an effective ground. Therefore, check the following: v The ac voltage receptacle for each unit must have a ground wire connected from the ground terminal on the receptacle to the ground bar in the power panel. v The ground bars in all branch circuit panels must be connected with an insulated ground wire to a ground point, which is defined as follows: – The nearest available metal cold water pipe, only if the pipe is effectively grounded to the earth (see National Electric Code Section 250-81, in the United States). – The nearest available steel beams in the building structure, only if the beam is effectively grounded to the earth. – Steel bars in the base of the building or a metal ground ring that is around the building under the surface of the earth. – A ground rod in the earth (see National Electric Code Section 250-83, in the United States).Note: For installations in the United States only, by National Electrical Code standard, if more than one of the preceding grounding methods are used, they must be connected together electrically. See National Electric Code Section 250, for more information on grounding. v The grounds of all separately derived sources (uninterruptable power supply, service entrance transformer, system power module, motor generator) must be connected to a ground point as defined above. v The service entrance ground bar must connect to a ground point as defined above. v All ground connections must be tight. v Check continuity of the ground path for each unit that is using an ECOS tester, Model 1023-100. Check continuity at each unit receptacle, and measure to the ground point as defined above. The total resistance of each ground path must be 1.0 ohm or less. If you cannot meet this requirement, check for faults in the ground path. v Conduit is sometimes used to meet wiring code requirements. If conduit is used, the branch circuits must still have a green (or green and yellow) wire for grounding as stated above. Note: The ground bar and the neutral bar must never be connected together in branch circuit power panels.
54 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
The ground bar and the neutral bar in the power panels that make up the electrical power network for the server must be connected together. This applies to the first electrically isolating unit that is found in the path of electrical wiring from the server to the service entrance power panel. This isolating unit is sometimes referred to as a separately derived source. It can be an uninterruptable power supply, the system power module for the system, or the service entrance transformer. If the building has none of the above isolating units, the ground bar and the neutral bar must be connected together in the service entrance power panel. 7. Look inside all power panels to ensure the following: v There is a separate ground wire for each unit. v The green (or green and yellow) ground wires are connected only to the ground bar. v The ground bar inside each power panel is connected to the frame of the panel. v The neutral wires are connected only to the neutral bar. v The ground bar and the neutral bar are not connected together, except as stated in step 6 on page 54. 8. For systems with more than one unit, ensure that the ground wire for each unit is not connected from one receptacle to the next in a string. Each unit must have its own ground wire, which goes to the power source. 9. Ensure that the grounding wires are insulated with green (or green and yellow) wire at least equal in size to the phase wires. The grounding wires also should be as short as possible. 10. If extension-mainline power cables or multiple-outlet power strips are used, make sure that they must have a three-wire cable. One of the wires must be a ground conductor. The ground connector on the plug must not be removed. This applies to any extension mainline power cables or multiple-outlet power strips that are used on the server. It also applies for attaching devices such as personal computers, workstations, and modems. Note: Check all extension-mainline power cables and multiple-outlet power strips with an ECOS tester and with power that is applied. Ensure that no wires are crossed (for example, a ground wire crossed with a wire that has voltage on it).This ends the procedure.INTIP09 Use this procedure to check the AC electrical power for the system. 1. Have an installation planning representative or an electrician (when necessary), perform the following steps. 2. Power off the server and the power network branch circuits before performing this procedure. 3. To ensure the safety of personnel, all electrical wiring in the United States must meet National Electrical Code requirements. 4. Check ALL system receptacles to ensure that each is wired correctly. This includes receptacles for the server and all equipment that attaches to the server, including workstations. Do this to determine if a wire with primary voltage on it has been swapped with the ground wire, causing an electrical shock hazard. 5. When three-phase voltage is used to provide power to the server, correct balancing of the load on each phase is important. The units should be connected so that all three phases are used equally. 6. The power distribution neutral must return to the ?separately derived source? (uninterruptable power supply, service entrance transformer, system power module, motor generator) through an insulated wire that is the same size as the phase wire or larger. 7. The server and its attached equipment should be the only units that are connected to the power distribution network that the server gets its power. 8. The equipment that is attached to the server, such as workstations and printers, must be attached to the power distribution network for the server when possible. 9. Check all circuit breakers in the network that supply ac power to the server as follows: Chapter 2. Isolation Procedures 55
v Ensure that the circuit breakers are installed tightly in the power panel and are not loose. v Feel the front surface of each circuit breaker to detect if it is warm. A warm circuit breaker may be caused by: – The circuit breaker that is not installed tightly in the power panel. – The contacts on the circuit breaker that is not making a good electrical connection with the contacts in the power panel. – A defective circuit breaker. – A circuit breaker of a smaller current rating than the current load which is going through it. – Devices on the branch circuit which are using more current than their rating.10. Equipment that uses a large amount of current, such as: Air conditioners, copiers, and FAX machines, should not receive power from the same branch circuits as the system or its workstations. Also, the wiring that provides ac voltage for this equipment should not be placed in the same conduit as the ac voltage wiring for the server. The reason for this is that this equipment generates ac noise pulses. These pulses can get into the ac voltage for the server and cause intermittent problems. 11. Measure the ac voltage to each unit to ensure that it is in the normal range. Is the voltage outside the normal range? No: Continue with the next step. Yes: Contact the customer to have the voltage source returned to within the normal voltage range.12. The remainder of this procedure is only for a server that is attached to a separately derived source. Some examples of separately derived sources are an uninterruptable power supply, a motor generator, a service entrance transformer, and a system power module. The ac voltage system must meet all the requirements that are stated in this procedure and also all of the following: Notes: a. The following applies to an uninterruptable power supply, but it can be used for any separately derived source. b. System upgrades must not exceed the power requirements of your derived source.The uninterruptable power supply must be able to supply the peak repetitive current that is used by the system and the devices that attach to it. The uninterruptable power supply can be used over its maximum capacity if it has a low peak repetitive current specification, and the uninterruptable power supply is already fully loaded. Therefore, a de-rating factor for the uninterruptable power supply must be calculated to allow for the peak-repetitive current of the complete system. To help you determine the de-rating factor for an uninterruptable power supply, use the following: Note: The peak-repetitive current is different from the ?surge? current that occurs when the server is powered on. The de-rating factor equals the crest factor multiplied by the RMS load current divided by the peak load current where the: v Crest factor is the peak-repetitive current rating of the uninterruptable power supply that is divided by the RMS current rating of the uninterruptable power supply. If you do not know the crest factor of the uninterruptable power supply, assume that it is 1.414. v RMS load current is the steady state RMS current of the server as determined by the power profile. v Peak load current is the steady state peak current of the server as determined by the power profile. For example, if the de-rating factor of the uninterruptable power supply is calculated to be 0.707, then the uninterruptable power supply must not be used more than 70.7% of its kVA-rated 56 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
capacity. If the kVA rating of the uninterruptable power supply is 50 kVA, then the maximum allowable load on it is 35.35 kVA (50 kVA multiplied by 0.707). When a three-phase separately derived source is used, correct balancing of the load as specified in step 5 on page 55 is critical. If the load on any one phase of an uninterruptable power supply is more than the load on the other phases, the voltage on all phases may be reduced. 13. If the system is attached to an uninterruptable power supply or motor generator, then check for the following: v The system and the attached equipment should be the only items that are attached to the uninterruptable power supply or motor generator. Equipment such as air conditioners, copiers, and FAX machines should not be attached to the same uninterruptable power supply, or motor generator that the system is attached. v The system unit console and the Electronic Customer Support modem must get ac voltage from the same uninterruptable power supply or motor generator to which the system is attached. This ends the procedure.INTIP14 Use this procedure to isolate problems with station protectors. Station protectors must be installed on all twinaxial cables that leave the building in which the server is located. This applies even if the cables go underground, through a tunnel, through a covered outside hallway, or through a skyway. Station protectors help prevent electrical noise on these cables from affecting the server. 1. Look at the Product Activity Log to determine what workstations are associated with the failure. 2. Determine if station protectors are installed on the twinaxial cables to the failing workstations. Are station protectors installed on the twinaxial cables to the failing workstations? Yes: Perform the next intermittent isolation procedure listed in the Isolation procedure column. This ends the procedure. No: You may need to install station protectors on the twinaxial cables to the failing workstations. This ends the procedure.INTIP16 Use this procedure when you need to copy a main storage dump to give to your next level of support. For some problems, performing a dump of main storage helps to analyze the problem. The data on the dump is analyzed by support personnel to determine the cause of the problem and how to correct it. 1. Copy the main storage dump to tape. See “Copying a dump” on page 656. 2. Ask your next level of support to determine for assistance. This ends the procedure.INTIP18 Use this procedure to determine if one or more PTFs are available to correct this specific problem. 1. Ensure that all PTFs that relate to the problem have been installed. Note: Ensure that the latest platform LIC fix has been installed before you exchange a service processor. 2. Contact your next level of support for more information. This ends the procedure.INTIP20 Use this procedure to analyze system performance problems. Chapter 2. Isolation Procedures 57
1. Look in the Product Activity Log, ASM log, or HMC to determine if any hardware errors occurred at the same time that the performance problem occurred. Did any hardware problems occur at the same time that the performance problem occurred? Yes: Perform problem analysis and correct the hardware errors. This ends the procedure. No: The performance problems are not related to hardware. Continue with the next step.2. Perform the following steps: a. Ask the customer if they have asked software support for any software PTFs that relate to this problem. b. Recommend that the customer install a cumulative PTF package if they have not done so in the past three months. c. Inform the customer that performance could possibly be improved by having Software Support analyze the conditions. d. Inform the customer that your service provider has performance tools. Contact Software Support for more information. This ends the procedure.INTIP24 Use this procedure to collect data when the service processor reports a suspected intermittent problem. It is important that you collect data for this problem so that the problem can be corrected. Use this procedure to collect the data. There are several ways the system can display the SRC. Follow the instructions for the correct display method, defined as follows: v If this SRC is displayed in the Product Activity Log or ASM log, then record all of the SRC data words, save all of the error log data, and contact your next level of support to submit an APAR. v If the control panel is displaying SRC data words scrolling automatically through control panel functions 11, 12 and 13, and the control panel user interface buttons are not responding, then perform “FSPSP02” on page 206 instead of using this procedure. v If the SRC is displayed at the control panel, and the control panel user interface buttons respond normally, then record all of the SRC words.Do not perform an IPL until you perform a storage dump of the service processor. To get a storage dump of the service processor, perform the following: 1. Record the complete system reference code (SRC) (functions 11 through 20). 2. Perform a service processor dump. See “Performing dumps” on page 653. 3. Is a display shown on the console? v Yes: Continue with the next step. v No: The problem is not intermittent. Choose from the following options: – If you were sent here from another procedure, return there and follow the procedure for a problem that is not intermittent. – If the problem continues, replace the service processor hardware. This ends the procedure.4. The problem is intermittent. Copy the IOP dump to tape. See “Performing dumps” on page 653. 5. Complete the IPL. 6. Determine if there are available program temporary fixes (PTFs) for this problem. 7. If a PTF is found, apply the PTF. Then, return here and answer the following question. Did you find and apply a PTF for this problem? v Yes: This ends the procedure. v No: Record the following information, and contact your next level of support. – The complete SRC you recorded in this procedure 58 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
– The service processor dump to tape you obtained in step 4 on page 58. – All known system symptoms: - How often the intermittent problem occurs - System environment (IPL, certain applications) - If necessary, other SRCs that you suspect relate to the problem– Information needed to write an LICTR. This ends the procedure. Chapter 2. Isolation Procedures 59
I/O processor (IOP) isolation procedures Use these procedures to isolate a failure in the multiple function I/O card. Please read all safety procedures before servicing the system. Attention: Unless instructed otherwise, always power off the system or expansion tower where the field replaceable unit (FRU) is located before removing, exchanging, or installing a FRU. Attention: Disconnecting the J15 and J16 cables will not prevent the system unit from powering on. DANGERWhen working on or around the system, observe the following precautions: Electrical voltage and current from power, telephone, and communication cables are hazardous. To avoid a shock hazard: v Connect power to this unit only with the IBM provided power cord. Do not use the IBM provided power cord for any other product. v Do not open or service any power supply assembly. v Do not connect or disconnect any cables or perform installation, maintenance, or reconfiguration of this product during an electrical storm. v The product might be equipped with multiple power cords. To remove all hazardous voltages, disconnect all power cords. v Connect all power cords to a properly wired and grounded electrical outlet. Ensure that the outlet supplies proper voltage and phase rotation according to the system rating plate. v Connect any equipment that will be attached to this product to properly wired outlets. v When possible, use one hand only to connect or disconnect signal cables. v Never turn on any equipment when there is evidence of fire, water, or structural damage. v Disconnect the attached power cords, telecommunications systems, networks, and modems before you open the device covers, unless instructed otherwise in the installation and configuration procedures. v Connect and disconnect cables as described in the following procedures when installing, moving, or opening covers on this product or attached devices. To Disconnect: 1. Turn off everything (unless instructed otherwise). 2. Remove the power cords from the outlets. 3. Remove the signal cables from the connectors. 4. Remove all cables from the devices To Connect: 1. Turn off everything (unless instructed otherwise). 2. Attach all cables to the devices. 3. Attach the signal cables to the connectors. 4. Attach the power cords to the outlets. 5. Turn on the devices. (D005) “IOPIP01” on page 62Use this procedure to perform an IPL to dedicated service tool (DST) to determine if the same reference code occurs. “IOPIP13” on page 64Use this procedure to isolate problems on the interface between the I/O card and the storage devices. 60 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
“IOPIP16” on page 66Use this procedure to isolate failing devices that are identified by FI codes FI01105, FI01106, and FI01107. “IOPIP17” on page 69Use this procedure to isolate problems that are associated with SCSI bus configuration errors and device task initialization failures. “IOPIP18” on page 71Use this procedure to isolate problems that are associated with SCSI bus configuration errors and device task initialization failures. “IOPIP19” on page 72You were sent to this procedure from unit reference code (URC) 9010, 9011, or 9013. “IOPIP20” on page 73Use this procedure to isolate the problem when two or more devices are missing from a disk array. “IOPIP21” on page 75Use this procedure to determine the failing disk unit when, a disk unit is not compatible with other disk units in the disk array, or when a disk unit has failed. If the URC is 9025 or 9030, the disk array is running, but it might not be protected. “IOPIP22” on page 75Use this procedure to gather error information and contact your next level of support. “IOPIP23” on page 76You were sent to this procedure from a unit reference code (URC) 9050. “IOPIP25” on page 76Use this procedure to isolate the problem when a device attached to the I/O card has functions that are not given support on the I/O card. “IOPIP26” on page 78Use this procedure to correct the problem when the I/O card recognizes that the attached disk unit must be initialized and formatted. “IOPIP27” on page 79I/O card cache data exists for a missing or failed device. “IOPIP28” on page 81You were sent to this procedure from unit reference code (URC) 9052. “IOPIP29” on page 81The failing item is in a migrated tower. “IOPIP30” on page 81Use this procedure to correct the problem when the system cannot find the required cache data for the attached disk units. “IOPIP31” on page 84Cache data associated with the attached devices cannot be found. “IOPIP32” on page 86You were sent to this procedure from unit reference code (URC) 9011. “IOPIP33” on page 87 The I/O processor card detected a device configuration error. The configuration sectors on the device may be incompatible with the current I/O processor card. “IOPIP34” on page 87You were sent to this procedure from unit reference code (URC) 9027. “IOPIP40” on page 88Use this procedure to isolate the problem when a storage I/O adapter is connected to an incompatible or non-operational auxiliary cache I/O adapter. Chapter 2. Isolation Procedures 61
“IOPIP41” on page 89Use this procedure to correct the problem when an auxiliary cache I/O adapter is not connected to a storage I/O adapter or when an auxiliary cache I/O adapter is connected to an incompatible or non-operational storage I/O adapter.IOPIP01 Use this procedure to perform an IPL to dedicated service tool (DST) to determine if the same reference code occurs. If a new reference code occurs, more analysis may be possible with the new reference code. If the same reference code occurs, you are instructed to exchange the failing items. 1. If the system has logical partitions, perform this procedure from the logical partition that reported the problem. To determine if the system has logical partitions, go to “Determining if the system has logical partitions” on page 593, before continuing with this procedure. 2. Was the IPL performed from disk (Type A or Type B)? No: Continue with the next step. Yes: Go to step 5. 3. Perform the following: a. Ensure that the IPL media is the correct version and level that are needed for the system model. b. Ensure that the media is not physically damaged. c. Choose from the following options to clean the IPL media: v If it is cartridge type optical media (for example, DVD), do not attempt to clean the media. v If it is non-cartridge type media (for example, CD-ROM), wipe the disc in a straight line from the inner hub to the outer rim. Use a soft, lint-free cloth or lens tissue. Always handle the disc by the edges to avoid finger prints. v If it is tape, clean the recording head in the tape unit. Use the correct Cleaning Cartridge Kit provided by your service provider. 4. Perform a Type D IPL in Manual mode. Does a system reference code (SRC) appear on the control panel? v No: Go to step 8 on page 63. v Yes: Is the SRC the same one that sent you to this procedure? Yes: You cannot continue isolating the problem. Use the original SRC and exchange the failing items, starting with the highest probable cause of failure. Refer to the reference code list. If the failing item list contains FI codes, see Chapter 3, “Locating FRUs,” on page 335 to help determine part numbers and location in the system. This ends the procedure. No: A different SRC occurred. Use the new SRC to correct the problem. See Chapter 1, “Starting a Service Call,” on page 1. This ends the procedure. 5. Perform an IPL to DST. See “Performing an IPL to DST” on page 640. Does an SRC appear on the control panel? No: Continue with the next step. Yes: Go to step 10 on page 63. 6. Does either the Disk Configuration Error Report, the Disk Configuration Attention Report, or the Disk Configuration Warning Report display appear on the console? v No: Continue with the next step. v Yes: Select option 5, press F11, then press Enter to display the details. Then, choose from the following options: – If all of the reference codes are 0000, go to “LICIP11” on page 96 and use cause code 0002. – If any of the reference codes are not 0000, go to step 10 on page 63, and use the reference code that is not 0000. 62 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
Note: Use the characters in the Type column to find the correct reference code table. 7. Look at the product activity log. See “Using the product activity log” on page 617 for details. Is an SRC logged as a result of this IPL? Yes: Continue with the next step. No: The problem cannot be isolated any more. Use the original SRC and exchange the failing items. Start with the highest probable cause of failure in the failing item list for this reference code. If the failing item list contains FI codes, see Chapter 3, “Locating FRUs,” on page 335 to help determine part numbers and location in the system. This ends the procedure. 8. Does either the Disk Configuration Error Report, the Disk Configuration Attention Report, or the Disk Configuration Warning Report display appear on the console? v Yes: Continue with the next step. v No: Look at the product activity log. See “Using the product activity log” on page 617 for details. Is an SRC logged as a result of this IPL? Yes: Continue with the next step. No: The problem is corrected. This ends the procedure. 9. Select option 5, press F11, then press Enter to display the details. Then, choose from the following options: v If all of the reference codes are 0000, go to “LICIP11” on page 96 and use cause code 0002. v If any of the reference codes are not 0000, continue with the next step and use the reference code that is not 0000. Note: Use the characters in the Type column to find the correct reference code table.10. Record the SRC. Are the SRC and unit reference code (URC) the same ones that sent you to this procedure? Yes: Continue with the next step. No: Use the new SRC or reference code to correct the problem. This ends the procedure.11. Perform the following steps: a. Power off the system or expansion tower. See “Powering on and powering off” on page 588. b. Exchange the FRUs in the failing item list for the SRC you have now. Start with the highest probable cause of failure in the failing item column in the reference code list. Perform the remaining steps of this procedure after you exchange each FRU until you determine the failing FRU.Note: If you exchange a disk unit, do not attempt to save customer data until instructed to do so in this procedure.12. Power on the system or the expansion unit. Does an SRC appear on the control panel? No: Continue with the next step. Yes: Go to step 15 on page 64.13. Does either the Disk Configuration Error Report, the Disk Configuration Attention Report, or the Disk Configuration Warning Report display appear on the console? v Yes: Continue with the next step. v No: Look at the product activity log. See “Using the product activity log” on page 617 for details. Is an SRC logged as a result of this IPL? – Yes: Continue with the next step. – No: The last FRU you exchanged was failing. Note: Before exchanging a disk unit, you should attempt to save customer data.
Chapter 2. Isolation Procedures 63
This ends the procedure.14. Select option 5, press F11, then press Enter to display the details. Then, choose from the following options: v If all of the reference codes are 0000, go to “LICIP11” on page 96 and use cause code 0002. v If any of the reference codes are not 0000, go to step 10 on page 63, and use the reference code that is not 0000. Note: Use the characters in the Type column to find the correct reference code table.15. Record the SRC on the Problem summary form. See “Using the product activity log” on page 617 for details. Is the SRC the same one that sent you to this procedure? v Yes: The last FRU you exchanged is not the failing FRU. Go to step 11 on page 63 to continue FRU isolation. v No: Is the SRC B100 4504 or B100 4505 and have you exchanged disk unit 1 in the system unit, or are all the reference codes on the console 0000? – Yes: The last FRU you exchanged was failing. This ends the procedure. Note: Before exchanging a disk unit, you should attempt to save customer data. – No: Use the new SRC or reference code to correct the problem. This ends the procedure.IOPIP13 Use this procedure to isolate problems on the interface between the I/O card and the storage devices. The unit reference code (part of the SRC that sent you to this procedure) indicates the SCSI bus that has the problem:
Unit reference code (URC)
SCSI bus
3100
0
3101
1
3102
2
3103
3 1. If the system has logical partitions, perform this procedure from the logical partition that reported the problem. To determine if the system has logical partitions, go to “Determining if the system has logical partitions” on page 593. 2. Were you performing an IPL from removable media (IPL type D) when the error occurred? No: Continue with the next step. Yes: Exchange the FRUs in the failing item list for the reference code that sent you to this procedure. This ends the procedure. 3. Perform the following steps: a. Look in the service action log (see “Using the Service Action Log” on page 609) for other errors logged at or around the same time as the 310x SRC. b. If no entries appear in the service action log, use the product activity log (see “Using the product activity log” on page 617). c. Use the other SRCs to correct the problem before performing an IPL. d. Contact your next level of support as necessary for assistance with SCSI bus problem isolation. e. If the problem is not corrected, continue with the next step. 4. Perform an IPL to DST. See “Performing an IPL to DST” on page 640. Does an SRC appear on the control panel? 64 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
No: Continue with the next step. Yes: Go to step 7. 5. Does one of the following displays appear on the console? v Disk Configuration Error Report v Disk Configuration Attention Report v Disk Configuration Warning Report v Display Unknown Mirrored Load-Source Status v Display Load-Source Failure – Yes: Continue with the next step. – No: Look at the product activity log. See “Using the product activity log” on page 617for details. Is an SRC logged as a result of this IPL? – Yes: Continue with the next step. – No: You cannot continue isolating the problem. Use the original SRC and exchange the failing items, starting with the highest probable cause of failure see the failing item list for this reference code. If the failing item list contains FI codes, see the Chapter 3, “Locating FRUs,” on page 335 topic to help determine part numbers and location in the system. This ends the procedure. 6. Are all of the reference codes 0000? On some of the displays, you must press F11 to display reference codes. v No: Continue with the next step. Use the reference code that is not 0000. v Yes: Go to “LICIP11” on page 96 and use cause code 0002. This ends the procedure. 7. Is the SRC the same one that sent you to this procedure? Yes: Continue with the next step. No: Record the SRC. Then use the SRC description to correct the problem. This ends the procedure. 8. Perform the following: a. Power off the system or the expansion tower. See “Powering on and powering off” on page 588 for details. b. Find the I/O card identified in the failing item list. c. Remove the I/O card and install a new I/O card. This item has the highest probability of being the failing item. d. Power on the system or the expansion tower.Does an SRC appear on the control panel? No: Continue with the next step. Yes: Go to step 12 on page 66. 9. Does one of the following displays appear on the console? v Disk Configuration Error Report v Disk Configuration Attention Report v Disk Configuration Warning Report v Display Unknown Mirrored Load-Source Status v Display Load-Source Failurev Yes: Does the Display Unknown Mirrored Load-Source Status display appear on the console? Note: On some of these displays, you must press F11 to display reference codes. – Yes: Continue with the next step. – No: Are all of the reference codes 0000? Chapter 2. Isolation Procedures 65
- No: Go to step 12 using the reference code that is not 0000. - Yes: Go to “LICIP11” on page 96 and use cause code 0002. This ends the procedure.v No: Go to step 11.10. Is the reference code the same one that sent you to this procedure? v No: Either a new reference code occurred, or the reference code is 0000. There may be more than one problem. The original I/O card may be failing, but it must be installed in the system to continue problem isolation. Install the original I/O card by doing the following: a. Power off the system or the expansion tower. See “Powering on and powering off” on page 588 for details. b. Remove the I/O card you installed in step 8 on page 65 and install the original I/O card. Note: Do not power on the system or the expansion unit now. A device connected to the I/O card could be the failing item. Go to “IOPIP16,” step (9) to continue isolating the problem. This ends the procedure.v Yes: Go to step 13.11. Look at the product activity log. See “Using the product activity log” on page 617 for details. Is an SRC logged as a result of this IPL? Yes: Continue with the next step. No: The I/O card, which you removed in step 8 on page 65, is the failing item. This ends the procedure.12. Is the SRC or reference code the same one that sent you to this procedure? Yes: Continue with the next step. No: Record the SRC. Then use the SRC description to correct the problem. This ends the procedure.13. The original I/O card is not the failing item. Install the original I/O card by doing the following: a. Power off the system or the expansion tower. See “Powering on and powering off” on page 588 for details. b. Remove the I/O card you installed in step 8 on page 65 of this procedure and install the original I/O card. Note: Do not power on the system or the expansion unit now.A device connected to the I/O card could be the failing item. Go to “IOPIP16,” step (9) to continue isolating the problem. This ends the procedure.IOPIP16 Use this procedure to isolate failing devices that are identified by FI codes FI01105, FI01106, and FI01107. During this procedure, you will remove devices that are identified by the FI code, and then you will perform an IPL to determine if the symptoms of the failure have disappeared, or changed. You should not remove the load-source disk until you have shown that the other devices are not failing. Removing the load-source disk can change the symptom of failure, although it is not the failing unit. 1. If the system has logical partitions, perform this procedure from the logical partition that reported the problem. To determine if the system has logical partitions, go to “Determining if the system has logical partitions” on page 593 before continuing with this procedure. 2. Use the Hardware Service Manager (HSM) verify function (use DST or SST), and verify that all tape and optical units attached to the SCSI bus (identified by FI01105, FI01106, or FI01107) are operating correctly. See Chapter 5, “Verifying a repair,” on page 557 for details. Note: Do not IPL the system to get to DST. 3. Choose from the following options: 66 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
v If verification was successful for all tape and optical units, then go to step 5. v If any tape or optical device could not be verified, or if it failed verification, then exchange the failing item. See the Chapter 3, “Locating FRUs,” on page 335 and continue with the next step. 4. Use the Hardware Service Manager (HSM) verify function (use SST or DST) and verify that the exchanged item is operating correctly. See Chapter 5, “Verifying a repair,” on page 557 for details. Was the verification successful? No: Replace the exchanged device with the original. See Chapter 3, “Locating FRUs,” on page 335 and continue with the next step. Yes: The newly exchanged tape or optical device was the failing item. This ends the procedure. 5. Perform an IPL to DST. See “Performing an IPL to DST” on page 640. Does an SRC appear on the control panel? No: Continue with the next step. Yes: Go to step 8. 6. Does one of the following displays appear on the console? v Disk Configuration Error Report v Disk Configuration Attention Report v Disk Configuration Warning Report v Display Unknown Mirrored Load-Source Status v Display Load-Source Failure Note: On some of these displays, you must press F11 to display reference codes. The characters under Type are the same as the 4 leftmost characters of word 1. The characters under Reference Code are the same as the 4 rightmost characters of word 1. – No: Continue with the next step. – Yes: Are all of the reference codes 0000? No: Go to step 8, and use the reference code that is not 0000. Yes: Go to “LICIP11” on page 96 and use cause code 0002. This ends the procedure. 7. Look at the Product Activity Log. See “Using the product activity log” on page 617 for details. Is a reference code logged as a result of this IPL? Yes: Continue with the next step. No: You cannot continue isolating the problem. Use the original reference code and exchange the failing items, starting with the highest probable cause of failure. If the failing item list contains FI codes, see Chapter 3, “Locating FRUs,” on page 335 for additional details. This ends the procedure. 8. Is the SRC or reference code the same one that sent you to this procedure? Yes: Continue with the next step. No: Record the SRC or reference code. Then, use the SRC or reference code to correct the problem. This ends the procedure. 9. Isolate the failing device by doing the following: a. Power off the system or the expansion unit if it is powered on. See “Powering on and powering off” on page 588. b. Go to Chapter 3, “Locating FRUs,” on page 335 to find the devices identified by FI code FI01105, FI01106, or FI01107 in the failing item list. c. Disconnect one of the devices that are identified by the FI code, other than the load-source disk unit. Note: The tape, or optical units should be the first devices to be disconnected, if they are attached to the SCSI bus identified by FI01105, FI01106, or FI01107.
Chapter 2. Isolation Procedures 67
d. Go to step 11.10. Continue to isolate the possible failing items by doing the following: a. Power off the system or the expansion unit. See Powering on and powering off. b. Disconnect the next device that is identified by FI codes FI01105, FI01106, or FI01107 in the FRU list. See the note in step 9 on page 67. Do not disconnect disk unit 1 (load-source disk) until you have disconnected all other devices and the load-source disk is the last device that is identified by these FI codes.11. Power on the system or the expansion tower. Does an SRC appear on the control panel? No: Continue with the next step. Yes: Go to step 14.12. Does one of the following displays appear on the console? v Disk Configuration Error Report v Disk Configuration Attention Report v Disk Configuration Warning Report v Display Unknown Mirrored Load-Source Status v Display Load-Source Failure Note: On some of these displays, you must press F11 to display reference codes. The characters under Type are the same as the 4 leftmost characters of word 1. The characters under Reference Code are the same as the 4 rightmost characters of word 1. – Yes: Go to step 14. – No: Look at the Product Activity Log. See “Using the product activity log” on page 617 for details. Is a reference code logged as a result of this IPL? No: Continue with the next step. Yes: Go to step 14.13. You are here because the IPL completed successfully. The last device you disconnected is the failing item. Is the failing item a disk unit? No: Exchange the failing item and reconnect the devices you disconnected previously. See the Chapter 3, “Locating FRUs,” on page 335. This ends the procedure. Yes: Exchange the failing FRU. Before exchanging a disk drive, you should attempt to save customer data. This ends the procedure.14. Is the SRC or reference code the same one that sent you to this procedure? Yes: Continue with the next step. No: Record the SRC or reference code on the Problem summary form. Then go to step 16.15. The last device you disconnected is not failing. Have you disconnected all the devices that are identified by FI codes FI01105, FI01106, or FI01107 in the FRU list? No: Leave the device disconnected and return to step 10 to continue isolating the possible failing items. Yes: Replace the device backplane or backplanes associated with the devices you removed in the earlier steps. If the device backplane does not fix the problem, then you cannot continue isolating the problem. Use the original SRC and exchange the failing items, starting with the highest probable cause of failure. If the failing item list contains FI codes, see Chapter 3, “Locating FRUs,” on page 335 for additional information. This ends the procedure.16. Is the SRC B1xx 4504, and have you disconnected the load-source disk unit? (The load-source disk unit is disconnected by disconnecting disk unit 1.) 68 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
v Yes: Continue with the next step. v No: Does one of the following displays appear on the console, and are all reference codes 0000? – Disk Configuration Error Report – Disk Configuration Attention Report – Disk Configuration Warning Report – Display Unknown Mirrored Load-Source Status – Display Load-Source Failure Note: On some of these displays, you must press F11 to display reference codes. The characters under Type are the same as the 4 leftmost characters of word 1. The characters under Reference Code are the same as the 4 rightmost characters of word 1. Yes: Continue with the next step. No: A new SRC or reference code occurred. Perform problem analysis and correct the problem. This ends the procedure.17. The last device you disconnected may be the failing item. Exchange the last device you disconnected. See the Chapter 3, “Locating FRUs,” on page 335. Note: Before exchanging a disk drive, you should attempt to save customer data. Was the problem corrected by exchanging the last device you disconnected? No: Continue with the next step. Yes: This ends the procedure.18. Reconnect the devices you disconnected previously in this procedure. 19. Use the original SRC and exchange the failing items, starting with the highest probable cause of failure. Do not exchange the FRU that you exchanged in this procedure. If the failing item list contains FI codes, see Chapter 3, “Locating FRUs,” on page 335 to help determine part numbers and location in the system. This ends the procedure.IOPIP17 Use this procedure to isolate problems that are associated with SCSI bus configuration errors and device task initialization failures. 1. If the system has logical partitions, perform this procedure from the logical partition that reported the problem. To determine if the system has logical partitions, go to “Determining if the system has logical partitions” on page 593 before continuing with this procedure. 2. Were you performing an IPL from removable media (IPL type D) when the error occurred? v Yes: Exchange the FRUs in the failing item list for the reference code that sent you to this procedure. v No: Perform an IPL to DST. See “Performing an IPL to DST” on page 640. Does an SRC appear on the control panel? No: Continue with the next step. Yes: Go to step 5 on page 70.3. Does either the Disk Configuration Error Report, the Disk Configuration Attention Report, or the Disk Configuration Warning Report display appear on the console? v No: Continue with the next step. v Yes: Does one of the following messages appear in the list? – Missing disk units in the configuration – Missing mirror protection disk units in the configuration – Device parity protected units in exposed mode. - No: Continue with the next step. Chapter 2. Isolation Procedures 69
- Yes: Select option 5, press F11, then press Enter to display the details. Then, choose from the following options: v If all of the reference codes are 0000, go to “LICIP11” on page 96 and use cause code 0002. v If any of the reference codes are not 0000, go to step 5, and use the reference code that is not 0000. Note: Use the characters in the Type column to find the correct reference code table.4. Look at the product activity log. See “Using the Service Action Log” on page 609. Is an SRC logged as a result of this IPL? Yes: Continue with the next step. No: You cannot continue isolating the problem. Use the original SRC and exchange the failing items, starting with the highest probable cause of failure (see the failing item list for this reference code in the (System Reference Codes)) topic. If the failing item list contains FI codes, see (Failing items) to help determine part numbers and location in the system. This ends the procedure.5. Record the SRC. Is the SRC the same one that sent you to this procedure? v No: A different SRC or reference code occurred. Use the new SRC or reference code to perform problem analysis and correct the problem. This ends the procedure. v Yes: Determine the device unit reference code (URC) from the SRC. If the Disk Configuration Error Report, the Disk Configuration Attention Report, or the Disk Configuration Warning Report display appears on the console, the device URC is displayed under Reference Code. This is on the same line as the missing device. Is the device unit reference code 3020, 3021, 3022, or 3023? Yes: Continue with the next step. No: Go to step 7.6. A unit reference code of 3020, 3021, 3022, or 3023 indicates that there is a problem on an I/O card SCSI bus. The problem can be caused by a device that is attached to the I/O card that: v Is not supported. v Does not match system configuration rules. For example, there are too many devices that are attached to the bus. v Is failing. Perform the following: a. Look at the characters on the control panel Data display or the Problem Summary Form for characters 9 - 16 of the top 16 character line of function 12 (word 3). Use the format BBBB-Cc-bb (BBBB = bus, Cc = card, bb = board) to determine the card slot location for the I/O card (see (Locations and addresses)). b. The unit reference code indicates the SCSI bus that has the problem:
URC
SCSI Bus
3020
0
3021
1
3022
2
3023
3 c. To find the bus and device locations, see Chapter 3, “Locating FRUs,” on page 335. d. Find the printout that shows the system configuration from the last IPL and compare it to the present system configuration. Note: If configuration is not the problem, a device on the SCSI bus may be failing. e. If you need to perform isolation on the SCSI bus, go to “IOPIP16” on page 66. This ends the procedure.7. The possible failing items are FI codes FI01105 (90%) and FI01112 (10%). Find the device unit address from the SRC (see “The System Reference Code (SRC) Format Description” on page 601). Use this 70 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
information to find the physical location of the device. Record the type and model numbers to determine if the addressed I/O card supports this device. Is the device given support on your system? v No: Continue with the next step. v Yes: Perform the following: a. Exchange the device. b. Perform an IPL to DST. See “Performing an IPL to DST” on page 640. Does this correct the problem? No: Ask your next level of support for assistance. This ends the procedure. Yes: This ends the procedure.8. Perform the following steps: a. Remove the device. b. Perform an IPL to DST. See “Performing an IPL to DST” on page 640.Does this correct the problem? No: Ask your next level of support for assistance. This ends the procedure. Yes: This ends the procedure.IOPIP18 Use this procedure to isolate problems that are associated with SCSI bus configuration errors and device task initialization failures. 1. If the system has logical partitions, perform this procedure from the logical partition that reported the problem. To determine if the system has logical partitions, go to “Determining if the system has logical partitions” on page 593 before continuing with this procedure. 2. Perform an IPL to DST. See “Performing an IPL to DST” on page 640. Does an SRC appear on the control panel? v Yes: Go to step 5 on page 72. v No: Does either the Disk Configuration Error Report, the Disk Configuration Attention Report, or the Disk Configuration Warning Report display appear on the console? Yes: Continue with the next step. No: Go to step 4.3. Does one of the following messages appear in the list? v Missing disk units in the configuration v Missing mirror protection disk units in the configuration v Device parity protected units in exposed mode. – No: Continue with the next step. – Yes: Select option 5, press F11, and then press Enter to display the details. Choose from the following options: - If all of the reference codes are 0000, go to “LICIP11” on page 96 and use cause code 0002. - If any of the reference codes are not 0000, go to step 5 on page 72, and use the reference code that is not 0000. Note: Use the characters in the Type column to find the correct reference code table.4. Look at the product activity log (see (Log) for details). Is an SRC logged as a result of this IPL? Yes: Continue with the next step. No: You cannot continue isolating the problem. Use the original SRC and exchange the failing items, starting with the highest probable cause of failure in the failing item column in the reference Chapter 2. Isolation Procedures 71
code list. If the failing item list contains FI codes, see Chapter 3, “Locating FRUs,” on page 335to help determine part numbers and location in the system. This ends the procedure.5. Record the SRC. Is the SRC the same one that sent you to this procedure? Yes: Continue with the next step. No: A different SRC or reference code occurred. Use the new SRC or reference code to correct the problem. This ends the procedure.6. Determine the device unit reference code (URC) from the SRC. If the Disk Configuration Error Report, the Disk Configuration Attention Report, or the Disk Configuration Warning Report display appears on the console, the device URC is displayed under Reference Code. This is on the same line as the missing device. Is the device unit reference code 3020? v No: Continue with the next step. v Yes: A device reference code of 3020 indicates that a device is attached to the addressed I/O card. Either it is not supported, or it does not match system configuration rules. For example, there are too many devices that are attached to the bus. Perform the following steps: a. Find the printout that shows the system configuration from the last IPL and compare it to the present system configuration. b. Use the unit address and the physical address in the SRC to help you with this comparison. c. If configuration is not the problem, a device on the SCSI bus may be failing. Use FI code FI00884 in the Chapter 3, “Locating FRUs,” on page 335 table to help find the failing device. d. If you need to perform isolation on the SCSI bus, go to “IOPIP16” on page 66. This ends the procedure.7. The possible failing items are FI codes FI01105 (90%) and FI01112 (10%). Find the device unit address from the SRC. Use this information to find the physical location of the device. Record the type and model numbers to determine if the addressed I/O card supports this device. Is the device given support on your system? v No: Continue with the next step. v Yes: Perform the following steps: a. Exchange the device. b. Perform an IPL to DST. See “Performing an IPL to DST” on page 640. Does this correct the problem? No: Ask your next level of support for assistance. This ends the procedure. Yes: This ends the procedure.8. Perform the following steps: a. Remove the device. b. Perform an IPL to DST. See “Performing an IPL to DST” on page 640.Does this correct the problem? No: Ask your next level of support for assistance. This ends the procedure. Yes: This ends the procedure.IOPIP19 You were sent to this procedure from unit reference code (URC) 9010, 9011, or 9013. Contact your next level of support for assistance. 72 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
IOPIP20 Use this procedure to isolate the problem when two or more devices are missing from a disk array. You were sent to this procedure from unit reference code (URC) 9020 or 9021. 1. If the system has logical partitions, perform this procedure from the logical partition that reported the problem. To determine if the system has logical partitions, go to “Determining if the system has logical partitions” on page 593, before continuing with this procedure. 2. Access SST/DST by doing one of the following: v If you can enter a command at the console, access system service tools (SST). See “System Service Tools (SST)” on page 611. v If you cannot enter a command at the console, perform an IPL to DST. See “Performing an IPL to DST” on page 640. v If you cannot perform a type A or B IPL, perform a type D IPL from removable media. 3. Have any other I/O card or device SRCs (other than a 902F SRC) occurred at about the same time as this error? v Yes: Use the other I/O card or device SRCs to correct the problem. This ends the procedure. v No: Has the I/O card, or have the devices been repaired or reconfigured recently? – Yes: Continue with the next step. – No: Contact your next level of support for assistance. This ends the procedure. 4. Did you perform a D IPL to get to DST? v Yes: Continue with the next step. v No: Perform the following steps: a. Access the Product Activity Log and display the SRC that sent you here and view the ?Additional Information? to record the formatted log information. Record all devices that are missing from the disk array. These are the array members that have both a present address of 0 and an expected address that is not 0. Note: There might be more than one Product Activity Log entry with the same Log ID. Access any additional entries by pressing the enter key from the ?Display Detail Report for Resource? screen. View the ?Additional Information? for each entry to record the formatted log information. For example: There might be an xxxx902F SRC entry in the Product Activity Log if there are more than 10 disk units in the array. b. Continue with step 6. 5. A formatted display of hexadecimal information for Product Activity Log entries is not available. In order to interpret the hexadecimal information, see “More information from hexadecimal reports” on page 620. Record all devices that are missing from the disk array. These are the array members that have both a present address of 0 and an expected address that is not 0. Note: There might be an xxxx902F SRC entry in the Product Activity Log if there are more than 10 disk units in the array. In order to interpret the hexadecimal information for these additional disk units, see “More information from hexadecimal reports” on page 620. 6. There are three possible ways to correct the problem: a. Find the missing devices and install them in the correct physical locations in the system. If you can find the missing devices and want to continue with this repair option, then continue with the next step. b. Stop the disk array that contains the missing devices. Attention: Customer data might be lost. If you want to continue with this repair option, go to step 8 on page 74. c. Initialize and format the remaining members of the disk array. Chapter 2. Isolation Procedures 73
Attention: Customer data will be lost. If you want to continue with this repair option, go to step 9. 7. Perform the following: a. Install the missing devices in the correct locations in the system. See Chapter 3, “Locating FRUs,” on page 335. b. Power on the system. See Powering on and powering off.Does the IPL complete successfully? No: You have a new problem. Perform problem analysis and correct the problem. This ends the procedure. Yes: This ends the procedure. 8. You have chosen to stop the disk array that contains the missing devices. Attention: Customer data might be lost. Perform the following: a. If you are not already using dedicated service tools, perform an IPL to DST. See “Performing an IPL to DST” on page 640. If you cannot perform a type A or B IPL, perform a type D IPL from removable media. b. Select Work with disk units. Did you get to DST with a Type D IPL? Yes: Continue with the next step. No: Select Work with disk configuration ? Work with device parity protection. Then, continue with the next step.c. Select Stop device parity protection. d. Follow the on-line instructions to stop device parity protection. e. Perform an IPL from disk.Does the IPL complete successfully? No: You have a new problem. Perform problem analysis and correct the problem.This ends the procedure. Yes: This ends the procedure. 9. You have chosen to initialize and format the remaining members of the disk array. Perform the following steps: Attention: Customer data will be lost. a. If you are not already using dedicated service tools, perform an IPL to DST. See “Performing an IPL to DST” on page 640. If you cannot perform a type A or B IPL, perform a type D IPL from removable media. b. Select Work with disk units. Did you get to DST with a Type D IPL? Yes: Continue with the next step. No: Select Work with disk unit recovery ? Disk unit problem recovery procedures, and continue with the next step.10. Select Initialize and format disk unit. 11. Follow the online instructions to format and initialize the disk units. 12. Perform an IPL from disk. Does the IPL complete successfully? No: You have a new problem. Perform problem analysis and correct the problem. This ends the procedure. Yes: This ends the procedure.
74 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
IOPIP21 Use this procedure to determine the failing disk unit when, a disk unit is not compatible with other disk units in the disk array, or when a disk unit has failed. If the URC is 9025 or 9030, the disk array is running, but it might not be protected. You were sent to this procedure from a unit reference code (URC) of 9025, 9030, or 9032. 1. If the system has logical partitions, perform this procedure from the logical partition that reported the problem. To determine if the system has logical partitions, go to “Determining if the system has logical partitions” on page 593 before continuing with this procedure. 2. Is the device location information for this SRC available in the service action log (see “Using the Service Action Log” on page 609 for details)? Yes: Exchange the disk unit. No: Continue with the next step.3. Access SST/DST by doing one of the following: v If you can enter a command at the console, access system service tools (SST). See “System Service Tools (SST)” on page 611. v If you cannot enter a command at the console, perform an IPL to DST. See “Performing an IPL to DST” on page 640.4. Perform the following steps: a. Access the Product Activity Log and display the SRC that sent you here. b. Press the F9 key for address information. This is the I/O card address.Note: There may be more than one entry with the same Log ID. Entries with the same Log ID may be accessed by pressing the Enter key from the ?Display Detail Report for Resource? screen. Example: There may be a device specific SRC and/or an xxxx902F SRC entry in the Product Activity Log. The xxxx902F SRC will occur if there are more than 10 disk units in the array. c. Continue with the next step.5. Perform the following steps: a. Return to the SST or DST main menu. b. Select Work with disk units ? Display disk configuration ? Display disk configuration status. c. On the Display disk configuration status display, look for the devices attached to the I/O card that is identified in step 4. d. Find the device that has a status of ?DPY/Unknown? or ?DPY/Failed?. This is the device that is causing the problem. Show the device address by selecting Display Disk Unit Details ? Display Detailed Address. Record the device address. e. See Chapter 3, “Locating FRUs,” on page 335 and find the diagram of the system unit, or the expansion unit and find the following: v The card slot that is identified by the I/O card direct select address v The disk unit location that is identified by the device addressHave you determined the location of the I/O card and disk unit that is causing the problem? Yes: Exchange the disk unit that is causing the problem. This ends the procedure. No: Ask your next level of support for assistance. This ends the procedure.IOPIP22 Use this procedure to gather error information and contact your next level of support. 1. If the system has logical partitions, perform this procedure from the logical partition that reported the problem. To determine if the system has logical partitions, go to “Determining if the system has logical partitions” on page 593, before continuing with this procedure. Chapter 2. Isolation Procedures 75
2. Access SST/DST by doing one of the following: v If you can enter a command at the console, access system service tools (SST). See “System Service Tools (SST)” on page 611. v If you cannot enter a command at the console, perform an IPL to DST. See “Performing an IPL to DST” on page 640. v If you cannot perform a type A or B IPL, perform a type D IPL from removable media.3. Did you perform a D IPL to get to DST? v Yes: Continue with the next step. v No: Perform the following steps: a. Access the Product Activity Log and display the SRC that sent you here and view the ?Additional Information? to record the formatted log information. Record all the information. Note: There may be more than one Product Activity Log entry with the same Log ID. Access any additional entries by pressing the Enter key from the ?Display Detail Report for Resource? screen. View the ?Additional Information? for each entry to record the formatted log information. Example: There may be an xxxx902F SRC entry in the Product Activity Log if there are more than 10 disk units in the array. b. Continue with step 5.4. A formatted display of hexadecimal information for Product Activity Log entries is not available. In order to interpret the hexadecimal information, see “More information from hexadecimal reports” on page 620. Record all the information. Then continue with the next step. Note: There may be an xxxx902F SRC entry in the Product Activity Log if there are more than 10 disk units in the array. In order to interpret the hexadecimal information for these additional disk units, see “More information from hexadecimal reports” on page 620. 5. Ask your next level of support for assistance. Note: Your next level of support may require the error information you recorded in the previous step.This ends the procedure.IOPIP23 You were sent to this procedure from a unit reference code (URC) 9050. If the failing item is in a migrated tower, use the (Migrated Expansion Tower Problem Analysis, Repair and Parts) manual on the V5R1 Supplemental Manuals web site to fix the problem. Otherwise, contact your next level of support for assistance. IOPIP25 Use this procedure to isolate the problem when a device attached to the I/O card has functions that are not given support on the I/O card. You were sent to this procedure from URC 9008. 1. If the system has logical partitions, perform this procedure from the logical partition that reported the problem. To determine if the system has logical partitions, go to “Determining if the system has logical partitions” on page 593, before continuing with this procedure. 2. Have any other I/O card or device SRCs occurred at about the same time as this error? v Yes: Use the other I/O card or device SRCs to correct the problem. See the system reference codes. This ends the procedure. v No: Has the I/O card, or have the devices been repaired or reconfigured recently? – Yes: Continue with the next step. 76 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
– No: Contact your next level of support for assistance. This ends the procedure.3. Access SST/DST by doing one of the following: v If you can enter a command at the console, access system service tools (SST). “System Service Tools (SST)” on page 611. v If you cannot enter a command at the console, perform an IPL to DST. See “Performing an IPL to DST” on page 640. v If you cannot perform a type A or B IPL, perform a type D IPL from removable media.4. Did you perform a D IPL to get to DST? v No: Access the Product Activity Log and display the SRC that sent you here. Press the F9 key for address information. This is the I/O card address. Then, view the ?Additional Information? to record the formatted log information. Record the addresses that are not 0000 0000 for all devices listed. Continue with the next step. v Yes: Access the Product Activity Log and display the SRC that sent you here. The direct select address (DSA) of the I/O Card is in the format BBBB-Cc-bb: – BBBB = hexadecimal offsets 4C and 4D – Cc = hexadecimal offset 51 – bb = hexadecimal offset 4F The unit address of the I/O card is hexadecimal offset 18C through 18F. A formatted display of hexadecimal information for Product Activity Log entries is not available. In order to interpret the hexadecimal information, see “More information from hexadecimal reports” on page 620. Record the addresses that are not 0000 0000 for all devices listed. Continue with the next step.5. See Chapter 3, “Locating FRUs,” on page 335 and find the diagram of the system unit, or the expansion unit. Then find the following: v The card slot that is identified by the I/O card direct select address (DSA) and unit address. If there is no IOA with a matching DSA and unit address, the IOP and IOA are one card. Use the IOP with the same DSA. v The disk unit locations that are identified by the unit addresses. Have you determined the location of the I/O card and the devices that are causing the problem? v No: Ask your next level of support for assistance. This ends the procedure. v Yes: Have one or more devices been moved to this I/O card from another I/O card? Yes: Continue with the next step. No: Ask your next level of support for assistance. This ends the procedure.6. Is the I/O card capable of supporting the devices attached, or is it in the correct mode to support the devices attached? Note: For information on I/O card modes, see “Storage I/O card modes and jumpers” on page 250. v No: Remove the devices from the I/O card. Note: You can remove disk units without installing another disk unit, and the system continues to operate.This ends the procedure. v Yes: Do you want to continue using these devices with this I/O card? Yes: Continue with the next step.v No: Either change the I/O card mode or remove the devices from the I/O card. This ends the procedure.7. Initialize and format the disk units by performing the following steps: Attention: Data on the disk unit will be lost.
Chapter 2. Isolation Procedures 77
a. Access SST or DST. b. Select Work with disk units. Did you get to DST with a Type D IPL? Yes: Continue with the next step. No: Select Work with disk unit recovery ? Disk unit problem recovery procedures. Then continue with the next step.c. Select Initialize and format disk unit for each disk unit. When the new disk unit is initialized and formatted, the display shows that the status is complete. This may take 30 minutes or longer. The disk unit is now ready to be added to the system configuration. This ends the procedure.IOPIP26 Use this procedure to correct the problem when the I/O card recognizes that the attached disk unit must be initialized and formatted. You were sent to this procedure from URC 9092. 1. If the system has logical partitions, perform this procedure from the logical partition that reported the problem. To determine if the system has logical partitions, go to “Determining if the system has logical partitions” on page 593 before continuing with this procedure. 2. Have any other I/O card or device SRCs occurred at about the same time as this error? v Yes: Use the other I/O card or device SRCs to correct the problem. This ends the procedure. v No: Has the I/O card, or have the devices been repaired or reconfigured recently? Yes: Continue with the next step. No: Contact your next level of support for assistance. This ends the procedure.3. Access SST/DST by doing one of the following: v If you can enter a command at the console, access system service tools (SST). See “System Service Tools (SST)” on page 611. v If you cannot enter a command at the console, perform an IPL to DST. See “Performing an IPL to DST” on page 640 for details. v If you cannot perform a type A or B IPL, perform a type D IPL from removable media.4. Did you perform a D IPL to get to DST? v Yes: Continue with the next step. v No: Perform the following steps: a. Access the Product Activity Log and display the SRC that sent you here. b. Press the F9 key for address information. This is the I/O card address. c. Then view the ?Additional Information? to record the formatted log information. d. Record the addresses that are not 0000 0000 for all devices listed. e. Continue with step 6 on page 79.5. Perform the following steps: a. Access the Product Activity Log and display the SRC that sent you here. The direct select address (DSA) of the I/O card is in the format BBBB-Cc-bb: v BBBB = hexadecimal offsets 4C and 4D v Cc = hexadecimal offset 51 v bb = hexadecimal offset 4F The unit address of the I/O card is hexadecimal offset 18C through 18F.b. A formatted display of hexadecimal information for Product Activity Log entries is not available. In order to interpret the hexadecimal information, see “More information from hexadecimal reports” on page 620. Record the addresses that are not 0000 0000 for all devices listed. c. Continue with the next step.
78 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
6. See Chapter 3, “Locating FRUs,” on page 335 and find the diagram of the system unit, or the expansion unit. Then find the following: v The card slot that is identified by the I/O card direct select address (DSA) and unit address. If there is no IOA with a matching DSA and unit address, the IOP and IOA are one card. Use the IOP with the same DSA. v The disk unit locations that are identified by the unit addresses. Have you determined the location of the I/O card and the devices that are causing the problem? v No: Ask your next level of support for assistance. This ends the procedure. v Yes: Have one or more devices been moved to this I/O card from another I/O card? Yes: Continue with the next step. No: Ask your next level of support for assistance. This ends the procedure.7. Do you want to continue using these devices with this I/O card? v Yes: Continue with the next step. v No: Remove the devices from the I/O card. Note: You can remove disk units without installing another disk unit, and the system continues to operate.This ends the procedure.8. Initialize and format the disk units by performing the following steps: Attention: Data on the disk unit will be lost. a. Access SST or DST. b. Select Work with disk units. Did you get to DST with a Type D IPL? Yes: Continue with the next step. No: Select Work with disk unit recovery ? Disk unit problem recovery procedures. Then continue with the next step.c. Select Initialize and format disk unit for each disk unit. When the new disk unit is initialized and formatted, the display shows that the status is complete. This may take 30 minutes or longer. The disk unit is now ready to be added to the system configuration. This ends the procedure.IOPIP27 I/O card cache data exists for a missing or failed device. You were sent to this procedure from a unit reference code (URC) of 9051. Note: For some storage I/O adapters, the cache card is integrated and not removable. Having I/O card cache data for a missing or failed device might be caused by the following conditions: v One or more disk units have failed on the I/O card. v The cache card of the I/O card was not cleared before it was shipped as a MES to the customer. In addition, the service representative moved devices from the I/O card to a different I/O card before performing a system IPL. v The cache card of the I/O card was not cleared before it was shipped to the customer. In addition, residual data was left in the cache card for disk units that manufacturing used to test the I/O card. v The I/O card and cache card were moved from a different system or a different location on this system after an abnormal power off. v One or more disk units were moved either concurrently, or they were removed after an abnormal power off. Chapter 2. Isolation Procedures 79
CAUTION: Any Function 08 power down (including from a D-IPL) is an abnormal power off. 1. If the system has logical partitions, perform this procedure from the logical partition that reported the problem. To determine if the system has logical partitions, go to “Determining if the system has logical partitions” on page 593 before continuing with this procedure. 2. Access SST/DST by doing one of the following: v If you can enter a command at the console, access system service tools (SST). See “System Service Tools (SST)” on page 611. v If you cannot enter a command at the console, perform an IPL to DST. See “Performing an IPL to DST” on page 640. v If you cannot perform a type A or B IPL, perform a type D IPL from removable media. 3. Did you perform a D IPL to get to DST? v Yes: Continue with the next step. v No: Perform the following steps: a. Access the Product Activity Log and display the SRC that sent you here. b. Press the F9 key for address information. This is the I/O card address. c. Then view the ?Additional Information? to record the formatted log information. Record the device types and serial numbers for those devices that show a unit address of 0000 0000. d. Continue with step 5. 4. Perform the following steps: a. Access the Product Activity Log and display the SRC that sent you here. The direct select address (DSA) of the I/O card is in the format BBBB-Cc-bb: v BBBB = hexadecimal offsets 4C and 4D. v Cc = hexadecimal offset 51 v bb = hexadecimal offset 4F The unit address of the I/O card is hexadecimal offset 18C through 18F. A formatted display of hexadecimal information for Product Activity Log entries is not available. In order to interpret the hexadecimal information, see “More information from hexadecimal reports” on page 620. b. Record the device types and serial numbers for those devices that show a unit address of 0000 0000. c. Continue with the next step. 5. See Chapter 3, “Locating FRUs,” on page 335 and find the diagram of the system unit, or the expansion unit. Find the card slot that is identified by the I/O card direct select address (DSA) and unit address. If there is no IOA with a matching DSA and unit address, the IOP and IOA are one card. Use the IOP with the same DSA. 6. Choose from the following options: v If the devices from step 3 of this procedure have never been installed on this system, continue with the next step. v If the devices are not in the current system disk configuration, go to step 9 on page 81. v Otherwise, the devices are part of the system disk configuration; go to step 11 on page 81. 7. Choose from the following options: v If this I/O card and cache card were moved from a different system, continue with the next step. v Otherwise, the cache card was shipped to the customer without first being cleared. Perform the following: a. Make a note of the serial number, the customer number, and the device types and their serial numbers. These were found in step 3. b. Inform your next level of support. 80 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
c. Then go to step 10 to clear the cache card and correct the URC 9051 problem. 8. Install both the I/O card and the cache cards back into their original locations. Then re-IPL the system. There could be data in the cache card for devices in the disk configuration of the original system. After an IPL to DST and a normal power off on the original system, the cache card will be cleared. It is then safe to move the I/O card and the cache card to another location. 9. One or more devices that are not currently part of the system disk configuration were installed on this I/O card. Either they were removed concurrently, they were removed after an abnormal power off, or they have failed. Continue with the next step. 10. Use the Reclaim IOP cache storage procedure to clear data from the cache for the missing or failed devices as follows: a. Perform an IPL to DST. See “Performing an IPL to DST” on page 640. If you cannot perform a type A or B IPL, perform a type D IPL from removable media. b. Reclaim the cache adapter card storage. See “Reclaiming IOP cache storage” on page 636.11. Choose from the following options: v If this I/O card and cache card were moved from a different location on this system, go to step 8. v If the devices from step 3 on page 80 of this procedure are now installed on another I/O card, and they were moved there before the devices were added to the system disk configuration, go to step 7 on page 80. (On an MES, the disk units are sometimes moved from one I/O card to another I/O card. This problem will result if manufacturing did not clear the cache card before shipping the MES.) v Otherwise, continue with the next step.12. One or more devices that are currently part of the system disk configuration are either missing or failed, and have data in the cache card. Consider the following: v The problem may be because devices were moved from the I/O card concurrently, or they were removed after an abnormal power off. If this is the case, locate the devices, power off the system and install the devices on the correct I/O card. v If no devices were moved, look for other errors logged against the device, or against the I/O card that occurred at approximately the same time as this error. Continue the service action by using these system reference codes.IOPIP28 You were sent to this procedure from unit reference code (URC) 9052. If the failing item is in a migrated tower, use the (Migrated Expansion Tower Problem Analysis, Repair and Parts) manual on the V5R1 Supplemental Manuals web site to fix the problem. Otherwise, contact your next level of support for assistance. IOPIP29 The failing item is in a migrated tower. You were sent to this procedure from URC 9012. Perform SDIOP-PIP29 in the (Migrated Expansion Tower Problem Analysis, Repair and Parts) manual on the V5R1 Supplemental Manuals web site . IOPIP30 Use this procedure to correct the problem when the system cannot find the required cache data for the attached disk units. You were sent to this procedure from URC 9050. Chapter 2. Isolation Procedures 81
1. If the system has logical partitions, perform this procedure from the logical partition that reported the problem. To determine if the system has logical partitions, go to “Determining if the system has logical partitions” on page 593 before continuing with this procedure. 2. Did you just exchange the storage input/output (I/O) adapter as a result of a failure? No: Continue with step 13 on page 83. Yes: Continue with the next step. 3. Are you working with a 571F/575B card set? No: Go to step 5. Yes: Continue with the next step. 4. Remove the 571F/575B card set. Create a new card set with the following: Note: Label all parts (both original and new) before moving them. v The new replacement 571F storage IOA. v The cache directory card from the original 571F storage IOA. v The original 575B auxiliary cache adapter.See “Separating the 571F/575B card set and moving the cache directory card” on page 594. v Ensure that the SCSI cable and the battery power cable on the top edge of the storage side of the card are connected to the top edge of the auxiliary cache side of the card. v Reinstall this card set into the system and go to step 6. 5. Remove the I/O adapter. Install the new replacement storage I/O adapter with the following parts installed on it: Note: Label all parts (both old and new) before moving them. v The cache directory card from the original storage I/O adapter. On adapters with removable cache cards, the cache directory card will move with the removable cache card. v The removable cache card from the original storage I/O adapter (this applies to only the 571E and some 2780 I/O adapters).See Chapter 3, “Locating FRUs,” on page 335 for information on removing and replacing parts.. v If the I/O adapter is attached to an auxiliary cache I/O adapter, ensure that the SCSI cable on the last port of the new replacement storage I/O adapter is connected to the auxiliary cache I/O adapter. For a list of auxiliary cache I/O adapters, see Chapter 8, “Part Information,” on page 691. 6. Did the 9050 SRC that sent you to this procedure occur on a type-D IPL? Yes: Perform a type-D IPL and continue with the next step. No: Continue with the next step. 7. Has a new 9010 or 9050 SRC occurred in the Service Action Log or Product Activity Log? No: Go to step 10. Yes: Continue with the next step. 8. Was the new SRC 9050? No: Continue with the next step. Yes: Contact your next level of support. This ends the procedure. 9. The new SRC was 9010. Reclaim the cache storage. See “Reclaiming IOP cache storage” on page 636. Note: When an auxiliary cache I/O adapter that is connected to the storage I/O adapter logs a 9055 SRC in the Product Activity Log, the reclaim does not result in lost sectors. Otherwise, the reclaim does result in lost sectors, and the system operator might want to restore data from the most recent saved tape after you complete the repair. 10. Are you working with a 571F/575B card set? No: Go to step 12 on page 83. 82 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
Yes: Continue with the next step.11. Remove the 571F/575B card set. Create a new card set with the following: v The new 571F storage IOA v The cache directory card from the new 571F storage IOA v The new 575B auxiliary cache adapterSee “Separating the 571F/575B card set and moving the cache directory card” on page 594. v Ensure that the SCSI cable and the battery power cable on the top edge of the storage side of the card are connected to the top edge of the auxiliary cache side of the card. v Reinstall this card set into the system. This ends the procedure.12. Remove the I/O adapter. Install the new replacement storage I/O adapter that has the following parts installed on it: v The cache directory card from the new storage I/O adapter. On adapters with removable cache cards, the cache directory card will move with the removable cache card. v The removable cache card from the new storage I/O adapter (this applies to only the 571E and some 2780 I/O adapters).See Chapter 3, “Locating FRUs,” on page 335. v If the I/O adapter is attached to an auxiliary cache I/O adapter, ensure that the SCSI cable on the last port of the new replacement storage I/O adapter is connected to the auxiliary cache I/O adapter. For a list of auxiliary cache I/O adapters, see Chapter 8, “Part Information,” on page 691.This ends the procedure 13. Identify the affected disk units using information in the Product Activity Log. Access SST/DST by doing one of the following: v If you can enter a command at the console, access system service tools (SST). See “System Service Tools (SST)” on page 611. v If you cannot enter a command at the console, perform an IPL to DST. See “Performing an IPL to DST” on page 640. v If you cannot perform a type A or B IPL, perform a type D IPL from removable media.14. Did you perform a D IPL to get to DST? v Yes: Continue with the next step. v No: Perform the following steps: a. Access the Product Activity Log and display the SRC that sent you here, then view Additional Information to record the formatted log information. The Device Errors detected field indicates the total number of disk units that are affected. The Device Errors logged field indicates the number of disk units for which detailed information is provided. Under the Device heading, the unit address, type, and serial number are provided for up to three disk units. Additionally, the controller type and serial number for each of these disk units indicates the adapter to which the disk was last attached when it was operational.Note: You might find more than one Product Activity Log entry with the same Log ID. Access any additional entries by pressing Enter from the Display Detail Report for Resource screen. View Additional Information for each entry, and record the formatted log information. For example: You might find an entry for an xxxx902F SRC in the Product Activity Log when the array includes more than 10 disk units. b. Continue with step 16 on page 84.15. A formatted display of hexadecimal information for Product Activity Log entries is not available. To interpret the hexadecimal information, see “More information from hexadecimal reports” on page 620. The Device Errors detected field indicates the total number of disk units that are affected. The Device Errors logged field indicates the number of disk units for which detailed information is provided. Under the Device heading, the unit address, type, and serial number are provided for up Chapter 2. Isolation Procedures 83
to three disk units. Additionally, the controller type and serial number for each of these disk units indicates the adapter to which the disk was last attached when it was operational.Note: You might find an entry for an xxxx902F SRC entry in the Product Activity Log when the array includes more than 10 disk units. To interpret the hexadecimal information for these additional disk units, see “More information from hexadecimal reports” on page 620. 16. Has the I/O card or have the devices been repaired or reconfigured recently? Yes: Continue with the next step. No: Contact your next level of support for assistance. This ends the procedure.17. You can use one of the following repair options to correct the problem: v Reunite the adapter and disk units identified in previous steps so that the cache data can be written to the disk units. If you can find the devices and adapters and want to continue with this repair option, then continue with the next step. v If the data for the disk units identified in previous steps is not needed on this or any other system, initialize and format these disk units. Attention: This repair option causes a loss of customer data. If you want to continue with this repair option, go to step 19.18. Perform the following: a. Restore the adapter and disk units back to their original configuration. For more information, see Chapter 3, “Locating FRUs,” on page 335. After the system writes cache data to the disk units and you power off the system normally, you can move the adapter and disk units to another location. b. Power on the system. For more information, see Powering on and powering off. Does the IP complete successfully? No: Perform problem analysis to correct the new problem. This ends the procedure. Yes: This ends the procedure.19. You have chosen to initialize and format the identified disk units. Perform the following steps: Attention: Performing the following steps causes a loss of customer data. a. If you are not already using dedicated service tools, perform an IPL to DST. For more information, see “Performing an IPL to DST” on page 640. If you cannot perform a type A or B IPL, perform a type D IPL from removable media. b. Select Work with disk units. Did you get to DST with a Type D IPL? Yes: Continue with the next step. No: Select Work with disk unit recovery > Disk unit problem recovery procedures, then continue with the next step.20. Select Initialize and format disk unit. 21. Follow the online instructions to format and initialize the disk units. 22. Perform an IPL from disk. Does the IPL complete successfully? No: Perform problem analysis and correct the new problem. This ends the procedure. Yes: This ends the procedure.IOPIP31 Cache data associated with the attached devices cannot be found. You were sent to this procedure from URC 9010. 1. If the system has logical partitions, perform this procedure from the logical partition that reported the problem. To determine if the system has logical partitions, go to “Determining if the system has logical partitions” on page 593 before continuing with this procedure. 2. Has the system been powered off for several days? 84 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
v No: Go to step 4. v Yes: The cache battery pack may be depleted. Do NOT replace the I/O adapter or the cache battery pack. Reclaim the cache storage. See “Reclaiming IOP cache storage” on page 636. Then continue with the next step. Note: When an auxiliary cache I/O adapter connected to the storage I/O adapter logs a 9055 SRC in the Product Activity Log, the Reclaim does not result in lost sectors. Otherwise, the Reclaim does result in lost sectors, and the system operator may want to restore data from the most recent saved tape after you complete the repair. 3. Does the IPL complete successfully? No: Contact your next level of support. This ends the procedure. Yes: This ends the procedure. 4. Are you working with a 571F/575B card set? No: Go to step 6. Yes: Continue with the next step. 5. Remove the 571F/575B card set. Create a new card set with the following: Note: Label all parts (both original and new) before moving them. v The new replacement 571F storage IOA v The cache directory card from the original 571F storage IOA v The original 575B auxiliary cache adapterSee “Separating the 571F/575B card set and moving the cache directory card” on page 594. v Ensure that the SCSI cable and the battery power cable on the top edge of the storage side of the card are connected to the top edge of the auxiliary cache side of the card. v Reinstall this card set into the system and go to step 7. 6. Remove the I/O adapter. Install the new replacement storage I/O adapter with the following parts installed on it: Note: Label all parts (both old and new) before moving them. v The cache directory card from the original storage I/O adapter. On adapters with removable cache cards, the cache directory card will move with the removable cache card. v The removable cache card from the original storage I/O adapter (this applies to only the 571E and some 2780 I/O adapters).See Chapter 3, “Locating FRUs,” on page 335. v If the I/O adapter is attached to an auxiliary cache I/O adapter, ensure that the SCSI cable on the last port of the new replacement storage I/O adapter is connected to the auxiliary cache I/O adapter. For a list of auxiliary cache I/O adapters, see Chapter 8, “Part Information,” on page 691. 7. Did the 9010 SRC that sent you to this procedure occur on a type-D IPL? No: Continue with the next step. Yes: Perform a type-D IPL and continue with the next step. 8. Has a new 9010 or 9050 SRC occurred in the Service Action Log? No: Go to step 11 on page 86. Yes: Continue with the next step. 9. Was the new SRC 9050? No: Continue with the next step. Yes: Contact your next level of support. This ends the procedure.10. The new SRC was 9010. Reclaim the cache storage. See “Reclaiming IOP cache storage” on page 636. Chapter 2. Isolation Procedures 85
Note: When an auxiliary cache I/O adapter that is connected to the storage I/O adapter logs a 9055 SRC in the Product Activity Log, the reclaim does not result in lost sectors. Otherwise, the reclaim does result in lost sectors, and the system operator might want to restore data from the most recent saved tape after you complete the repair. 11. Are you working with a 571F/575B card set? No: Go to step 13. Yes: Continue with the next step.12. Remove the 571F/575B card set. Create a new card set with the following: v The new 571F storage IOA v The cache directory card from the new 571F storage IOA v The new 575B auxiliary cache adapterSee “Separating the 571F/575B card set and moving the cache directory card” on page 594. v Ensure that the SCSI cable and the battery power cable on the top edge of the storage side of the card are connected to the top edge of the auxiliary cache side of the card. v Reinstall this card set into the system.This ends the procedure. 13. Remove the I/O adapter. Install the new replacement storage I/O adapter that has the following parts installed on it: v The cache directory card from the new storage I/O adapter. On adapters with removable cache cards, the cache directory card will move with the removable cache card. v The removable cache card from the new storage I/O adapter (this applies to only the 571E and some 2780 I/O adapters).See Chapter 3, “Locating FRUs,” on page 335. v If the I/O adapter is attached to an auxiliary cache I/O adapter, ensure that the SCSI cable on the last port of the new replacement storage I/O adapter is connected to the auxiliary cache I/O adapter. For a list of auxiliary cache I/O adapters, see the Chapter 8, “Part Information,” on page 691.This ends the procedureIOPIP32 You were sent to this procedure from unit reference code (URC) 9011. Attention: There is data in the cache of this I/O card, that belongs to devices other than those that are attached. Customer data may be lost. 1. If the system has logical partitions, perform this procedure from the logical partition that reported the problem. To determine if the system has logical partitions, go to “Determining if the system has logical partitions” on page 593 before continuing with this procedure. 2. Did you just exchange the storage I/O adapter as a result of a failure? v No: Continue with the next step. v Yes: Reclaim the cache storage. See “Reclaiming IOP cache storage” on page 636. Does the IPL complete successfully? No: You have a new problem. Perform problem analysis and correct the problem. This ends the procedure. Yes: This ends the procedure.3. Have the I/O cards been moved or reconfigured recently? v No: Ask your next level of support for assistance. This ends the procedure. v Yes: Perform the following steps: 86 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
a. Power off the system. See Powering on and powering off for details. b. Restore all I/O cards to their original position. c. Select the IPL type and mode that are used by the customer. d. Power on the system. Does the IPL complete successfully? No: Ask your next level of support for assistance. This ends the procedure. Yes: This ends the procedure.IOPIP33 The I/O processor card detected a device configuration error. The configuration sectors on the device may be incompatible with the current I/O processor card. You were sent to this procedure from unit reference code (URC) 9001. 1. If the system has logical partitions, perform this procedure from the logical partition that reported the problem. To determine if the system has logical partitions, go to (Determine if the system has logical partitions) before continuing with this procedure. 2. Has the I/O adapter been replaced with a different type of I/O adapter, or have the devices been moved from a different type of I/O adapter to this one? v No: Contact your next level of support. This ends the procedure. v Yes: Continue with the next step.3. Does the disk unit contain data that needs to be saved? v Yes: Continue with the next step. v No: Initialize and format the disk units. Attention: Any data on the disk unit will be lost. Perform the following:a. Access SST or DST. b. Select Work with disk units. c. Did you get to DST with a type D IPL? v No: Select Work with disk unit recovery ? Disk unit problem recovery procedures. Then, continue with the next step. v Yes: Continue with the next step.d. Select Initialize and format disk unit for each disk unit. When the new disk unit is initialized and formatted, the display will show that the status is complete. This may take 30 minutes or longer. e. The disk unit is now ready to be added to the system configuration. This ends the procedure.4. The disk unit contains data that needs to be saved. v If the I/O adapter has been replaced with a different type of I/O adapter, reinstall the original I/O adapter. Then continue with the next step. v If the disk units have been moved from a different type of I/O adapter to this one, return the disk units to their original I/O adapter. Then continue with the next step.5. Stop parity protection on the disk units, and power down the system normally with the I/O adapter in an operational state. The I/O adapter or disk units can now be returned to the configuration at the beginning of this procedure. This ends the procedure.IOPIP34 You were sent to this procedure from unit reference code (URC) 9027. The I/O processor card detected that an array is not functional due to the present hardware configuration. Chapter 2. Isolation Procedures 87
1. If the system has logical partitions, perform this procedure from the logical partition that reported the problem. To determine if the system has logical partitions, go to (Determine if the system has logical partitions) before continuing with this procedure. 2. Has the I/O adapter been replaced with a different I/O adapter, or have the devices been moved from a different I/O adapter to this one? v No: Perform (IOPIP22). This ends the procedure. v Yes: Perform the following:a. Power off the system. See (Power on/off the system and logical partitions). b. Restore all I/O cards or devices to their original position. c. Power on the system.3. Does the IPL complete successfully? v No: Ask your next level of support for assistance. This ends the procedure. v Yes: This ends the procedure.IOPIP40 Use this procedure to isolate the problem when a storage I/O adapter is connected to an incompatible or non-operational auxiliary cache I/O adapter. Perform the following: 1. Are you working on a 571F/575B combination storage and auxiliary cache IOA card set (uses two card slot locations)? v Yes: Continue with step 2. v No: Continue with step 3.2. Find the location of the card. v Use the location displayed in the Service Action Log. If the Service Action Log does not have a location, determine the address of the I/O adapter. See The System Reference Code (SRC) Format Description. v The location identified is for the 571F side of the card set.You must configure both the 571F and the 575B in the same partition. Are both the 571F side of the card set and the 575B side of the card set configured in the same partition? v Yes: Replace the entire card set. See Chapter 3, “Locating FRUs,” on page 335. This ends the procedure. v No: Change the configuration so that the same partition controls both cards in the card set. This ends the procedure.3. Ensure that the SCSI cable on the last port of the storage I/O adapter is connected to the auxiliary cache I/O adapter. Do the following: a. Use the location of the storage I/O adapter displayed in the Service Action Log. If the Service Action Log does not have a location, determine the address of the storage I/O adapter. See The System Reference Code (SRC) Format Description. b. Determine the location of the storage I/O adapter. See Chapter 3, “Locating FRUs,” on page 335. c. Ensure that the SCSI cable on the last port of the storage I/O adapter is properly connected to an auxiliary cache I/O adapter. d. Ensure that both the auxiliary cache I/O adapter and the storage I/O adapter are in the same partition. e. Ensure that the slot power indicator is lit for the auxiliary cache I/O adapter. If it is not, use concurrent maintenance to power on the slot.4. Ensure that the auxiliary cache I/O adapter is supported for the storage I/O adapter to which it is connected. See the PCI adapter installation instructions for information about which adapters are compatible. 88 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
5. Replace the SCSI cable on the last port of the storage I/O adapter that connects to the auxiliary cache I/O adapter. See Chapter 8, “Part Information,” on page 691 for cable part number information. If this does not fix the problem, replace the auxiliary cache I/O adapter. This ends the procedure.IOPIP41 Use this procedure to correct the problem when an auxiliary cache I/O adapter is not connected to a storage I/O adapter or when an auxiliary cache I/O adapter is connected to an incompatible or non-operational storage I/O adapter. Perform the following: 1. Are you working on a 571F/575B combination storage and auxiliary cache IOA card set (uses two card slot locations)? v Yes: Continue with step 2. v No: Continue with step 3.2. Find the location of the card. v Use the location displayed in the Service Action Log. If the Service Action Log does not have a location, determine the address of the I/O adapter. See The System Reference Code (SRC) Format Description. v The location identified is for the 575B side of the card set.You must configure both the 571F and the 575B in the same partition. Are both the 571F side of the card set and the 575B side of the card set configured in the same partition? v Yes: Replace the entire card set. See Chapter 3, “Locating FRUs,” on page 335 for the model you are working on. This ends the procedure. v No: Change the configuration so that the same partition controls both cards in the card set. This ends the procedure.3. Ensure that the SCSI cable of the auxiliary cache I/O adapter is connected to the last port of the storage I/O adapter. Do the following: a. Use the location of the auxiliary cache I/O adapter displayed in the Service Action Log. If the Service Action Log does not have a location, determine the address of the auxiliary cache I/O adapter. See The System Reference Code (SRC) Format Description. b. Determine the location of the auxiliary cache I/O adapter. See Chapter 3, “Locating FRUs,” on page 335 for the model you are working on. c. Ensure that the SCSI cable of the auxiliary cache I/O adapter is properly connected to the last port of the storage I/O adapter. d. Ensure that both the auxiliary cache I/O adapter and the storage I/O adapter are in the same partition. e. Ensure that the slot power indicator is lit for the storage I/O adapter. If it is not, use concurrent maintenance to power on the slot.4. Did you just replace the auxiliary cache I/O adapter because of a failure and did the new replacement auxiliary cache I/O adapter log a 9073 URC? v Yes: The SCSI bus on the storage I/O adapter may be disabled as a result of the initial failure. Use Hardware Service Manager to re-IPL the storage I/O adapter that is connected to the new replacement auxiliary cache I/O adapter. This ends the procedure. v No: Continue with the next step.5. Ensure that the auxiliary cache I/O adapter is supported for the storage I/O adapter to which it is connected. See the PCI adapter installation instructions for information about which adapters are compatible. 6. Replace the SCSI cable that connects the auxiliary cache I/O adapter to the storage I/O adapter. See Chapter 8, “Part Information,” on page 691 for cable part number information. If this does not fix the problem, replace the storage I/O adapter. This ends the procedure.
Chapter 2. Isolation Procedures 89
Licensed internal code (LIC) isolation procedures Use this section to isolate licensed internal code (LIC) problems. Please read and observe all safety procedures before servicing the system and while preforming a procedure. Attention: Unless instructed otherwise, always power off the system or expansion tower where the FRU is located before removing, exchanging, or installing a field-replaceable unit (FRU). DANGERWhen working on or around the system, observe the following precautions: Electrical voltage and current from power, telephone, and communication cables are hazardous. To avoid a shock hazard: v Connect power to this unit only with the IBM provided power cord. Do not use the IBM provided power cord for any other product. v Do not open or service any power supply assembly. v Do not connect or disconnect any cables or perform installation, maintenance, or reconfiguration of this product during an electrical storm. v The product might be equipped with multiple power cords. To remove all hazardous voltages, disconnect all power cords. v Connect all power cords to a properly wired and grounded electrical outlet. Ensure that the outlet supplies proper voltage and phase rotation according to the system rating plate. v Connect any equipment that will be attached to this product to properly wired outlets. v When possible, use one hand only to connect or disconnect signal cables. v Never turn on any equipment when there is evidence of fire, water, or structural damage. v Disconnect the attached power cords, telecommunications systems, networks, and modems before you open the device covers, unless instructed otherwise in the installation and configuration procedures. v Connect and disconnect cables as described in the following procedures when installing, moving, or opening covers on this product or attached devices. To Disconnect: 1. Turn off everything (unless instructed otherwise). 2. Remove the power cords from the outlets. 3. Remove the signal cables from the connectors. 4. Remove all cables from the devices To Connect: 1. Turn off everything (unless instructed otherwise). 2. Attach all cables to the devices. 3. Attach the signal cables to the connectors. 4. Attach the power cords to the outlets. 5. Turn on the devices. (D005) “LICIP01” on page 91Licensed Internal Code detected an IOP programming problem. “LICIP03” on page 92Dedicated service tools (DST) found a permanent program error, or a hardware failure occurred. “LICIP04” on page 93The initial program load (IPL) service function ended. “LICIP07” on page 93The system detected a problem while communicating with a specific I/O processor. 90 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
“LICIP08” on page 96Licensed Internal Code detected an operating system program problem. “LICIP11” on page 96Use this procedure to isolate a system STARTUP failure in the initial program load (IPL) mode. “LICIP12” on page 107Use this procedure to isolate an Independent Auxiliary Storage Pool (IASP) vary on failure. “LICIP13” on page 111A disk unit seems to have stopped communicating with the system. “LICIP14” on page 116Licensed Internal Code detected a card slot test failure. “LICIP15” on page 117Use this procedure to help you recover from an initial program load (IPL) failure. “LICIP16” on page 119Use this procedure to identify an adapter that is operational but is not located in the same partition as its associated adapter.LICIP01 Licensed Internal Code detected an IOP programming problem. You will need to gather data to determine the cause of the problem. If using OptiConnect, and the IOP is connected to another system, then collect this information from both systems. Read the “Licensed internal code (LIC) isolation procedures” on page 90 before continuing with this procedure. 1. If the system has logical partitions, perform this procedure from the logical partition that reported the problem. To determine if the system has logical partitions, go to “Determining if the system has logical partitions” on page 593 before continuing with this procedure. 2. Is the system operational: Did the SRC come from the Service Action Log, Product Activity Log, problem log, or system operator message? v No: Go to step 9 on page 92. v Yes: Is this a x6xx5121 SRC? No: Continue with the next step. Yes: Go to step 4. 3. If the IOP has DASD attached to it, then the IOP dump is in SID87 (or SID187 if the DASD is mirrored). Copy the IOP dump. See “Working with Storage Dumps” on page 636. 4. Print the Product Activity Log, including any IOP dumps, to removable media for the day which the problem occurred. Select the option to obtain HEX data. 5. Use the ?Licensed Internal Code log? service function under DST/SST to copy the LIC log entries to removable media for the day that the problem occurred. 6. Copy the system configuration list. See “Printing the System Configuration List” on page 643. 7. Provide the dumps to Service Support. 8. Check the Logical Hardware Resource STATUS field using Hardware Service Manager. If the status is not Operational then IPL the IOP using the I/O Debug option. Ignore resources with a status of not connected. To IPL a failed IOP, the following command can be used: VRYCFG CFGOBJ(XXXX) CFGTYPE(*CTL) STATUS(*RESET) or use DST/SST Hardware Service Manager. If the IPL does not work: v Check the Service Action Log for new SRC entries. See “Using the Service Action Log” on page 609. Use the new SRC and perform problem analysis to correct the problem. v If there are no new SRCs in the Service Action Log, contact your next level of support. This ends the procedure.
Chapter 2. Isolation Procedures 91
9. Has the system stopped but the DST console is still active: Did the SRC come from the Main Storage Dump manager screen on the DST console? Yes: Continue with the next step. No: Go to step 15.10. Complete a Problem Summary Form using the information in words 1-9 from the control panel, or from the DST Main Storage Dump screen. 11. The system has already taken a partial main storage dump for this SRC and automatically re-IPLed to DST. 12. Copy the main storage dump to tape. See “Working with Storage Dumps” on page 636. 13. When the dump is completed, the system will re-IPL automatically. Sign on to DST or SST. Obtain the data in steps 3 on page 91, 4 on page 91, 5 on page 91, and 6 on page 91. 14. Provide the dumps to Service Support. This ends the procedure. 15. Has the system stopped with an SRC at the control panel? Yes: Continue with the next step. No: Go to step 2 on page 91.16. Complete a Problem Summary Form using the information in words 1-9 from the control panel, or from the DST Main storage dump screen. 17. Do NOT power off the system. Perform a manual IPL to DST, and start the Main storage dump manager service function. 18. Copy the main storage dump to tape. 19. Obtain the data in steps 3 on page 91, 4 on page 91, 5 on page 91, and 6 on page 91. 20. Re-IPL the system. 21. Has the system stopped with an SRC at the control panel? Yes: Using the new SRC, perform problem analysis to correct the problem. This ends the procedure. No: Provide the dumps to Service Support. This ends the procedure.LICIP03 Dedicated service tools (DST) found a permanent program error, or a hardware failure occurred. Read the danger notices in the “Licensed internal code (LIC) isolation procedures” on page 90 before continuing with this procedure. 1. If the system has logical partitions, perform this procedure from the logical partition that reported the problem. To determine if the system has logical partitions, go to “Determining if the system has logical partitions” on page 593 before continuing with this procedure. 2. Does URC 50FF occur? No: Continue with the next step. Yes: Go to step 4.3. Perform a main storage dump, then perform an IPL by doing the following: a. Perform a main storage dump. See “Working with Storage Dumps” on page 636. b. Select Manual mode and perform an IPL to DST. See “Performing an IPL to DST” on page 640. c. When the IPL has completed, the system console should be at the DST display. Go to step 7 on page 93. If the DST display does not appear on the system console, contact your next level of support.4. Perform a main storage dump, then perform an IPL by doing the following: a. Perform a main storage dump. See “Working with Storage Dumps” on page 636. b. Select Manual mode and perform an IPL to DST. See “Performing an IPL to DST” on page 640.Does a display appear? 92 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
No: Continue with the next step. Yes: Go to step 7.5. If the problem is in the logical partition, exchange the first workstation I/O processor card on bus 1. See Removing and Replacing FRUs. 6. Select Manual mode and perform an IPL to DST. See “Performing an IPL to DST” on page 640. Is the IPL or Install the System display shown? Yes: Continue with the next step. No: Ask your next level of support for assistance and report the problem. This ends the procedure.7. Copy the main storage dump to removable media. See “Working with Storage Dumps” on page 636. 8. Report a Licensed Internal Code problem to your next level of support. This ends the procedure.LICIP04 The initial program load (IPL) service function ended. Dedicated service tools (DST) was in the disconnected status or lost communications with the IPL console because of a console failure and could not communicate with the user. Read the danger notices in “Licensed internal code (LIC) isolation procedures” on page 90 before continuing with this procedure. 1. If the system has logical partitions, perform this procedure from the logical partition that reported the problem. To determine if the system has logical partitions, go to “Determining if the system has logical partitions” on page 593 before continuing with this procedure. 2. Select function 21 (Make DST Available) on the control panel and press Enter to start DST again. Does the DST Sign On display appear? v No: Continue with the next step. v Yes: Perform the following steps (see “Dedicated Service Tools (DST)” on page 637 for details): a. Select Start a Service Tool ? Licensed Internal Code log. b. Perform a dump of the Licensed Internal Code log to tape. See “Start a service tool” on page 642 for details. c. Return here and continue with the next step.3. Perform a main storage dump. See “Working with Storage Dumps” on page 636 for details. 4. Copy the main storage dump to removable media. See “Working with Storage Dumps” on page 636 for details. 5. Report a Licensed Internal Code problem to your next level of support. This ends the procedure.LICIP07 The system detected a problem while communicating with a specific I/O processor. The problem could be caused by Licensed Internal Code, the I/O processor card, or by bus hardware. Read the danger notices in “Licensed internal code (LIC) isolation procedures” on page 90 before continuing with this procedure. 1. If the system has logical partitions, perform this procedure from the logical partition that reported the problem. To determine if the system has logical partitions, go to “Determining if the system has logical partitions” on page 593 before continuing with this procedure. 2. Did a previous procedure have you power off the system, perform an IPL in Manual mode, and is the system in Manual mode now? v Yes: Continue with the next step. v No: Perform the following steps: a. Power off the system. See “Powering on and powering off” on page 588 for details. Chapter 2. Isolation Procedures 93
b. Select Manual mode on the control panel. See “IPL type, mode, and speed options” on page 642 for details. c. Power on the system. d. Continue with the next step. 3. Does the SRC that sent you to this procedure appear on the control panel? v No: Continue with the next step. v Yes: Use the information in the SRC to determine the card direct select address. If the SRC is B6006910, you can use the last 8 characters of the top 16 character line of function 13 (word 7) to find the card direct select address in BBBBCcbb format. BBBB Bus number Cc Card direct select address bb board address Go to step 11 on page 95. 4. Does the console display indicate a problem with missing disks? Yes: Continue with the next step. No: Go to step 6. 5. Perform the following steps: a. Go to the DST main menu. b. On the DST sign-on display, enter the DST full authority user ID and password. See “Dedicated Service Tools (DST)” on page 637 for details. c. Select Start a service tool ? Hardware service manager. d. Check for the SRC in the service action log. See “Using the Service Action Log” on page 609. Did you find the same SRC that sent you to this procedure? v Yes: Note the date and time for that SRC. Go to the Product Activity Log and search all logs to find the same SRC. When you have found the SRC, go to step 9 on page 95. v No: Perform the following steps: 1) Return to the DST main menu. 2) Perform an IPL and return to the Display Missing Disk Units display. 3) Go to “LICIP11” on page 96. This ends the procedure. 6. Does the SRC that sent you to this procedure appear on the console or on the alternative console? v Yes: Continue with the next step. v No: Does the IPL complete successfully to the IPL or Install the System display? Yes: Continue with the next step. No: A different SRC occurred. Use the new SRC to correct the problem. This ends the procedure. 7. Perform the following: a. Use the full-authority password to sign on to DST. b. Search All logs in the product activity log looking for references of SRC B600 5209 and the SRC that sent you to this procedure. Note: Search only for SRCs that occurred during the last IPL.Did you find B600 5209 or the same SRC that sent you to this procedure? v Yes: Go to step 10 on page 95. v No: Did you find a different SRC than the one that sent you to this procedure? Yes: Continue with the next step. No: The problem appears to be intermittent. Ask your next level of support for assistance. This ends the procedure. 8. Use the new SRC to correct the problem. This ends the procedure. 94 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
9. Use F11 to move through alternative views of the log analysis displays until you find the card position and frame ID of the failing IOP associated with the SRC. Was the card position and frame ID available, and did this information help you find the IOP? No: Continue with the next step. Yes: Go to step 12.10. Perform the following steps: a. Display the report for the log entry of the SRC that sent you to this procedure. b. Display the additional information for the entry. c. If the SRC is B6006910, use characters 9-16 of the top 16 character line of function 13 (word 7) to find the card direct select address in BBBBCcbb format. BBBB Bus number Cc Card direct select address bb board address 11. Use the BBBBCcbb information and refer to Chapter 3, “Locating FRUs,” on page 335 to determine the failing IOP and its location. 12. Go to “MABIP55” on page 43 to isolate an I/O adapter problem on the IOP you just identified. If this fails to isolate the problem, return here and continue with the next step. 13. Is the I/O processor card you identified in step 9 or step 11 the CFIOP? v No: Continue with the next step. v Yes: Perform the following steps: a. Exchange the failing CFIOP card. See Chapter 3, “Locating FRUs,” on page 335. Note: You will be prompted for the system serial number. Ignore any error messages regarding system configuration that appear during the IPL. b. Go to step 16.14. Perform the following steps: a. Power off the system. b. Remove the IOP card. c. Power on the system.Does the SRC that sent you to this procedure appear on the control panel or appear as a new entry in the service action log or product activity log? v No: Continue with the next step. v Yes: Perform the following steps: a. Power off the system. b. Install the IOP card you just removed. c. Replace the I/O backplane (Un-P1). This ends the procedure.15. Perform the following steps: a. Power off the system. b. Exchange the failing IOP card.16. Power on the system. Does the SRC that sent you to this procedure appear on the control panel, on the console, or on the alternative console? No: Continue with the next step. Yes: Go to step 18 on page 96.17. Does a different SRC appear on the control panel, on the console, or on the alternative console? v Yes: Use the new SRC to correct the problem. This ends the procedure. v No: On the IPL or Install the System display, check for the SRC in the service action log. See “Using the Service Action Log” on page 609 for details. Chapter 2. Isolation Procedures 95
Did you find the same SRC that sent you to this procedure? Yes: Continue with the next step. No: Go to Chapter 5, “Verifying a repair,” on page 557. This ends the procedure.18. Perform the following steps: a. Power off the system. b. Remove the IOP card you just exchanged and install the original card. c. Go to (Bus-PIP1). This ends the procedure.19. Ask your next level of support for assistance and report a Licensed Internal Code problem. You may be asked to verify that all PTFs have been applied. If you are asked to perform the following, see the following: v Copy the main storage dump from disk to tape or diskette, see “Working with Storage Dumps” on page 636. v Print the product activity log, see “Using the product activity log” on page 617. v Copy the IOP storage dump to removable media, see “Working with Storage Dumps” on page 636. This ends the procedure.LICIP08 Licensed Internal Code detected an operating system program problem. Read the danger notices in “Licensed internal code (LIC) isolation procedures” on page 90 before continuing with this procedure. 1. If the system has logical partitions, perform this procedure from the logical partition that reported the problem. To determine if the system has logical partitions, go to “Determining if the system has logical partitions” on page 593 before continuing with this procedure. 2. Select Manual mode and perform an IPL to DST. See “Performing an IPL to DST” on page 640. Does the same SRC occur? v Yes: Go to step 5. v No: Does the same URC appear on the console? No: Continue with the next step. Yes: Go to step 4.3. Does a different SRC occur, or does a different URC appear on the console? v Yes: Use the new SRC or reference code to correct the problem. If the procedure for the new SRC sends you back to this procedure, then continue with the next step. This ends the procedure. v No: Select Perform an IPL on the IPL or Install the System display to complete the IPL. Is the problem intermittent? Yes: Continue with the next step. No: This ends the procedure.4. Copy the main storage dump to removable media. See “Working with Storage Dumps” on page 636. 5. Report a Licensed Internal Code problem to your next level of support. This ends the procedure.LICIP11 Use this procedure to isolate a system STARTUP failure in the initial program load (IPL) mode. Ensure you have read the danger notices in “Licensed internal code (LIC) isolation procedures” on page 90 before continuing with this procedure. 96 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
How to find the cause code 1. If the system has logical partitions, perform this procedure from the logical partition that reported the problem. To determine if the system has logical partitions, go to “Determining if the system has logical partitions” on page 593. 2. Were you given a cause code by another procedure? No: Continue with the next step. Yes: Use the cause code given by the other procedure. Then, go to step 4.3. Look at the Data display characters in word 3. You can obtain these characters by either: v Looking at word 3 on the Problem summary form that was filled out earlier. v Selecting characters 9-16 of the top 16 character line of function 12 (word 3).4. The 4 leftmost characters of word 3 represent the cause code. Select the cause code to go to the correct isolation instructions:
“0001”
“0010” on page 103
“0020” on page 105
“0031” on page 106
“0002”
“0011” on page 103
“0021” on page 105
“0033” on page 106
“0004” on page 100
“0012” on page 103
“0022” on page 105
“0034” on page 106
“0005” on page 101
“0015” on page 103
“0023” on page 105
“0035” on page 107
“0006” on page 101
“0016” on page 103
“0024” on page 105
“0037” on page 107
“0007” on page 101
“0017” on page 103
“0025” on page 106
“0099” on page 107
“0008” on page 101
“0018” on page 103
“0026” on page 106
“0009” on page 101
“0019” on page 103
“0027” on page 106
“000A” on page 101
“001A” on page 104
“000B” on page 102
“001C” on page 104
“000C” on page 102
“001D” on page 104
“000D” on page 102
“001E” on page 104
“000E” on page 102
“001F” on page 104 0001 Disk configuration is missing. 1. Select Manual mode and perform an IPL to DST for the failing partition (see “Performing an IPL to DST” on page 640). Does the Disk Configuration Error Report display appear? Yes: Continue with the next step. No: The IPL completed successfully. This ends the procedure.2. Is Missing Disk Configuration information displayed? Yes: Continue with the next step. No: Go to step 1 on page 98 for cause code 0002.3. On the Missing Disk Configuration display, perform the following: a. Select option 5 ? Display Detailed Report ? Work with disk units ? Work with disk unit recovery ? Recover Configuration. b. Follow the instructions on the display. After the disk configuration is recovered, the system automatically performs an IPL. This ends the procedure.0002 Disk units are missing from the disk configuration. Data from the control panel can be used to find information about the missing disk unit. Chapter 2. Isolation Procedures 97
1. Did you enter this procedure because all the devices listed on the Display Missing Units display (reached from the Disk Configuration Error Report, the Disk Configuration Attention Report, or the Disk Configuration Warning Report display) have a reference code of 0000? No: Continue with the next step. Yes: Go to step 20 on page 100. 2. Have you installed a new disk enclosure in a disk unit and not restored the data to the disk unit? No: Continue with the next step. Yes: Ignore SRC A600 5090. Continue with the disk unit exchange recovery procedure. This ends the procedure. 3. Use words 1-9 from the information recorded on the Problem summary form to determine the disk unit that is missing from the configuration: v Characters 1-8 of the bottom 16 character line of function 12 (word 4) contain the IOP direct select address. v Characters 1-8 of the top 16 character line of function 13 (word 6) contains the disk unit type, level and model number. v Characters 9-16 of the top 16 character line of function 13 (word 7) contains the disk unit serial number. Note: For 2105 and 2107 disk units, the 5 rightmost characters of word 7 contain the disk unit serial number. v Characters 1-8 of the bottom 16 character line of function 13 (word 8) contains the number of missing disk units. Are the problem disk units 432x, 433x, 660x, 671x, or 673x disk units? No: Continue with the next step. Yes: Go to step 5. 4. Attempt to get all devices attached to the MSIOP to Ready status by performing the following: a. The MSIOP address (MSIOP Direct Select Address) to use is characters 1-8 of the bottom 16 character line of function 12 (word 4). b. Verify the following and correct if necessary before continuing with step 10 on page 99. v All cable connections are made correctly and are tight. v All storage devices have the correct signal bus address, as indicated in the system configuration list. v All storage devices are powered on and ready. 5. Did you enter this procedure because there was an entry in the Service Action Log which has the reference code B6005090? Yes: Continue with the next step. No: Go to step 10 on page 99. 6. Are customer jobs running on the system now? Yes: Continue with the next step. No: Ensure that the customer is not running any jobs before continuing with this procedure. Then go to step 10 on page 99. 7. Select System Service Tools (SST) ? Work with disk units ? Display disk configuration ? Display disk configuration status. Are any disk units missing from the configuration (indicated by an asterisk *)? Yes: Continue with the next step. No: This ends the procedure. 8. Do all of the disk units that are missing from the configuration have a status of ?Suspended?? Yes: Continue with the next step. 98 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
No: Ensure that the customer is not running any jobs before continuing with this procedure. Then go to step 10. 9. Use the Service Action Log to determine if there are any entries for the missing disk units (see “Using the Service Action Log” on page 609). Are there any entries in the Service Action Log for the missing disk units that were logged since the last IPL? Yes: Use the information in the Service Action Log, and look up the reference code to read its description.). Perform the action indicated for the unit reference code. This ends the procedure. No: Go to step 21 on page 100.10. Select Manual mode and perform an IPL to DST for the failing partition (see “Performing an IPL to DST” on page 640). Does the Disk Configuration Error Report, the Disk Configuration Attention Report, or the Disk Configuration Warning Report display appear? Yes: Continue with the next step. No: The IPL completed successfully. This ends the procedure.11. Does one of the following messages appear in the list? v Missing disk units in the configuration v Missing mirror protected disk units in the configuration Yes: Continue with the next step. No: Go to step 16.12. Select option 5. Do the missing units have device parity protected status? (Device parity protection status is indicated by ?DPY/? as the first four characters of the status.) Yes: Continue with the next step. No: Go to step 14.13. Is the status DPY/Active? Yes: Continue with the next step. No: Use the Service Action Log to determine if there are any entries for the missing disk units or the IOA/IOP controlling them. See “Using the Service Action Log” on page 609 for details. This ends the procedure.14. Press F11, and press Enter to display the details. Do all of the disk units listed on the display have a reference code of 0000? Yes: Continue with the next step. No: Use the disk unit reference code shown on the display and look up the reference code.. Perform the action indicated for the unit reference code. This ends the procedure.15. Do all of the IOPs or devices listed on the display have a reference code of 0000? No: Use the IOP reference code shown on the display and look up the reference code.. Perform the action indicated for the reference code. This ends the procedure. Yes: Go to step 20 on page 100.16. Does the following message appear in the list: Unknown load-source status? Yes: Continue with the next step. No: Go to step 18.17. Select option 5, press F11, and then press Enter to display the details. Does the Assign Missing Load Source Disk display appear? No: Continue with the next step. Yes: Press Enter to assign the missing load-source disk unit. This ends the procedure.18. Does the following message appear in the list? Load source failure Yes: Continue with the next step. No: The IPL completed successfully. This ends the procedure.
Chapter 2. Isolation Procedures 99
19. Select option 5, press F11, and then press Enter to display the details. 20. The number of failing disk unit facilities (actuators) is the number of disk units displayed. A disk unit has a Unit number greater than zero. Find the failing disk unit by type, model, serial number, or address displayed on the console. 21. Is there more than one failing disk device attached to the IOA or MSIOP? Yes: Continue with the next step. No: Go to step 24.22. Use the SAL to determine if there are any entries that occurred around the time of the A6xx/B6xx 5090 SRC (see “Using the Service Action Log” on page 609). Are there any such entries? No: Continue with the next step. Yes: Use the information in the SAL and look up the reference code.). Perform the action indicated for the unit reference code. This ends the procedure.23. Are all the disk devices that are attached to the IOA or MSIOP failing? (If the disk units are using mirrored protection, select Display Disk Status to find out.) No: Continue with the next step. Yes: Go to step 25.24. Look up the SRC listed below for the specific disk unit and exchange the FRUs shown one at a time. Then return here and answer the question below the listed disk units.
Disk Unit
SRC to look up
2105
21053002
2107
21073002
432x
432x3002
433x
433x3002
660x
660x3002
671x
671x3002
673x
673x3002 Did the disk unit service information correct the problem? No: Continue with the next step. Yes: This ends the procedure.25. Exchange the IOA or MSIOP (see Chapter 3, “Locating FRUs,” on page 335). If exchanging the IOA or MSIOP did not correct the problem, use the original SRC and exchange the failing items, starting with the highest probable cause of failure. If the failing item list contains FI codes, Look up the FI code to help determine parts and locations. This ends the procedure.0004 Some disk units are unprotected but configured into a mirrored ASP. These units were originally DPY protected but protection was disabled. Perform the following: 1. Is the system managed by an HMC? Yes: Select DST by performing the HMC action for Function 21 for the failing partition (see “Control panel function codes on the HMC” on page 638 ). Then continue with the next step. No: Select DST using Function 21 for the failing partition (see “Selecting function 21 from the control panel” on page 638 in “Service functions” on page 652). Then continue with the next step.2. Select Work with disk units and take the actions to protect the system. 100 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
If you do not know what actions to take, select Manual mode and perform an IPL to DST for the failing partition. See “Performing an IPL to DST” on page 640. When the Disk configuration error report appears, the recovery actions are listed in the Help text for the error message ?Unprotected disk units in a mirrored ASP?. This ends the procedure.0005 A disk unit using parity protection is operating in exposed mode. 1. Select Manual mode and perform an IPL to DST for the failing partition. See “Performing an IPL to DST” on page 640. 2. Choose from the following options: v If the same reference code appears, ask your next level of support for assistance. v If no reference code appears and the IPL completes successfully, the problem is corrected. v If a different reference code appears, use it to perform problem analysis and correct the new problem. This ends the procedure.0006 There are new devices attached to the system that do not have Licensed Internal Code installed. Ask your next level of support for assistance. 0007 Some of the configured disk units have device parity protection disabled when the system expected device parity protection to be enabled. 1. Is the system managed by an HMC? Yes: Select DST by performing the HMC action for Function 21 for the failing partition (see “Control panel function codes on the HMC” on page 638). Then continue with the next step. No: Select DST using Function 21 for the failing partition (see “Selecting function 21 from the control panel” on page 638 in “Service functions” on page 652). Then continue with the next step.2. Correct the problem by doing the following: a. Select Work with disk units ? Work with disk unit recovery ? Correct device parity protection. b. Follow the online instructions. This ends the procedure.0008 A disk unit has no more alternate sectors to assign. 1. Determine the failing unit by type, model, serial number or address given in words 4-7. See The System Reference Code (SRC) Format Description. 2. See the service information for the specific storage device. Use the disk unit reference code listed below for service information entry. 432x 102E, 433x 102E, 660x 102E, 671x 102E, 673x 102E ( Look up a reference code.).This ends the procedure.0009 The procedure to restore a disk unit from the tape unit did not complete. Continue with the disk unit exchange recovery procedure. 000A There is a problem with a disk unit subsystem. As a result, there are missing disk units in the system. Perform the following: 1. Is the system managed by an HMC? Yes: Select DST by performing the HMC action for Function 21 for the failing partition (see “Control panel function codes on the HMC” on page 638). Then continue with the next step. Chapter 2. Isolation Procedures 101
No: Select DST using Function 21 for the failing partition (see “Selecting function 21 from the control panel” on page 638 in “Service functions” on page 652). Then continue with the next step.2. On the Service Tools display, select Start a Service Tool ? Product activity log ? Analyze log. 3. On the Select Subsystem Data display, select the option to view All Logs. Note: You can change the From: and To: Dates and Times from the 24-hour default if the time that the customer reported having the problem was more than 24 hours ago. 4. Use the defaults on the Select Analysis Report Options display by pressing Enter. 5. Search the entries on the Log Analysis Report display for system reference codes associated with the missing disk units. 6. Go to Reference codes to correct the problem. This ends the procedure.000B Some system IOPs require cache storage be reclaimed. 1. Is the system managed by an HMC? Yes: Select DST by performing the HMC action for Function 21 for the failing partition (see “Control panel function codes on the HMC” on page 638). Then continue with the next step. No: Select DST using Function 21 for the failing partition (see “Selecting function 21 from the control panel” on page 638 in “Service functions” on page 652). Then continue with the next step.2. Reclaim the cache adapter card storage. See “Reclaiming IOP cache storage” on page 636. Note: The system operator may want to restore data from the most recent saved tape after you complete the repair. This ends the procedure.000C One of the mirror protected disk units has no more alternate sectors to assign. 1. Determine the failing unit by type, model, serial number or address given in words 4-7. See “System Reference Code (SRC) Information” on page 601. 2. See the service information for the specific storage device. Use the disk unit reference code listed below for service information entry. 432x 102E, 433x 102E, 660x 102E, 671x 102E, 673x 102E ( Look up a reference code.). This ends the procedure.000D The system disk capacity has been exceeded. For more information about disk capacity, see iSeries Handbook, GA19-5486-20. 000E Start compression failure. 1. Select Manual mode and perform an IPL to DST for the failing partition. See “Performing an IPL to DST” on page 640. 2. Correct the problem by doing the following: a. Select Work with disk units ? Work with disk unit recovery ? Recover from start compression failure. b. Follow the on-line instructions. This ends the procedure.
102 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
0010 The disk configuration has changed. The operating system must be installed again, and all customer data must be restored. 1. Select Manual mode on the control panel. 2. Perform an IPL to reinstall the operating system. 3. The customer must restore all data from the latest system backup. This ends the procedure.0011 The serial number of the control panel does not match the system serial number. 1. Select Manual mode on the control panel. 2. Perform an IPL. You will be prompted for the system serial number. This ends the procedure.0012 The operation to write the vital product data (VPD) to the control panel failed. Exchange the multiple function I/O processor card. See Chapter 3, “Locating FRUs,” on page 335 for the model you are working on for the location of the card and a link to the remove and replace procedure. 0015 The mirrored load-source disk unit is missing from the disk configuration. Go to step 1 on page 98 for cause code 0002. 0016 A mirrored protected disk unit is missing. Wait six minutes. If the same reference code appears, go to step 1 on page 98 for cause code 0002. 0017 One or more disk units have a lower level of mirrored protection than originally configured. 1. Select Manual mode and perform an IPL to DST for the failing partition. See “Performing an IPL to DST” on page 640. 2. Review the detailed display, which shows the new and the previous levels of mirrored protection. This ends the procedure.0018 Load-source configuration problem. The load-source disk unit is using mirrored protection and is configured at an incorrect address. Ensure that the load-source disk unit is in device location 1. 0019 One or more disk units were formatted incorrectly. The system will continue to operate normally. However, it will not operate at optimum performance. To repair the problem, perform the following: 1. Record the unit number and serial number of the disk unit that is formatted incorrectly. 2. Sign on to DST. See “Accessing Dedicated Service Tools” on page 637. 3. Select Work with disk units ? Work with disk unit configuration ? Remove unit from configuration. 4. Select the disk unit you recorded earlier in this procedure. 5. Confirm the option to remove data from the disk unit. This step may take a long time because the data must be moved to other disk units in the auxiliary storage pool (ASP). 6. When the remove function is complete, select Add unit to configuration. Chapter 2. Isolation Procedures 103
7. Select the disk unit you recorded earlier in this procedure. 8. Confirm the add. The disk unit is formatted during functional operation. This ends the procedure.001A The load-source disk unit data is down-level. The load-source disk unit is mirror protected. The system is using the load-source disk unit that does not have the current level of data. 1. Select Manual mode and perform an IPL to DST for the failing partition. See “Performing an IPL to DST” on page 640. Does the Disk Configuration Error Report display appear? No: The system is now using the correct load-source. This ends the procedure. Yes: Continue with the next step.2. Does a ?Load source failure? message appear in the list? Yes: Continue with the next step. No: The system is now using the correct load-source. This ends the procedure.3. Select option 5, press F11, and then press Enter to display details. The load-source type, model, and serial number information that the system needs is displayed on the console. Is the load-source disk unit (displayed on the console) attached to an MSIOP that cannot be used for a load-source? Yes: Contact your next level of support. This ends the procedure. No: The load-source disk unit is missing. Go to step 1 on page 98 for cause code 0002.001C The disk units that are needed to update the system configuration are missing. Perform an IPL by doing the following: 1. Select Manual mode on the control panel. 2. Perform an IPL. Use the IPL information to determine the cause of the problem. This ends the procedure.001D 1. Is the Disk Configuration Attention Report, or the Disk Configuration Warning Report displayed? Yes: Continue with the next step. No: Ask your next level of support for assistance. This ends the procedure.2. On the Bad Load Source Configuration message line, select 5, and press Enter to rebuild the load-source configuration information. If there are other types of warnings, select option 5 on the warnings, and correct the problem. This ends the procedure.001E The load-source data must be restored. 001F Licensed Internal Code was installed on the wrong disk unit of the load-source mirrored pair. The system performed an IPL on a load source that may not contain the same level of Licensed Internal Code that was installed on the other load source. The type, model, and address of the active device are displayed in words 4-7 of the SRC. Choose from the following options: 104 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
1. If the load-source disk unit in position 1 contains the correct level of Licensed Internal Code, perform the following: a. Select Manual mode and perform an IPL to DST for the failing partition. See “Performing an IPL to DST” on page 640. Is the Disk Configuration Attention Report or Disk Configuration Warning Report displayed? Yes: Select option 5 on the Incorrect Licensed Internal Code Install message line. When the Display Incorrect Licensed Internal Code Install display appears on the console, press Enter. No: The system is now using the correct load source. This ends the procedure.2. If the load-source disk unit in position 1 of the system unit does not contain the correct level of Licensed Internal Code, restore the Licensed Internal Code to the disk unit in position 1 of the system unit. This ends the procedure.0020 The system appears to be a one disk unit system. Select Manual mode and perform an IPL to DST for the failing partition. See “Performing an IPL to DST” on page 640. 0021 The system password verification failed. 1. Select Manual mode and perform an IPL to DST for the failing partition. See “Performing an IPL to DST” on page 640. 2. When prompted, enter the correct system password. If the correct system password is not available perform the following: a. Select Bypass the system password. b. Have the customer contact the marketing representative immediately to order a new system password from your service provider. This ends the procedure.0022 A different compression status was expected on a reporting disk unit. Accept the warning. The reported compression status will be used as the current compression status. 0023 There is a problem with a disk unit subsystem. As a result, there are missing disk units in the system. The system is capable of IPLing in this state. 1. Is the system managed by an HMC? Yes: Select DST by performing the HMC action for Function 21 for the failing partition (see “Control panel function codes on the HMC” on page 638). Then continue with the next step. No: Select DST using Function 21 for the failing partition (see “Selecting function 21 from the control panel” on page 638 in “Service functions” on page 652). Then continue with the next step.2. On the Service Tools display, select Start a Service Tool ? Product activity log ? Analyze log. 3. On the Select Subsystem Data display, select the option to view All Logs. Note: You can change the From: and To: Dates and Times from the 24-hour default if the time that the customer reported having the problem was more than 24 hours ago. 4. Use the defaults on the Select Analysis Report Options display by pressing Enter. 5. Search the entries on the Log Analysis Report display for system reference codes associated with the missing disk units. 6. Go to the Reference codes topic and use the SRC information to correct the problem. This ends the procedure.0024 The system type or system unique ID needs to be entered. Chapter 2. Isolation Procedures 105
1. Select Manual mode and perform an IPL to DST for the failing partition. See “Performing an IPL to DST” on page 640. 2. When prompted, enter the correct system type or system unique ID. This ends the procedure.0025 Hardware Resource Information Persistence disabled. 1. Select Manual mode and perform an IPL to DST for the failing partition. See “Performing an IPL to DST” on page 640. 2. Contact your next level of support for instructions on how to enable the Hardware Resource Information Persistence function. This ends the procedure.0026 A disk unit is incorrectly configured for an LPAR system. 1. Select Manual mode and perform an IPL to DST for the failing partition. See “Performing an IPL to DST” on page 640. 2. On the Service Tools display, select Start a Service Tool ? Product activity log ? Analyze log. 3. On the Select Subsystem Data display, select the option to view All Logs. Note: You can change the From: and To: Dates and Times from the 24-hour default if the time that the customer reported having the problem was more than 24 hours ago. 4. Use the defaults on the Select Analysis Report Options display by pressing Enter. 5. Search the entries on the Log Analysis Report display for system reference codes (B6xx 53xx) that are associated with the error. 6. Using the SRC information, look up the reference code. and use the information to correct the problem. This ends the procedure.0027 The user ASP has overflowed. Contact your next level of support. 0031 A problem was detected with the installation of Licensed Internal Code service displays. The cause may be defective media, the installation media being removed too early, a device problem or a Licensed Internal Code problem. v Ask your next level of support for assistance. Characters 13-16 of the top 16 character line of function 12 (4 rightmost characters of word 3) contain information regarding the install error. v If the customer does not require the service displays to be in the national language, you may be able to continue by performing another system IPL. This ends the procedure.0033 System model not supported. This model of hardware does not support the System Licensed Internal Code version and release that is being used. Use a supported version and release of the System Licensed Internal Code. 0034 Insufficient main storage capacity. There is not enough main storage capacity. For details about how much more capacity is required, see the ?Insufficient Main Storage Capacity? screen, which is displayed when the system is IPLed in manual mode. Typically, this error occurs when you have moved memory between logical partitions, and one partition no longer has a sufficient amount of main storage. 106 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
0035 Data from a User ASP has overflowed into the System ASP (ASP 1). There is not enough free space in the User ASP to move the overflowed data from the System ASP back into the User ASP. The system will continue to run in this condition, but if a disk failure in the System ASP causes the System ASP to be cleared, the data in the User ASP will also be cleared out. You should delete some files or objects from the User ASP so that enough free space exists in the User ASP to allow the data that is overflowed into the System ASP to be moved back. 0037 One or more functional connections to a disk unit in a multi-path environment have not been detected. The connections to the disk unit were established by running ESS Specialist. If you use the server in this state, you may cause a loss of data. You must ensure that all of the functional connections are still established between the disk and the Input/Output Adapters (IOAs) attached to this server and this logical partition. If there is an IOA which has a connection to the disk unit that has been moved to a different logical partition or different server, you should not continue with the IPL. Notify your next level of support. 0099 A Licensed Internal Code program error occurred. Ask your next level of support for assistance. LICIP12 Use this procedure to isolate an Independent Auxiliary Storage Pool (IASP) vary on failure. Message CPDB8E0 occurred if the user attempted to vary on the IASP. Read the Danger notices in “Licensed internal code (LIC) isolation procedures” on page 90 before continuing with this procedure. How to find the cause code 1. If the system has logical partitions, perform this procedure from the logical partition that reported the problem. To determine if the system has logical partitions, go to “Determining if the system has logical partitions” on page 593 before continuing with this procedure. 2. Were you given a cause code by another procedure? No: Continue with the next step. Yes: Use the cause code given by the other procedure. Then go to step 4.3. Look at the characters in word 3. You can obtain these characters by doing the following: a. On the command line, enter the Start System Service Tools (STRSST) command. If you cannot get to SST, use function 21 to get to DST. Do not IPL the system to get to DST. b. On the Start Service Tools Sign On display, type in a User ID with service authority and password. c. Select Start a Service Tool ? Hardware Service Manager ? Work with service action log. d. On the Select Timeframe display, change the From: Date and Time to a date and time prior to when the user attempted to vary on the IASP. e. Search for a B6005094 system reference code that occurred at the time the user attempted to vary on the IASP. Display the failing item information for this entry. f. Select the function key for Additional details. g. The 4 leftmost characters of word 3 is the cause code to be used in this procedure.4. Find the cause code below:
“0002” on page 108
“000A” on page 110
“002C” on page 110
“0030” on page 111
“0004” on page 109
“000B” on page 110
“002D” on page 111
“0032” on page 111 Chapter 2. Isolation Procedures 107
“0007” on page 109
“000D” on page 110
“002E” on page 111
“0099” on page 111
“0009” on page 110
“000E” on page 110
“002F” on page 111 0002 Disk units are missing from the IASP disk configuration. 1. Have you installed a new disk enclosure in a disk unit and not restored the data to the disk unit? v No: Continue with the next step. v Yes: Ignore SRC A600 5094. Continue with the disk unit exchange recovery procedure. This ends the procedure. 2. Use words 1-9 from the information in the Service Action Log to determine the disk unit that is missing from the configuration: v Word 4 contains the IOP direct select address. v Word 5 contains the unit address. v Word 6 contains the disk unit type, level and model number. v Word 7 contains the disk unit serial number. v Word 8 contains the number of missing disk units. Are the problem disk units 432x, 660x, or 671x Disk Units? v Yes: Continue with the next step. v No: Attempt to get all devices attached to the IOP to Ready status by performing the following: a. The IOP address (IOP Direct Select Address) to use is Word 4. b. Verify the following, and correct if necessary: – Ensure all cable connections are made correctly and are tight. – Ensure the configuration within the device is correct. – Ensure all storage devices are powered on and ready.c. Continue with the next step. 3. Perform the following: Select System Service Tools (SST) ? Work with disk units ? Display disk configuration ? Display disk configuration status. Are any disk units missing-indicated with an asterisk (*)- from the IASP configuration? Yes: Continue with the next step. No: This ends the procedure. 4. Use the Service Action Log to determine if there are any entries other than B6xx 5094 for the missing disk units or the IOA or IOP that is controlling them. See “Using the Service Action Log” on page 609. Are there any entries in the Service Action Log other than B6xx 5094 for the missing disk units or the IOA or IOP that is controlling them? No: Continue with the next step. Yes: Use the information in the Service Action Log to solve the problem. See “Using the Service Action Log” on page 609. This ends the procedure. 5. Did you enter this procedure because there was a B6xx 5094 cause code of 0030? v No: Continue with the next step. v Yes: Work with the customer to recover the unknown configuration source disk unit. Use a workstation with System i Navigator installed to select the disk pool with the problem, and then select Recover unknown configuration source for this disk pool. This ends the procedure.
108 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
6. Use Hardware Service Manager to display logical resources connected to the IOP. See “Hardware Service Manager” on page 644. 7. Is every device attached to the IOP failing? v Yes: Continue with the next step. v No: Are all of the disk units that are attached to one IOA missing? – No: Continue with the next step. – Yes: Exchange the IOA. Use the IOP direct select address and the first character of the unit address from step 2 on page 108 to find the location. See Chapter 3, “Locating FRUs,” on page 335. This ends the procedure. 8. Is there more than one storage IOA attached to the IOP? v Yes: Exchange the IOP. Use the IOP direct select address from step 2 on page 108 to find the location. See Chapter 3, “Locating FRUs,” on page 335. This ends the procedure. v No: Go to step 10. 9. Go to the service information for the specific disk unit that is listed below and perform the action indicated. Then return here and answer the following question. v 2105 Disk Units: Use SRC 3002 exchange the FRUs shown one at a time. v 432x, 660x, 671x Disk Units: Use SRC 3002 and exchange the FRUs shown one at a time. Did the disk unit service information correct the problem? No: Continue with the next step. Yes: This ends the procedure.10. Perform the following: a. Exchange the IOA. Use the IOP direct select address and the first character of the unit address from step 2 on page 108 to find the location. See Chapter 3, “Locating FRUs,” on page 335. b. If exchanging the IOA does not correct the problem, exchange the IOP. Use the IOP direct select address from step 2 on page 108 to find the location. See Chapter 3, “Locating FRUs,” on page 335. c. If exchanging the IOP does not correct the problem, exchange the failing items in the following FRU list starting with the first item in the list. 1) FI01140 2) System backplane 3) FI00580 4) AJDG301This ends the procedure.0004 Some disk units are unprotected but configured into a mirrored IASP. These units were originally DPY protected but protection was disabled. Direct the customer to take the actions necessary to start protection on these disk units. This ends the procedure. 0007 Some of the configured disk units have device parity protection disabled when the system expected device parity protection to be enabled. 1. Select Manual mode and perform an IPL to DST. See “Performing an IPL to DST” on page 640. 2. Correct the problem by doing the following: a. Select Work with disk units ? Work with disk unit recovery ? Correct device parity protection mismatch. Chapter 2. Isolation Procedures 109
b. Follow the on-line instructions. This ends the procedure.0008 A disk unit has no more alternate sectors to assign. 1. Determine the failing unit by type, model, serial number or address given in words 4-7. See The System Reference Code (SRC) Format Description. 2. See the service information for the specific storage device. Use the disk unit reference code listed below for service information entry. 432x 102E, 660x 102E, 671x 102E This ends the procedure.0009 The procedure to restore a disk unit from the tape unit did not complete. Continue with the disk unit exchange recovery procedure. This ends the procedure. 000A There is a problem with a disk unit subsystem. As a result, there are missing disk units in the system. Use the Service Action Log to find system reference codes associated with the missing disk units by changing the From: Date and Time on the Select Timeframe display to a date and time prior to when the user attempted to vary on the IASP. For information on how to use the Service Action Log, see “Using the Service Action Log” on page 609. This ends the procedure. 000B Some system IOPs require cache storage be reclaimed. 1. Start SST. 2. Reclaim the cache adapter card storage by performing the following: a. Select Work with disk units ? Work with disk unit recovery ? Reclaim IOP Cache Storage. b. Follow the on-line instructions to reclaim cache storage. c. After you complete the repair, the system operator may want to restore data from the most recently saved tape. This ends the procedure.000D The system disk capacity has been exceeded. For more information about disk capacity, see the iSeries Handbook. This ends the procedure. 000E Start compression failure. 1. Select Manual mode and perform an IPL to DST. See “Performing an IPL to DST” on page 640. 2. Correct the problem by doing the following: a. Select Work with disk units ? Work with disk unit recovery ? Recover from start compression failure. b. Follow the on-line instructions. This ends the procedure.002C A Licensed Internal Code program error occurred. Ask your next level of support for assistance. This ends the procedure. 110 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
002D The IASP configuration source disk unit data is down-level. The system is using the IASP configuration source disk unit that does not have the current level of data. Work with the customer to recover the configuration. On a workstation with System i Navigator installed, select the disk pool with the problem, and then select Recover configuration. This ends the procedure. 002E The Independent ASP is assigned to another system or a Licensed Internal Code program error occurred. Work with the customer to check other systems to determine if the Independent ASP has been assigned to it. If the Independent ASP has not been assigned to another system, ask your next level of support for assistance. This ends the procedure. 002F The system version and release are at a different level than the IASP version and release. The system version and release must be upgraded to be the same as the system version and release in which the IASP was created. This ends the procedure. 0030 The mirrored IASP configuration source disk unit has a disk configuration status of unknown and is missing from the disk configuration. Go to step 1 on page 108 for cause code 0002. 0032 A Licensed Internal Code program error occurred. Ask your next level of support for assistance. This ends the procedure. 0099 A Licensed Internal Code program error occurred. Ask your next level of support for assistance. This ends the procedure. LICIP13 A disk unit seems to have stopped communicating with the system. The system has stopped normal operation until the cause of the disk unit failure is found and corrected. Ensure you have read the Danger notices in “Licensed internal code (LIC) isolation procedures” on page 90 before continuing with this procedure. If the disk unit that stopped communicating with the system has mirrored protection active, normal operation of the system stops for one to two minutes. Then the system suspends mirrored protection for that disk unit and continues normal operation. Note: Do not power off the system or partition using the white button, function 08, ASMI, or HMC immediate power-off when performing this procedure. If this procedure or other isolation procedures referenced by this procedure direct you to IPL or power off the system, Chapter 2. Isolation Procedures 111
v perform a partition main storage dump (see “Performing dumps” on page 653), or v if additional dump information is not needed, perform a function 03 IPL or restart the system or partition using the HMC. 1. If the system has logical partitions, perform this procedure from the logical partition that reported the problem. To determine if the system has logical partitions, go to “Determining if the system has logical partitions” on page 593 before continuing with this procedure. 2. Was a problem summary form completed for this problem? No: Continue with the next step. Yes: Use the problem summary form information and go to step 4. 3. Fill out a Problem Reporting Form ... completely with the instructions provided. 4. Recovery from a device command time-out may have caused the communications loss condition (indicated by an SRC on the control panel or in the HMC). This communications loss condition has the following symptoms: v The A6xx SRC does not increment within two minutes. v The system continues to run normally after it recovers from the communications loss condition and the reference code is cleared from the control panel. Does the communication loss condition have the above symptoms? Yes: Continue with the next step. No: Go to step 7. 5. Verify that all Licensed Internal Code PTFs have been applied to the system. Apply any Licensed Internal Code PTFs that have not been applied to the system. Does the intermittent condition continue? Yes: Print all product activity logs. Print the LIC logs with a major code of 1000. Provide this information to your next level of support. This ends the procedure. No: This ends the procedure. 6. Is the storage hosted by another partition? Yes: Contact your next level of support. No: Continue with the next step. 7. A manual reset of the IOP may clear the attention reference code. Perform the following: If you are working from the control panel: a. Select Manual mode on the control panel. b. Select Function 25 and press Enter. c. Select Function 26 and press Enter. d. Select Function 67 and press Enter to reset the IOP. e. Wait 10 minutes. f. Select Function 25 and press Enter to disable the service functions on the control panel.If you are working from the HMC: a. These need to be updated for the new HMC UI.... b. In the Navigation Area, open the Service Applications folder. c. Select Service Focal Point. d. In the contents area, select Service Utilities. e. In the Service Utilities window, select the system you are working on. f. Select Selected ? Operator Panel Service Functions. g. Select the logical partition, and then select Partition Functions. h. Select Disk Unit IOP Reset/Reload (67). i. Wait 10 minutes.
112 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
Did the reset successfully clear the control panel SRC or HMC panel value and can commands be entered on the partition console? No: Continue with the next step. Yes: Look for a Service Action Log (SAL) entry since the last IPL, and use it to fix the problem (see “Using the Service Action Log” on page 609). If a B6xx 5090 SRC occurred since the last IPL, look for other SRC entries and take action on them first. This ends the procedure. 8. Is the SRC the same reference code that sent you here? Yes: The same reference code occurred. Continue with the next step. No: Collect all words of the reference code and perform, problem analysis to resolve the new problem. This ends the procedure. 9. Powering off and powering on the affected IOP domain may clear the attention reference code. Perform the following: If you are working from the control panel: a. Select Manual mode on the control panel. b. Select Function 25 and press Enter. c. Select Function 26 and press Enter. d. Select Function 68 and press Enter to power off the domain. e. After the domain has been powered off or 10 minutes have passed, select Function 69 and press Enter to power on the domain. f. Wait 10 minutes. g. Select Function 25 and press Enter to disable the service functions on the control panel.If you are working from the HMC: a. In the Navigation Area, open the Service Applications folder. b. Select Service Focal Point. c. In the contents area, select Service Utilities. d. In the Service Utilities window, select the system you are working on. e. Select Selected ? Operator Panel Service Functions. f. Select the logical partition, and then select Partition Functions. g. Select Power off domain (68). h. After the domain has been powered off or 10 minutes have passed, select Power on domain (69). i. Wait 10 minutes.Did this successfully clear the control panel SRC or HMC panel value, and can commands be entered on the partition console? No: Continue with the next step. Yes: Look for a SAL entry since the last IPL, and use it to fix the problem (see “Using the Service Action Log” on page 609). If a B6xx 5090 SRC occurred since the last IPL, look for other SRC entries and take action on them first. This ends the procedure.10. Is the SRC the same reference code that sent you here? Yes: The same reference code occurred. Continue with the next step. No: Collect all words of the reference code and perform problem analysis to resolve the new problem. This ends the procedure.11. Perform a main storage dump, then perform an IPL by performing the following: If you are working from the control panel: a. Select Manual mode on the control panel. b. Select Function 22 and press Enter to dump the main storage to the load-source disk unit. c. Wait for SRC A100 300x to occur, indicating that the dump is complete. d. Then perform an IPL to DST (see “Performing an IPL to DST” on page 640).
Chapter 2. Isolation Procedures 113
If you are working from the HMC: a. In the Navigation Area, open Server and Partition. b. Select Server Management. c. In the contents area, open the server on which the logical partition is located. d. Select Partitions. e. Right-click the logical partition profile and select Restart Partition. f. In the Restart Partition window, select the Dump restart option. Does a different SRC occur, or does a display appear on the console showing reference codes? No: Continue with the next step. Yes: Perform problem analysis to correct the new problem. This ends the procedure.12. Does the same reference code occur? v Yes: Continue with the next step. v No: The problem is intermittent. Perform the following: a. Print the system product activity log for the magnetic storage subsystem and print the LIC logs with a major code of 1000. b. Copy the main storage dump to removable media (see “Managing dumps” on page 653). c. Contact your next level of support and provide them with this information. This ends the procedure.13. Are characters 7-8 of the top 16 character line of function 12 (2 rightmost characters of word 2) equal to 13 or 17? Yes: Continue with the next step. No: Go to step 16.14. Use the word 1 through 9 information recorded on the Problem summary form to determine the disk unit that stopped communicating with the system: v Characters 9-16 of the top 16 character line of function 12 (word 3) contain the IOP direct select address. v Characters 1-8 of the bottom 16 character line of function 12 (word 4) contains the unit address. v Characters 1-8 of the top 16 character line of function 13 (word 6) may contain the disk unit type, level and model number. v Characters 13-16 of the top 16 character line of function 13 (4 rightmost characters of word 7) may contain the disk unit reference code. v Characters 1-8 of the bottom 16 character line of function 13 (word 8) may contain the disk unit serial number. Note: For 2105 and 2107 disk units, characters 4-8 of the bottom 16 character line of function 13 (5 rightmost characters of word 8) contain the disk unit serial number.15. Is the disk unit reference code 0000? v No: Using the information from step 14, find the table for the indicated disk unit type. Perform problem analysis for the disk unit reference code. This ends the procedure. v Yes: Perform the following steps: a. Determine the IOP type by using characters 9-12 of the bottom 16 character line of function 13 (4 leftmost characters of word 9). b. Find the unit reference code table for the IOP type. Determine the unit reference code by using characters 13-16 of the bottom 16 character line of function 13 (4 rightmost characters of word 9). c. Perform problem analysis for the unit reference code. This ends the procedure.16. Are characters 7-8 of the top 16 character line of function 12 (the two rightmost characters of word 2) equal to 27? 114 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
Yes: Continue with the next step. No: Go to step 20.17. Use the word 1 through 9 information recorded on the Problem summary form to determine the disk unit that stopped communicating with the system: v Characters 9-16 of the top 16 character line of function 12 (word 3) contain the IOP direct select address. v Characters 1-8 of the bottom 16 character line of function 12 (word 4) contains the disk unit address v Characters 9-16 of the bottom 16 character line of function 12 (word 5) contains the disk unit type, level and model number. v Characters 1-8 of the bottom 16 character line of function 13 (word 8) contains the disk unit serial number. Note: For 2105 and 2107 Disk Units, characters 4-8 of the bottom 16 character line of function 13 (5 rightmost characters of word 8) contain the disk unit serial number. v Characters 13-16 of the bottom 16 character line of function 13 (4 rightmost characters of word 9) contain the disk unit reference code.18. Is the disk unit reference code 0000? v No: Continue with the next step. v Yes: Find the table for the indicated disk unit type. Then find unit reference code (URC) 3002 in the table, and exchange the FRUs for that URC, one at a time. Note: Do not perform any other isolation procedures that are associated with URC 3002.This ends the procedure.19. Are characters 9-16 of the bottom 16 character line of function 13 (word 9) B6xx 51xx? Yes: Using the B6xx table, perform problem analysis for the 51xx unit reference code. This ends the procedure. No: Using the information from step 17, find the table for the indicated disk unit type. Perform problem analysis for the disk unit reference code. This ends the procedure.20. Are the 2 rightmost characters of word 2 on the Problem summary form equal to 62? No: Use the information in characters 9-16 of the bottom 16 character line of function 13 (word 9) and use this information instead of the information in word 1 for the reference code. This ends the procedure. Yes: Continue with the next step.21. Are characters 9-16 of the top 16 character line of function 12 (word 3) equal to 00010004? Yes: Continue with the next step. No: Go to step 24 on page 116.22. Are characters 13-16 of the bottom 16 character line of function 12 (4 rightmost characters of word 5) equal to 0000? No: Continue with the next step. Yes: Go to step 25 on page 116.23. Note the following: v Characters 13-16 of the bottom 16 character line of function 12 (4 rightmost characters of word 5) contain the disk unit reference code. v Characters 1-8 of the top 16 character line of function 13 (word 6) contains the disk unit address. v Characters 9-16 of the top 16 character line of function 13 (word 7) contain the IOP direct select address. v Characters 1-8 of the bottom 16 character line of function 13 (word 8) contains the disk unit type, level and model number.
Chapter 2. Isolation Procedures 115
Find the table for the disk unit type (characters 1-4 of the bottom 16 character line of function 13 - 4 leftmost characters of word 8), and use characters 13-16 of the bottom 16 character line of function 12 (4 rightmost characters of word 5) as the unit reference code. This ends the procedure. 24. Are characters 9-16 of the top 16 character line of function 12 (word 3) equal to 0002000D? v Yes: Continue with the next step. v No: Use the information in characters 9-16 of the bottom 16 character line of function 13 (word 9), instead of the information in word 1 for the reference code, and perform problem analysis. – Characters 1-8 of the top 16 character line of function 13 (word 6) may contain the disk unit address. – Characters 9-16 of the top 16 character line of function 13 (word 7) may contain the IOP direct select address. – Characters 1-8 of the bottom 16 character line of function 13 (word 8) may contain the disk unit type, level and model number. This ends the procedure.25. Note the following: v Characters 1-8 of the top 16 character line of function 13 (word 6) contains the disk unit address. v Characters 9-16 of the top 16 character line of function 13 (word 7) contain the IOP direct select address. v Characters 1-8 of the bottom 16 character line of function 13 (word 8) contains the disk unit type, level and model number. Find the table for the disk unit type (characters 1-4 of the bottom 16 character line of function 13 (4 leftmost characters of word 8) and use 3002 as the unit reference code. Exchange the FRUs for URC 3002 one at a time. This ends the procedure.LICIP14 Licensed Internal Code detected a card slot test failure. 1. Has the I/O adapter moved to a new card location? Yes: Continue with the next step. No: Go to step 4.2. Perform one of the following, and then continue with the next step: v Use the concurrent maintenance option in Hardware Service Manager in SST/DST to power off, remove, reinsert, and power on the I/O adapter. OR v Power off the system, remove and reinsert the I/O adapter. Then IPL the system.3. Does the reference code occur again for this same I/O adapter? v Yes: Continue with the next step. v No: No further service action is needed. This ends the procedure.4. Move the I/O adapter to a different card location, that has no I/O processors in the PCI bridge set, by performing one of the following, and then continue with the next step: v Use the concurrent maintenance option in Hardware Service Manager in SST/DST to power off, remove the I/O adapter, install the I/O adapter in a different card location, and power on the I/O adapter.OR v Power off the system, remove the I/O adapter, install the I/O adapter in a different card location, and then IPL the system.5. Does the same reference code occur again for this I/O adapter? v Yes: Replace the I/O adapter. This ends the procedure. v No: Replace the backplane. 116 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
This ends the procedure.LICIP15 Use this procedure to help you recover from an initial program load (IPL) failure. 1. Is the system HMC-managed or managed by Integrated Virtualization Manager (IVM)? Yes: Continue with the next step. No: Go to step 4. 2. Check the LPAR configuration to ensure that the load source and alternate load source devices are valid. Is the LPAR configuration correct? Yes: Continue with the next step. No: Correct the LPAR configuration problem. This ends the procedure. 3. Is the load source hosted by another partition? Yes: Contact your next level of support. No: Continue with the next step. 4. Did the failure occur when you were performing a type-D IPL? v No: Go to step 10 on page 118. v Yes: Perform the following: a. Ensure that the device is ready and has valid install media. b. Ensure that the device has the correct SCSI address and that any cables are properly connected and terminated. If a correction is made during the above checks, retry the IPL. If none of the above items resolve the problem, continue with the next step. 5. Are the load source and alternate load source devices controlled by the same I/O adapter, and does the load source disk unit have SLIC loaded on it? Yes: Continue with the next step. No: Go to step 7. 6. Perform a type-B IPL in manual mode. Does the same SRC occur? v No: Continue with the next step. v Yes: Replace the following items, one at a time, and retry the IPL until the problem is resolved (see Chapter 3, “Locating FRUs,” on page 335): a. The I/O adapter controlling load source and alternate load source devices. Note: The I/O adapter may be embedded on the system unit backplane. b. The common cable, if present, attached between both the load source and alternate load source and the controlling I/O adapter. c. If none of the items above resolve the problem, contact your next level of support. This ends the procedure. 7. Replace the following items, one at a time, and retry the type-D IPL until the problem is resolved (see Chapter 3, “Locating FRUs,” on page 335): a. Media in the alternate load source device b. Device cables (if present) c. Media device d. Media backplane e. I/O adapter controlling the alternate load source device Note: The I/O adapter may be embedded on the system unit backplane f. If the problem persists after replacing each of these parts, contact your next level of support. This ends the procedure.
Chapter 2. Isolation Procedures 117
8. You performed a type A or type B IPL. Is the load source I/O adapter a Fibre Channel adapter? Yes: Continue with the next step. No: Continue with step 10. 9. Perform a type-D IPL in manual mode to DST. Look for other SRCs and use them to resolve the problem. If there are no SRCs, or if the SRCs do not resolve the problem, perform the actions for the 2847 3100 SRC. This ends the procedure. 10. Is the device in a valid location (see Chapter 3, “Locating FRUs,” on page 335)? Yes: Continue with the next step. No: Correct the device location problem and retry the IPL. If the problem persists, continue with the next step.11. Perform a type-D IPL in manual mode to DST. Is the type-D IPL successful? v No: Continue with the next step. v Yes: Look for other SRCs and use them to resolve the problem. If there are no SRCs, or the SRCs do not resolve the problem, replace the following items, one at a time, until the problem is resolved (see Chapter 3, “Locating FRUs,” on page 335): a. Load source disk drive b. Cables (if present) c. Disk drive backplane d. I/O adapter controlling the load source device Note: The I/O adapter may be embedded on the system unit backplane e. Backplane that the I/O adapter is plugged into f. If the problem persists after replacing each of these parts, contact your next level of support. This ends the procedure.12. The type-D IPL in manual mode to DST was not successful. Is the I/O adapter embedded on the system unit backplane? No: Continue with the next step. Yes: Replace the system unit backplane and retry the IPL. If the IPL still fails, contact your next level of support. This ends the procedure.13. Are the load source and alternate load source controlled by the same I/O adapter? No: Go to step 16 on page 119. Yes: Continue with the next step.14. Replace the I/O adapter and perform a type-A or type-B IPL. Does the IPL complete successfully? Yes: This ends the procedure. No: Continue with the next step.15. Perform a type-D IPL in manual mode to DST. Is the type-D IPL successful? v No: Continue with the next step. v Yes: Look for other SRCs and use them to resolve the problem. If there are no SRCs, or the SRCs do not resolve the problem, replace the following items, one at a time, until the problem is resolved (see Chapter 3, “Locating FRUs,” on page 335): a. Load source disk drive b. Cables (if present) c. Disk drive backplane d. I/O adapter controlling the load source device Note: The I/O adapter may be embedded on the system unit backplane e. Backplane that the I/O adapter and I/O processor are plugged into 118 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
f. If the problem persists after replacing each of these parts, contact your next level of support. This ends the procedure.16. Replace the backplane that the I/O adapter is plugged into and retry the IPL. If the IPL still fails, contact your next level of support. This ends the procedure.LICIP16 Use this procedure to identify an adapter that is operational but is not located in the same partition as its associated adapter. An adapter identified that its associated adapter is operational but is not located in the same partition. Use this procedure to identify the serial number and then find the location of the associated adapter and reassign it so that both adapters are in the same partition. Note: If the associated adapter is located in a different i5/OS partition, there might also be a B600690A logged against the associated adapter in that partition. 1. The adapter against which the B600690A is logged has identified that its associated adapter can not be found in this partition. Find the resource name that this error was logged against. This can be obtained from the Service Action Log (see ?Using the Service Action Log?). Then, using the resource name, perform the following: a. Access SST or DST. b. Select Start a Service Tool. c. Select Hardware Service Manager. d. Select Locate resource by resource name. e. Enter the resource name that this error was logged against. f. Take the option to Display detail for the adapter.2. The bottom of the resource detail screen displays any combination of the following information: Attached storage IOA resource name. : Attached storage IOA serial number. : Attached storage IOA link status. . : Or Attached auxiliary IOA resource name: Attached auxiliary IOA serial number: Attached auxiliary IOA link status. : Or Remote storage IOA resource name. . : Remote storage IOA serial number. . : Remote storage IOA link status. . . : 3. Using the serial number information displayed for the Attached or Remote IOA, have the customer determine which partition currently owns the adapter with that serial number by using logical resource or VPD utilities in each of the partitions on the system. Note: The CCIN of the associated adapter is the first four characters of word 6 of the SRC. 4. Then, have the customer ensure that both adapters are owned by the same partition. For further assistance, the customer should contact their software service provider. This ends the procedure. Chapter 2. Isolation Procedures 119
Logical partition (LPAR) isolation procedure These procedures help you to identify logical partition (LPAR) configuration conditions and the associated corrective actions. DANGERWhen working on or around the system, observe the following precautions: Electrical voltage and current from power, telephone, and communication cables are hazardous. To avoid a shock hazard: v Connect power to this unit only with the IBM provided power cord. Do not use the IBM provided power cord for any other product. v Do not open or service any power supply assembly. v Do not connect or disconnect any cables or perform installation, maintenance, or reconfiguration of this product during an electrical storm. v The product might be equipped with multiple power cords. To remove all hazardous voltages, disconnect all power cords. v Connect all power cords to a properly wired and grounded electrical outlet. Ensure that the outlet supplies proper voltage and phase rotation according to the system rating plate. v Connect any equipment that will be attached to this product to properly wired outlets. v When possible, use one hand only to connect or disconnect signal cables. v Never turn on any equipment when there is evidence of fire, water, or structural damage. v Disconnect the attached power cords, telecommunications systems, networks, and modems before you open the device covers, unless instructed otherwise in the installation and configuration procedures. v Connect and disconnect cables as described in the following procedures when installing, moving, or opening covers on this product or attached devices. To Disconnect: 1. Turn off everything (unless instructed otherwise). 2. Remove the power cords from the outlets. 3. Remove the signal cables from the connectors. 4. Remove all cables from the devices To Connect: 1. Turn off everything (unless instructed otherwise). 2. Attach all cables to the devices. 3. Attach the signal cables to the connectors. 4. Attach the power cords to the outlets. 5. Turn on the devices. (D005) “LPRIP01”Use this procedure to isolate the problem when LPAR configuration data does not match the current system configuration.LPRIP01 Use this procedure to isolate the problem when LPAR configuration data does not match the current system configuration. 1. Is there only one B6005311 error logged, and is it against the load source device for the partition, in either the Primary or a secondary partition? v Yes: Is the reporting partition the Primary partition? – Yes: Continue with the next step. – No: Go to step 3 on page 121.
120 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
v No: Go to step 4. 2. Was the load source disk unit migrated from another partition within the same system? v Yes: Is this load source device intended to be the load source for the Primary partition? – Yes: To accept the load source disk unit: Go to SST/DST in the current partition and select Work with system partitions ? Recover configuration data ? Accept load source disk unit. This ends the procedure. – No: Power off the system. Return the original load source disk to the Primary partition and perform a system IPL. This ends the procedure.v No: The load source disk unit has not changed. Contact your next level of support. This ends the procedure. 3. The reporting partition is a secondary partition. Since the last IPL of the reporting partition, have one of the following occurred? v Has the Primary partition time/date been moved backward to a time/date earlier than the previous setting? v Has the system serial number been changed? v Was the load source disk unit in this secondary partition, replaced intentionally with a load source from another system or another partition from the same system?v Yes: To accept the load source disk unit: Go to SST/DST in the current partition and select Work with system partitions ? Recover configuration data ? Accept load source disk unitThis ends the procedure. v No: Contact your next level of support. This ends the procedure. 4. Are there multiple B600 5311 SRCs logged in the same partition? v Yes: Continue with the next step. v No: None of the conditions in this procedure have been met, call your next level of support. This ends the procedure. 5. Is the resource for one of the B600 5311 SRCs the load source device and are all of the other B600 5311 entries for resources which are non-configured disk units? Note: To determine if a disk unit is a non-configured disk unit, refer to the ?Work with disk unit options? section in the ?DST options? section of the ?DST chapter? in the iSeries Service Functions information. v Yes: Is the partition that is reporting the error the Primary partition? – Yes: Continue with the next step. – No: Go to step 7.v No: Go to step 8 on page 122. 6. Was the load source disk unit migrated from another partition within the same system? v Yes: Is this load source device intended to be the load source for the Primary partition? – Yes: To accept the load source disk unit: Go to SST/DST in the current partition and select Work with system partitions ? Recover configuration data ? Accept load source disk unitThis ends the procedure. – No: Power off the system. Return the original load source disk to the Primary partition and perform a system IPL. This ends the procedure.v No: The load source disk unit has not changed. Contact your next level of support. This ends the procedure. 7. The reporting partition is a secondary partition. Since the last IPL of the reporting partition, have one of the following occurred: v Has the Primary partition time/date been moved backward to a time/date earlier than the previous setting? v Has the system serial number been changed? Chapter 2. Isolation Procedures 121
v Was the load source disk unit in this secondary partition, replaced intentionally with a load source from another system or another partition from the same system?v Yes: To accept the load source disk unit: Go to SST/DST in the current partition and select Work with system partitions ? Recover configuration data ? Accept load source disk unitThis ends the procedure. v No: Contact your next level of support. This ends the procedure. 8. One or more B600 5311 SRCs have been logged in the same partition. Do all of the B600 5311 errors have a resource which is a non-configured disk unit in the partition?Note: To determine if a disk unit is a non-configured disk unit, refer to the ?Work with disk unit options? section in the ?DST options? section of the ?DST chapter? in the iSeries Service Functions information. v Yes: Continue with the next step. v No: None of the conditions in this procedure have been met, call your next level of support. This ends the procedure. 9. Were any disk unit resources associated with the B600 5311 SRCs added to the partition, since the last IPL of this partition? v No: Continue with the next step. v Yes: Perform the following to clear non-configured disk unit configuration data: a. Go to SST/DST in the partition and select Work with system partitions ? Recover configuration data ? Clear non-configured disk unit configuration data. b. Select each unit in the list which is new to the system and press Enter. c. Continue the system IPL. This ends the procedure.10. None of the resources that are associated with the B600 5311 SRCs are disk units that were added to the partition since the last IPL of the partition. Has a scratch install recently been performed on the partition that is reporting the error(s)? v No: Continue with the next step. v Yes: Go to step 13.11. If a scratch install was not performed, was the clear configuration data option recently used to discontinue LPAR use? v Yes: Continue with the next step. v No: The Clear configuration data option was not used. Contact your next level of support. This ends the procedure.12. Perform the following to clear non-configured disk unit configuration data: a. Go to SST/DST in the partition and select Work with system partitions ? Recover configuration data ? Clear non-configured disk unit configuration data. b. Select each unit in the list which is new to the system and press Enter. c. Continue the system IPL. This ends the procedure.13. Was the load source device previously mirrored before the scratch install? v Yes: Continue with the next step. v No: Go to step 15.14. Perform the following to clear the old configuration data from the disk unit that was mirroring the old load source disk a. Go to SST/DST in the partition and select Work with system partitions ? Recover configuration data ? Clear non-configured disk unit configuration data. b. Select the former load source mirror in the list and press Enter.15. Is the Primary partition reporting the B600 5311 error(s)? v No: This ends the procedure. 122 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
v Yes: Does the customer want multiple partitions on the system? – No: This ends the procedure. – Yes: Use the Recover primary partition configuration data option to retrieve the LPAR configuration data from other devices in the system. a. Go to SST/DST in the primary partition and select Work with system partitions ? Recover configuration data ? Recover primary partition configuration data. The system will perform an automatic IPL. b. Verify the information that appears. - The device should be a former load source device from a secondary partition. - The time and date should reflect a time when that partition was active. It should be more recent than the last change to the logical partition configuration. This ends the procedure. Chapter 2. Isolation Procedures 123
Operations Console isolation procedures These procedures help you to isolate a failure with the Operations Console. DANGERWhen working on or around the system, observe the following precautions: Electrical voltage and current from power, telephone, and communication cables are hazardous. To avoid a shock hazard: v Connect power to this unit only with the IBM provided power cord. Do not use the IBM provided power cord for any other product. v Do not open or service any power supply assembly. v Do not connect or disconnect any cables or perform installation, maintenance, or reconfiguration of this product during an electrical storm. v The product might be equipped with multiple power cords. To remove all hazardous voltages, disconnect all power cords. v Connect all power cords to a properly wired and grounded electrical outlet. Ensure that the outlet supplies proper voltage and phase rotation according to the system rating plate. v Connect any equipment that will be attached to this product to properly wired outlets. v When possible, use one hand only to connect or disconnect signal cables. v Never turn on any equipment when there is evidence of fire, water, or structural damage. v Disconnect the attached power cords, telecommunications systems, networks, and modems before you open the device covers, unless instructed otherwise in the installation and configuration procedures. v Connect and disconnect cables as described in the following procedures when installing, moving, or opening covers on this product or attached devices. To Disconnect: 1. Turn off everything (unless instructed otherwise). 2. Remove the power cords from the outlets. 3. Remove the signal cables from the connectors. 4. Remove all cables from the devices To Connect: 1. Turn off everything (unless instructed otherwise). 2. Attach all cables to the devices. 3. Attach the signal cables to the connectors. 4. Attach the power cords to the outlets. 5. Turn on the devices. (D005)For direct cable only. The following symptoms can be caused by Operations Console failing to connect: v SRC A600 5008 v The status in the Operations Console window remains ?Connecting console?.The following symptoms can be caused by a defective remote control panel cable: v Remote control panel (hung) System control panel functions. v SRC 0000 DDDD with attention light on the system panel. v Remote control panel accepts mode selections, however the system does not respond. v The remote control panel does not function. v The status in the Operations Console window remains ?Connecting console?.
124 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
The following safety notices apply throughout this section. Read all safety procedures before servicing the system. Observe all safety procedures when performing a procedure. Unless instructed otherwise, always power off the system or expansion tower where the field-replaceable unit (FRU) is located before removing, exchanging, or installing a FRU. “OPCIP03”Use this procedure to isolate a bringup failure with Operations Console.OPCIP03 Use this procedure to isolate a bringup failure with Operations Console. DANGERWhen working on or around the system, observe the following precautions: Electrical voltage and current from power, telephone, and communication cables are hazardous. To avoid a shock hazard: v Connect power to this unit only with the IBM provided power cord. Do not use the IBM provided power cord for any other product. v Do not open or service any power supply assembly. v Do not connect or disconnect any cables or perform installation, maintenance, or reconfiguration of this product during an electrical storm. v The product might be equipped with multiple power cords. To remove all hazardous voltages, disconnect all power cords. v Connect all power cords to a properly wired and grounded electrical outlet. Ensure that the outlet supplies proper voltage and phase rotation according to the system rating plate. v Connect any equipment that will be attached to this product to properly wired outlets. v When possible, use one hand only to connect or disconnect signal cables. v Never turn on any equipment when there is evidence of fire, water, or structural damage. v Disconnect the attached power cords, telecommunications systems, networks, and modems before you open the device covers, unless instructed otherwise in the installation and configuration procedures. v Connect and disconnect cables as described in the following procedures when installing, moving, or opening covers on this product or attached devices. To Disconnect: 1. Turn off everything (unless instructed otherwise). 2. Remove the power cords from the outlets. 3. Remove the signal cables from the connectors. 4. Remove all cables from the devices To Connect: 1. Turn off everything (unless instructed otherwise). 2. Attach all cables to the devices. 3. Attach the signal cables to the connectors. 4. Attach the power cords to the outlets. 5. Turn on the devices. (D005)Use this procedure to isolate an Operations Console bringup failure when the SRC on the panel is A6xx5008 or B6xx5008. If you are not using the Operations Console, see A6005004. This procedure only works with cable-connected and LAN configurations. It is not valid for dial connected configurations. 1. If the system has logical partitions, perform this procedure from the logical partition that reported the problem. To determine if the system has logical partitions, go to Determine if the system has logical partitions before continuing with this procedure. Chapter 2. Isolation Procedures 125
2. Is the SRC on the panel A6xx5008 or B6xx5008? v No: This ends the procedure. v Yes: Are you connecting Operations Console using the ASYNC adapter? Yes: Continue with the next step. No: You are connecting using a LAN adapter. Go to step 6.3. Are words 17, 18, and 19 all equal to 00000000? v Yes: Report the problem to your next level of support. This ends the procedure. v No: Is word 17 equal to 00000001? No: Continue with the next step. Yes: The ASYNC adapter was not detected. Ensure that the ASYNC adapter card is installed, or replace the IOA and try again. This ends the procedure. 4. Is word 17 equal to 00000002? v Yes: On the ASYNC adapter card that was found, no cable was detected. Word 18 contains the card position. Locate the ASYNC adapter card in this card position, and ensure that the external cable is attached. Install or replace the external cable. This ends the procedure. v No: Is word 17 equal to 00000003? No: Continue with the next step. Yes: The cable that was detected does not have the correct cable ID. Word 18 contains the card position. Word 19 contains the cable ID. Locate the ASYNC adapter card in this card position, and verify that the correct cable is attached, or replace the cable. This ends the procedure. 5. Is word 17 equal to 00000004? No: Report the problem to you next level of support. This ends the procedure. Yes: Operations Console failed to make a connection because the port is already being used. Word 18 contains the card position. Disconnect the active communications session and try using the resource again. This ends the procedure.6. Are words 13, 14 and 15 all equal to 00000000? v Yes: Report the problem to you next level of support. This ends the procedure. v No: Is word 13 equal to 00000002? No: Continue with the next step. Yes: The LAN hardware failed to activate. Replace the LAN IOA being used. This ends the procedure.7. Is word 13 equal to 00000003? v No: Continue with the next step. v Yes: A hardware error occurred. Word 14 contains the error code, (example: 53001A80). Word 15 contains the card position. Is the error code equal to 53001A80? Yes: The network cable is not attached to the LAN adapter, the cable is defective, or the network is not operational. This ends the procedure. No: The LAN adapter hardware is not operational. Replace the hardware and try again. This ends the procedure. 8. Is word 13 equal to 00000004? v Yes: The console did not respond. Word 14 contains the number of attempts made. Word 15 contains the card position. The system is inserted into the network but there is no connection to the client (PC). Verify the configuration for the network at the system and client; verify the configuration of Operations Console. This ends the procedure. v No: Is word 13 equal to 00000005? No: Report the problem to your next level support. This ends the procedure. 126 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
Yes: IP information was received from the console. Word 14 contains the received IP address. Verify the configuration data for the client (PC) or verify the configuration for the network. This ends the procedure. Chapter 2. Isolation Procedures 127
Power isolation procedures Use power isolation procedures for isolating a problem in the power system. Use isolation procedures if there is no HMC attached to the server. If the server is connected to an HMC, use the procedures that are available on the HMC to continue FRU isolation. Some field replaceable units (FRUs) can be replaced with the unit powered on. Follow the instructions in Chapter 3, “Locating FRUs,” on page 335 when directed to remove, exchange, or install a FRU. The following safety notices apply throughout the power isolation procedures. Read all safety procedures before servicing the system and observe all safety procedures when performing a procedure. DANGERWhen working on or around the system, observe the following precautions: Electrical voltage and current from power, telephone, and communication cables are hazardous. To avoid a shock hazard: v Connect power to this unit only with the IBM provided power cord. Do not use the IBM provided power cord for any other product. v Do not open or service any power supply assembly. v Do not connect or disconnect any cables or perform installation, maintenance, or reconfiguration of this product during an electrical storm. v The product might be equipped with multiple power cords. To remove all hazardous voltages, disconnect all power cords. v Connect all power cords to a properly wired and grounded electrical outlet. Ensure that the outlet supplies proper voltage and phase rotation according to the system rating plate. v Connect any equipment that will be attached to this product to properly wired outlets. v When possible, use one hand only to connect or disconnect signal cables. v Never turn on any equipment when there is evidence of fire, water, or structural damage. v Disconnect the attached power cords, telecommunications systems, networks, and modems before you open the device covers, unless instructed otherwise in the installation and configuration procedures. v Connect and disconnect cables as described in the following procedures when installing, moving, or opening covers on this product or attached devices. To Disconnect: 1. Turn off everything (unless instructed otherwise). 2. Remove the power cords from the outlets. 3. Remove the signal cables from the connectors. 4. Remove all cables from the devices To Connect: 1. Turn off everything (unless instructed otherwise). 2. Attach all cables to the devices. 3. Attach the signal cables to the connectors. 4. Attach the power cords to the outlets. 5. Turn on the devices. (D005) “Power problems” on page 129Use the following table to learn how to begin analyzing a power problem. “PWR1900” on page 141Determine which procedure to use based on the model number. “PWR1905” on page 142A system unit power supply load fault is occurring. 128 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
“PWR1906” on page 144The server detected an error in the power system. “PWR1907” on page 146A unit was dropped from the SPCN configuration. “PWR1908” on page 148A power supply fault or load fault has occurred in a 5088 or 0588 expansion unit. “PWR1909” on page 150A power supply load fault is occurring in a system expansion unit or I/O tower. “PWR1911” on page 152You are here because of a power problem on a dual line cord system. If the failing unit does not have a dual line cord, return to the procedure that sent you here or go to the next item in the FRU list. “PWR1917” on page 157This procedure is used to display or change the configuration ID. “PWR1918” on page 159A voltage regulator card might be failing. “PWR1920” on page 160Use this procedure to verify that the lights on the server control panel and the display panel on all attached I/O expansion units are operating correctly.Power problems Use the following table to learn how to begin analyzing a power problem.
Table 17. Analyzing power problems
Symptom
What you should do
System unit does not power on.
See “Cannot power on system unit.”
The processor or I/O expansion unit does not power off.
See “Cannot power off system or SPCN-controlled I/O expansion unit” on page 132.
The system does not remain powered on during a loss of incoming ac voltage and has an uninterruptible power supply (UPS) installed.
Refer to the UPS user’s guide that was provided with your unit.
An I/O expansion unit does not power on.
See “Cannot power on SPCN-controlled I/O expansion unit” on page 136. “Cannot power on system unit”Perform this procedure until you correct the problem and you can power on the system. “Cannot power off system or SPCN-controlled I/O expansion unit” on page 132Use this procedure to analyze a failure of the normal command and control panel procedures to power off the system unit or an SPCN-controlled I/O expansion unit. “Cannot power on SPCN-controlled I/O expansion unit” on page 136You are here because an SPCN-controlled I/O expansion unit cannot be powered on, and might be displaying a 1xxx-C62E reference code.Cannot power on system unit Perform this procedure until you correct the problem and you can power on the system. For important safety information before continuing with this procedure, see “Power isolation procedures” on page 128. 1. Attempt to power on the system (see “Powering on and powering off” on page 588). Does the system power on, and is the system power status indicator light on continuously? Note: The system power status indicator blinks at the slower rate (one blink per two seconds) while powered off, and at the faster rate (one blink per second) during a normal power-on sequence.
Chapter 2. Isolation Procedures 129
No: Continue with the next step. Yes: Go to step 13 on page 131. 2. Are there any characters displayed on the control panel (a scrolling dot may be visible as a character)? No: Continue with the next step. Yes: Go to step 5. 3. Are the mainline ac power cables from the power supply, power distribution unit, or external uninterruptible power supply (UPS) to the customer’s ac power outlet connected and seated correctly at both ends? Yes: Continue with the next step. No: Connect the mainline ac power cables correctly at both ends and go to step 1 on page 129. 4. Perform the following: a. Verify that the UPS is powered on (if it is installed). If the UPS will not power on, follow the service procedures for the UPS to ensure proper line voltage and UPS operation. b. Disconnect the mainline ac power cable or ac power jumper cable from the system’s ac power connector at the system. c. Use a multimeter to measure the ac voltage at the system end of the mainline ac power cable or ac power jumper cable. Note: Some system models have more than one mainline ac power cable or ac power jumper cable. For these models, disconnect all the mainline ac power cables or ac power jumper cables and measure the ac voltage at each cable before continuing with the next step. Is the ac voltage from 200 V ac to 240 V ac, or from 100 V ac to 127 V ac? No: Go to step 8 on page 131. Yes: Continue with the next step. 5. Perform the following: a. Disconnect the mainline ac power cable(s) from the power outlet. b. Exchange the system unit control panel (Un-D1) (see “Physical Locations” on page 339). c. Reconnect the mainline ac power cables to the power outlet. d. Attempt to power on the system.Does the system power on? No: Continue with the next step. Yes: The system unit control panel was the failing item. This ends the procedure. 6. Perform the following: a. Disconnect the mainline ac power cable(s) from the power outlet. b. Exchange the power supply or supplies (Un-E1, Un-E2) (see “Physical Locations” on page 339). c. Reconnect the mainline ac power cables to the power outlet. d. Attempt to power on the system.Does the system power on? No: Continue with the next step. Yes: The power supply was the failing item. This ends the procedure. 7. Perform the following: a. Disconnect the mainline ac power cable(s). b. Replace the system backplane (Un-P1) (see “Physical Locations” on page 339). c. Reconnect the mainline ac power cables to the power outlet. d. Attempt to power on the system.Does the system power on? 130 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
No: Continue with the next step. Yes: The system backplane was the failing item. This ends the procedure. 8. Are you working on a system unit with a power distribution unit with tripped breakers? v No: Continue with the next step. v Yes: Perform the following: a. Reset the tripped power distribution breaker. b. Verify that the removable ac power cable is not the problem. Replace the cord, as installed, if it is defective. c. If the breaker continues to trip, install a new power supply (as installed) in each location until the defective one is found. This ends the procedure. 9. Does the system have an external UPS installed? Yes: Continue with the next step. No: Go to step 11.10. Use a multimeter to measure the ac voltage at the external UPS outlets. Is the ac voltage from 200 V ac to 240 V or from 100 V ac to 127 V ac? No: The UPS needs service. For 9910 type UPS, call IBM Service Support. For all other UPS types, have the customer call the UPS provider. In the meantime, go to step 12 to bypass the UPS. Yes: Exchange the ac power cable, as installed (see Chapter 8, “Part Information,” on page 691). This ends the procedure.11. Perform the following: a. Disconnect the mainline ac power cable from the customer’s ac power outlet. b. Use a multimeter to measure the ac voltage at the customer’s ac power outlet. Note: Some system models have more than one mainline ac power cable. For these models, disconnect all the mainline ac power cables and measure the ac voltage at all ac power outlets before continuing with this step. Is the ac voltage from 200 V ac to 240 V ac or from 100 V ac to 127 V ac? Yes: Exchange the mainline ac power cable. See the Chapter 8, “Part Information,” on page 691. Then go to step 1 on page 129. No: Inform the customer that the ac voltage at the power outlet is not correct. When the ac voltage at the power outlet is correct, reconnect the mainline ac power cables to the power outlet. This ends the procedure.12. Perform the following to bypass the UPS unit: a. Power off your system and the UPS unit. b. Remove the signal cable used between the UPS and the system. c. Remove any power jumper cords used between the UPS and the attached devices. d. Remove the country or region-specific power cord used from the UPS to the wall outlet. e. Use the correct power cord (the original country or region-specific power cord that was provided with your system) and connect it to the power inlet on the system. Plug the other end of this cord into a compatible wall outlet. f. Attempt to power on the system.Does the power-on standby sequence complete successfully? Yes: Go to Chapter 5, “Verifying a repair,” on page 557. This ends the procedure. No: Go to step 5 on page 130.13. Display the selected IPL mode on the system unit control panel. See ’IPL information’ in Progress Codes, SA76-0093. Is the selected mode the same mode that the customer was using when the power-on failure occurred? No: Go to step 15 on page 132. Chapter 2. Isolation Procedures 131
Yes: Continue with the next step.14. Is a function 11 reference code displayed on the system unit control panel? No: Go to step 16. Yes: Return to Chapter 1, “Starting a Service Call,” on page 1. This ends the procedure.15. Perform the following: a. Power off the system. b. Select the mode on the system unit control panel that the customer was using when the power-on failure occurred. c. Attempt to power on the system.Does the system power on? Yes: Continue with the next step. No: Exchange the system unit control panel (Un-D1) (see “Physical Locations” on page 339). This ends the procedure.16. Continue the IPL. Does the IPL complete successfully? Yes: This ends the procedure. No: Return to Chapter 1, “Starting a Service Call,” on page 1. This ends the procedure.Cannot power off system or SPCN-controlled I/O expansion unit Use this procedure to analyze a failure of the normal command and control panel procedures to power off the system unit or an SPCN-controlled I/O expansion unit. Attention: To prevent loss of data, ask the customer to verify that no interactive jobs are running before you perform this procedure. For important safety information before continuing with this procedure, see “Power isolation procedures” on page 128. 1. Is the power off problem on the system unit? No: Continue with the next step. Yes: Go to step 3. 2. Ensure that the SPCN cables that connect the units are connected and seated correctly at both ends. Does the I/O unit power off, and is the power indicator light blinking slowly? Yes: This ends the procedure. No: Go to step 7 on page 133. 3. Attempt to power off the system. Does the system unit power off, and is the power indicator light blinking slowly? No: Continue with the next step. Yes: The system is not responding to normal power off procedures which could indicate a Licensed Internal Code problem. Contact your next level of support. This ends the procedure. 4. Attempt to power off the system using ASMI. Does the system power off? Yes: The system is not responding to normal power off procedures which could indicate a Licensed Internal Code problem. Contact your next level of support. This ends the procedure. No: Continue with the next step. 5. Attempt to power off the system using the control panel power button. Does the system power off? Yes: Continue with the next step. No: Go to step 10 on page 133. 6. Is there a reference code logged in either ASMI, the control panel, or the HMC that indicates a power problem? Yes: Perform problem analysis for the reference code in the log. This ends the procedure. 132 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
No: Contact your next level of support. This ends the procedure. 7. Is the I/O expansion unit that will not power off part of a shared I/O tower loop? Yes: Go to step 9. No: Continue with the next step. 8. Attempt to power off the I/O expansion unit. Were you able to power off the expansion unit? Yes: This ends the procedure. No: Go to step 10. 9. The unit will only power off under certain conditions: v If the unit is in private mode, it should power off with the system unit that is connected by the SPCN frame-to-frame cable. v If the unit is in switchable mode, it should power off if the ?owning? system is powered off or is powering off, and the system unit that is connected by the SPCN frame-to-frame cable is powered off or is powering off. Does the I/O expansion unit power off? No: Continue with the next step. Yes: This ends the procedure.10. Ensure there are no jobs running on the system or partition, and verify that an uninterruptible power supply (UPS) is not powering the system or I/O expansion unit. Then continue with the next step. 11. Perform the following: a. Remove the system or I/O expansion unit ac power cord from the external UPS or, if an external UPS is not installed, from the customer’s ac power outlet. If the system or I/O expansion unit has more than one ac line cord, disconnect all the ac line cords. b. Exchange the following FRUs one at a time (see “Physical Locations” on page 339 and Chapter 8, “Part Information,” on page 691). If the system unit is failing: 1) Power supply (Un-E1 or Un-E2). Go to step 12 2) System backplane (Un-P1) 3) System control panel (Un-D1) If an I/O expansion unit is failing: 1) Each power supply. Go to step 12 2) I/O backplane 3) I/O backplane in the unit preceding the unit that will not power off 4) SPCN frame-to-frame cable This ends the procedure.12. A power supply might be the failing item. Attention: When replacing a redundant power supply, a 1xxx 1504, 1514, 1524, or 1534 reference code may be logged in the error log. If you just removed and replaced the power supply in the location associated with this reference code, and the power supply came ready after the install, disregard this reference code. If you had not previously removed and replaced a power supply, the power supply did not come ready after installation, or there are repeated fan fault errors after the power supply replacement, continue to follow these steps. Is the reference code 1xxx-15xx? No: Continue with the next step. Yes: Perform the following:a. Find the unit reference code in one of the following tables to determine the failing power supply. b. Ensure that the power cables are properly connected and seated. Chapter 2. Isolation Procedures 133
c. Is the reference code 1xxx-1500, 1510, 1520, or 1530 and is the failing unit configured with a redundant power supply option (or dual line cord feature)? v Yes: Perform “PWR1911” on page 152 before replacing parts. v No: Continue with step 12dd. Refer to “Physical Locations” on page 339 to determine the location and part number of the failing item. e. Replace the failing power supply (see the following tables to determine which power supply to replace). f. If the new power supply does not fix the problem, perform the following : 1) Reinstall the original power supply. 2) Try the new power supply in each of the other positions listed in the table. 3) If the problem still is not fixed, reinstall the original power supply and go to the next FRU in the list. 4) For reference codes 1xxx-1500, 1510, 1520, and 1530, exchange the power distribution backplane if a problem persists after replacing the power supply.
Table 18. Model 8203-E4A
Unit Reference Code
Power Supply
1510, 1511, 1512, 1513, 1514, 7110
E1
1520, 1521, 1522, 1523, 1524, 7120
E2
Table 19. Model 8203-E4A (Quiet office)
Unit Reference Code
Power Supply
1510, 1511, 1512, 1513, 1514, 7110
E1 Attention: For reference codes 1500, 1510, 1520, and 1530, perform “PWR1911” on page 152 before replacing parts.
Table 20. 5088, 0588 expansion units
Unit Reference Code
Power Supply
1510, 1511, 1512, 1513, 1514, 1516
P02
1520, 1521, 1522, 1523, 1524, 1526
P01 Attention: On a dual line cord system, for reference codes 1500, 1510, 1520, and 1530, perform “PWR1911” on page 152 before replacing parts. On a single line cord system, check the ac jumper to the power supply before replacing parts.
Table 21. 5094, 5294 I/O expansion units (single line cord)
Unit Reference Code
Power Supply
1510, 1511, 1512, 1513, 1514, 1516
P01
1520, 1521, 1522, 1523, 1524, 1526
P02
1530, 1531, 1532, 1533, 1534, 1536
P03 Attention: On a dual line cord system, for reference codes 1500, 1510, 1520, and 1530, perform “PWR1911” on page 152 before replacing parts. On a single line cord system, check the ac jumper to the power supply before replacing parts.
134 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
Attention: For 5094, 5294 expansion units, do not install power supplies P00 and P01 ac jumper cables on the same ac input module.
Table 22. 5094, 5294 I/O expansion units (dual line cord)
Unit Reference Code
Power Supply
1500, 1501, 1502, 1503
P00
1510, 1511, 1512, 1513, 1514, 1516
P01
1520, 1521, 1522, 1523, 1524, 1526
P02
1530, 1531, 1532, 1533
P03
Table 23. 5095, 0595, 5790, 7311-D11, 7311-D20, 7314-G30 expansion units
Unit Reference Code
Power Supply
1510, 1511, 1512, 1513, 1514, 1516, 1517
P01/E1
1520, 1521, 1522, 1523, 1524, 1526, 1527
P02/E2 This ends the procedure. 13. Is the reference code 1xxx-2600, 2603, 2605, or 2606? v No: Continue with the next step. v Yes: Perform the following:a. Refer to “Physical Locations” on page 339 to determine the location and part number of the failing item. b. Replace the failing power supply. c. Perform the following if the new power supply does not fix the problem: 1) Reinstall the original power supply. 2) Try the new power supply in each of the other positions listed in the table. 3) If the problem still is not fixed, reinstall the original power supply and go to the next FRU in the list. Attention: Do not install power supplies P00 and P01 ac jumper cables on the same ac input module.
Table 24. Failing power supplies
System or Feature Code
Failing Power Supply
8203-E4A, 9407-M15, or 9408-M25
Un-E1, Un-E2
5094, 5294 (single line cord)
P01, P02, P03
5094, 5294 (dual line cord)
P00, P01, P02, P03
5088, 0588
P02, P01
5095, 0595
P01, P02
5790, 7311-D11, 7311-D20, 7314-G30
E1, E2 This ends the procedure.14. Is the reference code 1xxx 8455 or 8456? v No: Return to Chapter 1, “Starting a Service Call,” on page 1. This ends the procedure. v Yes: One of the power supplies is missing, and must be installed. Use the following table to determine which power supply is missing, and install the power supply ( see “Physical Locations” on page 339 to determine the part number and exchange procedure).
Chapter 2. Isolation Procedures 135
Table 25. Missing power supplies
Reference Code
Missing Power Supply
1xxx 8455
Un-E1
1xxx 8456
Un-E2 This ends the procedure.Cannot power on SPCN-controlled I/O expansion unit You are here because an SPCN-controlled I/O expansion unit cannot be powered on, and might be displaying a 1xxx-C62E reference code. For important safety information before continuing with this procedure, see “Power isolation procedures” on page 128. 1. Power on the system. 2. Starting from SPCN 0 or SPCN 1 on the system unit (see “Physical Locations” on page 339), go to the first unit in the SPCN frame-to-frame cable sequence that does not power on. Is the Data display background light on, or is the power-on LED blinking, or are there any characters displayed on the I/O expansion unit display panel?Note: The background light is a dim yellow light in the Data area of the display panel. Yes: Go to step 12 on page 138. No: Continue with the next step. 3. Use a multimeter to measure the ac voltage at the customer’s ac power outlet. Is the ac voltage from 200 V ac to 240 V ac, or from 100 V ac to 127 V ac? v Yes: Continue with the next step. v No: Inform the customer that the ac voltage at the power outlet is not correct. This ends the procedure. 4. Is the mainline ac power cable from the ac module, power supply, or power distribution unit to the customer’s ac power outlet connected and seated correctly at both ends? v Yes: Continue with the next step. v No: Connect the mainline ac power cable correctly at both ends. This ends the procedure. 5. Perform the following: a. Disconnect the mainline ac power cable from the ac module, power supply, or power distribution unit. b. Use a multimeter to measure the ac voltage at the ac module, power supply, or power distribution unit end of the mainline ac power cable.Is the ac voltage from 200 V ac to 240 V ac, or from 100 V ac to 127 V ac? No: Continue with the next step. Yes: Go to step 7 on page 137. 6. Are you working on a power distribution unit with tripped breakers? v No: Exchange the mainline ac power cable or power distribution unit (as installed). This ends the procedure. v Yes: Perform the following: a. Reset the tripped power distribution breaker. b. Verify that the removable ac line cord is not the problem. Replace the cord, as installed, if it is defective. c. Install a new power supply (as installed) in all power locations until the defective one is found.
136 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
This ends the procedure. 7. Does the unit you are working on have ac power jumper cables installed? Note: The ac power jumper cables connect from the ac module, or the power distribution unit to the power supplies. Yes: Continue with the next step. No: Go to step 11. 8. Are the ac power jumper cables connected and seated correctly at both ends? v Yes: Continue with the next step. v No: Connect the ac power jumper cables correctly at both ends. This ends the procedure. 9. Perform the following: a. Disconnect the ac power jumper cables from the ac module, or power distribution unit. b. Use a multimeter to measure the ac voltage at the ac module or power distribution unit (that goes to the power supplies).Is the ac voltage at the ac module or power distribution unit from 200 V ac to 240 V ac, or from 100 V ac to 127 V ac? v Yes: Continue with the next step. v No: Exchange the following as they are installed (see Chapter 8, “Part Information,” on page 691): – ac module – Power distribution unit This ends the procedure.10. Perform the following: a. Connect the ac power jumper cables to the ac module, or power distribution unit. b. Disconnect the ac power jumper cable at the power supplies. c. Use a multimeter to measure the voltage of the power jumper cables input to the power supplies.Is the voltage 200 V ac to 240 V ac or 100 V ac to 127 V ac for each power jumper cable? v No: Exchange the power jumper cable. This ends the procedure. v Yes: Exchange the following parts, as installed, one at a time: a. One of the FRUs listed in the following table, according to the enclosure:
Table 26. Enclosures FRU to be exchanged
Enclosure
FRU
5088, 0588
Tower backplane assembly
5094, 5294, 8094-002
Tower backplane assembly
5095, 0595
Tower backplane assembly
7311-D11, 7314-G30, 5790 or 5796
I/O backplane
7311-D20
I/O backplane b. Display unit c. Power supply 1 d. Power supply 2 e. Power supply 3 This ends the procedure.11. Perform the following: Chapter 2. Isolation Procedures 137
a. Disconnect the mainline ac power cable (to the expansion unit) from the customer’s ac power outlet. b. Exchange the following FRUs, one at a time: v Power supply. v One of the FRUs listed in the following table, according to the enclosure:
Table 27. Enclosures FRU to be exchanged
Enclosure
FRU
5088, 0588
Tower backplane assembly
5094, 5294, 8094-002
Tower backplane assembly
5095, 0595
Tower backplane assembly
7311-D11, 7314-G30, 5790 or 5796
I/O backplane
7311-D20
I/O backplane c. Reconnect the mainline ac power cables (from the expansion unit) into the power outlet. d. Attempt to power on the system.Does the expansion unit power on? v Yes: The unit you exchanged was the failing item. This ends the procedure. v No: Repeat this step and exchange the next FRU in the list. If you have exchanged all of the FRUs in the list, ask your next level of support for assistance. This ends the procedure.12. Is there a reference code displayed on the display panel for the I/O unit that does not power on? v Yes: Continue with the next step. v No: Replace the FRU listed in the following table, according to the enclosure:
Table 28. Enclosures FRU to be exchanged
Enclosure
FRU
5088, 0588
Tower backplane assembly
5094, 5294, 8094-002
Tower backplane assembly
5095, 0595
Tower backplane assembly
7311-D11, 7314-G30, 5790 or 5796
I/O backplane
7311-D20
I/O backplane This ends the procedure.13. Is the reference code 1xxx xx2E? v Yes: Continue with the next step. v No: Use the new reference code and return to Chapter 1, “Starting a Service Call,” on page 1. This ends the procedure.14. Do the SPCN optical cables (A) connect the failing unit (B) to the preceding unit in the chain or loop?
.---------. (A) SPCN
| System | Optical Cables -. .----- SPCN
| Unit | | V Optical Adapter
| SPCN 0 | .-. V .-.
’----.----’ | +------------+ |
| | +------------+ |
.----’----. .-------’-+ .-------’-+ .---------.
| J15 | |Sec J16| |Sec J15| | Sec |
138 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
|Sec Unit +-+UNIT J15| |Unit J16+-+J15 Unit |
| 1 | | 2 | | 3 | | 4 |
’---------’ ’---------’ ’---------’ ’---------’
^
|
’---- (B) Failing Unit
Yes: Continue with the next step. No: Go to step 18 on page 140.15. Remove the SPCN optical adapter (A) from the preceding frame in the string that cannot become powered on. .---------. .--- (A) SPCN Optical Adapter | System | | | Unit | V | SPCN 0 | .-. .-. ’----.----’ | +------------+ | 322 | +------------+ | .----’----. .-------’-+ .-------’-+ .---------. | J15 | |Sec J16| |Sec J15| | Sec | |Sec Unit +-+Unit J15| |Unit J16+-+J15 Unit | | 1 | | 2 | | 3 | | 4 | ’---------’ ’---------’ ’---------’ ’---------’ ^ | ’-- Failing Unit 16. Perform the following: Notes: a. The cable may be connected to either J15 or J16. b. Use an insulated probe or jumper when performing the voltage readings.a. Connect the negative lead of a multimeter to the system frame ground. b. Connect the positive lead of a multimeter to pin 2 of the connector from which you removed the SPCN optical adapter in the previous step of this procedure. c. Note the voltage reading on pin 2. d. Move the positive lead of the multimeter to pin 3 of the connector or SPCN card. e. Note the voltage reading on pin 3.Is the voltage on both pin 2 and pin 3 from 1.5 V dc to 5.5 V dc? v Yes: Continue with the next step. v No: Exchange the FRU listed in the following table, according to the enclosure:
Table 29. Enclosures FRU to be exchanged
Enclosure
FRU
5088, 0588
Tower backplane assembly
5094, 5294, 8094-002
Tower backplane assembly
5095, 0595
Tower backplane assembly
7311-D11, 7314-G30, 5790 or 5796
I/O backplane
7311-D20
I/O backplane This ends the procedure.17. Exchange the following FRUs, one at a time: Chapter 2. Isolation Procedures 139
a. In the failing unit (first frame with a failure indication), replace the following:
Table 30. Enclosures FRU to be exchanged
Enclosure
FRU
5088, 0588
Tower backplane assembly
5094, 5294, 8094-002
Tower backplane assembly
5095, 0595
Tower backplane assembly
7311-D11, 7314-G30, 5790 or 5796
I/O backplane
7311-D20
I/O backplane b. In the preceding unit in the string, replace the FRU in Table 30. c. SPCN optical adapter (A) in the preceding unit in the string. d. SPCN optical adapter (B) in the failing unit. e. SPCN optical cables (C) between the preceding unit in the string and the failing unit.This ends the procedure.
(A) SPCN Optical .----- (C) SPCN Adapter ----. | Optical Cables | | | | .-- (B) SPCN .---------. | | | Optical | System | | | | Adapter | Unit | V | V | SPCN 0 | .-. V .-. ’----.----’ | +------------+ | | | +------------+ | .----’----. .-------’-+ .-------’-+ .---------. |Sec J15 | | Sec J16| |Sec J15| | Sec | |Unit J16+-+J15 Unit | |Unit J16+-+J15 Unit | | 1 | | 2 | | 3 | | 4 | ’---------’ ’---------’ ’---------’ ’---------’ ^ | | ’--- Failing Unit 18. Perform the following: a. Power off the system. b. Disconnect the SPCN frame-to-frame cable from the connector of the first unit that cannot be powered on. c. Connect the negative lead of a multimeter to the system frame ground. d. Connect the positive lead of the multimeter to pin 2 of the SPCN cable. Note: Use an insulated probe or jumper when performing the voltage readings. e. Note the voltage reading on pin 2. f. Move the positive lead of the multimeter to pin 3 of the SPCN cable. g. Note the voltage reading on pin 3.Is the voltage on both pin 2 and pin 3 from 1.5 V dc to 5.5 V dc? v No: Continue with the next step. v Yes: Exchange the following FRUs one at a time: 140 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
a. In the failing unit, replace the FRU listed in the following table:
Table 31. Enclosures FRU to be exchanged
Enclosure
FRU
5088, 0588
Tower backplane assembly
5094, 5294, 8094-002
Tower backplane assembly
5095, 0595
Tower backplane assembly
7311-D11, 7314-G30, 5790 or 5796
I/O backplane
7311-D20
I/O backplane b. In the preceding unit in the string, replace the FRU in Table 31. c. SPCN frame-to-frame cable. This ends the procedure.19. Perform the following: a. Follow the SPCN frame-to-frame cable back to the preceding unit in the string. b. Disconnect the SPCN cable from the connector. c. Connect the negative lead of a multimeter to the system frame ground. d. Connect the positive lead of a multimeter to pin 2 of the connector. Note: Use an insulated probe or jumper when performing the voltage readings. e. Note the voltage reading on pin 2. f. Move the positive lead of the multimeter to pin 3 of the connector. g. Note the voltage reading on pin 3.Is the voltage on both pin 2 and pin 3 from 1.5 V dc to 5.5 V dc? v Yes: Exchange the following FRUs one at a time: a. SPCN frame-to-frame cable. b. In the failing unit, replace the following:
Table 32. Enclosures FRU to be exchanged
Enclosure
FRU
5088, 0588
Tower backplane assembly
5094, 5294, 8094-002
Tower backplane assembly
5095, 0595
Tower backplane assembly
7311-D11, 7314-G30, 5790 or 5796
I/O backplane
7311-D20
I/O backplane c. In the preceding unit in the string, replace the FRU in Table 32. This ends the procedure. v No: Exchange the FRU in Table 32 from the unit from which you disconnected the SPCN cable in the previous step of this procedure. This ends the procedure.PWR1900 Determine which procedure to use based on the model number. Follow the instructions for the model or expansion unit you are servicing. For models 8203-E4A, 9407-M15, and 9408-M25 perform “PWR1905” on page 142. Chapter 2. Isolation Procedures 141
For 5094 and 5294 units, perform “PWR1906” on page 144. For 5088 and 0588 units, perform “PWR1908” on page 148. For 5095, 0595, 5790, 5796, 7314-G30. 7311-D11, and 7311-D20 units, perform “PWR1909” on page 150. This ends the procedure. PWR1905 A system unit power supply load fault is occurring. Refer to “Power isolation procedures” on page 128 for important safety information before servicing the system. Instructions for Models 8203-E4A, 9407-M15, or 9408-M25 1. Is the reference code 1xxx 1B01? Yes: Continue with the next step. No: This procedure only isolates problems that cause 1xxx 1B01 to be logged. Return to the procedure that sent you here. This ends the procedure. 2. Perform the following: a. Power off the system and disconnect the ac power cable from the unit you are working on. b. Disconnect all the I/O devices (tape, diskette, optical, and disk units) by sliding them partially out of the system unit (see “Physical Locations” on page 339). c. Remove and label all cards (for example, PCI adapters, GX adapter(s), RIO/HSL, and RAID cards if installed). d. Reconnect the ac power cable or cables to the unit you are working on. e. Power on the system (see “Powering on and powering off” on page 588).Does a power reference code occur? Yes: Continue with the next step. No: Go to step 5. 3. Perform the following: a. Power off the system. b. Replace one of the system fans (see “Physical Locations” on page 339). c. Power on the system (see “Powering on and powering off” on page 588).Does a power reference code occur? Yes: Continue with the next step. No: The fan you just replaced was the failing item. This ends the procedure. 4. Have you tried replacing all the fans? v Yes: Reinstall the fan you just replaced in step 3 and continue with the next step. v No: Perform the following: a. Power off the system. b. Reinstall the fan that you just removed in step 3 to its original location. c. Repeat step 3 5. Perform the following: a. Power off the system. b. Reinstall all the cards (PCI adapters, memory DIMMs, GX adapter(s), RIO/HSL and RAID cards) you removed in step 2 into their original locations. c. Power on the system.Does a power reference code occur? 142 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
Yes: Continue with the next step. No: Go to step 8. 6. Perform the following: a. Power off the system. b. Disconnect one of the cards you reinstalled in step 5 on page 142. c. Power on the system.Does a power reference code occur? Yes: Continue with the next step. No: Exchange the last card you disconnected in this step (see “Physical Locations” on page 339). This ends the procedure. 7. Have you disconnected all the cards? No: Repeat step 6. Yes: Reinstall all of the parts removed or exchanged in this procedure and return to Chapter 1, “Starting a Service Call,” on page 1. This ends the procedure. 8. Perform the following: a. Power off the system. b. Reconnect all of the I/O devices (tape, diskette, optical, and disk units) that you disconnected in step 2 on page 142. c. Power on the system.Does a power reference code occur? Yes: Continue with the next step. No: The problem has been resolved. This ends the procedure. 9. Perform the following: a. Power off the system. b. Disconnect one of the I/O devices (tape, diskette, optical, and disk units) that you reconnected in step 8. c. Power on the system.Does a power reference code occur? Yes: Continue with the next step. No: Exchange the last I/O device you disconnected in this step (see “Physical Locations” on page 339). This ends the procedure.10. Have you tried disconnecting all of the I/O devices? No: Repeat step 9. Yes: Continue with the next step.11. Replace the system backplane (Un-P1). Reinstall all of the parts you have removed or exchanged in this procedure. Does a power reference code occur? Yes: Continue with the next step. No: The system is fixed. This ends the procedure.12. Does the system contain only one power supply? Yes: Continue with the next step. No: Go to step 14 on page 144.13. Replace the power supply. Does a power reference code occur? Yes: Contact your service support.This ends the procedure. No: The system is fixed. This ends the procedure.
Chapter 2. Isolation Procedures 143
14. Power off the system. Remove one of the power supplies, then power on the system. Does a power reference code occur? Yes: Continue with the next step. No: The power supply you just removed is defective. Replace it. This ends the procedure.15. Power off the system. Reinstall the power supply that was removed in step 14, and remove the other power supply. Power on the system. Does a power reference code occur? Yes: Continue with the next step. No: The power supply you just removed is defective. Replace it. This ends the procedure.16. Power off the system. Replace both power supplies. Power on the system. Does a power reference code occur? Yes: Contact your service support.This ends the procedure. No: The system is fixed. This ends the procedure.PWR1906 The server detected an error in the power system. Please see “Power isolation procedures” on page 128 for important safety information before servicing the system. PWR1906 Instructions for 5094 and 5294 expansion units 1. Perform the following: a. Power off the unit you are working on. b. Disconnect all the I/O devices (tape, diskette, optical, and disk units) from the unit that you are working on by sliding them partially out of the unit. See the service guide for the unit you are servicing. c. Remove and label all of the cards that are installed in the PCI adapter slots. d. Power on the unit you are working on.Does a power reference code occur? Yes: Continue with the next step. No: Go to step 9 on page 145. 2. Perform the following: a. Power off the unit you are working on. b. Remove, in order, all power supplies except the first one (either P00 or P01 depending on the configuration). c. Power on the unit you are working on.Does a power reference code occur? Yes: Continue with the next step. No: Go to step 4 on page 145. 3. Perform the following: a. Power off the unit you are working on. b. Remove the power supply that was left installed in step 2. c. Reconnect the next power supply in order (P01 or P02). d. Power on the unit you are working on.Does a power reference code occur? v Yes: Go to step 5 on page 145. v No: Exchange the power supply you removed in step 3b. 144 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
This ends the procedure. 4. Perform the following: a. Power off the unit you are working on. b. Reconnect the next power supply in order. c. Power on the unit you are working on.Does a power reference code occur? v No: Repeat this step until all power supplies have been reconnected. This ends the procedure. v Yes: Exchange the power supply that you reconnected in this step. This ends the procedure. 5. Perform the following: a. Remove one of the fans from the unit you are working on that you did not previously remove during this procedure. Note: Disregard a fan reference code if it occurs during this step. b. Power on the unit you are working on.Does a power reference code occur? v Yes: Continue with the next step. v No: The fan that you removed in this step is the failing item. This ends the procedure. 6. Have you removed all of the fans, one at a time? v Yes: Install all of the fans and continue with the next step. v No: Perform the following: a. Power off the unit you are working on. b. Reinstall the fan that was removed in step 5 to its original location. c. Repeat step 5. 7. Perform the following: a. Remove the power cable (that was not previously removed) from one of the lower DASD device boards (CB1 or CB2). b. Power on the unit you are working on.Does a power reference code occur? v Yes: Continue with the next step. v No: Replace the DASD device board that the power cable was removed from in this step. This ends the procedure. 8. Is a second lower DASD backplane installed and its power cable connected? v Yes: Repeat step 7. v No: Replace the following one at a time: a. DASD upper device board (DB3) b. I/O enclosure backplane (CB1). This ends the procedure. 9. Perform the following: a. Power off the unit you are working on. b. Reinstall all of the cards that you removed in step 1 on page 144. c. Reconnect the ac power cable to the unit that you are working on. d. Power on the unit you are working on.Does a power reference code occur? Chapter 2. Isolation Procedures 145
Yes: Continue with the next step. No: Go to step 12.10. Perform the following: a. Power off the unit you are working on. b. Disconnect one of the cards that you connected in step 9 on page 145. c. Power on the unit you are working on.Does a power reference code occur? v Yes: Continue with the next step. v No: Exchange the last card that you disconnected in this step. This ends the procedure.11. Have you disconnected all the cards? Yes: Continue with the next step. No: Repeat step 10.12. Perform the following: a. Power off the unit you are working on. b. Reconnect all of the I/O devices (tape, diskette, optical, or disk units) that you disconnected in step 1 on page 144. c. Power on the unit you are working on.Does a power reference code occur? Yes: Continue with the next step. No: This ends the procedure.13. Perform the following: a. Power off the unit you are working on. b. If you have not already done so, disconnect all of the I/O devices (tape, diskette, optical, or disk units) that you reconnected in step 12. c. Reconnect one of the I/O devices (tape, diskette, optical, or disk units) that you just disconnected. d. Power on the unit you are working on.Does a power reference code occur? v No: Continue with the next step. v Yes: Exchange the last I/O device that you reconnected in this step. This ends the procedure.14. Have you reconnected all the I/O devices? Yes: This ends the procedure. No: Repeat step 13 (you can skip part b).PWR1907 A unit was dropped from the SPCN configuration. 1. Is the reference code you are working with 1xxx 913B? v No: Continue with the next step. v Yes: A system power control network (SPCN) firmware update is needed, but not started due to the SPCN firmware update policy setting. A manual update needs to be started.Notes: – Do not perform maintenance on an expansion unit or modify the SPCN network while the SPCN firmware update is being performed. 146 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
– Performing firmware updates or powering off the system will interrupt SPCN firmware updates and the firmware update will need to be started again after these actions.a. Access the Advanced System Management Interface (ASMI) and select System Configuration and then Configure I/O Enclosures. b. Record the current SPCN firmware update policy setting so it can be restored later. c. Change the SPCN firmware update policy setting to expanded and click Save Policy Setting to allow for SPCN firmware updates to be done over the RIO/HSL and serial SPCN interfaces. d. Select Start SPCN Firmware Update. The SPCN firmware will now be downloaded to the expansion units that require an update. e. Change the SPCN firmware update policy setting back to what it was originally set to in step 1b and click on Save Policy Setting. Note: The SPCN firmware update can be stopped using the Stop SPCN Firmware Update button. However, to allow the expansion units to be updated to the latest SPCN firmware level, it is recommended that the firmware update be allowed to complete. The progress of the SPCN firmware update can be monitored by clicking on Configure I/O Enclosures to update the screen. Do not use the browser Back or Refresh button to monitor the update progress. The Power Control Network Firmware Update Status column shows the percentage complete and In Progress is displayed while the download is progresses. Not Required is displayed when the download process completes. This ends the procedure. 2. Is the reference code you are working with 1xxx 90F0? v No: Contact your next level of support. v Yes: A unit was dropped from the SPCN configuration. This can be caused by any of the following: – The rack or unit has lost all ac or dc power. – The SPCN function in the unit has an error. – The SPCN frame-to-frame cables, RIO cable or the remote I/O (RIO) adapter card has failed.3. Using the HMC or ASMI, find the 1xxx 90F0 SRC in the error log (see “Displaying error and event logs” on page 649). Use the option Show details to display the location information for the failing unit. 4. After locating the failing unit, ensure that the SPCN frame-to-frame cable and RIO cables are seated correctly; reseat the cables if necessary. Are the cables connected correctly? v No: Correctly reconnect the cables, or replace them if necessary. This ends the procedure. v Yes: Continue with the next step.5. Are the ac line cords on the failing unit connected properly at both ends? v No: Reconnect the ac line cords, or replace them if necessary. This ends the procedure. v Yes: Continue with the next step.6. Check the voltage at the customer’s ac outlet. Is the voltage correct? v No: Inform the customer that the voltage at the ac power outlet is incorrect. This ends the procedure. v Yes: Continue with the next step.7. Are the power supplies functional? v No: Perform the following: a. Refer to Chapter 3, “Locating FRUs,” on page 335 to determine the location and part number for each power supply, and to find the appropriate procedure for exchanging the power supplies. b. Replace each power supply one at a time, until the problem has been resolved. Chapter 2. Isolation Procedures 147
c. If the problem persists after replacing all of the power supplies, continue with the next step.v Yes: Continue with the next step.8. Use the following table to determine the FRUs to replace. Refer to Chapter 3, “Locating FRUs,” on page 335 for instructions on replacing FRUs. This ends the procedure.
Table 33. Expansion enclosures FRUs
Enclosure
FRU
5088, 0588
Tower backplane assembly
5094, 5294, 8094-002
Tower backplane assembly
5095, 0595
Tower backplane assembly
7311-D11, 7314-G30, 5790 or 5796
I/O backplane
7311-D20
I/O backplane PWR1908 A power supply fault or load fault has occurred in a 5088 or 0588 expansion unit. For important safety information before servicing the system, see “Power isolation procedures” on page 128. For location information, see the service guide for the enclosure you are working on. PWR1908 instructions for 5088 or 0588 expansion units Refer to Chapter 3, “Locating FRUs,” on page 335 for instructions for removing and replacing any FRUs called out by the following procedure. 1. Perform the following: a. Power off the frame that you are working on by removing the ac jumper cable from the power supplies on the 5088 or 0588 expansion unit. b. Remove and label all cards installed in the PCI backplane area. c. Power on the frame by reconnecting the ac jumper cable to the unit.Does a power reference code occur? Yes: Continue with the next step. No: Go to step 8 on page 149.2. Is the reference code 1xxx 2603? v No: Continue with the next step. v Yes: Replace the PCI backplane. This ends the procedure.3. Perform the following: a. Power off the frame you are working on by removing the ac jumper cable from the power supplies on the 5088 or 0588 expansion unit. b. Remove power supply P01. c. Remove fan assembly B01 from power supply P01 and install it on a new power supply P01. d. Install the new power supply P01. e. Power on the frame by reconnecting the ac jumper cable to the unit you are working on.Does a power reference code occur? v Yes: Continue with the next step. v No: The power supply that you replaced in this step was the failing item. This ends the procedure.4. Perform the following: 148 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
a. Power off the frame you are working on by removing the ac jumper cable from the power supplies on the 5088 or 0588 expansion unit. b. Remove the new power supply P01 and replace it with the original power supply P01. c. Remove power supply P02. d. Remove fan assembly B02 from power supply P02 and install it on a new power supply P02. e. Install the new power supply P02. f. Power on the frame by reconnecting the ac jumper cable to the unit you are working on.Does a power reference code occur? v Yes: Continue with the next step. v No: The power supply that you replaced in this step was the failing item. This ends the procedure.5. Perform the following: a. Power off the frame you are working on by removing the ac jumper cable from the power supplies on the 5088 or 0588 expansion unit. b. Remove both the new power supply P02 and the fan assembly B02. c. Reinstall the original power supply P02 and a new fan assembly B02. d. Power on the frame by reconnecting the ac power cables to the unit you are working on.Does a power reference code occur? v Yes: Continue with the next step. v No: The fan assembly B02 that you replaced in this step was the failing item. This ends the procedure.6. Perform the following: a. Power off the frame you are working on by removing the ac jumper cable from the power supplies on the 5088 or 0588 expansion unit. b. Remove power supply P02. c. Replace the new fan assembly B02 with the original fan assembly B02. d. Reinstall power supply P02. e. Remove power supply P01. f. Remove fan assembly B01 and replace it with a new fan assembly B01. g. Reinstall power supply P01. h. Power on the frame by reconnecting the ac jumper cable to the unit you are working on.Does a power reference code occur? v Yes: Continue with the next step. v No: The fan assembly B01 that you removed in this step is the failing item. This ends the procedure.7. Perform the following: a. Power off the frame you are working on by removing the ac jumper cable from the power supplies on the 5088 or 0588 expansion unit. b. Remove power supply P01. c. Replace fan assembly B01 with the original fan assembly B01. d. Reinstall power supply P01. e. Replace the following FRUs one at a time: v Display panel. v PCI backplane assembly CB1. This ends the procedure.8. Perform the following: Chapter 2. Isolation Procedures 149
a. Power off the frame you are working on by removing the ac jumper cable from the power supplies on the 5088 or 0588 expansion unit. b. Reinstall one of the cards that you removed in step 1 on page 148. c. Power on the frame by reconnecting the ac power cables to the unit you are working on.Does a power reference code occur? v No: Continue with the next step. v Yes: Exchange the last card that you reinstalled in this step. This ends the procedure.9. Have you reinstalled all of the cards? v Yes: Replace the PCI backplane assembly CB1. This ends the procedure. v No: Repeat step 8 on page 149, reinstalling the next card.PWR1909 A power supply load fault is occurring in a system expansion unit or I/O tower. For important safety information before servicing the system, refer to “Power isolation procedures” on page 128. Instructions for 5095, 0595, 5790, 5796, 7311-D11, 7311-D20 and 7314-G30 Refer to Chapter 3, “Locating FRUs,” on page 335 for instructions for removing and replacing any FRUs called out by the following procedure. 1. Perform the following: a. Power off the system. b. Disconnect all the I/O devices (tape, diskette, optical, and disk units) from the expansion unit or I/O tower you are working on by sliding them partially out of the unit (see “Physical Locations” on page 339). c. Remove and label all cards installed in the PCI adapters area. d. Power on the system.Does a power reference code occur? Yes: Continue with the next step. No: Go to step 8 on page 151. 2. Perform the following: a. Power off the system. b. Remove one of the fans from the expansion unit or I/O tower that you have not previously removed during this procedure. Note: If a fan reference code occurs during this step, ignore it. c. Power on the system.Does a power reference code occur? v Yes: Continue with the next step. v No: The fan you removed in this step is the failing item. This ends the procedure. 3. Have you removed all of the fans one at a time? v No: Perform the following: a. Power off the system. b. Reinstall the fan that you removed in step 2 into its original location. c. Repeat step 2.
150 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
v Yes: Reinstall all of the fans and continue with the next step. 4. Perform the following: Note: If there are no DASD installed in this enclosure, go to step 6. a. Power off the system. b. Remove the I/O tower power supply cable, at the DASD backplane, that you have not previously removed. c. Power on the system.Does a power reference code occur? v No: The DASD backplane that was disconnected in this step is the failing item. This ends the procedure. v Yes: Continue with the next step. 5. Have you disconnected the power cables from each of the DASD backplanes one at a time? Yes: Continue with the next step. No: Repeat step 4. 6. Perform the following: a. Power off the system. b. Remove a power supply that you have not previously removed, and replace it with a new one. c. Power on the system.Does a power reference code occur? v Yes: Continue with the next step. v No: The power supply that was removed in this step is the failing item. This ends the procedure. 7. Have you removed all of the power supplies one at a time? v Yes: Perform the following: a. Remove the new power supply that you installed in step 6 and reinstall the original power supply. b. Replace the I/O backplane if you are working on a 7311-D11, 7314-G30, 5790 or 5796. Replace the tower backplane if you are working on a 5095 or 0595. This ends the procedure.v No: Remove the new power supply that you installed in step 6 and reinstall the original power supply. Then, repeat step 6. 8. Perform the following: a. Power off the system. b. Reinstall all of the cards you removed in step 1 on page 150. c. Power on the system.Does a power reference code occur? Yes: Continue with the next step. No: Go to step 11 on page 152. 9. Perform the following: a. Power off the system. b. Disconnect one of the cards you reconnected in step 8. c. Power on the system.Does a power reference code occur? v Yes: Continue with the next step. Chapter 2. Isolation Procedures 151
v No: Exchange the last card you disconnected in this step (see Chapter 8, “Part Information,” on page 691) Reinstall all the parts. This ends the procedure.10. Have you disconnected all the cards? v No: Repeat step 9 on page 151. v Yes: Reinstall all the parts and return to Chapter 1, “Starting a Service Call,” on page 1. This ends the procedure.11. Perform the following: a. Power off the system. b. Reconnect all of the I/O devices (tape, diskette, optical, or disk units) that you disconnected in step 1 on page 150. c. Power on the system.Does a power reference code occur? Yes: Continue with the next step. No: This ends the procedure.12. Perform the following: a. Power off the system. b. Disconnect one of the I/O devices you reconnected in step 11. c. Power on the system.Does a power reference code occur? v Yes: Continue with the next step. v No: Exchange the last I/O device you disconnected in this step. This ends the procedure.13. Have you disconnected all of the I/O devices? v No: Repeat step 12. v Yes: Reinstall all the parts and return to Chapter 1, “Starting a Service Call,” on page 1. This ends the procedure.PWR1911 You are here because of a power problem on a dual line cord system. If the failing unit does not have a dual line cord, return to the procedure that sent you here or go to the next item in the FRU list. The following steps are for the system unit, unless other instructions are given. For important safety information before servicing the system, refer to “Power isolation procedures” on page 128. 1. If an uninterruptible power supply is installed, verify that it is powered on before proceeding. 2. Are all the units powered on? v Yes: Go to step 7 on page 154. v No: On the unit that does not power on, perform the following: a. Disconnect the ac line cords from the unit that does not power on. b. Use a multimeter to measure the ac voltage at the system end of both ac line cords.
Table 34. Correct ac voltage
Model or expansion unit
Correct ac voltage
Model 8203-E4A and 5095, 0595, and 7311-D20 expansion units
100 V to 127 V or 200 V to 240 V
5796 and 7314-G30 expansion units
200 V to 240 V 152 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
Table 34. Correct ac voltage (continued)
Model or expansion unit
Correct ac voltage
5088, 0588, 5094, 5294, 5790, and 7311-D11 expansion units
200 V to 240 V ac or dc c. Is the voltage correct (refer to Table 34 on page 152)? Yes: Continue with the next step. No: Go to step 6 on page 154. 3. Are you working on a model 8203-E4A or a 5095, 0595, 5790, 5796, 7314-G30, 7311-D11, or 7311-D20 expansion unit? v No: Continue with the next step. v Yes: Perform the following: a. Reconnect the ac line cords. b. Verify that the failing unit fails to power on. c. Replace the failing power supply. Use the table below to determine which power supply needs replacing, and then see Chapter 3, “Locating FRUs,” on page 335 for its location, part number, and exchange procedure.
Table 35. Failing power supply for models 8203-E4A and 5095, 0595, 5790, 5796, 7314-G30, 7311-D11 or 7311-D20 expansion units
Reference code
Models or expansion units
Failing item name
1510
8203-E4A
Power supply 1
5095, 0595, 5790, 5796, 7314-G30, 7311-D11, 7311-D20
Power supply 1
1520
8203-E4A
Power supply 2
5095, 0595, 5790, 5796, 7314-G30, 7311-D11, 7311-D20
Power supply 2 This ends the procedure. 4. Perform the following: a. Reconnect the ac line cord to the ac modules. b. Remove the ac jumper cables at the power supplies. c. Use a multimeter to measure the ac voltage at the power-supply end of the jumper cable.Is the ac voltage from 200 V to 240 V? v No: Continue with the next step. v Yes: Replace the failing power supply. Use the following table to determine which power supply must be replaced, and then see Chapter 3, “Locating FRUs,” on page 335 for its location, part number, and exchange procedure. Attention: Do not install power supplies P00 and P01 ac jumper cables on the same ac module.
Table 36. Failing power supply for 5094, 5294, 5088 or 0588 expansion units
Reference code
Failing item name
1500
Power supply 0
1510
Power supply 1 for 5094 or 5294
Power supply 2 for 5088 or 0588
1520
Power supply 2 for 5094 or 5294
Power supply 1 for 5088 or 0588
1530
Power supply 3
Chapter 2. Isolation Procedures 153
This ends the procedure. 5. Perform the following: a. Disconnect the ac jumper cable at the ac module output. b. Use a multimeter to measure the ac voltage at the ac module output.Is the ac voltage from 200 V to 240 V? v Yes: Exchange the ac jumper cable. This ends the procedure. v No: Exchange the ac module (see Chapter 3, “Locating FRUs,” on page 335). This ends the procedure. 6. Perform the following: a. Disconnect the ac line cords from the customer’s ac power outlet. b. Use a multimeter to measure the ac voltage at the customer’s ac power outlet.Is the ac voltage correct (refer to Table 34 on page 152)? v Yes: Exchange the failing ac line cord. This ends the procedure. v No: Perform the following: a. Inform the customer that the ac voltage at the power outlet is not correct. b. Reconnect the ac line cords to the power outlet after the ac voltage at the power outlet is correct. This ends the procedure. 7. Is the reference code 1xxx00AC? v No: Continue with the next step. v Yes: This reference code may have been caused by an ac outage. If the system will power on without an error, no parts need to be replaced. This ends the procedure. 8. Is the reference code 1xxx1510 or 1520? v No: Continue with the next step. v Yes: Perform the following: a. Use the following table, figures and Chapter 3, “Locating FRUs,” on page 335 to locate the failing parts.
Table 37. Power Reference Code Table
Model or expansion unit
Reference code
Locate these parts
8203-E4A
1xxx 1510
Power supply E1 and ac line cord 1
1xxx 1520
Power supply E2 and ac line cord 2
5088 and 0588 (see Figure 2 on page 156)
1xxx 1510
ac jumper cable connected to power supply 2 and the ac module
1xxx 1520
ac jumper cable connected to power supply 1 and the ac module 154 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
Table 37. Power Reference Code Table (continued)
Model or expansion unit
Reference code
Locate these parts
5294 and 5094 (see Figure 1)
1xxx 1500
ac jumper cable connected to power supply 0 and the ac module
1xxx 1510
ac jumper cable connected to power supply 1 and the ac module
1xxx 1520
ac jumper cable connected to power supply 2 and the ac module
1xxx 1530
ac jumper cable connected to power supply 3 and the ac module
5095 and 0595
1xxx 1510
Power supply 1 and ac line cord 1
1xxx 1520
Power supply 2 and ac line cord 2
5790, 5796, 7314-G30, 7311-D11, 7311-D20
1xxx 1510
Power supply 1 and ac line cord 1
1xxx 1520
Power supply 2 and ac line cord 2 Figure 1. Dual line cord drawing for 5094
Chapter 2. Isolation Procedures 155
b. Locate the ac line cord or the ac jumper cable for the reference code you are working on. c. Go to step 10. 9. Is the reference code 1xxx 1500 or 1xxx 1530? v No: Perform Problem Analysis using the reference code. This ends the procedure. v Yes: Locate the ac jumper cables for the reference code you are working on (see Table 37 on page 154), and then continue with the next step: – If the reference code is 1xxx 1500, determine the locations of ac jumper cables that connect to power supply P00 (see the preceding figures). – If the reference code is 1xxx 1530, determine the locations of ac jumper cables that connect to power supply P03 (see the preceding figures).10. Perform the following: Attention: Do not disconnect the other system line cord when powered on. Attention: Do not disconnect the other ac jumper cable when powered on. a. For the reference code you are working on, disconnect either the ac jumper cable or the ac line cord from the power supply. b. Use a multimeter to measure the ac voltage at the power supply end of the ac jumper cable or the ac line cord. Figure 2. Dual line cord drawing for 5088, or 0588 installed on 5094
156 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
Is the ac voltage correct (see Table 34 on page 152)? No: Continue with the next step. Yes: Exchange the failing power supply. Refer to Table 35 on page 153 and Table 36 on page 153 for its position, and then see Chapter 3, “Locating FRUs,” on page 335 for part numbers and directions to the correct exchange procedures. This ends the procedure.11. Perform the following: a. Disconnect the ac line cords from the power outlet. b. Use a multimeter to measure the ac voltage at the customer’s ac power outlet.Is the ac voltage correct (see Table 34 on page 152)? v Yes: Exchange the following, one at a time: – Failing ac line cord – Failing ac jumper cable (if installed) – Failing ac module (if installed) (see Chapter 3, “Locating FRUs,” on page 335 for part numbers and directions to the correct exchange procedures) This ends the procedure. v No: Perform the following: a. Inform the customer that the ac voltage at the power outlet is not correct. b. Reconnect the ac line cords to the power outlet after the ac voltage at the power outlet is correct. This ends the procedure.PWR1917 This procedure is used to display or change the configuration ID. 1. Use either the Advanced System Management Interface (ASMI) or the control panel to display and change the configuration ID. v If you are using the ASMI, refer to the operations guide for your system and use the instructions for Changing system configuration. Use Table 38 on page 158 to find the correct configuration ID. v If you are using the control panel, continue with the next step.2. Perform the following to display the configuration ID: Attention: The system or unit that will display the ID must be powered off with ac power applied.Notes: v If you have just restored power to the system, the service processor must return to standby before control panel functions will work correctly. Returning the service processor to standby takes a few minutes after the panel appears to be operational. v You must have the panel in manual mode to access function 7 options.a. Select function 07 on the system control panel. Press Enter (07** will be displayed). b. Use the arrow keys to increment or decrement to subfunction A8. 07A8 will be displayed. Press Enter (07A8 00 will be displayed). c. Use the arrow keys to increment or decrement to the first byte of the unit address (usually 3C) for the box you want to check. 07nn will be displayed, where nn is the first byte of the unit address. d. Press Enter (073C 00, for example, will be displayed). e. Use the arrow keys to increment or decrement to the second byte of the unit address (usually 01, 02, etc for I/O expansion units) for the box you want to check. 07nn will be displayed, where nn is the second byte of the unit address (0701, for example, for a unit). Press Enter (0701 00, for example, will be displayed).Note: The display on the addressed I/O expansion unit being addressedshould be blinking on and off while displaying the configuration ID as the last two characters of the bottom line.
Chapter 2. Isolation Procedures 157
f. Use the following table to check the unit configuration ID.
Table 38. Unit Configuration IDs
Model or expansion unit
Configuration ID
5088 and 0588
89
5094 and 5294
8A
5095 and 0595
8B
7311-D11, 5790
88
7311-D20
8C
7314-G30, 5796
8D g. Is the correct configuration ID displayed for the tower selected? v No: Continue with the next step. v Yes: Go to step 6.3. You need to set the unit configuration ID. Are you starting this step from the function 01 view on the control panel? v No: To ensure that the control panel operates properly, return to function 01. Do the following: a. The operator panel should still show the incorrect configuration ID (for example, 07C0). b. Press Enter. The control panel will now show 07xx 00 (for example, 07C0 00). c. Use the arrow keys to display 07**, then press Enter. The control panel will now show 07. d. Use the arrow keys to get the display to function 01, then press Enter. You should now be at the regular function 01 control panel view. e. Continue with the next step.v Yes: Continue with step 4.4. Set the unit configuration ID. Do the following: a. Select function 07 on the system control panel. Press Enter (07** will be displayed). b. Use the arrow keys to increment/decrement to subfunction A9 (07A9 will be displayed). Press Enter (07A9 00 will be displayed). c. Use the arrow keys to increment/decrement to the first byte of the unit address (usually 3C) for the box that you want to change. 07nn (073C, for example) will be displayed, where nn is the first byte of the unit address. Press Enter (073C 00, for example, will be displayed). d. Use the arrow keys to increment/decrement to the second byte of the unit address (usually 01, 02, etc for I/O Expansion units) for the box you want to check. 07nn will be displayed, where nn is the second byte of the unit address (01, for example, for a unit). Press Enter (0701 00, for example, will be displayed).Note: The display on the addressed I/O expansion unit will be blinking on and off. e. Use the arrow keys to increment/decrement to the correct configuration ID (refer to Table 38). 07xx will be displayed where xx is the configuration ID. f. Press Enter (07xx 00 will be displayed). After 20 to 30 seconds, the display on the addressed I/O expansion unit will stop blinking and return to the normal display format. Note: To return the panel to normal display, scroll to 07** and press Enter. g. Continue with the next step.5. Power on the system. Do you still get SRC 1xxxx84D0 or 1xxxx840E? v No: This ends the procedure. v Yes: Continue with the next step.6. Perform the following: 158 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
a. Power off the system. b. Exchange the SPCN card in the failing frame. This ends the procedure.PWR1918 A voltage regulator card might be failing. For instructions for removing and replacing any FRUs called out by the following procedure, refer to Chapter 3, “Locating FRUs,” on page 335. 1. Is the reference code 1xxx 8450? No: Continue with the next step. Yes: At least one processor VRM is missing. Inspect all of the processor cards and install the VRM that are missing (see “Physical Locations” on page 339). This ends the procedure.2. Is the reference code 1xxx 2630? No: Continue with the next step. Yes: Replace VRM1A and VRM1B (which are one CRU/FRU) on processor 1 (Un-P1-C13) (see “Physical Locations” on page 339). This ends the procedure.3. Is the reference code 1xxx 2640? No: Continue with the next step. Yes: Replace VRM1A and VRM1B (which are one CRU/FRU) on processor 2 (Un-P1-C14). This ends the procedure.4. Is the reference code 1xxx 2623? v No Continue with the next step. v Yes: Replace the following, if present, one at a time, and in the order listed: a. Base RAID card (Un-P1-C10) b. Auxiliary RAID card (Un-P1-C9)This ends the procedure. 5. Is the reference code 1xxx 2624? v No Continue with the next step. v Yes: Replace the following, if present, one at a time, and in the order listed: a. Ethernet card (Un-P1-C7) b. Base RAID card (Un-P1-C10) c. Auxiliary RAID card (Un-P1-C9) d. System backplane (Un-P1) e. DASD drives f. DASD and media backplane (Un-P2)This ends the procedure. 6. Is the reference code 1xxx 2625? v No Continue with the next step. v Yes: Replace the following, if present, one at a time, and in the order listed: a. Base RAID card (Un-P1-C10) b. Auxiliary RAID card (Un-P1-C9) c. DASD drives d. DASD and media backplane (Un-P2) e. Ethernet card (Un-P1-C7) f. System backplane (Un-P1)This ends the procedure. Chapter 2. Isolation Procedures 159
7. Is the reference code 1xxx 2626? v No Continue with the next step. v Yes: Replace the following, if present: a. Ethernet card (Un-P1-C7)This ends the procedure. 8. Is the reference code 1xxx 2632? v No Continue with the next step. v Yes: Replace the following, one at a time, and in the order listed: a. VRM2 for processor 1 (Un-P1-C18) b. System backplane (Un-P1)This ends the procedure. 9. Is the reference code 1xxx 2642? v No Continue with the next step. v Yes: Replace the following, one at a time, and in the order listed: a. VRM2 for processor 2 (Un-P1-C20) b. System backplane (Un-P1)This ends the procedure.PWR1920 Use this procedure to verify that the lights on the server control panel and the display panel on all attached I/O expansion units are operating correctly. For important safety information before continuing with this procedure, see “Power isolation procedures” on page 128. Refer to Chapter 3, “Locating FRUs,” on page 335 for instructions for removing and replacing any FRUs called out by the following procedure. 1. Activate the lamp test by performing one of the following: v Select function 04 lamp test on the control panel and press Enter. v Sign on to ASMI and click System Configuration -> Service Indicators -> Lamp Test.2. Look at the server control panel and the display panels on all attached I/O towers. The lamp test is active only for 25 seconds after you press Enter. Check the following lights on the server control panel and all I/O enclosures and towers: v Power-on light. v Attention light. v All dots for the 32 character display.Are all the lights on the control panel and the I/O display panels on? v No: Go to step 4. v Yes: These control panel lights are working correctly. Continue with the next step.3. Are any abnormal characters or character patterns (not reference codes or normal display mode) displayed? v No: The lights are operating correctly. This ends the procedure. v Yes: Continue with the next step.4. Verify that all cables are seated correctly. If the problem persists, replace the control panel. If the problem still persists, use the following table to determine the possible causes for the lamp test failure: 160 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
Table 39. Failing unit
Failing unit
FRU
8203-E4A, 9407-M15, or 9408-M25
System backplane
5088, 0588
Tower backplane assembly
5094, 5294, 8094-002
Tower backplane assembly
5095, 0595
Tower backplane assembly
7311-D11, 7314-G30, 5790 or 5796
I/O backplane
7311-D20
I/O backplane Chapter 2. Isolation Procedures 161
Router isolation procedures These procedures serve as a guide to the correct isolation procedures from the reference code tables. DANGERWhen working on or around the system, observe the following precautions: Electrical voltage and current from power, telephone, and communication cables are hazardous. To avoid a shock hazard: v Connect power to this unit only with the IBM provided power cord. Do not use the IBM provided power cord for any other product. v Do not open or service any power supply assembly. v Do not connect or disconnect any cables or perform installation, maintenance, or reconfiguration of this product during an electrical storm. v The product might be equipped with multiple power cords. To remove all hazardous voltages, disconnect all power cords. v Connect all power cords to a properly wired and grounded electrical outlet. Ensure that the outlet supplies proper voltage and phase rotation according to the system rating plate. v Connect any equipment that will be attached to this product to properly wired outlets. v When possible, use one hand only to connect or disconnect signal cables. v Never turn on any equipment when there is evidence of fire, water, or structural damage. v Disconnect the attached power cords, telecommunications systems, networks, and modems before you open the device covers, unless instructed otherwise in the installation and configuration procedures. v Connect and disconnect cables as described in the following procedures when installing, moving, or opening covers on this product or attached devices. To Disconnect: 1. Turn off everything (unless instructed otherwise). 2. Remove the power cords from the outlets. 3. Remove the signal cables from the connectors. 4. Remove all cables from the devices To Connect: 1. Turn off everything (unless instructed otherwise). 2. Attach all cables to the devices. 3. Attach the signal cables to the connectors. 4. Attach the power cords to the outlets. 5. Turn on the devices. (D005)Perform these procedures only when directed to do so from another procedure. “RTRIP01” on page 163Gives a link to a topic that might assist you when exchanging the I/O processor (IOP) for the system or partition console of i5/OS™. “RTRIP02” on page 163Gives a link to a topic that might assist you when diagnosing workstation IOP detected errors. “RTRIP03” on page 163Gives links to topics to assist you when diagnosing workstation IOP detected errors. “RTRIP05” on page 163Use the attached procedure when this reference code occurs for a RIO/HSL/12X loop resource, when an I/O expansion unit on the loop is powered off for a concurrent maintenance action. “RTRIP06” on page 164Use the attached procedure when this reference code occurs in a service action code (SAL). 162 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
“RTRIP07” on page 165Gives a link to assist you when diagnosing a keyboard error. “RTRIP08” on page 165Gives a link to assist when the Licensed Internal Code detected an IOP programming problem.RTRIP01 Gives a link to a topic that might assist you when exchanging the I/O processor (IOP) for the system or partition console of i5/OS™. Perform “CONSL01” on page 27. RTRIP02 Gives a link to a topic that might assist you when diagnosing workstation IOP detected errors. Perform “TWSIP01” on page 234. RTRIP03 Gives links to topics to assist you when diagnosing workstation IOP detected errors. If you have a twinaxial terminal for the console, perform “TWSIP01” on page 234. Otherwise, perform “WSAIP01” on page 241. RTRIP05 Use the attached procedure when this reference code occurs for a RIO/HSL/12X loop resource, when an I/O expansion unit on the loop is powered off for a concurrent maintenance action. Note: This reference code can occur for the RIO/HSL/12X loop resource when an I/O expansion unit on the loop is powered off for a concurrent maintenance action.Note: A fiber optic cleaning kit may be required for optical RIO/HSL/12X connections. 1. Multiple B600 6982 errors may occur due to efforts to retry and recover. If the recovery efforts were successful, there will be a B600 6985 reference code with xxxx 3206 in word 4 logged after all B600 6982 reference codes in the product activity log (PAL). If this is the case, close out all the B600 6982 entries. Then continue with the next step. 2. Is there a B600 6987 reference code in the service action log (SAL) logged at about the same time? Yes: Close this problem and work the B600 6987. This ends the procedure. No: Continue with the next step.3. Is there a B600 6981 reference code in the SAL logged at approximately the same time? Yes: Go to step 8 on page 164. No: Continue with the next step.4. Perform “RIOIP06” on page 34 to determine if any other systems are connected to this loop and then return here. Note: The loop number can be found in the SAL in the description for the HSL_LNK FRU.Are there other systems connect to this loop? Yes: Continue with the next step. No: Go to step 8 on page 164.5. Check for HSL failures in the SALs on the other systems before replacing parts. HSL failures are indicated by SAL entries with HSL I/O bridge and Network Interface Controller (NIC) resources. Ignore B600 6982 and B600 6984 entries. Chapter 2. Isolation Procedures 163
Are there HSL failures on other systems? Yes: Continue with the next step. No: Go to step 8.6. Repair the problems on the other systems and return to this step. After making repairs on the other systems check the PAL of this system. Is there a B600 6985, along with this loop’s resource name, that was logged after the repairs you made on the other systems? Yes: Continue with the next step. No: Go to step 8.7. For the B600 6985 reference code you found, use “Status indications for RIO/HSL” on page 15 to determine if the loop is now complete. Is the loop complete? Yes: The problem has been resolved. This ends the procedure. No: Go to step 8.8. The FRU list displayed in the SAL may be different from the failing item list given here. Use the SAL’s FRU list when it is available. Does this reference code appear in the SAL with the symbolic FRU HSL_LNK listed as a FRU? Yes: Perform “RIOIP01” on page 28. This ends the procedure. No: Exchange the FRUs listed in the SAL according to their part action codes. This ends the procedure.RTRIP06 Use the attached procedure when this reference code occurs in a service action code (SAL). Note: A fiber optic cleaning kit may be required for optical HSL connections. 1. Is the reference code in the service action log (SAL)? v Yes: Continue with the next step. v No: The reference code is informational. Use “Status indications for RIO/HSL” on page 15 to determine what the reference code means. This ends the procedure.2. This error can appear in the SAL if a tower or another system in the loop did not complete powering on before Licensed Internal Code (LIC) checked this loop for errors. Search the product activity log (PAL) for all B600 6985 reference codes logged for this loop and use “Status indications for RIO/HSL” on page 15 to determine if this error requires service. Is further service required? Yes: Continue with the next step. No: This ends the procedure.3. There may be multiple B600 6985 reference codes, with xxxx 3205 in word 4, for the same loop resource in the SAL. This is caused by attempts to retry and recover. If there is a B600 6985 reference code with xxxx 3206 or xxxx 3208 in word 4 after the above B600 6985 entries in the PAL, then the recovery efforts were successful. If this is the case, close all the B600 6985 entries for that loop resource in the SAL. Then continue with the next step. 4. Is there a B600 6981 reference code in the SAL? Yes: Close that problem and go to step 9 on page 165. No: Continue with the next step.5. Perform “RIOIP06” on page 34 to determine if any other systems are connected to this loop and then return here. 164 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
Note: The loop number can be found in the SAL in the description for the HSL_LNK FRU.Are there other systems connected to this loop? Yes: Continue with the next step. No: Go to step 9.6. Check for HSL failures in the SALs on the other systems before replacing parts. HSL failures are indicated by SAL entries with HSL I/O bridge and Network Interface Controller (NIC) resources. Ignore B600 6982 and B600 6984 entries. Are there HSL failures on other systems? Yes: Continue with the next step. No: Go to step 9.7. Repair the problems on the other systems and return to this step. After making repairs on the other systems check the PAL of this system. Is there a B600 6985 reference code that was logged after the repairs you made on the other systems? Yes: Continue with the next step. No: Go to step 9.8. For the B600 6985 log you found, use “Status indications for RIO/HSL” on page 15 to determine if the loop is now complete. Is the loop complete? v Yes: The problem has been resolved. This ends the procedure. v No: Go to step 9.9. The FRU list displayed in the SAL may be different from the failing item list given here. Use the SAL’s FRU list when it is available. Does this reference code appear in the SAL with the symbolic FRU HSL_LNK listed as a FRU? v Yes: Perform “RIOIP01” on page 28. This ends the procedure. v No: Exchange the FRUs listed in the SAL according to their part action codes. This ends the procedure.RTRIP07 Gives a link to assist you when diagnosing a keyboard error. Perform “WSAIP01” on page 241. RTRIP08 Gives a link to assist when the Licensed Internal Code detected an IOP programming problem. Perform a system IPL. Is the IPL successful? Yes: Perform “LICIP01” on page 91 to determine the cause of the problem.This ends the procedure. No: Perform the action described in the new reference code. This ends the procedure. Chapter 2. Isolation Procedures 165
SAS isolation procedures Use the following Serial Attached SCSI (SAS) isolation procedures if an HMC is not attached to the server. If the server is connected to an HMC, use the procedures that are available on the HMC to continue FRU isolation. DANGERWhen working on or around the system, observe the following precautions: Electrical voltage and current from power, telephone, and communication cables are hazardous. To avoid a shock hazard: v Connect power to this unit only with the IBM provided power cord. Do not use the IBM provided power cord for any other product. v Do not open or service any power supply assembly. v Do not connect or disconnect any cables or perform installation, maintenance, or reconfiguration of this product during an electrical storm. v The product might be equipped with multiple power cords. To remove all hazardous voltages, disconnect all power cords. v Connect all power cords to a properly wired and grounded electrical outlet. Ensure that the outlet supplies proper voltage and phase rotation according to the system rating plate. v Connect any equipment that will be attached to this product to properly wired outlets. v When possible, use one hand only to connect or disconnect signal cables. v Never turn on any equipment when there is evidence of fire, water, or structural damage. v Disconnect the attached power cords, telecommunications systems, networks, and modems before you open the device covers, unless instructed otherwise in the installation and configuration procedures. v Connect and disconnect cables as described in the following procedures when installing, moving, or opening covers on this product or attached devices. To Disconnect: 1. Turn off everything (unless instructed otherwise). 2. Remove the power cords from the outlets. 3. Remove the signal cables from the connectors. 4. Remove all cables from the devices To Connect: 1. Turn off everything (unless instructed otherwise). 2. Attach all cables to the devices. 3. Attach the signal cables to the connectors. 4. Attach the power cords to the outlets. 5. Turn on the devices. (D005) “SIP3110” on page 168This procedure resolves problems when disk units are incompatible or a disk unit is missing or failed. “SIP3111” on page 169This procedure resolves the problem when two or more disk units are missing from a RAID 5 or RAID 6 disk array. “SIP3112” on page 171This procedure resolves the problem when one or more disk array members are not at the required physical locations. “SIP3113” on page 172This procedure resolves problems when a disk array is or would become exposed and parity data is out of synchronization. 166 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
“SIP3120” on page 174This procedure resolves the problem when Cache data associated with attached disk units cannot be found. “SIP3121” on page 175Use this procedure to resolve the following problem: RAID adapter resources not available due to previous problems (SRC xxxx9054). “SIP3130” on page 175Use this procedure to resolve the following problem: Adapter does not support function expected by one or more disk units (SRC xxxx9008). “SIP3131” on page 177Use this procedure to resolve the following problem: Required cache data cannot be located for one or more disk units (SRC xxxx9050). “SIP3132” on page 180Use this procedure to resolve the following problem: Cache data exists for one or more missing or failed disk units (SRC xxxx9051). “SIP3134” on page 181Use this procedure to resolve the following problem: Disk unit requires format before use (SRC xxxx9092). “SIP3140” on page 183Use this procedure to resolve the following problem: Multiple adapters connected in an invalid configuration (SRC xxxx9073) “SIP3141” on page 184Use this procedure to resolve the following problem: Multiple adapters not capable of similar functions or controlling same set of devices (SRC xxxx9074) “SIP3142” on page 185Use this procedure to resolve the following configuration error: incorrect connection between cascaded enclosures (SRC xxxx4010). “SIP3143” on page 187Use this procedure to resolve the following configuration error: connections exceed adapter design limits (SRC xxxx4020). “SIP3144” on page 188Use this procedure to resolve problems with multipath connections. “SIP3145” on page 191Use this procedure to resolve the following problem: Unsupported enclosure function detected (SRC xxxx4110). “SIP3146” on page 193Use this procedure to resolve the configuration error: Incomplete multipath connection between enclosures and device detected (SRC xxxx4041). “SIP3147” on page 194Use this procedure to resolve the following problem: Missing remote adapter (SRC xxxx9076) “SIP3148” on page 195Use this procedure to resolve the following problem: Attached enclosure does not support required multipath function (SRC xxxx4050). “SIP3149” on page 197Use this procedure to resolve the following problem: Incomplete multipath connection between adapter and remote adapter (SRC xxxx9075) “SIP3150” on page 197Use this procedure to perform serial attached SCSI (SAS) fabric problem isolation. “SIP3152” on page 199Use this procedure to resolve possible failed connection problems Chapter 2. Isolation Procedures 167
“SIP3153” on page 201Go to SIP3152.SIP3110 This procedure resolves problems when disk units are incompatible or a disk unit is missing or failed. The following SRCs are possible: v SRC xxxx9025. Indicates that an incompatible disk unit is installed at the disk unit location that caused the array to be exposed. v SRC xxxx9030. Indicates that a disk array is exposed due to a missing or failed disk unit. v SRC xxxx9032. Indicates that a disk unit in a disk array is missing or failed, but the array is still protected.If you received SRC xxxx9030 or xxxx9032, one of the following occurred: v A disk unit has failed and the RAID array protection is exposed or will become exposed if another disk unit fails because no hot spare disk unit was available to replace it. If the array is exposed, then the array will continue to be exposed until the disk unit has been replaced and a manual rebuilding of the array has been started. v A disk unit has failed in a RAID array, but a hot spare was used to automatically start rebuilding the array. Replace, format, and configure the failed disk unit as a hot spare. Note: If the previous hot spare disk unit was a larger capacity than the failed disk unit, ensure that the customer understands that the replacement disk unit might not provide adequate hot spare coverage for all of the arrays under this adapter.1. Is the device location information for this SRC available in the service action log (see “Using the Service Action Log” on page 609 for details)? No: Continue with the next step. Yes: Exchange the disk unit. See “Disk Drive” on page 379. This ends the procedure.2. Identify the affected adapter and disk units by examining the product activity log. Perform one of the following to access system service tools (SST) or dedicated service tools (DST): v If you can enter a command at the console, access system service tools (SST). See “System Service Tools (SST)” on page 611. v If you cannot enter a command at the console, perform an IPL to DST. See “Performing an IPL to DST” on page 640.3. Perform the following steps: a. Access the product activity log and display the SRC that sent you here. b. Press the F9 key for address information. This is the adapter address. c. Continue with the next step.4. Perform the following steps: a. Return to the SST or DST main menu. b. Select Work with disk units ? Display disk configuration ? Display disk configuration status. c. On the Display disk configuration status display, look for the devices attached to the adapter that is identified in step 3.Is there a device that has a status of RAID 5/Unknown, RAID 6/Unknown, RAID 5/Failed, or RAID 6/Failed? No: Continue with step 7 on page 169. Yes: Continue with the next step.
168 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
5. Find the device that has a status of RAID 5/Unknown, RAID 6/Unknown, RAID 5/Failed, or RAID 6/Failed. This is the device that is causing the problem. Show the device address by selecting Display Disk Unit Details ? Display Detailed Address. Record the device address. See Chapter 3, “Locating FRUs,” on page 335 and find the diagram of the system unit or of the expansion unit and find the following items: v The slot that is identified by the direct select address of the adapter v The disk unit location that is identified by the device address6. Have you determined the location of the adapter and disk unit that is causing the problem? No: Ask your next level of support for assistance. This ends the procedure. Yes: Exchange the disk unit that is causing the problem. See “Disk Drive” on page 379. This ends the procedure.7. Press the function key to cancel and to return to the Display Disk Configuration menu, then do the following: a. Select Display disk hardware status. b. Find a device that is either Not operational or Read/write protected. c. Display details for the device and get the location of the failed disk unit. d. Exchange the disk unit and configure it as a hot spare. Go to “Disk Drive” on page 379. This ends the procedure.SIP3111 This procedure resolves the problem when two or more disk units are missing from a RAID 5 or RAID 6 disk array. The following SRCs are possible: v xxxx9020 v xxxx9021 v xxxx90221. Identify the affected adapter and disk units by examining the Product Activity Log. Perform the following: a. Access SST or DST. v If you can enter a command at the console, access system service tools (SST). See “System Service Tools (SST)” on page 611. v If you cannot enter a command at the console, perform an IPL to DST. See “Performing an IPL to DST” on page 640. v If you cannot perform a type A or B IPL, perform a type D IPL from removable media.b. Access the Product Activity Log and record address information. If a type D IPL was not performed to get to SST or DST: The log information is formatted. Access the Product Activity Log and display the SRC that sent you here. Press the F9 key for address information. This is the adapter address. Then, press F12 to cancel and return to the previous screen. Then press the F4 key to view the ?Additional Information? to record the formatted log information. Record all devices that are missing from the disk array. These are the array members that have both a current address of 0 and an expected address that is not 0. If a type D IPL was performed to get to DST: The log information is not formatted. Access the Product Activity Log and display the SRC that sent you here. The direct select address (DSA) of the adapter is in the format BBBB-Cc-bb: BBBB Hexadecimal offsets 4C and 4D Chapter 2. Isolation Procedures 169
Cc Hexadecimal offset 51 bb Hexadecimal offset 4FIn order to interpret the hexadecimal information to get device addresses, see “More information from hexadecimal reports” on page 620. Record all devices that are missing from the disk array. These are the array members that have both a current address of 0 and an expected address that is not 0.c. Determine the location of the adapter and the devices that are causing the problem. See Chapter 3, “Locating FRUs,” on page 335 and find the diagram of the system unit, or the expansion unit. Then find the following items: v The card slot that is identified by the direct select address (DSA) v The disk unit locations that are identified by the unit addresses2. Perform one of the following options (listed in order of preference): Option 1 Power off the system or partition and install the identified disk units in the correct physical locations (that is the expected addresses) in the system. This ends the procedure. Option 2 Stop the disk array that contains the missing devices. Attention: Customer data might be lost. Perform the following: a. If you are not already using dedicated service tools, perform an IPL to DST. See “Performing an IPL to DST” on page 640. If you cannot perform a type A or B IPL, perform a type D IPL from removable media. b. Select Work with disk units. Did you get to DST with a type D IPL? No: Select Work with disk configuration ? Work with device parity protection. Then, continue with substep 2c. Yes: Continue with substep 2c.c. Select Stop device parity protection. d. Follow the online instructions to stop device parity protection. e. Perform an IPL from disk.Does the IPL complete successfully? No: Go to Starting a Service Call. This ends the procedure. Yes: This ends the procedure.Option 3 If the data on the disk units is not needed, initialize and format the remaining members of the disk array by performing the following steps:Attention: Data on the disk unit will be lost. If a type D IPL was not performed to get to SST or DST: a. Access SST or DST. b. Select Work with disk units ? Work with disk unit recovery ? Disk unit problem recovery procedures. c. Select Initialize and format disk unit for each disk unit. When the new disk unit is initialized and formatted, the display shows that the status is complete. This might take 30 minutes or much longer depending on the capacity of the disk unit. The disk unit is now ready to be added to the system configuration. This ends the procedure.
170 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
If a type D IPL was performed to get to DST: a. Access DST. b. Select Work with disk units. c. Select Initialize and format disk unit for each disk unit. When the new disk unit is initialized and formatted, the display shows that the status is complete. This might take 30 minutes or much longer depending on the capacity of the disk unit. The disk unit is now ready to be added to the system configuration. This ends the procedure.SIP3112 This procedure resolves the problem when one or more disk array members are not at the required physical locations. The possible SRC is SRC xxxx9023. 1. Identify the affected adapter and disk units that are not at their required locations by examining the Product Activity Log. Perform the following steps: a. Access SST or DST. v If you can enter a command at the console, access system service tools (SST). See “System Service Tools (SST)” on page 611. v If you cannot enter a command at the console, perform an IPL to DST. See “Performing an IPL to DST” on page 640. v If you cannot perform a type A or B IPL, perform a type D IPL from removable media.b. Access the product activity log and record address information. If a type D IPL was not performed to get to SST or DST: The log information is formatted. Access the Product Activity Log and display the SRC that sent you here. Press the F9 key for address information. This is the adapter address. Then, press F12 to cancel and return to the previous screen. Then press the F4 key to view the additional information to record the formatted log information. Record all devices that are not at their required locations. These are the array members that have a current address and an expected address that do not match. A current address of 0 is acceptable, and no action is needed to correct it for a known failed drive in the array. If a type D IPL was performed to get to DST: The log information is not formatted. Access the Product Activity Log and display the SRC that sent you here. The direct select address (DSA) of the adapter is in the format BBBB-Cc-bb: BBBB Hexadecimal offsets 4C and 4D Cc Hexadecimal offset 51 bb Hexadecimal offset 4FIn order to interpret the hexadecimal information to get device addresses, see “More information from hexadecimal reports” on page 620. Record all devices that are not at their required locations. These are the array members that have a current address and an expected address that do not match. A current address of 0 is acceptable, and no action is needed to correct it for a known failed drive in the array.c. Determine the location of the adapter and the devices that are causing the problem. See Chapter 3, “Locating FRUs,” on page 335 and find the diagram of the system unit or the expansion unit. Then find the following items: v The card slot that is identified by the direct select address (DSA) v The disk unit locations that are identified by the unit addresses2. Perform only one of the following options (listed in order of preference): Chapter 2. Isolation Procedures 171
Option 1 Power off the system or partition and install the identified disk units in the correct physical locations (that is the expected addresses) in the system. This ends the procedure. Option 2 Stop the disk array that contains the missing devices. Attention: Customer data might be lost. Perform the following: a. If you are not already using dedicated service tools, perform an IPL to DST. See “Performing an IPL to DST” on page 640. If you cannot perform a type A or B IPL, perform a type D IPL from removable media. b. Select Work with disk units. Did you get to DST with a type D IPL? No: Select Work with disk configuration ? Work with device parity protection. Then, continue with substep 2c. Yes: Continue with the next substep 2c.c. Select Stop device parity protection. d. Follow the online instructions to stop device parity protection. e. Perform an IPL from disk.Does the IPL complete successfully? No: Go to Starting a Service Call. This ends the procedure. Yes: This ends the procedure.Option 3 If the data on the disk units is not needed, initialize and format the remaining members of the disk array by performing the following steps:Attention: Data on the disk unit will be lost. If a type D IPL was not performed to get to SST or DST: a. Access SST or DST. b. Select Work with disk units ? Work with disk unit recovery ? Disk unit problem recovery procedures. c. Select Initialize and format disk unit for each disk unit. When the new disk unit is initialized and formatted, the display shows that the status is complete. This might take 30 minutes or much longer depending on the capacity of the disk unit. The disk unit is now ready to be added to the system configuration. This ends the procedure.If a type D IPL was performed to get to DST: a. Access DST. b. Select Work with disk units. c. Select Initialize and format disk unit for each disk unit. When the new disk unit is initialized and formatted, the display shows that the status is complete. This might take 30 minutes or much longer depending on the capacity of the disk unit. The disk unit is now ready to be added to the system configuration. This ends the procedure.SIP3113 This procedure resolves problems when a disk array is or would become exposed and parity data is out of synchronization. The possible SRC is xxxx9027. 172 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
1. Identify the affected adapter and disk units by examining the Product Activity Log. Perform the following steps: a. Access SST or DST. v If you can enter a command at the console, access system service tools (SST). See “System Service Tools (SST)” on page 611. v If you cannot enter a command at the console, perform an IPL to DST. See “Performing an IPL to DST” on page 640. v If you cannot perform a type A or B IPL, perform a type D IPL from removable media.b. Access the Product Activity Log and record address information. If a type D IPL was not performed to get to SST or DST: The log information is formatted. Access the Product Activity Log and display the SRC that sent you here. Press the F9 key for address information. This is the adapter address. Then, press F12 to cancel and return to the previous screen. Then press the F4 key to view the additional information to record the formatted log information. Record all devices that are missing from the disk array. These are the array members that have both a current address of 0 and an expected address that is not 0. If a type D IPL was performed to get to DST: The log information is not formatted. Access the product activity log and display the SRC that sent you here. The direct select address (DSA) of the adapter is in the format BBBB-Cc-bb: BBBB Hexadecimal offsets 4C and 4D Cc Hexadecimal offset 51 bb Hexadecimal offset 4FIn order to interpret the hexadecimal information to get device addresses, see “More information from hexadecimal reports” on page 620. Record all devices that are missing from the disk array. These are the array members that have both a current address of 0 and an expected address that is not 0.c. Determine the location of the adapter and the devices that are causing the problem. See Chapter 3, “Locating FRUs,” on page 335 and find the diagram of the system unit or the expansion unit. Then find the following: v The card slot that is identified by the direct select address (DSA) v The disk unit locations that are identified by the unit addresses2. Have the adapter or disk units been physically moved recently? No: Contact your hardware service provider. This ends the procedure. Yes: Continue with the next step.3. Perform one of the following two options (listed in order of preference): Option 1 Power off the system or partition and restore the I/O adapter and disk units back to their original configuration. This ends the procedure. Option 2 Stop the disk array that contains the missing devices. Attention: Customer data might be lost. Perform the following: a. If you are not already using dedicated service tools, perform an IPL to DST. See “Performing an IPL to DST” on page 640. If you cannot perform a type A or B IPL, perform a type D IPL from removable media. b. Select Work with disk units. Did you get to DST with a type D IPL? Chapter 2. Isolation Procedures 173
No: Select Work with disk configuration ? Work with device parity protection. Then, continue with substep 3c. Yes: Continue with substep 3c.c. Select Stop device parity protection. d. Follow the online instructions to stop device parity protection. e. Perform an IPL from disk.Does the IPL complete successfully? No: Go to Starting a Service Call. This ends the procedure. Yes: This ends the procedure.Option 3 If the data on the disk units is not needed, initialize and format the disk units by performing the following steps: Attention: Data on the disk unit will be lost. If a type D IPL was not performed to get to SST or DST: a. Access SST or DST. b. Select Work with disk units ? Work with disk unit recovery ? Disk unit problem recovery procedures. c. Select Initialize and format disk unit for each disk unit. When the new disk unit is initialized and formatted, the display shows that the status is complete. This might take 30 minutes or much longer depending on the capacity of the disk unit. The disk unit is now ready to be added to the system configuration. This ends the procedure.If a type D IPL was performed to get to DST: a. Access DST. b. Select Work with disk units. c. Select Initialize and format disk unit for each disk unit. When the new disk unit is initialized and formatted, the display shows that the status is complete. This might take 30 minutes or much longer depending on the capacity of the disk unit. The disk unit is now ready to be added to the system configuration. This ends the procedure.SIP3120 This procedure resolves the problem when Cache data associated with attached disk units cannot be found. The possible SRC is xxxx9010 1. Has the server been powered off for several days? No Go to step 2. Yes Go to step 5 on page 175.2. Using the appropriate service procedures, remove the adapter. Install the new replacement storage adapter with the following parts installed on it: Note: Label all parts (original and new) before moving them. v The cache directory card from the original storage adapter. Refer to Replacing the cache directory card. v The removable cache card from the original storage adapter. This applies only to certain adapters that have a removable cache card.3. Has a new SRC of xxxx9010 or xxxx9050 occurred? 174 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
No Go to step 6. Yes Go to step 44. Was the new SRC xxxx9050? No The new SRC was xxxx9010. Reclaim adapter cache storage. See “Reclaiming IOP cache storage” on page 636.Attention: Data might be lost. When an auxiliary cache adapter connected to the RAID adapter logs a xxxx9055 SRC in the hardware error log, the reclaim process does not result in lost sectors. Otherwise, the reclaim process does result in lost sectors.Note: On the Reclaim Controller Cache Storage results screen, the number of lost sectors is displayed. If the number is 0, there is no data loss. If the number is not 0, data has been lost and the system operator might want to restore data after this procedure is completed.Go to step 6. Yes: Contact your hardware service provider. This ends the procedure.5. If the server has been powered off for several days after an abnormal power-down, the cache battery pack might be depleted. Do not replace the adapter or the cache battery pack. Reclaim adapter cache storage. See “Reclaiming IOP cache storage” on page 636. Attention: Data might be lost. When an auxiliary cache adapter connected to the RAID adapter logs a xxxx9055 SRC in the hardware error log, the reclaim process does not result in lost sectors. Otherwise, the reclaim process does result in lost sectors.Note: On the Reclaim Controller Cache Storage results screen, the number of lost sectors is displayed. If the number is 0, there is no data loss. If the number is not 0, data has been lost and the system operator might want to restore data after this procedure is completed.This ends the procedure. 6. Using the appropriate service procedures, remove the adapter. Install the new replacement storage adapter with the following parts installed on it: v The cache directory card from the new storage adapter. Refer to Replacing the cache directory card. v The removable cache card from the new storage adapter. This only applies to certain adapters which have a removable cache card.This ends the procedure.SIP3121 Use this procedure to resolve the following problem: RAID adapter resources not available due to previous problems (SRC xxxx9054). Power off the system and remove all new or replacement disk units. IPL the system to DST. If you cannot perform a type A or B IPL, perform a type D IPL from removable media. Look for Product Activity Log entries for other reference codes and take action on them. This ends the procedure. SIP3130 Use this procedure to resolve the following problem: Adapter does not support function expected by one or more disk units (SRC xxxx9008). 1. Identify the affected adapter and disk units by examining the Product Activity Log. Perform the following: a. Access SST or DST. Chapter 2. Isolation Procedures 175
v If you can enter a command at the console, access system service tools (SST). See “System Service Tools (SST)” on page 611. v If you cannot enter a command at the console, perform an IPL to DST. See “Performing an IPL to DST” on page 640. v If you cannot perform a type A or B IPL, perform a type D IPL from removable media.b. Access the product activity log and record address information. If a type D IPL was not performed to get to SST or DST: The log information is formatted. Access the Product Activity Log and display the SRC that sent you here. Press the F9 key for address information. This is the adapter address. Then, press F12 to cancel and return to the previous screen. Then press the F4 key to view the ?Additional Information? to record the formatted log information. The Device Errors Detected field indicates the total number of disk units that are affected. The Device Errors Logged field indicates the number of disk units for which detailed information is provided. Under the Device heading, the unit address, type, serial number, and worldwide ID are provided for up to three disk units. Additionally, the adapter type, serial number, and worldwide ID for each of these disk units indicates the adapter to which the disk was last attached when it was operational. If a type D IPL was performed to get to DST: The log information is not formatted. Access the Product Activity Log and display the SRC that sent you here. The direct select address (DSA) of the adapter is in the format BBBB-Cc-bb: BBBB Hexadecimal offsets 4C and 4D Cc Hexadecimal offset 51 bb Hexadecimal offset 4FIn order to interpret the hexadecimal information to get device addresses, see “More information from hexadecimal reports” on page 620. The Device Errors Detected field indicates the total number of disk units that are affected. The Device Errors Logged field indicates the number of disk units for which detailed information is provided. Under the Device heading, the unit address, type, serial number, and worldwide ID are provided for up to three disk units. Additionally, the adapter type, serial number, and worldwide ID for each of these disk units indicates the adapter to which the disk was last attached when it was operational.c. Determine the location of the adapter and the devices that are causing the problem. See Chapter 3, “Locating FRUs,” on page 335 and find the diagram of the system unit or the expansion unit. Then find the following items: v The card slot that is identified by the direct select address (DSA) v The disk unit locations that are identified by the unit addressesHave you determined the location of the adapter and the devices that are causing the problem? No: Ask your next level of support for assistance. This ends the procedure. Yes: Continue with the next step.2. Have the adapter or disk units been physically moved recently, or were the disk units previously used by the AIX or Linux operating system? No: Ask your next level of support for assistance. This ends the procedure. Yes: Continue with the next step.3. Perform one of the following options (listed in order of preference): Option 1 Power off the system or partition and restore the adapter and disk units back to their original configuration. This ends the procedure. 176 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
Option 2 If the data on the disk units is not needed, initialize and format the disk units by performing the following steps: Attention: Data on the disk unit will be lost. If a type D IPL was not performed to get to SST or DST: a. Access SST or DST. b. Select Work with disk units ? Work with disk unit recovery ? Disk unit problem recovery procedures. c. Select Initialize and format disk unit for each disk unit. When the new disk unit is initialized and formatted, the display shows that the status is complete. This might take 30 minutes or much longer depending on the capacity of the disk unit. The disk unit is now ready to be added to the system configuration. This ends the procedure.If a type D IPL was performed to get to DST: a. Access DST. b. Select Work with disk units. c. Select Initialize and format disk unit for each disk unit. When the new disk unit is initialized and formatted, the display shows that the status is complete. This might take 30 minutes or much longer depending on the capacity of the disk unit. The disk unit is now ready to be added to the system configuration. This ends the procedure.SIP3131 Use this procedure to resolve the following problem: Required cache data cannot be located for one or more disk units (SRC xxxx9050). 1. Did you just exchange the adapter as the result of a failure? No: Go to step 3. Yes: Attention: a. The failed adapter that you have just exchanged contains cache data that is required by the disk units that were attached to that adapter. If the adapter that you just exchanged is failing intermittently, reinstalling it and performing an IPL of the system might allow the data to be successfully written to the disk units. After the cache data is written to the disk units and the system is powered off normally, the adapter can be replaced without data being lost. Otherwise, continue with this procedure. b. Label all parts (old and new) before moving them.Go to step 2. 2. Using the appropriate service procedures, remove the adapter. Install the new replacement storage adapter with the following parts installed on it: v The cache directory card from the original storage adapter. Refer to “Replacing the cache directory card.” v The removable cache card from the original storage adapter. This applies only to certain adapters that have a removable cache card.Go to step 8 on page 179. 3. Identify the affected adapter and disk units by examining the Product Activity Log. Perform the following: a. Access SST/DST. v If you can enter a command at the console, access system service tools (SST). See “System Service Tools (SST)” on page 611. Chapter 2. Isolation Procedures 177
v If you cannot enter a command at the console, perform an IPL to DST. See “Performing an IPL to DST” on page 640. v If you cannot perform a type A or B IPL, perform a type D IPL from removable media.b. Access the product activity log and record address information. If a type D IPL was not performed to get to SST/DST: The log information is formatted. Access the Product Activity Log and display the SRC that sent you here. Press the F9 key for address information. This is the adapter address. Then, press F12 to cancel and return to the previous screen. Then press the F4 key to view the additional information to record the formatted log information. The Device Errors detected field indicates the total number of disk units that are affected. The Device Errors Logged field indicates the number of disk units for which detailed information is provided. Under the Device heading, the unit address, type, serial number, and worldwide ID are provided for up to three disk units. Additionally, the adapter type, serial number, and worldwide ID for each of these disk units indicates the adapter to which the disk was last attached when it was operational. If a type D IPL was performed to get to DST: The log information is not formatted. Access the product activity log and display the SRC that sent you here. The direct select address (DSA) of the adapter is in the format BBBB-Cc-bb: BBBB Hexadecimal offsets 4C and 4D Cc hHexadecimal offset 51 bb Hexadecimal offset 4FIn order to interpret the hexadecimal information to get device addresses, see “More information from hexadecimal reports” on page 620. The Device Errors Detected field indicates the total number of disk units that are affected. The Device Errors Logged field indicates the number of disk units for which detailed information is provided. Under the Device heading, the unit address, type, serial number, and worldwide ID are provided for up to three disk units. Additionally, the adapter type, serial number, and worldwide ID for each of these disk units indicates the adapter to which the disk was last attached when it was operational.c. Determine the location of the adapter and the devices that are causing the problem. See Chapter 3, “Locating FRUs,” on page 335 and find the diagram of the system unit or the expansion unit. Then find the following items: v The card slot that is identified by the direct select address (DSA) v The disk unit locations that are identified by the unit addressesHave you determined the location of the adapter and the devices that are causing the problem? No: Ask your next level of support for assistance. This ends the procedure. Yes: Continue with the next step. 4. Have the adapter or disk units been physically moved recently? No: Contact your hardware service provider. Yes: Go to step 5. 5. Is the data on the disk units needed for this or any other system? No: Go to step 7 on page 179. Yes: Go to step 6 6. Restore the adapter and disk units back to their original configuration. The adapter and disk units must be rejoined so that the cache data can be written to the disk units. After the cache data is written to the disk units and the system is powered off normally, the adapter or disk units can be moved to another location. This ends the procedure 178 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
7. Perform only one of the following options, listed in the order of preference: Option 1 Reclaim adapter cache storage. See Reclaiming IOP cache storage. Attention: Data on the disk array will be lost.This ends the procedure. Option 2 If the data on the disk units is not needed, initialize and format the disk units by performing the following steps: Attention: Data on the disk units will be lost. If a type D IPL was not performed to get to SST or DST: a. Access SST or DST. b. Select Work with disk units ? Work with disk unit recovery ? Disk unit problem recovery procedures. c. Select Initialize and format disk unit for each disk unit. When the new disk unit is initialized and formatted, the display shows that the status is complete. This might take 30 minutes or much longer depending on the capacity of the disk unit. The disk unit is now ready to be added to the system configuration. This ends the procedure.If a type D IPL was performed to get to DST: a. Access DST. b. Select Work with disk units. c. Select Initialize and format disk unit for each disk unit. When the new disk unit is initialized and formatted, the display shows that the status is complete. This might take 30 minutes or much longer depending on the capacity of the disk unit. The disk unit is now ready to be added to the system configuration. This ends the procedure. 8. Has a new SRC xxxx9010 or xxxx9050 occurred? No: Go to step 10. Yes: Go to step 9. 9. Was the new SRC xxxx9050? No: The new SRC was xxxx9010. Reclaim adapter cache storage. See “Reclaiming IOP cache storage” on page 636.Attention: Data might be lost. When an auxiliary cache adapter connected to the RAID adapter logs an xxxx9055 SRC in the hardware error log, the reclaim process does not result in lost sectors. Otherwise, the reclaim process does result in lost sectors.Note: On the Reclaim Controller Cache Storage results screen, the number of lost sectors is displayed. If the number is 0, there is no data loss. If the number is not 0, data has been lost and the system operator might want to restore data after this procedure is completed.Go to step 10. Yes: Contact your hardware service provider.10. Using the appropriate service procedures, remove the adapter. Install the new replacement storage adapter with the following parts installed on it: Chapter 2. Isolation Procedures 179
v The cache directory card from the new storage adapter. Refer to Replacing the cache directory card. v The removable cache card from the new storage adapter. This only applies to certain adapters that have a removable cache card.This ends the procedure.SIP3132 Use this procedure to resolve the following problem: Cache data exists for one or more missing or failed disk units (SRC xxxx9051). The possible causes are: v One or more disk units have failed on the adapter. v One or more disk units were either moved concurrently or were removed after an abnormal power off. v The adapter was moved from a different system or a different location on this system after an abnormal power off. v The cache of the adapter was not cleared before it was shipped to the customer.1. Identify the affected adapter and disk units by examining the product activity log. Perform the following: a. Access SST or DST. v If you can enter a command at the console, access system service tools (SST). See “System Service Tools (SST)” on page 611. v If you cannot enter a command at the console, perform an IPL to DST. See “Performing an IPL to DST” on page 640. v If you cannot perform a type A or B IPL, perform a type D IPL from removable media.b. Access the Product Activity Log and record address information. If a type D IPL was not performed to get to SST or DST: The log information is formatted. Access the product activity log and display the SRC that sent you here. Press the F9 key for address information. This is the adapter address. Then, press F12 to cancel and return to the previous screen. Then press the F4 key to view the additional information to record the formatted log information. The Device Errors Detected field indicates the total number of disk units that are affected. The Device Errors Logged field indicates the number of disk units for which detailed information is provided. Under the Device heading, the unit address, type, serial number, and worldwide ID are provided for up to three disk units. Additionally, the adapter type, serial number, and worldwide ID for each of these disk units indicates the adapter to which the disk was last attached when it was operational. If a type D IPL was performed to get to DST: The log information is not formatted. Access the product activity log and display the SRC that sent you here. The direct select address (DSA) of the adapter is in the format BBBB-Cc-bb: BBBB Hexadecimal offsets 4C and 4D Cc Hexadecimal offset 51 bb Hexadecimal offset 4FIn order to interpret the hexadecimal information to get device addresses, see “More information from hexadecimal reports” on page 620. The Device Errors Detected field indicates the total number of disk units that are affected. The Device Errors Logged field indicates the number of disk units for which detailed information is provided. Under the Device heading, the unit address, type, serial number, and worldwide ID are provided for 180 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
up to three disk units. Additionally, the adapter type, serial number, and worldwide ID for each of these disk units indicates the adapter to which the disk was last attached when it was operational.c. Determine the location of the adapter and the devices that are causing the problem. See Chapter 3, “Locating FRUs,” on page 335 and find the diagram of the system unit or the expansion unit. Then find the following items: v The card slot that is identified by the direct select address (DSA) v The disk unit locations that are identified by the unit addressesHave you determined the location of the adapter and the devices that are causing the problem? No: Ask your next level of support for assistance. This ends the procedure. Yes: Continue with the next step.2. Are there other disk unit or adapter errors that have occurred at approximately the same time as this error? No: Go to step 3. Yes: Go to step 6.3. Is the data on the disk units (and thus the cache data for the disk units) needed for this or any other system? No: Go to step 7. Yes: Go to step 44. Have the adapter card or disk units been physically moved recently? No: Contact your hardware service provider. Yes: Go to step 5.5. Restore the adapter and disk units back to their original configuration. The adapter and disk units must be rejoined so that the cache data can be written to the disk units. After the cache data is written to the disk units and the system is powered off normally, the adapter or disk units can be moved to another location. This ends the procedure. 6. Take action on the other errors that have occurred at the same time as this error. This ends the procedure. 7. Reclaim adapter cache storage. See “Reclaiming IOP cache storage” on page 636. Attention: Data will be lost. This ends the procedure.SIP3134 Use this procedure to resolve the following problem: Disk unit requires format before use (SRC xxxx9092). The possible causes are: v Disk unit is a previously failed disk unit from a disk array and was automatically replaced by a hot spare disk unit. v Disk unit is a previously failed disk unit from a disk array and was removed and later reinstalled on a different adapter or different location on this adapter. v Appropriate service procedures were not followed when replacing disk units or reconfiguring the adapter, such as not performing a normal power off of the system prior to reconfiguring disk units and adapters. v Disk unit is a member of a disk array, but was detected subsequent to the adapter being configured. v Disk unit has multiple or complex configuration problems.1. Identify the affected adapter and disk units by examining the product activity log. Perform the following: a. Access SST or DST. Chapter 2. Isolation Procedures 181
v If you can enter a command at the console, access system service tools (SST). See “System Service Tools (SST)” on page 611. v If you cannot enter a command at the console, perform an IPL to DST. See “Performing an IPL to DST” on page 640. v If you cannot perform a type A or B IPL, perform a type D IPL from removable media.b. Access the product activity log and record address information. If a type D IPL was not performed to get to SST or DST: The log information is formatted. Access the product activity log and display the SRC that sent you here. Press the F9 key for address information. This is the adapter address. Then, press F12 to cancel and return to the previous screen. Then press the F4 key to view the additional information to record the formatted log information. The Device Errors Detected field indicates the total number of disk units that are affected. The Device Errors Logged field indicates the number of disk units for which detailed information is provided. Under the Device heading, the unit address, type, serial number, and worldwide ID are provided for up to three disk units. Additionally, the adapter type, serial number, and worldwide ID for each of these disk units indicates the adapter to which the disk was last attached when it was operational. If a type D IPL was performed to get to DST: The log information is not formatted. Access the product activity log and display the SRC that sent you here. The direct select address (DSA) of the adapter is in the format BBBB-Cc-bb: BBBB Hexadecimal offsets 4C and 4D Cc Hexadecimal offset 51 bb Hexadecimal offset 4FIn order to interpret the hexadecimal information to get device addresses, see “More information from hexadecimal reports” on page 620. The Device Errors detected field indicates the total number of disk units that are affected. The Device Errors logged field indicates the number of disk units for which detailed information is provided. Under the Device heading, the unit address, type, serial number, and Worldwide ID are provided for up to three disk units. Additionally, the adapter type, serial number, and Worldwide ID for each of these disk units indicates the adapter to which the disk was last attached when it was operational.c. Determine the location of the adapter and the devices that are causing the problem. See Chapter 3, “Locating FRUs,” on page 335 and find the diagram of the system unit, or the expansion unit. Then find the following items: v The card slot that is identified by the direct select address (DSA) v The disk unit locations that are identified by the unit addressesHave you determined the location of the adapter and the devices that are causing the problem? No: Ask your next level of support for assistance. This ends the procedure. Yes: Continue with the next step.2. Are there other disk unit or adapter errors that have occurred at about the same time as this error? No: Go to step 3. Yes: Go to step 5 on page 183.3. Have the adapter card or disk units been physically moved recently? No: Go to step 4. Yes: Go to step 6 on page 183.4. Is the data on the disk units needed for this or any other system? 182 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
No: Go to step 7. Yes: Go to step 6.5. Take action on the other errors that have occurred at the same time as this error. This ends the procedure. 6. Perform one of the following options that is most applicable to your situation: Option 1 Perform the following to cause the adapter to rediscover the devices and connections: a. Use Hardware Service Manager to perform another IPL of the virtual I/O processor that is associated with this adapter. b. Vary on any other resources attached to the virtual I/O processor.Take action for any other errors that are now occurring. This ends the procedure. Option 2 Power off the system or partition and restore the adapter and disk units to their original configuration. This ends the procedure. Option 3 Remove the disk units from this adapter. This ends the procedure.7. Do the following to format the disk units: Attention: All data on the disk units will be lost. If a type D IPL was not performed to get to SST or DST: a. Access SST or DST. b. Select Work with disk units ? Work with disk unit recovery ? Disk unit problem recovery procedures. c. Select Initialize and format disk unit for each disk unit. When the new disk unit is initialized and formatted, the display shows that the status is complete. This might take 30 minutes or much longer depending on the capacity of the disk unit. The disk unit is now ready to be added to the system configuration. This ends the procedure.If a type D IPL was performed to get to DST: a. Access DST. b. Select Work with disk units. c. Select Initialize and format disk unit for each disk unit. When the new disk unit is initialized and formatted, the display shows that the status is complete. This might take 30 minutes or much longer depending on the capacity of the disk unit. The disk unit is now ready to be added to the system configuration. This ends the procedure.SIP3140 Use this procedure to resolve the following problem: Multiple adapters connected in an invalid configuration (SRC xxxx9073) The possible causes are: v Incompatible adapters are connected to each other. This includes invalid adapter combinations such as the following: – Adapters with different write cache sizes – One adapter is not supported by i5/OS – An adapter that does not support auxiliary cache is connected to an auxiliary cache adapter – An adapter that supports Dual Storage IOAs is connected to another adapter which does not have the same support – Greater than 2 adapters are connected for Dual Storage IOAs Chapter 2. Isolation Procedures 183
– Adapter code levels are not up to date or are not at the same level of functionalityv One adapter, of a connected pair of adapters, is not operating under the i5/OS operating system. Connected adapters must both be controlled by i5/OS. Additionally, both adapters must be in the same system partition. v Adapters connected for Dual Storage IOAs are not cabled correctly. Each type of Dual Storage IOA configuration requires specific cables be used in a supported manner.Determine which of the possible causes applies to the current configuration and take the appropriate actions to correct it. If this does not correct the error, contact your hardware service provider. This ends the procedure. SIP3141 Use this procedure to resolve the following problem: Multiple adapters not capable of similar functions or controlling same set of devices (SRC xxxx9074) 1. This error relates to adapters connected in a Dual Storage IOA configuration. To obtain the reason or description for this failure, you must find the formatted error information in the Product Activity Log. This should also contain information about the connected adapter. Perform the following: a. Access SST/DST. v If you can enter a command at the console, access system service tools (SST). See “System Service Tools (SST)” on page 611. v If you cannot enter a command at the console, perform an IPL to DST. See “Performing an IPL to DST” on page 640. v If you cannot perform a type A or B IPL, perform a type D IPL from removable media.b. Access the Product Activity Log and record address information. If a D IPL was NOT performed to get to SST/DST: The log information is formatted. Access the Product Activity Log and display the SRC that sent you here. Press the F9 key for ?Address Information?. This is the adapter address. Then, press F12 to cancel and return to the previous screen. Then press the F4 key to view the ?Additional Information? to record the formatted log information. The Problem description field indicates the type of problem. The type, serial number, and Worldwide ID of the connected adapter is also available. If a D IPL was performed to get to DST: The log information is not formatted. Access the Product Activity Log and display the SRC that sent you here. The direct select address (DSA) of the adapter is in the format BBBB-Cc-bb: BBBB hexadecimal offsets 4C and 4D Cc hexadecimal offset 51 bb hexadecimal offset 4FIn order to interpret the hexadecimal information to get device addresses, see “More information from hexadecimal reports” on page 620. The Problem description field indicates the type of problem. The type, serial number, and Worldwide ID of the connected adapter is also available.2. Find the problem description and information for the connected adapter (remote adapter) shown in the error log, and perform the action listed for the reason in the following table: 184 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
Table 40. RAID array reason for failure
Problem description
Full description
Action
Adapter on which to perform the action
Secondary does not support RAID level being used by primary.
Secondary adapter detected that the primary has a RAID array with a RAID level that the secondary does not support.
Customer needs to upgrade the type of secondary adapter or change the RAID level of the array on the primary to a level that is supported by the secondary.
Physically change the type of adapter that logged the error. Change RAID level on primary adapter (remote adapter indicated in the error log).
Secondary does not support disk unit function being used by primary.
Secondary adapter detected a device function that it does not support.
Customer might need to upgrade the adapter code or upgrade the type of secondary adapter.
Adapter that logged the error.
Secondary is unable to find devices found by the primary.
Secondary adapter cannot discover all the devices that the primary has.
Verify the connections to the devices from the adapter logging the error. View the disk units under each adapter using HSM to determine the SAS port with the problem.
Adapter that logged the error.
Secondary found devices not found by the primary.
Secondary adapter has discovered more devices than the primary. After this error is logged, an automatic failover will occur.
Verify the connections to the devices from the remote adapter as indicated in the error log. View the disk units under each adapter using HSM to determine the SAS port with the problem.
Remote adapter indicated in the error log.
Secondary port not connected to the same numbered port on primary.
SAS connections from the adapter to the devices are incorrect. Common disk expansion drawers must be connected to the same numbered SAS port on both adapters.
Verify connections and re-cable SAS connections as necessary.
Either adapter.
Primary lost contact with disk units accessible by secondary.
Link failure from primary adapter to devices. An automatic failover will occur.
Verify cable connections from the adapter which logged the error. Possible disk expansion drawer failure.
Adapter that logged the error.
Other
Not currently defined.
Contact your hardware service provider. This ends the procedure.SIP3142 Use this procedure to resolve the following configuration error: incorrect connection between cascaded enclosures (SRC xxxx4010). The possible causes are: v Incorrect cabling of cascaded device enclosures v Use of an unsupported device enclosure
Chapter 2. Isolation Procedures 185
Considerations: v To prevent hardware damage, power off the system, partition, or card slot as appropriate, before connecting and disconnecting cables or devices.1. Identify the affected adapter and its port by examining the product activity log. Perform the following steps: a. Access SST or DST. v If you can enter a command at the console, access system service tools (SST). See “System Service Tools (SST)” on page 611. v If you cannot enter a command at the console, perform an IPL to DST. See “Performing an IPL to DST” on page 640. v If you cannot perform a type A or B IPL, perform a type D IPL from removable media.b. Access the product activity log and record address information. If a type D IPL was not performed to get to SST or DST: The log information is formatted. Access the product activity log and display the SRC that sent you here. Press the F9 key for address information. This is the adapter address. Then, press F12 to cancel and return to the previous screen. Then press the F4 key to view the additional Information to record the formatted log information. The Adapter Port field indicates the port on the adapter reporting the problem. There might be more than one port listed because multiple ports map to the same physical connector. For example, ports 0 through 3 map to the first physical connector, 4 through 7 map to the second physical connector, and so on. The port numbers are labeled on the adapter tailstock. If a type D IPL was performed to get to DST: The log information is not formatted. Access the product activity log and display the SRC that sent you here. The direct select address (DSA) of the adapter is in the format BBBB-Cc-bb: BBBB Hexadecimal offsets 4C and 4D Cc Hexadecimal offset 51 bb Hexadecimal offset 4FIn order to interpret the hexadecimal information to get device addresses, see “More information from hexadecimal reports” on page 620. The Adapter Port field indicates the port on the adapter reporting the problem. There might be more than one port listed because multiple ports map to the same physical connector. For example, ports 0 through 3 map to the first physical connector, 4 through 7 map to the second physical connector, and so on. The port numbers are labeled on the adapter tailstock.c. Determine the location of the adapter that reported the problem. See Chapter 3, “Locating FRUs,” on page 335 and find the diagram of the system unit or the expansion unit. Then find the following items: v The card slot that is identified by the direct select address (DSA) v The physical connector identified by the port number found on the adapter tailstockHave you determined the location of the adapter and its port? No: Ask your next level of support for assistance. This ends the procedure. Yes: Continue with the next step.2. Review the device enclosure cabling and correct the cabling as required for the device or device enclosure attached to the identified adapter port. To see example device configurations with SAS cabling, see ?Serial attached SCSI cable planning,? in the Site and Hardware Planning Guide, SA76-0091. If unsupported device enclosures are attached, then either remove or replace them with supported device enclosures. 3. Perform the following to cause the adapter to rediscover the devices and connections: 186 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
a. Use Hardware Service Manager to perform another IPL of the virtual I/O processor that is associated with this adapter. b. Vary on any other resources attached to the virtual I/O processor.Did the error recur? No: This ends the procedure. Yes: Contact your hardware service provider. This ends the procedure.SIP3143 Use this procedure to resolve the following configuration error: connections exceed adapter design limits (SRC xxxx4020). The possible causes are: v Unsupported number of cascaded device enclosures v Improper cabling of cascaded device enclosuresConsiderations: v To prevent hardware damage, power off the system, partition, or card slot as appropriate, before connecting and disconnecting cables or devices.1. Identify the affected adapter and its port by examining the product activity log. Perform the following: a. Access SST or DST. v If you can enter a command at the console, access system service tools (SST). See “System Service Tools (SST)” on page 611. v If you cannot enter a command at the console, perform an IPL to DST. See “Performing an IPL to DST” on page 640. v If you cannot perform a type A or B IPL, perform a type D IPL from removable media.b. Access the product activity log and record address information. If a type D IPL was not performed to get to SST or DST: The log information is formatted. Access the product activity log and display the SRC that sent you here. Press the F9 key for address information. This is the adapter address. Then, press F12 to cancel and return to the previous screen. Then press the F4 key to view the additional information to record the formatted log information. The Adapter Port field indicates the port on the adapter reporting the problem. There may be more than one port listed because multiple ports map to the same physical connector. For example, ports 0 through 3 map to the first physical connector, 4 through 7 map to the second physical connector, and so on. The port numbers are labeled on the adapter tailstock. If a type D IPL was performed to get to DST: The log information is not formatted. Access the product activity log and display the SRC that sent you here. The direct select address (DSA) of the adapter is in the format BBBB-Cc-bb: BBBB Hexadecimal offsets 4C and 4D Cc Hexadecimal offset 51 bb Hexadecimal offset 4FIn order to interpret the hexadecimal information to get device addresses, see “More information from hexadecimal reports” on page 620. The Adapter Port field indicates the port on the adapter reporting the problem. There might be more than one port listed because multiple ports map to the same physical connector. For example, ports 0 through Chapter 2. Isolation Procedures 187
3 map to the first physical connector, 4 through 7 map to the second physical connector, and so on. The port numbers are labeled on the adapter tailstock.c. Determine the location of the adapter that reported the problem. See Chapter 3, “Locating FRUs,” on page 335 and find the diagram of the system unit or the expansion unit. Then find the following items: v The card slot that is identified by the direct select address (DSA) v The physical connector identified by the port number found on the adapter tailstockHave you determined the location of the adapter and its port? No: Ask your next level of support for assistance. This ends the procedure. Yes: Continue with the next step.2. Reduce the number of cascaded device enclosures. Device enclosures can only be cascaded one level deep, and only in certain configurations. Review the device enclosure cabling and correct the cabling as required for the device or device enclosure attached to the identified adapter port. To see example device configurations with SAS cabling, see ?Serial attached SCSI cable planning,? in the Site and Hardware Planning Guide, SA76-0091. 3. Perform the following to cause the adapter to rediscover the devices and connections: a. Use Hardware Service Manager to perform another IPL of the virtual I/O processor that is associated with this adapter. b. Vary on any other resources attached to the virtual I/O processor.Did the error recur? No: This ends the procedure. Yes: Contact your hardware service provider. This ends the procedure.SIP3144 Use this procedure to resolve problems with multipath connections. This procedure is used to resolve the following configuration errors: v Configuration error, incorrect multipath connection (SRC xxxx4030) v Configuration error, incomplete multipath connection between adapter and enclosure detected (SRC xxxx4040)The possible causes are: v Incorrect cabling to device enclosure. Note: Pay special attention to the requirement that a Y0-cable, YI-cable, or X-cable must be routed along the right side of the rack frame (as viewed from the rear) when connecting to a disk expansion drawer. Review the device enclosure cabling and correct the cabling as required. To see example device configurations with serial attached SCSI (SAS) cabling, see ?Serial attached SCSI cable planning,? in the Site and Hardware Planning Guide, SA76-0091. v A failed connection caused by a failing component in the SAS fabric between, and including, the adapter and device enclosure.Considerations: v Power off the system, partition, or card slot before connecting and disconnecting cables or devices, as appropriate, to prevent hardware damage. v Some systems have SAS, PCI-X, and PCIe bus interface logic integrated onto the system boards and use a pluggable RAID Enablement Card (a non-PCI form factor card) for these SAS, PCI-X, and PCIe buses. For these configurations, replacement of the RAID Enablement Card is unlikely to solve a SAS related problem because the SAS interface logic is on the system board. 188 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
v Some systems have the disk enclosure or removable media enclosure integrated in the system with no cables. For these configurations the SAS connections are integrated onto the system boards and a failed connection can be the result of a failed system board or integrated device enclosure. v Some configurations involve a SAS adapter connecting to internal SAS disk enclosures within a system using a cable card. Keep in mind that when the procedure refers to a device enclosure, it could be referring to the internal SAS disk slots or media slots. Also, when the procedure refers to a cable, it could include a cable card. v When using SAS adapters in a Dual Storage IOA configuration, ensure that the actions taken in this procedure are against the primary adapter (that is, not the secondary adapter).Attention: When SAS fabric problems exist, replacing RAID adapters is not recommended without assistance from your service provider. Because the adapter might contain non-volatile write cache data and configuration data for the attached disk arrays, additional problems can be created by replacing an adapter. 1. Was the SRC xxxx4030? No: Go to step 5 on page 190. Yes: Go to step 2.2. Identify the affected adapter and its port by examining the product activity log. Perform the following: a. Access SST or DST. v If you can enter a command at the console, access system service tools (SST). See “System Service Tools (SST)” on page 611. v If you cannot enter a command at the console, perform an IPL to DST. See “Performing an IPL to DST” on page 640. v If you cannot perform a type A or B IPL, perform a type D IPL from removable media.b. Access the product activity log and record address information. If a type D IPL was not performed to get to SST or DST: The log information is formatted. Access the product activity log and display the SRC that sent you here. Press the F9 key for address information. This is the adapter address. Then, press F12 to cancel and return to the previous screen. Then press the F4 key to view the additional information to record the formatted log information. The Adapter Port field indicates the port on the adapter reporting the problem. There may be more than one port listed because multiple ports map to the same physical connector. For example, ports 0 through 3 map to the first physical connector, 4 through 7 map to the second physical connector, and so on. The port numbers are labeled on the adapter tailstock. If a type D IPL was performed to get to DST: The log information is not formatted. Access the product activity log and display the SRC that sent you here. The direct select address (DSA) of the adapter is in the format BBBB-Cc-bb: BBBB Hexadecimal offsets 4C and 4D Cc Hexadecimal offset 51 bb hHexadecimal offset 4FIn order to interpret the hexadecimal information to get device addresses, see “More information from hexadecimal reports” on page 620. The Adapter Port field indicates the port on the adapter reporting the problem. There may be more than one port listed because multiple ports map to the same physical connector. For example, ports 0 through 3 map to the first physical connector, 4 through 7 map to the second physical connector, and so on. The port numbers are labeled on the adapter tailstock.
Chapter 2. Isolation Procedures 189
c. Determine the location of the adapter that reported the problem. See Chapter 3, “Locating FRUs,” on page 335 and find the diagram of the system unit or the expansion unit. Then find the following items: v The card slot that is identified by the direct select address (DSA) v The physical connector identified by the port number found on the adapter tailstockHave you determined the location of the adapter and its port? No: Ask your next level of support for assistance. This ends the procedure. Yes: Continue with the next step.3. Review the device enclosure cabling and correct the cabling as required for the device or device enclosure attached to the identified adapter port. To see example device configurations with SAS cabling, see Serial attached SCSI cable planning, in the Site and Hardware Planning Guide, SA76-0091. 4. Perform the following to cause the adapter to rediscover the devices and connections: a. Use Hardware Service Manager to perform another IPL of the virtual I/O processor that is associated with this adapter. b. Vary on any other resources attached to the virtual I/O processor.Did the error recur? No: This ends the procedure. Yes: Contact your hardware service provider. This ends the procedure.5. The SRC is xxxx4040. Determine if a problem still exists for the adapter that logged this error by examining the SAS connections as follows: a. On the System Service Tools (SST) screen, select Start a Service Tool then press Enter. b. Select Display/Alter/Dump. c. Select Display/Alter storage. d. Select Licensed Internal Code (LIC) data. e. Select Advanced Analysis. f. Type FABQUERY on the entry line and then select it with option 1. g. On the Specify Advanced Analysis Options screen, type -SUB 01 -IOA DCxx -DSP 0 in the Options field, where DCxx is the adapter resource name. Press Enter. Note: More information is available by returning to the Specify Advanced Analysis Options screen and typing -SUB 01 -IOA DCxx -DSP 2 in the Options field, where DCxx is the adapter resource name. Press Enter.Do all expected devices appear in the list and are all paths marked as Operational? No: Continue with the next step. Yes: The error condition no longer exists. This ends the procedure.6. Perform the following to cause the adapter to rediscover the devices and connections: a. Use Hardware Service Manager to re-IPL the virtual I/O processor that is associated with this adapter. b. Vary on any other resources attached to the virtual I/O processor.Note: At this point, ignore any problems found and continue with the next step. 7. Determine if the problem still exists for the adapter that logged this error by examining the SAS connections by performing the actions in step 5 again. Do all expected devices appear in the list and are all paths marked as Operational? NO Go to step 8 on page 191. YES This ends the procedure.
190 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
8. Because the problem persists, some corrective action is needed to resolve the problem. Proceed by doing the following: Perform only one of the following corrective actions (listed in the order of preference). If one of the corrective actions has previously been attempted, proceed to the next one in the list. v Reseat cables if present on adapter and device enclosure. Perform the following: a. Use adapter concurrent maintenance to power off the adapter slot, or power off the system or partition. b. Reseat the cables. c. Use adapter concurrent maintenance to power on the adapter slot, or power on the system or partition.v Replace the cable, if present, from the adapter to the device enclosure. Perform the following: a. Use adapter concurrent maintenance to power off the adapter slot, or power off the system or partition. b. Replace the cables. c. Use adapter concurrent maintenance to power on the adapter slot, or power on the system or partition.v Replace the internal device enclosure or refer to the service documentation for an external expansion drawer. Perform the following steps: a. Power off the system or partition. If the enclosure is external, adapter concurrent maintenance can be used instead to power off the adapter slot. b. Replace the device enclosure. c. Power on the system or partition. If the enclosure is external, use adapter concurrent maintenance instead to power on the adapter slot.v Replace the adapter. The procedure to replace the adapter can be found in “PCI adapter” on page 438. v Contact your service provider.9. To determine if the problem still exists for the adapter that logged this error, examine the SAS connections by performing the actions in step 5 on page 190 again. Do all expected devices appear in the list and are all paths marked as Operational? No: Go to step 8. Yes: This ends the procedure.SIP3145 Use this procedure to resolve the following problem: Unsupported enclosure function detected (SRC xxxx4110). The possible causes are: v Device enclosure or adapter code levels are not up to date. v Unsupported type of device enclosure or device.Considerations: To prevent hardware damage or erroneous diagnostic results, remove power from the system as appropriate before connecting and disconnecting cables or devices. 1. Identify the affected adapter and its port by examining the product activity log. Perform the following: a. Access SST or DST. v If you can enter a command at the console, access system service tools (SST). See “System Service Tools (SST)” on page 611. Chapter 2. Isolation Procedures 191
v If you cannot enter a command at the console, perform an IPL to DST. See “Performing an IPL to DST” on page 640. v If you cannot perform a type A or B IPL, perform a type D IPL from removable media.b. Access the product activity log and record address information. If a type D IPL was not performed to get to SST or DST: The log information is formatted. Access the product activity log and display the SRC that sent you here. Press the F9 key for address information. This is the adapter address. Then, press F12 to cancel and return to the previous screen. Then press the F4 key to view the additional information to record the formatted log information. The Adapter Port field indicates the port on the adapter that is reporting the problem. There may be more than one port listed because multiple ports map to the same physical connector. For example, ports 0 through 3 map to the first physical connector, 4 through 7 map to the second physical connector, and so on. The port numbers are labeled on the adapter tailstock.If a type D IPL was performed to get to DST: The log information is not formatted. Access the product activity log and display the SRC that sent you here. The direct select address (DSA) of the adapter is in the format BBBB-Cc-bb: BBBB Hexadecimal offsets 4C and 4D Cc Hexadecimal offset 51 bb Hexadecimal offset 4FIn order to interpret the hexadecimal information to get device addresses, see “More information from hexadecimal reports” on page 620. The Adapter Port field indicates the port on the adapter reporting the problem. There might be more than one port listed because multiple ports map to the same physical connector. For example, ports 0 through 3 map to the first physical connector, 4 through 7 map to the second physical connector, and so on. The port numbers are labeled on the adapter tailstock.c. Determine the location of the adapter that reported the problem. See Chapter 3, “Locating FRUs,” on page 335 and find the diagram of the system unit or the expansion unit. Then find the following items: v The card slot that is identified by the direct select address (DSA) v The physical connector identified by the port number found on the adapter tailstock Have you determined the location of the adapter and its port? No: Ask your next level of support for assistance. This ends the procedure. Yes: Continue with the next step.2. Check for the latest PTFs for the device enclosure or adapter and apply them. If unsupported device enclosures or devices are attached, then either remove or replace them with supported device enclosures or devices. Review the device enclosure cabling and correct the cabling as required for the device or device enclosure attached to the identified adapter port. To see example device configurations with SAS cabling, see Serial attached SCSI cable planning, in the Site and Hardware Planning Guide, SA76-0091. 3. Perform the following to cause the adapter to rediscover the devices and connections: a. Use Hardware Service Manager to perform another IPL of the virtual I/O processor that is associated with this adapter. b. Vary on any other resources attached to the virtual I/O processor. Did the error recur? No: This ends the procedure. Yes: Contact your hardware service provider. This ends the procedure.
192 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
SIP3146 Use this procedure to resolve the configuration error: Incomplete multipath connection between enclosures and device detected (SRC xxxx4041). The possible cause is a failed connection caused by a failing component within the device enclosure, including the device itself. Note: The adapter is not a likely cause of this problem. Considerations: v Power off the system, partition, or card slot before connecting and disconnecting cables or devices, as appropriate, to prevent hardware damage. v Some systems have the disk enclosure or removable media enclosure integrated in the system with no cables. For these configurations the SAS connections are integrated onto the system boards and a failed connection can be the result of a failed system board or integrated device enclosure. v Some configurations involve a serial attached SCSI (SAS) adapter connecting to internal SAS disk enclosures within a system using a cable card. Keep in mind that when the procedure refers to a device enclosure, it could be referring to the internal SAS disk slots or media slots. Also, when the procedure refers to a cable, it could include a cable card. v When using SAS adapters in a Dual Storage IOA configuration, ensure that the actions taken in this procedure are against the primary adapter (that is, not the secondary adapter).Attention: Removing functioning disk units in a disk array is not recommended without assistance from your service provider. If functioning disk units are removed, a disk array might become unprotected or failed and additional problems could be created. 1. Determine the resource name of the adapter that reported the problem by performing the following: a. Access SST or DST. b. Access the product activity log and record the resource name that this error is logged against. If the resource name is an adapter resource name, use it and continue with the next step. If the resource name is a disk unit resource name, use Hardware Service Manager to determine the resource name of the adapter that is controlling this disk unit.2. Determine if a problem still exists for the adapter that logged this error by examining the SAS connections as follows: a. On the System Service Tools (SST) display, select Start a Service Tool, and then press Enter. b. Select Display/Alter/Dump. c. Select Display/Alter storage. d. Select Licensed Internal Code (LIC) data. e. Select Advanced Analysis. f. Type FABQUERY on the entry line and then select it with option 1. g. On the Specify Advanced Analysis Options screen, type -SUB 01 -IOA DCxx -DSP 0 in the Options field, where DCxx is the adapter resource name. Press Enter. Note: More information is available by returning to the Specify Advanced Analysis Options screen and typing -SUB 01 -IOA DCxx -DSP 2 in the Options field, where DCxx is the adapter resource name. Press Enter.Do all expected devices appear in the list and are all paths marked as Operational? No: Continue with the next step. Yes: The error condition no longer exists. This ends the procedure.3. Perform the following steps to cause the adapter to rediscover the devices and connections: a. Use Hardware Service Manager to perform another IPL of the virtual I/O processor that is associated with this adapter. Chapter 2. Isolation Procedures 193
b. Vary on any other resources attached to the virtual I/O processor.4. To determine if the problem still exists for the adapter that logged this error, examine the SAS connections by performing the actions in step 2 on page 193 again. Do all expected devices appear in the list and are all paths marked as Operational? No: Continue with the next step. Yes: The error condition no longer exists. This ends the procedure.5. Perform only one of the following corrective actions (listed in the order of preference). If one of the corrective actions has previously been attempted, proceed to the next one in the list. v Review the device enclosure cabling and correct the cabling as required. To see example device configurations with SAS cabling, see Serial attached SCSI cable planning, in the Site and Hardware Planning Guide, SA76-0091. v Reseat the cables, if present, on the adapter and the device enclosure. Perform the following steps: a. Use adapter concurrent maintenance to power off the adapter slot, or power off the system or partition. b. Reseat the cables. c. Use adapter concurrent maintenance to power on the adapter slot, or power on the system or partition.v Replace the cable, if present, from the adapter to the device enclosure. Perform the following steps: a. Use adapter concurrent maintenance to power off the adapter slot, or power off the system or partition. b. Replace the cables. c. Use adapter concurrent maintenance to power on the adapter slot, or power on the system or partition.v Replace the internal device enclosure or refer to the service documentation for an external expansion drawer. Perform the following steps: a. Power off the system or partition. If the enclosure is external, use adapter concurrent maintenance instead to power off the adapter slot. b. Replace the device enclosure. c. Power on the system or partition. If the enclosure is external, adapter concurrent maintenance can be used instead to power on the adapter slot.v Replace the adapter. The procedure to replace the adapter can be found in “PCI adapter” on page 438. v Contact your service provider.6. To determine if the problem still exists for the adapter that logged this error, examine the SAS connections by performing the actions in step 2 on page 193 again. Do all expected devices appear in the list and are all paths marked as Operational? No: Go to step 5. Yes: The error condition no longer exists. This ends the procedure..SIP3147 Use this procedure to resolve the following problem: Missing remote adapter (SRC xxxx9076) 1. An adapter attached in either an Auxiliary Cache or Dual Storage IOA configuration was not discovered in the allotted time. To obtain additional information about the configuration involved, locate the formatted log in the Product Activity Log. a. Access SST/DST. v If you can enter a command at the console, access system service tools (SST). See “System Service Tools (SST)” on page 611. v If you cannot enter a command at the console, perform an IPL to DST. See “Performing an IPL to DST” on page 640. 194 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
v If you cannot perform a type A or B IPL, perform a type D IPL from removable media.b. Access the Product Activity Log and record address information. If a D IPL was NOT performed to get to SST/DST: The log information is formatted. Access the Product Activity Log and display the SRC that sent you here. Press the F9 key for ?Address Information?. This is the adapter address. Then, press F12 to cancel and return to the previous screen. Then press the F4 key to view the ?Additional Information? to record the formatted log information. The ?Type of adapter connection? field indicates the type of configuration involved.If a D IPL was performed to get to DST: The log information is not formatted. Access the Product Activity Log and display the SRC that sent you here. The direct select address (DSA) of the adapter is in the format BBBB-Cc-bb: BBBB hexadecimal offsets 4C and 4D Cc hexadecimal offset 51 bb hexadecimal offset 4FIn order to interpret the hexadecimal information to get device addresses, see “More information from hexadecimal reports” on page 620. The ?Type of adapter connection? field indicates the type of configuration involved.2. Determine which of the following is the cause of your specific error and take the appropriate actions listed. If this does not correct the error, contact your hardware service provider. The possible causes are: v An attached adapter for the configuration is not installed or is not powered on. Some adapters are required to be part of a Dual Storage IOA Configuration. Ensure that both adapters are properly installed and powered on. v If this is an Auxiliary Cache or Dual Storage IOA configuration, then both adapters may not be in the same partition. Ensure that both adapters are assigned to the same partition. v An attached adapter does not support the desired configuration. v An attached adapter for the configuration is failed. Take action on the other errors that have occurred at the same time as this error. v Adapter code levels are not up to date or are not at the same level of functionality. Ensure that the code for both adapters is at the latest level.Note: The adapter that is logging this error will run in a performance degraded mode, without caching, until the problem is resolved. This ends the procedure.SIP3148 Use this procedure to resolve the following problem: Attached enclosure does not support required multipath function (SRC xxxx4050). The possible cause is the use of an unsupported device enclosure. Considerations: v Power off the system, partition, or card slot before connecting and disconnecting cables or devices, as appropriate, to prevent hardware damage.1. Identify the adapter and adapter port associated with the problem by examining the product activity log. Perform the following steps: a. Access SST or DST. Chapter 2. Isolation Procedures 195
v If you can enter a command at the console, access system service tools (SST). See “System Service Tools (SST)” on page 611. v If you cannot enter a command at the console, perform an IPL to DST. See “Performing an IPL to DST” on page 640. v If you cannot perform a type A or B IPL, perform a type D IPL from removable media.b. Access the product activity log and record address information. If a type D IPL was not performed to get to SST or DST: The log information is formatted. Access the product activity log and display the SRC that sent you here. Press the F9 key for address information. This is the adapter address. Then, press F12 to cancel and return to the previous screen. Then press the F4 key to view the additional information to record the formatted log information. The Adapter Port field indicates the port on the adapter reporting the problem. There may be more than one port listed because multiple ports map to the same physical connector. For example, ports 0 through 3 map to the first physical connector, 4 through 7 map to the second physical connector, and so on. The port numbers are labeled on the adapter tailstock.If a type D IPL was performed to get to DST: The log information is not formatted. Access the product activity log and display the SRC that sent you here. The direct select address (DSA) of the adapter is in the format BBBB-Cc-bb: BBBB Hexadecimal offsets 4C and 4D Cc Hexadecimal offset 51 bb Hexadecimal offset 4FIn order to interpret the hexadecimal information to get device addresses, see “More information from hexadecimal reports” on page 620. The Adapter Port field indicates the port on the adapter reporting the problem. There may be more than one port listed because multiple ports map to the same physical connector. For example, ports 0 through 3 map to the first physical connector, 4 through 7 map to the second physical connector, and so on. The port numbers are labeled on the adapter tailstock.c. Determine the location of the adapter that reported the problem. See Chapter 3, “Locating FRUs,” on page 335 and find the diagram of the system unit or the expansion unit. Then find the following items: v The card slot that is identified by the direct select address (DSA) v The physical connector identified by the port number found on the adapter tailstock Have you determined the location of the adapter and its port? No: Ask your next level of support for assistance. This ends the procedure. Yes: Continue with the next step.2. If unsupported device enclosures are attached to the identified adapter port, then either remove or replace them with supported device enclosures. 3. Perform the following steps to cause the adapter to rediscover the devices and connections: a. Use Hardware Service Manager to perform another IPL of the virtual I/O processor that is associated with this adapter. b. Vary on any other resources attached to the virtual I/O processor. Did the error recurr? No: This ends the procedure. Yes: Contact your hardware service provider. This ends the procedure.
196 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
SIP3149 Use this procedure to resolve the following problem: Incomplete multipath connection between adapter and remote adapter (SRC xxxx9075) The possible cause is incorrect cabling between SAS RAID adapters. Power off the system, partition, or card slot before connecting and disconnecting cables or devices, as appropriate, to prevent hardware damage. Review the device enclosure cabling and correct the cabling as required. To see example device configurations with SAS cabling, see ?Serial attached SCSI cable planning,? in the Site and Hardware Planning Guide, order number SA76-0091. This ends the procedure. SIP3150 Use this procedure to perform serial attached SCSI (SAS) fabric problem isolation. Considerations: v Power off the system, partition, or card slot before connecting and disconnecting cables or devices, as appropriate, to prevent hardware damage. v Some systems have SAS, PCI-X, and PCIe bus interface logic integrated onto the system boards and use a pluggable RAID Enablement Card (a non-PCI form factor card) for these SAS, PCI-X, and PCIe buses. For these configurations, replacement of the RAID Enablement Card is unlikely to solve a SAS related problem because the SAS interface logic is on the system board. v Some systems have the disk enclosure or removable media enclosure integrated in the system with no cables. For these configurations the SAS connections are integrated onto the system boards and a failed connection can be the result of a failed system board or integrated device enclosure.Attention: v Before you replace a RAID adapter. Because the adapter might contain nonvolatile write cache data and configuration data for the attached disk arrays, additional problems can be created by replacing an adapter. v Before you remove functioning disk units in a disk array. A disk array might become unprotected or might fail and additional problems might be created if functioning disk units are removed from a disk array. 1. Was the SRC xxxx3020? No Go to step 3. Yes Go to step 2. 2. The possible causes are: v More devices are connected to the adapter than the adapter supports. Change the configuration to reduce the number of devices below what is supported by the adapter. v A SAS device has been incorrectly moved from one location to another. Either return the device to its original location or move the device while the adapter is powered off. v A SAS device has been incorrectly replaced by a SATA device. A SAS device must be used to replace a SAS device.This ends the procedure. 3. Determine the status of the disk units in the array by doing the following steps: a. Access the product activity log and display the SRC that sent you here. b. Press the F9 key for address information. This is the adapter address. Chapter 2. Isolation Procedures 197
c. Return to the SST or DST main menu. d. Select Work with disk units ? Display disk configuration ? Display disk configuration status. e. On the Display disk configuration status screen, look for the devices attached to the adapter that was identified.Is there a device that has a status of ?RAID 5/Unknown?, ?RAID 6/Unknown?, ?RAID 5/Failed?, or ?RAID 6/Failed?? No Go to step 5. Yes Go to step 4 4. Other errors should have occurred related to the disk array having degraded protection. Take action on these errors to replace the failed disk unit and restore the disk array to a fully protected state. This ends the procedure. 5. Have other errors occurred at the same time as this error? No Go to step 7. Yes Go to step 6 6. Take action on the other errors that have occurred at the same time as this error. This ends the procedure. 7. Was the SRC xxxxFFFE? No Go to step 10. Yes Go to step 8. 8. Check for the latest PTFs for the device, device enclosure, and adapter and apply them. Did you find and apply a PTF? No Go to step 10. Yes Go to step 9. 9. This ends the procedure. 10. Identify the adapter and adapter port associated with the problem by examining the product activity log. Perform the following: a. Access SST or DST. v If you can enter a command at the console, access system service tools (SST). See “System Service Tools (SST)” on page 611. v If you cannot enter a command at the console, perform an IPL to DST. See “Performing an IPL to DST” on page 640. v If you cannot perform a type A or B IPL, perform a type D IPL from removable media.b. Access the product activity log and display the SRC that sent you here. Record the adapter address and the adapter port by doing one of the following: v If the SRC is xxxxFFFE, press the F9 key for address information. The adapter address is the bus, board, card information. The port is shown in the I/O bus field. Convert the port value from decimal to hexadecimal. v Press the F9 key for address information. The adapter address is the bus, board, card information. Then, press F12 to cancel and return to the previous screen. Then press the F4 key to view the additional information, if available. The adapter port is characters 1 and 2 of the unit address. For example, if the unit address is 123456FF, the port would be 12. v Go to “Hexadecimal Product Activity Log data” on page 617 to obtain the address information. The adapter address is the bus, board, card information. The adapter port is characters 1 and 2 of the unit address. For example, if the unit address is 123456FF, the port would be 12.11. Use the adapter address to find the location of the adapter (see Chapter 3, “Locating FRUs,” on page 335). On the tailstock of the adapter, find the port identified in the previous step. This is the port that is used to attach the device, or device enclosure, that is experiencing the problem. 198 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
12. Because the problem persists, some corrective action is needed to resolve the problem. Proceed by doing the following: Perform only one of the following corrective actions (listed in the order of preference). If one of the corrective actions has previously been attempted, proceed to the next one in the list. v Reseat cables, if present, on adapter and device enclosure. Perform the following steps: a. Use adapter concurrent maintenance to power off the adapter slot, or power off the system or partition. b. Reseat the cables. c. Use adapter concurrent maintenance to power on the adapter slot, or power on the system or partition.v Replace the cable, if present, from the adapter to the device enclosure. Perform the following: a. Use adapter concurrent maintenance to power off the adapter slot, or power off the system or partition. b. Replace the cables. c. Use adapter concurrent maintenance to power on the adapter slot, or power on the system or partition.v Replace the device. See “Disk Drive” on page 379.Note: If there are multiple devices with a path that is not Operational, then the problem is not likely to be with a device. v Replace the internal device enclosure or refer to the service documentation for an external expansion drawer. Perform the following: a. Power off the system or partition. If the enclosure is external, use adapter concurrent maintenance instead to power off the adapter slot. b. Replace the device enclosure. c. Power on the system or partition. If the enclosure is external, use adapter concurrent maintenance instead to power on the adapter slot.v Replace the adapter. The procedure to replace the adapter can be found in “PCI adapter” on page 438. v Contact your service provider.13. Does the problem still occur after performing the corrective action? No: This ends the procedure. Yes: Go to step 12.SIP3152 Use this procedure to resolve possible failed connection problems This procedure is used to resolve the following problems: v Multipath redundancy level is worse (SRC xxxx4060) v Device bus fabric error (SRC xxxx4100) v Temporary device bus fabric error (SRC xxxx4101)The possible causes are: v A failed connection caused by a failing component in the serial attached SCSI (SAS) fabric between, and including, the adapter and device enclosure. v A failed connection caused by a failing component within the device enclosure, including the device itself.Note: For SRC xxxx4060, the failed connection was previously working, and may have already recovered.
Chapter 2. Isolation Procedures 199
Considerations: v Power off the system, partition, or card slot before connecting and disconnecting cables or devices, as appropriate, to prevent hardware damage. v Some systems have SAS, PCI-X, and PCIe bus interface logic integrated onto the system boards and use a pluggable RAID Enablement Card (a non-PCI form factor card) for these SAS, PCI-X, and PCIe buses. For these configurations, replacement of the RAID Enablement Card is unlikely to solve a SAS-related problem because the SAS interface logic is on the system board. v Some systems have the disk enclosure or removable media enclosure integrated in the system with no cables. For these configurations the SAS connections are integrated onto the system boards and a failed connection can be the result of a failed system board or integrated device enclosure. v Some configurations involve a SAS adapter connecting to internal SAS disk enclosures within a system using a cable card. Keep in mind that when the procedure refers to a device enclosure, it could be referring to the internal SAS disk slots or media slots. Also, when the procedure refers to a cable, it could include a cable card. v When using SAS adapters in a Dual Storage IOA configuration, ensure that the actions taken in this procedure are against the primary adapter (not the secondary adapter).Attention: v When SAS fabric problems exist, replacing RAID adapters is not recommended without assistance from your service provider. Because the adapter might contain nonvolatile write cache data and configuration data for the attached disk arrays, additional problems can be created by replacing an adapter. v Removing functioning disk units in a disk array is not recommended without assistance from your service provider. A disk array might become unprotected or might fail if functioning disk units are removed. The removal of functioning disk units might also result in additional problems in the disk array.1. Determine the resource name of the adapter that reported the problem by performing the following: a. Access SST or DST. b. Access the product activity log and record the resource name that this error is logged against. If the resource name is an adapter resource name, use it and continue with the next step. If the resource name is a disk unit resource name, use the Hardware Service Manager to determine the resource name of the adapter that is controlling this disk unit.2. Determine if a problem still exists for the adapter that logged this error by examining the SAS connections as follows: a. On the System Service Tools (SST) screen, select Start a Service Tool and then press Enter. b. Select Display/Alter/Dump. c. Select Display/Alter storage. d. Select Licensed Internal Code (LIC) data. e. Select Advanced Analysis. f. Type FABQUERY on the entry line and then select it with option 1. g. On the Specify Advanced Analysis Options screen, type -SUB 01 -IOA DCxx -DSP 0 in the Options field, where DCxx is the adapter resource name. Press Enter. Note: More information is available by returning to the Specify Advanced Analysis Options screen and typing -SUB 01 -IOA DCxx -DSP 2 in the Options field, where DCxx is the adapter resource name. Press Enter.Do all expected devices appear in the list and are all paths marked as Operational? No: Continue with the next step. Yes: The error condition no longer exists. This ends the procedure.3. Perform the following to cause the adapter to rediscover the devices and connections: 200 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
a. Use Hardware Service Manager to perform another IPL of the virtual I/O processor that is associated with this adapter. b. Vary on any other resources attached to the virtual I/O processor.4. To determine if the problem still exists for the adapter that logged this error, examine the SAS connections by performing the actions in step 2 on page 200 again. Do all expected devices appear in the list and are all paths marked as Operational? No: Continue with the next step. Yes: The error condition no longer exists. This ends the procedure.5. Perform only one of the following corrective actions (listed in the order of preference). If one of the corrective actions has previously been attempted, then proceed to the next one in the list. v Reseat cables, if present, on adapter and device enclosure. Perform the following steps: a. Use adapter concurrent maintenance to power off the adapter slot, or power off the system or partition. b. Reseat the cables. c. Use adapter concurrent maintenance to power on the adapter slot, or power on the system or partition.v Replace the cable, if present, from adapter to device enclosure. Perform the following steps: a. Use adapter concurrent maintenance to power off the adapter slot, or power off the system or partition. b. Replace the cables. c. Use adapter concurrent maintenance to power on the adapter slot, or power on the system or partition.v Replace the device. See “Disk Drive” on page 379.Note: If there are multiple devices with a path that is not Operational, then the problem is not likely to be with a device. v Replace the internal device enclosure or refer to the service documentation for an external expansion drawer. Perform the following steps: a. Power off the system or partition. If the enclosure is external, adapter concurrent maintenance can be used instead to power off the adapter slot. b. Replace the device enclosure. c. Power on the system or partition. If the enclosure is external, adapter concurrent maintenance can be used instead to power on the adapter slot.v Replace the adapter. The procedure to replace the adapter can be found in “PCI adapter” on page 438. v Contact your service provider.6. To determine if the problem still exists for the adapter that logged this error, examine the SAS connections by performing the actions in step 2 on page 200 again. Do all expected devices appear in the list and are all paths marked as Operational? No: Go to step 5. Yes: The error condition no longer exists. This ends the procedure..SIP3153 Go to SIP3152. Chapter 2. Isolation Procedures 201
Service processor isolation procedures Use service processor isolation procedures if there is no HMC attached to the server. If the server is connected to an HMC, use the procedures that are available on the HMC to continue FRU isolation. DANGERWhen working on or around the system, observe the following precautions: Electrical voltage and current from power, telephone, and communication cables are hazardous. To avoid a shock hazard: v Connect power to this unit only with the IBM provided power cord. Do not use the IBM provided power cord for any other product. v Do not open or service any power supply assembly. v Do not connect or disconnect any cables or perform installation, maintenance, or reconfiguration of this product during an electrical storm. v The product might be equipped with multiple power cords. To remove all hazardous voltages, disconnect all power cords. v Connect all power cords to a properly wired and grounded electrical outlet. Ensure that the outlet supplies proper voltage and phase rotation according to the system rating plate. v Connect any equipment that will be attached to this product to properly wired outlets. v When possible, use one hand only to connect or disconnect signal cables. v Never turn on any equipment when there is evidence of fire, water, or structural damage. v Disconnect the attached power cords, telecommunications systems, networks, and modems before you open the device covers, unless instructed otherwise in the installation and configuration procedures. v Connect and disconnect cables as described in the following procedures when installing, moving, or opening covers on this product or attached devices. To Disconnect: 1. Turn off everything (unless instructed otherwise). 2. Remove the power cords from the outlets. 3. Remove the signal cables from the connectors. 4. Remove all cables from the devices To Connect: 1. Turn off everything (unless instructed otherwise). 2. Attach all cables to the devices. 3. Attach the signal cables to the connectors. 4. Attach the power cords to the outlets. 5. Turn on the devices. (D005) “FSPSP01” on page 205A part vital to system function has been deconfigured. Review the system error logs for errors that call out parts relevant to each reason code. If replacing those parts does not resolve the error, use this procedure. “FSPSP02” on page 206This procedure is for boot failures that terminate very early in the boot process. “FSPSP03” on page 206A system operator or user error has occurred. “FSPSP04” on page 206A problem has been detected in the service processor firmware. “FSPSP05” on page 206The service processor has detected a problem in the platform firmware. 202 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
“FSPSP06” on page 207The service processor reported a suspected intermittent problem. “FSPSP07” on page 207The time of day has been reset to the default value. “FSPSP09” on page 207A problem has been detected with a memory DIMM, but it cannot be isolated to a specific memory DIMM. “FSPSP10” on page 207The part indicated in the FRU list that follows this procedure is not valid or missing for this system’s configuration. “FSPSP11” on page 207The service processor has detected an error on the RIO/HSL port in the system unit. “FSPSP12” on page 208The DIMM FRU that was previously replaced did not correct the memory error. “FSPSP14” on page 208The service processor cannot communicate with the system firmware. The server firmware will continue to run the system and partitions while it attempts to recover communications. Server firmware recovery actions will continue for approximately 30 to 40 minutes. “FSPSP16” on page 208Save any error log and dump data and contact your next level of support for assistance. “FSPSP17” on page 208A system uncorrectable error has occurred. “FSPSP18” on page 208A problem has been detected in the platform LIC. “FSPSP20” on page 209A failing item has been detected by a hardware procedure. “FSPSP22” on page 209The system has detected that a processor chip is missing from the system configuration because JTAG lines are not working. “FSPSP23” on page 209The system needs to perform a service processor dump. “FSPSP24” on page 209The system is running degraded. Array bit steering may be able to correct this problem without replacing hardware. “FSPSP25” on page 209The server has detected an over-temperature thermal fault. “FSPSP27” on page 210An attention line has been detected as having a problem. If the FRU replaced before this procedure did not correct the problem, perform the following: “FSPSP28” on page 210The resource ID (RID) of the FRU could not be found in the vital product data (VPD) table. “FSPSP29” on page 210The system has detected that all I/O bridges are missing from the system configuration. “FSPSP30” on page 211A problem has been encountered accessing the VPD card or the data found on the VPD card has been corrupted. “FSPSP31” on page 211The service processor has detected that one or more of the required fields in the system VPD has not been initialized. “FSPSP32” on page 211A problem with the enclosure has been found. Chapter 2. Isolation Procedures 203
“FSPSP33” on page 212A problem has been detected in the connection with the HMC. “FSPSP34” on page 212The memory cards are plugged in an invalid configuration and cannot be used by the system. “FSPSP35” on page 213The system has detected a problem with a memory controller. “FSPSP38” on page 213The system has detected an error within the JTAG path. If the FRU replaced before this procedure did not correct the problem, perform the following: “FSPSP42” on page 213An error communicating between two system processors was detected. “FSPSP45” on page 214The system has detected an error with FSI path. “FSPSP46” on page 214Some corrupt areas of flash or RAM have been detected on the service processor. “FSPSP47” on page 214The system has detected an error within the PSI link. To resolve the problem, perform the following: “FSPSP48” on page 214A diagnostic function detects an external processor interface problem. If replacing the FRUs previously listed in the FRU list does not fix the problem, perform the following: “FSPSP49” on page 214A diagnostic function detects an internal processor interface problem. If replacing the FRUs previously listed in the FRU list does not fix the problem, perform the following: “FSPSP50” on page 215A diagnostic function detects a connection problem between a processor chip and a GX chip. If replacing the FRUs previously listed in the FRU list does not fix the problem, perform the following: “FSPSP51” on page 215Runtime diagnostics detected a memory bus correctable error that is exceeding the threshold. The memory bus correctable error will not cause an immediate loss of system operations. However, the system is operating in a degraded mode. “FSPSP52” on page 215A problem has been detected on a memory bus. If replacing the FRUs previously listed in the FRU list does not fix the problem, perform the following: “FSPSP54” on page 215A processor over-temperature has been detected. Check for any environmental issues before replacing any parts. “FSPSP55” on page 216An error has been detected on a bus between two FRUs. The endpoint FRUs have been called out. However, the source of the error could be the bus path between the FRUs. “FSPSP63” on page 216The system has experienced a power error. Please review previous error logs for power-related issues. “FSPSP64” on page 216All the processor support interface (PSI) links of the system are either nonfunctional or deconfigured, so the system cannot perform an IPL appropriately. Look for previous error logs that deconfigure hardware. “FSPSPC1” on page 217If the system hangs after the code that sent you to this procedure appears in the control panel, perform these steps to reset the service processor. “FSPSPD1” on page 218If the system hangs after the code that sent you to this procedure appears in the control panel, perform these steps to reset the service processor.
204 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
FSPSP01 A part vital to system function has been deconfigured. Review the system error logs for errors that call out parts relevant to each reason code. If replacing those parts does not resolve the error, use this procedure. Refer to Chapter 3, “Locating FRUs,” on page 335 for instructions for removing and replacing any FRUs called out by the following procedure. Perform the following steps: 1. Is SRC B1xxB10C or B1xxB10D (xx indicates that the Subsystem ID is irrelevant)? v No: Go to step 5 v Yes: The system has detected that one of the following has occurred: – A memory controller that is required for the system to function is deconfigured. – There is not enough memory. – The memory DIMM is plugged incorrectly.Continue with the next step.2. Perform the following steps: a. Reseat all of the memory DIMMs on the system backplane but do not replace any memory DIMM at this time. b. Perform a slow boot, refer to “Performing a slow boot” on page 665.Does the problem persist? Yes: Continue with the next step. No: Go to Chapter 5, “Verifying a repair,” on page 557. This ends the procedure.3. Use the following procedure to replace memory DIMMs, and perform a slow boot after each replacement until the problem is resolved. Memory DIMMs must be installed in quads and in the correct connectors. Refer to “Physical Locations” on page 339 for information on memory DIMMs. a. Replace the first memory DIMM quad. b. Perform a slow boot, refer to “Performing a slow boot” on page 665.Does the problem persist? Yes: Repeat this step and replace the second memory DIMM quad. If you have replaced both of the memory DIMM quads, then continue with the next step. No: Go to Chapter 5, “Verifying a repair,” on page 557. This ends the procedure.4. Use the following procedure to replace the system backplane. a. Replace the system backplane (Un-P1). b. Perform a slow boot, refer to “Performing a slow boot” on page 665.Does the problem persist? Yes: Contact your next level of support. This ends the procedure. No: Go to Chapter 5, “Verifying a repair,” on page 557. This ends the procedure.5. Is SRC B1xxB107 or B1xxB108? v No: Continue with the next step. v Yes: The system has detected a problem with the clock module. Replace the system backplane (Un-P1). This ends the procedure.6. Is SCR B1xxB106? v No: Continue with the next step. v Yes: The system has detected that the system backplane is deconfigured. Perform the following: a. Replace the system backplane (Un-P1). b. Perform a slow boot, refer to “Performing a slow boot” on page 665. Chapter 2. Isolation Procedures 205
c. Go to Chapter 5, “Verifying a repair,” on page 557. This ends the procedure.7. Is SRC B1xxB110 or B1xxB111? v No: Contact your next level of support. This ends the procedure. v Yes: The system has detected that the I/O bridge is deconfigured. Perform the following: a. Replace the system backplane (Un-P1). b. Perform a slow boot, refer to “Performing a slow boot” on page 665. c. Go to Chapter 5, “Verifying a repair,” on page 557. This ends the procedure.FSPSP02 This procedure is for boot failures that terminate very early in the boot process. This error path is indicated when the SRC data words are scrolling automatically through control panel functions 11, 12, and 13, and the control panel interface buttons are not responsive. Perform the following: 1. Push the white power button to reset the system and start it on the other side of platform LIC. Note: The white power button will only reset the system and attempt to reach standby. 2. Did an SRC occur after starting the system on the other side? No: Verify that the system’s firmware is at the latest level. Update the system’s firmware on the failing side if necessary.This ends the procedure. Yes: Continue with the next step.3. Is the SRC the same SRC that brought you to this procedure? v No: Return to Chapter 1, “Starting a Service Call,” on page 1 to service this new SRC. This ends the procedure. v Yes: Replace the system backplane (Un-P1). Refer to Chapter 3, “Locating FRUs,” on page 335. If the problem is not resolved, contact your next level of support. This ends the procedure.FSPSP03 A system operator or user error has occurred. Refer to the documentation for the task that you were attempting to perform. FSPSP04 A problem has been detected in the service processor firmware. Refer to Operations Guide for ASMI and for Nonpartitioned Systems (SA76-0094) or Operations Guide for the Hardware Management Console and Managed Systems (SA76-0085) and verify that the system’s firmware is at the latest level. Update the system’s firmware if necessary. FSPSP05 The service processor has detected a problem in the platform firmware. Refer to the Operations Guide for ASMI and for Nonpartitioned Systems, SA76-0094 OR the Operations Guide for the Hardware Management Console and its Managed Systems, SA76-0085 and verify that the system’s firmware is at the latest level. Update the system’s firmware if necessary. 206 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
FSPSP06 The service processor reported a suspected intermittent problem. Contact your next level of support. FSPSP07 The time of day has been reset to the default value. 1. To set the time of day, refer to the systems operations guide. 2. If the problem persists, replace the TOD battery (Un-P1-E1): a. Power off the system (see “Powering on and powering off” on page 588). b. Replace the TOD battery. Refer to Chapter 3, “Locating FRUs,” on page 335 for instructions. c. Perform a slow boot. This ends the procedure.FSPSP09 A problem has been detected with a memory DIMM, but it cannot be isolated to a specific memory DIMM. 1. Replace all the memory DIMMs on the system backplane. See Chapter 3, “Locating FRUs,” on page 335 for instructions. 2. Perform a slow boot after replacing all the memory DIMMs. Refer to “Performing a slow boot” on page 665. This ends the procedure.FSPSP10 The part indicated in the FRU list that follows this procedure is not valid or missing for this system’s configuration. Refer to Chapter 3, “Locating FRUs,” on page 335 for instructions for removing and replacing any FRUs called out by the following procedure. Perform the following to correct the problem: 1. Does word 8 (the 8 leftmost characters in the 2nd line of function 13) of the reference code end with 02 or 04? No: Go to step 3. Yes: Continue with the next step.2. The FRU that is called out after this procedure is either missing or not valid. Is that FRU installed and connected or plugged in correctly? Yes: The installed FRU is not valid. Remove that FRU. Then contact your next level of support to determine the correct FRU. This ends the procedure. No: The FRU is missing. If the FRU is present but not connected, reconnect it and perform a slow boot (see “Performing a slow boot” on page 665). Otherwise, contact your next level of support to determine the missing FRU. This ends the procedure.3. Does word 8 end with 01 or 05? No: Return to the Chapter 1, “Starting a Service Call,” on page 1. This ends the procedure. Yes: The FRU that is listed after this procedure has the same serial number as another FRU in the system. Remove all but one of the FRUs that are listed after this procedure and then perform a slow boot (see “Performing a slow boot” on page 665). This ends the procedure.FSPSP11 The service processor has detected an error on the RIO/HSL port in the system unit. 1. Verify that the system’s firmware is at the latest level. Update the system’s firmware if necessary. Chapter 2. Isolation Procedures 207
2. If the problem persists, replace the system backplane (Un-P1). 3. Perform a slow boot. Refer to “Performing a slow boot” on page 665. This ends the procedure.FSPSP12 The DIMM FRU that was previously replaced did not correct the memory error. Perform the following: 1. Power off the system (see “Powering on and powering off” on page 588). 2. Replace the system backplane (Un-P1). See Chapter 3, “Locating FRUs,” on page 335 for instructions. 3. Perform a slow boot. Refer to “Performing a slow boot” on page 665. This ends the procedure. FSPSP14 The service processor cannot communicate with the system firmware. The server firmware will continue to run the system and partitions while it attempts to recover communications. Server firmware recovery actions will continue for approximately 30 to 40 minutes. Perform the following: 1. Record the time the log was created or when the SRC was first noticed. Continue with the next step. 2. Are progress codes being displayed on the panel? v Yes: Server firmware was able to reset the service processor. This ends the procedure. v No: Continue with the next step.3. Has an A7006995 SRC been displayed on the panel? v Yes: Partitions are being powered off and a server dump will be attempted. Follow the A7006995 SRC description if the partitions do not terminate as requested. This ends the procedure. v No: Continue with the next step.4. Has the A1xx SRC remained on the panel for more than 40 minutes? v Yes: Server firmware could not begin termination of the partitions. Contact your next level of support to assist in attempting to terminate any remaining partitions and forcing a server dump. Collect the dump for support, power off and power on the system. This ends the procedure. v No: Contact your next level of support. This ends the procedure.FSPSP16 Save any error log and dump data and contact your next level of support for assistance. FSPSP17 A system uncorrectable error has occurred. 1. Look for other serviceable events and use the call outs listed with them to correct the problem. 2. If you need to run the system in a degraded mode until you can perform the service actions, do the following: a. Power off the system (see “Powering on and powering off” on page 588). b. Power on the system (see “Powering on and powering off” on page 588) to allow the memory diagnostics to clean up the memory. Guard out any defective parts.This ends the procedure.FSPSP18 A problem has been detected in the platform LIC. 208 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
Refer to the Operations Guide for the Hardware Management Console and Managed Systems, SA76-0085, or the Operations Guide for ASMI and for Nonpartitioned Systems, SA76-0094, and verify that the server firmware is at the latest level. Update the server firmware if necessary. FSPSP20 A failing item has been detected by a hardware procedure. Perform a slow boot to run full hardware diagnostics. If a new SRC occurs, repair the system using that reference code. If an incomplete occurs, go to “Accessing the ASMI” on page 646 menus to power off, check for deconfigured components and perform a slow boot. This ends the procedure. FSPSP22 The system has detected that a processor chip is missing from the system configuration because JTAG lines are not working. Refer to Chapter 3, “Locating FRUs,” on page 335 for instructions for removing and replacing any FRUs called out by the following procedure. 1. Power off the system (see “Powering on and powering off” on page 588). 2. Replace the system backplane (Un-P1). 3. Perform a slow boot. Refer to “Performing a slow boot” on page 665. This ends the procedure.FSPSP23 The system needs to perform a service processor dump. 1. Perform a service processor dump (see “Performing a platform system or service processor dump” on page 653). 2. Attempt to perform an IPL on the system. 3. Save the service processor dump to storage (see “Managing dumps” on page 653). 4. Contact your next level of support. This ends the procedure.FSPSP24 The system is running degraded. Array bit steering may be able to correct this problem without replacing hardware. 1. Power off the system (see “Powering on and powering off” on page 588). 2. Go to “Performing a slow boot” on page 665. This ends the procedure. 3. If the problem persists, replace the FRU that is called out after this procedure. See Chapter 3, “Locating FRUs,” on page 335 for instructions. This ends the procedure.FSPSP25 The server has detected an over-temperature thermal fault. 1. Before replacing any server hardware FRU callouts, look in the error log for thermal problems related to fans, power supplies, etc. Perform all service actions for the thermal problem SRCs first before continuing with any other callouts in the current SRC. Thermal problems are associated with 1100 xxxx SRCs, where xxxx may be any of the following: v 1514 v 1524 Chapter 2. Isolation Procedures 209
v 7201 v 7203 v 7205 v 7610 v 7611 v 7620 v 7621 v 7630 v 7631 v 7640 v 76412. If no thermal-related SRCs or problems can be found, replace the server hardware FRU called out in the current SRC. See Chapter 3, “Locating FRUs,” on page 335 for instructions. This ends the procedure.FSPSP27 An attention line has been detected as having a problem. If the FRU replaced before this procedure did not correct the problem, perform the following: Refer to Chapter 3, “Locating FRUs,” on page 335 for instructions on replacing any FRUs called out by this procedure. 1. Was the FRU called out before this procedure a memory DIMM (Un-P1-C14 through Un-P1-C17 or Un-P1-C21 through Un-P1-C24) or a GX adapter card (Un-P1-C8 or Un-P1-C6) No: This ends the procedure. Yes: Continue with the next step.2. Perform the following: a. Replace the system backplane (Un-P1). b. Perform a slow boot. This ends the procedure.FSPSP28 The resource ID (RID) of the FRU could not be found in the vital product data (VPD) table. 1. Find another callout that reads ?FSPxxxx? where xxxx is a 4 digit hex number that represents the RID. Record the RID and the model of the system. 2. Call next level of support to find out which FRU the RID represents. 3. Replace the FRU that the RID represents. 4. Perform a slow boot to ensure full hardware diagnostics. This ends the procedure.FSPSP29 The system has detected that all I/O bridges are missing from the system configuration. 1. Power off the system (see “Powering on and powering off” on page 588). 2. Replace the system backplane (Un-P1). Refer to Chapter 3, “Locating FRUs,” on page 335 for instructions on replacing the system backplane. 3. Perform a slow boot. Refer to “Performing a slow boot” on page 665. This ends the procedure. 210 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
FSPSP30 A problem has been encountered accessing the VPD card or the data found on the VPD card has been corrupted. This error occurred before VPD collection was completed. No location codes have been created. Refer to Chapter 3, “Locating FRUs,” on page 335 for instructions for removing and replacing any FRUs called out by the following procedure. 1. Power off the system (see “Powering on and powering off” on page 588) and remove ac power. 2. Clear any deconfiguration errors for the VPD card. 3. Replace the VPD card (Un-P1-C11). See Chapter 3, “Locating FRUs,” on page 335. 4. Perform a slow boot. Refer to “Performing a slow boot” on page 665). Does the problem persist? Yes: Continue with the next step. No: This ends the procedure.5. Replace the system backplane (Un-P1). This ends the procedure.FSPSP31 The service processor has detected that one or more of the required fields in the system VPD has not been initialized. 1. Log into ASMI with authorized service provider authority (see “Accessing the ASMI” on page 646). 2. Set the system VPD values (see “Setting the system enclosure type” on page 650 and “Setting the system identifiers” on page 650). Note: The service processor will automatically reset when leaving the ASMI after updating the system VPD. 3. Power on the system (“Powering on and powering off” on page 588). This ends the procedure.FSPSP32 A problem with the enclosure has been found. The problem is resulting from one of the following: v The enclosure VPD cannot be found. v The enclosure serial number is not programmed. v The enclosure feature code is not programmed.Refer to Chapter 3, “Locating FRUs,” on page 335 for instructions for removing and replacing any FRUs called out by the following procedure. Perform the following: 1. Record the reason code (the last 4 characters of word 11) from the SRC by looking at the operator panel or accessing the error log with the ASMI. 2. Is the reason code A46F? No: Go to step 6 on page 212. Yes: Continue with the next step.3. Check for and apply any server firmware updates. Does the problem persist? No: This ends the procedure. Yes: Continue with the next step.4. Power off the system (see “Powering on and powering off” on page 588). Chapter 2. Isolation Procedures 211
5. Perform the following: a. Replace the system backplane (Un-P1). b. Perform a slow boot (see “Performing a slow boot” on page 665).Does the problem persist? No: This ends the procedure. Yes: Contact your next level of support. This ends the procedure.6. Is the reason code A460? No: Go to step 8. Yes: Continue with the next step.7. Perform the following: a. Set the enclosure serial number using the ASMI (see “Setting the system identifiers” on page 650). b. The service processor will automatically reset when leaving the ASMI after updating the serial number. c. Perform a slow boot (see “Performing a slow boot” on page 665).Does the problem persist? No: This ends the procedure. Yes: Contact your next level of support. This ends the procedure.8. Is the reason code A45F? No: Contact your next level of support. This ends the procedure. Yes: Continue with the next step.9. Perform the following: a. Set the enclosure feature code using the ASMI (see “Setting the system enclosure type” on page 650). The service processor will automatically reset when leaving the ASMI after updating the serial number. b. Perform a slow boot (see “Performing a slow boot” on page 665).).Does the problem persist? No: This ends the procedure. Yes: Contact your next level of support. This ends the procedure.FSPSP33 A problem has been detected in the connection with the HMC. 1. Ensure that the cable connectors to the network from the HMC, managed system, managed system partitions, and other HMCs are securely connected. If the connections are not secure, plug the cables back into the proper locations and make sure that the connections are good. 2. Check to see if the HMC is working correctly or if the HMC was disconnected incorrectly from the managed system, managed system partitions, and other HMCs. If either has happened, reboot the HMC. 3. Verify that the network connection between the HMC, managed system, managed system partitions, and other HMCs is working properly. 4. If applicable, service the next FRU. See Chapter 3, “Locating FRUs,” on page 335 for instructions. 5. If the problem continues to persist, contact your next level of support. This ends the procedure.FSPSP34 The memory cards are plugged in an invalid configuration and cannot be used by the system. Refer to Chapter 3, “Locating FRUs,” on page 335 for instructions for removing and replacing any FRUs called out by the following procedure. 212 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
1. Is the SRC B1xx C02A? Yes: A memory module is missing from the system. The additional parts in the FRU callout list will include all memory modules in the group with the missing card. To correct the error, visually check the system to determine which of these missing is missing, and add the module. This ends the procedure. No: Continue with the next step.2. Is the SRC B1xx C029? Yes: A memory module is a different type than the other memory modules in the same group. The additional parts in the FRU callout list will include all memory modules in the group that contain the error. To correct the error, exchange the memory modules of the incorrect type with those of the desired type. This ends the procedure. No: Continue with the next step.3. Is the SRC B1xx C02B? Yes: A group of memory modules are missing and are required so that other memory modules on the board can be configured. The additional parts in the FRU callout list will include all missing memory modules in the group. To correct the error, add or remove these modules to the desired locations. This ends the procedure. No: Continue with the next step.4. Is the SRC B1xx C036? Yes: A memory module is not supported by this system. The additional parts in the FRU callout list will include all memory modules in the group that contains the unsupported modules. To correct the error, remove these modules from the system or replace them with the correct type. This ends the procedure. No: Return to Chapter 1, “Starting a Service Call,” on page 1. This ends the procedure.FSPSP35 The system has detected a problem with a memory controller. Perform the following to enable redundant utilization: 1. Power off the system (see “Powering on and powering off” on page 588). 2. “Performing a slow boot” on page 665. This ends the procedure.FSPSP38 The system has detected an error within the JTAG path. If the FRU replaced before this procedure did not correct the problem, perform the following: Refer to Chapter 3, “Locating FRUs,” on page 335 for instructions on replacing any FRUs called out by this procedure. 1. Was the FRU called out before this procedure a memory DIMM (Un-P1-C14 through Un-P1-C17 or Un-P1-C21 through Un-P1-C24) or a GX adapter card (Un-P1-C8 or Un-P1-C6)? No: No: This ends the procedure Yes: Continue with the next step.2. Perform the following: a. Replace the system backplane (Un-P1). b. Perform a slow boot. This ends the procedure.FSPSP42 An error communicating between two system processors was detected. Chapter 2. Isolation Procedures 213
Refer to Chapter 3, “Locating FRUs,” on page 335 for instructions for removing and replacing any FRUs called out by the following procedure. There is a communication error between the processors and the FRUs called out before this procedure. If you were unable to correct the problem by replacing FRUs that were previously specified before coming to this procedure, consider the possibility of failing system backplanes. 1. Power off the system (see “Powering on and powering off” on page 588). 2. Replace the system backplane (Un-P1). 3. Perform a slow boot (see “Performing a slow boot” on page 665). This ends the procedure. FSPSP45 The system has detected an error with FSI path. 1. Replace the system backplane (Un-P1) 2. Perform a slow boot. Refer to “Performing a slow boot” on page 665. This ends the procedure.FSPSP46 Some corrupt areas of flash or RAM have been detected on the service processor. 1. Replace the system backplane (Un-P1). See Chapter 3, “Locating FRUs,” on page 335 for instructions. 2. Perform a slow boot. Refer to “Performing a slow boot” on page 665. This ends the procedure. FSPSP47 The system has detected an error within the PSI link. To resolve the problem, perform the following: Refer to Chapter 3, “Locating FRUs,” on page 335 for instructions for removing and replacing any FRUs called out by the following procedure. 1. Replace the system backplane (Un-P1) 2. Perform a slow boot. Refer to “Performing a slow boot” on page 665. This ends the procedure.FSPSP48 A diagnostic function detects an external processor interface problem. If replacing the FRUs previously listed in the FRU list does not fix the problem, perform the following: 1. Power off the system (see “Powering on and powering off” on page 588). 2. Replace the system backplane (Un-P1). Refer to Chapter 3, “Locating FRUs,” on page 335 for instructions on replacing the system backplane. 3. Perform a slow boot. Refer to “Performing a slow boot” on page 665. This ends the procedure. FSPSP49 A diagnostic function detects an internal processor interface problem. If replacing the FRUs previously listed in the FRU list does not fix the problem, perform the following: 1. Power off the system (see “Powering on and powering off” on page 588). 2. Replace the system backplane (Un-P1). Refer to Chapter 3, “Locating FRUs,” on page 335 for instructions on replacing the system backplane. 3. Perform a slow boot. Refer to “Performing a slow boot” on page 665.
214 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
This ends the procedure. FSPSP50 A diagnostic function detects a connection problem between a processor chip and a GX chip. If replacing the FRUs previously listed in the FRU list does not fix the problem, perform the following: 1. Power off the system (see “Powering on and powering off” on page 588). 2. Replace the system backplane (Un-P1). Refer to Chapter 3, “Locating FRUs,” on page 335 for instructions on replacing the system backplane. 3. Perform a slow boot. Refer to “Performing a slow boot” on page 665. This ends the procedure. FSPSP51 Runtime diagnostics detected a memory bus correctable error that is exceeding the threshold. The memory bus correctable error will not cause an immediate loss of system operations. However, the system is operating in a degraded mode. To resolve the problem, replace the FRU called out after this procedure. Refer to Chapter 3, “Locating FRUs,” on page 335 for instructions. FSPSP52 A problem has been detected on a memory bus. If replacing the FRUs previously listed in the FRU list does not fix the problem, perform the following: 1. Power off the system (see “Powering on and powering off” on page 588). 2. Replace the system backplane (Un-P1). Refer to Chapter 3, “Locating FRUs,” on page 335. 3. Perform a slow boot. Refer to “Performing a slow boot” on page 665. This ends the procedure. FSPSP54 A processor over-temperature has been detected. Check for any environmental issues before replacing any parts. 1. Is the ambient room temperature in the normal operating range (less than 35 degrees C/95 degrees F)? No: Notify the customer. The customer must lower the room temperature so that it is within the normal range. Do not replace any parts. This ends the procedure. Yes: Continue to the next step.2. Are the front and rear of the CEC drawer, and the front and rear rack doors, free of obstructions that would impede the airflow through the drawer? No: Notify the customer. The system must be free of obstructions for proper airflow. Clean the air inlets and exits in the drawer as required. Do not replace any parts. This ends the procedure. Yes: Continue to the next step.3. Are all of the fans, especially those at the back of the power supply, functioning normally? No: Replace any fans that are not turning or are turning slowly. Refer to Chapter 3, “Locating FRUs,” on page 335 for instructions. This ends the procedure. Yes: There are no environmental issues with the cooling of the processors. This ends the procedure.
Chapter 2. Isolation Procedures 215
FSPSP55 An error has been detected on a bus between two FRUs. The endpoint FRUs have been called out. However, the source of the error could be the bus path between the FRUs. Refer to Chapter 3, “Locating FRUs,” on page 335 for instructions for removing and replacing any FRUs called out by the following procedure. 1. Is word 6 (the 8 leftmost characters of panel function 13) of the reference code 00000001? No: Go to the next step. Yes: Perform the following:a. The XYZ bus has an error. Replace the FRUs called out after this procedure. b. Did replacing the FRUs correct the problem? Yes: This ends the procedure. No: Perform the following:c. Replace the system backplane (Un-P1). d. Perform a slow boot. This ends the procedure.2. Is word 6 of the reference code 00000002? No: Go to the next step. Yes: Perform the following:a. The AB bus has an error. Replace the FRUs called out after this procedure. b. Did replacing the FRUs correct the problem? Yes: This ends the procedure. No: Perform the following:c. Replace the system backplane (Un-P1). d. Perform a slow boot. This ends the procedure.3. Is word 6 of the reference code 00000004 or 00000005? No: Go to the next step. Yes: Perform the following:a. The processor - GX card bus or PSI bus has an error. Replace the FRUs called out after this procedure. b. Did replacing the FRUs correct the problem? Yes: This ends the procedure. No: Perform the following:c. Replace the system backplane (Un-P1). d. Perform a slow boot. This ends the procedure.4. For any other value in word 6 of the reference code, the bus error is corrected by the FRUs called out after this procedure. Replace the FRUs called out after this procedure. This ends the procedure.FSPSP63 The system has experienced a power error. Please review previous error logs for power-related issues. FSPSP64 All the processor support interface (PSI) links of the system are either nonfunctional or deconfigured, so the system cannot perform an IPL appropriately. Look for previous error logs that deconfigure hardware. 216 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
FSPSPC1 If the system hangs after the code that sent you to this procedure appears in the control panel, perform these steps to reset the service processor. Attention: You should periodically check the system firmware level on all your servers and update the firmware to the latest level, if appropriate. If you were directed to this procedure because the server displayed B1817201, C1001014, or C1001020, or a combination of these codes, the latest firmware can help avoid a recurrence of this problem. Even if the customer cannot update the firmware on this system at this time, all of their systems should be updated to the latest firmware level as soon as possible to help prevent this problem from occurring on other systems. Resetting the service processor on systems with a physical control panel 1. If the Advanced System Management Interface (ASMI) is available, reset the service processor using the ASMI menus. Were you able to use the ASMI menus to reset the service processor? Yes: This ends the procedure. No: Continue with the next step.2. Activate the service processor pinhole reset switch on the system’s control (operator) panel by carefully performing these steps: a. Using an insulated paper clip, unbend the paper clip so that it has a straight section about 2 inches long. b. Insert the clip straight into the hole, keeping the clip perpendicular to the plastic bezel. c. When you engage the reset switch, you should feel the detent of the switch. Pressing the reset switch resets the service processor and causes the system to shut down.3. Reboot the system in slow mode from the permanent side, using control panel function 02 or the ASMI menus, if available. 4. If the hang repeats, verify whether there is a firmware update that is available that fixes the problem; apply if available. For more information, see: v If the system is managed by an HMC, see the Operations Guide for the HMC and Managed Systems, SA76-0085. v If the system is not managed by an HMC, see the Operations Guide for the ASMI and Nonpartitioned Systems, SA76-0094.5. Choose from the following options: v If no firmware update is available, continue with the next step. v If a firmware update is available, apply it using the Service Focal Point in the Hardware Management Console (HMC), if attached.Did the update resolve the problem so that the system now boots? Yes: This ends the procedure. No: You are here because there is no HMC attached to the system, the flash update failed, or the updated firmware did not fix the hang. Continue with the next step.6. Choose from the following options: v If you are a customer, contact your authorized hardware service provider. This ends the procedure. v If you are the authorized hardware service provider, continue with the next step.7. Replace the service processor (see Chapter 3, “Locating FRUs,” on page 335). 8. If replacing the service processor does not fix the problem, contact your next level of support. This ends the procedure.
Chapter 2. Isolation Procedures 217
FSPSPD1 If the system hangs after the code that sent you to this procedure appears in the control panel, perform these steps to reset the service processor. Attention: You should periodically check the system firmware level on all your servers and update the firmware to the latest level, if appropriate. If you were directed to this procedure because the server displayed B1817201, C1001014, or C1001020, or a combination of these codes, the latest firmware can help avoid a recurrence of this problem. Even if the customer cannot update the firmware on this system at this time, all of their systems should be updated to the latest firmware level as soon as possible to help prevent this problem from occurring on other systems. Resetting the service processor on systems with a physical control panel 1. If the Advanced System Management Interface (ASMI) is available, reset the service processor using the ASMI menus. Were you able to use the ASMI menus to reset the service processor? Yes: This ends the procedure. No: Continue with the next step.2. Activate the service processor pinhole reset switch on the system’s control (operator) panel by carefully performing these steps: a. Using an insulated paper clip, unbend the paper clip so that it has a straight section about 2 inches long. b. Insert the clip straight into the hole, keeping the clip perpendicular to the plastic bezel. c. When you engage the reset switch, you should feel the detent of the switch. Pressing the reset switch resets the service processor and causes the system to shut down.3. Reboot the system in slow mode from the permanent side, using control panel function 02 or the ASMI menus, if available. 4. If the hang repeats, verify whether there is a firmware update that is available that fixes the problem; apply if available. For more information, see: v If the system is managed by an HMC, see the Operations Guide for the HMC and Managed Systems, SA76-0085. v If the system is not managed by an HMC, see the Operations Guide for the ASMI and Nonpartitioned Systems, SA76-0094.5. Choose from the following options: v If no firmware update is available, continue with the next step. v If a firmware update is available, apply it using the Service Focal Point in the Hardware Management Console (HMC), if attached.Did the update resolve the problem so that the system now boots? Yes: This ends the procedure. No: You are here because there is no HMC attached to the system, the flash update failed, or the updated firmware did not fix the hang. Continue with the next step.6. Choose from the following options: v If you are a customer, contact your authorized hardware service provider. This ends the procedure. v If you are the authorized hardware service provider, continue with the next step.7. Replace the service processor (see Chapter 3, “Locating FRUs,” on page 335). 8. If replacing the service processor does not fix the problem, contact your next level of support. This ends the procedure. 218 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
Tape unit isolation procedures This topic contains the procedures necessary to isolate a failure in a tape device. In these procedures, the term tape unit may be any one of the following: v An internal tape drive, including its electronic parts and status indicators v An internal tape drive, including its tray, power regulator, and AMDs v An external tape drive, including its power supply, power switch, power regulator, and AMDs You should interpret the term tape unit to mean the tape drive you are working on. However, these procedures use the terms tape drive and enclosure to indicate a more specific meaning. Read and observe all safety procedures before servicing the system and while performing the procedures in this topic. Unless instructed otherwise, always power off the system or expansion unit where the FRU is located (see Powering on and powering off) before removing, exchanging, or installing a field-replaceable unit (FRU). Chapter 2. Isolation Procedures 219
DANGERWhen working on or around the system, observe the following precautions: Electrical voltage and current from power, telephone, and communication cables are hazardous. To avoid a shock hazard: v Connect power to this unit only with the IBM provided power cord. Do not use the IBM provided power cord for any other product. v Do not open or service any power supply assembly. v Do not connect or disconnect any cables or perform installation, maintenance, or reconfiguration of this product during an electrical storm. v The product might be equipped with multiple power cords. To remove all hazardous voltages, disconnect all power cords. v Connect all power cords to a properly wired and grounded electrical outlet. Ensure that the outlet supplies proper voltage and phase rotation according to the system rating plate. v Connect any equipment that will be attached to this product to properly wired outlets. v When possible, use one hand only to connect or disconnect signal cables. v Never turn on any equipment when there is evidence of fire, water, or structural damage. v Disconnect the attached power cords, telecommunications systems, networks, and modems before you open the device covers, unless instructed otherwise in the installation and configuration procedures. v Connect and disconnect cables as described in the following procedures when installing, moving, or opening covers on this product or attached devices. To Disconnect: 1. Turn off everything (unless instructed otherwise). 2. Remove the power cords from the outlets. 3. Remove the signal cables from the connectors. 4. Remove all cables from the devices To Connect: 1. Turn off everything (unless instructed otherwise). 2. Attach all cables to the devices. 3. Attach the signal cables to the connectors. 4. Attach the power cords to the outlets. 5. Turn on the devices. (D005) “TUPIP02” on page 221Use this procedure to perform the 8mm tape drive read self-test and hardware self-test. “TUPIP03” on page 223You were directed here because you may need to exchange a failing part. “TUPIP04” on page 224Use this procedure to reset an IOP and its attached tape units. Read the (overview) before continuing with this procedure. “TUPIP06” on page 228Use this procedure to isolate a Device Not Found message during installation from an alternate device. “Tape unit self-test procedure” on page 229This procedure is designed to allow you to quickly perform a complete set of diagnostic tests on a 6384 or 6387 tape unit. “Tape device ready conditions” on page 231All the conditions that are listed for the device, must be correct for the device to be ready.
220 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
TUPIP02 Use this procedure to perform the 8mm tape drive read self-test and hardware self-test. The write test is performed in “TUPIP03” on page 223. 1. If the system has logical partitions, perform this procedure from the logical partition that reported the problem (see “Determining if the system has logical partitions” on page 593). 2. Press the Unload switch on the front of the 8mm tape drive. Is a data cartridge present? v No: Continue with the next step. v Yes: Attempt to remove the data cartridge. Can you remove the data cartridge? Yes: Continue with the next step. No: The tape drive is the failing part. Go to (Tape cartridge - Manual removal). After removing the data cartridge, exchange the tape drive (see Chapter 3, “Locating FRUs,” on page 335). This ends the procedure.3. Clean the tape drive by using the cleaning cartridge (part 16G8467). If the tape drive ejects the cleaning cartridge with the Disturbance light on, a new cleaning cartridge is needed. The tape drive unloads automatically when cleaning is complete. Cleaning takes up to 5 minutes. Is the Disturbance light on continuously? No: Continue with the next step. Yes: Repeat this step while using a new cleaning cartridge. If you are using a new cleaning cartridge and the Disturbance light does not go off, the possible failing part is the 8mm tape drive. Go to “TUPIP03” on page 223. This ends the procedure.4. Is the Disturbance light blinking? v No: Continue with the next step. v Yes: Choose from the following: – If the 8mm tape drive is a type 6390, the possible failing part is the 6390 tape drive. – If the 8mm tape drive is a type 7208, the possible failing parts are: a. 7208 tape drive b. Power supply c. AMD Go to “TUPIP03” on page 223. This ends the procedure.5. Perform the read self-test: Notes: a. The 8mm tape drive read self-test and hardware self-test can take up to 5 minutes to run. b. The tape drive runs the read self-test first. During the read self-test, the Read-Write light blinks, and the Disturbance and Ready lights are off.To start the read self-test and hardware self-test, insert the diagnostic cartridge (part 46G2660) into the tape drive. The following conditions indicate that the read self-test ended successfully: v The Read-Write light stops blinking. v The diagnostic cartridge ejects automatically. v The three status lights go on to indicate the start of the hardware test. Does the read self-test end successfully? v No: Does the tape drive eject the diagnostic cartridge? – Yes: Continue with the next step. – No: The tape drive is the failing part. Go to (Tape cartridge - Manual removal). After removing the diagnostic cartridge, exchange the tape drive (see Chapter 3, “Locating FRUs,” on page 335). This ends the procedure.
Chapter 2. Isolation Procedures 221
v Yes: Go to step 8.6. Is the Disturbance light blinking approximately four times per second? v No: Continue with the next step. v Yes: The possible failing part is the diagnostic cartridge (part 46G2660). – If this is your first time through this step, get a new diagnostic cartridge. Go to step 3 on page 221 of this procedure to clean the tape drive again. Afterwards, run the read self-test and hardware self-test while using the new diagnostic cartridge. – If this is your second time through this step, the possible failing part is the 8mm tape drive. Go to “TUPIP03” on page 223. This ends the procedure.7. The Disturbance light is blinking approximately once per second. v If the 8mm tape drive is a type 6390, the possible failing part is the 6390 tape drive. v If the 8mm tape drive is a type 7208, the possible failing parts are: a. 7208 tape drive b. Power supply c. AMD Go to “TUPIP03” on page 223. This ends the procedure. 8. The tape drive runs the hardware self-test. During the hardware self-test, the three tape drive status lights are on for 15 to 30 seconds. The three status lights go off when the hardware self-test ends successfully. Does the hardware self-test end successfully? v Yes: Continue with the next step. v No: When the hardware self-test does not end successfully, the following conditions occur: – The three status lights do not go off. – The Ready and Read-Write lights go off. – The Disturbance light blinks approximately once per second. Possible failing parts are: – If the 8mm tape drive is a type 6390, the possible failing part is the 6390 tape drive. – If the 8mm tape drive is a type 7208, the possible failing parts are: a. 7208 tape drive b. Power supply c. AMD Go to “TUPIP03” on page 223. This ends the procedure.9. The read self-test and hardware self-test ended successfully. Was the user’s original tape identified as the probable cause of failure? v Yes: Perform the following: a. Mark and date the data cartridge to indicate that it failed with a permanent error. b. Discard this data cartridge when: – Volume statistics (if available) indicate a problem with the data cartridge. – A total of three permanent errors have occurred with the same data cartridge.c. If possible, continue operations with a new data cartridge. Go to “TUPIP03” on page 223. This ends the procedure. v No: Go to “TUPIP03” on page 223. This ends the procedure.
222 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
TUPIP03 You were directed here because you may need to exchange a failing part. The failing part was determined from one of the following: v Other problem isolation procedures v The Failing item column of the tape unit reference code table v Tape unit service guideNote: Occasionally, the system is available but not performing an alternate IPL (type D IPL). In this instance, any hardware failure of the tape unit I/O processor, or any device attached to it is not critical. With the exception of the loss of the affected devices, the system remains available. 1. If the system has logical partitions, perform this procedure from the logical partition that reported the problem (see “Determining if the system has logical partitions” on page 593). 2. Do you need to exchange a possible failing device? v No: Do you need to exchange the tape unit I/O processor? No: Continue with the next step. Yes: Exchange the tape unit I/O processor (see Chapter 3, “Locating FRUs,” on page 335). When you have completed the remove and replace procedure, continue with the next step.v Yes: Perform the following: – For an internal tape unit, go to Chapter 3, “Locating FRUs,” on page 335. – For an external tape unit, go to the remove and replace procedures in the device service information. 3. Are you working with a tape unit in the system unit or in an expansion unit? v Yes: Is the system available, and can you enter commands on the command line? No: Continue with the next step. Yes: Go to step 9 on page 224.v No: Continue with the next step. 4. Display the selected IPL type. Is the displayed IPL type D? v No: Do you want to perform an alternate IPL (type D)? No: Continue with the next step. Yes: Go to step 6.v Yes: Go to step 6. 5. Perform an IPL from disk by doing the following: a. Power off the system (see Powering on and powering off). b. Select IPL type A in manual mode. c. Power on the system. d. Go to step 8. 6. Place the first tape of the latest set of SAVSYS tapes or SAVSTG tapes, or the first Software Distribution tape in the alternate IPL tape drive. The tape drive automatically becomes ready for the IPL operation (this may take several minutes). 7. Perform an alternate IPL by doing the following: a. Power off the system. b. Select IPL type D in Manual mode. c. Power on the system. 8. The IPL may take one or more hours to complete. Chapter 2. Isolation Procedures 223
Does an unexpected reference code appear on the control panel, and is the System Attention light on? v No: Does the IPL complete successfully? Yes: Continue with the next step. No: Perform problem analysis to continue analyzing the problem. This ends the procedure.v Yes: Go to step 10. 9. Perform the following to test the tape unit: a. Enter VFYTAP (the Verify Tape command) on the command line. b. Follow the prompts on the Verify Tape displays, then return here and answer the following question.Does the VFYTAP command end successfully? No: Continue with the next step. Yes: This ends the procedure.10. Record the SRC. Is the SRC the same one that sent you to this procedure? Yes: You cannot continue to analyze the problem. Use the original SRC and exchange the FRUs. Begin with the FRU which has the highest percent of probable failure (see the failing item list for this reference code). This ends the procedure. No: A different SRC occurred. Use the new SRC to correct the problem. This ends the procedure.TUPIP04 Use this procedure to reset an IOP and its attached tape units. Read the (overview) before continuing with this procedure. If disk units are attached to an IOP, you must power off the system, then power it on to reset the IOP. 1. If the system has logical partitions, perform this procedure from the logical partition that reported the problem (see “Determining if the system has logical partitions” on page 593). 2. Is the tape unit powered on? v No: Continue with the next step. v Yes: Perform the following: a. Press the Unload switch on the front of the tape unit you are working on. b. If a data cartridge or a tape reel is present, do not load it until you need it. c. Continue with the next step of this procedure. 3. Verify the following: v If the external device has a power switch, ensure that it is set to the On position. v Ensure that the power and external signal cables are connected correctly. Note: For every 8mm and 1/4 inch tape unit, the I/O bus terminating plug for the SCSI external signal cable is connected internally. These devices do not need and should not have an external terminating plug. 4. Did you press the Unload switch in step 2? v Yes: Can you enter commands on the command line? Yes: Continue with the next step. No: Go to step 11 on page 226.v No: Press the Unload switch on the front of the tape unit you are working on. If a data cartridge or a tape reel is present, do not load it until you need it. Continue with the next step of this procedure.
224 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
5. Has the tape unit operated correctly since it was installed? If you do not know, continue with the next step of this procedure. Yes: Continue with the next step. No: Go to step 11 on page 226. 6. If a system message displayed an I/O processor name, a tape unit resource name, or a device name, record the name for use in the next step. You may continue without a name. Does the I/O processor give support to only one tape unit? If you do not know, continue with the next step of this procedure. v No: Continue with the next step. v Yes: Perform the following. You must complete all parts of this step before you press Enter. a. Enter WRKCFGSTS *DEV *TAP ASTLVL(*INTERMED) (the Work with Configuration Status command) on the command line. b. If the device is not varied off, select Vary off before continuing. c. Select Vary on for the failing tape unit. d. Enter RESET(*YES) (the Reset command) on the command line. e. Press Enter. This ends the procedure. 7. This step determines if the I/O processor for the tape unit gives support to other tape units or to a disk unit. Notes: a. If you cannot determine the tape unit you are attempting to use, go to step 11 (See 11 on page 226). b. System messages refer to other tape units that the I/O processor gives support to as associated devices.Enter WRKHDWRSC *STG (the Work with Hardware Resources command) on the command line. Did you record an I/O processor (IOP) resource name in step 6? v No: Perform the following: a. Select Work with resources for each storage resource IOP (CMB01, SIO1, and SIO2 are examples of storage resource IOPs). b. Find the Configuration Description name of the tape unit you are attempting to use, and then record the Configuration Description names of all tape units that the I/O processor gives support to. c. Record whether the I/O processor for the tape unit also gives support to any disk unit resources. d. Continue with the next step.v Yes: Perform the following: a. Select Work with resources for that resource. b. Record the Configuration description name of all tape units for which the I/O processor provides support. c. Record whether the I/O processor for the tape unit also gives support to any disk unit resources. d. Continue with the next step. 8. Does the I/O processor give support to any disk unit resources? No: Continue with the next step. Chapter 2. Isolation Procedures 225
Yes: The Reset option is not available. Go to step 11. 9. Does the I/O processor give support to only one tape unit? v No: Continue with the next step. v Yes: Perform the following: a. Select Work with configuration description and press Enter. b. Select Work with status and press Enter. Note: You must complete the remaining parts of this step before you press Enter again. c. If the device is not varied off, select Vary off before continuing. d. Select Vary on for the failing tape unit. e. Enter RESET(*YES) (the Reset command) on the command line. f. Press Enter. This ends the procedure.10. Perform the following: a. Enter WRKCFGSTS *DEV *TAP ASTLVL(*INTERMED) (the Work with Configuration Status command) on the command line. b. Select Vary off for the failing tape unit and associated devices (the devices you identified in step 7 on page 225), and then press Enter. Note: You must complete the remaining parts of this step before you press Enter again. c. Select Vary on for the failing tape unit. d. Enter RESET(*YES) (the Reset command) on the command line. e. Press Enter. f. Select Vary on for the associated devices (tape units) you identified in step 7 on page 225. It is not necessary to use the Reset option again.Does a system message indicate that the vary on operation failed? Yes: Continue with the next step. No: This ends the procedure.11. The Reset is not available, or you were not able to find the Configuration Description name when using WRKHDWRSC *STG (the Work with Hardware Resources command). You can perform an I/O processor (IOP) reset by performing an IPL of the I/O processor. All devices that are attached to the IOP will reset. The following steps describe how to load an IOP, how to configure a tape drive, how to vary on tape devices, and how to make tape devices available. 12. Is the tape device you are working on an 8mm tape drive? v No: Continue with the next step. v Yes: Verify the following on the 8mm tape drive: – The power and external signal cables are connected correctly. – The Power switch is set to the On position (pushed in).
226 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
Note: The SCSI I/O bus terminating plug for the system-external signal cables is connected internally in the 8mm tape drive. The 8mm tape drive does not need, and must not have an external terminating plug. Is the 8mm tape drive Power light on, and is the Disturbance light off? Yes: Continue with the next step. No: Go to “TUPIP02” on page 221 to correct the problem. This ends the procedure.13. Is a data cartridge or a tape reel installed in the tape device? No: Continue with the next step. Yes: Remove the data cartridge or tape reel. Continue with the next step.14. Can you enter commands on the command line? v Yes: Continue with the next step. v No: Perform the following: a. Power off the system (see Powering on and powering off). b. Power on the system. The system performs an IPL and resets all devices. If the tape device responds to SCSI address 7, the system configures the tape device. This ends the procedure.15. Verify that automatic configuration is on by entering DSPSYSVAL QAUTOCFG (the Display System Value command) on the command line. Is the Autoconfigure device option set to 1? v Yes: Continue with the next step. v No: Perform the following: a. Press Enter to return to the command line. b. Set automatic configuration to On by entering CHGSYSVAL QAUTOCFG ’1’ (the Change System Value command) on the command line.Note: QAUTOCFG resets to its initial value in step 21 on page 228. c. Continue with the next step.16. Perform the following: a. Enter STRSST (the Start SST command) on the command line. b. On the Start Service Tools Sign On display, type in a User ID with QSRV authority and Password. c. Select Start a Service Tool ? Hardware Service Manager ? Logical Hardware Resources ? System Bus Resources. The Logical Hardware Resources on System Bus display shows all of the IOPs. d. Find the IOP you want to reset. You must ensure that no one is using any of the tape units, communication channels, or display stations that are attached to the IOP you want to reset.Does a ?*? indicator appear to the right of the IOP description? v No: Continue with the next step. v Yes: Disk units are attached to the IOP. Perform the following: a. Press F3 until the Exit System Service Tools display appears. b. Press Enter. Chapter 2. Isolation Procedures 227
c. Power off the system (see Powering on and powering off). d. Power on the system. The system performs an IPL and resets all devices. This ends the procedure.17. Perform the following: a. Select I/O debug ? IPL the I/O processor. b. When the IOP reset is complete, continue with the next step of this procedure.18. Perform the following: a. Press F12 to return to the Logical Hardware Resources on System Bus display. b. Select Resources associated with IOP for the IOP you reset.Did the IOP detect the tape unit? v Yes: Continue with the next step. v No: The IOP did not detect the tape unit. Consider the following: – Ensure that the tape unit is powered on and that the signal cables are connected correctly. If you find and correct a power or a signal cable problem, return to step 16 on page 227. – The tape unit may be failing. Go to the tape unit service information and perform the procedures for analyzing device problems. If you find and correct a tape unit problem, return to step 16 on page 227. – If none of the above are true, ask your next level of support for assistance. This ends the procedure.19. Press F3 until the Exit System Service Tools display appears. Then press Enter. 20. Was automatic configuration Off before you performed step 15 on page 227? Yes: Continue with the next step. No: This ends the procedure.21. Enter CHGSYSVAL QAUTOCFG ’0’ (the Change System Value command) on the command line to reset QAUTOCFG to its initial value. This ends the procedure.TUPIP06 Use this procedure to isolate a Device Not Found message during installation from an alternate device. There are several possible causes: v The alternate installation device was not correctly defined. v The alternate installation device was not made ready. v The alternate installation device does not contain installation media. v The alternate installation device is not powered on. v The alternate installation device is not connected properly. v There is a hardware error on the alternate installation device or the attached I/O processor.Read the danger notices in “Tape unit isolation procedures” on page 219 before continuing with this procedure. 1. Is the device that you are using for alternate installation defined as the alternate installation device? v Yes: Is the alternate installation device ready? Yes: Continue with the next step. No: Make the alternate installation device ready and retry the alternate installation. This ends the procedure.
228 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
v No: Correct the alternate installation device information and retry the alternate installation. This ends the procedure.2. Is there installation media in the alternate installation device? v Yes: Is the alternate installation device an external device? Yes: Continue with the next step. No: Go to step 5.v No: Load the correct media and retry the alternate installation. This ends the procedure.3. Is the alternate installation device powered on? v Yes: Make sure that the alternate installation device is properly connected to the I/O processor or I/O adapter card. Is the alternate installation device properly connected? Yes: Go to step 5. No: Correct the problem and retry the alternate installation. This ends the procedure.v No: Continue with the next step.4. Ensure that the power cable is connected tightly to the power cable connector at the back of the alternate device. Ensure that the power cable is connected to a power outlet that has the correct voltage. Set the alternate device Power switch to the Power On position. The Power light should go on and remain on. If a power problem is present, one of the following power failure conditions may occur: v The Power light flashes, then remains off. v The Power light does not go on. v Another indication of a power problem occurs. Does one of the above power failure conditions occur? v No: The alternate device is powered on and runs its power-on self-test. Wait for the power-on self-test to complete. Does the power-on self-test complete successfully? No: Go to the service information for the specific alternate installation device to correct the problem. Then retry the alternate installation. This ends the procedure. Yes: Retry the alternate installation. This ends the procedure.v Yes: Perform the following: a. Go to the service information for the specific alternate device to correct the power problem. b. When you have corrected the power problem, retry the alternate installation. This ends the procedure.5. Was a device error recorded in the Product Activity Log? No: Contact your next level of support. This ends the procedure. Yes: See Reference codes to correct the problem. This ends the procedure.Tape unit self-test procedure This procedure is designed to allow you to quickly perform a complete set of diagnostic tests on a 6384 or 6387 tape unit. The following procedure is designed to allow you to quickly perform a complete set of diagnostic tests on a 6384 or 6387 tape unit, without impacting your system operation. This test can also be used to verify good performance of individual tape cartridges. Enter diagnostic mode: 1. Verify that a cartridge is not loaded in the tape unit. To unload a cartridge, press the eject button on the front of the tape unit. If the cartridge does not eject, refer to (Tape unit - manual removal). Chapter 2. Isolation Procedures 229
2. Press and hold the eject button for about 6 seconds until the amber LED starts flashing slowly, then release the button. The amber (left) LED will flash, indicating that the tape unit is waiting for a cartridge to be inserted.Running the self-test 1. Self-testing begins when a scratch data cartridge is inserted into the tape unit. The Ready (left) LED will flash, indicating that self-testing is in progress.Note: A cartridge must be loaded within 15 seconds, otherwise, the tape unit will automatically revert back to normal operation. If necessary, return to step 1 on page 229 to reenter diagnostic mode. 2. For fastest results, we recommend using an SLR100 Test Tape (P/N 35L0967) which was originally provided with your iSeries™ server. Attention: Use a blank cartridge that does not contain customer data. During this self-test, the cartridge will be rewritten with a test pattern and any customer data will be destroyed.Note: Use a cartridge that is not write-protected. If a write-protected cartridge is inserted while the tape unit is in diagnostic mode, the cartridge will be ejected, see Incorrect cartridge below. Self-testing will only be performed using a write-compatible cartridge type, and with a cartridge that is not damaged, see Incorrect cartridge below. If a cleaning cartridge is inserted while the tape unit is in diagnostic mode, drive cleaning will occur and the tape unit will then return to normal operating mode. Return to step 1 on page 229 to reenter diagnostic mode. 3. At any time, self-testing can be stopped by pressing the eject button. After the current operation is completed, the cartridge will be ejected and tape unit will return to normal operating mode. 4. The Ready (left) LED will continue to flash during the following: v The cartridge load sequence has a approximate duration of 30 seconds. The center LED indicates tape movement. v The hardware test has an approximate duration of 2 and 1/2 minutes. During that time, a static test is performed on tape unit electrical components. No tape motion occurs during this step. v The cartridge load/unload test has an approximate duration of 1 and 1/2 minutes. During that time, the Ready LED will continue to flash while a dynamic test is performed on tape unit mechanical components. Two cartridge load cycles are included. v Duration of the write/read test will vary, depending on what type of cartridge is loaded into the tape unit. When an SLR100 Test Tape is used, typical duration will be 5 minutes. Use of other cartridge types can increase the write/read test duration to 30-40 minutes. During this test, the Ready LED will continue to flash. The center LED indicates tape movement.Interpreting the results Test Passed: When self-testing has completed successfully, and no problems are detected, the cartridge is unloaded from the tape unit and all LEDs are off. Proper function of both the tape unit and tape cartridge have now been verified. Note: A solid amber light indicates that self-testing has completed successfully, but the tape unit requires cleaning. Clean the tape unit by inserting an Dry Process Cleaning Cartridge (P/N 35L0844). Test Failed: The cartridge will remain loaded inside the tape unit, and the amber LED will flash when a problem is detected with either the tape unit or cartridge. Note: To isolate failure to either tape unit or cartridge, return to step 1 on page 229 and repeat this self-test using a different scratch cartridge.
230 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
Incorrect cartridge: When the center (green) and right (amber) LEDs flash and a cartridge is unloaded, the tape unit has determined that an incorrect tape cartridge has been inserted, and self-testing cannot be performed. Verify that your tape cartridge is not one of the following: v Write-protected v Damaged v Unsupported media type v Media which is not write-compatible with tape unit.Press the eject button, to end self-test and return the tape unit to normal operating mode. Then return to step 1 on page 229 and run the self-test using another cartridge, or one which is not write-protected. This ends the procedure. Tape device ready conditions All the conditions that are listed for the device, must be correct for the device to be ready. If the device is not ready, use the Action column or other instructions, and go to the service information for the specific tape device. If the system has logical partitions, perform this procedure from the logical partition that reported the problem (see “Determining if the system has logical partitions” on page 593).
Table 41. Tape device ready conditions
Storage device
Ready description
Action
3480 or 3490
v Power switch is set to the On position. v Power light is on. v DC Power light is on. v Control unit On-line switch is set to the On-line position. v Control unit Normal/Test switch is set to the Normal position. v Control unit channel Enable/Disable switch is set to the Enable position. v Tape unit On-line/Off-line switch is set to the On-line position. v Tape is loaded. v Tape unit displays Ready U or Ready F.
See the 3480 Magnetic Tape Subsystem Operator’s Guide, SA32-0066, or 3490 Magnetic Tape Subsystem Operator’s Guide, SA32-0124, for instructions on making the tape unit ready. Chapter 2. Isolation Procedures 231
Table 41. Tape device ready conditions (continued)
Storage device
Ready description
Action
7208
v Power switch is on (pressed). v Power light is on. v Data cartridge is inserted. v Ready light is on. v System external signal cable is connected to the type 2621 I/O processor and to the 7208 tape drive.Note: The SCSI I/O bus terminating plug for the system external signal cable is connected internally in the 7208 tape drive. The 7208 tape drive does not need, and must not have, an external terminating plug.
See the 7208 8 mm Tape Drive Operator’s Manual for instructions on making the tape drive ready. If you cannot make the 7208 Model 012 tape drive ready, go to “TUPIP02” on page 221.
9348
v Power switch is set to the On position. v Power light is on. v Tape is loaded. v Status display shows 00 A002. v On-line light is on.
See the 9348 Customer Information manual, SA21-9567, for instructions on making the tape unit ready. If you cannot make the tape unit ready, go to the ?Analyzing Problems? section of 9348 Tape Unit Service Information, SY31-0697. 232 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
Twinaxial workstation I/O processor isolation procedure Use the procedure below to isolate a failure which has been detected by the twinaxial workstation I/O processor. If you are using a personal computer, an emulation program must be installed and working. Please read and observe all safety procedures before servicing the system and while performing the procedure below. Attention: Unless instructed otherwise, always power off the system or expansion tower where the FRU is located (see Powering on and powering off) before removing, exchanging, or installing a field-replaceable unit (FRU). DANGERWhen working on or around the system, observe the following precautions: Electrical voltage and current from power, telephone, and communication cables are hazardous. To avoid a shock hazard: v Connect power to this unit only with the IBM provided power cord. Do not use the IBM provided power cord for any other product. v Do not open or service any power supply assembly. v Do not connect or disconnect any cables or perform installation, maintenance, or reconfiguration of this product during an electrical storm. v The product might be equipped with multiple power cords. To remove all hazardous voltages, disconnect all power cords. v Connect all power cords to a properly wired and grounded electrical outlet. Ensure that the outlet supplies proper voltage and phase rotation according to the system rating plate. v Connect any equipment that will be attached to this product to properly wired outlets. v When possible, use one hand only to connect or disconnect signal cables. v Never turn on any equipment when there is evidence of fire, water, or structural damage. v Disconnect the attached power cords, telecommunications systems, networks, and modems before you open the device covers, unless instructed otherwise in the installation and configuration procedures. v Connect and disconnect cables as described in the following procedures when installing, moving, or opening covers on this product or attached devices. To Disconnect: 1. Turn off everything (unless instructed otherwise). 2. Remove the power cords from the outlets. 3. Remove the signal cables from the connectors. 4. Remove all cables from the devices To Connect: 1. Turn off everything (unless instructed otherwise). 2. Attach all cables to the devices. 3. Attach the signal cables to the connectors. 4. Attach the power cords to the outlets. 5. Turn on the devices. (D005) Attention: When instructed, remove and connect cables carefully. You may damage the connectors if you use too much force. “TWSIP01” on page 234The workstation IOP detected an error.
Chapter 2. Isolation Procedures 233
TWSIP01 The workstation IOP detected an error. Please read the danger notices in “Twinaxial workstation I/O processor isolation procedure” on page 233 before performing this procedure. One of the following occurred: v All of the workstations on one port are not working. v All of the workstations on the system are not working. v One of the workstations on the system is not working. v The reference code table instructed you to perform this procedure. v The Remote Operations Console is not working. 1. If the system has logical partitions, perform this procedure from the logical partition that reported the problem. To determine if the system has logical partitions, go to “Determining if the system has logical partitions” on page 593 before continuing with this procedure. 2. Are you using a workstation adapter console? Note: A personal computer (used as a console) that is attached to the system by using a console cable feature is known as a workstation adapter console. The cable (part number 46G0450, 46G0479, or 44H7504) connects the system port on the personal computer to a communications I/O adapter on the system. No: Continue with the next step. Yes: Go to “WSAIP01” on page 241. This ends the procedure. 3. Is the device you are attempting to repair a personal computer (PC)? No: Continue with the next step. Yes: PC emulation programs operate and report system-to-PC communications problems differently. See the PC emulation information for details on error identification. Then, continue with the next step. 4. Perform the following steps: a. Verify that all the devices you are attempting to repair, the primary console, and any alternative consoles are powered on. b. Verify that the all the devices you are attempting to repair, the primary console, and any alternative consoles have an available status. For more information on displaying the device status, see “Hardware Service Manager” on page 644. c. Verify that the workstation addresses of all workstations on the failing port are correct. Each workstation on the port must have a separate address, from 0 through 6. See the workstation service information for details on how to check addresses. d. Verify that the last workstation on the failing port is terminated. All other workstations on that port must not be terminated. e. Ensure that the cables that are attached to the device or devices are tight and are not visibly damaged. f. If there were any cable changes, check them carefully. g. If all of the workstations on the system are not working, disconnect them by terminating at the console. h. Verify the device operation (see the device information for instructions). i. The cursor position can assist in problem analysis. v If the cursor is in the upper right corner, it indicates a communication problem between the workstation IOP and the device. Continue with the next step. 234 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
v If the cursor is in the upper left corner, it indicates a communication problem between the workstation IOP and the operating system. Perform the following steps: 1) Verify that all current PTFs are loaded. 2) Ask your next level of support for assistance. This ends the procedure. 5. Is the system powered off? Yes: Continue with the next step. No: Go to step 8. 6. Perform the following: a. Power on the system in Manual mode. See “IPL type, mode, and speed options” on page 642 for details. b. Wait for a display to appear on the console or a reference code to appear on the control panel.Does a display appear on the console? v No: Continue with the next step. v Yes: If you disconnected any devices after the console in step 4 on page 234, perform the following: a. Power off the system. b. Reconnect one device. Note: Ensure that you terminate the device you just reconnected and remove the termination from the device previously terminated. c. Power on the system. d. If a reference code appears on the control panel, go to step 9. e. If no reference code appears, repeat steps a through d of this step until you have checked all devices disconnected previously. f. Continue to perform the initial program load (IPL). This ends the procedure. 7. Does the same reference code that sent you to this procedure appear on the control panel? Yes: Continue with the next step. No: Perform problem analysis for this new problem. This ends the procedure. 8. Perform the following to make DST available: a. Ensure that Manual mode on the control panel is selected. b. Select function 21 Make DST Available. c. Check the console and any alternative consoles for a display.Does a display appear on any of the console displays? v No: Continue with the next step. v Yes: If you disconnected any devices after the console in step 4 on page 234, perform the following: a. Power off the system. b. Reconnect one device. Note: Ensure that you terminate the device you just reconnected and remove the termination from the previously terminated device. c. Power on the system. d. If a reference code appears on the control panel or on the HMC, go to step 9. e. If no reference code, repeat steps a through d of this step until you have checked all devices disconnected previously. f. Continue to perform the initial program load (IPL). This ends the procedure. 9. Ensure that the following conditions are met: Chapter 2. Isolation Procedures 235
v The workstation addresses of all workstations on the failing port must be correct. Each workstation on the port must have a separate address, from 0 through 6. See the workstation service information if you need help with checking addresses. Did you find a problem with any of the above conditions? Yes: Continue with the next step. No: Go to step 11.10. Perform the following: a. Correct the problem. b. Select function 21 Make DST Available. c. Check the console and any alternative consoles for a display.Does a display appear on any of the consoles? v Yes: Continue to perform the IPL. This ends the procedure. v No: Does the same reference code appear on the control panel? Yes: Continue with the next step. No: Perform problem analysis for this new problem. This ends the procedure.11. Is the reference code one of the following: 0001, 0003, 0004, 0005, 0006, 0101, 0103, 0104, 0105, 0106, 5004, 5082, B000, D010, or D023? No: Continue with the next step. Yes: Go to step 15 on page 237.12. Does the system have an alternative console on a second workstation IOP? Yes: Continue with the next step. No: Go to step 14.13. There is either a Licensed Internal Code problem, or there are two device failures on the workstation IOPs, consoles, or cables. The console and any alternative consoles are the most probable causes for this failure. v See the service information for the failing display to attempt to correct the problem. If a display is connected to the system by a link protocol converter, use the link protocol converter information to attempt to correct the problem. The link protocol converter may be the failing item. v If you have another working display, you can exchange the console and alternative consoles and perform an IPL to attempt to correct the problem. v Exchange the following parts one at a time until you determine the failing item: a. Console b. Alternative console c. Cables d. Workstation IOA for the console e. The multi-adapter bridge. This ends the procedure.14. The console, cables, or the workstation IOP card is the most probable causes for this failure. If the console is connected to the system by a link protocol converter, the link protocol converter is possibly the failing item. Use one or more of the following options to correct the problem: a. See the service information for the failing displays for more information. If a display is connected to the system by a link protocol converter, see the link protocol converter information to attempt to correct the problem. b. If you have another working display, you can exchange the console and perform an IPL to attempt to correct the problem. c. Exchange the following parts one at a time until you determine the failing item: 1) Console 2) Workstation IOA 236 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
3) The multi-adapter bridge. 4) Twinaxial attachment (cable) This ends the procedure.15. To continue problem analysis, use a port tester, part 93X2040 or 59X4262, which you may have with your tools or the customer may have one. The port tester has either two or three lights. Is a port tester available? v Yes: Continue with the next step. v No: Check or exchange the cables from the system to the failing display. Did this correct the problem? Yes: You corrected the problem. This ends the procedure. No: Go to step 12 on page 236.16. To use the port tester to isolate the problem, perform the following: v Verify that the port tester is operating correctly by doing a self-test. A self-test can be made at any time, even when the port tester is attached to a port or cable. Perform the following steps to do a self-test: a. Move the selector switch to the center (0) position. b. Push and hold the test button until all lights go on. The yellow lights should go on immediately, and the green light should go on approximately 5 seconds later. The port tester is ready for use if all lights go on.v Leave the system power on.17. Find the input cable to the failing console or port. Is the failing console or the failing port attached to a protocol converter? v No: Perform the following: a. Disconnect the input cable from the failing console. b. Connect the port tester to the input cable. c. Continue with the next step.v Yes: Perform the following: a. Disconnect the cable that comes from the system at the protocol converter. b. Connect the port tester to the cable. c. Continue with the next step.18. Perform the following: a. Set the selector switch on the port tester to the left (1) position for a twinaxial connection. Set the switch to the right (2) position for a twisted pair connection. b. Press and hold the test switch on the port tester for 15 seconds and observe the lights. c. Choose from the following options: v If the port tester has three lights, do the following: – If only the top (green) light is on, go to step 27 on page 239. – If both the top (green) and center (yellow) lights are on, go to step 20 on page 238. Note: The center (yellow) light is always on for twisted pair cable and may be on for fiber optical cable. – If only the bottom (yellow) light is on, go to step 21 on page 238. – If all lights are off, go to step 22 on page 238. – If all lights are on, go to step 19 on page 238.v If the port tester has two lights, do the following: – If only the top (green) light is on, go to step 27 on page 239. – If only the bottom (yellow) light is on, go to step 21 on page 238. – If both lights are off, go to step 22 on page 238. Chapter 2. Isolation Procedures 237
– If both lights are on, continue with the next step.19. The tester is in the self-test mode. Check the position of the selector switch. v If the selector switch is not in the correct position, go to step 18 on page 237. v If the selector switch is already in the correct position, the port tester is not working correctly. Exchange the port tester, and go to step 16 on page 237.20. The cable you are testing has an open shield. Note: The open shield can be checked only on the cable from the twinaxial workstation attachment to the device or from device to device. Only one section of cable can be checked at a time. See the SA41-3136, Port Tester Use information.This ends the procedure. 21. The cable network is bad. The wires in the cable between the console and the twinaxial workstation attachment are reversed. Go to step 26. 22. Perform the following: a. Find the twinaxial workstation attachment to which the failing console is attached. b. Disconnect the cable from port 0 on that twinaxial workstation attachment. c. Connect the port tester to port 0 on the attachment. d. Set the selector switch on the port tester to the left (1) position.23. Perform the following: a. Press and hold the test switch on the port tester for 15 seconds and observe the lights. b. If the port tester has three lights, do the following: v If both the top (green) and center (yellow) lights are on, continue with step 24. Note: The center (yellow) light is always on for twisted pair cable and may be on for fiber optical cable. v If only the bottom (yellow) light is on, continue with step 24. v If all lights are off, continue with step 24. v If only the top (green) light is on, go to step 26. v If all lights are on, go to step 25.c. If the port tester has two lights, do the following:v If only the top (green) light is on, go to step 26. v If only the bottom (yellow) light is on, continue with step 24. v If both lights are off, continue with step 24. v If both lights are on, go to step 25.24. The test indicated that there was no signal from the system. Reconnect the cable you disconnected and perform the following: a. Exchange the following parts: 1) Twinaxial workstation IOA card 2) The multi-adapter bridge.b. Power on the system to perform an IPL. This ends the procedure.25. The tester is in the self-test mode. Check the position of the selector switch: v If the selector switch is not in the left (1) position, set the switch to the left (1) position. Then go to step 23. v If the selector switch is already in the left (1) position, the port tester is not working correctly. Exchange the port tester and go to step 22.26. The cable to the workstation is the failing item. Cable maintenance is a customer responsibility. The cable must be repaired or exchanged. Then, power on the system to perform an IPL. This ends the procedure. 238 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
27. The port tester detects most problems, but it does not always detect an intermittent problem or some cable impedance problems. The tester may indicate a good condition, although there is a problem with the workstation IOP card or cables. a. If the failing display is connected to a link protocol converter, the link protocol converter is the failing item. See the link protocol converter service information to correct the problem. b. Exchange the following parts: 1) Console 2) Twinaxial workstation IOA 3) The multi-adapter bridge. 4) Cablesc. If you have another working display, you can exchange the console and perform an IPL to attempt to correct the problem. See the service information for the failing display for more information. d. If exchanging the failing items did not correct the problem and the reference code was a 5002, 5082, or 50FF, there may be a Licensed Internal Code problem. Go to “LICIP03” on page 92. e. The problem may be caused by devices that are attached after the console on port 0. This ends the procedure. Chapter 2. Isolation Procedures 239
Workstation adapter isolation procedure Contains the procedure necessary to isolate a failure that is detected by the workstation adapter, and is used when no display is available with which to perform on-line problem analysis. The workstation adapter detected a problem while communicating with the workstation that is used as the primary console. Note: If you are using a PC, you must install an emulation program. Please read and observe all safety procedures before servicing the system and while performing the procedure below. Attention: Unless instructed otherwise, always power off the system or expansion tower where the FRU is located (see Powering on and powering off) before removing, exchanging, or installing a field-replaceable unit (FRU). DANGERWhen working on or around the system, observe the following precautions: Electrical voltage and current from power, telephone, and communication cables are hazardous. To avoid a shock hazard: v Connect power to this unit only with the IBM provided power cord. Do not use the IBM provided power cord for any other product. v Do not open or service any power supply assembly. v Do not connect or disconnect any cables or perform installation, maintenance, or reconfiguration of this product during an electrical storm. v The product might be equipped with multiple power cords. To remove all hazardous voltages, disconnect all power cords. v Connect all power cords to a properly wired and grounded electrical outlet. Ensure that the outlet supplies proper voltage and phase rotation according to the system rating plate. v Connect any equipment that will be attached to this product to properly wired outlets. v When possible, use one hand only to connect or disconnect signal cables. v Never turn on any equipment when there is evidence of fire, water, or structural damage. v Disconnect the attached power cords, telecommunications systems, networks, and modems before you open the device covers, unless instructed otherwise in the installation and configuration procedures. v Connect and disconnect cables as described in the following procedures when installing, moving, or opening covers on this product or attached devices. To Disconnect: 1. Turn off everything (unless instructed otherwise). 2. Remove the power cords from the outlets. 3. Remove the signal cables from the connectors. 4. Remove all cables from the devices To Connect: 1. Turn off everything (unless instructed otherwise). 2. Attach all cables to the devices. 3. Attach the signal cables to the connectors. 4. Attach the power cords to the outlets. 5. Turn on the devices. (D005) “WSAIP01” on page 241Use this procedure if your console has a keyboard error with a ?K? on the display.
240 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
WSAIP01 Use this procedure if your console has a keyboard error with a ?K? on the display. Note: If the console has a keyboard error, there may be a ?K? on the display. See the workstation service information for more information. Perform the following procedure from the logical partition that reported the problem: 1. Select the icon on the workstation to make it the console (you may have already done this). You must save the console selection. 2. Access Dedicated Service Tools (DST) by performing the following: a. Select Manual mode on the control panel. b. Use the selection switch on the control panel to display function 21, Make DST Available, and press Enter on the control panel. c. Wait for a display to appear on the console or for a reference code to appear on the control panel.Does a display appear on the console? No: Continue with the next step. Yes: The problem is corrected. This ends the procedure.3. Isolate the problem to one server and one workstation (console) by performing the following: a. Disconnect the power cable from the workstation. b. Eliminate all workstations, cables, and connector boxes from the network except for one server, one console, two connector boxes, and one cable. c. Ensure that the cables that are connected to the console, the keyboard, and the server are connected correctly and are not damaged.4. Perform the following: a. Ensure that the server console is terminated correctly. b. Set the Power switch on the console to the On position. c. Select the SNA*PS icon on the console. d. See the workstation information for more information.5. Access DST by performing the following: a. Select Manual mode on the control panel. b. Use the selection switch on the control panel to display function 21, Make DST Available, and press Enter on the control panel. c. Wait for a display to appear on the console or for a reference code to appear on the control panel.Does a display appear on the console? No: Continue with the next step. Yes: The problem is in a cable, connector box, or device you disconnected in step 3. This ends the procedure.6. Does the reference code A600 5005 appear on the control panel? Yes: Continue with the next step. No: Perform problem analysis using this reference code. This ends the procedure.7. Do you have another workstation, cable, and two connector boxes you can exchange with the workstation connected to the server? v Yes: Continue with the next step. v No: One of the following is causing the problem: Note: The items at the top of the list have a higher probability of fixing the problem than the items at the bottom of the list. – Workstation adapter Licensed Internal Code Chapter 2. Isolation Procedures 241
– Workstation adapter configuration – Workstation – Cable – Connector box – Workstation IOA – Workstation IOP If you still have not corrected the problem, ask your next level of support for assistance. This ends the procedure.8. Repeat steps 3 on page 241 through 7 on page 241 of this procedure, using a different workstation, cable, and connector boxes. Do you still have a problem? Yes: Continue with the next step. No: The problem is in the cable, connector boxes, or workstation you disconnected. This ends the procedure.9. One of the following is causing the problem: Note: The items at the top of the list have a higher probability of fixing the problem than the items at the bottom of the list. v Workstation adapter Licensed Internal Code v Workstation adapter configuration v Workstation IOA v Communications IOP To bring up a workstation other than the console, perform the following: a. Connect another workstation into this network. b. Select Normal mode on the control panel. c. Perform an IPL. If the sign-on display appears, the following parts are good: v Communications IOP v Workstation IOANote: If a printer connected to this assembly is not working correctly, it may look like the display is bad. Perform a self-test on the printer to ensure that it prints correctly (see the printer service information). If you still have not corrected the problem, ask your next level of support for assistance. This ends the procedure. 242 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
Workstation adapter console isolation procedure Contains the procedure necessary to isolate a failure that is detected by the workstation adapter console. Use this procedure when no display is available with which to perform online problem analysis. Note: If you are using a PC, you must install an emulation program. Read all safety procedures before servicing the system. Observe all safety procedures when performing a procedure. Unless instructed otherwise, always power off the system or expansion tower where the FRU is located, see Powering on and powering off before removing, exchanging, or installing a field-replaceable unit (FRU). DANGERWhen working on or around the system, observe the following precautions: Electrical voltage and current from power, telephone, and communication cables are hazardous. To avoid a shock hazard: v Connect power to this unit only with the IBM provided power cord. Do not use the IBM provided power cord for any other product. v Do not open or service any power supply assembly. v Do not connect or disconnect any cables or perform installation, maintenance, or reconfiguration of this product during an electrical storm. v The product might be equipped with multiple power cords. To remove all hazardous voltages, disconnect all power cords. v Connect all power cords to a properly wired and grounded electrical outlet. Ensure that the outlet supplies proper voltage and phase rotation according to the system rating plate. v Connect any equipment that will be attached to this product to properly wired outlets. v When possible, use one hand only to connect or disconnect signal cables. v Never turn on any equipment when there is evidence of fire, water, or structural damage. v Disconnect the attached power cords, telecommunications systems, networks, and modems before you open the device covers, unless instructed otherwise in the installation and configuration procedures. v Connect and disconnect cables as described in the following procedures when installing, moving, or opening covers on this product or attached devices. To Disconnect: 1. Turn off everything (unless instructed otherwise). 2. Remove the power cords from the outlets. 3. Remove the signal cables from the connectors. 4. Remove all cables from the devices To Connect: 1. Turn off everything (unless instructed otherwise). 2. Attach all cables to the devices. 3. Attach the signal cables to the connectors. 4. Attach the power cords to the outlets. 5. Turn on the devices. (D005)Read and understand the following service procedures before using this section: v Powering on and powering off v “Determining a primary or alternative console” on page 591 v Chapter 3, “Locating FRUs,” on page 335
Chapter 2. Isolation Procedures 243
Note: If the console has a keyboard error, there may be a K on the display. See the workstation service information for more information. 1. If the system has logical partitions, perform this procedure from the logical partition that reported the problem. To determine if the system has logical partitions, go to “Determining if the system has logical partitions” on page 593. 2. Ensure that your workstation meets the following conditions: v The workstation that you are using for the console is powered on. v The emulation program is installed and is working. v The input/output adapter (IOA) is installed and the workstation console cable is attached. Notes: a. Card information: Hardware that is associated with 6A59 feature is the type 2745 card. b. Cable information: The cable attaches directly to the IOA.Did you find a problem with any of the conditions listed above? No: Continue with the next step. Yes: Correct the problem. Then, perform an IPL of the system. This ends the procedure.3. Perform the following to make dedicated service tool (DST) available: a. If there is an alternative console, ensure that it is powered on. b. Ensure that Manual mode on the control panel is selected. c. Select function 21, Make DST Available on the control panel, and press Enter.Does a display appear on either the console or any alternative console? No: Continue with the next step. Yes: Complete the IPL. When the operating system display appears, use the Work with Problem command (WRKPRB) or Analyze Problem command (ANZPRB) to analyze and correct or report any console problems. This ends the procedure.4. Do you have SRC A600 5001, A600 5004, A600 5007, or B075 xxxx (where xxxx is any value)? v No: Continue with the next step. v Yes: Perform the following: a. Disconnect any cables that are attached to the IOA. b. Install the wrap plug on the IOA. The 2745 wrap plug label is QQ. c. Perform an IPL in Manual mode.5. Does SRC 6A59 5007 occur? v No: Continue with the next step. v Yes: One of the following is causing the problem: – Workstation emulation program – Workstation – Workstation console cableThis ends the procedure.6. Did SRC A600 5001, A600 5004, or 6A59 5008 occur? No: This is a new problem. Use the new reference code to perform problem analysis for the problem, or ask your next level of support for assistance. This ends the procedure. Yes: The Type 2745 workstation adapter is the failing item. This ends the procedure. 244 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
Isolating problems on servers that run the AIX or Linux operating systems Use the procedures for AIX or Linux servers if there is not an HMC attached to the server. If the server is connected to an HMC, use the procedures that are available on the HMC to continue FRU isolation. “MAP 0210: General problem resolution” on page 246Use this procedure to exchange the FRUs in the order of their failure probability. “Problems with loading and starting the operating system (AIX and Linux)” on page 246If the system is running partitions from partition standby (LPAR), the following procedure addresses the problem in which one partition will not boot AIX or Linux while other partitions boot successfully and run the operating system successfully. “Storage I/O card modes and jumpers” on page 250 “SCSI service hints” on page 252Use one or more of the following procedures when servicing SCSI adapter or devices. “MAP 0020: Problem determination procedure” on page 259Use this MAP to get a service request number (SRN) if the customer or a previous MAP provided none. “MAP 0030: Additional Problem Determination” on page 266This MAP is used for problems that still occur after all FRUs indicated by the SRN or error code have been exchanged. “MAP 0040: Intermittent Problem Isolation” on page 267This MAP provides a structured way of analyzing intermittent problems. “MAP 0050: SCSI bus problems” on page 269Use this MAP to analyze problems with a SCSI bus. “MAP 0054: PCI-X SCSI Bus Problems” on page 276Use this MAP to determine which FRUs may need to be replaced in order to solve a SCSI bus-related problem on a PCI-X SCSI or PCI-X SCSI RAID adapter. “MAP 0070: 888 sequence in operator panel display” on page 278Use this MAP when you receive an 888 sequence on the operator panel display or monitor. “MAP 0220: Hot-Swap FRU problem resolution” on page 279Use this procedure to exchange hot-swappable field replaceable units (FRUs). “MAP 0230: Platform error problem resolution” on page 284Use this MAP to resolve problems reported by SRNs A00-xxx to A25-xxxx. “MAP 0260: System hangs during resource configuration” on page 288Use this MAP when the system unit hangs while configuring a resource. “MAP 0270: SCSI RAID problem resolution and verification” on page 291Use this MAP to resolve SCSI RAID adapter, cache, or drive problems. “MAP 0280: Console and Keyboard Problem Resolution” on page 295Use this MAP to resolve console and keyboard problems when the system is booting. “MAP 0285: Multipath I/O (MPIO) Problem Resolution” on page 296Use this MAP to handle SRN A23-001 and ssss-640 (where ssss is the 3 or 4 digit Failing Function Code (FFC) of an SCSD drive) to check the path from adapter to device. “MAP 0291: Missing Device or Bus Problem Resolution” on page 299Use this MAP when a bus or device (such as a disk drive) is reported as a missing resource by the diagnostics. “PFW1540: Problem isolation procedures” on page 301The PFW1540 procedures are used to locate problems in the processor subsystem or I/O subsystem(s). “PFW1542: I/O problem isolation procedure” on page 302This I/O problem-determination procedure isolates I/O card and I/O subsystem failures. When I/O problem isolation is complete, all cables and cards that are failing will have been replaced or reseated. Chapter 2. Isolation Procedures 245
“PFW1548: Memory and processor subsystem problem isolation procedure” on page 316Use this problem isolation procedure to aid in solving memory and processor problems that are not found by normal diagnostics.MAP 0210: General problem resolution Use this procedure to exchange the FRUs in the order of their failure probability. v Step 0210-1 Did the problem report provide a part number for the FRU you are about to replace? NO Go to Step 0210-2. YES Go to Step 0210-3.v Step 0210-2 Find the failing function codes in the Chapter 8, “Part Information,” on page 691, and record the FRU part number and description of each FRU. v Step 0210-3 Obtain the replacement part. v Step 0210-4 Go to Chapter 3, “Locating FRUs,” on page 335, and follow the procedures for the FRU you are servicing.Problems with loading and starting the operating system (AIX and Linux) If the system is running partitions from partition standby (LPAR), the following procedure addresses the problem in which one partition will not boot AIX or Linux while other partitions boot successfully and run the operating system successfully. It is the customer’s responsibility to move devices between partitions. If a device must be moved to another partition to run standalone diagnostics, contact the customer or system administrator. If the optical drive must be moved to another partition, all SCSI devices connected to that SCSI adapter must be moved because moves are done at the slot level, not at the device level. Depending on the boot device, a checkpoint may be displayed on the operator panel for an extended period of time while the boot image is retrieved from the device. This is particularly true for tape and network boot attempts. If booting from an optical drive or tape drive, watch for activity on the drive’s LED indicator. A blinking LED indicates that the loading of either the boot image or additional information required by the operating system being booted is still in progress. If the checkpoint is displayed for an extended period of time and the drive LED is not indicating any activity, there might be a problem loading the boot image from the device. Notes: 1. For network boot attempts, if the system is not connected to an active network or if the target server is inaccessible (which can also result from incorrect IP parameters being supplied), the system will still attempt to boot. Because time-out durations are necessarily long to accommodate retries, the system may appear to be hung. Refer to checkpoint CA00 E174. 2. If the partition hangs with a 4-character checkpoint in the display, the partition must be deactivated, then reactivated before attempting to reboot. 3. If a BA06 000x error code is reported, the partition is already deactivated and in the error state. Reboot by activating the partition. If the reboot is still not successful, go to step 3 on page 247.
246 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
This procedure assumes that a diagnostic CD-ROM and an optical drive from which it can be booted are available, or that diagnostics can be run from a NIM (Network Installation Management) server. Booting the diagnostic image from an optical drive or a NIM server is referred to as running standalone diagnostics. 1. Is a Hardware Management Console attached to the managed system? Yes: Continue with the next step. No: Go to step 3. 2. Look at the service action event error log in Manage Serviceable Events on the HMC. Perform the actions necessary to resolve any open entries that affect devices in the boot path of the partition or that indicate problems with I/O cabling. Then try to reboot the partition. Does the partition reboot successfully? Yes: This ends the procedure. No: Continue with the next step. 3. Boot to the SMS main menu: v If you are rebooting a partition from partition standby (LPAR), go to the properties of the partition and select Boot to SMS, then activate the partition. v If you are rebooting from platform standby, access the ASMI (see “Accessing the ASMI using a Web browser” on page 646. Select Power/Restart Control, then Power On/Off System. In the AIX/Linux partition mode boot box, select Boot to SMS menu ? Save Settings and Power On. At the SMS main menu, select Select Boot Options and verify whether the intended boot device is correctly specified in the boot list. Is the intended load device correctly specified in the boot list? v Yes: Perform the following: a. Remove all removable media from devices in the boot list from which you do not want to load the operating system. b. If you are attempting to load the operating system from a network, go to step 4. c. If you are attempting to load the operating system from a disk drive or an optical drive, go to step 7 on page 248. d. No: Go to step 5. 4. If you are attempting to load the operating system from the network, perform the following: v Verify that the IP parameters are correct. v Use the SMS ping utility to attempt to ping the target server. If the ping is not successful, have the network administrator verify the server configuration for this client. v Check with the network administrator to ensure that the network is up. Also ask the network administrator to verify the settings on the server from which you are trying to load the operating system. v Check the network cabling to the adapter. Restart the partition and try loading the operating system. Does the operating system load successfully? Yes: This ends the procedure. No: Go to step 7 on page 248. 5. Use the SMS menus to add the intended boot device to the boot sequence. Can you add the device to the boot sequence? Yes: Restart the partition. This ends the procedure. No: Continue with the next step. 6. Ask the customer or system administrator to verify that the device you are trying to load from is assigned to the correct partition. Then select List All Devices and record the list of bootable devices that displays. Is the device from which you want to load the operating system in the list? Yes: Go to step 7 on page 248. No: Go to step 10 on page 248.
Chapter 2. Isolation Procedures 247
7. Try to load and run standalone diagnostics against the devices in the partition, particularly against the boot device from which you want to load the operating system. You can run standalone diagnostics from an optical drive or a NIM server. To boot standalone diagnostics, follow the detailed procedures in “Running the online and eServer stand-alone diagnostics” on page 672. Note: When attempting to load diagnostics on a partition from partition standby, the device from which you are loading standalone diagnostics must be made available to the partition that is not able to load the operating system, if it is not already in that partition. Contact the customer or system administrator if a device must be moved between partitions in order to load standalone diagnostics. Did standalone diagnostics load and start successfully? Yes: Go to step 8. No: Go to step 14 on page 249. 8. Was the intended boot device present in the output of the option Display Configuration and Resource List, which is run from the Task Selection menu? v Yes: Continue with the next step. v No: Go to step 10. 9. Did running diagnostics against the intended boot device result in a No Trouble Found message? Yes: Go to step 12 on page 249. No: Go to the list of service request numbers and perform the repair actions for the SRN reported by the diagnostics. When you have completed the repair actions, go to step 13 on page 249.10. Perform the following actions: a. Perform the first item in the action list below. In the list of actions below, choose SCSI or IDE based on the type of device from which you are trying to boot the operating system. b. Restart the system or partition. c. Stop at the SMS menus and select Select Boot Options. d. Is the device that was not appearing previously in the boot list now present? Yes: Go to Chapter 5, “Verifying a repair,” on page 557. This ends the procedure. No: Perform the next item in the action list and then return to step 10b. If there are no more items in the action list, go to step 11 on page 249.Action list: Note: See Chapter 3, “Locating FRUs,” on page 335 for part numbers and links to exchange procedures. a. Verify that the SCSI or IDE cables are properly connected. Also verify that the device configuration and address jumpers are set correctly. b. Do one of the following: v SCSI boot device: If you are attempting to boot from a SCSI device, remove all hot-swap disk drives (except the intended boot device, if the boot device is a hot-swap drive).If the boot device is present in the boot list after you boot the system to the SMS menus, add the hot-swap disk drives back in one at a time, until you isolate the failing device. v IDE boot device: If you are attempting to boot from an IDE device, disconnect all other internal SCSI or IDE devices. If the boot device is present in the boot list after you boot the system to the SMS menus, reconnect the internal SCSI or IDE devices one at a time, until you isolate the failing device or cable.c. Replace the SCSI or IDE cables. d. Replace the SCSI backplane (or IDE backplane, if present) to which the boot device is connected. e. Replace the intended boot device. f. Replace the system backplane.
248 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
11. Choose from the following: v If the intended boot device is not listed, go to “PFW1548: Memory and processor subsystem problem isolation procedure” on page 316. This ends the procedure. v If an SRN is reported by the diagnostics, go to the list of service request numbers and follow the action listed. This ends the procedure.12. Have you disconnected any other devices? Yes: Reinstall each device that you disconnected, one at a time. After you reinstall each device, reboot the system. Continue this procedure until you isolate the failing device. Replace the failing device, then go to step 13. No: Perform an operating system-specific recovery process or reinstall the operating system. This ends the procedure.13. Is the problem corrected? Yes: Go to Chapter 5, “Verifying a repair,” on page 557. This ends the procedure. No: If replacing the indicated FRUs did not correct the problem, or if the previous steps did not address your situation, go to “PFW1548: Memory and processor subsystem problem isolation procedure” on page 316. This ends the procedure.14. Is a SCSI boot failure (where you cannot boot from a SCSI-attached device) also occurring? v Yes: Go to “PFW1548: Memory and processor subsystem problem isolation procedure” on page 316. This ends the procedure. v No: Continue to the next step.15. Perform the following actions to determine if another adapter is causing the problem: a. Remove all adapters except the one to which the optical drive is attached and the one used for the console. b. Reload the standalone diagnostics. Can you successfully reload the standalone diagnostics? v Yes: Perform the following: 1) Reinstall the adapters that you removed (and attach devices as applicable) one at a time. After you reinstall each adapter, retry the boot operation until the problem reoccurs. 2) Replace the adapter or device that caused the problem. 3) Go to Chapter 5, “Verifying a repair,” on page 557. This ends the procedure.v No: Continue with the next step.16. The graphics adapter (if installed), optical drive, IDE or SCSI cable, or system board is most likely defective. Does your system have a PCI graphics adapter installed? Yes: Continue with the next step. No: Go to step 1817. Perform the following to determine if the graphics adapter is causing the problem: a. Remove the graphics adapter. b. Attach a TTY terminal to the system port. c. Try to reload standalone diagnostics. Do the standalone diagnostics load successfully? Yes: Replace the graphics adapter. This ends the procedure. No: Continue with the next step.18. Replace the following (if not already replaced), one at a time, until the problem is resolved: a. Optical drive b. IDE or SCSI cable that goes to the optical drive c. System board that contains the integrated SCSI or IDE adapters.If this resolves the problem, go to Chapter 5, “Verifying a repair,” on page 557. If the problem still persists or if the previous descriptions did not address your particular situation, go to “PFW1548: Memory and processor subsystem problem isolation procedure” on page 316. Chapter 2. Isolation Procedures 249
This ends the procedure.Storage I/O card modes and jumpers The 2748 and 2778 I/O cards can run in either compression mode or enhanced mode. v Compression Mode: Compression mode should only be used when you want to run Integrated Hardware Disk Compression. In compression mode, the write cache size is 4 MB. v Enhanced Mode: Enhanced mode has an enhanced write cache size. In enhanced mode, the write cache size for the 2748 I/O card is 26MB, and the maximum compressed write cache size for the 2778 I/O card is 104MB. Enhanced mode also provides support for Extended Adaptive Cache (by attachment of a read cache device) and Extended Adaptive Cache Simulator.The following subtopics are included here: v “Setting or changing the mode of an I/O card.” v When an error regarding the mode of the storage I/O card appears, the error message indicates whether the jumper is missing or overridden. – If the error message is CPPEA20, the I/O card has overridden the mode set by the jumper. See “Mode jumper has been overridden” on page 251. – If the error message is CPPEA21, the I/O card has detected that the jumper is missing. See “Mode jumper is missing” on page 252. “Setting or changing the mode of an I/O card”Provides detailed steps for setting or changing the mode of an I/O card. “Mode jumper has been overridden” on page 251Covers how to correct the error message that appears when the mode jumper has been overridden. “Mode jumper is missing” on page 252The error message CPPEA21 appears when the mode jumper is missing from the storage I/O card.Setting or changing the mode of an I/O card Provides detailed steps for setting or changing the mode of an I/O card. How to set or change the mode of an I/O card: The 2748 and 2778 I/O cards support two modes: compression and enhanced. The rear of the I/O card is marked with a C, indicating compression mode, and an E, indicating enhanced mode. Three pins on the rear of the I/O card determine the mode of the card, depending on which of the pins have a jumper placed on them. The jumper may be moved at any time, but changes only take effect during a system IPL. Refer to the illustration below while changing the mode jumper. 250 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
1. Power off the system, ensuring that no jobs are currently processing (see Powering on and powering off). 2. Choose from the following options: v To set the mode of the card to compressed (C), place the jumper on the top and middle pins. v To set the mode of the card to enhanced (E), place the jumper on the middle and bottom pins.3. Select the IPL type and mode. 4. Power on the system.Mode jumper has been overridden Covers how to correct the error message that appears when the mode jumper has been overridden. The error message CPPEA20 appears when the mode jumper has been overridden. This is due to cache data that is in a conflicting mode. 1. To begin correcting this error, examine the mode jumpers in the rear section of the storage I/O card. Note: The mode jumper is accessible without requiring the removal of the I/O card. 2. Choose from the following options: v If the mode jumper is not in the desired position, follow the steps in “Setting or changing the mode of an I/O card” on page 250. v If the mode jumper is in the desired position and any other I/O card or device reference codes occurred at about the same time as this error, contact your service representative. v If the mode jumper is in the desired position and no other I/O card or device reference codes occurred at about the same time as this error, then power the system down normally and IPL. See Powering on and powering off. This ends the procedure. Figure 3. Changing the mode jumper
Chapter 2. Isolation Procedures 251
Mode jumper is missing The error message CPPEA21 appears when the mode jumper is missing from the storage I/O card. 1. Locate the jumper. 2. Power off the system. See Powering on and powering off. 3. Install the jumper in the correct position. See “Setting or changing the mode of an I/O card” on page 250. 4. Power on the system. This ends the procedure.SCSI service hints Use one or more of the following procedures when servicing SCSI adapter or devices. General SCSI Configuration Checks With any type of SCSI problem, begin with the following steps: 1. Verify that all SCSI devices on the SCSI bus have a unique address. 2. Verify that all cables are connected securely and that there is proper termination at both ends of the SCSI bus. 3. Verify that the cabling configuration does not exceed the maximum cable length for the adapter in use. 4. Verify that the adapters and devices that you are working with are at the appropriate microcode levels for the customer situation. If you need assistance with microcode issues, contact your service support structure. 5. If there are multiple SCSI adapters on the SCSI bus, verify that the customer is using the appropriate software (such as HACMP™ or HANFS) to support such an arrangement. If the correct software is not in use, some SCSI errors should be expected when multiple adapters attempt to access the same SCSI device. Also, each adapter should have a unique address.High Availability or Multiple SCSI System Checks If you have a high-availability configuration, or if more than one system is attached to the same SCSI bus, do the following: 1. Verify that the adapters and devices have unique SCSI addresses. The default SCSI adapter address is always 7. If you have more than one adapter on the bus, change the address of at least one by using SMIT (SMIT Devices ? SCSI Adapter ? Change/Show characteristics of an adapter). You must make the changes to the database only, then reboot the system in order for the change to take effect.Note: Diagnostics defaults to using ID 7 (it is recommended that this ID not be used in high availability configurations). 2. If RAID devices such as the 7135 or 7137 are attached, run the appropriate diagnostics for the device. If problems occur, contact your service support structure for assistance. If the diagnostics are run incorrectly on these devices, misleading SRNs can result. 3. Diagnostics cannot be run against OEM devices; doing so results in misleading SRNs. 4. Verify that all cables are connected securely and that both ends of the SCSI bus is terminated correctly. 5. Verify that the cabling configuration does not exceed the maximum cable length for the adapter in use. Refer to the SCSI Cabling section in the RS/6000® eServer™ pSeries® Adapters, Devices, and Cable Information for Multiple Bus Systems for more details on SCSI cabling issues. 6. Verify that the adapter and devices are at the appropriate microcode levels for the customer situation. If you need assistance with microcode issues, contact your service support structure.SCSI-2 Single-Ended Adapter PTC Failure Isolation Procedure Before replacing a SCSI-2 single-ended adapter, use these procedures to determine if a short-circuit condition exists on the SCSI bus. The same positive temperature coefficient (PTC) resistor is used for both the internal and external buses. The PTC protects the SCSI bus from high currents due to short-circuiting 252 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
on the cable, terminator, or device. It is unlikely that the PTC can be tripped by a defective adapter. Unless instructed to do so by these procedures, do not replace the adapter because of a tripped PTC resistor. A fault (short-circuit) causes an increase in PTC resistance and temperature. The increase in resistance causes the PTC to halt current flow. The PTC returns to a low resistive and low temperature state when the fault is removed from the SCSI bus or when the system is turned off. Wait 5 minutes for the PTC resistor to fully cool, then retest. These procedures determine if the PTC resistor is still tripped and then determine if there is a short-circuit somewhere on the SCSI bus. Determining Where to Start Use the following to determine the adapter configuration and select the appropriate procedure: v If there are external cables attached to the adapter, start with the External Bus PTC Isolation Procedure for your type of adapter. v If there are no external cables attached, start with the “Internal SCSI-2 Single-Ended Bus PTC Isolation Procedure” on page 254. v If there is a combination of external and internal cables start with the External Bus PTC Isolation Procedure for your type of adapter. If this procedure does not resolve the problem, continue with the Internal Bus PTC Isolation Procedure for your type of adapter.External SCSI-2 Single-Ended Bus PTC Isolation Procedure Isolate the external SCSI bus PTC fault with the following procedure: Note: The external bus is of single-ended design. 1. Ensure the system power and all externally attached device power is turned off. All testing is accomplished with the power off. 2. Disconnect any internal and external cables from the adapter and remove the adapter from the system. 3. Verify with a digital ohmmeter that the internal PTC resistor, labeled Z1, (refer to the illustration after Internal SCSI-2 Single-Ended Bus PTC Isolation Procedure, step 3 on page 254) is cool and in a low resistance state, typically less than 1/2 Ohm. Measuring across, be sure to probe both sides of the PTC where the solder joints and board come together. The polarity of the test leads is not important. If necessary, allow the PTC resistor to cool and measure again. 4. Locate Capacitor C1 and measure the resistance across it by using the following procedure: a. Connect the positive lead to the side of the capacitor where the + is indicated on the board near C1. Be sure to probe at the solder joint where the capacitor and board come together. b. Connect the negative lead to the opposite side of the capacitor marked ?GND.? Be sure to probe at the solder joint where the capacitor and board come together. c. If there is no short-circuit present, then the resistance reading is high, typically hundreds of Ohms.Note: Because this is a measurement across unpowered silicon devices, the reading is a function of the ohmmeter used. v If there is a fault, the resistance reading is low, typically below 10 Ohms. Because there are no cables attached, the fault is on the adapter. Replace the adapter. Note: Some multi-function meters label the leads specifically for voltage measurements. When using this type of meter to measure resistance, the plus lead and negative lead my not be labeled correctly. If you are not sure that your meter leads accurately reflect the polarity for measuring resistance, repeat this step with the leads reversed. If the short-circuit is not indicated with the leads reversed, the SCSI bus is not faulted (short-circuited).
Chapter 2. Isolation Procedures 253
v If the resistance measured was high, proceed to the next step.5. Reattach the external cable to the adapter, then do the following: a. Measure across C1 as previously described. b. If the resistance is still high, in this case above 10 Ohms, then there is no apparent cause for a PTC failure from this bus. If there are internal cables attached, continue to the “Internal SCSI-2 Single-Ended Bus PTC Isolation Procedure.” c. If the resistance is less than 10 Ohms, there is a possibility of a fault on the external SCSI bus. Troubleshoot the external SCSI bus by disconnecting devices and terminators. Measure across C1 to determine whether the fault has been removed. Replace the failing component. Go to Chapter 5, “Verifying a repair,” on page 557.External SCSI-2 Single-Ended Bus Probable Tripped PTC Causes The following list provides some suggestions of things to check when the PTC is tripped: v A short-circuited terminator or cable. Check for bent pins on each connector and removable terminator. v Intermittent PTC failures can be caused by improperly seated cable connectors. Reseat the connector and flex the cable in an attempt to duplicate the fault condition across C1. v Plugging or unplugging a cable or terminator while the system is turned on (hot plugging). v A short-circuited device. v Differential devices or terminators are attached to the single-ended SCSI bus. Note: The SCSI-2 Fast/Wide and Ultra PCI Adapters use an onboard electronic terminator on the external SCSI bus. When power is removed from the adapter, as in the case of this procedure, the terminator goes to a high impedance state and the resistance measured cannot be verified, other than it is high. Some external terminators use an electronic terminator, which also goes to a high impedance state when power is removed. Therefore, this procedure is designed to find a short-circuited or low resistance fault as opposed to the presence of a terminator or a missing terminator.Internal SCSI-2 Single-Ended Bus PTC Isolation Procedure Isolate the internal SCSI bus PTC resistor fault with the following procedure: Note: The internal bus is single-ended. 1. Ensure that system power and all externally attached device power is turned off. 2. Disconnect any internal and external cables from the adapter, then remove the adapter from the system. 3. Verify with a digital ohmmeter, that the internal PTC resistor, labeled Z1, is cool and in a low resistance state, typically less than 1/2 Ohm. Measuring across, be sure to probe both sides of the PTC where the solder joints and board come together. The polarity of the test leads is not important. If necessary, allow the PTC to cool and measure again. Refer to the following illustration. 254 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
Note: Ensure that only the probe tips are touching the solder joints. Do not allow the probes to touch any other part of the component. 4. Locate capacitor C1 and measure the resistance across it using the following procedure: a. Connect the positive lead to the side of the capacitor where the + is indicated. Be sure to probe at the solder joint where the capacitor and board come together. b. Connect the negative lead to the opposite side of the capacitor. Be sure to probe at the solder joint where the capacitor and board come together. c. If there is no short-circuit present, the resistance reading is high, typically hundreds of Ohms.Note: Because this is a measurement across unpowered silicon devices, the reading is a function of the ohmmeter used. v If there is a fault, the resistance reading is low, typically below 10 Ohms. Because there are no cables attached, the fault is on the adapter. Replace the adapter. Note: Some multi-function meters label the leads specifically for voltage measurements. When using this type of meter to measure resistance, the plus lead and negative lead my not be labeled correctly. If you are not sure that your meter leads accurately reflect the polarity for measuring resistance, repeat this step with the leads reversed. Polarity is important in this measurement to prevent forward-biasing diodes, which lead to a false low resistance reading. If the short circuit is not indicated with the leads reversed, the SCSI bus is not faulted (short-circuited). v If the resistance is high and there is no internal cable to reattach, there is no apparent cause for the PTC resistor diagnostic failure. v If the resistance is high and there is an internal cable to reattach, proceed to the next step.5. Reattach the internal cable to the adapter, then do the following: a. Measure across C1 as described previously. Chapter 2. Isolation Procedures 255
b. If the resistance is still high, above 25 Ohms, there is no apparent cause for a PTC failure. c. If the resistance is less than 10 Ohms, a fault on the internal SCSI bus is possible. Troubleshoot the internal SCSI bus by disconnecting devices and terminators. Measure across C1 to determine if the fault has been removed.Note: Some internal cables have nonremovable terminators.Internal SCSI-2 Single-Ended Bus Probable Tripped PTC Resistor Causes The following list provides some suggestions of things to check when the PTC is tripped: v A short-circuited terminator or cable. Check for bent pins on each connector and removable terminator. v Intermittent PTC failures can be caused by incorrectly seated cable connectors. Reseat the connector and flex the cable in an attempt to duplicate the fault condition across C1. v A short-circuited device. v On some systems, the terminator is fixed to the internal cable and cannot be removed. If all devices are removed from the cable and the resistance is still low, then the cable should be replaced. Note: The SCSI-2 Fast/Wide and Ultra PCI adapters use an onboard electronic terminator on the internal SCSI bus. When power is removed from the adapter, as in the case of this procedure, the terminator goes to a high impedance state and the resistance measured cannot be verified, other than it is high. Some internal terminators use an electronic terminator, which also goes to a high impedance state when power is removed. Therefore, this procedure is designed to find a short-circuit or low resistance fault as opposed to the presence of a terminator or a missing terminator.SCSI-2 Differential Adapter PTC Failure Isolation Procedure Use this procedure when SRN xxx-240 or xxx-800 has been indicated. The differential adapter can be identified by the 4-B or 4-L on the external bracket plate. Before replacing a SCSI-2 differential adapter, use these procedures to determine if a short-circuit condition exists on the SCSI Bus. The PTC protects the SCSI bus from high currents due to short-circuits on the cable, terminator, or device. It is unlikely that the PTC can be tripped by a defective adapter. Unless instructed to do so by these procedures, do not replace the adapter because of a tripped PTC resistor. A fault (short-circuit) causes an increase in PTC resistance and temperature. The increase in resistance causes the PTC to halt current flow. The PTC returns to a low resistive and low temperature state when the fault is removed from the SCSI bus or when the system is turned off. Wait 5 minutes for the PTC resistor to fully cool, then retest. These procedures determine if the PTC resistor is still tripped and then determine if there is a short-circuit somewhere on the SCSI bus. External SCSI-2 Differential Adapter Bus PTC Isolation Procedure Isolate the external SCSI bus PTC fault with the following procedure: Notes: 1. Ensure that only the probe tips are touching the solder joints. Do not allow the probes to touch any other part of the component. 2. The external bus is differential.1. Ensure that system power and all externally attached device power is turned off. 256 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
2. Check to ensure all devices are marked SCSI Differential and that the terminator on the end of the SCSI bus is also marked differential. If not, you may have a single-ended SCSI device or terminator on the differential SCSI bus. Single-ended devices do not work on a differential SCSI bus and may cause a PTC type error to be reported. The entire SCSI bus may appear to be intermittent. After ensuring the system is completely differential, continue. 3. Disconnect the external cables from the adapter and remove the adapter from the system. 4. Verify with a digital ohmmeter that the internal PTC resistor, labeled Z1, (refer to the illustration on page “External SCSI-2 Differential Adapter Bus PTC Isolation Procedure” on page 256) is cool and in a low resistance state, typically less than 1/2 Ohm. Measuring across, be sure to probe both sides of the PTC resistor where the solder joints and board come together. The polarity of the test leads is not important. If necessary, allow the PTC resistor to cool and measure again. 5. Locate capacitor C1 and measure the resistance across it using the following procedure: a. Connect the negative lead to the side of the capacitor marked ?GND?. Be sure to probe at the solder joint where the capacitor and board come together. b. Connect the positive lead to the side of the capacitor marked ?Cathode D1? on the board near C1. Be sure to probe at the solder joint where the capacitor and board come together. v If there is no fault present, then the resistance reading is 25 to 35 Ohms. The adapter is not faulty. Continue to the next step. v If the resistance measured is higher than 35 Ohms, check to see if RN1, RN2, and RN3 are plugged into their sockets. If these sockets are empty, you are working with a Multi-Initiators or High-Availability system. With these sockets empty, a resistive reading across C1 cannot be verified other than it measures a high resistance (not a short-circuit). If the resistance measurement is not low enough to be suspected as a fault (lower than 10 Ohms), continue to the next step. Chapter 2. Isolation Procedures 257
v If the resistance is high and there is no external cable to reattach, there is no apparent cause for the PTC diagnostic failure. v If the resistance reading is low, typically below 10 Ohms, there is a fault. Because there are no cables attached, the fault is on the adapter. Replace the adapter. v If the resistance measured was high and there is an external cable to reattach, proceed to the next step.6. Reattach the external cable to the adapter. a. Measure across C1 as previously described. b. If the resistance is between 10 to 20 Ohms, there is no apparent cause for a PTC resistor failure. c. If the resistance is less than 10 Ohms, there is a possibility of a fault on the external SCSI bus. Troubleshoot the external SCSI bus by disconnecting devices and terminators. Measure across C1 to determine if the fault has been removed.SCSI-2 Differential Adapter Probable Tripped PTC Causes The following list provides some suggestions of things to check when the PTC is tripped: v A short-circuited terminator or cable. Check for bent pins on each connector and removable terminator. v Intermittent PTC failures can be caused by incorrectly seated cable connectors. Reseat the connector and flex the cable in an attempt to duplicate the fault condition across C1. v Plugging or unplugging a cable or terminator while the system is turned on (hot-plugging). v A short-circuited device. v Single-ended devices are attached to the differential SCSI bus.Dual-Channel Ultra SCSI Adapter PTC Failure Isolation Procedure Use the following procedures if diagnostics testing indicates a potential positive temperature coefficient (PTC) resistor fault or the TERMPWR short-circuited LED is lit. This procedure is used for SRNs 637-240 and 637-800 on the Dual-Channel Ultra SCSI Adapter. If the TERMPWR short-circuited LED is lit, use this procedure to help isolate the source of the problem on the failing channel. 1. Identify the adapter by its label of 4-R on the external bracket. Then, determine if the failure is on channel A or channel B. 2. The same PTC is used for both the internal and external buses. The PTC protects the SCSI bus from high currents due to short-circuits on the cable, terminator, or device. It is unlikely that the PTC can be tripped by a defective adapter. A fault (short-circuit) causes an increase in PTC resistance and temperature. The increase in resistance causes the PTC to halt current flow. The PTC returns to a low resistive and low temperature state when the fault is removed from the SCSI bus or when the system is turned off. Wait 5 minutes for the PTC resistor to fully cool, then retest. 3. If this same error persists, or the TERMPWR short-circuited LED is lit, replace the components of the failing channel in the following order (wait five minutes between steps): a. If the failure is on the external cable, replace the following: 1) Cable 2) Device 3) Attached subsystem 4) Adapterb. If the failure is on the internal cable, replace the following: 1) Cable 2) Device 258 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
3) Backplane 4) Adapterc. If the failure persists, verify that the parts exchanged are in the correct channel (internal or external, A or B). If the errors are still occurring, continue isolating the problem by going to “MAP 0050: SCSI bus problems” on page 269.64-bit PCI-X Dual Channel SCSI Adapter PTC Failure Isolation Procedure Use the following procedures if diagnostics testing indicates a potential self-resetting thermal fuse problem. This procedure is used for SRN 2524-702 on the integrated dual-channel SCSI adapter in a 7039/651 system. 1. Identify the adapter as the one embedded in the system board. Then, determine if the failure is on channel 0 or channel 1. 2. The thermal fuse protects the SCSI bus from high currents due to short-circuits on the terminator, cable, or device. It is unlikely that the thermal fuse can be tripped by a defective adapter. A fault (short-circuit) causes an increase in resistance and temperature of the thermal fuse. The increase in temperature causes the thermal fuse to halt current flow. The thermal fuse returns to a low resistive and low temperature state when the fault is removed from the SCSI bus or when the system is turned off. Wait 10 seconds for the thermal fuse to reset itself and recover, then retest. 3. If the same error persists, replace the components of the failing channel in the following order. Wait 10 seconds for the thermal fuse to reset itself between steps. a. Cable b. Device c. DASD backplane (if present) d. System board (adapter)4. If the failure persists, verify that the parts exchanged are in the correct channel (0 or 1). If the errors are still occurring, continue isolating the problem by going to “MAP 0050: SCSI bus problems” on page 269.MAP 0020: Problem determination procedure Use this MAP to get a service request number (SRN) if the customer or a previous MAP provided none. Purpose of this MAP Use this MAP to get a service request number (SRN) if the customer or a previous MAP provided none. If you are unable to power the system on, refer to the “Power isolation procedures” on page 128. v Step 0020-1 Visually check the server for obvious problems such as unplugged power cables or external devices that are powered off. Did you find an obvious problem? NO Go to Step 0020-2. YES Fix the problem, then go to Chapter 5, “Verifying a repair,” on page 557.v Step 0020-2 Are the AIX online diagnostics installed?Note: If AIX is not installed on the server or partition, answer no to the above question. NO If the operating system is running, perform its shutdown procedure. Get help if needed. Go to Step 0020-4. Chapter 2. Isolation Procedures 259
YES Go to Step 0020-3.v Step 0020-3 Note: When possible, run online diagnostics in service mode. Online diagnostics perform additional functions compared to standalone diagnostics. Run online diagnostics in concurrent mode when the customer does not let you power-off the system unit. To run online diagnostics in service mode, go to substep 5. If the system unit is already running in the service mode and you want to run online diagnostics, proceed to the question at the bottom of this MAP step. Otherwise, continue with 1 through 4 in the following procedure. 1. Log in with root authority or use CE login. If necessary, ask the customer for the password. 2. Enter the diag -a command to check for missing resources. a. If you see a command line prompt, proceed to substep 3 below. b. If the DIAGNOSTIC SELECTION menu is displayed, with the letter M shown next to any resource, select that resource, then press Commit (F7 key). Follow any instructions displayed. If you are prompted with a message Do you want to review the previously displayed error select Yes and press Enter. If an SRN displays, record it, and go to Step 0020-15. If there is no SRN, go to substep 3 below. c. If the MISSING RESOURCE menu is displayed, follow any instructions displayed. If you are prompted with a message Do you want to review the previously displayed error select Yes and press Enter. If an SRN displays, record it, and go to Step 0020-15. If there is no SRN, go to substep 3 below.3. Enter the diag command. 4. Go to Step 0020-5. 5. If the operating system is running, perform its shutdown procedure (get help if needed). 6. Turn off the system unit power and wait 45 seconds before proceeding. 7. Turn on the system unit power. 8. Load the online diagnostics in service mode. 9. Wait until the Diagnostic Operating Instructions display or the system appears to have stopped. Are the Diagnostic Operating Instructions displayed? NO Go to Step 0020-16. YES Go to Step 0020-5.v Step 0020-4 Note: If you are working on a partition, do not remove the power as directed in the following procedure. Remove the power only if you are working on a server that does not have multiple partitions. 1. If the server does not have multiple partitions, disconnect the power from the server, wait 45 seconds, then reconnect the power. 2. If the server supports slow boot (See “Performing a slow boot” on page 665) set the server to perform a slow boot for the next boot that is performed. If the system does not support slow boot, do a normal boot in the next step. 3. Refer to “Running the online and eServer stand-alone diagnostics” on page 672 to load the standalone diagnostics. Before continuing to the next step, ensure that the server power is turned on, or if you are working on a partition, the partition is started. The server or partition should be booting the standalone diagnostics from a CD-ROM or a network server. 4. Wait until the Diagnostic Operating Instructions display or the server boot appears to have stopped. Are the Diagnostic Operating Instructions displayed? NO Go to Step 0020-16. YES Go to Step 0020-5.
260 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
v Step 0020-5 Are the Diagnostic Operating Instructions displayed (screen number 801001) with no obvious problem (for example, blurred or distorted)? NO For display problems, go to Step 0020-12. YES To continue with diagnostics, go to Step 0020-6.v Step 0020-6 Press the Enter key. Is the FUNCTION SELECTION menu displayed (screen number 801002)? NO Go to Step 0020-13. YES Go to Step 0020-7.v Step 0020-7 1. Select the option ADVANCED DIAGNOSTICS ROUTINES. Notes: a. If the terminal type is not defined, do so now. You cannot proceed until this is complete. b. If you have SRNs from a Previous Diagnostics Results screen, process these Previous Diagnostics Results SRNs prior to processing any SRNs you may have received from an SRN reporting screen.2. If the DIAGNOSTIC MODE SELECTION menu (screen number 801003) displays, select the option PROBLEM DETERMINATION. 3. Find your system response in the following table. Follow the instructions in the Action column.
System Response
Action
Previous Diagnostic Results. Do you want to review the previously displayed error?
You have a pending item in the error log for which there is no corresponding Log Repair Action. To see this error, select YES at the prompt. Information from the error log is displayed in order of last event first. Record the error code, the FRU names and the location code of the FRUs. Go to Step 0020-15.
The RESOURCE SELECTION menu or the ADVANCED DIAGNOSTIC SELECTION menu is displayed (screen number 801006).
Go to Step 0020-8.
The system halted while testing a resource.
Record SRN 110-xxxx, where xxxx is the first four digits of the menu number displayed in the upper-right corner of the diagnostic menu. Go to Step 0020-15. Chapter 2. Isolation Procedures 261
System Response
Action
The MISSING RESOURCE menu is displayed or the letter M is displayed alongside a resource in the resource list.
If the MISSING RESOURCE menu is displayed, follow the displayed instructions until either the ADVANCED DIAGNOSTIC SELECTION menu or an SRN is displayed. If an M is displayed in front of a resource (indicating that it is missing) select that resource then choose the Commit (F7 key). Notes: 1. Run any supplemental media that may have been supplied with the adapter or device, and then return to substep 1 of Step 0020-7. 2. If the SCSI enclosure services device appears on the missing resource list along with the other resources, select it first. 3. ISA adapters cannot be detected by the system. The ISA adapter configuration service aid in standalone diagnostics allows the identification and configuration of ISA adapters.If the ADVANCED DIAGNOSTIC SELECTION menu is displayed, go to Step 0020-11. If an 8-digit error code is displayed, record it and go to Chapter 1, “Starting a Service Call,” on page 1. If an SRN is displayed, record it, and go to Step 0020-15.
The message The system will now continue the boot process is displayed continuously on the system unit’s console.
Go to Step 0020-4.
The message Processing supplemental diagnostic diskette media is displayed continuously on the system unit’s console.
Call your service support.
The diagnostics begin testing a resource. Note: If the option Problem Determination was selected from the DIAGNOSTIC MODE SELECTION menu, and if a recent error has been logged in the error log, the diagnostics automatically begin testing the resource.
Follow the displayed instructions. If the No Trouble Found screen is displayed, press Enter. If another resource is tested, repeat this step. If the ADVANCED DIAGNOSTIC SELECTION menu is displayed, go to Step 0020-11. If an SRN is displayed, record it, and go to Step 0020-15. If an 8-digit error code is displayed, go to Chapter 1, “Starting a Service Call,” on page 1.
The system did not respond to selecting the advanced diagnostics option.
Go to Step 0020-13.
A system unit with a beeper did not beep while booting.
Record SRN 111-947 and then go to Step 0020-15
The system unit emits a continuous sound from the beeper.
Record SRN 111-947 and then go to Step 0020-15.
An SRN or an eight-digit error code is displayed.
Record the error code, the FRU names, and the location code for the FRUs. If a SRN is displayed, go to Step 0020-15. If an 8-digit error code is displayed, go to Chapter 1, “Starting a Service Call,” on page 1.
The system stopped with a 3-digit or 4-digit code displayed in the operator panel display.
Record SRN 101-xxx (where xxx is the rightmost three digits of the displayed code). Go to Step 0020-15. 262 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
System Response
Action
An 888 message is displayed in the operator panel display. Note: The 888 may or may not be flashing.
Go to “MAP 0070: 888 sequence in operator panel display” on page 278. v Step 0020-8 On the DIAGNOSTIC SELECTION or ADVANCED DIAGNOSTIC SELECTION menu, look through the list of resources to make sure that all adapters and SCSI devices are listed including any new resources. Notes: 1. Resources attached to serial and parallel ports may not appear in the resource list. 2. If running diagnostics in a partition within a partitioned system, resources assigned to other partitions will not be displayed on the resource list.Did you find all the adapters or devices on the list? NO Go to Step 0020-9. YES Go to Step 0020-11.v Step 0020-9 Is the new device or adapter an exact replacement for a previous one installed at same location? NO Go to Step 0020-10. YES The replacement device or adapter may be defective. If possible, try installing it in an alternate location if one is available; if it works in that location, then suspect that the location where it failed to appear has a defective slot. Schedule time to replace the hardware that supports that slot. If it does not work in alternate location, suspect a bad replacement adapter or device. If you are still unable to detect the device or adapter, contact your service support structure.v Step 0020-10 Is the operating system software to support this new adapter or device installed? NO Load the operating system software. YES The replacement device or adapter may be defective. If possible, try installing it in an alternate location if one is available; if it works in that location, then suspect that the location where it failed to appear has a defective slot. Schedule time to replace the hardware that supports that slot. If it does not work in alternate location, suspect a bad replacement adapter or device. If you are still unable to detect the device or adapter, contact your service support structure.v Step 0020-11 Select and run the diagnostic test problem determination or system verification on one of the following: – The resources with which the customer is having problems. If the resource is not shown on the DIAGNOSTIC SELECTION menu, then run diagnostics on its parent (the adapter or controller to which the resource is attached). – The resources you suspect are causing a problem. – All resources. Note: When choosing All Resources, interactive tests are not done. If no problem is found running All Resources, select each of the individual resources on the selection menu to run diagnostics tests on to do the interactive tests
Chapter 2. Isolation Procedures 263
Find the response in the following table, or follow the directions on the test results screen.
Diagnostic Response
Action
An SRN or an eight-digit error code is displayed on the screen.
Record the error code, the FRU names, and the location code for the FRUs. If an SRN is displayed, go to Step 0020-15. If an 8-digit error code is displayed, go to the information center, and perform a search on the error code to obtain the name and location of the failing FRU. Perform the listed action.
The TESTING COMPLETE menu and the No trouble was found message are displayed, and you have not tested all of the resources.
Press Enter and continue testing other resources.
The TESTING COMPLETE menu and the No trouble was found message are displayed, and you have tested all of the resources.
Go to Step 0020-14. Note: If you have not run the sysplanar test, do so before going to Step 0020-14.
The system halted while testing a resource.
Record SRN 110-xxxx, where xxxx is the first three or four digits of the menu number displayed in the upper-right corner of the diagnostic menu screen. Go to Step 0020-15.
When running the Online Diagnostics, an installed device does not appear in the test list.
Ensure that the diagnostic support for the device was installed. The display configuration service aid can be used to determine whether diagnostic support is installed for the device. Record SRN 110-101. Go to Step 0020-15. Note: Supplemental diskettes may be required if service aids are run from standalone diagnostics.
The IBM ARTIC960 Quad T1/E1 adapter diagnostics display a message indicating that the interface board (PMC) is either not installed or is malfunctioning.
Install a PMC board if not already installed. When running online diagnostics on any of the IBM ARTIC960 family of adapters and the message indicates that the PMC is not installed, but it is installed, do the following: v Reseat the PMC board, then run diagnostics. v If the response is the same, replace the PMC and then go to Chapter 5, “Verifying a repair,” on page 557.
The symptom was not found in the table.
Return to the Chapter 1, “Starting a Service Call,” on page 1. v Step 0020-12 The following step analyzes a console display problem. Find your type of console display in the following table. Follow the instructions given in the Action column.
Type of Console Display
Action
TTY-type terminal
Be sure the TTY terminal attributes are set correctly. See ?Running the Diagnostic Programs from a TTY Terminal? in “Running the online and eServer stand-alone diagnostics” on page 672. If you did not find a problem with the attributes, go to the documentation for this type of TTY terminal, and continue problem determination. If you do not find the problem, record SRN 111-259, then go the Step 0020-15.
Graphics display
Go to the documentation for this type of graphics display, and continue problem determination. If you do not find the problem, record SRN 111-82c, then go to Step 0020-15. 264 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
Type of Console Display
Action
HMC (Hardware Management Console)
Go to the HMC service procedures in Service Guide for the Hardware Management Console Models 7042-CR4 and 7042-C06, SA76-0120. If HMC tests find no problem, there may be a problem with the communication between the HMC and the managed system. If the HMC communicates with the managed system through a network interface, verify whether the network interface is functional. If the HMC communicates with the managed system through the HMC interface, check the cable between the HMC and the managed system. If it is not causing the problem, suspect a configuration problem of the HMC communications setup. v Step 0020-13 There is a problem with the keyboard. Find the type of keyboard you are using in the following table. Follow the instructions given in the Action column.
Keyboard Type
Action
Type 101 keyboard (U.S.). Identify by the size of the Enter key. The Enter key is in only one horizontal row of keys.
Record SRN 111-736, then go to Step 0020-15.
Type 102 keyboard (W.T.). Identify by the size of the Enter key. The Enter key extends into two horizontal rows.
Record SRN 111-922; then go to Step 0020-15.
Kanji-type keyboard. (Identify by the Japanese characters.)
Record SRN 111-923; then go to Step 0020-15.
TTY terminal keyboard
Go to the documentation for this type of TTY terminal and continue problem determination.
HMC (Hardware Management Console)
Go to the HMC service procedures in Service Guide for the Hardware Management Console Models 7042-CR4 and 7042-C06, SA76-0120. If HMC tests find no problem, there may be a problem with the communication between the HMC and the managed system. If the HMC communicates with the managed system through a network interface, verify whether the network interface is functional. If the HMC communicates with the managed system through the HMC interface, check the cable between the HMC and the managed system. If it is not causing the problem, suspect a configuration problem of the HMC communications setup. v Step 0020-14 The diagnostics did not detect a problem. If the problem is related to either the system unit or the I/O expansion box, refer to the service documentation for that unit. If the problem is related to an external resource, use the problem determination procedures, if available, for that resource. If a problem occurs when running online diagnostics but not when running the stand-alone diagnostics, suspect a software problem. Check for the presence of supplemental diagnostic material, such as diskettes or documentation. This is possibly a problem with software or intermittent hardware. If you think that you have an intermittent hardware problem, go to “MAP 0040: Intermittent Problem Isolation” on page 267. v Step 0020-15 Take the following actions: 1. Handle multiple SRNs and error codes in the following order: a. 8-Digit Error Codes. Chapter 2. Isolation Procedures 265
b. SRNs with a source code other than F or G. c. SRNs with a source code of F. Run online diagnostics in advanced and problem determination mode to obtain maximum isolation. d. SRNs with a source code of G. Note: The priority for multiple SRNs with a source of G is determined by the time stamp of the failure. Follow the action for the SRN with the earliest time stamp first. e. Device SRNs and error codes (5-digit SRNs). If a group has multiple SRNs, it does not matter which SRN is handled first. 2. Find the SRN. If the SRN is not listed, look for it in the following: – Any supplemental service manual for the device – The diagnostic problem report screen for additional information – The ?Service Hints? service aid in “Running the online and eServer stand-alone diagnostics” on page 6723. Perform the action listed. 4. If you replace a part, go to Chapter 5, “Verifying a repair,” on page 557.v Step 0020-16 Look up the AIX IPL progress codes for definitions of configuration program indicators. They are normally 0xxx or 2xxx. Is a configuration program indicator displayed? NO Go to the “Problems with loading and starting the operating system (AIX and Linux)” on page 246. YES Record SRN 101-xxxx (where xxxx is the rightmost three or four digits or characters of the configuration program indicator). Go to Step 0020-17.v Step 0020-17 Is a physical location code or an AIX location code displayed on the operator panel display? NO Go to Step 0020-15. YES Record the location code, then go to Step 0020-15.MAP 0030: Additional Problem Determination This MAP is used for problems that still occur after all FRUs indicated by the SRN or error code have been exchanged. Purpose of This MAP Use this MAP for problems that still occur after all FRUs indicated by the SRN or error code have been exchanged. Note: Check the action text of the SRN before proceeding with this MAP. If there is an action listed, perform that action before proceeding with this MAP. v Step 0030-1 Some external devices (including rack drawers that contain devices) have their own problem determination procedures. If the problem is related to an external device that has its own problem determination procedure, run those procedures if not already run. If they do not correct the problem, continue with this MAP. v Step 0030-2 266 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
The problem may have been caused by a resource that has not been tested. System Checkout tests all resources. If the online Diagnostics are installed and you are able to load them, then All Resources under the Diagnostic Selection menu should be run. If you get a different SRN, look up the SRN in the SRN chapters and perform the listed action. If you are unable to run All Resources under the Diagnostic Selection menu or you do not get another SRN when running it, continue with this MAP. v Step 0030-3 If the problem is related to a SCSI device, SCSI bus, or SCSI controller, go to “MAP 0050: SCSI bus problems” on page 269. If you are unable to isolate the problem with MAP 0050, continue with Step 0030-4. v Step 0030-4 1. Find the resource(s) that are identified by the SRN or error code in the following table. 2. Perform the first action listed for the resource. 3. If you exchange a FRU or change a switch setting, test the resource again. 4. If the action does not correct the problem, perform the next action until all actions have been tried. If an action says to exchange a FRU that you have already exchanged, go to the next action. If an action corrects the problem, go to Chapter 5, “Verifying a repair,” on page 557. 5. If you perform all of the actions and do not correct the problem, check the Service Hints service aid for information. If the service aid does not help, call your service support structure.
Failing Resource
Repair Action
SCSI Device
Exchange the SCSI controller. Replace the power supply.
Pluggable SCSI or IDE controller
Exchange the backplane into which the adapter is plugged.
Keyboard, tablet, mouse, dials, LPF keys, diskette drive
Check the cable attaching the device to its adapter. If you do not find a problem, exchange the device’s adapter.
Pluggable adapters, CPU cards, and controllers
Determine whether the adapter contains any attached FRUs such as fuses, DRAMs, and crossover cables. 1. Check or exchange any attached FRU on the resource. 2. If the adapter is plugged into a riser card, check or exchange the riser card. 3. Exchange the backplane into which the adapter is plugged.
System and I/O backplanes
Contact your service support structure.
Built-In system ports
Replace the service processor if present.
A device attached to the system by a cable and an adapter.
1. Replace the adapter for the device. 2. Replace the cable to the device.
IDE Device
Replace the cable between the IDE controller and the device. If the IDE controller is packaged on a backplane, replace that backplane, otherwise replace the adapter containing the IDE controller. MAP 0040: Intermittent Problem Isolation This MAP provides a structured way of analyzing intermittent problems. Purpose of This MAP This MAP provides a structured way of analyzing intermittent problems. It consists of two tables: Hardware Symptoms and Software Symptoms. Because software or hardware can cause intermittent problems, consider all symptoms relevant to your problem. Chapter 2. Isolation Procedures 267
How to Use This MAP This MAP contains information about causes of intermittent symptoms. In the following tables, find your symptoms, and read the list of things to check. When you exchange a FRU, go to Chapter 5, “Verifying a repair,” on page 557 to check out the system. Hardware SymptomsNote: This table spans several pages.
Symptom of Hardware Problem
Things to Check For
Any hardware log entry in the error log.
Use the Hardware Error Report service aid to view the error log and check for: v Multiple errors on devices attached to the same SCSI bus. v Multiple errors on devices attached to the same async adapter. v Multiple errors on internally installed devices only.Contact your service support structure for assistance with error report interpretation.
Hardware-caused system crashes
v The connections on the CPU planar or CPU card v Memory modules for correct connections v Connections to the system planar. v Cooling fans operational v The environment for a too-high or too-low operating temperature. v Vibration: proximity to heavy equipment.
System unit powers off a few seconds after powering On.
v Fan speed. Some fans contain a speed-sensing circuit. If one of these fans is slow, the power supply powers the system unit off. v Correct voltage at the outlet into which the system unit is plugged. v Loose power cables and fan connectors, both internal and external.
System unit powers off after running for more than a few seconds.
v Excessive temperature in the power supply area. v Loose cable connectors on the power distribution cables. v Fans turning at full speed after the system power has been on for more than a few seconds.
Only internally installed devices are failing.
Check the following items that are common to more than one device: v Ground connections on all of the disk drives and other types of drives installed. v Loose connections on the power cables to the planars, drives, fans, and battery. v System unit cooling. Is the input air temperature within limits? Are all the fans running at full speed? Are any of the vent areas blocked? v Signal cables to the diskette drives, and the power supply. v SCSI device signal cables for loose connectors and terminators. v Loose SCSI device address jumpers. v Possible contamination of any device that has a cleaning procedure. See the operator guide for cleaning instructions. v Excessive static electricity. v Correct voltage at the system unit power outlet 268 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
Symptom of Hardware Problem
Things to Check For
Only externally attached devices are failing.
Check the following items that are common to more than one device. v Check the SCSI signal cables to the devices for loose connectors and terminators. v Check devices that use jumpers to set the SCSI address for loose jumpers. v Check any device that has a cleaning procedure for contamination. See the operator guide for cleaning instructions. v Check for excessive static electricity. v Check the outlet that the device is plugged into for correct voltage. v Check the error log for entries for the adapter driving the failing devices. v Check the temperature of the devices. Are the cooling vents blocked? Are the fans running? v Check for other devices near the failing device that may be radiating noise (displays, printers, and so on). Software Symptoms
Symptom of Software Problem
Things to Check For
Any symptom you suspect is related to software.
Use the software documentation to analyze software problems. Check the RETAIN® database for known problems with your type of system unit or software.
Software-caused system crashes
Check the following software items: v Is the problem only with one application program? v Is the problem only with one device? v Does the problem occur on a recently installed program? v Was the program recently patched or modified in any way? v Is the problem associated with any communication lines? v Check for static discharge occurring at the time of the failure. MAP 0050: SCSI bus problems Use this MAP to analyze problems with a SCSI bus. Purpose of this MAP Use this MAP to analyze problems with a SCSI bus. Considerations v To prevent hardware damage or erroneous diagnostic results from a system with its power turned on, use ?PCI hot-plug manager? subtask ?replace/remove PCI hot plug adapter? before connecting or disconnecting cables or devices. v Also, use this MAP for SCSI adapters that are built into system backplanes or I/O backplanes. If this procedure calls for replacing a SCSI adapter and the SCSI adapter is built into the system backplane or I/O backplane, replace the system backplane or I/O backplane as appropriate. v If the failure is a terminator power failure (SRNs xxx-226, xxx-240, xxx-800), always allow five minutes for the PTC to cool. v The differential version of the adapter has socket-type terminators to support high-availability. If this is the adapter’s configuration, the terminators would have been removed from the adapter. MAP steps requiring the removal of the cable from the adapter do not apply, because an adapter that is not terminated always fails diagnostics. Proper SCSI diagnostics require proper termination. If the Chapter 2. Isolation Procedures 269
configuration involves a Y-cable, leave it, with the appropriate terminator, attached to the adapter. Or, place an external differential terminator on the external port. v If the system uses shared disk-drive hardware or a high-availability configuration, ensure that the other system that is sharing the devices is not using the devices. For additional information concerning high-availability configurations, see “SCSI service hints” on page 252. v For intermittent problems that cannot be resolved with this MAP, refer to “SCSI service hints” on page 252. v If the SCSI bus is attached to a RAID subsystem, refer to the RAID subsystem documentation for any problem determination. If the RAID adapter is a PCI-X RAID adapter, refer to the PCI-X SCSI RAID Controller Reference Guide for AIX.Follow the steps in this MAP to isolate a SCSI bus problem.Note: This procedure removes devices and components from a SCSI bus until a problem or a symptom or problem is eliminated. If you follow the entire procedure, you will remove all components of a SCSI bus in the following order: 1. Hot-swap devices 2. Devices that are not hot-swap 3. SCSI Enclosure Services (SES) device or enclosures 4. SCSI cables 5. SCSI adapterDo the following: v Step 0050-1 Have changes been made recently to the SCSI configuration? NO Go to Step 0050-2. YES Go to Step 0050-5.v Step 0050-2 Are there any hot-swap devices (SCSI disk drives or media devices) controlled by the adapter? NO Go to Step 0050-3. YES Go to Step 0050-11.v Step 0050-3 Are there any devices other than hot-swappable devices controlled by the adapter? NO Go to Step 0050-4. YES Go to Step 0050-13.v Step 0050-4 Is an enclosure or drawer that supports hot-swap devices controlled by the adapter? NO Go to Step 0050-22. YES Go to Step 0050-15.v Step 0050-5 This step handles cases where recent changes have been made to the SCSI configuration. Using the first three digits of the SRN, refer to the FFC listing and determine if the adapter is single-ended or differential. Is the adapter a single-ended adapter? NO Go to Step 0050-6. YES Go to Step 0050-7.
270 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
v Step 0050-6 The adapter’s termination jumper settings may be incorrect. Power off the system, and inspect jumper J7. Are the jumpers correct? NO Go to Step 0050-8. YES Go to Step 0050-9.v Step 0050-7 If the adapter is not being used in a high-availability configuration, be sure sockets RN1, RN2, and RN3 are populated. If the adapter is being used in a high-availability configuration, be sure sockets RN1, RN2, and RN3 are not populated. Go to Step 0050-9. v Step 0050-8 1. Correct the jumper settings and reinstall the adapter and all cables. 2. Power on the system, and run diagnostics in system verification mode on the adapter. Did the diagnostic pass? NO Go to Step 0050-9. YES Go to Step 0050-10.v Step 0050-9 Check for the following problems: – Address conflicts between devices. – Cabling problems such as configurations that exceed the maximum cable lengths, missing termination, or excessive termination. Did you find a problem? NO Go to Step 0050-2. YES Go to Step 0050-10.v Step 0050-10 1. Correct the problem. 2. Power on the system, and run diagnostics in system verification mode on the adapter. Did a failure occur? NO Go to Chapter 5, “Verifying a repair,” on page 557. YES Go to Step 0050-2.v Step 0050-11 This step determines if a hot-swap device is causing the failure. 1. Go to “Preparing for hot-plug SCSI device or cable deconfiguration” on page 274. 2. Disconnect all hot-swap devices attached to the adapter. 3. Go to “After hot-plug SCSI device or cable deconfiguration” on page 275. 4. If the Missing Options menu displays, select the option The resource has been turned off, but should remain in the system configuration for all the devices that were disconnected. 5. Run the diagnostics in system verification mode on the adapter. Did a failure occur? NO Go to Step 0050-12. YES Go to Step 0050-3.v Step 0050-12 Chapter 2. Isolation Procedures 271
Go to “Preparing for hot-plug SCSI device or cable deconfiguration” on page 274. Reconnect the hot-plug devices one at a time. After reconnecting each device, do the follow: 1. Go to “After hot-plug SCSI device or cable deconfiguration” on page 275. 2. Rerun the diagnostics on the adapter. 3. If the adapter fails, the problem may be with the last device reconnected. Perform these substeps: a. Follow repair procedures for that last device. b. Rerun diagnostics on the adapter. c. If diagnostics fail, replace the SES backplane corresponding to the slot for the device. d. Rerun diagnostics. e. If diagnostics fail, replace the last device. f. Rerun diagnostics on the adapter. g. If diagnostics pass, go to Chapter 5, “Verifying a repair,” on page 557. Otherwise, contact your support center. Note: A device problem can cause other devices attached to the same SCSI adapter to go into the defined state. Ask the system administrator to make sure that all devices attached to the same SCSI adapter as the device that you replaced are in the available state.4. If no errors occur, the problem could be intermittent. Make a record of the problem. Running the diagnostics for each device on the bus may provide additional information.v Step 0050-13 This step determines if a device other than a hot-swappable device is causing the failure. Follow these steps: 1. Go to “Preparing for hot-plug SCSI device or cable deconfiguration” on page 274. 2. Disconnect all devices attached to the adapter (except for the device from which you boot to run diagnostics; you may want to temporarily move this device to another SCSI port while you are trying to find the problem). 3. Go to “After hot-plug SCSI device or cable deconfiguration” on page 275. 4. If the Missing Options menu displays, select the option The resource has been turned off, but should remain in the system configuration for all the devices that were disconnected. 5. Run the diagnostics in system verification mode on the adapter. Did a failure occur? NO Go to Step 0050-14. YES Go to Step 0050-4.v Step 0050-14 Reconnect the devices one at time. After reconnecting each device, follow this procedure: 1. Rerun the diagnostics in system verification mode on the adapter. 2. If there is a failure, the problem should be with the last device reconnected. Follow the repair procedures for that device, then go to Chapter 5, “Verifying a repair,” on page 557. 3. If no errors occur, the problem could be intermittent. Make a record of the problem. Running the diagnostics for each device on the bus may provide additional information.v Step 0050-15 This step determines whether the SCSI Enclosure Services (SES) controller, which provides hot-plug capability for SCSI drives in the server, is causing the problem. Note: In most cases the SES controller is integrated on the backplane that is used to connect SCSI devices, for example a disk drive backplane. If your system has hot-plug capability and the SES controller is separate from the SCSI drive backplane, there will be an intermediate card on the 272 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
SCSI bus between the SCSI adapter and the device or SCSI backplane. You will have to make a visual check to see if there are any intermediate cards on the SCSI bus that is displaying a problem. Does a separate SES controller plug into the SCSI device backplane? NO Go to Step 0050-18. YES Go to Step 0050-16.v Step 0050-16 Follow these steps: 1. Power off the system. 2. Remove the intermediate SES controller card. Locate the SES controller part number under Chapter 8, “Part Information,” on page 691. 3. Power on the system. 4. If the Missing Options menu displays, select the option The resource has been turned off, but should remain in the system configuration for all the devices that were disconnected. 5. Run the diagnostics in system verification mode on the adapter. Did a failure occur? NO Go to Step 0050-17. YES Go to Step 0050-18.v Step 0050-17 Follow these steps: 1. Power off the system. 2. Replace the intermediate SES controller card. 3. Go to Chapter 5, “Verifying a repair,” on page 557.v Step 0050-18 Follow these steps: 1. Go to “Preparing for hot-plug SCSI device or cable deconfiguration” on page 274. 2. Disconnect all cables attached to the SCSI adapter. For SCSI differential adapters in a high-availability configuration, see Considerations. 3. Go to “After hot-plug SCSI device or cable deconfiguration” on page 275. 4. If the Missing Options menu displays, select the option The resource has been turned off, but should remain in the system configuration for all the devices that were disconnected. 5. Run the diagnostics in system verification mode on the adapter. Did a failure occur? NO Go to Step 0050-19. YES Replace the adapter, then go to Chapter 5, “Verifying a repair,” on page 557.v Step 0050-19 Follow these steps: 1. Go to “Preparing for hot-plug SCSI device or cable deconfiguration” on page 274. 2. Reconnect the cables to the adapter. Does the SES controller (an intermediated SES controller) plug into the backplane? NO Go to Step 0050-20. YES Go to Step 0050-21.v Step 0050-20 Follow these steps: Chapter 2. Isolation Procedures 273
1. Replace the SES controller. Locate the intermediate SES controller part number under Chapter 8, “Part Information,” on page 691. 2. Power on the system. 3. If the Missing Options menu displays, select the option The resource has been turned off, but should remain in the system configuration for all the devices that were disconnected. 4. Run the diagnostics in system verification mode on the adapter. Did a failure occur? NO Go to Chapter 5, “Verifying a repair,” on page 557. YES Go to Step 0050-21.v Step 0050-21 One of the cables remaining in the system is defective. Refer to Chapter 8, “Part Information,” on page 691 for the cable part numbers. Replace the parts one at a time in the order listed. Follow these steps for each FRU replaced: 1. Rerun the diagnostics for the adapter. 2. If there is any failure, continue with the next FRU. 3. If there is no failure, go to Chapter 5, “Verifying a repair,” on page 557.v Step 0050-22 Follow these steps: 1. Go to “Preparing for hot-plug SCSI device or cable deconfiguration.” 2. Disconnect all cables attached to the adapter (except for the cable to the device from which you boot to run diagnostics; you may want to temporarily move this device to another SCSI port while you are trying to find the problem). 3. Go to “After hot-plug SCSI device or cable deconfiguration” on page 275. 4. If the Missing Options menu displays, select the option The resource has been turned off, but should remain in the system configuration for all the devices that were disconnected. 5. Run the diagnostics on the adapter. Did a failure occur? NO Go to Step 0050-23. YES Replace the adapter, then go to Chapter 5, “Verifying a repair,” on page 557.v Step 0050-23 One of the cables remaining in the system is defective. Refer to Chapter 8, “Part Information,” on page 691 for the cable part numbers. Replace the parts one at time in the order listed. Follow these steps for each FRU replaced: 1. Rerun the diagnostics for the adapter. 2. If there is any failure, continue with the next FRU. 3. If there is no failure, go to Chapter 5, “Verifying a repair,” on page 557. “Preparing for hot-plug SCSI device or cable deconfiguration”Use this procedure when you are preparing to deconfigure a hot-plug SCSI device or cable. This procedure will help determine if a SCSI device or SCSI device cable is causing your system problem. “After hot-plug SCSI device or cable deconfiguration” on page 275Use this procedure after you deconfigure or hot-plug a SCSI device to ensure that the replaced component was successfully installed.Preparing for hot-plug SCSI device or cable deconfiguration Use this procedure when you are preparing to deconfigure a hot-plug SCSI device or cable. This procedure will help determine if a SCSI device or SCSI device cable is causing your system problem. Purpose of this procedure: 274 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
Use this procedure to determine if a SCSI device or SCSI device cable is causing the symptom. Disconnect all cables attached to the adapter, (except for the cable to the device from which you boot to run diagnostics; temporarily move these device cables to another SCSI port while you are trying to find the problem). 1. Go to “Running the online and eServer stand-alone diagnostics” on page 672 and perform the prerequisite tasks described in the Before you begin topic. 2. Determine which SCSI adapter you plan to remove the cables or devices from. 3. Adapter slots are numbered on the rear of the system unit, record the slot number and location of each adapter being removed. 4. Ensure that any processes or applications that might use the adapter are stopped. 5. Enter the system diagnostics by logging in as ?root? user or as the ?celogin? user. Type the diag command on the AIX command line. 6. When the DIAGNOSTIC OPERATING INSTRUCTIONS menu displays, press Enter. The FUNCTION SELECTION menu appears. 7. From the FUNCTION SELECTION menu, select Task Selection, then press Enter. 8. From the Task Selection list, select PCI Hot Plug Manager. 9. From the PCI Hot Plug Manager menu, select Unconfigure a Device, then press Enter. 10. Press F4 or ESC 4 to display the Device Names menu. 11. Select the adapter from which you are removing the cables or devices in the Device Names menu. 12. In the Keep Definition field, use the tab key to answer Yes. 13. In the Unconfigure Child Devices field, use the Tab key to answer Yes, then press Enter. The ARE YOU SURE screen displays. 14. Press Enter to verify the information. A successful deconfiguration is indicated by the ?OK? message displayed next to the Command field at the top of the screen. 15. Press F4 or ESC 4 twice to return to the Hot Plug Manager menu. 16. Select replace/remove PCI Hot Plug adapter. 17. Select the slot that has the adapter you want to remove the cables or devices from in the system. 18. Select Remove. Note: A fast blinking amber LED located at the back of the machine indicates the slot that you selected. 19. Press Enter. This places the adapter in the ?action? state, meaning it is ready to be removed from the system. (You do not need to remove the adapter, unless it makes removing the cables attached to it easier).After hot-plug SCSI device or cable deconfiguration Use this procedure after you deconfigure or hot-plug a SCSI device to ensure that the replaced component was successfully installed. 1. Press Enter, then continue to follow the screen instructions until you receive a message that the replacement is successful. A successful replacement is indicated by the OK message displayed next to the command field at the top of the screen. 2. Press the F3 or ESC 3 key to return to the PCI Hot-Plug Manager menu. 3. Press the F3 or ESC 3 key to return to the TASK selection list. 4. Select Log Repair Action. 5. Select the adapter you just removed the cables or devices from, then press Enter. 6. Press Commit (F7 or ESC 7), then press Enter. 7. Press the F10 or the ESC 0 key to exit diagnostics. 8. Type the diag -a command on the command line.
Chapter 2. Isolation Procedures 275
MAP 0054: PCI-X SCSI Bus Problems Use this MAP to determine which FRUs may need to be replaced in order to solve a SCSI bus-related problem on a PCI-X SCSI or PCI-X SCSI RAID adapter. Purpose of This MAP Use the following to determine which FRUs may need to be replaced in order to solve a SCSI bus-related problem on a PCI-X SCSI or PCI-X SCSI RAID adapter. Considerations v Remove power from the system before connecting and disconnecting cables or devices, as appropriate, to prevent hardware damage or erroneous diagnostic results. v Note that some systems have SCSI and PCI-X bus interface logic integrated onto the system boards and use a pluggable RAID enablement card (a non-PCI form factor card) for these SCSI and PCI-X buses. An example of such a RAID enablement card is FC 5709. For these configurations, replacement of the RAID enablement card is unlikely to solve a SCSI bus-related problem, because the SCSI bus interface logic is on the system board. v Some adapters provide two connectors, one internal and one external, for each SCSI bus. For this type of adapter, it is not acceptable to use both connectors for the same SCSI bus at the same time. SCSI bus problems are likely to occur if this is done. However, it is acceptable to use an internal connector for one SCSI bus and an external connector for another SCSI bus. The internal and external connectors are labeled to indicate which SCSI bus they correspond to.Attention: RAID adapters should not be replaced when SCSI bus problems exist, except with assistance from your service support structure. Because the adapter may contain non-volatile write cache data and configuration data for the attached disk arrays, additional problems can be created by replacing an adapter when SCSI bus problems exist. Attention: Do not remove functioning disks in a disk array without assistance from your service support structure. A disk array may become degraded or failed if functioning disks are removed, and additional problems may be created. Follow the steps in this MAP to isolate a PCI-X SCSI bus problem. v Step 0054-1 Identify the SCSI bus on which the problem is occurring on by examining the hardware error log. To view the hardware error log, do the following: 1. Invoke diagnostics and select Task Selection on the Function Selection screen. 2. Select Display Hardware Error Report. 3. Choose one of the following options: – If the type of adapter is not known, select Display Hardware Errors for Any Resource. – If the adapter is a PCI-X SCSI adapter, select Display Hardware Errors for PCI-X SCSI Adapters. – If the adapter is a PCI-X SCSI RAID adapter, select Display Hardware Errors for PCI-X SCSI RAID Adapters.4. Select the resource, or select All Resources if the resource is not known. If you had previously selected Display Hardware Errors for Any Resource, then select All Resources. 5. On the Error Summary screen, look for an entry with a SRN corresponding to the problem which sent you here, and select it. 276 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
Note: If multiple entries exist for the SRN, some entries might be old or the problem has occurred on multiple entities (adapters, disk arrays, or devices). Older entries can be ignored; however, this MAP may need to be used multiple times if the same problem has occurred on multiple entities. 6. Select the hardware error log to view. While viewing the hardware error log, under the Detail Data and SENSE DATA headings, identify the first four bytes of the hexadecimal data (for example, nnnn nnnn nnnn nnnn ...). The four bytes identified in the error log can be interpreted as: 00bb ssLL where: – bb, when not FF, identifies the adapter’s SCSI bus – ss, when not FF, identifies the SCSI ID of a device – LL, when not FF, identifies the logical unit number (LUN) of a device Go to Step 0540-2. v Step 0054-2 Are the last two bytes of the four bytes identified in Step 0540-1, equal to FFFF (for example, 00bb FFFF, where bb identifies the adapter’s SCSI bus)? NO Go to Step 0540-4. YES Go to Step 0540-3.v Step 0054-3 While the error persists, replace the components of the failing SCSI bus in the following order: 1. Cable on bus bb (if present) 2. Adapter (if SCSI bus interface logic is on the adapter) or system board (if SCSI bus interface logic is on the system board) To replace a component, and see if the problem was corrected, do the following: 1. Follow the removal and replacement procedure for the component as previously described in this step. 2. Run diagnostics in system verification mode on the adapter. When the problem is resolved, go to Chapter 5, “Verifying a repair,” on page 557. v Step 0054-4 Are the last two bytes of the four bytes identified in Step 0540-1, equal to FF00 (for example, 00bb FF00, where bb identifies the adapter’s SCSI bus)? NO Go to Step 0540-6. YES Go to Step 0540-5.v Step 0054-5 While the error persists, replace the components of the failing SCSI bus in the following order: 1. Cable on bus bb (if present) 2. Adapter (if SCSI bus interface logic is on the adapter) or system board (if the SCSI bus interface logic is on the system board) 3. DASD backplane attached to bus bb (if present) To replace a component, and see if the problem was corrected, do the following: 1. Follow the removal and replacement procedure for the component as previously described in this step. 2. Run diagnostics in system verification mode on the adapter. When the problem is resolved, go to Chapter 5, “Verifying a repair,” on page 557. v Step 0054-6 Chapter 2. Isolation Procedures 277
While the error persists, replace the components of the failing SCSI bus in the following order: 1. Device on bus bb with SCSI ID ss 2. Cable on bus bb (if present) 3. Adapter (if SCSI bus interface logic is on the adapter) or system board (if SCSI bus interface logic is on the system board) To replace a component and see if the problem was corrected, do the following: 1. Follow the removal and replacement procedure for the component as previously described in this step. 2. Run diagnostics in system verification mode on the adapter. When the problem is resolved, go to Chapter 5, “Verifying a repair,” on page 557.MAP 0070: 888 sequence in operator panel display Use this MAP when you receive an 888 sequence on the operator panel display or monitor. Purpose of this MAP An 888 sequence in operator panel display suggests that either a hardware or software problem has been detected and a diagnostic message is ready to be read. Note: The 888 will not necessarily flash on the operator panel display. v Step 0070-1 Perform the following steps to record the information contained in the 888 sequence message. 1. Wait until the 888 sequence displays. 2. Record, in sequence, every code displayed after the 888. On systems with a 3-digit or a 4-digit operator panel, you may need to press the system’s ?reset? button to view the additional digits after the 888. Stop recording when the 888 digits reappear. 3. Go to Step 0070-2.v Step 0070-2 Using the first code that you recorded, use the following list to determine the next step to use. Type 102 Go to Step 0070-3. Type 103 Go to Step 0070-4.v Step 0070-3 A type 102 message generates when a software or hardware error occurs during system execution of an application. Use the following information to determine the content of the type 102 message. The message readout sequence is: 102 = Message type RRR = Crash code (the three-digit code that immediately follows the 102) SSS = Dump status code (the three-digit code that immediately follows the Crash code). Record the crash code and the dump status code from the message you recorded in Step 0070-1. Are there additional codes following the dump status? No Go to Step 0070-5. Yes The message also has a type 103 message included in it. Go to Step 0070-4 to decipher the SRN and field replaceable unit (FRU) information in the type 103 message.Note: Type 102 messages have no associated SRNs. v Step 0070-4 278 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
A type 103 message is generated by the hardware when certain types of hardware errors are detected. Use the following steps and information you recorded in Step 0070-1 to determine the content of the type 103 message. The message readout sequence is: 103 = Message type (x)xxx (y)yyy = SRN (where (x)xxx = the three- or four-digit code following the 103 and (y)yyy is the three- or four-digit code following the (x)xxx code). 1. Record the SRN and FRU location codes from the recorded message. 2. Find the SRN in the Service Request Number List and perform the indicated action.Note: The only way to recover from an 888 type of halt is to turn off the system unit. v Step 0070-5 Perform the following steps: 1. Turn off the system unit power. 2. Turn on the system unit power, and load the online diagnostics in service mode. 3. Wait until one of the following conditions occurs: – You are able to load the diagnostics to the point where the Diagnostic Mode Selection menu displays. – The system stops with an 888 sequence. – The system appears to hang. Is the Diagnostic Mode Selection menu displayed? No Go to Chapter 1, “Starting a Service Call,” on page 1. Yes Go to Step 0070-6.v Step 0070-6 Run the All Resources options under Advanced Diagnostics in Problem Determination Mode. Was an SRN reported by the diagnostics? No This is possibly a software-related 888 sequence. Follow the procedure for reporting a software problem. Yes Record the SRN and its location code information. Find the SRN in the SRN Listing and do the indicated action.MAP 0220: Hot-Swap FRU problem resolution Use this procedure to exchange hot-swappable field replaceable units (FRUs). Note: The FRU you want to hot plug might have a defect on it that can cause the hot-plug operation to fail. If, after following the hot-plug procedure, you continue to get an error message that indicates that the hot-plug operation has failed, schedule a time for deferred maintenance when the system containing the FRU can be powered off. Then go to MAP 0210: General problem resolution, Step 0210-2 and answer NO to the question Do you want to exchange this FRU as a hot-plug FRU? Attention: If the FRU is a disk drive or an adapter, ask the system administrator to perform the steps necessary to prepare the device for removal. v Step 0220-1 1. If the system displayed a FRU part number on the screen, use that part number to exchange the FRU. If there is no FRU part number displayed on the screen, refer to the SRN listing. Record the SRN source code and the failing function codes in the order listed. 2. Find the failing function codes in the FFC listing, and record the FRU part number and description of each FRU. Chapter 2. Isolation Procedures 279
3. To determine if the part is hot-swappable, refer to the Chapter 3, “Locating FRUs,” on page 335 procedure for the part. Does this system unit support hot-swapping of the first FRU listed? NO Go to “MAP 0210: General problem resolution” on page 246. YES Go to Step 0220-2.v Step 0220-2 Is the FRU a hot-swap power supply or fan? NO Go to Step 0220-4. YES Go to Step 0220-3.v Step 0220-3 Note: Refer to Chapter 3, “Locating FRUs,” on page 335 for the part. 1. Remove the old FRU. 2. Install the new FRU. 3. Enter the diag command. Go to Step 0220-14. v Step 0220-4 Is the FRU a hot-plug PCI adapter? NO Go to Step 0220-5. YES Go to Step 0220-12.v Step 0220-5 Is the FRU a SCSI hot-plug device? NO Go to Step 0220-11. YES Go to Step 0220-6.v Step 0220-6 Is the hot-plug drive located within a system unit? NO Go to Step 0220-8. YES Go to Step 0220-7.v Step 0220-7 Refer to the removal and replacement procedures for your system in Chapter 3, “Locating FRUs,” on page 335. Go to Step 0220-13. v Step 0220-8 Does the hot-plug drive’s enclosure have procedures for removing and replacing SCSI disk drives? NO Go to Step 0220-9. YES If a hot-plug procedure exists, use that procedure to remove the old hot-plug SCSI disk drive and replace it with a new hot-plug SCSI disk drive. Otherwise, if no hot-plug procedure exists, use the power off procedure to remove the old SCSI disk drive and replace it with a new SCSI disk drive. Go to Step 0220-13.v Step 0220-9 1. Ask the customer to back up the data on the drive that you intend to replace onto another drive. 2. Verify that the disk drive is in the defined state. The amber LED on the hot-plug disk drive should be off. Is the hot-plug disk drive’s amber LED unlit? 280 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
NO Ask the customer to remove the hot-plug disk drive from the operating system configuration (refer the customer to the system management guide for more information). YES Go to Step 0220-10.v Step 0220-10 Using the hot-plug task service aid, replace the hot-plug drive using the following procedure: 1. Use the option List the SES Devices to show the configuration of the hot-plug slots. Identify the slot number of the adapter for the FRU you want to replace. 2. Select the option REPLACE/REMOVE a Device Attached to an SES Device. 3. Select the slot which contains the SCSI hot-plug drive you want to replace. Press Enter. You will see a fast blinking green light on the front on the hot-plug drive indicating that it is ready for removal. Note: Refer to the ?Installing hardware? section of the information; locate the server information that you are servicing and follow the tables to locate the correct removal or replacement procedure. 4. Remove the old hot-plug drive. 5. Install the new hot-plug drive. After the hot-plug drive is in place, press Enter. 6. Press Exit. Wait while configuration is done on the drive, until you see the ?hot-plug task? on the service aid menu. Go to Step 0220-15. v Step 0220-11 Attention: Do not remove functioning disks in a disk array attached to a PCI-X SCSI RAID controller without assistance from your service support structure. A disk array may become degraded or failed if functioning disks are removed and additional problems may be created. If you still need to remove a RAID array disk attached to a PCI-X SCSI RAID controller, use the SCSI and SCSI RAID hot-plug manager. Using the hot-plug task service aid, replace the hot-plug drive using the hot plug RAID service aid: Note: The drive you want to replace must be either a SPARE or FAILED drive. Otherwise, the drive would not be listed as an ?Identify and remove resources selection? within the RAID HOT-PLUG DEVICES screen. In that case you must ask the customer to put the drive into FAILED state. Refer the customer to the Operating System and Device management in the AIX library for more information. Ask the customer to back up the data on the drive that you intend to replace. 1. Select the option RAID HOT-PLUG DEVICES within the HOT-PLUG TASK under DIAGNOSTIC SERVICE AIDS. 2. Select the RAID adapter that is connected to the RAID array containing the RAID drive you want to remove, then select COMMIT. 3. Choose the option IDENTIFY in the IDENTIFY AND REMOVE RESOURCES menu. 4. Select the physical disk which you want to remove from the RAID array and press Enter. 5. The disk will go into the IDENTIFY state, indicated by a flashing light on the drive. Verify that it is the physical drive you want to remove, then press Enter. 6. At the IDENTIFY AND REMOVE RESOURCES menu, choose the option REMOVE and press Enter. 7. A list of the physical disks in the system that may be removed will be displayed. If the physical disk you want to remove is listed, select it and press Enter. The physical disk will go into the REMOVE state, as indicated by the LED on the drive. If the physical disk you want to remove is not listed, it is not a SPARE or FAILED drive. Ask the customer to put the drive in the FAILED state before you can proceed to remove it. Refer the customer to the Operating System and Device management in the AIX library for more information. 8. Refer to the service guide for the system unit or enclosure that contains the physical drive for removal and replacement procedures for the following substeps: a. Remove the old hot-plug RAID drive. Chapter 2. Isolation Procedures 281
b. Install the new hot-plug RAID drive. After the hot-plug drive is in place, press Enter. The drive will exit the REMOVE state, and will go to the NORMAL state after you exit diagnostics. Note: There are no elective tests to run on a RAID drive itself under diagnostics (the drives are tested by the RAID adapter).9. This completes the repair. Return the system to the customer. Ask the customer to add the physical disk drive to the original configuration within the RAID. Refer them to system management guide for more information.v Step 0220-12 1. Remove the old adapter FRU and replace it with the new adapter FRU. Refer to the Chapter 3, “Locating FRUs,” on page 335 procedure for the part. 2. Enter the diag command. 3. Go to the FUNCTION SELECTION menu, and select the option Advanced Diagnostics Routines. 4. When the DIAGNOSTIC MODE SELECTION menu displays, select the option System Verification. 5. Go to Step 0220-14.v Step 0220-13 1. If not already running diagnostics, enter the diag command. Note: If you are already running service mode diagnostics and have just performed the task Configure Added/Replaced Devices (under the SCSI Hot Swap manager of the Hot Plug Task service aid), you must use the F3 key to return to the DIAGNOSTIC OPERATING INSTRUCTIONS menu before proceeding with the next step, or else the drive might not appear on the resource list. 2. Go to the FUNCTION SELECTION menu, and select the option Advanced Diagnostics Routines. 3. When the DIAGNOSTIC MODE SELECTION menu displays, select the option System Verification. Does the hot-plug SCSI disk drive you just replaced appear on the resource list? NO Verify that you have correctly followed the procedures for replacing hot-plug SCSI disk drives in the system service guide. If the disk drive still does not appear in the resource list, go to “MAP 0210: General problem resolution” on page 246 to replace the resource that the hot-plug SCSI disk drive is plugged into. YES Go to Step 0220-14.v Step 0220-14 Run the diagnostic test on the FRU you just replaced. Did the diagnostics run with no trouble found? NO Go to Step 0220-15. YES Go to Chapter 5, “Verifying a repair,” on page 557. Before returning the system to the customer, if a hot-plug disk has been removed, ask the customer to add the hot-plug disk drive to the operating system configuration. Refer to Operating System and Device Management in the AIX library for more information.v Step 0220-15 1. Use the option Log Repair Action in the TASK SELECTION menu to update the AIX error log. If the repair action was reseating a cable or adapter, select the resource associated with your repair action. If it is not displayed on the resource list, select sysplanar0. Note: On systems with a fault indicator LED, this changes the fault indicator LED from the fault state to the normal state. 2. While in diagnostics, go to the FUNCTION SELECTION menu. Select the option Advanced Diagnostics Routines. 282 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
3. When the DIAGNOSTIC MODE SELECTION menu displays, select the optionSystem Verification. Run the diagnostic test on the FRU you just replaced, or sysplanar0. Did the diagnostics run with no trouble found? NO Go to Step 0220-16. YES If you changed the service processor or network settings, restore the settings to the value they had prior to servicing the system. If you performed service on a PCI RAID subsystem involving changing of the RAID adapter cache card or changing the configuration on RAID disks, ask the customer to run ?PCI SCSI disk array manager? using smitty to resolve the PCI SCSI RAID adapter configuration. The following is an example of how the customer would resolve the configuration: 1. At the AIX command line, type smitty pdam. 2. On the PCI SCSI Disk Array Manager screen, select RECOVERY OPTIONS. 3. If a previous configuration exists on the replacement adapter, this must be cleared. Select Clear PCI SCSI RAID Adapter Configuration. Press F3. 4. On the Recovery Options screen, select RESOLVE PCI SCSI RAID ADAPTER CONFIGURATION. 5. On the Resolve PCI SCSI RAID Adapter Configuration screen, select ACCEPT CONFIGURATION on DRIVES. 6. On the PCI SCSI RAID Adapter selection menu, select the adapter that you changed. 7. On the next screen, press Enter. 8. On the ?Are You Sure?? selection menu, press Enter to continue. 9. You receive an OK status message when the recovery is complete. If you get a Failed status message, verify that you are preforming recovery on the correct adapter, then do this complete procedure. When you complete the recovery, exit smitty to return to the AIX command line.Go to Chapter 5, “Verifying a repair,” on page 557. v Step 0220-16 Does the original problem persist? NO If a FRU was replaced, run the log repair action service aid under the online diagnostics for the resource that was replaced. If the resource associated with your action is not displayed on the resource list, select sysplanar0. If steps were taken to make the device ready for removal, inform the system administrator of the steps required to return the system to the original state. Go to Chapter 5, “Verifying a repair,” on page 557. YES Go to Step 0220-17.v Step 0220-17 Have you exchanged all the FRUs that correspond to the failing function codes? NO Go to Step 0220-18. YES The SRN did not identify the failing FRU. Schedule a time to run diagnostics in service mode. If the same SRN is reported in service mode, go to “MAP 0030: Additional Problem Determination” on page 266.v Step 0220-18 Note: Before proceeding, remove the FRU you just replaced and install the original FRU in its place. Does the system unit support hot-swapping of the next FRU listed? NO Go to “MAP 0210: General problem resolution” on page 246. YES The SRN did not identify the failing FRU. Schedule a time to run diagnostics in service mode. If the same SRN is reported in service mode, go to Step 0220-14.
Chapter 2. Isolation Procedures 283
MAP 0230: Platform error problem resolution Use this MAP to resolve problems reported by SRNs A00-xxx to A25-xxxx. Step 0230-1 1. The last character of the SRN is bit-encoded as follows:8 4 2 1 | | | | | | | Replace all FRUs listed | | Hot-swap is supported | Software or Firmware could be the cause Reserved 2. Refer to the last character in the SRN. A 4, 5, 6, or 7 indicates a possible software or firmware problem. Does the last character indicate a possible software or firmware problem? NO Go to Step 0230-4YES Go to Step 0230-2. Step 0230-2 Ask the customer if any software or firmware has been installed recently. Has any software or firmware been installed recently? NO Go to Step 0230-4. YES Go to Step 0230-3. Step 0230-3 Suspect the new software or firmware. Check with your support center for any known problems with the new software or firmware. Are there any known problems with the software or firmware? NO Go to Step 0230-4. YES Obtain and follow the procedure to correct the software problem. This completes the repair.Step 0230-4 Were any FRUs or location code reported with the SRN? NO Go to Step 0230-5. YES Go to Step 0230-9 Step 0230-5 Run the diagnostics in problem determination mode on sysplanar0. Were there any FRUs reported with the SRN? NO Go to Step 0230-6. YES Go to Step 0230-9
284 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
Step 0230-6 Did the system display: ?Previous Diagnostic Results - Do you want to review the previously displayed error?? NO Go to Step 0230-7. YES You have a pending item in the error log for which there is no corresponding log repair action. To see this error, select YES at the prompt. Information from the error log displays in order of last event first. Record the error code, the FRU names and the location code of the FRUs. Go to Step 0230-7 Step 0230-7 Were there any other SRNs that begin with an A00 to A1F reported? NO Go to Step 0230-8. YES Go to Step 0230-1 and use the new SRN.Step 0230-8 Attempt to boot the system on slow boot mode. If the system boots, run the diagnostics in problem determination mode on sysplanar0 Were any new error codes or SRNs reported? NO Call your support center. YES Follow the procedure for the new error code or SRN. Step 0230-9 1. Obtain the list of physical location codes and FRU numbers that were listed on the Problem Report Screen. The list can be obtained by running the sysplanar0 diagnostics or using the task Display Previous Diagnostic Results. 2. Record the physical location codes and FRU numbers. 3. Refer to the last character in the SRN. A 2, 3, 6, or 7 indicates that hot-swap is possible. Does the last character indicate that hot-swap is possible? NO Go to Step 0230-10. YES Go to Step 0230-14 Step 0230-10 Note: If necessary, refer to “Powering on and powering off” on page 588 for information on system shutdown and powering the system on and off. 1. If the operating system is running, perform the operating system’s shutdown procedure. 2. Turn off power to the system. 3. Refer to the last character in the SRN. A 1, 3, 5, or 7 indicates that all FRUs listed on the Problem Report Screen need to be replaced. For SRNs ending with any other character, exchange one FRU at a time, in the order listed.Note: Use the appropriate procedure under Chapter 4, “Removing and replacing FRUs,” on page 349 to replace any FRUs indicated. 4. Turn on power to the system. Chapter 2. Isolation Procedures 285
5. Load “Running the online diagnostics in service mode” on page 673.Note: If the Diagnostics Operating Instructions do not display or you are unable to select the option Task Selection , check for loose cards, cables, and obvious problems. If you do not find a problem, go to “MAP 0020: Problem determination procedure” on page 259 and get a new SRN. 6. Wait until the Diagnostics Operating Instructions are displayed or the system appears to stop. 7. Press Enter. 8. Select Diagnostic Routines at the function selection menu. 9. Select System Verification. 10. If a missing options exist, particularly if it is related to the device that was replaced, resolve the missing options before proceeding 11. Select the option Task Selection. 12. Select the option Log Repair Action. 13. Log a repair action for each replaced resource. 14. If the resource associated with your repair action is not displayed on the resource list, select sysplanar0. 15. Return to the Task Selection Menu. 16. If the FRU that was replaced was memory and the system is running as a full system partition, select Run Exercisers and run the short exerciser on all the resources, otherwise proceed to Step 0230-15. 17. If you ran the exercisers in Step 0230-10, substep 16, return to the Task Selection menu. 18. Select Run Error Log Analysis and run analysis on all the resources. Was a problem reported? NO The repair is complete. Go to Chapter 5, “Verifying a repair,” on page 557. YES Go to Step 0230-11. Step 0230-11 Is the problem the same as the original problem? NO The symptom has changed. Check for loose cards, cables, and obvious problems. If you do not find a problem, go to “MAP 0020: Problem determination procedure” on page 259 and get a new SRN. YES Go to Step 0230-12. Step 0230-12 Look at the physical location codes and FRU part numbers you recorded. Have you exchanged all the FRUs that were listed? NO Go to Step 0230-13. YES The SRN did not identify the failing FRU. Call your support person for assistance. Step 0230-13 1. After performing a shutdown of the operating system, turn off power to the system. 2. Remove the new FRU and install the original FRU. 3. Exchange the next FRU in list. 4. Turn on power to the system. 286 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
5. Load “Running the online diagnostics in service mode” on page 673.Note: If the Diagnostics Operating Instructions do not display or you are unable to select the option Task Selection, check for loose cards, cables, and obvious problems. If you do not find a problem, go to “MAP 0020: Problem determination procedure” on page 259 and get a new SRN. 6. Wait until the Diagnostics Operating Instructions are displayed or the system appears to stop. 7. Press Enter. 8. Select Diagnostic Routines at the function selection menu. 9. Select System Verification. 10. If a missing options exist, particularly if it is related to the device that was replaced, resolve the missing options before proceeding. 11. Select the option Task Selection. 12. Select the option Log Repair Action. 13. Log a repair action for each replaced resource. 14. If the resource associated with your action does not appear on the Resource List, select sysplanar0. 15. Return to the Task Selection Menu. 16. If the FRU that was replaced was memory and the system is running as a full system partition, select Run Exercisers and run the short exerciser on all the resources, otherwise proceed to Step 0230-15. 17. If you ran the exercisers in Step 0230-13, substep 16, return to the Task Selection menu. 18. Select Run Error Log Analysis and run analysis on all the resources. Was a problem reported? NO The repair is complete. Go to Chapter 5, “Verifying a repair,” on page 557. YES Go to Step 0230-11. Step 0230-14 The FRUs can be hot-swapped. If you do not want to use the hot-swap, go to Step 0230-10.Note: See the hot-swap procedures in the Chapter 4, “Removing and replacing FRUs,” on page 349 section. 1. Refer to the last character in the SRN. A 1, 3, 5, or 7 indicates that all FRUs listed on the Problem Report Screen must be replaced. For SRNs ending with any other character, exchange one FRU at a time, in the order listed. 2. If available, use the CE Login and enter the diag command.Note: If CE Login is not available, have the system administrator enter superuser mode and then enter the diag command. 3. After the Diagnostics Operating Instructions display, press Enter. 4. Select the option Task Selection. 5. Select the option Log Repair Action. 6. If the resource associated with your action is not displayed on the Resource List, select sysplanar0. 7. Log a repair action for each replaced resource. 8. Return to the Task Selection menu. 9. For systems running as a full system partition, select Run Exercisers and run the short exerciser on all resources. Chapter 2. Isolation Procedures 287
10. Use the option Log Repair Action in the Task Selection menu to update the AIX error log. If the repair action was reseating a cable or adapter, select the resource associated with your repair action. If it is not displayed on the resource list, select sysplanar0.Note: On systems with a Fault Indicator LED, this changes the Fault Indicator LED from the fault state to the normal state.Was a problem reported? NO The repair is complete. Return the system to the customer. YES Go to Step 0230-15. Step 0230-15 Is the problem the same as the original problem? NO The symptom has changed. Check for loose cards, cables, and obvious problems. If you do not find a problem, go to “MAP 0020: Problem determination procedure” on page 259 and get a new SRN. YES Go to Step 0230-16. Step 0230-16 Look at the physical location codes and FRU part numbers you recorded. Have you exchanged all the FRUs that were listed? NO Go to Step 0230-17. YES The SRN did not identify the failing FRU. Call your support person for assistance. Step 0230-17 1. Remove the new FRU and install the original FRU. 2. Exchange the next FRU in the list. 3. Return to the Task Selection Menu. 4. Select the option Log Repair Action. 5. Log a repair action for each replaced resource. 6. If the resource associated with your action is not displayed on the Resource List, select sysplanar0. 7. Return to the Task Selection Menu. 8. For systems running as a full system partition, select Run Exercisers and run the short exerciser on all resources. 9. If you ran the exercisers in substep Step 0230-17, substep 8. 10. Select Run Error Log Analysis and run analysis on all exchanged resources. Was a problem reported? NO The repair is complete. Return the system to the customer. YES Go to Step 0230-15.MAP 0260: System hangs during resource configuration Use this MAP when the system unit hangs while configuring a resource. 288 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
Purpose of this MAP This MAP handles problems when the system unit hangs while configuring a resource. v Step 0260-1 The last three or four digits of the SRN following the dash (-) match a failing function code number. Look at the Chapter 8, “Part Information,” on page 691 and find the failing function code that matches the last three or four digits of your SRN, following the dash. Record the FRU part number and description (use the first FRU part listed when multiple FRUs are listed). The physical location code, AIX location code, or device name is displayed on the operator panel. Do you have either a physical location code or AIX location code displayed? No Go to Step 0260-4. Yes Go to Step 0260-2.v Step 0260-2 Are there any FRUs attached to the device described by the physical location code or AIX location code? No Go to Step 0260-6. Yes Go to Step 0260-3.v Step 0260-3 Remove this kind of FRU attached to the device described in the location code one at a time. Note whether the system still hangs after each device is removed. Repeat this step until you no longer get a hang, or all attached FRUS have been removed from the adapter or device. Has the symptom changed? No Go to Step 0260-4. Yes Use the location code of the attached device that you removed when the symptom changed, and go to Step 0260-6.v Step 0260-4 Does your system unit contain only one of this kind of FRU? No Go to Step 0260-5. Yes Go to Step 0260-6.v Step 0260-5 One of the FRUs of this kind is defective. Remove this kind of FRU one at a time. Test the system unit after each FRU is removed. Stop when the test completes successfully or when you have removed all of the FRUs of this kind. Were you able to identify a failing FRU? No Go to “PFW1540: Problem isolation procedures” on page 301. Yes Go to Step 0260-6.v Step 0260-6 1. Turn off the system unit. 2. Exchange the FRU identified by the location code or Step 0260-5. Is this system capable of running online diagnostics in service mode? No Go to Step 0260-7. Yes Go to Step 0260-8.v Step 0260-7 1. Turn on the system unit. Chapter 2. Isolation Procedures 289
2. Load the standalone diagnostics and refer to “Running the online and eServer stand-alone diagnostics” on page 672. 3. Wait until the Diagnostic Operating Instructions display or the system appears to have stopped. Are the DIAGNOSTIC OPERATING INSTRUCTIONS displayed? NO Go to Step 0260-9. YES Go to Chapter 5, “Verifying a repair,” on page 557.v Step 0260-8 1. Turn on the system unit. 2. Load the standalone diagnostics and refer to “Running the online and eServer stand-alone diagnostics” on page 672. 3. Wait until the Diagnostic Operating Instructions display or the system appears to have stopped. Are the DIAGNOSTIC OPERATING INSTRUCTIONS displayed? NO Go to Step 0260-9. YES Go to Chapter 5, “Verifying a repair,” on page 557.v Step 0260-9 Look at the operator panel display. Is the number displayed the same as the last three or four digits after the dash (-) of your SRN? NO The symptom changed. Check for loose cards, cables, and obvious problems. If you do not find a problem, go to “MAP 0020: Problem determination procedure” on page 259 and get a new SRN. YES Go to Step 0260-10.v Step 0260-10 Was the FRU you exchanged an adapter or a planar? NO Go to Step 0260-11. YES Go to “PFW1540: Problem isolation procedures” on page 301.v Step 0260-11 Was the FRU you exchanged a device? NO Go to “PFW1540: Problem isolation procedures” on page 301. YES Go to Step 0260-12.v Step 0260-12 The adapter for the device may be causing the problem. 1. Turn off the system unit. 2. Exchange the adapter for the device. Note: If the AIX operating system is not used on the system, start diagnostics from an alternate source. 3. Turn on the system unit. If c31 is displayed, follow the instructions to select a console display. 4. Load the standalone diagnostics and refer to “Running the online and eServer stand-alone diagnostics” on page 672. 5. Wait until the DIAGNOSTIC OPERATING INSTRUCTIONS display or the system appears to have stopped. Are the DIAGNOSTIC OPERATING INSTRUCTIONS displayed? NO Go to “PFW1540: Problem isolation procedures” on page 301. YES Go to Chapter 5, “Verifying a repair,” on page 557.
290 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
MAP 0270: SCSI RAID problem resolution and verification Use this MAP to resolve SCSI RAID adapter, cache, or drive problems. Purpose of This MAP Use this MAP to resolve SCSI RAID adapter, cache, or drive problems. Notes: 1. This MAP assumes that the RAID adapter and drive microcode is at the correct level. 2. This MAP applies only to PCI, not PCI-X, RAID adapters.Attention: If the FRU is a disk drive or an adapter, ask the system administrator to perform the steps necessary to prepare the device for removal. v Step 0270-1 1. If the system displayed a FRU part number on the screen, use that part number. If there is no FRU part number displayed on the screen, refer to the SRN listing. Record the SRN source code and the failing function codes in the order listed. 2. Find the failing function codes in the FFC listing, and record the FRU part number and description of each FRU. Go to Step 0270-2. v Step 0270-2 Is the FRU a RAID drive? NO Go to Step 0270-6. YES Go to Step 0270-3.v Step 0270-3 If the RAID drive you want to replace is not already in the failed state, then ask the customer to run the PCI SCSI Disk Array Manager using smit to fail the drive that you want to replace. An example of this procedure is: 1. Log in as root user. 2. Type smit pdam. 3. Select Fail a Drive in a PCI SCSI Disk Array. 4. Select the appropriate disk array by placing the cursor over that array and press Enter. 5. Select the appropriate drive to fail based on the Channel and ID called out in diagnostics. The Fail a Drive screen will appear. 6. Verify that you are failing the correct drive by looking at the Channel ID row. Press Enter when verified correct. Press Enter again. 7. Press F10 and type smit pdam 8. Select Change/Show PCI SCSI RAID Drive Status ? Remove a Failed Drive. 9. Select the drive that just failed. Go to Step 0270-4. v Step 0270-4 Replace the RAID drive using the RAID HOT PLUG DEVICES service aid: Note: The drive you want to replace must be either a SPARE or FAILED drive. Otherwise, the drive would not be listed as an IDENTIFY AND REMOVE RESOURCES selection within the RAID HOT PLUG DEVICES screen. In that case you must ask the customer to put the drive into FAILED state. For information on putting the drive in a FAILED state, refer the customer to the PCI Dual Channel Ultra320 SCSI RAID Adapter Installation and Using Guide, order number SC23-1324.
Chapter 2. Isolation Procedures 291
1. Select the option RAID HOT PLUG DEVICES within the HOT PLUG TASK under DIAGNOSTIC SERVICE AIDS. 2. Select the RAID adapter that is connected to the RAID array containing the RAID drive you want to remove, then select COMMIT. 3. Choose the option IDENTIFY in the IDENTIFY AND REMOVE RESOURCES menu. 4. Select the physical disk which you want to remove from the RAID array and press Enter. The disk will go into the IDENTIFY state, indicated by a flashing light on the drive. 5. Verify that it is the physical drive you want to remove, then press Enter. 6. At the IDENTIFY AND REMOVE RESOURCES menu, choose the option REMOVE and press Enter. A list of the physical disks in the system that may be removed will be displayed. 7. If the physical disk you want to remove is listed, select it and press Enter. The physical disk will go into the REMOVE state, as indicted by the LED on the drive. If the physical disk you want to remove is not listed, it is not a SPARE or FAILED drive. Ask the customer to put the drive in the FAILED state before you can proceed to remove it. For information on putting the drive in a FAILED state, refer the customer to the PCI Dual Channel Ultra320 SCSI RAID Adapter Installation and Using Guide, order number SC23-1324. 8. Refer to the service guide for the system unit or enclosure that contains the physical drive for removal and replacement procedures for the following substeps: a. Remove the old hot-swap RAID drive. b. Install the new hot-swap RAID drive. After the hot-swap drive is in place, press Enter. The drive will exit the REMOVE state, and will go to the NORMAL state after you exit diagnostics. Note: There are no elective tests to run on a RAID drive itself under diagnostics (the drives are tested by the RAID adapter).Go to Step 0270-5. v Step 0270-5 If the RAID did not begin reconstructing automatically, perform the following steps. Adding a Disk to the RAID array and Reconstructing: Ask the customer to run the PCI SCSI Disk Array Manager using smit. An example of this procedure is: 1. Log in as root user. 2. Type smit pdam. 3. Select Change/Show PCI SCSI RAID Drive Status. 4. Select Add a Spare Drive. 5. Select the appropriate adapter. 6. Select the channel and ID of the drive that was replaced. 7. Press Enter when verified. 8. Press F3 until you return to the Change/Show PCI SCSI RAID Drive Status screen. 9. Select Add a Hot Spare. 10. Select the drive you just added as a spare. If there was no hot spare previously installed in the array, the array will begin reconstructing immediately. Reconstruction time will vary based on the size of the RAID array. Allow 1-2 hours for completion. To check the progress of the reconstruction: 1. Log in as root user. 2. Type smit pdam. 3. Select List PCI SCSI RAID Arrays. 4. Choose the array containing the drive you replaced. 292 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
If the state of the RAID array is reconstructing, then it is in process of reconstructing. If it is optimal, then reconstruction has completed. 5. Press F10 to exit. Go to Step 027017. v Step 0270-6 Is the FRU a RAID adapter base card, RAID adapter cache card, or RAID adapter battery? NO Go to Step 0270-15. YES Go to Step 0270-7.v Step 0270-7 Do you want to change the FRU using a hot-swap operation? NO Power off the system, and remove the RAID adapter. Go to Step 0270-8. YES Remove the RAID adapter. Go to Step 0270-8.v Step 0270-8 Is the FRU you want to replace a RAID adapter cache card or RAID adapter battery? NO Go to Step 0270-10. YES Go to Step 0270-9.v Step 0270-9 Replace the FRU onto the existing base card. Go to Step 0270-11. v Step 0270-10 After physically removing the base card from the system, remove any other good FRUs (RAID cache card or cache battery) from the RAID base card adapter. Plug these FRUs on to the replacement RAID base card adapter FRU. Go to Step 0270-11. v Step 0270-11 Did you change the FRU using a hot-swap operation? NO Install the RAID adapter assembly into the system. Power on the system and log in to AIX. Go to Step 0270-12. YES Install the RAID adapter assembly into the system. Go to Step 0270-12.v Step 0270-12 Was the replacement FRU a RAID base card? NO Go to Step 0270-14. YES Go to Step 0270-13.v Step 0270-13 Attention: Prior to cabling the SCSI RAID adapter to the subsystem, check for preexisting configurations on the replacement SCSI RAID base card. The replacement base card can overwrite your system’s configuration data if it already has a configuration written to it. Check it before cabling the SCSI RAID subsystem array. Ask to customer to check for preexisting configuration on the SCSI RAID base card. Below is an example of this procedure: 1. Log in as root (if not already root). 2. Type smit pdam. 3. Select List PCI SCSI RAID Arrays. 4. If no RAID arrays are listed, then there are no preexisting configurations on the base card. Chapter 2. Isolation Procedures 293
5. Press F10 key to exit. If a preexisting configuration exists on the base card, ask the customer to run the PCI SCSI Disk Array Manager using smitty. 1. Log in as root (if not already root). 2. Type smit pdam from the AIX command prompt (if not already in the RAID manager). 3. Select Recovery Options. 4. Select Clear PCI SCSI RAID Adapter Configuration. Select the adapter that you just installed. Press Enter to confirm. 5. Return to the Recovery Options menu (if not already there). Select Resolve PCI SCSI RAID Adapter Configuration. Select Accept Configuration on Drives. Select the adapter that you just installed. Press Enter to confirm. The configuration on the new adapter should now match the configuration existent on the drives. 6. Press F10 to exit. You may now proceed to cable the RAID system array. Go to Step 0270-16. v Step 0270-14 Ask the customer to resynchronize the RAID array configuration: 1. Log in as root (if not already root). 2. Type smit pdam. 3. Select Recovery Options. 4. Select Resolve PCI SCSI RAID Adapter Configuration. 5. Select Retry Current Configuration. 6. Select the appropriate scraid (SCSI RAID) adapter. A message will be displayed as to the success of the operation. 7. Press F10 to exit. Go to Step 0270-16. v Step 0270-15 Other RAID FRUs require that the system be shut down prior to replacement. 1. If the operating system is running, perform the operating system shutdown procedure (get help if needed). 2. Turn off the system power. 3. Replace the FRU indicated by the FFC. Go to Step 0270-16. v Step 0270-16 Run the diagnostics in system verification mode on the RAID subsystem. Did the diagnostics run with no trouble found? NO Go to Step 0270-18. YES Go to Step 0270-17.v Step 0270-17 1. Use the option Log Repair Action in the TASK SELECTION menu to update the AIX error log. Select scraidX (where X is the RAID adapter number of the RAID subsystem you have been working on). Note: On systems with fault indicator LED, this changes the fault indicator LED from the Fault state to the Normal state. 2. While in diagnostics, go to the FUNCTION SELECTION menu. Select the option Advanced Diagnostics Routines. 294 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
3. When the DIAGNOSTIC MODE SELECTION menu displays, select the option System Verification. Run the diagnostic test on scraidX (where X is the RAID adapter number). Did the diagnostics run with no trouble found? NO Go to the Step 0270-18. YES If you changed the service processor or network settings, restore the settings to the value they had prior to servicing the system.This completes the repair; return the system to the customer. Go to Chapter 6, “Closing a service call,” on page 565. v Step 0270-18 Have you exchanged all the FRUs that correspond to the failing function codes? NO Go to Step 0270-19. YES The SRN did not identify the failing FRU. Schedule a time to run diagnostics in service mode. If the same SRN is reported in service mode, go to “MAP 0030: Additional Problem Determination” on page 266.v Step 0270-19 Note: Note: Before proceeding, remove the FRU you just replaced and install the original FRU in its place. Use the next FRU on the list and go to Step 0270-2.MAP 0280: Console and Keyboard Problem Resolution Use this MAP to resolve console and keyboard problems when the system is booting. Purpose of This MAP Use this MAP to resolve console and keyboard problems when the system is booting. For other boot problems and concerns, go to “Problems with loading and starting the operating system (AIX and Linux)” on page 246.
Entry Table
Keyboard problem
Go to Step 0280-1.
Graphic adapter problem
Go to Step 0280-2.
Terminal problem
Go to Step 0280-3. v Step 0280-1 The system fails to respond to keyboard entries. This problem is most likely caused by a faulty keyboard, keyboard adapter, or keyboard cable. Try the FRUs in the following order. Test each FRU by retrying the failing operation. 1. Keyboard. 2. Keyboard adapter, typically located on the system board. 3. Keyboard cable, if not included with the keyboard. Were you able to resolve the problem? NO Go to “MAP 0030: Additional Problem Determination” on page 266. YES Go to Chapter 5, “Verifying a repair,” on page 557.v Step 0280-2 1. Some systems have a graphic adapter POST. If a graphic adapter POST is supported and it indicates a failure, follow the procedures in the system guide to resolve the problem. Chapter 2. Isolation Procedures 295
2. If a graphic adapter POST is supported and it does not indicate a failure, suspect the display or display cable. 3. If the system does not have a graphic adapter POST, go to the display’s problem determination procedures. If you do not find a problem, replace the graphics adapter. Were you able to resolve the problem? NO Go to “MAP 0030: Additional Problem Determination” on page 266. YES Go to Chapter 5, “Verifying a repair,” on page 557.v Step 0280-3 Go to the problem determination procedure for the terminal. If you do not find a problem, suspect the serial port adapter or terminal cable. Were you able to resolve the problem? NO Call your support person. YES Go to Chapter 5, “Verifying a repair,” on page 557.MAP 0285: Multipath I/O (MPIO) Problem Resolution Use this MAP to handle SRN A23-001 and ssss-640 (where ssss is the 3 or 4 digit Failing Function Code (FFC) of an SCSD drive) to check the path from adapter to device. Purpose of this MAP Use this MAP to handle SRN A23-001 and ssss-640 (where ssss is the 3 or 4 digit FFC of an SCSD drive) to check the path from adapter to device. Note: Not all devices support MPIO. Before proceeding with this MAP, make sure that the devices on both ends of the missing path support MPIO. v Step 0285-1 Look at the problem report screen for the missing path. After the resource name and FRU, the next column identifies the missing path between resources (for example, scsi0 -> hdisk1). This indicates the missing path between the two resources, scsi0 (the parent resource) and hdisk1 (the child resource). Is the cabling present between the two resources? NO Go to Step 0285-2. YES Go to Step 0285-4.Note: In the following MAP steps, if no path previously existed between a parent and child device, the child device will have to be changed from the defined to the available state, otherwise you will be unable to select the child device to which you want to establish a path. v Step 0285-2 1. Power off the system. 2. Connect the correct cable between the two resources. 3. Power on the system, rebooting AIX. 4. At the AIX command line, type smitty mpio. 5. Choose MPIO Path Management. 6. Select Enable Paths. 7. Select Enable Paths for a Device. 8. In the Device Name selection, choose the name of the child device (the device at the end of the path). 9. In the Paths to Enable selection, select the missing path (between the parent and child device). 296 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
10. Press Enter. Did the smitty menu complete with no errors? NO Go to Step 0285-4. YES Go to Step 0285-3.v Step 0285-3 To verify that the device path is present, rerun diag -a. When the resource list displays, select the child resource that had the missing path. If the letter P is shown in front of the resource, you still have a path problem. If no letter P is shown in front of the resource or if the system returns to the command prompt or you get a message without the resource list being displayed, then the problem has been resolved. Do you still have a path problem? NO Go to Chapter 5, “Verifying a repair,” on page 557. YES Go to Step 0285-4.v Step 0285-4 1. Power off the system. 2. Reseat the cables between the device and the adapter that have the missing path. 3. Power on the system, and reboot AIX. 4. At the AIX command line, type smitty mpio. 5. Choose MPIO Path Management. 6. Select Enable Paths. 7. Select Enable Paths for a Device. 8. In the Device Name selection, choose the name of the child device (the device at the end of the path). 9. In the Paths to Enable selection, select the missing path (between the parent and child device). 10. Press Enter. Did the smitty menu complete with no errors? NO Go to Step 0285-6. YES Go to Step 0285-5.v Step 0285-5 To verify that the device path is present, rerun diag -a When the resource list displays, select the child resource that had the missing path. If the letter P is shown in front of the resource, you still have a path problem. If no letter P is shown in front of the resource or if the system returns to the command prompt or you get a message without the resource list being displayed, then the problem has been resolved. Do you still have a path problem? NO Go to Chapter 5, “Verifying a repair,” on page 557. YES Go to Step 0285-6.v Step 0285-6 You may have a problem with the driver interface or connector on either the parent or child device. 1. Power off the system (refer to the system service guide if necessary). 2. Remove the cables to the parent device. 3. Replace the cable(s) that go between the parent and child device (if present). 4. Reattach the cables to the parent device. 5. Power on the system, rebooting AIX (refer to the system service guide if necessary). 6. At the AIX command line, type smitty mpio. Chapter 2. Isolation Procedures 297
7. Choose MPIO Path Management. 8. Select Enable Paths. 9. Select Enable Paths for a Device. 10. In the Device Name selection, choose the name of the child device (the device at the end of the path). 11. In the Paths to Enable selection, select the missing path (between the parent and child device). 12. Press Enter. Did the smitty menu complete with no errors? NO Go to Step 0285-7. YES Go to Step 0285-8.v Step 0285-7 To verify that the device path is present, rerun diag -a. When the resource list displays, select the child resource that had the missing path. If the letter P is shown in front of the resource, you still have a path problem. If the letter P is not shown in front of the resource, or if the system returns to the command prompt, or you get a message without the resource list being displayed, then the problem has been resolved. Do you still have a problem? NO Go to Chapter 5, “Verifying a repair,” on page 557. YES Go to Step 0285-8.v Step 0285-8 You may have a problem with the driver interface or connector on either the parent or child device. 1. Power off the system. 2. Remove the cables to the parent device. 3. Replace the parent device. 4. Reattach the cables to the parent device. 5. Power on the system, rebooting AIX. 6. At the AIX command line, type smitty mpio. 7. Choose MPIO Path Management. 8. Select Enable Paths. 9. Select Enable Paths for a Device. 10. In the Device Name selection, choose the name of the child device (the device at the end of the path). 11. In the Paths to Enable selection, select the missing path (between the parent and child device). 12. Press Enter. Did the smitty menu complete with no errors? NO Go to Step 0285-9. YES Go to Step 0285-10.v Step 0285-9 To verify that the device path is present, rerun diag -a. When the resource list displays, select the child resource that had the missing path. If the letter P is shown in front of the resource, you still have a path problem. If no letter P is shown in front of the resource or if the system returns to the command prompt or you get a message without the resource list being displayed, then the problem has been resolved. Do you still have a path problem? NO Go to Chapter 5, “Verifying a repair,” on page 557. 298 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
YES Go to Step 0285-10.v Step 0285-10 You may have a problem with the driver interface or connector on the child device. 1. Power off the system. 2. Remove the cables to the child device. 3. Replace the child device (in the case of a SCSI or SAF-TE backplane, replace the backplane first, followed by the child device). 4. Reattach the cables to the child device. 5. Power on the system, rebooting AIX. 6. At the AIX command line, type smitty mpio. 7. Choose MPIO Path Management. 8. Select Enable Paths. 9. Select Enable Paths for a Device. 10. In the Device Name selection, choose the name of the child device (the device at the end of the path). 11. In the Paths to Enable selection, select the missing path (between the parent and child device). 12. Press Enter. Did the smitty menu complete with no errors? NO Neither the cabling, nor the parent, nor the child seem to be causing the problem. Contact your service support structure. YES Go to Step 0285-11.v Step 0285-11 Rerun diag -a to verify that the device path is present. When the resource list displays, select the child resource that had the missing path. If the letter P is shown in front of the resource, you still have a path problem. If no letter P is shown in front of the resource or if the system returns to the command prompt or you get a message without the resource list being displayed, then the problem has been resolved. Do you still have a path problem? NO Go to Chapter 5, “Verifying a repair,” on page 557. YES Neither the cabling, nor the parent, nor the child seem to be causing the problem. Contact your service support structure.MAP 0291: Missing Device or Bus Problem Resolution Use this MAP when a bus or device (such as a disk drive) is reported as a missing resource by the diagnostics. Purpose of this MAP Use this MAP when a bus or device (such as a disk drive) is reported as a missing resource by the diagnostics. v Step 0291-1 The device may be missing because of a power problem. If the missing device is located in a drawer or enclosure, do the following: 1. Check for any environmental problem indicators such as power or cooling that may indicate a problem. 2. If a problem is indicated, follow the service documentation to resolve the problem. Go to Step 0291-2. Chapter 2. Isolation Procedures 299
v Step 0291-2 Inspect the cables (signal and power) of the missing device. Be sure all connections are in place and power is present. Refer to the system or enclosure documentation containing the power wiring diagram or system cable diagram to locate specific cables, determine the cable numbering, and check for a problem-determination procedure. Look for obvious power cabling problems, such as missing or loose cable connectors. Power problems can sometimes be identified by checking other devices that use the same power source (such as a diskette drive and a SCSI tape drive, even though they have different controllers). If other devices that share a power source are reported as missing devices, suspect the power source as the problem. If there is a power problem, refer to “Power problems” on page 129. Did you find a problem? NO Go to Step 0291-3. YES Correct the problem, then go to Chapter 5, “Verifying a repair,” on page 557.v Step 0291-3 Is the missing device a SCSI device installed in a SCSI Enclosure Services (AIX resource SESx) or SAF-TE (AIX resource SAFTEx), or a SCSI device installed in either type of enclosure? NO Go to “MAP 0210: General problem resolution” on page 246. YES Go to Step 0291-4.v Step 0291-4 Run the advanced diagnostics in problem determination mode on the SCSI Enclosure Services or SAF-TE device.Note: A no trouble found result, or if you get another SRN with the same digits before the dash as you previously had from the diagnostics, indicates that you did not get a different SRN. Did you get a different SRN than when you ran the diagnostics previously? NO Go to Step 0291-5. YES Take the following action: 1. Look up the SRN.Note: If the SRN is not listed in the Service Request Number Lists, look for additional information in the following: – Any supplemental service manual for the device. – The diagnostic Problem Report screen. – The Service Hints service aid in “Running the online and eServer stand-alone diagnostics” on page 672.2. Perform the action listed.v Step 0291-5 Power off the system. Disconnect all hot-swap devices attached to the adapter. Reconnect the hot-swap devices one at time. After reconnecting each device, do the following: 1. Power on the system and boot the system in the same mode that you were in when you received the symptom that led you to this MAP “Powering on and powering off” on page 588. 2. At an AIX command prompt, run the diag -a command to check for missing options. 3. Verify that the device you just added to the system is present in the system configuration. If other devices are now missing, the problem may be with the last device that was reconnected or reinstalled. Perform these substeps: a. If the device that was just reinstalled in the system is missing, or if additional devices are now missing, replace the last device that was reinstalled. 300 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
b. Rerun diag -a to check for missing options. c. If devices are no longer missing, go to Chapter 5, “Verifying a repair,” on page 557. Otherwise, contact your support center. Note: A device problem can cause other devices attached to the same SCSI adapter to go into the defined state. Ask the system administrator to make sure that all devices attached to the same SCSI adapter as the device that you replaced are in the available state.4. If no devices were missing, the problem could be intermittent. Make a record of the problem. Running the diagnostics for each device on the bus may provide additional information. If you have not replaced FFCs B88, 190, and 152 go to “MAP 0210: General problem resolution” on page 246, using FFCs (in order): B88, 190, and 152.PFW1540: Problem isolation procedures The PFW1540 procedures are used to locate problems in the processor subsystem or I/O subsystem(s). If a problem is detected, these procedures help you isolate the problem to a failing unit. Find the symptom in the following table; then follow the instructions given in the Action column.
Problem Isolation Procedures
Symptom/Reference Code/Checkpoint
Action
You have or suspect an I/O card or I/O subsystem failure.You received one of the following SRNs or reference codes: 101-000, 101-517, 101-521, 101-538, 101-551 to 101-557, 101-559 to 101-599, 101-662, 101-727, 101-c32, 101-c33, 101-c70
Go to “PFW1542: I/O problem isolation procedure” on page 302.
You have or suspect a memory or processor subsystem problem. You received the following SRN or reference code: 101-185
Go to “PFW1548: Memory and processor subsystem problem isolation procedure” on page 316.
If you were directed to the PFW1540 procedure by an SRN and that SRN is not listed in this table.
Go to “PFW1542: I/O problem isolation procedure” on page 302. FRU identify LEDs Your system is configured with an arrangement of LEDs that help identify various components of the system. These include but are not limited to: v Rack identify beacon LED (optional rack status beacon) v Processor subsystem drawer identify LED v I/O drawer identify LED v RIO port identify LED v FRU identify LED v Power subsystem FRUs v Processor subsystem FRUs v I/O subsystem FRUs v I/O adapter identify LED v DASD identify LEDThe identify LEDs are arranged hierarchically with the FRU identify LED at the bottom of the hierarchy, followed by the corresponding processor subsystem or I/O drawer identify LED, and the corresponding rack identify LED to locate the failing FRU more easily. Any identify LED in the system may be flashed; refer to “Managing the Advanced System Management Interface (ASMI)” on page 645. Chapter 2. Isolation Procedures 301
Any identify LED in the system may also be flashed by using the AIX diagnostic programs task “Identify and Attention Indicators”. The procedure to use the AIX diagnostics task “Identify and Attention Indicators” is outlined in “Tasks and Service Aids” in “Running the online and eServer stand-alone diagnostics” on page 672. PFW1542: I/O problem isolation procedure This I/O problem-determination procedure isolates I/O card and I/O subsystem failures. When I/O problem isolation is complete, all cables and cards that are failing will have been replaced or reseated. Notes: 1. To avoid damage to the system or subsystem components, unplug the power cords before removing or installing a part. 2. This procedure assumes that either: v An optical drive is installed and connected to the integrated EIDE adapter, and an AIX diagnostic CD-ROM is available. OR v AIX standalone diagnostics can be booted from a NIM server.3. If a power-on password or privileged-access password is set, you are prompted to enter the password before the AIX diagnostic CD-ROM can load. 4. The term POST indicators refers to the device mnemonics that appear during the power-on self-test (POST). 5. The service processor might have been set by the user to monitor system operations and to attempt recoveries. You might want to disable these options while you diagnose and service the system. If these settings are disabled, make notes of their current settings so that they can be restored before the system is turned back over to the customer. The following settings may be of interest. Monitoring (also called surveillance) From the ASMI menu, expand the System Configuration menu, then click Monitoring. Disable both types of surveillance. Auto power restart (also called unattended start mode) From the ASMI menu, expand Power/Restart Control, then click Auto Power Restart, and set it to disabled. Wake on LAN® From the ASMI menu, expand Wake on LAN, and set it to disabled. Call Out From the ASMI menu, expand the Service Aids menu, then click Call-Home/Call-In Setup. Set the call-home system port and the call-in system port to disabled.6. Verify that the system has not been set to boot to the SMS menus or to the open firmware prompt. From the ASMI menu, expand Power/Restart Control to view the menu, then click Power On/Off System. The AIX/Linux partition mode boot indicates Continue to Operating System.Use this procedure to locate defective FRUs not found by normal diagnostics. For this procedure, diagnostics are run on a minimally configured system. If a failure is detected on the minimally configured system, the remaining FRUs are exchanged one at a time until the failing FRU is identified. If a failure is not detected, FRUs are added back until the failure occurs. The failure is then isolated to the failing FRU. Perform the following procedure: v PFW1542-1 1. Ensure that the diagnostics and the operating system are shut down. 2. Turn off the power. 302 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
3. Select slow system boot speed on the power on/off system menu under the power/restart control menu on the ASMI. 4. Turn on the power. 5. Insert the AIX diagnostic CD-ROM into the optical drive. Does the optical drive appear to operate correctly? NO Go to “Problems with loading and starting the operating system (AIX and Linux)” on page 246. YES Continue to PFW1542-2.v PFW1542-2 1. When the keyboard indicator is displayed (the word ?keyboard?), if the system or partition gets that far in the IPL process, press the 5 key on the firmware console. 2. If you are prompted to do so, enter the appropriate password. Is the ?Please define the System Console? screen displayed? NO Continue toPFW1542-3. YES Go to PFW1542-4.v PFW1542-3 The system is unable to boot standalone AIX diagnostics. Check the service processor error log (using the ASMI) and the operator panel for additional error codes resulting from the slow boot that was performed in PFW1542-1. Did the slow boot generate a different error code or partition firmware hang from the one that originally sent you to PFW1542? NO If you were sent here by an error code, and the error code did not change as the result of a slow boot, you have a processor subsystem problem. Go to “PFW1548: Memory and processor subsystem problem isolation procedure” on page 316. If you were sent here because the system is hanging on a partition firmware checkpoint, and the hang condition did not change as a result of the slow boot, go to PFW1542-5. YES Restore fast boot on the power on/off system menu from the ASMI. Look up the new error code in the reference code index and perform the listed actions.v PFW1542-4 The system stopped with the Please define the System Console prompt on the system console. Standalone diagnostics can be booted. Perform the following: 1. Follow the instructions on the screen to select the system console. 2. When the DIAGNOSTIC OPERATING INSTRUCTIONS screen is displayed, press Enter. 3. If the terminal type has not been defined, you must use the option Initialize Terminal on the FUNCTION SELECTION menu to initialize the AIX operating system environment before you can continue with the diagnostics. This is a separate operation from selecting the firmware console. 4. Select Advanced Diagnostic Routines. 5. When the DIAGNOSTIC MODE SELECTION menu displays, select System Verification to run diagnostics on all resources. Did running diagnostics produce a different symptom? NO Continue with the following sub-step. YES Return to the Problem Analysis procedures with the new symptom.6. Record any devices missing from the list of all adapters and devices. Continue with this procedure. When you have fixed the problem, use this record to verify that all devices appear when you run system verification. Are there any devices missing from the list of all adapters and devices? Chapter 2. Isolation Procedures 303
NO Reinstall all remaining adapters, if any, and reconnect all devices. Return the system to its original configuration. Be sure to select fast boot on the power on/off system menu on the ASMI. Go to Chapter 5, “Verifying a repair,” on page 557. YES The boot attempts that follow will attempt to isolate any remaining I/O subsystem problems with missing devices. Ignore any codes that may display on the operator panel unless stated otherwise. Continue to PFW1542-5.v PFW1542-5 Examine RIO port 0 of the first RIO bus card (Un-P1-C8-T2) in the base system unit drawer. Are there any I/O subsystems attached to this RIO card? NO Go to PFW1542-29. YES Continue to PFW1542-6.v PFW1542-6 There may be devices missing from one of more of the I/O subsystems, or one or more devices in the I/O subsystems may be causing the system or a partition to hang during IPL.Note: Various types of I/O subsystems might be connected to this system. The RIO and InfiniBand® ports on these subsystems are shown in the following table. Use this table to determine the physical location codes of the RIO or InfiniBand™ connectors that are mentioned in the remainder of this MAP.
Table 42. RIO ports location table
8203-E4A, 9407-M15, or 9408-M25 (base system) adapters
7314-G30 or 5796
7311-D20 or 5095
RIO port 0
Un-P1-C8-T2 Un-P1-C6-T2
Un-P1-C7-T2
Un-P1-C05-00 or Un-P1-C05-T2 (bottom connector)
RIO port 1
Un-P1-C8-T1 Un-P1-C6-T1
Un-P1-C7-T1
Un-P1-C05-01 or Un-P1-C05-T1 (top connector) Note: Before continuing, check the cabling from the base system to the I/O subsystem(s) to ensure that the system is cabled correctly. Refer to the cabling information for your I/O enclosure for valid configurations. Record the current cabling configuration and then continue with the following steps. In the following steps, the term RIO cable means either RIO cable or InfiniBand cable. 1. Turn off the power. Record the location and machine type and model number, or feature number, of each I/O drawer or tower. In the following steps, use this information to determine the physical location codes of the RIO connectors that are referred to by their logical names. For example, if I/O subsystem #1 is a 7311-D20 drawer, RIO port 0 is Un-P1-C05-T2. 2. At the base system drawer, disconnect the cable connection at RIO port 0 (Un-P1-C8-T2). 3. At the other end of the RIO cable referred to in step 2 of PFW1542-6, disconnect the I/O subsystem port connector 0. The RIO cable that was connected to RIO port 0 in the base system should now be loose; remove it. Record the location of this I/O subsystem and call it ?subsystem #1?. 4. Examine the connection at the I/O port connector 1 of the I/O subsystem recorded in step 3 of PFW1542-6. If the RIO cable attached to I/O port connector 1 connects to the I/O port connector 0 of another I/O subsystem, record the location of the next I/O subsystem that is connected to I/O port 1 of subsystem #1, then go to step 8 of PFW1542-6. 304 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
5. At the base system, disconnect the cable connection at RIO port 1 (Un-P1-C8-T1) and reconnect it to RIO port 0 (Un-P1-C8-T2). 6. At the I/O subsystem recorded in step 3 of PFW1542-6, disconnect the I/O port connector 1 and reconnect to I/O port 0. 7. Verify that a single RIO cable connects base system RIO port 0 (Un-P1-C8-T2) to the I/O subsystem recorded in step 4 port 0. Go to step 28 of PFW1542-6. 8. Record the location of the next I/O subsystem and call it ?subsystem #2?. This is the I/O subsystem that is connected to I/O port 1 of subsystem #1. 9. Examine the connection at the I/O port 1 of subsystem #2 recorded in step 8 of PFW1542-6. If the RIO cable attached to I/O port 1 connects to the I/O port 0 of another I/O subsystem, record the location of the next I/O subsystem that is connected to I/O port 1 of subsystem #2 and call it ?subsystem #3?. Go to step 13 of PFW1542-6. 10. The RIO cable attached to the I/O port 1 of subsystem #2 is attached to port 1 (Un-P1-C8-T1) of the base system. At the base system, disconnect the cable connection at RIO port 1 (Un-P1-C8-T1) and reconnect it to RIO port 0 (Un-P1-C8-T2). 11. On subsystem #2, disconnect the cable from I/O port 1 and reconnect it to I/O port 0 of subsystem #1. 12. Verify that a single RIO cable connects base system RIO port 0 to one or two I/O subsystems. Go to step 28 of PFW1542-6. 13. Examine the connection at the I/O port 1 of subsystem #3 recorded in step 9 of PFW1542-6. If the RIO cable attached to I/O port 1 connects to the I/O port 0 of another I/O subsystem, record the location of the next I/O subsystem that is connected to I/O port 1 of the subsystem #3 and call it ?subsystem #4?. Go to step 17 of PFW1542-6. 14. The RIO cable attached to the I/O port 1 of subsystem #3 is attached to port 1 (Un-P1-C8-T1) of the base system. At the base system, disconnect the cable connection at RIO port 1 (Un-P1-C8-T1) and reconnect it to RIO port 0 (Un-P1-C8-T2). 15. On subsystem #3, disconnect the cable from I/O port 1 and reconnect it to I/O port 0 of subsystem #1. 16. Verify that a single RIO cable connects base system RIO port 0 to three I/O subsystems. Go to step 28 of PFW1542-6. 17. Examine the connection at the I/O port 1 of subsystem #4 recorded in step step 13 of PFW1542-6. If the RIO cable attached to I/O port 1 connects to the I/O port 0 of another I/O subsystem, record the location of the next I/O subsystem that is connected to I/O port 1 of subsystem #4 and call it ?subsystem #5?. Go to step step 21 of PFW1542-6. 18. The RIO cable attached to the I/O port 1 of subsystem #4 is attached to port 1 (Un-P1-C8-T1) of the base system. At the base system, disconnect the cable connection at RIO port 1 (Un-P1-C8-T1) and reconnect it to RIO port 0 (Un-P1-C8-T2). 19. On subsystem #4, disconnect the cable from I/O port 1 and reconnect it to I/O port 0 of subsystem #1. 20. Verify that a single RIO cable connects base system RIO port 0 to four I/O subsystems. Continue to step 28 of PFW1542-6. 21. Examine the connection at the I/O port 1 of subsystem #5 recorded in step step 17 of PFW1542-6. If the RIO cable attached to I/O port 1 connects to the I/O port 0 of another I/O subsystem, record the location of the next I/O subsystem that is connected to I/O port 1 of subsystem #5 and call it ?subsystem #6?. Go to step step 25 of PFW1542-6 22. The RIO cable attached to the I/O port 1 of subsystem #5 is attached to port 1 (Un-P1-C8-T1) of the base system. At the base system, disconnect the cable connection at RIO port 1 (Un-P1-C8-T1) and reconnect it to RIO port 0 (Un-P1-C8-T2). 23. On subsystem #5, disconnect the cable from I/O port 1 and reconnect it to I/O port 0 of subsystem #1. Chapter 2. Isolation Procedures 305
24. Verify that a single RIO cable connects base system RIO port 0 to five I/O subsystems. Go to step 28 of PFW1542-6 25. The RIO cable attached to the I/O port 1 of subsystem #6 is attached to port 1 (Un-P1-C8-T1) of the base system. At the base system, disconnect the cable connection at RIO port 1 (Un-P1-C8-T1) and reconnect it to RIO port 0 (Un-P1-C8-T2). 26. On subsystem #6, disconnect the cable from I/O port 1 and reconnect it to I/O port 0 of subsystem #1. 27. Verify that a single RIO cable connects base system RIO port 0 to six I/O subsystems. Continue with step 28 of PFW1542-6 28. Turn on the power to boot the standalone AIX diagnostics from CD-ROM. 29. If the ?Please define the system console? screen is displayed, follow the directions to select the system console. 30. Use the option Display Configuration and Resource List to list all of the attached devices and adapters. 31. Verify that all adapters and the attached devices are listed. Did the ?Please define the System Console? screen display and are all adapters and attached devices listed? NO Go to PFW1542-7. YES The RIO cable that was removed in step 3 above is defective. Replace this RIO cable.– If six I/O subsystems are chained to RIO port 0 (Un-P1-C8-T2) of the base system, connect the new RIO cable from subsystem #6 I/O port 1 to base system RIO port 1 (Un-P1-C8-T1). – If five I/O subsystems are chained to RIO port 0 (Un-P1-C8-T2) of the base system, connect the new RIO cable from subsystem #5 I/O port 1 to base system RIO port 1 (Un-P1-C8-T1). – If four I/O subsystems are chained to RIO port 0 (Un-P1-C8-T2) of the base system, connect the new RIO cable from subsystem #4 I/O port 1 to base system RIO port 1 (Un-P1-C8-T1). – If three I/O subsystems are chained to RIO port 0 (Un-P1-C8-T2) of the base system, connect the new RIO cable from subsystem #3 I/O port 1 to base system RIO port 1 (Un-P1-C8-T1). – If two I/O subsystems are chained to RIO port 0 (Un-P1-C8-T2) of the base system, connect the new RIO cable from subsystem #2 I/O port 1 to base system RIO port 1 (Un-P1-C8-T1). – If one I/O subsystem is chained to RIO port 0 (Un-P1-C8-T2) of the base system, connect the new RIO cable from subsystem #1 I/O port 1 to base system RIO port 1 (Un-P1-C8-T1). Restore the system to its original configuration. Go to Chapter 5, “Verifying a repair,” on page 557. v PFW1542-7 The I/O attached to the RIO ports on the other GX adapter will now be isolated. Power off the system. Disconnect the cable connection at RIO port 0 (Un-P1-C8-T2) of the base system. v PFW1542-8 1. Turn on the power to boot the standalone diagnostic CD-ROM. 2. If the ?Please define the system console? screen is displayed, follow the directions to select the system console. 3. Use the option Display Configuration and Resource List to list all of the attached devices and adapters. 4. Check that all adapters and attached devices in the base system are listed. If the ?Please define the system console? screen was not displayed or all adapters and attached devices are not listed, the problem is in the base system. Was the ?Please define the system console? screen displayed and were all adapters and attached devices listed? NO Go to PFW1542-29. YES Go to PFW1542-21.
306 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
v PFW1542-9 For subsystem #1: Are there any adapters in the I/O subsystem? NO Go to PFW1542-10. YES Go to PFW1542-15.v PFW1542-10 For subsystem #2: Are there any adapters in the I/O subsystem? NO Go to PFW1542-11. YES Go to PFW1542-16.v PFW1542-11 For subsystem #3: Are there any adapters in the I/O subsystem? NO Go to PFW1542-12. YES Go to PFW1542-17.v PFW1542-12 For subsystem #4: Are there any adapters in the I/O subsystem? NO Go to PFW1542-13. YES Go to PFW1542-18.v PFW1542-13 For subsystem #5: Are there any adapters in the I/O subsystem? NO Go to PFW1542-14. YES Go to PFW1542-19.v PFW1542-14 For subsystem #6: Are there any adapters in the I/O subsystem? NO Go to PFW1542-23. YES Go to PFW1542-20.v PFW1542-15 (Subsystem #1) 1. If it is not already off, turn off the power. 2. Label and record the locations of any cables attached to the adapters, then disconnect the cables. 3. Record the slot numbers of the adapters. 4. Remove all adapters from the I/O subsystem. 5. Turn on the power to boot the standalone diagnostic CD-ROM. 6. If the ASCII terminal displays Enter 0 to select this console, press the 0 (zero) key on the ASCII terminal’s keyboard. 7. If the ?Please select the system console? screen is displayed, follow the directions to select the system console. 8. Use the option Display Configuration and Resource List to list all adapters and attached devices. 9. Check that all adapters and attached devices are listed. Was the ?Please define the system console? screen displayed and were all adapters and attached devices listed? NO Go to PFW1542-10. YES Go to PFW1542-21.v PFW1542-16 (Subsystem #2) 1. If it is not already off, turn off the power. 2. Label and record the locations of any cables attached to the adapters, then disconnect the cables. Chapter 2. Isolation Procedures 307
3. Record the slot numbers of the adapters. 4. Remove all adapters from the I/O subsystem. 5. Turn on the power to boot the standalone diagnostic CD-ROM. 6. If the ASCII terminal displays Enter 0 to select this console, press the 0 (zero) key on the ASCII terminal’s keyboard. 7. If the ?Please select the system console? screen is displayed, follow the directions to select the system console. 8. Use the option Display Configuration and Resource List to list all adapters and attached devices. 9. Check that all adapters and attached devices are listed. Was the ?Please define the system console? screen displayed and were all adapters and attached devices listed? NO Go to PFW1542-11. YES Go to PFW1542-21.v PFW1542-17 (Subsystem #3) 1. If it is not already off, turn off the power. 2. Label and record the locations of any cables attached to the adapters, then disconnect the cables. 3. Record the slot numbers of the adapters. 4. Remove all adapters from the I/O subsystem. 5. Turn on the power to boot the standalone diagnostic CD-ROM. 6. If the ASCII terminal displays Enter 0 to select this console, press the 0 (zero) key on the ASCII terminal’s keyboard. 7. If the ?Please select the system console? screen is displayed, follow the directions to select the system console. 8. Use the option Display Configuration and Resource List to list all adapters and attached devices. 9. Check that all adapters and attached devices are listed. Was the ?Please define the system console? screen displayed and were all adapters and attached devices listed? NO Go to PFW1542-12. YES Go to PFW1542-21.v PFW1542-18 (Subsystem #4) 1. If it is not already off, turn off the power. 2. Label and record the locations of any cables attached to the adapters, then disconnect the cables. 3. Record the slot numbers of the adapters. 4. Remove all adapters from the I/O subsystem. 5. Turn on the power to boot the standalone diagnostic CD-ROM. 6. If the ASCII terminal displays Enter 0 to select this console, press the 0 (zero) key on the ASCII terminal’s keyboard. 7. If the ?Please define the system console? screen is displayed, follow the directions to select the system console. 8. Use the option Display Configuration and Resource List to list all adapters and attached devices. 9. Check that all adapters and attached devices are listed. Was the ?Please define the system console? screen displayed and were all adapters and attached devices listed? NO Go to PFW1542-13. YES Go to PFW1542-21.
308 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
v PFW1542-19 (Subsystem #5) 1. If it is not already off, turn off the power. 2. Label and record the locations of any cables attached to the adapters, then disconnect the cables. 3. Record the slot numbers of the adapters. 4. Remove all adapters from the I/O subsystem. 5. Turn on the power to boot the standalone diagnostic CD-ROM. 6. If the ASCII terminal displays Enter 0 to select this console, press the 0 (zero) key on the ASCII terminal’s keyboard. 7. If the ?Please define the system console? screen is displayed, follow the directions to select the system console. 8. Use the option Display Configuration and Resource List to list all adapters and attached devices. 9. Check that all adapters and attached devices are listed. Was the ?Please define the system console? screen displayed and were all adapters and attached devices listed? NO Go to PFW1542-14. YES Go to PFW1542-21.v PFW1542-20 (Subsystem #6) 1. If it is not already off, turn off the power. 2. Label and record the locations of any cables attached to the adapters, then disconnect the cables. 3. Record the slot numbers of the adapters. 4. Remove all adapters from the I/O subsystem. 5. Turn on the power to boot the standalone diagnostic CD-ROM. 6. If the ASCII terminal displays Enter 0 to select this console, press the 0 (zero) key on the ASCII terminal’s keyboard. 7. If the ?Please define the system console? screen is displayed, follow the directions to select the system console. 8. Use the option Display Configuration and Resource List to list all adapters and attached devices. 9. Check that all adapters and attached devices are listed. Was the ?Please define the system console? screen displayed and were all adapters and attached devices listed? NO Go to PFW1542-23. YES Go to PFW1542-21.v PFW1542-21 If the ?Please define the system console? screen was displayed and all adapters and attached devices were not listed, the problem is with one of the adapters or attached devices that was removed or disconnected from the I/O subsystem. 1. Turn off the power. 2. Reinstall one adapter or device that was removed. Use the original adapters in their original slots when reinstalling adapters. 3. Turn on the power to boot the standalone diagnostic CD-ROM. 4. If the ?Please define the system console? screen is displayed, follow the directions to select the system console. 5. Use the option Display Configuration and Resource List to list all adapters and attached devices. 6. Check that all adapters and attached devices are listed. Was the ?Please define the system console? screen displayed and were all adapters and attached devices listed? Chapter 2. Isolation Procedures 309
NO Go to PFW1542-22. YES Reinstall the next adapter and device and return to the beginning of this step. Repeat this process until an adapter or device causes the ?Please define the System Console? screen to not display, or all attached devices and adapters to not be listed. After installing all of the adapters and the ?Please define the System Console? screen does display and all attached devices and adapters are listed, return the system to its original configuration. Go to Chapter 5, “Verifying a repair,” on page 557.v PFW1542-22 Replace the adapter you just installed with a new adapter and retry booting AIX standalone diagnostics from CD-ROM. 1. If the ?Please define the system console? screen is displayed, follow the directions to select the system console. 2. Use the option Display Configuration and Resource List to list all adapters and attached devices. 3. Check that all adapters and attached devices are listed. Was the ?Please define the system console? screen displayed and were all adapters and attached devices listed? NO The I/O subsystem backplane is defective. Replace the I/O subsystem backplane. In all 4 subsystem types, the I/O subsystem backplane is Un-CB1. Then go to PFW1542-24. YES The adapter was defective. Go to PFW1542-24.v PFW1542-23 1. Turn off the power. 2. Disconnect the I/O subsystem power cables. 3. Replace the following parts, one at a time, if present, in the sequence listed: a. I/O subsystem #1 backplane b. I/O subsystem #2 backplane c. I/O subsystem #3 backplane d. I/O subsystem #4 backplane e. I/O subsystem #5 backplane f. I/O subsystem #6 backplane g. The RIO interface in the base system to which the RIO cables are presently attached.4. Reconnect the I/O subsystem power cables. 5. Turn on the power. 6. Boot standalone AIX diagnostics from CD. 7. If the ?Please define the System Console? screen is displayed, follow directions to select the system console. 8. Use the option Display Configuration and Resource List to list all adapters and attached devices. 9. Check that all attached devices and adapters are listed. Did the ?Please define the System Console? screen display and are all attached devices and adapters listed? NO Replace the next part in the list and return to the beginning of this step. Repeat this process until a part causes the ?Please define the System Console? screen to be displayed and all adapters and attached devices to be listed. If you have replaced all the items listed above and the ?Please define the System Console? screen does not display or all attached devices and adapters are not listed, check all external devices and cabling. If you do not find a problem, contact your next level of support for assistance. YES Go to PFW1542-22.v PFW1542-24 310 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
The item you just replaced fixed the problem. 1. Turn off the power. 2. If a display adapter with keyboard and mouse were installed, reinstall the display adapter, keyboard, and mouse. 3. Reconnect the tape drive (if previously installed) to the internal SCSI bus cable. 4. Plug in all adapters that were previously removed but not reinstalled. 5. Reconnect the I/O subsystem power cables that were previously disconnected. Return the system to its original condition. Go to Go to Chapter 5, “Verifying a repair,” on page 557. v PFW1542-25 1. Turn off the power. 2. At the base system, reconnect the cable connection at RIO port 0 (Un-P1-C8-T2) recorded in PFW1542-7. 3. At the base system, reconnect the cable connection at RIO port 1 (Un-P1-C8-T1) recorded in PFW1542-7. 4. Reconnect the power cables to the I/O subsystems that were found attached to the base system RIO ports mentioned in step 2 and step 3 of PFW1542-25. All I/O subsystems that were attached to the base system RIO port 0 and RIO port 1 should now be reconnected to the base system. 5. Make sure the I/O subsystem(s) are cabled correctly. See the cabling information for your I/O subsystem. 6. Turn on the power to boot standalone AIX diagnostics from CD-ROM. 7. If the ?Please define the System Console? screen is displayed, follow the directions to select the system console. 8. Use the option Display Configuration and Resource List to list all adapters and attached devices. 9. Check that all adapters and attached devices are listed. Did the ?Please define the System Console? screen display and are all attached devices and adapters listed? NO Go to PFW1542-9 to isolate a problem in an I/O subsystem attached to the base system RIO bus on the system backplane. YES Go to PFW1542-26.v PFW1542-26 Is there a second GX adapter plugged into Un-P1-C6 in the base system, and if so, is there at least one I/O subsystem attached to it? NO Go to PFW1542-29. YES Continue to PFW1542-27.v PFW1542-27 1. Turn off the power. 2. At the base system, reconnect the cable connection at RIO port 0 on the second GX adapter (Un-P1-C6-T2) recorded in PFW1542-7. 3. At the base system, reconnect the cable connection at RIO port 1 on the second GX adapter (Un-P1-C6-T1) recorded in PFW1542-7. 4. Reconnect the power cables to the I/O subsystems that were attached to the second GX adapter’s ports mentioned in substeps 2 and 3 in this step. All I/O subsystems that were attached to RIO port 0 on the second GX adapter (Un-P1-C6-T2) and RIO port 1 on the second GX adapter (Un-P1-C6-T1) in the base system should now be reconnected to the system. 5. Make sure that the I/O subsystem(s) are cabled correctly as shown in the cabling information for your I/O expansion unit. 6. Turn on the power to boot the standalone AIX diagnostic CD-ROM. Chapter 2. Isolation Procedures 311
7. If the Please define the system console screen is displayed, follow the directions to select the system console. 8. Use the option Display Configuration and Resource List to list all adapters and attached devices. 9. Verify that all adapters and attached devices are listed. Did the ?Please define the system console? screen display and are all adapters and attached devices listed? NO Go to PFW1542-28 to isolate the problems in the I/O subsystems that are attached to the second GX adapter in the base system. YES Go to PFW1542-29.v PFW1542-28 At the base system, reconnect the second I/O subsystem to the RIO ports on the base system’s GX adapter at Un-P1-C6-T1 and Un-P1-C6-T2. The RIO ports on these subsystems are shown in the following table. Use this table to determine the physical location codes of the RIO connectors that are mentioned in the remainder of this MAP.
Table 43. RIO ports location table
8203-E4A, 9407-M15, or 9408-M25 (base system) adapters
7314-G30 or 5796
7311-D20 or 5095
RIO port 0
Un-P1-C8-T2 Un-P1-C6-T2
Un-P1-C7-T2
Un-P1-C05-00 or Un-P1-C05-T2 (bottom connector)
RIO port 1
Un-P1-C8-T1 Un-P1-C6-T1
Un-P1-C7-T1
Un-P1-C05-01 or Un-P1-C05-T1 (top connector) Note: Before continuing, check the cabling from the base system to the I/O subsystem(s) to ensure that the system is cabled correctly. Record the current cabling configuration and then continue with the following steps. 1. Turn off the power. Record the location and machine type and model number, or feature number, of each I/O drawer or tower. In the following steps, use this information to determine the physical location codes of the RIO connectors that are referred to by their logical names. For example, if I/O subsystem #1 is a 7311/D20 drawer, RIO port 0 is Un-P1-C05-T2. 2. At the base system, disconnect the cable connection at RIO port 0 (Un-P1-C6-T2). 3. At the other end of the RIO cable referred to in step 2 of PFW1542-28, disconnect the I/O subsystem port connector 0. The RIO cable that was connected to RIO port 0 on the expansion card should now be loose; remove it. Record the location of this I/O subsystem and call it ?subsystem #1?. 4. Examine the connection at the I/O port connector 1 of the I/O subsystem recorded in step 3 of PFW1542-28. If the RIO cable attached to I/O port connector 1 connects to the I/O port connector 0 of another I/O subsystem, record the location of the next I/O subsystem that is connected to I/O port 1 of subsystem #1, then go to step 8 of PFW1542-28. 5. At the base system, disconnect the cable connection at RIO port 1 (Un-P1-C6-T1) and reconnect it to RIO port 0 (Un-P1-C6-T2). 6. At the I/O subsystem recorded in step 3 of PFW1542-28, disconnect the I/O port connector 1 and reconnect to I/O port 0. 7. Verify that a single RIO cable connects base system RIO port 0 (Un-P1-C6-T2) to the I/O subsystem recorded in step 4 of PFW1542-28 port 0. Go to step 28 of PFW1542-28. 8. Record the location of the next I/O subsystem and call it ?subsystem #2?. This is the I/O subsystem that is connected to I/O port 1 of subsystem #1. 312 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
9. Examine the connection at the I/O port 1 of subsystem #2 recorded in step 8 of PFW1542-28. If the RIO cable attached to I/O port 1 connects to the I/O port 0 of another I/O subsystem, record the location of the next I/O subsystem that is connected to I/O port 1 of subsystem #2 and call it ?subsystem #3?. Go to step 13 of PFW1542-28. 10. The RIO cable attached to the I/O port 1 of subsystem #2 is attached to port 1 (Un-P1-C6-T1) of the base system. At the base system, disconnect the cable connection at RIO port 1 (Un-P1-C6-T1) and reconnect it to RIO port 0 (Un-P1-C6-T2). 11. On subsystem #2, disconnect the cable from I/O port 1 and reconnect it to I/O port 0 of subsystem #1. 12. Verify that a single RIO cable connects base system RIO port 0 to two I/O subsystems. Go to step 28 of PFW1542-28. 13. Examine the connection at the I/O port 1 of subsystem #3 recorded in step 9 of PFW1542-28. If the RIO cable attached to I/O port 1 connects to the I/O port 0 of another I/O subsystem, record the location of the next I/O subsystem that is connected to I/O port 1 of the subsystem #3 and call it ?subsystem #4?. Go to step 17 of PFW1542-28. 14. The RIO cable attached to the I/O port 1 of subsystem #3 is attached to port 1 (Un-P1-C6-T1) of the base system. At the base system, disconnect the cable connection at RIO port 1 (Un-P1-C6-T1) and reconnect it to RIO port 0 (Un-P1-C6-T2). 15. On subsystem #3, disconnect the cable from I/O port 1 and reconnect it to I/O port 0 of subsystem #1. 16. Verify that a single RIO cable connects base system RIO port 0 to three I/O subsystems. Go to step 28 of PFW1542-28. 17. Examine the connection at the I/O port 1 of subsystem #4 recorded in step 13 of PFW1542-28. If the RIO cable attached to I/O port 1 connects to the I/O port 0 of another I/O subsystem, record the location of the next I/O subsystem that is connected to I/O port 1 of subsystem #4 and call it ?subsystem #5?. Go to step 21 of PFW1542-28 18. The RIO cable attached to the I/O port 1 of subsystem #4 is attached to port 1 (Un-P1-C6-T1) of the base system. At the base system, disconnect the cable connection at RIO port 1 (Un-P1-C6-T1) and reconnect it to RIO port 0 (Un-P1-C6-T2). 19. On subsystem #4, disconnect the cable from I/O port 1 and reconnect it to I/O port 0 of subsystem #1. 20. Verify that a single RIO cable connects base system RIO port 0 to four I/O subsystems. Go to step 27 of PFW1542-28. 21. Examine the connection at the I/O port 1 of subsystem #5 recorded in step 17 of PFW1542-28. If the RIO cable attached to I/O port 1 connects to the I/O port 0 of another I/O subsystem, record the location of the next I/O subsystem that is connected to I/O port 1 of the subsystem #5 and call it ?subsystem #6?. Go to step 25 of PFW1542-28. 22. The RIO cable attached to the I/O port 1 of subsystem #5 is attached to port 1 (Un-P1-C6-T1) of the base system. At the base system, disconnect the cable connection at RIO port 1 (Un-P1-C6-T1) and reconnect it to RIO port 0 (Un-P1-C6-T2). 23. On subsystem #5, disconnect the cable from I/O port 1 and reconnect it to I/O port 0 of subsystem #1. 24. Verify that a single RIO cable connects base system RIO port 0 to five I/O subsystems. Go to step 28 of PFW1542-28. 25. The RIO cable attached to the I/O port 1 of subsystem #6 is attached to port 1 (Un-P1-C6-T1) of the base system. At the base system, disconnect the cable connection at RIO port 1 (Un-P1-C6-T1) and reconnect it to RIO port 0 (Un-P1-C6-T2). 26. On subsystem #6, disconnect the cable from I/O port 1 and reconnect it to I/O port 0 of subsystem #1. 27. Verify that a single RIO cable connects base system RIO port 0 to six I/O subsystems. Go to step 28 of PFW1542-28. Chapter 2. Isolation Procedures 313
28. Turn on the power to boot the standalone AIX diagnostics from CD-ROM. 29. If the ?Please define the system console? screen is displayed, follow the directions to select the system console. 30. Use the option Display Configuration and Resource List to list all of the attached devices and adapters. 31. Verify that all adapters and the attached devices are listed. Did the ?Please define the system console? screen display and are all adapters and attached devices listed? NO Go to PFW1542-9. YES The RIO cable that was removed in step 3 above is defective. Replace this RIO cable.– If six I/O subsystems are chained to RIO port 0 (Un-P1-C6-T2) of the base system, connect the new RIO cable from subsystem #6 I/O port 1 to base system RIO port 1 (Un-P1-C8-T1). – If five I/O subsystems are chained to RIO port 0 (Un-P1-C6-T2) of the base system, connect the new RIO cable from subsystem #5 I/O port 1 to base system RIO port 1 (Un-P1-C8-T1). – If four I/O subsystems are chained to RIO port 0 (Un-P1-C6-T2) of the base system, connect the new RIO cable from subsystem #4 I/O port 1 to base system RIO port 1 (Un-P1-C8-T1). – If three I/O subsystems are chained to RIO port 0 (Un-P1-C6-T2) of the base system, connect the new RIO cable from subsystem #3 I/O port 1 to base system RIO port 1 (Un-P1-C8-T1). – If two I/O subsystems are chained to RIO port 0 (Un-P1-C6-T2) of the base system, connect the new RIO cable from subsystem #2 I/O port 1 to base system RIO port 1 (Un-P1-C8-T1). – If one I/O subsystem is chained to RIO port 0 (Un-P1-C6-T2) of the base system, connect the new RIO cable from subsystem #1 I/O port 1 to base system RIO port 1 (Un-P1-C8-T1). Restore the system back to its original configuration. Go to Chapter 5, “Verifying a repair,” on page 557. v PFW1542-29 Are there any adapters in the PCI slots in the base system? NO Go to PFW1542-30. YES Go to PFW1542-32.v PFW1542-30 Replace the system backplane, Un-P1. Continue to PFW1542-31. v PFW1542-31 1. Boot standalone AIX diagnostics from CD. 2. If the ?Please define the System Console? screen is displayed, follow directions to select the system console. 3. Use the Display Configuration and Resource List to list all adapters and attached devices. 4. Check that all adapters and attached devices are listed. Did the ?Please define the System Console? screen display and are all attached devices and adapters listed? NO Go to PFW1542-35. YES Go to PFW1542-24.v PFW1542-32 1. If it is not already off, turn off the power. 2. Label and record the location of any cables attached to the adapters. 3. Record the slot number of the adapters. 4. Remove all adapters from slots 1, 2, 3, 4, 5, and 6 in the base system that are not attached to the boot device. 314 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
5. Turn on the power to boot standalone AIX diagnostics from CD-ROM. 6. If the ASCII terminal displays Enter 0 to select this console, press the 0 key on the ASCII terminal’s keyboard. 7. If the ?Please define the System Console? screen is displayed, follow directions to select the system console. 8. Use the option Display Configuration and Resource List to list all adapters and attached devices. 9. Check that all adapters and attached devices are listed. Did the ?Please define the System Console? screen display and are all attached devices and adapters listed? NO Go to PFW1542-35. YES Continue to PFW1542-33.v PFW1542-33 If the ?Please define the System Console? screen does display and all adapters and attached devices are listed, the problem is with one of the adapters or devices that was removed or disconnected from the base system. 1. Turn off the power. 2. Reinstall one adapter and device that was removed. Use the original adapters in their original slots when reinstalling adapters. 3. Turn on the power to boot standalone AIX diagnostics from the optical drive. 4. If the ?Please define the System Console? screen is displayed, follow the directions to select the system console. 5. Use the Display Configuration and Resource List to list all adapters and attached devices. 6. Check that all adapters and attached devices are listed. Did the ?Please define the System Console? screen display and are all attached devices and adapters listed? NO Continue to PFW1542-34. YES Return to the beginning of this step to continue reinstalling adapters and devices.v PFW1542-34 Replace the adapter you just installed with a new adapter and retry the boot to standalone AIX diagnostics from CD-ROM. 1. If the ?Please Define the System Console? screen is displayed, follow directions to select the system console. 2. Use the option Display Configuration and Resource List to list all adapters and attached devices. 3. Check that all adapters and attached devices are listed. Did the ?Please define the System Console? screen display and are all attached devices and adapters listed? NO Go to PFW1542-30. YES The adapter you just replaced was defective. Go to PFW1542-24.v PFW1542-35 1. Turn off the power. 2. Disconnect the base system power cables. 3. Replace the following parts, one at a time, in the sequence listed: a. Optical drive b. Removable media backplane and cage assembly c. Disk drive backplane and cage assembly d. I/O backplane, location Un-P1 Chapter 2. Isolation Procedures 315
e. Service processor4. Reconnect the base system power cables. 5. Turn on the power. 6. Boot standalone AIX diagnostics from CD. 7. If the ?Please define the System Console? screen is displayed, follow directions to select the system console. 8. Use the option Display Configuration and Resource List to list all adapters and attached devices. 9. Check that all adapters and attached devices are listed. Did the ?Please define the System Console? screen display and are all adapters and attached devices listed? NO Replace the next part in the list and return to the beginning of this step. Repeat this process until a part causes the Please define the System Console screen to be displayed and all adapters and attached devices to be listed. If you have replaced all the items listed above and the Please define the System Console screen does not display or all adapters and attached devices are not listed, check all external devices and cabling. If you do not find a problem, contact your next level of support for assistance. YES Go to PFW1542-24.PFW1548: Memory and processor subsystem problem isolation procedure Use this problem isolation procedure to aid in solving memory and processor problems that are not found by normal diagnostics. Notes: 1. To avoid damage to the system or subsystem components, unplug the power cords before removing or installing any part. 2. This procedure assumes that either: v An optical drive is installed and connected to the integrated EIDE adapter, and an AIX diagnostic CD-ROM is available. OR v AIX standalone diagnostics can be booted from a NIM server.3. If a power-on password or privileged-access password is set, you are prompted to enter the password before the AIX diagnostic CD-ROM can load. 4. The term POST indicators refers to the device mnemonics that appear during the power-on self-test (POST). 5. The service processor might have been set by the user to monitor system operations and to attempt recoveries. You might want to disable these options while you diagnose and service the system. If these settings are disabled, make notes of their current settings so that they can be restored before the system is turned back over to the customer. The following settings may be of interest. Monitoring (also called surveillance) From the ASMI menu, expand the System Configuration menu, then click Monitoring. Disable both types of surveillance. Auto power restart (also called unattended start mode) From the ASMI menu, expand Power/Restart Control, then click Auto Power Restart, and set it to disabled. Wake on LAN From the ASMI menu, expand Wake on LAN, and set it to disabled. 316 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
Call Out From the ASMI menu, expand the Service Aids menu, then click Call-Home/Call-In Setup. Set the call-home system port and the call-in system port to disabled.6. Verify that the system has not been set to boot to the System Management Services (SMS) menus or to the open firmware prompt. From the ASMI menu, expand Power/Restart Control to view the menu, then click Power On/Off System. The AIX/Linux partition mode boot should say ?Continue to Operating System?. 7. The service processor might have recorded one or more symptoms in its error/event log. Use the Advanced System Management Interface (ASMI) menus to view the error/event log. v If you arrived here after performing a slow boot, look for a possible new error that occurred during the slow boot. If there is a new error, and its actions call for a FRU replacement, perform those actions. If this does not resolve the problem, go to PFW1548-1. v If an additional slow boot has not been performed, or if the slow boot did not yield a new error code, look at the error that occurred just before the original error. Perform the actions associated with that error. If this does not resolve the problem, go to PFW1548-1. v If a slow boot results in the same error code, and there are no error codes before the original error code, go to PFW1548-1.8. To continue with this procedure, select one of the following files according to the system you are servicing: v “PFW1548: Memory and processor subsystem problem isolation procedure without an HMC attached” v “PFW1548: Memory and processor subsystem problem isolation procedure when an HMC is attached” on page 324 “PFW1548: Memory and processor subsystem problem isolation procedure without an HMC attached” “PFW1548: Memory and processor subsystem problem isolation procedure when an HMC is attached” on page 324PFW1548: Memory and processor subsystem problem isolation procedure without an HMC attached This procedure is used to locate defective FRUs not found by normal diagnostics. For this procedure, diagnostics are run on a minimally configured system. If a failure is detected on the minimally configured system, the remaining FRUs are exchanged one at a time until the failing FRU is identified. If a failure is not detected, FRUs are added back until the failure occurs. The failure is then isolated to the failing FRU. Note: the system backplane has two memory DIMM quads: Un-P1-C14 to Un-P1-C17, and Un-P1-C21 to Un-P1-C24. See “Locations” on page 339. Perform the following procedure: v PFW1548-1 1. Ensure that the diagnostics and the operating system are shut down. Is the system at ?service processor standby?, indicated by 01 in the control panel? NO Replace the system backplane, location: Un-P1. Return to step PFW1548-1. YES Continue with substep 2.2. Turn on the power using either the white button or the ASMI menus. Does the system reach an operating system login prompt, or if booting the AIX diagnostic CD-ROM, is the ?Please define the System Console? screen displayed? NO Go to PFW1548-3. YES Go to PFW1548-2.3. Insert the AIX diagnostic CD-ROM into the optical drive. Chapter 2. Isolation Procedures 317
Note: If you cannot insert the diagnostic CD-ROM, go to PFW1548-2. 4. When the word keyboard is displayed on an ASCII terminal or a directly attached keyboard, press the number 5 key. 5. If you are prompted to do so, enter the appropriate password. Is the ?Please define the System Console? screen displayed? NO Go to PFW1548-2. YES Go to PFW1548-14.v PFW1548-2 1. Insert the AIX diagnostic CD-ROM into the optical drive.Note: If you cannot insert the diagnostic CD-ROM, go to step PFW1548-3. 2. Turn on the power using either the white button or the ASMI menus. If the diagnostic CD-ROM is not in the optical drive, insert it now. If you are prompted to do so, enter the appropriate password. Is the ?Please define the System Console? screen displayed? NO Go to PFW1548-3. YES Go to PFW1548-14.v PFW1548-3 1. Turn off the power. 2. If you have not already done so, configure the service processor (using the ASMI menus) with the instructions in note 6 on page 317 at the beginning of this procedure, then return here and continue. 3. Exit the service processor (ASMI) menus and remove the power cords. 4. Disconnect all external cables (parallel, system port 1, system port 2, keyboard, mouse, USB devices, SPCN, Ethernet on the system planar, and so on). Also disconnect all of the external cables attached to the service processor.Go to the next step. v PFW1548-4 1. If this is a deskside system, remove the service access cover. If this is a rack-mounted system, place the drawer into the service position and remove the service access cover. Also remove the front cover. 2. Record the slot numbers of the PCI adapters and I/O expansion cards if present. Label and record the locations of all cables attached to the adapters. Disconnect all cables attached to the adapters and remove all of the adapters. 3. Remove the removable media/disk drive enclosure assembly by pulling out the blue tabs at the bottom of the enclosure, then sliding the enclosure out approximately three centimeters. 4. Remove and label the disk drives from the media/disk drive enclosure assembly. 5. Remove a memory DIMM quad. Notes: a. Place the memory DIMM locking tabs in the locked (upright) position to prevent damage to the tabs. b. Memory DIMMs must be installed in quads and in the correct connectors. Refer to “Physical Locations” on page 339 for information on memory DIMMs.6. Plug in the power cords and wait for 01 in the upper-left corner of the control panel display. 7. Turn on the power using the white button. Does the system reach an operating system login prompt, or if booting the AIX diagnostic CD-ROM, is the ?Please define the System Console? screen displayed? 318 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
NO Go to PFW1548-7. YES Go to the next step.v PFW1548-5 Were any memory DIMMs removed from system backplane? NO Go to PFW1548-8. YES Go to the next step.v PFW1548-6 1. Turn off the power, and remove the power cords. 2. Replug the memory DIMMs that were removed from the system backplane in PFW1548-2 in their original locations. Notes: a. Place the memory DIMM locking tabs into the locked (upright) position to prevent damage to the tabs. b. Memory DIMMs must be installed in quads in the correct connectors. Refer to “Physical Locations” on page 339 for information on memory DIMMs.3. Plug in the power cords and wait for 01 in the upper-left corner of the control panel display. 4. Turn on the power using the white button. Does the system reach an operating system login prompt, or if booting the AIX diagnostic CD-ROM, is the ?Please define the System Console? screen displayed? NO A memory DIMM in the quad you just replaced in the system is defective. Turn off the power, remove the power cords, and exchange the memory DIMMs quad with new or previously removed memory DIMM quad. Repeat this step until the defective memory DIMM quad is identified, or both memory DIMM quads have been exchanged. Note: the system backplane has two memory DIMM quads: Un-P1-C14 to Un-P1-C17 and Un-P1-C21 to Un-P1-C24. See “Locations” on page 339. If your symptom did not change and both the memory DIMM quads have been exchanged, call your service support person for assistance. If the symptom changed, check for loose cards and obvious problems. If you do not find a problem, go to the Problem Analysis procedures and follow the instructions for the new symptom. YES Go to the next step.v PFW1548-7 One of the FRUs remaining in the system unit is defective.Note: If a memory DIMM is exchanged, ensure that the new memory DIMM is the same size and speed as the original memory DIMM. 1. Turn off the power, remove the power cords, and exchange the following FRUs, one at a time, in the order listed: a. Memory DIMMs. Exchange one quad at a time with new or previously removed DIMM quads b. System backplane, location: Un-P1 c. Power supplies, locations: Un-E1 and Un-E2.2. Plug in the power cords and wait for 01 in the upper-left corner of the control panel display. 3. Turn on the power using the white button. Does the system reach an operating system login prompt, or if booting the AIX diagnostic CD-ROM, is the ?Please define the System Console? screen displayed? Chapter 2. Isolation Procedures 319
NO Reinstall the original FRU. Repeat the FRU replacement steps until the defective FRU is identified or all the FRUs have been exchanged. If the symptom did not change and all the FRUs have been exchanged, call service support for assistance. If the symptom has changed, check for loose cards, cables, and obvious problems. If you do not find a problem, go to the Problem Analysis procedures and follow the instructions for the new symptom. YES Go to Chapter 5, “Verifying a repair,” on page 557.v PFW1548-8 1. Turn off the power. 2. Reconnect the system console. Notes: a. If an ASCII terminal has been defined as the firmware console, attach the ASCII terminal cable to the S1 connector on the rear of the system unit. b. If a display attached to a display adapter has been defined as the firmware console, install the display adapter and connect the display to the adapter. Plug the keyboard and mouse into the keyboard connector on the rear of the system unit.3. Turn on the power using the white button. (If the diagnostic CD-ROM is not in the optical drive, insert it now.) 4. If the ASCII terminal or graphics display (including display adapter) is connected differently from the way it was previously, the console selection screen appears. Select a firmware console. 5. Immediately after the word keyboard is displayed, press the number 1 key on the directly attached keyboard, or an ASCII terminal. This action activates the system management services (SMS). 6. Enter the appropriate password if you are prompted to do so. Is the SMS screen displayed? NO One of the FRUs remaining in the system unit is defective. Exchange the FRUs that have not been exchanged, in the following order: 1. If you are using an ASCII terminal, go to the problem determination procedures for the display. If you do not find a problem, do the following: a. Replace the system backplane, location: Un-P1.2. If you are using a graphics display, go to the problem determination procedures for the display. If you do not find a problem, do the following: a. Replace the display adapter. b. Replace the backplane in which the graphics adapter is plugged. Repeat this step until the defective FRU is identified or all the FRUs have been exchanged. If the symptom did not change and all the FRUs have been exchanged, call service support for assistance. If the symptom changed, check for loose cards, cables, and obvious problems. If you do not find a problem, go to the Problem Analysis procedures and follow the instructions for the new symptom.YES Go to the next step.v PFW1548-9 1. Make sure the diagnostic CD-ROM is inserted into the optical drive. 2. Turn off the power and remove the power cords. 320 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
3. Use the cam levers to reconnect the disk drive enclosure assembly to the I/O backplane. 4. Reconnect the removable media/disk drive enclosure assembly by sliding the media enclosure toward the rear of the system, then pressing the blue tabs. 5. Plug in the power cords and wait for 01 in the upper-left corner of the operator panel display. 6. Turn on the power using the white button. (If the diagnostic CD-ROM is not in the optical drive, insert it now.) 7. Immediately after the word keyboard is displayed, press the number 5 key on either the directly attached keyboard or an ASCII terminal keyboard. 8. Enter the appropriate password if you are prompted to do so. Is the ?Please define the System Console? screen displayed? NO One of the FRUs remaining in the system unit is defective. Exchange the FRUs that have not been exchanged, in the following order: 1. Optical drive 2. Removable media enclosure. 3. System backplane, Un-P1.Repeat this step until the defective FRU is identified or all the FRUs have been exchanged. If the symptom did not change and all the FRUs have been exchanged, call service support for assistance. If the symptom has changed, check for loose cards, cables, and obvious problems. If you do not find a problem, go to Problem Analysis procedures and follow the instructions for the new symptom. YES Go to the next step.v PFW1548-10 The system is working correctly with this configuration. One of the disk drives that you removed from the disk drive backplanes may be defective. 1. Make sure the diagnostic CD-ROM is inserted into the optical drive. 2. Turn off the power and remove the power cords. 3. Install a disk drive in the media/disk drive enclosure assembly. 4. Plug in the power cords and wait for the OK prompt to display on the operator panel display. 5. Turn on the power. 6. Immediately after the word keyboard is displayed, press the number 5 key on either the directly attached keyboard or an ASCII terminal keyboard. 7. Enter the appropriate password if you are prompted to do so. Is the ?Please define the System Console? screen displayed? NO Exchange the FRUs that have not been exchanged, in the following order: 1. Last disk drive installed 2. Disk drive backplane.Repeat this step until the defective FRU is identified or all the FRUs have been exchanged. If the symptom did not change and all the FRUs have been exchanged, call service support for assistance. If the symptom has changed, check for loose cards, cables, and obvious problems. If you do not find a problem, go to Problem Analysis procedures and follow the instructions for the new symptom. YES Repeat this step with all disk drives that were installed in the disk drive backplane. After all of the disk drives have been reinstalled, go to the next step.
Chapter 2. Isolation Procedures 321
v PFW1548-11 The system is working correctly with this configuration. One of the devices that was disconnected from the system backplane may be defective. 1. Turn off the power and remove the power cords. 2. Attach a system backplane device (for example: system port 1, system port 2, USB, keyboard, mouse, Ethernet) that had been removed. After all of the I/O backplane device cables have been reattached, reattached the cables to the service processor one at a time. 3. Plug in the power cords and wait for 01 in the upper-left corner on the operator panel display. 4. Turn on the power using the white button. (If the diagnostic CD-ROM is not in the optical drive, insert it now.) 5. If the Console Selection screen is displayed, choose the system console. 6. Immediately after the word keyboard is displayed, press the number 5 key on either the directly attached keyboard or on an ASCII terminal keyboard. 7. Enter the appropriate password if you are prompted to do so. Is the ?Please define the System Console? screen displayed? NO The last device or cable that you attached is defective. To test each FRU, exchange the FRUs in the following order: 1. Device and cable (last one attached) 2. System backplane, location: Un-P1.If the symptom did not change and all the FRUs have been exchanged, call service support for assistance. If the symptom has changed, check for loose cards, cables, and obvious problems. If you do not find a problem, go to Problem Analysis procedures and follow the instructions for the new symptom. YES Repeat this step until all of the devices are attached. Go to the next step.v PFW1548-12 The system is working correctly with this configuration. One of the FRUs (adapters) that you removed may be defective. 1. Turn off the power and remove the power cords. 2. Install a FRU (adapter) and connect any cables and devices that were attached to the FRU. 3. Plug in the power cords and wait for the OK prompt to display on the operator panel display. 4. Turn on the power using the white button. (If the diagnostic CD-ROM is not in the optical drive, insert it now.) 5. If the Console Selection screen is displayed, choose the system console. 6. Immediately after the word keyboard is displayed, press the number 5 key on either the directly attached keyboard or on an ASCII terminal keyboard. 7. Enter the appropriate password if you are prompted to do so. Is the ?Please define the System Console? screen displayed? NO Go to the next step. YES Repeat this step until all of the FRUs (adapters) are installed. Go to Chapter 5, “Verifying a repair,” on page 557.v PFW1548-13 The last FRU installed or one of its attached devices is probably defective. 1. Make sure the diagnostic CD-ROM is inserted into the optical drive. 2. Turn off the power and remove the power cords. 322 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
3. Starting with the last installed adapter, disconnect one attached device and cable. 4. Plug in the power cords and wait for the 01 in the upper-left corner on the operator panel display. 5. Turn on the power using either the white button. (If the diagnostic CD-ROM is not in the optical drive, insert it now.) 6. If the Console Selection screen is displayed, choose the system console. 7. Immediately after the word keyboard is displayed, press the number 5 key on either the directly attached keyboard or on an ASCII terminal keyboard. 8. Enter the appropriate password if you are prompted to do so. Is the ?Please define the System Console? screen displayed? NO Repeat this step until the defective device or cable is identified or all devices and cables have been disconnected. If all the devices and cables have been removed, then one of the FRUs remaining in the system unit is defective. To test each FRU, exchange the FRUs in the following order: 1. Adapter (last one installed) 2. System backplane, location: Un-P1.If the symptom did not change and all the FRUs have been exchanged, call service support for assistance. If the symptom has changed, check for loose cards, cables, and obvious problems. If you do not find a problem, go to the Problem Analysis procedures and follow the instructions for the new symptom. YES The last device or cable that you disconnected is defective. Exchange the defective device or cable and then go to the next step.v PFW1548-14 1. Follow the instructions on the screen to select the system console. 2. When the DIAGNOSTIC OPERATING INSTRUCTIONS screen is displayed, press Enter. 3. Select Advanced Diagnostics Routines. 4. If the terminal type has not been defined, you must use the option Initialize Terminal on the FUNCTION SELECTION menu to initialize the AIX diagnostic environment before you can continue with the diagnostics. This is a separate operation from selecting the console display. 5. If the NEW RESOURCE screen is displayed, select an option from the bottom of the screen. Note: Adapters and devices that require supplemental media are not shown in the new resource list. If the system has adapters or devices that require supplemental media, select option 1. 6. When the DIAGNOSTIC MODE SELECTION screen is displayed, press Enter. 7. Select All Resources. If you were sent here from step PFW1548-18, select the adapter or device that was loaded from the supplemental media. Did you get an SRN? NO Go to step PFW1548-16. YES Go to the next step.v PFW1548-15 Look at the FRU part numbers associated with the SRN. Have you exchanged all the FRUs that correspond to the failing function codes (FFCs)? NO Exchange the FRU with the highest failure percentage that has not been changed. Chapter 2. Isolation Procedures 323
Repeat this step until all the FRUs associated with the SRN have been exchanged or diagnostics run with no trouble found. Run diagnostics after each FRU is exchanged. Go to Chapter 5, “Verifying a repair,” on page 557. YES If the symptom did not change and all the FRUs have been exchanged, call service support for assistance.v PFW1548-16 Does the system have adapters or devices that require supplemental media? NO Go to step the next step. YES Go to step PFW1548-18.v PFW1548-17 Consult the PCI adapter configuration documentation for your operating system to verify that all adapters are configured correctly. Go to Chapter 5, “Verifying a repair,” on page 557. If the symptom did not change and all the FRUs have been exchanged, call service support for assistance. v PFW1548-18 1. Select Task Selection. 2. Select Process Supplemental Media and follow the on-screen instructions to process the media. Supplemental media must be loaded and processed one at a time. Did the system return to the TASKS SELECTION SCREEN after the supplemental media was processed? NO Go to the next step. YES Press F3 to return to the FUNCTION SELECTION screen. Go to step PFW1548-14, substep 4 on page 323.v PFW1548-19 The adapter or device is probably defective. If the supplemental media is for an adapter, replace the FRUs in the following order: 1. Adapter 2. System backplane, location: Un-P1. If the supplemental media is for a device, replace the FRUs in the following order: 1. Device and any associated cables 2. The adapter to which the device is attached Repeat this step until the defective FRU is identified or all the FRUs have been exchanged. If the symptom did not change and all the FRUs have been exchanged, call service support for assistance. If the symptom has changed, check for loose cards, cables, and obvious problems. If you do not find a problem, go to the Problem Analysis procedures and follow the instructions for the new symptom. Go to Chapter 5, “Verifying a repair,” on page 557. End of procedure.PFW1548: Memory and processor subsystem problem isolation procedure when an HMC is attached This procedure is used to locate defective FRUs not found by normal diagnostics. For this procedure, diagnostics are run on a minimally configured system. If a failure is detected on the minimally configured system, the remaining FRUs are exchanged one at a time until the failing FRU is identified. If a failure is not detected, FRUs are added back until the failure occurs. The failure is then isolated to the failing FRU. 324 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
Note: the system backplane has two memory DIMM quads: Un-P1-C14 to Un-P1-C17, and Un-P1-C21 to Un-P1-C24. See “Locations” on page 339. Perform the following procedure: v PFW1548-1 1. Ensure that the diagnostics and the operating system are shut down. Is the system at ?service processor standby?, indicated by 01 in the control panel? NO Replace the system backplane, location: Un-P1. Return to step PFW1548-1. YES Continue with substep 2.2. Turn on the power using either the white button or the ASMI menus. Does the system reach hypervisor standby as indicated on the HMC? NO Go to PFW1548-3. YES Go to PFW1548-2.3. Insert the AIX diagnostic CD-ROM into the optical drive. Note: If you cannot insert the diagnostic CD-ROM, go to PFW1548-2. 4. When the word keyboard is displayed on an ASCII terminal, a directly attached keyboard, or hardware management console (HMC), press the number 5 key. 5. If you are prompted to do so, enter the appropriate password. Is the ?Please define the System Console? screen displayed? NO Go to PFW1548-2. YES Go to PFW1548-14.v PFW1548-2 Insert the AIX diagnostic CD-ROM into the optical drive.Note: If you cannot insert the diagnostic CD-ROM, go to step PFW1548-3. Turn on the power using either the white button or the ASMI menus. (If the diagnostic CD-ROM is not in the optical drive, insert it now.) After the system has reached hypervisor standby, activate a Linux or AIX partition by clicking the Advanced button on the activation screen. On the Advanced activation screen, select Boot in service mode using the default boot list to boot the diagnostic CD-ROM. If you are prompted to do so, enter the appropriate password. Is the ?Please define the System Console? screen displayed? NO Go to PFW1548-3. YES Go to PFW1548-14.v PFW1548-3 1. Turn off the power. 2. If you have not already done so, configure the service processor (using the ASMI menus) with the instructions in note 6 on page 317 at the beginning of this procedure, then return here and continue. 3. Exit the service processor (ASMI) menus and remove the power cords. 4. Disconnect all external cables (parallel, system port 1, system port 2, keyboard, mouse, USB devices, SPCN, Ethernet on the system planar, and so on). Also disconnect all of the external cables attached to the service processor except the Ethernet cable going to the HMC.Go to the next step. v PFW1548-4 Chapter 2. Isolation Procedures 325
1. If this is a deskside system, remove the service access cover. If this is a rack-mounted system, place the drawer into the service position and remove the service access cover. Also remove the front cover. 2. Record the slot numbers of the PCI adapters and I/O expansion cards if present. Label and record the locations of all cables attached to the adapters. Disconnect all cables attached to the adapters and remove all of the adapters. 3. Remove the removable media/disk drive enclosure assembly by pulling out the blue tabs at the bottom of the enclosure, then sliding the enclosure out approximately three centimeters. 4. Remove and label the disk drives from the media/disk drive enclosure assembly. 5. Remove one of the two memory DIMMs quads. Notes: a. Place the memory DIMM locking tabs in the locked (upright) position to prevent damage to the tabs. b. Memory DIMMs must be installed in quads and in the correct connectors. Refer to “Physical Locations” on page 339 for information on memory DIMMs.6. Plug in the power cords and wait for 01 in the upper-left corner of the control panel display. 7. Turn on the power using either the HMC or the white button. Does the managed system reach power on at hypervisor standby as indicated on the HMC? NO Go to PFW1548-7. YES Go to the next step.v PFW1548-5 Were any memory DIMMs removed from system backplane? NO Go to PFW1548-8. YES Go to the next step.v PFW1548-6 1. Turn off the power, and remove the power cords. 2. Replug the memory DIMMs that were removed from system backplane in PFW1548-2 in their original locations. Notes: a. Place the memory DIMM locking tabs into the locked (upright) position to prevent damage to the tabs. b. Memory DIMMs must be installed in quads in the correct connectors. Refer to “Physical Locations” on page 339 for information on memory DIMMs.3. Plug in the power cords and wait for 01 in the upper-left corner of the control panel display. 4. Turn on the power using either the HMC or the white button. Does the managed system reach power on at hypervisor standby as indicated on the HMC? NO A memory DIMM in the quad you just replaced in the system is defective. Turn off the power, remove the power cords, and exchange the memory DIMM quad with new or previously removed memory DIMM quad. Repeat this step until the defective memory DIMM quad is identified, or both memory DIMM quads have been exchanged. Note: the system backplane has two memory DIMM quads: Un-P1-C1x-C1 to Un-P1-C1x-C4, and Un-P1-C1x-C6 to Un-P1-C1x-C9. See “Locations” on page 339. If your symptom did not change and both the memory DIMM quads have been exchanged, call your service support person for assistance. 326 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
If the symptom changed, check for loose cards and obvious problems. If you do not find a problem, go to the Problem Analysis procedures and follow the instructions for the new symptom. YES Go to the next step.v PFW1548-7 One of the FRUs remaining in the system unit is defective.Note: If a memory DIMM is exchanged, ensure that the new memory DIMM is the same size and speed as the original memory DIMM. 1. Turn off the power, remove the power cords, and exchange the following FRUs, one at a time, in the order listed: a. Memory DIMMs. Exchange one quad at a time with new or previously removed DIMM quads b. System backplane, location: Un-P1 c. Power supplies, locations: Un-E1 and Un-E2.2. Plug in the power cords and wait for 01 in the upper-left corner of the control panel display. 3. Turn on the power using either the HMC or the white button. Does the managed system reach power on at hypervisor standby as indicated on the HMC? NO Reinstall the original FRU. Repeat the FRU replacement steps until the defective FRU is identified or all the FRUs have been exchanged. If the symptom did not change and all the FRUs have been exchanged, call service support for assistance. If the symptom has changed, check for loose cards, cables, and obvious problems. If you do not find a problem, go to the Problem Analysis procedures and follow the instructions for the new symptom. YES Go to Chapter 5, “Verifying a repair,” on page 557.v PFW1548-8 1. Turn off the power. 2. Reconnect the system console. Notes: a. If an ASCII terminal has been defined as the firmware console, attach the ASCII terminal cable to the S1 connector on the rear of the system unit. b. If a display attached to a display adapter has been defined as the firmware console, install the display adapter and connect the display to the adapter. Plug the keyboard and mouse into the keyboard connector on the rear of the system unit.3. Turn on the power using either the HMC or the white button. (If the diagnostic CD-ROM is not in the optical drive, insert it now.) After the system has reached hypervisor standby, activate a Linux or AIX partition by clicking the Advanced button on the activation screen. On the Advanced activation screen, select Boot in service mode using the default boot list to boot the diagnostic CD-ROM. 4. If the ASCII terminal or graphics display (including display adapter) is connected differently from the way it was previously, the console selection screen appears. Select a firmware console. 5. Immediately after the word keyboard is displayed, press the number 1 key on the directly attached keyboard, an ASCII terminal or HMC. This activates the system management services (SMS). 6. Enter the appropriate password if you are prompted to do so. Is the SMS screen displayed? NO One of the FRUs remaining in the system unit is defective. Chapter 2. Isolation Procedures 327
Exchange the FRUs that have not been exchanged, in the following order: 1. If you are using an ASCII terminal, go to the problem determination procedures for the display. If you do not find a problem, do the following: a. Replace the system backplane, location: Un-P1.2. If you are using a graphics display, go to the problem determination procedures for the display. If you do not find a problem, do the following: a. Replace the display adapter. b. Replace the backplane in which the graphics adapter is plugged. Repeat this step until the defective FRU is identified or all the FRUs have been exchanged. If the symptom did not change and all the FRUs have been exchanged, call service support for assistance. If the symptom changed, check for loose cards, cables, and obvious problems. If you do not find a problem, go to the Problem Analysis procedures and follow the instructions for the new symptom.YES Go to the next step.v PFW1548-9 1. Make sure the diagnostic CD-ROM is inserted into the optical drive. 2. Turn off the power and remove the power cords. 3. Use the cam levers to reconnect the disk drive enclosure assembly to the I/O backplane. 4. Reconnect the removable media/disk drive enclosure assembly by sliding the media enclosure toward the rear of the system, then pressing the blue tabs. 5. Plug in the power cords and wait for 01 in the upper-left corner of the operator panel display. 6. Turn on the power using either the HMC or the white button. (If the diagnostic CD-ROM is not in the optical drive, insert it now.) After the system has reached hypervisor standby, activate a Linux or AIX partition by clicking the Advanced button on the activation screen. On the Advanced activation screen, select Boot in service mode using the default boot list to boot the diagnostic CD-ROM. 7. Immediately after the word keyboard is displayed, press the number 5 key on either the directly attached keyboard or an ASCII terminal keyboard. 8. Enter the appropriate password if you are prompted to do so. Is the ?Please define the System Console? screen displayed? NO One of the FRUs remaining in the system unit is defective. Exchange the FRUs that have not been exchanged, in the following order: 1. Optical drive 2. Removable media enclosure. 3. System backplane, Un-P1.Repeat this step until the defective FRU is identified or all the FRUs have been exchanged. If the symptom did not change and all the FRUs have been exchanged, call service support for assistance. If the symptom has changed, check for loose cards, cables, and obvious problems. If you do not find a problem, go to Problem Analysis procedures and follow the instructions for the new symptom. YES Go to the next step.v PFW1548-10 The system is working correctly with this configuration. One of the disk drives that you removed from the disk drive backplanes may be defective. 328 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
1. Make sure the diagnostic CD-ROM is inserted into the optical drive. 2. Turn off the power and remove the power cords. 3. Install a disk drive in the media/disk drive enclosure assembly. 4. Plug in the power cords and wait for the OK prompt to display on the operator panel display. 5. Turn on the power. 6. Immediately after the word keyboard is displayed, press the number 5 key on either the directly attached keyboard or an ASCII terminal keyboard. 7. Enter the appropriate password if you are prompted to do so. Is the ?Please define the System Console? screen displayed? NO Exchange the FRUs that have not been exchanged, in the following order: 1. Last disk drive installed 2. Disk drive backplane.Repeat this step until the defective FRU is identified or all the FRUs have been exchanged. If the symptom did not change and all the FRUs have been exchanged, call service support for assistance. If the symptom has changed, check for loose cards, cables, and obvious problems. If you do not find a problem, go to Problem Analysis procedures and follow the instructions for the new symptom. YES Repeat this step with all disk drives that were installed in the disk drive backplane. After all of the disk drives have been reinstalled, go to the next step.v PFW1548-11 The system is working correctly with this configuration. One of the devices that was disconnected from the system backplane may be defective. 1. Turn off the power and remove the power cords. 2. Attach a system backplane device (for example: system port 1, system port 2, USB, keyboard, mouse, Ethernet) that had been removed. After all of the I/O backplane device cables have been reattached, reattached the cables to the service processor one at a time. 3. Plug in the power cords and wait for 01 in the upper-left corner on the operator panel display. 4. Turn on the power using either the HMC or the white button. (If the diagnostic CD-ROM is not in the optical drive, insert it now.) After the system has reached hypervisor standby, activate a Linux or AIX partition by clicking the Advanced button on the activation screen. On the Advanced activation screen, select Boot in service mode using the default boot list to boot the diagnostic CD-ROM. 5. If the Console Selection screen is displayed, choose the system console. 6. Immediately after the word keyboard is displayed, press the number 5 key on either the directly attached keyboard or on an ASCII terminal keyboard. 7. Enter the appropriate password if you are prompted to do so. Is the ?Please define the System Console? screen displayed? NO The last device or cable that you attached is defective. To test each FRU, exchange the FRUs in the following order: 1. Device and cable (last one attached) 2. System backplane, location: Un-P1.If the symptom did not change and all the FRUs have been exchanged, call service support for assistance. Chapter 2. Isolation Procedures 329
If the symptom has changed, check for loose cards, cables, and obvious problems. If you do not find a problem, go to Problem Analysis procedures and follow the instructions for the new symptom. YES Repeat this step until all of the devices are attached. Go to the next step.v PFW1548-12 The system is working correctly with this configuration. One of the FRUs (adapters) that you removed may be defective. 1. Turn off the power and remove the power cords. 2. Install a FRU (adapter) and connect any cables and devices that were attached to the FRU. 3. Plug in the power cords and wait for the OK prompt to display on the operator panel display. 4. Turn on the power using either the HMC or the white button. (If the diagnostic CD-ROM is not in the optical drive, insert it now.) After the system has reached hypervisor standby, activate a Linux or AIX partition by clicking the Advanced button on the activation screen. On the Advanced activation screen, select Boot in service mode using the default boot list to boot the diagnostic CD-ROM. 5. If the Console Selection screen is displayed, choose the system console. 6. Immediately after the word keyboard is displayed, press the number 5 key on either the directly attached keyboard or on an ASCII terminal keyboard. 7. Enter the appropriate password if you are prompted to do so. Is the ?Please define the System Console? screen displayed? NO Go to the next step. YES Repeat this step until all of the FRUs (adapters) are installed. Go to Chapter 5, “Verifying a repair,” on page 557.v PFW1548-13 The last FRU installed or one of its attached devices is probably defective. 1. Make sure the diagnostic CD-ROM is inserted into the optical drive. 2. Turn off the power and remove the power cords. 3. Starting with the last installed adapter, disconnect one attached device and cable. 4. Plug in the power cords and wait for the 01 in the upper-left corner on the operator panel display. 5. Turn on the power using either the HMC or the white button. (If the diagnostic CD-ROM is not in the optical drive, insert it now.) After the system has reached hypervisor standby, activate a Linux or AIX partition by clicking the Advanced button on the Advanced activation screen. On the Advanced activation screen, select Boot in service mode using the default boot list to boot the diagnostic CD-ROM. 6. If the Console Selection screen is displayed, choose the system console. 7. Immediately after the word keyboard is displayed, press the number 5 key on either the directly attached keyboard or on an ASCII terminal keyboard. 8. Enter the appropriate password if you are prompted to do so. Is the ?Please define the System Console? screen displayed? NO Repeat this step until the defective device or cable is identified or all devices and cables have been disconnected. If all the devices and cables have been removed, then one of the FRUs remaining in the system unit is defective. To test each FRU, exchange the FRUs in the following order: 1. Adapter (last one installed) 2. System backplane, location: Un-P1.
330 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
If the symptom did not change and all the FRUs have been exchanged, call service support for assistance. If the symptom has changed, check for loose cards, cables, and obvious problems. If you do not find a problem, go to the Problem Analysis procedures and follow the instructions for the new symptom. YES The last device or cable that you disconnected is defective. Exchange the defective device or cable then go to the next step.v PFW1548-14 1. Follow the instructions on the screen to select the system console. 2. When the DIAGNOSTIC OPERATING INSTRUCTIONS screen is displayed, press Enter. 3. Select Advanced Diagnostics Routines. 4. If the terminal type has not been defined, you must use the option Initialize Terminal on the FUNCTION SELECTION menu to initialize the AIX diagnostic environment before you can continue with the diagnostics. This is a separate operation from selecting the console display. 5. If the NEW RESOURCE screen is displayed, select an option from the bottom of the screen. Note: Adapters and devices that require supplemental media are not shown in the new resource list. If the system has adapters or devices that require supplemental media, select option 1. 6. When the DIAGNOSTIC MODE SELECTION screen is displayed, press Enter. 7. Select All Resources. (If you were sent here from step PFW1548-18, select the adapter or device that was loaded from the supplemental media). Did you get an SRN? NO Go to step PFW1548-16. YES Go to the next step.v PFW1548-15 Look at the FRU part numbers associated with the SRN. Have you exchanged all the FRUs that correspond to the failing function codes (FFCs)? NO Exchange the FRU with the highest failure percentage that has not been changed. Repeat this step until all the FRUs associated with the SRN have been exchanged or diagnostics run with no trouble found. Run diagnostics after each FRU is exchanged. Go to Chapter 5, “Verifying a repair,” on page 557. YES If the symptom did not change and all the FRUs have been exchanged, call service support for assistance.v PFW1548-16 Does the system have adapters or devices that require supplemental media? NO Go to step the next step. YES Go to step PFW1548-18.v PFW1548-17 Consult the PCI adapter configuration documentation for your operating system to verify that all adapters are configured correctly. Go to Chapter 5, “Verifying a repair,” on page 557. If the symptom did not change and all the FRUs have been exchanged, call service support for assistance. v PFW1548-18 1. Select Task Selection. Chapter 2. Isolation Procedures 331
2. Select Process Supplemental Media and follow the on-screen instructions to process the media. Supplemental media must be loaded and processed one at a time. Did the system return to the TASKS SELECTION SCREEN after the supplemental media was processed? NO Go to the next step. YES Press F3 to return to the FUNCTION SELECTION screen. Go to step PFW1548-14, substep 4 on page 331.v PFW1548-19 The adapter or device is probably defective. If the supplemental media is for an adapter, replace the FRUs in the following order: 1. Adapter 2. System backplane, location: Un-P1. If the supplemental media is for a device, replace the FRUs in the following order: 1. Device and any associated cables 2. The adapter to which the device is attached Repeat this step until the defective FRU is identified or all the FRUs have been exchanged. If the symptom did not change and all the FRUs have been exchanged, call service support for assistance. If the symptom has changed, check for loose cards, cables, and obvious problems. If you do not find a problem, go to the Problem Analysis procedures and follow the instructions for the new symptom. Go to Chapter 5, “Verifying a repair,” on page 557. End of procedure. 332 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
Collect serviceable events in Integrated Virtualization Manager Use this procedure when using the Integrated Virtualization Manager (IVM) to check for serviceable events on your system. 1. Did you receive a reference code on the IVM console? Yes: Record the reference code, and return to the procedure that sent you here to further isolate the problem. No: Continue with the next step to check for serviceable events.2. In the IVM navigation bar, under Service Management, select Manage Serviceable Events. 3. In the Serviceable Event Selection Criteria field, select the criteria for the serviceable events that you want to view and click Apply. The serviceable events are displayed in a table. The table contains a list that shows all serviceable events matching your selection criteria. 4. Select one of the following options to perform an action on your serviceable events. v To view properties, continue with step 5. v To view associated FRUs, continue with step 6. v To close a serviceable event, continue with step 7. v To view additional details for a serviceable event, continue with step 8.5. To view properties for a serviceable event, select the serviceable event that you want to view properties for and select Properties. The Serviceable Event Comments window opens. 6. To view associated field replaceable units (FRUs), select the serviceable event that you want to view FRUs for, and select View associated FRUs. The serviceable event window opens. 7. To close a serviceable event, select one or more of the serviceable events that you want to close, and select Close Events. The Serviceable Event Comments window opens. a. Enter comments if necessary, and click Close Events. b. On the Close Serviceable Event dialog box, click Yes to close the serviceable event.8. To view additional details for a serviceable event, select the serviceable event that you want to view additional details for and select View additional details. The Serviceable Event Comments window opens. This ends this procedure. Chapter 2. Isolation Procedures 333
334 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
Chapter 3. Locating FRUs You can identify the enclosure in which a FRU is plugged by its location code. The first character of the location code is always U followed by a four character feature code or enclosure type as shown in the following example: U789C.001.10ABCDE-P3-C31 In this example, the enclosure type is 789C. The next three characters of the location code indicate the model of the enclosure (001 in the example). The next string of characters provides the enclosure serial number (10ABCDE in the example). Using this information, locate the enclosure with the FRU you want to replace. Find the enclosure type in the following table and go to the service guide for that enclosure:
Feature Code (Utttt)
Model
Enclosure Service Guide
789C
8203-E4A, 9407-M15, or 9408-M25
Go to “Physical Locations” on page 339.
1300
7311-D11
Refer to the Service Guide for the 7311 Model D11, SA76-0115.
5796
7314-G30
Refer to the Service Guide for the 7314 Model G30, SA76-0083.
0595
7311-D20
Refer to the Service Guide for the 7311 Model D20, SA76-0116.
5786 5787
7031-T24
Refer to the Service Guide for the 7031 Models D24 and T24, SA76-0118.
5788 5789
7031-D24
Refer to the Service Guide for the 7031 Models D24 and T24, SA76-0118.
5886
5886
Refer to the Service Guide for the 5886 Expansion Unit, SA76-0121. Locate the FRU The string of characters following the enclosure serial number identifies the FRU location within the enclosure: U7879.001.10ABCDE-P3-C31 In this example, P3-C31 is the location of the FRU to be replaced. Use the graphics and tables to locate the FRU and link to its Removal and Replacement procedure. “Using location codes” on page 336The information provided in this topic defines specific sections of a location code string. Use this topic when you need help in understanding the meaning of a location code. “Physical Locations” on page 339Use the following illustrations to help you map a location code to a position on the server. © Copyright IBM Corp. 2008 335
Using location codes The information provided in this topic defines specific sections of a location code string. Use this topic when you need help in understanding the meaning of a location code. Use the following table to link to a specific topic that you need additional information on when reading your location code.
Location code topics that might be of interest
Description
“Location code overview”
Contains background information on the use of location codes.
“Physical location codes” on page 337
Provides a definition for physical location code.
“Logical location codes” on page 337
Provides a definition of what a logical location code is.
“Location code format” on page 337
Provides descriptive information of the Un value in the location code string. For example U789C.001.
“Location code labels” on page 338
Provides a table that identifies and defines the location code labels. The location code labels begin with an alpha character and follow the system serial number. For example, U789C.001.10ABCDE-P3-C31-T2-L23. (The system serial number is the ?10ABCDE? in the previous example.) The P3, C31, T2, and L23 all contain an alpha character that is identified in the Location code labels table.
“Worldwide unique identifier” on page 338
Provides a definition for the world unique identifier. This group of digits follows the resource code labels and always begins with the letter W. Location code overview Servers (system unit and expansion units) use physical location codes to provide mapping of replaceable units. Location codes are produced by the server’s firmware, which structures them so that they can be used to identify specific parts in a system. The location code format is the same for all servers. If you are working with a specific location code, the unit type and model immediately follow the first character (Utttt.mmm). Match the unit type and model to a link, as shown in the Unit type and locations table. If the location code ends with -Txx-Lxx, the server’s firmware could not identify the physical location. When a physical location cannot be identified, a logical location code is provided. Where logical location codes occur in enclosures, the locations article for the enclosure has the known conversions listed. For logical location codes with no conversion, contact your next level of support. If the location code begins with UTMPx, the expansion I/O unit’s machine type, model and serial number have not been set yet and this is a temporary unit identifier. To identify the unit, examine the display panels on all of the expansion I/O units connected to the server until you find one with the same characters in the first 5 digits of the top line in the unit’s display. Record the unit’s real machine type and model from the unit label. Match the unit’s machine type and model in the Unit type and locations table and follow the link to determine the service information. Note: Locations for units that are not in the preceding list are either not supported or there is a problem in the firmware. Contact your next level of support.
336 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
Physical location codes Physical location codes provide a mapping of logical functions and components, such as backplanes, removable modules, connectors, ports, cables, and devices, to their specific locations within the physical structure of the server. Logical location codes If the physical location cannot be mapped to a physical location code, the server’s firmware will generate a logical location code. A logical location code is a sequence of location labels that identify the path that the system uses to communicate with a given resource. Note: A resource has as many logical location codes as it has logical connections to the system. For example, an external tape device connected to two I/O adapters will have two logical location codes. An example of a logical location code is: U789C.001.10ABCDE-P3-C31-T2-L23 The first part of the location code (through the T2 label) represents the physical location code for the resource that communicates with the target resource. The remainder of the logical location code (L23) represents exactly which resource is indicated. Location code format The location code is an alphanumeric string of variable length, consisting of a series of location identifiers, separated by a dash. An example of a physical location for a fan is Un-A1. The first position, represented by Un (where n is equal to any string contained between the U and the hyphen) in the preceding example, is displayed in one of the following forms: Note: In location codes the U is a constant digit, however, the numbered positions following the U are variables and are dependent on your server. Each column defines the numbers following the U in the beginning of the location code.
Machine type and model number in its location codes
Feature codes and sequence numbers in its location code
Utttt.mmm.sssssss-A1
Uffff.ccc.sssssss-A1
The leftmost code is always U
The leftmost code is always U
tttt represents the unit type of the enclosure (drawer or node)
ffff represents the feature code of the enclosure (drawer or node)
mmm represents the model of the enclosure
ccc represents the sequence number of the enclosure
sssssss represents the serial number for the enclosure
sssssss represents the serial number of the enclosure
Note: The mmm or ccc number might not be displayed on all location codes for all servers. If the mmm value is not displayed, the location code is displayed in one of the following forms: v Utttt.sssssss-A1 v Uffff.sssssss-A1 The location code is hierarchical; that is, each location identifier in the string represents a physical part. The order (from left to right), in which each identifier is shown, allows you to determine which parts contain other parts in the string. Chapter 3. Locating FRUs 337
The - (dash) separator character represents a relationship between two components in the unit. In the example of the fan, whose location code is Un-A1, the - (dash) shows that the fan (A1) is contained in the base unit (or Un). Modules, adapters, cables, and devices are all parts that are plugged into another part. Their location codes will always show that they are plugged into another part as components of the server. Another example, Un-P1-C9 is a memory DIMM, with (C9) plugged into a backplane (P1), which is inside the unit (Un). For more information about the various location code label prefixes, refer to Location code labels. Location code labels The following table describes the location code label prefixes. Note: These labels apply to system units only.
Table 44. Location code label prefixes for system units
Prefix
Description
Example
A
Air-moving device
Fan, blower
C
Card connector
IOP, IOA, DIMM, processor card
D
Device
Diskette, control panel
E
Electrical
Battery, power supply, ac charger
L
Logical path SAS target
IDE address, Fibre Channel LUN
N
Horizontal placement for an empty rack location
P
Planar
System backplane
T
Port, external cable
U
Unit
V
Virtual planar
W
Worldwide unique ID
X
EIA value for an empty rack location
Y
Firmware FRU Worldwide unique identifier A worldwide unique identifier location label consists of the prefix ?W? followed by a maximum of 16 uppercase hexadecimal digits with no leading zeros. A location code may or may not consist of a worldwide unique identifier. When present, the worldwide unique identifier location label follows the location label of the resource that interfaces with the resource having the worldwide unique identifier, usually a port. 338 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
Physical Locations Use the following illustrations to help you map a location code to a position on the server. “Locations” “Addresses” on page 347You can find part locations using location codes.Locations Use this information to help you map a location code to a position on the unit. Notes: v To activate the identify LED, see “Activating and deactivating LEDs” on page 583. v After replacing any part on a server or expansion unit, go to Chapter 5, “Verifying a repair,” on page 557. v Information regarding parts can be found in Chapter 8, “Part Information,” on page 691.The following diagrams show field replaceable unit (FRU) layouts in the system. Use these diagrams with the following tables. Rack views Figure 4. Rack front view with 3.5 inch DASD
Chapter 3. Locating FRUs 339
Figure
5. Rack top view, without fans Figure 6. Rack top view, with fans
340 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
Deskside views Figure 7. Rack rear view Figure 8. Deskside front view with 3.5 inch DASD
Chapter 3. Locating FRUs 341
Figure
9. Deskside side view without fans Figure 10. Deskside side view with fans
342 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
The following table provides location codes for parts that comprise the server.
Table 45. FRU Location Table.
Failing item name
Physical location code
Identify LED
Failing item removal and replacement procedures
System unit
Un
Fans
Fan 1
Un-A1
Yes
“Fan” on page 409
Fan 2
Un-A2
Yes
“Fan” on page 409
Fan 3
Un-A3
Yes
“Fan” on page 409
Fan 4
Un-A4
Yes
“Fan” on page 409
Power supplies
Power supply 1
Un-E1
Yes
“Power supply” on page 471
Power supply 2
Un-E2
Yes
“Power supply” on page 471
Backplanes Figure 11. Deskside rear view
Chapter 3. Locating FRUs 343
Table 45. FRU Location Table (continued)
.
Failing item name
Physical location code
Identify LED
Failing item removal and replacement procedures
Input/output backplane with embedded: v RIO Hub/HSL NIC v RIO/HSL link v Ethernet controller v USB controller (AIX or Linux only) v SAS controllers v Logic oscillator v Service processor v Processor 1 v Processor 2 v SPCN v Time-of-day
Un-P1
Yes
“System backplane” on page 493
Time-of-day battery
Un-P1-E1
“Battery” on page 353
DASD and media backplane
Un-P2
Yes
“Disk unit backplane” on page 391
DVD
Un-P2-D2
“Slimline media” on page 489
System backplane cables
System serial cable 1 (top connector)
Un-P1-T3
“External cables” on page 352
System serial cable 2 (bottom connector)
Un-P1-T4
“External cables” on page 352
Embedded USB front cable
Un-D1-T1
“External cables” on page 352
Embedded USB rear cable 1
Un-P1-T7
“External cables” on page 352
Embedded USB rear cable 2
Un-P1-T8
“External cables” on page 352
SPCN cable 1 (top connector)
Un-P1-T1
“External cables” on page 352
SPCN cable 2 (bottom connector)
Un-P1-T2
“External cables” on page 352
HMC 1 (left connector)
Un-P1-T5
“External cables” on page 352
HMC 2 (right connector)
Un-P1-T6
“External cables” on page 352
Dual 1 GB Ethernet cable 1
Un-P1-C7-T1
“External cables” on page 352
Dual 1 GB Ethernet cable 2
Un-P1-C7-T2
“External cables” on page 352
Quad 1 GB Ethernet port 1
Un-P1-C7-T1
“External cables” on page 352
Quad 1 GB Ethernet cable 2
Un-P1-C7-T2
“External cables” on page 352
Quad 1 GB Ethernet cable 3
Un-P1-C7-T3
“External cables” on page 352
Quad 1 GB Ethernet cable 4
Un-P1-C7-T4
“External cables” on page 352
External SAS cable
Un-P2-T1
“External cables” on page 352 344 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
Table 45. FRU Location Table (continued)
.
Failing item name
Physical location code
Identify LED
Failing item removal and replacement procedures
Internal SAS cable
Un-P1-T9
Processor and processor regulator
Voltage regulator 1A for processor 1
Un-P1-C13
Yes
Two VRMs are handled as a pair; they have one location code. If a VRM fails, both LEDs are lit at the same time. See “Voltage regulator module pair” on page 531
Voltage regulator 1B for processor 1
Un-P1-C13
Yes
Two VRMs are handled as a pair; they have one location code. If a VRM fails, both LEDs are lit at the same time. See “Voltage regulator module pair” on page 531
Voltage regulator 2 for processor 1
Un-P1-C18
Yes
“Voltage regulator module - single” on page 539
Voltage regulator 1A for processor 2
Un-P1-C19
Yes
Two VRMs are handled as a pair; they have one location code. If a VRM fails, both LEDs are lit at the same time. See “Voltage regulator module pair” on page 531
Voltage regulator 1B for processor 2
Un-P1-C19
Yes
Two VRMs are handled as a pair; they have one location code. If a VRM fails, both LEDs are lit at the same time. See “Voltage regulator module pair” on page 531
Voltage regulator 2 for processor 2
Un-P1-C20
Yes
“Voltage regulator module - single” on page 539
VPD card
Un-P1-C11
Yes
“Vital product data (VPD) card” on page 547
Adapters
PCI-E adapter in slot 1
Un-P1-C1
Yes
“PCI adapter” on page 438
PCI-E adapter in slot 2
Un-P1-C2
Yes
“PCI adapter” on page 438
PCI-E adapter in slot 3
Un-P1-C3
Yes
“PCI adapter” on page 438
PCI-X adapter in slot 4
Un-P1-C4
Yes
“PCI adapter” on page 438
PCI-X adapter in slot 5
Un-P1-C5
Yes
“PCI adapter” on page 438
Ethernet card
Un-P1-C7
Yes
“PCI adapter” on page 438
RIO/HSL/GX Dual-Port 12X Channel Attach adapter controller slot 1
Un-P1-C8
Yes
“GX adapter” on page 415
RIO/HSL/GX Dual-Port 12X Channel Attach adapter controller slot 2
Un-P1-C6
Yes
“GX adapter” on page 415
RIO/HSL/GX Dual-Port 12X Channel Attach adapter controller slot 1 port 1
Un-P1-C8-T1
Yes, for RIO adapter controller No, for 12X Channel Attach adapter
“External cables” on page 352 Chapter 3. Locating FRUs 345
Table 45. FRU Location Table (continued)
.
Failing item name
Physical location code
Identify LED
Failing item removal and replacement procedures
RIO/HSL/GX Dual-Port 12X Channel Attach adapter controller slot 1 port 2
Un-P1-C8-T2
Yes, for RIO adapter controller No, for 12X Channel Attach adapter
“External cables” on page 352
RIO/HSL/GX Dual-Port 12X Channel Attach adapter controller slot 2 port 1
Un-P1-C6-T1
Yes, for RIO adapter controller No, for 12X Channel Attach adapter
“External cables” on page 352
RIO/HSL/GX Dual-Port 12X Channel Attach adapter controller slot 2 port 2
Un-P1-C6-T2
Yes, for RIO adapter controller No, for 12X Channel Attach adapter
“External cables” on page 352
Auxiliary cache card for system backplane
Un-P1-C9
Yes
“Auxiliary cache card” on page 483
Auxiliary cache card battery
Un-P1-C9-E1
No
“Cache battery pack” on page 366
RAID enablement card for system backplane
Un-P1-C10
Yes
“RAID enablement card” on page 477
Thermal management card
Un-P1-C12
Yes
“Thermal power management device (TPMD) card” on page 525
Memory modules
Memory module 1
Un-P1-C14
Yes
“Memory DIMM” on page 430
Memory module 2
Un-P1-C15
Yes
“Memory DIMM” on page 430
Memory module 3
Un-P1-C16
Yes
“Memory DIMM” on page 430
Memory module 4
Un-P1-C17
Yes
“Memory DIMM” on page 430
Memory module 5
Un-P1-C21
Yes
“Memory DIMM” on page 430
Memory module 6
Un-P1-C22
Yes
“Memory DIMM” on page 430
Memory module 7
Un-P1-C23
Yes
“Memory DIMM” on page 430
Memory module 8
Un-P1-C24
Yes
“Memory DIMM” on page 430
Device physical locations
Removable media
Un-P2-D1
No
“Media device” on page 421
Removable media
Un-P2-D2
No
“Media device” on page 421
Disk drive 1
Un-P2-D3
Yes
“Disk Drive” on page 379 346 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
Table 45. FRU Location Table (continued)
.
Failing item name
Physical location code
Identify LED
Failing item removal and replacement procedures
Disk drive 2
Un-P2-D4
Yes
“Disk Drive” on page 379
Disk drive 3
Un-P2-D5
Yes
“Disk Drive” on page 379
Disk drive 4
Un-P2-D6
Yes
“Disk Drive” on page 379
Disk drive 5
Un-P2-D7
Yes
“Disk Drive” on page 379
Disk drive 6
Un-P2-D8
Yes
“Disk Drive” on page 379
Control panel
Control panel
Un -D1
“Control panel” on page 375
Temperature sensor
Un -D1
“Control panel” on page 375
Server firmware
Server firmware
Un-Y1 Addresses You can find part locations using location codes. Use the address to find the location for the 9407-M15 and the 9408-M25, then go to “Physical Locations” on page 339 to find additional information.
Table 46. IOA, and device address information for the 9407-M15 and the 9408-M25 running i5/OS
Position
Possible failing item
DSA (BBBBCcbb)
Unit address
Un -P1
Embedded SAS controller
02001000 (primary)
7FFFFFFF
Un -P1-C1
PCI-E adapter
02040000
Not applicable
Un -P1-C2
PCI-E adapter
02050000
Not applicable
Un -P1-C3
PCI-E adapter
02070000
Not applicable
Un -P1-C4
PCI-X adapter
02021000
Not applicable
Un -P1-C5
PCI-X adapter
02031000
Not applicable
Un -P1-C9
PCI auxiliary cache card
02060000
7FFFFFFF
Un -P1-C10
RAID card
02001000
7FFFFFFF
Un -P2-D1
Removable media
02001000
020000FF
Un -P2-D2
Removable media
02001000
030000FF
Un -P2-D3
Disk drive 1
02001000
040200FF
Un -P2-D4
Disk drive 2
02001000
040300FF
Un -P2-D5
Disk drive 3
02001000
040400FF
Un -P2-D6
Disk drive 4
02001000
040500FF
Un -P2-D7
Disk drive 5
02001000
040600FF
Un -P2-D8
Disk drive 6
02001000
040700FF Chapter 3. Locating FRUs 347
348 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
Chapter 4. Removing and replacing FRUs Use these procedures to remove and replace failing parts. Before you begin a replacement, perform these tasks: 1. If you are performing a replacement procedure that might put your data at risk, ensure, if possible, that you have a current backup of your system or logical partition (including operating systems, licensed programs, and data). 2. Review the installation or replacement procedure for the feature or part. 3. Note the significance of color on your system. Blue or terra-cotta on a part of the hardware indicates a touch point where you can grip the hardware to remove it from or install it in the system, open or close a latch, and so on. terra-cotta might also indicate that the part can be removed and replaced with the system or logical partition power on. 4. Ensure that you have access to a medium, flat-blade screwdriver. 5. If parts are incorrect, missing, or visibly damaged, contact the provider of your parts, or your next level of support. Attention: Failure to follow the step-by-step sequence for FRU removal and installation may result in FRU or system damage. Use the following precautions whenever you handle electronic components or cables. v Attach a wrist strap to an unpainted metal surface of your hardware to prevent Electrostatic Discharge (ESD) from damaging your hardware. v If you do not have a wrist strap, just prior to removing the product from ESD packaging and installing or replacing hardware, touch an unpainted metal surface of the system for a minimum of 5 seconds. v Keep all electronic components in the shipping container or envelope until you are ready to install them. v If you remove, then reinstall an electronic component, temporarily place the component on an ESD pad or blanket. DANGER
© Copyright IBM Corp. 2008 349
When working on or around the system, observe the following precautions: Electrical voltage and current from power, telephone, and communication cables are hazardous. To avoid a shock hazard: v Connect power to this unit only with the IBM provided power cord. Do not use the IBM provided power cord for any other product. v Do not open or service any power supply assembly. v Do not connect or disconnect any cables or perform installation, maintenance, or reconfiguration of this product during an electrical storm. v The product might be equipped with multiple power cords. To remove all hazardous voltages, disconnect all power cords. v Connect all power cords to a properly wired and grounded electrical outlet. Ensure that the outlet supplies proper voltage and phase rotation according to the system rating plate. v Connect any equipment that will be attached to this product to properly wired outlets. v When possible, use one hand only to connect or disconnect signal cables. v Never turn on any equipment when there is evidence of fire, water, or structural damage. v Disconnect the attached power cords, telecommunications systems, networks, and modems before you open the device covers, unless instructed otherwise in the installation and configuration procedures. v Connect and disconnect cables as described in the following procedures when installing, moving, or opening covers on this product or attached devices. To Disconnect: 1. Turn off everything (unless instructed otherwise). 2. Remove the power cords from the outlets. 3. Remove the signal cables from the connectors. 4. Remove all cables from the devices To Connect: 1. Turn off everything (unless instructed otherwise). 2. Attach all cables to the devices. 3. Attach the signal cables to the connectors. 4. Attach the power cords to the outlets. 5. Turn on the devices. (D005) “External cables” on page 352Use this procedure to service the external cables. “Battery” on page 353Use this procedure to service the battery. “Cache battery pack” on page 366Use this procedure to service the cache battery pack. “Control panel” on page 375Use this procedure to service the control panel. “Disk Drive” on page 379Use this procedure to replace the disk drive. “Disk unit backplane” on page 391Use this procedure to service the disk unit backplane. “Ethernet card” on page 403Use this procedure to service the Ethernet card. “Fan” on page 409Use this procedure to service the fan. “GX adapter” on page 415Use this procedure to service the GX adapter. 350 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
“Media device” on page 421Use this procedure to service the media device. “Memory DIMM” on page 430Use this procedure to service a memory DIMM. “PCI adapter” on page 438Use this procedure to service the PCI adapter. “Power supply” on page 471Use this procedure to service the power supply. “RAID enablement card” on page 477Use this procedure to service the RAID enablement card. “Auxiliary cache card” on page 483Use this procedure to service the auxiliary cache card. “Slimline media” on page 489Use this procedure to service the Slimline media. “System backplane” on page 493Use this procedure to replace the system backplane. “Thermal power management device (TPMD) card” on page 525Use this procedure to service the thermal power management device (TPMD) card. “Voltage regulator module pair” on page 531Use this procedure to service the voltage regulator module pair. “Voltage regulator module - single” on page 539Use this procedure to service a single voltage regulator module. “Vital product data (VPD) card” on page 547Use this procedure to service the vital product data (VPD) card. Chapter 4. Removing and replacing FRUs 351
External cables Use this procedure to service the external cables. 1. Remove the external cable
Trace the cable and record the system location at the other end, and then remove the cable. 2. Install the external cable
Attention: To prevent damaging the FRU or the system, use the following precautions before plugging cables into a connector or adapter: v Make sure that you have the right type of cable for the connector or adapter. v Make sure that the cable plug is correctly aligned with the connector or adapter. v Make sure that the tongue on the HSL/RIO cable plug matches the white location keys on the connector.Install the cable, then route and install the other end of the cable to the system location that was previously recorded. 3.
Go to Chapter 5, “Verifying a repair,” on page 557. 352 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
Battery Use this procedure to service the battery. 1. Access the ASMI
The Advanced System Management Interface (ASMI) is the interface to the service processor that is required to perform general and administrator-level service tasks, such as reading service processor error logs, reading vital product data, setting up the service processor, and controlling the system power. The ASMI may also be referred to as the service processor menus. To access the ASMI through the Hardware Management Console (HMC), complete the following steps: 1. Ensure that the server you are working with is selected. 2. Click Tasks. 3. Click Operations, then click Advanced System Management (ASM). Note: If there is more than one service processor, you must select the primary service processor. 4. Log on to the ASMI using the administrator userid and password.When you have accessed the ASMI, continue with the next step. 2. Check and record the server settings
Prior to replacing this FRU, check and record all server settings as you may need to update the system after you update this FRU. This can be done through the Advanced System Management Interface (ASMI) . 1. Check and record the server firmware level indicated on the right edge of the status frame, the area where the Log out button is located. You may need to update the system to the latest server firmware code level after you update this FRU. 2. Check and record the service processor settings you previously set using ASMI. You will need to reset these service processor settings after you update this FRU or the default settings will be used. Settings to record include the following: v Power/Restart Control settings. v System Service Aids settings. v System Configuration settings, System Name setting. v Network Services settings. v Performance Setup settings. v Login Profile settings.3. Record any service processor settings you may have set using operating system commands. 3. Shut down the partitions and power off the system
This FRU cannot be serviced concurrently. Every partition must be shut down and the system powered off to continue the repair. Authorization MUST COME FROM THE CUSTOMER to shut down all running partitions and power off the system. If the customer will not allow the partitions to be shut down or the system to be powered off, delay the service procedure. If the customer is using a secondary HMC to manage the system, make sure they are NOT powering on the system at this time. Chapter 4. Removing and replacing FRUs 353
4. Disconnect the power cords
1. Open the back rack door on the unit you are servicing. 2. Identify the system unit you are servicing in the rack. 3. Disconnect all power cords from the unit you are servicing.
5. Place into service position
Note to tower users: The instructions in this procedure are written based on the rack model. Tower users may still use this procedure. However, tower users should ignore specific references to rack-only features such as rack doors, rack thumbscrews, and rack-specific FRU orientation, etc. 1. Open the front rack door. 2. Identify the system unit you are servicing in the rack. 3. Remove the two screws (A) (if present) that secure the system unit (B) to the rack.
354 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
6. Place into service position (continued)
Notes: 1. When placing the system into the service position, it is essential that all stability plates are firmly in position to prevent the rack from toppling. Ensure that only one system unit is in the service position at a time. 2. Ensure that the cables at the back of the system unit do not catch or bind as you pull the unit forward in the rack. 3. When the system unit rails are fully extended, the rail safety latches lock into place. This action prevents the system from being pulled out too far.While holding the system unit release latches (A) down on both the left and right sides, pull the system unit (B) out from the rack until the rails are fully extended and locked.
7. Remove the service access cover
1. Loosen the two thumbscrews (A) located at the back of the cover. 2. Slide the cover (B) toward the back of the system unit. When the front of the service access cover has cleared the upper frame ledge, lift the cover up and off the system unit.Attention: For proper cooling and air flow, re-install the cover before turning on the system. Operating the system without the cover for more than 30 minutes could damage the system components.
Chapter 4. Removing and replacing FRUs 355
8. Remove the fans
1. Push the latch (A) upward to disengage the fan from the fan cage. 2. Pull the fan (B) out of the fan cage. 3. Repeat these steps until all fans are removed from the fan cage.
9. Remove the fan cage
1. Pull the four retaining tabs (A) that secure the fan cage to its enclosure. 2. Lift the fan cage (B) straight up and out of the enclosure.
356 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
10. Remove the time-of-day battery
Attention: When removing the battery, do not use metallic tool to disengage it from its slot. Remove the time-of-day battery (A) from the system backplane, being careful to observe the polarity (+/-) of the battery. v Do not: v Burn or charge the battery (this to avoid possible explosion, battery contains lithium). v Throw or immerse into water. v Heat to more than 100 degrees C (212 degrees F). v Repair or disassemble.Exchange only with the IBM-approved part. Recycle or discard the battery as instructed by local regulations. In the United States, IBM has a process for the collection of this battery. For information, call 1-800-426-4333. Have the IBM part number for the battery unit available when you call.
11. Insert the time-of-day battery
Insert the time-of-day battery (A) in the slot on the system backplane, being sure to insert the battery with the polarity (+/-) the same as the battery that was removed.
Chapter 4. Removing and replacing FRUs 357
12. Install the fan cage
1. Check that the four retaining tabs are in open position. 2. Lower the fan cage (A) into its location on the enclosure, aligning the blue tabs with the holes. 3. Push firmly on the fan cage so that you are sure it fits into its position. 4. Tighten the four retaining tabs (B) securing the fan cage to its enclosure.
13. Install the fans
1. Lower the fan (A) into its location in the fan cage. 2. Push down on the fan until it locks into place. 3. Secure the fan by closing the latch (B). 4. Repeat these steps until all fans are installed.
358 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
14. Install the service access cover
1. Place the service access cover (A) on top of the system, about 25 mm (1 inch) from the upper chassis ledge. 2. Hold the service access cover against the system unit and slide it toward the front of the system. The tabs of the service access cover slide beneath the upper chassis ledge. 3. Align the two thumbscrews (B) located on the back of the service access cover with the two holes on the back of the system chassis. 4. Tighten the thumbscrews to secure the service access cover.
15. Place into operating position
Note: When placing the system into operating position, ensure that the cables at the back of the system do not catch or bind as you push the system unit back into the rack. 1. Unlock the rail safety latches (B) by sliding them toward the front of the system. 2. Push the system unit (A) back into the rack until both system-unit release latches have locked into position.
Chapter 4. Removing and replacing FRUs 359
16. Place into operating position (continued)
1. Replace and tighten the two screws (B) that secure the system unit (A) to the rack. 2. Close the front rack door of the unit you are servicing.
17. Install the power cords
If you removed the power cords, perform the following steps. 1. Identify the system unit you are servicing in the rack. 2. Connect all power cords to the unit you are servicing. 3. Close the back rack door.
18. Restore network connectivity
The service processor must connect to the network to be accessible from the HMC. 360 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
How will the IP Address of the Service Processor be managed?
Obtain an IP Address automatically from a DHCP server
Configure a static IP Address
?
Go to step20. 19. Restore service processor settings
Enable network access to the service processor by proceeding as follows: If the network connection uses the Dynamic Host Configuration Protocol (DHCP) to establish an IP address, and uses the Hardware Management Console (HMC) as the DHCP server, no additional configuration is necessary to enable network access to the service processor. Perform the following steps: 1. Ensure that the service processor is connected to the existing service network by verifying that the HMC cable is connected to the HMC port on the system backplane. 2. If not already connected, connect all system power cables by plugging them into power outlets. Note: Do not start the system at this time.
Do you have network access to the service processor?
Yes
No
Go to step 22 on page 362.
Go to step 24 on page 362 20. Restore service processor settings through the ASMI
The Advanced System Management Interface (ASMI) is the interface to the service processor that is required to perform general and administrator-level service tasks, such as reading service processor error logs, reading vital product data, setting up the service processor, controlling the system power, and setting service processor network ports. The ASMI may also be referred to as the service processor menus. The ASMI can be accessed through https. As the network connection uses static IP address assignments, perform the following: 1. To know your service processor network ports, if necessary, you can use the control panel in manual mode. Refer to service functions to get control panel documentation in that case. 2. Connect a client with a Web browser directly to the service processor network ports you previously noted. It could be something like the following URLs: v https://169.254.2.147 v https://169.254.3.1473. If not already connected, connect all system power cables by plugging them into power outlets.Note: Do not start the system at this time. 4. Log on to the ASMI with the ?admin? User ID and ?admin? default Password. 5. Change the ?admin? User ID’s Password and the ?general? User ID’s Password when prompted. 6. To configure network access perform the following: a. Click on Network Configuration under the Network Services node. b. On the right pane, configure eth0 and eth1 network interfaces, choose for Type of IP Address ’Static’, set a Host name, an IP address, a Subnet mask. Chapter 4. Removing and replacing FRUs 361
21. Restore service processor settings
With the network connection now configured to use static IP address assignments, try to access the service processor network ports.
Do you have network access to the service processor?
Yes
No
?
Go to step 25 on page 363 22. Access the ASMI
If you are already connected to the ASMI, click Next to continue. Otherwise to access the ASMI through the Hardware Management Console (HMC), complete the following steps: 1. Ensure that the server you are working with is selected. 2. Click Tasks. 3. Click Operations, then click Advanced System Management (ASM). Note: If there is more than one service processor, you must select the primary service processor. 23.
Go to step 27 on page 363. 24. Restore service processor settings
If the network connection uses DHCP to establish an IP address, but does not use an HMC as the DHCP server, perform the following: 1. Complete any network configuration necessary to allow the DHCP server to recognize and assign an IP address to the service processor. 2. Ensure that the service processor is connected to the existing service network by verifying that the network cable is connected to the network port. 3. If not already connected, connect all system power cables by plugging them into power outlets. Note: Do not start the system at this time.
Do you have network access to the service processor?
No
Yes
?
Go to step 22 362 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
25. Restore service processor settings
You have indicated that you are still not able to access the service network. You will need to reset the service processor by pressing the pin-hole switch (E) on the operator panel.
26.
Go to step 18 on page 360. 27. Restore service processor settings
Change the ?admin? User ID’s password when prompted by performing the following steps: 1. In the navigation area of the ASMI, expand Login Profile. 2. Select Change Password under Login Profile. 3. Specify the required information, and click Continue.As a security measure, you are required to enter your current user password in the Current password for current user ID field. Note: Passwords can be any combination of up to 64 alphanumeric characters. The default password for the ?admin? User ID is ?admin?. After your initial login to the ASMI, the ?admin? password must be changed. Chapter 4. Removing and replacing FRUs 363
28. Restore service processor settings
Set the system name. The system name can be changed to any valid ASCII string. It does NOT have to follow the initialized MachineType-Model#-Serial# format. To change the system name, perform the following: 1. In the navigation area of the ASMI, expand System Configuration. 2. Select System Name under System Configuration. 3. Enter the desired system name. 4. Click Save settings to update the system name to the new value.The new system name is displayed in the status frame, the area where the Log out button is located. If another method, such as the HMC, is used to change the system name, the status frame does not reflect the change. 29. Restore service processor settings
Set the time-of-day. 1. In the navigation area of the ASMI, expand System Configuration. 2. Select Time of Day under System Configuration. 3. If the system is powered off, the right pane displays a form that shows the current date (month, day, and year) and time (hours, minutes, seconds). 4. Change either the date or the time or both, and click Save settings. 30. Restore service processor settings
Reenter any of the following settings that were previously changed through the ASMI, unless you want to use the default settings. Settings to be set include the following: 1. Power/Restart Control settings. 2. System Service Aids settings. 3. System Configuration settings, if not already done. 4. Network Services settings. 5. Performance Setup settings. 6. Login Profile settings. 364 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
31. Restore service processor settings
1. Reset any service processor settings that you may have set using operating system commands. You recorded these settings at the beginning of this procedure. 2. If you choose to reset the HMC Access password, perform the following: 3. a. From the HMC GUI (preferred method): b. 1) Expand the Systems Management folder in the navigation tree pane. 2) Double click the Servers folder. 3) Use the checkbox to select a server, in the central panel. 4) Under the Operations folder, select Change Password. 5) Provide the desired information.c. d. Or, from the HMC command line, type: e. /usr/hmcrbin/chsyspwd -m managedsystem -t access --passwd --newpasswd newpassword where: v The value for managedsystem is the new service processor’s managed system name. v No value for --passwd is entered thereby allowing authentication. v The value for newpasswd is the newpassword value. 32. Restore partition data on the service processor, if you had a partition in your system
Profile data stored in the managed server has been cleared or corrupted. To recover profile data using the HMC, proceed as follows: 1. Expand the Systems Management folder in the navigation tree pane. 2. Double click the Servers folder. 3. Use the checkbox to select a server, in the central panel. 4. In the Configuration folder if you have the Manage Partition Data folder, select Restore. 33.
Go to Chapter 5, “Verifying a repair,” on page 557. Chapter 4. Removing and replacing FRUs 365
Cache battery pack Use this procedure to service the cache battery pack. 1. Cache battery pack replacement procedure information
Attention: To prevent possible data loss, ensure that the cache battery pack is in an error state before replacing it. This will ensure all cache data is written to disk and that it is safe to replace the cache battery pack. By following this procedure, you are going to: 1. Force the cache battery pack into an error state. 2. Perform the cache battery pack replacement.You will be allowed to resume this procedure later if you cannot ensure that the cache battery pack is in an error state. Forcing the cache battery pack into an error state will result in the following: v The system logs an error. v Data caching becomes disabled on the selected controller. v System performance could become significantly degraded until the cache battery pack is replaced on the selected controller. v The battery pack can be safely replaced field on the controller rechargeable battery information screen will show Yes. v The cache data present LED will stop flashing. v The cache battery pack requires replacement.Ensure that you have the correct type of cache battery pack to perform the replacement. 2.
Table 47.
Is the system powered off?
No
Yes
?
Go to step9 on page 368 3.
Select the operating system:
AIX
Linux
i5/OS
?
Go to step 8 on page 368.
Go to step6 on page 367. 366 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
4. Force the cache battery pack into an error state on AIX
To force the cache battery pack into an error state in order to prevent possible data loss, proceed as follows on the system or partition containing the adapter: 1. Navigate to the IBM SAS Disk Array Manager as described below: a. At the command prompt, type smit, and press Enter. b. Select Devices. c. Select Disk Array. d. Select IBM SAS Disk Array. e. Select IBM SAS Disk Array Manager from the menu with options for configuring and managing the IBM SAS RAID Controller.2. Select Diagnostics and Recovery Options. 3. Select Controller Rechargeable Battery Maintenance. 4. Select Force Controller Rechargeable Battery Error. 5. Select the IBM SAS RAID Controller whose battery you want to replace, using this option places the battery into the error state, which requires it to be replaced. 6. Press Enter. 7. Determine that it is safe to replace the cache battery pack. Refer to How to display rechargeable battery information below. It is safe to replace the cache battery pack when Yes is displayed next to Battery pack can be safely replaced. You may need to reselect the option to Display Controller Rechargeable Battery Information multiple times as it may take several minutes before it is safe to replace the cache battery pack.How to display rechargeable battery information 1. Navigate to the IBM SAS Disk Array Manager as described above. 2. Select Diagnostics and Recovery Options. 3. Select Controller Rechargeable Battery Maintenance. 4. Select Display Controller Rechargeable Battery Information. 5. Select the IBM SAS RAID Controller. 5.
Go to step 9 on page 368. 6. To prevent data loss, ensure that the cache battery pack is in an error state
To force the cache battery pack into an error state in order to prevent possible data loss, proceed as follows on the system or partition containing the adapter: 1. Be sure that you are signed on to the system with at least service level authority. 2. Type strsst on the command line and press Enter. 3. Type your service tools userid and service tools password on the System Service Tools (SST) Sign On display. Press Enter. 4. Select Start a Service Tool from the System Service Tools (SST) display. Press Enter. 5. Select Hardware Service Manager from the Start a Service Tool display. Press Enter. 6. Select Work with resources containing cache battery packs from the Hardware Service Manager display. Press Enter. 7. Select Force battery pack into error state for the I/O card you are working with from the Work with Resources containing cache battery packs display. Press Enter. 8. On the Force Battery Packs Into Error State display, verify that the correct I/O adapter has been selected and press the function key that confirms your choice. 9. Return to the Work with Resources containing cache battery packs display and select Display battery information. Ensure that the field Safe to replace cache battery = YES. Note: This may take several minutes and you may need to press the refresh key to see the field update.
Chapter 4. Removing and replacing FRUs 367
7.
Go to step 9. 8. Force the cache battery pack into an error state on Linux
To force the cache battery pack into an error state in order to prevent possible data loss, proceed as follows on the system or partition containing the adapter: 1. Run the iprconfig utility by typing iprconfig. 2. Select Work with disk unit recovery. 3. Select Work with resources containing cache battery packs. 4. Select your adapter and type 2. Then press Enter to force the battery error. Note: Using this option places the battery into the error state, which requires it to be replaced. 5. If you are sure you want to force a battery error, type c to confirm. If you do not want to force a battery error, type q to cancel. 6. Determine that it is safe to replace the cache battery pack. Refer to How to display rechargeable battery information below. It is safe to replace the cache battery pack when Yes is displayed next to Battery pack can be safely replaced. You may need to reselect the option to Display Controller Rechargeable Battery Information multiple times as it may take several minutes before it is safe to replace the cache battery pack.How to display rechargeable battery information 1. Run the iprconfig utility by typing iprconfig. 2. Select Work with disk unit recovery. 3. Select Work with resources containing cache battery packs. 4. Select your adapter and type 1. 5. Then press Enter to display battery information. 9. Place into service position
Note to tower users: The instructions in this procedure are written based on the rack model. Tower users may still use this procedure. However, tower users should ignore specific references to rack-only features such as rack doors, rack thumbscrews, and rack-specific FRU orientation, etc. 1. Open the front rack door. 2. Identify the system unit you are servicing in the rack. 3. Remove the two screws (A) (if present) that secure the system unit (B) to the rack.
368 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
10. Place into service position (continued)
Notes: 1. When placing the system into the service position, it is essential that all stability plates are firmly in position to prevent the rack from toppling. Ensure that only one system unit is in the service position at a time. 2. Ensure that the cables at the back of the system unit do not catch or bind as you pull the unit forward in the rack. 3. When the system unit rails are fully extended, the rail safety latches lock into place. This action prevents the system from being pulled out too far.While holding the system unit release latches (A) down on both the left and right sides, pull the system unit (B) out from the rack until the rails are fully extended and locked.
11. Remove the service access cover
1. Loosen the two thumbscrews (A) located at the back of the cover. 2. Slide the cover (B) toward the back of the system unit. When the front of the service access cover has cleared the upper frame ledge, lift the cover up and off the system unit.Attention: For proper cooling and air flow, re-install the cover before turning on the system. Operating the system without the cover for more than 30 minutes could damage the system components.
Chapter 4. Removing and replacing FRUs 369
12. Check cache data present LED
Examine the LED (A) near the cache battery pack on the RAID enablement card. This LED is called the cache data present LED. If the LED is flashing, there is cache data that has not been written to disk. If the LED is not flashing, all cache data has been written to disk.
Is the Cache Battery LED flashing?
Yes, the LED is flashing
No, the LED is not flashing
?
Go to step 14 on page 371 13. Stop removing the cache battery pack
Attention: v The flashing LED indicates that there is cache data that has not been written to disk. v This data will be lost if the battery is removed at this time. v Do not replace the cache battery pack at this time.Complete this service procedure without replacing the cache battery pack. If the system is powered on: 1. Retry the procedure from the beginning to ensure that all cache data is written to disk before the battery is replaced.If the system is powered off: 1. Power on the system and activate the partitions. 2. Retry the procedure from the beginning but this time with power on to ensure that all cache data is written to disk before the battery is replaced.
Go to step 21 on page 372. 370 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
14. Remove the cache battery pack
Attention: Do not remove battery if LED (C) is flashing; cache data exists. 1. Move the cache battery lever (A) away from the connector to disengage the battery from the connector. 2. Slide the cache battery pack (B) out of the mounting guides and remove it from the controller.
15. Install the cache battery pack
Attention: When installing the cache battery pack, ensure that the cache battery pack is disconnected for at least 60 seconds before connecting the new battery. This is the minimum amount of time needed for the card to recognize that the battery has been replaced. 1. Slide the cache battery pack (B) into the mounting guides on the controller until it is seated in the battery connector (C). 2. Move the lever (A) to the latched position to fully seat the battery into the connector.
16.
Table 48.
Is the system powered off?
No
Yes
?
Go to step 21 on page 372. Chapter 4. Removing and replacing FRUs 371
17. Re-enable the cache
Select the operating system:
AIX
Linux
i5/OS
?
The write cache is automatically enabled. Go to step 21.
Go to step 20. 18. Re-enabling caching on AIX
Restart the adapter’s write cache by doing the following: 1. Navigate to the IBM SAS Disk Array Manager by using the steps in Using the IBM SAS Disk Array Manager. 2. Select Diagnostics and Recovery Options. 3. Select Controller Rechargeable Battery Maintenance. 4. Select Start Adapter Cache. 5. Select your IBM SAS RAID Controller whose battery you just replaced. 6. Press Enter. 19.
Go to step 21. 20. Re-enabling caching on i5/OS
Restart the adapter’s write cache by doing the following: 1. Return to the Work with Resources containing Cache Battery Packs display and select Start IOA cache. Press Enter. 2. Verify that a message stating that the Cache was started has been received. 21. Install the service access cover
1. Place the service access cover (A) on top of the system, about 25 mm (1 inch) from the upper chassis ledge. 2. Hold the service access cover against the system unit and slide it toward the front of the system. The tabs of the service access cover slide beneath the upper chassis ledge. 3. Align the two thumbscrews (B) located on the back of the service access cover with the two holes on the back of the system chassis. 4. Tighten the thumbscrews to secure the service access cover.
372 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
22. Place into operating position
Note: When placing the system into operating position, ensure that the cables at the back of the system do not catch or bind as you push the system unit back into the rack. 1. Unlock the rail safety latches (B) by sliding them toward the front of the system. 2. Push the system unit (A) back into the rack until both system-unit release latches have locked into position.
23. Place into operating position (continued)
1. Replace and tighten the two screws (B) that secure the system unit (A) to the rack. 2. Close the front rack door of the unit you are servicing.
Chapter 4. Removing and replacing FRUs 373
24. Install the power cords
If you removed the power cords, perform the following steps. 1. Identify the system unit you are servicing in the rack. 2. Connect all power cords to the unit you are servicing. 3. Close the back rack door.
25.
Go to Chapter 5, “Verifying a repair,” on page 557. 374 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
Control panel Use this procedure to service the control panel. 1. Shut down the partitions and power off the system
This FRU cannot be serviced concurrently. Every partition must be shut down and the system powered off to continue the repair. Authorization MUST COME FROM THE CUSTOMER to shut down all running partitions and power off the system. If the customer will not allow the partitions to be shut down or the system to be powered off, delay the service procedure. If the customer is using a secondary HMC to manage the system, make sure they are NOT powering on the system at this time. 2. Disconnect the power cords
1. Open the back rack door on the unit you are servicing. 2. Identify the system unit you are servicing in the rack. 3. Disconnect all power cords from the unit you are servicing.
Chapter 4. Removing and replacing FRUs 375
3. Remove the front cover
Note to Tower Users: The instructions in this procedure are written based on the Rack model. Tower users may still use this procedure. However, Tower users should ignore specific references to rack-only features such as rack doors, rack thumbscrews, and rack-specific FRU orientation, etc. 1. Open the front rack door. 2. Remove the two screws (A) (if present) that secure the system unit (B) to the rack. 3. Grab the cover at points (C) and pull it out and away from the system unit.
4. Remove the operator panel
1. Release the operator panel by pushing the release tab (A) to the left. 2. Grasp the edges of the operator panel and pull it partially out of the bay. 3. Press the locking pins (C) inward and gently pull the operator panel completely out of the bay, taking care not to pull on the cables (B). 4. Rotate the operator panel downward if needed, and disconnect the operator panel cables (B) from the operator panel. 5. Record the operator panel cables orientation, for reconnecting them the same way.
376 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
5. Install the operator panel
1. Carefully guide both operator panel cables (A) through the cables conduit in the extension of the operator panel while gently sliding the operator panel into the operator panel bay. 2. Connect the operator panel cables (A) to the operator panel. 3. Rotate the front of the operator panel upward and continue sliding it into the bay until the operator panel engages and locks into place.
6. Install the front cover
1. Position the cover on the front of the system unit so that the two screws (C) align with the screw holes on the front of the system unit (A). 2. Push the cover at points (B) to attach it to the front of the system unit as shown in the following figure. 3. Tighten the two screws that secure the system unit to the rack. 4. Close the front rack door.
Chapter 4. Removing and replacing FRUs 377
7. Install the power cords
If you removed the power cords, perform the following steps. 1. Identify the system unit you are servicing in the rack. 2. Connect all power cords to the unit you are servicing. 3. Close the back rack door.
8.
Go to Chapter 5, “Verifying a repair,” on page 557. 378 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
Disk Drive Use this procedure to replace the disk drive.
Your system can be powered off or powered on when you replace the disk drive. Do one of the following: v If the operating system is not running, or if the disk drive to be replaced is in the AIX or Linux root volume group (rootvg) and it is not protected with either a Redundant Array of Independent Disks (RAID) or mirroring, use the procedure for replacing the disk drive with the power off. The power-off procedures are also known as nonconcurrent in the following information. v If the operating system is running, and if the disk drive to be replaced is in not the AIX or Linux root volume group (rootvg) and it is protected with either a Redundant Array of Independent Disks (RAID) or mirroring, you can replace a disk drive with the power on. The power-on procedures are also known as concurrent in the following information. 1.
Do you want to replace the disk drive concurrently with system operations and the unit powered on?
Yes
No
?
Go to “Replace the disk drive nonconcurrently” on page 388. 2.
Choose the procedure that is appropriate for the operating system running in the partition that the disk drive resource is assigned to. For AIX, go to “Replace the disk drive using AIX.” For Linux, go to “Replace the disk drive using Linux” on page 382. “Replace the disk drive using AIX”Use this procedure to remove and replace a disk drive concurrently using AIX. Replace the disk drive using LinuxUse this procedure to replace a disk drive in a location that is controlled by a system or logical partition that is running Linux. “Replace the disk drive nonconcurrently” on page 388 “Rebuild data on a replacement disk drive using Linux” on page 390To rebuild data on the replacement disk drive, complete the steps listed here.Replace the disk drive using AIX Use this procedure to remove and replace a disk drive concurrently using AIX. Chapter 4. Removing and replacing FRUs 379
1.
Remove the disk drive as a resource: 1. Log in as root user or use CE Login. 2. At the command line, type diag and press Enter. 3. Press Enter to continue. 4. On the Function Selection display, select Task Selection. 5. Select Hot Plug Task. 6. Select RAID Hot Plug Devices. 7. Select the adapter that is connected to the array that contains the disk drive you want to replace and press Enter. 8. Select Commit. 9. On the Identify and Remove Resources display, select Identify. 10. Select the disk drive you want to remove and press Enter. The disk drive slot enters the identify state. The concurrent maintenance light for the slot begins flashing. The following figure shows the locations of the concurrent maintenance lights. Verify that the flashing light corresponds to the location for the disk drive you want to remove, and press Enter. The disk drive slot exits the identify state. 11. On the Identify and Remove Resources display, select Remove. A list of the disk drives that you can remove is shown. If the disk drive you want to remove does not appear on the list, ask your system administrator to put the disk drive in the failed state before continuing with this procedure. For information, see AIX System Management Guide: Operating System and Devices. 12. Select the disk drive you want to remove and press Enter. The disk drive slot enters the remove state and power is removed from the slot. The concurrent maintenance light for the slot begins flashing rapidly.
Note: If the disk drive slot is not in the Remove state, contact your next level of support. 380 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
2. Remove the Front Cover
Note to tower users: The instructions in this procedure are written based on the rack model. Tower users may still use this procedure. However, tower users should ignore specific references to rack-only features such as rack doors, rack thumbscrews, rack-specific FRU orientation, and so on. 1. Open the front rack door. 2. Remove the two thumbscrews (A) that secure the system unit (B) to the rack. 3. Push the cover release latches (C) in the direction of the arrows to release the cover from the system unit. 4. Pull the cover out and away from the system unit.
3.
Remove the disk drive: 1. Unlock the disk drive handle (A) and pull the disk drive out toward you as shown. 2. Support the bottom of the disk drive as you slide it out of the system unit. Do not hold the disk drive by the handle. The concurrent maintenance light for the slot turns off when you remove the disk drive.
Chapter 4. Removing and replacing FRUs 381
4.
Install the disk drive: 1. Support the bottom of the disk drive as you align it with the guide rails in the system unit. Do not hold the disk drive by the handle. 2. Slide the disk drive into the system until it stops. 3. Push the disk drive handle in until it locks. Note: When you install a disk drive, ensure that the drive is fully seated and all the way into the system.
5.
Add the new disk drive as a resource: 1. Press Enter. The disk drive slot exits the remove state and enters the normal state. 2. Exit to the RAID Hot-Plug Devices menu. Press the F3 or ESC 3 key to return. 3. Exit to the Task selection display. 4. Select Log Repair Action. 5. Select the disk drive that you replaced and then press Enter. 6. Select Commit after you have made your selection and then press Enter. 7. Exit to the command line. 6.
To rebuild data on the replacement disk drive, refer to the information in the AIX System Management Guide: Operating System and Devices. 7. Go to Chapter 5, “Verifying a repair,” on page 557. This completes this procedure.Replace the disk drive using Linux Use this procedure to replace a disk drive in a location that is controlled by a system or logical partition that is running Linux. Prepare to remove the disk drive 1. Log in as root user. 2. Type iprconfig on the command line of the Linux session and press Enter. The IBM Power RAID Configuration Utility display is shown. 382 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
3. Select Analyze log. Press Enter. The Kernel Messages Log display is shown. 4. Select View most recent ipr error messages from the Kernel Messages Log display. Press Enter. 5. Find the entry in the log for the disk drive you want to replace. 6. Record the location information for the disk drive. Note: The location information has the form of 2:0:8:0. In this example, 2 is the SCSI host number, 0 is the SCSI bus, 8 is the SCSI target ID, and 0 is the LUN (logical unit). 7. Return to the command line. 8. Type the following: ls -ld /sys/class/scsi_host/host#/device where # is the SCSI host number. Press Enter. 9. Record the PCI location information. Note: The PCI location information has the form of 61:01:0:2. 10. Type iprconfig on the command line and press Enter. The IBM Power RAID Configuration Utility display is shown. 11. Select Display hardware status from the IBM Power RAID Configuration Utility display. Press Enter. The Display Hardware Status display is shown. Kernel Messages Log Select one of the following: 1. View most recent ipr error messages 2. View ipr error messages 3. View all kernel error messages 4. View iprconfig error messages 5. Set root kernel message log directory 6. Set default editor 7. Restore defaults 8. View ipr boot time messages Selection: e=Exit Figure 12. Kernel Messages Log
Chapter 4. Removing and replacing FRUs 383
12. Look for the disk drive at the PCI location you recorded. The disk drive might have a Failed status. 13. If the disk drive you want to replace is unprotected or in use, move the data from the disk drive before continuing with this procedure. For information, see the PCI-X SCSI RAID Controller Reference Guide for Linux. 14. Type option 1 (Display hardware resource information details) next to the disk drive you want to replace. Press Enter. A Disk Hardware Resource Information Details display similar to the following is shown. 15. Record the physical location information. 16. Return to the IBM Power RAID Configuration Utility display. Display Hardware Status Type option, press Enter. 1=Display hardware resource information details OPT Name PCI/SCSI Location Description Status --- ------ -------------------------- ------------------------- ---------------- 0000:01:01.0.0/ PCI-X SCSI Adapter Operational 0000:41:01.0.1/ PCI-X SCSI Adapter Operational sda 0000:41:01.0.1/0:3:0 Physical Disk Active sdb 0000:41:01.0.1/0:4:0 Physical Disk Active sdc 0000:41:01.0.1/0:8:0 Physical Disk Active sdd 0000:41:01.0.1/1:3:0 Physical Disk Active sde 0000:41:01.0.1/1:4:0 Physical Disk Active sdf 0000:41:01.0.1/1:5:0 Physical Disk Active 0001:61:01.0.2/ PCI-X SCSI RAID Adapter Operational sdg 0001:61:01.0.2/0:3:0 Physical Disk Active 0001:61:01.0.2/0:6:0 Advanced Function Disk Active sdi 0001:61:01.0.2/0:9:0 Physical Disk Active sdh 0001:61:01.0.2/255:0:0 RAID 10 Disk Array Failed 0001:61:01.0.2/0:4:0 RAID 10 Array Member Failed 0001:61:01.0.2/0:5:0 RAID 10 Array Member Failed e=Exit q=Cancel r=Refresh t=Toggle Figure 13. Example Display Hardware Status Disk Unit Hardware Resource Information Details Manufacturer . . . . . . . . . . . . . . : IBM Product ID . . . . . . . . . . . . . . . : ST336607LC Firmware Version . . . . . . . . . . . . : 43353048 (C50H) Serial Number. . . . . . . . . . . . . . : 00006719 Capacity . . . . . . . . . . . . . . . . : 36.40 GB Resource Name. . . . . . . . . . . . . . : /dev/sdd Physical location PCI Address. . . . . . . . . . . . . . . : 0001:50:01.0 SCSI Host Number . . . . . . . . . . . . : 2 SCSI Channel . . . . . . . . . . . . . . : 0 SCSI Id. . . . . . . . . . . . . . . . . : 4 SCSI Lun . . . . . . . . . . . . . . . . : 0 Extended Details FRU Number . . . . . . . . . . . . . . . : 00P2676 More... Press Enter to Continue e=Exit q=Cancel f=PageDn b=PageUp Figure 14. Example Disk Hardware Resource Information Details display
384 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
Remove the disk drive 1. From the IBM Power RAID Configuration Utility display, select Work with disk unit recovery. Press Enter. 2. From the Work with Disk Unit Recovery display, select Concurrent remove device. Press Enter. A Concurrent Device Remove display is shown, similar to the following display. 3. Type option 1 (Select) next to the location for the disk drive you want to replace. Press Enter. 4. The Verify Device Concurrent Remove display is shown. The concurrent maintenance light turns on for that disk drive slot. 5. On the Verify Device Concurrent Remove display, verify that the selected disk drive is the disk drive you want to replace, then press Enter. The identify light turns on for the disk drive. Attention: To prevent loss of data, ensure that the disk drive is not in use. 6. The Complete Device Concurrent Remove display is shown. Concurrent Device Remove Choose a single location for remove operations 1=Select OPT Name PCI/SCSI Location Description Status --- ------ -------------------------- ------------------------- ----------------- sdc 0000:41:01.0.1/0:8:0 Physical Disk Active 0000:41:01.0.1/0:5: Empty sdb 0000:41:01.0.1/0:4:0 Physical Disk Active sda 0000:41:01.0.1/0:3:0 Physical Disk Active 0000:41:01.0.1/1:8: Empty sdf 0000:41:01.0.1/1:5:0 Physical Disk Active sde 0000:41:01.0.1/1:4:0 Physical Disk Active sdd 0000:41:01.0.1/1:3:0 Physical Disk Active 0001:61:01.0.2/0:8: Empty sdh 0001:61:01.0.2/0:9:0 Physical Disk Active sdg 0001:61:01.0.2/0:3:0 Physical Disk Active Figure 15. Example Concurrent Device Remove display
Chapter 4. Removing and replacing FRUs 385
Attention: v Attach a wrist strap to an unpainted metal surface of your hardware to prevent electrostatic discharge from damaging your hardware. v When using a wrist strap, follow all electrical safety procedures. A wrist strap is for static control. It does not increase or decrease your risk of receiving electric shock when using or working on electrical equipment. v If you do not have a wrist strap, just prior to removing the product from ESD packaging and installing or replacing hardware, touch an unpainted metal surface of the system for a minimum of 5 seconds.7. Squeeze and pull the handle of the disk drive out toward you before you remove the disk drive as shown in Figure 16. If the handle is not all the way out, the disk drive will not slide out of the system or expansion unit. 8. Support the bottom of the disk drive as you slide it out of the system or expansion unit. Do not hold the disk drive by the handle. 9. Press Enter on the IBM Power RAID Configuration Utility to indicate that you have removed the disk drive.Replace the disk drive 1. Find the package that contains the new disk drive. Attention: Disk drives are fragile. Handle them with care. 2. Remove the disk drive from its protective package. 3. Unlock the disk drive handle and pull the disk drive out toward you before you install the disk drive. If the handle is not all the way out, the disk drive will not slide into the system or expansion unit. 4. From the IBM Power RAID Configuration Utility display, select Work with disk unit recovery. Press Enter. 5. From the Work with Disk Unit Recovery display, select Concurrent add device. Press Enter. Figure 16. 386 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
A Concurrent Device Add display similar to the following is shown. 6. Type option1 (Select) next to the location from which you removed the disk drive. The Verify Device Concurrent Add display is shown. 7. Press Enter on the Verify Device Concurrent Add display. The Complete Device Concurrent Add display is shown. 8. Support the bottom of the disk drive as you align it with the guide rails in the system or expansion unit. Do not hold the disk drive by the handle. 9. Slide the disk drive all the way into the system or expansion unit and then lock it in place by pushing in the disk drive handle, as shown in Figure 18. 10. Press Enter on the Complete Device Concurrent Add display to indicate that the disk drive is installed. 11. Rebuild the data on the replacement disk drive. For instructions, go to “Rebuild data on a replacement disk drive using Linux” on page 390. 12. Go to Chapter 5, “Verifying a repair,” on page 557. This completes this procedure. Concurrent Device Add Choose a single location for add operations 1=Select OPT Name PCI/SCSI Location Description Status --- ------ -------------------------- ------------------------- ----------------- 0000:41:01.0.1/0:5: Empty 0000:41:01.0.1/1:8: Empty 0001:61:01.0.2/0:8: Empty Figure 17. Example Concurrent Device Add display Figure 18. Chapter 4. Removing and replacing FRUs 387
Replace the disk drive nonconcurrently
Use this procedure to remove and replace a disk drive nonconcurrently. 1. Power off the system and remove the power cords
The component will not be serviced concurrently. If this system is powered on, power it off before continuing. If possible, shut down any running applications and the operating system before powering off the system. Once the system is powered off, remove all power cords from all of the power supplies.
2. Remove the front cover
Note to Tower Users: The instructions in this procedure are written based on the Rack model. Tower users may still use this procedure. However, Tower users should ignore specific references to rack-only features such as rack doors, rack thumbscrews, and rack-specific FRU orientation, etc. 1. Open the front rack door. 2. Remove the two screws (A) (if present) that secure the system unit (B) to the rack. 3. Grab the cover at points (C) and pull it out and away from the system unit.
388 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
3.
Remove the disk drive: 1. Unlock the disk drive handle (A) and pull the disk drive out toward you as shown. 2. Support the bottom of the disk drive as you slide it out of the system or expansion unit. Do not hold the disk drive by the handle.
4.
Install the disk drive: 1. Support the bottom of the disk drive as you align it with the guide rails in the system unit. Do not hold the disk drive by the handle. 2. Slide the disk drive into the system until it stops. 3. Push the disk drive handle in until it locks. Note: When you install a disk drive, ensure that the drive is fully seated and all the way into the system.
Chapter 4. Removing and replacing FRUs 389
5.
If you removed the power cords, reinstall the power cords for all of the power supplies.
6.
Go to Chapter 5, “Verifying a repair,” on page 557. This completes this procedure. Rebuild data on a replacement disk drive using Linux To rebuild data on the replacement disk drive, complete the steps listed here. For an unprotected disk drive If the disk drive you are replacing is in a RAID Level 0 disk array or in a failed RAID Level 5 or RAID Level 10 disk array, perform these tasks: 1. Re-create the disk array. 2. Re-create the file systems on the disk array. 3. Copy the data back to the restored disk array from your backup media. For information on these tasks, see the PCI-X SCSI RAID Controller Reference Guide for Linux, SA23-1327. 390 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
Disk unit backplane Use this procedure to service the disk unit backplane. 1. Shut down the partitions and power off the system
This FRU cannot be serviced concurrently. Every partition must be shut down and the system powered off to continue the repair. Authorization MUST COME FROM THE CUSTOMER to shut down all running partitions and power off the system. If the customer will not allow the partitions to be shut down or the system to be powered off, delay the service procedure. If the customer is using a secondary HMC to manage the system, make sure they are NOT powering on the system at this time. 2. Disconnect the power cords
1. Open the back rack door on the unit you are servicing. 2. Identify the system unit you are servicing in the rack. 3. Disconnect all power cords from the unit you are servicing.
Chapter 4. Removing and replacing FRUs 391
3. Remove the front cover
Note to Tower Users: The instructions in this procedure are written based on the Rack model. Tower users may still use this procedure. However, Tower users should ignore specific references to rack-only features such as rack doors, rack thumbscrews, and rack-specific FRU orientation, etc. 1. Open the front rack door. 2. Remove the two screws (A) (if present) that secure the system unit (B) to the rack. 3. Grab the cover at points (C) and pull it out and away from the system unit.
4. Remove the slimline media or filler panel device
1. If there is a slimline media or filler panel device in the system, push the terra cotta plastic retaining tab (A) approximately 5mm to the right away from the device. 2. While depressing the terra cotta plastic retaining tab, grasp the device (B) and pull it out of the system.
392 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
5. Remove the operator panel
1. Release the operator panel by pushing the release tab (A) to the left. 2. Grasp the edges of the operator panel and pull it partially out of the bay. 3. Press the locking pins (C) inward and gently pull the operator panel completely out of the bay, taking care not to pull on the cables (B). 4. Rotate the operator panel downward if needed, and disconnect the operator panel cables (B) from the operator panel. 5. Record the operator panel cables orientation, for reconnecting them the same way.
6. Remove the disk units (DASD) and fillers
Note: Support the bottom of the disk unit while you slide it out. Do not hold the disk unit by the handle. 1. Open the disk units handles by pinching the two surfaces of the handles together. 2. Pull all of the disk units out of the disk unit cage and backplane. 3. Repeat these steps for any disk unit fillers that are present.
Chapter 4. Removing and replacing FRUs 393
7. Place into service position
Notes: 1. When placing the system into the service position, it is essential that all stability plates are firmly in position to prevent the rack from toppling. Ensure that only one system unit is in the service position at a time. 2. Ensure that the cables at the back of the system unit do not catch or bind as you pull the unit forward in the rack. 3. When the system unit rails are fully extended, the rail safety latches lock into place. This action prevents the system from being pulled out too far.While holding the system unit release latches (A) down on both the left and right sides, pull the system unit (B) out from the rack until the rails are fully extended and locked.
8. Remove the service access cover
1. Loosen the two thumbscrews (A) located at the back of the cover. 2. Slide the cover (B) toward the back of the system unit. When the front of the service access cover has cleared the upper frame ledge, lift the cover up and off the system unit.Attention: For proper cooling and air flow, re-install the cover before turning on the system. Operating the system without the cover for more than 30 minutes could damage the system components.
394 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
9. Remove the fans
1. Push the latch (A) upward to disengage the fan from the fan cage. 2. Pull the fan (B) out of the fan cage. 3. Repeat these steps until all fans are removed from the fan cage.
10. Remove the fan cage
1. Pull the four retaining tabs (A) that secure the fan cage to its enclosure. 2. Lift the fan cage (B) straight up and out of the enclosure.
Chapter 4. Removing and replacing FRUs 395
11. Disconnect the media device cables, if you have a media device
1. Media device cable (A) does not need to be touched. 2. Squeeze the media device cable (B) as shown and disconnect it from its connector on the disk unit backplane. 3. Disconnect the media device cable (C) from its connector on the disk unit backplane.
12. Remove the disk unit cage and backplane
1. Rotate the handles outward in the direction shown to release the disk unit cage and backplane. 2. Slide the disk unit cage and backplane out of the system unit.
396 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
13. Install the disk unit cage and backplane
1. Slide the replacement disk unit cage and backplane into the system until you feel it engages and locks into place. 2. Rotate the handles in the direction shown to secure the disk unit cage and backplane.
14. Connect the media device cables, if you have a media device
1. Media device cable (A) does not need to be touched. 2. Connect the media device cables (B) and (C) to their connectors on the disk unit backplane.
Chapter 4. Removing and replacing FRUs 397
15. Install the disk units (DASD) and fillers
1. Slide the disk units back into the disk unit cage and backplane. 2. Push in the handles of each disk unit to lock them into place. 3. Repeat these steps for any disk unit fillers that are present.
16. Install the operator panel
1. Carefully guide both operator panel cables (A) through the cables conduit in the extension of the operator panel while gently sliding the operator panel into the operator panel bay. 2. Connect the operator panel cables (A) to the operator panel. 3. Rotate the front of the operator panel upward and continue sliding it into the bay until the operator panel engages and locks into place.
398 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
17. Install the slimline media or filler panel device
1. If there was a slimline media or filler panel device present in the system, align the device (A) with the slimline media bay. 2. Push it into the system until the retaining tab (B) locks the device into place.
18. Install the fan cage
1. Check that the four retaining tabs are in open position. 2. Lower the fan cage (A) into its location on the enclosure, aligning the blue tabs with the holes. 3. Push firmly on the fan cage so that you are sure it fits into its position. 4. Tighten the four retaining tabs (B) securing the fan cage to its enclosure.
Chapter 4. Removing and replacing FRUs 399
19. Install the fans
1. Lower the fan (A) into its location in the fan cage. 2. Push down on the fan until it locks into place. 3. Secure the fan by closing the latch (B). 4. Repeat these steps until all fans are installed.
20. Install the service access cover
1. Place the service access cover (A) on top of the system, about 25 mm (1 inch) from the upper chassis ledge. 2. Hold the service access cover against the system unit and slide it toward the front of the system. The tabs of the service access cover slide beneath the upper chassis ledge. 3. Align the two thumbscrews (B) located on the back of the service access cover with the two holes on the back of the system chassis. 4. Tighten the thumbscrews to secure the service access cover.
400 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
21. Place into operating position
Note: When placing the system into operating position, ensure that the cables at the back of the system do not catch or bind as you push the system unit back into the rack. 1. Unlock the rail safety latches (B) by sliding them toward the front of the system. 2. Push the system unit (A) back into the rack until both system-unit release latches have locked into position.
22. Install the front cover
1. Position the cover on the front of the system unit so that the two screws (C) align with the screw holes on the front of the system unit (A). 2. Push the cover at points (B) to attach it to the front of the system unit as shown in the following figure. 3. Tighten the two screws that secure the system unit to the rack. 4. Close the front rack door.
Chapter 4. Removing and replacing FRUs 401
23. Install the power cords
If you removed the power cords, perform the following steps. 1. Identify the system unit you are servicing in the rack. 2. Connect all power cords to the unit you are servicing. 3. Close the back rack door.
24.
Go to Chapter 5, “Verifying a repair,” on page 557. 402 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
Ethernet card Use this procedure to service the Ethernet card. 1. Shut down the partitions and power off the system
This FRU cannot be serviced concurrently. Every partition must be shut down and the system powered off to continue the repair. Authorization MUST COME FROM THE CUSTOMER to shut down all running partitions and power off the system. If the customer will not allow the partitions to be shut down or the system to be powered off, delay the service procedure. If the customer is using a secondary HMC to manage the system, make sure they are NOT powering on the system at this time. 2. Disconnect the power cords
1. Open the back rack door on the unit you are servicing. 2. Identify the system unit you are servicing in the rack. 3. Disconnect all power cords from the unit you are servicing.
3. Disconnect the external cables
1. Carefully label the external cables connected to the FRU you are servicing so that they can be reconnected in exactly the same place after the repair. 2. Disconnect the external cables. Chapter 4. Removing and replacing FRUs 403
4. Place into service position
Note to tower users: The instructions in this procedure are written based on the rack model. Tower users may still use this procedure. However, tower users should ignore specific references to rack-only features such as rack doors, rack thumbscrews, and rack-specific FRU orientation, etc. 1. Open the front rack door. 2. Identify the system unit you are servicing in the rack. 3. Remove the two screws (A) (if present) that secure the system unit (B) to the rack.
5. Place into service position (continued)
Notes: 1. When placing the system into the service position, it is essential that all stability plates are firmly in position to prevent the rack from toppling. Ensure that only one system unit is in the service position at a time. 2. Ensure that the cables at the back of the system unit do not catch or bind as you pull the unit forward in the rack. 3. When the system unit rails are fully extended, the rail safety latches lock into place. This action prevents the system from being pulled out too far.While holding the system unit release latches (A) down on both the left and right sides, pull the system unit (B) out from the rack until the rails are fully extended and locked.
404 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
6. Remove the service access cover
1. Loosen the two thumbscrews (A) located at the back of the cover. 2. Slide the cover (B) toward the back of the system unit. When the front of the service access cover has cleared the upper frame ledge, lift the cover up and off the system unit.Attention: For proper cooling and air flow, re-install the cover before turning on the system. Operating the system without the cover for more than 30 minutes could damage the system components.
7. Remove the Ethernet adapter
1. Slide the latch (A) in the direction shown to release the Ethernet adapter (B). 2. Carefully grasp the Ethernet adapter by its blue handle, and pull it straight out of its connector on the system backplane.
Chapter 4. Removing and replacing FRUs 405
8. Install the Ethernet adapter
1. Check the latch (A) is in the open position to let the Ethernet adapter slot accessible. 2. Carefully grasp the Ethernet adapter (B) by its blue handle, and align the card with its connector on the system backplane. 3. Press the Ethernet adapter firmly into its connector. 4. Slide the latch (A) back in the direction of the arrow to secure the Ethernet adapter.
9. Install the service access cover
1. Place the service access cover (A) on top of the system, about 25 mm (1 inch) from the upper chassis ledge. 2. Hold the service access cover against the system unit and slide it toward the front of the system. The tabs of the service access cover slide beneath the upper chassis ledge. 3. Align the two thumbscrews (B) located on the back of the service access cover with the two holes on the back of the system chassis. 4. Tighten the thumbscrews to secure the service access cover.
406 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
10. Place into operating position
Note: When placing the system into operating position, ensure that the cables at the back of the system do not catch or bind as you push the system unit back into the rack. 1. Unlock the rail safety latches (B) by sliding them toward the front of the system. 2. Push the system unit (A) back into the rack until both system-unit release latches have locked into position.
11. Place into operating position (continued)
1. Replace and tighten the two screws (B) that secure the system unit (A) to the rack. 2. Close the front rack door of the unit you are servicing.
Chapter 4. Removing and replacing FRUs 407
12. Reconnect the external cables
Attention: To prevent damaging the FRU or the system, use the following precautions before plugging cables into a connector or adapter: v Make sure that you have the right type of cable for the connector or adapter. v Make sure that the cable plug is correctly aligned with the connector or adapter. v Make sure that the tongue on the HSL/RIO cable plug matches the white location keys on the connector.External cables were labeled before disconnection so that they can be reconnected in exactly the same place after the repair. Reconnect the external cables you previously removed to the FRU you are servicing. 13. Install the power cords
If you removed the power cords, perform the following steps. 1. Identify the system unit you are servicing in the rack. 2. Connect all power cords to the unit you are servicing. 3. Close the back rack door.
14.
Go to Chapter 5, “Verifying a repair,” on page 557. 408 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
Fan Use this procedure to service the fan. 1. Place into service position
Note to tower users: The instructions in this procedure are written based on the rack model. Tower users may still use this procedure. However, tower users should ignore specific references to rack-only features such as rack doors, rack thumbscrews, and rack-specific FRU orientation, etc. 1. Open the front rack door. 2. Identify the system unit you are servicing in the rack. 3. Remove the two screws (A) (if present) that secure the system unit (B) to the rack.
2. Place into service position (continued)
Notes: 1. When placing the system into the service position, it is essential that all stability plates are firmly in position to prevent the rack from toppling. Ensure that only one system unit is in the service position at a time. 2. Ensure that the cables at the back of the system unit do not catch or bind as you pull the unit forward in the rack. 3. When the system unit rails are fully extended, the rail safety latches lock into place. This action prevents the system from being pulled out too far.While holding the system unit release latches (A) down on both the left and right sides, pull the system unit (B) out from the rack until the rails are fully extended and locked.
Chapter 4. Removing and replacing FRUs 409
3. Remove the service access cover
1. Loosen the two thumbscrews (A) located at the back of the cover. 2. Slide the cover (B) toward the back of the system unit. When the front of the service access cover has cleared the upper frame ledge, lift the cover up and off the system unit.Attention: For proper cooling and air flow, re-install the cover before turning on the system. Operating the system without the cover for more than 30 minutes could damage the system components.
4. Can repair be concurrent?
You must determine if the repair can continue concurrently (i.e. with power on). To continue the repair concurrently, the following conditions must be true: v Fans must already be installed at the other three fan locations. v Each of these fans has a green LED and an amber LED. The LEDs (A) must be set as follows: – Green - on – Amber - offIf any of the conditions are not true, the repair can continue only after powering off the unit containing the FRU that is being repaired. If the unit is already powered off, you may proceed with a non-concurrent repair.
410 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
5.
Select an action
Nonconcurrent repair. Power off the unit and continue the repair.
Concurrent repair. Continue the repair with unit power on.
Delay the repair.
?
Go to step 9.
This ends the procedure. 6. Shut down the partitions and power off the system
This FRU cannot be serviced concurrently. Every partition must be shut down and the system powered off to continue the repair. Authorization MUST COME FROM THE CUSTOMER to shut down all running partitions and power off the system. If the customer will not allow the partitions to be shut down or the system to be powered off, delay the service procedure. If the customer is using a secondary HMC to manage the system, make sure they are NOT powering on the system at this time. 7. Disconnect the power cords
1. Open the back rack door on the unit you are servicing. 2. Identify the system unit you are servicing in the rack. 3. Disconnect all power cords from the unit you are servicing.
8.
Go to step 10 on page 412. 9. Locate the hot-swap cooling fan
Locate the fan you are servicing. Note: Each fan unit has two LEDs. The green LED indicates the fan is operating properly. The amber LED indicates a fan failure. Chapter 4. Removing and replacing FRUs 411
10. Remove the fan
1. Locate the failing fan to be removed. 2. Push the latch (A) upward to disengage the fan from the fan cage. 3. Pull the fan out of the fan cage.
11. Install the fan
1. Lower the fan into its location in the fan cage. 2. Push down on the fan until it locks into place. 3. Secure the fan (A) by closing the latch.
412 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
12. Install the service access cover
1. Place the service access cover (A) on top of the system, about 25 mm (1 inch) from the upper chassis ledge. 2. Hold the service access cover against the system unit and slide it toward the front of the system. The tabs of the service access cover slide beneath the upper chassis ledge. 3. Align the two thumbscrews (B) located on the back of the service access cover with the two holes on the back of the system chassis. 4. Tighten the thumbscrews to secure the service access cover.
13. Place into operating position
Note: When placing the system into operating position, ensure that the cables at the back of the system do not catch or bind as you push the system unit back into the rack. 1. Unlock the rail safety latches (B) by sliding them toward the front of the system. 2. Push the system unit (A) back into the rack until both system-unit release latches have locked into position.
Chapter 4. Removing and replacing FRUs 413
14. Place into operating position (continued)
1. Replace and tighten the two screws (B) that secure the system unit (A) to the rack. 2. Close the front rack door of the unit you are servicing.
15. Install the power cords
If you removed the power cords, perform the following steps. 1. Identify the system unit you are servicing in the rack. 2. Connect all power cords to the unit you are servicing. 3. Close the back rack door.
16.
Go to Chapter 5, “Verifying a repair,” on page 557. 414 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
GX adapter Use this procedure to service the GX adapter. 1. Shut down the partitions and power off the system
This FRU cannot be serviced concurrently. Every partition must be shut down and the system powered off to continue the repair. Authorization MUST COME FROM THE CUSTOMER to shut down all running partitions and power off the system. If the customer will not allow the partitions to be shut down or the system to be powered off, delay the service procedure. If the customer is using a secondary HMC to manage the system, make sure they are NOT powering on the system at this time. 2. Disconnect the power cords
1. Open the back rack door on the unit you are servicing. 2. Identify the system unit you are servicing in the rack. 3. Disconnect all power cords from the unit you are servicing.
Chapter 4. Removing and replacing FRUs 415
3. Place into service position
Note to tower users: The instructions in this procedure are written based on the rack model. Tower users may still use this procedure. However, tower users should ignore specific references to rack-only features such as rack doors, rack thumbscrews, and rack-specific FRU orientation, etc. 1. Open the front rack door. 2. Identify the system unit you are servicing in the rack. 3. Remove the two screws (A) (if present) that secure the system unit (B) to the rack.
4. Place into service position (continued)
Notes: 1. When placing the system into the service position, it is essential that all stability plates are firmly in position to prevent the rack from toppling. Ensure that only one system unit is in the service position at a time. 2. Ensure that the cables at the back of the system unit do not catch or bind as you pull the unit forward in the rack. 3. When the system unit rails are fully extended, the rail safety latches lock into place. This action prevents the system from being pulled out too far.While holding the system unit release latches (A) down on both the left and right sides, pull the system unit (B) out from the rack until the rails are fully extended and locked.
416 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
5. Remove the service access cover
1. Loosen the two thumbscrews (A) located at the back of the cover. 2. Slide the cover (B) toward the back of the system unit. When the front of the service access cover has cleared the upper frame ledge, lift the cover up and off the system unit.Attention: For proper cooling and air flow, re-install the cover before turning on the system. Operating the system without the cover for more than 30 minutes could damage the system components.
6. Disconnect the external cables
1. Carefully label the external cables connected to the FRU you are servicing so that they can be reconnected in exactly the same place after the repair. 2. Disconnect the external cables. 7. Remove the GX adapter
1. Slide the latch in the direction of the arrow (A) to release the GX adapter. 2. Pinch the GX adapter latches (C) and rotate them in the direction shown to release the GX adapter from the slot. 3. Gently pull the GX adapter straight up and out of the system.
Chapter 4. Removing and replacing FRUs 417
8. Install the GX adapter
1. Align the GX adapter with its connector on the system backplane. 2. Gently push the GX adapter straight into the system. 3. Rotate the latches (C) in the direction shown until they latch. 4. Slide the latch back in the direction of the arrow (B) to secure all adapters.
9. Reconnect the external cables
Attention: To prevent damaging the FRU or the system, use the following precautions before plugging cables into a connector or adapter: v Make sure that you have the right type of cable for the connector or adapter. v Make sure that the cable plug is correctly aligned with the connector or adapter. v Make sure that the tongue on the HSL/RIO cable plug matches the white location keys on the connector.External cables were labeled before disconnection so that they can be reconnected in exactly the same place after the repair. Reconnect the external cables you previously removed to the FRU you are servicing. 418 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
10. Install the service access cover
1. Place the service access cover (A) on top of the system, about 25 mm (1 inch) from the upper chassis ledge. 2. Hold the service access cover against the system unit and slide it toward the front of the system. The tabs of the service access cover slide beneath the upper chassis ledge. 3. Align the two thumbscrews (B) located on the back of the service access cover with the two holes on the back of the system chassis. 4. Tighten the thumbscrews to secure the service access cover.
11. Place into operating position
Note: When placing the system into operating position, ensure that the cables at the back of the system do not catch or bind as you push the system unit back into the rack. 1. Unlock the rail safety latches (B) by sliding them toward the front of the system. 2. Push the system unit (A) back into the rack until both system-unit release latches have locked into position.
Chapter 4. Removing and replacing FRUs 419
12. Place into operating position (continued)
1. Replace and tighten the two screws (B) that secure the system unit (A) to the rack. 2. Close the front rack door of the unit you are servicing.
13. Install the power cords
If you removed the power cords, perform the following steps. 1. Identify the system unit you are servicing in the rack. 2. Connect all power cords to the unit you are servicing. 3. Close the back rack door.
14.
Go to Chapter 5, “Verifying a repair,” on page 557. 420 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
Media device Use this procedure to service the media device. 1. Shut down the partitions and power off the system
This FRU cannot be serviced concurrently. Every partition must be shut down and the system powered off to continue the repair. Authorization MUST COME FROM THE CUSTOMER to shut down all running partitions and power off the system. If the customer will not allow the partitions to be shut down or the system to be powered off, delay the service procedure. If the customer is using a secondary HMC to manage the system, make sure they are NOT powering on the system at this time. 2. Disconnect the power cords
1. Open the back rack door on the unit you are servicing. 2. Identify the system unit you are servicing in the rack. 3. Disconnect all power cords from the unit you are servicing.
Chapter 4. Removing and replacing FRUs 421
3. Remove the front cover
Note to Tower Users: The instructions in this procedure are written based on the Rack model. Tower users may still use this procedure. However, Tower users should ignore specific references to rack-only features such as rack doors, rack thumbscrews, and rack-specific FRU orientation, etc. 1. Open the front rack door. 2. Remove the two screws (A) (if present) that secure the system unit (B) to the rack. 3. Grab the cover at points (C) and pull it out and away from the system unit.
4. Place into service position
Notes: 1. When placing the system into the service position, it is essential that all stability plates are firmly in position to prevent the rack from toppling. Ensure that only one system unit is in the service position at a time. 2. Ensure that the cables at the back of the system unit do not catch or bind as you pull the unit forward in the rack. 3. When the system unit rails are fully extended, the rail safety latches lock into place. This action prevents the system from being pulled out too far.While holding the system unit release latches (A) down on both the left and right sides, pull the system unit (B) out from the rack until the rails are fully extended and locked.
422 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
5. Remove the service access cover
1. Loosen the two thumbscrews (A) located at the back of the cover. 2. Slide the cover (B) toward the back of the system unit. When the front of the service access cover has cleared the upper frame ledge, lift the cover up and off the system unit.Attention: For proper cooling and air flow, re-install the cover before turning on the system. Operating the system without the cover for more than 30 minutes could damage the system components.
6. Remove the fans
1. Push the latch (A) upward to disengage the fan from the fan cage. 2. Pull the fan (B) out of the fan cage. 3. Repeat these steps until all fans are removed from the fan cage.
Chapter 4. Removing and replacing FRUs 423
7. Remove the fan cage
1. Pull the four retaining tabs (A) that secure the fan cage to its enclosure. 2. Lift the fan cage (B) straight up and out of the enclosure.
8. Disconnect the media device cables
1. Disconnect the media device cable (A) from its connector at the rear of the media device. 2. Media device cables (B) and (C) do not need to be touched.
424 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
9. Remove the media device
While pushing the media device release latches (A) on both the left and right sides, grasp the device (B) and pull it out of the system unit.
10. Install the media device
Push the media device (A) into the system unit until you feel the latches (B) lock.
Chapter 4. Removing and replacing FRUs 425
11. Connect the media device cables
1. Connect the media device cable (A) to its connector at the rear of the media device. 2. Media device cables (B) and (C) do not need to be touched here.
12. Install the fan cage
1. Check that the four retaining tabs are in open position. 2. Lower the fan cage (A) into its location on the enclosure, aligning the blue tabs with the holes. 3. Push firmly on the fan cage so that you are sure it fits into its position. 4. Tighten the four retaining tabs (B) securing the fan cage to its enclosure.
426 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
13. Install the fans
1. Lower the fan (A) into its location in the fan cage. 2. Push down on the fan until it locks into place. 3. Secure the fan by closing the latch (B). 4. Repeat these steps until all fans are installed.
14. Install the service access cover
1. Place the service access cover (A) on top of the system, about 25 mm (1 inch) from the upper chassis ledge. 2. Hold the service access cover against the system unit and slide it toward the front of the system. The tabs of the service access cover slide beneath the upper chassis ledge. 3. Align the two thumbscrews (B) located on the back of the service access cover with the two holes on the back of the system chassis. 4. Tighten the thumbscrews to secure the service access cover.
Chapter 4. Removing and replacing FRUs 427
15. Place into operating position
Note: When placing the system into operating position, ensure that the cables at the back of the system do not catch or bind as you push the system unit back into the rack. 1. Unlock the rail safety latches (B) by sliding them toward the front of the system. 2. Push the system unit (A) back into the rack until both system-unit release latches have locked into position.
16. Install the front cover
1. Position the cover on the front of the system unit so that the two screws (C) align with the screw holes on the front of the system unit (A). 2. Push the cover at points (B) to attach it to the front of the system unit as shown in the following figure. 3. Tighten the two screws that secure the system unit to the rack. 4. Close the front rack door.
428 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
17. Install the power cords
If you removed the power cords, perform the following steps. 1. Identify the system unit you are servicing in the rack. 2. Connect all power cords to the unit you are servicing. 3. Close the back rack door.
18.
Go to Chapter 5, “Verifying a repair,” on page 557. Chapter 4. Removing and replacing FRUs 429
Memory DIMM Use this procedure to service a memory DIMM. 1. Shut down the partitions and power off the system
This FRU cannot be serviced concurrently. Every partition must be shut down and the system powered off to continue the repair. Authorization MUST COME FROM THE CUSTOMER to shut down all running partitions and power off the system. If the customer will not allow the partitions to be shut down or the system to be powered off, delay the service procedure. If the customer is using a secondary HMC to manage the system, make sure they are NOT powering on the system at this time. 2. Disconnect the power cords
1. Open the back rack door on the unit you are servicing. 2. Identify the system unit you are servicing in the rack. 3. Disconnect all power cords from the unit you are servicing.
430 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
3. Place into service position
Note to tower users: The instructions in this procedure are written based on the rack model. Tower users may still use this procedure. However, tower users should ignore specific references to rack-only features such as rack doors, rack thumbscrews, and rack-specific FRU orientation, etc. 1. Open the front rack door. 2. Identify the system unit you are servicing in the rack. 3. Remove the two screws (A) (if present) that secure the system unit (B) to the rack.
4. Place into service position (continued)
Notes: 1. When placing the system into the service position, it is essential that all stability plates are firmly in position to prevent the rack from toppling. Ensure that only one system unit is in the service position at a time. 2. Ensure that the cables at the back of the system unit do not catch or bind as you pull the unit forward in the rack. 3. When the system unit rails are fully extended, the rail safety latches lock into place. This action prevents the system from being pulled out too far.While holding the system unit release latches (A) down on both the left and right sides, pull the system unit (B) out from the rack until the rails are fully extended and locked.
Chapter 4. Removing and replacing FRUs 431
5. Remove the service access cover
1. Loosen the two thumbscrews (A) located at the back of the cover. 2. Slide the cover (B) toward the back of the system unit. When the front of the service access cover has cleared the upper frame ledge, lift the cover up and off the system unit.Attention: For proper cooling and air flow, re-install the cover before turning on the system. Operating the system without the cover for more than 30 minutes could damage the system components.
6. Remove the fans
1. Push the latch (A) upward to disengage the fan from the fan cage. 2. Pull the fan (B) out of the fan cage. 3. Repeat these steps until all fans are removed from the fan cage.
432 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
7. Remove the fan cage
1. Pull the four retaining tabs (A) that secure the fan cage to its enclosure. 2. Lift the fan cage (B) straight up and out of the enclosure.
8. Remove the memory DIMM
1. Locate the memory DIMM you want to remove. 2. Remove the memory DIMM by pushing the tabs (A) out and then down. The tabs’ lever action forces the memory DIMM out of the connector. 3. Pull the memory DIMM (B) out of the connector.
Chapter 4. Removing and replacing FRUs 433
9. Install the memory DIMM
Note: Memory DIMMs are keyed to prevent a DIMM from being installed incorrectly. Note the locations of the key tabs within the DIMM connector before attempting to install the DIMM. 1. Ensure that the connector locking tabs (A) are pushed out in the unlocked position before installing a new memory DIMM. 2. Carefully grasp the memory DIMM (B) along two edges and align the connector. 3. Insert the memory DIMM firmly into the connector. 4. Secure the memory DIMM by pushing in the locking tabs (A).
10. Install the fan cage
1. Check that the four retaining tabs are in open position. 2. Lower the fan cage (A) into its location on the enclosure, aligning the blue tabs with the holes. 3. Push firmly on the fan cage so that you are sure it fits into its position. 4. Tighten the four retaining tabs (B) securing the fan cage to its enclosure.
434 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
11. Install the fans
1. Lower the fan (A) into its location in the fan cage. 2. Push down on the fan until it locks into place. 3. Secure the fan by closing the latch (B). 4. Repeat these steps until all fans are installed.
12. Install the service access cover
1. Place the service access cover (A) on top of the system, about 25 mm (1 inch) from the upper chassis ledge. 2. Hold the service access cover against the system unit and slide it toward the front of the system. The tabs of the service access cover slide beneath the upper chassis ledge. 3. Align the two thumbscrews (B) located on the back of the service access cover with the two holes on the back of the system chassis. 4. Tighten the thumbscrews to secure the service access cover.
Chapter 4. Removing and replacing FRUs 435
13. Place into operating position
Note: When placing the system into operating position, ensure that the cables at the back of the system do not catch or bind as you push the system unit back into the rack. 1. Unlock the rail safety latches (B) by sliding them toward the front of the system. 2. Push the system unit (A) back into the rack until both system-unit release latches have locked into position.
14. Place into operating position (continued)
1. Replace and tighten the two screws (B) that secure the system unit (A) to the rack. 2. Close the front rack door of the unit you are servicing.
436 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
15. Install the power cords
If you removed the power cords, perform the following steps. 1. Identify the system unit you are servicing in the rack. 2. Connect all power cords to the unit you are servicing. 3. Close the back rack door.
16.
Go to Chapter 5, “Verifying a repair,” on page 557. Chapter 4. Removing and replacing FRUs 437
PCI adapter Use this procedure to service the PCI adapter. 1. Are you doing a concurrent repair?
You may service the PCI adapter concurrently or nonconcurrently. For more information, see PCI Adapters, SA76-0152. If you are servicing the PCI adapter concurrently, continue with step 4. Otherwise, continue with the next step. 2. Shut down the partitions and power off the system
This FRU cannot be serviced concurrently. Every partition must be shut down and the system powered off to continue the repair. Authorization MUST COME FROM THE CUSTOMER to shut down all running partitions and power off the system. If the customer will not allow the partitions to be shut down or the system to be powered off, delay the service procedure. If the customer is using a secondary HMC to manage the system, make sure they are NOT powering on the system at this time. 3. Disconnect the power cords
1. Open the back rack door on the unit you are servicing. 2. Identify the system unit you are servicing in the rack. 3. Disconnect all power cords from the unit you are servicing.
4. Disconnect the external cables
1. Carefully label the external cables connected to the FRU you are servicing so that they can be reconnected in exactly the same place after the repair. 2. Disconnect the external cables. 438 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
5. Place into service position
Note to tower users: The instructions in this procedure are written based on the rack model. Tower users may still use this procedure. However, tower users should ignore specific references to rack-only features such as rack doors, rack thumbscrews, and rack-specific FRU orientation, etc. 1. Open the front rack door. 2. Identify the system unit you are servicing in the rack. 3. Remove the two screws (A) (if present) that secure the system unit (B) to the rack.
6. Place into service position (continued)
Notes: 1. When placing the system into the service position, it is essential that all stability plates are firmly in position to prevent the rack from toppling. Ensure that only one system unit is in the service position at a time. 2. Ensure that the cables at the back of the system unit do not catch or bind as you pull the unit forward in the rack. 3. When the system unit rails are fully extended, the rail safety latches lock into place. This action prevents the system from being pulled out too far.While holding the system unit release latches (A) down on both the left and right sides, pull the system unit (B) out from the rack until the rails are fully extended and locked.
Chapter 4. Removing and replacing FRUs 439
7. Remove the service access cover
1. Loosen the two thumbscrews (A) located at the back of the cover. 2. Slide the cover (B) toward the back of the system unit. When the front of the service access cover has cleared the upper frame ledge, lift the cover up and off the system unit.Attention: For proper cooling and air flow, re-install the cover before turning on the system. Operating the system without the cover for more than 30 minutes could damage the system components.
8. Remove the PCI adapter
1. For a long PCI adapter, squeeze the terra cotta latch (D) on top of the front support, and swing it up in the open position. 2. Slide the latch in the direction of the arrow (A) to release the PCI adapters. 3. Locate the PCI adapter you want to remove. 4. Carefully grasp the PCI adapter (C) by its top edge or upper corners, and pull the PCI adapter up and out of its connector on the system backplane. 5. If you opened the terra cotta latch on top of the front support, close it now.
440 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
9. Install the PCI adapter
1. Place the PCI adapter, component-side up, on a flat, static-protective surface. 2. Set any jumpers or switches on the PCI adapter as instructed by the adapter’s manufacturer. 3. For a long PCI adapter, squeeze the terra cotta latch (D) on top of the front support, and swing it up in the open position. 4. Carefully grasp the PCI adapter (C) by its top edge, and align the PCI adapter with its connector on the system backplane. 5. Press the PCI adapter firmly into its connector. 6. Slide the latch back in the direction of the arrow (B) to secure all adapters. 7. If you opened the terra cotta latch on top of the front support, close it now.
10. Install the service access cover
1. Place the service access cover (A) on top of the system, about 25 mm (1 inch) from the upper chassis ledge. 2. Hold the service access cover against the system unit and slide it toward the front of the system. The tabs of the service access cover slide beneath the upper chassis ledge. 3. Align the two thumbscrews (B) located on the back of the service access cover with the two holes on the back of the system chassis. 4. Tighten the thumbscrews to secure the service access cover.
Chapter 4. Removing and replacing FRUs 441
11. Place into operating position
Note: When placing the system into operating position, ensure that the cables at the back of the system do not catch or bind as you push the system unit back into the rack. 1. Unlock the rail safety latches (B) by sliding them toward the front of the system. 2. Push the system unit (A) back into the rack until both system-unit release latches have locked into position.
12. Place into operating position (continued)
1. Replace and tighten the two screws (B) that secure the system unit (A) to the rack. 2. Close the front rack door of the unit you are servicing.
442 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
13. Reconnect the external cables
Attention: To prevent damaging the FRU or the system, use the following precautions before plugging cables into a connector or adapter: v Make sure that you have the right type of cable for the connector or adapter. v Make sure that the cable plug is correctly aligned with the connector or adapter. v Make sure that the tongue on the HSL/RIO cable plug matches the white location keys on the connector.External cables were labeled before disconnection so that they can be reconnected in exactly the same place after the repair. Reconnect the external cables you previously removed to the FRU you are servicing. 14. Install the power cords
If you removed the power cords, perform the following steps. 1. Identify the system unit you are servicing in the rack. 2. Connect all power cords to the unit you are servicing. 3. Close the back rack door.
15.
Go to Chapter 5, “Verifying a repair,” on page 557. “Remove and replace a PCI adapter contained in a cassette in the system with the power on in AIX” “Remove and replace a PCI adapter contained in a cassette in the system with the power on in Linux” on page 457Remove and replace a PCI adapter contained in a cassette in the system with the power on in AIX Procedures performed on a PCI adapter with the system power on in AIX, also known as hot-plug procedures, require the system administrator to take the PCI adapter offline prior to performing the operation. Before taking an adapter offline, the devices attached to the adapter must be taken offline as well. This action prevents a service representative or user from causing an unexpected outage for system users. To remove and replace an adapter, do the following steps: 1. Determine the location of the PCI adapter in the system. Chapter 4. Removing and replacing FRUs 443
2. Record the slot number and location of each adapter being removed. Adapter slots are numbered on the rear of the system unit. 3. Ensure that any processes or applications that might use the adapter are stopped. 4. Enter the system diagnostics by logging in as root user or as the celogin user, type diag at AIX command line. 5. When the DIAGNOSTIC OPERATING INSTRUCTIONS menu displays, press Enter. 6. At the FUNCTION SELECTION menu, select Task Selection, then press enter. 7. At the Task Selection list, select PCI Hot Plug Manager. 8. Select Unconfigure a Device, then press Enter. 9. Press F4 (or Esc +4) to display the Device Names menu. 10. Select the adapter you are removing in the Device Names menu. 11. Use the Tab key to answer YES to Keep Definition. Use the Tab key again to answer YES to Unconfigure Child Devices, then press Enter. 12. The ARE YOU SURE screen displays. Press Enter to verify the information. Successful unconfiguration is indicated by the OK message displayed next to the Command field at the top of the screen. 13. Press F3 (or Esc +3) twice to return to the Hot Plug Manager menu. 14. Select replace/remove PCI Hot Plug adapter. 15. Select the slot that has the device to be removed from the system. 16. Select Replace. Or, If you need to remove an adapter without replacing it, select Remove. A fast-blinking amber LED located at the back of the machine near the adapter indicates that the slot has been identified. 17. Press Enter. This places the adapter in the action state, meaning it is ready to be removed from the system. 18. Label, and then disconnect all cables attached to the adapter you plan to remove. 19. Lift up on the PCI adapter EMC shield A as shown in Figure 19 on page 445 and then rotate it up and away from the cassette as shown in Figure 20 on page 446. 444 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
Figure
19. Lift up on the EMC shield
Chapter 4. Removing and replacing FRUs 445
20. Remove the cassette. Lift up the lower cassette handle B as shown in the following figure. Pull the PCI cassette C out of the system. Attention: A cassette containing either a PCI adapter or filler panel must be placed in the PCI adapter slot of the system unit for proper air flow and cooling. Figure 20. Rotate the EMC shield into the open position
446 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
21. Place the cassette with the cover facing up on an approved ESD surface. The cover will have a label on it. 22. Install the replacement adapter into the PCI adapter cassette using the following instructions. If the cassette is not going to contain a replacement adapter, use this same procedure to place an adapter filler panel in the cassette. Follow these steps: a. Remove the cassette cover by doing the following steps: 1) Slide the cover latch A to disengage it from the pivot pin C as shown in the following figure. 2) Lift the cover B off of the pivot pin. 3) Slide the cover off of the cassette. Figure 21. PCI adapter cassette removed from the system unit
Chapter 4. Removing and replacing FRUs 447
b. Remove the adapter from the cassette by doing the following steps: 1) Unlock the adapter retainers by rotating the retainer clip A into the horizontal position. See Figure 23 on page 449. Notes: v The edge of the adapter located at the end of the cassette that contains the cassette handles is called the adapter tailstock. v Two retainers are located at the top of the cassette, along the top edge of the adapter. Two more retainers are located at the edge of the cassette opposite of the adapter tailstock. v When the retainer clip is in the horizontal position, the adapter retainers are unlocked and can slide away from the card. v If the corner support retainer is used, unlock it, and then slide the corner support retainer away from the card.2) Push the adapter retainers B away from the adapter. 3) Unlock the adapter tailstock clamp C. 4) Rotate the adapter out of the cassette by grasping the edge of the adapter opposite the tailstock, and then firmly rotate the adapter toward the bottom of the cassette. 5) Lift the adapter out of the tailstock retaining channel. Figure 22. PCI adapter cassette cover removed
448 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
c. Place the new adapter in the cassette. Attention: A cassette containing either a PCI adapter or filler panel must be placed in the PCI adapter slot of the system unit for proper air flow and cooling. If the cassette is not going to contain a PCI adapter, use this same procedure to place an adapter filler panel in the cassette. 1) Remove any shipping handles or brackets attached to the adapter. 2) With the tailstock clamp in the open position, insert the adapter firmly into the tailstock retaining channel A. See Figure 24 on page 450. 3) Rotate the adapter toward the top of the cassette and into place. 4) Close the tailstock clamp. Figure 23. Adapter removed from the PCI adapter cassette
Chapter 4. Removing and replacing FRUs 449
5) Position the adapter retainers to support the adapter, and then rotate the retainer clip B into the closed position. See Figure 24. Note: Two retainers are located at the top of the cassette, along the top edge of the adapter. Two more retainers are located at the edge of the cassette opposite of the adapter tailstock. When the adapter retainer clip is in the horizontal position, the adapter retainers are unlocked and can slide toward the adapter. Place the retainers on the adapter according to the length of the adapter being used. Select the appropriate instructions: v Adapter-cassette retainer placement for large adapters v Adapter-cassette retainer placement for mid-sized adapters v Adapter-cassette retainer placement for small adaptersAdapter-cassette retainer placement for large adapters v Place and lock the retainers B. See Figure 25 on page 451. Attention: Use of the lower corner support retainer might interfere with the docking of the PCI card when positioned within the system. Ensure the retainer does not interfere with the adapter connectors on the system backplane. v Ensure the adapter edge is seated in each retainer groove A. If the shape of the adapter or the presence of a connector will not allow the adapter edge to be seated into the retainer groove, ensure the retainer is still locked firmly against that edge or connector. Figure 24. Adapter removed from the PCI adapter single-width cassette
450 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
Adapter-cassette retainer placement for mid-sized adapters v Remove the adapter stabilizer C. See Figure 26 on page 452. v Place and lock the retainers B. v Ensure the adapter edge is seated in each retainer groove A. If the shape of the adapter or the presence of a connector will not allow the adapter edge to be seated into the retainer groove, ensure the retainer is still locked firmly against that edge or connector. Figure 25. Long adapter in the PCI adapter cassette with the supports and stabilizer in place
Chapter 4. Removing and replacing FRUs 451
Adapter-cassette retainer placement for small adapters v Remove the adapter stabilizer C. See Figure 27 on page 453. v Place the hookarm D into the hole in the corner of the adapter. This supports the card when it is undocked from the connector on the system backplane. v Place and lock the retainers B. v Ensure the adapter edge is seated in each retainer groove A. If the shape of the adapter or the presence of a connector will not allow the adapter edge to be seated into the retainer groove, ensure the retainer is still locked firmly against that edge or connector. Figure 26. Medium-length adapter in the PCI adapter cassette with the supports in place
452 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
d. Replace the cassette cover by doing the following steps: 1) Slide the cover B into position on the cassette. 2) While holding the cover latch A in the open position, place the cover over the pivot pin C. 3) Release the cover latch to lock the cover into place. Figure 27. Short adapter in the PCI adapter cassette with the supports and the hookarm in place
Chapter 4. Removing and replacing FRUs 453
23. At the back of the system, lift the cassette cover flap and identify the cassette slot you want to use. 24. Ensure the lower cassette handle is pressed up toward the retainer clip. This places the adapter in the correct position to be docked in the system. 25. Lift and hold the PCI adapter EMC shield in the open position. See Figure 19 on page 445 and Figure 20 on page 446. 26. Slide the cassette C into the cassette slot as shown in the following figure. 27. When the cassette is fully inserted into the system, firmly press downward on the lower cassette handle B to lock the adapter in its connector. Figure 28. PCI adapter cassette cover replaced
454 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
28. Connect the adapter cables. 29. Lower the PCI adapter EMC shield A into the closed position, close the shield latch, then close the rear rack door. Figure 29. PCI adapter cassette removed from the system unit
Chapter 4. Removing and replacing FRUs 455
30. Return to the AIX screen. Choose one of the following options:. v If you are replacing an adapter, go to step 31. v If you removing an adapter without replacing it, follow these steps: a. Press enter and continue to follow the screen instructions until you receive a message that the removal is successful. Successful removal is indicated by the OK message displayed next to the Command field at the top of the screen. b. Press F10 to exit the Hot-Plug Manager. c. Run the diag -a command. If the system responds with a menu or prompt, follow the instructions to complete the configuration. d. Continue at step 40 on page 457.31. Press enter and continue to follow the screen instructions until you receive a message that the replacement is successful. Successful replacement is indicated by the OK message displayed next to the Command field at the top of the screen. 32. Press the F3 (or Esc+3) key to return to the PCI Hot-Plug Manager menu. 33. Press the F3 (or Esc+3) key to return to the TASK selection list. 34. Select Log Repair Action. 35. Select the resource just replaced, press Enter, press Commit (F7 or ESC 7), then press Enter. 36. Press F3 (or Esc+3) to return to TASK Selection List. Figure 30. PCI adapter EMC shield in the closed position
456 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
37. Select Hot Plug Task, press enter. 38. Select PCI Hot Plug Manager, then select Configure a defined device, then press Enter. 39. Select the device just replaced from the list, then press Enter. The device is now configured. 40. Press the F10 key to exit the diagnostic program. Remove and replace a PCI adapter contained in a cassette in the system with the power on in Linux Procedures performed on a PCI adapter with the system power on in Linux, also known as hot-plug procedures, require the system administrator to take the PCI adapter offline prior to performing the operation. Before taking an adapter offline, the devices attached to the adapter must be taken offline as well. This action prevents a service representative or user from causing an unexpected outage for system users. Before performing this procedure, do the following: v Ensure that the system meets the “Prerequisites for hot-plugging PCI adapters in Linux” on page 469 v “Verify that the Linux, hot-plug PCI tools are installed” on page 470.To remove and replace an adapter, do the following steps: 1. Determine the location of the PCI adapter in the system. 2. Record the slot number and location of each adapter being removed. Adapter slots are numbered on the rear of the system unit. 3. Ensure that any processes or applications that might use the adapter are stopped. 4. Run the drslot_chrp_pci command to enable an adapter to be replaced: For example, to replace the PCI adapter in slot U7879.001.DQD014E-P1-C3 run this command: drslot_chrp_pci -R -s U7879.001.DQD014E-P1-C3 Follow the instructions on the display to complete the task. 5. Label, and then disconnect all cables attached to the adapter you plan to remove. 6. Lift up on the PCI adapter EMC shield A as shown in Figure 31 on page 458 and then rotate it up and away from the cassette as shown in Figure 32 on page 459. Chapter 4. Removing and replacing FRUs 457
Figure
31. Lift up on the EMC shield
458 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
7. Remove the cassette. Lift up the lower cassette handle B as shown in the following figure. Pull the PCI cassette C out of the system. Attention: A cassette containing either a PCI adapter or filler panel must be placed in the PCI adapter slot of the system unit for proper air flow and cooling. Figure 32. Rotate the EMC shield into the open position
Chapter 4. Removing and replacing FRUs 459
8. Place the cassette with the cover facing up on an approved ESD surface. The cover will have a label on it. 9. Install the replacement adapter into the PCI adapter cassette using the following instructions. If the cassette is not going to contain a replacement adapter, use this same procedure to place an adapter filler panel in the cassette. Follow these steps: a. Remove the cassette cover by doing the following steps: 1) Slide the cover latch A to disengage it from the pivot pin C as shown in the following figure. 2) Lift the cover B off of the pivot pin. 3) Slide the cover off of the cassette. Figure 33. PCI adapter cassette removed from the system unit
460 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
b. Remove the adapter from the cassette by doing the following steps: 1) Unlock the adapter retainers by rotating the retainer clip A into the horizontal position. See Figure 35 on page 462. Notes: v The edge of the adapter located at the end of the cassette that contains the cassette handles is called the adapter tailstock. v Two retainers are located at the top of the cassette, along the top edge of the adapter. Two more retainers are located at the edge of the cassette opposite of the adapter tailstock. v When the retainer clip is in the horizontal position, the adapter retainers are unlocked and can slide away from the card. v If the corner support retainer is used, unlock it, and then slide the corner support retainer away from the card.2) Push the adapter retainers B away from the adapter. 3) Unlock the adapter tailstock clamp C. 4) Rotate the adapter out of the cassette by grasping the edge of the adapter opposite the tailstock, and then firmly rotate the adapter toward the bottom of the cassette. 5) Lift the adapter out of the tailstock retaining channel. Figure 34. PCI adapter cassette cover removed
Chapter 4. Removing and replacing FRUs 461
c. Place the new adapter in the cassette. Attention: A cassette containing either a PCI adapter or filler panel must be placed in the PCI adapter slot of the system unit for proper air flow and cooling. If the cassette is not going to contain a PCI adapter, use this same procedure to place an adapter filler panel in the cassette. 1) Remove any shipping handles or brackets attached to the adapter. 2) With the tailstock clamp in the open position, insert the adapter firmly into the tailstock retaining channel A. See Figure 36 on page 463. 3) Rotate the adapter toward the top of the cassette and into place. 4) Close the tailstock clamp. Figure 35. Adapter removed from the PCI adapter cassette
462 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
5) Position the adapter retainers to support the adapter, and then rotate the retainer clip B into the closed position. See Figure 36. Note: Two retainers are located at the top of the cassette, along the top edge of the adapter. Two more retainers are located at the edge of the cassette opposite of the adapter tailstock. When the adapter retainer clip is in the horizontal position, the adapter retainers are unlocked and can slide toward the adapter. Place the retainers on the adapter according to the length of the adapter being used. Select the appropriate instructions: v Adapter-cassette retainer placement for large adapters v Adapter-cassette retainer placement for mid-sized adapters v Adapter-cassette retainer placement for small adaptersAdapter-cassette retainer placement for large adapters v Place and lock the retainers B. See Figure 37 on page 464. Attention: Use of the lower corner support retainer might interfere with the docking of the PCI card when positioned within the system. Ensure the retainer does not interfere with the adapter connectors on the system backplane. v Ensure the adapter edge is seated in each retainer groove A. If the shape of the adapter or the presence of a connector will not allow the adapter edge to be seated into the retainer groove, ensure the retainer is still locked firmly against that edge or connector. Figure 36. Adapter removed from the PCI adapter single-width cassette
Chapter 4. Removing and replacing FRUs 463
Adapter-cassette retainer placement for mid-sized adapters v Remove the adapter stabilizer C. See Figure 38 on page 465. v Place and lock the retainers B. v Ensure the adapter edge is seated in each retainer groove A. If the shape of the adapter or the presence of a connector will not allow the adapter edge to be seated into the retainer groove, ensure the retainer is still locked firmly against that edge or connector. Figure 37. Long adapter in the PCI adapter cassette with the supports and stabilizer in place
464 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
Adapter-cassette retainer placement for small adapters v Remove the adapter stabilizer C. See Figure 39 on page 466. v Place the hookarm D into the hole in the corner of the adapter. This supports the card when it is undocked from the connector on the system backplane. v Place and lock the retainers B. v Ensure the adapter edge is seated in each retainer groove A. If the shape of the adapter or the presence of a connector will not allow the adapter edge to be seated into the retainer groove, ensure the retainer is still locked firmly against that edge or connector. Figure 38. Medium-length adapter in the PCI adapter cassette with the supports in place
Chapter 4. Removing and replacing FRUs 465
d. Replace the cassette cover by doing the following steps: 1) Slide the cover B into position on the cassette. 2) While holding the cover latch A in the open position, place the cover over the pivot pin C. 3) Release the cover latch to lock the cover into place. Figure 39. Short adapter in the PCI adapter cassette with the supports and the hookarm in place
466 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
10. At the back of the system, lift the cassette cover flap and identify the cassette slot you want to use. 11. Ensure the lower cassette handle is pressed up toward the retainer clip. This places the adapter in the correct position to be docked in the system. 12. Lift and hold the PCI adapter EMC shield in the open position. See Figure 31 on page 458 and Figure 32 on page 459. 13. Slide the cassette C into the cassette slot as shown in the following figure. 14. When the cassette is fully inserted into the system, firmly press downward on the lower cassette handle B to lock the adapter in its connector. Figure 40. PCI adapter cassette cover replaced
Chapter 4. Removing and replacing FRUs 467
15. Connect the adapter cables. 16. Lower the PCI adapter EMC shield A into the closed position, close the shield latch, then close the rear rack door. Figure 41. PCI adapter cassette removed from the system unit
468 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
17. Run the lsslot command to verify that the slot is occupied. For example, Enter lsslot -c pci -s U7879.001.DQD014E-P1-C3 The following is an example of the information displayed by this command: # Slot Description Device(s) U7879.001.DQD014E-P1-C3 PCI-X capable, 64 bit, 133MHz slot 0001:40:01.0 Prerequisites for hot-plugging PCI adapters in LinuxIn the course of installing, removing, or replacing a PCI adapter with the system power on in Linux you might need complete some prerequisite tasks. Use the information in this section to identify those prerequisites. Verify that the Linux, hot-plug PCI tools are installedIn the course of installing, removing, or replacing a PCI adapter with the system power on in Linux you might need use the hot-plug PCI tools. Use the procedure in this section to verify that you have the hot-plug PCI tools installed.Prerequisites for hot-plugging PCI adapters in Linux In the course of installing, removing, or replacing a PCI adapter with the system power on in Linux you might need complete some prerequisite tasks. Use the information in this section to identify those prerequisites. Figure 42. PCI adapter EMC shield in the closed position
Chapter 4. Removing and replacing FRUs 469
The Linux, system administrator needs to take the PCI adapter offline prior to removing, replacing, or installing a PCI adapter with the system power on (hot-plugging). Before taking an adapter offline, the devices attached to the adapter must be taken offline as well. This action prevents a service representative or user from causing an unexpected outage for system users. Before hot-plugging adapters for storage devices, ensure file systems on those devices are unmounted. After hot-plugging adapters for storage devices, ensure the file systems on those devices are remounted. Before hot-plugging an adapter, ensure that the server or partition is at the correct level of the Linux operating system (Linux 2.6 or later). Install the POWER™ Linux Service Aids. These service aids enable system serviceability, as well to improve system management. If you are using a Linux on POWER distribution with Linux kernel version 2.6 or greater, you can install the Service Aids that gives you access to more capabilities, which can help you diagnose problems on your system. This software is available at the Service and productivity tools for Linux on POWER Web site. (http://techsupport.services.ibm.com/server/lopdiags) Verify that the Linux, hot-plug PCI tools are installed In the course of installing, removing, or replacing a PCI adapter with the system power on in Linux you might need use the hot-plug PCI tools. Use the procedure in this section to verify that you have the hot-plug PCI tools installed. 1. Enter the following command to verify that the hot-plug PCI tools are installed: rpm -aq | grep rpa-pci-hotplug If the command does not list any rpa-pci-hotplug packages, the PCI Hot Plug tools are not installed. 2. Enter the following command to ensure that the rpaphp driver is loaded: ls -l /sys/bus/pci/slots/ The directory should contain data. If the directory is empty, the driver is not loaded or the system does not contain hot-plug PCI slots. The following is an example of the information displayed by this command: drwxr-xr-x 15 root root 0 Feb 16 23:31 . drwxr-xr-x 5 root root 0 Feb 16 23:31 .. drwxr-xr-x 2 root root 0 Feb 16 23:31 0000:00:02.0 drwxr-xr-x 2 root root 0 Feb 16 23:31 0000:00:02.2 drwxr-xr-x 2 root root 0 Feb 16 23:31 0000:00:02.4 drwxr-xr-x 2 root root 0 Feb 16 23:31 0001:00:02.0 drwxr-xr-x 2 root root 0 Feb 16 23:31 0001:00:02.2 drwxr-xr-x 2 root root 0 Feb 16 23:31 0001:00:02.4 drwxr-xr-x 2 root root 0 Feb 16 23:31 0001:00:02.6 drwxr-xr-x 2 root root 0 Feb 16 23:31 0002:00:02.0 drwxr-xr-x 2 root root 0 Feb 16 23:31 0002:00:02.2 drwxr-xr-x 2 root root 0 Feb 16 23:31 0002:00:02.4 drwxr-xr-x 2 root root 0 Feb 16 23:31 0002:00:02.6 If the directory does not exist, run the following command to mount the filesystem: mount -t sysfs sysfs /sys 3. Ensure the following tools are available in the /usr/sbin directory. v lsslot v drslot_chrp_pci4. Return to the procedure that sent you here. 470 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
Power supply Use this procedure to service the power supply. 1. Can repair be concurrent?
You must determine if the repair can continue concurrently. To continue the repair concurrently, the following conditions must be true: v Another power supply must already be installed. v Its three LEDs must be set as follows: – AC Input - on, not blinking – DC Output - on, not blinking – Fault/Identify - offIf any of the conditions are not true, the repair can continue only after powering off the unit containing the FRU that is being repaired. If the unit is already powered off, you may proceed with a non-concurrent repair.
2.
Select an action:
Nonconcurrent repair. Power off the unit and continue the repair.
Concurrent repair. Continue the repair with unit power on.
Delay the repair.
?
Go to step 8 on page 473.
This ends the procedure. 3. Shut down the partitions and power off the system
This FRU cannot be serviced concurrently. Every partition must be shut down and the system powered off to continue the repair. Authorization MUST COME FROM THE CUSTOMER to shut down all running partitions and power off the system. If the customer will not allow the partitions to be shut down or the system to be powered off, delay the service procedure. If the customer is using a secondary HMC to manage the system, make sure they are NOT powering on the system at this time. Chapter 4. Removing and replacing FRUs 471
4. Disconnect the power cords
1. Open the back rack door on the unit you are servicing. 2. Identify the system unit you are servicing in the rack. 3. Disconnect all power cords from the unit you are servicing.
5. Remove the front cover
Note to Tower Users: The instructions in this procedure are written based on the Rack model. Tower users may still use this procedure. However, Tower users should ignore specific references to rack-only features such as rack doors, rack thumbscrews, and rack-specific FRU orientation, etc. 1. Open the front rack door. 2. Remove the two screws (A) (if present) that secure the system unit (B) to the rack. 3. Grab the cover at points (C) and pull it out and away from the system unit.
472 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
6. Remove the power supply
Attention: Do not attempt to open the covers of the power supply. Power supplies are not serviceable and are to be replaced as a unit. 1. Disconnect the power cable (A) from the power supply you are removing. 2. While pushing the lever (B) down, pull the power supply straight out of the system.
7.
Go to step 10 on page 474. 8. Remove the front cover
Note to Tower Users: The instructions in this procedure are written based on the Rack model. Tower users may still use this procedure. However, Tower users should ignore specific references to rack-only features such as rack doors, rack thumbscrews, and rack-specific FRU orientation, etc. 1. Open the front rack door. 2. Remove the two screws (A) (if present) that secure the system unit (B) to the rack. 3. Grab the cover at points (C) and pull it out and away from the system unit.
Chapter 4. Removing and replacing FRUs 473
9. Remove the power supply
Attention: Do not attempt to open the covers of the power supply. Power supplies are not serviceable and are to be replaced as a unit. 1. The power supply you are servicing should have its Fault/Identify LED (amber LED) blinking. 2. Disconnect the power cable (A) from the power supply you are removing. 3. While pushing the lever (B) down, pull the power supply straight out of the system.
10. Install the power supply
1. While pushing the lever (A) down, carefully slide the power supply into its location in the system. 2. Connect the power cable (B) to the power supply.
11.
Are you doing a concurrent repair?
Yes
No 474 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
Are you doing a concurrent repair?
?
Go to step 13. 12. Is power supply operational?
You must determine if the new power supply is operational. The new power supply is operational if its LEDs are set as follows: v AC Input - on, not blinking v DC Output - on, not blinking v Fault/Identify - blinking or offIf the LEDs are set differently, you must determine why. Make sure the power supply is plugged into a working AC power source, try installing a different power supply, or call your next level of support.
13. Install the front cover
1. Position the cover on the front of the system unit so that the two screws (C) align with the screw holes on the front of the system unit (A). 2. Push the cover at points (B) to attach it to the front of the system unit as shown in the following figure. 3. Tighten the two screws that secure the system unit to the rack. 4. Close the front rack door.
Chapter 4. Removing and replacing FRUs 475
14. Install the power cords
If you removed the power cords, perform the following steps. 1. Identify the system unit you are servicing in the rack. 2. Connect all power cords to the unit you are servicing. 3. Close the back rack door.
15.
Go to Chapter 5, “Verifying a repair,” on page 557. 476 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
RAID enablement card Use this procedure to service the RAID enablement card. 1. Shut down the partitions and power off the system
This FRU cannot be serviced concurrently. Every partition must be shut down and the system powered off to continue the repair. Authorization MUST COME FROM THE CUSTOMER to shut down all running partitions and power off the system. If the customer will not allow the partitions to be shut down or the system to be powered off, delay the service procedure. If the customer is using a secondary HMC to manage the system, make sure they are NOT powering on the system at this time. 2. Disconnect the power cords
1. Open the back rack door on the unit you are servicing. 2. Identify the system unit you are servicing in the rack. 3. Disconnect all power cords from the unit you are servicing.
Chapter 4. Removing and replacing FRUs 477
3. Place into service position
Note to tower users: The instructions in this procedure are written based on the rack model. Tower users may still use this procedure. However, tower users should ignore specific references to rack-only features such as rack doors, rack thumbscrews, and rack-specific FRU orientation, etc. 1. Open the front rack door. 2. Identify the system unit you are servicing in the rack. 3. Remove the two screws (A) (if present) that secure the system unit (B) to the rack.
4. Place into service position (continued)
Notes: 1. When placing the system into the service position, it is essential that all stability plates are firmly in position to prevent the rack from toppling. Ensure that only one system unit is in the service position at a time. 2. Ensure that the cables at the back of the system unit do not catch or bind as you pull the unit forward in the rack. 3. When the system unit rails are fully extended, the rail safety latches lock into place. This action prevents the system from being pulled out too far.While holding the system unit release latches (A) down on both the left and right sides, pull the system unit (B) out from the rack until the rails are fully extended and locked.
478 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
5. Remove the service access cover
1. Loosen the two thumbscrews (A) located at the back of the cover. 2. Slide the cover (B) toward the back of the system unit. When the front of the service access cover has cleared the upper frame ledge, lift the cover up and off the system unit.Attention: For proper cooling and air flow, re-install the cover before turning on the system. Operating the system without the cover for more than 30 minutes could damage the system components.
6. Remove the RAID enablement card
1. Push the tab (A) in the direction shown to release the card. 2. Gently pull the card (B) straight up and out of the enclosure.
Chapter 4. Removing and replacing FRUs 479
7. Install the RAID enablement card
1. Push the card (B) down in the direction shown, along the sliders of the middle divider. 2. Align the card with its connector on the system backplane. 3. Push the tab (A) in the direction shown and gently push the card down until it engages into its slot. 4. Release the tab (A) to lock the card into its slot.
8. Install the service access cover
1. Place the service access cover (A) on top of the system, about 25 mm (1 inch) from the upper chassis ledge. 2. Hold the service access cover against the system unit and slide it toward the front of the system. The tabs of the service access cover slide beneath the upper chassis ledge. 3. Align the two thumbscrews (B) located on the back of the service access cover with the two holes on the back of the system chassis. 4. Tighten the thumbscrews to secure the service access cover.
480 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
9. Place into operating position
Note: When placing the system into operating position, ensure that the cables at the back of the system do not catch or bind as you push the system unit back into the rack. 1. Unlock the rail safety latches (B) by sliding them toward the front of the system. 2. Push the system unit (A) back into the rack until both system-unit release latches have locked into position.
10. Place into operating position (continued)
1. Replace and tighten the two screws (B) that secure the system unit (A) to the rack. 2. Close the front rack door of the unit you are servicing.
Chapter 4. Removing and replacing FRUs 481
11. Install the power cords
If you removed the power cords, perform the following steps. 1. Identify the system unit you are servicing in the rack. 2. Connect all power cords to the unit you are servicing. 3. Close the back rack door.
12.
Go to Chapter 5, “Verifying a repair,” on page 557. 482 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
Auxiliary cache card Use this procedure to service the auxiliary cache card. 1. Shut down the partitions and power off the system
This FRU cannot be serviced concurrently. Every partition must be shut down and the system powered off to continue the repair. Authorization MUST COME FROM THE CUSTOMER to shut down all running partitions and power off the system. If the customer will not allow the partitions to be shut down or the system to be powered off, delay the service procedure. If the customer is using a secondary HMC to manage the system, make sure they are NOT powering on the system at this time. 2. Disconnect the power cords
1. Open the back rack door on the unit you are servicing. 2. Identify the system unit you are servicing in the rack. 3. Disconnect all power cords from the unit you are servicing.
Chapter 4. Removing and replacing FRUs 483
3. Place into service position
Note to tower users: The instructions in this procedure are written based on the rack model. Tower users may still use this procedure. However, tower users should ignore specific references to rack-only features such as rack doors, rack thumbscrews, and rack-specific FRU orientation, etc. 1. Open the front rack door. 2. Identify the system unit you are servicing in the rack. 3. Remove the two screws (A) (if present) that secure the system unit (B) to the rack.
4. Place into service position (continued)
Notes: 1. When placing the system into the service position, it is essential that all stability plates are firmly in position to prevent the rack from toppling. Ensure that only one system unit is in the service position at a time. 2. Ensure that the cables at the back of the system unit do not catch or bind as you pull the unit forward in the rack. 3. When the system unit rails are fully extended, the rail safety latches lock into place. This action prevents the system from being pulled out too far.While holding the system unit release latches (A) down on both the left and right sides, pull the system unit (B) out from the rack until the rails are fully extended and locked.
484 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
5. Remove the service access cover
1. Loosen the two thumbscrews (A) located at the back of the cover. 2. Slide the cover (B) toward the back of the system unit. When the front of the service access cover has cleared the upper frame ledge, lift the cover up and off the system unit.Attention: For proper cooling and air flow, re-install the cover before turning on the system. Operating the system without the cover for more than 30 minutes could damage the system components.
6. Remove the auxiliary cache card
Note: When removing the auxiliary cache card, handle it by its edges, and not by the attached battery. 1. Squeeze the terra cotta latch (A) on top of the front support, and swing it up in the open position. 2. Push the latch (B) in the direction shown to release the card. 3. Gently pull the card (C) straight up and out of the enclosure.
Chapter 4. Removing and replacing FRUs 485
7. Install the auxiliary cache card
Note: When installing the auxiliary cache card, handle it by its edges, and not by the attached battery. 1. Align the card (C) with its connector on the system backplane. 2. Push the latch (B) in the direction shown, and gently push the card (C) down until it engages into its slot. 3. Release the latch (B) to lock the card into its slot. 4. Close the terra cotta latch on top of the front support (A).
8. Install the service access cover
1. Place the service access cover (A) on top of the system, about 25 mm (1 inch) from the upper chassis ledge. 2. Hold the service access cover against the system unit and slide it toward the front of the system. The tabs of the service access cover slide beneath the upper chassis ledge. 3. Align the two thumbscrews (B) located on the back of the service access cover with the two holes on the back of the system chassis. 4. Tighten the thumbscrews to secure the service access cover.
486 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
9. Place into operating position
Note: When placing the system into operating position, ensure that the cables at the back of the system do not catch or bind as you push the system unit back into the rack. 1. Unlock the rail safety latches (B) by sliding them toward the front of the system. 2. Push the system unit (A) back into the rack until both system-unit release latches have locked into position.
10. Place into operating position (continued)
1. Replace and tighten the two screws (B) that secure the system unit (A) to the rack. 2. Close the front rack door of the unit you are servicing.
Chapter 4. Removing and replacing FRUs 487
11. Install the power cords
If you removed the power cords, perform the following steps. 1. Identify the system unit you are servicing in the rack. 2. Connect all power cords to the unit you are servicing. 3. Close the back rack door.
12.
Go to Chapter 5, “Verifying a repair,” on page 557. 488 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
Slimline media Use this procedure to service the Slimline media. 1. Remove the front cover
Note to Tower Users: The instructions in this procedure are written based on the Rack model. Tower users may still use this procedure. However, Tower users should ignore specific references to rack-only features such as rack doors, rack thumbscrews, and rack-specific FRU orientation, etc. 1. Open the front rack door. 2. Remove the two screws (A) (if present) that secure the system unit (B) to the rack. 3. Grab the cover at points (C) and pull it out and away from the system unit.
2. Can repair be concurrent?
You must determine if the repair can continue concurrently (i.e. with power on). If the top LED is green as shown in the figure on the right, you may proceed with a concurrent repair. Otherwise, the repair can continue only after powering off the unit. If the unit is already powered off, you may proceed with a non-concurrent repair.
Chapter 4. Removing and replacing FRUs 489
3.
Select an action:
Nonconcurrent repair. Power off the unit and continue the repair.
Concurrent repair. Continue the repair with unit power on.
Delay the repair.
?
Go to step 6 on page 491.
This ends the procedure. 4. Shut down the partitions and power off the system
This FRU cannot be serviced concurrently. Every partition must be shut down and the system powered off to continue the repair. Authorization MUST COME FROM THE CUSTOMER to shut down all running partitions and power off the system. If the customer will not allow the partitions to be shut down or the system to be powered off, delay the service procedure. If the customer is using a secondary HMC to manage the system, make sure they are NOT powering on the system at this time. 5. Disconnect the power cords
1. Open the back rack door on the unit you are servicing. 2. Identify the system unit you are servicing in the rack. 3. Disconnect all power cords from the unit you are servicing.
490 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
6. Remove the slimline media device
1. Push the terra cotta plastic retaining tab (A) approximately 5mm to the right away from the slimline media device. 2. While depressing the terra cotta plastic retaining tab, grasp the device (B) and pull it out of the system.
7. Install the slimline media device
1. Align the replacement device (A) with the slimline media bay. 2. Push it into the system until the retaining tab (B) locks the device into place.
Chapter 4. Removing and replacing FRUs 491
8. Install the front cover
1. Position the cover on the front of the system unit so that the two screws (C) align with the screw holes on the front of the system unit (A). 2. Push the cover at points (B) to attach it to the front of the system unit as shown in the following figure. 3. Tighten the two screws that secure the system unit to the rack. 4. Close the front rack door.
9. Install the power cords
If you removed the power cords, perform the following steps. 1. Identify the system unit you are servicing in the rack. 2. Connect all power cords to the unit you are servicing. 3. Close the back rack door.
10.
Go to Chapter 5, “Verifying a repair,” on page 557. 492 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
System backplane Use this procedure to replace the system backplane. 1. Access the ASMI
The Advanced System Management Interface (ASMI) is the interface to the service processor that is required to perform general and administrator-level service tasks, such as reading service processor error logs, reading vital product data, setting up the service processor, and controlling the system power. The ASMI may also be referred to as the service processor menus. To access the ASMI through the Hardware Management Console (HMC), complete the following steps: 1. Ensure that the server you are working with is selected. 2. Click Tasks. 3. Click Operations, then click Advanced System Management (ASM). Note: If there is more than one service processor, you must select the primary service processor. 4. Log on to the ASMI using the administrator userid and password.When you have accessed the ASMI, continue with the next step. 2. Check and record the server settings
Prior to replacing this FRU, check and record all server settings as you may need to update the system after you update this FRU. This can be done through the Advanced System Management Interface (ASMI) . 1. Check and record the server firmware level indicated on the right edge of the status frame, the area where the Log out button is located. You may need to update the system to the latest server firmware code level after you update this FRU. 2. Check and record the service processor settings you previously set using ASMI. You will need to reset these service processor settings after you update this FRU or the default settings will be used. Settings to record include the following: v Power/Restart Control settings. v System Service Aids settings. v System Configuration settings, System Name setting. v Network Services settings. v Performance Setup settings. v Login Profile settings.3. Record any service processor settings you may have set using operating system commands. 3. Shut down the partitions and power off the system
This FRU cannot be serviced concurrently. Every partition must be shut down and the system powered off to continue the repair. Authorization MUST COME FROM THE CUSTOMER to shut down all running partitions and power off the system. If the customer will not allow the partitions to be shut down or the system to be powered off, delay the service procedure. If the customer is using a secondary HMC to manage the system, make sure they are NOT powering on the system at this time. Chapter 4. Removing and replacing FRUs 493
4. Disconnect the power cords
1. Open the back rack door on the unit you are servicing. 2. Identify the system unit you are servicing in the rack. 3. Disconnect all power cords from the unit you are servicing.
5. Remove the front cover
Note to Tower Users: The instructions in this procedure are written based on the Rack model. Tower users may still use this procedure. However, Tower users should ignore specific references to rack-only features such as rack doors, rack thumbscrews, and rack-specific FRU orientation, etc. 1. Open the front rack door. 2. Remove the two screws (A) (if present) that secure the system unit (B) to the rack. 3. Grab the cover at points (C) and pull it out and away from the system unit.
494 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
6. Remove the power supply or both power supplies
1. Disconnect the power cable (A) from the first power supply you are removing, either E1 or E2. 2. While pushing the lever (B) down, pull the power supply straight out of the system. 3. Repeat these steps to remove the other power supply, if present.
7. Disconnect all the external cables
1. Carefully label the external cables connected to the FRU you are servicing so that they can be reconnected in exactly the same place after the repair. 2. This can include cables on the following ports: a. Serial b. SPCN c. GX d. Ethernet e. PCI f. USB3. 4. Then disconnect the external cables. Chapter 4. Removing and replacing FRUs 495
8. Place into service position
Notes: 1. When placing the system into the service position, it is essential that all stability plates are firmly in position to prevent the rack from toppling. Ensure that only one system unit is in the service position at a time. 2. Ensure that the cables at the back of the system unit do not catch or bind as you pull the unit forward in the rack. 3. When the system unit rails are fully extended, the rail safety latches lock into place. This action prevents the system from being pulled out too far.While holding the system unit release latches (A) down on both the left and right sides, pull the system unit (B) out from the rack until the rails are fully extended and locked.
9. Remove the service access cover
1. Loosen the two thumbscrews (A) located at the back of the cover. 2. Slide the cover (B) toward the back of the system unit. When the front of the service access cover has cleared the upper frame ledge, lift the cover up and off the system unit.Attention: For proper cooling and air flow, re-install the cover before turning on the system. Operating the system without the cover for more than 30 minutes could damage the system components.
496 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
10. Remove the fans
1. Push the latch (A) upward to disengage the fan from the fan cage. 2. Pull the fan (B) out of the fan cage. 3. Repeat these steps until all fans are removed from the fan cage.
11. Remove the fan cage
1. Pull the four retaining tabs (A) that secure the fan cage to its enclosure. 2. Lift the fan cage (B) straight up and out of the enclosure.
Chapter 4. Removing and replacing FRUs 497
12. Disconnect the media device cables
1. Disconnect the media device cable (A) from its connector at the rear of the media device. 2. Media device cables (B) and (C) do not need to be touched.
13. Remove the disk unit cage and backplane
1. Rotate the handles outward in the direction shown to release the disk unit cage and backplane. 2. Slide the disk unit cage and backplane out of the system unit.
498 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
14. Remove the Ethernet adapter
1. Slide the latch (A) in the direction shown to release the Ethernet adapter (B). 2. Carefully grasp the Ethernet adapter by its blue handle, and pull it straight out of its connector on the system backplane.
15. Remove the GX adapter divider
Grasp the GX adapter divider by its top edge, and slide it up and out of the system.
Chapter 4. Removing and replacing FRUs 499
16. Remove all GX adapters, if present
Note: Record the slot location of all adapters that are going to be removed, so that they can be reinstalled in exactly the same place after the repair. 1. If the latch is in position (B), slide it in the direction of the arrow (A) to release the GX adapters. 2. Pinch the GX Adapter latches (C) and rotate them in the direction shown to release the GX adapter from the slot. 3. Gently pull the GX adapter straight up and out of the system. 4. Repeat these steps to remove a second GX adapter, if present.
17. Remove the PCI adapter dividers
1. Squeeze the terra cotta latch (A) on top of the front support, and swing it up in the open position. 2. Flex the front edge (B) of the PCI adapter divider out of the bracket and toward the rest of the divider. 3. Pull the back edge of the divider away from the retention notches (C) in the system chassis. 4. Repeat these steps for the other PCI adapter dividers that need to be removed.
500 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
18. Remove all PCI adapters
Note: Record the slot location of all adapters that are going to be removed, so that they can be reinstalled in exactly the same place after the repair. 1. If there is a long PCI adapter, squeeze the terra cotta latch (D) on top of the front support, and swing it up in the open position. 2. If the latch is in position (B), slide it in the direction of the arrow (A) to release the PCI adapters. 3. Carefully grasp the PCI adapter (C) by its top edge or upper corners, and pull the PCI adapter up and out of its connector on the system backplane. 4. Store the PCI adapter in a safe place. 5. Repeat these steps until all PCI adapters are removed.
19. Remove the auxiliary cache card, if present
Note: When removing the auxiliary cache card, handle it by its edges, and not by the attached battery. 1. If needed, squeeze the terra cotta latch (A) on top of the front support, and swing it up in the open position. 2. Push the latch (B) in the direction shown to release the card. 3. Gently pull the card (C) straight up and out of the enclosure.
Chapter 4. Removing and replacing FRUs 501
20. Remove the PCI front support
1. Loosen the blue retaining tab (A) that secures the PCI front support to its enclosure. 2. Lift the PCI front support (B) up and out of the enclosure.
21. Remove the RAID enablement card, if present
1. Push the tab (A) in the direction shown to release the card. 2. Gently pull the card (B) straight up and out of the enclosure.
502 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
22. Remove the TPMD card, if present
Gently pull the TPMD card (A) straight up and out of its slot on the system backplane.
23. Remove the middle system divider
1. Loosen the two blue retaining tabs (A) that secure the middle system divider from the system chassis. 2. Grasp the middle system divider (B) and lift it out of the system.
Chapter 4. Removing and replacing FRUs 503
24. Remove the VPD card
1. Press gently on the handle (A) to unlock the VPD card (B) from its slot on the system backplane. 2. Pull the VPD card out of its slot
25. Remove the voltage regulator module pairs
1. Locate the first voltage regulator module pair. 2. Push the connector tabs (A) out and then down to unlock one module. 3. Pull the module (B) out of the connector. 4. Repeat these steps to remove the other module in the pair. 5. Keep the two modules of the pair together. 6. Repeat these steps to remove the other module pair.
504 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
26. Remove the single voltage regulator modules
1. Locate the first single voltage regulator module. 2. Push the connector tabs (A) out and then down to unlock one module. 3. Pull the module (B) out of the connector. 4. Repeat these steps to remove the other voltage regulator module.
27. Remove all memory DIMMs
1. Remove the memory DIMM by pushing the tabs (A) out and then down. The tabs’ lever action forces the memory DIMM out of the connector. 2. Pull the memory DIMM (B) out of the connector. 3. Repeat these steps until all memory DIMMs have been removed from the system backplane being replaced.
Chapter 4. Removing and replacing FRUs 505
28. Remove the line cord assembly
1. Loosen the upper right thumbscrew (A) located on the back of the system chassis. 2. Slide the line cord assembly (B) toward the front of the system to unlock it. 3. Without removing the line cords, grasp the line cord assembly, disengage it, pull it upward, and place it next to the system chassis.
29. Remove the system backplane
Note: Take care when removing the system backplane. Do not lift the system backplane by any of the attached modules. 1. Loosen the two thumbscrews (A) located on the back of the system chassis, as shown in the figure. 2. Push the system backplane (B) toward the front of the system - about 10 mm (half inch) - to unlock it from the system chassis. 3. Grasp the system backplane by the metal frame (C) located at the back of the backplane. 4. Carefully lift the system backplane out of the chassis.
506 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
30. Install the system backplane
Note: Take care when installing the system backplane. Do not lift the system backplane by any of the attached modules. 1. Carefully grasp the replacement system backplane (A) along two edges. 2. Lower the system backplane at an angle, with the front of the backplane connecting with the system chassis first. 3. When the system backplane lays at the bottom of the system chassis, slide it toward the back of the system chassis - about 10 mm (half inch) - until it locks. 4. Tighten the two thumbscrews (B) located on the back of the system chassis, as shown in the figure, to secure the backplane to the chassis.
31. Install the line cord assembly
1. Place the line cord assembly (A) into position, ensuring the slots (B) on the line cord assembly align with the pins on the system chassis. 2. Slide the line cord assembly toward the back of the system to lock it into place. 3. Insert and tighten the upper right thumbscrew (C) located on the back of the system chassis, to secure the line cord assembly.
Chapter 4. Removing and replacing FRUs 507
32. Install all memory DIMMs
Attention: Memory DIMMs are keyed to prevent a DIMM from being installed improperly. Note the locations of the key tabs within the DIMM connector before attempting to install the DIMM. 1. Ensure that the connector locking tabs (A) are pushed out in the unlocked position before installing a memory DIMM. 2. Carefully grasp the memory DIMM (B) along two edges and align the connector. 3. Insert the memory DIMM firmly into the connector. 4. Secure the memory DIMM by pushing in the locking tabs (A). 5. Repeat these steps until all memory DIMMs have been installed.
33. Install the two single voltage regulator modules
1. Ensure that the connector tabs (A) are pushed out in the unlocked position before installing a new voltage regulator module. 2. Carefully grasp the module (B) along two edges and align it with the connector. 3. Insert the module into the connector. 4. Push the connector tabs into the locked position. 5. Repeat these steps to install the other single voltage regulator module.
508 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
34. Install the two voltage regulator module pairs
1. Install the voltage regulator modules, pair by pair. 2. a. Ensure that the connector tabs (A) are pushed out in the unlocked position before installing the module (B). b. Carefully grasp the module along two edges and align it with the connector. c. Insert the module into the connector. d. Push the connector tabs into the locked position. e. Repeat these steps to install the other module in the pair.3. Repeat these steps to install the other pair of modules.
35. Install the VPD card
1. Press gently on the handle (A) to insert the VPD card (B) into its slot on the system backplane. 2. Push the VPD card into place until it is fully seated.
Chapter 4. Removing and replacing FRUs 509
36. Install the middle system divider
1. Align the two blue tabs (B) that secure the middle system divider (A) with the two holes. 2. Slide the middle system divider down and into its position, and align it with the pin (C) in the middle on the system backplane. 3. Tighten the two blue tabs (B) that secure the middle system divider with the system chassis.
37. Install the TPMD card, if previously removed
Gently push the TPMD card (A) straight into its slot on the system backplane.
510 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
38. Install the RAID enablement card, if previously removed
1. Push the card (B) down in the direction shown, along the sliders of the middle divider. 2. Align the card with its connector on the system backplane. 3. Push the tab (A) in the direction shown and gently push the card down until it engages into its slot. 4. Release the tab (A) to lock the card into its slot.
39. Install the PCI front support
1. Lower the PCI front support (A) into its location on the enclosure, aligning the tab with the hole on the back of the front chassis. 2. Tighten the blue tab (B) that secures the PCI front support.
Chapter 4. Removing and replacing FRUs 511
40. Install the auxiliary cache card, if previously removed
Note: When installing the auxiliary cache card, handle it by its edges, and not by the attached battery. 1. Squeeze the terra cotta latch (A) on top of the front support, and swing it up in the open position. 2. Align the card (C) with its connector on the system backplane. 3. Push the latch (B) in the direction shown, and gently push the card (C) down until it engages into its slot. 4. Release the latch (B) to lock the card into its slot.
41. Install all PCI adapters, if previously removed
Note: Reinstall the cards in the same slot locations as before the repair. 1. If there is a long PCI adapter and if needed, squeeze the terra cotta latch (D) on top of the front support, and swing it up in the open position. 2. If needed, slide the latch in the direction of the arrow (A) to let the PCI adapter slot accessible. 3. Carefully grasp the PCI adapter (C) by its top edge, and align the PCI adapter with its connector on the system backplane. 4. Press the PCI adapter firmly into its connector. 5. Repeat these steps until all PCI adapters are installed.
512 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
42. Install the PCI adapter dividers
1. If needed, squeeze the terra cotta latch (C) on top of the front support, and swing it up in the open position. 2. Grasp the PCI adapter divider by its top edge and align the back edge of the divider with the retention notches (A) in the system chassis. 3. Slide the front edge of the divider (B) into its slot. 4. Repeat these steps for the other PCI adapter dividers that need to be installed. 5. Close the terra cotta latch (C) on top of the front support.
43. Install all GX adapters, if previously removed
Note: Reinstall the cards in the same slot locations as before the repair. 1. If needed, slide the latch in the direction of the arrow (A) to let the GX adapter slot accessible. 2. Align the GX adapter with its connector on the system backplane. 3. Gently push the GX adapter straight into the system. 4. Rotate the latches (C) in the direction shown until they latch. 5. Repeat these steps to install a second GX adapter, if previously removed.
Chapter 4. Removing and replacing FRUs 513
44. Install the GX adapter divider
1. Align the GX adapter divider (A) with its sliders (B) and (C). 2. Slide the GX adapter divider down and into the system.
45. Install the Ethernet adapter
1. Check the latch (A) is in the open position to let the Ethernet adapter slot accessible. 2. Carefully grasp the Ethernet adapter (B) by its blue handle, and align the card with its connector on the system backplane. 3. Press the Ethernet adapter firmly into its connector. 4. Slide the latch (A) back in the direction of the arrow to secure the Ethernet adapter.
514 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
46. Install the disk unit cage and backplane
1. Slide the replacement disk unit cage and backplane into the system until you feel it engages and locks into place. 2. Rotate the handles in the direction shown to secure the disk unit cage and backplane.
47. Connect the media device cables
1. Connect the media device cable (A) to its connector at the rear of the media device. 2. Media device cables (B) and (C) do not need to be touched here.
Chapter 4. Removing and replacing FRUs 515
48. Install the fan cage
1. Check that the four retaining tabs are in open position. 2. Lower the fan cage (A) into its location on the enclosure, aligning the blue tabs with the holes. 3. Push firmly on the fan cage so that you are sure it fits into its position. 4. Tighten the four retaining tabs (B) securing the fan cage to its enclosure.
49. Install the fans
1. Lower the fan (A) into its location in the fan cage. 2. Push down on the fan until it locks into place. 3. Secure the fan by closing the latch (B). 4. Repeat these steps until all fans are installed.
516 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
50. Install the service access cover
1. Place the service access cover (A) on top of the system, about 25 mm (1 inch) from the upper chassis ledge. 2. Hold the service access cover against the system unit and slide it toward the front of the system. The tabs of the service access cover slide beneath the upper chassis ledge. 3. Align the two thumbscrews (B) located on the back of the service access cover with the two holes on the back of the system chassis. 4. Tighten the thumbscrews to secure the service access cover.
51. Place into operating position
Note: When placing the system into operating position, ensure that the cables at the back of the system do not catch or bind as you push the system unit back into the rack. 1. Unlock the rail safety latches (B) by sliding them toward the front of the system. 2. Push the system unit (A) back into the rack until both system-unit release latches have locked into position.
Chapter 4. Removing and replacing FRUs 517
52. Install the power supply or both power supplies
1. While pushing the lever (A) down, carefully slide the power supply into its location either E1 or E2 in the system. 2. Connect the power cable (B) to the power supply. 3. Repeat these steps to install the other power supply, if present.
53. Install the front cover
1. Position the cover on the front of the system unit so that the two screws (C) align with the screw holes on the front of the system unit (A). 2. Push the cover at points (B) to attach it to the front of the system unit as shown in the following figure. 3. Tighten the two screws that secure the system unit to the rack. 4. Close the front rack door.
518 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
54. Reconnect all the external cables
Attention: To prevent damaging the FRU or the system, use the following precautions before plugging cables into a connector or adapter: v Make sure that you have the right type of cable for the connector or adapter. v Make sure that the cable plug is correctly aligned with the connector or adapter. v Make sure that the tongue on the HSL/RIO cable plug matches the white location keys on the connector.External cables were labeled before disconnection so that they can be reconnected in exactly the same place after the repair. Reconnect all external cables to the rear of the machine you are servicing. This can include cables on the following ports: 1. Serial 2. SPCN 3. GX 4. Ethernet 5. PCI 6. USB 55. Install the power cords
If you removed the power cords, perform the following steps. 1. Identify the system unit you are servicing in the rack. 2. Connect all power cords to the unit you are servicing. 3. Close the back rack door.
56. Restore network connectivity
The service processor must connect to the network to be accessible from the HMC.
How will the IP Address of the Service Processor be managed?
Obtain an IP Address automatically from a DHCP server
Configure a static IP Address
?
Go to step58 on page 520. Chapter 4. Removing and replacing FRUs 519
57. Restore service processor settings
Enable network access to the service processor by proceeding as follows: If the network connection uses the Dynamic Host Configuration Protocol (DHCP) to establish an IP address, and uses the Hardware Management Console (HMC) as the DHCP server, no additional configuration is necessary to enable network access to the service processor. Perform the following steps: 1. Ensure that the service processor is connected to the existing service network by verifying that the HMC cable is connected to the HMC port on the system backplane. 2. If not already connected, connect all system power cables by plugging them into power outlets. Note: Do not start the system at this time.
Do you have network access to the service processor?
Yes
No
Go to step 60 on page 521.
Go to step 62 on page 521 58. Restore service processor settings through the ASMI
The Advanced System Management Interface (ASMI) is the interface to the service processor that is required to perform general and administrator-level service tasks, such as reading service processor error logs, reading vital product data, setting up the service processor, controlling the system power, and setting service processor network ports. The ASMI may also be referred to as the service processor menus. The ASMI can be accessed through https. As the network connection uses static IP address assignments, perform the following: 1. To know your service processor network ports, if necessary, you can use the control panel in manual mode. Refer to service functions to get control panel documentation in that case. 2. Connect a client with a Web browser directly to the service processor network ports you previously noted. It could be something like the following URLs: v https://169.254.2.147 v https://169.254.3.1473. If not already connected, connect all system power cables by plugging them into power outlets.Note: Do not start the system at this time. 4. Log on to the ASMI with the ?admin? User ID and ?admin? default Password. 5. Change the ?admin? User ID’s Password and the ?general? User ID’s Password when prompted. 6. To configure network access perform the following: a. Click on Network Configuration under the Network Services node. b. On the right pane, configure eth0 and eth1 network interfaces, choose for Type of IP Address ’Static’, set a Host name, an IP address, a Subnet mask. 59. Restore service processor settings
With the network connection now configured to use static IP address assignments, try to access the service processor network ports.
Do you have network access to the service processor?
Yes
No
?
Go to step 63 on page 522 520 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
60. Access the ASMI
If you are already connected to the ASMI, click Next to continue. Otherwise to access the ASMI through the Hardware Management Console (HMC), complete the following steps: 1. Ensure that the server you are working with is selected. 2. Click Tasks. 3. Click Operations, then click Advanced System Management (ASM). Note: If there is more than one service processor, you must select the primary service processor. 61.
Go to step 65 on page 522. 62. Restore service processor settings
If the network connection uses DHCP to establish an IP address, but does not use an HMC as the DHCP server, perform the following: 1. Complete any network configuration necessary to allow the DHCP server to recognize and assign an IP address to the service processor. 2. Ensure that the service processor is connected to the existing service network by verifying that the network cable is connected to the network port. 3. If not already connected, connect all system power cables by plugging them into power outlets. Note: Do not start the system at this time.
Do you have network access to the service processor?
No
Yes
?
Go to step 60 Chapter 4. Removing and replacing FRUs 521
63. Restore service processor settings
You have indicated that you are still not able to access the service network. You will need to reset the service processor by pressing the pin-hole switch (E) on the operator panel.
64.
Go to step 56 on page 519. 65. Restore service processor settings
Change the ?admin? User ID’s password when prompted by performing the following steps: 1. In the navigation area of the ASMI, expand Login Profile. 2. Select Change Password under Login Profile. 3. Specify the required information, and click Continue.As a security measure, you are required to enter your current user password in the Current password for current user ID field. Note: Passwords can be any combination of up to 64 alphanumeric characters. The default password for the ?admin? User ID is ?admin?. After your initial login to the ASMI, the ?admin? password must be changed. 522 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
66. Restore service processor settings
Set the system name. The system name can be changed to any valid ASCII string. It does NOT have to follow the initialized MachineType-Model#-Serial# format. To change the system name, perform the following: 1. In the navigation area of the ASMI, expand System Configuration. 2. Select System Name under System Configuration. 3. Enter the desired system name. 4. Click Save settings to update the system name to the new value.The new system name is displayed in the status frame, the area where the Log out button is located. If another method, such as the HMC, is used to change the system name, the status frame does not reflect the change. 67. Restore service processor settings
Set the time-of-day. 1. In the navigation area of the ASMI, expand System Configuration. 2. Select Time of Day under System Configuration. 3. If the system is powered off, the right pane displays a form that shows the current date (month, day, and year) and time (hours, minutes, seconds). 4. Change either the date or the time or both, and click Save settings. 68. Restore service processor settings
Reenter any of the following settings that were previously changed through the ASMI, unless you want to use the default settings. Settings to be set include the following: 1. Power/Restart Control settings. 2. System Service Aids settings. 3. System Configuration settings, if not already done. 4. Network Services settings. 5. Performance Setup settings. 6. Login Profile settings. Chapter 4. Removing and replacing FRUs 523
69. Restore service processor settings
1. Reset any service processor settings that you may have set using operating system commands. You recorded these settings at the beginning of this procedure. 2. If you choose to reset the HMC Access password, perform the following: 3. a. From the HMC GUI (preferred method): b. 1) Expand the Systems Management folder in the navigation tree pane. 2) Double click the Servers folder. 3) Use the checkbox to select a server, in the central panel. 4) Under the Operations folder, select Change Password. 5) Provide the desired information.c. d. Or, from the HMC command line, type: e. /usr/hmcrbin/chsyspwd -m managedsystem -t access --passwd --newpasswd newpassword where: v The value for managedsystem is the new service processor’s managed system name. v No value for --passwd is entered thereby allowing authentication. v The value for newpasswd is the newpassword value. 70. Restore partition data on the service processor, if you had a partition in your system
Profile data stored in the managed server has been cleared or corrupted. To recover profile data using the HMC, proceed as follows: 1. Expand the Systems Management folder in the navigation tree pane. 2. Double click the Servers folder. 3. Use the checkbox to select a server, in the central panel. 4. In the Configuration folder if you have the Manage Partition Data folder, select Restore. 71.
Go to Chapter 5, “Verifying a repair,” on page 557. 524 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
Thermal power management device (TPMD) card Use this procedure to service the thermal power management device (TPMD) card. 1. Shut down the partitions and power off the system
This FRU cannot be serviced concurrently. Every partition must be shut down and the system powered off to continue the repair. Authorization MUST COME FROM THE CUSTOMER to shut down all running partitions and power off the system. If the customer will not allow the partitions to be shut down or the system to be powered off, delay the service procedure. If the customer is using a secondary HMC to manage the system, make sure they are NOT powering on the system at this time. 2. Disconnect the power cords
1. Open the back rack door on the unit you are servicing. 2. Identify the system unit you are servicing in the rack. 3. Disconnect all power cords from the unit you are servicing.
Chapter 4. Removing and replacing FRUs 525
3. Place into service position
Note to tower users: The instructions in this procedure are written based on the rack model. Tower users may still use this procedure. However, tower users should ignore specific references to rack-only features such as rack doors, rack thumbscrews, and rack-specific FRU orientation, etc. 1. Open the front rack door. 2. Identify the system unit you are servicing in the rack. 3. Remove the two screws (A) (if present) that secure the system unit (B) to the rack.
4. Place into service position (continued)
Notes: 1. When placing the system into the service position, it is essential that all stability plates are firmly in position to prevent the rack from toppling. Ensure that only one system unit is in the service position at a time. 2. Ensure that the cables at the back of the system unit do not catch or bind as you pull the unit forward in the rack. 3. When the system unit rails are fully extended, the rail safety latches lock into place. This action prevents the system from being pulled out too far.While holding the system unit release latches (A) down on both the left and right sides, pull the system unit (B) out from the rack until the rails are fully extended and locked.
526 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
5. Remove the service access cover
1. Loosen the two thumbscrews (A) located at the back of the cover. 2. Slide the cover (B) toward the back of the system unit. When the front of the service access cover has cleared the upper frame ledge, lift the cover up and off the system unit.Attention: For proper cooling and air flow, re-install the cover before turning on the system. Operating the system without the cover for more than 30 minutes could damage the system components.
6. Remove the TPMD card
Gently pull the TPMD card (A) straight up and out of its slot on the system backplane.
Chapter 4. Removing and replacing FRUs 527
7. Install the TPMD card
Gently push the TPMD card (A) straight into its slot on the system backplane.
8. Install the service access cover
1. Place the service access cover (A) on top of the system, about 25 mm (1 inch) from the upper chassis ledge. 2. Hold the service access cover against the system unit and slide it toward the front of the system. The tabs of the service access cover slide beneath the upper chassis ledge. 3. Align the two thumbscrews (B) located on the back of the service access cover with the two holes on the back of the system chassis. 4. Tighten the thumbscrews to secure the service access cover.
528 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
9. Place into operating position
Note: When placing the system into operating position, ensure that the cables at the back of the system do not catch or bind as you push the system unit back into the rack. 1. Unlock the rail safety latches (B) by sliding them toward the front of the system. 2. Push the system unit (A) back into the rack until both system-unit release latches have locked into position.
10. Place into operating position (continued)
1. Replace and tighten the two screws (B) that secure the system unit (A) to the rack. 2. Close the front rack door of the unit you are servicing.
Chapter 4. Removing and replacing FRUs 529
11. Install the power cords
If you removed the power cords, perform the following steps. 1. Identify the system unit you are servicing in the rack. 2. Connect all power cords to the unit you are servicing. 3. Close the back rack door.
12.
Go to Chapter 5, “Verifying a repair,” on page 557. 530 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
Voltage regulator module pair Use this procedure to service the voltage regulator module pair. 1. Shut down the partitions and power off the system
This FRU cannot be serviced concurrently. Every partition must be shut down and the system powered off to continue the repair. Authorization MUST COME FROM THE CUSTOMER to shut down all running partitions and power off the system. If the customer will not allow the partitions to be shut down or the system to be powered off, delay the service procedure. If the customer is using a secondary HMC to manage the system, make sure they are NOT powering on the system at this time. 2. Disconnect the power cords
1. Open the back rack door on the unit you are servicing. 2. Identify the system unit you are servicing in the rack. 3. Disconnect all power cords from the unit you are servicing.
Chapter 4. Removing and replacing FRUs 531
3. Place into service position
Note to tower users: The instructions in this procedure are written based on the rack model. Tower users may still use this procedure. However, tower users should ignore specific references to rack-only features such as rack doors, rack thumbscrews, and rack-specific FRU orientation, etc. 1. Open the front rack door. 2. Identify the system unit you are servicing in the rack. 3. Remove the two screws (A) (if present) that secure the system unit (B) to the rack.
4. Place into service position (continued)
Notes: 1. When placing the system into the service position, it is essential that all stability plates are firmly in position to prevent the rack from toppling. Ensure that only one system unit is in the service position at a time. 2. Ensure that the cables at the back of the system unit do not catch or bind as you pull the unit forward in the rack. 3. When the system unit rails are fully extended, the rail safety latches lock into place. This action prevents the system from being pulled out too far.While holding the system unit release latches (A) down on both the left and right sides, pull the system unit (B) out from the rack until the rails are fully extended and locked.
532 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
5. Remove the service access cover
1. Loosen the two thumbscrews (A) located at the back of the cover. 2. Slide the cover (B) toward the back of the system unit. When the front of the service access cover has cleared the upper frame ledge, lift the cover up and off the system unit.Attention: For proper cooling and air flow, re-install the cover before turning on the system. Operating the system without the cover for more than 30 minutes could damage the system components.
6. Remove the fans
1. Push the latch (A) upward to disengage the fan from the fan cage. 2. Pull the fan (B) out of the fan cage. 3. Repeat these steps until all fans are removed from the fan cage.
Chapter 4. Removing and replacing FRUs 533
7. Remove the fan cage
1. Pull the four retaining tabs (A) that secure the fan cage to its enclosure. 2. Lift the fan cage (B) straight up and out of the enclosure.
8. Remove the voltage regulator module pair
1. Locate the voltage regulator module pair. You will need to remove both voltage regulator modules of the pair. 2. Push the connector tabs (A) out and then down to unlock the module. 3. Pull the module (B) out of the connector. 4. Repeat these steps to remove the other module of the pair.
534 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
9. Install a voltage regulator module pair
1. Ensure that the connector tabs (A) are pushed out in the unlocked position before installing a new module. 2. Carefully grasp the module (B) along two edges and align it with the connector. 3. Insert the module into the connector. 4. Push the connector tabs into the locked position. 5. Repeat these steps to install the other voltage regulator module in the pair.
10. Install the fan cage
1. Check that the four retaining tabs are in open position. 2. Lower the fan cage (A) into its location on the enclosure, aligning the blue tabs with the holes. 3. Push firmly on the fan cage so that you are sure it fits into its position. 4. Tighten the four retaining tabs (B) securing the fan cage to its enclosure.
Chapter 4. Removing and replacing FRUs 535
11. Install the fans
1. Lower the fan (A) into its location in the fan cage. 2. Push down on the fan until it locks into place. 3. Secure the fan by closing the latch (B). 4. Repeat these steps until all fans are installed.
12. Install the service access cover
1. Place the service access cover (A) on top of the system, about 25 mm (1 inch) from the upper chassis ledge. 2. Hold the service access cover against the system unit and slide it toward the front of the system. The tabs of the service access cover slide beneath the upper chassis ledge. 3. Align the two thumbscrews (B) located on the back of the service access cover with the two holes on the back of the system chassis. 4. Tighten the thumbscrews to secure the service access cover.
536 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
13. Place into operating position
Note: When placing the system into operating position, ensure that the cables at the back of the system do not catch or bind as you push the system unit back into the rack. 1. Unlock the rail safety latches (B) by sliding them toward the front of the system. 2. Push the system unit (A) back into the rack until both system-unit release latches have locked into position.
14. Place into operating position (continued)
1. Replace and tighten the two screws (B) that secure the system unit (A) to the rack. 2. Close the front rack door of the unit you are servicing.
Chapter 4. Removing and replacing FRUs 537
15. Install the power cords
If you removed the power cords, perform the following steps. 1. Identify the system unit you are servicing in the rack. 2. Connect all power cords to the unit you are servicing. 3. Close the back rack door.
16.
Go to Chapter 5, “Verifying a repair,” on page 557. 538 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
Voltage regulator module - single Use this procedure to service a single voltage regulator module. 1. Shut down the partitions and power off the system
This FRU cannot be serviced concurrently. Every partition must be shut down and the system powered off to continue the repair. Authorization MUST COME FROM THE CUSTOMER to shut down all running partitions and power off the system. If the customer will not allow the partitions to be shut down or the system to be powered off, delay the service procedure. If the customer is using a secondary HMC to manage the system, make sure they are NOT powering on the system at this time. 2. Disconnect the power cords
1. Open the back rack door on the unit you are servicing. 2. Identify the system unit you are servicing in the rack. 3. Disconnect all power cords from the unit you are servicing.
Chapter 4. Removing and replacing FRUs 539
3. Place into service position
Note to tower users: The instructions in this procedure are written based on the rack model. Tower users may still use this procedure. However, tower users should ignore specific references to rack-only features such as rack doors, rack thumbscrews, and rack-specific FRU orientation, etc. 1. Open the front rack door. 2. Identify the system unit you are servicing in the rack. 3. Remove the two screws (A) (if present) that secure the system unit (B) to the rack.
4. Place into service position (continued)
Notes: 1. When placing the system into the service position, it is essential that all stability plates are firmly in position to prevent the rack from toppling. Ensure that only one system unit is in the service position at a time. 2. Ensure that the cables at the back of the system unit do not catch or bind as you pull the unit forward in the rack. 3. When the system unit rails are fully extended, the rail safety latches lock into place. This action prevents the system from being pulled out too far.While holding the system unit release latches (A) down on both the left and right sides, pull the system unit (B) out from the rack until the rails are fully extended and locked.
540 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
5. Remove the service access cover
1. Loosen the two thumbscrews (A) located at the back of the cover. 2. Slide the cover (B) toward the back of the system unit. When the front of the service access cover has cleared the upper frame ledge, lift the cover up and off the system unit.Attention: For proper cooling and air flow, re-install the cover before turning on the system. Operating the system without the cover for more than 30 minutes could damage the system components.
6. Remove the fans
1. Push the latch (A) upward to disengage the fan from the fan cage. 2. Pull the fan (B) out of the fan cage. 3. Repeat these steps until all fans are removed from the fan cage.
Chapter 4. Removing and replacing FRUs 541
7. Remove the fan cage
1. Pull the four retaining tabs (A) that secure the fan cage to its enclosure. 2. Lift the fan cage (B) straight up and out of the enclosure.
8. Remove a single voltage regulator module
1. Locate the voltage regulator module. 2. Push the connector tabs (A) out and then down to unlock the module. 3. Pull the module (B) out of the connector.
542 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
9. Install a single voltage regulator module
1. Ensure that the connector tabs (A) are pushed out in the unlocked position before installing a new voltage regulator module. 2. Carefully grasp the module (B) along two edges and align it with the connector. 3. Insert the module into the connector. 4. Push the connector tabs into the locked position.
10. Install the fan cage
1. Check that the four retaining tabs are in open position. 2. Lower the fan cage (A) into its location on the enclosure, aligning the blue tabs with the holes. 3. Push firmly on the fan cage so that you are sure it fits into its position. 4. Tighten the four retaining tabs (B) securing the fan cage to its enclosure.
Chapter 4. Removing and replacing FRUs 543
11. Install the fans
1. Lower the fan (A) into its location in the fan cage. 2. Push down on the fan until it locks into place. 3. Secure the fan by closing the latch (B). 4. Repeat these steps until all fans are installed.
12. Install the service access cover
1. Place the service access cover (A) on top of the system, about 25 mm (1 inch) from the upper chassis ledge. 2. Hold the service access cover against the system unit and slide it toward the front of the system. The tabs of the service access cover slide beneath the upper chassis ledge. 3. Align the two thumbscrews (B) located on the back of the service access cover with the two holes on the back of the system chassis. 4. Tighten the thumbscrews to secure the service access cover.
544 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
13. Place into operating position
Note: When placing the system into operating position, ensure that the cables at the back of the system do not catch or bind as you push the system unit back into the rack. 1. Unlock the rail safety latches (B) by sliding them toward the front of the system. 2. Push the system unit (A) back into the rack until both system-unit release latches have locked into position.
14. Place into operating position (continued)
1. Replace and tighten the two screws (B) that secure the system unit (A) to the rack. 2. Close the front rack door of the unit you are servicing.
Chapter 4. Removing and replacing FRUs 545
15. Install the power cords
If you removed the power cords, perform the following steps. 1. Identify the system unit you are servicing in the rack. 2. Connect all power cords to the unit you are servicing. 3. Close the back rack door.
16.
Go to Chapter 5, “Verifying a repair,” on page 557. 546 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
Vital product data (VPD) card Use this procedure to service the vital product data (VPD) card. 1. Save the system identifiers
1. On the ASMI Welcome pane, if you have not already logged in, specify your userid and password, and click Log in. Note: To perform this operation, your authority level must be Administrator or Authorized service provider. 2. In the navigation area, expand System Configuration and then Program Vital Product Data. 3. In the navigation area, select System Brand under Program Vital Product Data. 4. Manually record the value for the System Brand, which appears in the right pane. 5. In the navigation area, select System Keywords under Program Vital Product Data. 6. Manually record the values for the Machine type-model, System serial number and System unique ID, which appear in the right pane. 2. Shut down the partitions and power off the system
This FRU cannot be serviced concurrently. Every partition must be shut down and the system powered off to continue the repair. Authorization MUST COME FROM THE CUSTOMER to shut down all running partitions and power off the system. If the customer will not allow the partitions to be shut down or the system to be powered off, delay the service procedure. If the customer is using a secondary HMC to manage the system, make sure they are NOT powering on the system at this time. 3. Disconnect the power cords
1. Open the back rack door on the unit you are servicing. 2. Identify the system unit you are servicing in the rack. 3. Disconnect all power cords from the unit you are servicing.
Chapter 4. Removing and replacing FRUs 547
4. Place into service position
Note to tower users: The instructions in this procedure are written based on the rack model. Tower users may still use this procedure. However, tower users should ignore specific references to rack-only features such as rack doors, rack thumbscrews, and rack-specific FRU orientation, etc. 1. Open the front rack door. 2. Identify the system unit you are servicing in the rack. 3. Remove the two screws (A) (if present) that secure the system unit (B) to the rack.
5. Place into service position (continued)
Notes: 1. When placing the system into the service position, it is essential that all stability plates are firmly in position to prevent the rack from toppling. Ensure that only one system unit is in the service position at a time. 2. Ensure that the cables at the back of the system unit do not catch or bind as you pull the unit forward in the rack. 3. When the system unit rails are fully extended, the rail safety latches lock into place. This action prevents the system from being pulled out too far.While holding the system unit release latches (A) down on both the left and right sides, pull the system unit (B) out from the rack until the rails are fully extended and locked.
548 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
6. Remove the service access cover
1. Loosen the two thumbscrews (A) located at the back of the cover. 2. Slide the cover (B) toward the back of the system unit. When the front of the service access cover has cleared the upper frame ledge, lift the cover up and off the system unit.Attention: For proper cooling and air flow, re-install the cover before turning on the system. Operating the system without the cover for more than 30 minutes could damage the system components.
7. Remove the fans
1. Push the latch (A) upward to disengage the fan from the fan cage. 2. Pull the fan (B) out of the fan cage. 3. Repeat these steps until all fans are removed from the fan cage.
Chapter 4. Removing and replacing FRUs 549
8. Remove the fan cage
1. Pull the four retaining tabs (A) that secure the fan cage to its enclosure. 2. Lift the fan cage (B) straight up and out of the enclosure.
9. Remove the VPD card
1. Press gently on the handle (A) to unlock the VPD card (B) from its slot on the system backplane. 2. Pull the VPD card out of its slot
550 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
10. Install the VPD card
1. Press gently on the handle (A) to insert the VPD card (B) into its slot on the system backplane. 2. Push the VPD card into place until it is fully seated.
11. Install the fan cage
1. Check that the four retaining tabs are in open position. 2. Lower the fan cage (A) into its location on the enclosure, aligning the blue tabs with the holes. 3. Push firmly on the fan cage so that you are sure it fits into its position. 4. Tighten the four retaining tabs (B) securing the fan cage to its enclosure.
Chapter 4. Removing and replacing FRUs 551
12. Install the fans
1. Lower the fan (A) into its location in the fan cage. 2. Push down on the fan until it locks into place. 3. Secure the fan by closing the latch (B). 4. Repeat these steps until all fans are installed.
13. Install the service access cover
1. Place the service access cover (A) on top of the system, about 25 mm (1 inch) from the upper chassis ledge. 2. Hold the service access cover against the system unit and slide it toward the front of the system. The tabs of the service access cover slide beneath the upper chassis ledge. 3. Align the two thumbscrews (B) located on the back of the service access cover with the two holes on the back of the system chassis. 4. Tighten the thumbscrews to secure the service access cover.
552 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
14. Place into operating position
Note: When placing the system into operating position, ensure that the cables at the back of the system do not catch or bind as you push the system unit back into the rack. 1. Unlock the rail safety latches (B) by sliding them toward the front of the system. 2. Push the system unit (A) back into the rack until both system-unit release latches have locked into position.
15. Place into operating position (continued)
1. Replace and tighten the two screws (B) that secure the system unit (A) to the rack. 2. Close the front rack door of the unit you are servicing.
Chapter 4. Removing and replacing FRUs 553
16. Install the power cords
If you removed the power cords, perform the following steps. 1. Identify the system unit you are servicing in the rack. 2. Connect all power cords to the unit you are servicing. 3. Close the back rack door.
17. Power on the system.
Power on the system. 18. Access the ASMI
If you are already connected to the ASMI, click Next to continue. Otherwise to access the ASMI through the Hardware Management Console (HMC), complete the following steps: 1. Ensure that the server you are working with is selected. 2. Click Tasks. 3. Click Operations, then click Advanced System Management (ASM). Note: If there is more than one service processor, you must select the primary service processor. 19. Set the system identifiers
1. On the ASMI Welcome pane, if you have not already logged in, specify your userid and password, and click Log In. Note: To perform this operation, your authority level must be Administrator or Authorized service provider. 2. In the navigation area, expand System Configuration and Program Vital Product Data. 3. Select System Brand. 4. If the system brand is not set, and only in that case, enter the correct system brand and click Continue. Note: If you do not know the correct system brand, contact your next level of support. 5. Select System Keywords under Program Vital Product Data. 6. If the backed up system keywords are correct, click Save settings. Otherwise, enter the values for the Machine type-model, System serial number, and System unique ID. Set the Reserved field to blank spaces unless directed otherwise by Level 4 technical support. Note: If you do not know the system unique ID, contact your next level of support. You cannot boot the system until valid values are entered for all fields. You can change these entries only once. 7. Click Save settings to update the system keywords and save them to the VPD. 554 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
20.
Go to Chapter 5, “Verifying a repair,” on page 557. Chapter 4. Removing and replacing FRUs 555
556 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
Chapter 5. Verifying a repair Use this information to verify hardware operation after making repairs to the system. Choose from the following: v To verify the repair of a system that is currently powered off, go to step 1. v To verify the repair of a system that is currently powered on with no operating system loaded, go to step 3. v To verify the repair of a system that is currently powered on and an operating system is loaded, go to step 5.1. Power on the server and all attached I/O enclosures.
Did all the enclosures power on?
No
Yes
?
Go to Step 3. 2. Choose from the following:
v If the original problem was that an enclosure would not power on and you have another FRU to replace, locate and replace the next FRU. v If the next FRU in the FRU list is an isolation procedure, perform the isolation procedure. v If the original problem was that an enclosure would not power on and you have an isolation procedure to perform, perform the isolation procedure. v If the original problem was that an enclosure would not power on and there are no more FRUs or isolation procedures in the FRU list, contact your next level of support. v If you have a new problem, perform problem analysis and repair the new problem. 3. Load the operating system.
Did the operating system load successfully?
No
Yes
?
Go to Step 5. 4. Choose from the following:
v If the original problem was a failing disk drive that contained the operating system software, go to step 5. v If the original problem was that the operating system would not load and you have another FRU to replace, go to your FRU locations section to locate the next FRU. v If the next FRU in the FRU list is an isolation procedure, perform the isolation procedure. v If the original problem was that the operating system would not load and you have an isolation procedure to perform, perform the isolation procedure. v If the original problem was that the operating system would not load and there are no more FRUs or isolation procedures in the FRU list, contact your next level of support. v If you have a new problem, perform problem analysis and repair the new problem. 5. Choose from the following: “Verifying the repair in AIX” on page 559 “Verifying the repair in Linux” on page 563 © Copyright IBM Corp. 2008 557
“Verifying a repair using an i5/OS system or logical partition” on page 564If you have installed a new feature or part, verify that the system recognizes the feature or part by using the i5/OS system service tools. Use the procedure in this section to accomplish this task. 558 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
Verifying the repair in AIX Use this MAP to check out the server after a repair is completed. 1. Did you replace a disk drive in the root volume group?
NO Go to step 3 YES Continue with the next step. 2. Run standalone diagnostics from either a CD or from a Network Installation Management (NIM) server.
Did you encounter any problems? NO Reinstall the operating system and continue with step 5. YES If the original problem still exists, replace the FRU or perform the isolation procedure that is next in the FRU list. If you have reached the end of the FRU list, contact your next level of support. If a new problem has occurred, go to Beginning Problem Analysis. 3. Did you replace a FRU with power on and concurrently with system operations?
NO Go to step 5. YES Continue with the next step. 4. Did you use an AIX diagnostics service aid hot-swap operation to change the FRU?
YES Go to step 6. NO Go to step 7 on page 560.
Note: The AIX diagnostic service aid was used if a resource was removed using the Hot Plug task. 5. Note: If any FRUs have been removed that should be reinstalled, reinstall them now. Perform the following:
1. If the system is not powered on, power it on now. 2. Perform a slow boot. 3. Wait until the AIX operating system login prompt displays or until system activity on the operator panel or display apparently has stopped. 4. Did you encounter any problems?NO Continue at step 6. YES If the original problem still exists, replace the FRU or perform the isolation procedure that is next in the FRU list. If you have reached the end of the FRU list, contact your next level of support. If a new problem has occurred, go to Beginning Problem Analysis. 6. If the Resource Repair Action menu is already displayed, go to step 9 on page 560; otherwise, perform the following: Chapter 5. Verifying a repair 559
1. Log into the operating system either with root authority (if needed, ask the customer to enter the password) or use the CE login. 2. Enter the diag -a command and check for missing resources. Follow any instructions that display. If an SRN displays, suspect a loose card or connection. If no instructions display, no resources were detected as missing. Continue with the next step.
7. Perform the following:
1. Enter diag at the command prompt. 2. Press Enter. 3. Select the Diagnostics Routines option. 4. When the DIAGNOSTIC MODE SELECTION menu displays, select System verification. 5. When the DIAGNOSTIC SELECTION menu displays, select the All Resources option or test the FRUs you exchanged, and any devices that are attached to the FRU(s) you exchanged, by selecting the diagnostics for the individual FRU.Did the RESOURCE REPAIR ACTION menu (801015) display? NO Continue with the next step. YES Go to step 9. 8. Did the TESTING COMPLETE, no trouble was found menu (801010) display?
YES Use the Log Repair Action option, if not previously logged, in the TASK SELECTION menu to update the AIX error log. If the repair action was reseating a cable or adapter, select the resource associated with that repair action. If the resource associated with your action is not displayed on the resource list, select sysplanar0. Note: If the system attention indicator is on, this action will set it back to the normal state. Go to step 11 on page 561. NO If the original problem still exists, replace the FRU or perform the isolation procedure that is next in the FRU list. If you have reached the end of the FRU list, contact your next level of support. If a new problem has occurred, go to Beginning Problem Analysis. 9. When a test is run on a resource in system verification mode, and that resource has an entry in the AIX error log, if the test on the resource was successful, the RESOURCE REPAIR ACTION menu displays. After replacing a FRU, you must select the resource for that FRU from the RESOURCE REPAIR ACTION menu. This updates the AIX error log to indicate that a system-detectable FRU has been replaced.Note: If the system attention indicator is on, this action will set it back to the normal state.
560 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
Perform the following:
1. Select the resource that has been replaced from the RESOURCE REPAIR ACTION menu. If the repair action was reseating a cable or adapter, select the resource associated with that repair action. If the resource associated with your action is not displayed on the resource list, select sysplanar0. 2. Press Commit after you make your selections.Did another Resource Repair Action (801015) display? NO If the No Trouble Found menu displays, go to step 11. YES Continue with the next step. 10. The parent or child of the resource you just replaced may also require that you run the RESOURCE REPAIR ACTION service aid on it. When a test is run on a resource in system verification mode, and that resource has an entry in the AIX error log, if the test on the resource was successful, the RESOURCE REPAIR ACTION menu displays. After replacing that FRU, you must select the resource for that FRU from the RESOURCE REPAIR ACTION menu. This updates the AIX error log to indicate that a system-detectable FRU has been replaced.Note: If the system attention indicator is on, this action will set it back to the normal state. Perform the following:
1. From the RESOURCE REPAIR ACTION menu, select the parent or child of the resource that has been replaced. If the repair action was reseating a cable or adapter, select the resource associated with that repair action. If the resource associated with your action is not displayed on the resource list, select sysplanar0. 2. Press COMMIT after you make your selections. 3. If the No Trouble Found menu displays, continue with the next step. 11. If you changed the service processor or network settings, as instructed in previous MAPs, restore the settings to the value they had prior to servicing the system. If you ran standalone diagnostics from CD-ROM, remove the standalone diagnostics CD-ROM from the system. Did you perform service on a RAID subsystem involving changing of the PCI RAID adapter cache card or changing the configuration? Note: This information does not apply to the PCI-X RAID adapter or cache.
NO Go to the close of call procedure. YES Continue with the next step. 12. Use the Recover Options selection to resolve the RAID configuration. To do this, perform the following: Chapter 5. Verifying a repair 561
1. On the PCI SCSI Disk Array Manager screen, select Recovery options. 2. If a previous configuration exists on the replacement adapter, this must be cleared. Select Clear PCI SCSI Adapter Configuration. Press F3. 3. On the Recovery Options screen, select Resolve PCI SCSI RAID Adapter Configuration. 4. On the Resolve PCI SCSI RAID Adapter Configuration screen, select Accept Configuration on Drives. 5. On the PCI SCSI RAID Adapter selections menu, select the adapter that you changed. 6. On the next screen, press Enter. 7. When you get the Are You Sure selection menu, press Enter to continue. 8. If you get a Failed status message, verify that you selected the correct adapter, then repeat this procedure. When the recovery is complete, exit the operating system. 9. Go to the close of call procedure.
562 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
Verifying the repair in Linux 1. Run standalone diagnostics from either a CD or from a Network Installation Management (NIM) server. For more information, see Running the eServer stand-alone diagnostics from CD-ROM.
Did you encounter any problems? NO Reboot the operating system and continue with the close of call procedure. YES If the original problem still exists, replace the FRU or perform the isolation procedure that is next in the FRU list. If you have reached the end of the FRU list, contact your next level of support. If a new problem has occurred, perform problem analysis and repair the new problem. Chapter 5. Verifying a repair 563
Verifying a repair using an i5/OS system or logical partition If you have installed a new feature or part, verify that the system recognizes the feature or part by using the i5/OS system service tools. Use the procedure in this section to accomplish this task. 1. Deactivate the failing item indicator light and close the service action log entry. 2. Be sure you are signed on with at least service level authority. 3. On the command line of the i5/OS session, type strsst and press Enter. Note: If you cannot get to the System Service Tools display, use function 21 from the control panel. Alternatively, if the system is managed by Hardware Management Console (HMC), use the Service Focal Point Utilities to get to the Dedicated Service Tools (DST) display. 4. Type your service tools user ID and service tools password on the System Service Tools (SST) Sign On display. Press Enter. Note: The service tools password is case-sensitive. 5. Select Start a service tool from the System Service Tools (SST) display. Press Enter. 6. Select Hardware service manager from the Start a Service Tool display. Press Enter. 7. Select Logical hardware resources (buses, IOPs, controllers) from the Hardware Service Manager display. This option allows you to display and work with logical resources. Logical hardware resources are the functional resources of the system used by the operating system. 8. Press Enter. When you select one of the options from the Logical Hardware Resources display, you can do the following: v Display logical hardware resource status v Display or change logical hardware resource information v Display associated packaging hardware resourcesFor more information on collecting information and performing specific functions using the options and function keys from the Logical Hardware Resources display or for details on the symbols that appear next to the Resource Description field, see the online Help information. 564 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
Chapter 6. Closing a service call Perform these procedures to close problem numbers, clear hardware messages, and prepare the server to return to the customer. Follow this checklist before performing the procedures: v Return the server to the state that the customer normally uses such as IPL type, IPL mode, and the way the system is configured or partitioned. Attention: Before returning the system to the customer, remove the system from service mode. If the system is left in service mode, it automatically places a call for service every two hours. v While you were performing the problem analysis on the original serviceable event, other serviceable events might have been opened. Close all serviceable events that were opened as a result of your service activity. v Ensure that server verification has been performed and there are no problems that require additional service actions. v If the repair was done using the HMC online repair procedures, ensure that the original serviceable event is now closed. 1.
For future reference, record the SRC or symptom and the location code of the FRU you replaced. Is the server managed by an HMC?
Yes
No
?
Do one of the following: v If the server is managed by IVM, Go to “Closing a service call using Integrated Virtualization Manager” on page 571. v If the server is not partitioned and running AIX or Linux, go to “Closing a service call using AIX or Linux” on page 577. 2.
On the HMC, open Manage Serviceable Events and examine the service action event log for any open service action events. 3.
Are there any service action events that are open?
Yes
No
?
If the system attention LED is still on, turn off the LED as described in “Activating and deactivating LEDs” on page 583. Return the system to the customer. This completes the repair. 4.
Record the list of open service action events. © Copyright IBM Corp. 2008 565
5.
From the list of serviceable events recorded in step 4 on page 565, perform the following step (6) through step 32 on page 569 for each open service action event. 6.
Determine the error class of the serviceable event. Record for future use. 7.
Examine the details of the open service action event. Is the error code associated with this service action event the same as recorded in Step 1 on page 565?
No
Yes
?
Go to step 11 on page 567. 8.
Examine the FRU list of the service action event. Are there any FRUs listed for the service action event?
Yes
No
?
Go to step 11 on page 567. 9.
Is the FRU list identical (same FRUs, same number of FRUs, and same order of FRUs) to the FRU list of the error code recorded in step 1 on page 565?
No
Yes
?
Go to step 11 on page 567. 10.
The FRU list is different. Is the FRU you replaced and recorded in step 1 on page 565 in the list of FRUs for this service action event?
Yes
No
?
Go to step 32 on page 569. Note: There are service action events that will remain open when you leave this MAP. Further service actions may be required to complete the repair. 566 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
11.
Examine the details of this service action event, and record the partitions involved in this service action event for use in a later step. 12.
Is the error code associated with this service action event of the form A11-xxx or A01-xxx?
Yes
No
?
Go to step 17. 13.
Have you begun a list of ?Axx? partitions from prior service action events that you processed in this MAP?
No
Yes
?
Go to step 15. 14.
Begin a new list of ?Axx? partitions by copying the list of partitions obtained in step 11. Go to step 16. 15.
Add the partition list obtained in step 11 to the existing list of ?Axx? partitions obtained from processing previous service action events in this MAP. 16.
Remove all entries in the list of all partitions you recorded in step 11. If you are referred to the list of partitions obtained in step 11 in future steps, the list is empty. Go to step 17. 17.
Select and highlight the service action event from the Error Associated With This Serviceable Event window. 18.
Click Close Event. 19.
Add comments for the serviceable event. Include any unique additional information. Click OK. The following steps will add or update FRU information. Chapter 6. Closing a service call 567
20.
Did you replace, add, or modify a FRU of the open service action event?
Yes
No
?
Go to step 22. 21.
From the FRU list, select a FRU that you need to update. Double-click on the FRU, and update the FRU information. Go to step 23. 22.
Select the No FRU Replaced for this Serviceable Event option. 23.
Click OK to close the service action event. 24.
Is the list of all partitions you recorded in step 11 on page 567 empty?
No
Yes
?
Go to step 32 on page 569. 25.
Does the list of all partitions you recorded in step 11 on page 567 contain more than one entry?
Yes
No
?
Go to step 32 on page 569. 26.
Is the error class recorded in step 25 AIX?
Yes
No
?
Go to step 32 on page 569. 27.
Perform the following steps for each entry in the list of all partitions you recorded in step 11 on page 567, except the partition you were using to debug the original problem. 28.
From the HMC virtual terminal window of a partition in the list of all partitions, type diag at the AIX command prompt. 568 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
29.
When the diagnostic operating instructions are displayed, do the following: 1. Press Enter. 2. Select the Task Selection option. 3. Select the Log Repair option. 4. Select the resource associated with the repair action. If the repair action was reseating a cable or adapter, select the resource associated with that repair action. If the resource associated with your repair action is not displayed on the Resource List, select sysplanar0. 5. Click Commit after you have made your selection.
Note: If the terminal type is not defined, you are prompted to define it before you can proceed. 30.
Exit from diagnostics in this partition and return to the AIX prompt. 31.
Have all the partitions in the list of all partitions you recorded in step 11 on page 567 been processed?
Yes
No
?
Go to step 24 on page 568 to process the next partition in the list you recorded in step 11 on page 567. 32.
Have all the serviceable events recorded in step 4 on page 565 been processed?
Yes
No
?
Go to step 5 on page 566 and process the next service action event in the list of serviceable events recorded in step 4 on page 565. 33.
While processing all service action events, were you directed to step 14 on page 567?
Yes
No
?
If the system attention LED is still on, turn off the LED as described in “Activating and deactivating LEDs” on page 583. Return the system to the customer. This completes the repair. Note: If during the processing of the list of open service action events, some service action events remained open, further service actions may be required to complete the repair. Chapter 6. Closing a service call 569
34.
Perform the following steps for each entry in the list of ?Axx? partitions you began recording in step 14 on page 567, except the partition you were using to debug the original problem. 35.
From the HMC virtual terminal window of a partition in the list of ?Axx? partitions, type diag at the AIX command prompt. 36.
When the diagnostic operating instructions are displayed, do the following: 1. Press Enter. 2. Select the Task Selection option. Note: If the terminal type is not defined, you are prompted to define it before you can proceed. 3. Select the Log Repair option. 4. Select the resource associated with the repair action. If the repair action was reseating a cable or adapter, select the resource associated with that repair action. If the resource associated with your repair action is not displayed on the Resource List, select sysplanar0. 5. Click Commit after you have made your selection. 37.
Exit from diagnostics in this partition and return to the AIX prompt. 38.
Have all the partitions in the list of ?Axx? partitions you began recording in step 14 on page 567 been processed?
Yes
No
?
Go to step 34 to process the next partition in the list you recorded in step 14 on page 567. 39.
If the system attention LED is still on, turn off the LED as described in Activating and deactivating LEDs. This completes the repair. Return the system to the customer. Note: If during the processing of the list of open service action events, some service action events remained open, further service actions may be required to complete the repair. “Closing a service call using Integrated Virtualization Manager” on page 571Perform these procedures to close problem numbers, clear hardware messages, and prepare the server to return to the customer. “Closing a service call using AIX or Linux” on page 577If the server is not connected to an HMC and not using IVM, perform these procedures to close problem numbers, clear hardware messages, and prepare the server to return to the customer. “Activating and deactivating LEDs” on page 583Use this procedure to activate or deactivate LEDs.
570 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
Closing a service call using Integrated Virtualization Manager Perform these procedures to close problem numbers, clear hardware messages, and prepare the server to return to the customer. Follow this checklist before performing the procedures: v Return the server to the state that the customer normally uses such as IPL type, IPL mode, and the way the system is configured or partitioned. Attention: Before returning the system to the customer, remove the system from service mode. If the system is left in service mode, it automatically places a call for service every two hours. v While you were performing the problem analysis on the original serviceable event, other serviceable-event numbers might have been opened. Close all serviceable events that were opened as a result of your service activity. v Ensure that server verification has been performed and there are no problems that require additional service actions. v If the repair was done using the IVM online repair procedures, ensure that the original serviceable event is now closed. 1.
For future reference, record the SRC or symptom and the location code of the FRU you replaced. 2.
On the IVM, open Manage Serviceable Events and look at existing serviceable events. 3.
Are there any service action events that are open?
Yes
No
?
If the system attention LED is still on, turn off the LED as described in “Activating and deactivating LEDs” on page 583. Return the system to the customer. This completes the repair. 4.
Record the list of open service action events. 5.
From the list of serviceable events recorded in step 4, perform step 6 through step 30 on page 575 for each open service action event. 6.
Determine the error class of the serviceable event. Record for future use. Chapter 6. Closing a service call 571
7.
Examine the details of the open service action event. Is the error code associated with this service action event the same as recorded in Step 1 on page 571?
No
Yes
?
Go to step 11. 8.
Examine the FRU list of the service action event. Are there any FRUs listed for the service action event?
Yes
No
?
Go to step 11. 9.
Is the FRU list identical (same FRUs, same number of FRUs, and same order of FRUs) to the FRU list of the error code recorded in step 1 on page 571?
No
Yes
?
Go to step 11. 10.
The FRU list is different. Is the FRU you replaced and recorded in step 1 on page 571 in the list of FRUs for this service action event?
Yes
No
?
Go to step 30 on page 575. Note: There are service action events that will remain open when you leave this MAP. Further service actions may be required to complete the repair. 11.
Examine the details of this service action event, and record the partitions involved in this service action event for use in a later step. 12.
Is the error code associated with this service action event of the form A11-xxx or A01-xxx?
Yes
No
?
Go to step 17 on page 573. 572 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
13.
Have you begun a list of ?Axx? partitions from prior service action events that you processed in this MAP?
No
Yes
?
Go to step 15. 14.
Begin a new list of ?Axx? partitions by copying the list of partitions obtained in step 11 on page 572. Go to step 16. 15.
Add the partition list obtained in step 11 on page 572 to the existing list of ?Axx? partitions obtained from processing previous service action events in this MAP. 16.
Remove all entries in the list of all partitions you recorded in step 11 on page 572. If you are referred to the list of partitions obtained in step 11 on page 572 in future steps, the list is empty. Go to step 17. 17.
Select and highlight the service action event from the Manage Serviceable Events window. 18.
Click Close Event. 19.
Add comments for the serviceable event. Include any unique additional information. Click OK. The following steps will add or update FRU information. 20.
Did you replace, add, or modify a FRU of the open service action event?
Yes
No
?
Go to step 22 on page 574. 21.
Click OK to close the service action event. Chapter 6. Closing a service call 573
22.
Is the list of all partitions you recorded in step 11 on page 572 empty?
No
Yes
?
Go to step 30 on page 575. 23.
Does the list of all partitions you recorded in step 11 on page 572 contain more than one entry?
Yes
No
?
Go to step 30 on page 575. 24.
Is the error class recorded in step 23 AIX?
Yes
No
?
Go to step 30 on page 575. 25.
Perform the following steps for each entry in the list of all partitions you recorded in step 11 on page 572, except the partition you were using to debug the original problem. 26.
From the IVM virtual terminal window of a partition in the list of all partitions, type diag at the AIX command prompt. 27.
When the diagnostic operating instructions are displayed, do the following: 1. Press Enter. 2. Select the Task Selection option. 3. Select the Log Repair option. 4. Select the resource associated with the repair action. If the repair action was reseating a cable or adapter, select the resource associated with that repair action. If the resource associated with your repair action is not displayed on the Resource List, select sysplanar0. 5. Click Commit after you have made your selection.
Note: If the terminal type is not defined, you are prompted to define it before you can proceed. 28.
Exit from diagnostics in this partition and return to the AIX prompt. 574 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
29.
Have all the partitions in the list of all partitions you recorded in step 11 on page 572 been processed?
Yes
No
?
Go to step 25 on page 574 to process the next partition in the list you recorded in step 11 on page 572. 30.
Have all the serviceable events recorded in step 4 on page 571 been processed?
Yes
No
?
Go to step 5 on page 571 and process the next service action event in the list of serviceable events recorded in step 4 on page 571. 31.
While processing all service action events, were you directed to step 14 on page 573?
Yes
No
?
If the system attention LED is still on, turn off the LED as described in “Activating and deactivating LEDs” on page 583. Return the system to the customer. This completes the repair. Note: If during the processing of the list of open service action events, some service action events remained open, further service actions may be required to complete the repair. 32.
Perform the following steps for each entry in the list of ?Axx? partitions you began recording in step 14 on page 573, except the partition you were using to debug the original problem. 33.
From the IVM virtual terminal window of a partition in the list of ?Axx? partitions, type diag at the AIX command prompt. Chapter 6. Closing a service call 575
34.
When the diagnostic operating instructions are displayed, do the following: 1. Press Enter. 2. Select the Task Selection option. Note: If the terminal type is not defined, you are prompted to define it before you can proceed. 3. Select the Log Repair option. 4. Select the resource associated with the repair action. If the repair action was reseating a cable or adapter, select the resource associated with that repair action. If the resource associated with your repair action is not displayed on the Resource List, select sysplanar0. 5. Click Commit after you have made your selection. 35.
Exit from diagnostics in this partition and return to the AIX prompt. 36.
Have all the partitions in the list of ?Axx? partitions you began recording in step 14 on page 573 been processed?
Yes
No
?
Go to step 32 on page 575 to process the next partition in the list you recorded in step 14 on page 573. 37.
If the system attention LED is still on, turn off the LED as described in Activating and deactivating LEDs. This completes the repair. Return the system to the customer. Note: If during the processing of the list of open service action events, some service action events remained open, further service actions may be required to complete the repair. 576 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
Closing a service call using AIX or Linux If the server is not connected to an HMC and not using IVM, perform these procedures to close problem numbers, clear hardware messages, and prepare the server to return to the customer. Follow this checklist before performing the procedures: v Return the server to the state that the customer normally uses such as IPL type, IPL mode, and the way the system is configured or partitioned. Attention: Before returning the system to the customer, remove the system from service mode. If the system is left in service mode, it automatically places a call for service every two hours. v While you were performing the problem analysis on the original serviceable event, other serviceable-event numbers might have been opened. Close all serviceable events that were opened as a result of your service activity. v Ensure that server verification has been performed and there are no problems that require additional service actions. v If the repair was done using the IVM online repair procedures, ensure that the original serviceable event is now closed. 1.
Did you use an AIX diagnostics service aid hot-swap operation to change the FRU?
No
Yes
?
Go to step 4 on page 578 2.
Note: If the system planar or battery has been replaced and you are loading diagnostics from a server over a network, it may be necessary for the customer to set the network boot information for this system before diagnostics can be loaded. The system time and date information should also be set when the repair is completed. Do you have any FRUs (for example cards, adapters, cables, or devices) that were removed during problem analysis that you want to put back into the system?
No
Yes
?
Reinstall all of the FRUs that were removed during problem analysis. Go to step 3 3.
Is the system or logical partition that you are performing a repair action on running the AIX operating system?
Yes
No
?
Go to step 5 on page 578 Chapter 6. Closing a service call 577
4.
Does the system or logical partition you are performing a repair action on have AIX installed? Note: Answer ?NO? to this question if you have just replaced a hard disk in the root volume group.
No
Yes
?
Go to step 7 on page 579 5.
Run stand-alone diagnostics in problem determination mode from either a CD ROM or from a NIM server. Note: For instruction on running stand-alone diagnostics from a CD and not using an HMC, go to Running the eServer stand-alone diagnostics from CD on a server without an HMC attached. For instruction on running stand-alone diagnostics from a NIM server, go to Running the eServer stand-alone diagnostics from a Network Installation Management server.
Did you encounter any problems?
No
Yes
?
Go to Problem Analysis, order number SA76-0111. 6.
The system hardware is functioning correctly. If the system attention LED is still on, turn off the LED as described in Activating and deactivating LEDs. This completes the repair. Note: If during the processing of the list of open service action events, some service action events remained open, further service actions may be required to complete the repair. Return the server to the state that the customer normally uses such as IPL type, IPL mode, and the way the system is configured or partitioned. This might require a reboot of the operating system.Attention: Before returning the system to the customer, remove the system from service mode. If the system is left in service mode, it automatically places a call for service every two hours. 578 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
7.
1. If the system supports slow boot (See “Performing a slow boot” on page 665), do a slow boot on the system. If the system does not support slow boot, do a normal boot. 2. Power on the system. 3. Wait until the AIX operating system login prompt displays or until system activity on the operator panel or display apparently has stopped.Did the AIX Login Prompt display?
Yes
No
?
Go to Problem Analysis, order number SA76-0111. 8.
If the Resource Repair Action menu is already displayed, go to 12 on page 580; otherwise, do the following: 1. Log into the operating system either with root authority (if needed, ask the customer to enter the password) or use the CE login. 2. Enter the diag -a command and check for missing resources. Follow any instructions that display. If an SRN displays, suspect a loose card or connection. If no instructions display, no resources were detected as missing. Continue on to 9. 9.
1. Enter diag at the command prompt. 2. Press Enter. 3. Select the Diagnostics Routines option. 4. When the DIAGNOSTIC MODE SELECTION menu displays, select Problem determination. 5. When the ADVANCED DIAGNOSTIC SELECTION menu displays, select the All Resources option or test the FRUs you exchanged, and any devices that are attached to the FRU(s) you exchanged, by selecting the diagnostics for the individual FRU.Did the RESOURCE REPAIR ACTION menu (801015) display?
No
Yes
?
Go to step 13 on page 580. 10.
Did the TESTING COMPLETE, no trouble was found menu (801010) display?
Yes
No
?
There is still a problem. Go to Problem Analysis, order number SA76-0111. Chapter 6. Closing a service call 579
11.
Use the Log Repair Action option, if not previously logged, in the TASK SELECTION menu to update the AIX error log. If the repair action was reseating a cable or adapter, select the resource associated with that repair action. If the resource associated with your action is not displayed on the resource list, select sysplanar0. Note: If the system attention indicator is on, this will set it back to the normal state. Go to step 14 on page 581. 12.
When a test is run on a resource in system verification mode, and that resource has an entry in the AIX error log, if the test on the resource was successful, the RESOURCE REPAIR ACTION menu displays. After replacing a FRU, you must select the resource for that FRU from the RESOURCE REPAIR ACTION menu. This updates the AIX error log to indicate that a system-detectable FRU has been replaced. Note: If the system attention indicator is on, this action will set it back to the normal state. Do the following: 1. Select the resource that has been replaced from the RESOURCE REPAIR ACTION menu. If the repair action was reseating a cable or adapter, select the resource associated with that repair action. If the resource associated with your action is not displayed on the resource list, select sysplanar0. 2. Press Commit after you make your selections.
Did another Resource Repair Action (801015) display?
Yes
No
?
If the No Trouble Found menu displays, go to step 14 on page 581 13.
The parent or child of the resource you just replaced may also require that you run the RESOURCE REPAIR ACTION service aid on it. When a test is run on a resource in system verification mode, and that resource has an entry in the AIX error log, if the test on the resource was successful, the RESOURCE REPAIR ACTION menu displays. After replacing that FRU, you must select the resource for that FRU from the RESOURCE REPAIR ACTION menu. This updates the AIX error log to indicate that a system-detectable FRU has been replaced. Note: If the system attention indicator is on, this action will set it back to the normal state. 580 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
Do the following: 1. From the RESOURCE REPAIR ACTION menu, select the parent or child of the resource that has been replaced. If the repair action was reseating a cable or adapter, select the resource associated with that repair action. If the resource associated with your action is not displayed on the resource list, select sysplanar0. 2. Press COMMIT after you make your selections.
Did the No Trouble Found menu display?
Yes
No
?
Go to Problem Analysis, order number SA76-0111. 14.
If you changed the service processor or network settings, as instructed in previous MAPs, restore the settings to the value they had prior to servicing the system. If you ran stand-alone diagnostics from CD-ROM, remove the stand-alone diagnostics CD-ROM from the system. Did you perform service on a RAID subsystem involving changing of the PCI RAID adapter cache card or changing the configuration? Note: This does not refer to the PCI-X RAID adapter or cache.
Yes
No
?
Go to step 16 on page 582. 15.
Use the Recover Options selection to resolve the RAID configuration. To do this, do the following: 1. On the PCI SCSI Disk Array Manager screen, select Recovery options. 2. If a previous configuration exists on the replacement adapter, this must be cleared. Select Clear PCI SCSI Adapter Configuration. Press F3. 3. On the Recovery Options screen, select Resolve PCI SCSI RAID Adapter Configuration. 4. On the Resolve PCI SCSI RAID Adapter Configuration screen, select Accept Configuration on Drives. 5. On the PCI SCSI RAID Adapter selections menu, select the adapter that you changed. 6. On the next screen, press Enter. 7. When you get the Are You Sure selection menu, press Enter to continue. 8. You should get an OK status message when the recover is complete. If you get a Failed status message, verify that you selected the correct adapter, then repeat this procedure. When recover is complete, exit the operating system. 9. Go to 16 on page 582 Chapter 6. Closing a service call 581
16.
The system hardware is functioning correctly. Return the server to the state that the customer normally uses such as IPL type, IPL mode, and the way the system is configured or partitioned.Attention: Before returning the system to the customer, remove the system from service mode. If the system is left in service mode, it automatically places a call for service every two hours. 582 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
Activating and deactivating LEDs Use this procedure to activate or deactivate LEDs. Choose from the following: v “Deactivate a system attention LED or partition LED using the HMC” v “Activate or deactivate identify LED using the HMC” v “Deactivate a system attention LED or partition LED using the ASMI” on page 584 v “Activate or deactivate identify LED using the ASMI” on page 584Deactivate a system attention LED or partition LED using the HMC You can deactivate a system attention LED or a logical partition LED. For example, you might determine that a problem is not a high priority and decide to repair the problem at a later time. However, you want to be alerted if another problem occurs, so you must deactivate the system attention LED so that it can be activated again if another problem occurs. 1. In the navigation area, open Systems management. 2. Select the server you are working on by checking the box next to its name. 3. Open Operations. 4. Open LED Status. 5. Select View System Attention. The system attention LED window opens. The selected system and its LED state are displayed in the upper part of the window. The logical partition and its LED state are displayed in the lower part of the window. From the system attention LED window, you can deactivate both the system attention LED and the logical partition LED. 6. Select Deactivate System Attention LED from the Action menu. A confirmation window is displayed that provides the following information: v A verification that the system attention LED was deactivated. v An indication that there still might be open problems within the system. v An indication that you cannot activate the system attention LED.7. Select one of the logical partitions in the lower table, and select Deactivate partition LED from the Partition Operations menu. A confirmation window is displayed that provides the following information: v A verification that the logical partition LED was deactivated. v An indication that there still might be open problems within the logical partition. v An indication that you cannot activate the logical partition LED.Activate or deactivate identify LED using the HMC The system provides several LEDs that help identify various components, such as enclosures or field replaceable units (FRUs), in the system. For this reason, they are called Identify LEDs. You can activate or deactivate the following types of identify LEDs: v Identify LED for an enclosure If you want to add an adapter to a specific drawer (enclosure), you need to know the machine type, model, and serial number (MTMS) of the drawer. To determine whether you have the correct MTMS for the drawer that needs the new adapter, you can activate the LED for a drawer and verify that the MTMS corresponds to the drawer that requires the new adapter. v Identify LED for a FRU associated with a specified enclosure If you want to hook up a cable to a specific I/O adapter, you can activate the LED for the adapter which is a field replaceable unit (FRU), and then physically check to see where you should hook up the cable. This is especially useful when you have several adapters with open ports.
Chapter 6. Closing a service call 583
To activate or deactivate an identify LED for an enclosure or FRU, follow these steps: 1. In the navigation area, open Systems management. 2. Select the server you are working on by checking the box next to its name. 3. Open Operations. 4. Open LED Status. 5. Select Identify LED. The Identify LED, Select Enclosure window opens. 6. To activate or deactivate an identify LED for an enclosure, select an enclosure from the table, and click either Activate LED or Deactivate LED. The associated LED is either turned on or off. 7. To activate or deactivate an identify LED for a FRU, select an enclosure from the table, select Selected ? List FRUs. 8. Select one or more FRUs from the table, and click either Activate LED or Deactivate LED. The associated LED is either turned on or off.Deactivate a system attention LED or partition LED using the ASMI The system attention indicator provides a visual signal that the system as a whole requires attention or service. Each system has a single system attention indicator. When an event occurs that either needs your intervention or that of service and support, the system attention indicator lights continuously. The system attention indicator is turned on when an entry is made in the service processor error log. The error entry is transmitted to the system level and operating system error logs. To perform this operation, your authority level must be one of the following: v Administrator v Authorized service providerTo turn off the system attention indicator, do the following: 1. On the ASMI Welcome pane, specify your user ID and password, and click Log In. 2. In the navigation area, expand System Configuration and Service Indicators. 3. Select System Attention Indicator. 4. In the right pane, click Turn off system attention indicator. If the attempt is unsuccessful, an error message is displayed.Activate or deactivate identify LED using the ASMI You can specify the location code of any indicator to view or modify its current state. If you provide the wrong location code, the advanced system manager attempts to go to the next higher level of the location code. The next level is the base-level location code for that field replaceable unit (FRU). For example, a user types the location code for the FRU located on the second I/O slot of the third enclosure in the system. If the location code for the second I/O slot is incorrect (the FRU does not exist at this location), an attempt to set the indicator for the third enclosure is initiated. This process continues until a FRU is located or no other level is available. To perform this operation, your authority level must be one of the following: v Administrator v Authorized service providerTo change the current state of an indicator, do the following: 1. On the ASMI Welcome pane, specify your user ID and password, and click Log In. 2. In the navigation area, expand System Configuration and Service Indicators. 584 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
3. Select Indicators by Location code. 4. In the right pane, enter the location code of the FRU and click Continue. 5. Select the preferred state from the list. 6. Click Save settings. Chapter 6. Closing a service call 585
586 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
Chapter 7. Common Service Procedures This chapter contains procedures that may be common to various isolation and repair procedures. 1. “Powering on and powering off” on page 588Provides information for powering on and powering off the system. 2. “Determining a primary or alternative console” on page 591The primary console is the first workstation that the system identifies. It is attached to the first input/output adapter (IOA) or input-output processor (IOP) that supports workstations. The alternative console is the workstation that functions as the console when the primary console is not operational. 3. “Determining if the system has logical partitions” on page 593 4. “Separating the 571F/575B card set and moving the cache directory card” on page 594 5. “Determining which Processor is the Secondary Service Processor” on page 600Use this procedure to determine which service processor on your system is the secondary service processor. 6. “The System Reference Code (SRC) Format Description” on page 601The format of the SRC that you see depends on the system or server. 7. “Using the Service Action Log” on page 609Use this procedure to search for an entry in the Service Action Log (SAL) that matches the time, reference code, or resource of the reported problem. 8. “Service functions” on page 652Service functions provides enough detail for the service provider to gather information about hardware problems while under the direction of the next level of support. 9. “Managing dumps” on page 653After you have performed a dump, or one occurs automatically, use this information to manage the dump, including copying, reporting, and deleting it. 10. “Performing an IBM i5/OS main storage dump” on page 663A main storage dump is a collection of data from the i5/OS’s main storage. It can be manually initiated when the system administrator suspects a system wait or loop, or automatically by the server firmware as the result of a system failure. 11. “Performing a slow boot” on page 665Learn how to perform a slow boot using the HMC, control panel, or ASMI menus depending on your configuration. 12. “Determining the primary or alternate console” on page 667Use this procedure to determine the primary or alternate console. 13. “Resetting the service processor” on page 669Use this procedure to reset the service processor. 14. “Resetting the managed system connection from the HMC” on page 670Use this procedure to reset the managed system connection from the HMC. 15. “Checking for a duplicate IP address” on page 671Use this procedure to check for a duplicate IP address. 16. “Running the online and eServer stand-alone diagnostics” on page 672Use these tools to diagnose hardware problems on your AIX or Linux systems or partitions. 17. “Concurrent PCI adapter removal and replacement using AIX” on page 682 18. “Concurrent PCI adapter removal and replacement using Linux” on page 686 © Copyright IBM Corp. 2008 587
Powering on and powering off Provides information for powering on and powering off the system. Attention: If possible, shut down all applications and partitions before powering off the system. Using either the power-on button on the control panel or entering commands at the Hardware Management Console (HMC) to stop the system can cause unpredictable results in the data files. Also, the next time you start the system, it might take longer if all applications are not ended before stopping the system. Install the power cords
Note: Before powering on a system, ensure that the power cords have been plugged in to all of the power supplies.
Power on a system using an HMC To power on a managed system, complete the following steps: 1. In the Navigation area, expand the Systems Management icon. 2. Expand the Servers icon. 3. Choose the check box next to the name of the desired server. The tasks available for that server will be enabled. 4. From the Tasks drop down menu, choose Operations -> Power on. Follow any additional on-screen instructions.Power on a system without an HMC To start a system that is not managed by an HMC, follow these steps: 1. Press the power button A, as shown in the following figure, on the control panel. Observe the following after pressing the power button: v The power-on light begins to blink visibly faster. v The system cooling fans are activated after approximately 30 seconds and begin to accelerate to operating speed. v Progress indicators, also referred to as checkpoints, appear on the control panel display while the system is being started. v The power-on light on the control panel stops blinking and remains on, indicating that system power is on.2. On a rack-mounted system unit, open the front rack door, if necessary. On a deskside system unit, open the front door. 588 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
3. Before you press the power button on the control panel, ensure that power is connected to the system unit as follows: v All system power cables are connected to a power source. v The power-on light is slowly blinking. v The top of the display contains 01 V=F. 4. Press the power button A, as shown in the figure, on the control panel. 5. Observe the following after pressing the power button: v The power-on light begins to blink visibly faster. v The system cooling fans are activated after approximately 30 seconds and begin to accelerate to operating speed. v Progress indicators, also referred to as checkpoints, appear on the control panel display while the system is being started. The power-on light on the control panel stops blinking and remains on, indicating that system power is on.Power off a system using an HMCAttention: If possible, shut down the running logical partitions on the managed system before powering off the managed system. Powering off the managed system without shutting down the logical partitions first causes the logical partitions to shut down abnormally and can cause data loss. 1. In the Navigation area, expand the Systems Management icon. 2. Expand the Servers icon. 3. Choose the check box next to the name of the desired server. A selection of tasks available for that server will be displayed. 4. From the Tasks menu, choose Operations -> Power off. Follow any additional on-screen instructions. 5. Continue with “Remove the power cords” on page 590 Figure 43. Control Panel
Chapter 7. Common Service Procedures 589
Power off a system without an HMC 1. If there is an operating system running, proceed with step 2. Otherwise, press the power button A, as shown in Figure 43 on page 589. The system power turns off. Proceed to step 4 2. Log in to the system as a user with the authority to run the shutdown command. 3. At the command line, enter one of the following commands: v If your system is running AIX, type shutdown. v If your system is running Linux, type shutdown -h now.The command stops the operating system. The system power turns off. 4. The power-on light begins to slowly blink, and the system goes into a standby state. 5. Continue with “Remove the power cords.”Remove the power cords
Unplug any power cables that are attached to the unit from electrical outlets. Remove all power cords from the server. Important: The system might be equipped with a second power supply. Before continuing with the procedure, ensure that all power sources to the system have been completely disconnected. (L003) or
590 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
Determining a primary or alternative console The primary console is the first workstation that the system identifies. It is attached to the first input/output adapter (IOA) or input-output processor (IOP) that supports workstations. The alternative console is the workstation that functions as the console when the primary console is not operational. A console is a workstation that allows you to view and control system operations. The primary console is the first workstation that the system identifies. It is attached to the first input/output adapter (IOA) or input-output processor (IOP) that supports workstations. The alternative console is the workstation that functions as the console when the primary console is not operational. The system can assign up to two alternative consoles. The first alternative console can only be a twinaxial workstation that is attached to the same IOP as the primary console. The next alternative console is a workstation that is attached to the next IOA or IOP that is capable of supporting workstations. The IOA or IOP that supports a console must be on the system bus (bus 1). If a workstation is not correctly attached to the first IOA or IOP that is capable of attaching workstations, the system will not assign a primary console. If it does not assign a primary console, the system displays a reference code on the control panel. If the system is set for Manual mode, it stops during the IPL. For more information on how to determine the primary and alternative consoles, see “Identifying the consoles when the system is operational.” Primary console requirements For a workstation to be the primary console, it must be operational and attached to the system bus. It must also have the correct port and address assigned. If the workstation is a personal computer, it must also have an active workstation emulation program. The workstation requirements are: v Twinaxial workstation – Port 0, Address 0, Bus 1v ASCII workstation – Port 0, Bus 1v Personal computer attached to ASCII IOP – Port 0, Bus 1 – Personal computer software to emulate a 316x or 3151 terminalv Personal computer attached to Twinaxial IOP – Port 0, Address 0, Bus 1 – 5250 emulator software active on personal computerFor more information on how to determine the primary and alternative consoles, see “Identifying the consoles when the system is operational.” Identifying the consoles when the system is operational When the system is operational, you can determine the primary and alternative consoles by performing one of the following: v Look at the display. – If the dominant operating system is i5/OS, look for a sign-on display that shows DSP01 in the upper right corner. DSP01 is the name that the system assigns to the primary console.Note: This resource name might have been changed by the customer.
Chapter 7. Common Service Procedures 591
v Use system commands to assist in identifying the consoles. See the system operation information for more details on commands. v Use the Hardware Service Manager function to assist in identifying the consoles: 1. Select the System bus resources option on the Hardware Service Manager display. The System Bus Resources display allows you to view the logical hardware resources for the system bus. Look for a (<) symbol next to an IOP. The (<) symbol indicates that the console attaches to this IOP. 2. Select the Resources associated with IOP and the Display detail options to collect more information about the consoles. 592 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
Determining if the system has logical partitions This procedure will help you determine if the system has logical partitions. 1. Is the system managed by an HMC? Yes: Continue with the next step. No: The system does not have logical partitions. You must have an HMC to have logical partitions. Return to the procedure that sent you here. This ends the procedure.2. Determine if the system has multiple logical partitions by performing the following from the HMC: a. Expand the Systems Management folder in the Navigation area. b. Expand the Servers folder. c. Click the name of the managed system you are working with. The partitions will be displayed.Is there more than one logical partition listed? No: The system does not have multiple logical partitions. Return to the procedure that sent you here. This ends the procedure. Yes: The system has multiple logical partitions. Return to the procedure that sent you here. This ends the procedure. Chapter 7. Common Service Procedures 593
Separating the 571F/575B card set and moving the cache directory card To complete this procedure, you will need a T-10 TORX bit or driver. Attention: To avoid loss of cache data, you must not remove the cache battery during this procedure. To separate the 571F/575B card set and move the cache directory card, do the following:Important: All cards are sensitive to electrostatic discharge. 1. Label both sides of the card before separating. 2. Go to the next step if you are not servicing a 571F/575B card set containing a light pipe assembly. If you are servicing a 571F/575B card set that contains a light pipe assembly, you will need to remove the light pipes. To remove the light pipes, do the following: a. Place the 571F/575B card set adapter on an ESD protective surface. b. Remove the light pipe retaining screw C from the 571F/575B card set. c. Slide the light pipe assembly D from between the 571F/575B cards. d. Put both the screw and the light pipe assembly in a safe place. 3. Place the 571F/575B card set adapter on an ESD protective surface and orient it as shown. 594 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
4.
Disconnect battery cable A and SCSI cable B from the 571F storage adapter. Leave the other end of the cables attached to the 575B auxiliary cache adapter. 5. To prevent possible card damage, first loosen all 5 retaining screws C BEFORE removing them. After all five retaining screws have been loosened, remove the screws C from the 571F storage adapter. Chapter 7. Common Service Procedures 595
6.
Carefully lift the 571F storage adapter off the standoffs and set it on the ESD protective surface. 7. Turn the 571F storage adapter over so the components are facing up and locate the cache directory card D on the 571F storage adapter. It is the small rectangular card mounted on the I/O card. 596 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
8.
Unseat the connector on the cache directory card by wiggling the two corners that are farthest from the mounting pegs. To disengage the mounting pegs, pivot the cache directory card back over the mounting peg. 9. Move the cache directory card to the replacement 571F storage adapter and seat it on the connector and mounting pegs. 10. To reassemble the cards do the above procedure in reverse order. To prevent possible card damage, insert all 5 screws C before tightening any of them. Note: If you are installing the 571F/575B card set adapter into a cassette, do the following: a. Remove the adapter handle B as shown in the following illustration. b. If applicable, remove the light-pipe assembly. c. Remove the two plastic covers from the SCSI jumper cable A as shown in the following figure: Chapter 7. Common Service Procedures 597
d.
Ensure that battery cable A and SCSI cable B are reconnected to the 575B auxiliary cache adapter. e. Go to Place an adapter in the PCI adapter double-wide cassette to complete the installation.11. If applicable, to replace the light pipes, do the following: a. Ensure that the 571F/575B cards are correctly assembled. b. If necessary, remove the 571F/575B card handle A from the back of the card set.Tip: If you misplace the light-pipe retaining screw C, use either of the screws that secure the card handle B to the card. c. Slide the light pipe assembly D between the 571F/575B cards. Ensure that the threaded screw hole located on the light pipe assembly is aligned with the screw clearance hole located on the card .
598 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
12.
Return to the procedure that sent you here. This ends this procedure. Chapter 7. Common Service Procedures 599
Determining which Processor is the Secondary Service Processor Use this procedure to determine which service processor on your system is the secondary service processor. 1. Locate the HMC1 and HMC2 port pairs for service processor zero and for service processor one. (See Chapter 3, “Locating FRUs,” on page 335 for your specific model.) 2. Is only one service processor with cables attached to its HMC ports? No: Continue to the next step. Yes: The secondary service processor is the service processor with no cables attached to the HMC ports. This ends the procedure.3. Use the HMC to access Advanced System Management Interface (ASMI) to determine which service processor is the secondary service processor. First, verify ASMI connectivity by logging onto ASMI. Then pull all HMC cables from HMC ports for service processor zero (leaving service processor one HMC cables attached). Log onto ASMI again. Were you successfully able to log onto ASMI? No: The service processor with HMC cables still attached is the secondary service processor. Reattach the HMC cable(s) to service processor zero. This ends the procedure. Yes: The cables just removed identify the secondary service processor. Reattach the HMC cables to the secondary service processor. This ends the procedure. 600 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
The System Reference Code (SRC) Format Description The format of the SRC that you see depends on the system or server. “System Reference Code (SRC) Information”Helps you understand how SRCs work. “System reference code (SRC) address formats”Use this procedure to determine the IOP, IOA, or device address from either a control panel or a console display. “Hardware SRC formats” on page 605SRC format examples.System Reference Code (SRC) Information Helps you understand how SRCs work. Use this information to understand how SRCs work. The system reference code (SRC) is a sequence of data words (codes) that: v Identifies a system status v Describes a detected hardware, Licensed Internal Code (LIC), or software failure v Describes the unit that is reporting the failure and its locationSRCs can display on the control panel, appear in a system console message, or appear in the product activity log. The system uses the following procedure to gather information about errors: 1. The system gathers information about an error as the system detects the error. 2. The Product Activity Log (PAL) records and stores the error information. 3. An SRC encodes information used to evaluate or identify a system-detected hardware or software error, failure, or status. The failure information may include the failing condition or part (or unit) that can be exchanged or replaced and its location. 4. The SRC links to the problem determination procedure when the element detecting the failure cannot isolate the failing condition. 5. The SRC link completes the isolation process.In summary, the SRC provides the first-failure-data-capture information. The customer or support representative uses SRCs during problem determination, repair actions, and verification. The SRC provides a link to the problem determination procedures (hardcopy or softcopy). Use the SRC and the problem determination procedures to isolate the problem. System reference code (SRC) address formats Use this procedure to determine the IOP, IOA, or device address from either a control panel or a console display. Use this procedure to determine the IOP, IOA, or device address from either a control panel or a console display. 1. Was the SRC and addressing information obtained either from a control panel display, the Main Storage Dump Manager display, or the HMC Partition Properties Reference Code words display? v No: Continue with the next step. v No: Contact your next level of support. This ends the procedure. v Yes: Addressing information is in the Bus-Bus-Bus-Bus/Card-card/board-board (BBBBCcbb) format as follows: Chapter 7. Common Service Procedures 601
– If characters 7 and 8 of the top 16-character line of function 12 (word two) are 60: - The direct select address is characters 1-8 of the bottom 16-character line of function 12 (word four) in the format ?BBBBCcbb?. - The device address is characters 9-16 of the bottom 16-character line of function 12 (word five) in the format ?abcd efgh?.– If characters 7 and 8 of the top 16-character line of function 12 (word two) are 13, 17, 27, or 29: - The direct select address is characters 9-16 of the top 16-character line of function 12 (word three) in the format ?BBBBCcbb?. - The device address is characters 1-8 of the bottom 16-character line of function 12 (word four) in the format ?abcd efgh?.– If characters 7 and 8 of the top 16-character line of function 12 (word two) are 62: - The direct select address may be characters 9-16 of the top 16-character line of function 13 (word seven) in the format ?BBBBCcbb?. - The device address may be characters 1-8 of the top 16-character line of function 13 (word six) in the format ?abcd efgh?.Go to step 9 on page 604. 2. Are you using the HMC? No: Continue with the next step. Yes: Open the server with the problem, then go to the i5/OS partition with the problem and use the Service Action Log (SAL) to find the entry related to this problem (see “Using the Service Action Log” on page 609), and then go to step 7 on page 603. 3. Are you using the Service Action Log (SAL)? No: Continue with the next step. Yes: Go to step 7 on page 603. 4. Are you using i5/OS communication-diagnostic panels that are displayed by the commands VFYCMN, WRKPRB, or ANZPRB? Yes: Go to step 11 on page 605. No: Continue with the next step. 5. Addressing information is in the Bus/Board/Card format. Note: This format shows the system bus number in decimal form. If no address information is available in the console error message, check the Product Activity Log (PAL) entry that is associated with the console error message. Below is an example of a PAL display for a storage device. Other devices (for example, workstations or communications) have similar displays. For more information about the PAL, see “Using the product activity log” on page 617 in the “Service functions” on page 652. +--------------------------------------------------------------------------------+ |Display Address Information for Resource | | | | Serial Resource | |Name Type Model Number Name | |STORAGE 6718 050 83-00834 DD001 | | | |Physical Location: | | U7879.001.10ABCDE-P3-D3 | | | | | | | |Logical Address: | |PCI Bus: Storage: |Figure 44. Product Activity Log display example
602 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
|System Bus . . . . . . : 3 I/O Adapter . . . . . . : 2 |
|System Board . . . . . : 0 I/O Bus . . . . . . . . : 1 |
|System Card . . . . . . : 32 Controller. . . . . . . : 4 |
| Device . . . . . . . . : 0 |
| | +--------------------------------------------------------------------------------+
In the above example, the contents of ?Storage? are defined as follows.
Table 49. Storage definitions
Reference
Description
I/O Bus
This is the bus to which the device is connected.
I/O Adapter
This is the adapter that the I/O bus connects to on the I/O processor. Note: For SAS I/O adapters, this field is not applicable.
Controller
This is the logical controller for the addressed device.
Device
This is the logical device that is attached to the logical device controller. 6. Use the figure below to understand how addresses in the Bus/Board/Card format relate to the Bus-Bus-Bus-Bus/Card-Card/Board-Board format. You can also use it to understand how addresses in the Bus/Board/Card format relate to the direct select addresses used in Chapter 3, “Locating FRUs,” on page 335. Note: When converting from Bus/Board/Card format to Bus-Bus-Bus-Bus/Card-Card/Board-Board format, the numbers must be converted from Decimal to Hexadecimal. Go to step 10 on page 604 for IOA and device address information. 7. Display the FRU details for the problem that you are attempting to fix. The location information in the field-replacable unit detail is only available for IOPs, IOAs, and devices. Is the FRU an IOP, IOA, or device, and is there location information in the Location field of the display? No: Continue with the next step. Yes: Use the location information and return to the procedure that sent you here. This ends the procedure. 8. There is no location information in the Location field. Are there additional words displayed at the top of the screen? Figure 45. Format conversion
Chapter 7. Common Service Procedures 603
v No: Go to “Hexadecimal Product Activity Log data” on page 617 in “Service functions” on page 652 to obtain the address information. Then return here and continue with the next step. v Yes: Use the additional words to find the address information: – If characters 7 and 8 of word two are 60: - The direct select address is word four in the format ?BBBBCcbb?. - The device address is word five in the format ?abcd efgh?.– If characters 7 and 8 of word two are 13, 17, 27, or 29: - The direct select address is word three in the format ?BBBBCcbb?. - The device address is word four in the format ?abcd efgh?.– If characters 7 and 8 of word two are 62: - The direct select address may be word seven in the format ?BBBBCcbb?. - The device address may be word six in the format ?abcd efgh?. Continue with the next step. 9. The BBBBCcbb format is the hexadecimal direct select address of a card location in the system unit. The format definition is: v BBBB—System Bus v Cc—System Card v bb—System Board Continue with the next step for IOA and device address information. 10. Unit (Device) Address Definitions are explained in an ?abcd efgh? format. For example, if the unit address is 0001 00FF, the address relates to the ?abcd efgh? format as follows: The characters ?abcd efgh? can have different meanings, depending on the subsystem that reports the problem. However, they define the information that is contained in the unit (device) address as follows.
Table 50. Character definitions
SCSI attached disk, tape, or optical
a
I/O adapter
b
I/O bus
cd
Controller
ef
Device
gh
FF
SAS attached disk, tape, or optical
Not Applicable
I/O adapter
ab
I/O bus
cd
Controller
ef
Device
gh
FF
Communications
a
I/O bus
b
Adapter
0 0 0 1 0 0 F F
(a b c d e f g h) Figure 46. Example of Unit (Device) Address definitions
604 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
Table 50. Character definitions (continued)
SCSI attached disk, tape, or optical
c
0
d
Port
e
0
f
Channel
GH
FF
Workstation
a
I/O Bus
b
Adapter
cd
Port
e
0
f
Device
g
0
h
Session 11. i5/OS communication-diagnostic panels that are displayed by the commands VFYCMN, WRKPRB, and ANZPRB use the following address format:
x1 x2 x3x4
Table 51. Decimal address values
Value
Description
x1 = System bus
The one, two, or three digit decimal representation of the hexadecimal Bus value of the Bus-Bus-Bus-Bus/Card-Card/Board-Board format Address at the top of Figure 45 on page 603.
x2 = System board
The one or two digit decimal representation of the hexadecimal Board value of the Bus-Bus-Bus-Bus/Card-Card/Board-Board format Address at the top of Figure 45 on page 603.
x3 = System card
The one or two digit decimal representation of the hexadecimal Card value of the Bus-Bus-Bus-Bus/Card-Card/Board-Board format Address at the top of Figure 45 on page 603.
x4 = Adapter location
The one or two digit decimal representation of the hexadecimal b Adapter value of the Communications unit address value in step 9 on page 604 (See step 10 on page 604). 12. Return to the procedure that sent you here. This ends the procedure.Note: See “Hardware SRC formats” for more formatting information.Hardware SRC formats SRC format examples. Notes: 1. Other formats exist; if necessary contact your next level of support. 2. For more information on formats 60–63, see LIC-Detected Problem Reference Code Formats.
Chapter 7. Common Service Procedures 605
Table 52. SRC format examples
SRC Word Number
Panel function, digits x to y (from left to right)
IOP SRC format 13
IOP SRC format 17
Device SRC format 27
IOP detected a failure format 29
LIC SRC Format 60
LIC SRC format 61
LIC SRC format 62
LIC SRC format 63
1
11, 1-8
TTTT RRRR
TTTT RRRR
tttt rrrr
B0XX RRRR
A6xx RRRR
B6xx RRRR
B6xx RRrr or B7xx Rrrr
A600 RRRR
2
12, 1-8
MIGV EP13
MIGV EP17
MIGV EP27
MIGV EP29
MIGV EP60
MIGV EP61
MIGV EP62
MIGV EP63
3
12, 9-16
BBBB Ccbb
BBBB Ccbb
BBBB Ccbb
BBBB Ccbb
PPPP 0000
cccc cccc
cccc cccc
AAAA BBBB
4
12, 17-24
aaaa aaaa
aaaa aaaa
aaaa aaaa
SAPP UUFF
BBBB Ccbb
pppp pppp
pppp pppp
CCCC DDDD
5
12, 25-32
TTTT LMMM
TTTT LMMM
tttt lmmm
TTTT FMMV
aaaa aaaa
qqqq qqqq
qqqq qqqq
EEEE FFFF
6
13, 1-8
uuuu uuuu
tttt lmmm
TTTT LMMM
tttt lmmm
tttt mmmm
qqqq qqqq
qqqq qqqq
GGGG HHHH
7
13, 9-16
uuuu uuuu
ZZZZ rrrr
ZZZZ RRRR
zzzz rrrr
ssss ssss
0000 0000
BBBB Ccbb
JJJJ KKKK
8
13, 17-24
uuuu uuuu
ssss ssss
ssss ssss
0sss ssss
NNNN 0000
0000 0000
TTTT MMMM
TTTT 0MMM
9
13, 25-32
uuuu uuuu
uuuu uuuu
uuuu uuuu
uuuu uuuu
0000 0000
0000 0000
0000 0000
SSSS SSSS
Table 53. SRC format examples (continued)
SRC format
Description
AAAA
Type of I/O Bus (see “Logical address format” on page 607 — field ’A’).
aaaa
Unit address.
BBBB
System Bus Number (see “Logical address format” on page 607 — field ’B’).
BBBBCcbb
Bus, bus, bus, bus, card, card, and board, board address (Direct Select Address or DSA). v For PCI system buses 0001-0003 and 0018-FFFF the Cc field is a one digit multi-adapter bridge number and a one digit multi-adapter bridge function number. v For SPD system buses 0004-0017 the Cc field is a two digit card number.
CCCC
System Board Number (see “Logical address format” on page 607 — field ’C’).
cccc cccc
Component reference code. Format 62 varies by component.
DDDD
System Card Number (see “Logical address format” on page 607 — field ’D’).
EEEE
(see “Logical address format” on page 607 — Unit Address type — field ’E’).
FFFF
(see “Logical address format” on page 607 — Unit Address type — field ’F’).
GGGG
(see “Logical address format” on page 607 — Unit Address type — field ’G’).
HHHH
(see “Logical address format” on page 607 — Unit Address type — field ’H’).
JJJJ
(see “Logical address format” on page 607 — Unit Address type — field ’J’).
KKKK
(see “Logical address format” on page 607 — Unit Address type — field ’K’).
LMMM
Level indicator plus 3? digit model number (for example, 0001). 606 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
Table 53. SRC format examples (continued) (continued)
SRC format
Description
lmmm
Level indicator plus 3? digit model number of failing device.
0MMM
Model number of failing device (i.e. MMM=’001’).
MIGV EP
General system status.
NNNN
Number of disk device facilities that are missing.
PPPP
Cause code.
pppp pppp
Programming reference code.
qqqq qqqq
Programming reference code qualifier.
RRRR
Unit reference code (URC).
rrrr
Outboard failing unit reference code (URC).
SAPP UUFF
Unit address.
SSSSSSSS
System Reference Code — this SRC is used to determine why the primary console failed to respond.
ssss
Serial number of failing unit.
TTTT
Type number or card identification number (hex).
tttt
Outboard failing unit type number (i.e. 6607).
uuuu
Unit-specific data.
ww
SRC type.
ZZZZ
Reserved. “Logical address format”Use the logical address to identify the resource entry.Logical address format Use the logical address to identify the resource entry. Use the logical address to identify the resource entry. To sort by logical address, select the Analyze log option on the Product Activity Log display. Then select the F9 key (Sort by...) and sort by logical address. For more information on the address, use the address information function. The logical address format, A/B/C/D-E/F/G/H/J/K, has the following definitions: A Type of I/O bus (transport) B System bus number C System board number (for busses that connect card enclosures) D System card (bus unit) number - Separates the bus address from the unit address E Unit address type FGHJK Unit address dataThe values of F, G, H, J, and K vary, depending on the unit address type (E). Use the following information to determine the Unit address data (FGHJK) representation. Chapter 7. Common Service Procedures 607
Table 54. Unit address (E) definitions
Unit Address Value
Unit Address Definition
Unit Address Data
E
F
G
H
J
K
0
Reserved
-
-
-
-
-
1
Communications
I/O bus
Adapter
Port
Channel
-
2
Storage
I/O adapter
I/O bus
Controller
Device
-
3
Workstation
I/O bus
Adapter
Port
Device
Session
4
Auxiliary Processor
I/O bus
Auxiliary Processor
Adapter
Port
-
5
Library
I/O adapter
I/O bus
Library
Controller
Device
6
Cryptography
I/O bus
Adapter
Device
-
- The Logical address field contains the bus address and the unit address. The bus address describes the hardware bus, board, and card information. The unit address describes the subsystem and identifying data (see Table 54). Note: To locate a device or card that is within the system unit, use the physical address information. 608 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
Using the Service Action Log Use this procedure to search for an entry in the Service Action Log (SAL) that matches the time, reference code, or resource of the reported problem. 1. On the command line, enter the Start System Service Tools command (STRSST). If you cannot get to SST, use function 21 to get to DST. 2. On the Start Service Tools Sign On display, type in a user ID with QSRV authority and password. 3. Select Start a Service Tool ? Hardware Service Manager ? Work with service action log. 4. On the Select Timeframe display, change the From: Date and Time to a date and time prior to when the customer reported having the problem. 5. Search for an entry that matches one or more conditions of the problem: v Reference code v Resource v Time v Failing item list 6. Perform the following: v Choose Display the failing item information to display the SAL entry. v Use the Display details option to display part location information. All new entries in the SAL represent problems that require a service action. It may be necessary to handle any problem in the log even if it does not match the original problem symptom. The information displayed in the date and time fields are the date and time for the first occurrence of the specific reference code for the resource displayed during the time range selected. 7. Did you find an entry in the SAL? v Yes: Continue with the next step. v No: Go to “Problems with noncritical resources” on page 610. This ends the procedure. 8. Does ?See the service information system reference code tables for further problem isolation? appear near the top of the display or are there procedures in the FRU list? v Yes: Perform the following steps: a. Go to the list of reference codes and use the reference code that is indicated in the log to find the correct reference code table and unit reference code. b. Perform all actions in the Description/Action column before exchanging failing items. Note: When exchanging failing items, the part numbers and locations found in the SAL entry should be used.This ends the procedure.v No: Display the failing item information for the SAL entry. Items at the top of the failing item list are more likely to fix the problem than items at the bottom of the list. Continue with the next step. Notes: a. Some failing items are required to be exchanged in groups until the problem is solved. b. Other failing items are flagged as mandatory exchange and must be exchanged before the service action is complete, even if the problem appears to have been repaired. c. Use the ?Part Action Code? field in the SAL display to determine if failing items are to be replaced in groups or as mandatory exchanges. d. Unless the ?Part Action Code? of a FRU indicates group or mandatory exchange, exchange the failing items one at a time until the problem is repaired. Use the help function to determine the meaning of Part Action Codes. 9. Perform the following steps to help resolve the problem: Chapter 7. Common Service Procedures 609
a. To display location information, choose the function key for Additional details. If location information is available, go to Chapter 3, “Locating FRUs,” on page 335 for the model you are working on to determine what removal and replacement procedure to perform. To turn on the failing item’s identify light, use the ?indicator on? option.Note: In some cases where the failing item does not contain a physical identify light, a higher level identify light will be activated (for example, the planar or unit containing the failing item). The location information should then be used to locate the actual failing item. b. If the failing item is Licensed Internal Code, contact your next level of support for the correct PTF to apply.10. After exchanging an item, perform the following: a. Go to Chapter 5, “Verifying a repair,” on page 557 and return here. b. If the failing item indicator was turned on during the removal and replacement procedure, use the ?indicator off? option to turn off the indicator. c. If all problems have been resolved for the partition, use the ?Acknowledge all errors? function at the bottom of the SAL display. d. Close the log entry by selecting Close a NEW entry on the SAL Report display. This ends the procedure. “Problems with noncritical resources”Use this procedure to help you determine the cause of problems with noncritical resources. “System Service Tools (SST)” on page 611 “Using the product activity log” on page 617This procedure can help you learn how to use the Product Activity Log (PAL). “Hexadecimal Product Activity Log data” on page 617Use the Display Hexadecimal Report Function or follow the steps below to display hexadecimal error log dump data. “More information from hexadecimal reports” on page 620You can get additional information for IOP 90xx reference code product activity log entries at DST by using a D-IPL. “Reclaiming IOP cache storage” on page 636 “Working with Storage Dumps” on page 636 “Dedicated Service Tools (DST)” on page 637 “Printing the System Configuration List” on page 643 “Hardware Service Manager” on page 644 “Managing the Advanced System Management Interface (ASMI)” on page 645Understand how to manage the Advanced System Management Interface (ASMI).Problems with noncritical resources Use this procedure to help you determine the cause of problems with noncritical resources. 1. Is there an SRC in an 8-character format available on the Problem Summary form? Note: If the operator has not filled out the problem summary form, go to the problem reporting procedure for the operating system in use. No: Continue with the next step. Yes: Perform problem analysis using the SRC.This ends the procedure.2. Does the problem involve a workstation resource? v No: Continue with the next step. v Yes: Perform the following steps: – Check that the workstation is operational. 610 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
– Verify that the cabling and addressing for the workstation is correct. – Perform any actions indicated in the system operator message. If you need further assistance, contact your next level of support. This ends the procedure.3. Does the problem involve a removable media resource? No: Continue with the next step. Yes: Go to “Using the product activity log” on page 617 to resolve the problem. This ends the procedure.4. Does the problem involve a communications resource? v No: Contact your next level of support. This ends the procedure. v Yes: Are there any system operator messages that indicate a communications-related problem has occurred? – No: Contact your next level of support. This ends the procedure. – Yes: Perform any actions indicated in the system operator message. If you need further assistance, contact your next level of support. This ends the procedure.System Service Tools (SST) The system service tools (SST) run one or more Licensed Internal Code (LIC) or hardware service functions under the control of the operating system. SST allows you to perform service functions concurrently with the customer’s application programs. The printer that is used for output must be an SCS-type data stream printer that is attached to the same workstation input-output processor (IOP) or storage media unit. “Work with communications trace”The Work with communications trace option is available only under System Service Tools (SST). Select this option from the Start a Service Tool display.Work with communications trace The Work with communications trace option is available only under System Service Tools (SST). Select this option from the Start a Service Tool display. This option allows you to start or stop a communications line trace on a configuration object. Once you run the trace, the data can be formatted. You can view the formatted data by printing it. You should use communications trace for the following situations: v To isolate errors that you cannot isolate using the communications verification procedure v To collect more data when the problem analysis procedures do not provide enough information on the problem v If you suspect a communications protocol violation problem or some other line problem v If you suspect line noise v When the error messages indicate that there is an SNA BIND problemRunning and interpreting the communications trace requires detailed knowledge of communications protocols. To obtain the most accurate sample of your line status, whenever possible, start the communications trace before varying on the lines . Note: Online help from the Work with Communications Trace display allows you to view a list of protocols.
Chapter 7. Common Service Procedures 611
Options and Function Keys The Work with Communications Traces display has the following options and function keys: v Start trace (F6) This function key allows you to start tracing the data on a communications configuration object. The Start Trace display appears after you press this function key. For more information, see “Starting a communications trace” on page 614. v Stop trace (option 2) This option appears only on the Work with Communications Traces display. It allows you to end the trace and stop collecting data. A trace must be stopped before you can format, print, or delete the data. To ensure that the trace is stopped, press the Refresh function key and check the trace status field. v Delete trace (option 4) The trace must be stopped (option 2) before you can use this option. Select this option to delete the trace information when you no longer need the data. Traces are not automatically deleted when you leave a communications trace function. Delete trace releases system space so that you can start other traces without having to increase the maximum amount of storage provided for the communications trace. Traces that are not deleted are listed on the Work with Communications Traces display. v Format and print trace (option 6) Use this option to format and print the trace. For more information on the Format and print trace option, see page “Format and print trace” on page 615. v Display message (option 7) Use this option to view a message that indicates the status of the trace that stopped because of an error. v Restart trace (option 8) Use this option to start a trace that has been stopped. v Change size (F10) Use this function key to change the amount of storage that all traces use. You might want to increase the amount of storage, if your system has many active traces or traces that will be active for a long time. v Display buffer size or display trace status (F11) This function key allows you to select and alternate between buffer size and trace status. v Refresh (F5) This function key updates the data on the display.Status conditions After the Work with communications trace option is selected from the Start a Service Tool display, the Work with Communications Traces display appears. 612 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
Details of the trace, including status, are displayed. The trace status can be one of the following: Condition Description Waiting The trace is waiting for the configuration object to be varied on (not collecting data). Active The trace data is being collected. Stopping The trace is stopping. Stopped The trace has stopped (not collecting data). Error An error occurred on the configuration object while the trace was collecting data. The data might or might not be collected. The trace stopped. Starting The trace is being started by another user. You cannot stop or delete this trace. Formatting Trace is being formatted. Work with Communications Traces Type choice, press Enter. 2=Stop trace 4=Delete trace 6=Format and print trace 7=Display message 8=Restart trace Configuration Opt Object Type Trace Description Protocol Trace Status _ LosAngeles Line Test LosAngeles Line SDLC Active _ Mpls NWI Test Mpls ISDN Stopped _ Tucson Line Test Tucson Line ASYNC Waiting F3=Exit F5=Refresh F6=Start trace F10=Change size F11=Display buffer size F12=Cancel Figure 47. Example Work with Communications Trace display
Chapter 7. Common Service Procedures 613
Starting a communications trace The following is an example of a Start Trace display: Start Trace Configuration object . . . . . . . . . _________ Type . . . . . . . . . . . . . . . . . 1 1=Line 2=Network interface 3=Network server Trace description . . . . . . . . . . ____________________ Buffer size (in kilobytes) . . . . . . 1 1=128, 2=256, 3=2M, 4=4M, 5=6M, 6=8M, 7=16M, 8=32M, 9=64M Stop on buffer full . . . . . . . . . N Y=Yes, N=No Data direction . . . . . . . . . . . 3 1=Sent, 2=Received, 3=Both Number of bytes to trace Beginning bytes . . . . . . . . . . *CALC VALUE, *CALC Ending bytes . . . . . . . . . . . *CALC VALUE, *CALC Type choices, press Enter. F3=Exit F5=Refresh F12=Cancel Enter the name of a communications configuration object description in the Configuration object prompt. Describe the trace in the Trace description field. This field can help you identify the trace. You can select a buffer size to hold the communications data that the trace collects. The default buffer size is 1=128K bytes. Base the size of the buffer on the speed of the communications line and on the amount of time necessary to trace the data. For high speed lines or long periods of tracing, a larger buffer size is recommended. If you specify Yes in the field, the trace stops when the buffer is full. This option is useful for viewing the initial data that is coming across a line. If you specify No, the trace continues until you stop it. In this case, the data in the buffer will be written over each time the buffer is full. You can select the direction of data to be traced by specifying one of the three selections in the Data direction field: v Only data that the system is sending (option 1) v Only data that the system is receiving (option 2) v Both the data sent and received by the system (option 3)Note: If you specify option 1 (Sent) for lines that are in SDLC short hold mode, the trace does not include controller names. You can select how much data is traced in a frame of data. The value that is entered is the amount that is saved as part of the trace. The minimum value that is allowed for both the beginning and the end value is 36 bytes. Those 36-byte minimum values include the protocol header. The configuration object you are tracing determines the maximum value that is allowed. Notes: 1. The BSC protocol ignores the beginning and ending byte values. 2. The SDLC, high-level data link control (HDLC), X.25, frame relay, Ethernet, distributed data interface (DDI), ATM, and token-ring protocols ignore the ending byte value. 614 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
3. The minimum value that is allowed for the beginning byte value for local area network (LAN) protocols (including frame relay protocols and ATM protocols) is 72 bytes.When all required options are complete, press the Enter key. The Work with Communications Trace display appears. If the communications object being traced is in varied off status when you start the trace, the trace remains in waiting status until you vary the communications object on. If the object is varied on, the trace will immediately go to the active state, and the system will begin to collect trace data. The trace remains active until one of the following occurs: v You select the option to stop the trace on the Work with Communications Traces display. v You vary off the configuration object being traced. v The configuration object being traced has an error. v The trace buffer is full, and the option to stop on buffer full was selected.Note: You can exit the communications trace function to do other work while the trace is active. Traces are not automatically deleted when you exit a communications trace function. To return to the communications trace: 1. Select the Start a service tool option from the service tools (SST or DST) display. 2. Select the Work with communications traces option. 3. From the Work with Communications Traces display, you can check the status of the trace or select to stop the trace.Format and print trace You must select the Stop Trace option before you format or print the trace. The Format and print trace option allows you to select various formatting options and prepare the trace data for printing. The options vary for each protocol. The following is an example of the Format Trace Data display: Notes: Format Trace Data Configuration object . . . . . . TRNLINE Type . . . . . . . . . . . . . . LINE Type choices, press Enter. Controller . . . . . . . . . . *ALL *ALL, name Data representation . . . . . 3 1=ASCII, 2=EBCDIC, 3=*CALC Format RR, RNR commands . . . N Y=Yes, N=No Format Broadcast data . . . . Y Y=Yes, N=No Format MAC or SMT data only. . N Y=Yes, N=No Format UI data only . . . . . N Y=Yes, N=No Format TCP/IP data only . . . N Y=Yes, N=No F3=Exit F5=Refresh F12=Cancel Figure 48. Example Format Trace Data Display
Chapter 7. Common Service Procedures 615
1. To view all the data associated with the trace, press the Enter key without changing to the defaults on the Format a Trace Data display . The information is not in any special format. 2. Select the option to format and print only the data that you want to see. 3. All options available for formatting depend on the protocol of the line that is being traced. Only valid choices appear for each protocol. 4. Not all combinations of options are valid for all protocols. Error messages appear, if you select an option that is not valid, . If an error message appears, you must change your selections. Use the F5 (Refresh) function key to restore all options to their default values. 5. For more information about these options, use online help .When the format of the trace data is complete, the output can be printed on the console printer (the printer that is attached to the same I/O processor as the console). Only SCS-type data streams are supported. The format of the trace data is: v An introduction page that contains, for example, configuration object, type, protocol, start and stop dates and times, trace options, and formatting options. v A help page, to assist in understanding the output. Help information for a specific protocol is given because the trace data differs for each communications protocol. v The formatted appears as follows: The width of the file is 132 characters. The data is in hexadecimal representation and either American National Standard Code for Information Interchange (ASCII) or EBCDIC character representation. The columns of the trace output common to all protocols are: Record number The number of the trace record. Shows if the record type is sent (S) or received (R). Notes: 1. The letter C in this column indicates that an X.21 short hold mode connection was cleared. 2. If an M appears, a modem change has occurred.Data length The amount of data, in decimal, that the record contains. Record status The protocol-dependent return code for the trace record. 00000000 is successful; no errors were found. Other return codes are listed in the functional specification for the protocol that is running or the port manager. Record Data Record Record Data Controller Number Number Poll/ Number S/R Length Status Timer Type Name/Number Command Sent Received Final ------ --- ------ -------- --------------- ------ ------------- ------- ------ -------- ----- 7 R 69 00000000 12:29:56.72963 EBCDIC ZSDLLC30 /01 XID ON Data . . . . . : 3245056150080000 0084C00000000000 01010B0000010900 00000007000E0DF4 *.&;...D{....................4* DADBCC4BDACBCDCE EFCGC5F31017F116 1101130011F9F4F0 F6F5F0F0F1F0F1F0 *ABC.ABCDEFG..1......94065001010* C4C6C44040 *DFD * 8 S 0 00000000 12:29:56.76081 EBCDIC ABCLLC30 /01 SNRM ON 9 R 0 00000000 12:29:56.78450 EBCDIC ABCLLC30 /01 UA ON 52 S 110 00000000 12:29:57.76210 EBCDIC ABCLLC30 /01 I 0 0 ON Data . . . . . : 2D00010200006B81 0031001307B0B051 3107878686870706 0200000000000000 *......,A..........GFFG..........* 944300000CDADBCC 4BEAEBCCD3D3C4F3 F02E000902E2D5C1 E2E5C3D4C7090300 *M....ABC.AABCLLD30....SNASVCMG...* 00BCF9CAD600000D 04DADBCC4BEAEBCC D3D3C4F3F00A1300 7B00BCF9CAD60000 *..9.O....ABC.ABCLLD30...#..9.O..* 000CDADBCC4BEAEB CCD3D3C4F4F0 *..ABC.ABCLLD40 * 53 R 110 00000000 12:29:57.80065 EBCDIC ABCLLC30 /01 I 0 1 ON Data . . . . . : 2F00010200006B81 0031001307B0B051 3107878686870706 0200000000000000 *......,A..........GFFG..........* 944300000CDADBCC 4BEAEBCCD3D3C4F4 F02E000902E2D5C1 E2E5C3D4C7090300 *M....ABC.ABCLLD40....SNASVCMG...* 00BCF9D56600000D 04DADBCC4BEAEBCC D3D3C4F4F00A1300 7B00BCF9D5660000 *..9N.....ABC.ABCLLD40...#..9N...* 000CDADBCC4BEAEB CCD3D3C4F3F0 *..ABC.ABCLLD30 * * * * * * E N D O F E X A M P L E P R I N T O U T * * * * * Figure 49. Example Formatted Trace Data Output for SDLC
616 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
Record timer The time that each event occurs. Depending on the communications hardware that is being used, the record timer will be either: 1. A time of day value, HH:MM:SS.NNNNN (where H=hours, M=minutes, S=seconds, and N=subseconds), based on the system time when the trace was stopped. 2. A relative time in decimal seconds. This timer value provides the relative time between events.Data type Shows whether the traced data is printed in ASCII or EBCDIC character representation. If the character representation of the data is mostly periods, you might want to format the data again using the other option for data representation. Controller name/number Indicates which controller originated the frame or record. In some conditions, this data is not available, and the column remains blank.Note: The formatted trace output is not security protected. Customer passwords are shown going across the line. Using the product activity log This procedure can help you learn how to use the Product Activity Log (PAL). 1. To locate a problem, find an entry in the product activity log for the symptom you are seeing. a. On the command line, enter the Start System Service Tools command: STRSST If you cannot get to SST, select DST. Note: Do not IPL the system or partition to get to DST. b. On the Start Service Tools Sign On display, type in a User ID with service authority and password. c. From the System Service Tools display, select Start a Service Tool ? Product activity log ? Analyze log. d. On the Select Subsystem Data display, select the option to view All Logs. Note: You can change the From: and To: Dates and Times from the 24-hour default if the time that the customer reported having the problem was more than 24 hours ago. e. Use the defaults on the Select Analysis Report Options display by pressing the Enter key. f. Search the entries on the Log Analysis Report display. Note: For example, a 6380 Tape Unit error would be identified as follows: System Reference Code: 6380CC5F Class: Perm Resource Name: TAP012. Find an SRC from the product activity log that best matches the time and type of the problem the customer reported. Did you find an SRC that matches the time and type of problem the customer reported? Yes: Use the SRC information to correct the problem. This ends the procedure. No: Contact your next level of support. This ends the procedure.Hexadecimal Product Activity Log data Use the Display Hexadecimal Report Function or follow the steps below to display hexadecimal error log dump data. Chapter 7. Common Service Procedures 617
Perform the following: 1. Find an entry in the Product Activity Log for the symptom (SRC) you are using to locate the problem. a. Select System Service Tools (SST). If you cannot get to SST, select DST.Note: Do not IPL the system to get to DST. b. On the Service Tools display, select the Start a service tool option. c. Select the Product activity log option on the Start a Service Tool display. d. Select the Analyze log option on the Product Activity Log display. e. On the Select Subsystem Data display, select the option to view All Logs. Note: If the SRC you are using occurred more than 24 hours ago, change the “From:” and “To:” Dates and Times from the 24-hour default. f. Use the defaults on the Select Analysis Report Options display by pressing the Enter key. g. Find the entry in the Product Activity Log that matches that SRC.2. Select the Display Detail Report for Resource display. (See the example of this display below.) 3. Press F6 to display the hexadecimal report. The direct select address (DSA) is in the format BBBB-Cc-bb: v BBBB = hexadecimal offsets 4C and 4D v Cc = hexadecimal offset 51 v bb = hexadecimal offset 4F Display Detail Report for Resource Serial Resource Name Type Model Number Name STORAGE 6512 001 10-4294009 SI05 Log ID . . . . . . . . . : 06020132 Sequence . . . . . . . : 2248 Date . . . . . . . . . . : 05/20/95 Time . . . . . . . . . : 08:37:55 Reference code . . . . . : 3400 Secondary code . . . . : 00000000 Table ID . . . . . . . . : 65120001 IPL source/state . . . : B/7 Class . . . . . . . . . . : Permanent System Ref Code . . . . . : 65123400 I/O processor card detected device error Press Enter to continue. F3=Exit F6=Hexadecimal report F9=Address Information F10=Previous detail report F12=Cancel Figure 50. Example Hexadecimal Detail Report for Resource
618 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
4. Is offset 170 - 173 = 00000000? v Yes: Continue with step 5. v No: Continue with step 6 on page 620.5. Page forward until you locate the offset location X’000180’. If the rightmost hex digit of offset 000182 is 8, then the unit address is hexadecimal offset 18C through 18F. If the rightmost hex digit of offset 000182 is 2, then the unit address is hexadecimal offset 192 through 195 Record this address information. Return to the procedure that sent you here, if one. Display Hexadecimal Report for Resource Serial Resource Name Type Model Number Name STORAGE 6512 001 10-4294009 SI05 Offset 0 1 2 3 4 5 6 7 8 9 A B C D E F EBCDIC 000000 C5D3F0F4 00000160 00E00180 77B7695E EL04...-.......; 000010 E8A70000 000008C8 20E00602 01320000 Y......H........ 000020 34000000 11000000 00000000 009C0000 ................ 000030 F6F5F1F2 F0F0F140 F1F060F4 F2F9F4F0 6512001 10-42940 000040 F0F9F6F5 F1F20000 00000001 00060000 096512.......... 000050 00020000 FFFFFFFF FFFFFFFF FFFF0000 ................ 000060 00000000 00000000 00000000 F4404040 ............4 000070 40000000 00000000 00000000 00000000 ............... 000080 00000000 00000000 00000000 00000000 ................ 000090 00000000 00000000 00000000 00000000 ................ 0000A0 00000000 00000000 00000000 00000000 ................ 0000B0 00000000 00000000 40404040 40404040 ........ More... Press Enter to continue. F3=Exit F12=Cancel Figure 51. Example Hexadecimal Report for ResourceDisplay Hexadecimal Report for Resource Serial Resource Name Type Model Number Name STORAGE 6512 001 10-4294009 SI05 Offset 0 1 2 3 4 5 6 7 8 9 A B C D E F EBCDIC 000180 D0007800 65120001 34000620 1100FFFF }............... 000190 FFFFFFFF 31090000 00001100 03C00000 .............{.. 0001A0 00000000 00000000 00000000 00000000 ................ 0001B0 00000000 00000000 00000000 00000000 ................ 0001C0 00000000 00000000 00000000 E0000008 ................ 0001D0 00000000 00000000 00000000 00000000 ................ 0001E0 00000018 00000002 E00401F0 10002000 ...........0.... 0001F0 00000000 40000000 00000000 000002CC .... ........... 000200 CCCC0000 00000000 00800000 00000000 ................ 000210 08100000 00000000 F0200040 00800000 ........0.. .... 000220 00000000 04000000 08000040 E000E221 ........... ..S. 000230 800084F3 00000002 0000001B 110003C0 ...3...........{ More... Press Enter to continue. F3=Exit F12=Cancel Figure 52. Hexadecimal Report for Resource Offset Location Example (where the rightmost hex digit of offset 000182 is 8)
Chapter 7. Common Service Procedures 619
6. Using hexadecimal arithmetic, add the value in offsets 170 - 173, the value in offsets 174 - 177, and 00000002. For example, 000001C8 + 00000410 + 00000002 = 000005DA If the value of the rightmost hex digit at this offset is 8: Using hexadecimal arithmetic, add 0000000A to the offset you calculated. This will be the location of the unit address in the form abcdefgh. For example, 00005DA + 0000000A = 000005E4 If the value of the rightmost hex digit at this offset is 2: Using hexadecimal arithmetic, add 00000010 to the offset you calculated. This will be the location of the unit address in the form abcdefgh. For example, 00005DA + 00000010 = 000005EA Record this address information. If you were sent here by another procedure, return to the procedure that sent you here. This ends the procedure.More information from hexadecimal reports You can get additional information for IOP 90xx reference code product activity log entries at DST by using a D-IPL. Note: The term I/O processor may indicate an IOP or an IOA. Below are examples of how to interpret hexadecimal information for a D-IPL. Note: Formatting is available (F4=Additional Information) for IPL from disk (A- or B-IPL). Formatting is not available in DST from a D-IPL. Use the information below and the example displays on the following pages to interpret the hexadecimal dump information that begins at offset hex 000180. If the rightmost hex digit of the offset 000182 is 8, get the format from offset 00091F, then use the following table to determine what figure to use.
Table 55. Format number to figure usage
Format number
Reference the following figures
01
Figure 54 on page 622 Figure 55 on page 623 Figure 56 on page 623
Display Hexadecimal Report for Resource
Serial Resource Name Type Model Number Name STORAGE 571A 001 10-4294009 DC01 Offset 0 1 2 3 4 5 6 7 8 9 A B C D E F EBCDIC 000180 F8407200 F5F7F1C1 F0F0F0F1 3400031D 8 ..571A0001.... 000190 00233104 00FFFFF4 F0F0F0F0 F0F0F0F0 .......400000000 0001A0 1104E255 F0F0F0F0 F0F0F0F0 40404040 ..S.00000000 0001B0 40404000 00FF0000 04440000 A1D0038D ..........}.. 0001C0 0713000C 00000000 00000000 70000200 ................ More... Press Enter to continue. F3=Exit F12=Cancel Figure 53. Hexadecimal Report for Resource Offset Location Example (where the rightmost hex digit of offset 000182 is 2)
620 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
Table 55. Format number to figure usage (continued)
Format number
Reference the following figures
02
Figure 57 on page 624 Figure 58 on page 625 Figure 59 on page 625
03
Figure 60 on page 626 Figure 61 on page 626 Figure 62 on page 626
04
Figure 63 on page 627 Figure 64 on page 628 Figure 65 on page 628 If the rightmost hex digit of offset 000182 is 2, get the format number from offset 0001B5, then use the following table to determine what table to use.
Table 56.
Format number
Reference the following tables
01
Table 57 on page 628
02
Table 58 on page 629
03
Table 59 on page 630
04
Table 60 on page 631
11
Table 61 on page 631
12
Table 62 on page 632
13
Table 63 on page 632
14
Table 64 on page 633
17
Table 65 on page 635
20
Table 66 on page 636 Chapter 7. Common Service Procedures 621
Device formatting log example (unformatted information)
Display Hexadecimal Report for Resource
Serial Resource Name Type Model Number Name STORAGE 6607 070 00-68753 Offset 0 1 2 3 4 5 6 7 8 9 A B C D E F EBCDIC 000180 F8407800 65320001 90920210 010400FF 8 .......k...... 000190 66070000 00001306 87531400 123D0001 ........g....... 0001A0 00000000 00000000 CA000003 00000000 ................ 0001B0 65320001 07050011 00000000 00000000 ................ 0001C0 00000004 00000003 00000000 00000000 ................ 0001D0 F6F6F0F7 13068753 010400FF 00000000 6607..g......... 0001E0 00000000 65320001 07050011 47000001 ................ 0001F0 07069205 C0004000 00000021 00000030 ..k.{. ......... 000200 00000000 00000000 F6F6F0F7 00071575 ........6607.... 000210 010300FF 00000000 00000000 65320001 ................ 000220 07050011 47000001 07069205 C0004000 ..........k.{. . 000230 00000000 00000000 00000000 00000000 ................ 000240 F6F6F0F7 00259DE8 010200FF 00000000 6607...Y........ 000250 00000000 65320001 07050011 47000001 ................ 000260 07069205 C0404040 00000000 00000000 ..k.{ ........ 000270 00000000 00000000 E2E5C3C4 D6C3E2C3 ........SVCDOCSC 000280 D7D7F2F9 C1F800D5 00000000 00000000 PP29A8.N........ 000290 00000000 00000000 00000000 00000000 ................ 0002A0 00000000 00000000 00000000 00000000 ................ 0002B0 00000000 00000000 00000000 00000000 ................ 0002C0 00000000 00000000 00000000 00000000 ................ 0002D0 00000000 00000000 C3D7D7F8 F9F2F9D5 ........CPP8929N 0002E0 F6F5F3F2 F9F0F9F2 F0F0F0F0 F0F0F0F0 6532909200000000 0002F0 F0F0F0F0 F0F1F6C3 F0F1F3F3 0000D5D5 0000016C0133..NN 000300 00000000 00000000 00000000 00000000 ................ Figure 54. Example Hexadecimal Report, Device Formatting Log (unformatted information)
622 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
Device formatting log template Device formatting log example (as formatted information for A or B IPL) Display Hexadecimal Report for Resource Serial Resource Name Type Model Number Name STORAGE 6607 070 00-68753 Offset 0 1 2 3 4 5 6 7 8 9 A B C D E F 000180 -------- -------- -------- -------- 000190 -------- -------- -------- ------01 0001A0 -------- -------- -------- -------- 0001B0 -------- -------- -------- -------- 0001C0 # errors # errors -------- -------- detected logged -------- -------- 0001D0 Device1 Device1 Device1 New Dev1 Type Serial UnitAddr Type 0001E0 New Dev1 IOP type IOP Cache Serial & model Serial Type/Mod 0001F0 Cache -------- -------- -------- Serial -------- -------- -------- 000200 -------- -------- Device2 Device2 -------- -------- Type Serial 000210 Device2 New Dev2 New Dev2 IOP type UnitAddr Type Serial & model 000220 IOP Cache Cache -------- Serial Type/Mod Serial -------- 000230 -------- -------- -------- -------- -------- -------- -------- -------- 000240 Device3 Device3 Device3 New Dev3 Type Serial UnitAddr Type 000250 New Dev3 IOP type IOP Cache Serial & model Serial Type/Mod 000260 Cache -------- -------- -------- Serial -------- -------- -------- Figure 55. Example Hexadecimal Report, Device Formatting Log Template Display Additional Information for Resource Serial Resource Name Type Model Number Name STORAGE 6607 070 00-68753 Device Errors detected . . . . . . . : 4 Device Errors logged . . . . . . . . : 3 Device . . . . . . . . . : 1 2 3 Unit Address . . . . : 010400FF 010300FF 010200FF Type . . . . . . . . : 6607 6607 6607 Serial Number . . . : 13068753 00071575 00259DE8 New Device Information: Type . . . . . . . . : **** **** **** Serial Number . . . : 00000000 00000000 00000000 I/O Processor Information: Type . . . . . . . . : 6532 6532 6532 Serial Number . . . : 07050011 07050011 07050011 Cache Adaptor Card Information: Type . . . . . . . . : 4700 4700 4700 Serial Number . . . : 07069205 07069205 07069205 Figure 56. Example Hexadecimal Report, Device Formatting (As formatted information for A or B IPL)
Chapter 7. Common Service Procedures 623
Array member formatting log example (unformatted information) Display Hexadecimal Report for Resource Serial Resource Name Type Model Number Name STORAGE 6532 001 00-50011 Offset 0 1 2 3 4 5 6 7 8 9 A B C D E F EBCDIC 000180 F4407800 65320001 90210210 0FFFFFFF 4 .............. 000190 00000000 00000000 00001400 12400002 ............. .. 0001A0 00000000 00000000 CA000003 00000000 ................ 0001B0 65320001 07050011 00000000 00000000 ................ 0001C0 F6F6F0F7 0025ED09 000200FF 000200FF 6607............ 0001D0 F6F6F0F7 00158135 000100FF 000100FF 6607..a......... 0001E0 F6F6F0F7 00259DE8 010200FF 00000000 6607...Y........ 0001F0 F6F6F0F7 00008326 010100FF 00000000 6607..c......... 000200 F6F6F0F7 00071575 010300FF 00000000 6607............ 000210 F6F6F0F7 13068753 010400FF 00000000 6607..g......... 000220 00000000 00000000 00000000 00000000 ................ 000230 00000000 00000000 00000000 00000000 ................ 000240 00000000 00000000 00000000 00000000 ................ 000250 00000000 00000000 00000000 00000000 ................ 000260 000000FF 00000004 00000000 00000000 ................ 000270 00000000 E2E5C3C4 D6C3E2C3 D7D7F2F9 ....SVCDOCSCPP29 000280 C1F800D5 00000000 00000000 00000000 A8.N............ 000290 00000000 00000000 00000000 00000000 ................ 0002A0 00000000 00000000 00000000 00000000 ................ 0002B0 00000000 00000000 00000000 00000000 ................ 0002C0 00000000 00000000 00000000 00000000 ................ 0002D0 00000000 C3D7D7C5 C1F0F2D5 F6F5F3F2 ....CPPEA02N6532 0002E0 F9F0F2F1 F0F0F0F0 F0F0F0F0 F0F0F0F0 9021000000000000 0002F0 F0F0C4F4 F0F1F3F3 0000D5D5 00000000 00D40133..NN.... 000300 00000000 00000000 00000000 00000000 ................ Figure 57. Example Hexadecimal Report, Array Member Formatting Log (unformatted information)
624 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
Array member formatting log template Array member formatting log example (as formatted information for A or B IPL) Display Hexadecimal Report for Resource Serial Resource Name Type Model Number Name STORAGE 6532 001 00-50011 Offset 0 1 2 3 4 5 6 7 8 9 A B C D E F 000180 -------- -------- -------- -------- 000190 -------- -------- -------- ------02 0001A0 -------- -------- -------- -------- 0001B0 -------- -------- -------- -------- 0001C0 Member0 Member0 Expected Current Type Serial Address Address 0001D0 Member1 Member1 Expected Current Type Serial Address Address 0001E0 Member2 Member2 Expected Current Type Serial Address Address 0001F0 Member3 Member3 Expected Current Type Serial Address Address 000200 Member4 Member4 Expected Current Type Serial Address Address 000210 Member5 Member5 Expected Current Type Serial Address Address 000220 Member6 Member6 Expected Current Type Serial Address Address 000230 Member7 Member7 Expected Current Type Serial Address Address 000240 Member8 Member8 Expected Current Type Serial Address Address 000250 Member9 Member9 Expected Current Type Serial Address Address Figure 58. Example Hexadecimal Report, Array Member Formatting Log Template Display Additional Information for Resource Serial Resource Name Type Model Number Name STORAGE 6532 001 00-50011 Array Member . . . . . . : 0 1 2 3 4 Type . . . . . . . . : 6607 6607 6607 6607 6607 Serial Number . . . : 0025ED09 00158135 00259DE8 00008326 00071575 Unit Address: Current . . . . . . : 000200FF 000100FF 00000000 00000000 00000000 Expected . . . . . . : 000200FF 000100FF 010200FF 010100FF 010300FF Array Member . . . . . . : 5 6 7 8 9 Type . . . . . . . . : 6607 **** **** **** **** Serial Number . . . : 13068753 00000000 00000000 00000000 00000000 Unit Address: Current . . . . . . : 00000000 00000000 00000000 00000000 00000000 Expected . . . . . . : 010400FF 00000000 00000000 00000000 00000000 Figure 59. Example Hexadecimal Report, Array Member Formatting (As formatted information for A or B IPL)
Chapter 7. Common Service Procedures 625
Configuration formatting log example (unformatted information) Configuration formatting log template Configuration formatting log example (as formatted information for A or B IPL) Display Hexadecimal Report for Resource Serial Resource Name Type Model Number Name STORAGE 6532 001 00-50011 Offset 0 1 2 3 4 5 6 7 8 9 A B C D E F EBCDIC 000180 50407800 65320001 90120210 0FFFFFFF & .............. 000190 00000000 00000000 00001400 123C0003 ................ 0001A0 00000000 00000000 CA000003 00000000 ................ 0001B0 65320001 07050011 00000000 00000000 ................ 0001C0 00000000 00000000 47000001 07069205 ..............k. 0001D0 F8F6C7F8 F1F4F9C3 D7D7F8F9 F8F100D5 86G8149CPP8981.N 0001E0 00000000 00000000 00000000 00000000 ................ 0001F0 00000000 00000000 00000000 00000000 ................ 000200 00000000 00000000 00000000 00000000 ................ 000210 00000000 00000000 00000000 00000000 ................ 000220 00000000 00000000 00000000 00000000 ................ 000230 C3D7D7C5 C1F0F2D5 F6F5F3F2 F9F0F1F2 CPPEA02N65329012 000240 F0F0F0F0 F0F0F0F0 F0F0F0F0 F0F0C4F4 00000000000000D4 000250 F0F1F3F4 0000E8D5 00000000 00000000 0134..YN........ Figure 60. Example Hexadecimal Report, Configuration Formatting log (unformatted information) Display Hexadecimal Report for Resource Serial Resource Name Type Model Number Name STORAGE 6532 001 00-50011 Offset 0 1 2 3 4 5 6 7 8 9 A B C D E F 000180 -------- -------- -------- -------- 000190 -------- -------- -------- ------03 0001A0 -------- -------- -------- -------- 0001B0 Current Current Current Current IOP type IOP Cache Cache & model serial type serial 0001C0 Expected Expected Expected Expected IOP type IOP Cache Cache & model serial type serial Figure 61. Example Hexadecimal Report, Configuration Formatting Log Template Display Additional Information for Resource Serial Resource Name Type Model Number Name STORAGE 6532 001 00-50011 Configuration . . . . . . : Current Expected I/O Processor Information: Type . . . . . . . . : 6532 0000 Serial Number . . . : 07050011 00000000 Cache Adaptor Card Information: Type . . . . . . . . : 0000 4700 Serial Number . . . : 00000000 07069205 Figure 62. Example Hexadecimal Report, Configuration Formatting Log (As formatted information for A or B IPL)
626 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
Array addendum formatting log example (unformatted information) Display Hexadecimal Report for Resource Serial Resource Name Type Model Number Name STORAGE 2757 001 10-0322005 DC03 Offset 0 1 2 3 4 5 6 7 8 9 A B C D E F EBCDIC 000000 C5D3F0F5 000000A0 00E00180 82745D0F EL05........b.). 000010 1FD78000 00002CE0 20100084 049F0000 .P.........d.... 000020 902F0000 11000000 00000000 00C70000 .............G.. 000030 F2F7F5F7 F0F0F140 F1F060F0 F3F2F2F0 2757001 10-03220 000040 F0F50000 0000F2F7 F5F70002 00020000 05....2757...... 000050 00200000 0002FFFF FFFFFFFF 0004FFFF ................ 000060 00000000 00000000 00000000 40404040 ............ 000070 40404040 40400000 00000000 00000000 .......... 000080 00000000 00000000 00000000 00000000 ................ 000090 00000000 00000000 00000000 00000000 ................ 0000A0 00000000 00000000 00000000 00000000 ................ 0000B0 00000000 00000000 40404040 40404040 ........ 0000C0 40404040 40404040 40404040 40404040 0000D0 40404040 40404040 40404040 40404040 0000E0 F2F7F5F7 F0F0F0F1 0000E2D3 C9C3C9D6 27570001..SLICIO 0000F0 4040E5F5 D9F2D4F0 40400000 00000000 V5R2M0 ...... 000100 00B90000 00000000 000000C7 00000000 ...........G.... 000110 00000000 00000000 00000000 00000000 ................ 000120 00000000 00000000 00000000 00000000 ................ 000130 00000000 00000000 00000000 00000000 ................ 000140 C4C3F0F3 40404040 40404040 40404040 DC03 000150 D5C5C5C4 E5C1D3C9 C4D740D5 00070001 NEEDVALIDP N.... 000160 00000000 A0200007 00000000 00000000 ................ 000170 00000000 00000000 0003001E 000000A0 ................ 000180 A0407800 27570001 902F0200 4FFFFFFF . ...*......|... 000190 00000000 00000000 00001400 623E0004 ................ 0001A0 F6F6F0F7 0026121D 410500FF 410500FF 6607............ 0001B0 F6F6F0F7 00248B85 410400FF 410400FF 6607...e........ 0001C0 F6F6F0F7 0014BE1F 410300FF 410300FF 6607............ 0001D0 F6F6F0F7 0014C0F0 410200FF 410200FF 6607..{0........ 0001E0 F6F6F0F7 001403C8 410100FF 410100FF 6607...H........ 0001F0 F6F6F0F7 0014BEA4 410800FF 410800FF 6607...u........ 000200 00000000 00000000 00000000 00000000 ................ 000210 00000000 00000000 00000000 00000000 ................ Figure 63. Example Hexadecimal Report, Array Addendum Formatting Log (unformatted information)
Chapter 7. Common Service Procedures 627
Array addendum log template Array addendum log example (as formatted information for A or B IPL)
Table 57. Format 01
Description
Hex offset
Device errors detected
0001D6 - 0001D9
Device errors logged
0001DA - 0001DD
Device 1 type
0001E6 - 0001E9
Device 1 serial number
0001EA - 0001ED
Device 1 unit address
0001EE - 0001F1
New device 1 type
0001F2 - 0001F5
Display Hexadecimal Report for Resource
Serial Resource Name Type Model Number Name STORAGE 6532 001 00-50011 Offset 0 1 2 3 4 5 6 7 8 9 A B C D E F 000180 -------- -------- -------- -------- 000190 -------- -------- -------- ------04 0001A0 Member10 Member10 Expected Current Type Serial Address Address 0001B0 Member11 Member11 Expected Current Type Serial Address Address 0001C0 Member12 member12 Expected Current Type Serial Address Address 0001D0 Member13 Member13 Expected Current Type Serial Address Address 0001E0 Member14 Member14 Expected Current Type Serial Address Address 0001F0 Member15 Member15 Expected Current Type Serial Address Address 000200 Member16 Member16 Expected Current Type Serial Address Address 000210 Member17 Member17 Expected Current Type Serial Address Address Figure 64. Example Hexadecimal Report, Array Addendum Formatting Log Template Display Additional Information for Resource Serial Resource Name Type Model Number Name STORAGE 2757 001 10-0322005 DC03 Array Member . . . . . . : 10 11 12 13 14 Type . . . . . . . . : 6607 6607 6607 6607 6607 Serial Number . . . : 0026121D 00248B85 0014BE1F 0014C0F0 001403C8 Unit Address: Current . . . . . . : 410500FF 410400FF 410300FF 410200FF 410100FF Expected . . . . . . : 410500FF 410400FF 410300FF 410200FF 410100FF Array Member . . . . . . : 15 16 17 Type . . . . . . . . : 6607 **** **** Serial Number . . . : 0014BEA4 00000000 00000000 Unit Address: Current . . . . . . : 410800FF 00000000 00000000 Expected . . . . . . : 410800FF 00000000 00000000 Figure 65. Example Hexadecimal Report, Array Addendum Formatting (as formatted information for A or B IPL)
628 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
Table 57. Format 01 (continued)
Description
Hex offset
New device 1 serial number
0001F6 - 0001F9
Device 1 IOA type and model
0001FA - 0001FD
Device 1 IOA serial number
0001FE - 000201
Device 1 Cache type and model
000202 - 000205
Device 1 Cache serial number
000206 - 000209
Device 2 type
00021E - 000221
Device 2 serial number
000222 - 000225
Device 2 unit address
000226 - 000229
New device 2 type
00022A - 00022D
New device 2 serial number
00022E - 000231
Device 2 IOA type and model
000232 - 000235
Device 2 IOA serial number
000236 - 000239
Device 2 cache type and model
00023A - 00023D
Device 2 cache serial number
00023E - 000241
Device 3 type
000256 - 000259
Device 3 serial number
00025A - 00025D
Device 3 unit address
00025E - 000261
New device 3 type
000262 - 000265
New device 3 serial number
000266 - 000269
Device 3 IOA type and model
00026A - 00026D
Device 2 IOA serial number
00026E - 000271
Device 3 cache type and model
000272 - 000275
Device 3 cache serial number
000276 - 000279
Table 58. Format 02
Description
Hex offset
Array member 0 type
0001D6 - 0001D9
Array member 0 serial number
0001DA - 0001DD
Array member 0 expected unit address
0001DE - 0001E1
Array member 0 current unit address
0001E2 - 0001E5
Array member 1 type
0001E6 - 0001E9
Array member 1 serial number
0001EA - 0001ED
Array member 1 expected unit address
0001EE - 0001F1
Array member 1 current unit address
0001F2 - 0001F5
Array member 2 type
0001F6 - 0001F9
Array member 2 serial number
0001FA - 0001FD
Array member 2 expected unit address
0001FE - 000201
Array member 2 current unit address
000202 - 000205
Array member 3 type
000206 - 000209
Array member 3 serial number
00020A - 00020D Chapter 7. Common Service Procedures 629
Table 58. Format 02 (continued)
Description
Hex offset
Array member 3 expected unit address
00020E - 000211
Array member 3 current unit address
000212 - 000215
Array member 4 type
000216 - 000219
Array member 4 serial number
00021A - 00021D
Array member 4 expected unit address
00021E - 000221
Array member 4 current unit address
000222 - 000225
Array member 5 type
000226 - 000229
Array member 5 serial number
00022A - 00022D
Array member 5 expected unit address
00022E - 000231
Array member 5 current unit address
000232 - 000235
Array member 6 type
000236 - 000239
Array member 6 serial number
00023A - 00023D
Array member 6 expected unit address
00023E - 000241
Array member 6 current unit address
000242 - 000245
Array member 7 type
000246 - 000249
Array member 7 serial number
00024A - 00024D
Array member 7 expected unit address
00024E - 000251
Array member 7 current unit address
000252 - 000255
Array member 8 type
000256 - 000259
Array member 8 serial number
00025A - 00025D
Array member 8 expected unit address
00025E - 000261
Array member 8 current unit address
000262 - 000265
Array member 9 type
000266 - 000269
Array member 9 serial number
00026A - 00026D
Array member 9 expected unit address
00026E - 000271
Array member 9 current unit address
000272 - 000275
Table 59. Format 03
Description
Hex offset
Current I/O card type
0001C6 - 0001C7
Current I/O card serial number
0001CA - 0001CD
Current cache type
0001CE - 0001CF
Current cache serial number
0001D2 - 0001D5
Expected I/O card type
0001D6 - 0001D7
Expected I/O card serial number
0001DA - 0001DD
Expected cache type
0001DE - 0001DF
Expected cache serial number
0001E2 - 0001E5 630 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
Table 60. Format 04
Description
Hex offset
Array member 10 type
0001B6 - 0001B9
Array member 10 serial number
0001BA - 0001BD
Array member 10 expected unit address
0001BE - 0001C1
Array member 10 current unit address
0001C2 - 0001C5
Array member 11 type
0001C6 - 0001C9
Array member 11 serial number
0001CA - 0001CD
Array member 11 expected unit address
0001CE - 0001D1
Array member 11 current unit address
0001D2 - 0001D5
Array member 12 type
0001D6 - 0001D9
Array member 12 serial number
0001DA - 0001DD
Array member 12 expected unit address
0001DE - 0001E1
Array member 12 current unit address
0001E2 - 0001E5
Array member 13 type
0001E6 - 0001E9
Array member 13 serial number
0001EA - 0001ED
Array member 13 expected unit address
0001EE - 0001F1
Array member 13 current unit address
0001F2 - 0001F5
Array member 14 type
0001F6 - 0001F9
Array member 14 serial number
0001FA - 0001FD
Array member 14 expected unit address
0001FE - 000201
Array member 14 current unit address
000202 - 000205
Array member 15 type
000206 - 000209
Array member 15 serial number
00020A - 00020D
Array member 15 expected unit address
00020E - 000211
Array member 15 current unit address
000212 - 000215
Array member 16 type
000216 - 000219
Array member 16 serial number
00021A - 00021D
Array member 16 expected unit address
00021E - 000221
Array member 16 current unit address
000222 - 000225
Array member 17 type
000226 - 000229
Array member 17 serial number
00022A - 00022D
Array member 17 expected unit address
00022E - 000231
Array member 17 current unit address
000232 - 000235
Table 61. Format 11
Description
Hex offset
SCSI sense data
0001C4 - 0001E3
Worldwide ID of device with problem
0001E4 - 001EB
Unit address of device with problem
0001EC - 0001EF
SCSI command descriptor block
0001F4 - 000203
Problem description
000204 - 000223 Chapter 7. Common Service Procedures 631
Table 62. Format 12
Description
Hex offset
Current I/O card type and model
0001C4 - 0001CB
Current I/O card serial number
0001CC - 0001D3
Current I/O card worldwide ID
0001D4 - 0001DB
Current cache type and model
0001DC - 0001E3
Current cache serial number
0001E4 - 0001EB
Current cache worldwide ID
0001EC - 0001F3
Previous I/O card type and model
0001FC - 000203
Previous I/O card serial number
000204 - 00020B
Previous I/O card worldwide ID
00020C - 000213
Previous cache type and model
000214 - 00021B
Previous cache serial number
00021C - 000223
Previous cache worldwide ID
000224 - 00022B
Table 63. Format 13
Description
Hex offset
Device errors detected
0001FC - 0001FF
Device errors logged
000200 - 000203
Device 1 type
000204 - 000207
Device 1 serial number
000208 - 00020F
Device 1 worldwide ID
000210 - 000217
Device 1 unit address
000218 - 00021B
New device 1 type
00021C - 00021F
New device 1 serial number
000220 - 000227
New device 1 worldwide ID
000228 - 00022F
Device 1 IOA type and model
000230 - 000237
Device 1 IOA serial number
000238 - 00023F
Device 1 IOA worldwide ID
000240 - 000247
Device 1 cache type and model
000248 - 00024F
Device 1 cache serial number
000250 - 000257
Device 1 cache worldwide ID
000258 - 00025F
Device 2 type
000260 - 000263
Device 2 serial number
000264 - 00026B
Device 2 worldwide ID
00026C - 000273
Device 2 unit address
000274 - 000277
New device 2 type
000278 - 00027B
New device 2 serial number
00027C - 000283
New device 2 worldwide ID
000284 - 00028B
Device 2 IOA type and model
00028C - 000293
Device 2 IOA serial number
000294 - 00029B
Device 2 IOA worldwide ID
00029C - 0002A3 632 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
Table 63. Format 13 (continued)
Description
Hex offset
Device 2 cache type and model
0002A4 - 0002AB
Device 2 cache serial number
0002AC - 0002B3
Device 2 cache worldwide ID
0002B4 - 0002BB
Device 3 type
0002BC - 0002BF
Device 3 serial number
0002C0 - 0002C7
Device 3 worldwide ID
0002C8 - 0002CF
Device 3 unit address
0002D0 - 0002D3
New device 3 type
0002D4 - 0002D7
New device 3 serial number
0002D8 - 0002DF
New device 3 worldwide ID
0002E0 - 0002E7
Device 3 IOA type and model
0002E8 - 0002EF
Device 3 IOA serial number
0002F0 - 0002F7
Device 3 IOA worldwide ID
0002F8 - 0002FF
Device 3 cache type and model
000300 - 000307
Device 3 cache serial number
000308 - 00030F
Device 3 cache worldwide ID
000310 - 000317
Table 64. Format 14
Description
Hex offset
RAID number
000204 - 00020B
Array member 0 type
00020C - 00020F
Array member 0 serial number
000210 - 000217
Array member 0 worldwide ID
000218 - 00021F
Array member 0 expected unit address
000220 - 000223
Array member 0 current unit address
000224 - 000227
Array member 1 type
000228 - 00022B
Array member 1 serial number
00022C - 000233
Array member 1 worldwide ID
000234 - 00023B
Array member 1 expected unit address
00023C - 00023F
Array member 1 current unit address
000240 - 000243
Array member 2 type
000244 - 000247
Array member 2 serial number
000248 - 00024F
Array member 2 worldwide ID
000250 - 000257
Array member 2 expected unit address
000258 - 00025B
Array member 2 current unit address
00025C - 00025F
Array member 3 type
000260 - 000263
Array member 3 serial number
000264 - 00026B
Array member 3 worldwide ID
00026C - 000273
Array member 3 expected unit address
000274 - 000277
Array member 3 current unit address
000278 - 00027B Chapter 7. Common Service Procedures 633
Table 64. Format 14 (continued)
Description
Hex offset
Array member 4 type
00027C - 00027F
Array member 4 serial number
000280 - 000287
Array member 4 worldwide ID
000288 - 00028F
Array member 4 expected unit address
000290 - 000293
Array member 4 current unit address
000294 - 000297
Array member 5 type
000298 - 00029B
Array member 5 serial number
00029C - 0002A3
Array member 5 worldwide ID
0002A4 - 0002AB
Array member 5 expected unit address
0002AC - 0002AF
Array member 5 current unit address
0002B0 - 0002B3
Array member 6 type
0002B4 - 0002B7
Array member 6 serial number
0002B8 - 0002BF
Array member 6 worldwide ID
0002C0 - 0002C7
Array member 6 expected unit address
0002C8 - 0002CB
Array member 6 current unit address
0002CC - 0002CF
Array member 7 type
0002D0 - 0002D3
Array member 7 serial number
0002D4 - 0002DB
Array member 7 worldwide ID
0002DC - 0002E3
Array member 7 expected unit address
0002E4 - 0002E7
Array member 7 current unit address
0002E8 - 0002EB
Array member 8 type
0002EC - 0002EF
Array member 8 serial number
0002F0 - 0002F7
Array member 8 worldwide ID
0002F8 - 0002FF
Array member 8 expected unit address
000300 - 000303
Array member 8 current unit address
000304 - 000307
Array member 9 type
000308 - 00030B
Array member 9 serial number
00030C - 000313
Array member 9 worldwide ID
000314 - 00031B
Array member 9 expected unit address
00031C - 00031F
Array member 9 current unit address
000320 - 000323
Array member 10 type
000324 - 000327
Array member 10 serial number
000328 - 00032F
Array member 10 worldwide ID
000330 - 000337
Array member 10 expected unit address
000338 - 00033B
Array member 10 current unit address
00033C - 00033F
Array member 11 type
000340 - 000343
Array member 11 serial number
000344 - 00034B
Array member 11 worldwide ID
00034C - 000353
Array member 11 expected unit address
000354 - 000357
Array member 11 current unit address
000358 - 00035B 634 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
Table 64. Format 14 (continued)
Description
Hex offset
Array member 12 type
00035C - 00035F
Array member 12 serial number
000360 - 000367
Array member 12 worldwide ID
000368 - 00036F
Array member 12 expected unit address
000370 - 000373
Array member 12 current unit address
000374 - 000377
Array member 13 type
000378 - 00037B
Array member 13 serial number
00037C - 000383
Array member 13 worldwide ID
000384 - 00038B
Array member 13 expected unit address
00038C - 00038F
Array member 13 current unit address
000390 - 000393
Array member 14 type
000394 - 000397
Array member 14 serial number
000398 - 00039F
Array member 14 worldwide ID
0003A0 - 0003A7
Array member 14 expected unit address
0003A8 - 0003AB
Array member 14 current unit address
0003AC - 0003AF
Array member 15 type
0003B0 - 0003B3
Array member 15 serial number
0003B4 - 0003BB
Array member 15 worldwide ID
0003BC - 0003C3
Array member 15 expected unit address
0003C4 - 0003C7
Array member 15 current unit address
0003C8 - 0003CB
Array member 16 type
0003CC - 0003CF
Array member 16 serial number
0003D0 - 0003D7
Array member 16 worldwide ID
0003D8 - 0003DF
Array member 16 expected unit address
0003E0 - 0003E3
Array member 16 current unit address
0003E4 - 0003E7
Array member 17 type
0003E8 - 0003EB
Array member 17 serial number
0003EC - 0003F3
Array member 17 worldwide ID
0003F4 - 0003FB
Array member 17 expected unit address
0003FC - 0003FF
Array member 17 current unit address
000400 - 000403
Table 65. Format 17
Description
Hex offset
Associated IOA type and model
0001C4 - 0001CB
Associated IOA serial number
0001CC - 0001D3
Associated IOA worldwide ID
0001D4 - 0001DB
Type of adapter connection (1 = Dual storage IOA pair, 2 = Storage IOA to auxiliary cache)
0001DC - 0001DC
Problem description
0001E0 - 00021F Chapter 7. Common Service Procedures 635
Table 66. Format 20
Description
Hex offset
Problem description
0001C4 - 000203
Path 1 adapter port
00020A
Path active (00 = Unknown, 01 = Yes, 10 = No, 11 = Unknown)
00020D, bit 0 - 1
Path state (0000 = Unknown, 0001 = Operational, 0010 = Degraded, 0011 = Failed)
00020D, bit 4 -7
Node 1 port type (0000 = Empty, 0001 = Adapter, 0010 = Expander, 0011 = Device, 0100= LUN)
000210, bit 0 - 3
Node 1 status (0000 = Operational, 0001 = Degraded, 0010 = Failed, 0011 = Suspect, 0100 = Missing, 0101 = Not valid)
000210, bit 4 - 7
Node 1 phy
000212
Node 1 information (1000 = 1.5 Gbps, 1001 = 3.0 Gbps)
000213, bit 4 - 7
Path 1 Node 1 SAS address
000214 - 00021B
Additional nodes are self defining Reclaiming IOP cache storage Use this procedure to reclaim IOP cache storage. 1. If you are not already using Dedicated Service Tools, perform an IPL to DST. If you cannot perform a type A or B IPL, perform a type D IPL from removable media. Did you get to DST with a Type D IPL? v No: Continue with the next step. v Yes: Perform the following: a. Select Work with disk units ? Reclaim IOP cache storage. b. Follow the on-line instructions to reclaim cache storage. The system will automatically perform an IPL to a Disk Configuration Attention Report or a Disk Configuration Warning Report display. Select the option to display the report. Because the Reclaim Cache Storage was selected after a D-IPL, the system will now require another Reclaim Cache Storage to be performed.2. Select Work with disk units ? Work with disk unit recovery ? Reclaim IOP Cache Storage. 3. Follow the on-line instructions to reclaim cache storage. Depending on the reason for performing the Reclaim IOP Cache procedure, there may be a message requiring the ASP be cleared before performing this option, or a message stating the number of lost sectors.Note: If there are lost sectors, some objects may be damaged. The customer may have to load the damaged objects again.When the IOP performs a Reset/Reload, if there is a storage IOA attached that is running the local load source, Cxxx xxxx codes will be displayed in the system control panel. The DST console may then have a blank screen with 01B displayed on the system control panel for up to 15 minutes before it is restored. This ends the procedure. Working with Storage Dumps A main storage dump (MSD) is a process of collecting data from the system’s main storage. For more information on main storage dumps, including POWER5 platform system dumps, see Performing a platform or main storage dump. A main storage dump can be performed in the following ways: 636 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
v Automatically - by the service processor as the result of a system failure. v Manually - by using control panel functions when the system waits, loops, or appears to have an operating system failure.Dedicated Service Tools (DST) Dedicated service tools (DST) is used to service Licensed Internal Code (LIC), work with disk units, work with configuration and resources, verify devices and communications, and display logs on the system. DST operates in stand-alone, limited, and full paging environments. The DST tools and functions vary depending on the paging environment and the release level of the operating system. Dedicated Service Tools requirementsNote: It is a requirement to change the password the first time anyone signs on to DST or SST. The authorized service provider needs to give the new (changed) password to the customer or have the customer change the password. To use DST, you must have the following items: v The disk that contains the Licensed Internal Code (the load-source disk). Note: When you perform an alternate initial program load (IPL) to DST, you do not need the load-source disk. v An operational workstation on bus 1 as either a primary console or an operations console. v One workstation for each system partition. v A valid service tools user ID and password to sign on to DST. IBM supplies service tools user IDs that have different levels of privileges.To make a printout, attach the printer to the workstation I/O processor or storage media unit that is performing the service function. The printer that is used with DST for service tool output must be an SCS-type data stream printer. “Accessing Dedicated Service Tools”Covers how to start Dedicated Service Tools. “Start a service tool” on page 642 “IPL type, mode, and speed options” on page 642Set the IPL options from function 02 on the control panel. To permanently change the IPL speed use the CHGIPLA command.Accessing Dedicated Service Tools Covers how to start Dedicated Service Tools. You can start dedicated service tools (DST) in several ways: v Performing an IPL to DST (see “Performing an IPL to DST” on page 640). v Pressing the System Request key while the system is operational. v Selecting function 21 while the system is operational (see “Selecting function 21 from the control panel” on page 638). This is available from the primary partition only. v Performing an alternate IPL to DST (type D IPL) (see “Performing an alternate IPL to DST (type D IPL)” on page 641). After performing a type D IPL on a primary partition, the first step in DST must be configuration recovery. v Using the function key F16, which is not displayed, while in DST debug mode during a step-mode IPL.
Chapter 7. Common Service Procedures 637
“Selecting function 21 from the control panel”Provides detailed instructions on selecting function 21 while the system is operational. “Control panel function codes on the HMC”Learn about the control panel function codes used to indicate status and function options for a system that is managed by a Hardware Management Console (HMC). “Performing an IPL to DST” on page 640This procedure is for the entire system, or if logical partitions are installed, for the primary partition. “Performing an alternate IPL to DST (type D IPL)” on page 641An alternate IPL to DST is called a type D IPL or a stand-alone IPL by service representatives.Selecting function 21 from the control panel: Provides detailed instructions on selecting function 21 while the system is operational. Perform the following steps: 1. Select control panel Function 21 (Make DST available). Select Manual mode, then push the Increment (?) or Decrement (?) button on the control panel until 21 appears in the Function/Data display. Press the Enter button on the control panel.Notes: a. If you enter Function 21 and the primary console is powered off or not usable, reference code A600 500x appears on the control panel. Enter Function 21 again to force the DST Sign On display to appear on one of the alternative consoles.2. Primary console or alternate console will display the DST sign on. Note: For more information on how to identify the primary console, see “Determining a primary or alternative console” on page 591. 3. Sign-on to DST. Type QSRV as the valid user ID and get the password from the customer. If the customer has changed the full DST authority user ID or password, ask the customer for the correct values. Note: v If prompted for a password, be sure to give the new (changed) password to the customer. v Since V5R1, you are limited to three sign on attempts to DST. After three unsuccessful attempts, the service tools user ID will be disabled. The system administrator with QSECOFR authority will need to reset the ID.4. The Use Dedicated Service Tools (DST) display appears. The system is in the full paging environment. This ends the procedure.Control panel function codes on the HMC: Learn about the control panel function codes used to indicate status and function options for a system that is managed by a Hardware Management Console (HMC). If you are managing the system with the Hardware Management Console (HMC), use the HMC to perform control panel functions. Servers without physical panels require the HMC to perform the equivalent control panel functions. The HMC affects the physical control panel in the following ways: v Except for some limited auto-platform override function, such as Auto Power On Restart and Timed Power On, the system operating mode value no longer has meaning. v The operating system IPL-type value is disabled in functions 01 and 02. 638 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
v Functions 11-19 do not display partition system reference codes (SRCs). They continue to display SRCs from the platform Licensed Internal Code (LIC). v Functions 21, 22, 34, and 65-70 are not selectable on the physical control panel.The following table lists the control panel functions that you can perform on the HMC.
Function
Description
01
1. Displays the currently selected IPL type (and logical key mode on some system types). 2. Displays the currently selected IPL speed override for the next IPL.This function is available in both normal and manual operating mode.
02
Performs a slow boot. This function is available in both normal and manual operating mode.
03
Performs a re-IPL. The IPL uses the selected IPL options. This function is available only in manual operating mode and when the system power is on.
04
Performs a lamp test; all displays and indicators are switched on. This function is available in both normal and manual operating mode.
05-06
Reserved.
07
Allows you to set SPCN configuration IDs. This function is available only in manual operating mode when the system is in standby.
08
Causes a fast power off. This function is available only when the system is in manual operating mode and the system power is on.
09-10
Reserved.
11-19
Displays a system reference code (SRC) on the control panel. These functions are available in both normal and manual operating mode when an SRC is available.
20
Displays the machine type, model, processor feature code, processor class indicator, and IPL path description. This function is available in both normal and manual operating mode.
21
Causes the Use Dedicated Service Tool (DST) display to appear on the system console. This function is available only in the manual operating mode and when activated by the operating system.
22
Forces a partition dump. This function is available only in the manual operating mode and when activated by the operating system.
23-24
Reserved.
25-26
Not applicable.
27-29
Reserved.
30
Displays CEC FSP IP address and location. This function is available only in the manual operating mode and when activated by the FSP.
31-33
Reserved.
34
Retries the partition dump. This function is available only in the manual operating mode and when activated by the operating system.
35-41
Reserved.
42-43
Not applicable.
44-49
Reserved.
50-52
Reserved.
53
Unconfigures an intermittently failing processor (repeat GARD function). This function is available only in the manual operating mode.
54
Reserved.
55
Platform dump override. This function is available only in the manual operating mode when activated by the FSP and when CE scroll ranges are enabled. Chapter 7. Common Service Procedures 639
Function
Description
56-62
Reserved.
63
Progress indicator history. This function is available only in the manual operating mode.
64
Diagnostic status SRC trace. This function is available only in the manual operating mode.
65
Deactivates remote service. This function is available only in the manual operating mode when activated by the operating system and when CE scroll ranges are enabled.
66
Activates remote service. This function is available only in the manual operating mode when activated by the operating system and when CE scroll ranges are enabled.
67
Disk unit I/O processor (IOP) reset/reload is enabled only by specific disk unit SRCs. This function is available only in the manual operating mode when activated by the operating system and when CE scroll ranges are enabled.
68
Turns off power domains during concurrent maintenance of IOPs and IOAs. This function is available only in the manual operating mode when activated by the operating system and when CE scroll ranges are enabled.
69
Turns on power domains during concurrent maintenance of IOPs and IOAs. This function is available only in the manual operating mode when activated by the operating system and when CE scroll ranges are enabled.
70
Dump service processor control storage. Initiates an I/O processor (IOP) dump; enabled by specific error conditions. This function is available only in the manual operating mode when activated by the operating system and when CE scroll ranges are enabled. Performing an IPL to DST: This procedure is for the entire system, or if logical partitions are installed, for the primary partition. Perform the following steps: 1. Select a type B IPL in Manual mode. Attention: For systems with logical partitions, performing an IPL on the primary partition causes the secondary partitions to be powered down. Failing to power down these secondary partitions will cause an abnormal power-down on the secondary partitions and possible loss of data. 2. Perform one of the following: v If there is a problem log entry after powering on the system, perform a delayed power-off. (This is the data for problem analysis.) When the system is off, press the Power button on the control panel to perform an IPL. Select Function 03 if the delayed power-off takes longer than 40 minutes to complete. Attention: Performing Function 03 may cause damage to objects, and it may also cause data loss. v If there is an attention light after powering on the system, use the Increment (?) or Decrement (?) button to select control panel Function 03. Press Enter on the control panel to perform an IPL. v When the system is powered off, press the Power button on the control panel to perform an IPL. The IPL or Install the System display appears on the primary console.3. Select the Use Dedicated Service Tools option. The DST Sign On display appears. For more information on how to identify the primary console, see “Determining a primary or alternative console” on page 591. 4. Sign on to DST with a valid user ID and password. Remember: Enter a valid user ID and get the password from the customer. If the customer has changed the full DST authority, user ID, or password, ask the customer for the correct values.Note: 640 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
v If prompted for a password, be sure to give the new (changed) password to the customer. v You are limited to three sign on attempts to DST. After three unsuccessful attempts, the service tools user ID will be disabled. A user ID with a service tool security privilege must reset the ID.The Use Dedicated Service Tools (DST) display appears. Restriction: The system is in the limited paging environment. This ends the procedure.Performing an alternate IPL to DST (type D IPL): An alternate IPL to DST is called a type D IPL or a stand-alone IPL by service representatives. An alternate IPL to DST is called a type D IPL or a stand-alone IPL by service representatives. Perform the following steps: 1. To use an alternate installation device, select the DST option to Work with alternate installation device. This option is available in DST from control panel function 21 or a type D IPL. 2. Determine the removable media unit or units for an alternate IPL. Power on the removable media units. 3. Place the media that contains the Licensed Internal Code (first tape of the customer’s system save, the SAVLIC, ISMD tapes, or optical device) in the removable media units. Load the media and make the media units ready. See the device information for instructions on loading the removable media. 4. Power off the system. If the system is running, end all jobs and perform the power down procedure. 5. Select a type D IPL in Manual mode. For information on how to select IPL options, see “IPL type, mode, and speed options” on page 642. 6. Power on the console. For more information on how to identify the primary console, see “Determining a primary or alternative console” on page 591. 7. Load the media and make the media unit ready. 8. Press the Power button to power on the system. Notes: a. Some types of removable media units automatically reset. In this condition, SRC A100 1933 (Media device not ready) may be displayed. b. If the installation device is an optical device, you may need to IPL the system again. This action will make the unit ready. pwrdwnsys restart(*yes) IPL(D) c. If the installation device is a tape unit, system reference code (SRC) A1001933 is displayed until the device automatically makes itself ready. d. The removable media becomes active. There is a delay while the system loads information from the load source. While running a process, the system continuously updates SRCs on the control panel that show the status. 9. Wait for the Install Licensed Internal Code display to appear on the console. The wait varies depending on the speed of the removable media unit and the processor speed for the specific system model.Notes: a. If SRC A600 500x is displayed on the control panel, the system was not able to locate the console. Chapter 7. Common Service Procedures 641
Ensure that the console is powered on (repair if necessary). Select Function 21 on the control panel (see “Selecting function 21 from the control panel” on page 638) to make DST available. b. If an SRC other than A600 500x is displayed on the control panel, go to “Starting Point for All Problems” in the Problem Analysis information for your system.10. The Install Licensed Internal Code display appears on the console. v If you want to access DST, select the Work with Dedicated Service Tools (DST) option and continue to the next step of this procedure. v If you want to install or restore Licensed Internal Code, select the Install Licensed Internal Code option and continue with the install or restore instructions. v If you want to define the device from which the system LIC should be installed, select the Define alternate installation device option. This option also allows you to enable or disable the alternate installation device.11. Reapply PTFs. 12. Sign on to DST as QSRV. Type QSRV as the valid user ID and get the password from the customer. If the customer has changed the full DST authority user ID or password, ask the customer for the correct values. Note: a. If prompted for a password, be sure to give the new (changed) password to the customer. b. You are limited to three sign on attempts to DST. After three unsuccessful attempts, the service tools user ID will be disabled. The system administrator with QSECOFR authority will need to reset the ID.13. The Use Dedicated Service Tools (DST) display appears. Options are limited because the system is in the stand-alone environment. This ends the procedure.Start a service tool Select this option from the Use Dedicated Service (DST) Tools display. Note: The Start a service tool option is also available from the System Service Tools (SST) display. The following options are available from the Start a Service Tool display: v Display / alter dump v Licensed Internal Code log v Licensed Internal Code trace v Hardware service manager v Main storage dump manager v Product activity log v Operator panel functions v Performance data collector v (available only on systems with SST)IPL type, mode, and speed options Set the IPL options from function 02 on the control panel. To permanently change the IPL speed use the CHGIPLA command. Note: Systems managed by the Hardware Management Console (HMC) should use the HMC to perform control panel functions. v Use function 01 to display the current IPL option, including IPL speed. This is the option and speed that the system will use at the next IPL. 642 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
v Use the HDWDIAG (hardware diagnostics) parameter of the CHGIPLA command to permanently change the IPL speed. The values are: The default shipped value *MIN (minimal) for fast diagnostics and *ALL for all (full) diagnostics.IPL speed recommendations v Use CHGIPLA *MIN, or Function 02 F or V=F for most IPLs (to perform a fast IPL that uses minimum diagnostics). v Use CHGIPLA *ALL, or Function 02 S or V=S, if you change hardware or suspect an intermittent hardware failure (to perform a slow IPL that uses all diagnostics). v Use CHGIPLA *ALL, or Function 02 S or V=S during the first Install IPL. You can set this at the panel with Function 02; it affects only the one IPL because the next IPL will again pick up the value set with CHGIPLA.Printing the System Configuration List The System Configuration List printout consists of the following sections (80-character width): 1. System information v System type, model, serial number, and release2. Packaging hardware resource information v Resource name, type-model, serial number, part number, frame ID, card position, and device position3. Logical hardware resource information v Resource name, type-model, serial number, and logical address4. Legend v Descriptions of the indicators v Logical address format information 132-character width printouts consist of the following: v description v type-module v serial number v location data – frame ID – device position – card positionv logical resource name v part number v logical address You can sort the printout by location or logical address data.Print the system configuration list To print the system configuration list, perform the following procedure: 1. On the Start a Service Tool display, select the Hardware service manager option. 2. Select the Print (F6) function key on the Hardware Service Manager display to request a printout. 3. The Print Format Options display appears. You can sort the information by location or logical address.Note: The logical address sort and location sort are available only in 132–character width. 4. Look for the system configuration list printout. Notes: Chapter 7. Common Service Procedures 643
a. If you select the Print function key from a display other than the Hardware Service Manager display, only the information that appears on the display will print. b. Use the last page of the printout (the legend) to assist in interpreting the information. c. The logical and packaging hardware resources have unique resource names. d. You can also view logical address information online from the product activity log.This ends the procedure. To collect more details on the information that is listed on the system configuration list, select the Print function key from a display other than the Hardware Service Manager display. Note: The entire system configuration list will not print. Only the information that appears on the display will print. Details on the system bus, main storage, or processor For details (that include descriptions) on the system bus, main storage, or processor, perform the following procedure: 1. Select the Logical hardware resources option on the Hardware Service Manager display. 2. On the Logical Hardware Resources display, select one of the following options to display details: v System bus resources v Processor resources v Main storage resources v High-speed link resources3. Use the Print function key to print the information. This ends the procedure.Hardware Service Manager You can select the Hardware service manager option from the Start a Service Tool display. It allows you to display and work with system hardware (logical or packaging) and debug input-output processors (IOP) and devices. 644 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
Note: The system power control network (SPCN) option appears only on systems with SPCN. The Hardware Service Manager display provides the following information: v System type, model, and serial number v Licensed Internal Code and operating system release information v A list of the hardware service manager options The options available vary depending on the system type and paging environment. v Function keys The function keys vary depending on the display that is shown.Managing the Advanced System Management Interface (ASMI) Understand how to manage the Advanced System Management Interface (ASMI). The Advanced System Management Interface (ASMI) is the interface to the service processor that is required to perform general and administrator-level service tasks, such as reading service processor error logs, reading vital product data, setting up the service processor, and controlling the system power. The ASMI may also be referred to as the service processor menus. Use the following information to help you manage the ASMI: v “Accessing the ASMI using a Web browser” on page 646 v “Accessing the ASMI using an ASCII terminal” on page 648 v “Accessing the ASMI using the HMC” on page 648 “Accessing the ASMI” on page 646You can access the ASMI through a Web browser or an ASCII console. “Displaying error and event logs” on page 649Display a list of all of the error and event logs in the service processor. “Setting the system enclosure type” on page 650Set values that uniquely identify the type of enclosures attached to the system. “Setting the system identifiers” on page 650Set the system-unique ID, system serial number, machine type, and machine model. Hardware Service Manager Attention: This utility is provided for service representative use only. System unit . . . . . . . : 9406-570 10-0020A Release . . . . . . . . . : V5R3M0 Select one of the following: 1. Packaging hardware resources (systems, frames, cards,...) 2. Logical hardware resources (buses, IOPs, controllers,...) 3. Locate resource by resource name 4. Failed and non-reporting hardware resources 5. System power control network (SPCN) 6. Work with service action log 7. Display label location work sheet 8. Device Concurrent Maintenance 9. Work with resources containing cache battery packs Bottom Selection F3=Exit F6=Print configuration F9=Display card gap information F10=Display resources requiring attention F12=Cancel Figure 66. Example Hardware Service Manager display (in the full and limited paging environments)
Chapter 7. Common Service Procedures 645
“Viewing vital product data” on page 651View selected or all manufacturer’s VPD, such as serial numbers and part numbers. “Clearing all deconfiguration errors” on page 651Clear error records for specific or all resources in the system.Accessing the ASMI You can access the ASMI through a Web browser or an ASCII console. “Accessing the ASMI using a Web browser”Connect a PC or mobile computer to the server to access the ASMI. The Web interface to the ASMI is available during all phases of system operation including the initial program load (IPL) and run time. “Accessing the ASMI using an ASCII terminal” on page 648Connect the ASCII console to the server to access the ASMI. “Accessing the ASMI using the HMC” on page 648Access the ASMI through the HMC interface.Accessing the ASMI using a Web browser: Connect a PC or mobile computer to the server to access the ASMI. The Web interface to the ASMI is available during all phases of system operation including the initial program load (IPL) and run time. The Web interface to the Advanced System Management Interface (ASMI) is accessible through Microsoft® Internet Explorer 6.0, Netscape 7.1, or Opera 7.23 running on a PC or mobile computer connected to the service processor. The Web interface is available during all phases of system operation including the initial program load (IPL) and run time. However, some of the menu options in the Web interface are unavailable during IPL or run time to prevent usage or ownership conflicts if the system resources are in use during that phase. Note: To perform general and administrator-level service tasks, such as reading service processor error logs, reading vital product data, setting up the service processor, and controlling the system power, you must set up the ASMI. The following instructions apply to systems that are not connected to an HMC. If you are managing the server using an HMC, “Accessing the ASMI using the HMC” on page 648. To set up the Web browser for direct or remote access to the ASMI, complete the following tasks: 1. Connect the power cord from the server to a power source, and wait for the control panel to display 01. 2. Select a PC or mobile computer that has Microsoft Internet Explorer 6.0, Netscape 7.1, or Opera 7.23 to connect to your server. If you do not plan to connect your server to your network, this PC or mobile computer will be your ASMI console. If you plan to connect your server to your network, this PC or mobile computer will be temporarily connected directly to the server for setup purposes only. After setup, you can use any PC or mobile computer on your network that is running Microsoft Internet Explorer 6.0, Netscape 7.1, or Opera 7.23 as your ASMI console. 3. Connect an Ethernet cable from the PC or mobile computer to the Ethernet port labeled HMC1 on the back of the managed system. If HMC1 is occupied, connect an Ethernet cable from the PC or mobile computer to the Ethernet port labeled HMC2 on the back of the managed system. 4. Configure the Ethernet interface on the PC or mobile computer to an IP address and subnet mask within the same subnet as the server so that your PC or mobile computer can communicate with the server. Use the following table to help you determine these values: 646 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
Table 67. PC or mobile computer network configuration information
Server connector
Subnet mask
IP address
HMC1
255.255.255.0
169.254.2.147
HMC2
255.255.255.0
169.254.3.147 For example, if you connected your PC or mobile computer to HMC1, the IP address for your PC or mobile computer could be 169.254.2.1 and the subnet mask would be 255.255.255.0. Set the gateway IP address to the same IP address as the PC or mobile computer. 5. Use Table 67 to determine the IP address of the Ethernet port that your PC or mobile computer is connected to, and type the IP address in the Address field of your PC’s or mobile computer’s Web browser. For example, if you connected your PC or mobile computer to HMC1, type https://169.254.2.147 in your PC’s or mobile computer’s Web browser. 6. When the login display appears, enter one of the following default user IDs and passwords:
Table 68. Default user IDs and passwords for the Web interface
User ID
Password
general
general
admin
admin 7. Change the default password when prompted. 8. If you plan to connect your PC or mobile computer to your network, complete the following steps: a. From the navigation area, expand Network Services. b. Click Network Configuration. c. If your PC or mobile computer is connected to HMC1, fill in the section labeled Network interface eth0. If your PC or mobile computer is connected to HMC2, fill in the section labeled Network interface eth1. Ensure that the following fields are filled in correctly.
Table 69. Fields and values for network configuration
Field
Value
Configure this interface?
Selected
Type of IP address
Static
IP address
This is the same address that you used in step 5.
Subnet mask
255.255.255.0 d. Click Save Settings. e. Disconnect your PC or mobile computer from the server. f. Connect the server to your company network. g. Go to any PC or mobile computer on your company network that is running Microsoft Internet Explorer 6.0, Netscape 7.1, or Opera 7.23, and type the following in the Address field of your browser:
https://host name.domain name where host name and domain name are the host name and domain name you entered when completing step 8c. h. Log in to the ASMI.9. Set the time of day on the server.
Chapter 7. Common Service Procedures 647
Accessing the ASMI using an ASCII terminal: Connect the ASCII console to the server to access the ASMI. The ASCII interface to the ASMI provides a subset of the Web interface functions. The ASCII console is available only when the system is in platform standby state. It is not available during the initial program load (IPL) or run time. The ASMI on an ASCII console is not available during some phases of system operation, such as the initial program load (IPL) and run time. To set up the ASCII console for direct or remote access to the ASMI, complete the following tasks: 1. Complete the following steps to cable an ASCII console or equivalent to the server: Using a cable equipped with a null modem, connect the ASCII console to system connector S1 1 on the back of the server or to system port FS1 on the control panel using an RJ-45 connector. Note: Both system port 1 connections are not available simultaneously; when one is connected, the other is deactivated. 2. Connect the power cord from the server to a power source. 3. Wait for the control panel to display 01. 4. Ensure that your ASCII console is set to the following general attributes. These attributes are the default settings for the diagnostic programs. Be sure your terminal is set to these attributes before proceeding to the next step.
General setup attributes
3151 /11/31/41 settings
3151 /51/61 settings
3161 /64 settings
Description
Line speed
19,200
19,200
19,200
Uses the 19,200 (bits per second) line speed to communicate with the system unit.
Word length (bits)
8
8
8
Selects eight bits as a data word length (byte).
Parity
No
No
No
Does not add a parity bit and is used together with the word length attribute to form the 8–bit data word (byte).
Stop bit
1
1
1
Places a bit after a data word (byte). 5. Press a key on the ASCII console to allow the service processor to confirm the presence of the ASCII console. 6. When the login display appears, enter one of the following default user IDs and passwords:
Table 70. Default user IDs and passwords for the Web interface
User ID
Password
general
general
admin
admin 7. Change the default password when you are prompted. 8. Set the time of day on the server.Accessing the ASMI using the HMC: Access the ASMI through the HMC interface. To access the Advanced System Management Interface (ASMI) using the Hardware Management Console (HMC), complete the following steps: 648 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
1. Ensure that the HMC is set up and configured. 2. In the navigation area, ensure that the server you are working with is selected. 3. Expand and click Operations. 4. In the content area, click Advanced System Management (ASMI). 5. Log on to the ASMI using the administrator userid and password.Displaying error and event logs Display a list of all of the error and event logs in the service processor. You can view error and event logs that are generated by various service processor firmware components. The content of these logs can be useful in solving hardware or server firmware problems. To perform this operation, your authority level must be one of the following: v General v Administrator v Authorized service providerInformational and error logs can be viewed by all authority levels. Hidden error logs can be viewed by authorized service providers. The following table shows error log types that might be displayed, the conditions that make an error log specific to that error log type, and the user authority level that will allow you to view specific types of error logs:
Error log type
Conditions
User availability
Severity
Action
Informational logs
Informational
Report to operating system (OS) but not hidden
Available to all users
Error logs
Not informational
Report to OS but not hidden
Available to all users
Hidden logs
Not informational and informational
Report to OS, hidden, or both
Available only to the authorized service provider and users with higher authority. To view error and event logs in summary or full detailed format, do the following: 1. On the ASMI Welcome pane, specify your user ID and password, and click Log In. 2. In the navigation area, expand System Service Aids and click Error/Event Logs. If log entries exist, a list of error and event log entries is displayed in a summary view. 3. To view the full detail format of any of the logs listed, select the log’s corresponding check box and click Show details. When multiple logs are selected, any action applies to each selected log. The full detail information might span several pages. The contents and layout of the full detail output is defined by the event or error logging component. 4. Click Mark as reported to mark platform error entries whose underlying causes have been resolved. By doing so, these entries are not reported to the operating system again when the system reboots. After they are marked, these errors can be overwritten by other errors logged in the service processor history log. Note: The Mark as reported button is available only when your authority level is an authorized service provider.
Chapter 7. Common Service Procedures 649
Setting the system enclosure type Set values that uniquely identify the type of enclosures attached to the system. When setting the system enclosure type, ensure that the enclosure serial number field matches the original value, which can be found on a label affixed to the unit. Updating the enclosure serial field keeps the configuration and error information synchronized, and this information is used by the system when creating the location codes. This task must be done using the ASMI, not with the control panel. However, if you do not have access to the ASMI, the system will still operate without updating this information. For example, when replacing the I/O backplane, you must re-enter the original enclosure serial number into the enclosure serial number field to overwrite the serial number that is recorded for the new I/O backplane. Failure to enter the correct enclosure serial number will result in logical partition mappings being incorrect. Note: You cannot boot the system until valid values are entered for all fields in the enclosure-type information. To perform this operation, your authority level must be one of the following: v Administrator v Authorized service providerTo change the system enclosure type, complete the following steps: 1. On the ASMI Welcome pane, specify your user ID and password, and click Log In. 2. In the navigation area, expand System Configuration ? Program Vital Product Data. 3. Select System Enclosures. In the right pane, the current system enclosures are displayed. 4. Enter the desired settings for the Enclosure location, Feature Code/Sequence Number, and Enclosure serial number fields using the naming convention described in the ASMI help. The value of the Enclosure serial number field is different than the serial number of the system. Set the Reserved field to blank spaces unless directed otherwise by Level 3 technical support. 5. Click Save settings to update the system enclosure type information and save it to the VPD.Setting the system identifiers Set the system-unique ID, system serial number, machine type, and machine model. You can set the system-unique ID, serial number, machine type, and machine model. If you do not know the system-unique ID, contact your next level of support. To perform this operation, your authority level must be one of the following: v Administrator v Authorized service providerNote: You cannot boot the system until valid values are entered for all fields. You can change these entries only once. To set the system keywords, do the following: 1. On the ASMI Welcome pane, specify your user ID and password, and click Log In. 2. In the navigation area, expand System Configuration and Program Vital Product Data. 3. Select System Keywords. 4. In the right pane, enter the desired values for the system serial number, machine type, and machine model using the naming convention shown in the ASMI help. Set the Reserved field to blank spaces unless directed otherwise by service and support. 5. Click Continue. The data validation panel is displayed, which includes your entered settings. 650 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
6. Click Save settings to update the system keywords and save them to the VPD.Viewing vital product data View selected or all manufacturer’s VPD, such as serial numbers and part numbers. You can view manufacturer’s vital product data (VPD) stored from the system boot prior to the one in progress now. To perform this operation, your authority level must be one of the following: v General v Administrator v Authorized service providerTo view the VPD, complete the following steps: 1. On the ASMI Welcome pane, specify your user ID and password, and click Log In. 2. In the navigation area, expand System Information and select Vital Product Data. 3. A list of field replaceable units (FRUs) that exist on the system and their descriptions are displayed. Select a single FRU or multiple FRUs from this list that you would like to view. 4. Click Display Details to display the details for selected FRUs, or click Display all details to display details for all VPD entries.Clearing all deconfiguration errors Clear error records for specific or all resources in the system. The ASMI allows you to clear error records for all or individual system hardware resources that include the processor, memory, L2 and L3 cache, I/O hubs, service processor card, and clock card. To clear all deconfiguration errors, your authority level must be an authorized service provider. Note: v Before performing this operation, record error messages or ensure that the error record data is no longer needed; otherwise, you will lose all error data from the hardware resources. v Upon completion of this procedure, the system must be powered off, and in standby mode, for the deconfiguration errors to be cleared.To clear all deconfiguration errors, do the following: 1. If the server is not powered off and in standby state, ask the customer to power off the server. When the server is in standby state (the power on LED on the control panel is blinking slowly), continue with the next step. 2. On the ASMI Welcome pane, specify your user ID and password, and click Log In. 3. In the navigation area, expand System Configuration and Hardware Deconfiguration. 4. Select Clear All Deconfiguration Errors. 5. In the right pane, select the desired hardware resource from the menu. You can select All hardware resources or an individual resource. 6. Click Clear errors for selected hardware resource. 7. Power on the server after clearing the deconfiguration errors. This will cause the FSP to retest the hardware as it powers on. Chapter 7. Common Service Procedures 651
Service functions Service functions provides enough detail for the service provider to gather information about hardware problems while under the direction of the next level of support. This information provides basic information about some of the service functions that are commonly used by authorized service providers. Before using this information, you should know how to service this particular hardware model. You must also know how to safely work with electrical components. Attention: Misuse of service tools or using them with inappropriate security safeguards might result in loss of data, programs, or other objects on the system. 652 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
Managing dumps After you have performed a dump, or one occurs automatically, use this information to manage the dump, including copying, reporting, and deleting it. Dumps are deleted automatically when a new dump occurs, except for log dumps. “Performing dumps”Learn more about the different types of dumps that can be performed and how to manage them, including copying, reporting, and deleting them. “Copying a dump” on page 656Use this procedure to copy a current platform dump or current main storage dump when the Main Storage Dump auto copy option is disabled. Copying the current dump allows another dump of the same type to be created. “Reporting a dump” on page 659Find out how to report a dump using the HMC, i5/OS service tools, AIX command line, or Linux command line. “Deleting a dump” on page 661Use this procedure to delete a dump when dump copies are no longer needed by your next level of support or your hardware service provider.Performing dumps Learn more about the different types of dumps that can be performed and how to manage them, including copying, reporting, and deleting them. Dumps can occur automatically after a failure or external reset, or after a manual request. The dump information provides valuable information on system status and should be reported to your next level of support or your hardware service provider. v Platform system dump: A platform system dump is a collection of data from server hardware and firmware, either after a system failure or a manual request. v Service processor dump: A service processor dump is a collection of data from a service processor either after a failure, external reset, or manual request.In addition, there are other dumps that generate automatically. These include: v Platform event log entry dumps (which are generated by server firmware when a platform event log entry is truncated) v Resource dumps v SMP dumps “Performing a platform system or service processor dump”A platform system dump or service processor dump is performed when your next level of support or your service provider suspects a wait or loop in the server firmware. “Performing a service processor dump” on page 655A service processor dump is a collection of data from a service processor either after a failure, external reset, or manual request.Performing a platform system or service processor dump A platform system dump or service processor dump is performed when your next level of support or your service provider suspects a wait or loop in the server firmware. Only perform a platform system or service processor dump under the direction of your next level of support or your service provider, after first providing to your next level of support or service provider a dump of the operating system. Chapter 7. Common Service Procedures 653
Note: Before performing a platform system dump, collect any reference code information from the control panel or the HMC. “Using the HMC to perform a system dump”If your next level of support or your service provider suspects a wait or loop in the server firmware, they might request that you perform a system dump. You can use the HMC to perform a system dump, which is a collection of data from the server hardware and firmware. “Using the control panel to perform a system dump”If your next level of support or your service provider suspects a wait or loop in the server firmware, they might request that you perform a system dump. If you do not have an HMC, you can use the control panel to perform a system dump, which is a collection of data from the server hardware and firmware. “Using the ASMI to perform a system dump” on page 655If your next level of support or your service provider suspects a wait or loop in the server firmware, they might request that you perform a system dump. You can use the ASMI to perform a system dump, which is a collection of data from the server hardware and firmware.Using the HMC to perform a system dump: If your next level of support or your service provider suspects a wait or loop in the server firmware, they might request that you perform a system dump. You can use the HMC to perform a system dump, which is a collection of data from the server hardware and firmware. Use this procedure only under the direction of your next level of support or your service provider, after first providing to your next level of support or service provider a dump of the operating system. Follow these steps to initiate a system dump. 1. Perform a controlled shutdown of all partitions. Note: A system dump with abnormally terminate any running partitions. 2. In the navigation area, open Systems Management. 3. Select the server and open it. 4. Select Serviceability. 5. Select Manage Dumps. 6. Select Action > Initiate System Dump. The dump is automatically saved to the HMC. 7. For details on how to copy, report, or delete a dump after you have completed a dump, refer to “Managing dumps” on page 653.Using the control panel to perform a system dump: If your next level of support or your service provider suspects a wait or loop in the server firmware, they might request that you perform a system dump. If you do not have an HMC, you can use the control panel to perform a system dump, which is a collection of data from the server hardware and firmware. Use this procedure only under the direction of your next level of support or your service provider, after first providing to your next level of support or service provider a dump of the operating system. If you do not have an HMC, follow these steps to perform a system dump: 1. Perform a controlled shutdown of the operating system. Note: A platform system dump with abnormally terminate any running partitions. 2. On the control panel, select Manual mode. 3. Use the increment or decrement buttons to select function 42 ( system dump). 4. Press Enter on the control panel. 654 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
5. Is 00000000 or FF displayed on the control panel for more than 30 seconds? v Yes: The service processor or server firmware is not responding to a request from the control panel. Contact your next level of support or your hardware service provider. This ends the procedure. v No: An attention SRC A1xx3042 is displayed, which indicates that function 42 has been selected.6. Perform the following steps: a. Reselect function 42 and press Enter on the control panel. b. Switch back to Normal mode and wait for the dump to complete. c. Wait for the system to IPL again.7. For details on how to copy, report, or delete a dump after you have completed a dump, refer to “Managing dumps” on page 653.Using the ASMI to perform a system dump: If your next level of support or your service provider suspects a wait or loop in the server firmware, they might request that you perform a system dump. You can use the ASMI to perform a system dump, which is a collection of data from the server hardware and firmware. Use this procedure only under the direction of your next level of support or your service provider, after first providing to your next level of support or service provider a dump of the operating system. To perform a system dump, see Initiating a system dump. Performing a service processor dump A service processor dump is a collection of data from a service processor either after a failure, external reset, or manual request. Only perform a service processor dump under the direction of your next level of support or your service provider, after first providing to your next level of support or service provider a dump of the operating system. “Using the HMC to perform a service processor dump”A service processor dump is a collection of data from a service processor either after a failure, external reset, or manual request. You can use the HMC to perform a service processor dump. “Using the control panel to perform a service processor dump” on page 656A service processor dump is a collection of data from a service processor either after a failure, external reset, or manual request. If you do not have an HMC, you can use the control panel to perform a service processor dump. “Using the ASMI to perform a service processor dump” on page 656A service processor dump is a collection of data from a service processor either after a failure, external reset, or manual request. You can use the ASMI to perform a service processor dump.Using the HMC to perform a service processor dump: A service processor dump is a collection of data from a service processor either after a failure, external reset, or manual request. You can use the HMC to perform a service processor dump. Use this procedure only under the direction of your next level of support or your service provider, after first providing to your next level of support or service provider a dump of the operating system. Follow these steps to initiate a service processor dump. 1. In the navigation area, open Systems Management. 2. Select the server and open it. 3. Select Serviceability. 4. Select Manage Dumps. Chapter 7. Common Service Procedures 655
5. Select Action > Initiate System Dump. The dump is automatically saved to the HMC. 6. For details on how to copy, report, or delete a dump after you have completed a dump, refer to “Managing dumps” on page 653.Using the control panel to perform a service processor dump: A service processor dump is a collection of data from a service processor either after a failure, external reset, or manual request. If you do not have an HMC, you can use the control panel to perform a service processor dump. Use this procedure only under the direction of your next level of support or your service provider, after first providing to your next level of support or service provider a dump of the operating system. If you do not have an HMC, follow these steps to perform a service processor dump: 1. On the control panel, select Manual mode. 2. Use the increment or decrement buttons to select function 43 (service processor dump). 3. Press Enter on the control panel. 4. Is 00000000 or FF displayed on the control panel for more than 30 seconds? v Yes: The service processor or server firmware is not responding to a request from the control panel. Contact your next level of support or your hardware service provider. This ends the procedure. v No: An attention SRC A1xx 3043 is displayed, which indicates that function 43 has been selected.5. Perform the following steps: a. Reselect function 43 and press Enter on the control panel. b. Switch back to Normal mode, and wait for the dump to complete.6. For details on how to copy, report, or delete a dump after you have completed a dump, refer to “Managing dumps” on page 653.Using the ASMI to perform a service processor dump: A service processor dump is a collection of data from a service processor either after a failure, external reset, or manual request. You can use the ASMI to perform a service processor dump. Use this procedure only under the direction of your next level of support or your service provider, after first providing to your next level of support or service provider a dump of the operating system. To perform a service processor dump, see Initiating a service processor dump. Copying a dump Use this procedure to copy a current platform dump or current main storage dump when the Main Storage Dump auto copy option is disabled. Copying the current dump allows another dump of the same type to be created. Note: If you have an HMC, you must manage your platform dumps with the HMC. “Using an HMC to copy a dump” on page 657Learn how to copy a dump using an HMC. “Using the AIX command line to copy a dump” on page 657Learn how to use the AIX command line to copy a dump. “Using the Linux command line to copy a dump” on page 657Learn how to use the Linux command line to copy a dump. “Using the IBM i5/OS service tools to copy a dump” on page 658Learn how to use the i5/OS service tools to copy a dump.
656 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
Using an HMC to copy a dump Learn how to copy a dump using an HMC. Note: If you have an HMC, you must manage your platform dumps with the HMC. However, you will need to manage i5/OS main storage dumps with the IBM i5/OS service tools. 1. In the navigation area, open Systems Management. 2. Select the server and open it. 3. Select Serviceability. 4. Select Manage Dumps. 5. Select Selected > Copy Dump. 6. Select the dump to copy. 7. Verify that HMC containing the dump is the HMC being used to copythe dump. 8. To copy the dump to DVD media, select Selected > Copy Dump File to DVD Media. 9. Load the media and follow the instructions on the display. 10. When the copy procedure is successfully completed, remove the media and process it according to the instructions of your next level of support or your hardware service provider. 11. If you encounter a problem with the copy procedure, contact your next level of support or your hardware service provider. This ends the procedure.Using the AIX command line to copy a dump Learn how to use the AIX command line to copy a dump. Copy the dump only if you are directed to do so by your next level of support or service provider. Note: If you have an HMC, you must manage your platform dumps with the HMC. For details, see “Using an HMC to copy a dump.” Platform dumps are saved in AIX under the following directory: /var/adm/platform Note: If the AIX partition becomes the service partition after it was installed, run the following command to create this directory: platform_dump -f jfs2 or platform_dump -f jfs This command will create the /var/adm/ras/platform file system. This is done automatically during installation if the AIX partition was marked as the service partition before it was installed. Using the Linux command line to copy a dump Learn how to use the Linux command line to copy a dump. Copy the dump only if you are directed to do so by your next level of support or service provider. Note: If you have an HMC, you must manage your platform dumps with the HMC. For details, see “Using an HMC to copy a dump.” Platform dumps are saved in Linux under the following directory: /var/log/dump/ Chapter 7. Common Service Procedures 657
Using the IBM i5/OS service tools to copy a dump Learn how to use the i5/OS service tools to copy a dump. Choose from the following options: v To copy a platform system or SP™ dump, go to “Copying a platform system or SP dump.” v To copy a main storage dump, go to “Copying a main storage dump.”Copying a platform system or SP dump: Use this procedure to copy a platform system or SP dump with the i5/OS tools. Note: If you have an HMC, you must manage your platform dumps with the HMC. For details, see “Using an HMC to copy a dump” on page 657. 1. On any command line, enter STRSST. 2. Enter a valid service tools user ID and password. 3. Select option 1 (Start a service tool). 4. Select option 6 (Main storage dump manager). Did option 7 (Work with current platform dump) appear on the Main Storage Dump Manager display? v Yes: There is a current platform dump. Continue with the next step. v No: There is no current platform dump. A dump might have been automatically copied to the Auxiliary Storage Pool (ASP). Go to “Reporting a dump” on page 659. This ends the procedure. 5. Select option 7 (Work with current platform dump). 6. Select option 1 (Display summary). The Platform Dump Summary display shows the platform dump type, date and time of the dump, and reference code. 7. Record the summary information and report it to your next level of support or your hardware service provider. 8. Press F12 (Cancel). 9. Select option 3 (Copy to ASP). The platform dump will be copied to disk storage. After the dump is copied, a message will be displayed indicating whether the copy completed. 10. Did a message indicate ?Copy completed normally?? v Yes: There no longer is an option 7 (Work with current platform dump) listed. This ends the procedure. v No: Continue with the next step.11. Has your next level of support or your hardware service provider requested a media copy of the platform dump? v Yes: Continue to the next step. v No: Work with your next level of support or your hardware service provider on the problem. This ends the procedure.12. If the copy to disk failed, copy a current platform dump to removable media by performing the following steps: a. Select option 2 (Copy to media). b. Load the media and follow the instructions on the display. c. When the copy procedure is successfully completed, remove the media and process it according to the instructions of your next level of support or your hardware service provider.13. If you encounter a problem with the copy procedure, contact your next level of support or your hardware service provider. This ends the procedure.Copying a main storage dump: Use this procedure to copy an i5/OS main storage dump using i5/OS tools. 1. On any command line, enter STRSST. 658 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
2. Enter a valid service tools user ID and password. 3. Select option 1 (Start a service tool). 4. Select option 6 (Main storage dump manager). 5. From the Main Storage Dump Manager display, press Enter. Note: Copying a main storage dump (MSD) to disk storage on the system prevents the MSD from being overwritten in the event that another MSD occurs. 6. Select option 1 (Display/Print). 7. Select option 1 (MSD summary). The Main Storage Dump Summary display shows the system reference code, date and time of the MSD, and Licensed Internal Code (LIC) level. 8. Record the summary information and report it to your next level of support or your hardware service provider. 9. Press F12 (Cancel) twice to return to the Work with Current Main Storage Dump display. 10. Select option 3 (Copy to ASP). 11. Type a dump description and then press Enter to start copying the dump. After the dump is copied, a message is displayed indicating whether the MSD copy completed. 12. Did a message indicate ?Copy completed normally?? v Yes: This ends the procedure. v No: Continue with the next step.13. Has your service provider requested a media copy of the MSD? v Yes: Continue to the next step. v No: Work with your next level of support or your hardware service provider on the problem. This ends the procedure.14. To copy a current MSD to a removable media, perform the following steps: a. Select option 2 (Copy to media). b. Load the media and follow the instruction on the display. c. When the copy procedure completes successfully, remove the media and process it according to your service provider’s instructions. If you encounter a problem with the copy procedure, contact your next level of support or your hardware service provider. This ends the procedure. Reporting a dump Find out how to report a dump using the HMC, i5/OS service tools, AIX command line, or Linux command line. “Using an HMC to report a dump”Learn how to report a dump using an HMC. “Using the AIX command line to report a dump” on page 660Learn how to report a dump using the AIX command line. “Using the Linux command line to report a dump” on page 660Learn how to report a dump using the Linux command line. “Using the i5/OS service tools to report a main storage or platform system or SP dump” on page 660Learn how to report a dump using the i5/OS service tools.Using an HMC to report a dump Learn how to report a dump using an HMC. 1. In the navigation area, open Systems Management. 2. Select the server and open it. 3. Select Serviceability. 4. Select Manage Dumps. 5. Select Selected > Call Home Dump. Chapter 7. Common Service Procedures 659
6. Verify that HMC containing dump is the HMC being used to report the dump. 7. Select Selected ? Call Home Dump. The dump summary display shows the date, time and size of the dump. Report the summary information to your next level of support or your hardware service provider. 8. Has your next level of support or your hardware service provider requested a copy of the dump? v Yes: Continue to the next step. v No: Contact your next level of support or your hardware service provider on the problem. 9. To copy a dump to removable media, perform the following steps: a. Select Copy Dump File to DVD Media. b. Load the media and follow the instructions on the display. c. When the copy procedure is successfully completed, remove the media and process it according to the instructions of your next level of support or your hardware service provider. If you encounter a problem with the copy procedure, contact your next level of support or your hardware service provider.10. To copy a dump to another system, perform the following steps: a. Select Selected ? Copy Dump File to Other System. b. Follow the instructions on the display. c. When the copy procedure is complete, contact your next level of support or your hardware service provider.Using the AIX command line to report a dump Learn how to report a dump using the AIX command line. Report the dump only if you are directed to do so by your next level of support or service provider. Platform dumps are saved in AIX under the following directory: /var/adm/platform Note: If the AIX partition becomes the service partition after it was installed, run the following command to create this directory: platform_dump -f jfs2 or platform_dump -f jfs This command will create the /var/adm/ras/platform file system. This is done automatically during installation if the AIX partition was marked as the service partition before it was installed. Using the Linux command line to report a dump Learn how to report a dump using the Linux command line. Platform dumps are saved in AIX under the following directory: /var/log/dump/ Report the dump only if you are directed to do so by your next level of support or service provider. Using the i5/OS service tools to report a main storage or platform system or SP dump Learn how to report a dump using the i5/OS service tools. If your system has the Main Storage Dump enabled for auto copy, your system might have automatically copied the current MSD to the Auxiliary Storage Pool (ASP) using the dump description Auto Copy. Then it might have performed another IPL of the system, or automatically copied a current platform system or SP dumps to ASP using the dump type as part of the dump description. 660 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
1. On any command line, enter STRSST. 2. Enter a valid service tools user ID and password. 3. Select option 1 (Start a service tool). 4. Select option 6 (Main storage dump manager). 5. Select option 2 (Work with copies of main storage dumps). 6. Find a new dump near the bottom of the list, type 5 (Display/Print) next to it, and press Enter. 7. Did the Display Main Storage Dump display appear? v Yes: This is a main storage dump. Select option 1 (MSD Summary). Continue with the next step. v No: This is a platform dump. Continue with the next step. 8. Record the summary information so that you can report it to your next level of support or your hardware service provider. The Dump Summary display shows the date and time of the dump, system reference code (SRC) data, and Licensed Internal Code (LIC) level. 9. Press F3 (Exit). 10. If the dump has a description of Auto Copy, rename it so that another auto copy and IPL can occur if necessary. a. Select option 7 (Rename). b. Type a new dump description, and press Enter.11. Has your next level of support or your hardware service provider requested a media copy of the dump? v Yes: Continue with the next step. v No: Contact your next level of support or your hardware service provider on the problem. Continue with “Deleting a dump.”12. To copy an MSD or a platform dump to removable media, perform the following steps: a. Select option 8 (Copy to media). b. Load the media and follow the instructions on the display. c. When the copy procedure completes successfully, remove the media and process it according to the instructions from your next level of support or your hardware service provider. If you encounter a problem with the copy procedure, contact your next level of support or your hardware service provider. 13. Continue with “Deleting a dump.”Deleting a dump Use this procedure to delete a dump when dump copies are no longer needed by your next level of support or your hardware service provider. “Using an HMC to delete a dump”Learn how to use an HMC to delete a dump. “Using the AIX command line to delete a dump” on page 662Learn how to delete a dump using the AIX command line. “Using the Linux command line to delete a dump” on page 662Learn how to delete a dump using the Linux command line. “Using the i5/OS service tools to delete a dump” on page 662Learn how to delete a dump using the i5/OS service tools.Using an HMC to delete a dump Learn how to use an HMC to delete a dump. Dumps are stored in the HMC’s Manage Sirviceable Events tool and are automatically deleted when room is required for new dumps. Use this procedure when you want to manually delete a dump. 1. In the navigation area, open Systems Management. Chapter 7. Common Service Procedures 661
2. Select the server and open it. 3. Select Serviceability. 4. Select Manage Dumps. 5. Select Selected > Delete Dump. 6. Select the dump to delete. 7. Verify that HMC containing the dump is the HMC being used to delete the dump. 8. Select Selected ? Delete Dump.Using the AIX command line to delete a dump Learn how to delete a dump using the AIX command line. Platform dumps are saved in AIX under the following directory: /var/adm/platform Only delete the dump if directed to do so by your next level of support or service provider. Note: If the AIX partition becomes the service partition after it was installed, run the following command to create this directory: platform_dump -f jfs2 or platform_dump -f jfs to create the /var/adm/ras/platform file system. During an installation, this directory is created automatically if the AIX partition was marked as the service partition before it was installed. Using the Linux command line to delete a dump Learn how to delete a dump using the Linux command line. Platform dumps are saved in AIX under the following directory: /var/log/dump/ Only delete the dump if directed to do so by your next level of support or service provider. Using the i5/OS service tools to delete a dump Learn how to delete a dump using the i5/OS service tools. 1. On any command line, enter STRSST. 2. Enter a valid service tools user ID and password. 3. Select option 1 (Start a service tool). 4. Select option 6 (Main storage dump manager). 5. Select option 2 (Work with copies of main storage dumps). 6. If you want to delete any dump copies, type 4 next to the dump copies, and press Enter twice. 7. To exit SST, press F3 (Exit) three times, and then press Enter. 662 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
Performing an IBM i5/OS main storage dump A main storage dump is a collection of data from the i5/OS’s main storage. It can be manually initiated when the system administrator suspects a system wait or loop, or automatically by the server firmware as the result of a system failure. Note: Before performing a main storage dump, collect any reference code information from the control panel or the HMC. “Using the HMC to perform an IBM i5/OS main storage dump”Learn how to use the HMC to perform an IBM i5/OS main storage dump. “Using the control panel to perform an IBM i5/OS main storage dump”Learn how to use the control panel to perform an IBM i5/OS main storage dump.Using the HMC to perform an IBM i5/OS main storage dump Learn how to use the HMC to perform an IBM i5/OS main storage dump. Note: Before performing a or main storage dump, collect any reference code information from the control panel or the HMC. Follow these steps to perform a main storage dump from an HMC:Note: If a main storage dump occurred automatically, go to “Managing dumps” on page 653 for information on how to copy, report, or delete a dump. 1. In the navigation area, open Server and Partition. 2. Select Server Management. 3. In the contents area, expand Partitions. 4. Right-click the logical partition and select Restart Partition. 5. Select Dump. 6. Click OK. 7. For details on how to copy, report, or delete a dump after you have completed a dump, refer to “Managing dumps” on page 653.Using the control panel to perform an IBM i5/OS main storage dump Learn how to use the control panel to perform an IBM i5/OS main storage dump. Note: Before performing a or main storage dump, collect any reference code information from the control panel or the HMC. If you do not have an HMC, follow these steps to perform a platform system dump: 1. Verify that no interactive jobs are running. 2. On the control panel, select Manual mode. 3. Use the increment or decrement buttons to select function 22 (main storage dump). 4. Press Enter on the control panel. 5. Is 00000000 or FF displayed on the control panel for more than 30 seconds? v Yes: The service processor or server firmware is not responding to a request from the control panel. Contact your next level of support or your hardware service provider. This ends the procedure. v No: An attention SRC A1xx 3022 is displayed, which indicates that function 22 has been selected. Continue with the next step.6. Perform the following steps on the control panel: a. Reselect function 22. Chapter 7. Common Service Procedures 663
b. Press Enter. c. Switch back to Normal mode and wait for the dump to complete. If function 22 was selected, the appearance of an A2D0 3000 on the console indicates that a successful user-initiated main storage dump was performed. (A B6xx xxxx reference code indicates that a successful i5/OS initiated main storage dump was performed.7. For details on how to copy, report, or delete a dump after you have completed a dump, refer to “Managing dumps” on page 653. 664 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
Performing a slow boot Learn how to perform a slow boot using the HMC, control panel, or ASMI menus depending on your configuration. In some cases, you need to perform a slow-mode boot in order to perform extended diagnostic testing. Note: A slow-mode boot might yield a new reference code on the control panel, or new errors in the service processor error log. When the server reports a new error code, record it for use in subsequent steps. “Using the HMC to perform a slow boot”Learn how to perform a slow boot using the HMC. “Using the control panel to perform a slow boot”Learn how to use the control panel to perform a slow boot. “Using the ASMI menus to perform a slow boot” on page 666Learn how to use the ASMI menus to perform a slow boot.Using the HMC to perform a slow boot Learn how to perform a slow boot using the HMC. In some cases, you need to perform a slow-mode boot in order to perform extended diagnostic testing. Note: A slow-mode boot might yield a new reference code on the control panel, or new errors in the service processor error log. When the server reports a new error code, record it for use in subsequent steps. 1. Record any reference codes appearing on the control panel or HMC. 2. Shut down all logical partitions and the server. 3. In the navigation area, open Systems Management. 4. Open Servers. 5. Select the server you want to perform a slow boot on. 6. In the Tasks area, select Properties. 7. Click the Power-On Parameters tab. 8. Record the current boot setting in the Current column next to the Power-on speed override box. 9. Select slow from the list in the Power-on speed override box in the Next column. Note: These settings will apply to all future boots. After you complete the service action, change the settings back to what you recorded in step 8. 10. Click OK. 11. After you are done, return to the step in the procedure that caused you to perform a slow boot. This ends the procedure.Using the control panel to perform a slow boot Learn how to use the control panel to perform a slow boot. In some cases, you need to perform a slow-mode boot in order to perform extended diagnostic testing. Note: A slow-mode boot might yield a new reference code on the control panel, or new errors in the service processor error log. When the server reports a new error code, record it for use in subsequent steps. 1. Record any reference codes appearing on the control panel. 2. Stop the operating system by performing the following steps: Chapter 7. Common Service Procedures 665
a. Log in to the system as a user with the authority to run the shutdown or pwrdwnsys (Power Down System) command. b. At the command line, enter one of the following commands: Note: These commands stop the operating system. The system power turns off, the power-on light begins to slowly blink, and the system goes into a standby state. v If your system is running AIX, type shutdown v If your system is running IBM i5/OS, type pwrdwnsys *immed v If your system is running Linux, type shutdown -h now 3. On the control panel, use the increment or decrement buttons to scroll to function 02. 4. Press Enter to start function 02. The current IPL type (with pointer) and speed is displayed. 5. Use the increment or decrement buttons to scroll through the IPL types and speeds. 6. When manual is shown, press Enter to select the IPL type of manual. The current IPL type and speed (with pointer) is displayed. 7. Use the increment or decrement buttons to scroll through the IPL speeds. 8. When S is shown, press Enter to select the IPL speed of slow. 9. To exit function 02, press Enter multiple times until just 02 is displayed. 10. Power on the server by pressing the white power button. Then return to the procedure that caused you to perform a slow boot. This ends the procedure.Using the ASMI menus to perform a slow boot Learn how to use the ASMI menus to perform a slow boot. In some cases, you need to perform a slow-mode boot in order to perform extended diagnostic testing. Note: A slow-mode boot might yield a new reference code on the control panel, or new errors in the service processor error log. When the server reports a new error code, record it for use in subsequent steps. 1. Record any reference codes appearing on the control panel or HMC. 2. Shut down all logical partitions and the server. 3. On the ASMI Welcome pane, specify your user ID and password, and click Log In. 4. Expand Power/Restart Control and select Power On/Off System. 5. Record the settings so that you can re-enter them after you complete the service actions. 6. On the Power on/off system menu, select the following: v Select Slow in the Set System Boot Speed field. v Select Running in the Boot to system server firmware field.7. Select Save Settings. Note: These settings will apply to all future boots. After you complete the service action, change the settings back to what you recorded in step 5. 8. After you are done, return to the step in the procedure that caused you to perform a slow boot. This ends the procedure. 666 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
Determining the primary or alternate console Use this procedure to determine the primary or alternate console. If the HMC has been configured as the primary console, the system will start the HMC. If the Operations Console has been configured as the primary console, the system will start the Operations Console. If the Operations Console has not been configured, the primary console is a workstation that is attached to the first input/output processor (IOP) that is capable of supporting workstations. In addition to the primary console, the system can assign up to two alternative consoles. The first alternative console can only be a twinax workstation that is attached to the same IOP as the primary console. The second alternative console is a workstation that is attached to the next IOP or input/output adapter (IOA) that is capable of supporting workstations. The IOP that supports the console must be on the first system bus (bus 1). If a workstation is not correctly attached to the first IOP that is capable of attaching workstations, then the system will not assign a primary console. The system will display a reference code on the operator’s panel. In addition, if the IPL (initial program load) mode is set to Manual, the system will stop. “Primary console workstation requirements”Learn about the primary console workstation requirements. “Find the primary console when the system is operational” on page 668Learn how to find the primary console when the system is operational. “Find the primary console when the system power is off” on page 668Learn how to find the primary console when the system power is off.Primary console workstation requirements Learn about the primary console workstation requirements. In order to be the primary console, the workstation must be operational and have the correct port and address. If the workstation is a PC, it must also have an active emulation program on the workstation. The workstation requirements are: v HMC v twinax workstation - Port 0 Address 0 v ASCII workstation - Port 0 v PC attached to ASCII IOP or IOA – Port 0 – PC software to emulate a 316x or 3151 terminalv PC attached to twinax IOP – Port 0 Address 0 – 5250 emulator software active on PCv PC attached to a LocalTalk IOA (6054) – SNAps 5250 Version 1.2 (or above) application – Console capable selected on MacIntosh (IOA converts to Port 0 Address 0)v PC attached to a 2609, 2612, 2699, or 2721 communications IOA – Client Access Console cable attached to the 2609 or 2612 P2 port (part number 46G0450 or 46G0479), 2699 (part number 21H3779), or 2721 (part number 44H7504) Chapter 7. Common Service Procedures 667
– Operations Console cable attached to the 2609 or 2612 (part number 97H7555), 2699 (part number 97H7556), or 2721 (part number 97H7557) - 5250 emulation or Rumba active on PCFind the primary console when the system is operational Learn how to find the primary console when the system is operational. The following methods are available to find the primary console: v Method 1: Look for a sign-on display with a DSP01 in the upper-right corner. v Method 2: If the device name (DSP01) for the console has been changed, you can verify the device name for the primary console by doing the following: 1. Enter DSPCTLD QCTL on any command line. The Display Controller Description display is displayed. 2. Find the Resource name parameter (such as CTL01) and record it. 3. Enter PRTDEVADR rrrrr on any command line, where rrrr is the resource name you recorded.Note: If the printer is active, the data will print. v Method 3: 1. Enter STRSST on any command line. The System Service Tools display is displayed. 2. Select option 1 (Start a service tool). 3. Select option 7 (Hardware service manager). 4. Select option 2 (Logical hardware resources). 5. Select option 1 (System bus resources). On the Logical Hardware Resources on System Bus display, the ? symbol indicates the IOP that the console is attached to. 6. Use option 9 (Resource associate with IOP and display detail) to find the location of the system bus, board, and card.Find the primary console when the system power is off Learn how to find the primary console when the system power is off. Use one of the following: v Turn on the system in Manual mode and look for the IPL and Install System display. v Turn on the system in Normal mode and look for DSP01 on the sign on display. Note that the name might have been changed. See “Find the primary console when the system is operational” to determine the display name. 668 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
Resetting the service processor Use this procedure to reset the service processor. Note: Use this procedure only under the direction of your next level of support or your hardware service provider. During this procedure you will be powering off the system and removing and reapplying ac power. 1. Inform anyone using the server that it will be shut down. 2. Stop all applications that are running on the server. 3. Stop the operating system and server by choosing from the following options: v If you have an HMC, shut down all logical partitions and the server. For instructions see “Powering on and powering off” on page 588. Then continue with step 6. v If you do not have an HMC, continue with the next step.4. Log in to the system as a user with the authority to run the appropriate shutdown command. 5. At the command line, enter one of the following commands: v If your system is running AIX, type shutdown v If your system is running i5/OS, type PWRDWNSYS *IMMED v If your system is running Linux, type shutdown -h nowThese commands stop the operating system. The system power turns off, the power-on light begins to slowly blink, and the system goes into a standby state. 6. Unplug the ac power cord and replug it back in. 7. Start the server. The service processor is reset. This ends the procedure. Chapter 7. Common Service Procedures 669
Resetting the managed system connection from the HMC Use this procedure to reset the managed system connection from the HMC. Perform these steps to reset the managed system connection: 1. In the navigation area, open Systems Management. 2. Select Servers. 3. In the contents area, select the server on which you want to reset the connection. 4. In the Tasks area, open Connections. 5. Select Reset or Remove Connection. 6. Select Reset Connection. 7. Click OK. 670 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
Checking for a duplicate IP address Use this procedure to check for a duplicate IP address. Perform these steps to check for a duplicate IP address. 1. On the HMC, right-click the desktop and select terminal ? rshterm. 2. At the terminal, issue this command: lssyscfg -r sys -F "name,ipaddr" A list of managed systems’ names and their IP address (or host name) will be displayed. 3. Check to see if there are duplicates. Chapter 7. Common Service Procedures 671
Running the online and eServer stand-alone diagnostics Use these tools to diagnose hardware problems on your AIX or Linux systems or partitions. Use these diagnostics only if you are directed to do so by your next level of support or your hardware service provider. Diagnostics are available for AIX and Linux systems or logical partitions which can help you perform hardware analysis. Additionally, there are various service aids in AIX diagnostics that can help you with service tasks on the system or logical partition. If there is a problem, you will receive a Service Request Number (SRN) that can help you pinpoint the problem and determine a corrective action. If you have AIX installed and it is running, you can perform online diagnostics. However, if the installed AIX cannot be started, or you have Linux installed, you will need to run the diagnostics from CD or from a NIM server. Verifying the repair can also be done using the diagnostics CD. To verify a repair in Linux, see Verify the repair in Linux. To verify the repair in AIX, see Verify the repair in AIX. “Running the online diagnostics”If you have AIX installed and it can be started, use this procedure to perform diagnostic procedures when directed from another procedure or by your next level of support. “Running the eServer stand-alone diagnostics from CD-ROM” on page 675If the system or logical partition where you would like to run diagnostics has AIX installed but it cannot be started, or you have Linux installed, use this procedure to perform diagnostic procedures from CD-ROM when directed from another procedure or by your next level of support. “Running the eServer stand-alone diagnostics from a Network Installation Management server” on page 679If the system or logical partition where you would like to run diagnostics has AIX installed but it cannot be started, or you have Linux installed, use this procedure to perform diagnostic procedures from a Network Installation Management (NIM) server when directed from another procedure or by your next level of support.Running the online diagnostics If you have AIX installed and it can be started, use this procedure to perform diagnostic procedures when directed from another procedure or by your next level of support. When you run online diagnostics, keep the following in mind: v When AIX is installed, the support for some devices might not be automatically installed. If this happens, that device will not display in the test list when online diagnostics run. v When running diagnostics in a logically partitioned system, you must run diagnostics in the logical partition containing the resource or resources that you want to test.Three modes are available for running the online diagnostics: v Service mode provides the most complete check of the system resources, but requires that no other programs are running on the system. When possible, run the diagnostics in service mode. v Maintenance mode allows you to check most of the available resources, with the exception of SCSI adapters, memory, processor, and the disk drive used for paging. v Concurrent mode allows you to run online diagnostics on some of the system resources while the system is running normal activity. “Running the online diagnostics in service mode” on page 673Use this procedure to run the online diagnostics in service mode. “Running the online diagnostics in maintenance mode” on page 674Use this procedure to run the online diagnostics in maintenance mode. 672 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
“Running the online diagnostics in concurrent mode” on page 674Use this procedure to run the online diagnostics in concurrent mode.Running the online diagnostics in service mode Use this procedure to run the online diagnostics in service mode. Using service mode will check everything except the SCSI adapter and the disk drives used for paging. However, memory and processor are only tested during POST. Running in service mode ensures that the error state of the system that has been captured in NVRAM is available for your use in analyzing the problem. “Running the online diagnostics in service mode with an HMC attached”Use this procedure to run the online diagnostics in service mode with an HMC attached. “Running the online diagnostics in service mode without an HMC attached”Use this procedure to run the online diagnostics in service mode without an HMC attached.Running the online diagnostics in service mode with an HMC attached: Use this procedure to run the online diagnostics in service mode with an HMC attached. Perform the following steps: 1. Stop all programs including the AIX operating system. For details, see Start or stop the system. 2. From the HMC, right-click Partition Manager and select Open Terminal Window. 3. From the Service Processor Menu on the VTERM, select option 2 (System Power Control). 4. Select option 6. 5. Verify that the state changes to currently disabled. Disabling fast system boot automatically enables slow boot. 6. Select option 98 to exit the system power control menu. 7. From the HMC, start the managed system in a full system partition by following these steps: a. In the Contents area, select the managed system. b. Right-click, and select Power On. 8. Select Power on Diagnostic Stored Boot list. 9. Make sure that there is no media in the devices in the media subsystem. 10. Enter any passwords, if requested. 11. When the Diagnostic Operating Instructions screen displays, follow the online instructions to check the desired resources. Note: If you do not receive the Diagnostic Operating Instructions display, try to run the standalone diagnostics. For details, see “Running the eServer stand-alone diagnostics from CD-ROM” on page 675. 12. When finished, contact your next level of support or your hardware service provider with any information you received during the diagnostics, including service request numbers (SRNs).Running the online diagnostics in service mode without an HMC attached: Use this procedure to run the online diagnostics in service mode without an HMC attached. Perform the following steps: 1. Stop all programs including the AIX operating system. For details, see Start or stop the system. 2. Remove all tapes, diskettes, and CDs. 3. Turn off the system unit power. 4. Turn on the system unit power. Chapter 7. Common Service Procedures 673
5. After the keyboard POST indicator displays on the firmware console, and before the last POST indicator (speaker) displays, press 6 on the keyboard or ASCII terminal to indicate that a service mode boot should be initiated using the customized service mode boot list. 6. Enter any passwords, if requested. 7. When the Diagnostic Operating Instructions screen displays, follow the online instructions to check the desired resources. Note: If you do not receive the Diagnostic Operating Instructions display, try to run the standalone diagnostics. For details, see “Running the eServer stand-alone diagnostics from CD-ROM” on page 675. 8. When finished, contact your next level of support or your hardware service provider with any information you received during the diagnostics, including service request numbers (SRNs).Running the online diagnostics in maintenance mode Use this procedure to run the online diagnostics in maintenance mode. Maintenance mode requires that all activity on the logical partition running the AIX operating system be stopped so that the online diagnostics have most of the resources available. All of the system resources, except the SCSI adapters, memory, processor, and the disk drive used for paging can be checked. Perform the following steps to run the online diagnostics in maintenance mode: 1. Stop all programs running on the logical partition except the AIX operating system. 2. Log in to the AIX operating system as root user or use CE login. 3. Type the shutdown -m command to stop all activity on the AIX operating system and put it into maintenance mode. 4. When a message indicates that the system is in maintenance mode, enter the diag command to invoke the diagnostic controller so you can run the diagnostics. Note: It might be necessary to set TERM type again. 5. Enter any passwords, if requested. 6. When the Diagnostic Operating Instructions screen displays, follow the online instructions to check the desired resources. Note: If you do not receive the Diagnostic Operating Instructions display, try to run the standalone diagnostics. For details, see “Running the eServer stand-alone diagnostics from CD-ROM” on page 675. 7. When finished, press Ctrl+D to log off from root user or CE login. 8. Contact your next level of support or your hardware service provider with any information you received during the diagnostics, including service request numbers (SRNs).This ends the procedure.Running the online diagnostics in concurrent mode Use this procedure to run the online diagnostics in concurrent mode. Use concurrent mode to run online diagnostics on some of the system resources while the system is running normal activity. Because the system is running in normal operation, the following resources cannot be tested in concurrent mode: v SCSI adapters connected to paging devices v Disk drive used for paging v Some display adapters and graphics related devicesThe following levels of testing exist in concurrent mode: 674 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
Share-test level This level tests a resource while the resource is being shared by programs running in the normal operation. This testing is mostly limited to normal commands that test for the presence of a device or adapter. Sub-test level This level tests a portion of a resource while the remaining part of the resource is being used in normal operation. For example, you could test one port of a multiport device while the other ports are being used in normal operation. Full-test level This level requires the device to not be assigned or used by any other operation. This level of testing on a disk drive might require the use of the vary off command. Use the diagnostics display menus to allow you to vary off the needed resource.Perform the following steps to run online diagnostics in concurrent mode: 1. Log in to the AIX operating system as root user, or use CE login. If you need help, contact the system administrator. 2. Enter the diag command to load the diagnostic controller, and display the online diagnostic menus. 3. If requested, enter a password. 4. When the Diagnostic Operating Instructions screen displays, follow the online instructions to check the desired resources. Note: If you do not receive the Diagnostic Operating Instructions display, try to run the standalone diagnostics. For details, see “Running the eServer stand-alone diagnostics from CD-ROM.” 5. When testing is complete, press F3 to return to the Diagnostic Operating Instructions display. 6. Press F3 again to return to the AIX operating system prompt. 7. Vary on any resources that you varied off. 8. Press Ctrl+D to log off from root user or CE login. 9. When finished, contact your next level of support or your hardware service provider with any information you received during the diagnostics, including service request numbers (SRNs).Running the eServer stand-alone diagnostics from CD-ROM If the system or logical partition where you would like to run diagnostics has AIX installed but it cannot be started, or you have Linux installed, use this procedure to perform diagnostic procedures from CD-ROM when directed from another procedure or by your next level of support. Diagnostics, which are available for AIX and Linux systems and logical partitions, can help you perform hardware analysis. If a problem is found, you will receive a service request number (SRN) or a service reference code (SRC) that can help pinpoint the problem and determine a corrective action. Additionally, there are various service aids in the diagnostics that can help you with service tasks on the system or logical partition. “Running the eServer stand-alone diagnostics from CD on a server with an HMC attached” on page 676Learn how to run the eServer stand-alone diagnostics on a system with an HMC attached. “Running the eServer stand-alone diagnostics from CD on a server without an HMC attached” on page 677Learn how to run the eServer stand-alone diagnostics on a system without an HMC attached.
Chapter 7. Common Service Procedures 675
Running the eServer stand-alone diagnostics from CD on a server with an HMC attached Learn how to run the eServer stand-alone diagnostics on a system with an HMC attached. If you have logical partitions, note the following considerations: v When running diagnostics in a logically partitioned system, you must run diagnostics in the logical partition containing the resource or resources that you want to test. v The device from which you are loading stand-alone diagnostics must be made available to the logical partition on which you want to run diagnostics. This might require moving the device (for example, the CD drive or network adapter connected to the Network Installation Management (NIM) server that has a stand-alone diagnostic image) from the logical partition that currently contains the device to the logical partition on which you want to run diagnostics.Perform the following steps from the HMC to run stand-alone diagnostics: Note: If you need help with any of these steps, contact your system operator. 1. Remove all tapes, diskettes, CDs, or DVDs, and insert the diagnostic CD into the CD drive on the managed system (not the CD drive on the HMC). 2. Shut down the operating system from the HMC by performing the following steps: a. In the navigation area, select Server and partition ? Server Management. b. In the contents pane, expand the server that contains the partition you want to test. c. Right-click the partition and select Open Terminal Window. d. In the VTerm window, log in as root user and enter any requested passwords. e. Shut down the operating system using one of the following commands: v If AIX is running, type the shutdown -F command. v If Linux is running, type the shutdown -h now commandf. Close the VTerm window.3. From the HMC, right-click the partition and select Activate. 4. Ensure the Open a terminal window or console session box is selected and click OK. 5. When the keyboard POST indicator (the word keyboard) is shown on the firmware console, and before the last POST indicator (the word speaker) is shown, press the 5 key on either the attached keyboard or the ASCII keyboard to indicate that a service mode boot should be initiated using the default service mode boot list. 6. Continue with “Selecting testing options.”Selecting testing options: 1. When the keyboard POST indicator (the word keyboard) is shown on the firmware console, and before the last POST indicator (the word speaker) is shown, press the 5 key on either the attached keyboard or the ASCII keyboard to indicate that a service mode boot should be initiated using the default service mode boot list. 2. When the Welcome screen is shown, define the following: v System console v Language to be used v Type of terminal Note: Depending on the terminal emulator selected, the function keys (Fn) might not function. In this case, use the ESC and the number in the screen menus. For example, F3 = ESC key and the #3. 3. When the Diagnostics Operating Instructions appear, press Enter. 676 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
Note: If you are unable to load the diagnostics to the point where the ?Diagnostic Operating Instructions? display is shown, contact your next level of support or your hardware service provider. 4. From the Function Select screen, select one of the following: v If you want to run diagnostics in Problem Determination mode, continue with the next step. v If you want to run diagnostics in Task Selection (Service Aids) mode, go to step 11. 5. Select Problem determination and press Enter. 6. Check the list of resources that is displayed. Does the list of resources match what you know to be installed in your system or partition? v Yes: Continue with the next step. v No: Record any information you have about the missing resource and check to ensure that the missing resource is installed correctly. If you cannot correct the problem with a missing resource, replace the missing resource (contact your service provider if necessary). To test the available resources, continue with the next step. 7. Select All Resources, or the specific resource or resources to be tested, and press the P7 (commit) key. 8. Record any error information you receive during the diagnostics, including service request numbers (SRNs) or SRCs, to report to your service provider. 9. When testing is complete, press the F3 key to return to the Diagnostic Operating Instructions. 10. Choose from the following options: v To continue testing, return to step 7. v To exit stand-alone diagnostics, select the exit function key from the menu and press Enter. Continue with step 18.11. Select Task Selection list and press Enter. 12. To perform one of these tasks, select the Task Selection option from the Function Selection menu. After a task is selected, a resource menu might be presented showing all resources supported by the task. 13. From the Task selection list, select the service aid task you want to perform. For example, Update and manage system Flash. 14. Follow the instructions for the task selected on each menu or panel. 15. Record any information you receive during the diagnostics, including service request numbers (SRNs), to report to your service provider. 16. When testing is complete, press the F3 key to return to the Diagnostic Operating Instructions. 17. Choose from the following options: v To continue testing, return to step 13. v To exit stand-alone diagnostics, select the exit function key from the menu and press Enter. Continue with the next step.18. Remove the CD from the drive. 19. When finished, contact your next level of support or your hardware service provider with any information you received during the diagnostics, including service request numbers (SRNs) and any missing resources. This ends the procedure.Running the eServer stand-alone diagnostics from CD on a server without an HMC attached Learn how to run the eServer stand-alone diagnostics on a system without an HMC attached. 1. Choose from the following options: v If the system is powered on, continue with step 2. v If the system is powered off, continue with step 3 on page 678.2. If the system is powered on, perform these steps: Chapter 7. Common Service Procedures 677
a. Let the system administrator and system users know that the system unit will be shut down. b. Stop all programs including the operating system. For details, see “Powering on and powering off” on page 588. c. Continue with step 4.3. If the system is powered off, perform the following steps: a. Start the server so you will be able to insert the diagnostic CD into the CD drive during the next step. b. Continue with step 4.4. Insert the diagnostic CD in the CD drive. 5. Re-start the server. 6. Continue with “Selecting testing options.”Selecting testing options: 1. When the keyboard POST indicator (the word keyboard) is shown on the firmware console, and before the last POST indicator (the word speaker) is shown, press the 5 key on either the attached keyboard or the ASCII keyboard to indicate that a service mode boot should be initiated using the default service mode boot list. 2. When the Welcome screen is shown, define the following: v System console v Language to be used v Type of terminal Note: Depending on the terminal emulator selected, the function keys (Fn) might not function. In this case, use the ESC and the number in the screen menus. For example, F3 = ESC key and the #3. 3. When the Diagnostics Operating Instructions appear, press Enter. Note: If you are unable to load the diagnostics to the point where the ?Diagnostic Operating Instructions? display is shown, contact your next level of support or your hardware service provider. 4. From the Function Select screen, select one of the following: v If you want to run diagnostics in Problem Determination mode, continue with the next step. v If you want to run diagnostics in Task Selection (Service Aids) mode, go to step 11 on page 679. 5. Select Problem determination and press Enter. 6. Check the list of resources that is displayed. Does the list of resources match what you know to be installed in your system or partition? v Yes: Continue with the next step. v No: Record any information you have about the missing resource and check to ensure that the missing resource is installed correctly. If you cannot correct the problem with a missing resource, replace the missing resource (contact your service provider if necessary). To test the available resources, continue with the next step. 7. Select All Resources, or the specific resource or resources to be tested, and press the P7 (commit) key. 8. Record any error information you receive during the diagnostics, including service request numbers (SRNs) or SRCs, to report to your service provider. 9. When testing is complete, press the F3 key to return to the Diagnostic Operating Instructions. 10. Choose from the following options: v To continue testing, return to step 7. 678 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
v To exit stand-alone diagnostics, select the exit function key from the menu and press Enter. Continue with step 18.11. Select Task Selection list and press Enter. 12. To perform one of these tasks, select the Task Selection option from the Function Selection menu. After a task is selected, a resource menu might be presented showing all resources supported by the task. 13. From the Task selection list, select the service aid task you want to perform. For example, Update and manage system Flash. 14. Follow the instructions for the task selected on each menu or panel. 15. Record any information you receive during the diagnostics, including service request numbers (SRNs), to report to your service provider. 16. When testing is complete, press the F3 key to return to the Diagnostic Operating Instructions. 17. Choose from the following options: v To continue testing, return to step 13. v To exit stand-alone diagnostics, select the exit function key from the menu and press Enter. Continue with the next step.18. Remove the CD from the drive. 19. When finished, contact your next level of support or your hardware service provider with any information you received during the diagnostics, including service request numbers (SRNs) and any missing resources. This ends the procedure.Running the eServer stand-alone diagnostics from a Network Installation Management server If the system or logical partition where you would like to run diagnostics has AIX installed but it cannot be started, or you have Linux installed, use this procedure to perform diagnostic procedures from a Network Installation Management (NIM) server when directed from another procedure or by your next level of support. Diagnostics, which are available for AIX and Linux systems and logical partitions, can help you perform hardware analysis. If a problem is found, you will receive a service request number (SRN) that can help pinpoint the problem and determine a corrective action. A client system connected to a network with a NIM server can boot stand-alone diagnostics from the NIM server if the client-specific settings on both the NIM server and client are correctly configured. Note: 1. For NIM clients that have adapters that would normally require that supplemental media be installed when stand-alone diagnostics are run from CD, the support code for these adapters must be installed into the directory pointed to by the NIM SPOT from which you wish to boot that client. Before running stand-alone diagnostics on these clients from the NIM server, the NIM server system administrator must ensure that any needed support for these devices is installed on the server. 2. All operations to configure the NIM server require root user authority. 3. If you replace the network adapter in the client, the network adapter hardware address settings for the client must be updated on the NIM server. 4. The Cstate for each stand-alone diagnostics client on the NIM server should be kept in the diagnostic boot has been enabled state. 5. On the client system, the NIM server network adapter should be put in the bootlist after the boot disk drive. This allows the system to boot in stand-alone diagnostics from the NIM server if there is a problem booting from the disk drive. Refer to the Multiboot section under ?SMS? in the client system’s service guide for information on setting the bootlist.
Chapter 7. Common Service Procedures 679
Configuring the NIM server Refer to the AIX Installation Guide and Reference for information about performing the following tasks: v Registering a client on the NIM server v Enabling a client to run diagnostics from the NIM serverTo verify that the client system is registered on the NIM server and the diagnostic boot is enabled, run the command from the command line on the NIM server: Isnim -a Cstate -z ClientName. Refer to the following table for system responses. Note: The ClientName is the name of the system on which you want to run stand-alone diagnostics.
System response
Client status
#name:Cstate:ClientName:diagnostic boot has been enabled:
The client system is registered on the NIM server and enabled to run diagnostics from the NIM server.
#name:Cstate:ClientName:ready for a NIM operation:or #name:Cstate:ClientName:B0S installation has been enabled:
The client is registered on the NIM server but not enabled to run diagnostics from the NIM server. Note: If the client system is registered on the NIM server but Cstate has not been set, no data will be returned.
0042–053 Isnim: there is no NIM object named ?ClientName?
The client is not registered on the NIM server. Configuring the client and running the stand-alone diagnostics from a NIM server Perform the following steps to run stand-alone diagnostics on a client from the NIM server: 1. Let the system administrator and system users know that the system unit might be shut down. 2. Stop all programs including the AIX or Linux operating system. For details, see Start or stop the system. If you need help, contact the system administrator. 3. Remove all tapes, diskettes, and CDs. 4. Choose from the following options: v If you are running stand-alone diagnostics in a full system partition profile, verify with the system administrator and system users that the system unit can shut down using the shutdown command. Then power down the system. v If you are running on a logically partitioned system, make sure the CD drive is available to the partition used to run stand-alone diagnostics. Verify with the system administrator and system users using that partition that all applications on that partition must be stopped, and that the partition will be restarted. Stop all programs on that partition, including the operating system. 5. Choose from the following options: v If you are in a full system partition, power on the system unit to run stand-alone diagnostics. v If you are in a logically partitioned system, restart the partition to run stand-alone diagnostics. 6. When the keyboard indicator is displayed (the word keyboard on an HMC virtual terminal window or the keyboard icon on a graphical display) press the number 1 key on the keyboard to display the SMS menu. 7. Enter any requested passwords. 8. Select Set Up Remote IPL (Initial Program Load). 9. Enter the client address, server address, gateway address, if applicable, and subnet mask. If there is no gateway between the NIM server and the client, set the gateway address to 0.0.0.0. To determine if there is a gateway, either ask the system network administrator or compare the first three octets of the NIM server address and the client address. If they are the same, (for example, if the NIM server address is 9.3.126.16 and the client address is 9.3.126.42, the first 3 octets (9.3.126) are the same), then set the gateway address in the RIPL field to 0.0.0.0.
680 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
Note: The RIPL is located under the Utility menu in System Management Services (SMS). Refer to it for information on setting these parameters. 10. If the NIM server is set up to allow pinging from the client system, use the ping utility in the RIPL utility to verify that the client system can ping the NIM server. 11. Under the ping utility, choose the network adapter that provides the attachment to the NIM server to do the ping operation. If the ping returns with an OK prompt, the client is prepared to boot from the NIM server. If ping returns with a FAILED prompt, the client cannot proceed with the NIM boot. Note: If the ping fails, refer to the Boot problems and concerns information. Then follow the steps for network boot problems. 12. Exit the SMS Main screen. 13. Select Select Boot Options ? Install or Boot a Device ? Network. 14. Record the current bootlist settings. You will need to set the bootlist back to the original settings after running diagnostics from the NIM server. 15. Change the bootlist so the network adapter attached to the NIM is first in the bootlist. 16. Set the network parameters for the adapter from which you want to boot. 17. Exit completely from SMS. The system will start loading packets while doing a bootp from the network. 18. Follow the on-screen instructions. v If Diagnostic Operating Instructions Version x.x.x displays, stand-alone diagnostics have installed successfully. v If the AIX login prompt displays, stand-alone diagnostics did not load. Continue with step 19.19. If the diagnostics did not load, check the following items: v The bootlist on the client might be incorrect. v Cstate on the NIM server might be incorrect. v Network problems might be preventing you from connecting to the NIM server. v Verify the settings and the status of the network. If you continue to have problems, refer to the Boot problems/concerns section for the system unit. Then follow the steps for network boot problems.20. After running diagnostics, restart the system and use SMS to change the IP settings and bootlist sequence back to the original settings. Chapter 7. Common Service Procedures 681
Concurrent PCI adapter removal and replacement using AIX The following procedure describes the removal of a PCI adapter concurrently (with the system power on) using AIX as the operating system.Note: 1. The illustrations seen in this procedure might be slightly different than what you are seeing on the system that you are servicing. 2. Use this procedure if you intend to remove a failing PCI adapter and replace it with the same type of adapter. 3. Procedures performed on a PCI adapter with the system power on, also known as hot-plug procedures, require the system administrator to take the PCI adapter offline prior to performing the operation. Before taking an adapter offline, the devices attached to the adapter must be taken offline as well. This action prevents a service representative or user from causing an unexpected outage for system users.To replace a failing PCI adapter concurrently, do the following: 1. Open the front rack door and back rack doors and place the system into the service position. 2. Remove or open the service access cover. 3. Determine which adapter you plan to remove. 4. Adapter slots are numbered on the rear of the system unit, record the slot number and location of each adapter being removed. 5. Ensure that any processes or applications that might use the adapter are stopped. 6. Enter the system diagnostics by logging in as root user or as the celogin user, type diag at AIX command line. 7. When the DIAGNOSTIC OPERATING INSTRUCTIONS menu displays, press Enter. 8. At the FUNCTION SELECTION menu, select Task Selection, then press enter. 9. At the Task Selection list, select PCI Hot Plug Manager. 10. Select Unconfigure a Device, then press Enter. 11. Press F4 (or Esc +4) to display the Device Names menu. 12. Select the adapter you are removing in the Device Names menu. 13. In the Keep Definition field, use the Tab key to answer Yes if you are replacing the adapter being removed with a same type adapter. Answer No if you will not be replacing the adapter with the same type of adapter or will be leaving the slot blank. 14. In the Unconfigure Child Devices field, use the Tab key to answer YES, then press Enter. 15. The ARE YOU SURE screen displays. Press Enter to verify the information. Successful unconfiguration is indicated by the OK message displayed next to the Command field at the top of the screen. 16. Press F4 (or Esc +4) twice to return to the Hot Plug Manager menu. 17. Select replace/remove PCI Hot Plug adapter. 18. Select the slot that has the device to be removed from the system. 19. Select remove. Note: A fast-blinking amber LED located at the back of the machine near the adapter indicates that the slot has been identified. 20. Press Enter. This places the adapter in the action state, meaning it is ready to be removed from the system. 21. Label, and then disconnect all cables attached to the adapter you plan to remove. 22. Remove the adapter. If you are removing a short adapter, continue to the next step. 682 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
If you are removing a long adapter, unlatch and open the PCI adapter light-pipe plate C attached to the fan tray as shown in Figure 67. 23. Rotate the adapter locking latch A counterclockwise. 24. Lift the black tab B attached to the adapter retainer assembly, and keep the black tab in a vertical position. 25. Carefully grasp the PCI adapter by its top edge or upper corners, and remove it from the system. Store the adapter in a safe place. 26. If you are replacing the adapter, remove the replacement adapter from the antistatic package and continue to step 27. If you are not replacing the adapter, do the following: a. Seal the expansion slot using an expansion-slot cover. b. Lower the plastic retainer seat over the PCI adapter faceplate. c. Rotate the locking latch clockwise until it clicks into the locked position. d. Continue to follow the online instructions until you receive a message that the adapter removal is successful. e. If you have other adapters to remove, press the F3 key to return to the PCI Hot Plug Manager menu. f. When you are finished removing adapters, press F10 to exit the Hot-Plug Manager. g. From an AIX command line, run the diag -a command. If the host system responds with a menu or prompt, follow the instruction to complete the device configuration. Figure 67. Detail of retaining latch used to secure a PCI adapter
Chapter 7. Common Service Procedures 683
h. Go directly to step 44 to complete this procedure.Attention: Avoid touching the components and gold-edge connectors on the adapter. 27. Carefully grasp the replacement adapter by its top edge, and align it with the expansion slot and its connector on the system backplane. 28. Press the adapter firmly into its connector. Ensure that it is completely and correctly seated in its connector. 29. If you are replacing a short adapter, continue to the next step. If you are replacing a long adapter, do the following: a. Close and latch the PCI adapter light-pipe plate C attached to the fan tray as shown in Figure 67 on page 683.Note: The light pipes below the light-pipe plate must fit through the holes in the plate for it to latch correctly. b. Note the guide grooves located toward the front of the system in the disk drive backplane, and align the adapter properly.30. Secure the adapter. Lower the tab A onto the PCI adapter faceplate as shown in the following figure. Rotate the adapter locking latch B clockwise until it covers the tab at approximately a 45-degree angle. Figure 68. Detail of the retaining latch being secured
684 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
31. Connect the adapter cables. 32. Press enter and continue to follow the screen instructions until you receive a message that the replacement is successful. Successful replacement is indicated by the OK message displayed next to the Command field at the top of the screen. 33. Press the F3 (or Esc+3) key to return to the PCI Hot-Plug Manager menu. 34. Press the F3 (or Esc+3) key to return to the TASK selection list. 35. Select Log Repair Action. 36. Select the resource just replaced, press Enter, press Commit (F7 or Esc+7), then press Enter. 37. Press F3 (or Esc+3) to return to TASK Selection List. 38. Select Hot Plug Task, press enter. 39. Select PCI Hot Plug Manager, then select Configure a defined device, then press Enter. 40. Select the device just replaced from the list, then press Enter. The device is now configured. 41. Press the F10 key to exit the diagnostic program. Note: If you are running the standalone diagnostics, do not exit the program completely. 42. Verify the PCI adapter by using the following instructions: a. Did you replace the adapter with the system power on? v Yes - Go to the next step. v No - Load the diagnostic program by doing the following: – If AIX is available, boot AIX, login as root or CELOGIN, then enter the diag command. – If AIX is not available, boot the standalone diagnosticsb. Type the diag command if you are not already displaying the diagnostic menus c. Select Advance Diagnostic Routines, then select Problem Determination. d. Select the name of the resource just replaced from the menu. If the resource just replaced is not shown, choose the resource associated with it. Press Enter, then press Commit ((F7 or Esc+7)). e. Did the Problem Determination identify any problems? v No: Continue to the next step. v Yes: A problem is identified. – If you are a customer, record the error information, then contact your service provider. – If you are an authorized service provider, return to map 210-5.43. Press the F10 key to exit the diagnostic program. 44. Close the rack’s back door. 45. Close the service access cover. 46. Go to Verify a repair in your host server’s service guide. This completes this procedure. Chapter 7. Common Service Procedures 685
Concurrent PCI adapter removal and replacement using Linux The following procedure describes the removal of PCI adapters with the system power on in a server or a partition running linux. Before performing this procedure, do the following: v Ensure that the system meets the “Prerequisites for hot-plugging PCI adapters in Linux” on page 469 v “Verify that the Linux, hot-plug PCI tools are installed” on page 470.To remove a PCI adapter with the system power on, do the following: 1. Open the front and rear rack doors. 2. If you are servicing a system drawer installed in a rack, put the drawer into the service position, then go to the step 3. If you are servicing a system drawer that gets serviced from either the front or the back of the drawer, go to the step 3. If you are servicing a deskside system, rest the side opposite the service access cover on the floor or desktop. 3. Depending on your system configuration, either remove or open the service access cover. 4. Determine which adapters you plan to remove. 5. Record the slot number and location of each adapter being removed. Note: Adapter slots are numbered on the rear of the system unit. 6. Ensure that any processes or applications that might use the adapter are stopped. 7. Label, and then disconnect all cables attached to the adapter you plan to remove. Note: Before performing a PCI hot-plug removal of storage devices, ensure file systems on those devices are unmounted. 8. Run the drslot_chrp_pci command to enable the adapter to be removed. For example, to remove the PCI adapter in slot U7879.001.DQD014E-P1-C3 run this command: drslot_chrp_pci -r -s U7879.001.DQD014E-P1-C3 9. Follow the instructions on the display to complete the task. 10. If you are removing a short adapter, continue to step 11. If you are removing a long adapter, do the following: a. Unlatch and open the PCI adapter light-pipe plate C attached to the fan tray as shown in the following figure. b. Rotate the adapter locking latch A counterclockwise. c. Lift the black tab B attached to the adapter retainer assembly, and keep the black tab in a vertical position. 686 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
11. Carefully grasp the PCI adapter by its top edge or upper corners, and remove it from the system. Store the adapter in a safe place. 12. If you are removing a PCI adapter as part of another procedure, return to that procedure. If not, continue to step 13. 13. If you plan to install a long adapter into the vacated slot, go to step 14. If you plan to install a short adapter or do not plan to install another adapter into the vacated slot do the following: a. Seal the expansion slot using an expansion-slot cover. b. Lower the plastic retainer seat over the PCI adapter faceplate. c. Rotate the locking latch clockwise until it clicks into the locked position. d. Go to 15.14. If you do not plan to install another adapter into the vacated slot, do the following: a. Run the drslot_chrp_pci command to enable an adapter to be installed. For example, to install the PCI adapter in slot U7879.001.DQD014E-P1-C3 run this command: drslot_chrp_pci -a -s U7879.001.DQD014E-P1-C3 b. Follow the instructions on the display to complete the task. c. When you are instructed to install the adapter in the adapter slot, do the following: 1) Carefully grasp the adapter by the edges and align the adapter in the slot guides. Insert the adapter fully into the adapter slot connector. If you are installing a full-length adapter, ensure that both ends of the adapter engage the card guides. Figure 69. Detail of adapter retaining tab and latch.
Chapter 7. Common Service Procedures 687
2) Press the adapter firmly into its connector.Attention: After installing the adapter into the system, ensure that it is correctly seated into the connector. 3) Close and latch the PCI adapter light-pipe plate C attached to the fan tray as shown in Figure 70.Tip: The light pipes below the light-pipe plate must fit through the holes in the plate for it to latch correctly. 4) Secure the adapter. Lower the tab A onto the PCI adapter faceplate as shown in Figure 70. Rotate the adapter locking latch B clockwise until it covers the tab at approximately a 45-degree angle. 5) d. Connect any adapter cables. e. Run the lsslot command to verify that the slot is occupied. For example, Enter lsslot -c pci -s U7879.001.DQD014E-P1-C3 The following is an example of the information displayed by this command: # Slot Description Device(s) U7879.001.DQD014E-P1-C3 PCI-X capable, 64 bit, 133MHz slot 0001:40:01.0 Figure 70. Detail of the locking tab and latch and an adapter being replaced into system or I/O enclosure
688 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
15. Replace or close the service access cover. 16. If necessary, on rack-mounted system’s, put the system drawer into the operating position and close the front and back rack doors. 17. Go to Verify a repair in your host server’s service guide. This completes this procedure. “Prerequisites for hot-plugging PCI adapters in Linux” on page 469In the course of installing, removing, or replacing a PCI adapter with the system power on in Linux you might need complete some prerequisite tasks. Use the information in this section to identify those prerequisites. “Verify that the Linux, hot-plug PCI tools are installed” on page 470In the course of installing, removing, or replacing a PCI adapter with the system power on in Linux you might need use the hot-plug PCI tools. Use the procedure in this section to verify that you have the hot-plug PCI tools installed.Prerequisites for hot-plugging PCI adapters in Linux In the course of installing, removing, or replacing a PCI adapter with the system power on in Linux you might need complete some prerequisite tasks. Use the information in this section to identify those prerequisites. The Linux, system administrator needs to take the PCI adapter offline prior to removing, replacing, or installing a PCI adapter with the system power on (hot-plugging). Before taking an adapter offline, the devices attached to the adapter must be taken offline as well. This action prevents a service representative or user from causing an unexpected outage for system users. Before hot-plugging adapters for storage devices, ensure file systems on those devices are unmounted. After hot-plugging adapters for storage devices, ensure the file systems on those devices are remounted. Before hot-plugging an adapter, ensure that the server or partition is at the correct level of the Linux operating system (Linux 2.6 or later). Install the POWER Linux Service Aids. These service aids enable system serviceability, as well to improve system management. If you are using a Linux on POWER distribution with Linux kernel version 2.6 or greater, you can install the Service Aids that gives you access to more capabilities, which can help you diagnose problems on your system. This software is available at the Service and productivity tools for Linux on POWER Web site. (http://techsupport.services.ibm.com/server/lopdiags) Verify that the Linux, hot-plug PCI tools are installed In the course of installing, removing, or replacing a PCI adapter with the system power on in Linux you might need use the hot-plug PCI tools. Use the procedure in this section to verify that you have the hot-plug PCI tools installed. 1. Enter the following command to verify that the hot-plug PCI tools are installed: rpm -aq | grep rpa-pci-hotplug If the command does not list any rpa-pci-hotplug packages, the PCI Hot Plug tools are not installed. 2. Enter the following command to ensure that the rpaphp driver is loaded: ls -l /sys/bus/pci/slots/ The directory should contain data. If the directory is empty, the driver is not loaded or the system does not contain hot-plug PCI slots. The following is an example of the information displayed by this command: Chapter 7. Common Service Procedures 689
drwxr-xr-x 15 root root 0 Feb 16 23:31 .
drwxr-xr-x 5 root root 0 Feb 16 23:31 .. drwxr-xr-x 2 root root 0 Feb 16 23:31 0000:00:02.0 drwxr-xr-x 2 root root 0 Feb 16 23:31 0000:00:02.2 drwxr-xr-x 2 root root 0 Feb 16 23:31 0000:00:02.4 drwxr-xr-x 2 root root 0 Feb 16 23:31 0001:00:02.0 drwxr-xr-x 2 root root 0 Feb 16 23:31 0001:00:02.2 drwxr-xr-x 2 root root 0 Feb 16 23:31 0001:00:02.4 drwxr-xr-x 2 root root 0 Feb 16 23:31 0001:00:02.6 drwxr-xr-x 2 root root 0 Feb 16 23:31 0002:00:02.0 drwxr-xr-x 2 root root 0 Feb 16 23:31 0002:00:02.2 drwxr-xr-x 2 root root 0 Feb 16 23:31 0002:00:02.4 drwxr-xr-x 2 root root 0 Feb 16 23:31 0002:00:02.6
If the directory does not exist, run the following command to mount the filesystem: mount -t sysfs sysfs /sys 3. Ensure the following tools are available in the /usr/sbin directory. v lsslot v drslot_chrp_pci4. Return to the procedure that sent you here. 690 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
Chapter 8. Part Information Assembly diagrams and part listings. Rack assembly
Table 71. Rack assembly part numbers
Index number
CCIN
Part number
Units per assembly
Description
1
32P186544V4345
1
Left EIA bracket
2
09P3744
1
Screw, left EIA bracket
3
43W4518
1
Left slide assembly
4
39J4515
1
Drawer top cover 8203-E4A
4
53P0308
1
Drawer top cover 9408-M25
4
39J4515
1
Drawer top cover 9407-M15
5
43W4519
1
Right slide assembly © Copyright IBM Corp. 2008 691
Table 71. Rack assembly part numbers (continued)
Index number
CCIN
Part number
Units per assembly
Description
6
32P186444V4342
1
Right EIA bracket
7
09P3744
1
Screw, right EIA bracket
8
39J4530
1
Drawer front bezel
9
04N6587
2
Drawer shipping screws Tower assembly
Table 72. Tower assembly part numbers
Index number
CCIN
Part number
Units per assembly
Description
1
44V3121
1
Left tower cover 8203-E4A
1
53P0285
1
Left tower cover 9408-M25
1
44V3121
1
Left tower cover 9407-M15 692 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
Table 72. Tower assembly part numbers (continued)
Index number
CCIN
Part number
Units per assembly
Description
2
0
Not applicable
3
42R8604
1
Handle cap
4
44V3127
1
Handle bracket
5
09P3744
4
Screw, handle bracket
6
44V3125
1
Right tower cover 8203-E4A
6
39J1179
1
Right tower cover 9408-M25
6
44V3125
1
Right tower cover 9407-M15
7
09P3744
1
Screw, right tower cover
8 and 9
42R6911
1
One piece front tower bezel
10
44V3120
1
Tip foot
11
09P3744
4
Screw, tip foot Rack assembly continued Chapter 8. Part Information 693
Table 73. Rack assembly part numbers
Index number
CCIN
Part number
Units per assembly
Description
1
44V4270 (high function)07P6910 (low function)
1
Disk drive cage assembly
2
1
Removable media. See Managing Devices, SA76-0107
3
10N7200
1-6
73 GB SAS disk drive
3
10N7230
1-6
73 GB SAS disk drive
3
10N7204
1-6
146 GB SAS disk drive
3
10N7232
1-6
146 GB SAS disk drive
3
10N7208
1-6
300 GB SAS disk drive
3
10N7234
1-6
300 GB SAS disk drive
4
10N9737
1
Control panel
5
44V4195
2
Power supply
6
Tape drive. See Managing Devices, SA76-0107 694 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
Rack assembly continued
Table 74. Rack assembly part numbers
Index number
CCIN
Part number
Units per assembly
Description
7
44V3298
1
SAS RAID enablement card
8
44V4192
1
RAID auxiliary cache card
9
10N9719
1
Thermal management card
10
10N9622 (1 GB, 4 port) 10N9627 (1 GB, 2 port) 10N9669 (10 GB
1
Ethernet card
11
10N9982 (RIO) 44V4645 (InfiniBand adapter)
2
GX Dual-Port 12X Channel Attach adapter
12
39J4517
4
Blower assembly
13
39J4499
1
Air plenum Chapter 8. Part Information 695
Rack assembly continued
Table 75. Rack assembly part numbers
Index number
CCIN
Part number
Units per assembly
Description
14
39J4503
1
PCI guide
15
39J4522
1
Line cord bracket
16
39J4502
3
Long PCI divider
17
53P2733
1
Short PCI divider
18
42R5078
1
GX divider
19
39J4504
1
GX guide
20
44V3382
1
External SAS cable 696 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
Rack assembly continued
Table 76. Rack assembly part numbers
Index number
CCIN
Part number
Units per assembly
Description
21
42R7898
1
System backplane
22
1
Card. See Managing PCI Adapters, SA76-0092
23
44V3845
2
Processor VRM
24
44V3841
2
30 A VRM
25
44V3845
2
Processor VRM
26
87H3621
8
8 MG DIMM
26
77P6497
8
512 MG DIMM
26
77P6498
8
1 GB DIMM
26
77P6499
8
2 GB DIMM
26
77P6500
8
4 GB DIMM
26
77P7504
8
16 GB DIMM Chapter 8. Part Information 697
Table 77. Cables
CCIN
Description
Part number
1.5 meter InfiniBand cable
15R8139
3 meter InfiniBand cable
15R8140
10 meter InfiniBand cable
15R8141
Operator panel cable
42R6066
Operator panel cable
42R6064
SCSI cable
52G4233
SCSI2 cable
52G4291
3.7 meter cable
10N7158
8 meter cable
10N7159
10 meter HSL cable
39J2561
1 meter SAS cable
44V4147
3 meter SAS cable
44V4148
6 meter SAS cable
44V4149
4.5 meter SAS cable
42R5814
3 meter SAS cable
44V4154
6 meter SAS cable
44V4155
15 meter SAS cable
44V4156
1 meter SAS cable
44V4041
3 meter SAS cable
44V4163
6 meter SAS cable
44V4164
1.5 meter SAS cable
44V4161
3 meter SAS cable
44V4162
1.5 meter SAS cable
44V4157
3 meter SAS cable
44V4158
6 meter SAS cable
44V4159
15 meter SAS cable
44V4160
2 meter SPCN cable
22R5217
3 meter SPCN cable
22R5239
15 meter SPCN cable
22R5221
6 meter SPCN cable
22R5219
30 meter SPCN cable
22R5222
Table 78. Miscellaneous parts
CCIN
Description
Part number
External cables and cords
Refer to Site and Hardware Planning Guide, SA76-0091
Removable media
See Managing Devices, SA76-0107
PCI adapters
See Managing PCI Adapters, SA76-0092
Battery pack
44V4145 698 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
Table 78. Miscellaneous parts (continued)
CCIN
Description
Part number
Time-of-day battery
44V4359
Power supply filler
42R6935 Chapter 8. Part Information 699
700 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
Appendix. Accessibility features Accessibility features help users who have a physical disability, such as restricted mobility or limited vision, to use information technology products successfully. Accessibility features The following list includes the major accessibility features: v Keyboard-only operation v Interfaces that are commonly used by screen readers v Keys that are tactilely discernible and do not activate just by touching them v Industry-standard devices for ports and connectors v The attachment of alternative input and output devicesIBM and accessibility See the IBM Accessibility Center at http://www.ibm.com/able/ for more information about the commitment that IBM has to accessibility. If you need an accessible version of this publication, use the Feedback button at http://www.ibm.com/systems/infocenter to send your request. Alternatively, you can send an e-mail to pubsinfo@us.ibm.com. In the request, make sure that you include the publication number and the title. © Copyright IBM Corp. 2008 701
702 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
Notices This information was developed for products and services offered in the U.S.A. The manufacturer may not offer the products, services, or features discussed in this document in other countries. Consult the manufacturer’s representative for information on the products and services currently available in your area. Any reference to the manufacturer’s product, program, or service is not intended to state or imply that only that product, program, or service may be used. Any functionally equivalent product, program, or service that does not infringe any intellectual property right of the manufacturer may be used instead. However, it is the user’s responsibility to evaluate and verify the operation of any product, program, or service. The manufacturer may have patents or pending patent applications covering subject matter described in this document. The furnishing of this document does not give you any license to these patents. You can send license inquiries, in writing, to the manufacturer. The following paragraph does not apply to the United Kingdom or any other country where such provisions are inconsistent with local law: THIS INFORMATION IS PROVIDED “AS IS” WITHOUT WARRANTY OF ANY KIND, EITHER EXPRESS OR IMPLIED, INCLUDING, BUT NOT LIMITED TO, THE IMPLIED WARRANTIES OF NON-INFRINGEMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Some states do not allow disclaimer of express or implied warranties in certain transactions, therefore, this statement may not apply to you. This information could include technical inaccuracies or typographical errors. Changes are periodically made to the information herein; these changes will be incorporated in new editions of the publication. The manufacturer may make improvements and/or changes in the product(s) and/or the program(s) described in this publication at any time without notice. Any references in this information to Web sites not owned by the manufacturer are provided for convenience only and do not in any manner serve as an endorsement of those Web sites. The materials at those Web sites are not part of the materials for this product and use of those Web sites is at your own risk. The manufacturer may use or distribute any of the information you supply in any way it believes appropriate without incurring any obligation to you. Any performance data contained herein was determined in a controlled environment. Therefore, the results obtained in other operating environments may vary significantly. Some measurements may have been made on development-level systems and there is no guarantee that these measurements will be the same on generally available systems. Furthermore, some measurements may have been estimated through extrapolation. Actual results may vary. Users of this document should verify the applicable data for their specific environment. Information concerning products not produced by this manufacturer was obtained from the suppliers of those products, their published announcements or other publicly available sources. This manufacturer has not tested those products and cannot confirm the accuracy of performance, compatibility or any other claims related to products not produced by this manufacturer. Questions on the capabilities of products not produced by this manufacturer should be addressed to the suppliers of those products. All statements regarding the manufacturer’s future direction or intent are subject to change or withdrawal without notice, and represent goals and objectives only. © Copyright IBM Corp. 2008 703
The manufacturer’s prices shown are the manufacturer’s suggested retail prices, are current and are subject to change without notice. Dealer prices may vary. This information is for planning purposes only. The information herein is subject to change before the products described become available. This information contains examples of data and reports used in daily business operations. To illustrate them as completely as possible, the examples include the names of individuals, companies, brands, and products. All of these names are fictitious and any similarity to the names and addresses used by an actual business enterprise is entirely coincidental. If you are viewing this information in softcopy, the photographs and color illustrations may not appear. The drawings and specifications contained herein shall not be reproduced in whole or in part without the written permission of the manufacturer. The manufacturer has prepared this information for use with the specific machines indicated. The manufacturer makes no representations that it is suitable for any other purpose. The manufacturer’s computer systems contain mechanisms designed to reduce the possibility of undetected data corruption or loss. This risk, however, cannot be eliminated. Users who experience unplanned outages, system failures, power fluctuations or outages, or component failures must verify the accuracy of operations performed and data saved or transmitted by the system at or near the time of the outage or failure. In addition, users must establish procedures to ensure that there is independent data verification before relying on such data in sensitive or critical operations. Users should periodically check the manufacturer’s support websites for updated information and fixes applicable to the system and related software. 704 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
Trademarks The following terms are trademarks of International Business Machines Corporation in the United States, other countries, or both: AIX AIX 5L eServer IBM IBM (logo) iSeries POWER POWER6 pSeries RS/6000 System i System pINFINIBAND, InfiniBand Trade Association, and the INFINIBAND design marks are trademarks and/or service marks of the INFINIBAND Trade Association. Linux is a registered trademark of Linus Torvalds in the United States, other countries, or both. Microsoft, Windows, Windows NT, and the Windows logo are trademarks of Microsoft Corporation in the United States, other countries, or both. Other company, product or service names may be trademarks or service marks of others. © Copyright IBM Corp. 2008 705
706 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
Electronic emission notices Class A Notices The following Class A statements apply to the IBM System i models and IBM System p servers with the exception of those that are specifically identified as Class B. Federal Communications Commission (FCC) statement Note: This equipment has been tested and found to comply with the limits for a Class A digital device, pursuant to Part 15 of the FCC Rules. These limits are designed to provide reasonable protection against harmful interference when the equipment is operated in a commercial environment. This equipment generates, uses, and can radiate radio frequency energy and, if not installed and used in accordance with the instruction manual, may cause harmful interference to radio communications. Operation of this equipment in a residential area is likely to cause harmful interference, in which case the user will be required to correct the interference at his own expense. Properly shielded and grounded cables and connectors must be used in order to meet FCC emission limits. IBM is not responsible for any radio or television interference caused by using other than recommended cables and connectors or by unauthorized changes or modifications to this equipment. Unauthorized changes or modifications could void the user’s authority to operate the equipment. This device complies with Part 15 of the FCC rules. Operation is subject to the following two conditions: (1) this device may not cause harmful interference, and (2) this device must accept any interference received, including interference that may cause undesired operation. Industry Canada Compliance Statement This Class A digital apparatus complies with Canadian ICES-003. Avis de conformité à la réglementation d’Industrie Canada Cet appareil numérique de la classe A respecte est conforme à la norme NMB-003 du Canada. European Community Compliance Statement This product is in conformity with the protection requirements of EU Council Directive 89/336/EEC on the approximation of the laws of the Member States relating to electromagnetic compatibility. IBM cannot accept responsibility for any failure to satisfy the protection requirements resulting from a non-recommended modification of the product, including the fitting of non-IBM option cards. This product has been tested and found to comply with the limits for Class A Information Technology Equipment according to European Standard EN 55022. The limits for Class A equipment were derived for commercial and industrial environments to provide reasonable protection against interference with licensed communication equipment. Warning: This is a Class A product. In a domestic environment this product may cause radio interference in which case the user may be required to take adequate measures. VCCI Statement - Japan © Copyright IBM Corp. 2008 707
The
following is a summary of the VCCI Japanese statement in the box above. This is a Class A product based on the standard of the Voluntary Control Council for Interference by Information Technology Equipment (VCCI). If this equipment is used in a domestic environment, radio disturbance may arise. When such trouble occurs, the user may be required to take corrective actions. Electromagnetic Interference (EMI) Statement - People’s Republic of China Declaration: This is a Class A product. In a domestic environment this product may cause radio interference in which case the user may need to perform practical action. Electromagnetic Interference (EMI) Statement - Taiwan The following is a summary of the EMI Taiwan statement above. Warning: This is a Class A product. In a domestic environment this product may cause radio interference in which case the user will be required to take adequate measures. IBM Taiwan Contact Information Electromagnetic Interference (EMI) Statement - Korea 708 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
Germany Compliance Statement Deutschsprachiger EU Hinweis: Hinweis für Geräte der Klasse A EU-Richtlinie zur Elektromagnetischen Verträglichkeit Dieses Produkt entspricht den Schutzanforderungen der EU-Richtlinie 2004/108/EG zur Angleichung der Rechtsvorschriften über die elektromagnetische Verträglichkeit in den EU-Mitgliedsstaaten und hält die Grenzwerte der EN 55022 Klasse A ein. Um dieses sicherzustellen, sind die Geräte wie in den Handbüchern beschrieben zu installieren und zu betreiben. Des Weiteren dürfen auch nur von der IBM empfohlene Kabel angeschlossen werden. IBM übernimmt keine Verantwortung für die Einhaltung der Schutzanforderungen, wenn das Produkt ohne Zustimmung der IBM verändert bzw. wenn Erweiterungskomponenten von Fremdherstellern ohne Empfehlung der IBM gesteckt/eingebaut werden. EN 55022 Klasse A Geräte müssen mit folgendem Warnhinweis versehen werden: ?Warnung: Dieses ist eine Einrichtung der Klasse A. Diese Einrichtung kann im Wohnbereich Funk-Störungen verursachen; in diesem Fall kann vom Betreiber verlangt werden, angemessene Maßnahmen zu ergreifen und dafür aufzukommen.? Deutschland: Einhaltung des Gesetzes über die elektromagnetische Verträglichkeit von Geräten Dieses Produkt entspricht dem “Gesetz über die elektromagnetische Verträglichkeit von Geräten (EMVG)“. Dies ist die Umsetzung der EU-Richtlinie 2004/108/EG in der Bundesrepublik Deutschland. Zulassungsbescheinigung laut dem Deutschen Gesetz über die elektromagnetische Verträglichkeit von Geräten (EMVG) (bzw. der EMC EG Richtlinie 2004/108/EG) für Geräte der Klasse A. Dieses Gerät ist berechtigt, in Übereinstimmung mit dem Deutschen EMVG das EG-Konformitätszeichen - CE - zu führen. Verantwortlich für die Konformitätserklärung nach des EMVG ist die IBM Deutschland GmbH, 70548 Stuttgart. Generelle Informationen: Das Gerät erfüllt die Schutzanforderungen nach EN 55024 und EN 55022 Klasse A. Electromagnetic Interference (EMI) Statement - Russia Electronic emission notices 709
710 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
Terms and conditions Permissions for the use of these publications is granted subject to the following terms and conditions. Personal Use: You may reproduce these publications for your personal, noncommercial use provided that all proprietary notices are preserved. You may not distribute, display or make derivative works of these publications, or any portion thereof, without the express consent of the manufacturer. Commercial Use: You may reproduce, distribute and display these publications solely within your enterprise provided that all proprietary notices are preserved. You may not make derivative works of these publications, or reproduce, distribute or display these publications or any portion thereof outside your enterprise, without the express consent of the manufacturer. Except as expressly granted in this permission, no other permissions, licenses or rights are granted, either express or implied, to the publications or any data, software or other intellectual property contained therein. The manufacturer reserves the right to withdraw the permissions granted herein whenever, in its discretion, the use of the publications is detrimental to its interest or, as determined by the manufacturer, the above instructions are not being properly followed. You may not download, export or re-export this information except in full compliance with all applicable laws and regulations, including all United States export laws and regulations. THE MANUFACTURER MAKES NO GUARANTEE ABOUT THE CONTENT OF THESE PUBLICATIONS. THESE PUBLICATIONS ARE PROVIDED ?AS-IS? AND WITHOUT WARRANTY OF ANY KIND, EITHER EXPRESSED OR IMPLIED, INCLUDING BUT NOT LIMITED TO IMPLIED WARRANTIES OF MERCHANTABILITY, NON-INFRINGEMENT, AND FITNESS FOR A PARTICULAR PURPOSE. © Copyright IBM Corp. 2008 711
712 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25)
BULL CEDOC 357 AVENUE PATTON B.P.20845 49008 ANGERS CEDEX 01 FRANCE
REFERENCE
86 A1 02FA 01
ESCALA Power7
ESCALA E5-700 removal and
replacement procedures
REFERENCE
86 A1 54FF 03
ESCALA Power7
ESCALA E5-700 removal and replacement
procedures
This publication concerns the following models:
- Bull Escala E5-700 (Power 750 / 8233-E8B)
- Bull Escala M6-700 (Power 770 / 9117-MMB)
- Bull Escala M7-700 (Power 780 / 9179-MHB)
- Bull Escala E1-700 (Power 710 / 8231-E2B)
- Bull Escala E2-700 / E2-700T (Power 720 / 8202-E4B)
- Bull Escala E3-700 (Power 730 / 8231-E2B)
- Bull Escala E4-700 / E4-700T (Power 740 / 8205-E6B)
References to Power 755 / 8236-E8C models are irrelevant.
Hardware
May 2011
BULL CEDOC
357 AVENUE PATTON
B.P.20845
49008 ANGERS CEDEX 01
FRANCE
REFERENCE
86 A1 54FF 03
The following copyright notice protects this book under Copyright laws which prohibit such actions as, but not limited
to, copying, distributing, modifying, and making derivative works.
Copyright Bull SAS 2007-2011
Printed in France
Suggestions and criticisms concerning the form, content, and presentation of this
book are invited. A form is provided at the end of this book for this purpose.
To order additional copies of this book or other Bull Technical Publications, you
are invited to use the Ordering Form also provided at the end of this book.
Trademarks and Acknowledgements
We acknowledge the right of proprietors of trademarks mentioned in this book.
The information in this document is subject to change without notice. Bull will not be liable for errors
ontained herein, or for incidental or consequential damages in connection with the use of this material.
c
Contents
Safety notices . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . v
Removal and replacement procedures for the 8233-E8B and 8236-E8C . . . . . . . . 1
Management console-based procedures . . . . . . . . . . . . . . . . . . . . . . . . . . 1
Non-management console-based procedures. . . . . . . . . . . . . . . . . . . . . . . . . 1
Battery . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1
Removing and installing the cache battery pack . . . . . . . . . . . . . . . . . . . . . . 16
Control panel and signal cable . . . . . . . . . . . . . . . . . . . . . . . . . . . . 21
Removing the control panel with the power off . . . . . . . . . . . . . . . . . . . . . 21
Replacing the control panel with the power off . . . . . . . . . . . . . . . . . . . . . 22
Removing and replacing the control panel signal cable . . . . . . . . . . . . . . . . . . . 23
Disk drive backplane . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 25
Removing the disk drive backplane . . . . . . . . . . . . . . . . . . . . . . . . . 25
Replacing the disk drive backplane . . . . . . . . . . . . . . . . . . . . . . . . . 32
External cables . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 38
Fans. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 38
Removing the fan . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 38
Replacing the fan . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 40
GX adapters . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 42
Removing GX adapters . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 42
Replacing GX adapters . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 43
Host Ethernet Adapter. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 45
Removing the Host Ethernet Adapter. . . . . . . . . . . . . . . . . . . . . . . . . 45
Replacing the Host Ethernet Adapter. . . . . . . . . . . . . . . . . . . . . . . . . 47
Media devices . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 48
SAS media device . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 48
Removing a SAS media device . . . . . . . . . . . . . . . . . . . . . . . . . . 48
Removing and replacing a Slimline media device with the power on . . . . . . . . . . . . . . 60
Removing a Slimline media device with the power on . . . . . . . . . . . . . . . . . . 60
Replacing a Slimline media device with the power on . . . . . . . . . . . . . . . . . . 61
Removing and replacing a USB disk drive . . . . . . . . . . . . . . . . . . . . . . . 62
Removing a USB disk drive . . . . . . . . . . . . . . . . . . . . . . . . . . . 62
Replacing a USB disk drive . . . . . . . . . . . . . . . . . . . . . . . . . . . 70
Installing an external USB docking station and removable disk drive with power on . . . . . . . . . 77
Memory modules . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 78
Removing memory modules. . . . . . . . . . . . . . . . . . . . . . . . . . . . 78
Replacing memory modules. . . . . . . . . . . . . . . . . . . . . . . . . . . . 81
Installing, removing, and replacing PCI adapters . . . . . . . . . . . . . . . . . . . . . . 83
Model 8233-E8B and 8236-E8C PCI adapters . . . . . . . . . . . . . . . . . . . . . . 83
Installing a PCI adapter in the 8233-E8B and 8236-E8C server . . . . . . . . . . . . . . . 83
Removing a PCI adapter from an 8233-E8B and 8236-E8C server . . . . . . . . . . . . . . 93
Replacing a PCI adapter in the 8233-E8B and 8236-E8C server . . . . . . . . . . . . . . . 101
Model 8233-E8B and 8236-E8C PCI adapter dividers . . . . . . . . . . . . . . . . . . . 109
Removing a PCI adapter divider from the 8233-E8B and 8236-E8C server . . . . . . . . . . . 109
Installing a PCI adapter divider in the 8233-E8B and 8236-E8C server . . . . . . . . . . . . 112
Related procedures for installing and removing PCI adapters . . . . . . . . . . . . . . . . 115
Before you begin . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 115
Avoiding electric shock . . . . . . . . . . . . . . . . . . . . . . . . . . . . 117
Handling static-sensitive devices . . . . . . . . . . . . . . . . . . . . . . . . . 118
Installing or replacing a PCI adapter with the system power on in Virtual I/O Server . . . . . . . 119
Shutting down logical partitions . . . . . . . . . . . . . . . . . . . . . . . . . 122
PCI hot-plug manager access for AIX . . . . . . . . . . . . . . . . . . . . . . . 122
Prerequisites for hot-plugging PCI adapters in Linux . . . . . . . . . . . . . . . . . . 125
Verifying that the hot-plug PCI tools are installed for Linux. . . . . . . . . . . . . . . . 125
Updating the worldwide port name for a new 2766, 2787, 280E, 576B, or 5774 IOA.. . . . . . . . 126
© Copyright IBM Corp. 2010 iii
PCI-X double-wide, quad-channel Ultra320 SCSI RAID Controller (FC 5739, 5778, 5781, 5782; CCIN 571F,
575B) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 126
PCI-X DDR 1.5 GB cache SAS RAID Adapter (FC 5904 and 5908; CCIN 572F and 575C) . . . . . . 130
Power supply . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 134
Removing the power supply with the system power off . . . . . . . . . . . . . . . . . . 134
Removing the power supply with the system power on . . . . . . . . . . . . . . . . . . 136
Replacing the power supply with the system power off . . . . . . . . . . . . . . . . . . 137
Replacing the power supply with the system power on . . . . . . . . . . . . . . . . . . 139
RAID enablement card . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 140
Preparing the system. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 140
Removing the RAID enablement card . . . . . . . . . . . . . . . . . . . . . . . . 141
Replacing the Redundant Array of Independent Disks enablement card. . . . . . . . . . . . . 142
System backplane . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 144
System processor assembly. . . . . . . . . . . . . . . . . . . . . . . . . . . . . 172
Removing the system processor assembly . . . . . . . . . . . . . . . . . . . . . . . 172
Replacing the system processor assembly . . . . . . . . . . . . . . . . . . . . . . . 174
System VPD card . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 176
Thermal power management device. . . . . . . . . . . . . . . . . . . . . . . . . . 183
Removing the TPMD card . . . . . . . . . . . . . . . . . . . . . . . . . . . . 183
Replacing the TPMD card . . . . . . . . . . . . . . . . . . . . . . . . . . . . 185
Voltage regulators. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 186
Removing the processor voltage regulator. . . . . . . . . . . . . . . . . . . . . . . 186
Replacing the processor voltage regulator . . . . . . . . . . . . . . . . . . . . . . . 190
Removing the memory voltage regulator . . . . . . . . . . . . . . . . . . . . . . . 194
Replacing the memory voltage regulator . . . . . . . . . . . . . . . . . . . . . . . 197
Verifying a repair . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 199
Verifying the repair in AIX . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 200
Verifying the repair in Linux . . . . . . . . . . . . . . . . . . . . . . . . . . . . 203
Verifying a repair using an IBM i system or logical partition . . . . . . . . . . . . . . . . . 204
Closing a service call. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 206
Closing a service call using Integrated Virtualization Manager . . . . . . . . . . . . . . . . . 210
Closing a service call using AIX or Linux . . . . . . . . . . . . . . . . . . . . . . . . 214
Activating and deactivating LEDs . . . . . . . . . . . . . . . . . . . . . . . . . . 218
Deactivating a system attention LED or partition LED using the management console . . . . . . . . 219
Activating or deactivating an identify LED using the management console. . . . . . . . . . . . 219
Deactivating a system attention LED or logical partition LED using the Advanced System Management
Interface . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 220
Activating or deactivating an identify LED using the Advanced System Management Interface . . . . . 221
Notices . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 223
Trademarks . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 224
Electronic emission notices . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 224
Class A Notices. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 224
Terms and conditions. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 228
iv IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures
Safety notices
Safety notices may be printed throughout this guide:
v DANGER notices call attention to a situation that is potentially lethal or extremely hazardous to
people.
v CAUTION notices call attention to a situation that is potentially hazardous to people because of some
existing condition.
v Attention notices call attention to the possibility of damage to a program, device, system, or data.
World Trade safety information
Several countries require the safety information contained in product publications to be presented in their
national languages. If this requirement applies to your country, a safety information booklet is included
in the publications package shipped with the product. The booklet contains the safety information in
your national language with references to the U.S. English source. Before using a U.S. English publication
to install, operate, or service this product, you must first become familiar with the related safety
information in the booklet. You should also refer to the booklet any time you do not clearly understand
any safety information in the U.S. English publications.
German safety information
Das Produkt ist nicht für den Einsatz an Bildschirmarbeitsplätzen im Sinne § 2 der
Bildschirmarbeitsverordnung geeignet.
Laser safety information
IBM® servers can use I/O cards or features that are fiber-optic based and that utilize lasers or LEDs.
Laser compliance
IBM servers may be installed inside or outside of an IT equipment rack.
© Copyright IBM Corp. 2010 v
DANGER
When working on or around the system, observe the following precautions:
Electrical voltage and current from power, telephone, and communication cables are hazardous. To
avoid a shock hazard:
v Connect power to this unit only with the IBM provided power cord. Do not use the IBM
provided power cord for any other product.
v Do not open or service any power supply assembly.
v Do not connect or disconnect any cables or perform installation, maintenance, or reconfiguration
of this product during an electrical storm.
v The product might be equipped with multiple power cords. To remove all hazardous voltages,
disconnect all power cords.
v Connect all power cords to a properly wired and grounded electrical outlet. Ensure that the outlet
supplies proper voltage and phase rotation according to the system rating plate.
v Connect any equipment that will be attached to this product to properly wired outlets.
v When possible, use one hand only to connect or disconnect signal cables.
v Never turn on any equipment when there is evidence of fire, water, or structural damage.
v Disconnect the attached power cords, telecommunications systems, networks, and modems before
you open the device covers, unless instructed otherwise in the installation and configuration
procedures.
v Connect and disconnect cables as described in the following procedures when installing, moving,
or opening covers on this product or attached devices.
To Disconnect:
1. Turn off everything (unless instructed otherwise).
2. Remove the power cords from the outlets.
3. Remove the signal cables from the connectors.
4. Remove all cables from the devices
To Connect:
1. Turn off everything (unless instructed otherwise).
2. Attach all cables to the devices.
3. Attach the signal cables to the connectors.
4. Attach the power cords to the outlets.
5. Turn on the devices.
(D005)
DANGER
vi IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures
Observe the following precautions when working on or around your IT rack system:
v Heavy equipment–personal injury or equipment damage might result if mishandled.
v Always lower the leveling pads on the rack cabinet.
v Always install stabilizer brackets on the rack cabinet.
v To avoid hazardous conditions due to uneven mechanical loading, always install the heaviest
devices in the bottom of the rack cabinet. Always install servers and optional devices starting
from the bottom of the rack cabinet.
v Rack-mounted devices are not to be used as shelves or work spaces. Do not place objects on top
of rack-mounted devices.
v Each rack cabinet might have more than one power cord. Be sure to disconnect all power cords in
the rack cabinet when directed to disconnect power during servicing.
v Connect all devices installed in a rack cabinet to power devices installed in the same rack
cabinet. Do not plug a power cord from a device installed in one rack cabinet into a power
device installed in a different rack cabinet.
v An electrical outlet that is not correctly wired could place hazardous voltage on the metal parts of
the system or the devices that attach to the system. It is the responsibility of the customer to
ensure that the outlet is correctly wired and grounded to prevent an electrical shock.
CAUTION
v Do not install a unit in a rack where the internal rack ambient temperatures will exceed the
manufacturer's recommended ambient temperature for all your rack-mounted devices.
v Do not install a unit in a rack where the air flow is compromised. Ensure that air flow is not
blocked or reduced on any side, front, or back of a unit used for air flow through the unit.
v Consideration should be given to the connection of the equipment to the supply circuit so that
overloading of the circuits does not compromise the supply wiring or overcurrent protection. To
provide the correct power connection to a rack, refer to the rating labels located on the
equipment in the rack to determine the total power requirement of the supply circuit.
v (For sliding drawers.) Do not pull out or install any drawer or feature if the rack stabilizer brackets
are not attached to the rack. Do not pull out more than one drawer at a time. The rack might
become unstable if you pull out more than one drawer at a time.
v (For fixed drawers.) This drawer is a fixed drawer and must not be moved for servicing unless
specified by the manufacturer. Attempting to move the drawer partially or completely out of the
rack might cause the rack to become unstable or cause the drawer to fall out of the rack.
(R001)
Safety notices vii
CAUTION:
Removing components from the upper positions in the rack cabinet improves rack stability during
relocation. Follow these general guidelines whenever you relocate a populated rack cabinet within a
room or building:
v Reduce the weight of the rack cabinet by removing equipment starting at the top of the rack
cabinet. When possible, restore the rack cabinet to the configuration of the rack cabinet as you
received it. If this configuration is not known, you must observe the following precautions:
– Remove all devices in the 32U position and above.
– Ensure that the heaviest devices are installed in the bottom of the rack cabinet.
– Ensure that there are no empty U-levels between devices installed in the rack cabinet below the
32U level.
v If the rack cabinet you are relocating is part of a suite of rack cabinets, detach the rack cabinet from
the suite.
v Inspect the route that you plan to take to eliminate potential hazards.
v Verify that the route that you choose can support the weight of the loaded rack cabinet. Refer to the
documentation that comes with your rack cabinet for the weight of a loaded rack cabinet.
v Verify that all door openings are at least 760 x 230 mm (30 x 80 in.).
v Ensure that all devices, shelves, drawers, doors, and cables are secure.
v Ensure that the four leveling pads are raised to their highest position.
v Ensure that there is no stabilizer bracket installed on the rack cabinet during movement.
v Do not use a ramp inclined at more than 10 degrees.
v When the rack cabinet is in the new location, complete the following steps:
– Lower the four leveling pads.
– Install stabilizer brackets on the rack cabinet.
– If you removed any devices from the rack cabinet, repopulate the rack cabinet from the lowest
position to the highest position.
v If a long-distance relocation is required, restore the rack cabinet to the configuration of the rack
cabinet as you received it. Pack the rack cabinet in the original packaging material, or equivalent.
Also lower the leveling pads to raise the casters off of the pallet and bolt the rack cabinet to the
pallet.
(R002)
(L001)
(L002)
viii IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures
(L003)
or
All lasers are certified in the U.S. to conform to the requirements of DHHS 21 CFR Subchapter J for class
1 laser products. Outside the U.S., they are certified to be in compliance with IEC 60825 as a class 1 laser
product. Consult the label on each part for laser certification numbers and approval information.
CAUTION:
This product might contain one or more of the following devices: CD-ROM drive, DVD-ROM drive,
DVD-RAM drive, or laser module, which are Class 1 laser products. Note the following information:
v Do not remove the covers. Removing the covers of the laser product could result in exposure to
hazardous laser radiation. There are no serviceable parts inside the device.
v Use of the controls or adjustments or performance of procedures other than those specified herein
might result in hazardous radiation exposure.
(C026)
Safety notices ix
CAUTION:
Data processing environments can contain equipment transmitting on system links with laser modules
that operate at greater than Class 1 power levels. For this reason, never look into the end of an optical
fiber cable or open receptacle. (C027)
CAUTION:
This product contains a Class 1M laser. Do not view directly with optical instruments. (C028)
CAUTION:
Some laser products contain an embedded Class 3A or Class 3B laser diode. Note the following
information: laser radiation when open. Do not stare into the beam, do not view directly with optical
instruments, and avoid direct exposure to the beam. (C030)
Power and cabling information for NEBS (Network Equipment-Building System)
GR-1089-CORE
The following comments apply to the IBM servers that have been designated as conforming to NEBS
(Network Equipment-Building System) GR-1089-CORE:
The equipment is suitable for installation in the following:
v Network telecommunications facilities
v Locations where the NEC (National Electrical Code) applies
The intrabuilding ports of this equipment are suitable for connection to intrabuilding or unexposed
wiring or cabling only. The intrabuilding ports of this equipment must not be metallically connected to the
interfaces that connect to the OSP (outside plant) or its wiring. These interfaces are designed for use as
intrabuilding interfaces only (Type 2 or Type 4 ports as described in GR-1089-CORE) and require isolation
from the exposed OSP cabling. The addition of primary protectors is not sufficient protection to connect
these interfaces metallically to OSP wiring.
Note: All Ethernet cables must be shielded and grounded at both ends.
The ac-powered system does not require the use of an external surge protection device (SPD).
The dc-powered system employs an isolated DC return (DC-I) design. The DC battery return terminal
shall not be connected to the chassis or frame ground.
x IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures
Removal and replacement procedures for the 8233-E8B and
8236-E8C
Maintenance information for the IBM Power 750 Express® (8233-E8B) and IBM Power 755 (8236-E8C)
includes procedures for start-of-call, isolation, and removal and replacement.
Management console-based procedures
Follow these procedures to exchange one field-replaceable unit (FRU) with another.
For HMC:
1. Log into the HMC as the service representative.
2. In the navigation area, select Servers from the Systems Management navigation.
3. Select the server that requires a parts exchange.
4. Select Serviceability, then Hardware, and then Exchange FRU.
5. The Exchange FRU window opens. Select the FRU to exchange, next click on Launch Procedure, and
follow the instructions.
For SDMC:
1. On the Welcome page, under the Resources tab, select the server.
2. Click Actions > Service and Support > Hardware > Exchange FRU.
3. Select an installed enclosure type from the drop-down list.
4. From the list of FRU types for this enclosure, select a FRU type.
5. Click Next to display a list of locations for the FRU type.
6. Select the location code for a specific FRU.
7. Click Add to add the FRU location to Pending Actions.
8. Click Launch Procedure to begin replacing the FRUs listed in Pending Actions.
9. Click Finish when you have completed the FRU installation.
Non-management console-based procedures
Use the removal and replacement procedures when you repair, maintain, or exchange your system parts.
The removal and replacement procedures are divided into field replaceable units (FRUs) and customer
replaceable units (CRUs). CRUs can be removed and replaced by customers, but FRUs can only be
removed and replaced by an IBM authorized service representative.
Use the following procedures after the server has identified a failing part. The server information details
how to activate and deactivate LEDs. After the part or parts are replaced, refer to the server's
documentation to verify the repair, and close the service call.
Battery
Use this procedure to service the battery.
1. Access the ASMI
© Copyright IBM Corp. 2010 1
The Advanced System Management Interface (ASMI) is the interface to the service processor that is required to
perform general and administrator-level service tasks, such as reading service processor error logs, reading vital
product data, setting up the service processor, and controlling the system power. The ASMI might also be referred to
as the service processor menus.
To access the ASMI, see Accessing the Advanced System Management Interface.
When you have accessed the ASMI, continue with the next step.
2. Check and record the server firmware and service processor settings
1. Check and record the admin user ID's password.
2. Check and record the server firmware level prior to replacing this field replaceable unit (FRU). You might need to
update the system to the latest server firmware code level after you replace this FRU.
3. Check the service processor settings that you previously set using Advanced System Management (ASM). This
can be done by using a management console or a PC to access ASM. You need to reset these service processor
settings after you replace this FRU or the default settings will be used.
4. Record any service processor settings you might have set by using operating system commands:
v System Name setting
v System Power Control settings
v System Service Aids settings
v System Configuration settings
v Network Services settings
v Performance Option settings
v Login Profile settings
v Boot List settings
v Host Ethernet Adapter settings
3. Save the partition data.
Use the management console to save the partition data. For more information, see Backing up partition profile data.
4. Power off the unit
Power should be removed from the unit that contains the battery. To power off the unit, complete the following
steps.
1. If the system is running, stop the system as described in Stopping the system or logical partition.
2. If this unit is powered on, power it off now.
5. Disconnect the power cords
2 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures
To disconnect the power cords, complete the
following steps:
1. Open the rear rack door.
2. Disconnect all power cords from the unit
you are servicing.
6. Remove the front cover
1. Open the front rack door.
2. Remove the two thumbscrews (A) that
secure the front cover to the system unit.
3. Push the cover release latches (C) in the
direction of the arrow to release the cover
from the system unit.
4. Pull the cover out and away from the
system unit.
7. Place the unit into the service position
Removal and replacement procedures for the 8233-E8B and 8236-E8C 3
Attention: When placing the model rack into
the service position, it is essential that all
stability plates are firmly in position to
prevent the rack from toppling. Ensure that
only one system unit is in the service position
at a time.
Remove two screws (A) securing the system
unit (B) to the rack.
8. Place the unit into the service position (continued)
Notes:
1. Ensure that the cables at the rear of the
system unit do not catch or bind as you
pull the unit forward in the rack.
2. When the system unit rails are fully
extended, the rail safety latches lock into
place. This action prevents the system
from being pulled out too far.
While holding the system-unit release latches
(A) down on both the left and right sides, pull
the system unit (B) out from the rack until the
rails are fully extended and locked.
9. Remove the service access cover
4 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures
1. Loosen the two thumbscrews (A) located
at the rear of the cover.
2. Slide the cover (B) toward the rear of the
system unit. When the front of the service
access cover has cleared the upper frame
ledge, lift the cover up and off the system
unit.
Attention: For proper cooling and airflow,
replace the cover before turning on the
system. Operating the system without the
cover for more than 30 minutes could damage
the system components.
10. Remove the front fan
1. While squeezing the latch (A) on the front
right fan, grasp the fan firmly and pull the
fan out of the fan cage.
2. Repeat these steps until all system fans are
removed from the fan cage.
11. Remove the air baffle
Removal and replacement procedures for the 8233-E8B and 8236-E8C 5
Grasp the right side air baffle and pull it out
of the fan cage.
12. Remove rear fan.
1. While squeezing the latch (A) on the rear right fan,
grasp the fan firmly and pull the fan out of the fan
cage.
13. Remove the time-of-day battery
6 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures
Remove the time-of-day battery (A) from the
system backplane.
CAUTION:
The battery contains lithium. To avoid
possible explosion, do not burn or charge
the battery.
Do not:
v Throw or immerse into water.
v Heat to more than 100°C (212°F).
v Repair or disassemble.
Exchange only with the IBM-approved part.
Recycle or discard the battery as instructed by
local regulations.
14. Install the time-of-day battery
CAUTION:
Do not use conductive tools to insert the
battery. To avoid corrosion, use latex or nitril
gloves to touch the battery. If tools are
needed, use plastic tools to avoid electrical
shorts on the battery.
Install the time-of-day battery (A) in the slot
on the system backplane.
15. Install the rear fan.
Removal and replacement procedures for the 8233-E8B and 8236-E8C 7
1. Lower the rear fan into its location in the fan cage.
2. Push down on the fan until it locks into place.
16. Install the air baffle
1. Lower the air baffle into its location in the
fan cage, ensuring that the arrows on the
air baffle point toward the front of the
system.
2. Push down on the air baffle until it locks
into place.
17. Install the front fan
8 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures
1. Lower the fan into its location in the fan
cage.
2. Push down on the fan until it locks into
place.
18. Install the service access cover
1. Place the service access cover (A) on the
top of the system, about 25 mm (1 in.)
from the front of the system.
2. Hold the service access cover against the
system unit and slide it toward the front
of the system. The tabs of the service
access cover slide beneath the upper
chassis ledge.
3. Align the two thumbscrews (B) located on
the rear of the service access cover with
the two holes on the rear of the system
chassis.
4. Tighten the thumbscrews to secure the
service access cover.
19. Place the unit in the operating position
Removal and replacement procedures for the 8233-E8B and 8236-E8C 9
1. To unlock the rail safety latches (B), slide
the latches toward the front of the system.
2. Push the system unit (A) back into the
rack until both system-unit release latches
lock into position.
Note: Ensure that the cables at the rear of
the system do not catch or bind as you
push the system unit back into the rack.
20. Place the unit in the operating position (continued)
Replace and tighten the two screws (B) that
secure the system unit (A) to the rack.
21. Install the front cover
10 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures
1. Position the cover on the front of the
system unit so that the two thumbscrews
(C) align with the screw holes on the front
of the system unit (A).
2. Push the cover release latches (B) in the
direction of the arrows to attach the cover
to the front of the system unit as shown in
the figure.
3. Tighten the thumbscrews.
4. Close the front rack door.
22. Connect the power cords
If you removed the power cords, connect all
power cords to the unit you are servicing.
23. Power on the system
24. Restore network connectivity
Removal and replacement procedures for the 8233-E8B and 8236-E8C 11
The installed service processor must connect to the network to be accessible from the management console.
How will the IP address of the service processor be managed?
Obtain an IP address
automatically from a
DHCP server
Configure a static IP
address
? Go to step 26.
25. Restore service processor settings
Note: If the network connection uses the Dynamic Host Configuration Protocol (DHCP) to establish an IP address,
and uses the Hardware Management Console (HMC) as the DHCP server, no additional configuration is necessary to
enable network access to the service processor.
To enable network access to the service processor, complete the following steps.
1. Ensure that the service processor is connected to the existing service network by verifying that the HMC cable is
connected to the HMC port on the service processor assembly.
2. If not already connected, connect all system power cables by plugging them into power outlets.
Note: Do not start the system at this time.
Can you access the service network?
Yes No
Go to step 28. Go to step 26.
26. Restore service processor settings
If the network connection uses static IP address assignments, complete the following steps:
1. Connect a client with a Web browser directly to the service processor network port using one of the following
URLs:
v https://169.254.2.147
v https://169.254.3.147
Note: For additional information on IP address assignments, see Accessing the ASMI using a PC or notebook
and Web browser.
2. If not already connected, connect all system power cables by plugging them into power outlets.
Note: Do not start the system at this time.
3. Log in to Advanced System Management (ASM) with the user ID admin and the default password admin.
4. Change the admin user ID's password and the general user ID's password when prompted.
5. Configure network access using the static IP address.
27. Restore service processor settings
With the network connection now configured to use static IP address assignments, access the service network.
Can you access the service network?
Yes No
? Go to step 29.
28. Access the ASMI
12 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures
The Advanced System Management Interface (ASMI) is the interface to the service processor that is required to
perform general and administrator-level service tasks, such as reading service processor error logs, reading vital
product data, setting up the service processor, and controlling the system power. The ASMI might also be referred to
as the service processor menus.
To access the ASMI, see Accessing the Advanced System Management Interface.
When you have accessed the ASMI, continue with the next step.
29.
Go to step 33.
30. Restore service processor settings
If the network connection uses DHCP to establish an IP address, but does not use a management console as the
DHCP server, do the following steps:
1. Complete any network configuration necessary to allow the DHCP server to recognize and assign an IP address
to the service processor.
2. Ensure that the service processor is connected to the existing service network by verifying that the network cable
is connected to the network port on the service processor assembly.
3. If not already connected, connect all system power cables by plugging them into power outlets.
Note: Do not start the system at this time.
Can you access the service network?
No Yes
? Go to step 28 on page
12.
31. Restore service processor settings
You have indicated that you are still not able to access the service network.
You will need to reset the service processor network interfaces (and passwords) by moving the reset jumpers.
32. Restore service processor settings
Go to step 24 on page 11.
33. Restore service processor settings
Change the admin user ID's password when prompted by following these steps:
1. In the navigation area of ASM, expand Login Profile.
2. Select Change Password.
3. Specify the required information, and click Continue.
Note: Passwords can be any combination of up to 64 alphanumeric characters. The default password for the
administrator ID is admin. After your initial login to ASM, the administrator passwords must be changed.
As a security measure, you are required to enter the password for your user ID in the Current password for current
user field.
34. Restore service processor settings
Removal and replacement procedures for the 8233-E8B and 8236-E8C 13
Set the system name. The system name can be changed to any valid ASCII string. It does not have to follow the
initialized machine type, model number, and serial number format.
To change the system name, complete the followings steps:
1. In the navigation area of ASM, expand System Configuration.
2. Select System Name.
3. Enter the desired system name using the previous naming convention.
4. Click Save settings to update the system name to the new value.
The new system name is displayed in the status frame, the area where the logout button is located. If another
method, such as the management console, is used to change the system name, the status frame does not reflect the
change.
35. Restore service processor settings
Set the time of day by completing these steps:
1. In the navigation area of ASM, expand System Configuration.
2. Select Time of Day. If the system is powered off, the right pane displays a form that shows the current date
(month, day, and year) and time (hours, minutes, and seconds).
3. Change the date value, the time value, or both, and click Update Time Of Day.
36. Restore service processor settings
Reenter any of the following settings that you previously changed through ASM, unless you want to use the default
settings:
1. System Power settings
2. ASMI Service Aids settings
3. System Configuration settings
4. Network Services settings
5. Login Profile settings
37. Restore service processor settings
14 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures
1. Reset any service processor settings that you might have set by using the operating system commands. You
recorded these settings at the beginning of this procedure.
2. If you reset the management console access password, use one of the following methods:
For HMC:
a. From the HMC GUI (preferred method):
1) Expand the Systems Management folder in the navigation tree pane.
2) Double click the Servers folder.
3) Select a server in the central panel.
4) Under the Operations folder, select Change Password.
5) Provide the desired information and click OK.
b. From the HMC command line, type chsyspwd -m managed system -t access --passwd --newpasswd
newpassword
where:
v The value for managed systemis the new service processor's managed system name.
v No value for --passwd is entered thereby allowing authentication.
v The value for newpassword is the new password value.
For SDMC:
a. Go to the Resources tab and click Hosts.
b. Select a server in the right pane.
c. On the Actions menu, click Operations > Change Password.
d. Provide the desired information and click OK.
38. Recover partition data on the service processor
The profile data that is stored in the managed server has been cleared or corrupted.
To recover profile data by using the HMC, complete the following steps:
1. Expand the Systems Management folder in the navigation tree pane.
2. Double click the Servers folder.
3. Use the check box to select a server, in the central panel.
4. In the Configuration folder if you have the Manage Partition Data folder, select Restore.
To recover profile data using the SDMC, complete the following steps:
1. Go to the Resources tab and click Hosts.
2. Select a server in the right pane.
3. On the Actions menu, click System Configuration > Manage Partition Data > Restore.
4. Select the desired restore option and click OK.
39. Restore boot list and Host Ethernet Adapter settings.
The boot list might have been erased when the battery was replaced. Verify, and set if necessary, the list of boot
devices by booting into the system management services (SMS) menus. Refer to Selecting boot options for
information on setting the boot device list.
After the operating system is booted, the bootlist command can be used to verify, or make additional changes to, the
list of boot devices for a system or partition.
To restore the Host Ethernet Adapter settings, see Configuring physical ports on a Host Ethernet Adapter.
40. Verify the repair.
Go to Verifying a repair.
Removal and replacement procedures for the 8233-E8B and 8236-E8C 15
Removing and installing the cache battery pack
To service the cache battery pack, complete the following procedure:
1. Begin with power on.
To prevent data loss, the cache battery pack must be in an error state before it is removed from the
system.
If you have powered the system off, power it on before you continue.
2. Select your operating system:
v AIX: Go to step 3
v Linux: Go to step 5
v IBM i: Go to step 7
3. Force the cache battery pack into an error state on AIX® (to prevent possible data loss), as follows:
a. Navigate to the IBM SAS Disk Array Manager by using the following steps:
1) At the command prompt, type smit, and press Enter.
2) Select Devices.
3) Select Disk Array.
4) Select IBM SAS Disk Array.
5) Select IBM SAS Disk Array Manager from the menu with options for configuring and
managing the IBM SAS RAID Controller.
b. Select Diagnostics and Recovery Options.
c. Select Controller Rechargeable Battery Maintenance.
d. Select Force Controller Rechargeable Battery Error.
e. Select the IBM SAS RAID Controller whose battery you want to replace.
f. This option places the battery into the error state, which requires it to be replaced.
g. Determine that it is safe to replace the cache battery pack. See the following Displaying
Rechargeable Battery Information procedure. It is safe to replace the cache battery pack when Yes
is displayed next to Battery pack can be safely replaced. You might need to reselect the option
Display Controller Rechargeable Battery Information multiple times as it might take several
minutes before it is safe to replace the cache battery pack.
h. Verify that the Cache Present LED is not flashing.
Displaying Rechargeable Battery Information
a. Navigate to the IBM SAS Disk Array Manager by using the previous procedure.
b. Select Diagnostics and Recovery Options.
c. Select Controller Rechargeable Battery Maintenance.
d. Select Display Controller Rechargeable Battery Information.
e. Select IBM SAS RAID Controller.
4. Go to step 8.
5. Force the cache battery pack into an error state on Linux (to prevent possible data loss), as follows:
a. Run the iprconfig utility by typing iprconfig.
b. Select Work with disk unit recovery.
c. Select Work with resources containing cache battery packs.
d. Select your adapter and type 2. Then press Enter to force the battery error.
Note: This option places the battery into the error state, which requires it to be replaced.
e. If you are sure you want to force a battery error, type c to confirm. If you do not want to force a
battery error, type q to cancel.
f. Determine that it is safe to replace the cache battery pack. See Displaying Rechargeable Battery
Information.
16 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures
g. It is safe to replace the cache battery pack when Yes is displayed next to Battery pack can be
safely replaced. You might need to reselect the option Display Controller Rechargeable Battery
Information multiple times as it might take several minutes before it is safe to replace the cache
battery pack.
6. Go to step 8.
7. Force the cache battery pack into an error state on IBM i (to prevent possible data loss), as follows
on the system or partition that contains the adapter:
a. Be sure that you are signed on to the system with at least service level authority.
b. Type strsst on the command line and press Enter.
c. Type your service tools user ID and service tools password on the System Service Tools (SST)
Sign On display. Press Enter.
d. Select Start a Service Tool from the System Service Tools (SST) display and press Enter.
e. Select Hardware Service Manager from the Start a Service Tool display and press Enter.
f. Select Work with resources containing cache battery packs from the Hardware Service Manager
display and press Enter.
g. Select Force battery pack into error state for the I/O card that you are working with from the
Work with Resources containing cache battery packs display and press Enter.
h. On the Force Battery Packs Into Error State display, verify that the correct I/O adapter has been
selected and press the function key that confirms your choice.
i. Return to the Work with Resources containing cache battery packs display and select Display
battery information. Ensure that the field Safe to replace cache battery is set to Yes. This might
take several minutes and you might need to press the refresh key to see the field update.
8. Place a rack-mounted system into service position. Go to Placing a 8233-E8B or 8236-E8C in the
service position
9. Remove the service access cover. Go to Removing the service access cover from an 8233-E8B or
8236-E8C system
10. Check cache battery LED by examining the LED on the cache battery pack (C).
Removal and replacement procedures for the 8233-E8B and 8236-E8C 17
Is the cache battery LED flashing?
Yes, the LED is flashing No, the LED is not flashing.
? Go to step 12.
11. Stop removing the cache battery pack
The flashing LED indicates that there is cached data on the battery that has not been saved. This
data will be lost if the battery is removed at this time. Complete this service procedure without
replacing the battery, and then retry it to ensure all cache data is saved before the battery is replaced.
Go to step 18.
12. Remove the cache battery pack as follows:
18 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures
Attention: Do not remove the cache battery if the LED (C) is flashing because this indicates cache
data exists.
a. Move the cache battery lever (A) away from the connector to disengage the battery from the
connector.
b. Slide the cache battery pack (B) out of the mounting guides and remove it from the controller.
13. Install the cache battery pack as follows:
Removal and replacement procedures for the 8233-E8B and 8236-E8C 19
Attention:
Ensure that the cache battery pack is disconnected for at least 60 seconds before you connect the new
battery. This is the minimum amount of time needed for the card to recognize that the battery has
been replaced.
a. Slide the cache battery pack (B) into the mounting guides on the controller until it is seated in
the battery connector (C).
b. After the battery is seated in the connector, move the lever (A) to the latched position to fully
seat the battery into the connector.
14. Restart the adapter's write cache by doing the following steps:
Select your operating system:
v AIX: Go to step 15
v Linux: 16
v IBM i: Go to step 17
15. Perform the following steps:
a. Navigate to the IBM SAS Disk Array Manager by using the information in step 3a.
b. Select Diagnostics and Recovery Options.
c. Select Controller Rechargeable Battery Maintenance.
d. Select Start Adapter Cache.
e. Select the controller with the battery you just replaced and press Enter.
f. Go to step 18.
16. Perform the following steps:
a. Run the iprconfig utility by typing iprconfig.
b. Select Work with disk unit recovery.
c. Select Work with resources containing cache battery packs.
d. Start the I/O adapter cache on the adapter for the battery that you just replaced by typing 3 and
press Enter.
20 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures
e. Go to step 18.
17. Perform the following steps:
a. Return to the Work with Resources containing Cache Battery Packs display using the information
in step 7a to 7f and select the Start IOA cache. Press Enter.
b. Ensure that you get the message Cache was started.
c. Go to step 18.
18. Install the service access cover. Go to Installing the service access cover on an 8233-E8B or 8236-E8C
system
19. Place into operating position. Go to Placing the 8233-E8B or 8236-E8C in the operating position
20. Close the back door on the unit you are servicing.
21. Verify that the new resource is working correctly, as described in Verifying the installed part.
Control panel and signal cable
You can remove or replace the control panel (operator panel) or signal cable in the server.
Removing the control panel with the power off
These procedures describe the removal of the control panel with the power off.
If your system is managed by the IBM Systems Director Management Console (SDMC), use the SDMC to
remove the control panel from the server. For instructions, see Removing a part by using the Systems
Director Management Console. If you do not have an SDMC, complete this procedure to remove a
control panel.
If your system is managed by the Hardware Management Console (HMC), you can use the HMC to
complete the steps for removing a control panel. For instructions, see Removing a part by using the
Hardware Management Console. If you do not have an HMC, complete this procedure to remove a
control panel.
To remove the control panel with the power off, complete the following procedure:
1. Perform the prerequisite tasks described in “Before you begin” on page 115.
2. Stop the system. For instructions, see Stopping a system or logical partition.
3. Disconnect the power source from the system by unplugging the system.
Note: This system is equipped with a second power supply. Before continuing with this procedure,
disconnect all the power sources to the system.
4. Unlock the control panel by pressing release button (A), and pull the control panel out of the system,
as shown in the following figure. Remove the control panel by pressing the locking tabs (C) located
on either side of the panel, grasping the edges of the panel, and pulling the panel out of its bay.
Removal and replacement procedures for the 8233-E8B and 8236-E8C 21
5. Disconnect the control-panel signal cable (B) from the back of the control panel.
6. If you are removing the control panel as part of another procedure, return to that procedure now.
7. Replace the control panel with the power off. For instructions, see Replacing the 8233-E8B or
8236-E8C control panel with the power off.
Replacing the control panel with the power off
These procedures describe the replacement of the control panel with the power off.
If your system is managed by the IBM Systems Director Management Console (SDMC), use the SDMC to
replace the control panel in the server. For instructions, see Replacing a part by using the Systems
Director Management Console. If you do not have an SDMC, complete this procedure to replace a control
panel.
If your system is managed by the Hardware Management Console (HMC), you can use the HMC to
complete the steps for replacing a control panel. For instructions, see Replacing a part by using the
Hardware Management Console. If you do not have an HMC, complete this procedure to replace a
control panel.
To replace the control panel with the power off, complete the following procedure:
1. Perform the prerequisite tasks described in “Before you begin” on page 115.
2. Remove the control panel with power off. For more information, see Removing the 8233-E8B or
8236-E8C control panel with the power off.
3. If necessary, remove the new control panel from its packaging.
4. Connect the control-panel signal cable (A) to the back of the control panel, as shown in the following
figure.
5. Carefully slide the control panel into the control panel bay until you feel the locking tabs lock in
place.
Figure 1. Removing a control panel from a rack-mounted system
22 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures
Note: If the control panel does not lock in place, the media device cable might be in the path of the
control panel signal cable. You need to move the media device cable out of the path of the control
panel signal cable before replacing the control panel. To access the media device cable, you must
remove the fans.
6. If you replaced the control panel as part of another procedure, return to that procedure now. If you
replaced the control panel because it was not operational, verify that the new resource is functional.
For instructions, see Verifying the installed part.
7. Start the system. For instructions, see Starting the system or logical partition.
Note: If a ball or dot appears in the middle of the control panel, toggle between function 02 and
function 01 until the ball or dot disappears.
Removing and replacing the control panel signal cable
These procedures describe the removal and replacement of the control-panel signal cable.
If your system is managed by the IBM Systems Director Management Console (SDMC), use the SDMC to
remove and replace the control-panel signal cable. For instructions, see Removing a part by using the
Systems Director Management Console and Replacing a part by using the Systems Director Management
Console. If you do not have an SDMC, complete this procedure to remove and replace a control-panel
signal cable.
If your system is managed by the Hardware Management Console (HMC), use the HMC to complete the
steps for replacing a control-panel signal cable. For instructions, see Replacing a part by using the
Hardware Management Console. If you do not have an HMC, complete this procedure to remove and
replace a control-panel signal cable.
To replace the control-panel signal cable, complete the following procedure:
Figure 2. Replacing a control panel in a rack-mounted system
Removal and replacement procedures for the 8233-E8B and 8236-E8C 23
1. Perform the prerequisite tasks described in “Before you begin” on page 115.
2. Remove the disk-drive backplane. For instructions, see Removing the 8233-E8B or 8236-E8C disk drive
backplane.
3. Remove and replace the control-panel signal cable for the 2.5-inch disk drive using the following
instructions:
a. To remove the control panel cable, perform the following step:
Important: When removing the cable, note where the connections are and how the cable is routed.
You can use this information to complete the replacement procedure.
1) Disconnect the signal cable (B) from port (A) on the back of the backplane and lift the cable
out of the system.
b. Install the replacement control-panel signal cable into the system by performing the following
steps:
1) Attach the signal cable (B) to port (A) on the back of the backplane.
Figure 3. 2.5-inch disk drive
Figure 4. Removing a signal cable in a rack-mounted system
24 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures
2) Route the signal cable to the front of the system.
4. Replace the control panel with the power off. For instructions, see Replacing the 8233-E8B or
8236-E8C control panel with the power off.
Note: When installing a new signal cable, you can replace the control panel while the disk-drive
backplane is still outside the system. After you have replaced the control panel, replace the disk-drive
backplane as instructed in Disk drive backplanes.
Disk drive backplane
Use this procedure to service the disk drive backplane.
Removing the disk drive backplane
If your system is managed by the IBM Systems Director Management Console (SDMC), use the SDMC to
complete the steps for removing the disk drive backplane from the server. For instructions, see Removing
a part using the Systems Director Management Console.
If your system is managed by the Hardware Management Console (HMC), use the HMC to complete the
steps for removing the disk drive backplane from the server. For information about using the HMC to
remove the disk drive backplane, see Removing a part using the Hardware Management Console.
If you do not have an HMC or SDMC, complete the following steps to remove the disk drive backplane:
1. Perform the prerequisite tasks described in “Before you begin” on page 115.
2. Do one of the following steps, as applicable:
a. If you are removing the disk drive backplane for a system upgrade or as part of another
procedure, continue to the next step.
b. If you are removing the disk drive backplane because of a system failure, use the service action
log to help identify the failing part. See Identifying a part .
3. Stop the system or logical partition. For instructions, see Stopping a system or logical partition.
4. Disconnect the power source from the system by unplugging the system.
Figure 5. Installing a signal cable in a rack-mounted system
Removal and replacement procedures for the 8233-E8B and 8236-E8C 25
Note: This system has a second power supply. Before continuing with this procedure, ensure that
the power source to the system has been completely disconnected.
(L003)
or
5. Remove the front cover. For instructions, see Removing the front cover from an 8233-E8B or
8236-E8C system.
6. Place the system into the service position. For instructions, see Placing a 8233-E8B or 8236-E8C in the
service position.
7. Remove the service access cover. For instructions, see Removing the service access cover from an
8233-E8B or 8236-E8C system.
8. Attach the wrist strap.
Attention:
v Attach a wrist strap to an unpainted surface of your hardware to prevent electrostatic discharge
(ESD) from damaging your hardware.
v When using a wrist strap, follow all electrical safety procedures. A wrist strap is for static control.
It does not increase or decrease your risk of receiving electric shock when using or working on
electrical equipment.
v If you do not have a wrist strap, just prior to removing the product from ESD packaging and
installing or replacing hardware, touch an unpainted surface of the system for a minimum of 5
seconds.
9. To remove the fans and the fan cage, complete the following steps:
a. To remove the fans, squeeze the tab (A) and lift the fan out of the fan cage as shown in Figure 6
on page 27.
Note: If the thumbscrews are too tight, you might need a flat-head screwdriver to complete this
step. To pull the fan cage out of the system, you might need to unplug the control panel cable to
26 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures
allow more space.
b. To remove the air baffles, grasp and pull the air baffle out of the fan cage, as shown in Figure 7
on page 28.
Figure 6. Removing the fan from the fan cage
Removal and replacement procedures for the 8233-E8B and 8236-E8C 27
c. To remove the fan cage, loosen the two thumbscrews (A) and lift the fan cage (B) out of the
system as shown in Figure 8.
Figure 7. Removing the air baffles from the fan cage
Figure 8. Removing the fan cage from the system
28 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures
10. Disconnect the cables from the back of the disk drive backplane, as shown in Figure 9. If you have
connected an external SAS cable, ensure that you disconnect the SAS cable from the disk drive
backplane. For more information about external SAS port, refer to Installing the external SAS port on
a model 8233-E8B or 8236-E8C.
11. If you need to replace the backplane because of an upgrade or system failure, complete the following
steps to remove the disk drives, depending on your backplane:
Note: If you remove a backplane that is populated with disk drives, the backplane will be heavy.
Ensure that you can safely complete the procedure.
a. To remove the disk drive from the backplane, perform the following steps:
Note: Record the location of each disk drive you remove so the disk drive can be replaced into
the same location again.
1) Unlock the disk drive handle (A) and pull it toward you to unseat the disk drive.
2) Place one hand under the disk drive and carefully pull the drive out of the system as shown
in Figure 10 on page 30.
Figure 9. Disconnecting the media device cables from the disk drive backplane
Removal and replacement procedures for the 8233-E8B and 8236-E8C 29
12. Remove the media device by pressing in on the latch (A) and sliding the media device out of the
system, as shown in the following figure.
Figure 10. Removing the disk drive from the backplane
Figure 11. Removing the media device from the disk drive backplane
30 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures
13. Unlock the disk drive backplane by pulling the handles open from the center of the backplane as
shown in Figure 12 and Figure 13.
14. Pull the backplane out of the server.
15. Remove the control panel and signal cable from the backplane to use in the replacement backplane.
For instructions, see Control panel and signal cables.
Figure 12. Removing the populated disk drive backplane
Figure 13. Removing the unpopulated disk drive backplane
Removal and replacement procedures for the 8233-E8B and 8236-E8C 31
16. If you removed the backplane as a part of another procedure, continue with that procedure. To
replace the backplane or install a new backplane, see Replacing the 8233-E8B or 8236-E8C disk drive
backplane.
Replacing the disk drive backplane
If your system is managed by the IBM Systems Director Management Console (SDMC), use the SDMC to
complete the steps for replacing the disk drive backplane in the server. For instructions, see Replacing a
part using the Systems Director Management Console.
If your system is managed by the Hardware Management Console (HMC), use the HMC to complete the
steps for replacing the disk drive backplane in the server. For instructions, see Exchanging a part using
the Hardware Management Console.
If you do not have an HMC or SDMC, complete the following steps to replace the disk drive backplane:
1. Perform the prerequisite tasks described in “Before you begin” on page 115.
2. Install the control panel and signal cable. For instructions, see Control panel and signal cables.
3. Check for a protective cover that might be present on the lower-right connector pins. If a protective
cover is present, remove the cover (A), as shown in Figure 14.
4. Carefully lift the backplane along two sides and align it with the slot in the server.
Important: If you remove a backplane that is populated with disk drives, the backplane will be
heavy. Ensure that you can safely perform this task before you begin.
5. Insert the backplane firmly into the server, as shown in the following figures.
Figure 14. Removing the protective cover from the connector pins
32 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures
6. Secure the backplane with the locking tabs.
7. Connect the cables to the backplane, as shown in the following figure. You can also connect an
external SAS cable to the backplane. For more information about external SAS port, refer to
Installing the external SAS port on a model 8233-E8B or 8236-E8C.
Figure 15. Installing the unpopulated disk drive backplane on a rack-mounted model
Figure 16. Installing the populated disk drive backplane on a rack-mounted model
Removal and replacement procedures for the 8233-E8B and 8236-E8C 33
8. Replace the disk drives, if you removed them earlier. To replace the disk drives, perform the
following steps:
a. Hold the disk drive by the top and bottom edge as you position the drive and insert it into the
expansion unit. Do not hold the disk drive by the handle.
b. Slide the disk drive all the way into the system or expansion unit, and push the disk drive
handle (A) in until it locks, as shown in the following figure.
Figure 17. Connecting the cables to the disk drive backplane
34 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures
9. Replace the Slimline media device by inserting it into the slot and pushing it in until the lever (B)
locks into place, as shown in the following figure.
10. Replace the fans, air baffles, and the fan cage.
Figure 18. Installing the backplane disk drives
Removal and replacement procedures for the 8233-E8B and 8236-E8C 35
a. To replace the fan cage (A), align the locator pins through the holes, and tighten the two
thumbscrews (B), as shown in Figure 19.
b. To install the air baffles, insert the air baffles into their locations in the fan cage, and push down
until they lock into place, as shown in Figure 20 on page 37.
Figure 19. Replacing the fan cage
36 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures
c. To install the fans, insert the fans into their locations in the fan cage, and push down on each fan
until it locks into place, as shown in Figure 21.
Figure 20. Installing the air baffles in the fan cage
Figure 21. Installing the fans in the fan cage
Removal and replacement procedures for the 8233-E8B and 8236-E8C 37
11. Replace the service access cover. See Installing the service access cover on an 8233-E8B or 8236-E8C
system.
12. Replace the front cover. See Installing the front cover on an 8233-E8B or 8236-E8C system.
13. Place the system in the operating position. For more information, see Placing the 8233-E8B or
8236-E8C in the operating position.
14. Start the system or logical partition. See Starting the system or logical partition.
15. Verify the installed part. For instructions, see Verifying the installed part.
External cables
Use this procedure to service the external cables except the service processor cable or SMP cable.
1. Remove the external cables
Use the graphics in the locations table to locate the cable.
Trace the cable and record the system location at the other end, and then remove the cable.
2. Install the external cables
Use the graphic in the locations table to locate where the cable is to be installed.
Install the cable, and then route and install the other end of the cable to the system location that was previously
recorded.
3. Verify the repair
Go to Verifying a repair.
Fans
Use this procedure to service the fan.
Removing the fan
For information about using the HMC to remove fans, see Removing a part by using the Hardware
Management Console
To remove the fan from a system that is not managed by the HMC, complete the following steps:
1. Complete the prerequisite tasks described in “Before you begin” on page 115.
2. Determine if the repair can continue concurrently. To continue the repair concurrently, the following
conditions must be true:
v Air moving devices (AMDs) must be installed in all four positions.
Each of these four AMD units has a green LED and an amber LED. The green LED will be on for
all four of the fans, but the fan with the failure on it will have the amber fault LED lit.
v If only amber fault LED is on, the repair can be performed concurrently.
38 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures
Note: If more than one amber fault LED is on then the repair must be done nonconcurrently.
3. Select an action
Nonconcurrent repair.
Power off the unit
and continue the
repair.
Concurrent repair.
Continue the repair
with unit power on.
Delay the repair.
? Go to step 9 on page
40.
This ends the procedure.
4. Stop the system, as described in Stopping a system or logical partition.
5. Disconnect all power cords from the unit you are servicing.
6. If you have a rack-mounted system, place it in the service position, as described in Placing a 8233-E8B
or 8236-E8C in the service position.
7. Remove the service access cover, as described in Removing the service access cover from an 8233-E8B
or 8236-E8C system.
Removal and replacement procedures for the 8233-E8B and 8236-E8C 39
8. Attach the wrist strap
Attention:
v Attach a wrist strap to an unpainted surface of your hardware to prevent electrostatic discharge
(ESD) from damaging your hardware.
v When using a wrist strap, follow all electrical safety procedures. A wrist strap is for static control. It
does not increase or decrease your risk of receiving electric shock when using or working on
electrical equipment.
v If you do not have a wrist strap, just prior to removing the product from ESD packaging and
installing or replacing hardware, touch an unpainted surface of the system for a minimum of 5
seconds.
9. Remove the fan
a. Locate the failing system fan to be replaced. It is the fan indicated by the amber fault LED being
on.
b. While squeezing the latch (A), reach through the hole of the failing fan and pull the fan out of the
fan cage.
If you removed the fan as part of another service action, or if you need to replace the fan that you
removed, see Replacing the 8233-E8B or 8236-E8C system unit fan.
Replacing the fan
Before installing or replacing a feature, ensure that the software required to support the feature is
installed on your system. For information about software prerequisites, see IBM Prerequisite. If the
required software is not installed, see the following Web sites to download it, and then install it before
continuing:
v To download firmware and software updates and fixes, see Fix Central.
v To download Hardware Management Console (HMC) updates and fixes, see Hardware Management
Console Support and downloads.
40 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures
If your system is managed by the HMC, use the HMC to complete the steps for replacing the fan in the
server. For information about using the HMC to replace fans, see Replacing a part by using the Hardware
Management Console. If you do not have an HMC, complete the following steps to replace the fan.
1. Complete the prerequisite tasks described in “Before you begin” on page 115.
2. Attach the wrist strap.
Attention:
v Attach a wrist strap to an unpainted surface of your hardware to prevent electrostatic discharge
(ESD) from damaging your hardware.
v When using a wrist strap, follow all electrical safety procedures. A wrist strap is for static control. It
does not increase or decrease your risk of receiving electric shock when using or working on
electrical equipment.
v If you do not have a wrist strap, just prior to removing the product from ESD packaging and
installing or replacing hardware, touch an unpainted surface of the system for a minimum of 5
seconds.
3. If necessary, remove the fan from its antistatic package.
4. Install the fan
a. Lower the fan into its location in the fan cage. Orient the fan so the latch is located towards the
middle of the system.
b. Push down on the fan until it locks into place.
5. Replace the service access cover, as described in Installing the service access cover on an 8233-E8B or
8236-E8C system.
6. Place the system in the operating position, as described in Placing the 8233-E8B or 8236-E8C in the
operating position.
7. If you removed the power cords, connect all power cords to the unit you are servicing.
8. Verify the installed part, as described in Verifying the installed part.
Removal and replacement procedures for the 8233-E8B and 8236-E8C 41
GX adapters
Use this procedure to service the GX adapters.
Removing GX adapters
If your system is managed by the Hardware Management Console (HMC), use the HMC to remove an
adapter. For instructions, see Removing a part by using the Hardware Management Console.
If your system is managed by the IBM Systems Director Management Console (SDMC), use the SDMC to
complete the steps for removing an adapter. For instructions, see Removing a part by using the Systems
Director Management Console.
To remove the adapter from a system that is not managed by the HMC or SDMC, complete the following
steps:
1. Complete the prerequisite tasks described in “Before you begin” on page 115; however do not
disconnect the power cords at this time.
2. Stop the system, as described in Stopping a system or logical partition.
3. If you are working on a rack-mounted system unit, open the rear rack door.
4. Disconnect the power source from the system by unplugging the system.
Note: This system is equipped with a second power supply. Before continuing with this procedure,
ensure that the power source to the system has been completely disconnected.
5. If necessary, disconnect any cables connected to the adapter at the rear of the system.
Tip: Label the cables before removing them from the adapter.
6. Place the system in the service position, as described in Placing a 8233-E8B or 8236-E8C in the
service position.
7. Remove the access cover, as described in Removing the service access cover from an 8233-E8B or
8236-E8C system.
8. Attach the wrist strap.
Attention:
v Attach a wrist strap to an unpainted surface of your hardware to prevent electrostatic discharge
(ESD) from damaging your hardware.
v When using a wrist strap, follow all electrical safety procedures. A wrist strap is for static control.
It does not increase or decrease your risk of receiving electric shock when using or working on
electrical equipment.
v If you do not have a wrist strap, just prior to removing the product from ESD packaging and
installing or replacing hardware, touch an unpainted surface of the system for a minimum of 5
seconds.
9. Remove the existing adapter assembly by sliding the retainers (A) into the open position. Unlatch
the blue locking latches (B) on the adapter assembly by pushing them together and pulling them up
into the open position, and then lift the adapter out of the system.
42 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures
10. Place an adapter or a filler panel in the expansion slot of the system unit for proper airflow and
cooling:
v To replace the adapter, see Replacing GX adapters in an 8233-E8B or 8236-E8C with power turned
off.
v If you are not replacing the adapter, place a filler panel in the slot and continue with the next step.
11. Seal the expansion slot by using an expansion slot cover.
12. Replace the service access cover, as described in Installing the service access cover on an 8233-E8B or
8236-E8C system.
13. If you are working on a rack-mounted system unit, ensure that the system cables are routed correctly
through the cable-management arm.
14. If you are working on a rack-mounted system unit, place the system in the operating position as
described in Placing the 8233-E8B or 8236-E8C in the operating position.
15. Reconnect all power and signal cables to their respective connectors.
16. Close the back rack door or the back system door.
17. Start the system, as described in Starting the system or logical partition.
18. Verify that the new resource is working correctly, as described in Verifying the installed part.
Replace the GX adapter with the power turned off.
Replacing GX adapters
Note: Replacing this feature is a customer task. You can perform this task yourself, or contact a service
provider to perform the task for you. You might be charged a fee by the service provider for this service.
If your system is managed by the Hardware Management Console (HMC), use the HMC to replace an
adapter. For instructions, see Exchanging a part by using the Hardware Management Console.
Figure 22. Adapter removal from a rack-mounted system
Removal and replacement procedures for the 8233-E8B and 8236-E8C 43
If your system is managed by the IBM Systems Director Management Console (SDMC), use the SDMC to
complete the steps for replacing an adapter. For instructions, see Replacing a part by using the Systems
Director Management Console.
To replace the adapter on a system that is not managed by the HMC or SDMC, complete the following
steps:
1. Remove the adapter from the system unit.
2. Attach the wrist strap.
Attention:
v Attach a wrist strap to an unpainted surface of your hardware to prevent electrostatic discharge
(ESD) from damaging your hardware.
v When using a wrist strap, follow all electrical safety procedures. A wrist strap is for static control.
It does not increase or decrease your risk of receiving electric shock when using or working on
electrical equipment.
v If you do not have a wrist strap, just prior to removing the product from ESD packaging and
installing or replacing hardware, touch an unpainted surface of the system for a minimum of 5
seconds.
3. Carefully grasp the new adapter assembly by its top edge, and align the adapter assembly with the
slot and its connector on the system backplane.
4. Press the adapter assembly firmly into its connector. When you install an adapter into the system,
ensure that it is completely and correctly seated in its connector.
5. Seat the adapter. Push down evenly on both of the blue locking latches (A) at the same time until
the latches snap into place, indicating that the adapter assembly is seated correctly and locked into
position. Slide the retainers (B) into the closed position.
6. Replace the service access cover, as described in Installing the service access cover on an 8233-E8B or
8236-E8C system.
Figure 23. Adapter installation in a rack-mounted system
44 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures
7. If you are working on a rack-mounted system unit, ensure that the system cables are routed through
the cable-management arm correctly.
8. If you are working on a rack-mounted system unit, place the system in the operating position as
described in Placing the 8233-E8B or 8236-E8C in the operating position.
9. Reconnect all power and signal cables to their respective connectors.
10. Close the rear rack door or the rear system door.
11. Start the system, as described in Starting the system or logical partition.
12. Verify that the new resource is working correctly, as described in Verifying the installed part.
Host Ethernet Adapter
Use this procedure to service the Host Ethernet Adapter.
Removing the Host Ethernet Adapter
If your system is managed by the Hardware Management Console (HMC), use the HMC to complete the
steps for removing the host Ethernet adapter from the server. For instructions, see Replacing a part by
using the Hardware Management Console.
Note: This procedure is for the internal Host Ethernet Adapter. For PCI Ethernet adapter information, see
PCI Adapters.
If you do not have an HMC, complete the following steps to remove the Host Ethernet Adapter:
Note: Removing the Host Ethernet Adapter is a nonconcurrent service action.
1. Perform the prerequisite tasks described in “Before you begin” on page 115.
2. If you are removing the Host Ethernet Adapter for a system upgrade or as part of another
procedure, continue to the next step. If you are removing the Host Ethernet Adapter because of a
system failure, use the service action log to help identify the failing part. For more information, see
Identifying a part.
3. Stop the system. For instructions, see Stopping a system or logical partition.
4. Disconnect the power source from the system by unplugging the system.
Note: This system has a second power supply. Before continuing with this procedure, ensure that
the power source to the system has been disconnected.
(L003)
or
Removal and replacement procedures for the 8233-E8B and 8236-E8C 45
5. Place the system in the service position. For more information, see Placing a 8233-E8B or 8236-E8C in
the service position.
6. Remove the service access cover. For more information, see Removing the service access cover from
an 8233-E8B or 8236-E8C system.
Attention:
v Attach a wrist strap to an unpainted surface of your hardware to prevent electrostatic discharge
(ESD) from damaging your hardware.
v When using a wrist strap, follow all electrical safety procedures. A wrist strap is for static control.
It does not increase or decrease your risk of receiving electric shock when using or working on
electrical equipment.
v If you do not have a wrist strap, just prior to removing the product from ESD packaging and
installing or replacing hardware, touch an unpainted surface of the system for a minimum of 5
seconds.
7. Locate the Ethernet adapter you want to remove as shown in the following figures.
8. Label and remove the Ethernet cables from the back of the system. Note cable connector locations for
future reference.
9. Unlock the Ethernet adapter by pushing the latch (A) out.
10. Lift the Ethernet adapter (B) out of the connector using the blue handle, as shown in the following
figures.
46 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures
11. If you removed the adapter as a part of another procedure, continue with that procedure. To replace
the adapter or install a new adapter, see Replacing the 8233-E8B or 8236-E8C Host Ethernet Adapter
with the power off.
Replacing the Host Ethernet Adapter
To remove the Host Ethernet Adapter, see Removing the 8233-E8B or 8236-E8C Host Ethernet Adapter
with the power off.
If your system is managed by the Hardware Management Console (HMC), use the HMC to complete the
steps for replacing the host Ethernet adapter in the server. For instructions, see Replacing a part by using
the Hardware Management Console.
If you do not have an HMC, complete the following steps to replace the Host Ethernet Adapter:
Note: Replacing the Host Ethernet Adapter is a nonconcurrent service action.
1. Perform the prerequisite tasks described in “Before you begin” on page 115.
2. Carefully grasp the blue adapter handle. Align the adapter with the connector. Ensure that the hook
on the tail stock of the adapter slips into the slot on the back of the system.
3. Insert the adapter (A) firmly into the connector by pushing down on the blue handle, as shown in the
following figure.
4. Secure the adapter with the locking tabs (B).
Figure 24. Removing a host Ethernet adapter ( P1-C6) for 8233-E8B or 8236-E8C
Removal and replacement procedures for the 8233-E8B and 8236-E8C 47
5. Replace the service access cover. For more information, see Installing the service access cover on an
8233-E8B or 8236-E8C system .
6. Reconnect the Ethernet cables to the back of the system.
7. Place the system in the operating position. For instructions, see Placing the 8233-E8B or 8236-E8C in
the operating position.
8. Verify the installed part. For instructions, see Verifying the installed part.
Media devices
Use this procedure to service media devices.
Select the procedure that applies to the media device that needs to be serviced.
v Removing and replacing a SAS media device
v Removing and replacing a Slimline media device with the power on
v Removing and replacing a universal serial bus disk drive
v Installing an external universal serial bus disk drive
SAS media device
Use this procedure to service SAS media devices.
Removing a SAS media device:
If your system is managed by the IBM Systems Director Management Console (SDMC), use the SDMC to
complete the steps for removing media devices from the server. For instructions, see Removing a part
using the Systems Director Management Console.
If your system is managed by the Hardware Management Console (HMC), use the HMC to complete the
steps for removing media devices from the server. For information about using the HMC to remove
media devices, see Removing a part using the Hardware Management Console.
Figure 25. Install the Host Ethernet Adapter on a rack-mounted model for 8233-E8B or 8236-E8C
48 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures
If you do not have an HMC or SDMC, complete the following steps to remove the media device:
Before you remove a media device, perform the prerequisite tasks described in Before you begin.
To remove a SAS media device, complete the following steps:
1. Stop the media device and eject any media.
2. Power off the system. See Stopping the system or logical partition.
3. Disconnect the power cords from the system.
Note: This system has a second power supply. Before continuing with this procedure, ensure that
the power source to the system has been disconnected.
(L003)
or
4. Remove the front cover. See Removing the front cover.
5. Place the system into the service position. See Placing the system into the service position.
6. Remove the service access cover. See Removing the service access cover.
7. Attach the wrist strap.
Attention:
v Attach a wrist strap to an unpainted surface of your hardware to prevent electrostatic discharge
(ESD) from damaging your hardware.
v When using a wrist strap, follow all electrical safety procedures. A wrist strap is for static control.
It does not increase or decrease your risk of receiving electric shock when using or working on
electrical equipment.
v If you do not have a wrist strap, just prior to removing the product from ESD packaging and
installing or replacing hardware, touch an unpainted surface of the system for a minimum of 5
seconds.
8. Remove the fans and fan cage.
Removal and replacement procedures for the 8233-E8B and 8236-E8C 49
Note: Although the following figures show a media device, you have already removed the media
device filler and your system will have an empty space until you complete all the steps.
a. Remove the fan by squeezing or lifting the tab (A), depending on your model, and lifting it out
of the system as shown in the following figure.
b. Remove the blue air baffles as shown in the following figure.
Figure 26. Fan removal
50 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures
c. Remove the fan cage (B) by unscrewing the thumbscrews (A) and lifting out the fan cage.
Note: If the thumbscrews are too tight, you might need a flat-head screwdriver to complete this
step. To pull the fan cage out of the system, you might need to unplug the control panel cable to
allow more space.
Figure 27. Air baffle removal
Removal and replacement procedures for the 8233-E8B and 8236-E8C 51
9. If you are replacing the media device cable, disconnect the media-device connections from the
system backplane.
10. Press the release latch (A) and unseat the media device (B) from the system as shown.
Note: You need to remove the media cable (C) attached at the back of the device before removing
the media device.
Figure 28. Fan cage removal
52 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures
11. Pull the media device from the system.
12. If you removed the media device as a part of another procedure, return to that procedure.
13. If you removed the media device to replace the media device, see Installing a SAS media device in
8233-E8B.
Replacing a SAS media device:
If your system is managed by the IBM Systems Director Management Console (SDMC), use the SDMC to
complete the steps for installing a feature in the server. For instructions, see Removing a part using the
Systems Director Management Console.
If your system is managed by the Hardware Management Console (HMC), use the HMC to complete the
steps for installing a feature in the server. For information about using the HMC to install a feature, see
Installing a feature using the Hardware Management Console.
If you do not have an HMC or SDMC, complete the following steps to install a media device from the
system or partition that controls the media device:
Before you install a SAS media device, perform the prerequisite tasks described in Before you begin.
To install a SAS media device, do the following steps:
1. Power off the system. See Stopping the system or logical partition.
2. Disconnect the power cords from the system.
Note: This system has a second power supply. Before continuing with this procedure, ensure that
the power source to the system has been completely disconnected.
(L003)
Removal and replacement procedures for the 8233-E8B and 8236-E8C 53
or
3. Place the system into the service position. See Placing the system into the service position.
4. Remove the front cover. See Removing the front cover for instructions.
5. Remove the service access cover. See Removing the service access cover for instructions.
6. Attach the wrist strap.
Attention:
v Attach a wrist strap to an unpainted surface of your hardware to prevent electrostatic discharge
(ESD) from damaging your hardware.
v When using a wrist strap, follow all electrical safety procedures. A wrist strap is for static control.
It does not increase or decrease your risk of receiving electric shock when using or working on
electrical equipment.
v If you do not have a wrist strap, just prior to removing the product from ESD packaging and
installing or replacing hardware, touch an unpainted surface of the system for a minimum of 5
seconds.
7. Remove the media device filler, if present, as shown in the following figure.
54 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures
8. Install the rails (A) onto the media device using the two small screws (B) included with your media
device. The rails must be placed on each side of the media device. Screw the rails into the side of the
media device using the lower screw hole.
9. Push the media device (A) into the system until you feel the latches (B) lock in place as shown.
Note: You need to attach the media cable (C) provided at the back of the device before installing the
media device.
Removal and replacement procedures for the 8233-E8B and 8236-E8C 55
10. Remove the fans and fan cage.
Note: Although the following figures show a media device, you have already removed the media
device filler and your system will have an empty space until you complete all the steps.
a. Remove the fan by squeezing or lifting the tab (A), depending on your model, and lifting it out
of the system as shown in the following figure.
56 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures
b. Remove the blue air baffles as shown in the following figure.
c. Remove the fan cage (B) by unscrewing the thumbscrews (A) and lifting out the fan cage.
Figure 29. Fan removal
Figure 30. Air baffle removal
Removal and replacement procedures for the 8233-E8B and 8236-E8C 57
Note: If the thumbscrews are too tight, you might need a flat-head screwdriver to complete this
step. To pull the fan cage out of the system, you might need to unplug the control panel cable to
allow more space.
11. Connect the SAS cable to the drive.
12. Attach the media-device connections to the system backplane.
13. Reinstall the fans and fan cage.
a. Replace the fan cage by inserting it into the system and tightening the thumbscrews (B).
Figure 31. Fan cage removal
58 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures
b. Replace the blue air baffles as shown in the following figure.
c. Replace the fans by inserting them into the fan cage until they lock into place, as shown in the
following figure.
Removal and replacement procedures for the 8233-E8B and 8236-E8C 59
14. Install the service access cover. See Installing the service access cover on a rack-mounted system for
instructions.
15. Attach the front cover. See Installing the front cover on the rack-mounted system.
16. Connect the power cords to the system.
17. Start the system. See Starting the system or logical partition.
18. Verify that the media device is installed and working correctly. See Verifying the installed part.
Removing and replacing a Slimline media device with the power on
Use this procedure to service Slimline media devices.
Removing a Slimline media device with the power on:
Before you remove a media device, perform the prerequisite tasks described in Before you begin.
To remove a Slimline media device, complete the following steps:
1. Remove any media from the drive.
2. Remove the front cover. See Removing the front cover from a rack-mounted system.
3. Press the retaining tab (A) away from the Slimline media device until it is unseated from the media
bay as shown.
60 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures
4. Pull the Slimline media device (B) away from the system.
Replacing a Slimline media device with the power on:
If your system is managed by the IBM Systems Director Management Console (SDMC), use the SDMC to
complete the steps for installing a feature in the server. For instructions, see Removing a part using the
Systems Director Management Console.
If your system is managed by the Hardware Management Console (HMC), use the HMC to complete the
steps for installing a feature in the server. For information about using the HMC to install a feature, see
Installing a feature using the Hardware Management Console.
If you do not have an HMC or SDMC, complete the following steps to install a media device from the
system or partition that controls the media device:
Note: The 8233-E8B and 8236-E8C support only one Slimline media device per processor.
To install a Slimline media device for models 8233-E8B and 8236-E8C with the system or partition
powered off, follow these steps from the system or partition that controls the media device:
1. Stop the system. For instructions, see http://publib.boulder.ibm.com/infocenter/powersys/v3r1m5/
topic/p7haj/crustopsys.htm.
If you have created an IBM i logical partition on your system, see Logical partitioning. To find
instructions for powering off a system with an IBM i logical partition, see Shutting down IBM i
logical partitions.
Attention: Media devices are fragile. Handle with care.
2. Remove the system unit front cover. For instructions, see Removing the front cover from a
rack-mounted system.
3. Find the package that contains the new media device and remove it from the static-protective
package.
Removal and replacement procedures for the 8233-E8B and 8236-E8C 61
Attention:
v Attach a wrist strap to an unpainted surface of your hardware to prevent electrostatic discharge
(ESD) from damaging your hardware.
v When using a wrist strap, follow all electrical safety procedures. A wrist strap is for static control.
It does not increase or decrease your risk of receiving electric shock when using or working on
electrical equipment.
v If you do not have a wrist strap, just prior to removing the product from ESD packaging and
installing or replacing hardware, touch an unpainted surface of the system for a minimum of 5
seconds.
4. Remove the media-device filler, if present.
5. Align the media device with the Slimline media bay, and support the bottom of the device as you
slide it halfway into the system.
6. Push the device (A) fully into the system as shown in the following figure. To ensure that the device
is held firmly in place, push in the plastic retaining tab (B).
7. Start the system. For instructions on starting the system, see Starting the system or logical partition.
8. Verify that the new resource is functional. See Verifying the installed part.
9. Replace the front cover on the system. For instructions, see Installing the front cover on the
rack-mounted system.
10. Close the front rack door, if this is a rack-mounted model.
Removing and replacing a USB disk drive
Use this procedure to service USB media devices.
Removing a USB disk drive:
Before you remove a media device, perform the prerequisite tasks described in Before you begin.
Figure 32. Installation of a Slimline media device
62 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures
To remove a USB disk drive media device, complete the following steps:
1. Power off the system. See Stopping the system or logical partition.
2. Disconnect the power cords from the system.
Note: This system has a second power supply. Before continuing with this procedure, ensure that
the power source to the system has been disconnected.
(L003)
or
3. Remove the front cover. See Removing the front cover.
4. Place the system into the service position. See Placing the system into the service position.
5. Remove the service access cover. See Removing the service access cover.
6. Attach the wrist strap.
Attention:
v Attach a wrist strap to an unpainted surface of your hardware to prevent electrostatic discharge
(ESD) from damaging your hardware.
v When using a wrist strap, follow all electrical safety procedures. A wrist strap is for static control.
It does not increase or decrease your risk of receiving electric shock when using or working on
electrical equipment.
v If you do not have a wrist strap, just prior to removing the product from ESD packaging and
installing or replacing hardware, touch an unpainted surface of the system for a minimum of 5
seconds.
7. Remove the fans.
a. Remove the fan by squeezing the tab (A) and lifting it out of the system as shown in the
following figure.
Removal and replacement procedures for the 8233-E8B and 8236-E8C 63
b. Remove the blue air baffles as shown in the following figure.
8. Remove the fans and fan cage.
Figure 33. Fan removal
Figure 34. Air baffle removal
64 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures
Note: Although the following figures show a media device, you have already removed the media
device filler and your system will have an empty space until you complete all the steps.
a. Remove the fan by squeezing or lifting the tab (A) and lifting it out of the system, as shown in
the following figure.
Remove the blue air baffles as shown in the following figure.
Figure 35. Fan removal
Removal and replacement procedures for the 8233-E8B and 8236-E8C 65
Remove the fan cage (B) by unscrewing the thumbscrews (A) and lifting out the fan cage.
Note: If the thumbscrews are too tight, you might need a flat-head screwdriver to complete this
step. To pull the fan cage out of the system, you might need to unplug the control panel cable to
allow more space.
Figure 36. Air baffle removal
66 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures
9. Disconnect the media-device connections from the system backplane.
Note:
v When removing cables, ensure that you release any cable locks.
Figure 37. Fan cage removal
Removal and replacement procedures for the 8233-E8B and 8236-E8C 67
v Optional: If you do not intend to replace the USB disk drive, you can reconnect the control panel
USB cable and media connections.
10. Reattach any other cables that you previously disconnected from the backplane.
11. Reinstall the fans into the 8233-E8B or 8236-E8C system.
a. Replace the fan cage by inserting it into the system and tightening the thumbscrews (B).
Replace the blue air baffles as shown in the following figure.
68 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures
Replace the fans by inserting them into the fan cage until they lock into place, as shown in the
following figure.
12. Press the release latch (A) and unseat the media device (B) from the system as shown.
Note: You need to remove the media cables (A) and (B) provided at the back of the device before
removing the media device.
Removal and replacement procedures for the 8233-E8B and 8236-E8C 69
13. Pull the media device from the system.
14. If you removed the media device as a part of another procedure, return to that procedure.
15. If you removed the media device to replace the media device, see Installing a universal serial bus
disk drive in 8233-E8B .
Replacing a USB disk drive:
If your system is managed by the IBM Systems Director Management Console (SDMC), use the SDMC to
complete the steps for installing a feature in the server. For instructions, see Removing a part using the
Systems Director Management Console.
If your system is managed by the Hardware Management Console (HMC), use the HMC to complete the
steps for installing a feature in the server. For information about using the HMC to install a feature, see
Installing a feature using the Hardware Management Console.
If you do not have an HMC or SDMC, complete the following steps to install a USB disk drive:
Before you install a USB disk drive, perform the prerequisite tasks described in http://
publib.boulder.ibm.com/infocenter/powersys/v3r1m5/topic/p7haj/beforebegin.htm.
To install a USB disk drive, do the following steps:
1. Power off the system. See http://publib.boulder.ibm.com/infocenter/powersys/v3r1m5/topic/
p7haj/crustopsys.htm.
2. Disconnect the power cords from the system.
Note: This system has a second power supply. Before continuing with this procedure, ensure that
the power source to the system has been disconnected.
(L003)
or
70 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures
3. Place the system into the service position. See Placing the system into the service position.
4. Remove the front cover. See Removing the front cover for instructions.
5. Remove the service access cover. See Removing the service access cover for instructions.
6. Attach the wrist strap.
Attention:
v Attach a wrist strap to an unpainted surface of your hardware to prevent electrostatic discharge
(ESD) from damaging your hardware.
v When using a wrist strap, follow all electrical safety procedures. A wrist strap is for static control.
It does not increase or decrease your risk of receiving electric shock when using or working on
electrical equipment.
v If you do not have a wrist strap, just prior to removing the product from ESD packaging and
installing or replacing hardware, touch an unpainted surface of the system for a minimum of 5
seconds.
7. Remove the media device filler, if present, as shown in the following figure.
8. Remove the fans and fan cage.
Note: Although the following figures show a media device, you have already removed the media
device filler and your system will have an empty space until you complete all the steps.
a. Remove the fan by squeezing or lifting the tab (A) and lifting it out of the system, as shown in
the following figure.
Removal and replacement procedures for the 8233-E8B and 8236-E8C 71
Remove the blue air baffles as shown in the following figure.
Remove the fan cage (B) by unscrewing the thumbscrews (A) and lifting out the fan cage.
Figure 38. Fan removal
Figure 39. Air baffle removal
72 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures
Note: If the thumbscrews are too tight, you might need a flat-head screwdriver to complete this
step. To pull the fan cage out of the system, you might need to unplug the control panel cable to
allow more space.
9. Install the rails (A) onto the media device using the two small screws (B) included with your media
device. Place the rails on each side of the media device. Screw the rails into the side of the media
device using the lower screw hole.
10. Press the media device (A) into the system until you feel the latches (B) lock in place as shown.
Figure 40. Fan cage removal
Removal and replacement procedures for the 8233-E8B and 8236-E8C 73
Note: You need to attach the media cables (A) and (B) provided at the back of the device before
installing the media device.
11. Attach the media-device connections (A) and (B) to the system backplane. The connection for cable
(A) on the backplane is already occupied. Remove the existing cable and replace it with cable (A).
Note: The USB connection on the front panel is now inoperative.
74 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures
12. Reattach any other cables that you previously disconnected from the backplane.
13. Reinstall the fans into the 8233-E8B or 8236-E8C system.
a. Replace the fan cage by inserting it into the system and tightening the thumbscrews (B).
Replace the blue air baffles as shown in the following figure.
Figure 41. Attaching the media device connections to the system backplane of a 8233-E8B or 8236-E8C system
Removal and replacement procedures for the 8233-E8B and 8236-E8C 75
Replace the fans by inserting them into the fan cage until they lock into place, as shown in the
following figure.
14. Install the service access cover. See Installing the service access cover on a rack-mounted model for
instructions.
15. Attach the front cover. See Installing the front cover on the rack-mounted system.
16. Connect the power cords to the system.
17. Start the system. See Starting the system or logical partition.
76 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures
18. Configure the removable disk drive by performing the following steps:
a. For the AIX operating system, log in as root user.
b. At the command line, type cfgmgr.
c. Press Enter.
d. To verify that the system recognizes the device, run the lsdev -Cc usbms command.
Note: The Linux operating system automatically configures the drive as a disk drive with a name in
the format sdx, for example, sda, sdb, and sdc. To verify if the system recognizes the device, enter
lsusb. To find the device that is associated with the USB disk drive, enter lsscsi.
19. Verify that the media device is installed and working correctly. See Verifying the installed part.
Installing an external USB docking station and removable disk drive with power on
Use this procedure to service external USB media devices.
Keep in mind the following points when you install the USB external docking station for a removable
disk drive:
v A rack shelf is not included with feature code 1104.
v The docking station can be rested on a flat surface in a rack or on a table top next to a rack.
v Ensure that the device is on a flat surface, is right side up, and is not likely to be bumped, dropped, or
otherwise damaged or jolted.
v If the docking station is placed by itself on a rack shelf, it has space on the sides of the device. No filler
panels are provided with the docking station.
v If the docking station is placed on a rack shelf, ensure that the docking station has sufficient air flow.
Also ensure that it does not affect the air flow of the other systems in the rack.
v Do not block the fan on the back of the docking station.
v The external USB disk drive can be installed while the system is powered on and needs to be
configured following the installation.
If your system is managed by the IBM Systems Director Management Console (SDMC), use the SDMC to
complete the steps for installing a media enclosure. For instructions, see Removing a part using the
Systems Director Management Console.
If your system is managed by the Hardware Management Console (HMC), use the HMC to complete the
steps for installing a feature in the server. For information about using the HMC for installing a media
enclosure, see Installing a feature using the Hardware Management Console.
If you do not have an HMC or SDMC, install the docking station by completing the following steps:
To install the docking station, complete the following steps:
1. Place the docking station on a flat service. Use the previous guidelines when selecting a location.
2. Attach one end of the external USB cable (F) to the back of the external docking station.
Removal and replacement procedures for the 8233-E8B and 8236-E8C 77
3. Attach the other end of the external USB cable (F) to an external, integrated USB port or to the USB
ports on a four-port USB PCI Express adapter (feature code 2728).
4. Attach the power supply cord (E) to the back of the external docking station and plug it into a power
source. In addition to the external power supply cord, there are also universal adapters to be used as
needed.
5. After the docking station is powered on, insert the disk drive into the docking station. A green
indicator light appears when the disk drive is placed into the dock correctly.
6. Configure the removable disk drive by performing the following steps:
a. For the AIX operating system, log in as root user.
b. At the command line, type cfgmgr.
c. Press Enter.
d. To verify that the system recognizes the device, run the lsdev -Cc usbms command.
Note: The Linux operating system automatically configures the drive as a disk drive with a name in
the format sdx, for example, sda, sdb, and sdc. To verify if the system recognizes the device, enter
lsusb. To find the device that is associated with the USB disk drive, enter lsscsi.
7. Verify that the media device is installed and working correctly. See Verifying the installed part.
Memory modules
Use this procedure to service dual inline memory modules (DIMMs).
Removing memory modules
If your system is managed by the Hardware Management Console (HMC), use the HMC to complete the
steps for removing memory modules from the server. For information about using the HMC to remove
memory modules, see Removing a part by using the Hardware Management Console.
If your system is managed by the IBM Systems Director Management Console (SDMC), use the SDMC to
remove the memory module from the server. For instructions, see Removing a part by using the Systems
Director Management Console.
If you do not have an HMC or SDMC, complete the following procedure to remove memory modules.
1. Complete the prerequisite tasks described in “Before you begin” on page 115.
2. If you are removing the memory modules for a system upgrade or as part of another procedure,
continue to the next step. If you are removing the memory modules because of a system failure, use
the service action log to help identify the failing part as described in Identifying a part. Identification
light-emitting diodes (LEDs) for failing memory modules are located on the top of the system
processor assembly.
3. Stop the system, as described in Stopping a system or logical partition.
4. Disconnect the power source from the system by unplugging the system.
78 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures
Note: This system is equipped with a second power supply. Before continuing with this procedure,
ensure that the power source to the system has been completely disconnected.
(L003)
or
5. Attach the wrist strap.
Attention:
v Attach a wrist strap to an unpainted surface of your hardware to prevent electrostatic discharge
(ESD) from damaging your hardware.
v When using a wrist strap, follow all electrical safety procedures. A wrist strap is for static control.
It does not increase or decrease your risk of receiving electric shock when using or working on
electrical equipment.
v If you do not have a wrist strap, just prior to removing the product from ESD packaging and
installing or replacing hardware, touch an unpainted surface of the system for a minimum of 5
seconds.
6. If you have a rack-mounted system, place it in the service position, as described in Placing a
8233-E8B or 8236-E8C in the service position.
7. Remove the service access cover, as described in Removing the service access cover from an
8233-E8B or 8236-E8C system.
8. Locate the memory module that you want to remove by observing the service indicator
light-emitting diodes (LEDs).
9. Remove the system processor assembly and protective cover, as described in Removing the system
processor assembly. Skip any steps that you have already completed.
10. Unlock the memory module cover by pushing the locking tab (A) down, as shown in Figure 42 on
page 80. Lift the cover up to access the memory modules.
Removal and replacement procedures for the 8233-E8B and 8236-E8C 79
11. Unlock the memory module by pushing the locking tabs (B) out and down, as shown in Figure 43.
Lift the memory module (C) out of the connector. The lever action of the tabs forces the memory
module out of the connector.
Finally, replace the memory module you removed or install new memory modules.
Figure 42. Removing a memory module
Figure 43. Removing a memory module
80 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures
Replacing memory modules
Before installing or replacing a feature, ensure that the software required to support the feature is
installed on your system. For information about software prerequisites, see IBM Prerequisite. If the
required software is not installed, see the following web sites to download it, and then install it before
continuing:
v To download firmware and software updates and fixes, see Fix Central.
v To download Hardware Management Console (HMC) updates and fixes, see Hardware Management
Console Support and downloads.
If your system is managed by the HMC, use the HMC to complete the steps for installing a feature in the
server. For information about using the HMC to install a feature, see Installing a feature by using the
Hardware Management Console. If you do not have an HMC, complete this procedure to install memory
modules.
If your system is managed by the HMC, use the HMC to complete the steps for replacing the memory
modules in the server. For information about using the HMC to replace memory modules, see Replacing
a part by using the Hardware Management Console.
If your system is managed by the IBM Systems Director Management Console (SDMC), use the SDMC to
replace the memory module in the server. For instructions, see Replacing a part by using the Systems
Director Management Console.
If you do not have an HMC or SDMC, use the following procedure to replace a memory module.
Restriction: Use these instructions only as a part of removing and replacing memory modules that are
identical in size, speed, and feature code. If you are installing new memory or upgrading memory, see
Installing 8233-E8B or 8236-E8C memory modules.
1. Complete the prerequisite tasks described in “Before you begin” on page 115.
2. Attach the wrist strap.
Attention:
v Attach a wrist strap to an unpainted surface of your hardware to prevent electrostatic discharge
(ESD) from damaging your hardware.
v When using a wrist strap, follow all electrical safety procedures. A wrist strap is for static control.
It does not increase or decrease your risk of receiving electric shock when using or working on
electrical equipment.
v If you do not have a wrist strap, just prior to removing the product from ESD packaging and
installing or replacing hardware, touch an unpainted surface of the system for a minimum of 5
seconds.
3. If necessary, unlock and lift the memory module cover.
4. If necessary, remove the new memory module from its antistatic package.
5. Ensure that the connector locking tabs are pressed out in the unlocked position before installing a
new memory module.
Attention: Memory modules are keyed to prevent a memory module from being installed
incorrectly. Note the location of the key tab within the memory module connector before attempting
to install the memory module.
6. Carefully grasp the memory module along two edges and align the connector.
7. Insert the memory module firmly into the connector (A), as shown in Figure 44 on page 82.
8. Secure the memory module with the locking tabs (B) located at each end of the memory module
connector, as shown in Figure 44 on page 82.
Removal and replacement procedures for the 8233-E8B and 8236-E8C 81
9. Close the memory module cover (C) until it locks in place, as shown in Figure 45.
10. Replace the system processor assembly. See Replacing the system processor assembly for information
about replacing the system processor assembly and covers, restoring power, and starting the system.
Skip any steps that you have already completed.
Figure 44. Replacing a memory module
Figure 45. Closing the memory module cover
82 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures
11. Verify the installed part, as described in Verifying the installed part.
Continue with any other service procedures you must perform.
Installing, removing, and replacing PCI adapters
Learn about installing, removing, and replacing Peripheral Component Interconnect (PCI), PCI-X, and PCI
Express (PCIe) adapters.
Model 8233-E8B and 8236-E8C PCI adapters
You can remove, replace, or install PCI adapters in the 8233-E8B and 8236-E8C.
If you are installing a new adapter, you will also need to refer to PCI adapter placement for slot
placement information.
Important: If you are installing a new feature, ensure that you have the software required to support the
new feature and determine whether there are any existing PTF prerequisites to install. To do this, use the
IBM Prerequisite website at http://www-912.ibm.com/e_dir/eServerPrereq.nsf .
Installing a PCI adapter in the 8233-E8B and 8236-E8C server:
You can install a PCI adapter.
Installing a PCI adapter in the 8233-E8B and 8236-E8C server with the power off:
You can install a PCI adapter with the system power off.
If your system is managed by the Hardware Management Console (HMC), use the HMC to complete the
steps for installing a PCI adapter.
If your system is managed by the IBM Systems Director Management Console (SDMC), use the SDMC to
install a PCI adapter in the server. For instructions, see Installing a feature by using the Systems Director
Management Console.
If you are installing a PCI-X double-wide, quad-channel Ultra320 SCSI RAID Controller (FC 5739 or 5778,
CCIN 571F and 575B), remove the lower, right standoff from between the two sides of the adapter before
installing it in the system unit. See Figure 71 on page 129.
If you do not have an HMC or SDMC, complete this procedure to install a PCI adapter with the system
power off:
1. Take appropriate precautions for avoiding electric shock and handling static-sensitive devices. For
information, see “Safety notices” on page v and “Handling static-sensitive devices” on page 118.
2. Determine in which slot to place the PCI adapter. For system-specific adapter placement information,
see the PCI adapter placement.
3. Perform the prerequisite tasks described in “Before you begin” on page 115.
4. Stop the system or logical partition. See Stop the system or logical partition.
5. Disconnect the power source from the system by unplugging the system.
6. If you are installing, removing, or replacing a PCI adapter in a rack-mounted system, follow these
steps to remove the service access cover.
a. Open the front rack door.
b. Place the system unit in the service position.
c. Remove or open the service access cover.
7. If necessary, remove the adapter from the antistatic package.
Removal and replacement procedures for the 8233-E8B and 8236-E8C 83
Attention: Avoid touching the components and gold connectors on the adapter.
8. Place the adapter, component-side up, on a flat, antistatic surface.
9. Some PCI adapters are shipped from the manufacturer with a blue handle or support bracket along
the back edge of the card. To use adapters of this type in this system, you must remove the blue
handle or support bracket from the card.
10. Slide the adapter latch (A) into the open position, as shown in the following figure.
11. If necessary, remove the adapter expansion slot filler.
12. Ensure that the slot is empty. If you are installing a double-wide adapter, remove the PCI adapter
divider if one is present. See “Removing a PCI adapter divider from the 8233-E8B and 8236-E8C
server” on page 109.
13. Carefully grasp the adapter (A) by its top edge, and align the adapter with the expansion slot and its
connector on the system backplane. See the following figure.
14. Press the adapter firmly into its connector.
15. Slide the adapter latch (B) back into place as shown in the following figure.
Figure 46. PCI adapter or filler plate removed from the rack-mounted system unit
84 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures
16. Connect any adapter cables.
17. If you are servicing a rack-mounted system, route the cables through the cable-management arm.
18. Replace or close the service access cover and, if applicable, return the system to the operating
position.
19. Reconnect the power source to the system.
20. Start the system or logical partition. Refer to Start the system or logical partition.
21. Verify that the new resource is functional.
Installing a PCI adapter in the 8233-E8B and 8236-E8C server, with the power on in an AIX environment:
You can install a PCI adapter with the system power on in an AIX environment.
If your system is managed by the Hardware Management Console (HMC), use the HMC to complete the
steps for installing a PCI adapter.
If your system is managed by the IBM Systems Director Management Console (SDMC), use the SDMC to
install a PCI adapter in the server. For instructions, see Installing a feature by using the Systems Director
Management Console.
If you are installing a PCI-X double-wide, quad-channel Ultra320 SCSI RAID Controller (FC 5739 or 5778,
CCIN 571F and 575B), remove the lower, right standoff from between the two sides of the adapter before
installing it in the system unit. See Figure 71 on page 129.
If you do not have an HMC or SDMC, complete the following steps to install a PCI adapter with the
system power on in the AIX environment:
1. Take appropriate precautions for avoiding electric shock and handling static-sensitive devices. For
information, see “Safety notices” on page v and “Handling static-sensitive devices” on page 118.
Figure 47. PCI adapter replaced in the rack-mounted system unit
Removal and replacement procedures for the 8233-E8B and 8236-E8C 85
2. Determine in which slot to place the PCI adapter. For system-specific adapter placement information,
see the PCI adapter placement.
3. Perform the prerequisite tasks described in “Before you begin” on page 115.
4. If you are installing, removing, or replacing a PCI adapter in a rack-mounted system, follow these
steps to remove the service access cover.
a. Open the front rack door.
b. Place the system unit in the service position.
c. Remove or open the service access cover.
5. If necessary, remove the adapter expansion slot shield.
6. If necessary, remove the adapter from the antistatic package.
Attention: Avoid touching the components and gold connectors on the adapter.
7. Place the adapter, component-side up, on a flat, antistatic surface.
8. Some PCI adapters are shipped from the manufacturer with a blue handle or support bracket along
the back edge of the card. To use adapters of this type in this system, you must remove the blue
handle or support bracket from the card.
9. If you are installing a double-wide adapter, remove the PCI adapter divider if one is present. See
“Removing a PCI adapter divider from the 8233-E8B and 8236-E8C server” on page 109.
10. Refer to “PCI hot-plug manager access for AIX” on page 122, and follow the steps in the access
procedure to select PCI Hot Plug Manager. Then return here to continue.
11. From the PCI Hot-Plug Manager menu, select Add a PCI Hot-Plug Adapter and press Enter. The
Add a Hot-Plug Adapter window displays.
12. Select the appropriate empty PCI slot from the ones listed on the screen, and press Enter.
13. Slide the adapter latch (A) into the open position, as shown in the following figure.
14. Remove the adapter filler plate if one is present.
Figure 48. PCI adapter or filler plate removed from the rack-mounted system unit
86 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures
15. Follow the instructions on the screen to install the adapter until the LED for the specified PCI slot is
set to the Action state. See “Component LEDs” on page 124.
16. When you are instructed to install the adapter in the adapter slot, carefully grasp the adapter by the
edges and align the adapter (A) in the slot guides. Insert the adapter fully into the adapter slot
connector. If you are installing a full-length adapter, ensure that both ends of the adapter engage the
card guides.
17. Press the adapter firmly into its connector.
18. Slide the adapter latch (B) back into place as shown in the following figure.
19. Connect any adapter cables.
20. Replace or close the service access cover and, if applicable, return the system to the operating
position.
21. Run the cfgmgr command to configure the adapter.
22. Verify that the new resource is functional. See Verify the installed part.
Installing a PCI adapter in the 8233-E8B server, with the power on in an IBM i environment:
You can install a PCI adapter in the i environment.
If your system is managed by the Hardware Management Console (HMC), use the HMC to complete the
steps for installing a PCI adapter.
If your system is managed by the IBM Systems Director Management Console (SDMC), use the SDMC to
install a PCI adapter in the server. For instructions, see Installing a feature by using the Systems Director
Management Console.
Figure 49. PCI adapter replaced in the rack-mounted system unit
Removal and replacement procedures for the 8233-E8B and 8236-E8C 87
If you are installing a PCI-X double-wide, quad-channel Ultra320 SCSI RAID Controller (FC 5739 or 5778,
CCIN 571F and 575B), remove the lower, right standoff from between the two sides of the adapter before
installing it in the system unit. See Figure 71 on page 129.
Important:
v If you are removing, installing or replacing a PCI-X double-wide, quad-channel Ultra320 SCSI RAID
controller, review the Concurrent maintenance procedure in the PCI-X double-wide, quad-channel
Ultra320 SCSI RAID Controller (FC 5739, 5778, 5781, 5782; CCIN 571F, 575B) topic, before proceeding
with the instructions provided here.
v If you are removing, installing or replacing a PCI-X DDR 1.5 GB cache SAS RAID Adapter, review the
Concurrent maintenance procedure in the PCI-X DDR 1.5 GB cache SAS RAID Adapter topic, before
proceeding with the instructions provided here.
v If you are exchanging a 2766, 2787, 280E, 576B, or 5774 Fibre Channel IOA, the external storage
subsystem must be updated to use the worldwide port name of the new 2766, 2787, 280E, 576B, or
5774 IOA. For instructions, see “Updating the worldwide port name for a new 2766, 2787, 280E, 576B,
or 5774 IOA.” on page 126.
v If you are replacing a 2748, 2757, 2763, 2767, 2778, 2780, 2782, 5702, 5709, or 570B storage IOA, take
note of the following: Depending on the configuration of the system, the storage IOA cache might have
been disabled to allow the attachment of OEM storage that emulates a load source drive. If you are
replacing a storage IOA that has its cache disabled, configure the replacement IOA the same way as the
IOA that you removed. If you remove hardware from the replacement IOA, return that hardware with
the failed IOA.
If you do not have an HMC or SDMC, complete the following steps to install a PCI adapter with the
system power on in the i environment:
1. Perform the prerequisite tasks described in “Before you begin” on page 115.
2. Take appropriate precautions for avoiding electric shock and handling static-sensitive devices. For
information, see “Safety notices” on page v and “Handling static-sensitive devices” on page 118.
3. Determine in which slot to place the PCI adapter. For system-specific adapter placement information,
see the PCI adapter placement.
4. If you are installing, removing, or replacing a PCI adapter in a rack-mounted system, follow these
steps to remove the service access cover.
a. Open the front rack door.
b. Place the system unit in the service position.
c. Remove or open the service access cover.
5. If necessary, remove the adapter expansion slot shield.
6. If necessary, remove the adapter from the antistatic package.
Attention: Avoid touching the components and gold connectors on the adapter.
7. Place the adapter, component-side up, on a flat, antistatic surface.
8. Some PCI adapter cards are shipped from the manufacturer with a blue handle or support bracket
along the back edge of the card. To use adapters of this type in this system, you must remove the
blue handle or support bracket from the card.
9. If you are installing a double-wide adapter, remove the PCI adapter divider if one is present. See
“Removing a PCI adapter divider from the 8233-E8B and 8236-E8C server” on page 109.
10. Type strsst on the command line of the Main menu and then press Enter.
11. Type your service tools user ID and service tools password on the System Service Tools (SST) Sign
On display. Press Enter.
12. Select Start a service tool from the System Service Tools (SST) display and press Enter.
13. Select Hardware service manager from the Start a Service Tool display and press Enter.
88 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures
14. Select Packaging hardware resources (system, frames, cards) from the Hardware Service Manager
display. Press Enter.
15. Type 9 (Hardware contained within package) in the System Unit or Expansion Unit field of the unit
where you are replacing the card. Press Enter.
16. Select the option to Include empty positions.
17. Select Concurrent Maintenance on the card position where you want to replace the card and then
press Enter.
18. Select the option to Toggle LED blink off/on. A light-emitting diode (LED) flashes identifying the
position you chose. Physically verify that this is the slot where you want to install the adapter.
19. Select the option to Toggle LED blink off/on to stop the blinking LED.
20. Slide the adapter latch (A) into the open position, as shown in the following figure.
21. Ensure that the slot is empty. Remove the adapter filler plate if one is present.
22. Carefully grasp the adapter by its top edge, and align the adapter with the expansion slot and its
connector on the system backplane.
23. Press the adapter firmly into its connector.
24. Slide the adapter latch (B) back into place as shown in the following figure.
Figure 50. PCI adapter or filler plate removed from the rack-mounted system unit
Removal and replacement procedures for the 8233-E8B and 8236-E8C 89
25. Connect any adapter cables.
26. Select Power on domain on the Hardware Resource Concurrent Maintenance display and press
Enter.
27. Select Assign to on the resource that has an asterisk (*) on the Work with Controlling Resource
display. Press Enter.
28. Wait for the Hardware Resource Concurrent Maintenance display to appear with this message:
Power on complete
29. Replace or close the service access cover and, if applicable, return the system to the operating
position.
30. Verify that the new resource is functional. See Verify the installed part.
Installing a PCI adapter in the 8233-E8B and 8236-E8C server, with the power on in a Linux environment:
You can install a PCI adapter with the system power on in the Linux environment.
If your system is managed by the Hardware Management Console (HMC), use the HMC to complete the
steps for installing a PCI adapter.
If your system is managed by the IBM Systems Director Management Console (SDMC), use the SDMC to
install a PCI adapter in the server. For instructions, see Installing a feature by using the Systems Director
Management Console.
If you are installing a PCI-X double-wide, quad-channel Ultra320 SCSI RAID Controller (FC 5739 or 5778,
CCIN 571F and 575B), remove the lower, right standoff from between the two sides of the adapter before
installing it in the system unit. See Figure 71 on page 129.
If you do not have an HMC or SDMC, complete the following steps to install a PCI adapter with the
system power on in the Linux environment:
Figure 51. PCI adapter replaced in the rack-mounted system unit
90 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures
1. Ensure that the system meets the “Prerequisites for hot-plugging PCI adapters in Linux” on page
125.
2. Take appropriate precautions for avoiding electric shock and handling static-sensitive devices. For
information, see “Safety notices” on page v and “Handling static-sensitive devices” on page 118.
3. “Verifying that the hot-plug PCI tools are installed for Linux” on page 125.
4. Determine in which slot to place the PCI adapter. For system-specific adapter placement information,
see the PCI adapter placement.
5. Perform the prerequisite tasks described in “Before you begin” on page 115.
6. If you are installing, removing, or replacing a PCI adapter in a rack-mounted system, follow these
steps to remove the service access cover.
a. Open the front rack door.
b. Place the system unit in the service position.
c. Remove or open the service access cover.
7. If necessary, remove the adapter expansion slot shield.
8. If necessary, remove the adapter from the antistatic package.
Attention: Avoid touching the components and gold connectors on the adapter.
9. Place the adapter, component-side up, on a flat, antistatic surface.
10. If you are installing a double-wide adapter, remove the PCI adapter divider if one is present. See
“Removing a PCI adapter divider from the 8233-E8B and 8236-E8C server” on page 109.
11. Log in to the system console as the root user.
12. Run the lsslot tool to list the hot-plug PCI slots that are available in the server or partition:
lsslot -c pci -a
The following is an example of the information displayed by this command:
# Slot Description Device(s)
U7879.001.DQD014E-P1-C1 PCI-X capable, 64 bit, 133MHz slot Empty
U7879.001.DQD014E-P1-C4 PCI-X capable, 64 bit, 133MHz slot Empty
U7879.001.DQD014E-P1-C5 PCI-X capable, 64 bit, 133MHz slot Empty
Select the appropriate empty PCI slot from the ones listed by the command.
13. Slide the adapter latch (A) into the open position, as shown in the following figure.
Removal and replacement procedures for the 8233-E8B and 8236-E8C 91
14. Ensure that the slot is empty. Remove the adapter filler plate if one is present.
15. Run the drslot_chrp_pci command to enable an adapter to be installed. For example, to install an
adapter into PCI slot U7879.001.DQD014E-P1-C3, enter the following command:
drslot_chrp_pci -a -s U7879.001.DQD014E-P1-C3
The following displays:
The visual indicator for the specified PCI slot has been set to the
identify state. Press Enter to continue or enter x to exit.
16. Press Enter.
The following displays:
The visual indicator for the specified PCI slot has been set to the
action state. Insert the PCI card into the identified slot, connect any
devices to be configured and press Enter to continue. Enter x to exit.
17. When you are instructed to install the adapter (A) in the adapter slot, carefully grasp the adapter by
its top edge, and align the adapter with the expansion slot and its connector on the system
backplane.
18. Press the adapter firmly into its connector.
Attention: When you install an adapter into the system, be sure that it is completely and correctly
seated in its connector.
19. Slide the adapter latch (B) back into place as shown in the following figure.
Figure 52. PCI adapter or filler plate removed from the rack-mounted system unit
92 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures
20. Connect any adapter cables.
21. Run the lsslot command to verify that the slot is occupied.
For example, enter lsslot -c pci -s U7879.001.DQD014E-P1-C3
The following is an example of the information displayed by this command:
# Slot Description Device(s)
U7879.001.DQD014E-P1-C3 PCI-X capable, 64 bit, 133MHz slot 0001:40:01.0
22. If you are servicing a rack-mounted system, route the cables through the cable-management arm.
23. Replace or close the service access cover and, if applicable, return the system to the operating
position.
Removing a PCI adapter from an 8233-E8B and 8236-E8C server:
You can remove a PCI adapter.
Important:
v If you are exchanging a 2766, 2787, 280E, 576B, or 5774 Fibre Channel input/output adapter (IOA), the
external storage subsystem must be updated to use the worldwide port name of the new 2766, 2787,
280E, 576B, or 5774 IOA. For instructions, see “Updating the worldwide port name for a new 2766,
2787, 280E, 576B, or 5774 IOA.” on page 126.
v If you are replacing a 2748, 2757, 2763, 2767, 2778, 2780, 2782, 5702, 5703, 5709, or 570B storage IOA,
take note of the following: Depending on the configuration of the system, the storage IOA might have
been altered or the storage IOA cache might have been disabled to allow the attachment of OEM
storage that emulates a load source drive. If you are replacing a storage IOA that has its cache
disabled, configure the replacement IOA the same way as the IOA that you removed. If you remove
hardware from the replacement IOA, return that hardware with the failed IOA.
Removing a PCI adapter from the 8233-E8B and 8236-E8C server with the power off:
Figure 53. PCI adapter replaced in the rack-mounted system unit
Removal and replacement procedures for the 8233-E8B and 8236-E8C 93
You can remove a PCI adapter with the system power off.
If your system is managed by the Hardware Management Console (HMC), use the HMC to remove a PCI
adapter.
If your system is managed by the IBM Systems Director Management Console (SDMC), use the SDMC to
remove the PCI adapter from the server. For instructions, see Removing a part by using the Systems
Director Management Console.
If you do not have an HMC or SDMC, complete the following steps to remove a PCI adapter with the
system power off:
1. Perform the prerequisite tasks described in “Before you begin” on page 115.
2. Take appropriate precautions for avoiding electric shock and handling static-sensitive devices. For
information, see “Safety notices” on page v and “Handling static-sensitive devices” on page 118.
3. If you are removing a failing PCI adapter, see Identifying a part. If you are removing the PCI
adapter for other reasons, continue to the next step.
4. Stop the system or logical partition.
5. Disconnect the power source from the system by unplugging the system.
6. If you are installing, removing, or replacing a PCI adapter in a rack-mounted system, follow these
steps to remove the service access cover.
a. Open the front rack door.
b. Place the system unit in the service position.
c. Remove or open the service access cover.
7. Determine which adapter you plan to remove, then label and disconnect all cables attached to that
adapter.
8. Record the slot number and location of each adapter being removed.
Note: Adapter slots are numbered on the rear of the system.
9. Slide the adapter latch (A) into the open position, as shown in the following figure.
94 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures
10. Carefully grasp the PCI adapter (B) by its top edge or upper corners, and remove it from the system.
Store the adapter in a safe place.
11. If you are removing a PCI adapter as part of another procedure, return to that procedure. If not,
continue to the next step.
12. If you plan to install another adapter into the vacated slot, go to “Replacing a PCI adapter in the
8233-E8B and 8236-E8C server with the power off” on page 101; otherwise, continue with the next
step.
13. Seal the expansion slot using an expansion-slot cover.
14. Replace or close the service access cover and, if applicable, return the system to the operating
position.
15. Reconnect the power source to the system.
16. Start the system or logical partition. Refer to Start the system or logical partition.
17. To replace the PCI adapter, see “Replacing a PCI adapter in the 8233-E8B and 8236-E8C server” on
page 101.
Removing a PCI adapter from the 8233-E8B and 8236-E8C server, with the power on in the AIX environment:
You can remove a PCI adapter with the system power on in the AIX environment.
To remove a failing adapter and replace it with the same adapter, see “Removing and replacing a PCI
adapter in the 8233-E8B and 8236-E8C server, with the power on in the AIX environment” on page 103. If
the adapter that is removed will be placed into a different slot or system, complete this removal
procedure, and then install the adapter as described in “Installing a PCI adapter in the 8233-E8B and
8236-E8C server, with the power on in an AIX environment” on page 85.
If your system is managed by the Hardware Management Console (HMC), use the HMC to complete the
steps for removing a PCI adapter.
Figure 54. PCI adapter or filler plate removed from the rack-mounted system unit
Removal and replacement procedures for the 8233-E8B and 8236-E8C 95
If your system is managed by the IBM Systems Director Management Console (SDMC), use the SDMC to
remove the PCI adapter from the server. For instructions, see Removing a part by using the Systems
Director Management Console.
Note: Procedures performed on a PCI adapter with the system power on in the AIX environment, also
known as hot-plug procedures, require the system administrator to take the PCI adapter offline before
performing the operation. Before taking an adapter offline, the devices attached to the adapter must be
taken offline as well. This action prevents a service representative or user from causing an unexpected
outage for system users.
If you do not have an HMC or SDMC, complete the following steps to remove a PCI adapter with the
system power on in the AIX environment:
1. Perform the prerequisite tasks described in “Before you begin” on page 115.
2. Take appropriate precautions for avoiding electric shock and handling static-sensitive devices. For
information, see “Safety notices” on page v and “Handling static-sensitive devices” on page 118.
3. If you are removing a failing PCI adapter, see Identifying a failing part. If you are removing the PCI
adapter for other reasons, continue to the next step.
4. If you are installing, removing, or replacing a PCI adapter in a rack-mounted system, follow these
steps to remove the service access cover.
a. Open the front rack door.
b. Place the system unit in the service position.
c. Remove or open the service access cover.
5. Determine which adapters you plan to remove.
6. Record the slot number and location of each adapter being removed. Adapter slots are numbered on
the rear of the system unit.
7. Ensure that any processes or applications that might use the adapter are stopped.
8. Follow these steps to place the adapter in the action state using the PCI Hot-Plug Manager:
a. Enter the system diagnostics by logging in as root user or as the celogin user, type diag at the
AIX command line.
b. When the DIAGNOSTIC OPERATING INSTRUCTIONS menu displays, press Enter.
c. At the FUNCTION SELECTION menu, select Task Selection, and then press Enter.
d. At the Task Selection list, select PCI Hot Plug Manager.
e. Select Unconfigure a Device, and then press Enter.
f. Press F4 (or Esc +4) to display the Device Names menu.
g. Select the adapter you are removing in the Device Names menu.
h. Use the Tab key to answer NO to Keep Definition. Use the Tab key again to answer YES to
Unconfigure Child Devices, and then press Enter. The ARE YOU SURE window is displayed.
i. Press Enter to verify the information. Successful unconfiguration is indicated by the OK message
displayed next to the Command field at the top of the screen.
j. Press F4 (or Esc +4) twice to return to the Hot Plug Manager menu.
k. Select Replace/remove PCI Hot Plug adapter.
l. Select the slot that has the device to be removed from the system.
m. Select Remove. A fast-blinking amber LED located at the back of the machine near the adapter
indicates that the slot has been identified.
n. Label all cables attached to the adapter you plan to remove.
o. Press Enter. This places the adapter in the action state, meaning it is ready to be removed from
the system.
p. Disconnect all cables attached to the adapter you plan to remove.
9. Label, and then disconnect all cables attached to the adapter you plan to remove.
96 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures
10. Slide the adapter latch (A) into the open position, as shown in the following figure.
11. Carefully grasp the PCI adapter (B) by its top edge or upper corners, and remove it from the system.
Store the adapter in a safe place.
12. If you plan to install another adapter into the vacated slot, go to “Installing a PCI adapter in the
8233-E8B and 8236-E8C server, with the power on in an AIX environment” on page 85; otherwise,
continue with the next step.
13. Seal the expansion slot using an expansion-slot cover.
14. Lower the plastic retainer seat over the PCI adapter faceplate.
15. Rotate the locking latch clockwise until it clicks into the locked position.
16. Continue to follow the screen instructions until you receive a message that the adapter removal is
successful. Successful removal is indicated by the OK message displayed next to the Command field
at the top of the screen.
17. If you have other adapters to remove, press the F3 key to return to the PCI Hot-Plug Manager menu
and then return to step 9 on page 96.
If you do not have other adapters to remove, continue with the next step.
18. Press F10 to exit the Hot-Plug Manager.
19. Run the diag -a command. If the system responds with a menu or prompt, follow the instructions
to complete the device configuration.
20. Replace or close the service access cover and, if applicable, return the system to the operating
position.
21. To replace the PCI adapter, see “Replacing a PCI adapter in the 8233-E8B and 8236-E8C server” on
page 101.
Removing a PCI adapter in the 8233-E8B server, with the power on in the IBM i environment:
You can remove a PCI adapter with the system power on in the IBM i environment.
Figure 55. PCI adapter or filler plate removed from the rack-mounted system unit
Removal and replacement procedures for the 8233-E8B and 8236-E8C 97
If your system is managed by the Hardware Management Console (HMC), use the HMC to complete the
steps for removing a PCI adapter.
If your system is managed by the IBM Systems Director Management Console (SDMC), use the SDMC to
remove the PCI adapter from the server. For instructions, see Removing a part by using the Systems
Director Management Console.
Important:
v If the adapter is the load source I/O adapter (IOA) or the load source I/O processor (IOP), or any
other storage IOA or IOP with critical disk storage attached for the system or partition, follow the
on-screen instructions when you use HSM to power down the IOP or IOA. Instructions to use
functions 68 and 69 on the control panel will be included.
v If the adapter is the console IOA or the console IOP for the system or partition, you must perform the
maintenance from an IBM i operating system session connected through a different IOA or IOP, or you
must power down the partition to perform maintenance.
v If you are removing, installing or replacing a PCI-X double-wide, quad-channel Ultra320 SCSI RAID
controller, review the Concurrent maintenance procedure in the PCI-X double-wide, quad-channel
Ultra320 SCSI RAID Controller (FC 5739, 5778, 5781, 5782; CCIN 571F, 575B) topic, before proceeding
with the instructions provided here.
v If you are removing, installing or replacing a PCI-X DDR 1.5 GB cache SAS RAID Adapter, review the
Concurrent maintenance procedure in the PCI-X DDR 1.5 GB cache SAS RAID Adapter topic, before
proceeding with the instructions provided here.
v If you are exchanging a 2766, 2787, 280E, 576B, or 5774 Fibre Channel IOA, the external storage
subsystem must be updated to use the worldwide port name of the new 2766, 2787, 280E, 576B, or
5774 IOA. For instructions, see “Updating the worldwide port name for a new 2766, 2787, 280E, 576B,
or 5774 IOA.” on page 126.
v If you are replacing a 2748, 2757, 2763, 2767, 2778, 2780, 2782, 5702, 5709, or 570B storage IOA, take
note of the following: Depending on the configuration of the system, the storage IOA cache might have
been disabled to allow the attachment of OEM storage that emulates a load source drive. If you are
replacing a storage IOA that has its cache disabled, configure the replacement IOA the same way as the
IOA that you removed. If you remove hardware from the replacement IOA, return that hardware with
the failed IOA.
If you do not have an HMC or SDMC, complete the following steps to remove a PCI adapter with the
system power on in the IBM i environment:
1. Perform the prerequisite tasks described in “Before you begin” on page 115.
2. Take appropriate precautions for avoiding electric shock and handling static-sensitive devices. For
information, see “Safety notices” on page v and “Handling static-sensitive devices” on page 118.
3. If you are removing a failing PCI adapter, see Identifying a failing part. If you are removing the PCI
adapter for other reasons, continue to the next step.
4. If you are installing, removing, or replacing a PCI adapter in a rack-mounted system, follow these
steps to remove the service access cover.
a. Open the front rack door.
b. Place the system unit in the service position.
c. Remove or open the service access cover.
5. Type strsst on the command line of the Main Menu and press Enter.
6. Type your service tools user ID and service tools password on the System Service Tools (SST) Sign
On display. Press Enter.
7. Select Start a service tool from the System Service Tools (SST) display. Press Enter.
8. Select Hardware service manager from the Start a Service Tool display and press Enter.
98 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures
9. Select Packaging hardware resources (system, frames, cards) from the Hardware Service Manager
display. Press Enter.
10. Type 9 (Hardware contained within package) in the System Unit or Expansion Unit field of the unit
where you are removing the card, and then press Enter.
11. Select the option to Include empty positions.
12. Select Concurrent Maintenance on the card position where you want to remove the card and then
press Enter.
13. Select the option to Toggle LED blink off/on. A light-emitting diode (LED) blinks identifying the
position you chose. Physically verify that this is the slot where you want to remove the adapter.
14. Select the option to Toggle LED blink off/on to stop the blinking LED.
15. Select the option to Power off domain on the Hardware Resource Concurrent Maintenance display
and press Enter.
16. Wait for the Hardware Resource Concurrent Maintenance display to appear with this message:
Power off complete
17. Label and then disconnect all cables attached to the adapter you plan to remove.
18. Record the slot number and location of each adapter being removed.
Note: Adapter slots are numbered on the rear of the system.
19. Slide the adapter latch (A) into the open position, as shown in the following figure.
20. Carefully grasp the PCI adapter (B) by its top edge or upper corners, and remove it from the system.
Store the adapter in a safe place.
21. If you are removing a PCI adapter as part of another procedure, return to that procedure. If not,
continue to the next step.
22. If you plan to install another adapter into the vacated slot, go to “Installing a PCI adapter in the
8233-E8B server, with the power on in an IBM i environment” on page 87; otherwise, continue with
the next step.
Figure 56. PCI adapter or filler plate removed from the rack-mounted system unit
Removal and replacement procedures for the 8233-E8B and 8236-E8C 99
23. Seal the expansion slot using an expansion-slot cover.
24. Replace or close the service access cover and, if applicable, return the system to the operating
position.
25. To replace the PCI adapter, see “Replacing a PCI adapter in the 8233-E8B and 8236-E8C server” on
page 101.
Removing a PCI adapter from the 8233-E8B and 8236-E8C server, with the power on in the Linux environment:
You can remove a PCI adapter with the system power on in the Linux environment.
If your system is managed by the Hardware Management Console (HMC), use the HMC to complete the
steps for removing a PCI adapter.
If your system is managed by the IBM Systems Director Management Console (SDMC), use the SDMC to
remove the PCI adapter from the server. For instructions, see Removing a part by using the Systems
Director Management Console.
If you do not have an HMC or SDMC, complete the following steps to remove a PCI adapter with the
system power on in the Linux environment:
1. Ensure that the system meets the “Prerequisites for hot-plugging PCI adapters in Linux” on page
125.
2. “Verifying that the hot-plug PCI tools are installed for Linux” on page 125.
3. Perform the prerequisite tasks described in “Before you begin” on page 115.
4. Take appropriate precautions for avoiding electric shock and handling static-sensitive devices. For
information, see “Safety notices” on page v and “Handling static-sensitive devices” on page 118.
5. If you are removing a failing PCI adapter, see Identifying a failing part. If you are removing the PCI
adapter for other reasons, continue to the next step.
6. If you are installing, removing, or replacing a PCI adapter in a rack-mounted system, follow these
steps to remove the service access cover.
a. Open the front rack door.
b. Place the system unit in the service position.
c. Remove or open the service access cover.
7. Determine which adapter you plan to remove, then label and disconnect all cables attached to that
adapter.
8. Record the slot number and location of each adapter being removed.
Note: Adapter slots are numbered on the rear of the system.
9. Label, and then disconnect all cables attached to the adapter you plan to remove.
Note: Before performing a PCI hot-plug removal of storage devices, ensure that the file systems on
those devices are unmounted.
10. Run the drslot_chrp_pci command to enable an adapter to be removed:
For example, to remove the PCI adapter in slot U7879.001.DQD014E-P1-C3, run this command:
drslot_chrp_pci -r -s U7879.001.DQD014E-P1-C3
Follow the instructions on the display to complete the task.
11. Slide the adapter latch (A) into the open position, as shown in the following figure.
100 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures
12. Carefully grasp the PCI adapter (B) by its top edge or upper corners, and remove it from the system.
Store the adapter in a safe place.
13. If you are removing a PCI adapter as part of another procedure, return to that procedure. If not,
continue to the next step.
14. If you plan to install another adapter into the vacated slot, go to “Replacing a PCI adapter in the
8233-E8B and 8236-E8C server with the power on in the Linux environment” on page 108; otherwise,
continue with the next step.
15. Seal the expansion slot using an expansion-slot cover.
16. Replace or close the service access cover and, if applicable, return the system to the operating
position.
17. Reconnect the power source to the system.
18. Start the system or logical partition. Refer to Start the system or logical partition.
19. To replace the PCI adapter, see “Replacing a PCI adapter in the 8233-E8B and 8236-E8C server.”
Replacing a PCI adapter in the 8233-E8B and 8236-E8C server:
You can replace a PCI adapter.
Replacing a PCI adapter in the 8233-E8B and 8236-E8C server with the power off:
You can replace a PCI adapter with the system power off.
You must have already completed the procedure “Removing a PCI adapter from the 8233-E8B and
8236-E8C server with the power off” on page 93 in order to have the slot powered off.
If your system is managed by the Hardware Management Console (HMC), use the HMC to complete the
steps for replacing a PCI adapter.
Figure 57. PCI adapter or filler plate removed from the rack-mounted system unit
Removal and replacement procedures for the 8233-E8B and 8236-E8C 101
If your system is managed by the IBM Systems Director Management Console (SDMC), use the SDMC to
replace the PCI adapter in the server. For instructions, see Replacing a part by using the Systems Director
Management Console.
If you do not have an HMC or SDMC, complete the following steps to replace a PCI adapter with the
system power off:
1. Perform the prerequisite tasks described in “Before you begin” on page 115.
2. Take appropriate precautions for avoiding electric shock and handling static-sensitive devices. For
information, see “Safety notices” on page v and “Handling static-sensitive devices” on page 118.
3. If necessary, remove the adapter from the antistatic package.
Attention: Avoid touching the components and gold connectors on the adapter.
4. Place the adapter, component-side up, on a flat, static-protective surface.
5. Carefully grasp the adapter by its top edge, and align the adapter with the expansion slot and its
connector on the system backplane.
6. Press the adapter (A) firmly into its connector.
Attention: When you install an adapter into the system, be sure that it is completely and correctly
seated in its connector.
7. Slide the adapter latch (B) back into place as shown in the following figure.
8. Connect the adapter cables.
9. If you are servicing a rack-mounted system, route the cables through the cable-management arm.
10. Replace or close the service access cover and, if applicable, return the system to the operating
position.
11. Reconnect the power source to the system.
12. Start the system or logical partition. Refer to Start the system or logical partition.
Figure 58. PCI adapter replaced in the rack-mounted system unit
102 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures
13. Verify that the new resource is functional. See Verify the installed part.
Removing and replacing a PCI adapter in the 8233-E8B and 8236-E8C server, with the power on in the AIX
environment:
You can replace a PCI adapter with the system power on in the AIX environment.
If your system is managed by the Hardware Management Console (HMC), use the HMC to complete the
steps for replacing a PCI adapter.
If your system is managed by the IBM Systems Director Management Console (SDMC), use the SDMC to
replace the PCI adapter in the server. For instructions, see Replacing a part by using the Systems Director
Management Console.
Read the following notes to determine if this is the correct procedure for the task to be performed.
Notes:
v Use this procedure if you intend to remove a failing PCI adapter and replace it with the same type of
adapter.
v If you plan to remove a failing adapter and leave the slot empty, see “Removing a PCI adapter from
the 8233-E8B and 8236-E8C server, with the power on in the AIX environment” on page 95.
v This procedure should not be used to remove an existing adapter and install a different type of
adapter. To install a different adapter, remove the existing adapter as described in “Removing a PCI
adapter from the 8233-E8B and 8236-E8C server, with the power on in the AIX environment” on page
95, then install the new adapter as described in “Installing a PCI adapter in the 8233-E8B and 8236-E8C
server, with the power on in an AIX environment” on page 85.
v Procedures performed on a PCI adapter with the system power on in AIX, also known as hot-plug
procedures, require the system administrator to take the PCI adapter offline before performing the
operation. Before taking an adapter offline, the devices attached to the adapter must be taken offline as
well. This action prevents a service representative or user from causing an unexpected outage for
system users.
If you do not have an HMC or SDMC, complete the following steps to replace a PCI adapter with the
system power on in the AIX environment:
1. Perform the prerequisite tasks described in “Before you begin” on page 115.
2. Take appropriate precautions for avoiding electric shock and handling static-sensitive devices. For
information, see “Safety notices” on page v and “Handling static-sensitive devices” on page 118.
3. If you are removing a failing PCI adapter, see Identifying a failing part. If you are removing the PCI
adapter for other reasons, continue to the next step.
4. If you are installing, removing, or replacing a PCI adapter in a rack-mounted system, follow these
steps to remove the service access cover.
a. Open the front rack door.
b. Place the system unit in the service position.
c. Remove or open the service access cover.
5. Determine which adapters you plan to remove.
6. Record the slot number and location of each adapter being removed.
Note: Adapter slots are numbered on the rear of the system unit.
7. Ensure that any processes or applications that might use the adapter are stopped.
8. Enter the system diagnostics by logging in as root user or as the celogin user, type diag at AIX
command line.
9. When the DIAGNOSTIC OPERATING INSTRUCTIONS menu displays, press Enter.
Removal and replacement procedures for the 8233-E8B and 8236-E8C 103
10. At the FUNCTION SELECTION menu, select Task Selection, then press enter.
11. At the Task Selection list, select PCI Hot Plug Manager.
12. Select Unconfigure a Device, then press Enter.
13. Press F4 (or Esc +4) to display the Device Names menu.
14. Select the adapter you are removing in the Device Names menu.
15. Use the Tab key to answer YES to Keep Definition. Use the Tab key again to answer YES to
Unconfigure Child Devices, then press Enter.
16. The ARE YOU SURE screen displays. Press Enter to verify the information. Successful
unconfiguration is indicated by the OK message displayed next to the Command field at the top of
the screen.
17. Press F3 (or Esc +3) twice to return to the Hot Plug Manager menu.
18. Select Replace/remove PCI Hot Plug adapter.
19. Select the slot that has the device to be removed from the system.
20. Select Replace.
Note: A fast-blinking amber LED located at the back of the machine near the adapter indicates that
the slot has been identified.
21. Press Enter. This places the adapter in the action state, meaning that it is ready to be removed from
the system.
22. Label, and then disconnect all cables attached to the adapter you plan to remove.
23. Slide the adapter latch (A) into the open position, as shown in the following figure.
24. Carefully grasp the PCI adapter (B) by its top edge or upper corners, and remove it from the system.
Store the adapter in a safe place.
25. If necessary, remove the replacement adapter from the antistatic package.
Figure 59. PCI adapter or filler plate removed from the rack-mounted system unit
104 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures
Attention: Avoid touching the components and gold connectors on the adapter.
26. Carefully grasp the adapter by its top edge, and align the adapter with the expansion slot and its
connector on the system backplane.
27. Press the adapter (A) firmly into its connector. See the following figure.
Attention: When you install an adapter into the system, be sure that it is completely and correctly
seated in its connector.
28. Slide the adapter latch (B) back into place as shown in the following figure.
29. Connect the adapter cables.
30. Press enter and continue to follow the instructions in the system diagnostics until you receive a
message that the replacement is successful. Successful replacement is indicated by the OK message
displayed next to the Command field at the top of the menu.
31. Press the F3 (or Esc+3) key to return to the PCI Hot-Plug Manager menu.
32. Press the F3 (or Esc+3) key to return to the TASK selection list.
33. Select Log Repair Action.
34. Select the resource just replaced, press Enter, press Commit (F7 or ESC 7), then press Enter.
35. Press F3 (or Esc+3) to return to TASK Selection List.
36. Select Hot Plug Task, press enter.
37. Select PCI Hot Plug Manager, then select Configure a defined device, then press Enter.
38. Select the device just replaced from the list, then press Enter. The device is now configured.
39. Press the F10 key to exit the diagnostic program.
Note: If you are running the stand-alone diagnostics, do not exit the program completely.
40. Verify the PCI adapter by using the following instructions:
a. Did you replace the adapter with the system power on?
Figure 60. PCI adapter replaced in the rack-mounted system unit
Removal and replacement procedures for the 8233-E8B and 8236-E8C 105
v Yes - Go to the next step.
v No - Load the diagnostic program by doing the following:
– If AIX is available, boot AIX, log in as root or CELOGIN, then enter the diag command.
– If AIX is not available, boot the stand-alone diagnostics
b. Type the diag command if you are not already displaying the diagnostic menus
c. Select Advance Diagnostic Routines, then select Problem Determination.
d. Select the name of the resource just replaced from the menu. If the resource just replaced is not
shown, choose the resource associated with it. Press Enter, then press Commit ((F7 or Esc+7)).
e. Did the Problem Determination identify any problems?
v No: Continue to the next step.
v Yes: A problem is identified
– If you are a customer, record the error information, then contact your service provider.
– If you are an authorized service provider, return to map 210-5.
41. Press the F10 key to exit the diagnostic program.
42. Replace or close the service access cover and, if applicable, return the system to the operating
position.
43. Verify that the new resource is functional. See Verify the installed part.
Replacing a PCI adapter in the 8233-E8B server, with the power on in the IBM i environment:
You can replace a PCI adapter with the system power on in the IBM i environment.
Attention: You must have already completed the procedure “Removing a PCI adapter in the 8233-E8B
server, with the power on in the IBM i environment” on page 97 in order to have the slot powered off.
If your system is managed by the Hardware Management Console (HMC), use the HMC to complete the
steps for replacing a PCI adapter.
If your system is managed by the IBM Systems Director Management Console (SDMC), use the SDMC to
replace the PCI adapter in the server. For instructions, see Replacing a part by using the Systems Director
Management Console.
Important:
v If the adapter is the load source I/O adapter (IOA) or the load source I/O processor (IOP), or any
other storage IOA or IOP with critical disk storage attached for the system or partition, follow the
on-screen instructions when you use HSM to power down the IOP or IOA. Instructions to use
functions 68 and 69 on the control panel will be included.
v If the adapter is the console IOA or the console IOP for the system or partition, you must perform the
maintenance from an IBM i operating system session connected through a different IOA or IOP, or you
must power down the partition to perform maintenance.
v If you are removing, installing or replacing a PCI-X double-wide, quad-channel Ultra320 SCSI RAID
controller, review the Concurrent maintenance procedure in the PCI-X double-wide, quad-channel
Ultra320 SCSI RAID Controller (FC 5739, 5778, 5781, 5782; CCIN 571F, 575B) topic, before proceeding
with the instructions provided here.
v If you are removing, installing or replacing a PCI-X DDR 1.5 GB cache SAS RAID Adapter, review the
Concurrent maintenance procedure in the PCI-X DDR 1.5 GB cache SAS RAID Adapter topic, before
proceeding with the instructions provided here.
v If you are exchanging a 2766, 2787, 280E, 576B, or 5774 Fibre Channel IOA, the external storage
subsystem must be updated to use the worldwide port name of the new 2766, 2787, 280E, 576B, or
5774 IOA. For instructions, see “Updating the worldwide port name for a new 2766, 2787, 280E, 576B,
or 5774 IOA.” on page 126.
106 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures
v If you are replacing a 2748, 2757, 2763, 2767, 2778, 2780, 2782, 5702, 5709, or 570B storage IOA, take
note of the following: Depending on the configuration of the system, the storage IOA cache might have
been disabled to allow the attachment of OEM storage that emulates a load source drive. If you are
replacing a storage IOA that has its cache disabled, configure the replacement IOA the same way as the
IOA that you removed. If you remove hardware from the replacement IOA, return that hardware with
the failed IOA.
If you do not have an HMC or SDMC, complete the following steps to replace a PCI adapter with the
system power on in the IBM i environment:
1. Perform the prerequisite tasks described in “Before you begin” on page 115.
2. Take appropriate precautions for avoiding electric shock and handling static-sensitive devices. For
information, see “Safety notices” on page v and “Handling static-sensitive devices” on page 118.
3. If necessary, remove the adapter from the antistatic package.
Attention: Avoid touching the components and gold connectors on the adapter.
4. Carefully grasp the adapter by its top edge, and align the adapter with the expansion slot and its
connector on the system backplane.
5. Press the adapter (A) firmly into its connector.
Attention: When you install an adapter into the system, be sure that it is completely and correctly
seated in its connector.
6. Slide the adapter latch (B) back into place as shown in the following figure.
7. Connect the adapter cables.
8. Select Power on domain on the Hardware Resource Concurrent Maintenance display and press
Enter.
9. Select Assign to on the resource that has an asterisk (*) on the Work with Controlling Resource
display. Press Enter.
Figure 61. PCI adapter replaced in the rack-mounted system unit
Removal and replacement procedures for the 8233-E8B and 8236-E8C 107
10. Wait for the Hardware Resource Concurrent Maintenance display to appear with this message:
Power on complete
11. Replace or close the service access cover and, if applicable, return the system to the operating
position.
12. Verify that the new resource is functional. See Verify the installed part.
Replacing a PCI adapter in the 8233-E8B and 8236-E8C server with the power on in the Linux environment:
You can replace a PCI adapter with the system power on in the Linux environment.
You must have already completed the procedure “Removing a PCI adapter from the 8233-E8B and
8236-E8C server, with the power on in the Linux environment” on page 100.
If your system is managed by the Hardware Management Console (HMC), use the HMC to complete the
steps for installing a PCI adapter.
If your system is managed by the IBM Systems Director Management Console (SDMC), use the SDMC to
replace the PCI adapter in the server. For instructions, see Replacing a part by using the Systems Director
Management Console.
If you do not have an HMC or SDMC, complete the following steps to replace a PCI adapter with the
system power on in the Linux environment:
1. Perform the prerequisite tasks described in “Before you begin” on page 115.
2. Take appropriate precautions for avoiding electric shock and handling static-sensitive devices. For
information, see “Safety notices” on page v and “Handling static-sensitive devices” on page 118.
3. If necessary, remove the adapter from the antistatic package.
Attention: Avoid touching the components and gold connectors on the adapter.
4. Place the adapter, component-side up, on a flat, static-protective surface.
5. Run the drslot_chrp_pci command to enable an adapter to be replaced:
For example, to replace the PCI adapter in slot U7879.001.DQD014E-P1-C3 run this command:
drslot_chrp_pci -R -s U7879.001.DQD014E-P1-C3
Follow the instructions on the display to complete the task.
6. Press the adapter firmly into its connector.
Attention: When you install an adapter into the system, be sure that it is completely and correctly
seated in its connector.
7. Carefully grasp the adapter by its top edge, and align the adapter with the expansion slot and its
connector on the system backplane.
8. Press the adapter (A) firmly into its connector.
Attention: When you install an adapter into the system, be sure that it is completely and correctly
seated in its connector.
9. Slide the adapter latch (B) back into place as shown in the following figure.
108 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures
10. Connect the adapter cables.
11. Run the lsslot command to verify that the slot is occupied.
For example, Enter lsslot -c pci -s U7879.001.DQD014E-P1-C3
The following is an example of the information displayed by this command:
# Slot Description Device(s)
U7879.001.DQD014E-P1-C3 PCI-X capable, 64 bit, 133MHz slot 0001:40:01.0
12. If you are servicing a rack-mounted system, route the cables through the cable-management arm.
13. Replace or close the service access cover and, if applicable, return the system to the operating
position.
14. Verify that the new resource is functional. See Verify the installed part.
Model 8233-E8B and 8236-E8C PCI adapter dividers
You might need to remove, replace, or install PCI adapter dividers. Use the procedures in this section to
perform these tasks.
Removing a PCI adapter divider from the 8233-E8B and 8236-E8C server:
You can remove a PCI adapter divider.
The following procedure describes the removal of PCI adapter dividers with the system power off. This
procedure can be done with the system power on by omitting the steps related to powering off the
system.
To remove a divider, do the following steps:
1. Perform prerequisite tasks as described in “Before you begin” on page 115.
2. Take appropriate precautions for avoiding electric shock and handling static-sensitive devices. For
information, see “Safety notices” on page v and “Handling static-sensitive devices” on page 118.
3. Stop the system or logical partition. See Stop the system or logical partition.
Figure 62. PCI adapter replaced in the rack-mounted system unit
Removal and replacement procedures for the 8233-E8B and 8236-E8C 109
4. Disconnect the power source from the system by unplugging the system.
5. Remove the unit's cover if you are installing or removing a PCI adapter divider.
6. Slide the adapter latch (A) into the open position, as shown in the following figure.
7. Flex the front edge (A) of the PCI adapter divider out of the bracket and toward the rest of the
divider.
8. Pull the back edge of the divider away from the retention notches (B).
Figure 63. PCI adapter or filler plate removed from the rack-mounted system unit
110 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures
9. If you are removing the PCI adapter divider as part of another procedure, return to that procedure
now. To replace the divider, see “Installing a PCI adapter divider in the 8233-E8B and 8236-E8C
server” on page 112. To close up the system, continue to the next step.
10. Slide the adapter latch (B) back into place as shown in the following figure.
Figure 64. PCI-adapter divider removed from the system unit
Removal and replacement procedures for the 8233-E8B and 8236-E8C 111
11. Replace or close the service access cover and, if applicable, return the system to the operating
position.
12. Reconnect the power source to the system.
13. Start the system or logical partition. Refer to Start the system or logical partition.
Installing a PCI adapter divider in the 8233-E8B and 8236-E8C server:
You can install a PCI adapter divider.
The following procedure describes the installation of a PCI adapter divider with the system power off.
This procedure can be done with the system power on by omitting the steps related to powering on the
system.
To install a divider, do the following steps:
1. Locate the PCI adapter divider slot that you want to use. To access the divider slots, see the first 6
steps in “Removing a PCI adapter divider from the 8233-E8B and 8236-E8C server” on page 109.
2. Slide the adapter latch (A) into the open position, as shown in the following figure.
Figure 65. PCI adapter replaced in the rack-mounted system unit
112 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures
3. Carefully grasp the PCI adapter divider by its top edge and align the back edge of the divider with
the retention notches (A).
4. Insert the front edge of the divider (B) into the slots and then press the divider into place.
Figure 66. PCI adapter or filler plate removed from the rack-mounted system unit
Removal and replacement procedures for the 8233-E8B and 8236-E8C 113
5. Slide the adapter latch (B) back into place as shown in the following figure.
Figure 67. PCI-adapter divider replaced in the system unit
Figure 68. PCI adapter replaced in the rack-mounted system unit
114 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures
6. Replace or close the service access cover and, if applicable, return the system to the operating
position.
7. Reconnect the power source to the system.
8. Start the system or logical partition. Refer to Start the system or logical partition.
Related procedures for installing and removing PCI adapters
These procedures are related to installing and removing PCI adapters.
Before you begin:
Observe these precautions when you are installing, removing, or replacing features and parts.
These precautions are intended to create a safe environment to service your system and do not provide
steps for servicing your system. The installation, removal and replacement procedures provide the
step-by-step processes required to service your system.
DANGER
When working on or around the system, observe the following precautions:
Electrical voltage and current from power, telephone, and communication cables are hazardous. To
avoid a shock hazard:
v Connect power to this unit only with the IBM provided power cord. Do not use the IBM
provided power cord for any other product.
v Do not open or service any power supply assembly.
v Do not connect or disconnect any cables or perform installation, maintenance, or reconfiguration
of this product during an electrical storm.
v The product might be equipped with multiple power cords. To remove all hazardous voltages,
disconnect all power cords.
v Connect all power cords to a properly wired and grounded electrical outlet. Ensure that the outlet
supplies proper voltage and phase rotation according to the system rating plate.
v Connect any equipment that will be attached to this product to properly wired outlets.
v When possible, use one hand only to connect or disconnect signal cables.
v Never turn on any equipment when there is evidence of fire, water, or structural damage.
v Disconnect the attached power cords, telecommunications systems, networks, and modems before
you open the device covers, unless instructed otherwise in the installation and configuration
procedures.
v Connect and disconnect cables as described in the following procedures when installing, moving,
or opening covers on this product or attached devices.
To Disconnect:
1. Turn off everything (unless instructed otherwise).
2. Remove the power cords from the outlets.
3. Remove the signal cables from the connectors.
4. Remove all cables from the devices
To Connect:
1. Turn off everything (unless instructed otherwise).
2. Attach all cables to the devices.
3. Attach the signal cables to the connectors.
4. Attach the power cords to the outlets.
5. Turn on the devices.
(D005)
DANGER
Removal and replacement procedures for the 8233-E8B and 8236-E8C 115
Observe the following precautions when working on or around your IT rack system:
v Heavy equipment–personal injury or equipment damage might result if mishandled.
v Always lower the leveling pads on the rack cabinet.
v Always install stabilizer brackets on the rack cabinet.
v To avoid hazardous conditions due to uneven mechanical loading, always install the heaviest
devices in the bottom of the rack cabinet. Always install servers and optional devices starting
from the bottom of the rack cabinet.
v Rack-mounted devices are not to be used as shelves or work spaces. Do not place objects on top
of rack-mounted devices.
v Each rack cabinet might have more than one power cord. Be sure to disconnect all power cords in
the rack cabinet when directed to disconnect power during servicing.
v Connect all devices installed in a rack cabinet to power devices installed in the same rack
cabinet. Do not plug a power cord from a device installed in one rack cabinet into a power
device installed in a different rack cabinet.
v An electrical outlet that is not correctly wired could place hazardous voltage on the metal parts of
the system or the devices that attach to the system. It is the responsibility of the customer to
ensure that the outlet is correctly wired and grounded to prevent an electrical shock.
CAUTION
v Do not install a unit in a rack where the internal rack ambient temperatures will exceed the
manufacturer's recommended ambient temperature for all your rack-mounted devices.
v Do not install a unit in a rack where the air flow is compromised. Ensure that air flow is not
blocked or reduced on any side, front, or back of a unit used for air flow through the unit.
v Consideration should be given to the connection of the equipment to the supply circuit so that
overloading of the circuits does not compromise the supply wiring or overcurrent protection. To
provide the correct power connection to a rack, refer to the rating labels located on the
equipment in the rack to determine the total power requirement of the supply circuit.
v (For sliding drawers.) Do not pull out or install any drawer or feature if the rack stabilizer brackets
are not attached to the rack. Do not pull out more than one drawer at a time. The rack might
become unstable if you pull out more than one drawer at a time.
v (For fixed drawers.) This drawer is a fixed drawer and must not be moved for servicing unless
specified by the manufacturer. Attempting to move the drawer partially or completely out of the
rack might cause the rack to become unstable or cause the drawer to fall out of the rack.
(R001)
Before you begin a replacement or installation procedure, perform these tasks:
1. If you are installing a new feature, ensure that you have the software required to support the new
feature.
To do this, go to the following Web site: IBM Prerequisite
2. If you are performing an installation or replacement procedure that might put your data at risk,
ensure, wherever possible, that you have a current backup of your system or logical partition
(including operating systems, licensed programs, and data).
3. Review the installation or replacement procedure for the feature or part.
4. Note the significance of color on your system.
116 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures
Blue or terra-cotta on a part of the hardware indicates a touch point where you can grip the hardware
to remove it from or install it in the system, open or close a latch, and so on. Terra-cotta might also
indicate that the part can be removed and replaced with the system or logical partition power on.
5. Ensure that you have access to a medium flat-blade screwdriver, a Phillips screwdriver, and a pair of
scissors.
6. If parts are incorrect, missing, or visibly damaged, do the following:
v If you are replacing a part, contact the provider of your parts or next level of support.
v If you are installing a feature, contact one of the following service organizations:
– The provider of your parts or next level of support.
– In the United States, the IBM Rochester Manufacturing Automated Information Line (R–MAIL)
at 1–800–300–8751.
In countries and regions outside of the United States, use the following Web site to locate your service
and support telephone numbers:
http://www.ibm.com/planetwide
7. If you encounter difficulties during the installation, contact your service provider, your IBM reseller,
or your next level of support.
8. If you are installing new hardware in a logical partition, you need to understand and plan for the
implications of partitioning your system. For information, see Logical Partitioning.
Avoiding electric shock:
Learn about precautions you should take to avoid electric shock when working on our around a
computer system.
Removal and replacement procedures for the 8233-E8B and 8236-E8C 117
DANGER
When working on or around the system, observe the following precautions:
Electrical voltage and current from power, telephone, and communication cables are hazardous. To
avoid a shock hazard:
v Connect power to this unit only with the IBM provided power cord. Do not use the IBM
provided power cord for any other product.
v Do not open or service any power supply assembly.
v Do not connect or disconnect any cables or perform installation, maintenance, or reconfiguration
of this product during an electrical storm.
v The product might be equipped with multiple power cords. To remove all hazardous voltages,
disconnect all power cords.
v Connect all power cords to a properly wired and grounded electrical outlet. Ensure that the outlet
supplies proper voltage and phase rotation according to the system rating plate.
v Connect any equipment that will be attached to this product to properly wired outlets.
v When possible, use one hand only to connect or disconnect signal cables.
v Never turn on any equipment when there is evidence of fire, water, or structural damage.
v Disconnect the attached power cords, telecommunications systems, networks, and modems before
you open the device covers, unless instructed otherwise in the installation and configuration
procedures.
v Connect and disconnect cables as described in the following procedures when installing, moving,
or opening covers on this product or attached devices.
To Disconnect:
1. Turn off everything (unless instructed otherwise).
2. Remove the power cords from the outlets.
3. Remove the signal cables from the connectors.
4. Remove all cables from the devices
To Connect:
1. Turn off everything (unless instructed otherwise).
2. Attach all cables to the devices.
3. Attach the signal cables to the connectors.
4. Attach the power cords to the outlets.
5. Turn on the devices.
(D005)
Handling static-sensitive devices:
Learn about precautions you should take to prevent damage to electronic components from static
electricity discharge.
Electronic boards, adapters, media drives, and disk drives are sensitive to static electricity discharge.
These devices are wrapped in antistatic bags to prevent this damage. Take the following precautions to
prevent damage to these devices from static electricity discharge.
v Attach a wrist strap to an unpainted metal surface of your hardware to prevent electrostatic discharge
from damaging your hardware.
v When using a wrist strap, follow all electrical safety procedures. A wrist strap is for static control. It
does not increase or decrease your risk of receiving electric shock when using or working on electrical
equipment.
v If you do not have a wrist strap, just prior to removing the product from ESD packaging and installing
or replacing hardware, touch an unpainted metal surface of the system for a minimum of 5 seconds.
v Do not remove the device from the antistatic bag until you are ready to install the device in the
system.
118 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures
v With the device still in its antistatic bag, touch it to the metal frame of the system.
v Grasp cards and boards by the edges. Avoid touching the components and gold connectors on the
adapter.
v If you need to lay the device down while it is out of the antistatic bag, lay it on the antistatic bag.
Before picking it up again, touch the antistatic bag and the metal frame of the system at the same time.
v Handle the devices carefully to prevent permanent damage.
Installing or replacing a PCI adapter with the system power on in Virtual I/O Server:
You can install or replace a PCI adapter in the Virtual I/O Server logical partition or in the Integrated
Virtualization Manager management partition.
The Virtual I/O Server includes a PCI Hot Plug Manager that is similar to the PCI Hot Plug Manager in
the AIX operating system. The PCI Hot Plug Manager allows you to hot plug PCI adapters into the
server and then activate them for the logical partition without having to reboot the system. Use the PCI
Hot Plug Manager for adding, identifying, or replacing PCI adapters in the system that are currently
assigned to the Virtual I/O Server.
Getting started:
Prerequisites:
v If you are installing a new adapter, an empty system slot must be assigned to the Virtual I/O Server
logical partition. This task can be done through dynamic logical partitioning (DLPAR) operations.
– If you are using a Hardware Management Console (HMC), you must also update the logical
partition profile of the Virtual I/O Server so that the new adapter is configured to the Virtual I/O
Server after you restart the system.
– If you are using the Integrated Virtualization Manager, an empty slot is probably already assigned to
the Virtual I/O Server logical partition because all slots are assigned to the Virtual I/O Server by
default. You only need to assign an empty slot to the Virtual I/O Server logical partition if you
previously assigned all empty slots to other logical partitions.
v If you are installing a new adapter, ensure that you have the software required to support the new
adapter and determine whether there are any existing PTF prerequisites to install. To do this, use the
IBM Prerequisite website at http://www-912.ibm.com/e_dir/eServerPrereq.nsf
v If you need help determining the PCI slot in which to place a PCI adapter, see the PCI adapter
placement.
Follow these steps to access the Virtual I/O Server, PCI Hot Plug Manager:
1. If you are using the Integrated Virtualization Manager, connect to the command-line interface.
2. Use the diagmenu command to open the Virtual I/O Server diagnostic menu. The menus are similar
to the AIX diagnostic menus.
3. Select Task Selection, then press Enter.
4. At the Task Selection list, select PCI Hot Plug Manager.
Installing a PCI adapter:
To install a PCI adapter with the system power on in Virtual I/O Server, do the following steps:
1. From the PCI Hot Plug Manager, select Add a PCI Hot Plug Adapter, then press Enter. The Add a
Hot-Plug Adapter window is displayed.
2. Select the appropriate empty PCI slot from those listed, and press Enter. A fast-blinking amber LED
located at the back of the server near the adapter indicates that the slot has been identified.
3. Follow the instructions on the screen to install the adapter until the LED for the specified PCI slot is
set to the Action state.
Removal and replacement procedures for the 8233-E8B and 8236-E8C 119
a. Set the adapter LED to the action state so that the indicator light for the adapter slot flashes
b. Physically install the adapter
c. Finish the adapter installation task in diagmenu.
4. Run the cfgdev command to configure the device for the Virtual I/O Server.
If you are installing a PCI, Fibre Channel adapter, it is now ready to be attached to a SAN and have
LUNs assigned to the Virtual I/O Server for virtualization.
Replacing a PCI adapter:
Prerequisite: Before you can remove or replace a storage adapter, you must unconfigure that adapter. See
“Unconfiguring storage adapters” for instructions.
To replace a PCI adapter with the system power on in Virtual I/O Server, do the following steps:
1. From the PCI Hot Plug Manager, select Unconfigure a Device, then press Enter.
2. Press F4 (or Esc +4) to display the Device Names menu.
3. Select the adapter you are removing in the Device Names menu.
4. In the Keep Definition field, use the Tab key to answer Yes. In the Unconfigure Child Devices
field, use the Tab key again to answer YES, then press Enter.
5. Press Enter to verify the information on the ARE YOU SURE screen. Successful unconfiguration is
indicated by the OK message displayed next to the Command field at the top of the screen.
6. Press F4 (or Esc +4) twice to return to the Hot Plug Manager.
7. Select replace/remove PCI Hot Plug adapter.
8. Select the slot that has the device to be removed from the system.
9. Select replace. A fast-blinking amber LED located at the back of the machine near the adapter
indicates that the slot has been identified.
10. Press Enter which places the adapter in the action state, meaning it is ready to be removed from the
system.
Unconfiguring storage adapters:
Before you can remove or replace a storage adapter, you must unconfigure that adapter. Storage adapters
are generally parent devices to media devices, such as disk drives or tape drives. Removing the parent
requires that all attached child devices either be removed or placed in the define state.
Unconfiguring a storage adapter involves the following tasks:
v Closing all applications that are using the adapter you are removing, replacing, or moving
v Unmounting file systems
v Ensuring that all devices connected to the adapter are identified and stopped
v Listing all slots that are currently in use or a slot that is occupied by a specific adapter
v Identifying the adapter's slot location
v Making parent and child devices unavailable
v Making the adapter unavailable
If the adapter supports physical volumes that are in use by a client logical partition, then You can
perform steps on the client logical partition before unconfiguring the storage adapter. For instructions, see
“Preparing the client logical partitions” on page 121. For example, the adapter might be in use because
the physical volume was used to create a virtual target device, or it might be part of a volume group
used to create a virtual target device.
Follow these steps to unconfigure SCSI, SSA, and Fibre Channel storage adapters:
120 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures
1. Connect to the Virtual I/O Server command-line interface.
2. Use the oem_setup_env command to close all applications that are using the adapter you are
unconfiguring.
3. Type lsslot-c pci to list all the hot plug slots in the system unit and display their characteristics.
4. Type lsdev -C to list the current state of all the devices in the system unit.
5. Type unmount to unmount previously mounted file systems, directories, or files using this adapter.
6. Type rmdev -l adapter -R to make the adapter unavailable.
Attention: Do not use the -d flag with the rmdev command for hot plug operations because this
action removes your configuration.
Preparing the client logical partitions:
If the virtual target devices of the client logical partitions are not available, the client logical partitions can
fail or they might be unable to perform I/O operations for a particular application. If you use the HMC
to manage the system, you might have redundant Virtual I/O Server logical partitions, which allow for
Virtual I/O Server maintenance and avoid downtime for client logical partitions. If you are replacing an
adapter on the Virtual I/O Server and your client logical partition is dependent on one or more of the
physical volumes accessed by that adapter, then You can take action on the client before you unconfigure
the adapter.
The virtual target devices must be in the define state before the Virtual I/O Server adapter can be
replaced. Do not remove the virtual devices permanently.
To prepare the client logical partitions so that you can unconfigure an adapter, complete the following
steps depending on your situation.
Table 1. Situations and steps for preparing the client logical partitions
Situation Steps
You have redundant hardware on the Virtual I/O Server
for the adapter.
No action is required on the client logical partition.
HMC-managed systems only: You have redundant
Virtual I/O Server logical partitions that, in conjunction
with virtual client adapters, provide multiple paths to
the physical volume on the client logical partition.
No action is required on the client logical partition.
However, path errors might be logged on the client
logical partition.
HMC-managed systems only: You have redundant
Virtual I/O Server logical partitions that, in conjunction
with virtual client adapters, provide multiple physical
volumes that are used to mirror a volume group.
See the procedures for your client operating system. For
example, for AIX, see Replacing a disk on the Virtual
I/O Server in the IBM System p® Advanced POWER®
Virtualization Best Practices Redpaper. The procedure for
Linux is similar to this procedure for AIX.
You do not have redundant Virtual I/O Server logical
partitions.
Shut down the client logical partition.
For instructions, see the following topics about shutting
down logical partitions:
v For systems that are managed by the HMC, see
“Shutting down AIX logical partitions using the
HMC”, “Shutting down IBM i logical partitions using
the HMC”, and “Shutting down Linux logical
partitions using the HMC” in the Logical partitioning.1
v For systems that are managed by the Integrated
Virtualization Manager, see “Shutting down logical
partitions” on page 122.
1The Logical partitioning can be found on the Hardware Information website at
http://publib.boulder.ibm.com/infocenter/systems/scope/hw/topic/p7hdx/power_systems.htm .
Removal and replacement procedures for the 8233-E8B and 8236-E8C 121
Shutting down logical partitions:
You can use the Integrated Virtualization Manager to shut down the logical partitions or to shut down
the entire managed system.
Use any role other than View Only to perform this task.
The Integrated Virtualization Manager provides the following types of shutdown options for logical
partitions:
v Operating System (recommended)
v Delayed
v Immediate
The recommended shutdown method is to use the client operating systems shutdown command. Use the
immediate shutdown method only as a last resort because using this method causes an abnormal
shutdown which might result in data loss.
If you choose the Delayed shutdown method, then be aware of the following considerations:
v Shutting down the logical partitions is equivalent to pressing and holding the white control-panel
power button on a server that is not partitioned.
v Use this procedure only if you cannot successfully shut down the logical partitions through operating
system commands. When you use this procedure to shut down the selected logical partitions, the
logical partitions wait a predetermined amount of time to shut down. This allows the logical partitions
time to end jobs and write data to disks. If the logical partition is unable to shut down within the
predetermined amount of time, it ends abnormally, and the next restart might take a long time.
If you plan to shut down the entire managed system, shut down each client logical partition, then shut
down the Virtual I/O Server management partition.
To shut down a logical partition, complete the following steps in the Integrated Virtualization Manager:
1. In the navigation area, select View/Modify Partitions under Partition Management. The
View/Modify Partitions page is displayed.
2. Select the logical partition that you want to shut down.
3. From the Tasks menu, click Shutdown. The Shutdown Partitions page is displayed.
4. Select the shutdown type.
5. Optional: Select Restart after shutdown completes if you want the logical partition to start
immediately after it shuts down.
6. Click OK to shut down the partition. The View/Modify Partitions page is displayed, and the logical
partition state has a value of shutdown.
PCI hot-plug manager access for AIX:
You might need to service PCI adapters with the system power on in AIX. Use the procedures in this
section to perform this task.
The instructions for servicing PCI adapters with the system power on in AIX refer you to these
procedures when it is appropriate to perform them.
Note: For an adapter to be serviced with the system power on, both the adapter and the system unit
must support hot-plug procedures. To identify adapters that are hot-pluggable in the system you are
servicing, refer to the following placement information: PCI adapter placement.
Accessing hot-plug management functions:
122 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures
You can use PCI Hot Plug Manager to service PCI adapters with the system power on in AIX. Use the
procedures in this section to perform this task.
Note: Procedures performed on a PCI adapter with the system power on in AIX, also known as hot-plug
procedures, require the system administrator to take the PCI adapter offline prior to performing the
operation. Before taking an adapter offline, the devices attached to the adapter must be taken offline as
well. This action prevents a service representative or user from causing an unexpected outage for system
users.
To access the hot-plug menus, do the following:
1. Log in as root user.
2. At the command line, type smitty.
3. Select Devices.
4. Select PCI Hot Plug Manager and press Enter.
5. The PCI Hot-Plug Manager menu displays. Return to the procedure that directed you here. The
following section describes the menu options.
PCI hot-plug manager menu:
You can use PCI Hot Plug Manager to service PCI adapters with the system power on in AIX. The
following options are available from the PCI Hot Plug Manager menu.
Note: For information about the PCI slot LED states, see “Component LEDs” on page 124.
List PCI hot-plug slots
Provides a descriptive list of all slots that support PCI hot-plug capability. If the listing for a slot indicates
it holds an “Unknown” device, select the Install/configure Devices added after IPL to configure the
adapter in that slot.
Add a PCI hot-plug adapter
Allows the user to add a new PCI hot-plug-capable adapter to the slot with the system turned on. You
will be asked to identify the PCI slot that you have selected prior to the actual operation. The selected
PCI slot will go into the Action state and finally into the On state.
Note: The system will indicate the slot holds an “Unknown” device until you perform the
Install/configure devices added after IPL option to configure the adapter.
Replace/remove a PCI hot-plug adapter
Allows the user to remove an existing adapter, or replace an existing adapter with an identical one. For
this option to work, the adapter must be in the Defined state (see the “Unconfigure a Device” option).
You will be asked to identify the PCI slot prior to the actual operation. The selected PCI slot will go into
the Action state.
Identify a PCI hot-plug slot
Allows the user to identify a PCI slot. The selected PCI slot will go into the Identify state. See
“Component LEDs” on page 124.
Unconfigure a device
Allows the user to put an existing PCI adapter into the Defined state if the device is no longer in use.
Removal and replacement procedures for the 8233-E8B and 8236-E8C 123
This step must be completed successfully before starting any removal or replacement operation. If this
step fails, the customer must take action to release the device.
Configure a defined device
Allows a new PCI adapter to be configured into the system if software support is already available for
the adapter. The selected PCI slot will go into the On state.
Install/configure devices added after IPL
The system attempts to configure any new devices and tries to find and install any required software
from a user-selected source.
The add, remove, and replace functions return information to the user indicating whether the operation
was successful. If additional instructions are provided on the screen, complete the recommended actions.
If the instructions do not resolve the problem, do the following:
v If the adapter is listed as Unknown, perform the Install/configure devices Added After IPL option to
configure the adapter.
v If you receive a warning indicating that needed device packages are not installed, the system
administrator must install the specified packages before you can configure or diagnose the adapter.
v If you receive a failure message indicating a hardware error, the problem might be either the adapter
or the PCI slot. Isolate the problem by retrying the operation in a different PCI slot, or trying a
different adapter in the slot. If you determine that you have failing hardware, call your service
representative.
v Do not use Install/configure devices added after IPL if your system is set up to run HACMP™
clustering. Consult with your system administrator or software support to determine the correct
method to configure the replacement device.
Component LEDs:
Individual LEDs are located on or near the failing components. Use the information in this section to
interpret the LEDs.
The LEDs are located either on the component itself or on the carrier of the component (for example,
memory card, fan, memory module, or processor). LEDs are either green or amber.
Green LEDs indicate either of the following:
v Electrical power is present.
v Activity is occurring on a link. (The system could be sending or receiving information.)
Amber LEDs indicate a fault or identify condition. If your system or one of the components on your
system has an amber LED turned on or blinking, identify the problem and take the appropriate action to
restore the system to normal.
Resetting the LEDs in AIX:
Individual LEDs are located on or near the failing components. You can use this procedure to reset the
LEDs after you have completed a repair action.
After the repair action is completed, do the following:
1. Log in as root user.
2. At the command line, type diag.
3. Select Task Selection.
4. Select Log Repair Action.
124 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures
5. Select the device that was repaired.
6. Press F10 to exit diagnostics.
If the Attention LED remains on after you have completed the repair action and reset the LEDs, call for
service support.
Prerequisites for hot-plugging PCI adapters in Linux:
In the course of installing, removing, or replacing a PCI adapter with the system powered on in aLinux
operating system, you might need to complete some prerequisite tasks. Use the information in this
section to identify those prerequisites.
The Linux system administrator needs to take the PCI adapter offline prior to removing, replacing, or
installing a PCI adapter with the system power on (hot-plugging). Before taking an adapter offline, the
devices attached to the adapter must be taken offline as well. This action prevents a service representative
or user from causing an unexpected outage for system users.
Before hot-plugging adapters for storage devices, ensure that the file systems on those devices are
unmounted. After hot-plugging adapters for storage devices, ensure that the file systems on those devices
are remounted.
Before hot-plugging an adapter, ensure that the server or partition is at the correct level of the Linux
operating system (Linux 2.6 or later).
Install the POWER Linux Service Aids. These service aids enable system serviceability, as well to improve
system management.
If you are using a Linux on POWER distribution with Linux kernel version 2.6 or later, you can install
the Service Aids that gives you access to more capabilities, which can help you diagnose problems on
your system.
This software is available at the Service and productivity tools for Linux on POWER Web site
(http://www14.software.ibm.com/webapp/set2/sas/f/lopdiags/home.html).
Verifying that the hot-plug PCI tools are installed for Linux:
In the course of installing, removing, or replacing a PCI adapter with the system power on in Linux
operating system you might need use the hot-plug PCI tools. Use the procedure in this section to verify
that you have the hot-plug PCI tools installed.
1. Enter the following command to verify that the hot-plug PCI tools are installed:
rpm -aq | grep rpa-pci-hotplug
If the command does not list any rpa-pci-hotplug packages, the PCI Hot Plug tools are not installed.
2. Enter the following command to ensure that the rpaphp driver is loaded:
ls -l /sys/bus/pci/slots/
The directory should contain data. If the directory is empty, the driver is not loaded or the system
does not contain hot-plug PCI slots. The following is an example of the information displayed by this
command:
drwxr-xr-x 15 root root 0 Feb 16 23:31 .
drwxr-xr-x 5 root root 0 Feb 16 23:31 ..
drwxr-xr-x 2 root root 0 Feb 16 23:31 0000:00:02.0
drwxr-xr-x 2 root root 0 Feb 16 23:31 0000:00:02.2
drwxr-xr-x 2 root root 0 Feb 16 23:31 0000:00:02.4
drwxr-xr-x 2 root root 0 Feb 16 23:31 0001:00:02.0
drwxr-xr-x 2 root root 0 Feb 16 23:31 0001:00:02.2
drwxr-xr-x 2 root root 0 Feb 16 23:31 0001:00:02.4
Removal and replacement procedures for the 8233-E8B and 8236-E8C 125
drwxr-xr-x 2 root root 0 Feb 16 23:31 0001:00:02.6
drwxr-xr-x 2 root root 0 Feb 16 23:31 0002:00:02.0
drwxr-xr-x 2 root root 0 Feb 16 23:31 0002:00:02.2
drwxr-xr-x 2 root root 0 Feb 16 23:31 0002:00:02.4
drwxr-xr-x 2 root root 0 Feb 16 23:31 0002:00:02.6
If the directory does not exist, run the following command to mount the filesystem:
mount -t sysfs sysfs /sys
3. Ensure the following tools are available in the /usr/sbin directory.
v lsslot
v drslot_chrp_pci
4. Return to the procedure that sent you here.
Updating the worldwide port name for a new 2766, 2787, 280E, 576B, or 5774 IOA.:
If you have exchanged a 2766, 2787, 280E, 576B, or 5774 Fibre Channel Input/Output Adapter (IOA), the
IBM external storage subsystem must be updated to use the worldwide port name (WWPN) of the new
2766, 2787, 280E, 576B, or 5774 IOA. Any SAN hardware using WWPN zoning might also need updating.
For instructions on how to update the external storage subsystem or SAN hardware configurations, see
the documentation for those systems.
The WWPN for the Fibre Channel IOA can be found using the Hardware Service Manager in SST or DST.
Display detail on the 2766, 2787, 280E, 576B, or 5774 IOA Logical Hardware Resource information, and
use the port worldwide name field.
The 16-digit WWPN can also be determined by appending the digits "1000" to the beginning of the
12-digit IEEE address found on the tailstock label of the Fibre Channel IOA.
PCI-X double-wide, quad-channel Ultra320 SCSI RAID Controller (FC 5739, 5778, 5781, 5782; CCIN
571F, 575B):
Learn about the features, specifications, and installation notes for the PCI-X double-wide, quad-channel
Ultra320 SCSI RAID Controller.
The PCI-X double-wide, quad-channel Ultra320 SCSI RAID Controller is a high-performance SCSI adapter
combined with an auxiliary-write cache adapter to form a double-wide, adapter pair. The two adapters
are screwed together. This topic contains the following sections:
v “Features”
v “Specifications” on page 127
v “Placement information” on page 128
v “Logical partitioning” on page 128
v “Placing the adapter in a double-wide cassette” on page 128
v “Installing the cassette containing the adapter in a system” on page 128
v “Placing the double-wide adapter in a system that does not use cassettes” on page 129
v “Power LED” on page 129
v “Concurrent maintenance procedure” on page 129
Features
v 3 external U320 SCSI ports
v 1 internal U320 SCSI port dedicated to connecting the controller to the auxiliary-write cache adapter
v Up to 36 Direct Access Storage Devices (DASD) can be externally attached per adapter
v 320 MB data rate per SCSI bus
126 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures
v Supports low voltage differential (LVD) disk devices only
v Supports RAID 5 and 6 (3–18 drive sets)
v 390 MB/1.5 GB compressed write cache
v 415 MB/1.6 GB compressed read cache
v PCI-X DDR support (storage adapter side)
v IOP or IOPless (5739 requires an IOP. 5778 does not.)
(A) Battery cover
(B) SCSI port 2
(C) SCSI port 0
(D) SCSI port 1
Specifications
Item Description
Adapter FRU number 42R6578
Cache battery FRU
number
42R3965
Unit description v 64 bit, 133 MHz, 3.3 V
v PCI-X 2.0 compliant
v Double-wide adapter, requires 2, adjacent, long slots. The SCSI controller side of the
adapter pair requires a 64-bit slot. (The controller side is the side with the external
SCSI connectors.)
v The auxiliary-write cache adapter contains a dual, concurrently maintainable,
cache-battery pack, which maintains cache memory on both adapters in the event of
an abnormal termination.
Operating system or
partition requirements
v IBM i V5R3, V5R3M5, and V5R4 with PTFs.
v AIX 5L™ Version 5.2 with the 5200-10 Technology Level, or later
v AIX 5L Version 5.3 with the 5300-06 Technology Level, or later
v Red Hat Enterprise Linux version 4 U4, or later
v SUSE Linux Enterprise Server 10, or later
Check the IBM Prerequisite Web page for further details and updates. You can find a
link to that Web page at the end of this topic.
Figure 69. Adapter
Removal and replacement procedures for the 8233-E8B and 8236-E8C 127
Item Description
Maximum number See Placement information.
Placement information
For placement information for this adapters, see PCI adapter placement.
Logical partitioning
When used in a logical partition (LPAR) environment, this double-wide adapter must have both slots of
the adapter assigned to the same logical partition. When implementing dynamic logical partitioning
(DLPAR), both slots of the adapter must be managed together.
Placing the adapter in a double-wide cassette
To place this double-wide adapter in a double-wide cassette, first remove the adapter handle (B), and the
two plastic covers from the SCSI jumper cable (A), as shown in the following figure. (The two plastic
covers are only found on adapters made before August 2008.) Then follow the general procedures to
place a double-wide adapter in a double-wide cassette. See Placing an adapter in the PCI adapter
double-wide cassette.
Installing the cassette containing the adapter in a system
To install the cassette containing the adapter into a system unit or expansion unit, following the
procedures in Model 5796 expansion units, PCI adapters and cassettes.
Figure 70. Preparing the adapter for placement in a cassette
128 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures
Placing the double-wide adapter in a system that does not use cassettes
If you are installing the double-wide adapter in the internal PCI slots, remove the lower, right standoff
shown in Figure 71.
Power LED
For a double-wide adapter, there is only one power LED visible for both slots.
Concurrent maintenance procedure
Concurrent maintenance of this double-wide adapter is not supported through the Hardware
Management Console (HMC). Concurrent maintenance must be done from the Hardware Service
Manager (HSM) of the system or owning partition. The HSM automatically powers off and on both PCI
slots when either slot is selected. In AIX or Linux, you must manually power off and on each slot
separately.
Important:
v Both PCI slots must be powered off when installing or removing this adapter with the system power
on.
v If this adapter is the load source IOA, or under the load source IOP, or any other storage IOA/IOP
with critical DASD attached for the system, have a qualified service provider perform the concurrent
maintenance procedure. At step 12 on page 130, the HSM directs you to use control panel functions 68
and 69 to power off the domain.
The following steps are a general concurrent maintenance procedure for installing, removing, or replacing
an adapter in IBM i operating system. If you were sent here by another procedure, additional specific
instructions for removing and replacing a PCI adapter can be found in that procedure.
1. Start an IBM i session for the system or partition containing the adapter and sign on to the system or
partition.
2. Type strsst on the command line of the Main menu and then press Enter.
3. Type your service tools user ID and service tools password on the System Service Tools (SST) Sign
On display and press Enter.
4. Select Start a service tool from the System Service Tools (SST) display and press Enter.
5. Select Hardware service manager from the Start a Service Tools display and press Enter.
Figure 71. Removing the standoff
Removal and replacement procedures for the 8233-E8B and 8236-E8C 129
6. Select Logical hardware resources (system, frames, cards) from the Hardware Service Manager
display and press Enter.
7. Select System bus resources and press Enter. The display changes to the Logical Hardware
Resources on System Bus display.
8. Page down and locate the IOP that controls the IOA to be serviced.
If you are installing a new IOPless adapter in an empty slot, the IOP is not shown.
9. Enter 9 in the Opt field for the IOP to be serviced.
10. Locate the Storage IOA resource for the adapter to be serviced and enter 8 in the Opt field for that
resource.
11. Enter 3 for Concurrent maintenance in the Opt field for the Storage IOA resource to be serviced.
The display changes to the Hardware Resource Concurrent Maintenance display.
12. Press F9 to power off the domain.
In some cases the card slots are already powered off if empty.
13. Press Enter to start power off. The display changes to the Hardware Resource Concurrent
Maintenance Status display, which shows the status of the power off.
When the power off is complete, the display returns to the Hardware Resource Concurrent
Maintenance display. The display shows that the Power Status is off for Storage IOA. Leave this
display on the HSM while you install, remove, or replace the adapter.
14. Check the slot power LED state for the slot in which the adapter resides to verify that the adapter is
powered off.
For a double-wide adapter, there is only one power LED visible for both slots.
15. Install, remove, or replace the adapter. See Model 5796 expansion units, PCI adapters and cassettes.
Links to procedures for other systems can be found at Installing PCI adapters.
16. After the replacement is complete, return to the Hardware Resource Concurrent Maintenance display
on the HSM.
17. If you were previously instructed to use operator panel function 69 to power on the adapter, do so
now. Then go to step Installing PCI adapters, otherwise continue with next step.
18. Enter F10 to power the adapter on.
19. Press Enter to begin power on.
20. If the Work with Controlling Resource display appears, enter 7 in the Opt field for the IOP to assign
the IOA to.
The Hardware Resource Concurrent Maintenance Status display shows the status of the power on.
21. When the power on is complete, the display returns to the Hardware Resource Concurrent
Maintenance display. The display shows that the Power Status is on for the Storage IOA.
22. Check the LED status to verify that the adapter is powered on.
For a double-wide adapter, there is only one power LED visible for both slots.
23. Return to the procedure that sent you here.
PCI-X DDR 1.5 GB cache SAS RAID Adapter (FC 5904 and 5908; CCIN 572F and 575C):
Learn about the specifications and operating system requirements for the 5904 and 5908 adapter.
Overview
The PCI-X DDR 1.5 GB cache SAS RAID Adapter is a SAS disk controller with a maximum of 1.5 GB
compressed write cache and a maximum 1.6 GB compressed read cache. Auxiliary write cache and
concurrent battery maintenance are provided. The controller is implemented using two physical adapters
that are securely connected to form a double-wide adapter. The double-wide adapter requires two
adjacent Peripheral Component Interconnect-X (PCI-X) slots. The auxiliary write cache side of the
130 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures
double-wide adapter contains a dual, concurrently maintainable cache battery pack which maintains
cache memory on both adapters in the event of an abnormal termination.
When used in a logical partition (LPAR) environment, this double-wide adapter must have both slots of
the adapter assigned to the same logical partition. When implementing dynamic LPAR (DLPAR), both
slots of the adapter must be managed together.
Feature 5904 and 5908 are all feature codes representing the same PCI-X DDR 1.5 GB cache SAS RAID
Adapter. Different feature codes indicate if a blind swap cassette is used and its type:
v Feature 5904 indicates no blind swap cassette. The feature is used in enclosures that do not use blind
swap cassettes.
v Feature 5908 indicates a gen-3 blind swap cassette. This feature is used in the 5790 and 5796.
572F is the CCIN number on the RAID adapter side of the double-wide adapter pair. 575C is the CCIN
number on the write-cache adapter.
The adapter provides RAID 0, RAID 5, RAID 6, and RAID 10 for the AIX and Linux operating systems.
Under the IBM i operating system, mirroring and data spreading are provided by the operating system,
and RAID 5 and RAID 6 are provided by the adapter. 1
The adapter provides three mini-SAS 4x connectors for the attachment of SAS drives located in 5886 EXP
12S Expansion Drawers. The adapter supports a maximum of five 5886 EXP 12S Expansion Drawers.1
With proper cabling and configuration, multiple wide ports are used to provide redundant paths to each
dual port SAS drive. The adapter manages SAS path redundancy and path switching should a SAS
failure occur.
This adapter supports the multi-initiator and high availability configurations. With the optional pairing of
adapter sets, an even higher level of protection is provided by using a dual controller I/O configuration1
to protect against the failure of an entire adapter set. In such a high availability I/O configuration, SAS X
cables are used to attach 5586 EXP 12S Expansion Drawers, and the fourth (top) mini-SAS connector on
each card set is used to directly connect the card sets by using a SAS AA cable. See Serial attached SCSI
cable planning for more information.
Important: See the SAS RAID controller for AIX or SAS RAID controller for Linux topics for more
information and important considerations for multi-initiator and high availability configurations. See also
PCI-X DDR 1.5 GB cache SAS RAID Adapter (FC 5904 and 5908; CCIN 572F and 575C).
The following figure shows the adapter.
Removal and replacement procedures for the 8233-E8B and 8236-E8C 131
Note: Port T3 does not support any device attachment. T3 is only used in dual controller I/O
configurations for adapter to adapter communication.
Specifications
Item Description
Adapter FRU number
44V8622 (Adapter only) (Designed to comply with RoHS requirement.)
44V7627 (Adapter in a gen-2.5 blind swap cassette)
Battery FRU number
42R3965 or 74Y5665 (Designed to comply with RoHS requirement.)
I/O bus architecture
PCI-X
Slot requirement
Two, long, adjacent PCI-X slots.
Cables
SAS device attachment requires specific cables that are provided with the subsystem or device
features being attached. Special cabling is required for multi-initiator and high availability
configurations. See Serial attached SCSI cable planning.
Voltage
3.3V
Form factor
Long
Maximum number
See PCI adapter placement.
Attributes
v SAS speed: 3 Gbps
v SAS, SAS Serial SCSI Protocol (SSP) and Serial Management Protocol (SMP) supported
v Single controller supported with SAS y cables (3692, 3693, 3694)
v Dual controller supported with SAS x cables (3661, 3662, 3663) and SAS AA cable (3681, 3682)
v Single controller supports mirrored write cache data with auxiliary cache
v Dual controller supports mirrored-write cache data and mirrored-RAID parity footprints
between card sets
Figure 72. PCI-X DDR 1.5GB cache SAS RAID Adapter
132 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures
v Solid-state drives (SSDs) supported
v Removable media devices are not supported
Operating system or partition requirements
This adapter is supported for the following operating systems:
v AIX:
– AIX 5L Version 5.3 with the 5300-07 Technology Level and Service Pack 8, or later
– AIX 5L Version 5.3 with the 5300-08 Technology Level and Service Pack 6, or later
– AIX 5L Version 5.3 with the 5300-09 Technology Level and Service Pack 2, or later
– AIX 5L Version 5.3 with the 5300-10 Technology Level, or later
– AIX 6.1 and Service Pack 8, or later
– AIX Version 6.1 with the 6100-01 Technology Level and Service Pack 4, or later
– AIX Version 6.1 with the 6100-02 Technology Level and Service Pack 3, or later
– AIX Version 6.1 with the 6100-03 Technology Level, or later
v Linux:
– Red Hat Enterprise Linux version 4, with update 7, or later
– Red Hat Enterprise Linux version 5, with update 2, or later
– SUSE Linux Enterprise Server 10, with service pack 2, or later
v IBM i
– IBM i V5R4M5 and Resave F, or later
– IBM i 6.1 and Resave F, or later
– IBM i 6.1.1 and Resave A, or later, is required for Dual Storage IOA support.
This adapter requires the following drivers:
v AIX: devices.pci.1410bd02 device driver package
v Linux:
– iprutils version 2.4.1 and ipr driver version 2.0.11.6 (or later) for RHEL4 kernels
– iprutils version 2.4.1 and ipr driver version 2.2.0.2 (or later) for RHEL5 kernels
– iprutils version 2.4.1 and ipr driver version 2.2.0.2 (or later) for SLES10 kernels
If you are installing a new feature, ensure that you have the software required to support the new feature
and that you determine if there are any existing prerequisites for this feature and attaching devices. To
check for the prerequisites, use the IBM Prerequisite website at
http://www-912.ibm.com/e_dir/eServerPrereq.nsf .
Concurrent maintenance procedure
Concurrent maintenance of this double-wide adapter is not supported through the Hardware
Management Console (HMC). Concurrent maintenance must be done from within the partition operating
system. In IBM i, the Hardware Service Manager (HSM) of the system or owning partition automatically
turns off the power or turns on the power to both PCI slots when either slot is selected. In AIX or Linux,
you must manually turn off the power and turn on the power to each slot separately.
Important:
v Both PCI slots must be powered off when installing or removing this adapter with the system power
turned on.
v If this adapter is the load source IOA, or any other storage IOA with critical DASD attached for the
system, this concurrent maintenance procedure should be done by a qualified service provider.
Removal and replacement procedures for the 8233-E8B and 8236-E8C 133
Power supply
Learn about removing and replacing a power supply in a server. Learn about removing and replacing a
power supply, fan, or input power distribution assembly in a 5802 or a 5877 expansion unit.
Removing the power supply with the system power off
If your system is managed by the Hardware Management Console (HMC), use the HMC to remove a
power supply. For instructions, see Removing a part by using the Hardware Management Console.
If your system is managed by the IBM Systems Director Management Console (SDMC), use the SDMC to
remove a power supply from the server. For instructions, see Removing a part by using the Systems
Director Management Console.
If your system is not managed by an HMC or SDMC, complete the following steps to remove a power
supply from a system while the system power is turned off:
1. Perform the prerequisite tasks, as described in Before you begin.
2. Identify the failing part, as described in Identifying a failing part.
3. If the system is running, stop the system as described in Stopping the system or logical partition.
4. Disconnect the power source from the system by unplugging the system, as described in
Disconnecting the power cords.
Note: Before continuing with this procedure, ensure that all power sources to the system have been
disconnected.
(L003)
or
134 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures
5. Open the front rack door.
6. Remove the front cover, as described in Removing the front cover from an 8233-E8B or 8236-E8C
system.
7. Disconnect the power cable (A) from the front of the power supply as shown in Figure 73.
8. Press the locking lever (B) down into the open position.
9. Pull the power supply out of the system.
Figure 73. Removing the power supply from a rack-mounted model
Removal and replacement procedures for the 8233-E8B and 8236-E8C 135
If you want to replace the power supply that you removed, see Replacing the 8233-E8B or 8236-E8C
power supply with the power turned off.
Removing the power supply with the system power on
If your system is managed by the Hardware Management Console (HMC), use the HMC to remove a
power supply. For instructions, see Removing a part by using the Hardware Management Console.
If your system is managed by the IBM Systems Director Management Console (SDMC), use the SDMC to
remove a power supply from a server. For instructions, see Removing a part by using the Systems
Director Management Console.
If your system is not managed by an HMC or SDMC, complete the following steps to remove a power
supply from a system while the system power is turned on:
1. Perform the prerequisite tasks, as described in Before you begin.
2. Open the front rack door.
3. Remove the front cover, as described in Removing the front cover from an 8233-E8B or 8236-E8C
system.
4. Identify the failing part, as described in Identifying a failing part.
5. Disconnect the power cable (A) from the front of the power supply as shown in Figure 74 on page
137.
6. Attach the wrist strap.
Attention:
v Attach a wrist strap to an unpainted surface of your hardware to prevent electrostatic discharge
(ESD) from damaging your hardware.
v When using a wrist strap, follow all electrical safety procedures. A wrist strap is for static control. It
does not increase or decrease your risk of receiving electric shock when using or working on
electrical equipment.
v If you do not have a wrist strap, just prior to removing the product from ESD packaging and
installing or replacing hardware, touch an unpainted surface of the system for a minimum of 5
seconds.
7. Press down on the locking latch (B) as shown in Figure 74 on page 137. Using the locking handle, pull
the power supply out of the system.
136 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures
If you need to replace the power supply that you removed, see Replacing the 8233-E8B or 8236-E8C
power supply with the power turned on.
Replacing the power supply with the system power off
If your system is managed by the Hardware Management Console (HMC), use the HMC to replace a
power supply. For instructions, see Exchanging a part by using the Hardware Management Console.
If your system is managed by the IBM Systems Director Management Console (SDMC), use the SDMC to
replace a power supply in the server. For instructions, see Replacing a part by using the Systems Director
Management Console.
If your system is not managed by an HMC or SDMC, complete the following steps to replace a power
supply in a system while the system power is turned off:
1. To remove the power supply, see Removing the 8233-E8B or 8236-E8C power supply with the power
turned off.
2. Open the rear rack door.
3. Push the power supply into the system until the latch (A) locks in place as shown in Figure 75 on
page 138.
Figure 74. Removing the power supply
Removal and replacement procedures for the 8233-E8B and 8236-E8C 137
4. Reconnect the power cable (B) to the front of the power supply, as shown in Figure 75.
5. Reconnect the power cable to the back of the system.
6. Start the system, as described in Starting the system or logical partition.
7. Note the state of the light-emitting diodes (LEDs) on the power supply. For more information, see
Power supply LEDs . If the LEDs indicate the power supply is operating normal, that is both of the
green LEDs are on solid (not flashing), continue to the next step. If not, remove the power supply
from the system and repeat the procedure. If, after repeating the procedure, the power supply is not
operating normally, contact your service provider.
8. Replace the front cover, as described in Installing the front cover on an 8233-E8B or 8236-E8C system.
9. Close the rear rack door.
Figure 75. Replacing a power supply
138 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures
Replacing the power supply with the system power on
If your system is managed by the Hardware Management Console (HMC), use the HMC to replace a
power supply. For instructions, see Exchanging a part by using the Hardware Management Console.
If your system is managed by the IBM Systems Director Management Console (SDMC), use the SDMC to
replace a power supply in the server. For instructions, see Replacing a part by using the Systems Director
Management Console.
If your system is not managed by an HMC or SDMC, complete the following steps to replace a power
supply in a system while the system power is turned on.
Important: You must have a functioning power supply in the system to use the following steps.
1. If you are replacing a redundant power supply because of a failure, remove the failing power supply.
For instructions, see Removing the 8233-E8B or 8236-E8C power supply with the power turned on.
2. Perform the prerequisite tasks, as described in Before you begin.
3. Attach the wrist strap.
Attention:
v Attach a wrist strap to an unpainted surface of your hardware to prevent electrostatic discharge
(ESD) from damaging your hardware.
v When using a wrist strap, follow all electrical safety procedures. A wrist strap is for static control. It
does not increase or decrease your risk of receiving electric shock when using or working on
electrical equipment.
v If you do not have a wrist strap, just prior to removing the product from ESD packaging and
installing or replacing hardware, touch an unpainted surface of the system for a minimum of 5
seconds.
4. Push the power supply into place until the locking latch (A) snaps into place as shown in Figure 76
on page 140.
Removal and replacement procedures for the 8233-E8B and 8236-E8C 139
5. Reconnect the power cable (B) to the front of the power supply.
6. Replace the front cover, as described in Installing the front cover on an 8233-E8B or 8236-E8C system.
7. Close the front rack door.
RAID enablement card
Use this procedure to service the RAID enablement card.
Preparing the system
To prepare the system for the removal of a RAID enablement card, perform the following steps:
1. Perform prerequisite tasks as described in “Before you begin” on page 115.
2. Identify the auxiliary cache card that you need to remove. For instructions, see Identifying a part.
3. Stop the system. For instructions, see Stopping a system or logical partition.
4. Disconnect the power source from the system by unplugging the system.
Attention: You must disconnect the power source from the system by disconnecting all power cords
to prevent system damage during this procedure.
Note: This system might be equipped with a second power supply. Before continuing with this
procedure, ensure that all power sources to the system have been completely disconnected.
(L003)
Figure 76. Replacing a power supply
140 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures
or
5. If you have a rack-mounted system, place it in the service position. See Placing a 8233-E8B or
8236-E8C in the service position.
6. Remove the access cover. See Removing the service access cover from an 8233-E8B or 8236-E8C
system.
7. Attach the wrist strap
Attention:
v Attach a wrist strap to an unpainted surface of your hardware to prevent electrostatic discharge
(ESD) from damaging your hardware.
v When using a wrist strap, follow all electrical safety procedures. A wrist strap is for static control. It
does not increase or decrease your risk of receiving electric shock when using or working on
electrical equipment.
v If you do not have a wrist strap, just prior to removing the product from ESD packaging and
installing or replacing hardware, touch an unpainted surface of the system for a minimum of 5
seconds.
Removing the RAID enablement card
To remove a RAID enablement card, perform the following steps:
1. Locate the RAID enablement card you want to remove.
2. Press the tab (B) and slide the card out of the system as shown in the following figures. The RAID
enablement card is attached to a metal plate (A). You will slide the metal plate out of the system to
remove the RAID enablement card.
Removal and replacement procedures for the 8233-E8B and 8236-E8C 141
3. Replace the RAID enablement card. For instructions, see Replacing the model 8233-E8B Redundant
Array of Independent Disks enablement card.
Replacing the Redundant Array of Independent Disks enablement card
You must have already completed the procedure Removing the model 8233-E8B Redundant Array of
Independent Disks enablement card.
To replace a RAID enablement card, complete the following steps:
1. Locate the slot for the RAID enablement card
Figure 77. RAID enablement card removal from a rack-mounted system unit
142 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures
2. Align the RAID enablement card with the slot in the system.
3. Slide the RAID enablement card fully into the system until the tab (A) locks into place.
Figure 78. Slot location for the RAID enablement card installation
Figure 79. RAID enablement card replacement in a rack-mounted system unit
Removal and replacement procedures for the 8233-E8B and 8236-E8C 143
4. Replace the service access cover. See Installing the service access cover on an 8233-E8B or 8236-E8C
system
5. Place the system in the operating position. See Placing the 8233-E8B or 8236-E8C in the operating
position.
6. Start the system. For instructions, see Starting the system or logical partition.
7. If you replaced the RAID enablement card as part of another procedure, return to that procedure now.
If you replaced the RAID enablement card because it was not operational, verify that the new
resource is functional. For instructions, see Verifying the installed part.
System backplane
Use this procedure to service the system backplane.
Note: Before you begin this procedure, make sure that you have authority to log in as an authorized
service provider. The different authority levels are described at ASMI authority levels.
Removing the system backplane
1. Access the ASMI
The Advanced System Management Interface (ASMI) is the interface to the service processor that is required to
perform general and administrator-level service tasks, such as reading service processor error logs, reading vital
product data, setting up the service processor, and controlling the system power. The ASMI might also be referred to
as the service processor menus.
To access the ASMI, see Accessing the Advanced System Management Interface.
When you have accessed the ASMI, continue with the next step.
2. Check and record the server firmware and service processor settings
1. Check and record the admin user ID's password.
2. Check and record the server firmware level prior to replacing this field replaceable unit (FRU). You might need to
update the system to the latest server firmware code level after you replace this FRU.
3. Check the service processor settings that you previously set using Advanced System Management (ASM). This
can be done by using a management console or a PC to access ASM. You need to reset these service processor
settings after you replace this FRU or the default settings will be used.
4. Record any service processor settings you might have set by using operating system commands:
v System Name setting
v System Power Control settings
v System Service Aids settings
v System Configuration settings
v Network Services settings
v Performance Option settings
v Login Profile settings
v Boot List settings
v Host Ethernet Adapter settings
3. Power off the unit
If the unit that contains the backplane is powered on, power it off now.
4. Disconnect the power cords
144 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures
To disconnect the power cords, complete the
following steps:
1. Open the rear rack door.
2. Disconnect all power cords from the unit
you are servicing.
5. Remove the front cover
1. Open the front rack door.
2. Remove the two thumbscrews (A) that
secure the front cover to the system unit.
3. Push the cover release latches (C) in the
direction of the arrow to release the cover
from the system unit.
4. Pull the cover out and away from the
system unit.
6. Disconnect the external cables
Label and disconnect the external cables connected to the FRU you are servicing.
7. Place the unit into the service position
Removal and replacement procedures for the 8233-E8B and 8236-E8C 145
Attention: When placing the model rack into
the service position, it is essential that all
stability plates are firmly in position to
prevent the rack from toppling. Ensure that
only one system unit is in the service position
at a time.
Remove two screws (A) securing the system
unit (B) to the rack.
8. Place the unit into the service position (continued)
Notes:
1. Ensure that the cables at the rear of the
system unit do not catch or bind as you
pull the unit forward in the rack.
2. When the system unit rails are fully
extended, the rail safety latches lock into
place. This action prevents the system
from being pulled out too far.
While holding the system-unit release latches
(A) down on both the left and right sides, pull
the system unit (B) out from the rack until the
rails are fully extended and locked.
9. Remove the service access cover
146 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures
1. Loosen the two thumbscrews (A) located
at the rear of the cover.
2. Slide the cover (B) toward the rear of the
system unit. When the front of the service
access cover has cleared the upper frame
ledge, lift the cover up and off the system
unit.
Attention: For proper cooling and airflow,
replace the cover before turning on the
system. Operating the system without the
cover for more than 30 minutes could damage
the system components.
10. Remove both power supplies
1. Disconnect the power cable (A) from the
power supply you are removing.
2. While pushing the lever (B) down, pull
the power supply straight out of the
system.
3. Repeat these steps to remove the other
power supply.
11. Remove the front fan
Removal and replacement procedures for the 8233-E8B and 8236-E8C 147
1. While squeezing the latch (A) on the front
right fan, grasp the fan firmly and pull the
fan out of the fan cage.
2. Repeat these steps until all system fans are
removed from the fan cage.
12. Remove the air baffle
Grasp the right side air baffle and pull it out
of the fan cage.
13. Remove the fan cage
148 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures
1. Loosen the two retaining thumbscrews (A)
that secure the fan cage to the system
board.
2. Lift the fan cage (B) up and out of the
enclosure.
14. Remove the disk unit cage
1. Disconnect the cables from the rear of the
disk unit backplane.
2. Rotate the handles outward in the
direction shown to release the disk unit
cage.
3. Slide the disk unit cage out of the system
unit.
15. Remove the vital product data (VPD) card
Removal and replacement procedures for the 8233-E8B and 8236-E8C 149
Remove the VPD card (B) from the VPD card
guide (A) on the planar by pulling the blue
handle.
16. Remove the thermal power management device (TPMD) card, if present
1. Push the tab (A) in the direction shown to
release the card (B).
2. Gently pull the card straight out of its slot
on the system backplane.
17. Remove the daughter RAID card, if present
150 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures
1. Push the latch (A) in the direction shown
to release the card.
2. Gently pull the card (B) straight out of the
enclosure.
18. Remove the base RAID card, if present
1. Push the tab (B) in the direction shown to
release the card.
2. Gently pull the card (A) straight out of the
enclosure.
19. Remove all PCI adapters
Removal and replacement procedures for the 8233-E8B and 8236-E8C 151
1. Record the slot number and location of
each adapter being removed.
2. Slide the adapter latch (A) in the direction
shown to release the adapter (B).
3. Carefully grasp the PCI adapter by its top
edge or upper corners, and pull the
adapter straight out of its connector on the
system backplane.
4. Store the adapter in a safe place.
5. Repeat these steps until all the PCI
adapters are removed.
20. Remove the GX adapter, if present
1. Slide the adapter latch (A) in the direction
shown to release the adapter.
2. Squeeze the latches (B) to release them
from the adapter and rotate them in the
direction shown.
3. Gently pull the adapter straight out of the
system.
4. Repeat these steps to remove a second GX
adapter, if present.
21. Remove the Ethernet card
152 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures
1. Slide the adapter latch (A) in the direction
shown to release the Ethernet card (B).
2. Carefully grasp the Ethernet card by its
top edge or upper corners, and pull the
card straight out of its connector on the
system backplane.
22. Remove the system processor assemblies and fillers
1. Move the top of the system processor
assembly handles (A) in the direction
shown to release the handles.
2. Rotate the system processor assembly
handles to the open position (B).
3. Pull the system processor assembly (C) out
by the handles.
4. Place the system processor assembly on an
electrostatic discharge (ESD) surface.
5. Repeat these steps to remove all of the
system processor assemblies and fillers
from the enclosure.
23. Remove the PCI adapter dividers
Removal and replacement procedures for the 8233-E8B and 8236-E8C 153
1. Flex the front edge (A) of the PCI adapter
divider out of the bracket and toward the
rest of the divider.
2. Pull the back edge of the divider away
from the retention notches (B) in the
system chassis.
3. Repeat these steps for the other PCI
adapter dividers that need to be removed.
24. Remove the middle system divider
1. Remove the screw (A) to unfasten the
middle system divider from the system
backplane.
2. Grasp the middle system divider (B) and
lift it out of the system.
25. Remove the line cord assembly
154 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures
1. Remove the screw (A) located on the back
of the system chassis.
2. Pull down the terra cota cable retainers
that hold the power supply cables.
3. Slide the line cord assembly (B) toward
the front of the system to unlock it.
4. Grasp the line cord assembly and pull it
out of the system.
26. Remove the front processor support
1. Remove the five screws (A) to unfasten
the front processor support.
2. Grasp the front processor support (B) and
lift it out of the system.
27. Remove the system backplane
Removal and replacement procedures for the 8233-E8B and 8236-E8C 155
1. Label and disconnect all Ethernet cables
that are present.
2. Remove the six screws (A) as shown in the
illustration.
3. Push the system backplane (B) toward the
front of the system.
4. Lift the system backplane up and out of
the system chassis.
Note: Use care when removing the
system backplane. Do not lift the system
backplane by any of the attached modules.
Replacing the system backplane
1. Install the system backplane
Note: Use care when you replace the system
backplane.
1. Carefully grasp the replacement system
backplane (A) along two edges.
Note: Ensure the pin holes on the system
backplane are lined up correctly with the
pins (B) on the system chassis.
2. Lower the system backplane at an angle,
with the rear of the backplane connecting
with the system chassis first.
3. Slide the system backplane toward the
rear of the system chassis.
4. Tighten the six screws (C), securing the
system backplane to the system chassis.
5. Reconnect all Ethernet cables that were
previously removed.
2. Install the front processor support
156 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures
1. Align the pin hole (A) on the front
processor support with the chassis guide
pin on the system backplane.
2. Insert and tighten the five screws (B) to
secure the front processor support.
3. Install the line cord assembly
1. Place the line cord assembly (A) into
position, ensuring that the slots (B) on the
line cord assembly align with the pins on
the system chassis.
2. Slide the line cord assembly toward the
rear of the system to lock it into place.
3. Ensure that the terra cota cable retainers
that hold the power supply cables are
connected.
4. Insert and tighten the screw (C) to secure
the line cord assembly.
4. Install the middle divider
Removal and replacement procedures for the 8233-E8B and 8236-E8C 157
1. Align the middle system divider (A) in
between the front and rear processor
supports.
2. Slide the middle system divider down and
into place.
3. Insert and tighten the screw (B) to secure
the middle system divider.
5. Install the PCI adapter dividers
1. Carefully grasp the PCI adapter divider by
its top edge and align the rear edge of the
divider with the retention notches (A) in
the system chassis.
2. Insert the front edge of the divider (B) into
the slots, and then press the divider into
place.
3. Repeat these steps for the other PCI
adapter dividers that need to be installed.
6. Install the system processor assemblies and fillers
158 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures
1. Rotate the system processor assembly
handles (A) to the open position.
2. Align the system processor assembly (B)
with the slot in the system frame.
3. Carefully push on the system processor
assembly until you feel it lock into place.
4. Rotate the handles to the locked position
(C).
5. Repeat these steps to install all of the
system processor assemblies and fillers.
7. Install the Ethernet card
1. Carefully grasp the Ethernet card (A) by
its top edge, and align the card with the
expansion slot and its connector on the
system backplane.
2. Press the Ethernet card firmly into its
connector.
3. Slide the adapter latch (B) back into place
in the direction shown to secure the
Ethernet card.
8. Install the GX adapter card, if previously removed
Removal and replacement procedures for the 8233-E8B and 8236-E8C 159
1. Align the adapter with the expansion slot
and its connector on the system backplane.
2. With the latches (A) in the open position,
gently push the adapter straight into the
system.
3. Rotate the latches in the direction shown
until they latch.
4. Slide the adapter latch (B) back into place
in the direction shown to secure the
adapter.
5. Repeat these steps to install a second GX
adapter, if previously removed.
9. Install all PCI adapters
1. Carefully grasp the adapter (A) by its top
edge, and align the adapter with the
expansion slot and its connector on the
system backplane.
2. Press the adapter firmly into its connector.
3. Slide the adapter latch (B) in the direction
shown to secure the adapter.
4. Repeat these steps until all PCI adapters
are installed.
10. Install the base RAID card, if previously removed
160 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures
1. Gently push the card (B) in the direction
shown straight into the enclosure.
2. Push the tab (A) in the direction shown to
fit the card into its position.
11. Install the daugther RAID card, if previously removed
1. Gently push the card (A) in the direction
shown straight into the enclosure.
2. Push the latch (B) in the direction shown
to tighten the card into its position.
12. Install the TPMD card, if previously removed
Removal and replacement procedures for the 8233-E8B and 8236-E8C 161
1. Gently push the card (A) straight into its
slot on the system backplane.
2. Push the tab (B) in the direction shown to
fit the card into its position.
13. Install the VPD card
1. Insert the VPD card (B) into the VPD card
guide (A).
2. Push until the VPD card is fully seated.
14. Install the disk unit cage
162 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures
1. Slide the replacement disk unit cage into
the system until you feel it lock.
2. Rotate the handles in the direction shown
to secure the disk unit cage.
3. Reconnect the cables on the rear of the
disk unit backplane.
15. Install the fan cage
1. Lower the fan cage (A) into its location on
the system board, aligning the locator pins
through the holes.
2. Tighten the two retaining thumbscrews (B)
securing the fan cage to the system board.
16. Install the air baffle
Removal and replacement procedures for the 8233-E8B and 8236-E8C 163
1. Lower the air baffle into its location in the
fan cage, ensuring that the arrows on the
air baffle point toward the front of the
system.
2. Push down on the air baffle until it locks
into place.
17. Install the front fan
1. Lower the fan into its location in the fan
cage.
2. Push down on the fan until it locks into
place.
18. Install both power supplies
164 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures
1. While pushing the lever (A) down,
carefully slide the power supply into its
location in the system.
2. Connect the power cable (B) to the power
supply.
3. Repeat these steps to install the other
power supply.
19. Install the service access cover
1. Place the service access cover (A) on the
top of the system, about 25 mm (1 in.)
from the front of the system.
2. Hold the service access cover against the
system unit and slide it toward the front
of the system. The tabs of the service
access cover slide beneath the upper
chassis ledge.
3. Align the two thumbscrews (B) located on
the rear of the service access cover with
the two holes on the rear of the system
chassis.
4. Tighten the thumbscrews to secure the
service access cover.
20. Place the unit in the operating position
Removal and replacement procedures for the 8233-E8B and 8236-E8C 165
1. To unlock the rail safety latches (B), slide
the latches toward the front of the system.
2. Push the system unit (A) back into the
rack until both system-unit release latches
lock into position.
Note: Ensure that the cables at the rear of
the system do not catch or bind as you
push the system unit back into the rack.
21. Place the unit in the operating position (continued)
Replace and tighten the two screws (B) that
secure the system unit (A) to the rack.
22. Install the front cover
166 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures
1. Position the cover on the front of the
system unit so that the two thumbscrews
(C) align with the screw holes on the front
of the system unit (A).
2. Push the cover release latches (B) in the
direction of the arrows to attach the cover
to the front of the system unit as shown in
the figure.
3. Tighten the thumbscrews.
4. Close the front rack door.
23. Reconnect the external cables
Reconnect the external cables, which you previously removed, to the FRU you are servicing.
24. Connect the power cords
If you removed the power cords, connect all
power cords to the unit you are servicing.
25. Power on the system
Removal and replacement procedures for the 8233-E8B and 8236-E8C 167
26. Restore network connectivity
The installed service processor must connect to the network to be accessible from the management console.
How will the IP address of the service processor be managed?
Obtain an IP address
automatically from a
Dynamic Host
Configuration Protocol
(DHCP) server
Configure a static IP
address
? Go to step 28.
27. Restore service processor settings
Note: If the network connection uses the Dynamic Host Configuration Protocol (DHCP) to establish an IP address,
and uses the Hardware Management Console (HMC) as the DHCP server, no additional configuration is necessary to
enable network access to the service processor.
To enable network access to the service processor, complete the following steps.
1. Ensure that the service processor is connected to the existing service network by verifying that the HMC cable is
connected to the HMC port on the service processor assembly.
2. If not already connected, connect all system power cables by plugging them into power outlets.
Note: Do not start the system at this time.
Can you access the service network?
Yes No
Go to step 30 on page
169.
Go to step 33 on page
169.
28. Restore service processor settings
If the network connection uses static IP address assignments, complete the following steps:
1. Connect a client with a Web browser directly to the service processor network port using one of the following
URLs:
v https://169.254.2.147
v https://169.254.3.147
Note: For additional information on IP address assignments, see Accessing the ASMI using a PC or notebook
and Web browser.
2. If not already connected, connect all system power cables by plugging them into power outlets.
Note: Do not start the system at this time.
3. Log in to Advanced System Management (ASM) with the user ID admin and the default password admin.
4. Change the admin user ID's password and the general user ID's password when prompted.
5. Configure network access using the static IP address.
29. Restore service processor settings
With the network connection now configured to use static IP address assignments, access the service network.
168 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures
Can you access the service network?
Yes No
? Go to step 34
30. Access the ASMI
The Advanced System Management Interface (ASMI) is the interface to the service processor that is required to
perform general and administrator-level service tasks, such as reading service processor error logs, reading vital
product data, setting up the service processor, and controlling the system power. The ASMI might also be referred to
as the service processor menus.
To access the ASMI, see Accessing the Advanced System Management Interface.
When you have accessed the ASMI, continue with the next step.
31. Set System Enclosure Type.
The enclosure serial number is a seven-digit number printed on a bar-coded label on the system
unit. When the system backplane is replaced, the seven-digit enclosure serial number and feature
code number needs to be manually inputted and stored in the new system backplane using the
Advanced System Management Interface (ASMI).
32.
Go to step 36.
33. Restore service processor settings
If the network connection uses DHCP to establish an IP address, but does not use a management console as the
DHCP server, do the following steps:
1. Complete any network configuration necessary to allow the DHCP server to recognize and assign an IP address
to the service processor.
2. Ensure that the service processor is connected to the existing service network by verifying that the network cable
is connected to the network port on the service processor assembly.
3. If not already connected, connect all system power cables by plugging them into power outlets.
Note: Do not start the system at this time.
Can you access the service network?
No Yes
? Go to step 30.
34. Restore service processor settings
You have indicated that you are still not able to access the service network.
You will need to reset the service processor network interfaces (and passwords) by moving the reset jumpers.
35.
Go to step 26 on page 168.
36. Restore service processor settings
Removal and replacement procedures for the 8233-E8B and 8236-E8C 169
Change the admin user ID's password when prompted by following these steps:
1. In the navigation area of ASM, expand Login Profile.
2. Select Change Password.
3. Specify the required information, and click Continue.
Note: Passwords can be any combination of up to 64 alphanumeric characters. The default password for the
administrator ID is admin. After your initial login to ASM, the administrator passwords must be changed.
As a security measure, you are required to enter the password for your user ID in the Current password for current
user field.
37. Restore service processor settings
Set the system name. The system name can be changed to any valid ASCII string. It does not have to follow the
initialized machine type, model number, and serial number format.
To change the system name, complete the followings steps:
1. In the navigation area of ASM, expand System Configuration.
2. Select System Name.
3. Enter the desired system name using the previous naming convention.
4. Click Save settings to update the system name to the new value.
The new system name is displayed in the status frame, the area where the logout button is located. If another
method, such as the management console, is used to change the system name, the status frame does not reflect the
change.
38. Restore service processor settings
Set the time of day by completing these steps:
1. In the navigation area of ASM, expand System Configuration.
2. Select Time of Day. If the system is powered off, the right pane displays a form that shows the current date
(month, day, and year) and time (hours, minutes, and seconds).
3. Change the date value, the time value, or both, and click Update Time Of Day.
39. Restore service processor settings.
Verify if the server firmware level is same as the value recorded prior to replacing the system
backplane. If the value is different then it must be fixed. For information on how to fix the firmware,
see Getting firmware fixes.
40. Restore service processor settings
Reenter any of the following settings that you previously changed through ASM, unless you want to use the default
settings:
1. System Power settings
2. ASMI Service Aids settings
3. System Configuration settings
4. Network Services settings
5. Login Profile settings
41. Restore service processor settings
170 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures
1. Reset any service processor settings that you might have set by using the operating system commands. You
recorded these settings at the beginning of this procedure.
2. If you reset the management console access password, use one of the following methods:
For HMC:
a. From the HMC GUI (preferred method):
1) Expand the Systems Management folder in the navigation tree pane.
2) Double click the Servers folder.
3) Select a server in the central panel.
4) Under the Operations folder, select Change Password.
5) Provide the desired information and click OK.
b. From the HMC command line, type chsyspwd -m managed system -t access --passwd --newpasswd
newpassword
where:
v The value for managed systemis the new service processor's managed system name.
v No value for --passwd is entered thereby allowing authentication.
v The value for newpassword is the new password value.
For SDMC:
a. Go to the Resources tab and click Hosts.
b. Select a server in the right pane.
c. On the Actions menu, click Operations > Change Password.
d. Provide the desired information and click OK.
42. Recover partition data on the service processor
The profile data that is stored in the managed server has been cleared or corrupted.
To recover profile data by using the HMC, complete the following steps:
1. Expand the Systems Management folder in the navigation tree pane.
2. Double click the Servers folder.
3. Use the check box to select a server, in the central panel.
4. In the Configuration folder if you have the Manage Partition Data folder, select Restore.
To recover profile data using the SDMC, complete the following steps:
1. Go to the Resources tab and click Hosts.
2. Select a server in the right pane.
3. On the Actions menu, click System Configuration > Manage Partition Data > Restore.
4. Select the desired restore option and click OK.
43. Update vital product data
Removal and replacement procedures for the 8233-E8B and 8236-E8C 171
If your system is running IBM i and is not managed by the management console , you must update the load source
after replacing the system backplane.
To update the load source without using the management console, do the following steps:
1. Load the I_BASE_01 Licensed Internal Code optical media into the device that is defined for the system and
access it to perform an initial program load (IPL).
Note: This IPL can take several minutes.
2. When the Select a Language Group display is shown, press Enter to select the default language feature 2924
(English).
Note: To change the language feature, type the language feature that appears on the media that contains the
operating system, and then press Enter. Confirm the language feature by pressing Enter.
3. From the Install Licensed Internal Code display, select option 2 (Work with Dedicated Service Tools (DST)) and
press Enter.
Use the Dedicated Service Tools to make sure the disk units, including the load source, are reporting correctly and
that there are no errors that might prevent the IPL from disk. When all errors have been resolved and the load
source disk is reporting in correctly, continue with the next step.
4. From the Work with Dedicated Service Tools (DST) display, select option 2 (Work with disk units) and press
Enter.
5. Select option 8 (Update system vital product data) to identify the location of the load source disk unit. Press
Enter. The Update System Vital Product Data display appears.
6. Press Enter to confirm that you want the vital product data to be written.
7. Press F12 (Previous) twice.
8. Press 1 and press Enter to exit DST. Use the system control panel to delay turning off the system. Change the IPL
mode to B, and power on the system to perform an IPL from disk.
44. Restore boot list and Host Ethernet Adapter settings
The boot list is not present on a new system backplane, therefore; it is necessary to input the boot devices by booting
into the system management services (SMS) menus. See Selecting boot options for information on setting the boot
device list.
After the operating system is booted, the bootlist command can be used to verify, or make additional changes to, the
list of boot devices for a system or partition.
To restore the Host Ethernet Adapter settings, see Configuring physical ports on a Host Ethernet Adapter.
45. Verify the repair
Go to Verifying a repair.
System processor assembly
Use this procedure to service the control panel or signal cable.
Removing the system processor assembly
If your system is managed by the IBM Systems Director Management Console (SDMC), use the SDMC to
remove the system processor assembly from the server. For instructions, see Removing a part using the
Systems Director Management Console.
If your system is managed by the Hardware Management Console (HMC), use the HMC to remove the
system processor assembly from the server. For information about using the HMC to remove the
processor voltage regulator module, see Removing a part using the Hardware Management Console.
If you do not have an HMC or SDMC, complete the following steps to remove either the system
processor assembly or the filler for the system processor assembly:
172 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures
1. Perform the prerequisite tasks listed in “Before you begin” on page 115.
2. If necessary, identify the system processor assembly that you need to replace. See Identifying a part.
3. Stop the system or logical partition. See Stopping a system or logical partition.
4. Place the system in the service position. See Placing a 8233-E8B or 8236-E8C in the service position.
5. Remove the service access cover. See Removing the service access cover from an 8233-E8B or
8236-E8C system.
6. Attach the wrist strap.
Attention:
v Attach a wrist strap to an unpainted surface of your hardware to prevent electrostatic discharge
(ESD) from damaging your hardware.
v When using a wrist strap, follow all electrical safety procedures. A wrist strap is for static control.
It does not increase or decrease your risk of receiving electric shock when using or working on
electrical equipment.
v If you do not have a wrist strap, just prior to removing the product from ESD packaging and
installing or replacing hardware, touch an unpainted surface of the system for a minimum of 5
seconds.
7. Disconnect the power cables from the primary power supply and secondary power supply if present.
Note: This system has a second power supply. Before continuing with this procedure, ensure that
the power source to the system has been completely disconnected.
(L003)
or
8. As shown in Figure 80 on page 174, unlock the latches (A) on the system processor assembly. Using
both hands, grasp the unlocked latches (B) to pull the system processor assembly (C) through the
guide rails away from the system backplane assembly.
Removal and replacement procedures for the 8233-E8B and 8236-E8C 173
9. Place the system processor assembly on an acceptable ESD surface.
10. If necessary, remove the memory modules to reinstall them in the new system processor assembly.
See Memory modules.
Replacing the system processor assembly
To complete this procedure, you must have already completed the removal procedure. For the removal
procedure instructions, see Removing the 8233-E8B or 8236-E8C system processor assembly.
If your system is managed by the IBM Systems Director Management Console (SDMC), use the SDMC to
replace the system processor assembly. For instructions, see Replacing a part using the Systems Director
Management Console.
If your system is managed by the Hardware Management Console (HMC), you can use the HMC to
replace the system processor assembly. For information about using the HMC to replace voltage regulator
modules, see Exchanging a part using the Hardware Management Console.
If you do not have an HMC or SDMC, complete the following steps to replace the system processor
assembly:
Attention:
Attach a wrist strap to an unpainted metal surface of your hardware to prevent electrostatic discharge
from damaging your hardware.
When using a wrist strap, follow all electrical safety procedures. A wrist strap is for static control. It
does not increase or decrease your risk of receiving electric shock when using or working on electrical
equipment.
If you do not have a wrist strap prior to removing the product from ESD packaging and installing or
replacing hardware, touch an unpainted metal surface of the system for a minimum of 5 seconds.
1. If necessary, remove the system processor assembly from its antistatic package.
Figure 80. Removing system processor assembly from the backplane assembly in a rack-mounted system
174 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures
2. Move the existing memory modules or memory fillers in to the new system processor assembly. Use
the same configuration that you had before you removed the memory modules. See the memory
module information you printed for this procedure. If you did not print the memory module topic,
see Memory modules.
3. With the latches (A) in the unlocked position, correctly align the system processor assembly (B) and
slide it through the guide rails into the system backplane assembly.
4. Move the latches (A) in the direction of the arrow to lock the system processor assembly into
position.
5. Replace the service access cover. For instructions, see Installing the service access cover on an
8233-E8B or 8236-E8C system.
6. Place the system in the operating position. For instructions, see Placing the 8233-E8B or 8236-E8C in
the operating position.
7. Restore power to the system by reconnecting the system power cables.
8. Start the system or logical partition. For instructions, see Starting the system or logical partition.
9. Verify that the system processor assembly is functional. For instructions, see Verifying the installed
part.
10. Do one of the following actions:
v If you received new activation codes for your feature, see Activate Capacity on Demand or
Advanced POWER Virtualization for more information about entering the new code.
v If you did not receive a new activation code but one is required, see Order Capacity on Demand
to enter the new code order. After you receive your new activation code, see Activate Capacity on
Demand or Advanced POWER Virtualization for more information about entering the new code.
To search for an activation code for a specific system, see Capacity on Demand activation codes.
Figure 81. Installing the system processor assembly into the backplane assembly of a rack-mounted model
Removal and replacement procedures for the 8233-E8B and 8236-E8C 175
System VPD card
Use this procedure to service the system vital product data (VPD) card.
Attention: The system VPD card contains data that is vital to system operation:
v Machine type, model, and serial number
v System brand
v System unique ID
v Activation codes for Capacity on Demand (CoD) and Virtualization Engine technologies, if applicable.
If the VPD card is replaced, this data must be programmed into the replacement VPD card. If the system
is still functional prior to the replacement of the VPD card, view and record the system unique ID.
New, replacement activation codes for CoD and Virtualization Engine technologies must be generated.
The original activation codes for the system cannot be reused. Contact your next level of support for new
activation codes.
Note: If you plan to replace the VPD card and have the replacement card available, you can contact the
following e-mail address from 8 a.m. to 5 p.m. U.S. Central Time, Monday through Friday, to obtain
replacement activation codes prior to repair. You must supply machine type and serial number, and the
part number and serial number of the replacement VPD card. For System p systems contact
pcod@us.ibm.com, and for System i® systems contact icod@us.ibm.com.
To exchange the VPD card, perform the following procedure:
1. Access the ASMI
The Advanced System Management Interface (ASMI) is the interface to the service processor that is required to
perform general and administrator-level service tasks, such as reading service processor error logs, reading vital
product data, setting up the service processor, and controlling the system power. The ASMI might also be referred to
as the service processor menus.
To access the ASMI, see Accessing the Advanced System Management Interface.
When you have accessed the ASMI, continue with the next step.
2. Save the system identifiers
1. On the ASM Welcome pane, specify your user ID and password, and click Log in.
Note: To perform this operation, your authority level must be administrator or authorized service provider.
2. In the navigation area, expand System Configuration ? Program Vital Product Data.
3. Select System Brand.
4. In the right pane, manually record the value for the System Brand.
5. Under Program Vital Product Data, Select System Keywords.
6. In the right pane, manually record the values for the machine type and model, system serial number, system
unique ID, and (if applicable) activation codes for Capacity on Demand (COD) and Virtualization Engine
technologies.
3. Power off the unit
If the unit that contains the VPD card is powered on, power it off now.
4. Disconnect the power cords
176 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures
To disconnect the power cords, complete the
following steps:
1. Open the rear rack door.
2. Disconnect all power cords from the unit
you are servicing.
5. Remove the front cover
1. Open the front rack door.
2. Remove the two thumbscrews (A) that
secure the front cover to the system unit.
3. Push the cover release latches (C) in the
direction of the arrow to release the cover
from the system unit.
4. Pull the cover out and away from the
system unit.
6. Place the unit into the service position
Removal and replacement procedures for the 8233-E8B and 8236-E8C 177
Attention: When placing the model rack into
the service position, it is essential that all
stability plates are firmly in position to
prevent the rack from toppling. Ensure that
only one system unit is in the service position
at a time.
Remove two screws (A) securing the system
unit (B) to the rack.
7. Place the unit into the service position (continued)
Notes:
1. Ensure that the cables at the rear of the
system unit do not catch or bind as you
pull the unit forward in the rack.
2. When the system unit rails are fully
extended, the rail safety latches lock into
place. This action prevents the system
from being pulled out too far.
While holding the system-unit release latches
(A) down on both the left and right sides, pull
the system unit (B) out from the rack until the
rails are fully extended and locked.
8. Remove the service access cover
178 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures
1. Loosen the two thumbscrews (A) located
at the rear of the cover.
2. Slide the cover (B) toward the rear of the
system unit. When the front of the service
access cover has cleared the upper frame
ledge, lift the cover up and off the system
unit.
Attention: For proper cooling and airflow,
replace the cover before turning on the
system. Operating the system without the
cover for more than 30 minutes could damage
the system components.
9. Remove the vital product data (VPD) card
Remove the VPD card (B) from the VPD card
guide (A) on the planar by pulling the blue
handle.
10. Install the VPD card
Removal and replacement procedures for the 8233-E8B and 8236-E8C 179
1. Insert the VPD card (B) into the VPD card
guide (A).
2. Push until the VPD card is fully seated.
11. Install the service access cover
1. Place the service access cover (A) on the
top of the system, about 25 mm (1 in.)
from the front of the system.
2. Hold the service access cover against the
system unit and slide it toward the front
of the system. The tabs of the service
access cover slide beneath the upper
chassis ledge.
3. Align the two thumbscrews (B) located on
the rear of the service access cover with
the two holes on the rear of the system
chassis.
4. Tighten the thumbscrews to secure the
service access cover.
12. Place the unit in the operating position
180 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures
1. To unlock the rail safety latches (B), slide
the latches toward the front of the system.
2. Push the system unit (A) back into the
rack until both system-unit release latches
lock into position.
Note: Ensure that the cables at the rear of
the system do not catch or bind as you
push the system unit back into the rack.
13. Place the unit in the operating position (continued)
Replace and tighten the two screws (B) that
secure the system unit (A) to the rack.
14. Install the front cover
Removal and replacement procedures for the 8233-E8B and 8236-E8C 181
1. Position the cover on the front of the
system unit so that the two thumbscrews
(C) align with the screw holes on the front
of the system unit (A).
2. Push the cover release latches (B) in the
direction of the arrows to attach the cover
to the front of the system unit as shown in
the figure.
3. Tighten the thumbscrews.
4. Close the front rack door.
15. Connect the power cords
If you removed the power cords, connect all
power cords to the unit you are servicing.
16. Power on the system
17. Access the ASMI
182 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures
The Advanced System Management Interface (ASMI) is the interface to the service processor that is required to
perform general and administrator-level service tasks, such as reading service processor error logs, reading vital
product data, setting up the service processor, and controlling the system power. The ASMI might also be referred to
as the service processor menus.
To access the ASMI, see Accessing the Advanced System Management Interface.
When you have accessed the ASMI, continue with the next step.
18. Set the system identifiers
1. On the ASM Welcome pane, if you have not already logged in, specify your user ID and password, and click Log
In.
Note: To perform this operation, your authority level must be administrator or authorized service provider.
2. In the navigation area, expand System Configuration ? Program Vital Product Data.
3. Click System Brand.
4. If the backed up system brand is correct, click Accept backed up setting and save. Otherwise, enter the correct
system brand and click Continue.
Note: If you do not know the correct system brand, contact your next level of support.
5. Click Save setting to update the system brand to the VPD.
6. Under Program Vital Product Data, click System Keywords.
7. If the backed-up system keywords are correct, click Accept backed up setting and save. Otherwise, enter the
values for the machine type and model, system serial number, system unique ID, and (if applicable) activation
codes for Capacity on Demand (COD) and Virtualization Engine technologies. Set the Reserved field to blank
spaces unless directed otherwise by Level 4 technical support.
Note: If you do not know the system unique ID, contact your next level of support. You cannot boot the system
until valid values are entered for all fields. You can change these entries only once.
8. Click Save settings to update the system keywords and to save them to the VPD.
19. Verify the repair.
Go to Verifying a repair.
Thermal power management device
Use this procedure to service the thermal power management device (TPMD).
Removing the TPMD card
For information about using the HMC to remove the TPMD card, see Removing a part by using the
Hardware Management Console
To remove the TPMD card from a system that is not managed by the HMC, complete the following steps:
1. Complete the prerequisite tasks described in “Before you begin” on page 115.
2. Stop the system, as described in Stopping a system or logical partition.
3. Disconnect all power cords from the unit you are servicing.
Note: This system has a second power supply. Before continuing with this procedure, ensure that the
power source to the system has been disconnected.
(L003)
Removal and replacement procedures for the 8233-E8B and 8236-E8C 183
or
4. If you have a rack-mounted system, place it in the service position, as described in Placing a 8233-E8B
or 8236-E8C in the service position.
5. Remove the service access cover, as described in Removing the service access cover from an 8233-E8B
or 8236-E8C system.
6. Attach the wrist strap.
Attention:
v Attach a wrist strap to an unpainted surface of your hardware to prevent electrostatic discharge
(ESD) from damaging your hardware.
v When using a wrist strap, follow all electrical safety procedures. A wrist strap is for static control. It
does not increase or decrease your risk of receiving electric shock when using or working on
electrical equipment.
v If you do not have a wrist strap, just prior to removing the product from ESD packaging and
installing or replacing hardware, touch an unpainted surface of the system for a minimum of 5
seconds.
7. Locate the TPMD card that is indicated by the amber fault LED being on.
8. Remove the TPMD card as follows:
a. Push the tab (A) in the direction shown to release the card (B).
b. Gently pull the card straight out of its slot on the system backplane.
184 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures
Note: The location code of the TPMD card is P1-C12 or you can identify the card with the amber
fault LED on.
If you removed the TPMD card as part of another service action, return to that procedure. If you need to
replace the TPMD card that you removed, see Replacing the 8233-E8B or 8236-E8C TPMD card.
Replacing the TPMD card
Before installing or replacing a feature, ensure that the software required to support the feature is
installed on your system. For information about software prerequisites, see IBM Prerequisite. If the
required software is not installed, see the following Web sites to download it, and then install it before
continuing:
v To download firmware and software updates and fixes, see Microcode downloads.
v To download Hardware Management Console (HMC) updates and fixes, see Hardware Management
Console Support and downloads.
If your system is managed by the HMC, use the HMC to complete the steps for replacing the TPMD card
in the server. For information about using the HMC to replace the TPMD card, see Replacing a part by
using the Hardware Management Console.
To replace a TPMD card from a system that is not managed by an HMC, complete the following steps:
1. Complete the prerequisite tasks described in “Before you begin” on page 115.
2. Attach the wrist strap.
Removal and replacement procedures for the 8233-E8B and 8236-E8C 185
Attention:
v Attach a wrist strap to an unpainted surface of your hardware to prevent electrostatic discharge
(ESD) from damaging your hardware.
v When using a wrist strap, follow all electrical safety procedures. A wrist strap is for static control. It
does not increase or decrease your risk of receiving electric shock when using or working on
electrical equipment.
v If you do not have a wrist strap, just prior to removing the product from ESD packaging and
installing or replacing hardware, touch an unpainted surface of the system for a minimum of 5
seconds.
3. If necessary, remove the TPMD card from its antistatic package.
4. Install the TPMD card as follows:
a. Gently push the card (A) straight into its slot on the system backplane.
b. Push the tab (B) in the direction shown to fit the card into its position.
5. Replace the service access cover, as described in Installing the service access cover on an 8233-E8B or
8236-E8C system.
6. Place the system in the operating position, as described in Placing the 8233-E8B or 8236-E8C in the
operating position.
7. If you removed the power cords, connect all power cords to the unit you are servicing.
8. Verify the installed part, as described in Verifying the installed part.
Voltage regulators
Use this procedure to service the voltage regulator.
Removing the processor voltage regulator
If your system is managed by the IBM Systems Director Management Console (SDMC), use the SDMC to
remove the processor voltage regulator module from the server. For instructions, see Removing a feature
using the Systems Director Management Console.
186 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures
If your system is managed by the Hardware Management Console (HMC), use the HMC to remove the
processor voltage regulator modules from the server. For information about using the HMC to remove
the processor voltage regulator module, see Removing a part using the Hardware Management Console.
If you do not have an HMC or SDMC, complete the following steps to remove a memory voltage
regulator module:
Note: Ensure that you keep a screwdriver handy before you begin with this procedure.
1. Perform the prerequisite tasks as described in “Before you begin” on page 115.
2. Remove the front cover. For instructions, see Removing the front cover from an 8233-E8B or
8236-E8C system
3. Open the back rack door.
4. Stop the system. For instructions, see Stopping a system or logical partition.
5. Disconnect the power source from the system by unplugging the system.
Note: This system has a second power supply. Before continuing with this procedure, ensure that
the power source to the system has been completely disconnected.
(L003)
or
6. Place the system in the service position. For instructions, see Placing a 8233-E8B or 8236-E8C in the
service position.
7. Remove the access cover. For instructions, see Removing the service access cover from an 8233-E8B
or 8236-E8C system.
8. Attach the wrist strap.
Removal and replacement procedures for the 8233-E8B and 8236-E8C 187
Attention:
v Attach a wrist strap to an unpainted surface of your hardware to prevent electrostatic discharge
(ESD) from damaging your hardware.
v When using a wrist strap, follow all electrical safety procedures. A wrist strap is for static control.
It does not increase or decrease your risk of receiving electric shock when using or working on
electrical equipment.
v If you do not have a wrist strap, just prior to removing the product from ESD packaging and
installing or replacing hardware, touch an unpainted surface of the system for a minimum of 5
seconds.
9. Remove the system processor assembly. (Skip the steps you have already completed.) For
instructions, see Removing the 8233-E8B or 8236-E8C system processor assembly.
10. Remove the airflow cover by lifting the clear shield from the system.
a. Loosen the screws (A).
b. Slide the cover (B) from under the flange (C) and remove from the system processor assembly.
11. Locate the processor voltage regulator module to be removed from the system processor assembly.
Figure 82. Processor voltage regulator module location (C1)
188 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures
12. To remove the processor voltage regulator module from the system processor assembly, follow these
steps:
a. Grasp the latch end of both of the locked mechanical arms, and pull the locking latch (A) back to
unlock, as shown in the following figure.
b. Lift both of the mechanical voltage regulator arms (A) upward at the same time until they are in
the vertical position, as shown in Figure 84 on page 190.
c. Carefully pull up on both of the arms (A), at the same time, to lift the processor voltage regulator
module from the system processor assembly.
Figure 83. Unlocking the processor voltage regulator module arms
Removal and replacement procedures for the 8233-E8B and 8236-E8C 189
13. Place the voltage regulator module on an electrostatic discharge (ESD) surface.
Replacing the processor voltage regulator
If your system is managed by the IBM Systems Director Management Console (SDMC), use the SDMC to
replace the processor voltage regulator module. For instructions, see Replacing a feature using the
Systems Director Management Console.
If your system is managed by the Hardware Management Console (HMC), you can use the HMC to
replace the processor voltage regulator module. For information about using the HMC to replace voltage
regulator modules, see Exchanging a part using the Hardware Management Console.
If you do not have an HMC or SDMC, complete the following steps to replace a processor voltage
regulator module:
1. Remove the voltage regulator module from its antistatic package. Locate the connectors in the
system processor assembly to place the module.
Figure 84. Removing the processor voltage regulator module from the system
190 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures
2. Open both of the locking arms on the processor voltage regulator module, at the same time, until
they extend straight up from the voltage regulator module.
3. To replace the voltage regulator module, complete the following steps:
Note: When reinstalling processor voltage regulator module, ensure that the DIMM latches are fully
engaged and latched to the DIMMs or blanks to avoid possible damage to the DIMM latches.
a. Lift the voltage regulator module by both of the arms (A). Hold the voltage regulator module
above the system processor assembly, aligning the connectors of the voltage regulator module
with the connectors on the system processor assembly.
b. Align the holes in the processor voltage regulator module (B) with the large mechanical pins (C)
on the system processor assembly. Slide the voltage regulator module down onto the system
processor assembly until the arms touch the top of the pins.
Figure 85. Processor voltage regulator module shown above the location in the system processor assembly
Removal and replacement procedures for the 8233-E8B and 8236-E8C 191
c. Use both hands to apply downward force on all four corners of the processor voltage regulator
module, as shown by the arrows. When the arms are in the half open position, the voltage
regulator module is in place.
d. Using one finger on each hand, apply pressure downward on both processor voltage regulator
arms at the same time, to move to the close position. Press down at the end of each arm A until
they reach the locked position.
192 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures
4. Install the airflow cover as shown in the following figure.
a. To install the airflow cover, slide the cover (A) under the flange (B).
b. Tighten the screws (C).
The memory voltage regulator is locked into place.
5. Replace the system processor assembly. (Skip the steps you have already completed.) For
instructions, see Replacing the 8233-E8B or 8236-E8C system processor assembly
6. Replace the service access cover: For instructions, see Installing the service access cover on an
8233-E8B or 8236-E8C system.
7. Place the system in the operating position. For instructions, see Placing the 8233-E8B or 8236-E8C in
the operating position.
Removal and replacement procedures for the 8233-E8B and 8236-E8C 193
8. Replace the front cover. For instructions, see Installing the front cover on an 8233-E8B or 8236-E8C
system.
9. Reconnect all power and signal cables to their respective connectors.
10. Close the back rack door or the back system door.
11. Start the system. For instructions, see Starting the system or logical partition.
12. Verify that the new resource is working correctly. For instructions, see Verifying the installed part.
Removing the memory voltage regulator
If your system is managed by the IBM Systems Director Management Console (SDMC), use the SDMC to
remove the memory voltage regulator module from the server. For instructions, see Removing a feature
using the Systems Director Management Console.
If your system is managed by the Hardware Management Console (HMC), use the HMC to remove the
memory voltage regulator modules from the server. For information about using the HMC to remove the
memory voltage regulator module, see Removing a part using the Hardware Management Console.
If you do not have an HMC or SDMC, complete the following steps to remove a memory voltage
regulator module:
Note: Ensure that you keep a screwdriver handy before you continue with this procedure.
1. Perform the prerequisite tasks as described in “Before you begin” on page 115.
2. Remove the front cover. For instructions, see Removing the front cover from an 8233-E8B or
8236-E8C system.
3. On a rack-mounted system unit, open the back rack door.
4. Stop the system. For instructions, see Stopping a system or logical partition.
5. Disconnect the power source from the system by unplugging the system.
Note: This system has a second power supply. Before continuing with this procedure, ensure that
the power source to the system has been completely disconnected.
(L003)
or
194 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures
6. Place the system in the service position. For instructions, see Placing a 8233-E8B or 8236-E8C in the
service position.
7. Remove the access cover. For instructions, see Removing the service access cover from an 8233-E8B
or 8236-E8C system.
8. Attach the wrist strap.
Attention:
v Attach a wrist strap to an unpainted surface of your hardware to prevent electrostatic discharge
(ESD) from damaging your hardware.
v When using a wrist strap, follow all electrical safety procedures. A wrist strap is for static control.
It does not increase or decrease your risk of receiving electric shock when using or working on
electrical equipment.
v If you do not have a wrist strap, just prior to removing the product from ESD packaging and
installing or replacing hardware, touch an unpainted surface of the system for a minimum of 5
seconds.
9. Remove the system processor assembly. (Skip the steps you have already completed.) For more
information, see Removing the 8233-E8B or 8236-E8C system processor assembly.
10. Remove the airflow cover by lifting the clear shield from the system.
a. Loosen the screws (A).
b. Slide the cover (B) from under the flange (C) and remove from the system processor assembly.
Removal and replacement procedures for the 8233-E8B and 8236-E8C 195
11. Locate the memory voltage regulator module to be removed from the system processor assembly.
Figure 86 shows the memory voltage regulator (C10) in relation to the location on the system
processor assembly.
12. Remove the memory voltage regulator module (A) from the connectors, as shown in the following
figure:
a. Grasp the top or back edge (A) of the memory voltage regulator module to loosen from the
connectors.
b. Place your thumb under bottom edge (B)the voltage regulator module and lift to remove the
module from the connectors.
Figure 86. Memory voltage regulator module location (C10)
196 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures
13. Place the voltage regulator module on an electrostatic discharge (ESD) surface.
Replacing the memory voltage regulator
If your system is managed by the IBM Systems Director Management Console (SDMC), use the SDMC to
replace the memory voltage regulator module. For instructions, see Replacing a feature using the Systems
Director Management Console.
If your system is managed by the Hardware Management Console (HMC), you can use the HMC to
replace the memory voltage regulator module. For information about using the HMC to replace voltage
regulator modules, see Exchanging a part using the Hardware Management Console.
If you do not have an HMC or SDMC, complete the following steps to replace a memory voltage
regulator module:
1. Remove the voltage regulator module from its antistatic package. Locate the connectors on the
system processor assembly to connect the module.
Figure 87. Removing the memory voltage regulator module
Removal and replacement procedures for the 8233-E8B and 8236-E8C 197
2. Install the voltage regulator module by completing the following steps:
a. Carefully grasp the voltage regulator module along two edges and align it with the connectors.
b. Insert the voltage regulator module (A) into the connectors and firmly push down into position.
3. Install the airflow cover as shown in the following figure.
a. To install the airflow cover, slide the cover (A) under the flange (B).
b. Tighten the screws (C).
The memory voltage regulator is locked into place.
Figure 88. Memory voltage regulator module (A) shown above the voltage regulator connectors on the system
processor assembly
Figure 89. Installing the memory voltage regulator module
198 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures
4. Replace the system processor assembly. (Skip the steps you have already completed.) For
instructions, see Replacing the 8233-E8B or 8236-E8C system processor assembly
5. Replace the service access cover. For instructions, see Installing the service access cover on an
8233-E8B or 8236-E8C system
6. Place the system in the operating position. See Placing the 8233-E8B or 8236-E8C in the operating
position.
7. Replace the front cover. For a rack-mounted system unit, see Installing the front cover on an
8233-E8B or 8236-E8C system.
8. Reconnect all power and signal cables to their respective connectors.
9. Close the back rack door or the back system door.
10. Start the system. See Starting the system or logical partition.
11. Verify that the new resource is working correctly. For instructions, see Verifying the installed part.
Verifying a repair
Use these procedures to verify hardware operation after making repairs to the system.
Choose from the following options:
v To verify the repair of a system that is currently powered off, go to step 1.
v To verify the repair of a system that is currently powered on with no operating system loaded, go to
step 3 on page 200.
v To verify the repair of a system that is currently powered on and that has an operating system loaded,
go to step 5 on page 200.
1. Power on the server and all attached I/O enclosures.
Did all the enclosures power on?
Yes: Go to step 3 on page 200.? Go to step 3 on page 200.
No: Continue with the next step.
Removal and replacement procedures for the 8233-E8B and 8236-E8C 199
2. Choose from the following options:
v If the original problem was that an enclosure would not power on and you have another FRU to replace, locate
and replace the next field-replaceable unit (FRU).
v If the next FRU in the FRU list is an isolation procedure, perform the isolation procedure.
v If the original problem was that an enclosure would not power on and you have an isolation procedure to
complete, perform the isolation procedure.
v If the original problem was that an enclosure would not power on and there are no more FRUs or isolation
procedures in the FRU list, contact your next level of support.
v If you have a new problem, perform problem analysis and repair the new problem.
3. Load the operating system.
Did the operating system load successfully?
Yes: Go to step 5.
No: Continue with the next step.
4. Choose from the following options:
v If the original problem was a failing disk drive that contained the operating system software, go to step 5.
v If the original problem was that the operating system would not load and you have another FRU to replace, go to
your FRU locations section to locate the next FRU.
v If the next FRU in the FRU list is an isolation procedure, perform the isolation procedure.
v If the original problem was that the operating system would not load and you have an isolation procedure to
complete, perform the isolation procedure.
v If the original problem was that the operating system would not load and there are no more FRUs or isolation
procedures in the FRU list, contact your next level of support.
v If you have a new problem, perform problem analysis and repair the new problem.
5. Choose from the following options:
v “Verifying the repair in AIX”
v “Verifying the repair in Linux” on page 203
v “Verifying a repair using an IBM i system or logical partition” on page 204
Verifying the repair in AIX
You can use this procedure to verify that a repair is complete using the AIX operating system.
Use this maintenance analysis procedure (MAP) to check out the server after a repair is completed.
1. Did you replace a disk drive in the root volume group?
No Go to step 3 on page 201.
Yes Continue with the next step.
2. Run stand-alone diagnostics either from a CD or from a Network Installation Management (NIM)
server.
200 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures
Did you encounter any problems?
No Reinstall the operating system and continue with step 5.
Yes If the original problem still exists, replace the
field-replaceable unit (FRU) or perform the isolation
procedure that is next in the FRU list. If you have reached
the end of the FRU list, contact your next level of support.
If a new problem has occurred, go to Beginning problem
analysis.
3. Did you replace a FRU with the power turned on and concurrently with system operations?
No Go to step 5.
Yes Continue with the next step.
4. Did you use an AIX diagnostics service aid hot-swap operation to change the FRU?
Yes Go to step 6.
No Go to step 7.
Note: The AIX diagnostic service aid was
used if a resource was removed using the Hot
Plug task.
5.
Note: If any FRUs have been removed that should be reinstalled, reinstall them now.
Perform the following steps:
1. If the system is not powered on, power it on now.
2. Perform a slow boot.
3. Wait until the AIX operating system login prompt displays or until system activity on the operator panel or
display apparently has stopped.
4. Did you encounter any problems?
No Continue at step 6.
Yes If the original problem still exists, replace the FRU or perform the isolation procedure that is next in the FRU
list. If you have reached the end of the FRU list, contact your next level of support.
If a new problem has occurred, go to Beginning problem analysis.
6. If the Resource Repair Action menu is already displayed, go to step 9 on page 202; otherwise,
perform the following steps:
1. Log into the operating system either with root authority (if
needed, ask the customer to enter the password) or use the CE
login.
2. Enter the diag -a command and check for missing resources.
Follow any instructions that display. If an SRN displays, suspect a
loose card or connection. If no instructions display, no resources
were detected as missing. Continue with the next step.
7. Perform the following steps:
Removal and replacement procedures for the 8233-E8B and 8236-E8C 201
1. Enter diag at the command prompt.
2. Press Enter.
3. Select the Diagnostics Routines option.
4. When the Diagnostic Mode Selection menu displays, select System verification.
5. When the Diagnostic Selection menu displays, select the All Resources option or test the FRUs you exchanged,
and any devices that are attached to the FRU(s) you exchanged, by selecting the diagnostics for the individual
FRU.
Did the Resource Repair Action menu (801015) display?
No Continue with the next step.
Yes Go to step 9.
8. Did the Testing Complete, no trouble was found menu (801010) display?
Yes Use the Log Repair Action option, if not previously logged,
in the TASK SELECTION menu to update the AIX error log.
If the repair action was reseating a cable or adapter, select
the resource associated with that repair action.
If the resource associated with your action is not displayed
on the resource list, select sysplanar0.
Note: If the system attention indicator is on, this action will
set it back to the normal state.
Go to step 11 on page 203.
No If the original problem still exists, replace the FRU or
perform the isolation procedure that is next in the FRU list.
If you have reached the end of the FRU list, contact your
next level of support.
If a new problem has occurred, go to Beginning problem
analysis.
9. When a test is run on a resource in system verification mode, that resource has an entry in the AIX
error log. If the test on that resource was successful, the Resource Repair Action menu displays.
After replacing a FRU, you must select the resource for that FRU from the Resource Repair Action
menu. This updates the AIX error log to indicate that a system-detectable FRU has been replaced.
Note: If the system attention indicator is on, this action will set it back to the normal state.
Perform the following steps:
1. Select the resource that has been replaced from the Resource
Repair Action menu. If the repair action was reseating a cable or
adapter, select the resource associated with that repair action. If
the resource associated with your action is not displayed on the
resource list, select sysplanar0.
2. Press Commit after you make your selections.
Did another Resource Repair Action (801015) display?
No If the No Trouble Found menu displays, go to step 11 on
page 203.
Yes Continue with the next step.
10. The parent or child of the resource you just replaced might also require that you run the Resource
Repair Action service aid on it.
202 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures
When a test is run on a resource in system verification mode, that resource has an entry in the AIX
error log. If the test on the resource was successful, the Resource Repair Action menu displays.
After replacing that FRU, you must select the resource for that FRU from the Resource Repair Action
menu. This updates the AIX error log to indicate that a system-detectable FRU has been replaced.
Note: If the system attention indicator is on, this action will set it back to the normal state.
Perform the following steps:
1. From the RESOURCE REPAIR ACTION menu, select the parent
or child of the resource that has been replaced. If the repair action
was reseating a cable or adapter, select the resource associated
with that repair action. If the resource associated with your action
is not displayed on the resource list, select sysplanar0.
2. Press COMMIT after you make your selections.
3. If the No Trouble Found menu displays, continue with the next
step.
11. If you changed the service processor or network settings, as instructed in previous MAPs, restore the
settings to the value they had prior to servicing the system. If you ran stand-alone diagnostics from
CD-ROM, remove the stand-alone diagnostics CD-ROM from the system.
Did you perform service on a RAID subsystem involving changing of the PCI RAID adapter
cache card or changing the configuration?
Note: This information does not apply to the PCI-X RAID adapter or cache.
No Go to the close of call procedure.
Yes Continue with the next step.
12. Use the Recover Options selection to resolve the RAID configuration. To do this, perform the
following steps:
1. On the PCI SCSI Disk Array Manager display, select Recovery
options.
2. If a previous configuration exists on the replacement adapter, this
must be cleared. Select Clear PCI SCSI Adapter Configuration
and press F3.
3. On the Recovery Options screen, select Resolve PCI SCSI RAID
Adapter Configuration.
4. On the Resolve PCI SCSI RAID Adapter Configuration screen,
select Accept Configuration on Drives.
5. On the PCI SCSI RAID Adapter selections menu, select the
adapter that you changed.
6. On the next screen, press Enter.
7. When you see the Are You Sure selection menu, press Enter to
continue.
8. If you see a Failed status message, verify that you selected the
correct adapter, and then repeat this procedure. When the
recovery is complete, exit the operating system.
9. Go to the close of call procedure.
Verifying the repair in Linux
You can use this procedure to verify that a repair is complete using the Linux operating system.
Removal and replacement procedures for the 8233-E8B and 8236-E8C 203
1. Run stand-alone diagnostics from either a CD or from a Network Installation Management (NIM)
server. See Running the stand-alone diagnostics from CD-ROM.
Did you encounter any problems?
No Reboot the operating system and continue with the close of call procedure.
Yes If the original problem still exists, replace the field-replaceable unit (FRU) or perform the isolation procedure
that is next in the FRU list. If you have reached the end of the FRU list, contact your next level of support.
If a new problem has occurred, go to Beginning problem analysis and repair the new problem.
Verifying a repair using an IBM i system or logical partition
Use this procedure to verify a repair using the IBM i operating system.
1. Was the system powered off during the repair?
Yes: Continue with the next step.
No: Continue with step 3.
2. Perform the following tasks:
a. Verify that the power cable is plugged into the power outlet.
b. Verify that power is available at the customer's power outlet.
3. Was the partition powered off during the repair?
Yes: Continue with the next step.
No: Continue with step 6.
4. Select the IPL type and mode for the system or logical partition that the customer uses (see IPL type
mode and speed options in the Service functions).
5. Start an IPL by powering on the system or partition (see Powering on and powering off). Did the
system complete the IPL?
Yes: Continue with the next step.
No: This might be a new problem. Go to the Start of call procedure. This ends the procedure.
6. Did the system or partition remain running throughout the repair, and was the I/O processor, I/O
adapter, or storage device replaced?
Yes: Continue with step 10.
No: Continue with the next step.
7. Use the service action log or serviceable event view (if the system is managed by an HMC) to look
for any reference codes that are related to this IPL (see Using the Service Action Log). Are there any
reference codes that are related to this IPL?
Yes: Continue with the next step.
No: If the problem was related to removable media or communications, perform the verification
procedures in the Service functions to verify that the problem is corrected. Then return the
system to the customer and have the customer verify the system date and time. This ends the
procedure.
8. Is the new reference code the same as the original reference code?
Yes: Continue with the next step.
No: A new symptom might have occurred. Go to the Start of call procedure. This ends the
procedure.
9. Are there any other failing items that remain to be replaced?
Yes: Replace the next failing item listed for this reference code. This ends the procedure.
No: Contact your next level of support for assistance. This ends the procedure.
10. Was concurrent maintenance performed on an optical storage unit?
204 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures
Yes: The product activity log and service action log, in most cases, contain a reference code for
the optical storage unit when concurrent maintenance is performed. You can ignore this reference
code. Perform the following:
- Perform the verification procedures in the Service functions topic to verify that the problem is
corrected.
- Return the system to the customer and have the customer verify the system date and time. This
ends the procedure.
No: Continue with the next step.
11. Use the service action log to look for any new reference codes (see Using the Service Action Log).
Are there any new reference codes?
Yes: Continue with the next step.
No: Go to step 14.
12. Is the new reference code the same as the original reference code?
Yes: Continue with the next step.
No: A new symptom might have occurred. Go to the Start of call procedure to determine the
cause of the problem. This ends the procedure.
13. Are there any other failing items that need to be replaced?
Yes: Replace the next failing item listed for the reference code. This ends the procedure..
No: Contact your next level of support for assistance. This ends the procedure.
14. Are you working with a tape device?
Yes: Perform the verification procedures in the Service functions to verify that the problem is
corrected. After the verification test is complete, the tape device description will be set to the
failed state because a resource change was detected. Perform the following tasks:
- Vary the tape device description off and then on.
- Return the system to the customer and have the customer verify the system date and time.
Then go to Verifying the repair from the HMC. This ends the procedure.
No: Continue with the next step.
15. Are you working with an IOP or an IOA?
Yes: Use the display hardware configuration service function to check for any missing or failed
hardware:
- On the command line, enter the STRSST (Start System Service Tools command). If you cannot get
to SST, select DST. Do not IPL the system or partition to get to DST.
- On the Start Service Tools Sign On display, enter the user ID with the service authority and
password.
- Select Start a service tool ? Hardware service manager ? Logical hardware resources ? System
bus resources.
- Select the function key for Include nonreporting resources.
- If the IOP and IOA that you just replaced is a failed or non-reporting resource, the problem has
not been fixed. Continue to the next failing item in the failing item list. This ends the procedure.
No: Perform the verification procedures in the Service functions topics to verify that the problem
is corrected. Resources that usually vary on automatically during an IPL, or that were previously
varied on manually, might need to be varied on again after the verification procedures are
complete. Return the system to the customer and have the customer verify the system date and
time. This ends the procedure.
Removal and replacement procedures for the 8233-E8B and 8236-E8C 205
Closing a service call
Perform these procedures to close problem numbers, clear hardware messages, and prepare the server to
return to the customer.
Follow this checklist before performing the procedures:
v Return the server to the state that the customer normally uses such as IPL type, IPL mode, and the
way the system is configured or partitioned.
Attention: Before returning the system to the customer, remove the system from service mode. If the
system is left in service mode, it automatically places a call for service every two hours.
v While you were performing the problem analysis on the original serviceable event, other serviceable
events might have been opened. Close all serviceable events that were opened as a result of your
service activity.
v Ensure that server verification has been performed and no problems require additional service actions.
v If the repair was done using the management console online repair procedures, ensure that the original
serviceable event is now closed.
1. Record the system reference code (SRC) or symptom and the location code of the field-replaceable
unit (FRU) you replaced, for future reference. Is the server managed by a management console?
v Yes: Continue with the next step.
v No: Do one of the following steps:
– If the server is managed by Integrated Virtualization Manager (IVM), go to “Closing a service call using
Integrated Virtualization Manager” on page 210.
– If the server is not partitioned and is running the AIX or Linux operating system, go to “Closing a service call
using AIX or Linux” on page 214.
2. On the HMC, open Manage Serviceable Events and examine the service action event log for any
open service action events.
3. Are there any service action events that are open?
v Yes: Continue with the next step.
v No: If the system attention LED is still on, turn off the LED as described in “Activating and deactivating LEDs” on
page 218. Return the system to the customer. This completes the repair.
4. Record the list of open service action events.
5. From the list of serviceable events recorded in step 4, perform the following step 6 through step 32
on page 209 for each open service action event.
6. Determine the error class of the serviceable event. Record for future use.
7. Examine the details of the open service action event.
Is the error code that is associated with this service action event the same as recorded in Step 1?
206 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures
v Yes: Go to step 11.
v No: Continue with the next step.
8. Examine the FRU list of the service action event. Are any FRUs listed for the service action event?
v Yes: Continue with the next step.
v No: Go to step 11.
9. Is the FRU list identical (same FRUs, same number of FRUs, and same order of FRUs) to the FRU
list of the error code recorded in step 1 on page 206?
v Yes: Go to step 11.
v No: Continue with the next step.
10. The FRU list is different. Is the FRU you replaced and recorded in step 1 on page 206 in the list of
FRUs for this service action event?
v Yes: Continue with the next step.
v No: Go to step 32 on page 209.
Note: There are service action events that will remain open when you leave this MAP. Further service actions
might be required to complete the repair.
11. Examine the details of this service action event, and record the partitions involved in this service
action event for use in a later step.
12. Is the error code associated with this service action event of the form A11-xxx or A01-xxx?
v Yes: Continue with the next step.
v No: Go to step 17.
13. Have you begun a list of Axx partitions from prior service action events that you processed in this
maintenance analysis procedure (MAP)?
v Yes: Go to step 15.
v No: Continue with the next step.
14. Begin a new list of Axx partitions by copying the list of partitions obtained in step 11. Go to step 16.
15. Add the partition list obtained in step 11 to the existing list of Axx partitions obtained from
processing previous service action events in this MAP.
16. Remove all entries in the list of all partitions you recorded in step 11. If you are referred to the list of
partitions obtained in step 11 in future steps, the list is empty. Go to step 17.
17. Select and highlight the service action event from the Error Associated With This Serviceable Event
window.
Removal and replacement procedures for the 8233-E8B and 8236-E8C 207
18. Click Close Event.
19. Add comments for the serviceable event. Include any unique additional information. Click OK. The
following steps will add or update FRU information.
20. Did you replace, add, or modify a FRU of the open service action event?
v Yes: Continue with the next step.
v No: Go to step 22.
21. From the FRU list, select a FRU that you need to update. Double-click the FRU, and update the FRU
information. Go to step 23.
22. Select the No FRU Replaced for this Serviceable Event option.
23. Click OK to close the service action event.
24. Is the list of all partitions you recorded in step 11 on page 207 empty?
v Yes: Go to step 32 on page 209.
v No: Continue with the next step.
25. Does the list of all partitions you recorded in step 11 on page 207 contain more than one entry?
v Yes: Continue with the next step.
v No: Go to step 32 on page 209.
26. Is the error class recorded in step 25 AIX?
v Yes: Continue with the next step.
v No: Go to step 32 on page 209.
27. Perform the following steps for each entry in the list of all partitions you recorded in step 11 on page
207, except the partition you were using to debug the original problem.
28. From the HMC virtual terminal window of a partition in the list of all partitions, type diag at the
AIX command prompt.
29. When the diagnostic operating instructions are displayed, do the following steps:
208 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures
1. Press Enter.
2. Select the Task Selection option.
3. Select the Log Repair option.
4. Select the resource associated with the repair action. If the repair
action was reseating a cable or adapter, select the resource
associated with that repair action. If the resource associated with
your repair action is not displayed on the Resource List, select
sysplanar0.
5. Click Commit after you have made your selection.
Note: If the terminal type is not defined, you
are prompted to define it before you can
proceed.
30. Exit from diagnostics in this partition and return to the AIX prompt.
31. Have all the partitions in the list of all the partitions you recorded in step 11 on page 207 been
processed?
v Yes: Continue with the next step.
v No: Go to step 24 on page 208 to process the next partition in the list you recorded in step 11 on page 207.
32. Have all the serviceable events recorded in step 4 on page 206 been processed?
v Yes: Continue with the next step.
v No: Go to step 5 on page 206 and process the next service action event in the list of serviceable events recorded in
step 4 on page 206.
33. While processing all service action events, were you directed to step 14 on page 207?
v Yes: Continue with the next step.
v No: If the system attention LED is still on, turn off the LED as described in “Activating and deactivating LEDs” on
page 218. Return the system to the customer. This completes the repair.
Note: If during the processing of the list of open service action events, some service action events remained open,
further service actions might be required to complete the repair.
34. Perform the following steps for each entry in the list of Axx partitions you began recording in step
14 on page 207, except the partition you were using to debug the original problem.
35. From the management console virtual terminal window of a partition in the list of Axx partitions,
type diag at the AIX command prompt.
36. When the diagnostic operating instructions are displayed, do the following steps:
Removal and replacement procedures for the 8233-E8B and 8236-E8C 209
1. Press Enter.
2. Select the Task Selection option.
Note: If the terminal type is not defined, you are prompted to
define it before you can proceed.
3. Select the Log Repair option.
4. Select the resource associated with the repair action. If the repair
action was reseating a cable or adapter, select the resource
associated with that repair action. If the resource associated with
your repair action is not displayed on the Resource List, select
sysplanar0.
5. Click Commit after you have made your selection.
37. Exit from diagnostics in this partition and return to the AIX prompt.
38. Have all the partitions in the list of Axx partitions you began recording in step 14 on page 207 been
processed?
v Yes: Continue with the next step.
v No: Go to step 34 on page 209 to process the next partition in the list you recorded in step 14 on page 207.
39. If the system attention LED is still on, turn off the LED as described in “Activating and deactivating
LEDs” on page 218. This completes the repair. Return the system to the customer.
Note: If during the processing of the list of open service action events, some service action events
remained open, further service actions might be required to complete the repair.
Closing a service call using Integrated Virtualization Manager
Perform these procedures to close problem numbers, clear hardware messages, and prepare the server to
return to the customer.
Follow this checklist before performing the procedures:
v Return the server to the state that the customer normally uses, such as IPL type, IPL mode, and the
way the system is configured or partitioned.
Attention: Before returning the system to the customer, remove the system from service mode. If the
system is left in service mode, it automatically places a call for service every two hours.
v While you were performing the problem analysis on the original serviceable event, other
serviceable-event numbers might have been opened. Close all serviceable events that were opened as a
result of your service activity.
v Ensure that server verification has been performed and there are no problems that require additional
service actions.
v If the repair was done using the Integrated Virtualization Manager (IVM) online repair procedures,
ensure that the original serviceable event is now closed.
1. Record the system reference code (SRC) or symptom and the location code of the field-replaceable
unit (FRU) you replaced, for future reference.
2. On the IVM, open Manage Serviceable Events and look at existing serviceable events.
210 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures
3. Are there any service action events that are open?
v Yes: Continue with the next step.
v No: If the system attention LED is still on, turn off the LED as described in “Activating and deactivating LEDs” on
page 218. Return the system to the customer. This completes the repair.
4. Record the list of open service action events.
5. From the list of serviceable events recorded in step 4, perform step 6 through step 30 on page 213 for
each open service action event.
6. Determine the error class of the serviceable event. Record for future use.
7. Examine the details of the open service action event.
Is the error code associated with this service action event the same as
recorded in step 1 on page 210?
v Yes: Go to step 11.
v No: Continue with the next step.
8. Examine the FRU list of the service action event. Are any FRUs listed for the service action event?
v Yes: Continue with the next step.
v No: Go to step 11.
9. Is the FRU list identical (same FRUs, same number of FRUs, and same order of FRUs) to the FRU
list of the error code recorded in step 1 on page 210?
v Yes: Go to step 11.
v No: Continue with the next step.
10. Is the FRU you replaced and recorded in step 1 on page 210 in the list of FRUs for this service action
event?
v Yes: Continue with the next step.
v No: Go to step 30 on page 213.
Note: There are service action events that will remain open when you leave this MAP. Further service actions
might be required to complete the repair.
11. Examine the details of this service action event, and record the partitions involved in this service
action event for use in a later step.
12. Is the error code associated with this service action event of the form A11-xxx or A01-xxx?
Removal and replacement procedures for the 8233-E8B and 8236-E8C 211
v Yes: Continue with the next step.
v No: Go to step 17.
13. Have you begun a list of Axx partitions from prior service action events that you processed in this
maintenance analysis procedure (MAP)?
v Yes: Go to step 15.
v No: Continue with the next step.
14. Begin a new list of Axx partitions by copying the list of partitions obtained in step 11 on page 211.
Go to step 16.
15. Add the partition list obtained in step 11 on page 211 to the existing list of Axx partitions obtained
from processing previous service action events in this MAP.
16. Remove all entries in the list of all partitions you recorded in step 11 on page 211. If you are referred
to the list of partitions obtained in step 11 on page 211 in future steps, the list is empty. Go to step
17.
17. Select and highlight the service action event from the Manage Serviceable Events window.
18. Click Close Event.
19. Add comments for the serviceable event. Include any unique additional information. Click OK.
20. Add or update FRU information:
Did you replace, add, or modify a FRU of the open service action event?
v Yes: Continue with the next step.
v No: Go to step 22.
21. Click OK to close the service action event.
22. Is the list of all partitions you recorded in step 11 on page 211 empty?
v Yes: Go to step 30 on page 213.
v No: Continue with the next step.
23. Does the list of all partitions you recorded in step 11 on page 211 contain more than one entry?
212 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures
v Yes: Continue with the next step.
v No: Go to step 30.
24. Is the error class recorded in step 23 on page 212?
v Yes: Continue with the next step.
v No: Go to step 30.
25. Perform the following steps for each entry in the list of all partitions you recorded in step 11 on page
211, except the partition you were using to debug the original problem.
26. From the IVM virtual terminal window of a partition in the list of all partitions, type diag at the AIX
command prompt.
27. When the diagnostic operating instructions are displayed, do the following steps:
1. Press Enter.
2. Select the Task Selection option.
3. Select the Log Repair option.
4. Select the resource associated with the repair action. If the repair
action was reseating a cable or adapter, select the resource
associated with that repair action. If the resource associated with
your repair action is not displayed on the Resource List, select
sysplanar0.
5. Click Commit after you have made your selection.
Note: If the terminal type is not defined, you
are prompted to define it before you can
proceed.
28. Exit from diagnostics in this partition and return to the AIX prompt.
29. Have all the partitions in the list of all partitions you recorded in step 11 on page 211 been
processed?
v Yes: Continue with the next step.
v No: Go to step 25 to process the next partition in the list you recorded in step 11 on page 211.
30. Have all the serviceable events recorded in step 4 on page 211 been processed?
v Yes: Continue with the next step.
v No: Go to step 5 on page 211 and process the next service action event in the list of serviceable events recorded in
step 4 on page 211.
31. While processing all service action events, were you directed to step 14 on page 212?
v Yes: Continue with the next step.
v No: If the system attention LED is still on, turn off the LED as described in “Activating and deactivating LEDs” on
page 218. Return the system to the customer. This completes the repair.
Note: If during the processing of the list of open service action events, some service action events remained open,
further service actions might be required to complete the repair.
Removal and replacement procedures for the 8233-E8B and 8236-E8C 213
32. Perform the following steps for each entry in the list of Axx partitions you began recording in step
14 on page 212, except the partition you were using to debug the original problem.
33. From the IVM virtual terminal window of a partition in the list of Axx partitions, type diag at the
AIX command prompt.
34. When the diagnostic operating instructions are displayed, do the following steps:
1. Press Enter.
2. Select the Task Selection option.
Note: If the terminal type is not defined, you are prompted to
define it before you can proceed.
3. Select the Log Repair option.
4. Select the resource associated with the repair action. If the repair
action was reseating a cable or adapter, select the resource
associated with that repair action. If the resource associated with
your repair action is not displayed on the Resource List, select
sysplanar0.
5. Click Commit after you have made your selection.
35. Exit from diagnostics in this partition and return to the AIX prompt.
36. Have all the partitions in the list of Axx partitions you began recording in step 14 on page 212 been
processed?
v Yes: Continue with the next step.
v No: Go to step 32 to process the next partition in the list you recorded in step 14 on page 212.
37. If the system attention LED is still on, turn off the LED as described in “Activating and deactivating
LEDs” on page 218. This completes the repair. Return the system to the customer.
Note: If, during the processing of the list of open service action events, some service action events remained open,
further service actions might be required to complete the repair.
Closing a service call using AIX or Linux
If the server is not connected to an management console and not using Integrated Virtualization Manager
(IVM), perform these procedures to close problem numbers, clear hardware messages, and prepare the
server to return to the customer.
Follow this checklist before performing the procedures:
v Return the server to the state that the customer normally uses, such as IPL type, IPL mode, and the
way the system is configured or partitioned.
Attention: Before returning the system to the customer, remove the system from service mode. If the
system is left in service mode, it automatically places a call for service every two hours.
v While you were performing the problem analysis on the original serviceable event, other
serviceable-event numbers might have been opened. Close all serviceable events that were opened as a
result of your service activity.
214 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures
v Ensure that server verification has been performed and that no problems require additional service
actions.
v If the repair was done using the IVM online repair procedures, ensure that the original serviceable
event is now closed.
1. Did you use an AIX diagnostics service aid hot-swap operation to change the FRU?
v Yes: Go to step 4
v No: Continue with the next step.
2. Do you have any field-replaceable units (FRUs) (for example cards, adapters, cables, or devices) that
were removed during problem analysis that you want to put back into the system?
Note: If the system planar or battery has been replaced and you are loading diagnostics from a
server over a network, it might be necessary for the customer to set the network boot information
for this system before diagnostics can be loaded. The system time and date information should also
be set when the repair is completed.
v Yes: Reinstall all of the FRUs that were removed during problem analysis. Go to step 3
v No: Continue with the next step.
3. Is the system or logical partition that you are performing a repair action on running the AIX
operating system?
v Yes: Continue with the next step.
v No: Go to step 5.
4. Does the system or logical partition you are performing a repair action on have AIX installed?
Note: Answer no to this question if you have just replaced a hard disk in the root volume group.
v Yes: Go to step 7 on page 216.
v No: Continue with the next step.
5. Run stand-alone diagnostics in problem determination mode from either a CD-ROM or from a
Network Installation Management (NIM) server.
Note: For instructions on running stand-alone diagnostics from a CD and not using an HMC, go to
Running the stand-alone diagnostics from CD on a server without an HMC attached.
For instructions on running stand-alone diagnostics from a NIM server, go to Running the
stand-alone diagnostics from a Network Installation Management server.
Did you encounter any problems?
v Yes: Go to problem analysis.
v No: Continue with the next step.
6. The system hardware is functioning correctly.
Removal and replacement procedures for the 8233-E8B and 8236-E8C 215
If the system attention LED is still on, turn off the LED as described
in “Activating and deactivating LEDs” on page 218.
This completes the repair.
Note: If, during the processing of the list of open service action
events, some service action events remained open, further service
actions might be required to complete the repair.
Return the server to the state that the customer normally uses, such
as IPL type, IPL mode, and the way the system is configured or
partitioned. This might require you to reboot the operating system.
Attention: Before returning the system to the customer, remove the
system from service mode. If the system is left in service mode, it
automatically places a call for service every two hours.
7. Complete the following steps:
1. If the system supports slow boot (see Performing a slow boot), do
a slow boot on the system. If the system does not support slow
boot, do a normal boot.
2. Power on the system.
3. Wait until the AIX operating system login prompt displays or
until system activity on the operator panel or display apparently
has stopped.
Did the AIX Login Prompt display?
v Yes: Continue with the next step.
v No: Go to problem analysis.
8. If the Resource Repair Action menu is already displayed, go to 12 on page 217; otherwise, do the
following steps:
1. Log into the operating system, either with root authority (if
needed, ask the customer to enter the password) or use the CE
login.
2. Enter the diag -a command and check for missing resources.
Follow any instructions that display. If an system request number
(SRN) displays, suspect a loose card or connection. If no
instructions display, no resources were detected as missing.
Continue with 9.
9. Complete the following steps:
1. Enter diag at the command prompt.
2. Press Enter.
3. Select the Diagnostics Routines option.
4. When the Diagnostic Mode Selection menu displays, select
Problem determination.
5. When the Advanced Diagnostic Selection menu displays, select
the All Resources option, or test the FRUs you exchanged, and
any devices that are attached to the FRUs you exchanged, by
selecting the diagnostics for the individual FRU.
Did the Resource Repair Action menu (801015) display?
216 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures
v Yes: Go to step 13.
v No: Continue with the next step.
10. Did the TESTING COMPLETE, no trouble was found menu (801010) display?
v Yes: Continue with the next step.
v No: There is still a problem. Go to problem analysis.
11. Use the Log Repair Action option, if not previously logged, in the TASK SELECTION menu to
update the AIX error log. If the repair action was reseating a cable or adapter, select the resource
associated with that repair action.
If the resource associated with your action is not displayed on the
resource list, select sysplanar0.
Note: If the system attention indicator is on, this will set it back to
the normal state. Go to step 14 on page 218.
12. When a test is run on a resource in system verification mode, and that resource has an entry in the
AIX error log, if the test on the resource was successful, the Resource Repair Action menu displays.
After replacing a FRU, you must select the resource for that FRU from the Resource Repair Action menu. This
updates the AIX error log to indicate that a system-detectable FRU has been replaced.
Note: If the system attention indicator is on, this action will set it back to the normal state.
Do the following steps:
1. Select the resource that has been replaced from the Resource Repair Action menu. If the repair action was
reseating a cable or adapter, select the resource associated with that repair action. If the resource associated with
your action is not displayed on the resource list, select sysplanar0.
2. Press Commit after you make your selections.
Did another Resource Repair Action (801015) display?
v Yes: Continue with the next step.
v No: If the No Trouble Found menu displays, go to step 14 on page 218.
13. The parent or child of the resource you just replaced might also require that you run the Resource
Repair Action service aid on it.
When a test is run on a resource in system verification mode, and
that resource has an entry in the AIX error log, if the test on the
resource was successful, the Resource Repair Action menu displays.
After replacing that FRU, you must select the resource for that FRU
from the Resource Repair Action menu. This updates the AIX error
log to indicate that a system-detectable FRU has been replaced.
Note: If the system attention indicator is on, this action will set it
back to the normal state.
Removal and replacement procedures for the 8233-E8B and 8236-E8C 217
Do the following steps:
1. From the Resource Repair Action menu, select the parent or child
of the resource that has been replaced. If the repair action was
reseating a cable or adapter, select the resource associated with
that repair action. If the resource associated with your action is
not displayed on the resource list, select sysplanar0.
2. Press COMMIT after you make your selections.
Did the No Trouble Found menu display?
v Yes: Continue with the next step.
v No: Go to problem analysis.
14. If you changed the service processor or network settings, as instructed in previous MAPs, restore the
settings to the value they had prior to servicing the system. If you ran stand-alone diagnostics from
CD-ROM, remove the stand-alone diagnostics CD-ROM from the system.
Did you perform service on a RAID subsystem involving changing of the PCI RAID adapter cache card or changing
the configuration?
Note: This does not refer to the PCI-X RAID adapter or cache.
v Yes: Continue with the next step.
v No: Go to step 16.
15. Use the Recover Options selection to resolve the RAID configuration. To do this, do the following
steps:
1. On the PCI SCSI Disk Array Manager dialog, select Recovery options.
2. If a previous configuration exists on the replacement adapter, this must be cleared. Select Clear PCI SCSI
Adapter Configuration and press F3.
3. On the Recovery Options dialog, select Resolve PCI SCSI RAID Adapter Configuration.
4. On the Resolve PCI SCSI RAID Adapter Configuration dialog, select Accept Configuration on Drives.
5. On the PCI SCSI RAID Adapter selections menu, select the adapter that you changed.
6. On the next dialog, press Enter.
7. When you see the Are You Sure selection menu, press Enter to continue.
8. You should see an OK status message when the recover is complete. If you get a Failed status message, verify that
you selected the correct adapter, and then repeat this procedure. When recover is complete, exit the operating
system.
9. Go to 16.
16. The system hardware is functioning correctly. Return the server to the state that the customer
normally uses, such as IPL type, IPL mode, and the way the system is configured or partitioned.
Attention: Before returning the system to the customer, remove the
system from service mode. If the system is left in service mode, it
automatically places a call for service every two hours.
Activating and deactivating LEDs
You can use these procedures to activate or deactivate light-emitting diodes (LEDs) using the
management console or the Advanced System Management Interface (ASMI).
218 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures
For IBM Power Systems™ servers that contain the POWER7® processor, the LEDs can be used to identify
or verify a part that you are servicing. The error and identify function (amber) LED indicates an error
and corresponds to the location code in the system reference code (SRC). The LED is activated and
deactivated automatically.
Additionally, the following procedures can also be used to activate and deactivate LEDs.
v “Deactivating a system attention LED or partition LED using the management console”
v “Activating or deactivating an identify LED using the management console”
v “Deactivating a system attention LED or logical partition LED using the Advanced System
Management Interface” on page 220
v “Activating or deactivating an identify LED using the Advanced System Management Interface” on
page 221
Deactivating a system attention LED or partition LED using the management
console
You can deactivate a system attention LED or a logical partition LED if you decide that a problem is not
a high priority and you decide to repair the problem at a later time. You can do this task from the
Hardware Management Console (HMC) or the IBM Systems Director Management Console (SDMC).
If you want to be alerted if another problem occurs, you must deactivate the system attention LED so
that it can be activated again if another problem occurs.
To deactivate a system attention LED by using HMC, complete the following steps:
1. In the navigation area, open Systems management.
2. Select the server you are working on by checking the box next to its name.
3. Open Operations.
4. Open LED Status.
5. Select View System Attention. The system attention LED window opens. The selected system and its
LED state are displayed in the upper part of the window. The logical partition and its LED state are
displayed in the lower part of the window. From the system attention LED window, you can
deactivate both the system attention LED and the logical partition LED.
6. Select Deactivate System Attention LED from the Action menu. A confirmation window is displayed
that provides the following information:
v A verification that the system attention LED was deactivated.
v An indication that there still might be open problems within the system.
v An indication that you cannot activate the system attention LED.
7. Select one of the logical partitions in the lower table, and select Deactivate partition LED from the
Partition Operations menu. A confirmation window is displayed that provides the following
information:
v A verification that the logical partition LED was deactivated.
v An indication that there still might be open problems within the logical partition.
v An indication that you cannot activate the logical partition LED.
To deactivate a system attention LED by using SDMC, complete the following steps:
1. On the Welcome page, under the Resources tab, select the server.
2. Click Actions > Service and Support> Hardware> System Attention LED.
3. Click OK.
Activating or deactivating an identify LED using the management console
You can activate or deactivate an identify LED for components attached to the system from the Hardware
Management Console (HMC) or the IBM Systems Director Management Console (SDMC).
Removal and replacement procedures for the 8233-E8B and 8236-E8C 219
The system provides several LEDs that help identify various components, such as enclosures or
field-replaceable units (FRUs), in the system. For this reason, they are called identify LEDs.
You can activate or deactivate the following types of identify LEDs:
v Identify LED for an enclosure If you want to add an adapter to a specific drawer (enclosure), you
need to know the machine type, model, and serial number (MTMS) of the drawer. To determine
whether you have the correct MTMS for the drawer that needs the new adapter, you can activate the
LED for a drawer and verify that the MTMS corresponds to the drawer that requires the new adapter.
v Identify LED for a FRU associated with a specified enclosure If you want to hook up a cable to a
specific I/O adapter, you can activate the LED for the adapter, which is a field replaceable unit (FRU),
and then physically check to see where you should hook up the cable. This is especially useful when
you have several adapters with open ports.
To activate or deactivate an identify LED for an enclosure or FRU by using the HMC, follow these steps:
1. In the navigation area, open Systems management.
2. Select the server you are working on.
3. Click Operations ? LED Status ? Identify LED. The Identify LED, Select Enclosure window opens.
4. To activate or deactivate an identify LED for an enclosure, select an enclosure from the table, and click
either Activate LED or Deactivate LED. The associated LED is either turned on or off.
5. To activate or deactivate an identify LED for a FRU, select an enclosure from the table, and then select
Selected ? List FRUs.
6. Select one or more FRUs from the table, and click either Activate LED or Deactivate LED. The
associated LED is either turned on or off.
To activate or deactivate an identify LED for an enclosure or FRU by using the SDMC, complete these
steps:
1. On the Welcome page, under the Resources tab, select the server.
2. Click Actions > Service and Support> Hardware> Identify LED.
3. To activate or deactivate an identify LED for an enclosure, select an enclosure from the table, and click
either Activate LED or Deactivate LED. The associated LED is either turned on or off.
4. To activate or deactivate an identify LED for a FRU, select an enclosure from the table, and then click
List FRUs.
5. Select one or more FRUs from the table, and then click either Activate LED or Deactivate LED. The
associated LED is either turned on or off.
Deactivating a system attention LED or logical partition LED using the Advanced
System Management Interface
You can deactivate a system attention LED or a logical partition LED using the Advanced System
Management Interface (ASMI).
The system attention indicator provides a visual signal that the system as a whole requires attention or
service. Each system has a single system attention indicator. When an event occurs that either needs your
intervention or that of service and support, the system attention indicator lights continuously. The system
attention indicator is turned on when an entry is made in the service processor error log. The error entry
is transmitted to the system error log and the operating system error log.
To perform this operation, your authority level must be one of the following levels:
v Administrator
v Authorized service provider
To turn off the system attention indicator, do the following steps:
1. On the ASMI Welcome pane, specify your user ID and password, and click Log In.
220 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures
2. In the navigation area, expand System Configuration ? Service Indicators ? System Attention
Indicator.
3. In the right pane, click Turn off system attention indicator. If the attempt is unsuccessful, an error
message is displayed.
Activating or deactivating an identify LED using the Advanced System
Management Interface
You can activate or deactivate an identify LED using the Advanced System Management Interface
(ASMI).
You can specify the location code of any indicator to view or modify its current state. If you provide the
wrong location code, the advanced system manager attempts to go to the next higher level of the location
code.
The next level is the base-level location code for that field replaceable unit (FRU). For example, a user
types the location code for the FRU located on the second I/O slot of the third enclosure in the system. If
the location code for the second I/O slot is incorrect (the FRU does not exist at this location), an attempt
to set the indicator for the third enclosure is initiated. This process continues until a FRU is located or no
other level is available.
To perform this operation, your authority level must be one of the following levels:
v Administrator
v Authorized service provider
To change the current state of an indicator, do the following steps:
1. On the ASMI Welcome pane, specify your user ID and password, and click Log In.
2. In the navigation area, expand System Configuration ? Service Indicators ? Indicators by Location
code.
3. In the right pane, enter the location code of the FRU and click Continue.
4. Select the preferred state from the list.
5. Click Save settings.
Removal and replacement procedures for the 8233-E8B and 8236-E8C 221
222 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures
Notices
This information was developed for products and services offered in the U.S.A.
The manufacturer may not offer the products, services, or features discussed in this document in other
countries. Consult the manufacturer's representative for information on the products and services
currently available in your area. Any reference to the manufacturer's product, program, or service is not
intended to state or imply that only that product, program, or service may be used. Any functionally
equivalent product, program, or service that does not infringe any intellectual property right of the
manufacturer may be used instead. However, it is the user's responsibility to evaluate and verify the
operation of any product, program, or service.
The manufacturer may have patents or pending patent applications covering subject matter described in
this document. The furnishing of this document does not grant you any license to these patents. You can
send license inquiries, in writing, to the manufacturer.
The following paragraph does not apply to the United Kingdom or any other country where such
provisions are inconsistent with local law: THIS INFORMATION IS PROVIDED “AS IS” WITHOUT
WARRANTY OF ANY KIND, EITHER EXPRESS OR IMPLIED, INCLUDING, BUT NOT LIMITED TO,
THE IMPLIED WARRANTIES OF NON-INFRINGEMENT, MERCHANTABILITY OR FITNESS FOR A
PARTICULAR PURPOSE. Some states do not allow disclaimer of express or implied warranties in certain
transactions, therefore, this statement may not apply to you.
This information could include technical inaccuracies or typographical errors. Changes are periodically
made to the information herein; these changes will be incorporated in new editions of the publication.
The manufacturer may make improvements and/or changes in the product(s) and/or the program(s)
described in this publication at any time without notice.
Any references in this information to websites not owned by the manufacturer are provided for
convenience only and do not in any manner serve as an endorsement of those websites. The materials at
those websites are not part of the materials for this product and use of those websites is at your own risk.
The manufacturer may use or distribute any of the information you supply in any way it believes
appropriate without incurring any obligation to you.
Any performance data contained herein was determined in a controlled environment. Therefore, the
results obtained in other operating environments may vary significantly. Some measurements may have
been made on development-level systems and there is no guarantee that these measurements will be the
same on generally available systems. Furthermore, some measurements may have been estimated through
extrapolation. Actual results may vary. Users of this document should verify the applicable data for their
specific environment.
Information concerning products not produced by this manufacturer was obtained from the suppliers of
those products, their published announcements or other publicly available sources. This manufacturer has
not tested those products and cannot confirm the accuracy of performance, compatibility or any other
claims related to products not produced by this manufacturer. Questions on the capabilities of products
not produced by this manufacturer should be addressed to the suppliers of those products.
All statements regarding the manufacturer's future direction or intent are subject to change or withdrawal
without notice, and represent goals and objectives only.
The manufacturer's prices shown are the manufacturer's suggested retail prices, are current and are
subject to change without notice. Dealer prices may vary.
© Copyright IBM Corp. 2010 223
This information is for planning purposes only. The information herein is subject to change before the
products described become available.
This information contains examples of data and reports used in daily business operations. To illustrate
them as completely as possible, the examples include the names of individuals, companies, brands, and
products. All of these names are fictitious and any similarity to the names and addresses used by an
actual business enterprise is entirely coincidental.
If you are viewing this information in softcopy, the photographs and color illustrations may not appear.
The drawings and specifications contained herein shall not be reproduced in whole or in part without the
written permission of the manufacturer.
The manufacturer has prepared this information for use with the specific machines indicated. The
manufacturer makes no representations that it is suitable for any other purpose.
The manufacturer's computer systems contain mechanisms designed to reduce the possibility of
undetected data corruption or loss. This risk, however, cannot be eliminated. Users who experience
unplanned outages, system failures, power fluctuations or outages, or component failures must verify the
accuracy of operations performed and data saved or transmitted by the system at or near the time of the
outage or failure. In addition, users must establish procedures to ensure that there is independent data
verification before relying on such data in sensitive or critical operations. Users should periodically check
the manufacturer's support websites for updated information and fixes applicable to the system and
related software.
Ethernet connection usage restriction
This product is not intended to be connected directly or indirectly by any means whatsoever to interfaces
of public telecommunications networks.
Trademarks
IBM, the IBM logo, and ibm.com are trademarks or registered trademarks of International Business
Machines Corp., registered in many jurisdictions worldwide. Other product and service names might be
trademarks of IBM or other companies. A current list of IBM trademarks is available on the Web at
Copyright and trademark information at www.ibm.com/legal/copytrade.shtml.
Linux is a registered trademark of Linus Torvalds in the United States, other countries, or both.
Red Hat, the Red Hat "Shadow Man" logo, and all Red Hat-based trademarks and logos are trademarks
or registered trademarks of Red Hat, Inc., in the United States and other countries.
Other company, product, or service names may be trademarks or service marks of others.
Electronic emission notices
When attaching a monitor to the equipment, you must use the designated monitor cable and any
interference suppression devices supplied with the monitor.
Class A Notices
The following Class A statements apply to the IBM servers that contain the POWER7 processor and its
features unless designated as electromagnetic compatibility (EMC) Class B in the feature information.
224 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures
Federal Communications Commission (FCC) statement
Note: This equipment has been tested and found to comply with the limits for a Class A digital device,
pursuant to Part 15 of the FCC Rules. These limits are designed to provide reasonable protection against
harmful interference when the equipment is operated in a commercial environment. This equipment
generates, uses, and can radiate radio frequency energy and, if not installed and used in accordance with
the instruction manual, may cause harmful interference to radio communications. Operation of this
equipment in a residential area is likely to cause harmful interference, in which case the user will be
required to correct the interference at his own expense.
Properly shielded and grounded cables and connectors must be used in order to meet FCC emission
limits. IBM is not responsible for any radio or television interference caused by using other than
recommended cables and connectors or by unauthorized changes or modifications to this equipment.
Unauthorized changes or modifications could void the user's authority to operate the equipment.
This device complies with Part 15 of the FCC rules. Operation is subject to the following two conditions:
(1) this device may not cause harmful interference, and (2) this device must accept any interference
received, including interference that may cause undesired operation.
Industry Canada Compliance Statement
This Class A digital apparatus complies with Canadian ICES-003.
Avis de conformité à la réglementation d'Industrie Canada
Cet appareil numérique de la classe A est conforme à la norme NMB-003 du Canada.
European Community Compliance Statement
This product is in conformity with the protection requirements of EU Council Directive 2004/108/EC on
the approximation of the laws of the Member States relating to electromagnetic compatibility. IBM cannot
accept responsibility for any failure to satisfy the protection requirements resulting from a
non-recommended modification of the product, including the fitting of non-IBM option cards.
This product has been tested and found to comply with the limits for Class A Information Technology
Equipment according to European Standard EN 55022. The limits for Class A equipment were derived for
commercial and industrial environments to provide reasonable protection against interference with
licensed communication equipment.
European Community contact:
IBM Deutschland GmbH
Technical Regulations, Department M456
IBM-Allee 1, 71139 Ehningen, Germany
Tele: +49 7032 15-2937
email: tjahn@de.ibm.com
Warning: This is a Class A product. In a domestic environment, this product may cause radio
interference, in which case the user may be required to take adequate measures.
Notices 225
VCCI Statement - Japan
The following is a summary of the VCCI Japanese statement in the box above:
This is a Class A product based on the standard of the VCCI Council. If this equipment is used in a
domestic environment, radio interference may occur, in which case, the user may be required to take
corrective actions.
Japanese Electronics and Information Technology Industries Association (JEITA)
Confirmed Harmonics Guideline (products less than or equal to 20 A per phase)
Japanese Electronics and Information Technology Industries Association (JEITA)
Confirmed Harmonics Guideline with Modifications (products greater than 20 A per
phase)
Electromagnetic Interference (EMI) Statement - People's Republic of China
Declaration: This is a Class A product. In a domestic environment this product may cause radio
interference in which case the user may need to perform practical action.
226 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures
Electromagnetic Interference (EMI) Statement - Taiwan
The following is a summary of the EMI Taiwan statement above.
Warning: This is a Class A product. In a domestic environment this product may cause radio interference
in which case the user will be required to take adequate measures.
IBM Taiwan Contact Information:
Electromagnetic Interference (EMI) Statement - Korea
Germany Compliance Statement
Deutschsprachiger EU Hinweis: Hinweis für Geräte der Klasse A EU-Richtlinie zur
Elektromagnetischen Verträglichkeit
Dieses Produkt entspricht den Schutzanforderungen der EU-Richtlinie 2004/108/EG zur Angleichung der
Rechtsvorschriften über die elektromagnetische Verträglichkeit in den EU-Mitgliedsstaaten und hält die
Grenzwerte der EN 55022 Klasse A ein.
Um dieses sicherzustellen, sind die Geräte wie in den Handbüchern beschrieben zu installieren und zu
betreiben. Des Weiteren dürfen auch nur von der IBM empfohlene Kabel angeschlossen werden. IBM
übernimmt keine Verantwortung für die Einhaltung der Schutzanforderungen, wenn das Produkt ohne
Zustimmung von IBM verändert bzw. wenn Erweiterungskomponenten von Fremdherstellern ohne
Empfehlung von IBM gesteckt/eingebaut werden.
Notices 227
EN 55022 Klasse A Geräte müssen mit folgendem Warnhinweis versehen werden:
"Warnung: Dieses ist eine Einrichtung der Klasse A. Diese Einrichtung kann im Wohnbereich
Funk-Störungen verursachen; in diesem Fall kann vom Betreiber verlangt werden, angemessene
Maßnahmen zu ergreifen und dafür aufzukommen."
Deutschland: Einhaltung des Gesetzes über die elektromagnetische Verträglichkeit von Geräten
Dieses Produkt entspricht dem “Gesetz über die elektromagnetische Verträglichkeit von Geräten
(EMVG)“. Dies ist die Umsetzung der EU-Richtlinie 2004/108/EG in der Bundesrepublik Deutschland.
Zulassungsbescheinigung laut dem Deutschen Gesetz über die elektromagnetische Verträglichkeit von
Geräten (EMVG) (bzw. der EMC EG Richtlinie 2004/108/EG) für Geräte der Klasse A
Dieses Gerät ist berechtigt, in Übereinstimmung mit dem Deutschen EMVG das EG-Konformitätszeichen
- CE - zu führen.
Verantwortlich für die Einhaltung der EMV Vorschriften ist der Hersteller:
International Business Machines Corp.
New Orchard Road
Armonk, New York 10504
Tel: 914-499-1900
Der verantwortliche Ansprechpartner des Herstellers in der EU ist:
IBM Deutschland GmbH
Technical Regulations, Abteilung M456
IBM-Allee 1, 71139 Ehningen, Germany
Tel: +49 7032 15-2937
email: tjahn@de.ibm.com
Generelle Informationen:
Das Gerät erfüllt die Schutzanforderungen nach EN 55024 und EN 55022 Klasse A.
Electromagnetic Interference (EMI) Statement - Russia
Terms and conditions
Permissions for the use of these publications is granted subject to the following terms and conditions.
Personal Use: You may reproduce these publications for your personal, noncommercial use provided that
all proprietary notices are preserved. You may not distribute, display or make derivative works of these
publications, or any portion thereof, without the express consent of the manufacturer.
Commercial Use: You may reproduce, distribute and display these publications solely within your
enterprise provided that all proprietary notices are preserved. You may not make derivative works of
these publications, or reproduce, distribute or display these publications or any portion thereof outside
your enterprise, without the express consent of the manufacturer.
228 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures
Except as expressly granted in this permission, no other permissions, licenses or rights are granted, either
express or implied, to the publications or any information, data, software or other intellectual property
contained therein.
The manufacturer reserves the right to withdraw the permissions granted herein whenever, in its
discretion, the use of the publications is detrimental to its interest or, as determined by the manufacturer,
the above instructions are not being properly followed.
You may not download, export or re-export this information except in full compliance with all applicable
laws and regulations, including all United States export laws and regulations.
THE MANUFACTURER MAKES NO GUARANTEE ABOUT THE CONTENT OF THESE
PUBLICATIONS. THESE PUBLICATIONS ARE PROVIDED "AS-IS" AND WITHOUT WARRANTY OF
ANY KIND, EITHER EXPRESSED OR IMPLIED, INCLUDING BUT NOT LIMITED TO IMPLIED
WARRANTIES OF MERCHANTABILITY, NON-INFRINGEMENT, AND FITNESS FOR A PARTICULAR
PURPOSE.
Notices 229
230 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures
Red Hat Cluster Suite
Conguring and Managing a
Cluster
86 A2 35ER 00
Red Hat Cluster Suite: Conguring and Managing a Cluster
Copyright © 2000-2004 by Red Hat, Inc.Mission Critical Linux, Inc.K.M. Sorenson
Red Hat, Inc.
1801 Varsity Drive
Raleigh NC 27606-2072 USA
Phone: +1 919 754 3700
Phone: 888 733 4281
Fax: +1 919 754 3701
PO Box 13588
Research Triangle Park NC 27709 USA
rh-cs(EN)-3-Print-RHI (2004-06-04T17:42)
For Part I Using the Red Hat Cluster Manager and Part III Appendixes, permission is granted to copy, distribute and/or modify
this document under the terms of the GNU Free Documentation License, Version 1.1 or any later version published by the Free
Software Foundation. A copy of the license is available at http://www.gnu.org/licenses/fdl.html. The content described in this
paragraph is copyrighted by © Mission Critical Linux, Inc. (2000), K.M. Sorenson (2000), and Red Hat, Inc. (2000-2003).
This material in Part II Conguring a Linux Virtual Server Cluster may be distributed only subject to the terms and conditions
set forth in the Open Publication License, V1.0 or later (the latest version is presently available at
http://www.opencontent.org/openpub/). Distribution of substantively modied versions of this material is prohibited without
the explicit permission of the copyright holder. Distribution of the work or derivative of the work in any standard (paper) book
form for commercial purposes is prohibited unless prior permission is obtained from the copyright holder. The content
described in this paragraph is copyrighted by © Red Hat, Inc. (2000-2003).
Red Hat, Red Hat Network, the Red Hat "Shadow Man" logo, RPM, Maximum RPM, the RPM logo, Linux Library,
PowerTools, Linux Undercover, RHmember, RHmember More, Rough Cuts, Rawhide and all Red Hat-based trademarks and
logos are trademarks or registered trademarks of Red Hat, Inc. in the United States and other countries.
Linux is a registered trademark of Linus Torvalds.
Motif and UNIX are registered trademarks of The Open Group.
XFree86 is a trademark of The XFree86 Project, Inc, and is pending registration.
Intel and Pentium are registered trademarks of Intel Corporation. Itanium and Celeron are trademarks of Intel Corporation.
AMD, Opteron, Athlon, Duron, and K6 are registered trademarks of Advanced Micro Devices, Inc.
Netscape is a registered trademark of Netscape Communications Corporation in the United States and other countries.
Java and Swing are trademarks or registered trademarks of Sun Microsystems, Inc. in the U.S. or other countries.
Oracle is a registered trademark, and Oracle8i, Oracle9i, and interMedia are trademarks or registered trademarks of Oracle
Corporation.
Microsoft andWindows are either registered trademarks or trademarks of Microsoft Corporation in the United States and/or
other countries.
SSH and Secure Shell are trademarks of SSH Communications Security, Inc.
FireWire is a trademark of Apple Computer Corporation.
IBM, AS/400, OS/400, RS/6000, S/390, and zSeries are registered trademarks of International Business Machines
Corporation. eServer, iSeries, and pSeries are trademarks of International Business Machines Corporation.
All other trademarks and copyrights referred to are the property of their respective owners.
The GPG ngerprint of the security@redhat.com key is:
CA 20 86 86 2B D6 9D FC 65 F6 EC C4 21 91 80 CD DB 42 A6 0E
Table of Contents
Acknowledgments ................................................................................................................................ i
Introduction........................................................................................................................................iii
1. How To Use This Manual ....................................................................................................iii
2. Document Conventions........................................................................................................ iv
3. More to Come ...................................................................................................................... vi
3.1. Send in Your Feedback ......................................................................................... vi
4. Sign Up for Support............................................................................................................ vii
I. Using the Red Hat Cluster Manager ............................................................................................ ix
1. Red Hat Cluster Manager Overview..................................................................................... 1
1.1. Red Hat Cluster Manager Features ........................................................................ 2
2. Hardware Installation and Operating System Conguration ................................................ 5
2.1. Choosing a Hardware Conguration ..................................................................... 5
2.2. Setting Up the Members ...................................................................................... 15
2.3. Installing and Conguring Red Hat Enterprise Linux ......................................... 17
2.4. Setting Up and Connecting the Cluster Hardware............................................... 21
3. Cluster Conguration.......................................................................................................... 35
3.1. Installing the Red Hat Cluster Suite Packages..................................................... 35
3.2. Installation Notes for Red Hat Enterprise Linux 2.1 Users ................................. 37
3.3. The Cluster Conguration Tool ........................................................................ 37
3.4. Conguring the Cluster Software ........................................................................ 40
3.5. Editing the rawdevices File .............................................................................. 41
3.6. Conguring Cluster Daemons.............................................................................. 42
3.7. Adding and Deleting Members............................................................................ 46
3.8. Conguring a Power Controller Connection ....................................................... 47
3.9. Conguring a Failover Domain ........................................................................... 49
3.10. Adding a Service to the Cluster ......................................................................... 51
3.11. Checking the Cluster Conguration................................................................... 53
3.12. Conguring syslogd Event Logging ............................................................... 55
4. Service Administration ....................................................................................................... 59
4.1. Conguring a Service .......................................................................................... 59
4.2. Displaying a Service Conguration..................................................................... 62
4.3. Disabling a Service .............................................................................................. 63
4.4. Enabling a Service ............................................................................................... 63
4.5. Modifying a Service............................................................................................. 63
4.6. Relocating a Service ............................................................................................ 64
4.7. Deleting a Service ................................................................................................ 64
4.8. Handling Failed Services ..................................................................................... 64
5. Database Services ............................................................................................................... 67
5.1. Setting Up an Oracle Service............................................................................... 67
5.2. Tuning Oracle Service ......................................................................................... 72
5.3. Setting Up a MySQL Service .............................................................................. 73
6. Network File Sharing Services ........................................................................................... 77
6.1. Setting Up an NFS Service .................................................................................. 77
6.2. Using the NFS Druid .......................................................................................... 77
6.3. NFS Caveats......................................................................................................... 82
6.4. Importing the Contents of an NFS Exports File .................................................. 82
6.5. NFS Conguration: Active-Active Example ....................................................... 83
6.6. Setting Up a Samba Service................................................................................. 84
6.7. Using the Samba Druid........................................................................................ 86
6.8. Fields in the smb.conf.sharename File........................................................... 90
7. Setting Up Apache HTTP Server........................................................................................ 93
7.1. Apache HTTP Server Setup Overview ................................................................ 93
7.2. Conguring Shared Storage................................................................................. 93
7.3. Installing and Conguring the Apache HTTP Server.......................................... 94
8. Cluster Administration........................................................................................................ 97
8.1. Overview of the Cluster Status Tool.................................................................. 97
8.2. Displaying Cluster and Service Status................................................................. 97
8.3. Starting and Stopping the Cluster Software......................................................... 99
8.4. Modifying the Cluster Conguration................................................................. 100
8.5. Backing Up and Restoring the Cluster Database............................................... 100
8.6. Modifying Cluster Event Logging ..................................................................... 101
8.7. Updating the Cluster Software........................................................................... 101
8.8. Changing the Cluster Name............................................................................... 102
8.9. Disabling the Cluster Software .......................................................................... 102
8.10. Diagnosing and Correcting Problems in a Cluster........................................... 102
II. Conguring a Linux Virtual Server Cluster ........................................................................... 107
9. Introduction to Linux Virtual Server................................................................................. 109
9.1. Technology Overview........................................................................................ 109
9.2. Basic Congurations.......................................................................................... 109
10. Linux Virtual Server Overview....................................................................................... 111
10.1. A Basic LVS Conguration ............................................................................. 111
10.2. A Three Tiered LVS Conguration.................................................................. 113
10.3. LVS Scheduling Overview............................................................................... 113
10.4. Routing Methods.............................................................................................. 115
10.5. Persistence and Firewall Marks ....................................................................... 117
10.6. LVS Cluster ? A Block Diagram ................................................................... 117
11. Initial LVS Conguration................................................................................................ 121
11.1. Conguring Services on the LVS Routers ....................................................... 121
11.2. Setting a Password for the Piranha Conguration Tool ............................... 122
11.3. Starting the Piranha Conguration Tool Service.......................................... 122
11.4. Limiting Access To the Piranha Conguration Tool .................................... 123
11.5. Turning on Packet Forwarding......................................................................... 124
11.6. Conguring Services on the Real Servers ....................................................... 124
12. Setting Up a Red Hat Enterprise Linux LVS Cluster...................................................... 125
12.1. The NAT LVS Cluster ...................................................................................... 125
12.2. Putting the Cluster Together ............................................................................ 127
12.3. Multi-port Services and LVS Clustering.......................................................... 128
12.4. FTP In an LVS Cluster..................................................................................... 130
12.5. Saving Network Packet Filter Settings ............................................................ 132
13. Conguring the LVS Routers with Piranha Conguration Tool ................................. 133
13.1. Necessary Software.......................................................................................... 133
13.2. Logging Into the Piranha Conguration Tool .............................................. 133
13.3. CONTROL/MONITORING......................................................................... 134
13.4. GLOBAL SETTINGS ................................................................................... 135
13.5. REDUNDANCY............................................................................................. 137
13.6. VIRTUAL SERVERS .................................................................................... 139
13.7. Synchronizing Conguration Files .................................................................. 147
13.8. Starting the Cluster .......................................................................................... 148
III. Appendixes................................................................................................................................ 149
A. Using Red Hat Cluster Manager with Piranha................................................................. 151
B. Using Red Hat GFS with Red Hat Cluster Suite.............................................................. 153
B.1. Terminology ...................................................................................................... 153
B.2. Changes to Red Hat Cluster .............................................................................. 154
B.3. Installation Scenarios ........................................................................................ 154
C. The GFS Setup Druid....................................................................................................... 157
C.1. Cluster Name..................................................................................................... 157
C.2. LOCK_GULM parameters .................................................................................. 157
C.3. Choose Location for CCS Files......................................................................... 158
C.4. Cluster Members ............................................................................................... 159
C.5. Saving Your Conguration and Next Steps....................................................... 161
D. Supplementary Hardware Information............................................................................. 163
D.1. Setting Up Power Controllers ........................................................................... 163
D.2. SCSI Bus Conguration Requirements ............................................................ 165
D.3. SCSI Bus Termination ...................................................................................... 166
D.4. SCSI Bus Length............................................................................................... 166
D.5. SCSI Identication Numbers ............................................................................ 167
E. Supplementary Software Information .............................................................................. 169
E.1. Cluster Communication Mechanisms................................................................ 169
E.2. Failover and Recovery Scenarios ...................................................................... 170
E.3. Common Cluster Behaviors: General................................................................ 170
E.4. Common Behaviors: Two Member Cluster with Disk-based Tie-breaker ........ 172
E.5. Common Behaviors: 2-4 Member Cluster with IP-based Tie-Breaker ............. 173
E.6. Common Behaviors: 3-5 Member Cluster ........................................................ 173
E.7. Common Behaviors: Cluster Service Daemons ................................................ 174
E.8. Common Behaviors: Miscellaneous.................................................................. 175
E.9. The cluster.xml File..................................................................................... 175
F. Cluster Command-line Utilities........................................................................................ 179
F.1. Using redhat-config-cluster-cmd........................................................... 179
F.2. Using the shutil Utility................................................................................... 180
F.3. Using the clusvcadm Utility ............................................................................ 180
F.4. Using the clufence Utility .............................................................................. 181
Index................................................................................................................................................. 183
Colophon.......................................................................................................................................... 191
Acknowledgments
The Red Hat Cluster Manager software was originally based on the open source Kimberlite
(http://oss.missioncriticallinux.com/kimberlite/) cluster project, which was developed by Mission
Critical Linux, Inc.
Subsequent to its inception based on Kimberlite, developers at Red Hat have made a large number
of enhancements and modications. The following is a non-comprehensive list highlighting some of
these enhancements.
? Packaging and integration into the Red Hat installation paradigm to simplify the end user's experience.
? Addition of support for multiple cluster members.
? Addition of support for high availability NFS services.
? Addition of support for high availability Samba services.
? Addition of the Cluster Conguration Tool, a graphical conguration tool.
? Addition of the Cluster Status Tool, a graphical monitoring and administration tool.
? Addition of support for failover domains.
? Addition of support for Red Hat GFS, including the GFS Setup Druid and the LOCK_GULM
fencing driver.
? Addition of support for using watchdog timers as a data integrity provision.
? Addition of service monitoring which automatically restart a failed application.
? Rewrite of the service manager to facilitate additional cluster-wide operations.
? A set of miscellaneous bug xes.
The Red Hat Cluster Manager software incorporates STONITH compliant power switch modules
from the Linux-HA project; refer to http://www.linux-ha.org/stonith/.
ii Acknowledgments
Introduction
The Red Hat Cluster Suite is a collection of technologies working together to provide data integrity
and the ability to maintain application availability in the event of a failure. Administrators can deploy
enterprise cluster solutions using a combination of hardware redundancy along with the failover and
load-balancing technologies in Red Hat Cluster Suite.
Red Hat Cluster Manager is a high-availability cluster solution specically suited for database applications,
network le servers, and World Wide Web (Web) servers with dynamic content. An Red Hat
Cluster Manager system features data integrity and application availability using redundant hardware,
shared disk storage, power management, and robust cluster communication and application failover
mechanisms.
Administrators can also deploy highly available applications services using Piranha, a load-balancing
and advanced routing cluster solution based on Linux Virtual Server (LVS) technology. Using Piranha,
administrators can build highly available e-commerce sites that feature complete data integrity and
service availability, in addition to load balancing capabilities. Refer to Part II Conguring a Linux
Virtual Server Cluster for more information.
This guide assumes that the user has an advanced working knowledge of Red Hat Enterprise Linux and
understands the concepts of server computing. For more information about using Red Hat Enterprise
Linux, refer to the following resources:
? Red Hat Enterprise Linux Installation Guide for information regarding installation.
? Red Hat Enterprise Linux Introduction to System Administration for introductory information for
new Red Hat Enterprise Linux system administrators.
? Red Hat Enterprise Linux System Administration Guide for more detailed information about con-
guring Red Hat Enterprise Linux to suit your particular needs as a user.
? Red Hat Enterprise Linux Reference Guide provides detailed information suited for more experienced
users to refer to when needed, as opposed to step-by-step instructions.
? Red Hat Enterprise Linux Security Guide details the planning and the tools involved in creating a
secured computing environment for the data center, workplace, and home.
HTML, PDF, and RPM versions of the manuals are available on the Red Hat Enterprise Linux Documentation
CD and online at:
http://www.redhat.com/docs/
1. How To Use This Manual
This manual contains information about setting up a Red Hat Cluster Manager system. These tasks
are described in Chapter 2 Hardware Installation and Operating System Conguration and Chapter 3
Cluster Conguration.
For information about conguring Red Hat Cluster Manager cluster services, refer to Chapter 4 Service
Administration through Chapter 7 Setting Up Apache HTTP Server.
Part II Conguring a Linux Virtual Server Cluster describes how to achieve load balancing in an Red
Hat Enterprise Linux cluster by using the Linux Virtual Server.
For information about deploying a Red Hat Cluster Manager cluster in conjunction with Red Hat GFS
using the GFS Setup Druid, refer to Appendix C The GFS Setup Druid.
Appendix D Supplementary Hardware Information contains detailed conguration information on
specic hardware devices and shared storage congurations. Appendix E Supplementary Software
Information contains background information on the cluster software and other related information.
iv Introduction
Appendix F Cluster Command-line Utilities provides usage and reference information on the
command-line utilities included with Red Hat Cluster Suite.
This guide assumes you have a thorough understanding of Red Hat Enterprise Linux system administration
concepts and tasks. For detailed information on Red Hat Enterprise Linux system administration,
refer to the Red Hat Enterprise Linux System Administration Guide. For reference information
on Red Hat Enterprise Linux, refer to the Red Hat Enterprise Linux Reference Guide.
2. Document Conventions
When you read this manual, certain words are represented in different fonts, typefaces, sizes, and
weights. This highlighting is systematic; different words are represented in the same style to indicate
their inclusion in a specic category. The types of words that are represented this way include the
following:
command
Linux commands (and other operating system commands, when used) are represented this way.
This style should indicate to you that you can type the word or phrase on the command line
and press [Enter] to invoke a command. Sometimes a command contains words that would be
displayed in a different style on their own (such as le names). In these cases, they are considered
to be part of the command, so the entire phrase is displayed as a command. For example:
Use the cat testfile command to view the contents of a le, named testfile, in the current
working directory.
file name
File names, directory names, paths, and RPM package names are represented this way. This style
should indicate that a particular le or directory exists by that name on your system. Examples:
The .bashrc le in your home directory contains bash shell denitions and aliases for your own
use.
The /etc/fstab le contains information about different system devices and le systems.
Install the webalizer RPM if you want to use a Web server log le analysis program.
application
This style indicates that the program is an end-user application (as opposed to system software).
For example:
Use Mozilla to browse the Web.
[key]
A key on the keyboard is shown in this style. For example:
To use [Tab] completion, type in a character and then press the [Tab] key. Your terminal displays
the list of les in the directory that start with that letter.
[key]-[combination]
A combination of keystrokes is represented in this way. For example:
The [Ctrl]-[Alt]-[Backspace] key combination exits your graphical session and return you to the
graphical login screen or the console.
Introduction v
text found on a GUI interface
A title, word, or phrase found on a GUI interface screen or window is shown in this style. Text
shown in this style is being used to identify a particular GUI screen or an element on a GUI
screen (such as text associated with a checkbox or eld). Example:
Select the Require Password checkbox if you would like your screensaver to require a password
before stopping.
top level of a menu on a GUI screen or window
A word in this style indicates that the word is the top level of a pulldown menu. If you click on
the word on the GUI screen, the rest of the menu should appear. For example:
Under File on a GNOME terminal, the New Tab option allows you to open multiple shell
prompts in the same window.
If you need to type in a sequence of commands from a GUI menu, they are shown like the
following example:
Go to Main Menu Button (on the Panel) => Programming => Emacs to start the Emacs text
editor.
button on a GUI screen or window
This style indicates that the text can be found on a clickable button on a GUI screen. For example:
Click on the Back button to return to the webpage you last viewed.
computer output
Text in this style indicates text displayed to a shell prompt such as error messages and responses
to commands. For example:
The ls command displays the contents of a directory. For example:
Desktop about.html logs paulwesterberg.png
Mail backupfiles mail reports
The output returned in response to the command (in this case, the contents of the directory) is
shown in this style.
prompt
A prompt, which is a computer's way of signifying that it is ready for you to input something, is
shown in this style. Examples:
$
#
[stephen@maturin stephen]$
leopard login:
user input
Text that the user has to type, either on the command line, or into a text box on a GUI screen, is
displayed in this style. In the following example, text is displayed in this style:
To boot your system into the text based installation program, you must type in the text command
at the boot: prompt.
replaceable
Text used for examples which is meant to be replaced with data provided by the user is displayed
in this style. In the following example, ?? version-number
is displayed in this style:
vi Introduction
The directory for the kernel source is /usr/src/
version-number
/, where
version-number
is the version of the kernel installed on this system.
Additionally, we use several different strategies to draw your attention to certain pieces of information.
In order of how critical the information is to your system, these items are marked as note, tip,
important, caution, or a warning. For example:
Note
Remember that Linux is case sensitive. In other words, a rose is not a ROSE is not a rOsE.
Tip
The directory /usr/share/doc/ contains additional documentation for packages installed on your
system.
Important
If you modify the DHCP conguration le, the changes will not take effect until you restart the DHCP
daemon.
Caution
Do not perform routine tasks as root ? use a regular user account unless you need to use the root
account for system administration tasks.
Warning
Be careful to remove only the necessary Red Hat Enterprise Linux partitions. Removing other partitions
could result in data loss or a corrupted system environment.
3. More to Come
This manual is part of Red Hat's growing commitment to provide useful and timely support to Red
Hat Enterprise Linux users.
Introduction vii
3.1. Send in Your Feedback
If you spot a typo, or if you have thought of a way to make this manual better, we would love to
hear from you. Please submit a report in Bugzilla (http://bugzilla.redhat.com/bugzilla/) against the
component rh-cs.
Be sure to mention the manual's identier:
rh-cs(EN)-3-Print-RHI (2004-06-04T17:42)
By mentioning this manual's identier, we know exactly which version of the guide you have.
If you have a suggestion for improving the documentation, try to be as specic as possible. If you
have found an error, please include the section number and some of the surrounding text so we can
nd it easily.
4. Sign Up for Support
If you have a variant of Red Hat Enterprise Linux 3, please remember to sign up for the benets you
are entitled to as a Red Hat customer.
Registration enables access to the Red Hat Services you have purchased, such as technical support
and Red Hat Network. To register your product, go to:
http://www.redhat.com/apps/activate/
Note
You must activate your product before attempting to connect to Red Hat Network. If your product
has not been activated, Red Hat Network rejects registration to channels to which the system is not
entitled.
Good luck, and thank you for choosing Red Hat Enterprise Linux!
The Red Hat Documentation Team
viii Introduction
I. Using the Red Hat Cluster Manager
Clustered systems provide reliability, scalability, and availability to critical production services. Using
the Red Hat Cluster Manager, administrators can create high availability clusters for lesharing, Web
servers, databases, and more. This part discusses the installation and conguration of cluster systems
using the recommended hardware and Red Hat Enterprise Linux.
This section is licensed under the GNU Free Documentation License. For details refer to the Copyright
page.
Table of Contents
1. Red Hat Cluster Manager Overview............................................................................................. 1
2. Hardware Installation and Operating System Conguration .................................................... 5
3. Cluster Conguration................................................................................................................... 35
4. Service Administration................................................................................................................. 59
5. Database Services.......................................................................................................................... 67
6. Network File Sharing Services..................................................................................................... 77
7. Setting Up Apache HTTP Server ................................................................................................ 93
8. Cluster Administration................................................................................................................. 97
Chapter 1.
Red Hat Cluster Manager Overview
Red Hat Cluster Manager allows administrators to connect separate systems (called members or
nodes) together to create failover clusters that ensure application availability and data integrity under
several failure conditions. Administrators can use Red Hat Cluster Manager with database applications,
le sharing services, web servers, and more.
To set up a failover cluster, you must connect the member systems (often referred to simply as members
or nodes) to the cluster hardware, and congure the members into the cluster environment. The
foundation of a cluster is an advanced host membership algorithm. This algorithm ensures that the
cluster maintains complete data integrity at all times by using the following methods of inter-member
communication:
? Network connections between the cluster systems for heartbeat
? Shared state on shared disk storage to hold cluster status
To make an application and data highly available in a cluster, you must congure a service (such
as an application and shared disk storage) as a discrete, named group of properties and resources to
which you can assign an IP address to provide transparent client access. For example, you can set up
a service that provides clients with access to highly-available database application data.
You can associate a service with a failover domain, a subset of cluster members that are eligible to
run the service. In general, any eligible member can run the service and access the service data on
shared disk storage. However, each service can run on only one cluster member at a time, in order to
maintain data integrity. You can specify whether or not the members in a failover domain are ordered
by preference. You can also specify whether or not a service is restricted to run only on members of
its associated failover domain. (When associated with an unrestricted failover domain, a service can
be started on any cluster member in the event no member of the failover domain is available.)
You can set up an active-active conguration in which the members run different services, or a hotstandby
conguration in which a primary member runs all the services, and a backup cluster system
takes over only if the primary system fails.
Figure 1-1 shows an example of a cluster in an active-active conguration.
Figure 1-1. Example Cluster in Active-Active Conguration
2 Chapter 1. Red Hat Cluster Manager Overview
If a hardware or software failure occurs, the cluster automatically restarts the failed member's services
on the functional member. This service failover capability ensures that no data is lost, and there is little
disruption to users. When the failed member recovers, the cluster can re-balance the services across
the members.
In addition, you can cleanly stop the services running on a cluster system and then restart them on
another system. This service relocation capability allows you to maintain application and data availability
when a cluster member requires maintenance.
1.1. Red Hat Cluster Manager Features
Cluster systems deployed with Red Hat Cluster Manager include the following features:
? No-single-point-of-failure hardware conguration
Clusters can include a dual-controller RAID array, multiple network channels, and redundant uninterruptible
power supply (UPS) systems to ensure that no single failure results in application down
time or loss of data.
Alternately, a low-cost cluster can be set up to provide less availability than a no-single-point-offailure
cluster. For example, you can set up a cluster with a single-controller RAID array and only
a single Ethernet channel.
Certain low-cost alternatives, such as software RAID and multi-initiator parallel SCSI, are not
compatible or appropriate for use on the shared cluster storage. Refer to Section 2.1 Choosing a
Hardware Conguration, for more information.
? Service conguration framework
Clusters allow you to easily congure individual services to make data and applications highly
available. To create a service, you specify the resources used in the service and properties for the
service, including the service name, application start, stop, and status script, disk partitions, mount
points, and the cluster members on which you prefer the service to run. After you add a service,
the cluster management software stores the information in a cluster conguration le on shared
storage, where the conguration data can be accessed by all cluster members.
The cluster provides an easy-to-use framework for database applications. For example, a database
service serves highly-available data to a database application. The application running on a cluster
member provides network access to database client systems, such as Web servers. If the service
fails over to another member, the application can still access the shared database data. A networkaccessible
database service is usually assigned an IP address, which is failed over along with the
service to maintain transparent access for clients.
The cluster service framework can be easily extended to other applications, as well.
? Failover domains
By assigning a service to a restricted failover domain, you can limit the members that are eligible
to run a service in the event of a failover. (A service that is assigned to a restricted failover domain
cannot be started on a cluster member that is not included in that failover domain.) You can order
the members in a failover domain by preference to ensure that a particular member runs the service
(as long as that member is active). If a service is assigned to an unrestricted failover domain, the
service starts on any available cluster member (if none of the members of the failover domain are
available).
? Data integrity assurance
To ensure data integrity, only one member can run a service and access service data at one time.
The use of power switches in the cluster hardware conguration enables a member to power-cycle
another member before restarting that member's services during the failover process. This prevents
any two systems from simultaneously accessing the same data and corrupting it. Although not
required, it is recommended that power switches are used to guarantee data integrity under all failure
Chapter 1. Red Hat Cluster Manager Overview 3
conditions. Watchdog timers are an optional variety of power control to ensure correct operation of
service failover.
? Cluster administration user interface
The cluster administration interface facilitiates management tasks such as: creating, starting, and
stopping services; relocating services from one member to another; modifying the cluster conguration
(to add or remove services or resources); and monitoring the cluster members and services.
? Ethernet channel bonding
To monitor the health of the other members, each member monitors the health of the remote power
switch, if any, and issues heartbeat pings over network channels. With Ethernet channel bonding,
multiple Ethernet interfaces are congured to behave as one, reducing the risk of a single-point-offailure
in the typical switched Ethernet connection between systems.
? Shared storage for quorum information
Shared state information includes whether the member is active. Service state information includes
whether the service is running and which member is running the service. Each member checks to
ensure that the status of the other members is up to date.
In a two-member cluster, each member periodically writes a timestamp and cluster state information
to two shared cluster partitions located on shared disk storage. To ensure correct cluster operation,
if a member is unable to write to both the primary and shadow shared cluster partitions at startup
time, it is not allowed to join the cluster. In addition, if a member is not updating its timestamp, and
if heartbeats to the system fail, the member is removed from the cluster.
Figure 1-2 shows how members communicate in a cluster conguration. Note that the terminal
server used to access system consoles via serial ports is not a required cluster component.
Figure 1-2. Cluster Communication Mechanisms
4 Chapter 1. Red Hat Cluster Manager Overview
? Service failover capability
If a hardware or software failure occurs, the cluster takes the appropriate action to maintain application
availability and data integrity. For example, if a member completely fails, another member
(in the associated failover domain, if used, or in the cluster) restarts its services. Services already
running on this member are not disrupted.
When the failed member reboots and is able to write to the shared cluster partitions, it can rejoin the
cluster and run services. Depending on how the services are congured, the cluster can re-balance
the services among the members.
? Manual service relocation capability
In addition to automatic service failover, a cluster allows you to cleanly stop services on one member
and restart them on another member. You can perform planned maintenance on a member
system while continuing to provide application and data availability.
? Event logging facility
To ensure that problems are detected and resolved before they affect service availability, the cluster
daemons log messages by using the conventional Linux syslog subsystem. You can customize the
severity level of the logged messages.
? Application monitoring
The infrastructure in a cluster can optionally monitor the state and health of an application. In this
manner, should an application-specic failure occur, the cluster automatically restarts the application.
In response to the application failure, the application attempts to be restarted on the member it
was initially running on; failing that, it restarts on another cluster member. You can specify which
members are eligible to run a service by assigning a failover domain to the service.
Chapter 2.
Hardware Installation and Operating System
Conguration
To set up the hardware conguration and install Red Hat Enterprise Linux, follow these steps:
? Choose a cluster hardware conguration that meets the needs of applications and users; refer to
Section 2.1 Choosing a Hardware Conguration.
? Set up and connect the members and the optional console switch and network switch or hub; refer
to Section 2.2 Setting Up the Members.
? Install and congure Red Hat Enterprise Linux on the cluster members; refer to Section 2.3 Installing
and Conguring Red Hat Enterprise Linux.
? Set up the remaining cluster hardware components and connect them to the members; refer to
Section 2.4 Setting Up and Connecting the Cluster Hardware.
After setting up the hardware conguration and installing Red Hat Enterprise Linux, install the cluster
software.
Tip
Refer to the Red Hat Hardware Compatibility List available at http://hardware.redhat.com/hcl/ for a
list of compatible hardware. Perform a Quick Search for the term cluster to nd results for power
switch and shared storage hardware certied for or compatible with Red Hat Cluster Manager. For
general system hardware compatibility searches, use manufacturer, brand, and/or model keywords
to check for compatibility with Red Hat Enterprise Linux.
2.1. Choosing a Hardware Conguration
The Red Hat Cluster Manager allows administrators to use commodity hardware to set up a cluster
conguration that meets the performance, availability, and data integrity needs of applications and
users. Cluster hardware ranges from low-cost minimum congurations that include only the components
required for cluster operation, to high-end congurations that include redundant Ethernet
channels, hardware RAID, and power switches.
Regardless of conguration, the use of high-quality hardware in a cluster is recommended, as hardware
malfunction is a primary cause of system down time.
Although all cluster congurations provide availability, some congurations protect against every
single point of failure. In addition, all cluster congurations provide data integrity, but some congurations
protect data under every failure condition. Therefore, administrators must fully understand the
needs of their computing environment and also the availability and data integrity features of different
hardware congurations to choose the cluster hardware that meets the proper requirements.
When choosing a cluster hardware conguration, consider the following:
Performance requirements of applications and users
Choose a hardware conguration that provides adequate memory, CPU, and I/O resources. Be
sure that the conguration chosen can handle any future increases in workload as well.
6 Chapter 2. Hardware Installation and Operating System Conguration
Cost restrictions
The hardware conguration chosen must meet budget requirements. For example, systems with
multiple I/O ports usually cost more than low-end systems with fewer expansion capabilities.
Availability requirements
If a computing environment requires the highest degree of availability, such as a production environment,
then a cluster hardware conguration that protects against all single points of failure,
including disk, storage interconnect, Ethernet channel, and power failures is recommended. Environments
that can tolerate an interruption in availability, such as development environments,
may not require as much protection. Refer to Section 2.4.3 Conguring UPS Systems and Section
2.4.4 Conguring Shared Disk Storagefor more information about using redundant hardware for
high availability.
Data integrity under all failure conditions requirement
Using power switches in a cluster conguration guarantees that service data is protected under
every failure condition. These devices enable a member to power cycle another member before
restarting its services during failover. Power switches protect against data corruption if an unresponsive
(or hanging) member becomes responsive after its services have failed over and then
issues I/O to a disk that is also receiving I/O from the other member.
In addition, if a quorum daemon fails on a member, the member is no longer able to monitor the
shared cluster partitions. If you are not using power switches in the cluster, this error condition
may result in services being run on more than one member, which can cause data corruption.
Refer to Section 2.4.2 Conguring Power Switches for more information about the benets of
using power switches in a cluster. It is recommended that production environments use power
switches or watchdog timers in the cluster conguration.
2.1.1. Shared Storage Requirements
The operation of the cluster depends on reliable, coordinated access to shared storage. In the event
of hardware failure, it is desirable to be able to disconnect one member from the shared storage for
repair without disrupting the other members. Shared storage is truly vital to the cluster conguration.
Testing has shown that it is difcult, if not impossible, to congure reliable multi-initiator parallel
SCSI congurations at data rates above 80MB/sec using standard SCSI adapters. Further tests have
shown that these congurations cannot support online repair because the bus does not work reliably
when the HBA terminators are disabled, and external terminators are used. For these reasons, multiinitiator
SCSI congurations using standard adapters are not supported. Either single-initiator SCSI
bus adapters (connected to multi-ported storage) or Fibre Channel adapters are required.
The Red Hat Cluster Manager requires that all cluster members have simultaneous access to the shared
storage. Certain host RAID adapters are capable of providing this type of access to shared RAID units.
These products require extensive testing to ensure reliable operation, especially if the shared RAID
units are based on parallel SCSI buses. These products typically do not allow for online repair of
a failed member. Only host RAID adapters listed in the Red Hat Hardware Compatibility List are
supported.
The use of software RAID, or software Logical Volume Management (LVM), is not supported on
shared storage. This is because these products do not coordinate access from multiple hosts to shared
storage. Software RAID or LVMmay be used on non-shared storage on cluster members (for example,
boot and system partitions, and other le systems which are not associated with any cluster services).
2.1.2. Minimum Hardware Requirements
A minimum hardware conguration includes only the hardware components that are required for
cluster operation, as follows:
Chapter 2. Hardware Installation and Operating System Conguration 7
? Two servers to run cluster services
? Ethernet connection for sending heartbeat pings and for client network access
? Shared disk storage for the shared cluster partitions and service data
The hardware components described in Table 2-1 can be used to set up a minimum cluster conguration.
This conguration does not guarantee data integrity under all failure conditions, because it does
not include power switches. Note that this is a sample conguration; it is possible to set up a minimum
conguration using other hardware.
Warning
The minimum cluster conguration is not a supported solution and should not be used in a production
environment, as it does not guarantee data integrity under all failure conditions.
Hardware Description
Two servers Each member includes a network interface for client access and
for Ethernet connections and a SCSI adapter (termination
disabled) for the shared storage connection
Two network cables with RJ45
connectors
Network cables connect an Ethernet network interface on each
member to the network for client access and heartbeat pings.
RAID storage enclosure The RAID storage enclosure contains one controller with at least
two host ports.
Two HD68 SCSI cables Each cable connects one HBA to one port on the RAID
controller, creating two single-initiator SCSI buses.
Table 2-1. Example of Minimum Cluster Conguration
The minimum hardware conguration is the most cost-effective cluster conguration; however, it
includes multiple points of failure. For example, if the RAID controller fails, then all cluster services
become unavailable. When deploying the minimal hardware conguration, software watchdog timers
should be congured as a data integrity provision. Refer to Section D.1.2.3 Conguring a Hardware
Watchdog Timer for details.
To improve availability, protect against component failure, and guarantee data integrity under all failure
conditions, the minimum conguration can be expanded, as described in Table 2-2.
Problem Solution
Disk failure Hardware RAID to replicate data across multiple disks
RAID controller failure Dual RAID controllers to provide redundant access to
disk data
Heartbeat failure Ethernet channel bonding and failover
Power source failure Redundant uninterruptible power supply (UPS) systems
Data corruption under all failure
conditions
Power switches or hardware-based watchdog timers
Table 2-2. Improving Availability and Guaranteeing Data Integrity
8 Chapter 2. Hardware Installation and Operating System Conguration
A no single point of failure hardware conguration that guarantees data integrity under all failure
conditions can include the following components:
? At least two servers to run cluster services
? Ethernet connection between each member for heartbeat pings and for client network access
? Dual-controller RAID array to replicate shared partitions and service data
? Power switches to enable each member to power-cycle the other members during the failover process
? Ethernet interfaces congured to use channel bonding
? At least two UPS systems for a highly-available source of power
The components described in Table 2-3 can be used to set up a no single point of failure cluster con-
guration that includes two single-initiator SCSI buses and power switches to guarantee data integrity
under all failure conditions. Note that this is a sample conguration; it is possible to set up a no single
point of failure conguration using other hardware.
Hardware Description
Two servers (up to 8 supported) Each member includes the following hardware:
Two network interfaces for:
Point-to-point Ethernet connections
Client network access and Ethernet heartbeat pings
Three serial ports for:
Remote power switch connection
Connection to the terminal server
One Adaptec 29160 adapter (termination enabled) for the shared
disk storage connection.
One network switch A network switch enables the connection of multiple members to
a network.
One Cyclades terminal server A terminal server allows for management of remote members
from a central location. (A terminal server is not required for
cluster operation.)
Four network cables Network cables connect the terminal server and a network
interface on each member to the network switch.
Two RJ45 to DB9 crossover
cables
RJ45 to DB9 crossover cables connect a serial port on each
member to the Cyclades terminal server.
Two serial-attached power
switches
Power switches enable each member to power-cycle the other
member before restarting its services. The power cable for each
member is connected to its own power switch. Note that
serial-attach power switches are supported in two-member
clusters only.
Two null modem cables Null modem cables connect a serial port on each member to the
power switch that provides power to the other member. This
connection enables each member to power-cycle the other
member.
FlashDisk RAID Disk Array
with dual controllers
Dual RAID controllers protect against disk and controller failure.
The RAID controllers provide simultaneous access to all the
logical units on the host ports.
Chapter 2. Hardware Installation and Operating System Conguration 9
Hardware Description
Two HD68 SCSI cables HD68 cables connect each host bus adapter to a RAID enclosure
"in" port, creating two single-initiator SCSI buses.
Two terminators Terminators connected to each "out" port on the RAID enclosure
terminate both single-initiator SCSI buses.
Redundant UPS Systems UPS systems provide a highly-available source of power. The
power cables for the power switches and the RAID enclosure are
connected to two UPS systems.
Table 2-3. Example of a No Single Point of Failure Conguration
Figure 2-1 shows an example of a no single point of failure hardware conguration that includes the
previously-described hardware, two single-initiator SCSI buses, and power switches to guarantee data
integrity under all error conditions. A "T" enclosed in a circle represents a SCSI terminator.
Figure 2-1. No Single Point of Failure Conguration Example
Cluster hardware congurations can also include other optional hardware components that are common
in a computing environment. For example, a cluster can include a network switch or network
hub, which enables the connection of the members to a network. A cluster may also include a console
10 Chapter 2. Hardware Installation and Operating System Conguration
switch, which facilitates the management of multiple members and eliminates the need for separate
monitors, mouses, and keyboards for each member.
One type of console switch is a terminal server, which enables connection to serial consoles and
management of many members from one remote location. As a low-cost alternative, you can use a
KVM (keyboard, video, and mouse) switch, which enables multiple members to share one keyboard,
monitor, and mouse. A KVM is suitable for congurations in which access to a graphical user interface
(GUI) to perform system management tasks is preferred.
When choosing a system, be sure that it provides the required PCI slots, network slots, and serial
ports. For example, a no single point of failure conguration requires multiple bonded Ethernet ports.
Refer to Section 2.2.1 Installing the Basic Cluster Hardware for more information.
2.1.3. Choosing the Type of Power Controller
The Red Hat Cluster Manager implementation consists of a generic power management layer and a set
of device-specic modules which accommodate a range of power management types. When selecting
the appropriate type of power controller to deploy in the cluster, it is important to recognize the
implications of specic device types. The following describes the types of supported power switches
followed by a summary table. For a more detailed description of the role a power switch plays to
ensure data integrity, refer to Section 2.4.2 Conguring Power Switches.
Serial-attached and network-attached power switches are separate devices which enable one cluster
member to power cycle another member. They resemble a power plug strip on which individual outlets
can be turned on and off under software control through either a serial or network cable. Networkattached
power switches differ from serial-attached in that they connect to cluster members via an
Ethernet hub or switch, rather than direct connection to cluster members. A network-attached power
switch can not be directly attached to a cluster member using a crossover cable, as the power switch
would be unable to power cycle the other members.
Watchdog timers provide a means for failed members to remove themselves from the cluster prior
to another member taking over its services, rather than allowing one cluster member to power cycle
another. The normal operational mode for watchdog timers is that the cluster software must periodically
reset a timer prior to its expiration. If the cluster software fails to reset the timer, the watchdog
triggers under the assumption that the member may have hung or otherwise failed. The healthy cluster
member allows a window of time to pass prior to concluding that another cluster member has failed
(by default, this window is 12 seconds). The watchdog timer interval must be less than the duration of
time for one cluster member to conclude that another has failed. In this manner, a healthy member can
assume, prior to taking over services, that the failed cluster member has safely removed itself from
the cluster (by rebooting) and is no longer a risk to data integrity. The underlying watchdog support
is included in the core Linux kernel. Red Hat Cluster Manager utilizes these watchdog features via its
standard APIs and conguration mechanism.
There are two types of watchdog timers: hardware-based and software-based. Hardware-based watchdog
timers typically consist of system board components such as the Intel® i810 TCO chipset. This
circuitry has a high degree of independence from the main system CPU. This independence is bene-
cial in failure scenarios of a true system hang, as in this case it pulls down the system's reset lead
resulting in a system reboot. Some PCI expansion cards provide watchdog features.
Software-based watchdog timers do not have any dedicated hardware. The implementation is a kernel
thread which is periodically run; if the timer duration has expired, the thread initiates a system reboot.
The vulnerability of the software watchdog timer is that under certain failure scenarios, such as system
hangs while interrupts are blocked, the kernel thread is not called. As a result, in such conditions it
cannot be denitively depended on for data integrity. This can cause the healthy cluster member to
take over services for a hung member which could cause data corruption under certain scenarios.
Finally, administrators can choose not to employ a power controller at all. When a power controller
is not in use, no provision exists for a cluster member to power cycle a failed member. Similarly, the
failed member cannot be guaranteed to reboot itself under all failure conditions.
Chapter 2. Hardware Installation and Operating System Conguration 11
Important
Use of a power controller is strongly recommended as part of a production cluster environment.
Conguration of a cluster without a power controller is not supported.
Ultimately, the right type of power controller deployed in a cluster environment depends on the data
integrity requirements weighed against the cost and availability of external power switches.
Table 2-4 summarizes the types of supported power management modules and discusses their advantages
and disadvantages individually.
Type Notes Pros Cons
Serial-attached
power switches
(supported for
two-member
clusters only)
Two serial attached
power controllers are
used in a cluster (one per
member)
Affords strong data
integrity guarantees ?
the power controller
itself is not a single
point of failure as there
are two in a cluster
Requires purchase of
power controller
hardware and cables;
consumes serial ports;
can only be used in
two-member cluster
Network-attached
power switches
A single network
attached power
controller is required per
cluster (depending on
the number of
members); however, up
to three are supported
for each cluster member
Affords strong data
integrity guarantees and
can be used in clusters
with more than two
members
Requires purchase of
power controller
hardware ? the power
controller itself can
become a single point of
failure (although they
are typically very
reliable devices)
Hardware
Watchdog Timer
Affords strong data
integrity guarantees
Obviates the need to
purchase external power
controller hardware
Not all systems include
supported watchdog
hardware
Software
Watchdog Timer
Offers acceptable data
integrity provisions
Obviates the need to
purchase external power
controller hardware;
works on any system
Under some failure
scenarios, the software
watchdog is not
operational, opening a
small vulnerability
window
No power
controller
No power controller
function is in use
Obviates the need to
purchase external power
controller hardware;
works on any system
Vulnerable to data
corruption under certain
failure scenarios
Table 2-4. Power Switches
2.1.4. Cluster Hardware Components
Use the following tables to identify the hardware components required for the cluster conguration.
Table 2-5 includes the hardware required for the cluster members.
12 Chapter 2. Hardware Installation and Operating System Conguration
Hardware Quantity Description Required
Cluster
members
eight
(maximum
supported)
Each member must provide enough PCI slots,
network slots, and serial ports for the cluster
hardware conguration. Because disk devices
must have the same name on each member, it is
recommended that the members have symmetric
I/O subsystems. It is also recommended that the
processor speed and amount of system memory
be adequate for the processes run on the cluster
members. Consult the Red Hat Enterprise Linux
3 Release Notes for specics. Refer to Section
2.2.1 Installing the Basic Cluster Hardware for
more information.
Yes
Table 2-5. Cluster Member Hardware
Table 2-6 includes several different types of power switches.
A single cluster requires only one type of power switch.
Hardware Quantity Description Required
Serial power
switches
Two In a two-member cluster, use serial power
switches to enable each cluster member to
power-cycle the other member. Refer to Section
2.4.2 Conguring Power Switches for more
information. Note, cluster members are
congured with either serial power switches
(supported for two-member clusters only) or
network-attached power switches, but not both.
Strongly
recommended
for data
integrity under
all failure
conditions
Null modem
cable
Two Null modem cables connect a serial port on a
cluster member to a serial power switch. This
enables each member to power-cycle the other
member. Some power switches may require
different cables.
Only if using
serial power
switches
Mounting
bracket
One Some power switches support rack mount
congurations and require a separate mounting
bracket.
Only for rack
mounting
power
switches
Network
power switch
One (depends
on member
count)
Network-attached power switches enable each
cluster member to power cycle all others. Refer
to Section 2.4.2 Conguring Power Switches for
more information.
Strongly
recommended
for data
integrity under
all failure
conditions
Watchdog
Timer
One per
member
Watchdog timers cause a failed cluster member
to remove itself from a cluster prior to a healthy
member taking over its services. Refer to
Section 2.4.2 Conguring Power Switches for
more information.
Recommended
for data
integrity on
systems which
provide
integrated
watchdog
hardware
Chapter 2. Hardware Installation and Operating System Conguration 13
Table 2-6. Power Switch Hardware Table
Table 2-8 through Table 2-10 show a variety of hardware components for an administrator to choose
from. An individual cluster does not require all of the components listed in these tables.
Hardware Quantity Description Required
Network
interface
One for each
network
connection
Each network connection requires a network
interface installed in a member.
Yes
Network
switch or hub
One A network switch or hub allows connection of
multiple members to a network.
No
Network cable One for each
network
interface
A conventional network cable, such as a cable
with an RJ45 connector, connects each network
interface to a network switch or a network hub.
Yes
Table 2-7. Network Hardware Table
Hardware Quantity Description Required
Host bus
adapter
One per
member
To connect to shared disk storage, install either
a parallel SCSI or a Fibre Channel host bus
adapter in a PCI slot in each cluster member.
For parallel SCSI, use a low voltage
differential (LVD) host bus adapter. Adapters
have either HD68 or VHDCI connectors.
Host-bus adapter based RAID cards are only
supported if they correctly support multi-host
operation. At the time of publication, there were
no fully tested host-bus adapter based RAID
cards.
Yes
External disk
storage
enclosure
At least one Use Fibre Channel or single-initiator parallel
SCSI to connect the cluster members to a
single or dual-controller RAID array. To use
single-initiator buses, a RAID controller must
have multiple host ports and provide
simultaneous access to all the logical units on
the host ports. To use a dual-controller RAID
array, a logical unit must fail over from one
controller to the other in a way that is
transparent to the OS.
SCSI RAID arrays that provide simultaneous
access to all logical units on the host ports are
recommended.
To ensure symmetry of device IDs and LUNs,
many RAID arrays with dual redundant
controllers must be congured in an
active/passive mode.
Refer to Section 2.4.4 Conguring Shared Disk
Storage for more information.
Yes
14 Chapter 2. Hardware Installation and Operating System Conguration
Hardware Quantity Description Required
SCSI cable One per
member
SCSI cables with 68 pins connect each host bus
adapter to a storage enclosure port. Cables have
either HD68 or VHDCI connectors. Cables vary
based on adapter type.
Only for
parallel SCSI
congurations
SCSI
terminator
As required by
hardware
conguration
For a RAID storage enclosure that uses "out"
ports (such as FlashDisk RAID Disk Array) and
is connected to single-initiator SCSI buses,
connect terminators to the "out" ports to
terminate the buses.
Only for
parallel SCSI
congurations
and only as
necessary for
termination
Fibre Channel
hub or switch
One or two A Fibre Channel hub or switch may be required. Only for some
Fibre Channel
congurations
Fibre Channel
cable
As required by
hardware
conguration
A Fibre Channel cable connects a host bus
adapter to a storage enclosure port, a Fibre
Channel hub, or a Fibre Channel switch. If a hub
or switch is used, additional cables are needed to
connect the hub or switch to the storage adapter
ports.
Only for Fibre
Channel
congurations
Table 2-8. Shared Disk Storage Hardware Table
Hardware Quantity Description Required
Network
interface
Two for each
member
Each Ethernet connection requires
a network interface card installed
on all cluster members.
No
Network
crossover cable
One for each
channel
A network crossover cable
connects a network interface on
one member to a network interface
on other cluster members, creating
an Ethernet connection for
communicating heartbeat.
Only for a redundant
Ethernet connection (use
of channel-bonded
Ethernet connection is
preferred)
Table 2-9. Point-To-Point Ethernet Connection Hardware Table
Hardware Quantity Description Required
UPS system One or more Uninterruptible power supply (UPS) systems
protect against downtime if a power outage
occurs. UPS systems are highly recommended
for cluster operation. Connect the power cables
for the shared storage enclosure and both power
switches to redundant UPS systems. Note, a
UPS system must be able to provide voltage for
an adequate period of time, and should be
connected to its own power circuit.
Strongly
recommended
for availability
Table 2-10. UPS System Hardware Table
Chapter 2. Hardware Installation and Operating System Conguration 15
Hardware Quantity Description Required
Terminal
server
One A terminal server enables you to manage many
members from one remote location.
No
KVM One A KVM enables multiple members to share one
keyboard, monitor, and mouse. Cables for
connecting members to the switch depend on the
type of KVM.
No
Table 2-11. Console Switch Hardware Table
2.2. Setting Up the Members
After identifying the cluster hardware components described in Section 2.1 Choosing a Hardware
Conguration, set up the basic cluster hardware and connect the members to the optional console
switch and network switch or hub. Follow these steps:
1. In all members, install the required network adapters and host bus adapters. Refer to Section
2.2.1 Installing the Basic Cluster Hardware for more information about performing this task.
2. Set up the optional console switch and connect it to each member. Refer to Section 2.2.2 Setting
Up a Console Switch for more information about performing this task.
If a console switch is not used, then connect each member to a console terminal.
3. Set up the optional network switch or hub and use conventional network cables to connect it to
the members and the terminal server (if applicable). Refer to Section 2.2.3 Setting Up a Network
Switch or Hub for more information about performing this task.
If a network switch or hub is not used, then conventional network cables should be used to
connect each member and the terminal server (if applicable) to a network.
After performing the previous tasks, install Red Hat Enterprise Linux as described in Section 2.3
Installing and Conguring Red Hat Enterprise Linux.
2.2.1. Installing the Basic Cluster Hardware
Members must provide the CPU processing power and memory required by applications.
In addition, members must be able to accommodate the SCSI or Fibre Channel adapters, network
interfaces, and serial ports that the hardware conguration requires. Systems have a limited number of
pre-installed serial and network ports and PCI expansion slots. Table 2-12 helps determine how much
capacity the member systems employed require.
Cluster Hardware Component Serial
Ports
Network
Slots
PCI
Slots
SCSI or Fibre Channel adapter to shared disk storage One for
each bus
adapter
16 Chapter 2. Hardware Installation and Operating System Conguration
Cluster Hardware Component Serial
Ports
Network
Slots
PCI
Slots
Network connection for client access and Ethernet heartbeat
pings
One for
each
network
connection
Point-to-point Ethernet connection for 2-node clusters
(optional)
One for
each connection
Terminal server connection (optional) One
Table 2-12. Installing the Basic Cluster Hardware
Most systems come with at least one serial port. If a system has graphics display capability, it is
possible to use the serial console port for a power switch connection. To expand your serial port
capacity, use multi-port serial PCI cards. For multiple-member clusters, use a network power switch.
Also, ensure that local system disks are not on the same SCSI bus as the shared disks. For example,
use two-channel SCSI adapters, such as the Adaptec 39160-series cards, and put the internal devices
on one channel and the shared disks on the other channel. Using multiple SCSI cards is also possible.
Refer to the system documentation supplied by the vendor for detailed installation information. Refer
to Appendix D Supplementary Hardware Information for hardware-specic information about using
host bus adapters in a cluster.
Figure 2-2 shows the bulkhead of a sample member and the external cable connections for a typical
cluster conguration.
Figure 2-2. Typical External Cabling for a Cluster Member
Chapter 2. Hardware Installation and Operating System Conguration 17
2.2.2. Setting Up a Console Switch
Although a console switch is not required for cluster operation, it can be used to facilitate member
management and eliminate the need for separate monitors, mouses, and keyboards for each cluster
member. There are several types of console switches.
For example, a terminal server enables connection to serial consoles and management of many members
from a remote location. For a low-cost alternative, use a KVM (keyboard, video, and mouse)
switch, which enables multiple members to share one keyboard, monitor, and mouse. A KVM switch
is suitable for congurations in which GUI access to perform system management tasks is preferred.
Set up the console switch according to the documentation provided by the vendor.
After the console switch has been set up, connect it to each cluster member. The cables used depend
on the type of console switch. For example, a Cyclades terminal server uses RJ45 to DB9 crossover
cables to connect a serial port on each member to the terminal server.
2.2.3. Setting Up a Network Switch or Hub
A network switch or hub, although not required for operating a two-node cluster, can be used to
facilitate cluster and client system network operations. Clusters of more than two members require a
switch or hub.
Set up a network switch or hub according to the documentation provided by the vendor.
After setting up the network switch or hub, connect it to each member by using conventional network
cables. A terminal server, if used, is connected to the network switch or hub through a network cable.
2.3. Installing and Conguring Red Hat Enterprise Linux
After the setup of basic cluster hardware, proceed with installation of Red Hat Enterprise Linux on
each member and ensure that all systems recognize the connected devices. Follow these steps:
1. Install Red Hat Enterprise Linux on all cluster members. Refer to Red Hat Enterprise Linux
Installation Guide for instructions.
In addition, when installing Red Hat Enterprise Linux, it is strongly recommended to do the
following:
? Gather the IP addresses for the members and for the bonded Ethernet ports, before installing
Red Hat Enterprise Linux. Note that the IP addresses for the bonded Ethernet ports can be
private IP addresses, (for example, 10.x.x.x).
? Do not place local le systems (such as /, /etc, /tmp, and /var) on shared disks or on the
same SCSI bus as shared disks. This helps prevent the other cluster members from accidentally
mounting these le systems, and also reserves the limited number of SCSI identication
numbers on a bus for cluster disks.
? Place /tmp and /var on different le systems. This may improve member performance.
? When a member boots, be sure that the member detects the disk devices in the same order in
which they were detected during the Red Hat Enterprise Linux installation. If the devices are
not detected in the same order, the member may not boot.
? When using RAID storage congured with Logical Unit Numbers (LUNs) greater than zero,
it is necessary to enable LUN support by adding the following to /etc/modules.conf:
options scsi_mod max_scsi_luns=255
18 Chapter 2. Hardware Installation and Operating System Conguration
After modifying modules.conf, it is necessary to rebuild the initial ram disk using
mkinitrd. Refer to the Red Hat Enterprise Linux System Administration Guide for more
information about creating ramdisks using mkinitrd.
2. Reboot the members.
3. When using a terminal server, congure Red Hat Enterprise Linux to send console messages to
the console port.
4. Edit the /etc/hosts le on each cluster member and include the IP addresses used in the
cluster or ensure that the addresses are in DNS. Refer to Section 2.3.1 Editing the /etc/hosts
File for more information about performing this task.
5. Decrease the alternate kernel boot timeout limit to reduce boot time for members. Refer to
Section 2.3.2 Decreasing the Kernel Boot Timeout Limit for more information about performing
this task.
6. Ensure that no login (or getty) programs are associated with the serial ports that are being
used for the remote power switch connection (if applicable). To perform this task, edit the
/etc/inittab le and use a hash symbol (#) to comment out the entries that correspond
to the serial ports used for the remote power switch. Then, invoke the init q command.
7. Verify that all systems detect all the installed hardware:
? Use the dmesg command to display the console startup messages. Refer to Section 2.3.3
Displaying Console Startup Messages for more information about performing this task.
? Use the cat /proc/devices command to display the devices congured in the kernel.
Refer to Section 2.3.4 Displaying Devices Congured in the Kernel for more information
about performing this task.
8. Verify that the members can communicate over all the network interfaces by using the ping
command to send test packets from one member to another.
9. If intending to congure Samba services, verify that the required RPM packages for Samba
services are installed.
2.3.1. Editing the /etc/hosts File
The /etc/hosts le contains the IP address-to-hostname translation table. The /etc/hosts le on
each member must contain entries for the following:
? IP addresses and associated hostnames for all cluster members
? IP addresses and associated hostnames for the point-to-point Ethernet heartbeat connections (these
can be private IP addresses)
As an alternative to the /etc/hosts le, naming services such as DNS or NIS can be used to de-
ne the host names used by a cluster. However, to limit the number of dependencies and optimize
availability, it is strongly recommended to use the /etc/hosts le to dene IP addresses for cluster
network interfaces.
The following is an example of an /etc/hosts le on a member:
127.0.0.1 localhost.localdomain localhost
193.186.1.81 cluster2.example.com cluster2
10.0.0.1 ecluster2.example.com ecluster2
193.186.1.82 cluster3.example.com cluster3
10.0.0.2 ecluster3.example.com ecluster3
Chapter 2. Hardware Installation and Operating System Conguration 19
The previous example shows the IP addresses and hostnames for two members (cluster2 and cluster3),
and the private IP addresses and hostnames for the Ethernet interface (ecluster2 and ecluster3) used
for the point-to-point heartbeat connection on each member.
Verify correct formatting of the local host entry in the /etc/hosts le to ensure that it does not
include non-local systems in the entry for the local host. An example of an incorrect local host entry
that includes a non-local system (server1) is shown next:
127.0.0.1 localhost.localdomain localhost server1
An Ethernet connection may not operate properly if the format of the /etc/hosts le is not correct.
Check the /etc/hosts le and correct the le format by removing non-local systems from the local
host entry, if necessary.
Note that each network adapter must be congured with the appropriate IP address and netmask.
The following example shows a portion of the output from the /sbin/ifconfig command on a
cluster member:
eth0 Link encap:Ethernet HWaddr 00:00:BC:11:76:93
inet addr:192.186.1.81 Bcast:192.186.1.245 Mask:255.255.255.0
UP BROADCAST RUNNING MULTICAST MTU:1500 Metric:1
RX packets:65508254 errors:225 dropped:0 overruns:2 frame:0
TX packets:40364135 errors:0 dropped:0 overruns:0 carrier:0
collisions:0 txqueuelen:100
Interrupt:19 Base address:0xfce0
eth1 Link encap:Ethernet HWaddr 00:00:BC:11:76:92
inet addr:10.0.0.1 Bcast:10.0.0.245 Mask:255.255.255.0
UP BROADCAST RUNNING MULTICAST MTU:1500 Metric:1
RX packets:0 errors:0 dropped:0 overruns:0 frame:0
TX packets:0 errors:0 dropped:0 overruns:0 carrier:0
collisions:0 txqueuelen:100
Interrupt:18 Base address:0xfcc0
The previous example shows two network interfaces on a cluster member: eth0 (the network interface
for the member) and eth1 (the network interface for the point-to-point Ethernet connection).
You may also add the IP addresses for the cluster members to your DNS server. Refer to the Red
Hat Enterprise Linux Reference Guide for information on conguring DNS, or consult your network
administrator.
2.3.2. Decreasing the Kernel Boot Timeout Limit
It is possible to reduce the boot time for a member by decreasing the kernel boot timeout limit. During
the Red Hat Enterprise Linux boot sequence, the boot loader allows for specifying an alternate kernel
to boot. The default timeout limit for specifying a kernel is ten seconds.
To modify the kernel boot timeout limit for a member, edit the appropriate les as follows:
When using the GRUB boot loader, the timeout parameter in /boot/grub/grub.conf should be
modied to specify the appropriate number of seconds for the timeout parameter. To set this interval
to 3 seconds, edit the parameter to the following:
timeout = 3
When using the LILO or ELILO boot loaders, edit the /etc/lilo.conf le (on x86 systems) or the
elilo.conf le (on Itanium systems) and specify the desired value (in tenths of a second) for the
timeout parameter. The following example sets the timeout limit to three seconds:
20 Chapter 2. Hardware Installation and Operating System Conguration
timeout = 30
To apply any changes made to the /etc/lilo.conf le, invoke the /sbin/lilo command.
On an Itanium system, to apply any changes made to the /boot/efi/efi/redhat/elilo.conf
le, invoke the /sbin/elilo command.
2.3.3. Displaying Console Startup Messages
Use the dmesg command to display the console startup messages. Refer to the dmesg(8) man page
for more information.
The following example of output from the dmesg command shows that two external SCSI buses and
nine disks were detected on the member. (Lines with backslashes display as one line on most screens):
May 22 14:02:10 storage3 kernel: scsi0 : Adaptec AHA274x/284x/294x \
(EISA/VLB/PCI-Fast SCSI) 5.1.28/3.2.4
May 22 14:02:10 storage3 kernel:
May 22 14:02:10 storage3 kernel: scsi1 : Adaptec AHA274x/284x/294x \
(EISA/VLB/PCI-Fast SCSI) 5.1.28/3.2.4
May 22 14:02:10 storage3 kernel:
May 22 14:02:10 storage3 kernel: scsi : 2 hosts.
May 22 14:02:11 storage3 kernel: Vendor: SEAGATE Model: ST39236LW Rev: 0004
May 22 14:02:11 storage3 kernel: Detected scsi disk sda at scsi0, channel 0, id 0, lun 0
May 22 14:02:11 storage3 kernel: Vendor: SEAGATE Model: ST318203LC Rev: 0001
May 22 14:02:11 storage3 kernel: Detected scsi disk sdb at scsi1, channel 0, id 0, lun 0
May 22 14:02:11 storage3 kernel: Vendor: SEAGATE Model: ST318203LC Rev: 0001
May 22 14:02:11 storage3 kernel: Detected scsi disk sdc at scsi1, channel 0, id 1, lun 0
May 22 14:02:11 storage3 kernel: Vendor: SEAGATE Model: ST318203LC Rev: 0001
May 22 14:02:11 storage3 kernel: Detected scsi disk sdd at scsi1, channel 0, id 2, lun 0
May 22 14:02:11 storage3 kernel: Vendor: SEAGATE Model: ST318203LC Rev: 0001
May 22 14:02:11 storage3 kernel: Detected scsi disk sde at scsi1, channel 0, id 3, lun 0
May 22 14:02:11 storage3 kernel: Vendor: SEAGATE Model: ST318203LC Rev: 0001
May 22 14:02:11 storage3 kernel: Detected scsi disk sdf at scsi1, channel 0, id 8, lun 0
May 22 14:02:11 storage3 kernel: Vendor: SEAGATE Model: ST318203LC Rev: 0001
May 22 14:02:11 storage3 kernel: Detected scsi disk sdg at scsi1, channel 0, id 9, lun 0
May 22 14:02:11 storage3 kernel: Vendor: SEAGATE Model: ST318203LC Rev: 0001
May 22 14:02:11 storage3 kernel: Detected scsi disk sdh at scsi1, channel 0, id 10, lun 0
May 22 14:02:11 storage3 kernel: Vendor: SEAGATE Model: ST318203LC Rev: 0001
May 22 14:02:11 storage3 kernel: Detected scsi disk sdi at scsi1, channel 0, id 11, lun 0
May 22 14:02:11 storage3 kernel: Vendor: Dell Model: 8 BAY U2W CU Rev: 0205
May 22 14:02:11 storage3 kernel: Type: Processor \
ANSI SCSI revision: 03
May 22 14:02:11 storage3 kernel: scsi1 : channel 0 target 15 lun 1 request sense \
failed, performing reset.
May 22 14:02:11 storage3 kernel: SCSI bus is being reset for host 1 channel 0.
May 22 14:02:11 storage3 kernel: scsi : detected 9 SCSI disks total.
The following example of the dmesg command output shows that a quad Ethernet card was detected
on the member:
May 22 14:02:11 storage3 kernel: 3c59x.c:v0.99H 11/17/98 Donald Becker
May 22 14:02:11 storage3 kernel: tulip.c:v0.91g-ppc 7/16/99 becker@cesdis.gsfc.nasa.gov
May 22 14:02:11 storage3 kernel: eth0: Digital DS21140 Tulip rev 34 at 0x9800, \
00:00:BC:11:76:93, IRQ 5.
May 22 14:02:12 storage3 kernel: eth1: Digital DS21140 Tulip rev 34 at 0x9400, \
00:00:BC:11:76:92, IRQ 9.
May 22 14:02:12 storage3 kernel: eth2: Digital DS21140 Tulip rev 34 at 0x9000, \
00:00:BC:11:76:91, IRQ 11.
May 22 14:02:12 storage3 kernel: eth3: Digital DS21140 Tulip rev 34 at 0x8800, \
00:00:BC:11:76:90, IRQ 10.
Chapter 2. Hardware Installation and Operating System Conguration 21
2.3.4. Displaying Devices Congured in the Kernel
To be sure that the installed devices, including serial and network interfaces, are congured in the kernel,
use the cat /proc/devices command on each member. Use this command to also determine
if there is raw device support installed on the member. For example:
Character devices:
1 mem
2 pty
3 ttyp
4 ttyS
5 cua
7 vcs
10 misc
19 ttyC
20 cub
128 ptm
136 pts
162 raw
Block devices:
2 fd
3 ide0
8 sd
65 sd
The previous example shows:
? Onboard serial ports (ttyS)
? Serial expansion card (ttyC)
? Raw devices (raw)
? SCSI devices (sd)
2.4. Setting Up and Connecting the Cluster Hardware
After installing Red Hat Enterprise Linux, set up the cluster hardware components and verify the
installation to ensure that the members recognize all the connected devices. Note that the exact steps
for setting up the hardware depend on the type of conguration. Refer to Section 2.1 Choosing a
Hardware Conguration for more information about cluster congurations.
To set up the cluster hardware, follow these steps:
1. Shut down the members and disconnect them from their power source.
2. Set up the bonded Ethernet channels, if applicable. Refer to Section 2.4.1 Conguring Ethernet
Channel Bonding for more information.
3. When using power switches, set up the switches and connect each member to a power switch.
Refer to Section 2.4.2 Conguring Power Switches for more information.
In addition, it is recommended to connect each power switch (or each member's power cord if
not using power switches) to a different UPS system. Refer to Section 2.4.3 Conguring UPS
Systems for information about using optional UPS systems.
4. Set up the shared disk storage according to the vendor instructions and connect the members to
the external storage enclosure. Refer to Section 2.4.4 Conguring Shared Disk Storage for more
information about performing this task.
22 Chapter 2. Hardware Installation and Operating System Conguration
In addition, it is recommended to connect the storage enclosure to redundant UPS systems.
Refer to Section 2.4.3 Conguring UPS Systems for more information about using optional
UPS systems.
5. Turn on power to the hardware, and boot each cluster member. During the boot-up process, enter
the BIOS utility to modify the member setup, as follows:
? Ensure that the SCSI identication number used by the HBA is unique for the SCSI bus it
is attached to. Refer to Section D.5 SCSI Identication Numbers for more information about
performing this task.
? Enable or disable the onboard termination for each host bus adapter, as required by the storage
conguration. Refer to Section 2.4.4 Conguring Shared Disk Storage and Section D.3 SCSI
Bus Termination for more information about performing this task.
? Enable the member to automatically boot when it is powered on.
6. Exit from the BIOS utility, and continue to boot each member. Examine the startup messages
to verify that the Red Hat Enterprise Linux kernel has been congured and can recognize the
full set of shared disks. Use the dmesg command to display console startup messages. Refer
to Section 2.3.3 Displaying Console Startup Messages for more information about using the
dmesg command.
7. Verify that the members can communicate over each point-to-point Ethernet connection by using
the ping command to send packets over each network interface.
8. Set up the shared cluster partitions on the shared disk storage. Refer to Section 2.4.4.3 Conguring
Shared Cluster Partitions for more information about performing this task.
2.4.1. Conguring Ethernet Channel Bonding
Ethernet channel bonding in a no-single-point-of-failure cluster system allows for a fault tolerant
network connection by combining two Ethernet devices into one virtual device. The resulting channel
bonded interface ensures that in the event that one Ethernet device fails, the other device will become
active. This type of channel bonding, called an active-backup policy allows connection of both bonded
devices to one switch or can allow each Ethernet device to be connected to separate hubs or switches,
which eliminates the single point of failure in the network hub/switch.
Channel bonding requires each cluster member to have two Ethernet devices installed. When it is
loaded, the bonding module uses the MAC address of the rst enslaved network device and assigns
that MAC address to the other network device if the rst device fails link detection.
To congure two network devices for channel bonding, perform the following:
1. Create a bonding devices in /etc/modules.conf. For example:
alias bond0 bonding
options bonding miimon=100 mode=1
This loads the bonding device with the bond0 interface name, as well as passes options to
the bonding driver to congure it as an active-backup master device for the enslaved network
interfaces.
2. Edit the /etc/sysconfig/network-scripts/ifcfg-ethX conguration le for both eth0
and eth1 so that the les show identical contents. For example:
DEVICE=ethX
USERCTL=no
ONBOOT=yes
MASTER=bond0
SLAVE=yes
BOOTPROTO=none
Chapter 2. Hardware Installation and Operating System Conguration 23
This will enslave ethX (replace X with the assigned number of the Ethernet devices) to the
bond0 master device.
3. Create a network script for the bonding device (for example,
/etc/sysconfig/network-scripts/ifcfg-bond0), which would appear like the
following example:
DEVICE=bond0
USERCTL=no
ONBOOT=yes
BROADCAST=192.168.1.255
NETWORK=192.168.1.0
NETMASK=255.255.255.0
GATEWAY=192.168.1.1
IPADDR=192.168.1.10
4. Reboot the system for the changes to take effect. Alternatively, manually load the bonding device
and restart the network. For example:
/sbin/insmod /lib/modules/`uname -r`/kernel/drivers/net/bonding/bonding.o \
miimon=100 mode=1
/sbin/service network restart
For more information about channel bonding, refer to the high-availability section of the Linux Ethernet
Bonding Driver Mini-Howto, available in:
/usr/src/linux-2.4/Documentation/networking/bonding.txt
Note
You must have the kernel-source package installed in order to view the Linux Ethernet Bonding
Driver Mini-Howto.
2.4.2. Conguring Power Switches
Power switches enable a member to power-cycle another member before restarting its services as part
of the failover process. The ability to remotely disable a member ensures data integrity is maintained
under any failure condition. It is recommended that production environments use power switches
or watchdog timers in the cluster conguration. Only development (test) environments should use a
conguration without power switches. Refer to Section 2.1.3 Choosing the Type of Power Controller
for a description of the various types of power switches. Note that within this section, the general term
"power switch" also includes watchdog timers.
In a cluster conguration that uses physical power switches, each member's power cable is connected
to a power switch through either a serial or network connection (depending on switch type). When
failover occurs, a member can use this connection to power-cycle another member before restarting
its services.
Power switches protect against data corruption if an unresponsive (or hanging) member becomes
responsive after its services have failed over, and issues I/O to a disk that is also receiving I/O from
another member. In addition, if a quorum daemon fails on a member, the member is no longer able to
monitor the shared cluster partitions. If power switches or watchdog timers are not used in the cluster,
then this error condition may result in services being run on more than one member, which can cause
data corruption and possibly system crashes.
It is strongly recommended to use power switches in a cluster. However, administrators who are aware
of the risks may choose to set up a cluster without power switches.
24 Chapter 2. Hardware Installation and Operating System Conguration
A member may hang for a few seconds if it is swapping or has a high system workload. For this reason,
adequate time is allowed prior to concluding another member has failed (typically 15 seconds).
If a member determines that a hung member is down, and power switches are used in the cluster,
that member power-cycles the hung member before restarting its services. Clusters congured to use
watchdog timers self-reboot under most system hangs. This causes the hung member to reboot in a
clean state and prevent it from issuing I/O and corrupting service data.
Hung members reboot themselves either due to a watchdog ring, failure to send heartbeat packets,
or ? in the case a member has no physical power switch ? loss of quorum status.
Hung members may be rebooted by other members if they are attached to a power switch. If the hung
member never becomes responsive and no power switches are in use, then a manual reboot is required.
When used, power switches must be set up according to the vendor instructions. However, some
cluster-specic tasks may be required to use a power switch in the cluster. Refer to Section D.1 Setting
Up Power Controllers for detailed information on power switches (including information about watchdog
timers). Be sure to take note of any caveats or functional attributes of specic power switches.
Note that the cluster-specic information provided in this manual supersedes the vendor information.
When cabling power switches, take special care to ensure that each cable is plugged into the appropriate
outlet. This is crucial because there is no independent means for the software to verify correct
cabling. Failure to cable correctly can lead to an incorrect member being power cycled, or for one
member to inappropriately conclude that it has successfully power cycled another cluster member.
After setting up the power switches, perform these tasks to connect them to the members:
1. Connect the power cable for each member to a power switch.
2. Connect each member to the power switch. The cable used for the connection depends on the
type of power switch. Serial-attached power switches use null modem cables, while a networkattached
power switches require an Ethernet patch cable.
3. Connect the power cable for each power switch to a power source. It is recommended to connect
each power switch to a different UPS system. Refer to Section 2.4.3 Conguring UPS Systems
for more information.
After the installation of the cluster software, test the power switches to ensure that each member can
power-cycle the other member before starting the cluster. Refer to Section 3.11.2 Testing the Power
Switches for information.
2.4.3. Conguring UPS Systems
Uninterruptible power supplies (UPS) provide a highly-available source of power. Ideally, a redundant
solution should be used that incorporates multiple UPS systems (one per server). For maximal faulttolerance,
it is possible to incorporate two UPS systems per server as well as APC Automatic Transfer
Switches to manage the power and shutdown management of the server. Both solutions are solely
dependent on the level of availability desired.
It is not recommended to use a single UPS infrastructure as the sole source of power for the cluster. A
UPS solution dedicated to the cluster is more exible in terms of manageability and availability.
A complete UPS system must be able to provide adequate voltage and current for a prolonged period
of time. While there is no single UPS to t every power requirement, a solution can be tailored to t
a particular conguration.
If the cluster disk storage subsystem has two power supplies with separate power cords, set up
two UPS systems, and connect one power switch (or one member's power cord if not using power
switches) and one of the storage subsystem's power cords to each UPS system. A redundant UPS
system conguration is shown in Figure 2-3.
Chapter 2. Hardware Installation and Operating System Conguration 25
Figure 2-3. Redundant UPS System Conguration
An alternative redundant power conguration is to connect the power switches (or the members'
power cords) and the disk storage subsystem to the same UPS system. This is the most cost-effective
conguration, and provides some protection against power failure. However, if a power outage occurs,
the single UPS system becomes a possible single point of failure. In addition, one UPS system may
not be able to provide enough power to all the attached devices for an adequate amount of time. A
single UPS system conguration is shown in Figure 2-4.
Figure 2-4. Single UPS System Conguration
Many vendor-supplied UPS systems include Red Hat Enterprise Linux applications that monitor the
operational status of the UPS system through a serial port connection. If the battery power is
low, the monitoring software initiates a clean system shutdown. As this occurs, the cluster
software is properly stopped, because it is controlled by a SysV runlevel script (for example,
/etc/rc.d/init.d/clumanager).
Refer to the UPS documentation supplied by the vendor for detailed installation information.
2.4.4. Conguring Shared Disk Storage
In a cluster, shared disk storage is used to hold service data and two partitions (primary and shadow)
that store cluster state information. Because this storage must be available to all members, it cannot be
26 Chapter 2. Hardware Installation and Operating System Conguration
located on disks that depend on the availability of any one member. Refer to the vendor documentation
for detailed product and installation information.
There are some factors to consider when setting up shared disk storage in a cluster:
? External RAID
It is strongly recommended to use use RAID 1 (mirroring) to make service data and the shared cluster
partitions highly available. Optionally, parity RAID can also be employed for high-availability.
Do not use RAID 0 (striping) alone for shared partitions because this reduces storage availability.
? Multi-initiator SCSI congurations
Multi-initiator SCSI congurations are not supported due to the difculty in obtaining proper bus
termination.
? The Red Hat Enterprise Linux device name for each shared storage device must be the same on
each member. For example, a device named /dev/sdc on one member must be named /dev/sdc
on the other cluster members. Using identical hardware for all members usually ensures that these
devices are named the same.
? A disk partition can be used by only one cluster service.
? Do not include any le systems used in a cluster service in the member's local /etc/fstab les,
because the cluster software must control the mounting and unmounting of service le systems.
? For optimal performance of shared le systems, make sure to specify a 4 KB block size with the
-b option to mke2fs. A smaller block size can cause long fsck times. Refer to Section 2.4.4.6
Creating File Systems.
The following list details parallel SCSI requirements, and must be adhered to when parallel SCSI
buses are employed in a cluster environment:
? SCSI buses must be terminated at each end, and must adhere to length and hot plugging restrictions.
? Devices (disks, host bus adapters, and RAID controllers) on a SCSI bus must have a unique SCSI
identication number.
Refer to Section D.2 SCSI Bus Conguration Requirements for more information.
It is strongly recommended to connect the storage enclosure to redundant UPS systems for a highlyavailable
source of power. Refer to Section 2.4.3 Conguring UPS Systems for more information.
Refer to Section 2.4.4.1 Setting Up a Single-initiator SCSI Bus and Section 2.4.4.2 Setting Up a Fibre
Channel Interconnect for more information about conguring shared storage.
After setting up the shared disk storage hardware, partition the disks and then either create le systems
or raw devices on the partitions. Two raw devices must be created for the primary and the
shadow shared cluster partitions. Refer to Section 2.4.4.3 Conguring Shared Cluster Partitions, Section
2.4.4.4 Partitioning Disks, Section 2.4.4.5 Creating Raw Devices, and Section 2.4.4.6 Creating
File Systems for more information.
2.4.4.1. Setting Up a Single-initiator SCSI Bus
A single-initiator SCSI bus has only one member connected to it, and provides host isolation and better
performance than a multi-initiator bus. Single-initiator buses ensure that each member is protected
from disruptions due to the workload, initialization, or repair of the other members.
When using a single- or dual-controller RAID array that has multiple host ports and provides simultaneous
access to all the shared logical units from the host ports on the storage enclosure, the setup
of the single-initiator SCSI buses to connect each cluster member to the RAID array is possible. If
a logical unit can fail over from one controller to the other, the process must be transparent to the
Chapter 2. Hardware Installation and Operating System Conguration 27
operating system. Note that some RAID controllers restrict a set of disks to a specic controller or
port. In this case, single-initiator bus setups are not possible.
A single-initiator bus must adhere to the requirements described in Section D.2 SCSI Bus Conguration
Requirements.
To set up a single-initiator SCSI bus conguration, the following is required:
? Enable the on-board termination for each host bus adapter.
? Enable the termination for each RAID controller.
? Use the appropriate SCSI cable to connect each host bus adapter to the storage enclosure.
Setting host bus adapter termination is usually done in the adapter BIOS utility during member boot.
To set RAID controller termination, refer to the vendor documentation. Figure 2-5 shows a conguration
that uses two single-initiator SCSI buses.
Figure 2-5. Single-initiator SCSI Bus Conguration
Figure 2-6 shows the termination in a single-controller RAID array connected to two single-initiator
SCSI buses.
Figure 2-6. Single-controller RAID Array Connected to Single-initiator SCSI Buses
Figure 2-7 shows the termination in a dual-controller RAID array connected to two single-initiator
SCSI buses.
28 Chapter 2. Hardware Installation and Operating System Conguration
Figure 2-7. Dual-controller RAID Array Connected to Single-initiator SCSI Buses
2.4.4.2. Setting Up a Fibre Channel Interconnect
Fibre Channel can be used in either single-initiator or multi-initiator congurations.
A single-initiator Fibre Channel interconnect has only one member connected to it. This may provide
better host isolation and better performance than a multi-initiator bus. Single-initiator interconnects
ensure that each member is protected from disruptions due to the workload, initialization, or repair of
the other member.
If employing a RAID array that has multiple host ports, and the RAID array provides simultaneous
access to all the shared logical units from the host ports on the storage enclosure, set up single-initiator
Fibre Channel interconnects to connect each member to the RAID array. If a logical unit can fail over
from one controller to the other, the process must be transparent to the operating system.
Figure 2-8 shows a single-controller RAID array with two host ports and the host bus adapters connected
directly to the RAID controller, without using Fibre Channel hubs or switches. When using
this type of single-initiator Fibre Channel connection, your RAID controller must have a separate host
port for each cluster member.
Chapter 2. Hardware Installation and Operating System Conguration 29
Figure 2-8. Single-controller RAID Array Connected to Single-initiator Fibre Channel Interconnects
The external RAID array must have a separate SCSI channel for each cluster member. In clusters with
more than two members, connect each member to a different SCSI channel on the RAID array, using
a single-initiator SCSI bus as shown in Figure 2-8.
To connect multiple cluster members to the same host port on the RAID array, use an FC hub or
switch. In this case, each HBA is connected to the hub or switch, and the hub or switch is connected
to a host port on the RAID controller.
A Fibre Channel hub or switch is also required with a dual-controller RAID array with two host ports
on each controller. This conguration is shown in Figure 2-9. Additional cluster members may be
connected to either Fibre Channel hub or switch shown in the diagram. Some RAID arrays include a
built-in hub so that each host port is already connected to each of the internal RAID controllers. In
this case, an additional external hub or switch may not be needed.
30 Chapter 2. Hardware Installation and Operating System Conguration
Figure 2-9. Dual-controller RAID Array Connected to Single-initiator Fibre Channel Interconnects
2.4.4.3. Conguring Shared Cluster Partitions
Two raw devices on shared disk storage must be created for the primary shared partition and the
shadow shared partition. Each shared partition must have a minimum size of 10 MB. The amount of
data in a shared partition is constant; it does not increase or decrease over time.
The shared partitions are used to hold cluster state information, including the following:
? Cluster lock states
? Service states
? Conguration information
Periodically, each member writes the state of its services to shared storage. In addition, the shared
partitions contain a version of the cluster conguration le. This ensures that each member has a
common view of the cluster conguration.
If the primary shared partition is corrupted, the cluster members read the information from the shadow
(or backup) shared partition and simultaneously repair the primary partition. Data consistency is maintained
through checksums, and any inconsistencies between the partitions are automatically corrected.
If a member is unable to write to both shared partitions at startup time, it is not allowed to join
the cluster. In addition, if an active member can no longer write to both shared partitions, the member
removes itself from the cluster by rebooting (and may be remotely power cycled by a healthy member).
The following are shared partition requirements:
? Both partitions must have a minimum size of 10 MB.
? Shared partitions must be raw devices. They cannot contain le systems.
? Shared partitions can be used only for cluster state and conguration information.
The following are recommended guidelines for conguring the shared partitions:
Chapter 2. Hardware Installation and Operating System Conguration 31
? It is strongly recommended to set up a RAID subsystem for shared storage, and use RAID 1 (mirroring)
to make the logical unit that contains the shared partitions highly available. Optionally, parity
RAID can be used for high availability. Do not use RAID 0 (striping) alone for shared partitions.
? Place both shared partitions on the same RAID set, or on the same disk if RAID is not employed,
because both shared partitions must be available for the cluster to run.
? Do not put the shared partitions on a disk that contains heavily-accessed service data. If possible,
locate the shared partitions on disks that contain service data that is rarely accessed.
Refer to Section 2.4.4.4 Partitioning Disks and Section 2.4.4.5 Creating Raw Devices for more information
about setting up the shared partitions.
Refer to Section 3.5 Editing the rawdevices File for information about editing the rawdevices le
to bind the raw character devices to the block devices each time the members boot.
2.4.4.4. Partitioning Disks
After shared disk storage hardware has been set up, partition the disks so they can be used in the
cluster. Then, create le systems or raw devices on the partitions. For example, two raw devices must
be created for the shared partitions using the guidelines described in Section 2.4.4.3 Conguring
Shared Cluster Partitions.
Use parted to modify a disk partition table and divide the disk into partitions. While in parted, use
the p to display the partition table and the mkpart command to create new partitions. The following
example shows how to use parted to create a partition on disk:
? Invoke parted from the shell using the command parted and specifying an available shared disk
device. At the (parted) prompt, use the p to display the current partition table. The output should
be similar to the following:
Disk geometry for /dev/sda: 0.000-4340.294 megabytes
Disk label type: msdos
Minor Start End Type Filesystem Flags
? Decide on how large of a partition is required. Create a partition of this size using the mkpart
command in parted. Although the mkpart does not create a le system, it normally requires a le
system type at partition creation time. parted uses a range on the disk to determine partition size;
the size is the space between the end and the beginning of the given range. The following example
shows how to create two partitions of 20 MB each on an empty disk.
(parted) mkpart primary ext3 0 20
(parted) mkpart primary ext3 20 40
(parted) p
Disk geometry for /dev/sda: 0.000-4340.294 megabytes
Disk label type: msdos
Minor Start End Type Filesystem Flags
1 0.030 21.342 primary
2 21.343 38.417 primary
? When more than four partitions are required on a single disk, it is necessary to create an extended
partition. If an extended partition is required, the mkpart also performs this task. In this case, it is
not necessary to specify a le system type.
Note
Only one extended partition may be created, and the extended partition must be one of the four
primary partitions.
(parted) mkpart extended 40 2000
(parted) p
32 Chapter 2. Hardware Installation and Operating System Conguration
Disk geometry for /dev/sda: 0.000-4340.294 megabytes
Disk label type: msdos
Minor Start End Type Filesystem Flags
1 0.030 21.342 primary
2 21.343 38.417 primary
3 38.417 2001.952 extended
? An extended partition allows the creation of logical partitionsinside of it. The following example
shows the division of the extended partition into two logical partitions.
(parted) mkpart logical ext3 40 1000
(parted) p
Disk geometry for /dev/sda: 0.000-4340.294 megabytes
Disk label type: msdos
Minor Start End Type Filesystem Flags
1 0.030 21.342 primary
2 21.343 38.417 primary
3 38.417 2001.952 extended
5 38.447 998.841 logical
(parted) mkpart logical ext3 1000 2000
(parted) p
Disk geometry for /dev/sda: 0.000-4340.294 megabytes
Disk label type: msdos
Minor Start End Type Filesystem Flags
1 0.030 21.342 primary
2 21.343 38.417 primary
3 38.417 2001.952 extended
5 38.447 998.841 logical
6 998.872 2001.952 logical
? A partition may be removed using parted's rm command. For example:
(parted) rm 1
(parted) p
Disk geometry for /dev/sda: 0.000-4340.294 megabytes
Disk label type: msdos
Minor Start End Type Filesystem Flags
2 21.343 38.417 primary
3 38.417 2001.952 extended
5 38.447 998.841 logical
6 998.872 2001.952 logical
? After all required partitions have been created, exit parted using the quit command. If a partition
was added, removed, or changed while both members are powered on and connected to the shared
storage, reboot the other member for it to recognize the modications. After partitioning a disk,
format the partition for use in the cluster. For example, create the le systems or raw devices for
shared partitions. Refer to Section 2.4.4.5 Creating Raw Devices and Section 2.4.4.6 Creating File
Systems for more information.
For basic information on partitioning hard disks at installation time, refer to the Red Hat Enterprise
Linux Installation Guide.
2.4.4.5. Creating Raw Devices
After partitioning the shared storage disks, create raw devices on the partitions. File systems are block
devices (for example, /dev/sda1) that cache recently-used data in memory to improve performance.
Raw devices do not utilize system memory for caching. Refer to Section 2.4.4.6 Creating File Systems
for more information.
Red Hat Enterprise Linux supports raw character devices that are not hard-coded against specic
block devices. Instead, Red Hat Enterprise Linux uses a character major number (currently 162) to
implement a series of unbound raw devices in the /dev/raw/ directory. Any block device can have a
character raw device front-end, even if the block device is loaded later at run time.
Chapter 2. Hardware Installation and Operating System Conguration 33
To create a raw device, edit the /etc/sysconfig/rawdevices le to bind a raw character device
to the appropriate block device to enable the raw device to be opened, read, and written.
Shared partitions and some database applications require raw devices, because these applications perform
their own buffer caching for performance purposes. Shared partitions cannot contain le systems
because if state data was cached in system memory, the members would not have a consistent view of
the state data.
Raw character devices must be bound to block devices each time a member boots. To ensure that
this occurs, edit the /etc/sysconfig/rawdevices le and specify the shared partition bindings. If
using a raw device in a cluster service, use this le to bind the devices at boot time. Refer to Section
3.5 Editing the rawdevices File for more information.
After editing /etc/sysconfig/rawdevices, the changes take effect either by rebooting or by execute
the following command:
service rawdevices restart
Query all the raw devices by using the command raw -aq. The output should be similar to the
following:
/dev/raw/raw1 bound to major 8, minor 17
/dev/raw/raw2 bound to major 8, minor 18
Note that, for raw devices, no cache coherency exists between the raw device and the block device. In
addition, requests must be 512-byte aligned both in memory and on disk. For example, the standard
dd command cannot be used with raw devices because the memory buffer that the command passes
to the write system call is not aligned on a 512-byte boundary.
For more information on using the raw command, refer to the raw(8) man page.
Note
The same raw device names (for example, /dev/raw/raw1 and /dev/raw/raw2) must be used on
all cluster members.
2.4.4.6. Creating File Systems
Use the mkfs command to create an ext3 le system. For example:
mke2fs -j -b 4096 /dev/sde3
For optimal performance of shared le systems, make sure to specify a 4 KB block size with the -b
option to mke2fs. A smaller block size can cause long fsck times.
34 Chapter 2. Hardware Installation and Operating System Conguration
Chapter 3.
Cluster Conguration
After installing and conguring the cluster hardware, the cluster system software and cluster conguration
software can be installed.
3.1. Installing the Red Hat Cluster Suite Packages
The clumanager and redhat-config-cluster packages are required to congure the Red Hat
Cluster Manager. Perform the following instructions to install the Red Hat Cluster Suite on your Red
Hat Enterprise Linux system.
3.1.1. Installation with the Package Management Tool
1. Insert the Red Hat Cluster Suite CD in your CD-ROM drive. If you are using a graphical desktop,
the CD will automatically run the Package Management Tool. Click Forward to continue.
Figure 3-1. Package Management Tool
2. Check the box for the Red Hat Cluster Suite, and click the Details link to the package descriptions.
36 Chapter 3. Cluster Conguration
Figure 3-2. Choosing Package Group
3. While viewing the package group details, check the box next to the packages to install. Click
Close when nished.
Figure 3-3. Choosing Packages for Installation
4. The Package Management Tool shows an overview of the packages to be installed. Click
Forward to install the packages.
5. When the installation is complete, click Finish to exit the Package Management Tool.
3.1.2. Installation with rpm
If you are not using a graphical desktop environment, you can install the packages manually using the
rpm utility at a shell prompt.
Chapter 3. Cluster Conguration 37
Insert the Red Hat Cluster Suite CD into the CD-ROM drive. Log into a shell prompt, change to
the RedHat/RPMS/ directory on the CD, and type the following commands as root (replacing
version
and arch
with the version and architecture of the packages to install):
rpm --Uvh clumanager-
version
.
arch
.rpm
rpm --Uvh redhat-config-cluster-
version
.noarch.rpm
3.2. Installation Notes for Red Hat Enterprise Linux 2.1 Users
Warning
Make sure the cluster service is not running on any members before performing the following procedure.
If you are currently running cluster services on Red Hat Enterprise Linux 2.1 and you want to install
Red Hat Enterprise Linux 3 on your system while preserving your cluster conguration, perform the
following steps on the Red Hat Enterprise Linux 2.1 system to backup the cluster conguration before
installing Red Hat Enterprise Linux 3:
1. Stop the cluster service by invoking the command service cluster stop on all cluster
members.
2. Remove the cluster database le (/etc/cluster.conf) from all members, saving a copy with
a different name (for example, /etc/cluster.conf.sav) to a separate running system or
portable storage such as a diskette.
3. Reinstall the Red Hat Enterprise Linux 2.1 system with Red Hat Enterprise Linux 3 on all
members.
4. Install the Red Hat Cluster Suite using the method described in Section 3.1 Installing the Red
Hat Cluster Suite Packages.
5. On all members, install the applications to be managed as services by the cluster software.
6. Restore the cluster database le you saved in step 2 to one member, naming the le
/etc/cluster.conf.
7. Invoke the /usr/sbin/cluster-convert command to convert the /etc/cluster.conf to
/etc/cluster.xml.
8. Congure shared storage on the cluster member by running /usr/sbin/shutil -i which
initializes the shared storage. Then run /usr/sbin/shutil -s /etc/cluster.xml to read
the shared storage information from the le and write it to the shared state.
9. Start the cluster service on this member using the command /sbin/service clumanager
start.
10. Copy the /etc/cluster.xml le to the other member using the scp command.
11. Start the cluster service on the other members.
38 Chapter 3. Cluster Conguration
3.3. The Cluster Conguration Tool
Red Hat Cluster Manager consists of the following RPM packages:
? clumanager ? This package consists of the software that is responsible for cluster operation
(including the cluster daemons).
? redhat-config-cluster ? This package contains the Cluster Conguration Tool and the
Cluster Status Tool, which allow for the conguration of the cluster and the display of the current
status of the cluster and its members and services.
You can use either of the following methods to access the Cluster Conguration Tool:
? Select Main Menu => System Settings => Server Settings => Cluster.
? At a shell prompt, type the redhat-config-cluster command.
The rst time that the application is started, the Cluster Conguration Tool is displayed. After you
complete the cluster conguration, the command starts the Cluster Status Tool by default. To access
the Cluster Conguration Tool from the Cluster Status Tool, select Cluster => Congure.
Figure 3-4. The Cluster Conguration Tool
The following tabbed sections are available within the Cluster Conguration Tool:
? Members ? Use this section to add members to the cluster and optionally congure a power
controller connection for any given member.
? Failover Domains ? Use this section to establish one or more subsets of the cluster members for
specifying which members are eligible to run a service in the event of a system failure. (Note that
the use of failover domains is optional.)
? Services ?Use this section to congure one or more services to be managed by the cluster. As you
specify an application service, the relationship between the service and its IP address, device special
le, mount point, and NFS exports is represented by a hierarchical structure. The parent-child
relationships in the Cluster Conguration Tool reect the organization of the service information
in the /etc/cluster.xml le.
Chapter 3. Cluster Conguration 39
Warning
Do not manually edit the contents of the /etc/cluster.xml le.
Do not simultaneously run the Cluster Conguration Tool on multiple members. (It is permissible
to run the Cluster Status Tool on more than one member at a time.)
The Cluster Conguration Tool stores information about the cluster service and daemons, cluster
members, and cluster services in the /etc/cluster.xml conguration le. The cluster conguration
le is created the rst time the Cluster Conguration Tool is started.
Save the conguration at any point (using File => Save) while running the Cluster Conguration
Tool. When File => Quit is selected, it prompts you to save changes if any unsaved changes to the
conguration are detected.
Note
When you save the cluster conguration for the rst time using the Cluster Conguration Tool and
exit the application, the next time you the cluster (either by choosing Main Menu => System Settings
=> Server Settings => Cluster or by running redhat-config-cluster from a shell prompt)
the Cluster Status Tool will display by default. The Cluster Status Tool displays the status of the
cluster service, cluster members, and application services, and shows statistics concerning service
operation. If you need to further congure the cluster system, choose Cluster => Congure from the
Cluster Status Tool menu.
The Cluster Conguration Tool) is used to congure cluster members, services, and cluster daemons.
The Cluster Status Tool is used to monitor, start and stop the cluster members and services
on particular members, and move an application service to another member.
The Cluster Conguration Tool uses a hierarchical tree structure to show relationships between
components in the cluster conguration. A triangular icon to the left of a component name indicates
that the component has children. To expand or collapse the portion of the tree below a component,
click the triangle icon.
Figure 3-5. Cluster Conguration Tool
40 Chapter 3. Cluster Conguration
To display the properties of a component, select the component and click Properties. As a shortcut,
you can display the properties of a component by double-clicking on the component name.
To view or modify the properties of the cluster daemons, choose Cluster => Daemon Properties.
3.4. Conguring the Cluster Software
Before describing the steps to congure the cluster software, the hierarchical structure of cluster members
and services needs to be considered. Cluster members and services can be thought of in terms of
a parent/child tree structure, as shown in Figure 3-6.
Figure 3-6. Cluster Conguration Structure
Each member can have power controller children. Each service can have IP address children. Each
Service can also have device children. Each device can have NFS export directory children. Each NFS
export directory can have client children. This structure is reected in the /etc/cluster.xml le
syntax.
The steps to congure the cluster software consist of the following:
1. Edit the /etc/sysconfig/rawdevicesle on all cluster members and specify the raw device
special les and character devices for the primary and backup shared partitions as described in
Section 2.4.4.3 Conguring Shared Cluster Partitions and Section 3.5 Editing the rawdevices
File.
2. Run the Cluster Conguration Tool on one cluster member.
3. Enter a name for the cluster in the Cluster Name eld. The name should be descriptive enough
to distinguish it from other clusters and systems on your network (for example, nfs_cluster
or httpd_cluster).
4. Choose Cluster => Shared State, and conrm that the Raw Primary and Raw Shadow settings
match the settings specied in step 1. The default for Raw Primary is /dev/raw/raw1.
The default for Raw Shadow is /dev/raw/raw2.
5. Choose Cluster => Daemon Properties to edit the daemon properties. Each daemon has its
own tab. Update the daemon options to suit your preferences and operating environment, and
click OK when done. Refer to Section 3.6 Conguring Cluster Daemons for more details.
Chapter 3. Cluster Conguration 41
6. Add member systems to the cluster by selecting the Members tab and clicking the New button.
Refer to Section 3.7 Adding and Deleting Members for details.
If the hardware conguration includes power controllers (power switches), then for each member
connected to a power controller, congure that member's connection to the power controller
by selecting the member and clicking Add Child. Refer to Section 3.8 Conguring a Power
Controller Connection for more information.
7. Set up one or more failover domains, if needed, to restrict the members on which a service can
run or restrict the order of members followed when a service fails over from one failover domain
member to another (or both). Refer to Section 3.9 Conguring a Failover Domain for details.
8. Congure the application services to be managed by the cluster, specifying IP addresses, failover
domain (if applicable) and user script that manages the service. Refer to Section 3.10 Adding a
Service to the Cluster for more information.
9. Save cluster conguration changes by selecting File => Save. When you save the cluster con-
guration, the command service clumanager reload command is executed to cause the
cluster software to load the changed conguration le.
10. Quit the application by selecting File => Quit.
Running the Cluster Conguration Tool for the rst time causes the cluster conguration le
/etc/cluster.xml to be created automatically. When quiting after running it for the rst
time, you are prompted to Press 'OK' to initialize Shared Storage, which uses the shared
partitions to pass quorum and service state information between cluster members. Click OK to
initialize the shared storage.
Note
Shared storage must be initialized before starting the cluster service for the rst time, or the
cluster will not run properly.
11. After completing the cluster software conguration on one member, copy the conguration le
/etc/cluster.xml to all other cluster members. The scp can be used to copy les from one
host to another.
3.5. Editing the rawdevices File
The /etc/sysconfig/rawdevices le is used to map the raw devices for the shared partitions
each time a cluster member boots. As part of the cluster software installation procedure, edit the
rawdevices le on each member and specify the raw character devices and block devices for the
primary and backup shared partitions. This must be done prior to running the Cluster Conguration
Tool.
If raw devices are employed in a cluster service, the rawdevices le is also used to bind the devices
at boot time. Edit the le and specify the raw character devices and block devices that you want to bind
each time the member boots. To make the changes to the rawdevices le take effect without requiring
a reboot, perform the following command:
/sbin/service rawdevices restart
The following is an example rawdevices le that designates two shared partitions:
# raw device bindings
# format:
rawdev
major
minor
#
rawdev
blockdev
# example: /dev/raw/raw1 /dev/s da1
42 Chapter 3. Cluster Conguration
# /dev/raw/raw2 8 5
/dev/raw/raw1 /dev/hda5
/dev/raw/raw2 /dev/hda6
Refer to Section 2.4.4.3 Conguring Shared Cluster Partitions for more information about setting up
the shared partitions. Refer to Section 2.4.4.5 Creating Raw Devices for more information on using
the raw command to bind raw character devices to block devices.
Note
The rawdevices conguration must be performed on all cluster members, and all members must use
the same raw devices (from the previous example, /dev/raw/raw1 and/dev/raw/raw2).
To check raw device conguration on the current cluster member, choose Cluster => Shared State
in the Cluster Conguration Tool. The Shared State dialog is displayed, as shown in Figure 3-7.
Figure 3-7. Verifying Shared State Conguration
3.6. Conguring Cluster Daemons
The Red Hat Cluster Manager provides the following daemons to monitor cluster operation:
? cluquorumd? Quorum daemon
? clusvcmgrd? Service manager daemon
? clurmtabd?Synchronizes NFS mount entries in /var/lib/nfs/rmtab with a private copy on
a service's mount point
? clulockd? Global lock manager (the only client of this daemon is clusvcmgrd)
? clumembd? Membership daemon
Each of these daemons can be individually congured using the Cluster Conguration Tool. To
access the Cluster Daemon Properties dialog box, choose Cluster => Daemon Properties.
The following sections explain how to congure cluster daemon properties. However, note that the
default values are applicable to most congurations and do not need to be changed.
3.6.1. Conguring clumembd
On each cluster system, the clumembd daemon issues heartbeats (pings) across the point-to-point
Ethernet lines to which the cluster members are connected.
Chapter 3. Cluster Conguration 43
Figure 3-8. Conguring clumembd
Note
You can enable both broadcast heartbeating and multicast heartbeating, but at least one of these
features must be used.
Multicast heartbeating over a channel-bonded Ethernet interface provides good fault tolerance and
is recommended for availability.
You can specify the following properties for the clumembd daemon:
? Log Level ? Determines the level of event messages that get logged to the cluster log le (by default
/var/log/messages). Choose the appropriate logging level from the menu. Refer to Section
3.12 Conguring syslogd Event Logging for more information.
? Failover Speed?Determines the number of seconds that the cluster service waits before shutting
down a non-responding member (that is, a member from which no heartbeat is detected). To set the
failover speed, drag the slider bar. The default failover speed is 10 seconds.
Note
Setting a faster failover speed increases the likelihood of false shutdowns.
? Heartbeating ? Enable Broadcast Heartbeating or Enable Multicast Heartbeating by clicking
the corresponding radio button. Broadcast heartbeating species that the broadcast IP address is to
be used by the clumembd daemon when emitting heartbeats.
By default, the clumembd is congured to emit heartbeats via multicast. Multicast uses the network
interface associated with the member's hostname for transmission of heartbeats.
? Multicast IP Address ? Species the IP address to be used by the clumembd daemon over the
multicast channel. This eld is not editable if Enable Broadcast Heartbeating is checked. The
default multicast IP address used by the cluster is 225.0.0.11.
3.6.2. Conguring cluquorumd
In a two-member cluster without a specied tiebreaker IP address, the cluquorumd daemon periodically
writes a time-stamp and system status to a specic area on the primary and shadow shared
44 Chapter 3. Cluster Conguration
partitions. The daemon also reads the other member's timestamp and system status information from
the primary shared partition or, if the primary partition is corrupted, from the shadow partition.
Figure 3-9. Conguring cluquorumd
You can specify the following properties for the cluquorumd daemon:
? Log Level ? Determines the level of event messages that get logged to the cluster log le (by default
/var/log/messages). Choose the appropriate logging level from the menu. Refer to Section
3.12 Conguring syslogd Event Logging for more information.
? Ping Interval or Tiebreaker IP ? Ping Interval is used for a disk-based heartbeat; species the
number of seconds between the quorum daemon's updates to its on-disk status.
Tiebreaker IP is a network-based heartbeat used to determine quorum, which is the ability to run
services. The tiebreaker IP address is only checked when the cluster has an even split in a twoor
four-member cluster. This IP address should be associated with a router that, during normal
operation, can be reached by all members over the Ethernet interface used by the cluster software.
3.6.3. Conguring clurmtabd
The clurmtabd daemon synchronizes NFS mount entries in /var/lib/nfs/rmtab with a private
copy on a service's mount point. The clurmtabd daemon runs only when a service with NFS exports
is running.
Figure 3-10. Conguring clurmtabd
You can specify the following properties for the clurmtabd daemon:
Chapter 3. Cluster Conguration 45
? Log Level ? Determines the level of event messages that get logged to the cluster log le (by
default, /var/log/messages). Choose the appropriate logging level from the menu. See Section
3.12 Conguring syslogd Event Logging for more information.
? Poll Interval?Species the number of seconds between polling cycles to synchronize the local
NFS rmtab to the cluster rmtab on shared storage.
3.6.4. Conguring the clusvcmgrd daemon
On each cluster system, the clusvcmgrd service manager daemon responds to changes in cluster
membership by stopping and starting services. You might notice, at times, that more than one
clusvcmgrd process is running; separate processes are spawned for start, stop, and monitoring
operations.
Figure 3-11. Conguring clusvcmgrd
You can specify the following properties for the clusvcmgrd daemon:
? Log Level ? Determines the level of event messages that get logged to the cluster log le (by
default, /var/log/messages). Choose the appropriate logging level from the menu. See Section
3.12 Conguring syslogd Event Logging for more information.
3.6.5. Conguring clulockd
The clulockd daemon manages the locks on les being accessed by cluster members.
Figure 3-12. Conguring clulockd
46 Chapter 3. Cluster Conguration
You can specify the following properties for the clulockd daemon:
? Log Level ? Determines the level of event messages that get logged to the cluster log le (by
default, /var/log/messages). Choose the appropriate logging level from the menu. See Section
3.12 Conguring syslogd Event Logging for more information.
3.7. Adding and Deleting Members
The procedure to add a member to a cluster varies slightly, depending on whether the cluster is already
running or is a newly-congured cluster.
3.7.1. Adding a Member to a New Cluster
To add a member to a new cluster, follow these steps:
Figure 3-13. Adding a Member to a New Cluster
1. Ensure that the Members tab is selected and click New. It prompts for a member name.
2. Enter the name or address of a system on the cluster subnet. Note that each member must be on
the same subnet as the system on which you are running the Cluster Conguration Tool and
must be dened either in DNS or in each cluster system's /etc/hosts le
The system on which you are running the Cluster Conguration Tool must be explicitly added
as a cluster member; the system is not automatically added to the cluster conguration as a
result of running the Cluster Conguration Tool.
3. Leave Enable SWWatchdog checked. (A software watchdog timer enables a member to reboot
itself.)
4. Click OK.
5. Choose File => Save to save the changes to the cluster conguration.
3.7.2. Adding a Member to a Running Cluster
To add a member to an existing cluster that is currently in operation, follow these steps:
1. Ensure that the cluster service is not running on the new member by invoking the
/sbin/service clumanager status command. Invoke the /sbin/service
clumanager stop command to stop the cluster service.
2. Ensure that the cluster service is running on all active cluster members. Run /sbin/service
clumanager start to start the service on the existing cluster members.
Chapter 3. Cluster Conguration 47
3. On one of the running members, invoke the Cluster Conguration Tool to add the new member.
Ensure that the Members tab is selected and click New.
4. It prompts for a member name. Enter the name of the new member. Note that each member must
be on the same subnet as the system on which you are running the Cluster Conguration Tool
and must be dened in DNS or in each cluster member's /etc/hosts le..
5. Leave Enable SWWatchdog checked. (A software watchdog timer enables a member to reboot
itself.)
6. Click OK.
7. Choose File => Save to save the changes to the cluster conguration.
8. Copy the updated /etc/cluster.xml le (containing the newly-added member) to the new
member.
9. Invoke the service clumanager start command to start the cluster service on the new
member.
3.7.3. Deleting a Member from a Running Cluster
To delete a member from an existing cluster that is currently in operation, follow these steps:
1. Ensure that the cluster service is not running on the member to be deleted. Invoke the service
clumanager stop command to stop the cluster service.
2. On one of the running members, invoke the Cluster Conguration Tool to delete the new
member, as follows:
a. Remove the member from any failover domains of which it is a member (see Section 3.9
Conguring a Failover Domain).
b. Select the Members tab.
c. Select the member to be deleted and click Delete.
d. Click Yes to conrm deletion.
Figure 3-14. Deleting a Member
3. Choose File => Save to save the changes to the cluster conguration.
3.8. Conguring a Power Controller Connection
To ensure data integrity, only one member can run a service and access service data at one time. The
use of power controllers (also called power switches) in the cluster hardware conguration enables a
member to power-cycle another member before restarting that member's services during the failover
process. This prevents more than one system from simultaneously accessing the same data and corrupting
it. Although not required, it is recommended that you use power controllers to guarantee data
48 Chapter 3. Cluster Conguration
integrity under all failure conditions. Watchdog timers are an optional variety of power control to
ensure correct operation of service failover.
If the hardware conguration for the cluster includes one or more power controllers, congure the
connection to a power controller for any given member as follows:
1. Select the member for which you want to congure a power controller connection and click
Add Child. The Power Controller dialog box is displayed as shown in Figure 3-15.
Figure 3-15. Conguring Power Controllers
2. Specify the following information depending on whether the controller is connected to the
member through a serial port or through the network. Note that serial connections to power
controllers are valid only in a two-member cluster.
Field Description
Type Type of serial power controller (only the rps10 is supported in Red Hat
Enterprise Linux 3).
Device Physical device on the power controller's parent member that the power
controller is plugged into; must be one of /dev/ttyS0 through
/dev/ttyS9.
Port Port number on the power controller itself that is attached to the specied
device.
Owner The member that controls, or owns, the device. The owner is the member that
can power off this device. Automatically defaults to the name of the other
member. This eld cannot be edited.
Table 3-1. Conguring a Serial Power Controller
Field Description
Type Type of network power controller; choose from the menu containing all
supported types.
Chapter 3. Cluster Conguration 49
Field Description
IP Address IP of the network power controller itself.
Port Specic port on the power controller that this member is attached to.
User Username to be used when logging in to the power controller.
Password Password to be used when logging in to the power controller.
Table 3-2. Conguring a Network Power Controller
Users of Red Hat GFS can choose the Grand Unied Lock Manager (GULM) driver as their
power controller by clicking the GULM STONITH radio button. The GULM driver is used
in conjunction with the Red Hat Cluster Manager failover subsystem and features performance
gains in the detection and fencing initialization of hung or failed cluster members.
For more information about Red Hat GFS refer to the Red Hat GFS Administrator's Guide.
3. Click OK.
4. Choose File => Save to save the changes to the cluster conguration.
3.9. Conguring a Failover Domain
A failover domain is a named subset of cluster members that are eligible to run a service in the event
of a system failure. A failover domain can have the following characteristics:
? Unrestricted ? Allows you to specify that a subset of members are preferred, but that a service
assigned to this domain can run on any available member.
? Restricted ? Allows you to restrict the members that can run a particular service. If none of the
members in a restricted failover domain are available, the service cannot be started (either manually
or by the cluster software).
? Unordered ? When a service is assigned to an unordered failover domain, the member on which
the service runs is chosen from the available failover domain members with no priority ordering.
? Ordered ?Allows you to specify a preference order among the members of a failover domain. The
member at the top of the list is the most preferred, followed by the second member in the list, and
so on.
By default, failover domains are unrestricted and unordered.
In a cluster with several members, using a restricted failover domain can minimize the work to set up
the cluster to run a service (such as httpd, which requires you to set up the conguration identically
on all members that run the service). Instead of setting up the entire cluster to run the service, you
must set up only the members in the restricted failover domain that you associate with the service.
Tip
To implement the concept of a preferred member, create an unrestricted failover domain comprised
of only one cluster member. By doing this, a service runs on the preferred member; in the event of a
failure, the service fails over to any of the other members.
To add a failover domain to the cluster software conguration, follow these steps:
1. Select the Failover Domains tab and click New. The Failover Domain dialog box is displayed
as shown in Figure 3-16.
50 Chapter 3. Cluster Conguration
Figure 3-16. Conguring a Failover Domain
2. Enter a name for the domain in the Domain Name eld. The name should be descriptive enough
to distinguish its purpose relative to other names used on your network.
3. Check Restrict failover to only these members to prevent any member other than those listed
from taking over a service assigned to this domain.
4. Check Ordered Failover if you want members to assume control of a failed service in a particular
sequence; preference is indicated by the member's position in the list of members in the
domain, with the most preferred member at the top.
5. Click Add Members to select the members for this failover domain. The Failover Domain
Member dialog box is displayed.
Figure 3-17. Failover Domain Member
You can choose multiple members from the list by pressing either the [Shift] key while clicking
the start and end of a range of members, or pressing the [Ctrl] key while clicking on noncontiguous
members.
Chapter 3. Cluster Conguration 51
6. When nished selecting members from the list, click OK. The selected members are displayed
on the Failover Domain list.
7. When Ordered Failover is checked, you can rearrange the order of the members in the domain
by dragging the member name in the list box to the desired position. A thin, black line is
displayed to indicate the new row position (when you release the mouse button).
8. When nished, click OK.
9. Choose File => Save to save the changes to the cluster conguration.
To remove a member from a failover domain, follow these steps:
1. On the Failover Domains tab, double-click the name of the domain you want to modify (or
select the domain and click Properties).
2. In the Failover Domain dialog box, click the name of the member you want to remove from the
domain and click Delete Member. (Members must be deleted one at a time.) You are prompted
to conrm the deletion.
3. When nished, click OK.
4. Choose File => Save to save the changes to the cluster conguration.
3.10. Adding a Service to the Cluster
To add a service to the cluster, follow these steps:
1. Select the Services tab and click New. The Service dialog is displayed as shown in Figure 3-18.
Figure 3-18. Adding a Service
2. Give the service a descriptive Service Name to distinguish its functionality relative to other
services that may run on the cluster.
3. If you want to restrict the members on which this service is able to run, choose a failover
domain from the Failover Domain list. (Refer to Section 3.9 Conguring a Failover Domain
for instructions on how to congure a failover domain.)
4. Adjust the quantity in the Check Interval eld, which sets the interval (in seconds) that the
cluster infrastructure checks the status of a service. This eld is only applicable if the service
script is written to check the status of a service.
5. Specify a User Script that contains settings for starting, stopping, and checking the status of a
service.
6. Specify service properties, including an available oating IP address (an address that can be
transferred transparently from a failed member to a running member in the event of failover)
52 Chapter 3. Cluster Conguration
and devices (which are congured as children of the service). For instructions, refer to Section
3.10.1 Adding a Service IP Address and Section 3.10.2 Adding a Service Device.
7. When nished, click OK.
8. Choose File => Save to save the changes to the cluster conguration.
The following sections describe service conguration in more detail.
3.10.1. Adding a Service IP Address
To specify a service IP address, follow these steps:
1. On the Services tab of the Cluster Conguration Tool, select the service you want to congure
and click Add Child.
2. Select Add Service IP Address and click OK.
3. Specify an IP address (which must be resolvable by DNS but cannot be the IP address of a
running service).
4. Optionally specify a netmask and broadcast IP address.
5. Choose File => Save to save the change to the /etc/cluster.xml conguration le.
3.10.2. Adding a Service Device
To specify a device for a service, follow these steps:
1. On the Services tab of the Cluster Conguration Tool, select the service you want to congure
and click Add Child.
2. Select Add Device and click OK.
3. Specify a Device Special File (for example, /dev/hda7) and a mount point (for example,
/mnt/share). Each device must have a unique device special le and a unique mount point
within the cluster.
4. Specify a Samba Share Name for the device if it is intended to be a Samba export
directory. If a Samba Share Name has been entered, when the user selects File => Save, a
/etc/samba/smb.conf.sharename le is created (where sharename is the name of the
Samba share), and will be used by Samba when the cluster starts the service. For each Samba
share you create, an /etc/samba/smb.conf.sharename is created. Copy all of these les
to the other cluster members before initializing the cluster service on those members. For more
information about conguring a Samba share, refer to Section 6.6 Setting Up a Samba Service.
5. Specify a directory from which to mount the device in the Mount Point eld. This directory
should not be listed in /etc/fstab as it is automatically mounted by the Red Hat Cluster
Manager when the service is started.
6. Choose a le system type from the FS Type list.
7. Optionally specify Options for the device. If you leave the Options eld blank, the default
mount options (rw,suid,dev,exec,auto,nouser,async) are used. Refer to the mount man
page for a complete description of the options available for mount.
8. Check Force Unmount to force any application that has the specied le system mounted to be
killed prior to disabling or relocating the service (when the application is running on the same
member that is running the disabled or relocated service).
9. When nished, click OK.
10. Choose File => Save to save the change to the /etc/cluster.xml conguration le.
Chapter 3. Cluster Conguration 53
3.11. Checking the Cluster Conguration
To ensure that the cluster software has been correctly congured, use the following tools located in
the /usr/sbin directory:
? Test the shared partitions and ensure that they are accessible.
Invoke the /usr/sbin/shutil utility with the -v option to test the accessibility of the shared
partitions. See Section 3.11.1 Testing the Shared Partitions for more information.
? Test the operation of the power switches.
If power switches are used in the cluster hardware conguration, run the clufence command
on each member to ensure that it can remotely power-cycle the other member. Do not run this
command while the cluster software is running. See Section 3.11.2 Testing the Power Switches for
more information.
? Ensure that all members are running the same software version.
Invoke the rpm -q clumanager command and rpm -q redhat-config-cluster command
on each member to display the revision of the installed cluster software RPMs.
The following section explains the cluster utilities in further detail.
3.11.1. Testing the Shared Partitions
The shared partitions must refer to the same physical device on all members. Invoke the
/usr/sbin/shutil utility with the -v command to test the shared partitions and verify that they
are accessible.
If the command succeeds, run the /usr/sbin/shutil -p /cluster/header command on
all members to display a summary of the header data structure for the shared partitions. If the
output is different on the members, the shared partitions do not point to the same devices on all
members. Check to make sure that the raw devices exist and are correctly specied in the
/etc/sysconfig/rawdevices le. See Section 2.4.4.3 Conguring Shared Cluster Partitions for
more information.
The following example shows that the shared partitions refer to the same physical device on cluster
members clu1.example.com and clu2.example.com via the /usr/sbin/shutil -p
/cluster/header command:
/cluster/header is 140 bytes long
SharedStateHeader {
ss_magic = 0x39119fcd
ss_timestamp = 0x000000003ecbc215 (14:14:45 May 21 2003)
ss_updateHost = clu1.example.com
All elds in the output from the /usr/sbin/shutil -p /cluster/header command should be
the same when run on all cluster members. If the output is not the same on all members, perform the
following:
? Examine the /etc/sysconfig/rawdevices le on each member and ensure that the raw character
devices and block devices for the primary and backup shared partitions have been accurately
specied. If they are not the same, edit the le and correct any mistakes. Then re-run the Cluster
Conguration Tool. See Section 3.5 Editing the rawdevices File for more information.
? Ensure that you have created the raw devices for the shared partitions on each member. See Section
2.4.4.3 Conguring Shared Cluster Partitions for more information.
? To determine the bus conguration on each member, examine the system startup messages by running
dmesg |less to the point where the system probes the SCSI subsystem. Verify that all members
identify the same shared storage devices and assign them the same name.
54 Chapter 3. Cluster Conguration
? Verify that a member is not attempting to mount a le system on the shared partition. For example,
make sure that the actual device (for example, /dev/sdb1) is not included in an /etc/fstab le.
After performing these tasks, re-run the /usr/sbin/shutil utility with the -p option.
3.11.2. Testing the Power Switches
If either network-attached or serial-attached power switches are employed in the cluster hardware con-
guration, install the cluster software and invoke the clufence command to test the power switches.
Invoke the command on each member to ensure that it can remotely power-cycle the other member. If
testing is successful, then the cluster can be started.
The clufence command can accurately test a power switch only if the cluster software is not running.
This is due to the fact that for serial attached switches, only one program at a time can access the serial
port that connects a power switch to a member. When the clufence command is invoked, it checks
the status of the cluster software. If the cluster software is running, the command exits with a message
to stop the cluster software.
The clufence command line options are as follows:
? -d ? Turn on debugging
? -f ? Fence (power off) member
? -u ? Unfence (power on) member
? -r ? Reboot (power cycle) member
? -s ? Check status of all switches controlling member
When testing power switches, the rst step is to ensure that each cluster member can successfully
communicate with its attached power switch. The following output of the clufence command shows
that the cluster member is able to communicate with its power switch:
[27734] info: STONITH: rps10 at /dev/ttyS0, port 0 controls clumember1.example.com
[27734] info: STONITH: rps10 at /dev/ttyS0, port 1 controls clumember2.example.com
In the event of an error in the clufence output, check the following:
? For serial attached power switches:
? Verify that the device special le for the remote power switch connection serial port (for example,
/dev/ttyS0) is specied correctly in the cluster conguration le; in the Cluster Conguration
Tool, display the Power Controller dialog box to check the serial port value. If necessary,
use a terminal emulation package such as minicom to test if the cluster member can access the
serial port.
? Ensure that a non-cluster program (for example, a getty program) is not using the serial port
for the remote power switch connection. You can use the lsof command to perform this task.
? Check that the cable connection to the remote power switch is correct. Verify that the correct
type of cable is used (for example, an RPS-10 power switch requires a null modem cable), and
that all connections are securely fastened.
? Verify that any physical dip switches or rotary switches on the power switch are set properly.
? For network based power switches:
? Verify that the network connection to network-based power switches is operational. Most
switches have a link light that indicates connectivity.
Chapter 3. Cluster Conguration 55
? It should be possible to ping the network power switch; if not, then the switch may not be
properly congured for its network parameters.
? Verify that the correct password and login name (depending on switch type) have been specied
in the cluster conguration le (as established by running the Cluster Conguration Tool and
viewing the properties specied in the Power Controller dialog box). A useful diagnostic approach
is to verify Telnet access to the network switch using the same parameters as specied in
the cluster conguration.
After successfully verifying communication with the switch, attempt to power cycle the other cluster
member. Prior to doing this, we recommend you verify that the other cluster member is not actively
performing any important functions (such as serving cluster services to active clients). Running the
command clufence -f clumember2.example.com displays the following output upon a successful
shutdown and fencing operation (which means that the system does not receive power from
the power switch until the system has been unfenced):
[7397] info: STONITH: rps10 at /dev/ttyS0, port 0 controls clumember1.example.com
[7397] info: STONITH: rps10 at /dev/ttyS0, port 1 controls clumember2.example.com
[7397] notice: STONITH: clumember2.example.com has been fenced!
3.11.3. Displaying the Cluster Software Version
Ensure that all members in the cluster are running the same version of the Red Hat Cluster Manager
software.
To display the version of the Cluster Conguration Tool and the Cluster Status Tool, use either of
the following methods:
? Choose Help => About. The About dialog includes the version numbers.
? Invoke the following commands:
rpm -q redhat-config-cluster
rpm -q clumanager
? The version of the clumanager package can also be determined by invoking the clustat -v
command.
3.12. Conguring syslogd Event Logging
It is possible to edit the /etc/syslog.conf le to enable the cluster to log events to a le that is
different from the /var/log/messages log le. Logging cluster messages to a separate le helps to
diagnose problems more clearly.
The members use the syslogd daemon to log cluster-related events to a le, as specied in the
/etc/syslog.conf le. The log le facilitates diagnosis of problems in the cluster. It is recommended
to set up event logging so that the syslogd daemon logs cluster messages only from the
member on which it is running. Therefore, you need to examine the log les on all members to get a
comprehensive view of the cluster.
The syslogd daemon logs messages from the cluster daemons, which all default to severity level
4 (warning). Refer to Section 3.6 Conguring Cluster Daemons for more information on cluster
daemons.
The importance of an event determines the severity level of the log entry. Important events should be
investigated before they affect cluster availability. The cluster can log messages with the following
severity levels, listed in order of severity level:
56 Chapter 3. Cluster Conguration
? EMERG ?The member is unusable (emergency).
? ALERT ?Action must be taken immediately to address the problem.
? CRIT ?A critical condition has occurred.
? ERR ?An error has occurred.
? WARN ?A signicant event that may require attention has occurred.
? NOTICE ?A signicant, but normal, event has occurred.
? INFO ?An insignicant, but normal, cluster operation has occurred.
? DEBUG ?Diagnostic output detailing cluster operations (typically not of interest to administrators.)
Examples of log le entries are as follows:
Jul 18 20:24:39 clu1 clufence[7397]:
info
STONITH: rps10 at /dev/ttyS0,\
port 0 controls clu1
Jul 18 20:24:39 clu1 clufence[7397]:
info
STONITH: rps10 at /dev/ttyS0,\
port 1 controls clu2
Jul 18 20:24:53 clu1 clufence[7397]: Port 0 being turned off.
Jul 18 20:24:53 clu1 clufence[7397]:
notice
STONITH: clu2 has been fenced!
Jul 18 20:51:03 clu1 clufence[30780]:
info
STONITH: rps10 at/dev/ttyS0,\
port 0 controls clu1
Jul 18 20:51:03 clu1 clufence[30780]:
info
STONITH: rps10 at /dev/ttyS0,\
port 1 controls clu2
Jul 18 20:51:17 clu1 clufence[30780]: Port 0 being turned on.
Jul 18 20:51:17 clu1 clufence[30780]:
notice
STONITH: clu2 is no longer fenced off.
[1] [2] [3] [4] [5]
Each entry in the log le contains the following information:
? [1] Date and time
? [2] Hostname
? [3] Cluster resource or daemon
? [4] Severity
? [5] Message
After conguring the cluster software, optionally edit the /etc/syslog.conf le to enable the cluster
to log events to a le that is different from the default log le, /var/log/messages. The cluster
utilities and daemons log their messages using a syslog tag called local4. Using a cluster-specic
log le facilitates cluster monitoring and problem solving.
To prevent cluster events from being logged to the /var/log/messages le, add local4.none to
the following line in the /etc/syslog.conf le:
# Log anything (except mail) of level info or higher.
# Don't log private authentication messages!
*.info;mail.none;news.none;authpriv.none;local4.none /var/log/messages
To direct the cluster logging facility to log cluster events in the /var/log/cluster le, add lines
similar to the following to the /etc/syslog.conf le:
#
# Cluster messages coming in on local4 go to /var/log/cluster
#
local4.* /var/log/cluster
To apply the previous changes, restart syslogd with the service syslog restart command.
Chapter 3. Cluster Conguration 57
In addition, it is possible to modify the severity level of the events that are logged by the individual
cluster daemons; refer to Section 3.6 Conguring Cluster Daemons and the man page for
syslog.conf for more information.
To rotate the cluster log le according to the frequency specied in the /etc/logrotate.conf le
(the default is weekly), add /var/log/cluster to the rst line of the /etc/logrotate.d/syslog
le. For example:
/var/log/messages /var/log/secure /var/log/maillog /var/log/spooler
/var/log/boot.log /var/log/cron /var/log/cluster {
sharedscripts
postrotate
/bin/kill -HUP `cat /var/run/syslogd.pid 2> /dev/null` 2>
/dev/null || true
endscript
}
58 Chapter 3. Cluster Conguration
Chapter 4.
Service Administration
The following sections describe how to display, enable, disable, modify, relocate, and delete a service,
and how to handle services that fail to start. Examples of setting up specic types of services are
provided.
4.1. Conguring a Service
After setting up disk storage and installing applications to be managed by the cluster, you can congure
services to manage these applications and resources by using the Cluster Conguration Tool.
To congure a service, follow these steps:
1. If applicable, create a script to start, stop, and check the status of the application used in the
service. Refer to Section 4.1.2 Creating Service Scripts for information.
2. Gather information about service resources and properties. Refer to Section 4.1.1 Gathering
Service Information for information.
3. Set up the le systems or raw devices that the service uses. Refer to Section 4.1.3 Conguring
Service Disk Storage for information.
4. Ensure that the application software can run on each member (in either the associated failover
domain, if used, or in the cluster) and that the service script, if any, can start and stop the
service application. Refer to Section 4.1.4 Verifying Application Software and Service Scripts
for upgrade instructions.
5. If upgrading from an existing cluster, back up the /etc/cluster.conf le. Refer to Section
3.2 Installation Notes for Red Hat Enterprise Linux 2.1 Users for upgrade instructions.
6. Start the Cluster Conguration Tool and add services, specifying the information about the
service resources and properties obtained in step 2.
7. Save your conguration. Saving the settings on one cluster member propogates to other cluster
members automatically.
For more information about adding a cluster service, refer to the following:
? Section 5.1 Setting Up an Oracle Service
? Section 5.3 Setting Up a MySQL Service
? Section 6.1 Setting Up an NFS Service
? Section 6.6 Setting Up a Samba Service
? Chapter 7 Setting Up Apache HTTP Server
4.1.1. Gathering Service Information
Before conguring a service, gather all available information about the service resources and properties.
In some cases, it is possible to specify multiple resources for a service (for example, multiple IP
addresses and disk devices).
The service properties and resources that you can specify using the Cluster Conguration Tool are
described in Table 4-1.
60 Chapter 4. Service Administration
Property Description
Service
name
Each service must have a unique name. A service name can consist of one to 63
characters and must consist of a combination of letters (either uppercase or
lowercase), integers, underscores, periods, and dashes (hyphens). A service name
must begin with a letter or an underscore.
Failover
domain
Identify the members on which to run the service by associating the service with an
existing failover domain.
When ordered failover is enabled, if the member on which the service is running
fails, the service is automatically relocated to the next member on the ordered
member list. (Order of preference is established by the sequence of member names
in the failover domain list). Refer to Section 3.9 Conguring a Failover Domain for
more information.
Check
interval
Species the frequency (in seconds) that the member checks the health of the
application associated with the service. For example, when you specify a nonzero
check interval for an NFS or Samba service, Red Hat Cluster Manager veries that
the necessary NFS or Samba daemons are running. For other types of services, Red
Hat Cluster Manager checks the return status after calling the status clause of the
application service script. By default, check interval is 0, indicating that service
monitoring is disabled.
User script If applicable, specify the full path name for the script that is used to start and stop
the service. Refer to Section 4.1.2 Creating Service Scripts for more information.
IP address One or more Internet protocol (IP) addresses may be assigned to a service. This IP
address (sometimes called a "oating" IP address) is different from the IP address
associated with the host name Ethernet interface for a member, because it is
automatically relocated along with the service resources when failover occurs. If
clients use this IP address to access the service, they do not know which member is
running the service, and failover is transparent to the clients.
Note that cluster members must have network interface cards congured in the IP
subnet of each IP address used in a service.
Netmask and broadcast addresses for each IP address can also be specied; if they
are not, then the cluster uses the netmask and broadcast addresses from the network
interconnect in the subnet.
Device
special le
Specify each shared disk partition used in a service.
File system
and sharing
options
If the service uses a le system, specify the type of le system, the mount point,
and any mount options. You can specify any of the standard le system mount
options as described in the mount(8) man page. It is not necessary to provide mount
information for raw devices (if used in a service). ext2 and ext3 le systems are
supported for a cluster.
Specify whether to enable forced unmount for a le system. Forced unmount
allows the cluster service management infrastructure to unmount a le system prior
to relocation or failover, even if the le system is busy. This is accomplished by
terminating any applications that are accessing the le system.
You can also specify whether to export the le system via NFS set access
permissions. Refer to Section 6.1 Setting Up an NFS Service for details.
Specify whether or not to make the le system accessible to SMB clients via Samba
by providing a Samba share name.
Table 4-1. Service Property and Resource Information
Chapter 4. Service Administration 61
4.1.2. Creating Service Scripts
The cluster infrastructure starts and stops specied applications by running service-specic scripts.
For both NFS and Samba services, the associated scripts are built into the cluster services infrastructure.
Consequently, when running the Cluster Conguration Tool to congure NFS and Samba
services, leave the User Script eld blank. For other application types it is necessary to designate
a service script. For example, when conguring a database application, specify the fully-qualied
pathname of the corresponding database start script.
The format of the service scripts conforms to the conventions followed by the System V init scripts.
This convention dictates that the scripts have a start, stop, and status clause. These scripts should
return an exit status of 0 upon successful completion.
When a service fails to start on one cluster member, Red Hat Cluster Manager will attempt to start
the service on other cluster members. If the other cluster members fail to start the service, Red Hat
Cluster Manager attempts to stop the service on all members. If it fails to stop the service for any
reason, the cluster infrastructure will place the service in the Failed state. Administrators must then
start the Cluster Status Tool, highlight the failed service, and click Disable before they can enable
the service.
In addition to performing the stop and start functions, the script is also used for monitoring the status of
an application service. This is performed by calling the status clause of the service script. To enable
service monitoring, specify a nonzero value in the Check Interval eld when specifying service
properties with the Cluster Conguration Tool. If a nonzero exit is returned by a status check request
to the service script, then the cluster infrastructure rst attempts to restart the application on the
member it was previously running on. Status functions do not have to be fully implemented in service
scripts. If no real monitoring is performed by the script, then a stub status clause should be present
which returns success.
The operations performed within the status clause of an application can be tailored to best meet the application's
needs as well as site-specic parameters. For example, a simple status check for a database
would consist of verifying that the database process is still running. A more comprehensive check
would consist of a database table query.
The /usr/share/cluster/doc/services/examples/ directory contains a template that can be
used to create service scripts, in addition to examples of scripts. Refer to Section 5.1 Setting Up an
Oracle Service, Section 5.3 Setting Up a MySQL Service, Chapter 7 Setting Up Apache HTTP Server,
for sample scripts.
4.1.3. Conguring Service Disk Storage
Prior to creating a service, set up the shared le systems and raw devices that the service is to use.
Refer to Section 2.4.4 Conguring Shared Disk Storage for more information.
If employing raw devices in a cluster service, it is possible to use the /etc/sysconfig/rawdevices
le to bind the devices at boot time. Edit the le and specify the raw character devices and block devices
that are to be bound each time the member boots. Refer to Section 3.5 Editing the rawdevices
File for more information.
Note that software RAID and host-based RAID cannot be used for shared disk storage. Only certied
SCSI adapter-based RAID cards can be used for shared disk storage.
You should adhere to the following service disk storage recommendations:
? For optimal performance, use a 4 KB block size when creating le systems. Note that some of the
mkfs le system build utilities default to a 1 KB block size, which can cause long fsck times.
? To facilitate quicker failover times, it is recommended that the ext3 le system be used. Refer to
Section 2.4.4.6 Creating File Systems for more information.
62 Chapter 4. Service Administration
? For large le systems, use the nocheck option to bypass code that checks all the block groups on
the partition. Specifying the nocheck option can signicantly decrease the time required to mount
a large le system.
4.1.4. Verifying Application Software and Service Scripts
Prior to setting up a service, install any applications that are used in the service on each member
in the cluster (or each member in the failover domain, if used). After installing the application on
these members, verify that the application runs and can access shared disk storage. To prevent data
corruption, do not run the application simultaneously on more than one member.
If using a script to start and stop the service application, install and test the script on all cluster members,
and verify that it can be used to start and stop the application. Refer to Section 4.1.2 Creating
Service Scripts for more information.
4.2. Displaying a Service Conguration
In the Cluster Conguration Tool, the following information for a given
service
element is
viewable directly on the Services tab (when the service conguration is fully expanded):
Figure 4-1. The Services Tab
? Service name
? Device special le name
? Service IP address
? NFS export and clients (if specied) for the device
You can display the failover domain, check interval, and user script name for a
service
by accessing
its properties dialog box.
Chapter 4. Service Administration 63
The Samba share name, mount point, le system type, and mount options for a service are displayed
as the properties for a
device
.
You can view the permissions for NFS client access by displaying
client
properties.
To display the properties for an element in the cluster conguration, use any of the following methods:
? Double-click the element name.
? Select the element name and click Properties.
? Select the element name and choose File => Properties.
For instructions on conguring a service using the Cluster Conguration Tool, refer to Section 3.10
Adding a Service to the Cluster.
To display cluster service status, refer to Section 8.2 Displaying Cluster and Service Status.
4.3. Disabling a Service
A running service can be disabled to stop the service and make it unavailable. Once disabled, a service
can then be re-enabled. Refer to Section 4.4 Enabling a Service for information.
There are several situations in which a running service may need to be disabled:
? To modify a service?A running service must be disabled before it can be modied. Refer to Section
4.5 Modifying a Service for more information.
? To temporarily stop a service?A running service can be disabled, making it unavailable to clients
without having to completely delete the service.
To disable a running service, invoke the redhat-config-cluster command to display the Cluster
Status Tool. In the Services tab, select the service you want to disable and click Disable.
4.4. Enabling a Service
When you enable a disabled service, that service is started and becomes available on the specied
cluster member. (You can start the service on another member by dragging the service icon into
the Members tab of the Cluster Status Tool and dropping the icon on the member icon .)
To enable a stopped service, invoke the redhat-config-cluster command to display the Cluster
Conguration Tool. In the Services tab, select the service you want to enable and click Enable.
4.5. Modifying a Service
You can modify the properties for a service any time after creating the service. For example, you can
change the IP address or location of the user script. You can also add more resources (for example,
additional le systems or IP addresses) to an existing service. Refer to Section 4.1.1 Gathering Service
Information for information.
Using the Cluster Status Tool, rst disable, or stop, the service before modifying its properties. If an
administrator attempts to modify a running service, a warning dialog displays prompting to stop the
service. Refer to Section 4.3 Disabling a Service for instructions on disabling a service.
Because a service is unavailable while being modied, be sure to gather all the necessary service
information before disabling it to minimize service down time. In addition, back up the cluster conguration
le (/etc/cluster.xml) before modifying a service; refer to Section 8.5 Backing Up and
Restoring the Cluster Database.
64 Chapter 4. Service Administration
Use the Cluster Conguration Tool to modify the service properties, then save the conguration by
choosing File => Save. After the modications to the service have been saved, you can restart the
service using the Cluster Status Tool to enable the modications.
4.6. Relocating a Service
In addition to providing automatic service failover, a cluster enables you to cleanly stop a service
on one member and restart it on another member. This service relocation functionality allows you to
perform maintenance on a cluster member while maintaining application and data availability.
To relocate a service by using the Cluster Status Tool, drag the service icon from the Services
tab onto the member icon in the Members tab. The Red Hat Cluster Manager stops the service on
the member on which it was running and restarts it on the new member.
4.7. Deleting a Service
A cluster service can be deleted. Note that the cluster conguration le should be backed up before
deleting a service. Refer to Section 8.5 Backing Up and Restoring the Cluster Database for information.
To delete a service from the cluster, follow these steps:
1. If the service is running, disable the service using the Cluster Status Tool (refer to Section 4.3
Disabling a Service).
2. On the Services tab in the Cluster Conguration Tool, select the name of the service you want
to remove and click Delete.
3. You are prompted to conrm the deletion. To remove the service, click OK.
4. Choose File => Save to save the change to the /etc/cluster.xml conguration le.
4.8. Handling Failed Services
The cluster puts a service into the Failed state if it is unable to successfully start the service across all
members and then cannot cleanly stop the service. A Failed state can be caused by various problems,
such as a misconguration as the service is running or a service hang or crash. The Cluster Status
Tool displays the service as being Failed.
Chapter 4. Service Administration 65
Figure 4-2. Service in Failed State
Note
You must disable a Failed service before you can modify or re-enable the service.
Be sure to carefully handle failed services. If service resources are still congured on the owner
member, starting the service on another member may cause signicant problems. For example, if a
le system remains mounted on the owner member, and you start the service on another member,
the le system is mounted on both members, which can cause data corruption. If the enable fails, the
service remains in the Disabled state.
After highlighting the service and clicking Disable, you can attempt to correct the problem that caused
the Failed state. After you modify the service, the cluster software enables the service on the owner
member, if possible; otherwise, the service remains in the Disabled state. The following list details
steps to follow in the event of service failure:
1. Modify cluster event logging to log debugging messages. Viewing the logs can help determine
problem areas. Refer to Section 8.6 Modifying Cluster Event Logging for more information.
2. Use the Cluster Status Tool to attempt to enable or disable the service on one of the cluster or
failover domain members. Refer to Section 4.3 Disabling a Service and Section 4.4 Enabling a
Service for more information.
3. If the service does not start or stop on the member, examine the /var/log/messages and
(if congured to log separately) /var/log/cluster log les, and diagnose and correct the
problem. You may need to modify the service to x incorrect information in the cluster conguration
le (for example, an incorrect start script), or you may need to perform manual tasks on
the owner member (for example, unmounting le systems).
4. Repeat the attempt to enable or disable the service on the member. If repeated attempts fail to
correct the problem and enable or disable the service, reboot the member.
5. If still unable to successfully start the service, verify that the service can be manually restarted
outside of the cluster framework. For example, this may include manually mounting the le
systems and manually running the service start script.
66 Chapter 4. Service Administration
Chapter 5.
Database Services
This chapter contains instructions for conguring Red Hat Enterprise Linux to make database services
highly available.
Note
The following descriptions present example database conguration instructions. Be aware that differences
may exist in newer versions of each database product. Consequently, this information may not
be directly applicable.
5.1. Setting Up an Oracle Service
A database service can serve highly-available data to a database application. The application can then
provide network access to database client systems, such as Web servers. If the service fails over, the
application accesses the shared database data through the new cluster system. A network-accessible
database service is usually assigned an IP address, which is failed over along with the service to
maintain transparent access for clients.
This section provides an example of setting up a cluster service for an Oracle database. Although the
variables used in the service scripts depend on the specic Oracle conguration, the example may aid
in setting up a service for individual environments. Refer to Section 5.2 Tuning Oracle Service for
information about improving service performance.
In the example that follows:
? The service includes one IP address for the Oracle clients to use.
? The service has two mounted le systems, one for the Oracle software (/u01/) and the other for
the Oracle database (/u02/), which are set up before the service is added.
? An Oracle administration account with the name oracle is created on the cluster systems that run
the service before the service are actually added.
? The administration directory is on a shared disk that is used in conjunction with the Oracle service
(for example, /u01/app/oracle/admin/db1).
Create a consistent user/group conguration that can properly access Oracle service for each cluster
system. For example:
mkdir /users
groupadd -g 900 dba
groupadd -g 901 oinstall
useradd -u 901 -g 901 -d /users/oracle -m oracle
usermod -G 900 oracle
The Oracle service example uses three scripts that must be placed in /users/oracle and owned
by the Oracle administration account. The oracle script is used to start and stop the Oracle service.
Specify this script when you add the service. This script calls the other Oracle example scripts.
The startdb and stopdb scripts start and stop the database. Note that there are many ways for an
application to interact with an Oracle database.
68 Chapter 5. Database Services
The following is an example of the oracle script, which is used to start, stop, and check the status of
the Oracle service.
#!/bin/sh
#
# Cluster service script to start, stop, and check status of oracle
#
cd /users/oracle
case $1 in
start)
su - oracle -c ./startdb
;;
stop)
su - oracle -c ./stopdb
;;
status)
status oracle
;;
esac
The following is an example of the startdb script, which is used to start the Oracle Database Server
instance:
#!/bin/sh
#
#
# Script to start the Oracle Database Server instance.
#
########################################################################
#
# ORACLE_RELEASE
#
# Specifies the Oracle product release.
#
########################################################################
ORACLE_RELEASE=9.2.0
########################################################################
#
# ORACLE_SID
#
# Specifies the Oracle system identifier or "sid", which is the name of
# the Oracle Server instance.
#
########################################################################
export ORACLE_SID=TEST
########################################################################
#
# ORACLE_BASE
#
# Specifies the directory at the top of the Oracle software product and
# administrative file structure.
#
########################################################################
export ORACLE_BASE=/u01/app/oracle
Chapter 5. Database Services 69
########################################################################
#
# ORACLE_HOME
#
# Specifies the directory containing the software for a given release.
# The Oracle recommended value is $ORACLE_BASE/product/
release
#
########################################################################
export ORACLE_HOME=/u01/app/oracle/product/${ORACLE_RELEASE}
########################################################################
#
# LD_LIBRARY_PATH
#
# Required when using Oracle products that use shared libraries.
#
########################################################################
export LD_LIBRARY_PATH=${ORACLE_HOME}/lib:$LD_LIBRARY_PATH
########################################################################
#
# PATH
#
# Verify that the users search path includes $ORACLE_HOME/bin
#
########################################################################
export PATH=$PATH:${ORACLE_HOME}/bin
########################################################################
#
# This does the actual work.
#
# Start the Oracle Server instance based on the initSID.ora
# initialization parameters file specified.
#
########################################################################
/u01/app/oracle/product/9.2.0/bin/sqlplus << EOF
sys as sysdba
spool /home/oracle/startdb.log
startup pfile = /u01/app/oracle/product/9.2.0/admin/test/scripts/init.ora open;
spool off
quit;
EOF
exit
The following is an example of the stopdb script, which is used to stop the Oracle Database Server
instance:
#!/bin/sh
#
#
# Script to STOP the Oracle Database Server instance.
#
######################################################################
#
# ORACLE_RELEASE
70 Chapter 5. Database Services
#
# Specifies the Oracle product release.
#
######################################################################
ORACLE_RELEASE=9.2.0
######################################################################
#
# ORACLE_SID
#
# Specifies the Oracle system identifier or "sid", which is the name
# of the Oracle Server instance.
#
######################################################################
export ORACLE_SID=TEST
######################################################################
#
# ORACLE_BASE
#
# Specifies the directory at the top of the Oracle software product
# and administrative file structure.
#
######################################################################
export ORACLE_BASE=/u01/app/oracle
######################################################################
#
# ORACLE_HOME
#
# Specifies the directory containing the software for a given release.
# The Oracle recommended value is $ORACLE_BASE/product/
release
#
######################################################################
export ORACLE_HOME=/u01/app/oracle/product/${ORACLE_RELEASE}
######################################################################
#
# LD_LIBRARY_PATH
#
# Required when using Oracle products that use shared libraries.
#
######################################################################
export LD_LIBRARY_PATH=${ORACLE_HOME}/lib:$LD_LIBRARY_PATH
######################################################################
#
# PATH
#
# Verify that the users search path includes $ORACLE_HOME/bin
#
######################################################################
export PATH=$PATH:${ORACLE_HOME}/bin
######################################################################
#
Chapter 5. Database Services 71
# This does the actual work.
#
# STOP the Oracle Server instance in a tidy fashion.
#
######################################################################
/u01/app/oracle/product/9.2.0/bin/sqlplus << EOF
sys as sysdba
spool /home/oracle/stopdb.log
shutdown abort;
spool off
quit;
EOF
exit
5.1.1. Oracle and the Cluster Conguration Tool
To add an Oracle service using the Cluster Conguration Tool, perform the following:
1. Start the Cluster Conguration Tool by choosing Main Menu => System Settings => Server
Settings => Cluster or by typing redhat-config-cluster at a shell prompt. The Cluster
Status Tool appears by default.
2. Start the Cluster Conguration Tool by selecting Cluster => Congure from the Cluster
Status Tool menus.
3. Click the Services tab.
4. Add the Oracle service.
? Click New. The Service dialog appears.
Figure 5-1. Adding an Oracle Service
? Enter a Service Name for the Oracle service.
? Select a Failover Domain or leave it as None.
? Type a quantity (seconds) to check the health of the Oracle service through the status
function of the init script.
? Enter a User Script, such as /home/oracle/oracle.
? Click OK
5. Add an IP address for the Oracle service.
? Select the Oracle service and click Add Child.
72 Chapter 5. Database Services
? Select Add Service IP Address and click OK. The Service IP Address dialog appears.
? Enter an IP Address.
? Enter a Netmask, or leave it None.
? Enter a Broadcast Address, or leave it None.
? Click OK.
6. Add a device for the Oracle service and administrative les.
? Select the Oracle service and click Add Child.
? Select Add Device and click OK. The Device dialog appears.
? Enter the Device Special File (for example, /dev/sdb5).
? In the Mount Point eld, enter /u01.
? Select the le system type in FS Type or leave it blank.
? Enter any mount point Options, including rw (read-write).
? Check or uncheck Force Unmount.
? Click OK.
7. Add a device for the Oracle database les.
? Select the Oracle service and click Add Child.
? Select Add Device and click OK. The Device dialog appears.
? Enter the Device Special File (for example, /dev/sdb6).
? In the Mount Point eld, enter /u02.
? Select the le system type in FS Type or leave it blank.
? Enter any mount point Options, including rw (read-write).
? Check or uncheck Force Unmount.
? Click OK.
8. Choose File => Save to save the Oracle service.
5.2. Tuning Oracle Service
The Oracle database recovery time after a failover is directly proportional to the number of outstanding
transactions and the size of the database. The following parameters control database recovery time:
? LOG_CHECKPOINT_TIMEOUT
? LOG_CHECKPOINT_INTERVAL
? FAST_START_IO_TARGET
? REDO_LOG_FILE_SIZES
To minimize recovery time, set the previous parameters to relatively low values. Note that excessively
low values adversely impact performance. Try different values to nd the optimal value.
Chapter 5. Database Services 73
Oracle provides additional tuning parameters that control the number of database transaction retries
and the retry delay time. Be sure that these values are large enough to accommodate the failover
time in the cluster environment. This ensures that failover is transparent to database client application
programs and does not require programs to reconnect.
5.3. Setting Up a MySQL Service
A database service can serve highly-available data to a MySQL database application. The application
can then provide network access to database client systems, such as Web servers. If the service fails
over, the application accesses the shared database data through the new cluster system. A networkaccessible
database service is usually assigned one IP address, which is failed over along with the
service to maintain transparent access for clients.
An example of a conguring a MySQL database service is as follows:
? The MySQL server packages are installed on each cluster system that will run the service. The
MySQL database directory resides on a le system that is located on a disk partition on shared
storage. This allows the database data to be accessed by all cluster members. In the example, the
le system is mounted as /var/lib/mysql, using the shared disk partition /dev/sda1.
? An IP address is associated with the MySQL service to accommodate network access by clients of
the database service. This IP address is automatically migrated among the cluster members as the
service fails over. In the example below, the IP address is 10.1.16.12.
? The script that is used to start and stop the MySQL database is the standard init script mysqld.
If general logging of connections and queries is needed, edit the mysqld script to add the option
--log=/var/log/mysqld.log as the last option to the safe_mysqld command. The resulting
line should appear similar to the following (Note: the forward slash (\) denotes the continuation of
one line):
/usr/bin/safe_mysqld --defaults-file=/etc/my.cnf --log=/var/log/mysqld.log \
>/dev/null 2>&1 &
If the --log option is added to the mysqld script, the new mysqld script should be copied to the
other cluster members that can run the MySQL service, so that they can log connections and queries
if the MySQL service fails over to those members.
? by default, a client connection to a MySQL server times out after eight hours of inactivity. This
connection limit can be modied by setting the wait_timeout variable in the /etc/my.cnf le.
For example, to set timeouts to four hours, add the following line to the [mysqld] section of
/etc/my.cnf:
set-variable = wait_timeout=14400
Restart the MySQL service. Note that after this change is made, the new /etc/my.cnf le should
be copied to all the other cluster members that can run the MySQL service.
To check if a MySQL server has timed out, invoke the mysqladmin command and examine the
uptime. If it has timed out, invoke the query again to automatically reconnect to the server.
Depending on the Linux distribution, one of the following messages may indicate a MySQL server
timeout:
CR_SERVER_GONE_ERROR
CR_SERVER_LOST
5.3.1. MySQL and the Cluster Conguration Tool
To add a MySQL service using the Cluster Conguration Tool, perform the following:
74 Chapter 5. Database Services
1. Start the Cluster Conguration Tool by choosing Main Menu => System Settings => Server
Settings => Cluster or by typing redhat-config-cluster at a shell prompt. The Cluster
Status Tool appears by default.
2. Start the Cluster Conguration Tool by selecting Cluster => Congure from the Cluster
Status Tool menus.
3. Click the Services tab.
4. Add the MySQL service.
? Click New. The Service dialog appears.
Figure 5-2. Adding a Service
? Enter a Service Name for the MySQL service.
? Select a Failover Domain or leave it as None.
? Type a quantity (seconds) in the Check Interval box if you want to check the health of the
MySQL service through the status directive of the mysqld init script.
? Enter a User Script, such as /etc/init.d/mysqld.
? Click OK.
5. Add an IP address for the MySQL service.
? Select the MySQL service and click Add Child.
? Select Add Service IP Address and click OK. The Service IP Address dialog appears.
? Enter an IP Address.
? Enter a Netmask, or leave it None.
? Enter a Broadcast Address, or leave it None.
? Click OK.
6. Add a device for the MySQL service.
? Select the MySQL service and click Add Child.
? Select Add Device and click OK. The Device dialog appears.
? Enter the Device Special File (for example, /dev/sdc3).
? In the Mount Point eld, enter /var/lib/mysql.
? Select the le system type in FS Type or leave it blank.
? Enter any mount point Options, including rw (read-write).
? Check or uncheck Force Unmount.
Chapter 5. Database Services 75
? Click OK.
7. Choose File => Save to save the MySQL service.
76 Chapter 5. Database Services
Chapter 6.
Network File Sharing Services
This chapter contains instructions for conguring Red Hat Enterprise Linux to make network le
sharing services through NFS and Samba highly available.
6.1. Setting Up an NFS Service
A highly-available network le system (NFS) is one of the key strengths of the clustering infrastructure.
Advantages of clustered NFS services include:
? Ensures that NFS clients maintain uninterrupted access to key data in the event of server failure.
? Facilitates planned maintenance by allowing transparent relocation of NFS services to one cluster
member, allowing you to x or upgrade the other cluster member.
? Allows setup of an active-active conguration to maximize equipment utilization. Refer to Section
6.5 NFS Conguration: Active-Active Example for more information.
6.1.1. NFS Server Requirements
To create highly available NFS services, there are a few requirements which must be met by each
cluster member. (Note: these requirements do not pertain to NFS client systems.) These requirements
are as follows:
? The NFS daemon must be running on all cluster servers. Check the status of the servers by running
the following:
/sbin/service nfs status
NFS services will not start unless the following NFS daemons are running: nfsd, rpc.mountd,
and rpc.statd. If the service is not running, start it with the following commands:
/sbin/service portmap start
/sbin/service nfs start
To make NFS start upon reboot and when changing runlevels, run the following command:
/sbin/chkconfig --level 345 nfs on
? The RPC portmap daemon must also be enabled with the following command:
/sbin/chkconfig --level 345 portmap on
? File system mounts and their associated exports for clustered NFS services should not be included
in /etc/fstab or /etc/exports. Rather, for clustered NFS services, the parameters describing
mounts and exports are entered via the Cluster Conguration Tool. For your convenience, the
tool provides a Bulk Load NFS feature to import entries from an existing le into the cluster
conguration le.
? NFS cannot be congured to run over TCP with Red Hat Cluster Manager. For proper failover
capabilities, NFS must run over the default UDP.
For detailed information about setting up an NFS server, refer to the Red Hat Enterprise Linux System
Administration Guide.
78 Chapter 6. Network File Sharing Services
6.2. Using the NFS Druid
This section describes how to use the NFS Druid to quickly congure an NFS share for client access.
1. Start the Cluster Status Tool. Verify that the cluster daemons are running; if not, choose Cluster
=> Start Cluster Service to start the cluster daemons.
2. In the Cluster Status Tool, choose Cluster => Congure to display the Cluster Conguration
Tool.
3. Start the NFS Druid by choosing Add Exports => NFS... and click Forward to continue.
Figure 6-1. NFS Druid
4. Enter the Export Directory ? Specied as a child of a device, the export directory can be
the same as the mount point. In this case, the entire le system is accessible through NFS.
Alternatively, you can specify a portion (subdirectory) of a mounted le system to be mounted
(instead of the entire le system). By exporting subdirectories of a mountpoint, different access
rights can be allocated to different sets of NFS clients.
Enter the Client Name?Specied as a child of an export directory, the NFS client specication
identies which systems will be allowed to access the le system as NFS clients. You can specify
individual systems (for example, fred) or groups of systems by using wildcards (for example,
*.example.com). Entering an asterisk (*) in the Client Name eld allows any client to
mount the le system.
Enter any Client Options in the provided elds ? Specied as part of the NFS Export Client
information, this eld denes the access rights afforded to the corresponding client(s). Examples
include ro (read only), and rw (read write). Unless explicitly specied otherwise, the default
export options are ro,async,wdelay, root_squash. Refer to the exports(5) manpage
for more options.
Chapter 6. Network File Sharing Services 79
Figure 6-2. Export and Client Options
5. If an existing service contains the device and mountpoint conguration for the directory you
want to NFS export, then select that existing service. Otherwise, enter a new Service Name and
Service IP Address for the NFS export directory.
Service Name ? A name used to uniquely identify this service within the cluster (such as
nfs_cluster or marketing.)
Service IP Address?NFS clients access le systems from an NFS server which is designated
by its IP address (or associated hostname). To keep NFS clients from knowing which specic
cluster member is the acting NFS server, the client systems should not use the cluster member's
hostname as the IP address from which a service is started. Rather, clustered NFS services are
assigned oating IP addresses which are distinct from the cluster server's IP addresses. This
oating IP address is then congured on whichever cluster member is actively serving the NFS
export. Following this approach, the NFS clients are only aware of the oating IP address and
are unaware of the fact that clustered NFS server has been deployed.
Figure 6-3. Select Service for Export
6. For non-clustered le systems, the mount information is typically placed in /etc/fstab. However,
clustered le systems must not be placed in /etc/fstab. This is necessary to ensure that
80 Chapter 6. Network File Sharing Services
only one cluster member at a time has the le system mounted. Failure to do so will likely result
in le system corruption and system crashes.
If you selected an existing service, then devices for that service will be listed under Existing
Device and Mountpoint. If the device and mount point for your NFS export is listed, then
select it.
Otherwise, select New Device and use the elds to edit the following settings.
Device Special File ? Designates the disk or partition on shared storage.
Device Mountpoint ? Species the directory on which the le system will be mounted. An
NFS service can include more than one le system mount. In this manner, the le systems will
be grouped together as a single failover unit.
Figure 6-4. Select Device for Export
7. At the end of the NFS Druid, click Apply to create the service. Save the conguration by
choosing File => Save from the Cluster Conguration Tool.
To modify your NFS service conguration, click the Services tab in the Cluster Conguration Tool
and click the triangular icon next to the NFS service to display the full child tree for the service.
Double-click each child to modify options.
1. Highlight the
service
and click Properties to congure the the following options:
Chapter 6. Network File Sharing Services 81
Figure 6-5. Services in the Cluster Conguration Tool
? Service Name ? A name used to uniquely identify this service within the cluster (such as
nfs_cluster or marketing.
? Failover Domain?An optional property that species a subset (or ordered subset) of cluster
members which are eligible to run the service in the event of a failover. You must create the
failover domain before you can reference it in an NFS service conguration; see Section 3.9
Conguring a Failover Domain for more information.
? Check Interval ? An optional property that species whether or not to check the status of
the NFS daemons at a regular interval (in seconds). The default value is 0 seconds, meaning
the daemon status is not checked.
If the service returns an error or does not respond to the status check, the cluster attempts to
cleanly shut down the service and start it on another member. If at any point it fails to cleanly
shut down the NFS service, the cluster will place the service in a Failed state, requiring the
administrator to disable the service rst before attempting to restart it.
? For the User Script, leave the eld as None, as the cluster infrastructure handles NFS service
control and status checking.
2. Choose the
service ip address
child to change the Service IP Address and to enter a Netmask
and Broadcast address, which are both set as None by default. If these elds are left as
None, then the cluster infrastructure will use the netmask and broadcast IP address congured
on the network device of the member running the service.
3. Choose the
device
child to modify the Device Special File, Device Mount Point, FS Type,
and Mount Options. You can also check or uncheck the Force Unmount. When Forced Unmount
is enabled, any applications that have the specied le system mounted will be killed
prior to disabling or relocating the NFS service (assuming the application is running on the same
member that is running the NFS service)
4. Choose the
nfsexport
child to specify a directory name for clients to mount the exported
share.
82 Chapter 6. Network File Sharing Services
5. Choose the
client
child to enter Client Name, any hosts, groups, and domains that are
allowed to mount the exported shares (default is * which allows any client to mount the share)
and Options for allowed client mount options (such as rw for read-write or ro for read-only).
6.2.1. NFS Client Access
The NFS usage model for clients is completely unchanged from its normal approach. For example,
to mount the NFS share from clu1.example.com to the client's /mnt/users/ directory, run the
following command:
/bin/mount -t nfs clu1.example.com:/share /mnt/users
To simplify the mounting of the NFS share for clients, place the following in the client's /etc/fstab
le:
clu1.example.com:/share /mnt/users nfs rw,rsize=8192,wsize=8192 0 0
For additional NFS mount options, refer to the Red Hat Enterprise Linux System Administration
Guide.
6.3. NFS Caveats
The following points should be taken into consideration when clustered NFS services are congured.
Avoid using exportfs -r
File systems being NFS exported by cluster members do not get specied in the conventional
/etc/exports le. Rather, the NFS exports associated with cluster services are specied in the
cluster conguration le (as established by the Cluster Conguration Tool).
The command exportfs -r removes any exports which are not explicitly specied in the
/etc/exports le. Running this command causes the clustered NFS services to become unavailable
until the service is restarted. For this reason, it is recommended to avoid using the
exportfs -r command on a cluster on which highly available NFS services are congured. To
recover from unintended usage of exportfs -r, the NFS cluster service must be stopped and
then restarted.
NFS File Locking
NFS le locks are not preserved across a failover or service relocation. This is due to the fact
that the Linux NFS implementation stores le locking information in system les. These system
les representing NFS locking state are not replicated across the cluster. The implication is that
locks may be regranted subsequent to the failover operation.
6.4. Importing the Contents of an NFS Exports File
The Bulk Load NFS feature allows you to import all the entries from an /etc/exports le without
having to create the entries individually as children of a device. This can be convenient for administrators
who are transitioning from traditional NFS server systems to a high-availability cluster. To use
the Bulk Load NFS feature, follow these steps:
1. Stop all non-clustered NFS exports (for example, /sbin/service nfs stop).
Chapter 6. Network File Sharing Services 83
2. Unmount the le systems on which the exports reside (for example, /bin/umount
/mnt/nfs/, where /mnt/nfs is the directory on which the NFS exported partitions are
mounted).
3. Start the Cluster Conguration Tool.
4. On the Services tab, select the device(s) to be loaded by the /etc/exports le.
5. Choose File => Bulk Load NFS.
6. Select the exports le that you want to load into the selected device (by default, the Bulk Load
NFS dialog attempts to load /etc/exports), and click OK.
All entries in the le specied in the Bulk Load NFS dialog box are subjected to the same
validation as when an NFS export directory is manually specied. Any entries in the imported
le that fail validation are displayed in a message box. Only entries that pass validation are
loaded into the selected device.
7. If the bulk load successfully completed, you must then remove all exports from the
/etc/exports le so that it does not affect the cluster NFS services.
8. Choose File => Save to save the change to the /etc/cluster.xml conguration le.
6.5. NFS Conguration: Active-Active Example
Section 6.2 Using the NFS Druid described how to congure a simple NFS service using the NFS
Druid. This section shows how to congure a second NFS service on another running cluster member.
The second service has its own separate IP address and failover domain. This cluster conguration,
called an active-active conguration, allows multiple cluster members to simultaneously export le
systems. This most effectively utilizes the capacity of cluster. In the event of a failure (or planned
maintenance) on any cluster member running NFS services, those services will failover to the active
cluster member.
For this example, individual subdirectories of the mounted le system will be made accessible on a
read-write (rw) basis by three members of a department. The names of the systems used by these
three team members are ferris, denham, and brown. To make this example more illustrative, notice
that each team member will only be able to NFS mount their specic subdirectory, which has already
been created for them and over which they have user and group permissions.
Use the Cluster Conguration Tool as follows to congure this example NFS service:
1. Verify that the cluster daemons are running in the Cluster Status Tool; if not, choose Cluster
=> Start Local Cluster Daemons.
2. Choose Cluster => Congure to display the Cluster Conguration Tool.
3. Choose the Services tab and, if services have already been dened, select one and click New.
(If no services are dened, just click New.)
a. Specify nfs_engineering in the Service Name eld. This name was chosen as a
reminder of the service's intended function to provide exports to the members of the
engineering department.
b. In this example, assume a failover domain named clu3_domain was previously created
using the Cluster Conguration Tool, consisting only of member clu3 with both
Restricted failover to only these members and Ordered Failover unchecked. In
this way, clu3 is designated as the preferred member for this service. (Note that the
nfs_accounting service is assigned to failover domain clu4_domain.) Choose
clu3_domain from the Failover Domain list. (For more information on failover
domains, refer to Section 3.9 Conguring a Failover Domain.)
84 Chapter 6. Network File Sharing Services
c. Specify a value of 30 in the Check Interval eld to specify that the status of the NFS
daemons should be checked every 30 seconds.
d. The cluster infrastructure includes support for NFS services. Consequently, there is no
need to create or specify a value for User Script when conguring an NFS service. Accept
the default value of None.
e. Click OK to complete this portion of the service conguration.
4. In the Cluster Conguration Tool, select the service you just created, and click Add Child.
On the Add Device or IP Address dialog box, choose Add Service IP Address and click OK.
a. In the Service IP Address eld, enter 10.0.0.11. This example assumes a hostname
of clunfseng is associated with this IP address, by which NFS clients mount the le
system. Note that this IP address must be distinct from that of any cluster member.
b. The default netmask address will be used, so accept the default of None.
c. The default broadcast address will be used, so accept the default of None.
d. Click OK to complete the service IP address conguration.
5. In the Cluster Conguration Tool, select the nfs_engineering service and click Add Child.
On the Add Device or IP Address dialog box, choose Add Device and click OK.
a. In the Device Special File eld, enter /dev/sdb11 which refers to the partition on the
shared storage RAID box on which the le system will be physically stored.
Leave the Samba Share Name eld blank.
b. In the Mount Point eld, enter /mnt/users/engineering.
c. From the FS Type menu, choose ext3.
d. In the Options eld, enter rw,nosuid,sync.
e. Leave the Force Unmount checkbox checked.
f. Click OK to complete this portion of the device conguration.
6. In the Cluster Conguration Tool, select the device you just created, and click Add Child.
Enter /mnt/users/engineering/ferrisin the NFS Export Directory Name eld and
click OK.
Repeat this step twice, adding NFS export directories named
/mnt/users/engineering/denham and /mnt/users/engineering/brown.
7. In the Cluster Conguration Tool, select the NFS export for ferris and click Add Child.
The NFS Export Client dialog box is displayed.
In the Client Name eld, type ferris.
In the Options eld, type rw.
Click OK.
8. Repeat step 7 twice, specifying clients named denham and brown, respectively, each with the
same permissions options (rw).
9. Save the service by selecting File => Save in the Cluster Conguration Tool.
10. Start the service from the Cluster Status Tool by highlighting the service and clicking Start.
Chapter 6. Network File Sharing Services 85
6.6. Setting Up a Samba Service
Highly available network le services are one of the key strengths of the clustering infrastructure.
Advantages of high availibility Samba services include:
? Heterogeneous le serving capabilities to Microsoft® Windows? clients using the CIFS/SMB
protocol.
? Allows the same set of le systems to be simultaneously network served to both NFS and Windows
based clients.
? Ensures that Windows-based clients maintain access to key data, and can quickly reestablish connection
in the event of server failure.
? Facilitates planned maintenance by allowing the transparent relocation of Samba services to one
cluster member, enabling you to x or upgrade the other cluster member.
? Allows the setup of an active-active conguration to maximize equipment utilization.
Note
A complete explanation of Samba conguration is beyond the scope of this document. Rather, this
documentation highlights aspects which are crucial for clustered operation. Refer to Red Hat Enterprise
Linux System Administration Guide for more details on Samba conguration.
6.6.1. Samba Server Requirements
If you intend to create highly available Samba services, each cluster member on which the services
will run must meet the following requirements:
? The Samba RPM packages must be installed. Note that there have been no modications to the
Samba RPMs to support high availability.
? The Samba daemons will be started and stopped by the cluster infrastructure on a per-service basis.
Consequently, the Samba conguration information should not be specied in the conventional
/etc/samba/smb.conf le. The Cluster Conguration Tool writes a smb.conf.sharename
le to the /etc/samba/ directory for each Samba share (where sharename is the name you
specied for the Samba share).
? The automated system startup of the Samba daemons smbd and nmbd should be disabled in init.d
runlevels. For example: chkconfig --del smb.
? Since the cluster infrastructure stops the cluster-related Samba daemons appropriately, do not manually
run the conventional Samba stop script (service smb stop) as this will terminate all clusterrelated
samba daemons.
? File system mounts for clustered Samba services should not be included in /etc/fstab. Rather,
for clustered services, the parameters describing mounts are entered via the Cluster Conguration
Tool.
? Failover of Samba printer shares is not currently supported.
86 Chapter 6. Network File Sharing Services
6.6.2. Samba Operating Model
This section provides background information describing the implementation model in support of
Samba high availability services. Knowledge of this information will provide the context for understanding
the conguration requirements of clustered Samba services.
The conventional, non-clustered Samba conguration model consists of editing the
/etc/samba/smb.conf le to designate which le systems are to be made network accessible to
the specied clients. It also designates access permissions and other mapping capabilities. In the
single system model, a single instance of each of the smbd and nmbd daemons are automatically
started up by the /etc/rc.d/init.d/smb runlevel script.
To implement high availibility Samba services, rather than having a single /etc/samba/smb.conf
le, each service has its own Samba conguration le. These les are named
/etc/samba/smb.conf.sharename, where sharename is the specic name of the individual
conguration le associated with a Samba service. For example, if you created a share called mktg,
the corresponding Samba conguration le would be /etc/samba/smb.conf.mktg.
Note
A Samba share must be in a service with at least one IP address.
The format of the smb.conf.sharename le is identical to the conventional smb.conf format.
No additional elds have been created for clustered operation. There are several elds within the
smb.conf.sharename le which are required for correct cluster operation; these elds will be described
in Section 6.8 Fields in the smb.conf.sharename File. When a new Samba service is created
using the Cluster Conguration Tool, the corresponding smb.conf.sharename le is created
based on the service-specic parameters, including appropriate client systems, specic directories to
share, and read-write permissions.
Copy the /etc/samba/smb.conf.sharename les onto all members in the cluster (or all
members in an unrestricted failover domain, if used); refer to Section 3.9 Conguring a Failover
Domain for more information. After the initial conguration, should any changes be made to any
smb.conf.sharename le, you must also copy the updated version to the other members.
To facilitate high-availability Samba functionality, each individual Samba service congured within
the cluster (through the Cluster Conguration Tool) will have its own individual pair of smbd and
nmbd daemons. Consequently, if there are more than one Samba services congured with the cluster,
you may see multiple instances of these daemon pairs running on an individual cluster server.
These Samba daemons smbd and nmbd are not initiated via the conventional init.d run level scripts;
rather they are initiated by the cluster infrastructure based on whichever member is the active service
provider.
To allow a single system to run multiple instances of the Samba daemons, every pair of daemons
is required to have both its own locking directory and its own process ID (pid) directory. Consequently,
there will be a separate per-service Samba daemon locking and running process directory.
These directories are given the name /var/cache/samba/sharename/ for lock les and
/var/run/samba/sharename/ for pid les (where sharename is replaced by the Samba share
name specied within the service conguration information set using the Cluster Conguration
Tool). Continuing the prior example, the corresponding directories for our mktg share would be
/var/cache/samba/mktg/ and /var/run/samba/mktg/.
Chapter 6. Network File Sharing Services 87
6.7. Using the Samba Druid
This section describes how to use the Samba Druid to quickly congure an Samba share for client
access.
1. Start the Cluster Status Tool. Verify that the cluster daemons are running; if not, choose Cluster
=> Start Cluster Service to start the cluster daemons.
2. In the Cluster Status Tool, choose Cluster => Congure to display the Cluster Conguration
Tool.
3. Start the Samba Druid by choosing Add Exports => Samba... and click Forward to continue.
Figure 6-6. Samba Druid
4. Choose to create a new service with a new oating IP address.
Enter a Service Name ? A name used to uniquely identify this service within the cluster.
Enter a Service IP Address ? Clients access le shares from a server as designated by its
IP address (or associated hostname). To abstract clients from knowing which specic cluster
member is the acting Samba server, the client systems should not use the cluster member's
hostname as the IP address by which a service is accessed. Rather, clustered Samba services
are assigned oating IP addresses which are distinct from the cluster server's IP addresses. This
oating IP address is then congured on which ever cluster member is actively serving the share.
Following this approach, the clients are only aware of the oating IP address and are unaware
of the fact that clustered Samba services have been deployed.
88 Chapter 6. Network File Sharing Services
Figure 6-7. Select Service for Export
5. Enter the device special lename and mount point for the service.
Mount information ? For non-clustered le systems, the mount information is typically
placed in /etc/fstab. In contrast, clustered le systems must not be placed in /etc/fstab.
This is necessary to ensure that only one cluster member at a time has the le system mounted.
Failure to do so will result in le system corruption and potential system crashes.
? Device Special File ? The mount information designates the disk's device special le and
the directory on which the le system will be mounted.
? Device Mount point ? A Samba service can include more than one le system mount. In
this manner, the le systems will be grouped together as a single failover unit.
Figure 6-8. Select Device for Export
6. Enter a Share Name ? Species the name by which clients refer to the mount point. Based
on the name you specify, a corresponding /etc/samba/smb.conf.sharename le and lock
directory /var/cache/samba/sharename will be created. By convention the actual Windows
share name specied within the smb.conf.sharename will be set in accordance with
this parameter. In practice, you can designate more than one Samba share within an individual
smb.conf.sharename le. There can be at most one samba conguration specied per
Chapter 6. Network File Sharing Services 89
service, which must be specied with the rst device. For example, if you have multiple disk
devices (and corresponding le system mounts) within a single service, then specify a single
sharename for the service. Then within the /etc/samba/smb.conf.sharename le,
designate multiple individual Samba shares to share directories from the multiple devices. To
disable Samba sharing of a service, the share name should be set to None.
Figure 6-9. Samba Share Name
7. Click Apply to save the congurationg le (/etc/samba/smb.conf.sharename) to the cluster
member.
Figure 6-10. Samba Share Completion
8. Save the conguration for the Samba service by choosing File => Save from the Cluster Con-
guration Tool.
9. Copy /etc/samba/smb.conf.sharename over to the other cluster members.
To modify your Samba service conguration, click the Services tab in the Cluster Conguration
Tool and click the triangular icon next to the Samba service to display the full child tree for the
service. Double-click each child to modify options.
? Highlight the
service
and click Properties to congure the following options:
90 Chapter 6. Network File Sharing Services
Service Name ? A name used to uniquely identify this service within the cluster.
Failover Domain ? Denes which systems are eligible to be the Samba server for this service
when more than one cluster member is operational.
Check Interval ? Species how often (in seconds) the cluster subsystem should verify that the
Samba daemons (smbd and nmbd) associated with this service are running. In the event that either
of these daemons have unexpectedly exited, they will be automatically restarted to resume services.
If a value of 0 is specied, then no monitoring will be performed. For example, designating an
interval of 90 seconds will result in monitoring at that interval.
For the User Script, leave the eld as None, as the cluster infrastructure handles NFS service
control and status checking.
? Choose the
!
service ip address
"
child to change the Service IP Address and to enter a Netmask
and Broadcast address, which are both set as None by default. If these elds are left as None, then
the cluster infrastructure will use the netmask and broadcast IP address congured on the network
device of the member running the service.
? Choose the
!
device
"
child to modify the Device Special File and Samba Share Name, Mount
Point, FS Type, and Mount Options.
You can also check or uncheck the Force Unmount button. As part of the mount information, you
can specify whether forced unmount should be enabled or not. When forced unmount is enabled,
if any applications running on the cluster server have the designated le system mounted when the
service is being disabled or relocated, then that application will be killed to allow the unmount to
proceed.
6.7.1. Samba Considerations
When running the Cluster Conguration Tool to congure Samba services:
? Correctly enter the service parameters, as the validation logic associated with Samba parameters is
not robust.
? After conguring a Samba service via the Cluster Conguration Tool, remember to tune the
/etc/samba/smb.conf.sharename le for each service in accordance with the clients and authorization
scheme you desire.
? Remember to copy the smb.conf.sharename le over to other cluster members.
? Note that the cluster infrastructure creates Samba lock directories when it starts the service.
? If you delete a Samba service, the Cluster Conguration Tool automatically deletes this le to
preserve your site-specic conguration parameters for possible later use.
6.8. Fields in the smb.conf.sharename File
This section describes the elds within the smb.conf.sharename le which are most relevant to
the correct operation of highly available Samba services. It is beyond the scope of this document to
completely describe all of the elds within a Samba conguration le. There have been no additional
eld names added in support of clustering, and the le format follows the normal Samba conventions.
The following is an example smb.conf.sharename le which is automatically generated by the
Cluster Conguration Tool. The example shows a share named mktg. The name of the le is
/etc/samba/smb.conf.mktg.
# Template samba service configuration file - please modify
# to specify subdirectories and client access permissions.
# Remember to copy this file over to *ALL* other cluster
Chapter 6. Network File Sharing Services 91
# members.
#
# From a cluster perspective, the key fields are:
# lock directory - must be unique per samba service.
# bind interfaces only - must be present set to yes.
# interfaces - must be set to service floating IP address.
# path - must be the service mountpoint or subdirectory thereof.
# Refer to the cluster documentation for details.
[global]
workgroup = RHCLUSTER
pid directory = /var/run/samba/mktg
lock directory = /var/cache/samba/mktg
log file = /var/log/samba/%m.log
encrypt passwords = yes
bind interfaces only = yes
interfaces = 192.168.26.11
[mktg]
comment = High Availability Samba Service
browsable = yes
writable = no
public = yes
path = /share
The following are descriptions of the most relevant elds, from a clustering perspective,
in the /etc/samba/smb.conf.sharename le. In this example, the le is named
/etc/samba/smb.conf.mktg in accordance with the share name that was specied (mktg) while
running the Cluster Conguration Tool. Only the cluster-specic elds are described below. The
remaining elds follow standard Samba syntax.
Global Parameters
These parameters pertain to all shares which are specied in the smb.conf.sharename le. It is
possible to designate more than one share within this le, provided that the directories described
within it are within the service's le system mounts.
lock directory
Dictates the name of the directory in which the Samba daemons (smbd and nmbd) will
place their locking les. This must be set to /var/cache/samba/sharename, where
sharename varies based on the parameter specied in the Cluster Conguration Tool.
Specication of a lock directory is required to allow a separate per-service instance of smbd
and nmbd.
pid directory
Dictates the name of the directory in which the Samba daemons (smbd and nmbd) will place
their processor ID (pid) les. This must be set to /var/run/samba/sharename/, where
sharename varies based on the parameter specied in the Cluster Conguration Tool
tool. Specication of a pid directory is required to allow a separate per-service instance of
smbd and nmbd.
bind interfaces only
This parameter must be set to yes to allow each smbd and nmbd pair to bind to the oating
IP address associated with this clustered Samba service.
92 Chapter 6. Network File Sharing Services
interfaces
Species the IP address associated with the Samba service. If a netmask is specied
within the service, this eld appears like the following example: interfaces =
10.0.0.10/255.255.254.0
Share-specic parameters
These parameters pertain to a specic Samba share.
writable
By default, the share access permissions are conservatively set as non-writable. Tune this
parameter according to your site-specic preferences.
path
Defaults to the rst le system mount point specied within the service conguration. This
should be adjusted to match the specic directory or subdirectory intended to be available
as a share to clients.
Chapter 7.
Setting Up Apache HTTP Server
This chapter contains instructions for conguring Red Hat Enterprise Linux to make the Apache
HTTP Server highly available.
It provides an example of setting up a cluster service that fails over an Apache HTTP Server. Although
the actual variables used in the service depend on the specic conguration, the example may assist
in setting up a service for a particular environment.
7.1. Apache HTTP Server Setup Overview
First, congure Apache HTTP Server on all members in the cluster. Consider assigning the service to
a failover domain to reduce the number of systems that must be congured to run this service. Refer
to Section 3.9 Conguring a Failover Domain for instructions. The cluster software ensures that only
one cluster system runs the Apache HTTP Server at one time. The conguration consists of installing
the httpd RPM package on all cluster members (or on members in the failover domain, if used) and
conguring a shared le system to house the website's content.
When installing the Apache HTTP Server on the cluster systems, do not congure the cluster systems
so that the service automatically starts when the system boots by performing the following command:
chkconfig --del httpd
Rather than having the system startup scripts spawn httpd, the cluster infrastructure does that on the
active cluster server. This ensures that the corresponding IP address and le system mounts are active
on only one cluster member at a time.
When adding an httpd service, a oating IP address must be assigned to the service such that the IP
address will transfer from one cluster member to another in the event of failover or service relocation.
The cluster infrastructure binds this IP address to the network interface on the cluster system that is
currently running the Apache HTTP Server. This IP address ensures that the cluster system running
the httpd service is transparent to the HTTP clients accessing the Apache HTTP Server.
The le systems that contain the Web content must not be automatically mounted on shared disk
storage when the cluster systems boot. Instead, the cluster software must mount and unmount the le
systems as the httpd service is started and stopped on the cluster systems. This prevents the cluster
systems from accessing the same data simultaneously, which may result in data corruption. Therefore,
do not include the le systems in the /etc/fstab le.
7.2. Conguring Shared Storage
To set up the shared le system, perform the following tasks as root on one cluster system:
1. On a shared disk, use the interactive parted utility to create a partition to use for the document
root directory. Note that it is possible to create multiple document root directories on different
disk partitions. Refer to Section 2.4.4.4 Partitioning Disks for more information.
2. Use the mkfs command to create an ext3 le system on the partition you created in the previous
step. Specify the drive letter and the partition number. For example:
mkfs -t ext3 /dev/sde3
3. Mount the le system that contains the document root directory. For example:
mount /dev/sde3 /var/www/html
94 Chapter 7. Setting Up Apache HTTP Server
Do not add this mount information to the /etc/fstab le because only the cluster software
can mount and unmount le systems used in a service.
4. Copy all the required les to the document root directory.
5. If you have CGI les or other les that must be in different directories or in separate partitions,
repeat these steps, as needed.
7.3. Installing and Conguring the Apache HTTP Server
The Apache HTTP Server must be installed and congured on all members in the assigned failover
domain, if used, or in the cluster. The basic server conguration must be the same on all members
on which it runs for the service to fail over correctly. The following example shows a basic Apache
HTTP Server installation that includes no third-party modules or performance tuning.
On all member systems in the cluster (or members in the failover domain, if used), install the httpd
RPM package. For example:
rpm -Uvh httpd-
#
version
$
.
#
arch
$
.rpm
On one system, perform the following tasks:
1. Edit the /etc/httpd/conf/httpd.conf conguration le and customize the le according
to your conguration. For example:
? Specify the directory that contains the HTML les. Also specify this mount point when
adding the service to the cluster conguration. It is only required to change this eld if the
mountpoint for the website's content differs from the default setting of /var/www/html/.
For example:
DocumentRoot "/mnt/httpdservice/html"
? If the script directory resides in a non-standard location, specify the directory that contains
the CGI programs. For example:
ScriptAlias /cgi-bin/ "/mnt/httpdservice/cgi-bin/"
? Specify the path that was used in the previous step, and set the access permissions to default
to that directory. For example:
#
Directory /mnt/httpdservice/cgi-bin"
AllowOverride None $
Options None
Order allow,deny
Allow from all
#
/Directory
$
Additional changes may need to be made to tune the Apache HTTP Server or add module
functionality. For information on setting up other options, refer to the Red Hat Enterprise
Linux System Administration Guide and the Red Hat Enterprise Linux Reference Guide.
2. The standard Apache HTTP Server start script, /etc/rc.d/init.d/httpd is also used within
the cluster framework to start and stop the Apache HTTP Server on the active cluster member.
Accordingly, when conguring the service, specify this script in the User Script eld of the
Service dialog box.
3. Copy the conguration le over to the other members of the cluster (or members of the failover
domain, if congured).
Before the service is added to the cluster conguration, ensure that the Apache HTTP Server directories
are not mounted. Then, on one member, invoke the Cluster Conguration Tool to add the
service, as follows. This example assumes a failover domain named httpd-domain was created for
Chapter 7. Setting Up Apache HTTP Server 95
this service. Figure 7-1 shows the
%
device
&
settings for conguring the shared storage and mount
point where HTML les and CGI scripts will be stored.
Figure 7-1. Conguring Apache HTTP Server
1. Select the Services tab and click New. The Service properties dialog box is displayed.
a. Give the service a name (for example, httpd).
b. Choose httpd-domain from the Failover Domain list.
c. Specify a value in the Check Interval eld.
d. Specify /etc/rc.d/init.d/httpd in the User Script eld.
e. Click OK.
2. Select the httpd service on the Services tab and click Add Child. The Add Device or Service
IP Address dialog box is displayed.
a. Choose Add Device and click OK. The Device properties dialog box is displayed.
b. Enter the device special le name in the Device Special File eld (for example,
/dev/hda7).
c. Enter the mount point in the Mount Point eld (for example, /var/www/html/).
d. Choose ext3 from the FS Type list.
e. Enter rw in the Options eld.
f. Ensure that Force Unmount is checked, and click OK.
3. Ensure that the httpd service is still selected in the Services tab and click Add Child. The Add
Device or Service IP Address dialog box is displayed.
a. Choose Add Service IP Address and click OK. The Service IP Address properties dialog
box is displayed.
b. In the IP Address eld, specify an IP address, which the cluster infrastructure binds to
the network interface on the cluster system that runs the httpd service (for example,
192.168.26.10).
c. Specify a netmask of None in the Netmask eld.
d. In the Broadcast eld, specify an IP address of None for broadcasting on the cluster
subnet.
e. Click OK.
96 Chapter 7. Setting Up Apache HTTP Server
4. Choose File => Save to save your changes.
5. To start the Apache HTTP Server within the Cluster Status Tool, highlight the service and click
Enable.
Chapter 8.
Cluster Administration
This chapter describes the various administrative tasks involved in maintaining a cluster after it has
been installed and congured.
8.1. Overview of the Cluster Status Tool
The Cluster Status Tool displays the status of the cluster service, cluster members, and application
services, and shows statistics concerning service operation.
The cluster conguration le (maintained by the Cluster Conguration Tool) is used to determine
how to manage the members, services, and cluster daemons.
Important
Do not manually edit the cluster conguration le.
Use the Cluster Status Tool to start and stop the cluster service on that particular member, restart
application services, or move an application service to another member.
The Cluster Conguration Tool can be displayed by choosing Cluster => Congure within the
Cluster Status Tool.
Figure 8-1. Cluster Status Tool
8.2. Displaying Cluster and Service Status
Monitoring cluster and application service status can help identify and resolve problems in the cluster
environment. The following tools assist in displaying cluster status information:
? The clustat command
98 Chapter 8. Cluster Administration
? The clufence command
? Log le messages
? The cluster monitoring GUI
Note that status is representative of the cluster system on which an administrator is running a particular
tool. To obtain comprehensive cluster status on all member, run the tools on all cluster systems.
Important
Members that are not running the cluster software cannot determine or report the status of other
members of the cluster.
Cluster and service status includes the following information:
? Cluster member system status
? Heartbeat channel status
? Service status and which cluster system is running the service or owns the service
? Service monitoring status of the cluster system
The following tables describe how to analyze the status information shown by the Cluster Status
Tool and clustat command.
Member Status Description
Active The member system is communicating with other member systems and
accessing the quorum partitions.
Inactive The member system is unable to communicate with the other member
system.
Table 8-1. Member Status
Service Status Description
Running The service resources are congured and available on the cluster system
that owns the service.
Pending The service has failed on a member and is pending start on another
member.
Disabled The service has been disabled, and does not have an assigned owner.
Stopped The service not running; waiting for a member capable of starting service.
Failed The service has failed to start on and the cluster cannot successfully stop
the service. Refer to Section 4.8 Handling Failed Services for more
information on failed services.
Table 8-2. Service Status
To display a snapshot of the current cluster status from a shell prompt, invoke the clustat utility.
Example output is as follows:
Chapter 8. Cluster Administration 99
Cluster Status - clustertest 22:15:32
Quorum: Yes, view 36
Shared State: Shared Raw Device Driver v1.0 [Min. Size=1176064]
Member Status
------------------ ----------
clu1 Active
clu2 Active <-- You are here
Service Status Owner (Last) Last Transition Chk Restarts
-------------- -------- ---------------- --------------- --- --------
nfs_clutest started clu1 22:11:28 Jul 21 0 0
smb_clutest started clu2 22:13:06 Jul 21 0 0
To monitor the cluster and display status at specic time intervals from a shell prompt, invoke
clustat with the -i time option, where time species the number of seconds between status
snapshots. For example:
clustat -i 10
8.3. Starting and Stopping the Cluster Software
To start the cluster software on a member, run the Cluster Status Tool and choose Cluster => Start
Local Cluster Daemons. The cluster member's Status eld changes from Disabled to Active.
Figure 8-2. Cluster Member Started
You can also start the cluster software at a shell prompt by typing the following:
/sbin/service clumanager start
To stop a cluster member using the Cluster Status Tool, choose Cluster => Stop Local Cluster
Daemons. The cluster member's Status eld changes from Cluster is running to Cluster is not
running.
100 Chapter 8. Cluster Administration
Figure 8-3. Cluster Member Stopped
To stop the cluster software on a cluster system at a shell prompt, run the following command:
/sbin/service clumanager stop
Stopping the cluster services on a member causes its services to failover to an active member.
8.4. Modifying the Cluster Conguration
It may be necessary to modify the cluster conguration. For example, it may be necessary to correct
heartbeat channel or quorum partition entries in the cluster database, a copy of which is located in the
/etc/cluster.xml le.
Important
Use only the Cluster Conguration Tool to modify the cluster conguration. Do not directly edit the
/etc/cluster.xml le with a text editor.
To modify the cluster conguration, stop the cluster software on all cluster systems, as described
in Section 8.3 Starting and Stopping the Cluster Software. Start the Cluster Conguration Tool
and modify the elds that you need to change. Choose File => Save to save your changes. Settings
automatically transfer via shared storage to active cluster members.
8.5. Backing Up and Restoring the Cluster Database
The Cluster Conguration Tool automatically saves a back-up copy of your /etc/cluster.xml
le whenever you save your conguration. This is useful if the cluster does not function correctly due
to misconguration and you need to return to a previous working conguration.
The back-up conguration is stored in a le called /etc/cluster.xml.bak. If a cluster member
becomes inoperable due to misconguration, stop cluster services on all members:
/sbin/service cluster stop
Next, overwrite the erroneous conguration with a previously saved conguration:
Chapter 8. Cluster Administration 101
cp /etc/cluster.xml.bak /etc/cluster.xml
Finally, re-initialize the shared storage:
shutil -s /etc/cluster.xml
8.6. Modifying Cluster Event Logging
It is possible to modify the severity level of the events that are logged by the clupowerd,
cluquorumd, cluhbd, and clusvcmgrd daemons. This is done so that the daemons on the cluster
systems log messages at a user-dened severity level.
Figure 8-4. Cluster Logging Severity Levels
To change a cluster daemon's logging level on all cluster members, use the Cluster Conguration
Tool and choose Cluster => Daemon Properties. Specify the daemon using the tabs, and the severity
level for each daemon using the pull down menu. The following list denes each severity level:
? EMERG - The system is unusable
? ALERT - Action must be taken immediately
? CRIT - Critical conditions
? ERR - Error conditions
? WARN - Warning conditions
? NOTICE - Normal but signicant condition
? INFO - Informational
? DEBUG - Debug-level messages
Note that the cluster logs messages with the designated severity level and also messages of a higher
severity. For example, if the severity level for quorum daemon messages is CRIT, then the cluster
logs messages for CRIT, ALERT, and EMERG severity levels. Also note that setting the logging
level to a low severity level (such as DEBUG) results in large log les over time.
8.7. Updating the Cluster Software
To update the cluster software while minimizing service downtime, follow these steps:
102 Chapter 8. Cluster Administration
1. Stop the cluster software on the rst member to be upgraded by clicking Cluster => Stop Local
Cluster Daemons from the Cluster Status Tool. Alternatively, run the following command at
a shell prompt:
/sbin/service clumanager stop
2. Make a copy of the cluster conguration le. For example, to save the le to the /root directory,
run the following command:
cp /etc/cluster.xml /root/
3. Upgrade the cluster packages using the steps outlined in Section 3.1 Installing the Red Hat
Cluster Suite Packages.
4. Copy the conguration le you saved back to the /etc/ directory.
cp /root/cluster.xml /etc/
5. Start the cluster software on the updated cluster system by clicking Cluster => Start Local
Cluster Daemons from the Cluster Status Tool. Alternatively, run the following command at
a shell prompt:
/sbin/service clumanager start
6. Repeat steps 1 - 5 for each cluster member to be updated.
8.8. Changing the Cluster Name
Start the Cluster Conguration Tool and type a new name into the Cluster Name eld. The
cluster name is used to distinctively separate multiple clusters by function or department (such as
clu_marketing or test_cluster).
8.9. Disabling the Cluster Software
It may become necessary to temporarily disable the cluster software on a member system. For example,
if a cluster system experiences a hardware failure, an administrator may want to reboot the
system, but prevent it from rejoining the cluster to perform maintenance on the system.
Use the /sbin/chkconfig command to stop the member from joining the cluster at boot-up.
/sbin/chkconfig --del clumanager
Once the issues with the disabled cluster member has been resolved, use the following command to
allow it to rejoin the cluster:
/sbin/chkconfig --add clumanager
You can then reboot the system for the changes to take effect or run the following command to restart
cluster services:
/sbin/service clumanager start
8.10. Diagnosing and Correcting Problems in a Cluster
To ensure the proper diagnosis of any problems in a cluster, event logging must be enabled. In addition,
if problems arise in a cluster, be sure to set the severity level to DEBUG for the cluster daemons. This
logs descriptive messages that may help solve problems.
Chapter 8. Cluster Administration 103
Note
Once any issues have been resolved, reset the debug level back down to its default value of WARN
to avoid excessively large log message les from being generated. Refer to Section 8.6 Modifying
Cluster Event Logging for more information.
Use Table 8-3 to troubleshoot issues in a cluster.
Problem Symptom Solution
SCSI bus not terminated SCSI errors appear in the
log le
Each SCSI bus must be terminated
only at the beginning and end of the
bus. Depending on the bus
conguration, it might be necessary
to enable or disable termination in
host bus adapters, RAID controllers,
and storage enclosures. To support
hot plugging, external termination is
required to terminate a SCSI bus.
In addition, be sure that no devices
are connected to a SCSI bus using a
stub that is longer than 0.1 meter.
Refer to Section 2.4.4 Conguring
Shared Disk Storage and Section D.3
SCSI Bus Termination for information
about terminating different types of
SCSI buses.
SCSI bus length greater
than maximum limit
SCSI errors appear in the
log le
Each type of SCSI bus must adhere
to restrictions on length, as described
in Section D.4 SCSI Bus Length.
In addition, ensure that no
single-ended devices are connected to
the LVD SCSI bus, because this
causes the entire bus to revert to a
single-ended bus, which has more
severe length restrictions than a
differential bus.
SCSI identication numbers
not unique
SCSI errors appear in the
log le
Each device on a SCSI bus must have
a unique identication number. Refer
to Section D.5 SCSI Identication
Numbers for more information.
104 Chapter 8. Cluster Administration
Problem Symptom Solution
SCSI commands timing out
before completion
SCSI errors appear in the
log le
The prioritized arbitration scheme on
a SCSI bus can result in low-priority
devices being locked out for some
period of time. This may cause
commands to time out, if a
low-priority storage device, such as a
disk, is unable to win arbitration and
complete a command that a host has
queued to it. For some workloads,
this problem can be avoided by
assigning low-priority SCSI
identication numbers to the host
bus adapters.
Refer to Section D.5 SCSI
Identication Numbers for more
information.
Mounted quorum partition Messages indicating
checksum errors on a
quorum partition appear in
the log le
Be sure that the quorum partition raw
devices are used only for cluster state
information. They cannot be used for
cluster services or for non-cluster
purposes, and cannot contain a le
system. Refer to Section 2.4.4.3
Conguring Shared Cluster
Partitions for more information.
These messages could also indicate
that the underlying block device
special le for the quorum partition
has been erroneously used for
non-cluster purposes.
Service le system is
unclean
A disabled service cannot
be enabled
Manually run a checking program
such as fsck. Then, enable the
service.
Note that the cluster infrastructure
does by default run fsck with the -p
option to automatically repair le
system inconsistencies. For
particularly egregious error types, you
may be required to manually initiate
le system repair options.
Chapter 8. Cluster Administration 105
Problem Symptom Solution
Quorum partitions not set
up correctly
Messages indicating that a
quorum partition cannot be
accessed appear in the log
le
Run the /sbin/shutil -t
command to check that the quorum
partitions are accessible. If the
command succeeds, run the shutil
-p command on both cluster systems.
If the output is different on the
systems, the quorum partitions do not
point to the same devices on both
systems. Check to make sure that the
raw devices exist and are correctly
specied in the
/etc/sysconfig/rawdevices le.
Refer to Section 2.4.4.3 Conguring
Shared Cluster Partitions for more
information.
Cluster service operation
fails
Messages indicating the
operation failed to appear
on the console or in the log
le
There are many different reasons for
the failure of a service operation (for
example, a service stop or start). To
help identify the cause of the problem,
set the severity level for the cluster
daemons to DEBUG to log
descriptive messages. Then, retry the
operation and examine the log le.
Refer to Section 8.6 Modifying
Cluster Event Logging for more
information.
Cluster service stop fails
because a le system cannot
be unmounted
Messages indicating the
operation failed appear on
the console or in the log le
Use the fuser and ps commands to
identify the processes that are
accessing the le system. Use the
kill command to stop the
processes. Use the lsof -t
file_system command to display
the identication numbers for the
processes that are accessing the
specied le system. If needed, pipe
the output to the kill command.
To avoid this problem, be sure that
only cluster-related processes can
access shared storage data. In
addition, modify the service and
enable forced unmount for the le
system. This enables the cluster
service to unmount a le system even
if it is being accessed by an
application or user.
Incorrect entry in the cluster
database
Cluster operation is
impaired
The Cluster Status Tool can be used
to examine and modify service
conguration. The Cluster
Conguration Tool is used to modify
cluster parameters.
106 Chapter 8. Cluster Administration
Problem Symptom Solution
Incorrect Ethernet heartbeat
entry in the cluster database
or /etc/hosts le
Cluster status indicates that
a Ethernet heartbeat channel
is OFFLINE even though
the interface is valid
Examine and modify the cluster
conguration by running the Cluster
Conguration Tool, as specied in
Section 8.4 Modifying the Cluster
Conguration, and correct the
problem.
In addition, be sure to use the ping
command to send a packet to all
network interfaces used in the cluster.
Loose cable connection to
power switch
Power switch status using
clufence returns an error
or hangs
Check the serial cable connection.
Power switch serial port
incorrectly specied in the
cluster database
Power switch status using
clufence indicates a
problem
Examine the current settings and
modify the cluster conguration by
running the Cluster Conguration
Tool, as specied in Section 8.4
Modifying the Cluster Conguration,
and correct the problem.
Table 8-3. Diagnosing and Correcting Problems in a Cluster
II. Conguring a Linux Virtual Server Cluster
Building a Linux Virtual Server (LVS) system offers highly-available and scalable solution for production
services using specialized routing and load-balancing techniques congured through the Piranha
Conguration Tool. This part discusses the conguration of high-performance systems and services
with Red Hat Enterprise Linux and LVS.
This section is licensed under the Open Publication License, V1.0 or later. For details refer to the
Copyright page.
Table of Contents
9. Introduction to Linux Virtual Server........................................................................................ 109
10. Linux Virtual Server Overview ............................................................................................... 111
11. Initial LVS Conguration......................................................................................................... 121
12. Setting Up a Red Hat Enterprise Linux LVS Cluster............................................................ 125
13. Conguring the LVS Routers with Piranha Conguration Tool.......................................... 133
Chapter 9.
Introduction to Linux Virtual Server
Using Red Hat Enterprise Linux, it is possible to create highly available server clustering solutions
able to withstand many common hardware and software failures with little or no interruption of critical
services. By allowing multiple computers to work together in offering these critical services, system
administrators can plan and execute system maintenance and upgrades without service interruption.
The chapters in this part guide you through the following steps in understanding and deploying a
clustering solution based on the Red Hat Enterprise Linux Linux Virtual Server (LVS) technology:
? Explains the Linux Virtual Server technology used by Red Hat Enterprise Linux to create a loadbalancing
cluster
? Explains how to congure a Red Hat Enterprise Linux LVS cluster
? Guides you through the Piranha Conguration Tool, a graphical interface used for conguring
and monitoring an LVS cluster
9.1. Technology Overview
Red Hat Enterprise Linux implements highly available server solutions via clustering. It is important
to note that cluster computing consists of three distinct branches:
? Compute clustering (such as Beowulf) uses multiple machines to provide greater computing power
for computationally intensive tasks. This type of clustering is not addressed by Red Hat Enterprise
Linux.
? High-availability (HA) clustering uses multiple machines to add an extra level of reliability for a
service or group of services.
? Load-balance clustering uses specialized routing techniques to dispatch trafc to a pool of servers.
Red Hat Enterprise Linux addresses the latter two types of clustering technology. Using a collection
of programs to monitor the health of the systems and services in the cluster.
Note
The clustering technology included in Red Hat Enterprise Linux is not synonymous with fault tolerance.
Fault tolerant systems use highly specialized and often very expensive hardware to implement
a fully redundant environment in which services can run uninterrupted by hardware failures.
However, fault tolerant systems do not account for operator and software errors which Red Hat Enterprise
Linux can address through service redundancy. Also, since Red Hat Enterprise Linux is
designed to run on commodity hardware, it creates an environment with a high level of system availability
at a fraction of the cost of fault tolerant hardware.
9.2. Basic Congurations
While Red Hat Enterprise Linux can be congured in a variety of different ways, the congurations
can be broken into two major categories:
110 Chapter 9. Introduction to Linux Virtual Server
? High-availability clusters using Red Hat Cluster Manager
? Load-balancing clusters using Linux Virtual Servers
This part explains what a load-balancing cluster system is and how to congure a load-balancing
system using Linux Virtual Servers on Red Hat Enterprise Linux.
9.2.1. Load-Balancing Clusters Using Linux Virtual Servers
To an outside user accessing a hosted service (such as a website or database application), a Linux
Virtual Server (LVS) cluster appears as one server. In reality, however, the user is actually accessing
a cluster of two or more servers behind a pair of redundant LVS routers that distribute client requests
evenly throughout the cluster system. Load-balanced clustered services allow administrators to use
commodity hardware and Red Hat Enterprise Linux to create continuous and consistent access to all
hosted services while also addressing availability requirements.
An LVS cluster consists of at least two layers. The rst layer is composed of a pair of similarly con-
gured Linux machines or cluster members. One of these machine acts as the LVS routers, congured
to direct requests from the Internet to the cluster. The second layer consists of a cluster of machines
called real servers. The real servers provide the critical services to the end-user while the LVS router
balances the load on these servers.
For a detailed overview of LVS clustering, refer to Chapter 10 Linux Virtual Server Overview.
Chapter 10.
Linux Virtual Server Overview
Red Hat Enterprise Linux LVS clustering uses a Linux machine called the active router to send requests
from the Internet to a pool of servers. To accomplish this, LVS clusters consist of two basic
machine classications ? the LVS routers (one active and one backup) and a pool of real servers
which provide the critical services.
The active router serves two roles in the cluster:
? To balance the load on the real servers.
? To check the integrity of the services on each of the real servers.
The backup router's job is to monitor the active router and assume its role in the event of failure.
10.1. A Basic LVS Conguration
Figure 10-1 shows a simple LVS cluster consisting of two layers. On the rst layer are two LVS routers
? one active and one backup. Each of the LVS routers has two network interfaces, one interface
on the Internet and one on the private network, enabling them to regulate trafc between the two
networks. For this example the active router is using Network Address Translation or NAT to direct
trafc from the Internet to a variable number of real servers on the second layer, which in turn provide
the necessary services. Therefore, the real servers in this example are connected to a dedicated private
network segment and pass all public trafc back and forth through the active LVS router. To the outside
world, the server cluster appears as one entity.
Figure 10-1. A Basic LVS Conguration
112 Chapter 10. Linux Virtual Server Overview
Service requests arriving at the LVS cluster are addressed to a virtual IP address or VIP. This is a
publicly-routable address the administrator of the site associates with a fully-qualied domain name,
such as www.example.com, and which is assigned to one or more virtual server1. Note that a VIP
address migrates from one LVS router to the other during a failover, thus maintaining a presence at
that IP address, also known as oating IP addresses.
VIP addresses may be aliased to the same device which connects the LVS router to the Internet. For
instance, if eth0 is connected to the Internet, than multiple virtual servers can be aliased to eth0:1.
Alternatively, each virtual server can be associated with a separate device per service. For example,
HTTP trafc can be handled on eth0:1, and FTP trafc can be handled on eth0:2.
Only one LVS router is active at a time. The role of the active router is to redirect service requests
from virtual IP addresses to the real servers. The redirection is based on one of eight supported loadbalancing
algorithms described further in Section 10.3 LVS Scheduling Overview.
The active router also dynamically monitors the overall health of the specic services on the real
servers through simple send/expect scripts. To aid in detecting the health of services that require
dynamic data, such as HTTPS or SSL, the administrator can also call external executables. If a service
on a real server malfunctions, the active router stops sending jobs to that server until it returns to
normal operation.
The backup router performs the role of a standby system. Periodically, the LVS routers exchange
heartbeat messages through the primary external public interface and, in a failover situation, the private
interface. Should the backup node fail to receive a heartbeat message within an expected interval,
it initiates a failover and assumes the role of the active router. During failover, the backup router takes
over the VIP addresses serviced by the failed router using a technique known as ARP spoong ?
where the backup LVS router announces itself as the destination for IP packets addressed to the failed
node. When the failed node returns to active service, the backup node assumes its hot-backup role
again.
The simple, two-layered conguration used in Figure 10-1 is best for clusters serving data which does
not change very frequently ? such as static webpages ? because the individual real servers do not
automatically sync data between each node.
10.1.1. Data Replication and Data Sharing Between Real Servers
Since there is no built-in component in LVS clustering to share the same data between the real servers,
the administrator has two basic options:
? Synchronize the data across the real server pool
? Add a third layer to the topology for shared data access
The rst option is preferred for servers that do not allow large numbers of users to upload or change
data on the real servers. If the cluster allows large numbers of users to modify data, such as an ecommerce
website, adding a third layer is preferable.
10.1.1.1. Conguring Real Servers to Synchronize Data
There are many ways an administrator can choose to synchronize data across the pool of real servers.
For instance, shell scripts can be employed so that if aWeb engineer updates a page, the page is posted
to all of the servers simultaneously. Also, the cluster administrator can use programs such as rsync
to replicate changed data across all nodes at a set interval.
However, this type of data synchronization does not optimally function if the cluster is overloaded
with users constantly uploading les or issuing database transactions. For a cluster with a high load, a
three-tiered topology is the ideal solution.
1. A virtual server is a service congured to listen on a specic virtual IP. Refer to Section 13.6 VIRTUAL
SERVERS for more on conguring a virtual server using the Piranha Conguration Tool.
Chapter 10. Linux Virtual Server Overview 113
10.2. A Three Tiered LVS Conguration
Figure 10-2 shows a typical three tiered LVS cluster topology. In this example, the active LVS router
routes the requests from the Internet to the pool of real servers. Each of the real servers then accesses
a shared data source over the network.
Figure 10-2. A Three Tiered LVS Conguration
This conguration is ideal for busy FTP servers, where accessible data is stored on a central, highly
available server and accessed by each real server via an exported NFS directory or Samba share.
This topography is also recommended for websites that access a central, highly available database
for transactions. Additionally, using an active-active conguration with Red Hat Cluster Manager,
administrators can congure one high-availability cluster to serve both of these roles simultaneously.
The third tier in the above example does not have to use Red Hat Cluster Manager, but failing to use
a highly available solution would introduce a critical single point of failure.
10.3. LVS Scheduling Overview
One of the advantages of using an LVS cluster is its ability to perform exible, IP-level load balancing
on the real server pool. This exibility is due to the variety of scheduling algorithms an administrator
can choose from when conguring a cluster. LVS load balancing is superior to less exible methods,
such as Round-Robin DNS where the hierarchical nature of DNS and the caching by client machines
114 Chapter 10. Linux Virtual Server Overview
can lead to load imbalances. Additionally, the low-level ltering employed by the LVS router has
advantages over application-level request forwarding because balancing loads at the network packet
level causes minimal computational overhead and allows for greater scalability.
Using scheduling, the active router can take into account the real servers' activity and, optionally,
an administrator-assigned weight factor when routing service requests. Using assigned weights gives
arbitrary priorities to individual machines. Using this form of scheduling, it is possible to create a
group of real servers using a variety of hardware and software combinations and the active router can
evenly load each real server.
The scheduling mechanism for an LVS cluster is provided by a collection of kernel patches called IP
Virtual Server or IPVS modules. These modules enable layer 4 (L4) transport layer switching, which
is designed to work well with multiple servers on a single IP address.
To track and route packets to the real servers efciently, IPVS builds an IPVS table in the kernel.
This table is used by the active LVS router to redirect requests from a virtual server address to and
returning from real servers in the pool. The IPVS table is constantly updated by a utility called ipvsadm
? adding and removing cluster members depending on their availability.
10.3.1. Scheduling Algorithms
The structure that the IPVS table takes depends on the scheduling algorithm that the administrator
chooses for any given virtual server. To allow for maximum exibility in the types of services you
can cluster and how these services are scheduled, Red Hat Enterprise Linux provides the following
scheduling algorithms listed below. For instructions on how to assign scheduling algorithms refer to
Section 13.6.1 The VIRTUAL SERVER Subsection.
Round-Robin Scheduling
Distributes each request sequentially around the pool of real servers. Using this algorithm, all
the real servers are treated as equals without regard to capacity or load. This scheduling model
resembles round-robin DNS but is more granular due to the fact that it is network-connection
based and not host-based. LVS round-robin scheduling also does not suffer the imbalances caused
by cached DNS queries.
Weighted Round-Robin Scheduling
Distributes each request sequentially around the pool of real servers but gives more jobs to servers
with greater capacity. Capacity is indicated by a user-assigned weight factor, which is then adjusted
upward or downward by dynamic load information. Refer to Section 10.3.2 Server Weight
and Scheduling for more on weighting real servers.
Weighted round-robin scheduling is a preferred choice if there are signicant differences in the
capacity of real servers in the pool. However, if the request load varies dramatically, the more
heavily weighted server may answer more than its share of requests.
Least-Connection
Distributes more requests to real servers with fewer active connections. Because it keeps track of
live connections to the real servers through the IPVS table, least-connection is a type of dynamic
scheduling algorithm, making it a better choice if there is a high degree of variation in the request
load. It is best suited for a real server pool where each member node has roughly the same
capacity. If a group of servers have different capabilities, weighted least-connection scheduling
is a better choice.
Weighted Least-Connections (default)
Distributes more requests to servers with fewer active connections relative to their capacities.
Capacity is indicated by a user-assigned weight, which is then adjusted upward or downward by
dynamic load information. The addition of weighting makes this algorithm ideal when the real
Chapter 10. Linux Virtual Server Overview 115
server pool contains hardware of varying capacity. Refer to Section 10.3.2 Server Weight and
Scheduling for more on weighting real servers.
Locality-Based Least-Connection Scheduling
Distributes more requests to servers with fewer active connections relative to their destination
IPs. This algorithm is designed for use in a proxy-cache server cluster. It routes the packets for
an IP address to the server for that address unless that server is above its capacity and has a server
in its half load, in which case it assigns the IP address to the least loaded real server.
Locality-Based Least-Connection Scheduling with Replication Scheduling
Distributes more requests to servers with fewer active connections relative to their destination IPs.
This algorithm is also designed for use in a proxy-cache server cluster. It differs from Locality-
Based Least-Connection Scheduling by mapping the target IP address to a subset of real server
nodes. Requests are then routed to the server in this subset with the lowest number of connections.
If all the nodes for the destination IP are above capacity, it replicates a new server for that
destination IP address by adding the real server with the least connections from the overall pool
of real servers to the subset of real servers for that destination IP. The most loaded node is then
dropped from the real server subset to prevent over-replication.
Destination Hash Scheduling
Distributes requests to the pool of real servers by looking up the destination IP in a static hash
table. This algorithm is designed for use in a proxy-cache server cluster.
Source Hash Scheduling
Distributes requests to the pool of real servers by looking up the source IP in a static hash table.
This algorithm is designed for LVS routers with multiple rewalls.
10.3.2. Server Weight and Scheduling
The administrator of an LVS cluster can assign a weight to each node in the real server pool. This
weight is an integer value which is factored into any weight-aware scheduling algorithms (such as
weighted least-connections) and helps the LVS router more evenly load hardware with different capabilities.
Weights work as a ratio relative to one another. For instance, if one real server has a weight of 1 and
the other server has a weight of 5, then the server with a weight of 5 gets 5 connections for every 1
connection the other server gets. The default value for a real server weight is 1.
Although adding weight to varying hardware congurations in a real server pool can help load-balance
the cluster more efciently, it can cause temporary imbalances when a real server is introduced to the
real server pool and the virtual server is scheduled using weighted least-connections. For example,
suppose there are three servers in the real server pool. Servers A and B are weighted at 1 and the third,
server C, is weighted at 2. If server C goes down for any reason, servers A and B evenly distributes
the abandoned load. However, once server C comes back online, the LVS router sees it has zero
connections and oods the server with all incoming requests until it is on par with servers A and B.
To prevent this phenomenon, administrators can make the virtual server a quiesce server ? anytime
a new real server node comes online, the least-connections table is reset to zero and the LVS router
routes requests as if all the real servers were newly added to the cluster.
116 Chapter 10. Linux Virtual Server Overview
10.4. Routing Methods
Red Hat Enterprise Linux uses Network Address Translation or NAT routing for LVS clustering, which
allows the administrator tremendous exibility when utilizing available hardware and integrating the
cluster into an existing network.
10.4.1. NAT Routing
Figure 10-3, illustrates an LVS cluster utilizing NAT routing to move requests between the Internet
and a private network.
Figure 10-3. An LVS Cluster Implemented with NAT Routing
In the example, there are two NICs in the active LVS router. The NIC for the Internet has a real
IP address on eth0 and has a oating IP address aliased to eth0:1. The NIC for the private network
interface has a real IP address on eth1 and has a oating IP address aliased to eth1:1. In the event
of failover, the virtual interface facing the Internet and the private facing virtual interface are takenover
by the backup LVS router simultaneously. All of the cluster's real servers located on the private
network use the oating IP for the NAT router as their default route to communicate with the active
LVS router so that their abilities to respond to requests from the Internet is not impaired.
In this example, the LVS router's public LVS oating IP address and private NAT oating IP address
are aliased to two physical NICs. While it is possible to associate each oating IP address to its own
physical device on the LVS router nodes, having more than two NICs is not a requirement.
Using this topography, the active LVS router receives the request and routes it to the appropriate
server. The real server then processes the request and returns the packets to the LVS router which
uses network address translation to replace the address of the real server in the packets with the LVS
Chapter 10. Linux Virtual Server Overview 117
routers public VIP address. This process is called IP masquerading because the actual IP addresses of
the real servers is hidden from the requesting clients.
Using this NAT routing, the real servers may be any kind of machine running various operating systems.
The main disadvantage is that the LVS router may become a bottleneck in large cluster deployments
because it must process outgoing as well as incoming requests.
10.5. Persistence and Firewall Marks
In certain situations, it may be desirable for a client to reconnect repeatedly to the same real server,
rather than have an LVS load balancing algorithm send that request to the best available server. Examples
of such situations include multi-screen web forms, cookies, SSL, and FTP connections. In these
cases, a client may not work properly unless the transactions are being handled by the same server to
retain context. LVS provides two different features to handle this: persistence and rewall marks.
10.5.1. Persistence
When enabled, persistence acts like a timer. When a client connects to a service, LVS remembers the
last connection for a specied period of time. If that same client IP address connects again within that
period, it is sent to the same server it connected to previously ? bypassing the load-balancing mechanisms.
When a connection occurs outside the time window, it is handled according to the scheduling
rules in place.
Persistence also allows the administrator to specify a subnet mask to apply to the client IP address
test as a tool for controlling what addresses have a higher level of persistence, thereby grouping
connections to that subnet.
Grouping connections destined for different ports can be important for protocols which use more than
one port to communicate, such as FTP. However, persistence is not the most efcient way to deal with
the problem of grouping together connections destined for different ports. For these situations, it is
best to use rewall marks.
10.5.2. Firewall Marks
Firewall marks are an easy and efcient way to a group ports used for a protocol or group of related
protocols. For instance, if an LVS cluster is deployed to run an e-commerce site, rewall marks can
be used to bundle HTTP connections on port 80 and secure, HTTPS connections on port 443. By
assigning the same rewall mark to the virtual server for each protocol, state information for the
transaction can be preserved because the LVS router forwards all requests to the same real server after
a connection is opened.
Because of its efciency and ease-of-use, administrators of LVS clusters should use rewall marks
instead of persistence whenever possible for grouping connections. However, administrators should
still add persistence to the virtual servers in conjunction with rewall marks to ensure the clients are
reconnected to the same server for an adequate period of time.
10.6. LVS Cluster ? A Block Diagram
LVS routers use a collection of programs to monitor cluster members and cluster services. Figure 10-4
illustrates how these various programs on both the active and backup LVS routers work together to
manage the cluster.
118 Chapter 10. Linux Virtual Server Overview
Figure 10-4. Components of a Running LVS Cluster
The pulse daemon runs on both the active and passive LVS routers. On the backup router, pulse
sends a heartbeat to the public interface of the active router to make sure the active router is still
properly functioning. On the active router, pulse starts the lvs daemon and responds to heartbeat
queries from the backup LVS router.
Once started, the lvs daemon calls the ipvsadm utility to congure and maintain the IPVS routing
table in the kernel and starts a nanny process for each congured virtual server on each real server.
Each nanny process checks the state of one congured service on one real server, and tells the lvs
daemon if the service on that real server is malfunctioning. If a malfunction is detected, the lvs
daemon instructs ipvsadm to remove that real server from the IPVS routing table.
If the backup router does not receive a response from the active router, it initiates failover by calling
send_arp to reassign all virtual IP addresses to the NIC hardware addresses (MAC address) of the
backup node, sends a command to the active router via both the public and private network interfaces
to shut down the lvs daemon on the active router, and starts the lvs daemon on the backup node to
accept requests for the congured virtual servers.
10.6.1. Components of an LVS Cluster
Section 10.6.1.1 pulse shows a detailed list of each software component in an LVS router.
10.6.1.1. pulse
This is the controlling process which starts all other daemons related to LVS routers. At boot time,
the daemon is started by the /etc/rc.d/init.d/pulse script. It then reads the conguration le
/etc/sysconfig/ha/lvs.cf. On the active router, pulse starts the LVS daemon. On the backup
router, pulse determines the health of the active router by executing a simple heartbeat at a usercon
gurable interval. If the active router fails to respond after a user-congurable interval, it initiates
failover. During failover, pulse on the backup router instructs the pulse daemon on the active router
to shut down all LVS services, starts the send_arp program to reassign the oating IP addresses to
the backup router's MAC address, and starts the lvs daemon.
Chapter 10. Linux Virtual Server Overview 119
10.6.1.2. lvs
The lvs daemon runs on the active LVS router once called by pulse. It reads the conguration le
/etc/sysconfig/ha/lvs.cf, calls the ipvsadm utility to build and maintain the IPVS routing
table, and assigns a nanny process for each congured LVS service. If nanny reports a real server is
down, lvs instructs the ipvsadm utility to remove the real server from the IPVS routing table.
10.6.1.3. ipvsadm
This service updates the IPVS routing table in the kernel. The lvs daemon sets up and administers an
LVS cluster by calling ipvsadm to add, change, or delete entries in the IPVS routing table.
10.6.1.4. nanny
The nanny monitoring daemon runs on the active LVS router. Through this daemon, the active router
determines the health of each real server and, optionally, monitors its workload. A separate process
runs for each service dened on each real server.
10.6.1.5. /etc/sysconfig/ha/lvs.cf
This is the LVS cluster conguration le. Directly or indirectly, all daemons get their conguration
information from this le.
10.6.1.6. Piranha Conguration Tool
This is the Web-based tool for monitoring, conguring, and administering an LVS cluster. This is the
default tool to maintain the /etc/sysconfig/ha/lvs.cf LVS cluster conguration le.
10.6.1.7. send_arp
This program sends out ARP broadcasts when the oating IP address changes from one node to
another during failover.
Chapter 11 Initial LVS Conguration reviews important post-installation conguration steps you
should take before conguring Red Hat Enterprise Linux to be an LVS router.
120 Chapter 10. Linux Virtual Server Overview
Chapter 11.
Initial LVS Conguration
After installing Red Hat Enterprise Linux, you must take some basic steps to set up both the LVS
routers and the real servers in the LVS cluster. This chapter covers these initial steps in detail.
Note
The LVS router node that becomes the active node once the cluster is started is also referred to as
the primary node. When conguring an LVS cluster, use the Piranha Conguration Tool on the
primary node.
11.1. Conguring Services on the LVS Routers
The Red Hat Enterprise Linux installation program installs all of the components needed to set up an
LVS cluster, but the appropriate services must be activated before conguring the cluster. For both
LVS routers, set the appropriate services to start at boot time. There are three primary tools available
for setting services to activate at boot time under Red Hat Enterprise Linux: the command line program
chkconfig, the ncurses-based program ntsysv, and the graphical Services Conguration Tool. All
of these tools require root access.
Tip
To attain root access, open a shell prompt and type the following command followed by the root
password:
su -
On the LVS routers, there are three services which need to be set to activate at boot time:
? The piranha-gui service (primary node only)
? The pulse service
? The sshd service
If you are clustering multi-port services or using rewall marks, you must also enable the iptables
service.
It is best to set these services to activate in both runlevel 3 and runlevel 5. To accomplish this using
chkconfig, type the following command for each service:
/sbin/chkconfig --level 35 daemon on
In the above command, replace daemon with the name of the service you are activating. To get a
list of services on the system as well as what runlevel they are set to activate on, issue the following
command:
/sbin/chkconfig --list
122 Chapter 11. Initial LVS Conguration
Warning
Turning any of the above services on using chkconfig does not actually start the daemon. To do
this use the /sbin/service command. See Section 11.3 Starting the Piranha Conguration Tool
Service for an example of how to use the /sbin/service command.
For more information on runlevels and conguring services with ntsysv and the Services Conguration
Tool, refer to the chapter titled "Controlling Access to Services" in the Red Hat Enterprise
Linux System Administration Guide.
11.2. Setting a Password for the Piranha Conguration Tool
Before using the Piranha Conguration Tool for the rst time on the primary LVS router, you must
restrict access to it by creating a password. To do this, login as root and issue the following command:
/usr/sbin/piranha-passwd
After entering this command, create the administrative password when prompted.
Warning
For a password to be more secure, it should not contain proper nouns, commonly used acronyms, or
words in a dictionary from any language. Do not leave the password unencrypted anywhere on the
system.
If the password is changed during an active Piranha Conguration Tool session, the administrator
is prompted to provide the new password.
11.3. Starting the Piranha Conguration Tool Service
After you have set the password for the Piranha Conguration Tool, start or restart the
piranha-gui service located in /etc/rc.d/init.d/piranha-gui. To do this, type the
following command as root:
/sbin/service piranha-gui start
or
/sbin/service piranha-gui restart
Issuing this command starts a private session of the Apache HTTP Server by calling the symbolic
link /usr/sbin/piranha_gui -> /usr/sbin/httpd. For security reasons, the piranha-gui
version of httpd runs as the piranha user in a separate process. The fact that piranha-gui leverages
the httpd service means that:
1. The Apache HTTP Server must be installed on the system.
2. Stopping or restarting the Apache HTTP Server via the service command stops the
piranha-gui service.
Chapter 11. Initial LVS Conguration 123
Warning
If the command /sbin/service httpd stop or /sbin/service httpd restart is issued on an
LVS router, you must start the piranha-gui service by issuing the following command:
/sbin/service piranha-gui start
The piranha-gui service is all that is necessary to begin conguring an LVS cluster. However, if
you are conguring the cluster remotely, the sshd service is also required. You do not need to start
the pulse service until conguration using the Piranha Conguration Tool is complete. See Section
13.8 Starting the Cluster for information on starting the pulse service.
11.3.1. Conguring the Piranha Conguration Tool Web Server Port
The Piranha Conguration Tool runs on port 3636 by default. To change this port number,
change the line Listen 3636 in Section 2 of the piranha-gui Web server conguration le
/etc/sysconfig/ha/conf/httpd.conf.
To use the Piranha Conguration Tool you need at minimum a text-only Web browser. If you start a
Web browser on the primary LVS router, open the location http://localhost:3636. You can
reach the Piranha Conguration Tool from anywhere via Web browser by replacing localhost
with the hostname or IP address of the primary LVS router.
When your browser connects to the Piranha Conguration Tool, you must login to access the
cluster conguration services. Enter piranha in the Username eld and the password set with
piranha-passwd in the Password eld.
Now that the Piranha Conguration Tool is running, you may wish to consider limiting who has
access to the tool over the network. The next section reviews ways to accomplish this task.
11.4. Limiting Access To the Piranha Conguration Tool
The Piranha Conguration Tool prompts for a valid username and password combination. However,
because all of the data passed to the Piranha Conguration Tool is in plain text, it is recommended
that you restrict access only to trusted networks or to the local machine.
The easiest way to restrict access is to use the Apache HTTP Server's built in access control mechanisms
by editing /etc/sysconfig/ha/web/secure/.htaccess. After altering the le you do
not have to restart the piranha-gui service because the server checks the .htaccess le each time
it accesses the directory.
By default, the access controls for this directory allow anyone to view the contents of the directory.
Here is what the default access looks like:
Order deny,allow
Allow from all
To limit access of the Piranha Conguration Tool to only the localhost change the .htaccess le
to allow access from only the loopback device (127.0.0.1). For more information on the loopback
device, see the chapter titled Network Scripts in the Red Hat Enterprise Linux Reference Guide.
Order deny,allow
Deny from all
Allow from 127.0.0.1
You can also allow specic hosts or subnets as seen in this example:
124 Chapter 11. Initial LVS Conguration
Order deny,allow
Deny from all
Allow from 192.168.1.100
Allow from 172.16.57
In this example, only Web browsers from the machine with the IP address of 192.168.1.100 and
machines on the 172.16.57/24 network can access the Piranha Conguration Tool.
Caution
Editing the Piranha Conguration Tool .htaccess le limits access to the conguration pages in
the /etc/sysconfig/ha/web/secure/ directory but not to the login and the help pages
in /etc/sysconfig/ha/web/. To limit access to this directory, create a .htaccess le in
the /etc/sysconfig/ha/web/ directory with order, allow, and deny lines identical to
/etc/sysconfig/ha/web/secure/.htaccess.
11.5. Turning on Packet Forwarding
In order for the LVS router to forward network packets properly to the real servers, each LVS router
node must have IP forwarding turned on in the kernel. Log in as root and change the line which reads
net.ipv4.ip_forward = 0 in /etc/sysctl.conf to the following:
net.ipv4.ip_forward = 1
The changes take effect when you reboot the system.
To check if IP forwarding is turned on, issue the following command as root:
/sbin/sysctl net.ipv4.ip_forward
If the above command returns a 1, then IP forwarding is enabled. If it returns a 0, then you can turn it
on manually using the following command:
/sbin/sysctl -w net.ipv4.ip_forward=1
11.6. Conguring Services on the Real Servers
If the real servers in the cluster are Red Hat Enterprise Linux systems, set the appropriate server
daemons to activate at boot time. These daemons can include httpd for Web services or xinetd for
FTP or Telnet services.
It may also be useful to access the real servers remotely, so the sshd daemon should also be installed
and running.
Chapter 12.
Setting Up a Red Hat Enterprise Linux LVS
Cluster
A Red Hat Enterprise Linux LVS cluster consists of two basic groups: the LVS routers and the real
servers. To prevent a single point of failure, each groups should contain at least two member systems.
The LVS router group should consist of two identical or very similar systems running Red Hat Enterprise
Linux. One will act as the active LVS router while the other stays in hot standby mode, so they
need to have as close to the same capabilities as possible.
Before choosing and conguring the hardware for the real server group, you most decide what which
of the three types of LVS topographies to use.
12.1. The NAT LVS Cluster
The NAT topography allows for great latitude in utilizing existing hardware, but it is limited in its
ability to handle large loads due to the fact that all packets going into and coming out of the cluster
pass through the LVS router.
Network Layout
The topography for an LVS cluster utilizing NAT routing is the easiest to congure from a network
layout perspective because the cluster needs only one access point to the public network.
The real servers pass all requests back through the LVS router so they are on their own private
network.
Hardware
The NAT topography is the most exible in regards to cluster hardware because the real servers
do not need to be Linux machines to function correctly in the cluster. In a NAT cluster, each
real server only needs one NIC since it will only be responding to the LVS router. The LVS
routers, on the other hand, need two NICs each to route trafc between the two networks. Because
this topography creates a network bottleneck at the LVS router, gigabit Ethernet NICs can be
employed on each LVS router to increase the bandwidth the LVS routers can handle. If gigabit
Ethernet is employed on the LVS routers, any switch connecting the real servers to the LVS
routers must have at least two gigabit Ethernet ports to handle the load efciently.
Software
Because the NAT topography requires the use of iptables for some congurations, there can
be a fair amount of software conguration outside of Piranha Conguration Tool. In particular,
FTP services and the use of rewall marks requires extra manual conguration of the LVS routers
to route requests properly.
12.1.1. Conguring Network Interfaces for a NAT LVS Cluster
To set up a NAT LVS cluster, the administrator must rst congure the network interfaces for the
public network and the private network on the LVS routers. In this example, the LVS routers' public
interfaces (eth0) will be on the 192.168.26/24 network (I know, I know, this is not a routable IP, but
let us pretend there is a rewall in front of the LVS router for good measure) and the private interfaces
which link to the real servers (eth1) will be on the 10.11.12/24 network.
So on the active or primary LVS router node, the public interface's network script,
/etc/sysconfig/network-scripts/ifcfg-eth0, could look something like this:
126 Chapter 12. Setting Up a Red Hat Enterprise Linux LVS Cluster
DEVICE=eth0
BOOTPROTO=static
ONBOOT=yes
IPADDR=192.168.26.9
NETMASK=255.255.255.0
GATEWAY=192.168.26.254
The /etc/sysconfig/network-scripts/ifcfg-eth1 for the private NAT interface on the LVS
router could look something like this:
DEVICE=eth1
BOOTPROTO=static
ONBOOT=yes
IPADDR=10.11.12.9
NETMASK=255.255.255.0
In this example, the VIP for the LVS router's public interface will be 192.168.26.10 and the VIP for
the NAT or private interface will be 10.11.12.10. So, it is essential that the real servers route requests
back to the VIP for the NAT interface.
Important
The sample Ethernet interface conguration settings in this section are for the real IP addresses
of an LVS router and not the oating IP addresses. To congure the public and private oating IP
addresses the administrator should use the Piranha Conguration Tool, as shown in Section 13.4
GLOBAL SETTINGS and Section 13.6.1 The VIRTUAL SERVER Subsection.
After conguring the primary LVS router node's network interfaces, congure the backup LVS
router's real network interfaces ? taking care that none of the IP address conict with any other IP
addresses on the network.
Important
Be sure each interface on the backup node services the same network as the interface on primary
node. For instance, if eth0 connects to the public network on the primary node, it must also connect
to the public network on the backup node as well.
12.1.2. Routing on the Real Servers
The most important thing to remember when conguring the real servers network interfaces in a NAT
cluster is to set the gateway for the NAT oating IP address of the LVS router. In this example, that
address will be 10.11.12.10.
Note
Once the network interfaces are up on the real servers, the machines will be unable to ping or connect
in other ways to the public network. This is normal. You will, however, be able to ping the real IP for
the LVS router's private interface, in this case 10.11.12.8.
Chapter 12. Setting Up a Red Hat Enterprise Linux LVS Cluster 127
So the real server's /etc/sysconfig/network-scripts/ifcfg-eth0 le could look similar to
this:
DEVICE=eth0
ONBOOT=yes
BOOTPROTO=static
IPADDR=10.11.12.1
NETMASK=255.255.255.0
GATEWAY=10.11.12.10
Warning
If a real server has more than one network interface congured with a GATEWAY= line, the rst one to
come up will get the gateway. Therefore if both eth0 and eth1 are congured and eth1 is used for
LVS clustering, the real servers may not route requests properly.
It is best to turn off extraneous network interfaces by setting ONBOOT=no in their network scripts within
the /etc/sysconfig/network-scripts/ directory or by making sure the gateway is correctly set in
the interface which comes up rst.
12.1.3. Enabling NAT Routing on the LVS Routers
In a simple NAT LVS cluster where each clustered service uses only one port, like HTTP on port
80, the administrator needs only to enable packet forwarding on the LVS routers for the requests
to be properly routed between the outside world and the real servers. See Section 11.5 Turning on
Packet Forwarding for instructions on turning on packet forwarding. However, more conguration is
necessary when the clustered services require more than one port to go to the same real server during
a user session. For information on creating multi-port services using rewall marks, see Section 12.3
Multi-port Services and LVS Clustering.
Once forwarding is enabled on the LVS routers and the real servers are set up and have the clustered
services running, use the Piranha Conguration Tool to congure the cluster as shown in Chapter
13 Conguring the LVS Routers with Piranha Conguration Tool.
Warning
Do not congure the oating IP for eth0:1 or eth1:1 by manually editing network scripts or using
a network conguration tool. Instead, use the Piranha Conguration Tool as shown in Section
13.4 GLOBAL SETTINGS and Section 13.6.1 The VIRTUAL SERVER Subsection to congure any
cluster-related virtual interfaces.
When nished, start the pulse service as shown in Section 13.8 Starting the Cluster. Once pulse is
up and running, the active LVS router will begin routing requests to the pool of real servers.
12.2. Putting the Cluster Together
After determining which of the above routing methods to use, the hardware for the LVS cluster should
be linked together on the network.
128 Chapter 12. Setting Up a Red Hat Enterprise Linux LVS Cluster
Important
The adapter devices on the LVS routers must be congured to access the same networks. For instance
if eth0 connects to public network and eth1 connects to the private network, then these same
devices on the backup LVS router must connect to the same networks.
Also the gateway listed in the rst interface to come up at boot time is added to the routing table and
subsequent gateways listed in other interfaces are ignored. This is especially important to consider
when conguring the real servers.
After physically connecting together the cluster hardware, congure the network interfaces on the primary
and backup LVS routers. This can be done using a graphical application such as redhat-congnetwork
or by editing the network scripts manually. For more information about adding devices using
redhat-cong-network, see the chapter titled Network Conguration in the Red Hat Enterprise Linux
System Administration Guide. For more information on editing network scripts by hand, see the chapter
titled Network Scripts in the Red Hat Enterprise Linux Reference Guide. For the remainder of the
chapter, example alterations to network interfaces are made either manually or through the Piranha
Conguration Tool.
12.2.1. General LVS Networking Tips
Congure the real IP addresses for both the public and private networks on the LVS routers before
attempting to congure the cluster using the Piranha Conguration Tool. The sections on each
topography give example network addresses, but the actual network addresses are needed. Below are
some useful commands for bringing up network interfaces or checking their status.
Bringing Up Real Network Interfaces
The best way to bring up any real network interface is to use the following commands as root
replacing N with the number corresponding to the interface (eth0 and eth1):
/sbin/ifup ethN
Warning
Do not use the ifup scripts to bring up any oating IP addresses you may congure using
Piranha Conguration Tool (eth0:1 or eth1:1). Use the service command to start pulse
instead (see Section 13.8 Starting the Cluster for details).
To bring a network interface down, type the following command:
/sbin/ifdown ethN
Again, replace N in the above command with the number corresponding to the interface you wish
to bring down.
Checking the Status of Network Interfaces
If you need to check which network interfaces are up at any given time, type the following:
/sbin/ifconfig
To view the routing table for a machine, issue the following command:
/sbin/route
12.3. Multi-port Services and LVS Clustering
LVS routers under any topology require extra conguration when creating multi-port LVS services.
Multi-port services can be created articially by using rewall marks to bundle together different, but
related protocols, such as HTTP (port 80) and HTTPS (port 443), or when LVS is used to cluster true
Chapter 12. Setting Up a Red Hat Enterprise Linux LVS Cluster 129
multi-port protocols, such as FTP. In either case, the LVS router uses rewall marks to recognize that
packets destined for different ports, but bearing the same rewall mark, should be handled identically.
Also, when combined with persistence, rewall marks ensure connections from the client machine are
routed to the same host, as long as the connections occur within the length of time specied by the
persistence parameter. For more on assigning persistence to a virtual server, see Section 13.6.1 The
VIRTUAL SERVER Subsection.
Unfortunately, the mechanism used to balance the loads on the real servers ?IPVS?can recognize
the rewall marks assigned to a packet, but cannot itself assign rewall marks. The job of assigning
rewall marks must be performed by the network packet lter, iptables, outside of Piranha
Conguration Tool.
12.3.1. Assigning Firewall Marks
To assign rewall marks to a packet destined for a particular port, the administrator must use
iptables.
This section illustrates how to bundle HTTP and HTTPS as an example, however FTP is another
commonly clustered multi-port protocol. If an LVS cluster is used for FTP services, see Section 12.4
FTP In an LVS Cluster for details on how to best congure the cluster.
The basic rule to remember when using rewall marks is that for every protocol using a rewall mark
in Piranha Conguration Tool there must be a commensurate iptables rule to assign marks to the
network packets.
Before creating network packet lter rules, make sure there are no rules already in place. To do this,
open a shell prompt, login as root, and type:
/sbin/service iptables status
If iptables is not running, the prompt will instantly reappear.
If iptables is active, it displays a set of rules. If rules are present, type the following command:
/sbin/service iptables stop
If the rules already in place are important, check the contents of /etc/sysconfig/iptables and
copy any rules worth keeping to a safe place before proceeding.
Below are rules which assign the same rewall mark, 80, to incoming trafc destined for the oating IP
address, n.n.n.n, on ports 80 and 443. For instructions on assigning the VIP to the public network
interface, see Section 13.6.1 The VIRTUAL SERVER Subsection. Also note that you must log in as
root and load the module for iptables before issuing rules for the rst time.
/sbin/modprobe ip_tables
/sbin/iptables -t mangle -A PREROUTING -p tcp \
-d n.n.n.n/32 --dport 80 -j MARK --set-mark 80
/sbin/iptables -t mangle-A PREROUTING -p tcp \
-d n.n.n.n/32 --dport 443 -j MARK --set-mark 80
In the above iptables commands, n.n.n.n should be replaced with the oating IP for your HTTP
and HTTPS virtual servers. These commands have the net effect of assigning any trafc addressed
to the VIP on the appropriate ports a rewall mark of 80, which in turn is recognized by IPVS and
forwarded appropriately.
130 Chapter 12. Setting Up a Red Hat Enterprise Linux LVS Cluster
Warning
The commands above will take effect immediately, but do not persist through a reboot of the system.
To ensure network packet lter settings are restored upon reboot, refer to Section 12.5 Saving
Network Packet Filter Settings
12.4. FTP In an LVS Cluster
File Transport Protocol (FTP) is an old and complex multi-port protocol that presents a distinct set
of challenges to a clustered environment. To understand the nature of these challenges, you must rst
understand some key things about how FTP works.
12.4.1. How FTP Works
With most other server client relationships, the client machine opens up a connection to the server on
a particular port and the server then responds to the client on that port. When an FTP client connects
to an FTP server it opens a connection to the FTP control port 21. Then the client tells the FTP server
whether to establish an active or passive connection. The type of connection chosen by the client
determines how the server responds and on what ports transactions will occur.
The two types of data connections are:
Active Connections
When an active connection is established, the server opens a data connection to the client from
port 20 to a high range port on the client machine. All data from the server is then passed over
this connection.
Passive Connections
When a passive connection is established, the client asks the FTP server to establish a passive
connection port, which can be on any port higher than 10,000. The server then binds to this
high-numbered port for this particular session and relays that port number back to the client.
The client then opens the newly bound port for the data connection. Each data request the client
makes results in a separate data connection. Most modern FTP clients attempt to establish a
passive connection when requesting data from servers.
The two important things to note about all of this in regards to clustering is:
1. The client determines the type of connection, not the server. This means, to effectively cluster
FTP, you must congure the LVS routers to handle both active and passive connections.
2. The FTP client/server relationship can potentially open a large number of ports that the Piranha
Conguration Tool and IPVS do not know about.
12.4.2. How This Affects LVS Routing
IPVS packet forwarding only allows connections in and out of the cluster based on it recognizing its
port number or its rewall mark. If a client from outside the cluster attempts to open a port IPVS
is not congured to handle, it drops the connection. Similarly, if the real server attempts to open a
connection back out to the Internet on a port IPVS does not know about, it drops the connection. This
means all connections from FTP clients on the Internet must have the same rewall mark assigned
to them and all connections from the FTP server must be properly forwarded to the Internet using
network packet ltering rules.
Chapter 12. Setting Up a Red Hat Enterprise Linux LVS Cluster 131
12.4.3. Creating Network Packet Filter Rules
Before assigning any iptables rules for FTP service, review the information in Section 12.3.1 Assigning
Firewall Marks concerning multi-port services and techniques for checking the existing network
packet ltering rules.
Below are rules which assign the same rewall mark, 21, to FTP trafc. For these rules to work
properly, you must also use the VIRTUAL SERVER subsection of Piranha Conguration Tool to
congure a virtual server for port 21 with a value of 21 in the Firewall Mark eld. See Section 13.6.1
The VIRTUAL SERVER Subsection for details.
12.4.3.1. Rules for Active Connections
The rules for active connections tell the kernel to accept and forward connections coming to the
internal oating IP address on port 20 ? the FTP data port.
iptables
/sbin/iptables -t nat -A POSTROUTING -p tcp \
-s n.n.n.0/24 --sport 20 -j MASQUERADE
In the above iptables commands, n.n.n should be replaced with the rst three values for the oating
IP for the NAT interface's internal network interface dened in the GLOBAL SETTINGS panel
of Piranha Conguration Tool. The command allows the LVS router to accept outgoing connections
from the real servers that IPVS does not know about.
12.4.3.2. Rules for Passive Connections
The rules for passive connections assign the appropriate rewall mark to connections coming in from
the Internet to the oating IP for the service on a wide range of ports ? 10,000 to 20,000.
Warning
If you are limiting the port range for passive connections, you must also congure the VSFTP
server to use a matching port range. This can be accomplished by adding the following lines to
/etc/vsftpd.conf:
pasv_min_port=10000
pasv_max_port=20000
You must also control the address that the server displays to the client for passive FTP connections.
In a NAT routed LVS system, add the following line to /etc/vsftpd.conf to override the real server
IP address to the VIP, which is what the client sees upon connection. For example:
pasv_address=X.X.X.X
Replace X.X.X.X with the VIP address of the LVS system.
For conguration of other FTP servers, consult the respective documentation.
This range should be a wide enough for most situations; however, you can increase this number
to include all available non-secured ports by changing 10000:20000 in the commands below to
1024:65535.
132 Chapter 12. Setting Up a Red Hat Enterprise Linux LVS Cluster
iptables
/sbin/iptables -t mangle -A PREROUTING -p tcp \
-d n.n.n.n/32 \
--dport 21 -j MARK --set-mark 21
/sbin/iptables -t mangle -A PREROUTING -p tcp \
-d n.n.n.n/32 \
--dport 10000:20000 -j MARK --set-mark 21
In the above iptables commands, n.n.n.n should be replaced with the oating IP for the FTP
virtual server dened in the VIRTUAL SERVER subsection of Piranha Conguration Tool. These
commands have the net effect of assigning any trafc addressed to the oating IP on the appropriate
ports a rewall mark of 21, which is in turn recognized by IPVS and forwarded appropriately.
Warning
The commands above take effect immediately, but do not persist through a reboot of the system. To
ensure network packet lter settings are restored after a reboot, see Section 12.5 Saving Network
Packet Filter Settings
Finally, you need to be sure that the appropriate service is set to activate on the proper runlevels. For
more on this, refer to Section 11.1 Conguring Services on the LVS Routers.
12.5. Saving Network Packet Filter Settings
After conguring the appropriate network packet lters for your situation, save the settings so they
get restored after a reboot. For iptables, type the following command:
/sbin/service iptables save
This saves the settings in /etc/sysconfig/iptables so they can be recalled at boot time.
Once this le is written, you are able to use the /sbin/service command to start, stop, and check
the status (using the status switch) of iptables. The /sbin/service will automatically load the
appropriate module for you. For an example of how to use the /sbin/service command, see Section
11.3 Starting the Piranha Conguration Tool Service.
Finally, you need to be sure the appropriate service is set to activate on the proper runlevels. For more
on this, see Section 11.1 Conguring Services on the LVS Routers.
The next chapter explains how to use the Piranha Conguration Tool to congure the LVS router
and describe the steps necessary to active an LVS cluster.
Chapter 13.
Conguring the LVS Routers with Piranha
Conguration Tool
The Piranha Conguration Tool provides a structured approach to creating the necessary conguration
le for a Piranha cluster ? /etc/sysconfig/ha/lvs.cf. This chapter describes the basic
operation of the Piranha Conguration Tool and how to activate the cluster once conguration is
complete.
Important
The conguration le for the LVS cluster follows strict formatting rules. Using the Piranha Conguration
Tool is the best way to prevent syntax errors in the lvs.cf and therefore prevent software
failures.
13.1. Necessary Software
The piranha-gui service must be running on the primary LVS router to use the Piranha Conguration
Tool. To congure the cluster, you minimally need a text-only Web browser, such as links.
If you are accessing the LVS router from another machine, you also need an ssh connection to the
primary LVS router as the root user.
While conguring the primary LVS router it is a good idea to keep a concurrent ssh connection
in a terminal window. This connection provides a secure way to restart pulse and other services,
congure network packet lters, and monitor /var/log/messages during trouble shooting.
The next four sections walk through each of the conguration pages of the Piranha Conguration
Tool and give instructions on using it to set up the LVS cluster.
13.2. Logging Into the Piranha Conguration Tool
When conguring an LVS cluster, you should always begin by conguring the primary router with
the Piranha Conguration Tool. To do this,verify that the piranha-gui service is running and an
administrative password has been set, as described in Section 11.2 Setting a Password for the Piranha
Conguration Tool.
If you are accessing the machine locally, you can open http://localhost:3636 in a Web
browser to access the Piranha Conguration Tool. Otherwise, type in the hostname or real IP address
for the server followed by :3636. Once the browser connects, you will see the screen shown in
Figure 13-1.
134 Chapter 13. Conguring the LVS Routers with Piranha Conguration Tool
Figure 13-1. The Welcome Panel
Click on the Login button and enter piranha for the Username and the administrative password
you created in the Password eld.
The Piranha Conguration Tool is made of four main screens or panels. In addition, the Virtual
Servers panel contains four subsections. The CONTROL/MONITORING panel is the rst panel
after the login screen.
13.3. CONTROL/MONITORING
The CONTROL/MONITORING Panel presents the cluster administrator with a limited runtime
status of the cluster. It displays the status of the pulse daemon, the LVS routing table, and the LVSspawned
nanny processes.
Note
The elds for CURRENT LVS ROUTING TABLE and CURRENT LVS PROCESSES remain blank
until you actually start the cluster, as shown in Section 13.8 Starting the Cluster .
Chapter 13. Conguring the LVS Routers with Piranha Conguration Tool 135
Figure 13-2. The CONTROL/MONITORING Panel
Auto update
The status display on this page can be updated automatically at a user congurable interval. To
enable this feature, click on the Auto update checkbox and set the desired update frequency in
the Update frequency in seconds text box (the default value is 10 seconds).
It is not recommended that you set the automatic update to an interval less than 10 seconds.
Doing so may make it difcult to recongure the Auto update interval because the page will
update too frequently. If you encounter this issue, simply click on another panel and then back
on CONTROL/MONITORING.
The Auto update feature does not work with all browsers, such as Mozilla.
Update information now
You can manually update the status information manually by clicking this button.
CHANGE PASSWORD
Clicking this button takes you to a help screen with information on how to change the administrative
password for the Piranha Conguration Tool.
13.4. GLOBAL SETTINGS
The GLOBAL SETTINGS panel is where the cluster administrator denes the networking details
for the primary LVS router's public and private network interfaces.
136 Chapter 13. Conguring the LVS Routers with Piranha Conguration Tool
Figure 13-3. The GLOBAL SETTINGS Panel
The top half of this panel sets up the primary LVS router's public and private network interfaces.
These are the interfaces already congured in Section 12.1.1 Conguring Network Interfaces for a
NAT LVS Cluster.
Primary server public IP
In this eld, enter the publicly routable real IP address for the primary LVS node.
Primary server private IP
Enter the real IP address for an alternative network interface on the primary LVS node. This
address is used solely as an alternative heartbeat channel for the backup router and does not
have to correlate to the real private IP address assigned in Section 12.1.1 Conguring Network
Interfaces for a NAT LVS Cluster. You may leave this eld blank, but doing so will mean there is
no alternate heartbeat channel for the backup LVS router to use and therefore will create a single
point of failure.
Tip
The primary LVS router's private IP can be congured on any interface that accepts TCP/IP,
whether it be an Ethernet adapter or a serial port.
Use network type
Click the NAT button to select NAT routing.
Chapter 13. Conguring the LVS Routers with Piranha Conguration Tool 137
The next three elds deal specically with the NAT router's virtual network interface connected the
private network with the real servers.
NAT Router IP
Enter the private oating IP in this text eld. This oating IP should be used as the gateway for
the real servers.
NAT Router netmask
If the NAT router's oating IP needs a particular netmask, select it from drop-down list.
NAT Router device
Use this text eld to dene the device name of the network interface for the oating IP address,
such as eth1:1.
Tip
You should alias the NAT oating IP address to the Ethernet interface connected to the private
network. In this example, the private network is on the eth1 interface, so eth1:1 is the oating
IP address.
Warning
After completing this page, click the ACCEPT button to make sure you do not lose any changes when
selecting a new panel.
13.5. REDUNDANCY
The REDUNDANCY panel allows you to congure of the backup LVS router node and set various
heartbeat monitoring options.
Tip
The rst time you visit this screen, it displays an "inactive" Backup status and an ENABLE button.
To congure the backup LVS router, click on the ENABLE button so that the screen matches Figure
13-4.
138 Chapter 13. Conguring the LVS Routers with Piranha Conguration Tool
Figure 13-4. The REDUNDANCY Panel
Redundant server public IP
Enter the public real IP address for the backup LVS router node.
Redundant server private IP
Enter the backup node's private real IP address in this text eld.
If you do not see the eld called Redundant server private IP, go back to the GLOBAL SETTINGS
panel and enter a Primary server private IP address and click ACCEPT.
The rest of the panel is devoted to conguring the heartbeat channel, which is used by the backup
node to monitor the primary node for failure.
Heartbeat Interval (seconds)
This eld sets the number of seconds between heartbeats ? the interval that the backup node
will check the functional status of the primary LVS node.
Assume dead after (seconds)
If the primary LVS node does not respond after this number of seconds, then the backup LVS
router node will initiate failover.
Heartbeat runs on port
This eld sets the port at which the heartbeat communicates with the primary LVS node. The
default is set to 539 if this eld is left blank.
Chapter 13. Conguring the LVS Routers with Piranha Conguration Tool 139
Warning
Remember to click the ACCEPT button after making any changes in this panel to make sure you do
not lose any changes when selecting a new panel.
13.6. VIRTUAL SERVERS
The VIRTUAL SERVERS panel displays information for each currently dened virtual server. Each
table entry shows the status of the virtual server, the server name, the virtual IP assigned to the server,
the netmask of the virtual IP, the port number to which the service communicates, the protocol used,
and the virtual device interface.
Figure 13-5. The VIRTUAL SERVERS Panel
Each server displayed in the VIRTUAL SERVERS panel can be congured on subsequent screens
or subsections.
To add a service, click the ADD button. To remove a service, select it by clicking the radio button next
to the virtual server and click the DELETE button.
To enable or disable a virtual server in the table click its radio button and click the (DE)ACTIVATE
button.
After adding a virtual server, you can congure it by clicking the radio button to its left and clicking
the EDIT button to display the VIRTUAL SERVER subsection.
140 Chapter 13. Conguring the LVS Routers with Piranha Conguration Tool
13.6.1. The VIRTUAL SERVER Subsection
The VIRTUAL SERVER subsection panel shown in Figure 13-6 allows you to congure an individual
virtual server. Links to subsections related specically to this virtual server are located along the
top of the page. But before conguring any of the subsections related to this virtual server, complete
this page and click on the ACCEPT button.
Figure 13-6. The VIRTUAL SERVERS Subsection
Name
Enter a descriptive name to identify the virtual server. This name is not the hostname for the
machine, so make it descriptive and easily identiable. You can even reference the protocol used
by the virtual server, such as HTTP.
Application port
Enter the port number through which the service application will listen. Since this example is for
HTTP services, port 80 is used.
Protocol
Choose between UDP and TCP in the drop-down menu. Web servers typically communicate via
the TCP protocol, so this is selected in the example above.
Virtual IP Address
Enter the virtual server's oating IP address in this text eld.
Chapter 13. Conguring the LVS Routers with Piranha Conguration Tool 141
Virtual IP Network Mask
Set the netmask for this virtual server with the drop-down menu.
Firewall Mark
Do not enter a rewall mark integer value in this eld unless you are bundling multi-port protocols
or creating a multi-port virtual server for separate, but related protocols. In this example, the
above virtual server has a Firewall Mark of 80 because we are bundling connections to HTTP
on port 80 and to HTTPS on port 443 using the rewall mark value of 80. When combined with
persistence, this technique will ensure users accessing both insecure and secure webpages are
routed to the same real server, preserving state.
Warning
Entering a rewall mark in this eld allows IPVS to recognize that packets bearing this rewall
mark are treated the same, but you must perform further conguration outside of the Piranha
Conguration Tool to actually assign the rewall marks. See Section 12.3 Multi-port Services
and LVS Clustering for instructions on creating multi-port services and Section 12.4 FTP In an
LVS Cluster for creating a highly available FTP virtual server.
Device
Enter the name of the network device to which you want the oating IP address dened the
Virtual IP Address eld to bind.
You should alias the public oating IP address to the Ethernet interface connected to the public
network. In this example, the public network is on the eth0 interface, so eth0:1 should be
entered as the device name.
Re-entry Time
Enter an integer value which denes the length of time, in seconds, before the active LVS router
attempts to bring a real server back into the cluster after a failure.
Service Timeout
Enter an integer value which denes the length of time, in seconds, before a real server is considered
dead and removed from the cluster.
Quiesce server
When the Quiesce server radio button is selected, anytime a new real server node comes online,
the least-connections table is reset to zero so the active LVS router routes requests as if all the real
servers were freshly added to the cluster. This option prevents the a new server from becoming
bogged down with a high number of connections upon entering the cluster.
Load monitoring tool
The LVS router can monitor the load on the various real servers by using either rup or ruptime.
If you select rup from the drop-down menu, each real server must run the rstatd service. If
you select ruptime, each real server must run the rwhod service.
Caution
Load monitoring is not the same as load balancing and can result in hard to predict scheduling
behavior when combined with weighted scheduling algorithms. Also, if you use load monitoring,
the real servers in the cluster must be Linux machines.
142 Chapter 13. Conguring the LVS Routers with Piranha Conguration Tool
Scheduling
Select your preferred scheduling algorithm from the drop-down menu. The default is Weighted
least-connection. For more information on scheduling algorithms, see Section 10.3.1
Scheduling Algorithms.
Persistence
If an administrator needs persistent connections to the virtual server during client transactions,
enter the number of seconds of inactivity allowed to lapse before a connection times out in this
text eld.
Important
If you entered a value in the Firewall Mark eld above, you should enter a value for persistence
as well. Also, be sure that if you use rewall marks and persistence together, that the amount of
persistence is the same for each virtual server with the rewall mark. For more on persistence
and rewall marks, refer to Section 10.5 Persistence and Firewall Marks.
Persistence Network Mask
To limit persistence to particular subnet, select the appropriate network mask from the drop-down
menu.
Note
Before the advent of rewall marks, persistence limited by subnet was a crude way of bundling
connections. Now, it is best to use persistence in relation to rewall marks to achieve the same
result.
Warning
Remember to click the ACCEPT button after making any changes in this panel. To make sure you do
not lose changes when selecting a new panel.
13.6.2. REAL SERVER Subsection
Clicking on the REAL SERVER subsection link at the top of the panel displays the EDIT REAL
SERVER subsection. It displays the status of the physical server hosts for a particular virtual service.
Chapter 13. Conguring the LVS Routers with Piranha Conguration Tool 143
Figure 13-7. The REAL SERVER Subsection
Click the ADD button to add a new server. To delete an existing server, select the radio button beside
it and click the DELETE button. Click the EDIT button to load the EDIT REAL SERVER panel,
as seen in Figure 13-8.
144 Chapter 13. Conguring the LVS Routers with Piranha Conguration Tool
Figure 13-8. The REAL SERVER Conguration Panel
This panel consists of three entry elds:
Name
A descriptive name for the real server.
Tip
This name is not the hostname for the machine, so make it descriptive and easily identiable.
Address
The real server's IP address. Since the listening port is already specied for the associated virtual
server, do not add a port number.
Weight
An integer value indicating this host's capacity relative to that of other hosts in the pool. The
value can be arbitrary, but treat it as a ratio in relation to other real servers in the cluster. For
more on server weight, see Section 10.3.2 Server Weight and Scheduling.
Warning
Remember to click the ACCEPT button after making any changes in this panel. To make sure you do
not lose any changes when selecting a new panel.
Chapter 13. Conguring the LVS Routers with Piranha Conguration Tool 145
13.6.3. EDIT MONITORING SCRIPTS Subsection
Click on the MONITORING SCRIPTS link at the top of the page. The EDIT MONITORING
SCRIPTS subsection allows the administrator to specify a send/expect string sequence to verify that
the service for the virtual server is functional on each real server. It is also the place where the administrator
can specify customized scripts to check services requiring dynamically changing data.
Figure 13-9. The EDIT MONITORING SCRIPTS Subsection
Sending Program
For more advanced service verication, you can use this eld to specify the path to a servicechecking
script. This functionality is especially helpful for services that require dynamically
changing data, such as HTTPS or SSL.
To use this functionality, you must write a script that returns a textual response, set it to be
executable, and type the path to it in the Sending Program eld.
Tip
To ensure that each server in the real server pool is checked, use the special token %h after the
path to the script in the Sending Program eld. This token is replaced with each real server's
IP address as the script is called by the nanny daemon.
146 Chapter 13. Conguring the LVS Routers with Piranha Conguration Tool
The following is a sample script to use as a guide when composing an external service-checking
script:
#!/bin/sh
TEST=`dig -t soa example.com @$1 | grep -c dns.example.com
if [ $TEST != "1" ]; then
echo "OK
else
echo "FAIL"
fi
Note
If an external program is entered in the Sending Program eld, then the Send eld is ignored.
Send
Enter a string for the nanny daemon to send to each real server in this eld. By default the send
eld is completed for HTTP. You can alter this value depending on your needs. If you leave this
eld blank, the nanny daemon attempts to open the port and assume the service is running if it
succeeds.
Only one send sequence is allowed in this eld, and it can only contain printable, ASCII characters
as well as the following escape characters:
? \n for new line.
? \r for carriage return.
? \t for tab.
? \ to escape the next character which follows it.
Expect
Enter a the textual response the server should return if it is functioning properly. If you wrote
your own sending program, enter the response you told it to send if it was successful.
Tip
To determine what to send for a given service, you can open a telnet connection to the port
on a real server and see what is returned. For instance, FTP reports 220 upon connecting, so
could enter quit in the Send eld and 220 in the Expect eld.
Warning
Remember to click the ACCEPT button after making any changes in this panel. To make sure you do
not lose any changes when selecting a new panel.
Once you have congured virtual servers using the Piranha Conguration Tool, you must copy
specic conguration les to the backup LVS router. See Section 13.7 Synchronizing Conguration
Files for details.
Chapter 13. Conguring the LVS Routers with Piranha Conguration Tool 147
13.7. Synchronizing Conguration Files
After conguring the primary LVS router, there are several conguration les that must be copied to
the backup LVS router before you start the cluster.
These les include:
? /etc/sysconfig/ha/lvs.cf? the conguration le for the LVS routers.
? /etc/sysctl?the conguration le that, among other things, turns on packet forwarding in the
kernel.
? /etc/sysconfig/iptables? If you are using rewall marks, you should synchronize one of
these les based on which network packet lter you are using.
Important
The /etc/sysctl.conf and /etc/sysconfig/iptables les do not change when you congure
the cluster using the Piranha Conguration Tool.
13.7.1. Synchronizing lvs.cf
Anytime the LVS conguration le, /etc/sysconfig/ha/lvs.cf, is created or updated, you must
copy it to the backup LVS router node.
Warning
Both the active and backup LVS router nodes must have identical lvs.cf les. Mismatched LVS
conguration les between the LVS router nodes can prevent failover.
The best way to do this is to use the scp command.
Important
To use scp the sshd must be running on the backup router, see Section 11.1 Conguring Services on
the LVS Routers for details on how to properly congure the necessary services on the LVS routers.
Issue the following command as the root user from the primary LVS router to sync the lvs.cf les
between the router nodes:
scp /etc/sysconfig/ha/lvs.cf n.n.n.n:/etc/sysconfig/ha/lvs.cf
In the above command, replace n.n.n.n with the real IP address of the backup LVS router.
13.7.2. Synchronizing sysctl
The sysctl le is only modied once in most situations. This le is read at boot time and tells the
kernel to turn on packet forwarding.
148 Chapter 13. Conguring the LVS Routers with Piranha Conguration Tool
Important
If you are not sure whether or not packet forwarding is enabled in the kernel, see Section 11.5 Turning
on Packet Forwarding for instructions on how to check and, if necessary, enable this key functionality.
13.7.3. Synchronizing Network Packet Filtering Rules
If you are using iptables, you will need to synchronize the appropriate conguration le on the
backup LVS router.
If you alter the any network packet lter rules, enter the following command as root from the primary
LVS router:
scp /etc/sysconfig/iptables n.n.n.n:/etc/sysconfig/
In the above command, replace n.n.n.n with the real IP address of the backup LVS router.
Next either open an ssh session to the backup router or log into the machine as root and type the
following command:
/sbin/service iptables restart
Once you have copied these les over to the backup router and started the appropriate services (see
Section 11.1 Conguring Services on the LVS Routers for more on this topic) you are ready to start
the cluster.
13.8. Starting the Cluster
To start the LVS cluster, it is best to have two root terminals open simultaneously or two simultaneous
root open ssh sessions to the primary LVS router.
In one terminal, watch the kernel log messages with the command:
tail -f /var/log/messages
Then start the cluster by typing the following command into the other terminal:
/sbin/service pulse start
Follow the progress of the pulse service's startup in the terminal with the kernel log messages. When
you see the following output, the pulse daemon has started properly:
gratuitous lvs arps finished
To stop watching /var/log/messages, type [Ctrl]-[c].
From this point on, the primary LVS router is also the active LVS router. While you can make requests
to the cluster at this point, you should start the backup LVS router before putting the cluster into
service. To do this, simply repeat the process described above on the backup LVS router node.
After completing this nal step, the cluster will be up and running.
III. Appendixes
This section is licensed under the GNU Free Documentation License. For details refer to the Copyright
page.
Table of Contents
A. Using Red Hat Cluster Manager with Piranha....................................................................... 151
B. Using Red Hat GFS with Red Hat Cluster Suite..................................................................... 153
C. The GFS Setup Druid ................................................................................................................ 157
D. Supplementary Hardware Information ................................................................................... 163
E. Supplementary Software Information...................................................................................... 169
F. Cluster Command-line Utilities................................................................................................. 179
Appendix A.
Using Red Hat Cluster Manager with Piranha
A cluster can be used in conjunction with the Piranha load-balancing features to deploy a highly
available e-commerce site that has complete data integrity and application availability, in addition to
load balancing capabilities.
Figure A-1 shows how you could use Red Hat Cluster Manager with Piranha. The gure shows a
cluster with a three-tier architecture, where the top tier consists of Piranha load-balancing systems to
distribute Web requests. The second tier consists of a set of Web servers to serve the requests. The
third tier consists of a cluster to serve data to the Web servers.
Figure A-1. Cluster in an LVS Environment
In a Piranha conguration, client systems issue requests on theWorldWideWeb. For security reasons,
these requests enter a Web site through a rewall, which can be a Linux system serving in that capacity
or a dedicated rewall device. For redundancy, you can congure rewall devices in a failover
conguration. Behind the rewall are Piranha load-balancing systems, which can be congured in an
active-standby mode. The active load-balancing system forwards the requests to a set of Web servers.
Each Web server can independently process an HTTP request from a client and send the response
back to the client. Piranha enables an administrator to expand a Web site's capacity by adding Web
servers to the load-balancing systems' set of active Web servers. In addition, if a Web server fails, it
can be removed from the set.
This Piranha conguration is particularly suitable if the Web servers serve only static Web content,
which consists of small amounts of infrequently changing data, such as corporate logos, that can be
easily duplicated on the Web servers. However, this conguration is not suitable if the Web servers
serve dynamic content, which consists of information that changes frequently. Dynamic content could
152 Appendix A. Using Red Hat Cluster Manager with Piranha
include a product inventory, purchase orders, or customer database, which must be consistent on all
the Web servers to ensure that customers have access to up-to-date and accurate information.
To serve dynamic Web content in a Piranha conguration, add a cluster behind the Web servers, as
shown in the previous gure. This combination of Piranha and Red Hat Cluster Manager allows for
the conguration of a high-integrity, no-single-point-of-failure e-commerce site. The cluster can run
a highly-available instance of a database or a set of databases that are network-accessible to the Web
servers.
For example, the gure could represent an e-commerce site used for online merchandise ordering
through a URL. Client requests to the URL pass through the rewall to the active Piranha loadbalancing
system, which then forwards the requests to one of the three Web servers. The Red Hat
Cluster Manager systems serve dynamic data to the Web servers, which forward the data to the requesting
client system.
Appendix B.
Using Red Hat GFS with Red Hat Cluster Suite
This appendix provides information about considerations to take when running Red Hat GFS 6.0 with
Red Hat Cluster Suite and consists of the following sections:
? Section B.1 Terminology
? Section B.2 Changes to Red Hat Cluster
? Section B.3 Installation Scenarios
B.1. Terminology
You may have encountered new terms associated with Red Hat Cluster Suite. The following list provides
a brief description of terms used with Red Hat GFS and Red Hat Cluster Suite:
GFS Setup Druid
This application is a Red Hat Cluster GUI for initial conguration of Red Hat GFS. The GUI is
launched separately from the Red Hat Cluster GUI, the Cluster Conguration Tool. The GFS
Setup Druid uses /etc/cluster.xml as input. If /etc/cluster.xml does not exist, the
GFS Setup Druid displays a message and exits.
Note
You must run the Cluster Conguration Tool before running the GFS Setup Druid; the Cluster
Conguration Tool creates /etc/cluster.xml.
To run the GFS Setup Druid, enter the following at the command line:
# redhat-config-gfscluster
gulm-bridge
This is a fence method available for Red Hat Cluster nodes, if and only if the Red Hat GFS RPM
is installed on the node that the Cluster Conguration Tool runs on. The gulm-bridge fence
method has been added to Red Hat Cluster Suite specically for the Red Hat Enterprise Linux
3 Update 3 release. Using this fence method on a Red Hat Cluster Manager member prevents it
from being fenced twice.
Red Hat Cluster
Red Hat Cluster Manager is part of the Red Hat Cluster Suite. It provides cluster administration
functionality for Red Hat Enterprise Linux 3. Red Hat Cluster Manager contains two major
components:
? Red Hat Cluster Manager ? The underlying software (non-GUI) that performs Red Hat
Cluster administrations services.
? Cluster Conguration Tool ? This component is the graphical user interface (GUI) for Red
Hat Cluster Manager. The GUI provides a conguration interface and a status monitor for
members and services in a Red Hat Cluster Manager system. The Cluster Conguration
Tool accepts conguration data from a user and writes it to the /etc/cluster.xml le. The
Red Hat Cluster Manager reads the conguration data from the /etc/cluster.xml le.
154 Appendix B. Using Red Hat GFS with Red Hat Cluster Suite
Also, the Cluster Conguration Tool wraps several command line calls into the Red Hat
Cluster Manager, such as starting and stopping services.
B.2. Changes to Red Hat Cluster
The following changes to Red Hat Cluster enable running it with Red Hat GFS in RHEL-U3:
? The Cluster Conguration Tool has been changed. After entering members in the conguration
section of the application, if a member is highlighted, and you click Add Child, a dialog box
is displayed, offering fence method options. You can select a fence method by clicking a radio
button next to the fence method in the dialog box. Earlier Red Hat Cluster releases provided only
two fence method options (under Power Controller Type): Serial and Network. For Red Hat
Enterprise Linux 3 Update 3, if Red Hat GFS is installed on the node, then a third fence-method
option, GULM-STONITH (the gulm-bridge fence method), is available.
? The Red Hat Cluster Manager now provides support for GULM-STONITH, the gulm-bridge
fence method.
? A druid application, the GFS Setup Druid, provides for conguring an initial instance of Red
Hat GFS by writing the three Red Hat GFS conguration les: cluster.ccs, nodes.ccs, and
fence.ccs. The GFS Setup Druid requires an /etc/cluster.xml le when started.
B.3. Installation Scenarios
When running Red Hat GFS with Red Hat Cluster Manager, you must take into account certain considerations,
according to the following circumstances:
? New installations of Red Hat GFS and Red Hat Cluster Manager
? Adding Red Hat GFS to an existing Red Hat Cluster Manager deployment
? Upgrading Red Hat GFS 5.2.1 to Red Hat GFS 6.0
B.3.1. New Installations of Red Hat GFS and Red Hat Cluster Manager
When installing Red Hat GFS and Red Hat Cluster Manager for the rst time into a cluster, install
and congure Red Hat Cluster Suite before installing and conguring Red Hat GFS.With the Cluster
Conguration Tool, you can congure up to 16 nodes ? the maximum number of nodes allowed in
Red Hat Cluster Manager system.
You can add services and failover domains (and other functions) after initially conguring Red Hat
GFS with the GFS Setup Druid.
Note
The only conguration items in Red Hat Cluster that Red Hat GFS or the GFS Setup Druid depend
on are setting up Red Hat Cluster Manager members and specifying fence devices.
Appendix B. Using Red Hat GFS with Red Hat Cluster Suite 155
B.3.2. Adding Red Hat GFS to an Existing Red Hat Cluster Manager
Deployment
Adding Red Hat GFS to an existing Red Hat Cluster Manager deployment requires running the Red
Hat GFS druid application, GFS Setup Druid (also known as redhat-cong-gfscluster). As with
the scenario in Section B.3.1 New Installations of Red Hat GFS and Red Hat Cluster Manager, while
Red Hat GFS is scalable up to 300 nodes, a Red Hat Cluster Manager limits the total number of nodes
in a cluster to 16. Therefore, in this scenario, Red Hat GFS scalability is limited. If the 16-node limit
is too small for your deployment, you may want to consider using multiple Red Hat Cluster Manager
clusters.
156 Appendix B. Using Red Hat GFS with Red Hat Cluster Suite
Appendix C.
The GFS Setup Druid
The Red Hat Cluster Suite includes a graphical tool for administrators who are new to Red Hat GFS
conguration. The GFS Setup Druid allows administrators to quickly migrate existing Red Hat Cluster
Manager cluster systems to Red Hat GFS. The GFS Setup Druid integrates any previously con-
gured cluster nodes and fencing devices from the existing /etc/cluster.xml conguration le.
Administrators can then congure any new nodes and fence devices, as well as associate fence devices
to nodes for proper power control.
Note
For more detailed information about installation, conguration, and administration of Red Hat GFS,
refer to the Red Hat GFS Administrator's Guide.
The GFS Setup Druid can be started by typing redhat-config-gfscluster as root from a shell
prompt.
C.1. Cluster Name
After reading the introductory screen describing the functionality of the GFS Setup Druid, click
Forward to continue. You are then prompted to type a name for the GFS cluster. You can enter any
name that does not contain spaces. (such as Development1 or Engineering_Cluster). Click
Forward to continue.
Figure C-1. Cluster Name
158 Appendix C. The GFS Setup Druid
C.2. LOCK_GULM parameters
After naming the cluster, you must then congure the GFS LOCK_GULM parameters. The Grand
Unied Lock Manager (GULM) is a centralized server-based locking and cluster state management
for GFS le systems. Multiple lock servers can be congured for failover redundancy in the event of
server failure. Figure C-2 shows the LOCK_GULM conguration screen.
Figure C-2. Conguring LOCK_GULM parameters
Allowed Misses species the number of responses that a node can consecutively miss before the lock
server(s) declare the node expired. The default value is 2.
Heartbeat Rate sets the interval (in seconds) at which heartbeats are checked by the lock server(s).
Two-thirds of this interval is the rate at which nodes send heartbeat messages to the lock server(s).
The default rate is set at 15 seconds.
After conguring the LOCK_GULM parameters, click Forward to continue.
C.3. Choose Location for CCS Files
The next screen prompts you to type or browse for a directory that stores the GFS cluster conguration
les created by using the GFS Setup Druid. By default, the /tmp/ directory stores the three
conguration les: cluster.ccs, nodes.ccs, and fence.ccs. When a directory is chosen, click
Forward to continue.
Appendix C. The GFS Setup Druid 159
Figure C-3. Location to Write GFS Conguration Files
C.4. Cluster Members
The next screen allows you to add, delete, and congure cluster nodes. You can also congure fence
devices, which are hardware- or software-based devices used to perform shutdown or reboot of nodes
that are expired or that need to be serviced. Figure C-4 shows the cluster and fence device conguration
screen.
Figure C-4. Congure Cluster Members
160 Appendix C. The GFS Setup Druid
C.4.1. Adding a New Member
Click the Add a New Member Node to create a new cluster member. A window similar to Figure C-5
appears that prompts you to add the Host Name, IP Address, and Network Device of the member
node. Once these values are entered, click Update to add the member to the system.
Figure C-5. Add a New Cluster Member
The added node appears within the Member Nodes and Fence Instances section. You can congure
any member node by clicking the Edit Properties button. Clicking Delete Entry deletes the highlighted
member node.
C.4.2. Fence Devices
To create an entry for a fence device, click the Add a Fence Device button in the Fence Device
Specications section. A window appears similar to Figure C-6.
Figure C-6. Add a Fence Device
Enter a name (do not use spaces) in the Fence Device Name eld. Then, choose the type of fence
device using the Agent Type drop down menu. Depending on the fence device chosen, enter any
required parameters. Available parameters include:
? IP Address ? The address of the device if it is a network power switch or Fiber Channel switch.
? Login ? The username that is entered to access the device.
? Password ? The password used to authenticate the username that accesses the device.
? Server ? The server name of the device (required when using the Global Network Block Device
(GNBD) as a fencing method.
? Hostname ? The hostname of the device. This name must be addressable to the cluster either by
DNS assignment or by static entry in each nodes /etc/hosts le.
Appendix C. The GFS Setup Druid 161
Note
For more information about the types of fence devices in the Agent Type drop down menu, refer to the
chapter titled "Creating the Cluster Conguration System Files" in the Red Hat GFS Administrator's
Guide.
Once a fence device is added, you can recongure the entry by clicking the Edit Properties button.
To delete a device, highlight its entry and click the Delete Entry button.
Finally, you can associate a fence device to a cluster member by highlighting the entry for the particular
member and clicking the Add a Fence to a Node button in the Member Nodes and Fence
Instances section. A pop up window appears allowing you to choose any congured fence device
from a drop-down menu and associate to the chosen node. Depending on the fence device chosen, you
may have to enter additional fence device specications, such as the Port to which the member node
is connected on the fence device. Once congured, click Update and the GFS Setup Druid shows
the member node with the associated fence device(s) below it.
Note
You cannot delete a fence device listed in the Fence Device Specications section if it is associated
with a cluster node. You must rst delete the fence instance(s) in the Member Nodes and Fence Instances
section for that fence device before you delete the entry in the Fence Device Specications
section.
C.5. Saving Your Conguration and Next Steps
When you are satised with your entries, click Forward to continue to the nal screen, which prompts
you to click Apply and save the changes to the cluster conguration les. A pop up window appears
allowing you to write the les or cancel the save and go back and change the settings.
Important
Once you have written the conguration les, you cannot change them again using the GFS Setup
Druid. You must either restart the conguration from the beginning or edit each cluster conguration
le manually using a text editor such as vi or emacs.
Once the les are saved, you can create the cluster conguration archive (CCA), which is a binary
accessible to the cluster via shared device (also called a pool volume).
Create the CCA by running the ccs_tool at a shell prompt as root. The usage of ccs_tool for
creating the CCA is as follows:
ccs_tool create /directory CCADevice
For example, if your cluster conguration les are located in /tmp, and you created a shared device
named /dev/pool/devel_cca using the GFS Pool Volume Manager (which allows you to congure
pools of physical storage into logical volumes accessible to cluster nodes as if they were local storage),
run the following command:
162 Appendix C. The GFS Setup Druid
ccs_tool create /tmp/ /dev/pool/devel_cca
This chapter briey explains the process of conguring an existing Red Hat Cluster Manager cluster
to leverage Red Hat GFS using the GFS Setup Druid. However, for complete instructions on how
to properly install, setup, run, and congure Red Hat GFS, refer to the Red Hat GFS Administrator's
Guide.
Appendix D.
Supplementary Hardware Information
The following sections provide additional information about conguring the hardware used in a cluster
system.
D.1. Setting Up Power Controllers
This section discusses power controllers. For more information about power controllers and their role
in a cluster environment, refer to Section 2.1.3 Choosing the Type of Power Controller.
D.1.1. Power Switches
For a list of serial-attached and network-attached power switches tested with and/or supported by Red
Hat, Inc. for cluster power control, refer to the Red Hat Hardware Compatibility List located at the
following URL:
http://hardware.redhat.com/hcl/
D.1.2. Setting up Watchdog Power Switches
A description of the usage model for watchdog timers as a cluster data integrity provision appears
in Section 2.1.3 Choosing the Type of Power Controller. As described in that section, there are two
variants of watchdog timers: hardware-based and software-based.
This section details the conguration tasks required to setup watchdog timer usage in a cluster hardware
conguration.
Regardless of which type of watchdog timer is employed, it is necessary to create the device special
le appropriate for the watchdog timer. This can be accomplished with the following commands:
cd /dev
./MAKEDEV watchdog
When using the Cluster Conguration Tool, each new member added to the cluster has software
watchdog functionality enabled by default.
D.1.2.1. Conguring the Software Watchdog Timer
Any cluster system can utilize the software watchdog timer as a data integrity provision, as no dedicated
hardware components are required. The cluster software automatically loads the corresponding
loadable kernel module called softdog.
If the cluster is congured to utilize the software watchdog timer, the cluster membership daemon
(clumembd) periodically resets the timer interval. Should clumembd fail to reset the timer, the failed
cluster member reboots itself.
When using the software watchdog timer, there is a small risk that the system hangs in such a way that
the software watchdog thread is not executed. In this unlikely scenario, the other cluster member may
takeover services of the apparently hung cluster member. Generally, this is a safe operation; but in the
unlikely event that the hung cluster member resumes, data corruption could occur. To further lessen
the chance of this vulnerability occurring when using the software watchdog timer, administrators
should also congure the NMI watchdog timer as well as an external power switch (if available).
164 Appendix D. Supplementary Hardware Information
D.1.2.2. Enabling the NMI Watchdog Timer
If you are using the software watchdog timer as a data integrity provision, it is also recommended to
enable the Non-Maskable Interrupt (NMI) watchdog timer to enhance the data integrity guarantees.
The NMI watchdog timer is a different mechanism for causing the system to reboot in the event of a
hang scenario where interrupts are blocked. This NMI watchdog can be used in conjunction with the
software watchdog timer.
Unlike the software watchdog timer which is reset by the cluster quorum daemon (cluquorumd),
the NMI watchdog timer counts system interrupts. Normally, a healthy system receives hundreds of
device and timer interrupts per second. If there are no interrupts in a 5 second interval, a system hang
has occurred and the NMI watchdog timer expires, initiating a system reboot.
A robust data integrity solution can be implemented by combining the health monitoring of the the
cluster quorum daemon with the software watchdog timer along with the low-level system status
checks of the NMI watchdog.
Correct operation of the NMI watchdog timer mechanism requires that the cluster members contain
an APIC chip on the main system board.
The NMI watchdog is enabled on supported systems by adding nmi_watchdog=1 to the kernel's
command line. Here is an example /etc/grub.conf:
Note
The following GRUB and LILO bootloader congurations only apply to the x86 architecture of Red
Hat Enterprise Linux.
# grub.conf
default=0
timeout=10
splashimage=(hd0,0)/grub/splash.xpm.gz
title HA Test Kernel (2.4.9-10smp)
root (hd0,0)
# This is the kernel's command line.
kernel /vmlinuz-2.4.9-10smp ro root=/dev/hda2 nmi_watchdog=1
# end of grub.conf
On systems using LILO, add "nmi_watchdog=1" to the append section in /etc/lilo.conf.
For example:
# lilo.conf
prompt
timeout=50
default=linux
boot=/dev/hda
map=/boot/map
install=/boot/boot.b
lba32
image=/boot/vmlinuz-2.4.9-10smp
label=linux
read-only
root=/dev/hda2
append="nmi_watchdog=1"
# end of lilo.conf
Appendix D. Supplementary Hardware Information 165
Run /sbin/lilo after editing /etc/lilo.conf for the changes to take effect.
To determine if a server supports the NMI watchdog timer, rst try adding "nmi_watchdog=1" to
the kernel command line as described above. After the system has booted, log in as root and type:
cat /proc/interrupts
The output should appear similar to the following:
CPU0
0: 5623100 XT-PIC timer
1: 13 XT-PIC keyboard
2: 0 XT-PIC cascade
7: 0 XT-PIC usb-ohci
8: 1 XT-PIC rtc
9: 794332 XT-PIC aic7xxx, aic7xxx
10: 569498 XT-PIC eth0
12: 24 XT-PIC PS/2 Mouse
14: 0 XT-PIC ide0
NMI: 5620998
LOC: 5623358
ERR: 0
MIS: 0
The relevant portion of the above output is to verify that the NMI id appears on the left side. If NMI
value has a value larger than zero (0), the server supports the NMI watchdog.
If this approach fails, that is, NMI is zero, try passing nmi_watchdog=2 to the kernel instead of
nmi_watchdog=1 in the manner described previously. Again, check /proc/interrupts after the
system boots. If NMI has a value larger than zero, the NMI watchdog has been congured properly. If
NMI is zero, your system does not support the NMI watchdog timer.
D.1.2.3. Conguring a Hardware Watchdog Timer
The kernel provides driver support for various types of hardware watchdog timers. Some of these
timers are implemented directly on the system board; whereas, others are separate hardware components
such as PCI cards. Hardware-based watchdog timers provide excellent data integrity provisions
in the cluster because they operate independently of the system processor and are therefore fully capable
of rebooting a system in the event of a system hang.
Due to a lack of uniformity among low-level hardware watchdog components, it is difcult to make
generalizations describing how to know if a particular system contains such components. Many lowlevel
hardware watchdog components are not self-identifying.
When conguring any of the supported watchdog timers supported by the kernel, it is necessary to
place a corresponding entry into the /etc/modules.conf le. For example, if an Intel-810 based
TCO watchdog timer is to be used, the following line should be added to /etc/modules.conf:
alias wdt i810-tco
D.2. SCSI Bus Conguration Requirements
SCSI buses must adhere to a number of conguration requirements to operate correctly. Failure to
adhere to these requirements adversely affects cluster operation and resource availability.
The following are SCSI bus conguration requirements:
166 Appendix D. Supplementary Hardware Information
? Buses must be terminated at each end. Refer to Section D.3 SCSI Bus Termination for more information.
? Buses must not extend beyond the maximum length restriction for the bus type. Internal cabling
must be included in the length of the SCSI bus. Refer to Section D.4 SCSI Bus Length for more
information.
? All devices (host bus adapters and disks) on a bus must have unique SCSI identication numbers.
Refer to Section D.5 SCSI Identication Numbers for more information.
? The Linux device name for each shared SCSI device must be the same on each cluster system. For
example, a device named /dev/sdc on one cluster system must be named /dev/sdc on the other
cluster system. One way to ensure that devices are named the same is by using identical hardware
for both cluster systems.
Use the system's conguration utility to set SCSI identication numbers and enable host bus adapter
termination. When the system boots, a message is displayed describing how to start the utility. For
example, the utility prompts the user to press [Ctrl]-[A], and follow the prompts to perform a particular
task. To set storage enclosure and RAID controller termination, refer to the vendor documentation.
Refer to Section D.3 SCSI Bus Termination and Section D.5 SCSI Identication Numbers for more
information.
D.3. SCSI Bus Termination
A SCSI bus is an electrical path between two terminators. A device (host bus adapter, RAID controller,
or disk) attaches to a SCSI bus by a short stub, which is an unterminated bus segment that usually must
be less than 0.1 meter in length.
Buses must have only two terminators located at opposing ends of the bus. Additional terminators,
terminators that are not at the ends of the bus, or long stubs cause the bus to operate incorrectly. Termination
for a SCSI bus can be provided by the devices connected to the bus or by external terminators,
if the internal (onboard) device termination can be disabled.
Testing has shown that external termination on HBAs that run at speeds greater than 80MB/second
does not work reliably.
When disconnecting a device from a single-initiator SCSI bus follow these guidelines:
? Unterminated SCSI cables must not be connected to an operational host bus adapter or storage
device.
? Connector pins must not bend or touch an electrical conductor while the SCSI cable is disconnected.
? To disconnect a host bus adapter from a single-initiator bus, rst disconnect the SCSI cable from the
RAID controller and then from the adapter. This ensures that the RAID controller is not exposed to
any erroneous input.
? Protect connector pins from electrostatic discharge while the SCSI cable is disconnected by wearing
a grounded anti-static wrist guard and physically protecting the cable ends from contact with other
objects.
? Do not remove a device that is currently participating in any SCSI bus transactions.
To enable or disable an adapter's internal termination, use the system BIOS utility. When the system
boots, a message is displayed describing how to start the utility. For example, many utilities prompt
users to press [Ctrl]-[A]. Follow the prompts for setting the termination. At this point, it is also possible
to set the SCSI identication number, as needed, and disable SCSI bus resets. Refer to Section D.5
SCSI Identication Numbers for more information.
To set storage enclosure and RAID controller termination, refer to the vendor documentation.
Appendix D. Supplementary Hardware Information 167
D.4. SCSI Bus Length
A SCSI bus must adhere to length restrictions for the bus type. Buses that do not adhere to these
restrictions do not operate properly. The length of a SCSI bus is calculated from one terminated end
to the other and must include any cabling that exists inside the system or storage enclosures.
A cluster supports LVD (low voltage differential) buses. The maximum length of a single-initiator
LVD bus is 25 meters. The maximum length of a multi-initiator LVD bus is 12 meters. According
to the SCSI standard, a single-initiator LVD bus is a bus that is connected to only two devices, each
within 0.1 meter from a terminator. All other buses are dened as multi-initiator buses.
Do not connect any single-ended devices to an LVD bus; doing so converts the bus single-ended,
which has a much shorter maximum length than a differential bus.
D.5. SCSI Identication Numbers
Each device on a SCSI bus must have a unique SCSI identication number. Devices include host bus
adapters, RAID controllers, and disks.
The number of devices on a SCSI bus depends on the data path for the bus. A cluster supports wide
SCSI buses, which have a 16-bit data path and support a maximum of 16 devices. Therefore, there are
sixteen possible SCSI identication numbers that can be assigned to the devices on a bus.
In addition, SCSI identication numbers are prioritized. Use the following priority order to assign
SCSI identication numbers:
7 - 6 - 5 - 4 - 3 - 2 - 1 - 0 - 15 - 14 - 13 - 12 - 11 - 10 - 9 - 8
The previous order species that 7 is the highest priority, and 8 is the lowest priority. The default
SCSI identication number for a host bus adapter is 7, because adapters are usually assigned the
highest priority. It is possible to assign identication numbers for logical units in a RAID subsystem
by using the RAID management interface.
To modify an adapter's SCSI identication number, use the system BIOS utility. When the system
boots, a message is displayed describing how to start the utility. For example, a user may be prompted
to press [Ctrl]-[A] and follow the prompts for setting the SCSI identication number. At this point,
it is possible to enable or disable the adapter's internal termination, as needed, and disable SCSI bus
resets. Refer to Section D.3 SCSI Bus Termination for more information.
The prioritized arbitration scheme on a SCSI bus can result in low-priority devices being locked out
for some period of time. This may cause commands to time out, if a low-priority storage device, such
as a disk, is unable to win arbitration and complete a command that a host has queued to it. For some
workloads, it is possible to avoid this problem by assigning low-priority SCSI identication numbers
to the host bus adapters.
168 Appendix D. Supplementary Hardware Information
Appendix E.
Supplementary Software Information
The information in the following sections can assist in the management of the cluster software con-
guration.
E.1. Cluster Communication Mechanisms
A cluster uses several intra-cluster communication mechanisms to ensure data integrity and correct
cluster behavior when a failure occurs. The cluster uses these mechanisms to:
? Control when a system can become a cluster member
? Determine the state of the cluster systems
? Control the behavior of the cluster when a failure occurs
The cluster communication mechanisms are as follows:
? Shared (quorum) partitions
Periodically, each cluster system writes a time-stamp and system status to the primary and shadow
shared partitions, which are raw partitions located on shared storage. Each member reads the system
status and time-stamp that were written by the other members and determines if they are up
to date. The members attempt to read the information from the primary shared partition. If this
partition is corrupted, the members read the information from the shadow shared partition and simultaneously
repair the primary partition. Data consistency is maintained through checksums and
any inconsistencies between the partitions are automatically corrected.
If a member reboots but cannot write to both shared partitions, the system is not allowed to join the
cluster. In addition, if an existing member can no longer write to both partitions, it removes itself
from the cluster by shutting down.
Shared partitions are only used as a communication mechanism in two-member clusters that have
network tie-breaker disabled.
? Remote power switch monitoring
Periodically, each member monitors the health of the remote power switch connection, if any. The
member uses this information to help determine the status of the other cluster members. The complete
failure of the power switch communication mechanism does not automatically result in a
failover. If a power switch fails to power-cycle a hung system, no failover is performed as the
cluster infrastructure cannot guarantee the member's present state.
? Ethernet heartbeats
The members are connected together by using point-to-point Ethernet lines. Periodically, each
member issues heartbeats (pings) across these lines. The cluster uses this information to help determine
the status of the members and to ensure correct cluster operation. The complete failure of the
heartbeat communication mechanism does not automatically result in a failover.
If a member determines that a time-stamp from another member is not up-to-date, it checks the heartbeat
status. If heartbeats to the member are still operating, the cluster software takes no action. If a
member does not update its time-stamp after some period of time, and does not respond to heartbeat
pings, it is considered down.
The cluster remains operational as long as one cluster system can write to the shared partitions, even
if all other communication mechanisms fail.
170 Appendix E. Supplementary Software Information
Note that shared partition is only used as a back-up in some two-member congurations. The network
membership algorithm is the primary determining factor as to which cluster members are active and
which are not. A member that is not updating its time-stamp in this conguration never causes a
failover unless clumembd reports that the member is down.
E.2. Failover and Recovery Scenarios
Understanding cluster behavior when signicant events occur can assist in the proper management
of a cluster. Note that cluster behavior depends on whether power switches are employed in the con-
guration. Power switches enable the cluster to maintain complete data integrity under all failure
conditions.
The following sections describe how the system responds to various failure and error scenarios.
E.3. Common Cluster Behaviors: General
Loss of connectivity to a power switch or failure to fence a member
Common Causes: Serial power switch disconnected from controlling member. Network power
switch disconnected from network.
Expected Behavior: Members controlled by the power switch will not be able to be shut down
or restarted. In this case, if the member hangs, services will not fail-over from any member
controlled by the switch in question.
Verication: Run clustat to verify that services are still marked as running on the member,
even though it is inactive according to membership.
Dissolution of the cluster quorum
Common Causes: A majority of cluster members (for example, 3 of 5 members) go ofine
Test Case: In a 3 member cluster, stop the cluster software on two members.
Expected Behavior: All members which do not have controlling power switches reboot immediately.
All services stop immediately and their states are not updated on the shared media (when
running clustat, the service status blocks may still display that the service is running). Service
managers exit. Cluster locks are lost and become unavailable.
Verication: Run clustat on one of the remaining active members.
Member loses participatory status in the cluster quorum but is not hung
Common Causes: Total loss of connectivity to other members.
Test Case: Disconnect all network cables from a cluster member.
Expected Behavior: If the member has no controlling power switches, it reboots immediately.
Otherwise, it attempts to stop services as quickly as possible. If a quorum exists, the set of
members comprising the cluster quorum will fence the member.
clumembd crashes
Test Case: killall -KILL clumembd
Expected Behavior: System reboot.
Appendix E. Supplementary Software Information 171
clumembd hangs, watchdog in use
Test Case: killall -STOP clumembd
Expected Behavior: System reboot may occur if clumembd hangs for a time period greater than
(failover_time - 1) seconds. Triggered externally by watchdog timer.
clumembd hangs, no watchdog in use
Test Case: killall -STOP clumembd
Expected Behavior: System reboot may occur if clumembd hangs for a time period greater than
(failover_time) seconds. Triggered internally by clumembd.
cluquorumd crashes
Test Case: killall -KILL cluquorumd
Expected Behavior: System reboot.
clusvcmgrd crashes
Test Case: killall -KILL clusvcmgrd
Expected Behavior: cluquorumd re-spawns clusvcmgrd, which runs the stop phase of all services.
Services which are stopped are started.
Verication: Consult system logs for a warning message from cluquorumd.
clulockd crashes
Test Case: killall -KILL clulockd
Expected Behavior: cluquorumd re-spawns clulockd. Locks may be unavailable (preventing
service transitions) for a short period of time.
Verication: Consult system logs for a warning message from cluquorumd.
Unexpected system reboot without clean shutdown of cluster services
Common Causes: Any noted scenario which causes a system reboot.
Test Case: reboot -fn; pressing the reset switch.
Expected Behavior: If a power switch controls the rebooting member in question, the system will
also be fenced (generally, power-cycled) if a cluster quorum exists.
Loss of quorum during clean shutdown
Test Case: Stop cluster services (service clumanager stop) on all members.
Expected Behavior: Any remaining services are stopped uncleanly.
Verication: Consult the system logs for warning message.
Successful STONITH fencing operation
Expected Behavior: Services on member which was fenced are started elsewhere in the cluster,
if possible.
Verication: Verify that services are, in fact, started after the member is fenced. This should only
take a few seconds.
Unsuccessful fencing operation on cluster member
Common Causes: Power switch returned error status or is not reachable.
Test Case: Disconnect power switch controlling a member and run reboot -fn on the member.
172 Appendix E. Supplementary Software Information
Expected Behavior: Services on a member which fails to be fenced are not started elsewhere
in the cluster. If the member recovers, services on the cluster are restarted. Because there is no
way to accurately determine the member's state, it is assumed that it is now still running even if
heartbeats have stopped. Thus, all services should be reported as running on the down member.
Verication: Run clustat to verify that services are still marked as running on the member,
even though it is inactive according to membership. Messages will be logged stating that the
member is now in the PANIC state.
Error reading from one of the shared partitions
Test Case: Run dd to write zeros to the shared partition
dd if=/dev/zero of=/dev/raw/raw1 bs=512 count=1
shutil -p /cluster/header
Expected Behavior: Event is logged. The data from the good shared partition is copied to the
partition which returned errors
Verication: A second read pass of the same data should not produce a second error message.
Error reading from both of the shared partitions
Common Causes: Shared media is either unreachable or both partitions have corruption.
Test Case: Unplug SCSI or Fibre Channel cable from a member.
Expected Behavior: The event is logged. Congured action is taken to address loss of access to
shared storage (reboot/halt/stop/ignore). Default action is to reboot
E.4. Common Behaviors: Two Member Cluster with Disk-based
Tie-breaker
Loss of network connectivity to other member, shared media still accessible
Common Causes: Network connectivity lost.
Test Case: Disconnect all network cables from a member.
Expected Behavior: No fail-over unless disk updates are also lost. Services will not be able to be
relocated in most cases, which is due to the fact that the lock server requires network connectivity.
Verication: Run clustat to verify that services are still marked as running on the member, even
though it is inactive according to membership. Messages are logged stating that the member is
now in the PANIC state.
Loss of access to shared media
Common Causes: Shared media loses power, cable connecting a member to the shared media is
disconnected.
Test Case: Unplug SCSI or Fibre Channel cable from a member.
Expected Behavior: No failover occurs unless network is also lost. Congured action is taken to
address loss of access to shared storage (reboot/halt/stop/ignore). Default is reboot. The action
may subsequently cause a failover.)
System hang or crash (panic) on member X
Test Case: Kill the cluquorumd and clumembd daemons.
killall -STOP cluquorumd clumembd
Appendix E. Supplementary Software Information 173
Expected Behavior: Hung cluster member is fenced by other cluster member. Services fail over.
Congured watchdog timers may be triggered.
Start of Cluster Services without network connectivity
Common Causes: Bad switch; one or both members are without network connectivity
Test Case: Stop cluster services on all members. Disconnect all network cables from one member.
Start cluster services on both members.
Verication: Not all services may start, as locks require network connectivity. Because the Cluster
Manager requires a fully connected subnet, this case is handled on a best-effort basis, but is
technically an inoperable cluster.
E.5. Common Behaviors: 2-4 Member Cluster with IP-based
Tie-Breaker
Network Partition
Common Causes: Network switch problem.
Test Case: Connect half of the members to switch A. Connect other half to switch B. Connect
switch A to switch B using up-link or crossover cable. Connect device acting as tie-breaker IP
address to switch A. Start cluster services. Unplug switch A from switch B.
Expected Behavior: All cluster partitions which are comprised of exactly half (1/2 or 2/4) members
send ping packets to the tie-breaker IP address. If a reply is received, the partition forms
a quorum. In the test case, this means the half connected to switch A will form a quorum. Because
the Cluster Manager requires a fully connected subnet, the case where an even-split (or
split-brain) occurs when both partitions can reach the tie-breaker IP is not handled.
Verication: Run clustat on the members plugged in to switch A. There should be a Cluster
Quorum Incarnation number listed at the top of the output.
Loss of access to shared media
Common Causes: Shared media loses power; cable connecting a member to the shared media is
disconnected.
Test Case: Unplug SCSI or Fibre Channel cable from a member.
Expected Behavior: Congured action is taken to address loss of access to shared storage (reboot/
halt/stop/ignore). Default is reboot.
System hang or crash (panic) on cluster member
Test Case: Kill the cluquorumd and clumembd daemons.
killall -STOP cluquorumd clumembd
Expected Behavior: The cluster member is fenced by another member. Services fail over. If a
watchdog is in use, it may be triggered.
E.6. Common Behaviors: 3-5 Member Cluster
Network Partition
Common Causes: Network switch problem
174 Appendix E. Supplementary Software Information
Test Case: Connect a majority of members to switch A. Connect remaining members to switch
B. Connect switch A to switch B using up-link or crossover cable. Start cluster services. Unplug
switch A from switch B.
Expected Behavior: The partition with a majority of members continues operating, and a new
view of the cluster quorum is formed. Members in the minority partition are fenced, and services
which were running in the minority partition are started in the majority partition, if possible. In
the test case, this means that members connected to switch A will fence members connected to
switch B.
Verication: Run clustat on one of the members connected to switch A. There should be a
Cluster Quorum Incarnation number listed near the top of the output.
System hang on cluster member
Test Case: Kill the clumembd daemon.
killall -STOP clumembd
Expected Behavior: The cluster member is fenced by another member. Services fail over. If
watchdog timer is congured, it may be triggered.
Loss of access to shared media
Common Causes: Shared media loses power, cable connecting a member to the shared media is
disconnected.
Test Case: Unplug SCSI or Fibre Channel cable from a member.
Expected Behavior: Congured action is taken to address loss of access to shared storage (reboot/
halt/stop/ignore). Default is reboot.
E.7. Common Behaviors: Cluster Service Daemons
Service status check fails
Common Causes: User script reports error, clurmtabd not running on NFS service, smbd and
nmbd not running for a service with a Samba share.
Test Case: Create a service with an init script that returns a status output of 1.
Expected Behavior: Service restarts on the current owner.
Verication: Consult system logs for a service restart event. The restarts eld of the service's
status information should be incremented.
A member fails to start services
Common Causes: User script returns error due to le system errors.
Test Case: Create a service with a user script which returns 1 for the start phase only on one
member. Attempt to enable the service on this member.
Expected Behavior: Service is stopped and started on another member, provided the services
stops successfully.
Service start fails on all members
Common Causes: User script returns error, le system errors.
Test Case: Create a service with a user script which returns 1 for the start phase on all members.
Expected Behavior: Service is placed into the disabled state.
Verication: Run clustat and verify that the service is in the disabled state.
Appendix E. Supplementary Software Information 175
Service stop fails on a member
Common Causes: User script returns error; le system can not be unmounted.
Test Case: Create a service script which returns 1 for the stop phase.
Expected Behavior: Service is placed into the failed state. At this point, the administrator must
intervene to determine the cause of the failure and the appropriate course of action. The service
must then be disabled before it can be enabled.
Verication: Run clustat and verify that the service has been placed in to the failed state.
E.8. Common Behaviors: Miscellaneous
/etc/cluster.xml no longer exists yet cluster is running
Test Case: rm -f cluster.xml
Expected Behavior: clusvcmgrd recreates /etc/cluster.xml from its cached version.
Verication: Check to see that /etc/cluster.xml is present shortly after testing this action. A
log message appears in the system logs.
clurmtabd crashes
Test Case: Kill the clurmtabd daemon.
killall -KILL clurmtabd
Expected Behavior: Since clurmtabd is only spawned during the start phase of a service with
NFS exports, it is only checked by a service which has a Check Interval and NFS exports
congured. If clurmtabd is not present (and it is congured to be), the check phase of the
service returns an error and the service is restarted.
Verication: Consult system logs for a service restart event. The restarts eld of the service's
status information should be incremented.
E.9. The cluster.xml File
The cluster conguration le, /etc/cluster.xml, contains detailed information about the cluster
members and services. Do not manually edit the conguration le. Instead, use the Cluster Conguration
Tool to modify the cluster conguration.
When you run Cluster Conguration Tool, cluster-specic information is entered in a hierarchical
XML format. The following is a description of each area of conguration, from daemon and shared
storage to cluster members and services. Note that the back slash (\) represents a continuation of a
single line.
' ?xml version="1.0"?
' cluconfig version="3( .0"
' clumembd broadcast="no( " interval="500000" loglevel="4" multicast="yes" \
multicast_ipaddress="225.0.0.11" thread="yes" tko_count="20"/
' cluquorumd loglevel="6" pinginterval="" tiebreaker_ip=""/ (
' clurmtabd loglevel="4" pollinterval="4"/ (
' clusvcmgrd loglevel="4"/ (
' clulockd loglevel="4"/ (
' cluster config_viewnumb( er="18" key="7a497d303feefeef0f8be9b72697aaed" name="Octane"/
(
The above elds contain versioning information and cluster daemon operation parameters such as
logging levels, networking addresses, and more. For more information on conguring cluster daemon
parameters, refer to Section 3.6 Conguring Cluster Daemons.
176 Appendix E. Supplementary Software Information
) cluster config_viewnumber="18" key="7a497d303feefeef0f8be9b72697aaed" \
name="Test_cluster"/
) sharedstate driver="li* bsharedraw.so" rawprimary="/dev/raw/raw1" \
rawshadow="/dev/raw/raw2" type="raw"/
*
The elds above dene the cluster quorum and shared cluster conguration parameters. The driver
and raw partition information about the shared primary and backup partitions are also specied in
these elds. For more information about conguring the shared partitions, refer to Section 3.5 Editing
the rawdevices File.
) members
) member* id="0" name="clu1" watchdog="yes"
) powercontroller id="0" ipaddress="192.1* 68.65.51" password="apc" \
port="1:1" type="apcmaster" user="apc"/
) /member *
) member i* d="1" name="clu2" watchdog="yes"
) powercontroller id="0" ipaddress="192.1* 68.65.52" password="baytech" \
port="1" type="baytech" user="admin"/
) /member *
) member i* d="2" name="clu3" watchdog="yes"
) powercontroller id="0" ipaddress="192.1* 68.65.53" password="baytech" \
port="2" type="baytech" user="admin"/
) /member *
) member i* d="3" name="clu4" watchdog="yes"
) powercontroller id="0" ipaddress="192.1* 68.65.54" password="wti" \
port="blue" type="wti_nps" user=""/
) /member *
) /members*
*
The elds above dene the cluster and its individual members. Each member eld contains identication
and conguration information, including cluster names, addresses, power controllers and types,
and authentication details. For more information on conguring cluster members, refer to Section 3.7
Adding and Deleting Members.
) services
) service* checkinterval="0" failoverdomain="None" id="0" name="test" \
userscript="None"
) service_ipa* ddresses/
) /service *
) service c* heckinterval="0" failoverdomain="foodomain" id="1" name="test2" \
userscript="None"
) service_ipa* ddresses/
) /service *
) /services *
*
The elds above dene the services controlled by the cluster system, such as NFS, Samba, and HTTP.
The parameters in these elds include service names, failover domain names, service status check
intervals, and location of service init scripts (if applicable). For more information about conguring
clustered services, refer to Section 3.10 Adding a Service to the Cluster.
) failoverdomains
) failoverdomain* id="0" name="fonfs" ordered="yes" restricted="yes"
) failoverdomainnode id="0" name="clu2"/ *
) failoverdomainnode id="1" name="clu3"/*
) /failoverdomain *
) failoverdomain i* d="1" name="fosamba" ordered="no" restricted="no"
) failoverdomainnode id="0" name="clu1"/ *
) failoverdomainnode id="1" name="clu3"/*
) /failoverdomain *
) /failoverdomains *
*
Appendix E. Supplementary Software Information 177
The elds above dene the failover domains that control the priority and order in which the cluster
members queue in the event of failover. Parameters in these elds include failover domain name,
restricted and ordered toggling, and node order by member name. For more information about conguring
failover domains for cluster systems, refer to Section 3.9 Conguring a Failover Domain.
178 Appendix E. Supplementary Software Information
Appendix F.
Cluster Command-line Utilities
This appendix provides reference information on the following command-line utilities provided with
Red Hat Cluster Suite:
? redhat-config-cluster-cmd?Provides command-line access to the conguration features of
the Cluster Conguration Tool utility
? shutil?Checks status of the quorum partitions
? clufence?Tests and controls the connections to network and serial-attached power switches
F.1. Using redhat-config-cluster-cmd
This section details an example of the redhat-cong-cluster-cmd utility, which allows you to congure
all aspects of the cluster, and stores the information in the /etc/cluster.xml le.
Usage, options, and examples of using the redhat-config-cluster-cmd command
can be found in its man page. To access the manpage from a shell-prompt, type man
redhat-config-cluster-cmd.
The following describes an example cluster system that is congured using only the
redhat-config-cluster-cmd utility.
Suppose a system administrator wants to create a cluster system that will serve highly available NFS
services to the engineering department of a small organization. The NFS export should only be accessible
to the three members of the department (Bob, Jane, and Tom).
1. Add the service and assign it a descriptive name to distinguish its functionality from other
services that may run on the cluster.
redhat-config-cluster-cmd --add_service --name=nfs_engineers
2. Add a service IP address that will transfer from one member to another in the event of failover:
redhat-config-cluster-cmd --service=nfs_engineers --add_service_ipaddress \
--ipaddress=10.0.0.10
3. Add a device to the service (the disk partition that serves as the NFS export):
redhat-config-cluster-cmd --service=nfs_engineering --add_device --name=/dev/sdc3
4. Add a mount point for the device (note: the mount point cannot be listed in /etc/fstab):
redhat-config-cluster-cmd --service=nfs_engineering --device=/dev/sdc3 --mount \
--mountpoint=/mnt/nfs/engineering/ --fstype=ext3 \
--options=rw,nosuid,sync --forceunmount=yes
5. Add the mounted directory for NFS exporting:
redhat-config-cluster-cmd --service=nfs_engineering --device=/dev/sdc3 \
--add_nfs_export --name=/mnt/nfs/engineering
6. Allow Bob to access the clustered NFS export:
redhat-config-cluster-cmd --service=nfs_engineering --device=/dev/sdc3 \
--nfsexport=/mnt/nfs/engineering --add_client --name=bob \
--options=rw
7. Repeat step 6 for Jane and Tom.
For more information and examples of using redhat-config-cluster-cmd, refer to the man page
by typing the following at a shell prompt:
180 Appendix F. Cluster Command-line Utilities
man redhat-config-cluster-cmd
F.2. Using the shutil Utility
Test the quorum partitions and ensure that they are accessible by invoking the shutil utility with the
-p option.
Running /usr/sbin/shutil -p /cluster/header on all active cluster members should give the
same output on each machine. For example:
/cluster/header is 144 bytes long
SharedStateHeader {
ss_magic = 0x39119fcd
ss_timestamp = 0x000000003f5d3eea (22:46:02 Sep 08 2003)
ss_updateHost = lab.example.com
}
If the output of the shutil utility with the -p option is not the same on all cluster systems, perform
the following:
? Examine the /etc/sysconfig/rawdevices le on each cluster system and ensure that the raw
character devices and block devices for the primary and backup quorum partitions have been accurately
specied. If they are not the same, edit the le and correct any mistakes. Then re-run the
clucong utility. Refer to Section 3.5 Editing the rawdevices File for more information.
? Ensure that you have created the raw devices for the quorum partitions on each cluster system.
Refer to Section 2.4.4.3 Conguring Shared Cluster Partitions for more information.
? On each cluster system, examine the system startup messages at the point where the system probes
the SCSI subsystem to determine the bus conguration. Verify that both cluster systems identify
the same shared storage devices and assign them the same name.
? Verify that a cluster system is not attempting to mount a le system on the quorum partition.
For example, make sure that the actual device (for example, /dev/sdb1) is not included in an
/etc/fstab le.
F.3. Using the clusvcadm Utility
The clusvcadm utility provides a command-line user interface that enables an administrator to monitor
and manage the cluster systems and services. Use the clusvcadm utility to perform the following
tasks:
? Disable and enable services
? Relocate and restart cluster services
? lock and unlock service state
The clusvcadm command line options are as follows:
-d service
Disable a service.
-e service
Enable a service.
Appendix F. Cluster Command-line Utilities 181
-e service -m member
Enable a service on a specic member.
-l
Lock the service states.
-r service -m member
Relocate a service to a specic member.
-q
Operate quietly.
-R
Restart a service.
-s
Stop a service
-u
Unlock the service states.
-v
Displays information about the current version of clusvcadm.
Refer to the clusvcadm(8) man page for more information.
F.4. Using the clufence Utility
If power switches are used in the cluster hardware conguration, run the clufence utility on each
cluster system to ensure that it can remotely power-cycle the other cluster members.
If the command succeeds, run the shutil -p command on both cluster systems to display a summary
of the header data structure for the quorum partitions. If the output is different on the systems, the
quorum partitions do not point to the same devices on both systems. Check to make sure that the
raw devices exist and are correctly specied in the /etc/sysconfig/rawdevices le. See Section
2.4.4.3 Conguring Shared Cluster Partitions for more information.
If either network- or serial-attached power switches are employed in the cluster hardware conguration,
install the cluster software and invoke the clufence command to test the power switches.
Invoke the command on each cluster system to ensure that it can remotely power-cycle the other
cluster members. If testing is successful, then the cluster can be started.
The clufence command can accurately test a power switch. The format of the clufence command
is as follows:
usage: clufence [-d] [-[furs]
+
member
,
]
-d Turn on debugging
-f
+
member
,
Fence (power off)
+
member
-u ,
+
member
,
Unfence (power on)
+
member
-r ,
+
member
,
Reboot (power cycle)
+
member
-s ,
+
member
,
Check status of all switches controlling
+
member
,
When testing power switches, the rst step is to ensure that each cluster member can successfully
communicate with its attached power switch. The following example of the clufence command
output shows that the cluster member is able to communicate with its power switch:
182 Appendix F. Cluster Command-line Utilities
[23750] info: STONITH: baytech at 192.168.1.31, port 1 controls clu2
[23750] info: STONITH: baytech at 192.168.1.31, port 2 controls clu3
[23750] info: STONITH: wti_nps at 192.168.1.29, port clu4 controls clu4
[23750] info: STONITH: wti_nps at 192.168.1.29, port clu5 controls clu5
Any errors in the output could be indicative of the following types of problems:
? For serial attached power switches:
? Verify that the device special le for the remote power switch connection serial port (for example,
/dev/ttyS0) is specied correctly in the cluster database, as established via Cluster
Conguration Tool. If necessary, use a terminal emulation package such as minicom to test if
the cluster system can access the serial port.
? Ensure that a non-cluster program (for example, a getty program) is not using the serial port
for the remote power switch connection. You can use the lsof command to perform this task.
? Check that the cable connection to the remote power switch is correct. Verify that the correct
type of cable is used (for example, an RPS-10 power switch requires a null modem cable), and
that all connections are securely fastened.
? Verify that any physical dip switches or rotary switches on the power switch are set properly.
? For network based power switches:
? Verify that the network connection to network-based switches is operational. Most switches have
a link light that indicates connectivity.
? It should be possible to ping the network switch; if not, then the switch may not be properly
congured for its network parameters.
? Verify that the correct password and login name (depending on switch type) have been specied
in the cluster conguration database (as established by running Cluster Conguration Tool).
A useful diagnostic approach is to verify Telnet access to the network switch using the same
parameters as specied in the cluster conguration.
After successfully verifying communication with the switch, attempt to power cycle the other cluster
member. Prior to doing this, it is recommended to verify that the other cluster member is not actively
performing any important functions (such as serving cluster services to active clients). By executing
the following command :
clufence -r clu3
The following depicts a successful power cycle operation:
Successfully power cycled host clu3.
Index
Symbols
/etc/hosts
editing, 18
/etc/syscong/ha/lvs.cf le, 119
/etc/syscong/rawdevices
editing the le, 41
A
acknowledgments, i
active router
(See LVS clustering)
active-active conguration, 1
Apache HTTP Server
httpd.conf, 94
setting up service, 93
availability and data integrity table, 7
B
backup router
(See LVS clustering)
C
channel bonding
(See Ethernet bonding)
chkcong, 121
cluster
(See cluster types)
administration, 97
checking the conguration, 53, 180
daemons, 42
diagnosing and correcting problems, 102
disabling the cluster software, 102
displaying status, 97
name, changing, 102
using Red Hat Cluster Manager with Piranha, 151
cluster administration, 97
backing up the cluster database, 100
changing the cluster name, 102
diagnosing and correcting problems in a cluster,
102
disabling the cluster software, 102
displaying cluster and service status, 97
modifying cluster event logging, 101
modifying the cluster conguration, 100
restoring the cluster database, 100
starting and stopping the cluster software, 99
updating the cluster software, 101
cluster command-line utilities, 179
cluster communication mechanisms, 169
cluster conguration, 53
minimum
example, 7
modifying, 100
reloading, 41
using the shutil utility, 180
Cluster Conguration Tool
accessing, 38
and Oracle service, 71
cluster daemons, 42
membership daemon, 42
quorum daemon, 43
service manager daemon, 45
cluster database
backing up, 100
restoring, 100
cluster event logging
daemons
cluhbd, 101
clupowerd, 101
cluquorumd, 101
clusvcmgrd, 101
severity levels, 101
cluster features
administration user interface, 2
application monitoring, 2
data integrity assurance, 2
event logging facility, 2
failover domain
restricted, 2
unrestricted, 2
manual service relocation capabilities, 2
multiple cluster communication methods, 2
no-single-point-of-failure hardware conguration,
2
service conguration framework, 2
service failover capabilities, 2
status monitoring agent, 2
cluster hardware
connecting, 21
power controllers, 10
setting up, 21
cluster hardware tables, 11
cluster member hardware table, 11
cluster overview, 1
cluster service, 1
displaying status, 97
cluster services
active-active NFS conguration, 83
administration, 59
Apache HTTP Server, setting up, 93
httpd.conf, 94
conguration, 51, 59
conguring service disk storage, 61
184
deleting a service, 64
disabling a service, 63
displaying a service conguration, 62
enabling a service, 63
gathering service information, 59
handling a service that fails to start, 64
modifying a service, 63
MySQL service, setting up, 73
NFS caveats, 82
NFS client access, 82
NFS Druid, 78
NFS server requirements, 77
NFS service, setting up, 77
Oracle service, setting up, 67
Oracle, tuning, 72
relocating a service, 64
Samba Druid, 87
Samba operating model, 86
Samba server requirements, 85
Samba service, setting up, 85
scripts, creating, 61
smb.conf.sharename le elds, 90
verifying application software and service scripts,
62
cluster software
command-line utilities, 179
disabling, 102
installation and conguration, 35
steps for installing and initializing, 35
with rpm, 36
with the Package Management Tool, 35
starting and stopping, 99
steps for installing and initializing, 35
updating, 101
version display, 55
cluster software installation and conguration, 35
cluster systems, 1
cluster types
compute-clustering
Beowulf, 109
denition of, 109
high-availability clustering, 109
(See Also Red Hat Cluster Manager)
denition of, 109
load-balance clustering, 109
(See Also LVS clustering)
denition of, 109
overview of, 109
clusvcadm
using, 180
common cluster behaviors
2-4 Member cluster, IP tie-breaker, 173
3-5 Member Cluster, 173
miscellaneous, 175
service daemons, 174
two member disk-based tie-breaker, 172
common cluster behaviors table, 170
components
of LVS cluster, 118
compute-clustering
(See cluster types)
conguration
Red Hat Enterprise Linux, 17
conguration le
reloading, 41
conguring a service, 59
console startup messages
displaying, 20
console switch, 9
setting up, 17
console switch hardware table, 15
conventions
document, iv
D
daemons
clulockd
conguring, 42
clumembd
conguring, 42
cluquorumd
conguring, 42
clurmtabd
conguring, 42
clusvcmgrd
conguring, 42
database service, 2
databases
MySQL
setting up service, 73
Oracle
oraclescript example, 68
setting up, 67
startdb script example, 68
stopdb script example, 69
tuning, 72
using Cluster Conguration Tool with, 71
deleting a service, 64
diagnosing and correcting problems in a cluster
table, 103
disabling a service, 63
disk storage
conguring service disk storage, 61
displaying a service conguration, 62
displaying console startup messages, 20
displaying devices congured in the kernel, 21
185
E
enabling a service, 63
Ethernet channel bonding
conguring, 22
event logging
modifying, 101
syslog
conguring, 55
examples
minimum cluster conguration, 7
NFS Druid, 78
no single point of failure conguration, 8
oracle script, 68
Samba Druid, 87
startdb script, 68
stopdb script, 69
using Cluster Conguration Tool to add an Oracle
service, 71
ext3, 33
F
failover and recover scenarios, 170
features, cluster, 2
feedback, vii
le services
NFS
active-active conguration, 83
caveats, 82
client access, 82
NFS Druid, 78
server requirements, 77
setting up service, 77
Samba
operating model, 86
Samba Druid, 87
server requirements, 85
setting up, 85
le systems
creating, 33
FTP, clustering, 130
(See Also LVS clustering)
G
GFS
cluster archive
conguration, 161
conguration
with ccs_tool, 161
GFS Setup Druid, 157
and CCS les, 158
cluster name, 157
conguration, 157
conguring cluster members, 159
fence device conguration, 160
GULM settings, 158
member conguration, 160
post-conguration, 161
saving conguration, 161
Grand Unied Lock Manager
(See GULM)
GULM, 158
conguration with GFS Setup Druid, 158
H
handling a service that fails to start, 64
hardware
installing basic cluster hardware, 15
hardware conguration
availability considerations, 6
choosing a conguration, 5
cost restrictions, 6
data integrity under all failure conditions, 6
minimum, 6
optional hardware, 9
performance considerations, 5
shared storage requirements, 6
hardware information, supplementary, 163
hardware installation
operating system conguration, 5
hardware watchdog timer
conguring, 165
hardware watchdog timers, 165
high-availability clustering
(See cluster types)
hot-standby conguration, 1
how to use this manual, iii
HTTP services
Apache HTTP Server
httpd.conf, 94
setting up, 93
I
installation
Red Hat Enterprise Linux, 17
installing basic cluster hardware, 15
installing the basic cluster hardware, 15
introduction, iii
cluster features, 2
cluster overview, 1
how to use this manual, iii
other Red Hat Enterprise Linux manuals, iii
iptables, 121
ipvsadm program, 119
186
J
job scheduling, LVS, 113
K
kernel
decreasing kernel boot timeout limit, 19
displaying congured devices, 21
Kernel Boot Timeout Limit
decreasing, 19
Kimberlite, i
KVM (keyboard, video, mouse) switch, 10
L
least connections
(See job scheduling, LVS)
Linux Virtual Server
(See LVS clustering)
load-balance clustering
(See cluster types)
low voltage differential (LVD), 167
LVS
/etc/syscong/ha/lvs.cf le, 119
components of, 118
daemon, 119
date replication, real servers, 112
denition of, 109
initial conguration, 121
ipvsadm program, 119
job scheduling, 113
lvs daemon, 119
LVS routers
conguring services, 121
necessary services, 121
primary node, 121
multi-port services, 128
FTP, 130
nanny daemon, 119
NAT routing
enabling, 127
requirements, hardware, 125
requirements, network, 125
requirements, software, 125
overview of, 110, 111
packet forwarding, 124
Piranha Conguration Tool, 119
pulse daemon, 118
real servers, 110
routing methods
NAT, 116
routing prerequisites, 125
scheduling, job, 113
send_arp program, 119
shared data, 112
starting the cluster, 148
synchronizing conguration les, 147
three tiered
Red Hat Cluster Manager, 113
using Red Hat Cluster Manager with Piranha, 151
lvs daemon, 119
M
member status table, 98
member systems
(See cluster systems)
members
setting up, 15
membership daemon, 42
minimum cluster conguration example, 7
minimum hardware conguration, 6
Mission Critical Linux, Inc., i
mkfs, 33
multi-port services, clustering, 128
(See Also LVS clustering)
MySQL
setting up service, 73
N
nanny daemon, 119
NAT
enabling, 127
routing methods, LVS, 116
network address translation
(See NAT)
network hardware table, 13
network hub, 9
network switch, 9
NFS
active-active conguration, 83
caveats, 82
client access, 82
NFS Druid, 78
server requirements, 77
setting up service, 77
NMI watchdog timer
enabling, 164
no single point of failure conguration, 8
Non-Maskable Interrupt (NMI) watchdog timers, 164
187
O
operating system conguration
hardware installation, 5
Oracle
adding an Oracle service, 71
oracle script example, 68
setting up service, 67
startdb script example, 68
stopdb script example, 69
tuning service, 72
overview
introduction, 1
P
packet forwarding, 124
(See Also LVS clustering)
Parallel SCSI
requirements, 26
parted
creating disk partitions, 31
partitioning disks, 31
Piranha
using Red Hat Cluster Manager with, 151
Piranha Conguration Tool, 119
CONTROL/MONITORING, 134
EDIT MONITORING SCRIPTS Subsection, 145
GLOBAL SETTINGS, 135
limiting access to, 123, 123
login panel, 133
necessary software, 133
overview of, 133
REAL SERVER subsection, 142
REDUNDANCY, 137
setting a password, 122
VIRTUAL SERVER subsection
Firewall Mark, 141
Persistence, 142
Scheduling, 141
Virtual IP Address, 140
VIRTUAL SERVER subsection, 139
VIRTUAL SERVERS, 139
piranha-gui service, 121
piranha-passwd, 122
point-to-point Ethernet connection hardware table, 14
power controller connection, conguring, 48
power controllers, 10
network-attached, 10
serial-attached, 10
watchdog timers, 10
hardware-based, 10
software-based, 10
power switch, 48
(See Also power controller)
power switch hardware table, 12
power switches
conguring, 23
hardware watchdog timers
conguring, 165
NMI watchdog timers
enabling, 164
setting up, 163
watchdog, 163
software watchdog timers
conguration, 163
testing, 54, 181
troubleshooting, 54, 182
pulse daemon, 118
pulse service, 121
Q
quorum daemon, 43
quorum partitions
(See shared cluster partitions)
testing, 180
R
raw, 32
raw devices
creating, 32
rawdevices
editing the le, 41
real servers
(See LVS clustering)
conguring services, 124
Red Hat Cluster Manager, 109
and Piranha, 151
command-line utilities, 179
with Red Hat GFS, 157
Red Hat Cluster Suite, 35
installation, 35
with rpm, 36
with the Package Management Tool, 35
Red Hat Enterprise Linux, 109
installation and conguration, 17
overview of, 109
Red Hat GFS
and Red Hat Cluster Suite, 157
Red Hat GFS with Red Hat Cluster Suite
changes to Red Hat Cluster Suite, 154
installation scenarios, 154
terminology, 153
using, 153
using GFS Setup Druid, 157
relocating a service, 64
round robin
(See job scheduling, LVS)
routing
188
prerequisites for LVS, 125
S
Samba
operating model, 86
Samba Druid, 87
server requirements, 85
setting up service, 85
smb.conf.sharename le elds, 90
scheduling, job (LVS), 113
scripts
creating service scripts, 61
oracle script example, 68
startdb script example, 68
stopdb script example, 69
verifying application software and service scripts,
62
SCSI bus conguration requirements, 165
SCSI bus length, 167
SCSI bus termination, 166
SCSI identication numbers, 167
security
Piranha Conguration Tool, 123
send_arp program, 119
service conguration, 59
service failover, 1
service property and resource information table, 59
service relocation, 1
service status table, 98
services, 51
(See Also adding to the cluster conguration)
shared cluster partitions
conguring, 30
requirements, 30
shared disk storage
conguring, 25
shared disk storage hardware table, 13
shared partitions
testing, 53
shared state, 1
shared storage, 26
shared storage requirements, 6
single-initiator bre channel interconnect
setting up, 28
single-initiator SCSI bus
setting up, 26
software information, supplementary, 169
software watchdog timers, 163
sshd service, 121
synchronizing conguration les, 147
syslog, 55
syslog event logging
conguring, 55
syslogd, 55
System V init, 99
T
tables
availability and data integrity, 7
cluster hardware, 11
cluster member hardware, 11
common cluster behaviors, 170
console switch hardware, 15
diagnosing and correcting problems in a cluster,
103
installing the basic cluster hardware, 15
member status, 98
minimum cluster conguration components, 7
network hardware, 13
no single point of failure conguration, 8
point-to-point Ethernet connection hardware, 14
power controller connection, conguring, 48
power switch hardware, 12
service property and resource information, 59
service status, 98
shared disk storage hardware, 13
UPS system hardware, 14
terminal server, 10
testing
power switches, 54, 181
quorum partitions, 180
shared partitions, 53
troubleshooting
diagnosing and correcting problems in a cluster,
102
failover and recover scenarios, 170
power switch testing, 54, 182
table, 103
U
UPS system hardware table, 14
UPS systems
conguring, 24
using Red Hat GFS with Red Hat Cluster Suite, 153
utilities
clufence, 181
clusvcadm, 180
redhat-cong-cluster-cmd, 179
shutil, 180
189
W
watchdog timers
hardware
conguring, 165
hardware-based, 10
NMI
enabling, 164
setting up, 163
software, 163
conguration, 163
software-based, 10
weighted least connections
(See job scheduling, LVS)
weighted round robin
(See job scheduling, LVS)
Unités de disques
REFERENCE
86 F1 14FG 01
ESCALA Power7
Unités de disques
Cette publication concerne les modèles suivants :
- Bull Escala E5-700 (Power 750 / 8233-E8B)
- Bull Escala M6-700 (Power 770 / 9117-MMB)
- Bull Escala M7-700 (Power 780 / 9179-MHB)
- Bull Escala E1-700 (Power 710 / 8231-E2B)
- Bull Escala E2-700 / E2-700T (Power 720 / 8202-E4B)
- Bull Escala E3-700 (Power 730 / 8231-E2B)
- Bull Escala E4-700 / E4-700T (Power 740 / 8205-E6B)
Les références au modèle Power 755 / 8236-E8C sont sans objet.
Hardware
Septembre 2010
BULL CEDOC
357 AVENUE PATTON
B.P.20845
49008 ANGERS CEDEX 01
FRANCE
REFERENCE
86 F1 14FG 01
L’avis juridique de copyright ci–après place le présent document sous la protection des lois de Copyright qui
prohibent, sans s’y limiter, des actions comme la copie, la distribution, la modification et la création de produits
dérivés.
Copyright Bull SAS 2007-2010
Imprimé en France
Marques déposées
Toutes les marques citées dans ce manuel sont la propriété de leurs détenteurs respectifs.
Tous les noms de marques ainsi que les noms de produits matériels et/ou logiciels sont régis par le droit des
marques et/ou des brevets.
La citation des noms de marques et de produits est purement informative et ne constitue pas une violation du droit
des marques et/ou des brevets.
Des corrections ou des modifications au contenu de ce document peuvent intervenir sans préavis. Bull SAS
ne pourra pas être tenu pour responsable des éventuelles erreurs qui pourraient être contenues dans ce
manuel, ni pour tout dommage pouvant résulter de son application.
Table des matières
Avis aux lecteurs canadiens . . . . . . . . . . . . . . . . . . . . . . . . . . vii
Consignes de sécurité . . . . . . . . . . . . . . . . . . . . . . . . . . . . . xi
Unités de disques . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1
Unités de disques - Nouveautés . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1
Installation d'une unité de disques ou d'une unité SSD . . . . . . . . . . . . . . . . . . . . . 1
Unités SSD (Solid State Drive) . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1
Installation d'une unité de disques ou d'une unité SSD sur un système ou une partition logique AIX sous
tension . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7
Préparation du système. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 9
Installation d'une unité de disques . . . . . . . . . . . . . . . . . . . . . . . . . . 9
Fin de la procédure. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 14
Installation d'une unité de disques ou d'une unité SSD sur un système ou une partition logique Linux sous
tension . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 15
Installation d'une unité de disques utilisant le noyau 2.4 Linux . . . . . . . . . . . . . . . . 17
Installation d'une unité de disques. . . . . . . . . . . . . . . . . . . . . . . . . 17
Suivez la procédure . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 23
Installation d'une unité de disques utilisant le noyau 2.6 Linux . . . . . . . . . . . . . . . . 24
Installation d'une unité de disques. . . . . . . . . . . . . . . . . . . . . . . . . 24
Vérification du fonctionnement de la nouvelle unité de disques . . . . . . . . . . . . . . . 29
Suivez la procédure . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 30
Installation d'une unité de disques ou d'une unité SSD sur un système ou une partition logique Virtual I/O
Server sous tension. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 30
Préparation du système . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 32
Installation d'une unité de disques. . . . . . . . . . . . . . . . . . . . . . . . . . 32
Suivez la procédure . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 37
Installation d'une unité de disques ou d'une unité SSD sur un système ou une partition logique hors tension . 37
Préparation du système . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 39
Installation d'une unité de disques. . . . . . . . . . . . . . . . . . . . . . . . . . 40
Suivez la procédure . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 44
Retrait d'une unité de disques ou d'une unité SSD . . . . . . . . . . . . . . . . . . . . . . 45
Remplacement d'une unité de disques . . . . . . . . . . . . . . . . . . . . . . . . . . 45
Remplacement d'une unité de disques sur un système ou une partition logique AIX sous tension . . . . . 46
Préparation du système . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 48
Retrait d'une unité de disques . . . . . . . . . . . . . . . . . . . . . . . . . . . 48
Remplacement d'une unité de disque. . . . . . . . . . . . . . . . . . . . . . . . . 51
Fin de la procédure. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 54
Remplacement d'une unité de disques sur un système ou une partition logique Linux sous tension . . . . . 55
Remplacement d'une unité de disques utilisant le noyau 2.4 Linux . . . . . . . . . . . . . . . 57
Préparation du retrait de l'unité de disques. . . . . . . . . . . . . . . . . . . . . . 57
Retrait d'une unité de disques . . . . . . . . . . . . . . . . . . . . . . . . . . 59
Remplacement d'une unité de disques . . . . . . . . . . . . . . . . . . . . . . . 63
Suivez la procédure . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 67
Remplacement d'une unité de disques utilisant le noyau 2.6 Linux . . . . . . . . . . . . . . . 67
Préparation du retrait de l'unité de disques. . . . . . . . . . . . . . . . . . . . . . 67
Retrait d'une unité de disques . . . . . . . . . . . . . . . . . . . . . . . . . . 69
Remplacement d'une unité de disques . . . . . . . . . . . . . . . . . . . . . . . 71
Suivez la procédure . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 73
Remplacement d'une unité de disques sur un système ou une partition logique hors tension . . . . . . . 73
Préparation du système . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 74
Retrait d'une unité de disques . . . . . . . . . . . . . . . . . . . . . . . . . . . 75
Remplacement d'une unité de disques . . . . . . . . . . . . . . . . . . . . . . . . 78
Fin de la procédure. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 80
Remplacement d'une unité de disques SCSI . . . . . . . . . . . . . . . . . . . . . . . 81
iii
Retrait d'une unité de disques . . . . . . . . . . . . . . . . . . . . . . . . . . . 81
Remplacement d'une unité de disques . . . . . . . . . . . . . . . . . . . . . . . . 83
Suivez la procédure . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 86
Retrait ou installation d'un obturateur d'unité de disques . . . . . . . . . . . . . . . . . . . . 86
Retrait d'un obturateur d'unité de disques . . . . . . . . . . . . . . . . . . . . . . . . 86
Installation d'un obturateur d'unité de disques . . . . . . . . . . . . . . . . . . . . . . 88
Emplacements des unités de disque et voyants de maintenance simultanée . . . . . . . . . . . . . . 90
Emplacements de l'adaptateur PCIe RAID et SSD SAS . . . . . . . . . . . . . . . . . . . . 90
Emplacements des unités de disques et voyants de maintenance simultanée pour les systèmes 02E/4B et
05F/6B. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 90
Emplacements des unités de disque et voyants de maintenance simultanée pour 31E/2B . . . . . . . . 92
Emplacements des unités de disques et voyants de maintenance simultanée sur les modèles 33E/8B et 36E/8C 94
Emplacements des unités de disques et voyants de maintenance simultanée pour les modèles 17M/MB et
79M/HB . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 95
Emplacements des unités de disques et voyants de maintenance simultanée sur le boîtier du disque 58/86 . . 96
Emplacements des unités de disques et voyants de maintenance simultanée pour les boîtiers d'unité SCSI
31D/24 et 31T/24 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 98
Emplacements des unités de disques et voyants de maintenance simultanée sur une unité d'extension 58/02 100
Informations connexes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 100
Préparation d'une unité de disques dans un système ou une partition logique AIX . . . . . . . . . . 100
Préparation du retrait d'une unité de disques d'une unité centrale ou d'extension contrôlée par le système
d'exploitation AIX. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 100
Régénération des données d'une unité de disques de remplacement sous AIX. . . . . . . . . . . . 101
Régénération des données d'une unité de disques de remplacement sous Linux . . . . . . . . . . . 101
Régénération de données sur une unité de disques utilisant le noyau 2.4 Linux . . . . . . . . . . 101
Régénération de données sur une unité de disques utilisant le noyau 2.6 Linux . . . . . . . . . . 102
Partage d'unités de disques internes sur le modèle 02E/4B ou 05F/6B . . . . . . . . . . . . . . . 104
Partage d'unités de disques internes sur un modèle 33E/8B ou 36E/8C . . . . . . . . . . . . . . . 105
Partage d'unités de disques internes sur un modèle 17M/MB ou 79M/HB . . . . . . . . . . . . . . 107
Installation du port SAS externe sur un modèle 02E/4B ou 05F/6B . . . . . . . . . . . . . . . . 109
Installation du port SAS externe sur un modèle 31E/2B . . . . . . . . . . . . . . . . . . . . 111
Installation du port SAS externe des modèles 33E/8B ou 36E/8C . . . . . . . . . . . . . . . . . 113
Installation du port SAS externe sur un modèle 17M/MB ou 79M/HB . . . . . . . . . . . . . . . 121
Remise en place d'une carte d'extension SAS dans une unité d'extension 58/02 hors tension . . . . . . . . 123
Retrait d'une carte d'extension SAS dans une unité d'extension 58/02 hors tension . . . . . . . . . . . 131
Remise en place d'une carte d'extension SAS dans une unité d'extension 58/02 hors tension . . . . . . . . 140
Retrait d'une carte d'extension SAS d'une unité d'extension 58/02 hors tension . . . . . . . . . . . . 141
Procédures communes pour les fonctions installables . . . . . . . . . . . . . . 143
Avant de commencer. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 143
Identification d'un composant défaillant . . . . . . . . . . . . . . . . . . . . . . . . . 146
Voyants du panneau de commande . . . . . . . . . . . . . . . . . . . . . . . . . . 146
Identification d'un composant défaillant sur un système AIX ou une partition logique . . . . . . . . . 148
Localisation d'un composant défaillant sur un système AIX ou une partition logique . . . . . . . . 148
Activation du voyant associé au composant défaillant. . . . . . . . . . . . . . . . . . . 148
Désactivation du voyant d'un composant défaillant. . . . . . . . . . . . . . . . . . . . 148
Identification d'un composant défaillant sur un système Linux ou une partition logique . . . . . . . . 149
Localisation d'un composant défaillant sur un système Linux ou une partition logique. . . . . . . . 149
Localisation d'un composant défaillant sur un système ou une partition logique Linux. . . . . . . . 149
Activation du voyant associé au composant défaillant. . . . . . . . . . . . . . . . . . . 150
Désactivation du voyant d'un composant défaillant. . . . . . . . . . . . . . . . . . . . 150
Localisation d'un composant défaillant sur un système Virtual I/O Server ou une partition logique . . . . 150
Identification d'un composant à l'aide du Virtual I/O Server . . . . . . . . . . . . . . . . 151
Démarrage du système ou de la partition logique . . . . . . . . . . . . . . . . . . . . . . 151
Démarrage d'un système non géré par une console HMC . . . . . . . . . . . . . . . . . . 151
Démarrage d'un système ou d'une partition logique via la console HMC (Hardware Management Console) 153
Arrêt d'un système ou d'une partition logique . . . . . . . . . . . . . . . . . . . . . . . 153
Arrêt d'un système non géré par une console HMC . . . . . . . . . . . . . . . . . . . . 153
Arrêt d'un système à l'aide de la console HMC . . . . . . . . . . . . . . . . . . . . . . 155
Retrait et remplacement du volet ou panneau d'unité d'extension . . . . . . . . . . . . . . . . . 156
Retrait du panneau avant sur un modèle 14G/30 . . . . . . . . . . . . . . . . . . . . . 156
iv Unités de disques
Installation du panneau avant sur un modèle 14G/30. . . . . . . . . . . . . . . . . . . . 156
Carters système . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 157
Retrait et remplacement des carters du système 02E/4B ou 05F/6B . . . . . . . . . . . . . . . 157
Retrait du carter avant d'un système monté en armoire 02E/4B ou 05F/6B. . . . . . . . . . . . 157
Retrait du carter avant sur un système autonome 02E/4B ou 05F/6B. . . . . . . . . . . . . . 158
Installation du carter avant sur un système monté en armoire 02E/4B ou 05F/6B . . . . . . . . . 159
Installation du carter avant et du volet avant sur un système autonome 02E/4B ou 05F/6B . . . . . . 160
Retrait du capot d'accès d'un système monté sur armoire 02E/4B ou 05F/6B . . . . . . . . . . . 161
Retrait du capot d'accès d'un système autonome 02E/4B ou 05F/6B . . . . . . . . . . . . . . 162
Installation du capot d'accès d'un système monté sur armoire 02E/4B ou 05F/6B . . . . . . . . . 164
Installation du capot d'accès d'un système autonome 02E/4B ou 05F/6B . . . . . . . . . . . . 164
Retrait et remplacement des carters du système 31E/2B . . . . . . . . . . . . . . . . . . . 166
Retrait du capot d'accès du système 31E/2B . . . . . . . . . . . . . . . . . . . . . . 166
Installation du capot d'accès du système 31E/2B . . . . . . . . . . . . . . . . . . . . 167
Retrait et installation des carters sur le système 33E/8B ou 36E/8C . . . . . . . . . . . . . . . 168
Retrait du carter avant des modèles 33E/8B ou 36E/8C. . . . . . . . . . . . . . . . . . . 168
Installation du carter avant des modèles 33E/8B ou 36E/8C. . . . . . . . . . . . . . . . . 169
Retrait du capot d'accès des modèles 33E/8B ou 36E/8C. . . . . . . . . . . . . . . . . . 170
Installation du capot d'accès des modèles 33E/8B ou 36E/8C. . . . . . . . . . . . . . . . . 171
Retrait et remplacement des carters du système 17M/MB ou 79M/HB . . . . . . . . . . . . . . 172
Retrait du capot avant des modèles 17M/MB ou 79M/HB . . . . . . . . . . . . . . . . . 172
Installation du panneau avant du modèle 17M/MB ou 79M/HB . . . . . . . . . . . . . . . 173
Mise en position de maintenance ou de fonctionnement du système 33E/8B ou 36E/8C . . . . . . . . . 174
Mise en position de maintenance d'un système monté en armoire 31E/2B . . . . . . . . . . . . . 174
Mise en position de fonctionnement d'un système monté en armoire 31E/2B . . . . . . . . . . . . 175
Mise en position de maintenance d'un système monté en armoire 02E/4B ou 05F/6B . . . . . . . . . . 176
Mise en position de fonctionnement du système monté en armoire 02E/4B ou 05F/6B . . . . . . . . . . 178
Positions de maintenance et de fonctionnement . . . . . . . . . . . . . . . . . . . . . . . 179
Mise en position de fonctionnement du système 02E/4B ou 05F/6B . . . . . . . . . . . . . . . 179
Mise en position de maintenance d'un système monté en armoire 02E/4B ou 05F/6B . . . . . . . . 179
Mise en position de fonctionnement du système monté en armoire 02E/4B ou 05F/6B . . . . . . . . 181
Mise en position de maintenance ou de fonctionnement du système 33E/8B ou 36E/8C . . . . . . . . 182
Mise en position de maintenance d'un système monté en armoire 31E/2B . . . . . . . . . . . . 182
Mise en position de fonctionnement d'un système monté en armoire 31E/2B . . . . . . . . . . . 183
Mise en position de maintenance ou de fonctionnement du système 33E/8B ou 36E/8C . . . . . . . . 184
Mise en position de maintenance de l'unité 33E/8B ou 36E/8C. . . . . . . . . . . . . . . . 184
Placement de 33E/8B ou de 36E/8C en position de fonctionnement . . . . . . . . . . . . . . 187
Cordons d'alimentation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 189
Déconnexion des cordons d'alimentation d'un système 02E/4B ou 05F/6B . . . . . . . . . . . . . 189
Connexion des cordons d'alimentation d'un système 02E/4B ou 05F/6B. . . . . . . . . . . . . . 190
Déconnexion des cordons d'alimentation du système 31E/2B . . . . . . . . . . . . . . . . . 191
Connexion des cordons d'alimentation au système 31E/2B . . . . . . . . . . . . . . . . . . 192
Installation d'un dispositif via la console HMC (Hardware Management Console) . . . . . . . . . . . 193
Retrait d'un composant via la console HMC . . . . . . . . . . . . . . . . . . . . . . . 194
Remplacement d'un composant via la console HMC . . . . . . . . . . . . . . . . . . . . 194
Vérification du composant installé . . . . . . . . . . . . . . . . . . . . . . . . . . . 195
Vérification d'un dispositif installé ou d'un composant remplacé sur un système ou une partition logique AIX 195
Vérification d'un composant installé sur un système Linux ou une partition logique. . . . . . . . . . 198
Vérification d'un composant installé via les programmes de diagnostic autonomes . . . . . . . . . . 198
Vérification d'un composant installé via la console HMC (Hardware Management Console) . . . . . . . 199
Activation et désactivation de voyants . . . . . . . . . . . . . . . . . . . . . . . . 200
Désactivation d'un voyant d'avertissement système ou de partition . . . . . . . . . . . . . 200
Activation ou désactivation de voyants d'identification . . . . . . . . . . . . . . . . . 201
Visualisation d'événements réparables . . . . . . . . . . . . . . . . . . . . . . . . 202
Vérification de la présence d'un dispositif installé ou d'un composant remplacé sur un système ou une
partition logique à l'aide des outils Virtual I/O Server. . . . . . . . . . . . . . . . . . . . 202
Remarques . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 207
Marques . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 208
Bruits radioélectriques . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 208
Remarques sur la classe A . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 208
Table des matières v
Remarques sur la classe B . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 212
Dispositions. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 214
vi Unités de disques
Avis aux lecteurs canadiens
Le présent document a été traduit en France. Voici les principales différences et particularités dont vous
devez tenir compte.
Illustrations
Les illustrations sont fournies à titre d'exemple. Certaines peuvent contenir des données propres à la
France.
Terminologie
La terminologie des titres IBM peut différer d'un pays à l'autre. Reportez-vous au tableau ci-dessous, au
besoin.
IBM France IBM Canada
ingénieur commercial représentant
agence commerciale succursale
ingénieur technico-commercial informaticien
inspecteur technicien du matériel
Claviers
Les lettres sont disposées différemment : le clavier français est de type AZERTY, et le clavier
français-canadien, de type QWERTY.
OS/2 - Paramètres canadiens
Au Canada, on utilise :
v les pages de codes 850 (multilingue) et 863 (français-canadien),
v le code pays 002,
v le code clavier CF.
Nomenclature
Les touches présentées dans le tableau d'équivalence suivant sont libellées différemment selon qu'il s'agit
du clavier de la France, du clavier du Canada ou du clavier des États-Unis. Reportez-vous à ce tableau
pour faire correspondre les touches françaises figurant dans le présent document aux touches de votre
clavier.
vii
Recommandations à l'utilisateur
Ce matériel utilise et peut émettre de l'énergie radiofréquence. Il risque de parasiter les communications
radio et télévision s'il n'est pas installé et utilisé conformément aux instructions du constructeur
(instructions d'utilisation, manuels de référence et manuels d'entretien).
Si cet équipement provoque des interférences dans les communications radio ou télévision, mettez-le hors
tension puis sous tension pour vous en assurer. Il est possible de corriger cet état de fait par une ou
plusieurs des mesures suivantes :
v Réorienter l'antenne réceptrice ;
v Déplacer l'équipement par rapport au récepteur ;
v Éloigner l'équipement du récepteur ;
v Brancher l'équipement sur une prise différente de celle du récepteur pour que ces unités fonctionnent
sur des circuits distincts ;
v S'assurer que les vis de fixation des cartes et des connecteurs ainsi que les fils de masse sont bien
serrés ;
v Vérifier la mise en place des obturateurs sur les connecteurs libres.
Si vous utilisez des périphériques non IBM avec cet équipement, nous vous recommandons d'utiliser des
câbles blindés mis à la terre, à travers des filtres si nécessaire.
En cas de besoin, adressez-vous à votre détaillant.
Le fabricant n'est pas responsable des interférences radio ou télévision qui pourraient se produire si des
modifications non autorisées ont été effectuées sur l'équipement.
viii Unités de disques
L'obligation de corriger de telles interférences incombe à l'utilisateur.
Au besoin, l'utilisateur devrait consulter le détaillant ou un technicien qualifié pour obtenir de plus
amples renseignements.
Brevets
Il est possible qu'IBM détienne des brevets ou qu'elle ait déposé des demandes de brevets portant sur
certains sujets abordés dans ce document. Le fait qu'IBM vous fournisse le présent document ne signifie
pas qu'elle vous accorde un permis d'utilisation de ces brevets. Vous pouvez envoyer, par écrit, vos
demandes de renseignements relatives aux permis d'utilisation au directeur général des relations
commerciales d'IBM, 3600 Steeles Avenue East, Markham, Ontario, L3R 9Z7.
Assistance téléphonique
Si vous avez besoin d'assistance ou si vous voulez commander du matériel, des logiciels et des
publications IBM, contactez IBM direct au 1 800 465-1234.
Avis aux lecteurs canadiens ix
x Unités de disques
Consignes de sécurité
Les consignes de sécurité peuvent être imprimées tout au long de ce guide.
v DANGER - Consignes attirant votre attention sur un risque de blessures graves, voire mortelles.
v ATTENTION - Consignes attirant votre attention sur un risque de blessures graves, en raison de
certaines circonstances réunies.
v Avertissement - Consignes attirant votre attention sur un risque de dommages sur un programme, une
unité, un système ou des données.
Consignes de sécurité relatives au commerce international
Plusieurs pays nécessitent la présentation des consignes de sécurité indiquées dans les publications du
produit dans leur langue nationale. Si votre pays en fait partie, un livret de consignes de sécurité est
inclus dans l'ensemble des publications livré avec le produit. Ce livret contient les consignes de sécurité
dans votre langue en faisant référence à la source en anglais (Etats-Unis). Avant d'utiliser une publication
en version originale anglaise pour installer, faire fonctionner ou dépanner ce produit, vous devez vous
familiariser avec les consignes de sécurité figurant dans ce livret. Vous devez également consulter ce
livret chaque fois que les consignes de sécurité des publications en anglais (Etats-Unis) ne sont pas assez
claires pour vous.
Informations sur les appareils à laser
Les serveurs peuvent comprendre des cartes d'E-S ou des composants à fibres optiques, utilisant des
lasers ou des diodes électroluminescentes (LED).
Conformité aux normes relatives aux appareils à laser
Les serveurs peuvent être installés à l'intérieur ou à l'extérieur d'une armoire d'équipement informatique.
xi
DANGER
Lorsque vous utilisez le système ou travaillez à proximité de ce dernier, observez les consignes
suivantes :
Le courant électrique provenant de l'alimentation, du téléphone et des câbles de transmission peut
présenter un danger. Pour éviter tout risque de choc électrique :
v Branchez cette unité uniquement avec le cordon d'alimentation fourni par . N'utilisez pas ce
dernier avec un autre produit.
v N'ouvrez pas et n'entretenez pas le bloc d'alimentation électrique.
v Ne manipulez aucun câble et n'effectuez aucune opération d'installation, d'entretien ou de
reconfiguration de ce produit au cours d'un orage.
v Le produit peut être équipé de plusieurs cordons d'alimentation. Pour supprimer tout risque de
danger électrique, débranchez tous les cordons d'alimentation.
v Branchez tous les cordons d'alimentation sur un socle de prise de courant correctement câblé et
mis à la terre. Vérifiez que la tension et l'ordre des phases des prises de courant correspondent
aux informations de la plaque d'alimentation électrique du système.
v Branchez sur des socles de prise de courant correctement câblés tout équipement connecté à ce
produit.
v Lorsque cela est possible, n'utilisez qu'une seule main pour connecter ou déconnecter les câbles
d'interface.
v Ne mettez jamais un équipement sous tension en cas d'incendie ou d'inondation, ou en présence
de dommages matériels.
v Avant de retirer les carters de l'unité, mettez celle-ci hors tension et déconnectez ses cordons
d'alimentation, ainsi que les câbles qui la relient aux réseaux, aux systèmes de télécommunication
et aux modems (sauf mention contraire dans les procédures d'installation et de configuration).
v Lorsque vous installez, que vous déplacez, ou que vous manipulez le présent produit ou des
périphériques qui lui sont raccordés, reportez-vous aux instructions ci-dessous pour connecter et
déconnecter les différents cordons.
Pour déconnecter les cordons :
1. Mettez toutes les unités hors tension (sauf mention contraire).
2. Débranchez les cordons d'alimentation des prises.
3. Débranchez les câbles d'interface des connecteurs.
4. Débranchez tous les câbles des unités.
Pour connecter les cordons :
1. Mettez toutes les unités hors tension (sauf mention contraire).
2. Branchez tous les cordons sur les unités.
3. Branchez les câbles d'interface sur des connecteurs.
4. Branchez les cordons d'alimentation aux prises.
5. Mettez l'unité sous tension.
(D005a)
DANGER
xii Unités de disques
Observez les consignes suivantes lors de l'utilisation du système en armoire ou lorsque vous
travaillez à proximité de ce dernier :
v Un mauvais maniement de l'équipement lourd peut engendrer blessures et dommages matériels.
v Abaissez toujours les vérins de mise à niveau de l'armoire.
v Installez toujours des équerres de stabilisation sur l'armoire.
v Pour prévenir tout danger lié à une mauvaise répartition de la charge, installez toujours les
unités les plus lourdes dans la partie inférieure de l'armoire. Installez toujours les serveurs et les
unités en option en commençant par le bas de l'armoire.
v Un serveur monté en armoire n'est pas une étagère ou un espace de travail. Ne posez pas d'objet
sur un serveur monté en armoire.
v Chaque armoire peut être équipée de plusieurs cordons d'alimentation. Avant de manipuler
l'armoire, vous devez débrancher l'ensemble des cordons d'alimentation.
v Reliez toutes les unités installées dans l'armoire aux dispositifs d'alimentation installés dans la
même armoire. Vous ne devez pas brancher le cordon d'alimentation d'une unité installée dans
une armoire au dispositif d'alimentation installé dans une autre armoire.
v Un mauvais câblage du socle de prise de courant peut provoquer une mise sous tension
dangereuse des parties métalliques du système ou des unités qui lui sont raccordées. Il
appartient au client de s'assurer que le socle de prise de courant est correctement câblé et mis à la
terre afin d'éviter tout risque de choc électrique.
ATTENTION
v N'installez pas d'unité dans une armoire dont la température ambiante interne dépasse la
température ambiante que le fabricant recommande pour toutes les unités montées en armoire.
v N'installez pas d'unité dans une armoire où la ventilation n'est pas assurée. Vérifiez que les
côtés, l'avant et l'arrière de l'unité sont correctement ventilés.
v Le matériel doit être correctement raccordé au circuit d'alimentation pour éviter qu'une surcharge
des circuits n'entrave le câblage des dispositifs d'alimentation ou de protection contre les
surintensités. Pour choisir des connexions d'alimentation à l'armoire adaptées, consultez les
étiquettes de puissance nominale situées sur le matériel dans l'armoire afin de déterminer
l'alimentation totale requise par le circuit d'alimentation.
v Armoires dotées de tiroirs coulissants : Si l'armoire n'est pas équipée d'équerres de stabilisation, ne
sortez et n'installez pas de tiroir ou de dispositif. Ne retirez pas plusieurs tiroirs à la fois. Si vous
retirez plusieurs tiroirs simultanément, l'armoire risque de devenir instable.
v Armoires dotées de tiroirs fixes : Sauf indication du fabricant, les tiroirs fixes ne doivent pas être
retirés à des fins de maintenance. Si vous tentez de retirer une partie ou l'ensemble du tiroir,
l'armoire risque de devenir instable et le tiroir risque de tomber.
(R001)
Consignes de sécurité xiii
ATTENTION :
Le retrait des composants des parties supérieures de l'armoire améliore sa stabilité au cours du
déplacement. Pour déplacer une armoire remplie de composants dans une pièce ou dans un bâtiment,
procédez comme suit :
v Pour réduire le poids de l'armoire, retirez les équipements, à commencer par celui situé en haut. Si
possible, restaurez la configuration d'origine de l'armoire. Si vous ne connaissez pas cette
configuration, procédez comme suit :
– Retirez toutes les unités de la position 32U et plus.
– Assurez-vous que les unités les plus lourdes sont installées dans la partie inférieure de l'armoire.
– Assurez-vous qu'il ne reste aucun niveau U vide entre les unités installées dans l'armoire, en
dessous du niveau 32U.
v Si l'armoire déplacée fait partie d'un groupe d'armoires, séparez-la de ce dernier.
v Vérifiez l'itinéraire envisagé pour éliminer tout risque.
v Vérifiez que l'armoire une fois chargée n'est pas trop lourde pour l'itinéraire choisi. Pour plus
d'informations sur le poids d'une armoire chargée, consultez la documentation fournie avec votre
armoire.
v Vérifiez que toutes les ouvertures mesurent au moins 760 x 230 mm.
v Vérifiez que toutes les unités, toutes les étagères, tous les tiroirs, toutes les portes et tous les câbles
sont bien fixés.
v Vérifiez que les vérins de mise à niveau sont à leur position la plus haute.
v Vérifiez qu'aucune équerre de stabilisation n'est installée sur l'armoire pendant le déplacement.
v N'utilisez pas de rampe inclinée à plus de dix degrés.
v Dès que l'armoire est à son nouvel emplacement, procédez comme suit :
– Abaissez les quatre vérins de mise à niveau.
– Installez des équerres de stabilisation sur l'armoire.
– Si vous avez retiré des unités de l'armoire, remettez-les à leur place, en remontant de la partie
inférieure à la partie supérieure de l'armoire.
v Si un déplacement important est nécessaire, restaurez la configuration d'origine de l'armoire. Mettez
l'armoire dans son emballage d'origine ou dans un autre emballage équivalent. De plus, abaissez les
vérins de mise à niveau pour que les roulettes ne soient plus au contact de la palette et fixez
l'armoire à celle-ci.
(R002)
(L001)
(L002)
xiv Unités de disques
(L003)
ou
Aux Etats-Unis, tous les appareils à laser sont certifiés conformes aux normes indiquées dans le
sous-chapitre J du DHHS 21 CFR relatif aux produits à laser de classe 1. Dans les autres pays, ils sont
certifiés être des produits à laser de classe 1 conformes aux normes CEI 60825. Consultez les étiquettes
sur chaque pièce du laser pour les numéros d'accréditation et les informations de conformité.
ATTENTION :
Ce produit peut contenir des produits à laser de classe 1 : lecteur de CD-ROM, DVD-ROM,
DVD-RAM ou module à laser. Notez les informations suivantes :
v Ne retirez pas les carters. En ouvrant le produit à laser, vous vous exposez au rayonnement
dangereux du laser. Vous ne pouvez effectuer aucune opération de maintenance à l'intérieur.
v Pour éviter tout risque d'exposition au rayon laser, respectez les consignes de réglage et d'utilisation
des commandes, ainsi que les procédures décrites dans le présent manuel.
(C026)
Consignes de sécurité xv
ATTENTION :
Les installations informatiques peuvent comprendre des modules à laser fonctionnant à des niveaux
de rayonnement excédant les limites de la classe 1. Il est donc recommandé de ne jamais examiner à
l'oeil nu la section d'un cordon optique ni une prise de fibres optiques ouverte. (C027)
ATTENTION :
Ce produit contient un laser de classe 1M. Ne l'observez pas à l'aide d'instruments optiques. (C028)
ATTENTION :
Certains produits à laser contiennent une diode à laser intégrée de classe 3A ou 3B. Prenez
connaissance des informations suivantes. Rayonnement laser lorsque le carter est ouvert. Evitez toute
exposition directe au rayon laser. Evitez de regarder fixement le faisceau ou de l'observer à l'aide
d'instruments optiques. (C030)
Informations sur l'alimentation électrique et sur le câblage relatives au document
GR-1089-CORE du NEBS (Network Equipment-Building System)
Les commentaires suivants s'appliquent aux serveurs qui ont été déclarés conformes au document
GR-1089-CORE du NEBS (Network Equipment-Building System) :
Cet équipement peut être installé :
v dans des infrastructures de télécommunications réseau
v aux endroits préconisés dans les directives NEC (National Electrical Code).
Les ports de ce matériel qui se trouvent à l'intérieur du bâtiment peuvent être connectés à des câbles
internes ou non exposés uniquement. Ils ne doivent pas être connectés par leur partie métallique aux
interfaces connectées au réseau extérieur ou à son câblage. Ces interfaces sont conçues pour être
exclusivement utilisées à l'intérieur d'un bâtiment (ports de type 2 ou 4 décrits dans le document
GR-1089-CORE) ; elles doivent être isolées du câblage à découvert du réseau extérieur. L'ajout de
dispositifs de protection primaires n'est pas suffisant pour pouvoir connecter ces interfaces par leur partie
métallique au câblage du réseau extérieur.
Remarque : Tous les câbles Ethernet doivent être blindés et mis à la terre aux deux extrémités.
Dans le cas d'un système alimenté en courant alternatif, il n'est pas nécessaire d'installer un dispositif
externe de protection contre les surtensions (SPD).
Un système alimenté en courant continu fait appel à un dispositif de retour du continu (DC-I). La borne
de retour de la batterie en courant continu ne doit pas être connectée à la masse.
xvi Unités de disques
Unités de disques
Informations sur l'installation, le retrait et le remplacement d'une unité de disque.
Unités de disques - Nouveautés
Prenez connaissance des nouveautés et des modifications apportées aux rubriques Unités de disques
depuis leur dernière mise à jour.
Installation d'une unité de disques ou d'une unité SSD
Vous pouvez installer une unité de disques SCSI (Small Computer System Interface), SAS (serial-attached
SCSI) ou SSD (Solid State Drive) sur le serveur ou l'unité d'extension. Les procédures de cette rubrique
permettent d'effectuer cette tâche.
Votre système ou partition logique peut être indifféremment hors tension ou sous tension lorsque vous
installez l'unité de disques. Identifiez le système pour effectuer l'installation ou d'une mise à niveau.
Allumez le voyant d'identification système (BLEU).
Unités SSD (Solid State Drive)
Pour installer et configurer des unités SSD, vous devez vérifier les détails relatifs au type de machine, à
la carte et à la configuration pris en charge.
Pourquoi et quand exécuter cette tâche
Les unités SSD, également appelées unités flash, observent la plupart des règles appliquées à une unité
de disque dur (HDD) normal. Par exemple, les unités SSD ressemblent physiquement aux unités de
disque dur, s'installent de la même manière et dans la plupart des mêmes emplacements que ces derniers.
Cependant, certaines restrictions et règles de configuration applicables aux unités SSD diffèrent de celles
des unités de disque dur. Les tableaux suivants peuvent vous aider à déterminer les règles de
configuration du système d'exploitation, de l'adaptateur, du type machine et de l'unité applicables à votre
système.
Après avoir examiné les règles de configuration, vous pouvez installer l'unité de la même manière que
pour tout autre unité. Pour plus d'informations, voir «Installation d'une unité de disques ou d'une unité
SSD».
Restriction : Les unités SSD doivent faire partie intégrante d'une grappe RAID ou d'un miroitage du
système ().
Choisissez dans la liste suivante le système d'exploitation du serveur ou de la partition sur lequel vous
souhaitez installer les disques :
v Règles relatives aux unités SSD pour le système d'exploitation AIX
v Règles relatives aux unités SSD pour le système d'exploitation Linux®
v Règles relatives aux unités SSD pour le système d'exploitation
Avertissement : Avant d'installer une unité SSD sur votre système, vous devez vous assurer de disposer
de toutes les mises à jour préalablement requises pour l'unité SSD et les adaptateurs. .
1
Que faire ensuite
Tableau 1. Règles relatives aux unités SSD pour le système d'exploitation AIX
Modèle du type de machine Carte Règles d'association
58/86 v (5904) - Carte SAS RAID PCI-X
DDR 1,5 Go de cache
v (5906 et 5908) - Carte SAS RAID
PCI-X DDR 1,5 Go de cache (BSC)
v (5903) - Carte SAS RAID PCIe à
double canal x4 3 Go de cache de
380 Mo
v Les unités SSD et les unités de
disque dur ne peuvent pas être
combinées dans ce boîtier d'unité.
v Jusqu'à huit unités sont prises en
charge dans cette configuration.
v Un seul 58/86 peut être raccordé à
la carte, ou paire de cartes, lorsque
des unités SSD sont installées.
v Les mêmes règles s'appliquent à
l'utilisation de cartes SAS RAID à
double canal connectées à 5886.
33E/8B et 36E/8C v (5679) - Activation de SAS RAID :
carte interne
v (5904) - Carte SAS RAID PCI-X
DDR 1,5 Go de cache
v (5903) - Carte SAS RAID PCIe à
double canal x4 3 Go de cache de
380 Mo
v Les unités SSD et HDD ne peuvent
jamais être combinées dans la
même grappe RAID ni le même
groupe de mise en miroir.
v Elles peuvent être associées dans le
fond de panier de l'unité de
disques du système dès lors que ce
dernier n'est pas divisé.
v Les unités SSD et HDD peuvent
coexister dans un fond de panier
divisé si chaque côté du fond de
panier inclut uniquement l'unité
SSD ou HDD.
Restriction : Les fonds de panier
divisés pour ces modèles ne sont
pas pris en charge avec les cartes
SAS RAID à double canal.
A faire : L'unité SSD doit être située
d'un côté du fond de panier divisé et
l'unité HDD, de l'autre côté. Ces
types d'unité ne peuvent pas être
combinés au sein d'un jeu d'unités à
contrôle de parité intégré. Toutefois,
si vous décidez de le faire,
assurez-vous de disposer d'un
nombre suffisant d'unités de chaque
type pour mener à bien votre
programme de protection.
2 Unités de disques
Tableau 1. Règles relatives aux unités SSD pour le système d'exploitation AIX (suite)
Modèle du type de machine Carte Règles d'association
17M/MB et 79M/HB v (5652) - Fond de panier du
disque/du support de stockage
v (5662) - Carte d'activation Cache
RAID de 175 Mo - Adaptateur
d'E-S double stockage
v (5903) - Carte SAS RAID PCIe à
double canal x4 3 Go de cache de
380 Mo
v Les unités SSD et HDD ne peuvent
pas être combinées dans la même
grappe RAID ni le même groupe
de mise en miroir.
v Elles peuvent être associées dans le
fond de panier de l'unité de
disques système.
A faire : Vous ne devez pas
combiner ces types d'unité au sein
d'un jeu d'unités à contrôle de
parité intégré. Toutefois, si vous
décidez de le faire, assurez-vous de
disposer d'un nombre suffisant
d'unités de chaque type pour
mener à bien votre programme de
protection.
58/02 (5903) - Carte SAS RAID PCIe à
double canal x4 3 Go de cache de 380
Mo
v Les unités SSD et HDD ne peuvent
pas être combinées dans un même
fond de panier d'unité de disque.
v Les unités SSD et HDD peuvent
coexister dans un fond de panier
divisé si chaque côté du fond de
panier inclut uniquement l'unité
SSD ou HDD. L'unité SSD doit être
située d'un côté du fond de panier
divisé et l'unité HDD, de l'autre
côté.
v Les unités SSD et HDD ne peuvent
pas être associées sur une paire de
cartes 5903.
v Une paire de cartes 5903 peut
prendre en charge un maximum de
9 SSD.
Tableau 2. Règles relatives aux unités SSD pour le système d'exploitation Linux.
Modèle du type de machine Carte Règles d'association
58/86 v (5904) - Carte SAS RAID PCI-X
DDR 1,5 Go de cache
v (5906 et 5908) - Carte SAS RAID
PCI-X DDR 1,5 Go de cache (BSC)
v (5903) - Carte SAS RAID PCIe à
double canal x4 3 Go de cache de
380 Mo
v Les unités SSD et les unités de
disque dur ne peuvent pas être
combinées dans ce boîtier d'unité.
v Jusqu'à huit unités sont prises en
charge dans cette configuration.
v Un seul 58/86 peut être raccordé à
la carte, ou paire de cartes, lorsque
des unités SSD sont installées.
v Les mêmes règles s'appliquent à
l'utilisation de cartes SAS RAID à
double canal connectées à 5886.
Unités de disques 3
Tableau 2. Règles relatives aux unités SSD pour le système d'exploitation Linux. (suite)
Modèle du type de machine Carte Règles d'association
33E/8B et 36E/8C v (5679) - Activation de SAS RAID :
carte interne
v (5904) - Carte SAS RAID PCI-X
DDR 1,5 Go de cache
v (5903) - Carte SAS RAID PCIe à
double canal x4 3 Go de cache de
380 Mo
v Les unités SSD et HDD ne peuvent
jamais être combinées dans la
même grappe RAID ni le même
groupe de mise en miroir.
v Elles peuvent être associées dans le
fond de panier de l'unité de
disques du système dès lors que ce
dernier n'est pas divisé.
v Les unités SSD et HDD peuvent
coexister dans un fond de panier
divisé si chaque côté du fond de
panier inclut uniquement l'unité
SSD ou HDD.
Restriction : Les fonds de panier
divisés pour ces modèles ne sont
pas pris en charge avec les cartes
SAS RAID à double canal.
A faire : L'unité SSD doit être située
d'un côté du fond de panier divisé et
l'unité HDD, de l'autre côté. Ces
types d'unité ne peuvent pas être
combinés au sein d'un jeu d'unités à
contrôle de parité intégré. Toutefois,
si vous décidez de le faire,
assurez-vous de disposer d'un
nombre suffisant d'unités de chaque
type pour mener à bien votre
programme de protection.
17M/MB et 79M/HB v (5652) - Fond de panier du
disque/du support de stockage
v (5662) - Carte d'activation Cache
RAID de 175 Mo - Adaptateur
d'E-S double stockage
v (5903) - Carte SAS RAID PCIe à
double canal x4 3 Go de cache de
380 Mo
v Les unités SSD et HDD ne peuvent
pas être combinées dans la même
grappe RAID ni le même groupe
de mise en miroir.
v Elles peuvent être associées dans le
fond de panier de l'unité de
disques système.
A faire : Vous ne devez pas
combiner ces types d'unité au sein
d'un jeu d'unités à contrôle de
parité intégré. Toutefois, si vous
décidez de le faire, assurez-vous de
disposer d'un nombre suffisant
d'unités de chaque type pour
mener à bien votre programme de
protection.
4 Unités de disques
Tableau 2. Règles relatives aux unités SSD pour le système d'exploitation Linux. (suite)
Modèle du type de machine Carte Règles d'association
58/02 (5903) - Carte SAS RAID PCIe à
double canal x4 3 Go de cache de 380
Mo
v Les unités SSD et HDD ne peuvent
pas être combinées dans un même
fond de panier d'unité de disque.
v Les unités SSD et HDD peuvent
coexister dans un fond de panier
divisé si chaque côté du fond de
panier inclut uniquement l'unité
SSD ou HDD. L'unité SSD doit être
située d'un côté du fond de panier
divisé et l'unité HDD, de l'autre
côté.
v Les unités SSD et HDD ne peuvent
pas être associées sur une paire de
cartes 5903.
v Une paire de cartes 5903 peut
prendre en charge un maximum de
9 SSD.
Tableau 3. Règles relatives aux unités SSD pour le système d'exploitation
Modèle du type de machine Carte Règles d'association
58/86 v (5904) - Carte SAS RAID PCI-X
DDR 1,5 Go de cache
v (5906 et 5908) - Carte SAS RAID
PCI-X DDR 1,5 Go de cache (BSC)
v (5903) - Carte SAS RAID PCIe à
double canal x4 3 Go de cache de
380 Mo
v Les unités SSD et les unités de
disque dur ne peuvent pas être
combinées dans ce boîtier d'unité.
v Jusqu'à huit unités sont prises en
charge dans cette configuration.
v Un seul 58/86 peut être raccordé à
la carte, ou paire de cartes, lorsque
des unités SSD sont installées.
v Les mêmes règles s'appliquent à
l'utilisation de cartes SAS RAID à
double canal connectées à 5886.
33E/8B et 36E/8C v (5679) - Activation de SAS RAID :
carte interne
v (5904) - Carte SAS RAID PCI-X
DDR 1,5 Go de cache
v (5903) - Carte SAS RAID PCIe à
double canal x4 3 Go de cache de
380 Mo
v Les unités SSD et HDD ne peuvent
pas être combinées dans la même
grappe RAID ni le même groupe
de mise en miroir.
v Elles peuvent être associées dans le
fond de panier de l'unité de
disques système.
A faire : Vous ne devez pas
combiner ces types d'unité au sein
d'un jeu d'unités à contrôle de
parité intégré. Toutefois, si vous
décidez de le faire, assurez-vous de
disposer d'un nombre suffisant
d'unités de chaque type pour
mener à bien votre programme de
protection.
Unités de disques 5
Tableau 3. Règles relatives aux unités SSD pour le système d'exploitation (suite)
Modèle du type de machine Carte Règles d'association
17M/MB et 79M/HB v (5652) - Fond de panier du
disque/du support de stockage
v (5662) - Carte d'activation Cache
RAID de 175 Mo - Adaptateur
d'E-S double stockage
v (5903) - Carte SAS RAID PCIe à
double canal x4 3 Go de cache de
380 Mo
v Les unités SSD et HDD ne peuvent
pas être combinées dans la même
grappe RAID ni le même groupe
de mise en miroir.
v Elles peuvent être associées dans le
fond de panier de l'unité de
disques système.
A faire : Vous ne devez pas
combiner ces types d'unité au sein
d'un jeu d'unités à contrôle de
parité intégré. Toutefois, si vous
décidez de le faire, assurez-vous de
disposer d'un nombre suffisant
d'unités de chaque type pour
mener à bien votre programme de
protection.
58/02 (5903) - Carte SAS RAID PCIe à
double canal x4 3 Go de cache de 380
Mo
v Les unités SSD et HDD ne peuvent
pas être combinées dans un même
fond de panier d'unité de disque.
v Les unités SSD et HDD peuvent
coexister dans un fond de panier
divisé si chaque côté du fond de
panier inclut uniquement l'unité
SSD ou HDD. L'unité SSD doit être
située d'un côté du fond de panier
divisé et l'unité HDD, de l'autre
côté.
v Les unités SSD et HDD ne peuvent
pas être associées sur une paire de
cartes 5903.
v Une paire de cartes 5903 peut
prendre en charge un maximum de
9 SSD.
6 Unités de disques
Tâches associées
«Installation d'une unité de disques ou d'une unité SSD sur un système ou une partition logique AIX
sous tension»
Vous pouvez installer une unité de disques ou une unité SSD sur un système ou une unité d'extension
alors que le système d'exploitation ou la partition logique AIX qui contrôle l'emplacement de l'unité de
disques est sous tension. Exécutez la procédure décrite dans cette section pour effectuer cette tâche.
«Installation d'une unité de disques ou d'une unité SSD sur un système ou une partition logique Linux
sous tension», à la page 15
Il peut être nécessaire d'installer une unité de disques ou une unité SSD sur un système ou une unité
d'extension alors que le système d'exploitation ou la partition logique Linux qui contrôle l'emplacement
de l'unité de disques est sous tension. Exécutez la procédure décrite dans cette section pour effectuer cette
tâche.
«Installation d'une unité de disques ou d'une unité SSD sur un système ou une partition logique hors
tension», à la page 37
Découvrez comment installer une unité de disques sur un système ou une partition logique qui contrôle
l'emplacement de l'unité de disques une fois l'alimentation coupée. La procédure décrite dans cette
information permet d'effectuer cette tâche.
Information associée
Câblage du sous-système du disque 5802
Installation d'une unité de disques ou d'une unité SSD sur un système
ou une partition logique AIX sous tension
Vous pouvez installer une unité de disques ou une unité SSD sur un système ou une unité d'extension
alors que le système d'exploitation ou la partition logique AIX qui contrôle l'emplacement de l'unité de
disques est sous tension. Exécutez la procédure décrite dans cette section pour effectuer cette tâche.
Avant de commencer
Avertissement : Si vous effectuez la maintenance d'un composant défaillant, consultez les procédures de
maintenance pour «Remplacement d'une unité de disques sur un système ou une partition logique AIX
sous tension», à la page 46. Cette procédure s'applique à l'installation ou à la mise à niveau d'un nouveau
drive de mémoire.
Avant d'installer ou de remplacer un dispositif, assurez-vous que le logiciel requis pour prendre en
charge ce dispositif est installé sur votre système. Si votre système est géré par la console HMC, utilisez
celle-ci pour effectuer la procédure de remplacement du ventilateur dans le serveur. Pour obtenir des
informations sur l'utilisation de la console HMC pour l'installation d'un dispositif, voir «Installation d'un
dispositif via la console HMC (Hardware Management Console)», à la page 193. Si vous ne disposez pas
d'une console HMC, procédez comme suit pour installer une unité de disque du système ou de la
partition qui contrôle l'unité de disques.
Si vous installez des unités SSD, lisez les règles de configuration puis revenez à cette section. Pour plus
d'informations, voir «Unités SSD (Solid State Drive)», à la page 1.
Unités de disques 7
Pourquoi et quand exécuter cette tâche
DANGER
Lorsque vous utilisez le système ou travaillez à proximité de ce dernier, observez les consignes
suivantes :
Le courant électrique provenant de l'alimentation, du téléphone et des câbles de transmission peut
présenter un danger. Pour éviter tout risque de choc électrique :
v Branchez cette unité uniquement avec le cordon d'alimentation fourni par . N'utilisez pas ce
dernier avec un autre produit.
v N'ouvrez pas et n'entretenez pas le bloc d'alimentation électrique.
v Ne manipulez aucun câble et n'effectuez aucune opération d'installation, d'entretien ou de
reconfiguration de ce produit au cours d'un orage.
v Le produit peut être équipé de plusieurs cordons d'alimentation. Pour supprimer tout risque de
danger électrique, débranchez tous les cordons d'alimentation.
v Branchez tous les cordons d'alimentation sur un socle de prise de courant correctement câblé et
mis à la terre. Vérifiez que la tension et l'ordre des phases des prises de courant correspondent
aux informations de la plaque d'alimentation électrique du système.
v Branchez sur des socles de prise de courant correctement câblés tout équipement connecté à ce
produit.
v Lorsque cela est possible, n'utilisez qu'une seule main pour connecter ou déconnecter les câbles
d'interface.
v Ne mettez jamais un équipement sous tension en cas d'incendie ou d'inondation, ou en présence
de dommages matériels.
v Avant de retirer les carters de l'unité, mettez celle-ci hors tension et déconnectez ses cordons
d'alimentation, ainsi que les câbles qui la relient aux réseaux, aux systèmes de télécommunication
et aux modems (sauf mention contraire dans les procédures d'installation et de configuration).
v Lorsque vous installez, que vous déplacez, ou que vous manipulez le présent produit ou des
périphériques qui lui sont raccordés, reportez-vous aux instructions ci-dessous pour connecter et
déconnecter les différents cordons.
Pour déconnecter les cordons :
1. Mettez toutes les unités hors tension (sauf mention contraire).
2. Débranchez les cordons d'alimentation des prises.
3. Débranchez les câbles d'interface des connecteurs.
4. Débranchez tous les câbles des unités.
Pour connecter les cordons :
1. Mettez toutes les unités hors tension (sauf mention contraire).
2. Branchez tous les cordons sur les unités.
3. Branchez les câbles d'interface sur des connecteurs.
4. Branchez les cordons d'alimentation aux prises.
5. Mettez l'unité sous tension.
(D005a)
Avertissement : La procédure avec le système sous tension est un ensemble complexe d'étapes qui
doivent être exécutées avec précision pour éviter des pertes de données ou un endommagement du
système. Vous pouvez également effectuer la procédure hors tension, plus simple et moins risquée en
termes de pertes de données ou d'endommagement du système. Pour plus d'informations, voir
«Installation d'une unité de disques ou d'une unité SSD sur un système ou une partition logique hors
tension», à la page 37.
Remarque : Certaines figures accompagnant ces procédures peuvent ne pas correspondre exactement à
votre unité centrale ou votre unité d'extension. Néanmoins, les procédures demeurent identiques pour
l'exécution de chaque tâche.
8 Unités de disques
Préparation du système
Procédure
1. Effectuez les tâches préalablement requises décrites dans la section «Avant de commencer», à la page
143.
2. Le cas échéant, retirez le volet de l'unité centrale, celui de l'unité d'extension, ou ouvrez l'armoire.
3. Le cas échéant, retirez le panneau avant.
4. Identifiez le premier emplacement d'unité de disques libre de l'unité centrale ou de l'unité d'extension.
Pour plus d'informations, voir «Emplacements des unités de disque et voyants de maintenance
simultanée», à la page 90.
Remarque : Si le système est doté d'une unité d'extension, comblez d'abord les emplacements de
l'unité centrale. Toutefois, vous pouvez choisir une autre position pour l'unité de disques, en fonction
de votre stratégie de protection des données.
5. Inscrivez l'emplacement d'installation de la nouvelle unité. Par exemple, la nouvelle unité de disques
peut être placée en P3-D4.
6. Pour installer une unité de disques, procédez comme suit.
Installation d'une unité de disques
Pourquoi et quand exécuter cette tâche
1. Connectez-vous en tant que superutilisateur.
2. A l'invite, tapez diag et appuyez sur Entrée.
3. Appuyez sur Entrée pour continuer.
4. Dans l'écran de sélection de fonction, sélectionnez Sélection des tâches.
5. Sur l'écran de sélection des tâches, sélectionnez Remplacement à chaud.
6. Sélectionnez Gestionnaire d'unités SCSI et RAID SCSI remplaçables à chaud.
7. Sélectionnez Connexion d'une unité à une unité SES. La liste des emplacements vides du boîtier de
connexion à chaud SCSI s'affiche.
8. Sélectionnez l'emplacement dans lequel vous allez installer l'unité de disques, puis appuyez sur
Entrée.
L'emplacement de l'unité de disques affiche l'état Identification. Le voyant de maintenance
simultanée de l'emplacement commence à clignoter. Pour afficher l'emplacement des voyants de
maintenance simultanée, choisissez le modèle sur lequel vous souhaitez installer une unité de
disques. Pour une liste des modèles, voir «Emplacements des unités de disque et voyants de
maintenance simultanée», à la page 90.
9. Si l'emplacement que vous avez choisi comporte un obturateur d'unité de disque, conservez celui-ci
pour une utilisation ultérieure ou mettez-le au rebut puis passez à l'étape Insertion de l'unité de
disque.
Remarque : L'obturateur est une pièce séparée que ne se fixe pas sur la nouvelle unité de disques.
Important : .
Tirez l'obturateur d'unité de disques pour le détacher de l'emplacement approprié (voir figure 1, à la
page 10).
Unités de disques 9
Avertissement :
v Portez un bracelet antistatique relié à une surface métallique non peinte pour protéger le matériel
contre les risques de décharge électrostatique.
v Si vous portez un bracelet antistatique, respectez toutes les consignes de sécurité relatives aux
dangers électriques. Un bracelet antistatique permet de réguler l'électricité statique. Il ne réduit et
n'augmente en aucun cas les risques d'électrocution liés à l'utilisation ou à la manipulation
d'appareils électriques.
v Si vous ne disposez pas d'un bracelet antistatique, touchez une surface métallique non peinte du
système pendant au moins 5 secondes avant de déballer un produit de son emballage antistatique,
d'installer ou de remplacer du matériel.
10. Si l'emplacement en question contient un obturateur d'unité de disques, procédez comme suit.
Retirez le loquet (A) du panneau frontal de l'unité de disques du bord avant de l'obturateur (voir
figure suivante).
Figure 1. Retrait d'un obturateur d'unité de disques
10 Unités de disques
11. Si l'emplacement en question contient un obturateur d'unité de disques, procédez comme suit.
Dévissez et retirez la vis moletée (C) pour dégager le panneau frontal (B) du bord avant de
l'obturateur.
12. Prenez l'emballage contenant la nouvelle unité de disques.
Avertissement : Les unités de disques sont fragiles. Manipulez-les avec précaution.
13. Retirez l'unité de disques de son emballage antistatique.
14. Déverrouillez la poignée de l'unité de disques en la tirant vers vous. Si la poignée n'est pas
complètement sortie, vous ne pouvez pas insérer l'unité de disques dans l'unité centrale ou l'unité
d'extension.
15. Si un panneau frontal n'est pas préinstallé sur l'unité de disques que vous installez, procédez à
l'étape suivante.
Fixez le panneau frontal (A) sur l'unité de disques (voir figure suivante).
Figure 2. Retrait du panneau frontal d'un obturateur d'unité de disques
Unités de disques 11
16. Si un panneau frontal n'est pas préinstallé sur l'unité de disques que vous installez, procédez à
l'étape suivante.
Appuyez sur la vis moletée (B), puis serrez la vis pour maintenir le panneau frontal (A) sur l'unité
de disques.
17. Maintenez l'unité de disques par-dessous lorsque vous l'alignez sur les rails de guidage de l'unité
centrale. Ne la tenez pas par la poignée.
18. Faites glisser complètement l'unité de disques dans l'unité centrale (voir figure 4, à la page 13).
19. Poussez complètement la poignée de l'unité de disques. Le voyant de l'unité de disques s'allume.
Figure 3. Installation d'un panneau frontal sur une unité de disques
12 Unités de disques
Figure 4. Installation d'une unité de disques sur un modèle
Figure 5. Installation d'une unité de disques à encombrement réduit
Unités de disques 13
20. Appuyez sur Entrée.
21. Si vous installez plusieurs unités de disques, répétez les étapes 1, à la page 9 à 20 jusqu'à installation
de toutes les unités.
22. Annulez pour revenir au menu Gestionnaire d'unités SCSI et RAID SCSI remplaçables à chaud.
23. Sélectionnez Configuration des unités ajoutées/remplacées.
24. Retournez à la ligne de commande.
Fin de la procédure
Pourquoi et quand exécuter cette tâche
1. Remettez en place le panneau avant.
2. Installez ou fermez le volet du système, de l'unité d'extension ou de l'armoire.
3. Vérifiez le composant installé. Pour obtenir des instructions, consultez la rubrique «Vérification d'un
dispositif installé ou d'un composant remplacé sur un système ou une partition logique AIX», à la
page 195.
4. Pour préparer l'utilisation de la nouvelle unité de disques installée, voir «Préparation d'une unité de
disques dans un système ou une partition logique AIX», à la page 100.
Figure 6. Installation d'une unité de disques à encombrement réduit sur l'unité d'extension 5803
14 Unités de disques
Concepts associés
Retrait et remplacement de volets et capots
Suivez ces procédures pour retirer et remettre en place les carters sur les serveurs systèmes contenant le
processeur POWER7.
Référence associée
«Retrait et remplacement du volet ou panneau d'unité d'extension», à la page 156
Il se peut que vous deviez retirer, remplacer ou installer des panneaux ou des volets sur une unité
d'extension pour accéder à des composants ou effectuer une opération de maintenance.
Installation d'une unité de disques ou d'une unité SSD sur un système
ou une partition logique Linux sous tension
Il peut être nécessaire d'installer une unité de disques ou une unité SSD sur un système ou une unité
d'extension alors que le système d'exploitation ou la partition logique Linux qui contrôle l'emplacement
de l'unité de disques est sous tension. Exécutez la procédure décrite dans cette section pour effectuer cette
tâche.
Avant de commencer
Avertissement : Si vous effectuez la maintenance d'un composant défaillant, consultez les procédures de
maintenance pour «Remplacement d'une unité de disques sur un système ou une partition logique Linux
sous tension», à la page 55. Cette procédure s'applique à l'installation ou à la mise à niveau d'un nouveau
drive de mémoire.
Avant d'installer ou de remplacer un dispositif, assurez-vous que le logiciel requis pour prendre en
charge ce dispositif est installé sur votre système. Si votre système est géré par la console HMC, utilisez
celle-ci pour effectuer la procédure de remplacement du ventilateur dans le serveur. Pour obtenir des
informations sur l'utilisation de la console HMC pour l'installation d'un dispositif, voir «Installation d'un
dispositif via la console HMC (Hardware Management Console)», à la page 193. Si vous ne disposez pas
d'une console HMC, procédez comme suit pour installer une unité de disque du système ou de la
partition qui contrôle l'unité de disques.
Si vous installez des unités SSD, lisez les règles de configuration puis revenez à cette section. Pour plus
d'informations, voir «Unités SSD (Solid State Drive)», à la page 1.
Unités de disques 15
DANGER
Lorsque vous utilisez le système ou travaillez à proximité de ce dernier, observez les consignes
suivantes :
Le courant électrique provenant de l'alimentation, du téléphone et des câbles de transmission peut
présenter un danger. Pour éviter tout risque de choc électrique :
v Branchez cette unité uniquement avec le cordon d'alimentation fourni par . N'utilisez pas ce
dernier avec un autre produit.
v N'ouvrez pas et n'entretenez pas le bloc d'alimentation électrique.
v Ne manipulez aucun câble et n'effectuez aucune opération d'installation, d'entretien ou de
reconfiguration de ce produit au cours d'un orage.
v Le produit peut être équipé de plusieurs cordons d'alimentation. Pour supprimer tout risque de
danger électrique, débranchez tous les cordons d'alimentation.
v Branchez tous les cordons d'alimentation sur un socle de prise de courant correctement câblé et
mis à la terre. Vérifiez que la tension et l'ordre des phases des prises de courant correspondent
aux informations de la plaque d'alimentation électrique du système.
v Branchez sur des socles de prise de courant correctement câblés tout équipement connecté à ce
produit.
v Lorsque cela est possible, n'utilisez qu'une seule main pour connecter ou déconnecter les câbles
d'interface.
v Ne mettez jamais un équipement sous tension en cas d'incendie ou d'inondation, ou en présence
de dommages matériels.
v Avant de retirer les carters de l'unité, mettez celle-ci hors tension et déconnectez ses cordons
d'alimentation, ainsi que les câbles qui la relient aux réseaux, aux systèmes de télécommunication
et aux modems (sauf mention contraire dans les procédures d'installation et de configuration).
v Lorsque vous installez, que vous déplacez, ou que vous manipulez le présent produit ou des
périphériques qui lui sont raccordés, reportez-vous aux instructions ci-dessous pour connecter et
déconnecter les différents cordons.
Pour déconnecter les cordons :
1. Mettez toutes les unités hors tension (sauf mention contraire).
2. Débranchez les cordons d'alimentation des prises.
3. Débranchez les câbles d'interface des connecteurs.
4. Débranchez tous les câbles des unités.
Pour connecter les cordons :
1. Mettez toutes les unités hors tension (sauf mention contraire).
2. Branchez tous les cordons sur les unités.
3. Branchez les câbles d'interface sur des connecteurs.
4. Branchez les cordons d'alimentation aux prises.
5. Mettez l'unité sous tension.
(D005a)
Avertissement : Avant de commencer, assurez-vous d'avoir pris connaissance des présentes instructions
et de bien maîtriser la complexité de la procédure utilisant le mode Sous tension.
Si vous êtes dans l'un des cas suivants, voir «Installation d'une unité de disques ou d'une unité SSD sur
un système ou une partition logique hors tension», à la page 37.
v Vous installez une unité de disques qui sera connectée à l'adaptateur SCSI 6203 ou 6204.
v Vous ne vous sentez pas à l'aise avec la procédure d'installation en mode sous tension.
Remarque : Certaines figures accompagnant ces procédures peuvent ne pas correspondre exactement à
votre unité centrale ou votre unité d'extension. Néanmoins, les procédures demeurent identiques pour
l'exécution de chaque tâche.
16 Unités de disques
Pourquoi et quand exécuter cette tâche
Préparation du système
Procédure
1. Effectuez les tâches prérequises (voir «Avant de commencer», à la page 143).
2. Le cas échéant, retirez le volet de l'unité centrale, de l'unité d'extension ou de l'armoire.
3. Le cas échéant, retirez le panneau avant.
v Si vous avez un panneau avant d'un autre modèle, retirez-le.
4. Placez-vous devant l'unité centrale ou l'unité d'extension et identifiez le premier emplacement d'unité
de disques disponible. Pour plus d'informations, voir «Emplacements des unités de disque et voyants
de maintenance simultanée», à la page 90.
Remarque : Si le système est doté d'une unité d'extension, comblez d'abord les emplacements de
l'unité centrale. Toutefois, vous pouvez choisir une autre position pour l'unité de disques, en fonction
de votre stratégie de protection des données.
5. Inscrivez l'emplacement d'installation de la nouvelle unité. Par exemple, la nouvelle unité de disques
peut être positionnée en P3–D4.
Résultats
Installez une unité de disques SCSI dans une unité centrale ou une unité d'extension via un noyau 2.4
Linux ou un noyau 2.6 Linux.
Concepts associés
Retrait et remplacement de volets et capots
Suivez ces procédures pour retirer et remettre en place les carters sur les serveurs systèmes contenant le
processeur POWER7.
Référence associée
«Retrait et remplacement du volet ou panneau d'unité d'extension», à la page 156
Il se peut que vous deviez retirer, remplacer ou installer des panneaux ou des volets sur une unité
d'extension pour accéder à des composants ou effectuer une opération de maintenance.
Installation d'une unité de disques utilisant le noyau 2.4 Linux
Si vous installez une unité de disques à un emplacement contrôlé par le noyau 2.4 Linux, appliquez la
procédure décrite dans cette section pour effectuer cette tâche.
Installation d'une unité de disques :
Procédure
1. Si l'emplacement que vous avez choisi comporte un obturateur d'unité de disque, conservez celui-ci
pour une utilisation ultérieure ou mettez-le au rebut puis passez à l'étape 7, à la page 20.
Unités de disques 17
Remarque : L'obturateur est une pièce séparée que ne se fixe pas sur la nouvelle unité de disques.
Avertissement :
v Portez un bracelet antistatique relié à une surface métallique non peinte pour protéger le matériel
contre les risques de décharge électrostatique.
v Si vous portez un bracelet antistatique, respectez toutes les consignes de sécurité relatives aux
dangers électriques. Un bracelet antistatique permet de réguler l'électricité statique. Il ne réduit et
n'augmente en aucun cas les risques d'électrocution liés à l'utilisation ou à la manipulation
d'appareils électriques.
v Si vous ne disposez pas d'un bracelet antistatique, touchez une surface métallique non peinte du
système pendant au moins 5 secondes avant de déballer un produit de son emballage antistatique,
d'installer ou de remplacer du matériel.
a. Tirez l'obturateur d'unité de disques pour le détacher de l'emplacement (voir figure 7).
b. Retirez le loquet (A) du panneau frontal de l'unité de disques du bord avant de l'obturateur (voir
figure suivante).
Figure 7. Retrait du panneau obturateur sur un modèle
18 Unités de disques
c. Dévissez et retirez la vis moletée (C) pour dégager le panneau frontal (B) de l'unité de disques
du bord avant de l'obturateur.
2. Prenez l'emballage contenant la nouvelle unité de disques.
Avertissement : Les unités de disques sont fragiles. Manipulez-les avec précaution.
3. Retirez l'unité de disques de son emballage antistatique. Inscrivez le numéro de série (les quatre
derniers chiffres) de l'unité de disques.
4. Déverrouillez la poignée de l'unité de disques en la tirant vers vous. Si la poignée n'est pas
complètement sortie, vous ne pouvez pas insérer l'unité de disques dans l'unité centrale ou l'unité
d'extension.
5. Si votre unité de disques n'est pas dotée d'un panneau frontal, procédez comme suit pour l'installer :
a. Fixez le panneau frontal (A) sur l'unité de disques (voir figure suivante).
Figure 8. Retrait du panneau frontal de l'unité de disques
Unités de disques 19
b. Appuyez sur la vis moletée (B), puis serrez la vis pour maintenir le panneau frontal sur l'unité
de disques.
6. Maintenez l'unité de disques par-dessous lorsque vous l'alignez sur les rails de guidage de l'unité
centrale ou de l'unité d'extension. Ne la tenez pas par la poignée.
7. Faites glisser l'unité de disques dans le système.
Faites glisser complètement l'unité de disques dans l'unité centrale ou l'unité d'extension, puis
bloquez-la dans son emplacement en poussant la poignée (voir figure 10, à la page 21).
Figure 9. Installation d'un panneau frontal sur une unité de disques
20 Unités de disques
Figure 10. Installation d'une unité de disques sur un modèle
Figure 11. Installation d'une unité de disques à encombrement réduit
Unités de disques 21
Remarque : Le voyant de maintenance simultanée ne clignote plus et reste allumé lorsque l'unité de
disques est correctement installée.
Pour afficher l'emplacement des voyants de maintenance simultanée, sélectionnez le modèle
approprié dans la liste suivante :
v «Emplacements des unités de disques et voyants de maintenance simultanée pour les boîtiers
d'unité SCSI 31D/24 et 31T/24», à la page 98
v «Emplacements des unités de disques et voyants de maintenance simultanée sur le boîtier du
disque 58/86», à la page 96
v «Emplacements des unités de disques et voyants de maintenance simultanée sur une unité
d'extension 58/02», à la page 100
8. Si vous installez plusieurs unités de disques, répétez les étapes 1, à la page 17 à 7, à la page 20.
9. Connectez-vous en tant que superutilisateur.
10. Après avoir installé l'unité de disques, attendez 30 secondes, puis tapez iprconfig sur la ligne de
commande de la session Linux. Appuyez sur Entrée. L'écran Work with Disk Units apparaît.
11. Sélectionnez Display disk hardware status dans l'écran Work with Disk Units. Appuyez sur Entrée.
L'écran Display Hardware Status apparaît (voir figure suivante).
Figure 12. Installation d'une unité de disques à encombrement réduit sur l'unité d'extension 5803
22 Unités de disques
12. Localisez l'unité de disques que vous avez installée. Le numéro de série peut être 0.
Si l'unité de disques apparaît en étant définie comme disque de modèle 50, vous avez terminé. Si tel
n'est pas le cas, procédez comme suit :
a. Recherchez le nom /dev/sdx associé à l'unité de disques que vous avez installée.
b. Retournez à la ligne de commande.
c. Tapez sg_map sur la ligne de commande et appuyez sur Entrée.
d. Utilisez la sortie de la commande sg_map pour comparer l'unité /dev/sdx à une unité /dev/sgx.
e. Tapez sg_inq /dev/sgx sur la ligne de commande en utilisant la valeur /dev/sgx de l'étape
précédente. Appuyez sur Entrée.
f. Tapez iprconfig sur la ligne de commande et appuyez sur Entrée.
g. Sélectionnez Display disk hardware status dans l'écran Work with Disk Units. Appuyez sur
Entrée.
h. Le numéro de série de la nouvelle unité de disques s'affiche dans l'écran Display Hardware
Status.
13. Notez le nom de ressource lorsque vous configurez l'unité de disques.
Suivez la procédure :
Procédure
1. Remettez en place le panneau avant.
2. Installez ou fermez le volet avant du système, de l'unité d'extension ou de l'armoire.
3. Configurez la nouvelle unité de disques en fonction de votre environnement. Pour plus
d'informations, voir PCI-X SCSI RAID Controller Reference Guide for Linux.
Display Disk Hardware Status
Serial Vendor Product Resource Hardware
Number ID ID Model Name Status
03132041 IBM 2780001 001 /dev/ipr3 Operational
000015A5 IBM ST373453LC 079 DPY/Active
E3V1WEAB IBM IC35L036UCDY10-0 070 DPY/Active
E3V0J55B IBM IC35L036UCDY10-0 079 DPY/Active
E3V1W1UB IBM IC35L036UCDY10-0 070 DPY/Active
04B81193 IBM 3C98D203 210 /dev/sdi Operational
000227CE IBM ST336605LC 020 /dev/sdh Operational
00000001 IBM 570B001 001 /dev/ipr2 Operational
00006719 IBM ST336607LC 020 /dev/sdd Operational
E3V0E77B IBM IC35L036UCDY10-0 020 /dev/sde Operational
0007E0C4 IBM ST373453LC 020 /dev/sdf Operational
00000001 IBM 5702001 001 /dev/ipr1 Operational
03060038 IBM 5703001 001 /dev/ipr0 Operational
More...
Press Enter to continue.
e=Exit q=Cancel r=Refresh f=PageDn b=PageUp
d=Display disk unit details p=Display device parity status
Figure 13. Ecran Display Disk Hardware Status
Unités de disques 23
Concepts associés
Retrait et remplacement de volets et carters
Suivez ces procédures pour retirer et remettre en place les carters sur les serveurs systèmes contenant le
processeur POWER7.
Référence associée
«Retrait et remplacement du volet ou panneau d'unité d'extension», à la page 156
Il se peut que vous deviez retirer, remplacer ou installer des panneaux ou des volets sur une unité
d'extension pour accéder à des composants ou effectuer une opération de maintenance.
Installation d'une unité de disques utilisant le noyau 2.6 Linux
Si vous installez une unité de disques à un emplacement contrôlé par le noyau 2.6 Linux, appliquez la
procédure décrite dans cette section pour effectuer cette tâche.
Installation d'une unité de disques :
Procédure
1. Connectez-vous en tant que superutilisateur.
2. Tapez lsslot sur la ligne de commande de la session Linux et appuyez sur Entrée.
Un écran similaire à la figure suivante apparaît.
3. Notez le nom Linux de l'emplacement de la carte, sous lequel vous voulez installer l'unité de
disques.
4. Tapez iprconfig sur la ligne de commande de la session Linux et appuyez sur Entrée. L'écran
Configuration Utility apparaît.
5. Sélectionnez Work with disk unit recovery sur l'écran Configuration Utility. Appuyez sur Entrée.
6. Sélectionnez Concurrent add device dans l'écran Work with Disk Unit Recovery. Appuyez sur
Entrée.
Un écran Concurrent Device Add similaire à la figure suivante apparaît.
7. Tapez l'option 1 à côté de l'emplacement dans lequel vous voulez installer l'unité de disques, puis
appuyez sur Entrée.
x0537p1:~ # lsslot
# Slot Description Linux Name Device(s)
U787A.001.DNZ00XV-P1-C1 Logical I/O Slot 0000:00:02.0 ethernet
U787A.001.DNZ00XV-P1-C2 Logical I/O Slot 0000:00:02.2 pci
U787A.001.DNZ00XV-P1-C3 Logical I/O Slot 0001:00:02.2 pci1069,b166
U787A.001.DNZ00XV-P1-C4 Logical I/O Slot 0000:00:02.6 pci1069,b166
U787A.001.DNZ00XV-P1-C5 Logical I/O Slot 0001:00:02.6 pci1014,0180
U787A.001.DNZ00XV-P1-T5 Logical I/O Slot 0000:00:02.4 ethernet
U787A.001.DNZ00XV-P1-T7 Logical I/O Slot 0000:00:02.3 usb
U787A.001.DNZ00XV-P1-T10 Logical I/O Slot 0001:00:02.4 scsi
U787A.001.DNZ00XV-P1-T12 Logical I/O Slot 0001:00:02.3 ide
U9111.520.10D3C6C-V1-C0 Virtual I/O Slot 30000000 vty
Figure 14. Nom Linux correspondant à l'emplacement d'installation approprié pour l'unité de disques.
Concurrent Device Add
Choose a single location for add operations
1=Select
OPT Name PCI/SCSI Location Description Status
--- ------ -------------------------- ------------------------- -----------------
0000:41:01.0.1/0:5: Empty
0000:41:01.0.1/1:8: Empty
0001:61:01.0.2/0:8: Empty
Figure 15. Exemple d'écran Concurrent Device Remove
24 Unités de disques
L'écran Verify Device Concurrent Add apparaît.
Un voyant de maintenance simultanée indique l'emplacement où l'unité peut être installée.
8. Appuyez sur Entrée dans l'écran Verify Device Concurrent Add pour vérifier l'emplacement.
9. Si l'emplacement que vous avez choisi comporte un obturateur d'unité de disque, conservez celui-ci
pour une utilisation ultérieure ou mettez-le au rebut puis passez à l'étape 16, à la page 27.
Remarque : L'obturateur est une pièce séparée que ne se fixe pas sur la nouvelle unité de disques.
Avertissement :
v Portez un bracelet antistatique relié à une surface métallique non peinte pour protéger le matériel
contre les risques de décharge électrostatique.
v Si vous portez un bracelet antistatique, respectez toutes les consignes de sécurité relatives aux
dangers électriques. Un bracelet antistatique permet de réguler l'électricité statique. Il ne réduit et
n'augmente en aucun cas les risques d'électrocution liés à l'utilisation ou à la manipulation
d'appareils électriques.
v Si vous ne disposez pas d'un bracelet antistatique, touchez une surface métallique non peinte du
système pendant au moins 5 secondes avant de déballer un produit de son emballage antistatique,
d'installer ou de remplacer du matériel.
a. Tirez l'obturateur d'unité de disques pour le détacher de l'emplacement (voir figure 16).
b. Retirez le loquet (A) du panneau frontal de l'unité de disques du bord avant de l'obturateur (voir
figure suivante).
Figure 16. Retrait du panneau obturateur sur un modèle
Unités de disques 25
c. Dévissez et retirez la vis moletée (C) pour dégager le panneau frontal (B) de l'unité de disques
du bord avant de l'obturateur.
10. Prenez l'emballage contenant la nouvelle unité de disques.
Avertissement : Les unités de disques sont fragiles. Manipulez-les avec précaution.
11. Retirez l'unité de disques de son emballage antistatique. Inscrivez le numéro de série (les quatre
derniers chiffres) de l'unité de disques.
12. Déverrouillez la poignée de l'unité de disques en la tirant vers vous. Si la poignée n'est pas
complètement sortie, vous ne pouvez pas insérer l'unité de disques dans l'unité centrale ou l'unité
d'extension.
13. Si votre unité de disques n'est pas dotée d'un panneau frontal, procédez comme suit pour l'installer :
a. Fixez le panneau frontal (A) sur l'unité de disques (voir figure suivante).
Figure 17. Retrait du panneau frontal sur l'unité de disques.
26 Unités de disques
b. Appuyez sur la vis moletée (B), puis serrez la vis pour maintenir le panneau frontal sur l'unité
de disques.
14. Maintenez l'unité de disques par-dessous lorsque vous l'alignez sur les rails de guidage de l'unité
centrale ou de l'unité d'extension. Ne la tenez pas par la poignée.
15. L'écran Complete Device Concurrent Add apparaît.
16. Faites glisser l'unité de disques dans le système.
Faites glisser complètement l'unité de disques dans l'unité centrale ou l'unité d'extension, puis
bloquez-la dans son emplacement en poussant la poignée (voir figure 19, à la page 28).
Figure 18. Fixation du panneau frontal à l'unité de disques
Unités de disques 27
Figure 19. Installation d'une unité de disques sur un modèle
Figure 20. Installation d'une unité de disques à encombrement réduit
28 Unités de disques
17. Appuyez sur Entrée. Le voyant de maintenance simultanée ne clignote plus et reste allumé lorsque
l'unité de disques est correctement installée.
Pour afficher l'emplacement des voyants de maintenance simultanée, sélectionnez le modèle
approprié dans la liste suivante :
v «Emplacements des unités de disques et voyants de maintenance simultanée pour les boîtiers
d'unité SCSI 31D/24 et 31T/24», à la page 98
v «Emplacements des unités de disques et voyants de maintenance simultanée sur le boîtier du
disque 58/86», à la page 96
v «Emplacements des unités de disques et voyants de maintenance simultanée sur une unité
d'extension 58/02», à la page 100
18. Si vous installez plusieurs unités de disques, répétez les étapes 2, à la page 24 à 17 jusqu'à
installation de toutes les unités.
Vérification du fonctionnement de la nouvelle unité de disques :
Pourquoi et quand exécuter cette tâche
Pour vérifier si l'unité de disques fonctionne, procédez comme suit :
1. Connectez-vous en tant que superutilisateur.
2. Tapez iprconfig sur la ligne de commande de la session Linux et appuyez sur Entrée.L'écran
Configuration Utility apparaît.
3. Sélectionnez Display hardware status.
L'écran Display Hardware Status apparaît (voir figure suivante).
Figure 21. Installation d'une unité de disques à encombrement réduit sur l'unité d'extension 5803
Unités de disques 29
4. Vérifiez que l'unité de disques installée apparaît à l'écran.
Suivez la procédure :
Pourquoi et quand exécuter cette tâche
1. Remettez en place le panneau avant.
2. Installez ou fermez le volet avant de l'unité système, de l'unité d'extension ou de l'armoire.
3. Configurez la nouvelle unité de disques en fonction de votre environnement. Pour plus
d'informations, voir PCI-X SCSI RAID Controller Reference Guide for Linux.
Concepts associés
Retrait et remplacement de volets et capots
Suivez ces procédures pour retirer et remettre en place les carters sur les serveurs systèmes contenant le
processeur POWER7.
Référence associée
«Retrait et remplacement du volet ou panneau d'unité d'extension», à la page 156
Il se peut que vous deviez retirer, remplacer ou installer des panneaux ou des volets sur une unité
d'extension pour accéder à des composants ou effectuer une opération de maintenance.
Installation d'une unité de disques ou d'une unité SSD sur un système
ou une partition logique Virtual I/O Server sous tension
Vous pouvez installer une unité de disques ou une unité SSD sur un système ou une unité d'extension
alors que le système d'exploitation ou la partition logique Virtual I/O Server qui contrôle l'emplacement
de l'unité de disques est sous tension. La procédure décrite dans cette information permet d'effectuer
cette tâche.
Avant de commencer
Avant d'installer ou de remplacer un dispositif, assurez-vous que le logiciel requis pour prendre en
charge ce dispositif est installé sur votre système. Si votre système est géré par la console HMC, utilisez
celle-ci pour effectuer la procédure de remplacement du ventilateur dans le serveur. Pour obtenir des
informations sur l'utilisation de la console HMC pour l'installation d'un dispositif, voir «Installation d'un
dispositif via la console HMC (Hardware Management Console)», à la page 193. Si vous ne disposez pas
Display Hardware Status
Type option, press Enter.
1=Display hardware resource information details
OPT Name PCI/SCSI Location Description Status
--- ------ -------------------------- ------------------------- ----------------
0000:01:01.0.0/ PCI-X SCSI Adapter Operational
0000:41:01.0.1/ PCI-X SCSI Adapter Operational
sda 0000:41:01.0.1/0:3:0 Physical Disk Active
sdb 0000:41:01.0.1/0:4:0 Physical Disk Active
sdc 0000:41:01.0.1/0:8:0 Physical Disk Active
sdd 0000:41:01.0.1/1:3:0 Physical Disk Active
sde 0000:41:01.0.1/1:4:0 Physical Disk Active
sdf 0000:41:01.0.1/1:5:0 Physical Disk Active
0001:61:01.0.2/ PCI-X SCSI RAID Adapter Operational
sdg 0001:61:01.0.2/0:3:0 Physical Disk Active
0001:61:01.0.2/0:6:0 Advanced Function Disk Active
sdi 0001:61:01.0.2/0:9:0 Physical Disk Active
sdh 0001:61:01.0.2/255:0:0 RAID 10 Disk Array Failed
0001:61:01.0.2/0:4:0 RAID 10 Array Member Failed
0001:61:01.0.2/0:5:0 RAID 10 Array Member Failed
e=Exit q=Cancel r=Refresh t=Toggle
Figure 22. Exemple d'écran Display Hardware Status
30 Unités de disques
d'une console HMC, procédez comme suit pour installer une unité de disque du système ou de la
partition qui contrôle l'unité de disques.
Pourquoi et quand exécuter cette tâche
DANGER
Lorsque vous utilisez le système ou travaillez à proximité de ce dernier, observez les consignes
suivantes :
Le courant électrique provenant de l'alimentation, du téléphone et des câbles de transmission peut
présenter un danger. Pour éviter tout risque de choc électrique :
v Branchez cette unité uniquement avec le cordon d'alimentation fourni par . N'utilisez pas ce
dernier avec un autre produit.
v N'ouvrez pas et n'entretenez pas le bloc d'alimentation électrique.
v Ne manipulez aucun câble et n'effectuez aucune opération d'installation, d'entretien ou de
reconfiguration de ce produit au cours d'un orage.
v Le produit peut être équipé de plusieurs cordons d'alimentation. Pour supprimer tout risque de
danger électrique, débranchez tous les cordons d'alimentation.
v Branchez tous les cordons d'alimentation sur un socle de prise de courant correctement câblé et
mis à la terre. Vérifiez que la tension et l'ordre des phases des prises de courant correspondent
aux informations de la plaque d'alimentation électrique du système.
v Branchez sur des socles de prise de courant correctement câblés tout équipement connecté à ce
produit.
v Lorsque cela est possible, n'utilisez qu'une seule main pour connecter ou déconnecter les câbles
d'interface.
v Ne mettez jamais un équipement sous tension en cas d'incendie ou d'inondation, ou en présence
de dommages matériels.
v Avant de retirer les carters de l'unité, mettez celle-ci hors tension et déconnectez ses cordons
d'alimentation, ainsi que les câbles qui la relient aux réseaux, aux systèmes de télécommunication
et aux modems (sauf mention contraire dans les procédures d'installation et de configuration).
v Lorsque vous installez, que vous déplacez, ou que vous manipulez le présent produit ou des
périphériques qui lui sont raccordés, reportez-vous aux instructions ci-dessous pour connecter et
déconnecter les différents cordons.
Pour déconnecter les cordons :
1. Mettez toutes les unités hors tension (sauf mention contraire).
2. Débranchez les cordons d'alimentation des prises.
3. Débranchez les câbles d'interface des connecteurs.
4. Débranchez tous les câbles des unités.
Pour connecter les cordons :
1. Mettez toutes les unités hors tension (sauf mention contraire).
2. Branchez tous les cordons sur les unités.
3. Branchez les câbles d'interface sur des connecteurs.
4. Branchez les cordons d'alimentation aux prises.
5. Mettez l'unité sous tension.
(D005a)
Avertissement : La procédure avec le système sous tension est un ensemble complexe d'étapes qui
doivent être exécutées avec précision pour éviter des pertes de données ou un endommagement du
système. Vous pouvez également effectuer la procédure hors tension, plus simple et moins risquée en
termes de pertes de données ou d'endommagement du système. Pour plus d'informations, voir
«Installation d'une unité de disques ou d'une unité SSD sur un système ou une partition logique hors
tension», à la page 37.
Unités de disques 31
Remarque : Certaines figures accompagnant ces procédures peuvent ne pas correspondre exactement à
votre unité centrale ou votre unité d'extension. Néanmoins, les procédures demeurent identiques pour
l'exécution de chaque tâche.
Préparation du système
Procédure
1. Effectuez les tâches prérequises décrites dans «Avant de commencer», à la page 143.
2. Retirez les capots avant. Pour plus d'informations, voir «Carters système», à la page 157
3. Identifiez le premier emplacement d'unité de disques libre de l'unité centrale ou de l'unité d'extension.
Pour plus d'informations, voir «Emplacements des unités de disque et voyants de maintenance
simultanée», à la page 90.
Remarque : Si le système est doté d'une unité d'extension, comblez d'abord les emplacements de
l'unité centrale. Toutefois, vous pouvez sélectionner une autre position pour l'unité de disques, en
fonction de votre stratégie de protection des données.
4. Inscrivez l'emplacement d'installation de la nouvelle unité. Par exemple, la nouvelle unité de disques
peut être placée en P3-D4.
5. Pour installer une unité de disques, procédez comme suit.
Installation d'une unité de disques
Pourquoi et quand exécuter cette tâche
1. Connectez-vous en tant que superutilisateur.
2. Sur la ligne de commande, tapez diagmenu et appuyez sur Entrée.
3. Appuyez sur Entrée pour continuer.
4. Dans l'écran de sélection de fonction, sélectionnez Sélection des tâches.
5. Sur l'écran de sélection des tâches, sélectionnez Remplacement à chaud.
6. Sélectionnez Gestionnaire d'unités SCSI et RAID SCSI remplaçables à chaud.
7. Sélectionnez Connexion d'une unité à une unité SES. La liste des emplacements vides du boîtier de
connexion à chaud SCSI s'affiche.
8. Sélectionnez l'emplacement dans lequel vous allez installer l'unité de disques, puis appuyez sur
Entrée.
L'emplacement de l'unité de disques affiche l'état Identification. Le voyant de maintenance
simultanée de l'emplacement commence à clignoter. Pour afficher l'emplacement des voyants de
maintenance simultanée, choisissez le modèle sur lequel vous souhaitez installer une unité de
disques. Pour une liste des modèles, voir «Emplacements des unités de disque et voyants de
maintenance simultanée», à la page 90.
9. Si l'emplacement que vous avez choisi comporte un obturateur d'unité de disque, conservez celui-ci
pour une utilisation ultérieure ou mettez-le au rebut puis passez à l'étape Insertion de l'unité de
disque.
Remarque : L'obturateur est une pièce séparée que ne se fixe pas sur la nouvelle unité de disques.
Important : .
Tirez l'obturateur d'unité de disques pour le détacher de l'emplacement approprié (voir figure 23, à
la page 33).
32 Unités de disques
Avertissement :
v Portez un bracelet antistatique relié à une surface métallique non peinte pour protéger le matériel
contre les risques de décharge électrostatique.
v Si vous portez un bracelet antistatique, respectez toutes les consignes de sécurité relatives aux
dangers électriques. Un bracelet antistatique permet de réguler l'électricité statique. Il ne réduit et
n'augmente en aucun cas les risques d'électrocution liés à l'utilisation ou à la manipulation
d'appareils électriques.
v Si vous ne disposez pas d'un bracelet antistatique, touchez une surface métallique non peinte du
système pendant au moins 5 secondes avant de déballer un produit de son emballage antistatique,
d'installer ou de remplacer du matériel.
10. Si l'emplacement en question contient un obturateur d'unité de disques, procédez comme suit.
Retirez le loquet (A) du panneau frontal de l'unité de disques du bord avant de l'obturateur (voir
figure suivante).
Figure 23. Retrait d'un obturateur d'unité de disques
Unités de disques 33
11. Si l'emplacement en question contient un obturateur d'unité de disques, procédez comme suit.
Dévissez et retirez la vis moletée (C) pour dégager le panneau frontal (B) du bord avant de
l'obturateur.
12. Prenez l'emballage contenant la nouvelle unité de disques.
Avertissement : Les unités de disques sont fragiles. Manipulez-les avec précaution.
13. Retirez l'unité de disques de son emballage antistatique.
14. Déverrouillez la poignée de l'unité de disques en la tirant vers vous. Si la poignée n'est pas
complètement sortie, vous ne pouvez pas insérer l'unité de disques dans l'unité centrale ou l'unité
d'extension.
15. Si un panneau frontal n'est pas préinstallé sur l'unité de disques que vous installez, procédez à
l'étape suivante.
Fixez le panneau frontal (A) sur l'unité de disques (voir figure suivante).
Figure 24. Retrait du panneau frontal d'un obturateur d'unité de disques
34 Unités de disques
16. Si un panneau frontal n'est pas préinstallé sur l'unité de disques que vous installez, procédez à
l'étape suivante.
Appuyez sur la vis moletée (B), puis serrez la vis pour maintenir le panneau frontal (A) sur l'unité
de disques.
17. Maintenez l'unité de disques par-dessous lorsque vous l'alignez sur les rails de guidage de l'unité
centrale. Ne la tenez pas par la poignée.
18. Faites glisser complètement l'unité de disques dans l'unité centrale (voir figure 26, à la page 36).
19. Poussez complètement la poignée de l'unité de disques. Le voyant de l'unité de disques s'allume.
Figure 25. Installation d'un panneau frontal sur une unité de disques
Unités de disques 35
Figure 26. Installation d'une unité de disques sur un modèle
Figure 27. Installation d'une unité de disques à encombrement réduit
36 Unités de disques
20. Appuyez sur Entrée.
21. Si vous installez plusieurs unités de disques, répétez les étapes 1, à la page 32 à 20 jusqu'à
installation de toutes les unités.
22. Annulez pour revenir au menu Gestionnaire d'unités SCSI et RAID SCSI remplaçables à chaud.
23. Sélectionnez Configuration des unités ajoutées/remplacées.
24. Retournez à la ligne de commande.
Suivez la procédure
Pourquoi et quand exécuter cette tâche
1. Réinstallez le panneau avant. Pour plus d'informations sur le remplacement du capot, voir «Carters
système», à la page 157.
2. Vérifiez le composant installé. Pour plus d'informations, voir «Vérification du composant installé», à
la page 195.
Installation d'une unité de disques ou d'une unité SSD sur un système
ou une partition logique hors tension
Découvrez comment installer une unité de disques sur un système ou une partition logique qui contrôle
l'emplacement de l'unité de disques une fois l'alimentation coupée. La procédure décrite dans cette
information permet d'effectuer cette tâche.
Avant de commencer
Avertissement : Si vous effectuez la maintenance d'un composant défaillant, consultez les procédures de
maintenance pour «Remplacement d'une unité de disques sur un système ou une partition logique hors
tension», à la page 73. Cette procédure s'applique à l'installation ou à la mise à niveau d'un nouveau
drive de mémoire.
Figure 28. Installation d'une unité de disques à encombrement réduit sur l'unité d'extension 5803
Unités de disques 37
Avant d'installer ou de remplacer un dispositif, assurez-vous que le logiciel requis pour prendre en
charge ce dispositif est installé sur votre système. Si votre système est géré par la console HMC, utilisez
celle-ci pour effectuer la procédure de remplacement du ventilateur dans le serveur. Pour obtenir des
informations sur l'utilisation de la console HMC pour l'installation d'un dispositif, voir
http://publib.boulder.ibm.com/infocenter/powersys/v3r1m5/topic/p7haj/hmcinstall.htm. Si vous ne
disposez pas d'une console HMC, procédez comme suit pour installer une unité de disque du système ou
de la partition qui contrôle l'unité de disques.
Si vous installez des unités SSD, lisez les règles de configuration puis revenez à cette section. Pour plus
d'informations, voir «Unités SSD (Solid State Drive)», à la page 1.
38 Unités de disques
Pourquoi et quand exécuter cette tâche
DANGER
Lorsque vous utilisez le système ou travaillez à proximité de ce dernier, observez les consignes
suivantes :
Le courant électrique provenant de l'alimentation, du téléphone et des câbles de transmission peut
présenter un danger. Pour éviter tout risque de choc électrique :
v Branchez cette unité uniquement avec le cordon d'alimentation fourni par . N'utilisez pas ce
dernier avec un autre produit.
v N'ouvrez pas et n'entretenez pas le bloc d'alimentation électrique.
v Ne manipulez aucun câble et n'effectuez aucune opération d'installation, d'entretien ou de
reconfiguration de ce produit au cours d'un orage.
v Le produit peut être équipé de plusieurs cordons d'alimentation. Pour supprimer tout risque de
danger électrique, débranchez tous les cordons d'alimentation.
v Branchez tous les cordons d'alimentation sur un socle de prise de courant correctement câblé et
mis à la terre. Vérifiez que la tension et l'ordre des phases des prises de courant correspondent
aux informations de la plaque d'alimentation électrique du système.
v Branchez sur des socles de prise de courant correctement câblés tout équipement connecté à ce
produit.
v Lorsque cela est possible, n'utilisez qu'une seule main pour connecter ou déconnecter les câbles
d'interface.
v Ne mettez jamais un équipement sous tension en cas d'incendie ou d'inondation, ou en présence
de dommages matériels.
v Avant de retirer les carters de l'unité, mettez celle-ci hors tension et déconnectez ses cordons
d'alimentation, ainsi que les câbles qui la relient aux réseaux, aux systèmes de télécommunication
et aux modems (sauf mention contraire dans les procédures d'installation et de configuration).
v Lorsque vous installez, que vous déplacez, ou que vous manipulez le présent produit ou des
périphériques qui lui sont raccordés, reportez-vous aux instructions ci-dessous pour connecter et
déconnecter les différents cordons.
Pour déconnecter les cordons :
1. Mettez toutes les unités hors tension (sauf mention contraire).
2. Débranchez les cordons d'alimentation des prises.
3. Débranchez les câbles d'interface des connecteurs.
4. Débranchez tous les câbles des unités.
Pour connecter les cordons :
1. Mettez toutes les unités hors tension (sauf mention contraire).
2. Branchez tous les cordons sur les unités.
3. Branchez les câbles d'interface sur des connecteurs.
4. Branchez les cordons d'alimentation aux prises.
5. Mettez l'unité sous tension.
(D005a)
Préparation du système
Procédure
1. Effectuez les tâches préalablement requises décrites dans la section «Avant de commencer», à la page
143.
Remarque : Ne coupez pas l'alimentation lorsque vous arrêtez le système ou la partition logique afin
d'installer une unité de disque ou une unité SSD.
2. Le cas échéant, retirez le volet de l'unité centrale, celui de l'unité d'extension, ou ouvrez l'armoire.
3. Le cas échéant, retirez le panneau avant.
Unités de disques 39
4. Identifiez le premier emplacement d'unité de disques libre de l'unité centrale ou de l'unité d'extension.
Pour plus d'informations, voir «Emplacements des unités de disque et voyants de maintenance
simultanée», à la page 90.
Remarque : Si le système est doté d'une unité d'extension, vous devez d'abord combler les
emplacements de l'unité centrale. Toutefois, vous pouvez sélectionner une autre position pour l'unité
de disques, en fonction de votre stratégie de protection des données.
5. Arrêtez le système ou la partition qui contrôle l'emplacement de l'unité de disques. Pour obtenir des
instructions, voir http://publib.boulder.ibm.com/infocenter/powersys/v3r1m5/topic/p7haj/
crustopsys.htm.
Installation d'une unité de disques
Pourquoi et quand exécuter cette tâche
1. Si l'emplacement que vous avez choisi comporte un obturateur d'unité de disque, conservez celui-ci
pour une utilisation ultérieure ou mettez-le au rebut puis passez à l'étape Insertion du disque.
Remarque : L'obturateur est une pièce séparée que ne se fixe pas sur la nouvelle unité de disques.
Avertissement :
v Portez un bracelet antistatique relié à une surface métallique non peinte pour protéger le matériel
contre les risques de décharge électrostatique.
v Si vous portez un bracelet antistatique, respectez toutes les consignes de sécurité relatives aux
dangers électriques. Un bracelet antistatique permet de réguler l'électricité statique. Il ne réduit et
n'augmente en aucun cas les risques d'électrocution liés à l'utilisation ou à la manipulation
d'appareils électriques.
v Si vous ne disposez pas d'un bracelet antistatique, touchez une surface métallique non peinte du
système pendant au moins 5 secondes avant de déballer un produit de son emballage antistatique,
d'installer ou de remplacer du matériel.
a. Tirez l'obturateur d'unité de disques pour le détacher de l'emplacement approprié (voir figure 29).
b. Retirez le loquet (A) du panneau frontal de l'unité de disques du bord avant de l'obturateur (voir
figure suivante).
Figure 29. Retrait d'un obturateur d'unité de disques d'un modèle
40 Unités de disques
c. Dévissez et retirez la vis moletée (C) pour dégager le panneau frontal (B) du bord avant de
l'obturateur.
2. Prenez l'emballage contenant la nouvelle unité de disques.
Avertissement : Les unités de disques sont fragiles. Manipulez-les avec précaution.
3. Retirez l'unité de disques de son emballage protecteur. Inscrivez le numéro de série (les quatre
derniers chiffres) de l'unité de disques.
4. Déverrouillez la poignée de l'unité de disques en la tirant vers vous avant d'installer l'unité de
disques. Si la poignée n'est pas complètement sortie, vous ne pouvez pas insérer l'unité de disques
dans l'unité centrale ou l'unité d'extension.
5. Si un panneau frontal n'est pas préinstallé sur l'unité de disques, procédez comme suit pour l'installer
sur la nouvelle unité :
a. Fixez le panneau frontal (A) sur l'unité de disques (voir figure suivante).
Figure 30. Retrait du panneau frontal d'une unité de disques
Unités de disques 41
b. Appuyez sur la vis moletée (B), puis serrez la vis pour maintenir le panneau frontal sur l'unité de
disques.
6. Maintenez l'unité de disques par-dessous lorsque vous l'alignez sur les rails de guidage de l'unité
centrale ou de l'unité d'extension. Ne la tenez pas par la poignée.
7. Faites glisser complètement l'unité de disques dans l'unité centrale ou l'unité d'extension.
Remarque : Lors de l'installation de l'unité de disques, vérifiez qu'elle est complètement et
parfaitement insérée dans le système.
8. Poussez complètement la poignée de l'unité de disques (voir figure 32, à la page 43).
Figure 31. Installation d'un panneau frontal sur une unité de disques
42 Unités de disques
Figure 32. Installation d'une unité de disques sur un modèle
Figure 33. Installation d'une unité de disques à encombrement réduit
Unités de disques 43
9. Si vous installez plusieurs unités de disques, répétez les étapes 1, à la page 40 à 8, à la page 42
jusqu'à installation de toutes les unités.
Suivez la procédure
Pourquoi et quand exécuter cette tâche
1. Remettez en place le panneau avant.
2. Installez ou fermez le volet avant de l'unité système, de l'unité d'extension ou de l'armoire.
3. Si les câbles d'alimentation système ont été retirés, rebranchez-les.
4. Démarrez le système ou la partition logique. Pour plus d'informations, voir «Démarrage du système
ou de la partition logique», à la page 151.
5. Pour configurer la nouvelle unité de disques, sélectionnez la procédure correspondant au système
d'exploitation qui contrôle l'unité de disques.
v «Préparation d'une unité de disques dans un système ou une partition logique AIX», à la page 100
v Pour plus d'informations sur la configuration de la nouvelle unité de disques dans un
environnement Linux, voir PCI-X SCSI RAID Controller Reference Guide for Linux.
Figure 34. Installation d'une unité de disques à encombrement réduit sur l'unité d'extension 5803
44 Unités de disques
Concepts associés
Retrait et remplacement de volets et capots
Suivez ces procédures pour retirer et remettre en place les carters sur les serveurs systèmes contenant le
processeur POWER7.
Référence associée
«Retrait et remplacement du volet ou panneau d'unité d'extension», à la page 156
Il se peut que vous deviez retirer, remplacer ou installer des panneaux ou des volets sur une unité
d'extension pour accéder à des composants ou effectuer une opération de maintenance.
Retrait d'une unité de disques ou d'une unité SSD
Il peut être nécessaire de retirer une unité de disques ou une unité SSD du serveur ou de l'unité
d'extension. La présente section fournit des informations sur le boîtier d'unités de disques et les
procédures permettant d'effectuer ces tâches.
Pour retirer et remplacer une unité de disques défaillante, voir «Remplacement d'une unité de disques».
Si le système est géré par la console HMC (Hardware Management Console), utilisez-la pour retirer
l'unité de disques. Pour plus d'informations, voir «Retrait d'un composant via la console HMC», à la page
194.
Cette rubrique explique comment retirer une unité de disques sur une unité centrale ou une unité
d'extension contrôlée par différents systèmes d'exploitation.
«Remplacement d'une unité de disques sur un système ou une partition logique AIX sous tension», à la
page 46
«Remplacement d'une unité de disques sur un système ou une partition logique Linux sous tension», à la
page 55
Remplacement d'une unité de disques
Il peut être nécessaire de remplacer une unité de disques SCSI (Small Computer System Interface) ou SAS
(serial-attached Attached SCSI) sur le serveur ou l'unité d'extension. Les procédures de cette section
permettent d'effectuer cette tâche.
Pourquoi et quand exécuter cette tâche
Remarque : Ces instructions (les trois premières sections) ne concernent que le remplacement d'une unité
de disques SAS.
Pour retirer et remplacer une unité d'extension dans un boîtier différent, utilisez les informations du
boîtier.
Suivez cette procédure pour retirer une unité de disques défaillante et la remplacer par le même type. Si
vous envisagez de retirer une unité de disques et d'installer un type différent, retirez l'unité existante
(voir «Retrait d'une unité de disques ou d'une unité SSD») et installez la nouvelle unité de disques (voir
«Installation d'une unité de disques ou d'une unité SSD», à la page 1). Si le système est géré par la
console HMC (Hardware Management Console), utilisez-la pour remplacer l'unité de disques. Pour plus
d'informations, voir «Remplacement d'un composant via la console HMC», à la page 194.
Votre système ou votre partition peut être indifféremment hors tension ou sous tension lorsque vous
remplacez l'unité de disques. Si l'unité de disques remplacée se trouve dans le groupe de volumes racine
(rootvg) AIX ou Linux et n'est pas protégée, mettez le système hors tension pour remplacer le disque
(voir la procédure appropriée).
Unités de disques 45
Remplacement d'une unité de disques sur un système ou une partition
logique AIX sous tension
Il peut être nécessaire de remplacer une unité de disques SCSI (Small Computer System Interface) ou SAS
(serial-attached Attached SCSI) sur un système ou une unité d'extension alors que le système
d'exploitation ou la partition logique AIX qui contrôle l'emplacement de l'unité de disques est sous
tension. Exécutez la procédure décrite dans cette section pour effectuer cette tâche.
Avant de commencer
Avertissement : Si vous installez une unité de disque nouvelle ou mise à niveau, voir «Installation d'une
unité de disques ou d'une unité SSD sur un système ou une partition logique AIX sous tension», à la
page 7. Si vous retirez une unité de disque dans le cadre d'une procédure de maintenance, continuez de
suivre la procédure suivante.
46 Unités de disques
Pourquoi et quand exécuter cette tâche
DANGER
Lorsque vous utilisez le système ou travaillez à proximité de ce dernier, observez les consignes
suivantes :
Le courant électrique provenant de l'alimentation, du téléphone et des câbles de transmission peut
présenter un danger. Pour éviter tout risque de choc électrique :
v Branchez cette unité uniquement avec le cordon d'alimentation fourni par . N'utilisez pas ce
dernier avec un autre produit.
v N'ouvrez pas et n'entretenez pas le bloc d'alimentation électrique.
v Ne manipulez aucun câble et n'effectuez aucune opération d'installation, d'entretien ou de
reconfiguration de ce produit au cours d'un orage.
v Le produit peut être équipé de plusieurs cordons d'alimentation. Pour supprimer tout risque de
danger électrique, débranchez tous les cordons d'alimentation.
v Branchez tous les cordons d'alimentation sur un socle de prise de courant correctement câblé et
mis à la terre. Vérifiez que la tension et l'ordre des phases des prises de courant correspondent
aux informations de la plaque d'alimentation électrique du système.
v Branchez sur des socles de prise de courant correctement câblés tout équipement connecté à ce
produit.
v Lorsque cela est possible, n'utilisez qu'une seule main pour connecter ou déconnecter les câbles
d'interface.
v Ne mettez jamais un équipement sous tension en cas d'incendie ou d'inondation, ou en présence
de dommages matériels.
v Avant de retirer les carters de l'unité, mettez celle-ci hors tension et déconnectez ses cordons
d'alimentation, ainsi que les câbles qui la relient aux réseaux, aux systèmes de télécommunication
et aux modems (sauf mention contraire dans les procédures d'installation et de configuration).
v Lorsque vous installez, que vous déplacez, ou que vous manipulez le présent produit ou des
périphériques qui lui sont raccordés, reportez-vous aux instructions ci-dessous pour connecter et
déconnecter les différents cordons.
Pour déconnecter les cordons :
1. Mettez toutes les unités hors tension (sauf mention contraire).
2. Débranchez les cordons d'alimentation des prises.
3. Débranchez les câbles d'interface des connecteurs.
4. Débranchez tous les câbles des unités.
Pour connecter les cordons :
1. Mettez toutes les unités hors tension (sauf mention contraire).
2. Branchez tous les cordons sur les unités.
3. Branchez les câbles d'interface sur des connecteurs.
4. Branchez les cordons d'alimentation aux prises.
5. Mettez l'unité sous tension.
(D005a)
Avertissement : La procédure avec le système sous tension est un ensemble complexe d'étapes qui
doivent être exécutées avec exactitude pour éviter des pertes de données ou un endommagement du
système. Vous pouvez également effectuer la procédure hors tension, plus simple et moins risquée en
termes de pertes de données ou d'endommagement du système. Si le système ou la partition logique qui
contrôle l'unité de disques à remplacer est sur la ligne de commande AIX, vous pouvez utiliser cette
procédure avec l'alimentation sous tension. Si l'unité de disques à remplacer se trouve dans le groupe de
volumes racine (rootvg) et qu'elle n'est pas protégée par la technologie RAID (Redundant Array of
Independent Disks) ou la mise en miroir, ou pour utiliser la procédure hors tension, voir «Remplacement
d'une unité de disques sur un système ou une partition logique hors tension», à la page 73.
Unités de disques 47
Remarque : Certaines figures accompagnant ces procédures peuvent ne pas correspondre exactement à
votre unité centrale ou votre unité d'extension. Néanmoins, les procédures demeurent identiques pour
l'exécution de chaque tâche.
Préparation du système
Procédure
1. Effectuez les tâches de configuration prérequises décrites dans la section «Avant de commencer», à la
page 143.
2. Préparez-vous à retirer l'unité de disques de l'unité centrale ou de l'unité d'extension (voir
«Préparation du retrait d'une unité de disques d'une unité centrale ou d'extension contrôlée par le
système d'exploitation AIX», à la page 100).
3. Le cas échéant, retirez le volet de l'unité centrale, de l'unité d'extension ou de l'armoire.
4. Le cas échéant, retirez le panneau avant.
v Si vous avez un panneau avant d'un autre modèle, retirez-le.
Résultats
Pour remplacer une unité de disques JBOD (Just a Bunch of Disks) connectée à une carte SCSI ou SCSI
RAID, voir «Remplacement d'une unité de disques SCSI», à la page 81.
Retrait d'une unité de disques
Procédure
1. Si vous retirez une unité SSD située sur un adaptateur PCIe RAID et SSD SAS, reportez-vous à
Retrait et remplacement de l'adaptateur PCIe RAID et SSD SASRetrait et remplacement de
l'adaptateur PCIe RAID et SSD SAS et suivez la procédure, puis revenez ici et effectuez les étapes
suivantes.
2. Connectez-vous en tant que superutilisateur.
3. A l'invite, tapez diag et appuyez sur Entrée.
4. Appuyez sur Entrée pour continuer.
5. Dans l'écran de sélection de fonction, sélectionnez Sélection des tâches.
6. Sélectionnez Remplacement à chaud.
7. Sélectionnez Gestionnaire d'unités SCSI et RAID SCSI remplaçables à chaud ou Unités RAID
remplaçables à chaud, en fonction de l'adaptateur auquel votre unité est connectée.
Conseil : L'une, l'autre ou ces deux options peuvent être utilisées, en fonction de la configuration de
votre système. Si l'unité que vous désirez remplacer ne figure pas dans la liste lorsque vous suivez la
procédure du Gestionnaire d'unités SCSI et RAID SCSI remplaçables à chaud, passez à l'étape 8.
v Si une unité de disques RAID est connectée à une carte PCI SCSI RAID 2498, passez à l'étape 8.
v Si une unité de disques JBOD est connectée à une carte SCSI ou SCSI RAID, voir «Remplacement
d'une unité de disques SCSI», à la page 81.
v Si des unités RAID sont connectées à d'autres cartes, voir PCI-X SCSI RAID Controller Reference
Guide for AIX.
8. Sélectionnez Unités RAID remplaçables à chaud.
9. Dans la liste de sélection des ressources, sélectionnez la carte connectée à l'ensemble contenant
l'unité de disques à remplacer. Appuyez sur Entrée.
10. Sélectionnez Validation.
11. Dans le menu Identification et retrait de ressources, sélectionnez Retrait. Appuyez sur Entrée.
La liste des unités de disques pouvant être retirées apparaît. Si l'unité de disques à retirer ne figure
pas dans la liste, demandez à l'administrateur système de la placer dans l'état Incident avant de
poursuivre cette procédure. Pour plus d'informations, voir AIX System Management Guide: Operating
System and Devices.
48 Unités de disques
12. Sélectionnez l'unité à retirer et appuyez sur Entrée. Le voyant de maintenance simultanée
correspondant à l'emplacement clignote rapidement.
Avertissement : Si ce n'est pas le cas, laissez l'unité à sa place.
Pour afficher l'emplacement des voyants de maintenance simultanée, consultez «Emplacements des
unités de disque et voyants de maintenance simultanée», à la page 90.
13. Déverrouillez la poignée de l'unité de disques en appuyant dessus et en la tirant vers vous (voir
figure 35).
Figure 35. Retrait d'une unité de disques d'un modèle
Unités de disques 49
14. Maintenez l'unité de disques par-dessous lorsque vous la faites glisser hors de l'unité centrale ou de
l'unité d'extension. Ne la tenez pas par la poignée.
Figure 36. Retrait d'une unité de disques à encombrement réduit
Figure 37. Retrait d'une unité de disques à encombrement réduit sur l'unité d'extension 5803
50 Unités de disques
Le voyant de maintenance simultanée de l'emplacement s'éteint lorsque vous retirez l'unité de
disques.
15. Votre nouvelle unité de disques est-elle déjà équipée d'un panneau frontal ?
v Non : continuez la procédure à ce stade.
v Oui : Passez à l'étape 6, à la page 52 de la procédure de remplacement.
Important : Si le panneau frontal de l'unité de disques n'est pas doté d'une vis moletée, il s'agit
d'une unité de disques SAS (serial-attached Attached SCSI) non amovible. L'unité de remplacement
doit alors être pourvue d'un panneau frontal fixé.
Dévissez et retirez la vis moletée (B) pour dégager le panneau frontal (C) de l'unité de disques,
comme illustré dans la figure suivante.
Remplacement d'une unité de disque
Procédure
1. Prenez l'emballage contenant la nouvelle unité de disques.
Avertissement : Les unités de disques sont fragiles. Manipulez-les avec précaution.
2. Retirez l'unité de disques de son emballage protecteur.
3. Déverrouillez la poignée de l'unité de disques de remplacement en la tirant vers vous. Si la poignée
n'est pas complètement sortie, vous ne pouvez pas insérer l'unité de disques dans le système.
4. Si la nouvelle unité de disques n'est pas déjà équipée d'un panneau frontal, procédez comme suit :
Fixez le panneau frontal (A) à l'unité de disques de remplacement, comme illustré dans la figure
suivante.
5. Si la nouvelle unité de disques n'est pas déjà équipée d'un panneau frontal, procédez comme suit :
Figure 38. Retrait du panneau frontal d'une unité de disques
Unités de disques 51
Appuyez sur la vis moletée (B), puis serrez la vis pour maintenir le panneau frontal sur l'unité de
disques.
6. Installez l'unité de disques (voir figure 40, à la page 53).
a. Maintenez l'unité de disques par-dessous lorsque vous l'alignez sur les rails de guidage de l'unité
centrale. Ne la tenez pas par la poignée.
b. Faites glisser complètement l'unité de disques dans l'unité centrale.
Important : Lorsque vous installez une unité de disques, vous devez vérifier que l'unité est
complètement et parfaitement insérée dans le système.
c. Poussez complètement la poignée de l'unité de disques.
Figure 39. Installation d'un panneau frontal sur une unité de disques
52 Unités de disques
Figure 40. Installation d'une unité de disques sur un modèle
Figure 41. Installation d'une unité de disques à encombrement réduit
Unités de disques 53
7. Appuyez sur Entrée. L'emplacement de l'unité de disques passe de l'état Retrait à l'état Normal.
8. Sélectionnez le menu Unités RAID remplaçables à chaud. Appuyez sur F3 pour revenir à l'écran
précédent.
9. Passez à l'écran de sélection des tâches.
10. Sélectionnez Consignation d'action corrective.
11. Sélectionnez l'unité de disques remplacée et appuyez sur Entrée.
12. Sélectionnez Validation après la sélection et appuyez sur Entrée.
13. Retournez à la ligne de commande.
Fin de la procédure
Procédure
1. Remettez en place le panneau avant.
2. Installez ou fermez le volet avant de l'unité centrale, de l'unité d'extension ou de l'armoire.
3. Pour préparer l'unité de disques, voir «Régénération des données d'une unité de disques de
remplacement sous AIX», à la page 101.
Figure 42. Installation d'une unité de disques à encombrement réduit sur l'unité d'extension 5803
54 Unités de disques
Concepts associés
Retrait et remplacement de volets et capots
Suivez ces procédures pour retirer et remettre en place les carters sur les serveurs systèmes contenant le
processeur POWER7.
Référence associée
«Retrait et remplacement du volet ou panneau d'unité d'extension», à la page 156
Il se peut que vous deviez retirer, remplacer ou installer des panneaux ou des volets sur une unité
d'extension pour accéder à des composants ou effectuer une opération de maintenance.
Remplacement d'une unité de disques sur un système ou une partition
logique Linux sous tension
Il peut être nécessaire de remplacer une unité de disques SCSI (Small Computer System Interface) ou SAS
(serial-attached Attached SCSI) sur un système ou une unité d'extension alors que le système
d'exploitation ou la partition logique Linux qui contrôle l'emplacement de l'unité de disques est sous
tension. Exécutez la procédure décrite dans cette section pour effectuer cette tâche.
Avant de commencer
Avertissement : Si vous installez une unité de disque nouvelle ou mise à niveau, voir «Installation d'une
unité de disques ou d'une unité SSD sur un système ou une partition logique Linux sous tension», à la
page 15. Si vous retirez une unité de disque dans le cadre d'une procédure de maintenance, continuez de
suivre la procédure suivante.
Unités de disques 55
DANGER
Lorsque vous utilisez le système ou travaillez à proximité de ce dernier, observez les consignes
suivantes :
Le courant électrique provenant de l'alimentation, du téléphone et des câbles de transmission peut
présenter un danger. Pour éviter tout risque de choc électrique :
v Branchez cette unité uniquement avec le cordon d'alimentation fourni par . N'utilisez pas ce
dernier avec un autre produit.
v N'ouvrez pas et n'entretenez pas le bloc d'alimentation électrique.
v Ne manipulez aucun câble et n'effectuez aucune opération d'installation, d'entretien ou de
reconfiguration de ce produit au cours d'un orage.
v Le produit peut être équipé de plusieurs cordons d'alimentation. Pour supprimer tout risque de
danger électrique, débranchez tous les cordons d'alimentation.
v Branchez tous les cordons d'alimentation sur un socle de prise de courant correctement câblé et
mis à la terre. Vérifiez que la tension et l'ordre des phases des prises de courant correspondent
aux informations de la plaque d'alimentation électrique du système.
v Branchez sur des socles de prise de courant correctement câblés tout équipement connecté à ce
produit.
v Lorsque cela est possible, n'utilisez qu'une seule main pour connecter ou déconnecter les câbles
d'interface.
v Ne mettez jamais un équipement sous tension en cas d'incendie ou d'inondation, ou en présence
de dommages matériels.
v Avant de retirer les carters de l'unité, mettez celle-ci hors tension et déconnectez ses cordons
d'alimentation, ainsi que les câbles qui la relient aux réseaux, aux systèmes de télécommunication
et aux modems (sauf mention contraire dans les procédures d'installation et de configuration).
v Lorsque vous installez, que vous déplacez, ou que vous manipulez le présent produit ou des
périphériques qui lui sont raccordés, reportez-vous aux instructions ci-dessous pour connecter et
déconnecter les différents cordons.
Pour déconnecter les cordons :
1. Mettez toutes les unités hors tension (sauf mention contraire).
2. Débranchez les cordons d'alimentation des prises.
3. Débranchez les câbles d'interface des connecteurs.
4. Débranchez tous les câbles des unités.
Pour connecter les cordons :
1. Mettez toutes les unités hors tension (sauf mention contraire).
2. Branchez tous les cordons sur les unités.
3. Branchez les câbles d'interface sur des connecteurs.
4. Branchez les cordons d'alimentation aux prises.
5. Mettez l'unité sous tension.
(D005a)
Avertissement : Avant de commencer, assurez-vous d'avoir pris connaissance des présentes instructions
et de bien maîtriser la complexité de la procédure utilisant le mode Sous tension. Si le système ou la
partition logique qui contrôle l'unité de disques à remplacer est sur la ligne de commande Linux, vous
pouvez utiliser cette procédure avec l'alimentation sous tension.
Si vous êtes dans l'un des cas suivants, voir «Remplacement d'une unité de disques sur un système ou
une partition logique hors tension», à la page 73 :
v Vous installez une unité de disques qui sera connectée à l'adaptateur SCSI 6203 ou 6204
v Vous ne vous sentez pas à l'aise avec la procédure d'installation sur un système ou une partition
logique sous tension.
56 Unités de disques
Remarque : Certaines figures accompagnant ces procédures peuvent ne pas correspondre exactement à
votre unité centrale ou votre unité d'extension. Néanmoins, les procédures demeurent identiques pour
l'exécution de chaque tâche.
Pourquoi et quand exécuter cette tâche
Préparation du système
Procédure
1. Effectuez les tâches de configuration prérequises décrites dans la section «Avant de commencer», à la
page 143.
2. L'unité de disques à remplacer se trouve-t-elle dans une batterie de disques RAID (Redundant Array
of Independent Disk) de niveau 0 ou dans une batterie de disques RAID de niveau 5 ou 10 défaillante
?
v Si oui, vous devez supprimer le module avant de remplacer le disque. Pour plus d'informations,
voir PCI-X SCSI RAID Controller Reference Guide for Linux.
v Si non, votre unité est configurée comme JBOD (Just a Bunch of Disks) ou dans un module de
disques endommagé. Passez à l'étape suivante.
3. Si l'unité de disques à remplacer se trouve dans un module de disques RAID (Redundant Array of
Independent Disk) de niveau 0 ou dans un module de disques RAID de niveau 5 ou 10 défaillant,
vous devez supprimer le module de disques avant de remplacer le disque. Pour plus d'informations,
voir PCI-X SCSI RAID Controller Reference Guide for Linux.
4. Le cas échéant, retirez le volet de l'unité centrale, celui de l'unité d'extension, ou ouvrez le volet avant
de l'armoire.
5. Le cas échéant, retirez le panneau avant.
v Si vous avez un panneau avant d'un autre modèle, retirez-le.
Résultats
Pour remplacer une unité de disques sur un système ou une partition logique Linux qui contrôle
l'emplacement de cette unité sous tension, sélectionnez la procédure appropriée dans la liste suivante :
Concepts associés
Retrait et remplacement de volets et capots
Suivez ces procédures pour retirer et remettre en place les carters sur les serveurs systèmes contenant le
processeur POWER7.
Référence associée
«Retrait et remplacement du volet ou panneau d'unité d'extension», à la page 156
Il se peut que vous deviez retirer, remplacer ou installer des panneaux ou des volets sur une unité
d'extension pour accéder à des composants ou effectuer une opération de maintenance.
Remplacement d'une unité de disques utilisant le noyau 2.4 Linux
Les procédures suivantes expliquent comment remplacer une unité de disques à un emplacement contrôlé
par un système ou une partition logique qui exécute le noyau 2.4 Linux.
Préparation du retrait de l'unité de disques :
Procédure
1. Connectez-vous en tant que superutilisateur.
2. Tapez iprconfig sur la ligne de commande de la session Linux et appuyez sur Entrée. L'écran Work
with Disk Units apparaît.
3. Dans l'écran Work with Disk Units, sélectionnez Analyze log. Appuyez sur Entrée. L'écran Kernel
Messages Log similaire à l'écran suivant apparaît.
Unités de disques 57
4. Dans l'écran Kernel Messages Log, sélectionnez Use vi to view most recent error messages.
Appuyez sur Entrée.
5. Recherchez l'entrée de journal de l'unité à remplacer.
6. Notez les informations d'emplacement de l'unité. Les informations d'emplacement ont le format
P1–I2/Z1–A5.
7. Revenez à l'écran Work with Disk Units.
8. Sélectionnez Display disk hardware status dans l'écran Work with Disk Units. Appuyez sur Entrée.
9. Notez l'état matériel de l'unité de disques à remplacer.
10. Tapez d (affichage des informations sur l'unité de disques) à côté de l'unité de disques à remplacer.
11. Dans l'écran Display Disk Unit Details, tapez 5 (affichage des informations sur l'unité de disques) à
côté de l'unité de disques à remplacer.
L'écran Disk Unit Hardware Resource Information Details, similaire à l'écran suivant, apparaît.
12. Notez les informations d'emplacement physique.
13. Revenez à l'écran Work with Disk Units.
14. Si l'unité de disques à remplacer n'est pas protégée ou si elle est en cours d'utilisation, transférez les
données de l'unité de disques avant de poursuivre la procédure.
Kernel Messages Log
Select one of the following:
1. Use vi to view most recent error messages
2. Use vi to view Storage error messages
3. Use vi to view all kernel error messages
4. Use vi to view sisconfig error messages
5. Set root kernel message log directory
6. Set default editor
7. Restore defaults
8. Use vi to view Storage boot time messages
Selection:
e=Exit
Figure 43. Kernel Messages Log
Disk Unit Hardware Resource Information Details
Manufacturer . . . . . . . . : IBM
Machine Type and Model. . . : IC35L073UCDY10-0
Firmware Version . . . . . . : 53323846
Model . . . . . . . . . . . : 020
Serial Number. . . . . . . : E6VKUNPD
Capacity . . . . . . . . . . : 73,41 Go
Resource Name. . . . . . . : /dev/sda
Physical location:
Location . . . . . . . : U1.9-P1/Z1-A8
PCI Bus. . . . . . . . . . : 257
PCI Device . . . . . . . . : 1
SCSI Host Number . . . . . : 0
SCSI Channel . . . . . . . : 0
SCSI Id. . . . . . . . . . : 8
SCSI Lun . . . . . . . . . : 0
Press Enter to continue.
e=Exit q=Cancel v=Display Extended VPD
Figure 44. Ecran Disk Unit Hardware Resource Information Details
58 Unités de disques
Pour plus d'informations, voir PCI-X SCSI RAID Controller Reference Guide for Linux.
Retrait d'une unité de disques :
Procédure
1. Si vous retirez une unité SSD située sur un adaptateur PCIe RAID et SSD SAS, reportez-vous à
Retrait et remplacement de l'adaptateur PCIe RAID et SSD SASRetrait et remplacement de
l'adaptateur PCIe RAID et SSD SAS et suivez la procédure, puis revenez ici et effectuez les étapes
suivantes.
2. Dans l'écran Work with Disk Units, sélectionnez Work with disk unit recovery. Appuyez sur Entrée.
3. Dans l'écran Work with Disk Unit Recovery, sélectionnez Device Concurrent maintenance. Appuyez
sur Entrée.
Un écran Device Concurrent Maintenance similaire à l'écran suivant apparaît.
4. Entrez les informations d'emplacement physique ou l'emplacement PCI ou SCSI de l'unité de disques
à remplacer. Appuyez sur Entrée.
5. Spécifiez 1 (Remove device) pour que le système effectue l'action.
6. Saisissez un délai en minutes ; soit le temps nécessaire pour retirer l'unité de disques.
Le voyant de maintenance simultanée indique que vous pouvez retirer l'unité de disques. Pour
afficher l'emplacement des voyants de maintenance simultanée, sélectionnez le modèle approprié
dans la liste suivante :
v «Emplacements des unités de disques et voyants de maintenance simultanée pour les boîtiers
d'unité SCSI 31D/24 et 31T/24», à la page 98
7. Appuyez sur Entrée.
Un écran Device Concurrent Maintenance Action similaire à l'écran suivant apparaît.
Device Concurrent Maintenance
Type the choices, then press Enter.
Specify either Physical Location or PCI/SCSI location.
Physical Location: U0.1-P1-I4/Z1-A8
OU
PCI/SCSI . . . PCI Bus: PCI Device:
SCSI Channel: SCSI Id: SCSI Lun:
Specify action as 1=Remove device 2=Install device
Action à exécuter. . . . . . . . . : 1
Enter a time value between 01 and 19.
Time needed in minutes . . . . . . . . . : 1
Or leave blank and press Enter to cancel
Figure 45. Exemple d'écran Device Concurrent Maintenance
Unités de disques 59
8. Entrez c pour confirmer que vous voulez retirer l'unité de disques dans l'emplacement défini.
Un écran Device Concurrent Maintenance Action In Progress similaire à l'écran suivant apparaît.
Avertissement :
v Portez un bracelet antistatique relié à une surface métallique non peinte pour protéger le matériel
contre les risques de décharge électrostatique.
v Si vous portez un bracelet antistatique, respectez toutes les consignes de sécurité relatives aux
dangers électriques. Un bracelet antistatique permet de réguler l'électricité statique. Il ne réduit et
n'augmente en aucun cas les risques d'électrocution liés à l'utilisation ou à la manipulation
d'appareils électriques.
v Si vous ne disposez pas d'un bracelet antistatique, touchez une surface métallique non peinte du
système pendant au moins 5 secondes avant de déballer un produit de son emballage antistatique,
d'installer ou de remplacer du matériel.
9. Appuyez sur la poignée de l'unité de disques et tirez-la vers vous pour retirer l'unité de disques
(voir figure 48, à la page 61). Si la poignée n'est pas complètement sortie, vous ne pouvez pas retirer
l'unité de disques de l'unité centrale ou de l'unité d'extension.
Device Concurrent Maintenance Action
Removal of device:
Location: U0.1-P1-I4/Z1-A8
You have 1 minute to perform the operation
During this time, your system can seem unresponsive
c=Confirm q=Cancel
Figure 46. Exemple d'écran Device Concurrent Maintenance Action
Device Concurrent Maintenance Action In Progress
Please remove the device located in:
Location: U0.1-P1-I4/Z1-A8
Figure 47. Exemple d'écran Device Concurrent Maintenance Action In Progress
60 Unités de disques
Figure 48. Retrait d'une unité de disques d'un modèle ou 11D/20
Figure 49. Retrait d'une unité de disques à encombrement réduit
Unités de disques 61
10. Retirez l'unité de disques en la maintenant par-dessous. Ne la tenez pas par la poignée.
Remarque : Le voyant de maintenance simultanée de l'emplacement s'arrête de clignoter et reste
éteint lorsque vous retirez l'unité de disques.
11. Appuyez sur Entrée.
12. Votre unité de disques est-elle déjà équipée d'un panneau frontal ?
v Non : continuez la procédure à ce stade.
v Oui : passez à l'étape 5, à la page 64 de la procédure de remplacement.
Dévissez et retirez la vis moletée (B) pour dégager le panneau frontal (C) de l'unité de disques (voir
figure suivante).
Figure 50. Retrait d'une unité de disques à encombrement réduit sur l'unité d'extension 5803
62 Unités de disques
Remplacement d'une unité de disques :
Procédure
1. Prenez l'emballage contenant la nouvelle unité de disques.
Avertissement : Les unités de disques sont fragiles. Manipulez-les avec précaution.
2. Retirez l'unité de disques de son emballage protecteur.
3. Déverrouillez la poignée de l'unité de disques en la tirant vers vous avant d'installer l'unité de
disques. Si la poignée n'est pas complètement sortie, vous ne pouvez pas insérer l'unité de disques
dans l'unité centrale ou l'unité d'extension.
4. Si l'unité de disques de remplacement n'est pas déjà équipée d'un panneau frontal, procédez comme
suit pour l'installer :
a. Fixez le panneau frontal (A) sur l'unité de disques (voir figure suivante).
Figure 51. Retrait du panneau frontal d'une unité de disques
Unités de disques 63
b. Appuyez sur la vis moletée (B), puis serrez la vis pour maintenir le panneau frontal sur l'unité
de disques.
5. Sélectionnez Device Concurrent maintenance dans l'écran Work with Disk Unit Recovery. Appuyez
sur Entrée.
6. Entrez les informations d'emplacement physique ou l'emplacement PCI/SCSI de l'unité de disques
que vous avez retirée. Appuyez sur Entrée.
7. Définissez l'action 2 (installation d'unité).
8. Saisissez un délai en minutes ; soit le temps nécessaire pour installer la nouvelle unité de disques.
Le voyant de maintenance simultanée de l'emplacement indique que vous pouvez installer l'unité de
disques.
9. Appuyez sur Entrée. Un écran Device Concurrent Maintenance Action similaire à l'écran suivant
apparaît.
Figure 52. Fixation d'un panneau frontal à une unité de disques
64 Unités de disques
10. Entrez c pour confirmer que vous voulez installer l'unité de disques dans l'emplacement défini.
Un écran Device Concurrent Maintenance Action In Progress similaire à l'écran suivant apparaît.
11. Maintenez l'unité de disques par-dessous lorsque vous l'alignez sur les rails de guidage de l'unité
centrale ou de l'unité d'extension. Ne la tenez pas par la poignée.
12. Faites glisser complètement l'unité de disques dans l'unité centrale ou l'unité d'extension, puis
bloquez-la dans son emplacement en poussant la poignée (voir figure 55).
Device Concurrent Maintenance Action
Installation of device:
Location: U0.1-P1-I4/Z1-A8
You have 1 minute to perform the operation
During this time, your system can seem unresponsive
c=Confirm q=Cancel
Figure 53. Exemple d'écran Device Concurrent Action Maintenance
Device Concurrent Maintenance Action In Progress
Please insert the device located in:
Location: U0.1-P1-I4/Z1-A8
Figure 54. Exemple d'écran Device Concurrent Maintenance Action In Progress
Figure 55. Installation d'une unité de disques sur un modèle ou 11D/20
Unités de disques 65
Le voyant de maintenance simultanée de l'emplacement ne clignote plus et reste allumé lorsque
l'unité de disques est correctement installée.
Figure 56. Installation d'une unité de disques à encombrement réduit
Figure 57. Installation d'une unité de disques à encombrement réduit sur l'unité d'extension 5803
66 Unités de disques
Suivez la procédure :
Procédure
1. Régénérez les données sur la nouvelle unité de disques. Pour plus d'informations, voir «Régénération
des données d'une unité de disques de remplacement sous Linux», à la page 101.
2. Remettez en place le panneau avant.
3. Installez ou fermez le volet avant de l'unité système, de l'unité d'extension ou de l'armoire.
Concepts associés
Retrait et remplacement de volets et capots
Suivez ces procédures pour retirer et remettre en place les carters sur les serveurs systèmes contenant le
processeur POWER7.
Référence associée
«Retrait et remplacement du volet ou panneau d'unité d'extension», à la page 156
Il se peut que vous deviez retirer, remplacer ou installer des panneaux ou des volets sur une unité
d'extension pour accéder à des composants ou effectuer une opération de maintenance.
Remplacement d'une unité de disques utilisant le noyau 2.6 Linux
Les procédures suivantes expliquent comment remplacer une unité de disques à un emplacement contrôlé
par un système ou une partition logique qui exécute le noyau 2.6 Linux.
Préparation du retrait de l'unité de disques :
Procédure
1. Connectez-vous en tant que superutilisateur.
2. Tapez iprconfig sur la ligne de commande de la session Linux et appuyez sur Entrée.
L'écran Configuration Utility apparaît.
3. Sélectionnez Analyze log. Appuyez sur Entrée. L'écran Kernel Messages Log apparaît.
4. Sélectionnez View most recent ipr error messages dans l'écran the Kernel Messages Log. Appuyez
sur Entrée.
5. Recherchez l'entrée de journal de l'unité de disques à remplacer.
6. Notez les informations d'emplacement de l'unité.
Remarque : Les informations d'emplacement ont le format 2:0:8:0. Dans cet exemple, 2 correspond
au numéro d'hôte SCSI, 0 au bus SCSI, 8 à l'ID cible SCSI et 0 à l'unité logique.
7. Retournez à la ligne de commande.
8. Entrez la commande suivante :
Kernel Messages Log
Select one of the following:
1. View most recent ipr error messages
2. View ipr error messages
3. View all kernel error messages
4. View iprconfig error messages
5. Set root kernel message log directory
6. Set default editor
7. Restore defaults
8. View ipr boot time messages
Selection:
e=Exit
Figure 58. Kernel Messages Log
Unités de disques 67
ls -ld /sys/class/scsi_host/host#/device
Où # correspond au numéro d'hôte SCSI. Appuyez sur Entrée.
9. Notez les informations d'emplacement PCI.
Remarque : Les informations d'emplacement PCI ont le format 61:01:0:2.
10. Tapez iprconfig sur la ligne de commande et appuyez sur Entrée.
L'écran Configuration Utility apparaît.
11. Sélectionnez Display hardware status dans l'écran Configuration Utility. Appuyez sur Entrée.
L'écran Display Hardware Status apparaît.
12. Recherchez l'unité de disques dans l'emplacement PCI que vous avez noté. L'unité de disques peut
avoir l'état Incident.
13. Si l'unité de disques à remplacer n'est pas protégée ou si elle est en cours d'utilisation, transférez les
données de l'unité de disques avant de poursuivre la procédure.
Pour plus d'informations, voir PCI-X SCSI RAID Controller Reference Guide for Linux.
14. Tapez l'option 1 (affichage des informations de ressource matérielle) à côté de l'unité de disques à
remplacer. Appuyez sur Entrée.
Un écran Disk Hardware Resource Information Details similaire à l'exemple suivant apparaît.
Display Hardware Status
Type option, press Enter.
1=Display hardware resource information details
OPT Name PCI/SCSI Location Description Status
--- ------ -------------------------- ------------------------- ----------------
0000:01:01.0.0/ PCI-X SCSI Adapter Operational
0000:41:01.0.1/ PCI-X SCSI Adapter Operational
sda 0000:41:01.0.1/0:3:0 Physical Disk Active
sdb 0000:41:01.0.1/0:4:0 Physical Disk Active
sdc 0000:41:01.0.1/0:8:0 Physical Disk Active
sdd 0000:41:01.0.1/1:3:0 Physical Disk Active
sde 0000:41:01.0.1/1:4:0 Physical Disk Active
sdf 0000:41:01.0.1/1:5:0 Physical Disk Active
0001:61:01.0.2/ PCI-X SCSI RAID Adapter Operational
sdg 0001:61:01.0.2/0:3:0 Physical Disk Active
0001:61:01.0.2/0:6:0 Advanced Function Disk Active
sdi 0001:61:01.0.2/0:9:0 Physical Disk Active
sdh 0001:61:01.0.2/255:0:0 RAID 10 Disk Array Failed
0001:61:01.0.2/0:4:0 RAID 10 Array Member Failed
0001:61:01.0.2/0:5:0 RAID 10 Array Member Failed
e=Exit q=Cancel r=Refresh t=Toggle
Figure 59. Exemple d'écran Display Hardware Status
68 Unités de disques
15. Notez les informations d'emplacement physique.
16. Revenez à l'écran Configuration Utility.
Retrait d'une unité de disques :
Procédure
1. Si vous retirez une unité SSD située sur un adaptateur PCIe RAID et SSD SAS, reportez-vous à
Retrait et remplacement de l'adaptateur PCIe RAID et SSD SASRetrait et remplacement de
l'adaptateur PCIe RAID et SSD SAS et suivez la procédure, puis revenez ici et effectuez les étapes
suivantes.
2. Dans l'écran Configuration Utility, sélectionnez Work with disk unit recovery. Appuyez sur Entrée.
3. Dans l'écran Work with Disk Unit Recovery, sélectionnez Concurrent remove device. Appuyez sur
Entrée. Un écran Concurrent Device Remove similaire à l'écran suivant apparaît.
4. Tapez l'option 1 (sélection) à côté de l'unité de disques à remplacer. Appuyez sur Entrée.
5. L'écran Verify Device Concurrent Remove apparaît. Le voyant de maintenance simultané
correspondant à l'emplacement choisi s'allume. Pour afficher l'emplacement des voyants de
maintenance simultanée, sélectionnez le modèle approprié dans la liste suivante :
Disk Unit Hardware Resource Information Details
Manufacturer . . . . . . . . . . . . . . : IBM
Product ID . . . . . . . . . . . . . . . : ST336607LC
Firmware Version . . . . . . . . . . . . : 43353048 (C50H)
Serial Number. . . . . . . . . . . . . . : 00006719
Capacity . . . . . . . . . . . . . . . . : 36.40 GB
Resource Name. . . . . . . . . . . . . . : /dev/sdd
Physical location
PCI Address. . . . . . . . . . . . . . . : 0001:50:01.0
SCSI Host Number . . . . . . . . . . . . : 2
SCSI Channel . . . . . . . . . . . . . . : 0
SCSI Id. . . . . . . . . . . . . . . . . : 4
SCSI Lun . . . . . . . . . . . . . . . . : 0
Extended Details
FRU Number . . . . . . . . . . . . . . . : 00P2676
More...
Press Enter to Continue
e=Exit q=Cancel f=PageDn b=PageUp
Figure 60. Exemple d'écran Disk Hardware Resource Information
Concurrent Device Remove
Choose a single location for remove operations
1=Select
OPT Name PCI/SCSI Location Description Status
--- ------ -------------------------- ------------------------- -----------------
sdc 0000:41:01.0.1/0:8:0 Physical Disk Active
0000:41:01.0.1/0:5: Empty
sdb 0000:41:01.0.1/0:4:0 Physical Disk Active
sda 0000:41:01.0.1/0:3:0 Physical Disk Active
0000:41:01.0.1/1:8: Empty
sdf 0000:41:01.0.1/1:5:0 Physical Disk Active
sde 0000:41:01.0.1/1:4:0 Physical Disk Active
sdd 0000:41:01.0.1/1:3:0 Physical Disk Active
0001:61:01.0.2/0:8: Empty
sdh 0001:61:01.0.2/0:9:0 Physical Disk Active
sdg 0001:61:01.0.2/0:3:0 Physical Disk Active
Figure 61. Exemple d'écran Concurrent Device Remove
Unités de disques 69
v «Emplacements des unités de disques et voyants de maintenance simultanée pour les boîtiers
d'unité SCSI 31D/24 et 31T/24», à la page 98
6. Dans l'écran Verify Device Concurrent Remove, vérifiez que l'unité de disques correspond bien à
celle à remplacer et appuyez sur Entrée. Le voyant d'identification de l'unité de disques s'allume.
Remarque : Vérifiez que l'unité de disques n'est pas en cours d'utilisation afin de ne pas perdre des
données.
7. L'écran Device Concurrent Remove apparaît.
Avertissement :
v Portez un bracelet antistatique relié à une surface métallique non peinte pour protéger le matériel
contre les risques de décharge électrostatique.
v Si vous portez un bracelet antistatique, respectez toutes les consignes de sécurité relatives aux
dangers électriques. Un bracelet antistatique permet de réguler l'électricité statique. Il ne réduit et
n'augmente en aucun cas les risques d'électrocution liés à l'utilisation ou à la manipulation
d'appareils électriques.
v Si vous ne disposez pas d'un bracelet antistatique, touchez une surface métallique non peinte du
système pendant au moins 5 secondes avant de déballer un produit de son emballage antistatique,
d'installer ou de remplacer du matériel.
8. Appuyez sur la poignée de l'unité de disques et tirez-la vers vous pour retirer l'unité de disques
(voir figure 62). Si la poignée n'est pas complètement sortie, vous ne pouvez pas retirer l'unité de
disques de l'unité centrale ou de l'unité d'extension.
9. Maintenez l'unité de disques par-dessous lorsque vous la faites glisser hors de l'unité centrale ou de
l'unité d'extension. Ne la tenez pas par la poignée.
10. Si l'unité de disques de remplacement n'est pas déjà équipée d'un panneau frontal, procédez comme
suit :
Dévissez et retirez la vis moletée (B) pour dégager le panneau frontal (C) de l'unité de disques (voir
figure suivante).
Figure 62. Retrait d'une unité de disques d'un modèle ou 11D/20
70 Unités de disques
11. Appuyez sur Entrée dans l'écran Configuration Utility pour indiquer que vous avez retiré l'unité de
disques.
Remplacement d'une unité de disques :
Procédure
1. Prenez l'emballage contenant la nouvelle unité de disques.
Avertissement : Les unités de disques sont fragiles. Manipulez-les avec précaution.
2. Retirez l'unité de disques de son emballage protecteur.
3. Déverrouillez la poignée de l'unité de disques en la tirant vers vous avant d'installer l'unité de
disques. Si la poignée n'est pas complètement sortie, vous ne pouvez pas insérer l'unité de disques
dans l'unité centrale ou l'unité d'extension.
4. Si l'unité de disques de remplacement n'est pas déjà équipée d'un panneau frontal, procédez comme
suit pour l'installer :
a. Fixez le panneau frontal (A) sur l'unité de disques (voir figure 64, à la page 72).
b. Appuyez sur la vis moletée (B), puis serrez la vis pour maintenir le panneau frontal sur l'unité
de disques.
Figure 63. Retrait du panneau frontal d'une unité de disques
Unités de disques 71
5. Dans l'écran Configuration Utility, sélectionnez Work with disk unit recovery. Appuyez sur Entrée.
6. Dans l'écran Work with Disk Unit Recovery, sélectionnez Concurrent add device. Appuyez sur
Entrée.
Un écran Concurrent Device Add similaire à l'exemple suivant apparaît.
7. Tapez l'option 1 (sélection) à côté de l'emplacement de l'unité de disques que vous avez retirée.
L'écran Verify Device Concurrent Add apparaît.
8. Appuyez sur Entrée dans l'écran Verify Device Concurrent Add. L'écran Complete Device
Concurrent Add apparaît.
9. Maintenez l'unité de disques par-dessous lorsque vous l'alignez sur les rails de guidage de l'unité
centrale ou de l'unité d'extension. Ne la tenez pas par la poignée.
10. Faites glisser complètement l'unité de disques dans l'unité centrale ou l'unité d'extension, puis
bloquez l'unité dans son emplacement en poussant la poignée (voir figure 66, à la page 73).
Figure 64. Fixation d'un panneau frontal à une unité de disques
Concurrent Device Add
Choose a single location for add operations
1=Select
OPT Name PCI/SCSI Location Description Status
--- ------ -------------------------- ------------------------- -----------------
0000:41:01.0.1/0:5: Empty
0000:41:01.0.1/1:8: Empty
0001:61:01.0.2/0:8: Empty
Figure 65. Exemple d'écran Concurrent Device Add
72 Unités de disques
11. Appuyez sur Entrée dans l'écran Complete Device Concurrent Add pour indiquer que l'unité de
disques est installée.
Suivez la procédure :
Procédure
1. Régénérez les données sur la nouvelle unité de disques. Pour plus d'informations, voir «Régénération
des données d'une unité de disques de remplacement sous Linux», à la page 101.
2. Remettez en place le panneau avant.
3. Installez ou fermez le volet avant de l'unité système, de l'unité d'extension ou de l'armoire.
Concepts associés
Retrait et remplacement de portes et capots
Suivez ces procédures pour retirer et remettre en place les carters sur les serveurs systèmes contenant le
processeur POWER7.
Référence associée
«Retrait et remplacement du volet ou panneau d'unité d'extension», à la page 156
Il se peut que vous deviez retirer, remplacer ou installer des panneaux ou des volets sur une unité
d'extension pour accéder à des composants ou effectuer une opération de maintenance.
Remplacement d'une unité de disques sur un système ou une partition
logique hors tension
Il peut être nécessaire d'installer une unité de disques SCSI (Small Computer System Interface) ou SAS
(serial-attached Attached SCSI) sur un système ou une partition logique contrôlant l'emplacement de
l'unité de disques une fois l'alimentation coupée. Exécutez la procédure décrite dans cette section pour
effectuer cette tâche.
Avant de commencer
Avertissement : Si vous installez une unité de disque nouvelle ou mise à niveau, voir «Installation d'une
unité de disques ou d'une unité SSD sur un système ou une partition logique hors tension», à la page 37.
Si vous retirez une unité de disque dans le cadre d'une procédure de maintenance, continuez de suivre la
procédure suivante.
Figure 66. Installation d'une unité de disques sur un modèle ou 11D/20
Unités de disques 73
Pourquoi et quand exécuter cette tâche
DANGER
Lorsque vous utilisez le système ou travaillez à proximité de ce dernier, observez les consignes
suivantes :
Le courant électrique provenant de l'alimentation, du téléphone et des câbles de transmission peut
présenter un danger. Pour éviter tout risque de choc électrique :
v Branchez cette unité uniquement avec le cordon d'alimentation fourni par . N'utilisez pas ce
dernier avec un autre produit.
v N'ouvrez pas et n'entretenez pas le bloc d'alimentation électrique.
v Ne manipulez aucun câble et n'effectuez aucune opération d'installation, d'entretien ou de
reconfiguration de ce produit au cours d'un orage.
v Le produit peut être équipé de plusieurs cordons d'alimentation. Pour supprimer tout risque de
danger électrique, débranchez tous les cordons d'alimentation.
v Branchez tous les cordons d'alimentation sur un socle de prise de courant correctement câblé et
mis à la terre. Vérifiez que la tension et l'ordre des phases des prises de courant correspondent
aux informations de la plaque d'alimentation électrique du système.
v Branchez sur des socles de prise de courant correctement câblés tout équipement connecté à ce
produit.
v Lorsque cela est possible, n'utilisez qu'une seule main pour connecter ou déconnecter les câbles
d'interface.
v Ne mettez jamais un équipement sous tension en cas d'incendie ou d'inondation, ou en présence
de dommages matériels.
v Avant de retirer les carters de l'unité, mettez celle-ci hors tension et déconnectez ses cordons
d'alimentation, ainsi que les câbles qui la relient aux réseaux, aux systèmes de télécommunication
et aux modems (sauf mention contraire dans les procédures d'installation et de configuration).
v Lorsque vous installez, que vous déplacez, ou que vous manipulez le présent produit ou des
périphériques qui lui sont raccordés, reportez-vous aux instructions ci-dessous pour connecter et
déconnecter les différents cordons.
Pour déconnecter les cordons :
1. Mettez toutes les unités hors tension (sauf mention contraire).
2. Débranchez les cordons d'alimentation des prises.
3. Débranchez les câbles d'interface des connecteurs.
4. Débranchez tous les câbles des unités.
Pour connecter les cordons :
1. Mettez toutes les unités hors tension (sauf mention contraire).
2. Branchez tous les cordons sur les unités.
3. Branchez les câbles d'interface sur des connecteurs.
4. Branchez les cordons d'alimentation aux prises.
5. Mettez l'unité sous tension.
(D005a)
Préparation du système
Procédure
1. Effectuez les tâches de configuration prérequises décrites dans la section «Avant de commencer», à la
page 143.
2. Si vous ne pouvez pas démarrer le système ou la partition logique qui contrôle l'emplacement de
l'unité de disques ou que le système ou la partition logique est hors tension, voir étape 6, à la page
75.
3. Recherchez l'unité de disques à remplacer et notez les informations d'emplacement. Pour plus
d'informations, voir «Identification d'un composant défaillant», à la page 146.
74 Unités de disques
4. Préparez-vous à retirer l'unité de disques de l'unité centrale ou de l'unité d'extension, comme suit :
v Si le système ou la partition logique qui contrôle l'emplacement de l'unité de disques exécute AIX,
exécutez la procédure de la section «Préparation du retrait d'une unité de disques d'une unité
centrale ou d'extension contrôlée par le système d'exploitation AIX», à la page 100.
v Si le système ou la partition logique qui contrôle l'emplacement de l'unité de disques exécute Linux
et que l'unité de disques à remplacer est protégée ou en cours d'utilisation, transférez les données
de l'unité de disques avant de poursuivre la procédure. Pour plus d'informations, voir PCI-X SCSI
RAID Controller Reference Guide for Linux.
5. Arrêtez le système ou la partition qui contrôle l'emplacement de l'unité de disques. Pour obtenir des
instructions, voir «Arrêt d'un système ou d'une partition logique», à la page 153.
6. Si vous avez arrêté la partition logique, ne déconnectez pas la source d'alimentation. Si vous avez
arrêté le système, débranchez-le de la source d'alimentation.
Remarque : Le système est peut-être équipé d'un second bloc d'alimentation. Avant de continuer la
procédure, vérifiez que toutes les sources d'alimentation ont été débranchées du système.
(L003)
ou
7. Retirez le volet de l'unité centrale ou de l'unité d'extension ou ouvrez le volet avant de l'armoire. Pour
plus d'informations, sélectionnez la procédure appropriée dans la liste suivante :
v «Retrait et remplacement du volet ou panneau d'unité d'extension», à la page 156
8. Retirez le panneau avant.
Retrait d'une unité de disques
Procédure
1. Si vous retirez une unité SSD située sur un adaptateur PCIe RAID et SSD SAS, reportez-vous à
Retrait et remplacement de l'adaptateur PCIe RAID et SSD SASRetrait et remplacement de
l'adaptateur PCIe RAID et SSD SAS et suivez la procédure, puis revenez ici et effectuez les étapes
suivantes.
Unités de disques 75
2. Déverrouillez la poignée de l'unité de disques en appuyant dessus et en la tirant vers vous (voir
figure 67).
Avertissement :
v Portez un bracelet antistatique relié à une surface métallique non peinte pour protéger le matériel
contre les risques de décharge électrostatique.
v Si vous portez un bracelet antistatique, respectez toutes les consignes de sécurité relatives aux
dangers électriques. Un bracelet antistatique permet de réguler l'électricité statique. Il ne réduit et
n'augmente en aucun cas les risques d'électrocution liés à l'utilisation ou à la manipulation
d'appareils électriques.
v Si vous ne disposez pas d'un bracelet antistatique, touchez une surface métallique non peinte du
système pendant au moins 5 secondes avant de déballer un produit de son emballage antistatique,
d'installer ou de remplacer du matériel.
Figure 67. Retrait d'une unité de disques sur un modèle ou 11D/20
76 Unités de disques
3. Maintenez l'unité de disques par-dessous lorsque vous la faites glisser hors de l'unité centrale ou de
l'unité d'extension. Ne la tenez pas par la poignée.
4. Si l'unité de disques de remplacement n'est pas déjà équipée d'un panneau frontal, procédez comme
suit :
Dévissez puis retirez la vis moletée (B) pour dégager le panneau frontal (C) de l'unité de disques (voir
figure suivante).
Important : Si le panneau frontal de l'unité de disques n'est pas doté d'une vis moletée, il s'agit d'une
unité de disques SAS non amovible. L'unité de remplacement doit alors être pourvue d'un panneau
frontal fixé.
Figure 68. Retrait d'une unité de disques à encombrement réduit sur l'unité d'extension 5803
Unités de disques 77
Remplacement d'une unité de disques
Procédure
1. Prenez l'emballage contenant la nouvelle unité de disques.
Avertissement : Les unités de disques sont fragiles. Manipulez-les avec précaution.
2. Retirez l'unité de disques de son emballage protecteur.
3. Déverrouillez la poignée de l'unité de disques en la tirant vers vous avant d'installer l'unité de
disques. Si la poignée n'est pas complètement sortie, vous ne pouvez pas insérer l'unité de disques
dans l'unité centrale ou l'unité d'extension.
4. Si l'unité de disques de remplacement n'est pas déjà équipée d'un panneau frontal, procédez comme
suit pour l'installer :
a. Fixez le panneau frontal (A) sur l'unité de disques (voir figure suivante).
b. Appuyez sur la vis moletée (B), puis serrez la vis pour maintenir le panneau frontal sur l'unité de
disques.
Figure 69. Retrait du panneau frontal d'une unité de disques
78 Unités de disques
5. Maintenez l'unité de disques par-dessous lorsque vous l'alignez sur les rails de guidage de l'unité
centrale ou de l'unité d'extension. Ne la tenez pas par la poignée.
6. Faites glisser complètement l'unité de disques dans l'unité centrale ou l'unité d'extension.
Remarque : Lors de l'installation d'une unité de disques, il est important de s'assurer que l'unité est
complètement et parfaitement insérée dans le système.
7. Poussez complètement la poignée de l'unité de disques (voir figure 71, à la page 80).
Figure 70. Fixation d'un panneau frontal à une unité de disques
Unités de disques 79
Fin de la procédure
Procédure
1. Remettez en place le panneau avant.
2. Installez ou fermez le volet avant du système, de l'unité d'extension ou de l'armoire.
3. Rebranchez tous les câbles d'alimentation système.
4. Reconnectez le système à l'alimentation.
Figure 71. Installation d'une unité de disques sur un modèle ou 11D/20
Figure 72. Installation d'une unité de disques à encombrement réduit sur l'unité d'extension 5803
80 Unités de disques
5. Démarrez le système ou la partition logique. Pour obtenir des instructions, voir «Démarrage du
système ou de la partition logique», à la page 151.
6. Si le système ou la partition logique qui contrôle l'unité de disques remplacée fonctionne sous AIX,
vérifiez que l'unité fonctionne (voir «Vérification d'un dispositif installé ou d'un composant remplacé
sur un système ou une partition logique AIX», à la page 195. Si vous ne pouvez pas effectuer de
diagnostics en ligne, par exemple, si vous avez remplacé l'unité de disques du groupe de volumes
racine (rootvg), réalisez des diagnostics autonomes (voir «Vérification d'un composant installé via les
programmes de diagnostic autonomes», à la page 198.
7. Pour régénérer les données sur la nouvelle unité de disques, sélectionnez le système d'exploitation du
système ou de la partition logique qui contrôle l'unité de disques :
v «Régénération des données d'une unité de disques de remplacement sous AIX», à la page 101
v «Régénération des données d'une unité de disques de remplacement sous Linux», à la page 101
Concepts associés
Retrait et remplacement de volets et capots
Suivez ces procédures pour retirer et remettre en place les carters sur les serveurs systèmes contenant le
processeur POWER7.
Référence associée
«Retrait et remplacement du volet ou panneau d'unité d'extension», à la page 156
Il se peut que vous deviez retirer, remplacer ou installer des panneaux ou des volets sur une unité
d'extension pour accéder à des composants ou effectuer une opération de maintenance.
Remplacement d'une unité de disques SCSI
Découvrez comment remplacer une unité de disques SCSI (Small Computer System Interface) dans une
unité centrale ou d'extension. La procédure décrite dans cette information permet d'effectuer cette tâche.
Retrait d'une unité de disques
Procédure
1. Connectez-vous en tant que superutilisateur.
2. A l'invite, tapez diag et appuyez sur Entrée.
3. Appuyez sur Entrée pour continuer.
4. Dans l'écran de sélection de fonction, sélectionnez Sélection des tâches.
5. Dans l'écran de sélection des tâches, sélectionnez Remplacement à chaud.
6. Sélectionnez Gestionnaire d'unités SCSI et RAID SCSI remplaçables à chaud.
7. Sélectionnez Identification d'une unité connectée à une unité SES.
8. Sélectionnez l'unité de disques à remplacer et appuyez sur Entrée.
L'emplacement de l'unité de disques passe en mode Identification, indiqué par le voyant clignotant
de maintenance simultanée. Pour afficher l'emplacement des voyants de maintenance simultanée,
consultez «Emplacements des unités de disque et voyants de maintenance simultanée», à la page 90.
9. Revenez à l'écran Gestionnaire d'unités SCSI et RAID SCSI remplaçables à chaud.
10. Dans cet écran, sélectionnez Remplacement/Retrait d'une unité connectée à une unité SES. La liste
de tous les emplacements contenant une unité de disques pouvant être retirée s'affiche.
11. Sélectionnez l'emplacement qui contient l'unité de disques à retirer. L'emplacement de l'unité de
disques passe en mode Retrait et son alimentation est coupée. Le voyant de maintenance simultanée
de l'emplacement commence à clignoter rapidement.
12. Déverrouillez l'unité de disques. Déverrouillez la poignée de l'unité de disques en appuyant dessus
et en la tirant vers vous (voir figures suivantes).
Unités de disques 81
13. Maintenez l'unité de disques par-dessous lorsque vous la faites glisser hors de l'unité centrale ou de
l'unité d'extension. Ne la tenez pas par la poignée.
Remarque : Le voyant de maintenance simultanée de l'emplacement s'éteint lorsque vous retirez
l'unité de disques.
14. Dévissez et retirez la vis moletée B pour dégager le panneau frontal C de l'unité de disques.
Figure 73. Retrait d'une unité de disques sur un modèle ou 11D/20
Figure 74. Retrait d'une unité de disques à encombrement réduit sur l'unité d'extension 5803
82 Unités de disques
Remplacement d'une unité de disques
Procédure
1. Prenez l'emballage contenant la nouvelle unité de disques.
Avertissement : Les unités de disques sont fragiles. Manipulez-les avec précaution.
2. Retirez l'unité de disques de son emballage protecteur.
3. Déverrouillez la poignée de l'unité de disques de remplacement en la tirant vers vous. Si la poignée
n'est pas complètement sortie, vous ne pouvez pas insérer l'unité de disques dans le système.
4. Fixez le panneau frontal de l'unité de disques A sur l'unité de disques de remplacement (voir figure
suivante).
Figure 75. Retrait du panneau frontal d'une unité de disques
Unités de disques 83
5. Appuyez sur la vis moletée B, puis serrez la vis pour maintenir le panneau frontal A en place sur
l'unité de disques.
6. Pour installer l'unité de disques, procédez comme suit :
a. Maintenez l'unité de disques par-dessous lorsque vous l'alignez sur les rails de guidage de l'unité
centrale. Ne la tenez pas par la poignée.
b. Faites glisser complètement l'unité de disques dans l'unité centrale.
Remarque : Lors de l'installation d'une unité de disques, il est important de s'assurer que l'unité
est complètement et parfaitement insérée dans le système.
c. Poussez complètement la poignée de l'unité de disques (voir figure 77, à la page 85).
Figure 76. Fixation d'un panneau frontal à une unité de disques
84 Unités de disques
7. Appuyez sur Entrée.
8. Sélectionnez le menu Gestionnaire d'unités SCSI et RAID SCSI remplaçables à chaud.
9. Sélectionnez Configuration des unités ajoutées/remplacées. Appuyez sur Entrée.
10. Sélectionnez l'unité de disques remplacée et appuyez sur Entrée.
L'unité de disques est désormais configurée.
11. Passez à l'écran de sélection des tâches.
Figure 77. Installation d'une unité de disques sur un modèle ou 11D/20
Figure 78. Installation d'une unité de disques à encombrement réduit sur l'unité d'extension 5803
Unités de disques 85
12. Sélectionnez Consignation d'action corrective.
13. Sélectionnez l'unité de disques remplacée et appuyez sur Entrée.
14. Sélectionnez Validation après la sélection et appuyez sur Entrée.
15. Retournez à la ligne de commande.
Suivez la procédure
Procédure
1. Remettez en place le panneau avant.
2. Installez ou fermez le volet avant de l'unité système, de l'unité d'extension ou de l'armoire.
3. Vérifiez si l'unité de disques fonctionne. Pour obtenir des instructions, consultez la rubrique
«Vérification d'un dispositif installé ou d'un composant remplacé sur un système ou une partition
logique AIX», à la page 195. Si vous ne pouvez pas effectuer de diagnostics AIX en ligne, par
exemple, si vous avez remplacé l'unité de disques du groupe de volumes racine (rootvg), réalisez des
diagnostics autonomes (voir «Vérification d'un composant installé via les programmes de diagnostic
autonomes», à la page 198.
4. Pour préparer l'unité de disques, voir «Régénération des données d'une unité de disques de
remplacement sous AIX», à la page 101.
Concepts associés
Retrait et remplacement de volets et capots
Suivez ces procédures pour retirer et remettre en place les carters sur les serveurs systèmes contenant le
processeur POWER7.
Référence associée
«Retrait et remplacement du volet ou panneau d'unité d'extension», à la page 156
Il se peut que vous deviez retirer, remplacer ou installer des panneaux ou des volets sur une unité
d'extension pour accéder à des composants ou effectuer une opération de maintenance.
Retrait ou installation d'un obturateur d'unité de disques
Il se peut que vous deviez retirer ou installer un obturateur d'unité de disques SCSI (Small Computer
System Interface) dans le serveur ou l'unité d'extension. La présente section explique comment effectuer
ces tâches.
Remarque : Installez une autre unité de disques ou un obturateur d'unité de disques dans les
emplacements des unités de disques. Le fait de remplir l'emplacement d'unité de disques permet
d'assurer une ventilation correcte pour le refroidissement et de garantir une conformité EMI optimale.
Retrait d'un obturateur d'unité de disques
Informations sur le retrait d'un obturateur d'unité de disques d'un serveur ou d'une unité d'extension. La
procédure décrite dans cette information permet d'effectuer cette tâche.
Pourquoi et quand exécuter cette tâche
Remarque : Certaines figures accompagnant cette procédure peuvent ne pas correspondre exactement à
votre unité centrale ou à votre unité d'extension. Néanmoins, les procédures demeurent identiques pour
l'exécution de chaque tâche.
Procédure
1. Retirez le volet de l'unité centrale ou de l'unité d'extension ou ouvrez l'armoire. Pour plus
d'informations, voir «Retrait et remplacement du volet ou panneau d'unité d'extension», à la page 156.
2. Retirez le panneau avant. Pour plus d'informations sur le retrait du capot, voir «Carters système», à la
page 157.Si vous avez un panneau avant d'un autre modèle, retirez-le.
3. Identifiez l'obturateur d'unité de disques à retirer.
86 Unités de disques
4. Tirez l'obturateur d'unité de disques de son emplacement. Tirez l'obturateur d'unité de disques pour
le détacher de l'emplacement approprié (voir figure 79 et figure 80, à la page 88).
Avertissement :
v Portez un bracelet antistatique relié à une surface métallique non peinte pour protéger le matériel
contre les risques de décharge électrostatique.
v Si vous portez un bracelet antistatique, respectez toutes les consignes de sécurité relatives aux
dangers électriques. Un bracelet antistatique permet de réguler l'électricité statique. Il ne réduit et
n'augmente en aucun cas les risques d'électrocution liés à l'utilisation ou à la manipulation
d'appareils électriques.
v Si vous ne disposez pas d'un bracelet antistatique, touchez une surface métallique non peinte du
système pendant au moins 5 secondes avant de déballer un produit de son emballage antistatique,
d'installer ou de remplacer du matériel.
Figure 79. Retrait d'un obturateur d'unité de disques d'un modèle 33E/8B ou 36E/8C
Unités de disques 87
Installation d'un obturateur d'unité de disques
Il se peut que vous deviez installer un obturateur d'unité de disques dans le serveur ou l'unité
d'extension. La procédure de cette section permet d'effectuer cette tâche.
Pourquoi et quand exécuter cette tâche
Remarque : Certaines figures accompagnant cette procédure peuvent ne pas correspondre exactement à
votre unité centrale ou à votre unité d'extension. Néanmoins, les procédures demeurent identiques pour
l'exécution de chaque tâche.
Procédure
1. Retirez le volet de l'unité centrale ou de l'unité d'extension ou ouvrez l'armoire. Pour plus
d'informations, voir «Retrait et remplacement du volet ou panneau d'unité d'extension», à la page 156.
2. Retirez le carter avant. Pour plus d'informations sur le retrait du capot, voir «Carters système», à la
page 157. Si vous avez un panneau avant d'un autre modèle, retirez-le.
3. Installez l'obturateur d'unité de disques dans l'unité centrale ou l'unité d'extension (voir figures
suivantes).
Figure 80. Retrait d'un obturateur d'unité de disques d'un modèle 17M/MB ou 79M/HB
88 Unités de disques
4. Réinstallez le panneau avant.
5. Installez ou fermez le volet avant du système, de l'unité d'extension ou de l'armoire.
Figure 81. Installation d'un obturateur d'unité de disques d'un modèle 33E/8B ou 36E/8C
Figure 82. Installation d'un obturateur d'unité de disques d'un modèle 17M/MB ou 79M/HB
Unités de disques 89
Concepts associés
Retrait et remplacement de portes et capots
Suivez ces procédures pour retirer et remettre en place les carters sur les serveurs systèmes contenant le
processeur POWER7.
Référence associée
«Retrait et remplacement du volet ou panneau d'unité d'extension», à la page 156
Il se peut que vous deviez retirer, remplacer ou installer des panneaux ou des volets sur une unité
d'extension pour accéder à des composants ou effectuer une opération de maintenance.
Emplacements des unités de disque et voyants de maintenance
simultanée
Cette rubrique présente les emplacements des unités de disque et les voyants de maintenance simultanée
sur différents modèles.
Repérez votre unité centrale ou votre unité d'extension pour identifier les emplacements des unités et des
voyants de maintenance simultanée.
Emplacements de l'adaptateur PCIe RAID et SSD SAS
Informations sur les emplacements de l'adaptateur PCIe RAID et SSD SAS sur l'emplacement des voyants
de maintenance simultanée sur l'adaptateur PCIe RAID et SSD SAS
La figure figure 83 montre les emplacements de l'adaptateur PCIe RAID et SSD SAS.
Emplacements des unités de disques et voyants de maintenance
simultanée pour les systèmes 02E/4B et 05F/6B
Cette section décrit les emplacements des unités de disques et des voyants de maintenance simultanée
sur le système.
Figure 83. Emplacements de l'adaptateur PCIe RAID et SSD SAS
90 Unités de disques
Les figures suivantes illustrent les emplacements des unités de disques et les voyants de maintenance
simultanée pour les modèles 02E/4B ou 05F/6B. Les voyants de maintenance simultanée sont situés
au-dessus de la poignée de verrouillage des unités de disques SAS.
Figure 84. Emplacements des unités de disques sur le modèle d'armoire
Figure 85. Emplacements des unités de disques sur le modèle de tour
Unités de disques 91
Emplacements des unités de disque et voyants de maintenance
simultanée pour 31E/2B
Informations sur les emplacements des unités de disques et des voyants de maintenance simultanée sur
le système.
Les figures suivantes illustrent les emplacements des unités de disques et les voyants de maintenance
simultanée pour le modèle 31E/2B. Les voyants de maintenance simultanée sont situés au-dessus de la
Figure 86. Emplacement des voyants de maintenance simultanée sur le modèle d'armoire
Figure 87. Emplacement des voyants de maintenance simultanée sur le modèle de tour
92 Unités de disques
poignée de verrouillage des unités de disques SAS.
Figure 88. Emplacements des unités de disque sur un modèle de base avec six baies SFF
Figure 89. Emplacements des unités de disque sur un modèle avec trois baies SFF (facultatif)
Figure 90. Emplacements des voyants de maintenance simultanée sur un modèle de base avec six baies SFF
Unités de disques 93
Emplacements des unités de disques et voyants de maintenance
simultanée sur les modèles 33E/8B et 36E/8C
Cette section décrit les emplacements des unités de disques et des voyants de maintenance simultanée
sur le système.
Les figures suivantes illustrent les emplacements des unités de disques et les voyants de maintenance
simultanée sur les modèles 33E/8B et 36E/8C. Les voyants de maintenance simultanée sont situés
au-dessus de la poignée de verrouillage des unités de disques SAS.
Figure 91. Emplacements des voyants de maintenance simultanée sur un modèle de base avec trois baies SFF
(facultatif)
94 Unités de disques
Emplacements des unités de disques et voyants de maintenance
simultanée pour les modèles 17M/MB et 79M/HB
Cette rubrique décrit les emplacements des unités de disques dans le système.
Les figures suivantes illustrent les emplacements des unités de disques et les voyants de maintenance
simultanée pour les modèles 17M/MB et 79M/HB.
Figure 92. Emplacements des unités de disques
Figure 93. Emplacement des voyants de maintenance simultanée
Unités de disques 95
Emplacements des unités de disques et voyants de maintenance
simultanée sur le boîtier du disque 58/86
Cette section décrit les emplacements des unités de disques et des voyants de maintenance simultanée
sur le boîtier du disque.
La figure suivante illustre les emplacements des unités de disques sur le modèle 58/86.
Figure 94. Emplacements des unités de disques
Figure 95. Emplacement des voyants de maintenance simultanée
96 Unités de disques
La figure suivante représente les emplacements des voyants de maintenance simultanée des unités de
disques sur le boîtier d'unité de disques 58/86.
Figure 96. Emplacements des unités de disques pour le boîtier du modèle 58/86
Figure 97. Vue de face des voyants de maintenance simultanée sur le boîtier d'unité de disques 58/86
Unités de disques 97
Emplacements des unités de disques et voyants de maintenance
simultanée pour les boîtiers d'unité SCSI 31D/24 et 31T/24
Cette section décrit les emplacements des unités de disques et des voyants de maintenance simultanée
sur le boîtier de l'unité de disque.
La disponibilité des emplacements d'unités de disques dépend du nombre de cartes d'interface installées
dans le boîtier d'unités de disques SCSI. Si l'emplacement disponible suivant d'unité de disques
correspond au premier emplacement dans un boîtier, tel que D21, D26 ou D46, vérifiez pour le premier
contrôleur que le boîtier est connecté à un port de bus SCSI. L'extrémité du câble du contrôleur
mentionne clairement le boîtier d'unités de disques auquel il se connecte. Si le boîtier n'est pas connecté à
un contrôleur d'unité de disques, installez l'unité de disques dans un autre boîtier ou connectez le boîtier
à un port de bus SCSI sur un contrôleur. Vous devrez peut-être ajouter un contrôleur d'unité de disques
pour pouvoir ajouter des unités de disques à l'unité d'extension.
Les figures suivantes illustrent les emplacements des unités de disques à l'avant et à l'arrière des boîtiers
d'unités de disques SCSI 31D/24 et 31T/24.
Figure 98. Vue de face des boîtiers d'unités de disques SCSI 31D/24 et 31T/24
98 Unités de disques
Les figures suivantes illustrent l'emplacement des voyants de maintenance simultanée A du boîtier
d'unités de disques SCSI 31D/24 et 31T/24.
Figure 99. Vue arrière des boîtiers d'unités de disques SCSI 31D/24 et 31T/24
Figure 100. Vue de face des voyants de maintenance simultanée sur les boîtiers d'unités de disques 31D/24 et 31T/24
Figure 101. Vue arrière des voyants de maintenance simultanée des boîtiers d'unités de disques SCSI 31D/24 et
31T/24
Unités de disques 99
Emplacements des unités de disques et voyants de maintenance
simultanée sur une unité d'extension 58/02
Cette section décrit les emplacements des unités de disques et des voyants de maintenance simultanée
sur l'unité d'extension.
La figure suivante illustre les emplacements des unités de disques sur l'unité d'extension 58/02.
Les voyants de maintenance simultanée se trouvent au bas de la poignée de verrouillage de chaque unité
de disques.
Information associée
Câblage du sous-système du disque 5802
Informations connexes
Cette section apporte d'autres informations et instructions associées aux unités de disques dont vous
pourriez avoir besoin.
Préparation d'une unité de disques dans un système ou une partition
logique AIX
Configurez la nouvelle unité de disques en fonction de votre environnement.
Pourquoi et quand exécuter cette tâche
Préparation du retrait d'une unité de disques d'une unité centrale ou
d'extension contrôlée par le système d'exploitation AIX
Les procédures suivantes expliquent comment retirer correctement une unité de disques.
Pourquoi et quand exécuter cette tâche
Avant de retirer une unité de disques sur une unité centrale ou une unité d'extension contrôlée par le
système d'exploitation AIX, vérifiez que toutes les données sur cette unité ou sur le module où elle figure
sont sauvegardées et que l'unité se trouve à l'état Defined.
Figure 102. Emplacements des unités de disques
100 Unités de disques
Régénération des données d'une unité de disques de remplacement
sous AIX
Pour régénérer des données sur la nouvelle unité de disques, consultez les informations du contrôleur
auquel l'unité de disques est connectée.
Pourquoi et quand exécuter cette tâche
v Si l'unité de disques est connectée à un contrôleur RAID PCI-X SCSI, voir PCI-X SCSI RAID Controller
Reference Guide for AIX.
v Si l'unité de disques est connectée à une carte PCI SCSI RAID (code dispositif 2498), voir PCI 4–Channel
Ultra3 SCSI RAID Adapter Reference Guide.
Pour plus d'informations, voir AIX System Management Guide: Operating System and Devices.
Régénération des données d'une unité de disques de remplacement
sous Linux
Utilisez cette procédure pour régénérer les données de l'unité de disque de remplacement.
Pourquoi et quand exécuter cette tâche
Pour reconstruire les données sur l'unité de disque de remplacement, effectuez les étapes ci-après.
Pour une unité de disques non protégée
Si l'unité de disques que vous remplacez se trouve dans une grappe de disques RAID de niveau 0 ou
dans une grappe de disques RAID de niveau 5 ou 10 défaillant, procédez comme suit :
Procédure
1. Recréez le module de disques.
2. Recréez les systèmes de fichiers dans le module de disques.
3. Copiez les données vers le module de disques restaurés depuis le support de sauvegarde.
Résultats
Pour plus d'informations sur ces tâches, voir PCI-X SCSI RAID Controller Reference Guide for Linux. .
Pour une unité de disques dans une grappe de disques RAID de niveau 5 ou 10, sélectionnez la
procédure appropriée :
Régénération de données sur une unité de disques utilisant le noyau 2.4 Linux
Effectuez les étapes ci-après pour régénérer les données.
Pourquoi et quand exécuter cette tâche
Si un disque de secours était disponible et protégeait le module de disques pendant toute la durée de la
défaillance de l'unité de disques, vous pouvez configurer l'unité de disques que vous venez d'installer
comme disque de secours. Pour plus d'informations, voir PCI-X SCSI RAID Controller Reference Guide for
Linux.
.
Si la régénération des données sur un disque de secours n'a pas été initialisée par le contrôleur, vous
devez la lancer sur la nouvelle unité de disques installée. Pour ce faire, procédez comme suit :
Unités de disques 101
Procédure
1. Connectez-vous en tant que superutilisateur.
2. Tapez iprconfig sur la ligne de commande de la session Linux et appuyez sur Entrée. L'écran Work
with Disk Units apparaît.
3. Dans l'écran Work with Disk Units, sélectionnez Work with disk unit recovery. L'écran Work with
Disk Unit Recovery apparaît.
4. Dans l'écran Work with Disk Unit Recovery, sélectionnez Rebuild disk unit data.
Un écran Rebuild Disk Unit Data similaire à la figure suivante apparaît.
5. Tapez l'option 1 (régénération) à côté de l'unité de disques à reconstruire et appuyez sur Entrée.
Remarque : La régénération sur une unité de disques remplace les données qui figure sur l'unité.
Un écran Confirm Rebuild Disk Unit Data similaire à la figure suivante apparaît.
6. Pour confirmer que vous voulez régénérer les données sur l'unité de disques, appuyez sur Entrée.
Le message annonçant que la régénération a commencé apparaît au bas de l'écran Work with Disk
Unit Recovery. L'opération de régénération peut prendre plusieurs minutes.
Régénération de données sur une unité de disques utilisant le noyau 2.6 Linux
Les procédures suivantes expliquent comment régénérer des données sur une unité de disques.
Rebuild Disk Unit Data
Select the units to be rebuilt
Type choice, press Enter.
1=Rebuild
Vendor Product Serial PCI PCI SCSI SCSI SCSI
Option ID ID Number Bus Dev Bus ID Lun
IBM IC35L036UCDY10-0 E3V0J55B 353 01 0 5 0
e=Exit q=Cancel f=PageDn b=PageUp
Figure 103. Exemple d'écran Rebuild Disk Unit Data
Confirm Rebuild Disk Unit Data
Rebuilding the disk unit data could take several minutes for
each unit selected.
Press Enter to confirm having the data rebuilt.
Press q=Cancel to return and change your choice.
Vendor Product Serial PCI PCI SCSI SCSI SCSI
Option ID ID Number Bus Dev Bus ID Lun
1 IBM IC35L036UCDY10-0 E3V0J55B 353 01 0 5 0
q=Cancel f=PageDn b=PageUp
Figure 104. Exemple d'écran Confirm Rebuild Disk Unit Data
102 Unités de disques
Pourquoi et quand exécuter cette tâche
Si un disque de secours était disponible et protégeait le module de disques pendant toute la durée de la
défaillance de l'unité de disques, vous pouvez configurer l'unité de disques que vous venez d'installer
comme disque de secours. Pour plus d'informations, voir PCI-X SCSI RAID Controller Reference Guide for
Linux.
Si la régénération des données sur un disque de secours n'a pas été initialisée par le contrôleur, vous
devez la lancer sur la nouvelle unité de disques installée. Pour ce faire, procédez comme suit :
Procédure
1. Connectez-vous en tant que superutilisateur.
2. Tapez iprconfig sur la ligne de commande de la session Linux et appuyez sur Entrée. L'écran
Configuration Utility apparaît.
3. Sélectionnez Work with disk unit recovery sur l'écran Configuration Utility. Appuyez sur Entrée.
L'écran Work with Disk Unit Recovery apparaît.
4. Dans l'écran Work with Disk Unit Recovery, sélectionnez Rebuild disk unit data.
Un écran Rebuild Disk Unit Data similaire à la figure suivante apparaît.
5. Tapez l'option 1 (régénération) à côté de l'unité de disques à reconstruire et appuyez sur Entrée.
Remarque : La régénération sur une unité de disques remplace les données qui figure sur l'unité.
Un écran Confirm Rebuild Disk Unit Data similaire à la figure suivante apparaît.
Rebuild Disk Unit Data
Select the disks to be rebuilt
Type choice, press Enter.
1=Rebuild
OPT Name PCI/SCSI Location Description Status
--- ------ -------------------------- ------------------------- -------------
1 0000:58:01.0.0/0:4:0 RAID Array Member Failed
e=Exit q=Cancel t=Toggle
Figure 105. Exemple d'écran Rebuild Disk Unit Data
Confirm Rebuild Disk Unit Data
Rebuilding the disk unit data might take several minutes for each disk
selected.
Press Enter to confirm having the data rebuilt.
q=Cancel to return and change your choice.
OPT Name PCI/SCSI Location Description Status
--- ------ -------------------------- ------------------------- ---------------
1 0000:58:01.0.0/0:4:0 RAID Array Member Failed
q=Cancel t=Toggle
Figure 106. Exemple d'écran Confirm Rebuild Disk Unit Data
Unités de disques 103
6. Pour confirmer que vous voulez régénérer les données sur l'unité de disques, appuyez sur Entrée.
Le message annonçant que la régénération a commencé apparaît au bas de l'écran Work with Disk
Unit Recovery. L'opération de régénération peut prendre plusieurs minutes.
Partage d'unités de disques internes sur le modèle 02E/4B ou 05F/6B
Découvrez comment diviser des disques internes installés dans le boîtier de l'unité centrale en groupes
pouvant être gérés séparément. Utilisez les informations suivantes pour exécuter la tâche.
Avant de commencer
Le fond de panier DASD prend en charge le mode de division DASD. La carte d'activation de division
DASD FC 5631 est utilisée pour mettre le fond de panier DASD en mode de division DASD dans lequel 3
SFF DASD sur la gauche (vue de face) sont affectés à la carte d'activation de division DASD (CCIN 2BE0)
et 3 SFF DASD sur la droite sont affectés au contrôleur de stockage intégré (CCIN 572C). Un adaptateur
SAS PCIe ou PCIX (tel que FC 5901, FC5904 ou FC5912) peut accéder aux 3 SFF DASD via un câble SAS
AI externe, comme indiqué dans figure 107, à la page 105.
Important : Cette configuration n'est pas prise en charge avec le code dispositif de fond de panier 5630.
Utilisez les informations suivantes une fois l'adaptateur installé. Pour plus d'informations, voir
Adaptateurs PCI.
Effectuez les tâches prérequises (voir «Avant de commencer», à la page 143).
Remarque : Un câble interne FC 3670 pour modèle 02E/4B ou 05F/6B doit être installé pour activer la
configuration DASD partagé. Pour obtenir des instructions d'installation, voir «Installation du port SAS
externe sur un modèle 02E/4B ou 05F/6B», à la page 109.
Pourquoi et quand exécuter cette tâche
Les tâches suivantes permettent de diviser des disques internes installés dans le boîtier de l'unité centrale
en groupes pouvant être gérés séparément.
Procédure
1. Arrêtez et mettez hors tension l'alimentation au système, voir Arrêt du système ou de la partition
logique.
2. Câblez le système en connectant le câble SAS à la carte, puis à la connexion SAS interne (voir
figure 107, à la page 105).
104 Unités de disques
3. Démarrez le système. Voir la rubrique «Démarrage du système ou de la partition logique», à la page
151.
4. Vérifiez que le composant est bien installé et fonctionne correctement. Voir la rubrique «Vérification
du composant installé», à la page 195.
Exemple
Après l'exécution de cette procédure, trois disques du boîtier système sont gérés par la carte d'activation
de division DASD et trois disques sont gérés par le système interne.
Que faire ensuite
Vous pouvez à présent gérer les disques de la même manière que n'importe quels autres disques.
Partage d'unités de disques internes sur un modèle 33E/8B ou 36E/8C
Cette rubrique vous explique la fonction de cette carte et comment la connecter. Suivez les instructions
données ci-après pour effectuer ces tâches.
Avant de commencer
Le fond de panier DASD prend en charge le mode de division DASD. Un câble SAS interne FC 3669 est
utilisé pour mettre le fond de panier DASD en mode de division DASD dans lequel 4 SFF DASD sur la
gauche (vue de face) sont assignés au contrôleur SAS intégré et 4 SFF DASD sur la droite sont assignés
au port SAS arrière interne. Un adaptateur SAS PCIe ou PCIX (tel que FC 5901, FC5904 ou FC5912) peut
accéder aux 4 SFF DASD via un câble SAS AI externe, comme indiqué dans la figure qui suit.
Remarques : Le second connecteur de l'adaptateur peut permettre de connecter un tiroir d'extension de support ou
d'extension de disque.
Figure 107. Connexion d'un adaptateur SAS aux emplacements de disque SAS internes via un port SAS externe
système
Unités de disques 105
Les informations fournies ci-après sont destinées à être utilisées une fois que la carte a été installée.
Installez la carte, puis revenez à cette rubrique. Pour plus d'informations, consultez la rubrique relative
aux cartes PCI (Cartes PCI).
Si vous ne l'avez pas encore fait, passez en revue les tâches décrites dans la section «Avant de
commencer», à la page 143.
Remarque : Le câble interne FC3669 du modèle 33E/8B ou 36E/8C être installé pour activer cette
configuration. Pour plus d'informations sur l'installation, voir Installation du port SAS externe sur un
modèle 33E/8B ou 36E/8C
Pourquoi et quand exécuter cette tâche
Cette carte permet de diviser des disques internes installés dans le boîtier de l'unité centrale en groupes
pouvant être gérés séparément.
Procédure
1. Si ce n'est pas déjà fait, arrêtez et mettez le système hors tension. Voir la rubrique «Arrêt d'un système
ou d'une partition logique», à la page 153.
2. Câblez le système en connectant le câble SAS à la carte, puis à la connexion SAS interne (voir figure
suivante).
3. Démarrez le système. Voir la rubrique «Démarrage du système ou de la partition logique», à la page
151.
Remarques :
v Le second connecteur de l'adaptateur peut permettre de connecter un tiroir d'extension de support ou d'extension
de disque.
Figure 108. Connexion d'un adaptateur SAS aux emplacements de disque SAS internes via un port SAS externe
système
106 Unités de disques
4. Vérifiez que le composant est bien installé et fonctionne correctement. Voir la rubrique «Vérification
du composant installé», à la page 195.
Exemple
Une fois la procédure terminée, quatre des disques du boîtier système seront gérés par le contrôleur SAS
PCI et quatre seront gérés par le contrôleur SAS intégré de l'unité centrale.
Que faire ensuite
Les disques peuvent à présent être gérés de la même manière que n'importe quels autres disques.
Information associée
Contrôleurs SAS RAID pour AIX sur les modèles 8233-E8B et 8236-E8C
Partage d'unités de disques internes sur un modèle 17M/MB ou
79M/HB
Cette rubrique vous explique la fonction de cette carte et comment la connecter. Suivez les instructions
données ci-après pour effectuer ces tâches.
Avant de commencer
Passez en revue les modes des unités de disque et les emplacements d'unités pris en charge, comme
indiqué dans le Tableau 1. La figure suivante présente l'emplacement des unités de disques sur les
modèles 17M/MB et 79M/HB.
Tableau 4. Modes des unités de disque et emplacements des unités prises en charge
Mode de l'unité de
disque Carte Port SAS Carte Câble SAS
Emplacement de
l'unité de disque
Disques internes
RAID
Aucune Contrôleur RAID
intégré double canal
Aucune D1, D2, D3, D4, D5,
et D6
Figure 109. Emplacements des unités de disques
Unités de disques 107
Tableau 4. Modes des unités de disque et emplacements des unités prises en charge (suite)
Mode de l'unité de
disque Carte Port SAS Carte Câble SAS
Emplacement de
l'unité de disque
Disques RAID
internes et externes
Dispositif de port
externe SAS (FC1819)
- Ensemble de câbles
SAS pour port SAS
Contrôleur RAID
intégré double canal
v Câble YI (FC3686) -
Système au boîtier
SAS, Contrôleur
unique / Double
accès - 1,5 mètre
v Câble YI (FC3687) -
Système au boîtier
SAS, Contrôleur
unique / Double
accès - 3 mètres
D1, D2, D3, D4, D5,
et D6
Fond de panier deux
sections
Aucune v Contrôleur SAS
intégré (SAS A)
v Contrôleur SAS
intégré (SAS B)
Aucune v D1, D2 et D3
v D4, D5 et D6
Fond de panier trois
sections
Dispositif de port
interne SAS (FC1815)
- Câble SAS pour
fond de panier trois
sections
v Contrôleur SAS
intégré (SAS A)
v Contrôleur SAS
intégré (SAS B)
v (FC5901) - Carte
SAS PCIe Dual-x4
Câble AI(FC3679) -
Carte à disque interne
- 1 mètre
v D1 et D4
v D2 et D5
v D3 et D6
Les informations suivantes sont destinées à être utilisées une fois que la carte de stockage FC5901 SAS a
été installée. Installez la carte, puis revenez à cette rubrique. Pour plus d'informations, consultez la
rubrique relative aux cartes PCI (Cartes PCI).
Si vous ne l'avez pas encore fait, passez en revue les tâches décrites dans la section «Avant de
commencer», à la page 143.
Pourquoi et quand exécuter cette tâche
Cette carte permet de diviser des disques internes installés dans le boîtier de l'unité centrale en groupes
pouvant être gérés séparément.
Procédure
1. Si ce n'est pas déjà fait, arrêtez et mettez le système hors tension. Voir la rubrique «Arrêt d'un système
ou d'une partition logique», à la page 153.
2. Connectez un boîtier d'unité centrale en procédant comme suit :
a. Branchez le câble au port SAS situé sur la cloison arrière du boîtier de l'unité centrale, au port
supérieur de la carte de stockage SAS, comme illustré dans la figure suivante.
Restriction : Le partage d'unité de disque interne n'est possible que lorsque le dispositif pour
câble interne FC1815 connecte le fond de panier DASD et la cloison arrière du boîtier de l'unité
centrale. En outre, la carte d'activation Cache RAID FC662 175 Mo - Adaptateur d'E-S double ne
doit pas être installée. enablement card must not be installed. La carte de stockage SAS peut être
installé dans n'importe quel autre emplacement la prenant en charge.
108 Unités de disques
b. Fixez tout câble supplémentaire.
3. Démarrez le système. Voir la rubrique «Démarrage du système ou de la partition logique», à la page
151.
4. Vérifiez que le composant est bien installé et fonctionne correctement. Voir la rubrique «Vérification
du composant installé», à la page 195.
Exemple
Lorsque cette fonction est installée, 2 des 6 disques du boîtier système peuvent être gérés par la carte de
stockage SAS.
Remarque : Le support de stockage amovible est toujours contrôlé par un contrôleur SAS distinct
incorporé à la carte système.
Que faire ensuite
Les disques peuvent à présent être gérés de la même manière que n'importe quels autres disques.
Installation du port SAS externe sur un modèle 02E/4B ou 05F/6B
Suivez cette procédure pour installer un port SAS externe dans le cadre de la mise à niveau de votre fond
de panier d'unité de disques ou pour activer le partage de disque interne.
Avant de commencer
Effectuez les tâches prérequises (voir «Avant de commencer», à la page 143).
Pourquoi et quand exécuter cette tâche
Suivez les étapes ci-après pour installer un port SAS externe dans le serveur :
Unités de disques 109
Procédure
1. Arrêtez le système. Pour obtenir des instructions pour l'arrêt du système, voir «Arrêt d'un système ou
d'une partition logique», à la page 153.
2. Retirez toutes les cassettes d'adaptateur PCI. Voir Cassette de quatrième génération simple largeur
d'adaptateur PCI.
3. Si nécessaire, retirez le câble SAS existant ou l'obturateur (voir figure 110).
a. Appuyez sur les taquets latéraux et retirez le capuchon SAS (A) de la connexion SAS sur le fond
de panier système.
b. Ouvrez le serre-câble (B) sur la paroi latérale. Le serre-câble s'ouvre à partir du haut.
c. Retirez le câble SAS (D) du système.
4. Installez le nouveau câble SAS (voir figure 111, à la page 111).
a. Connectez le port externe à l'arrière du système jusqu'à ce qu'il s'enclenche.
b. Ouvrez le serre-câble (C) sur la paroi latérale, et placez-y le câble (A). Le serre-câble s'ouvre à
partir du haut.
c. Refermez le serre-câble (C) et continuez à faire glisser le câble vers la connexion SAS sur le fond
de panier système.
d. Poussez le capuchon SAS (D) dans la connexion SAS sur le fond de panier système jusqu'à ce qu'il
s'enclenche.
Figure 110. Retrait du câble SAS
110 Unités de disques
5. Remplacez toutes les cassettes d'adaptateur PCI. Voir Cassette de quatrième génération simple largeur
d'adaptateur PCI.
6. Démarrez le système. Voir http://publib.boulder.ibm.com/infocenter/powersys/v3r1m5/topic/p7haj/
crustartsys.htm
Résultats
Vous pouvez à présent associer d'autres fonctions SAS prises en charge à votre système.
Installation du port SAS externe sur un modèle 31E/2B
Suivez cette procédure pour installer un port SAS externe dans le cadre de la mise à niveau de votre fond
de panier d'unité de disques ou pour activer le partage de disque interne.
Avant de commencer
Effectuez les tâches prérequises (voir «Avant de commencer», à la page 143).
Pourquoi et quand exécuter cette tâche
Suivez les étapes ci-après pour installer le port SAS externe dans le serveur :
Procédure
1. Arrêtez le système. Pour obtenir des instructions pour l'arrêt du système, voir «Arrêt d'un système ou
d'une partition logique», à la page 153.
2. Retirez toutes les cassettes d'adaptateur PCI. Voir Cassette de quatrième génération simple largeur
d'adaptateur PCI.
3. Si nécessaire, retirez le câble SAS existant ou l'obturateur (voir figure 112, à la page 112).
Figure 111. Remplacement du câble SAS
Unités de disques 111
a. Appuyez sur les taquets latéraux et retirez le capuchon SAS de la connexion SAS sur le fond de
panier système.
b. Ouvrez le serre-câble sur la paroi latérale et sortez le câble (B). Le serre-câble s'ouvre à partir du
haut.
c. Enfoncez le port SAS externe (A), puis sortez le câble SAS du système.
4. Installation du nouveau câble SAS. Pour plus d'informations, voir figure 113, à la page 113.
a. Connectez le port externe à l'arrière du système (B) jusqu'à ce qu'il s'enclenche.
b. Si nécessaire, ouvrez le serre-câble sur la paroi latérale, et placez-y le câble. Le serre-câble s'ouvre
à partir du haut.
c. Refermez le serre-câble et continuez à faire glisser le câble (A) sur la connexion SAS sur le fond de
panier système.
d. Poussez le capuchon SAS dans la connexion SAS (B) sur le fond de panier système jusqu'à ce qu'il
s'enclenche.
Figure 112. Retrait du câble SAS
112 Unités de disques
5. Remplacez toutes les cassettes d'adaptateur PCI. Voir Cassette de quatrième génération simple largeur
d'adaptateur PCI.
6. Démarrez le système. Voir http://publib.boulder.ibm.com/infocenter/powersys/v3r1m5/topic/p7haj/
crustartsys.htm
Résultats
Vous pouvez à présent associer d'autres fonctions SAS prises en charge à votre système.
Installation du port SAS externe des modèles 33E/8B ou 36E/8C
Il peut être nécessaire d'installer un port SAS externe dans le cadre de la mise à niveau de votre fond de
panier d'unité de disques ou d'activer le partage de disque interne. La procédure décrite dans cette
section permet d'effectuer cette tâche.
Pourquoi et quand exécuter cette tâche
Si votre système est géré par la console HMC, utilisez cette dernière pour installer la fonctionnalité SAS
3699 sur le serveur. Pour plus d'informations, voir «Retrait d'un composant via la console HMC», à la
page 194.
Si vous ne disposez pas d'une console HMC, suivez les étapes ci-après pour installer la fonctionnalité
SAS 3699 dans le serveur :
Procédure
1. Effectuez les tâches de configuration prérequises (voir «Avant de commencer», à la page 143).
2. Arrêtez le système. Pour obtenir des instructions, voir «Arrêt d'un système ou d'une partition
logique», à la page 153.
Figure 113. Remplacement du câble SAS
Unités de disques 113
3. Mettez le système en position de maintenance s'il est monté en armoire. Pour plus d'informations,
voir «Mise en position de maintenance de l'unité 33E/8B ou 36E/8C», à la page 184.
4. Retirez le capot d'accès. Pour plus d'informations, voir «Retrait du capot d'accès des modèles 33E/8B
ou 36E/8C», à la page 170.
5. Retirez le bloc processeur ou l'obturateur de bloc processeur situé à l'emplacement P1-C16 en
soulevant la poignée A, puis en extrayant le bloc du système (voir figure suivante).
Remarque : Il vous suffit de retirer un assemblage ou un obturateur en P1-C16. Pour retirer un
obturateur, saisissez-le par son milieu et extrayez-le du système.
6. Retirez les ventilateurs et le capot de ventilation en procédant comme suit :
a. Pour retirer les ventilateurs, appuyez sur la patte (A) et soulevez le ventilateur (B) du capot de
ventilation, comme illustré dans la figure suivante.
114 Unités de disques
b. Pour retirer les déflecteurs d'air, saisissez et ôtez-les du boîtier de ventilation, comme illustré
dans la figure suivante.
Figure 114. Retrait du ventilateur du boîtier de ventilation 33E/8B ou 36E/8C
Figure 115. Retrait des déflecteurs d'air du boîtier de ventilation 33E/8B ou 36E/8C
Unités de disques 115
c. Pour retirer le boîtier de ventilation, desserrez les deux vis moletées (A) et ôtez le boîtier de
ventilation (B) du système, comme illustré dans la figure suivante.
7. Faites passer la partie avant du câble doté du connecteur mâle à travers l'ouverture entre le boîtier
du processeur et le côté du système.
Conseil : Pour effectuer les étapes suivantes, reportez-vous à la figure 117, à la page 117.
8. Insérez les broches (A) situées sur le côté du système dans les emplacements (B) de l'ensemble de
câble SAS.
9. Faites de nouveau glisser l'ensemble de câble jusqu'à ce que les broches (A) soient insérées et le
loquet (C) en place.
Conseil : Le port situé à l'arrière de l'ensemble de câble doit être inséré dans l'emplacement à
l'arrière du système.
10. Fixez le support de câble (H) en plaçant l'encoche du support au-dessus de la colonne située sur le
côté du système, alignez le trou de la vis (I) avec celui qui se trouve sur le côté du système, puis
fixez le support à l'aide de la vis.
Remarque : Le support doit reposer confortablement sur l'ensemble de câble SAS.
Figure 116. Retrait du boîtier de ventilation du système 33E/8B ou 36E/8C
116 Unités de disques
11. Selon le fond de panier de l'unité de disques, effectuez l'une des opérations suivantes :
Conseil : Pour effectuer cette étape, reportez-vous à la figure 118, à la page 118.
v Si le raccord SAS est visible au centre du fond de panier de l'unité de disques (D), fixez le câble
au raccord et passez à l'étape 12, à la page 118.
v Si le capuchon de protection du câble SAS (E) ressort du fond de panier de l'unité de disques,
passez à l'étape 11a.
Figure 117. Fixation du câble SAS
Unités de disques 117
a. Recherchez le capuchon de protection du câble SAS (E) sur le fond de panier de l'unité de
disques, puis retirez du système la partie arrière du capuchon pour dégager celui-ci.
b. Retirez le capuchon du port SAS.
c. Fixez le capuchon au connecteur SAS mâle (F) de l'ensemble de câble en la faisant glisser
par-dessus le connecteur de câble (F) depuis l'arrière. Le connecteur SAS du câble doit tenir dans
les deux emplacements situés à l'avant du capuchon, et il doit ressortir de celui-ci afin de pouvoir
être inséré dans le port.
d. Faites glisser le câble et le capuchon de protection (G) dans le port SAS jusqu'à ce qu'ils soient
solidement fixés.
12. Reposez les ventilateurs et le boîtier de ventilation en procédant comme suit :
a. Pour installer le boîtier de ventilation (A) dans le système, alignez les broches coudées avec les
orifices, puis serrez les deux vis moletées (B), comme illustré dans la figure suivante.
Figure 118. Raccordement du câble SAS
118 Unités de disques
b. Pour installer les déflecteurs d'air, mettez-les en place dans le boîtier de ventilation, puis appuyez
dessus jusqu'à ce qu'ils s'enclenchent, comme illustré dans la figure suivante.
Figure 119. Installation du boîtier de ventilation dans le système 33E/8B ou 36E/8C
Figure 120. Installation des déflecteurs d'air dans le boîtier de ventilation 33E/8B ou 36E/8C
Unités de disques 119
c. Pour installer les ventilateurs, mettez-les en place dans le boîtier de ventilation, et appuyez
dessus jusqu'à ce qu'ils s'enclenchent, comme illustré dans la figure suivante.
13. Installez le processeur système (B) et fermez les deux loquets de sécurité (C) comme indiqué à la
figure suivante.
Figure 121. Installation des ventilateurs dans le boîtier de ventilation 33E/8B ou 36E/8C
120 Unités de disques
14. Reposez le capot d'accès. Pour plus d'informations, consultez la rubrique «Installation du capot
d'accès des modèles 33E/8B ou 36E/8C.», à la page 171.
15. Remettez le système en position de fonctionnement. Pour plus d'informations, voir «Placement de
33E/8B ou de 36E/8C en position de fonctionnement», à la page 187.
Résultats
Vous pouvez à présent associer les fonctions SAS prises en charge à votre système. Revenez à la
procédure qui vous a amené vers cette page ou démarrez le système. Pour plus d'informations,
reportez-vous à la rubrique «Démarrage du système ou de la partition logique», à la page 151.
Installation du port SAS externe sur un modèle 17M/MB ou 79M/HB
Suivez cette procédure pour installer un port SAS externe dans le cadre de la mise à niveau de votre fond
de panier d'unité de disques ou pour activer le partage de disque interne.
Pourquoi et quand exécuter cette tâche
Suivez les étapes ci-après pour installer le port SAS externe dans le serveur :
Procédure
1. Effectuez les tâches de configuration prérequises (voir «Avant de commencer», à la page 143).
2. Arrêtez le système. Voir «Arrêt d'un système ou d'une partition logique», à la page 153.
3. Retirez toutes les cassettes d'adaptateur PCI. Voir Cassette de quatrième génération simple largeur
d'adaptateur PCI.
4. Si nécessaire, retirez le câble SAS existant ou l'obturateur.
a. Appuyez sur les taquets latéraux et retirez le capuchon SAS (A) de la connexion SAS sur le fond
de panier système.
b. Ouvrez le serre-câble sur la paroi latérale et sortez le câble. Le serre-câble s'ouvre à partir du haut.
c. Enfoncez le port SAS externe (B), puis sortez le câble SAS du système.
Unités de disques 121
5. Installation du nouveau câble SAS.
a. Connectez le port externe à l'arrière du système (B) jusqu'à ce qu'il s'enclenche.
b. Si nécessaire, ouvrez le serre-câble sur la paroi latérale, et placez-y le câble. Le serre-câble s'ouvre
à partir du haut.
c. Refermez le serre-câble et continuez à faire glisser le câble sur la connexion SAS sur le fond de
panier système.
d. Poussez le capuchon SAS (A) dans la connexion SAS sur le fond de panier système jusqu'à ce qu'il
s'enclenche.
Figure 122. Retrait du câble SAS
122 Unités de disques
6. Remplacez toutes les cassettes d'adaptateur PCI. Voir Cassette de quatrième génération simple largeur
d'adaptateur PCI.
7. Démarrez le système. Voir «Démarrage du système ou de la partition logique», à la page 151
Résultats
Vous pouvez à présent associer d'autres fonctions SAS prises en charge à votre système.
Remise en place d'une carte d'extension SAS dans une unité
d'extension 58/02 hors tension
Les procédures suivantes expliquent comment remettre en place une carte d'extension SAS dans une
unité d'extension 58/02 hors tension.
Avant de commencer
Examinez les conditions préalablement requises décrites dans la section «Avant de commencer», à la page
143.
Pourquoi et quand exécuter cette tâche
Si le système est géré par la console HMC (Hardware Management Console), utilisez-la pour remplacer la
carte. Pour obtenir des instructions, voir «Retrait d'un composant via la console HMC», à la page 194
Procédure
1. Arrêtez le système. Pour obtenir des instructions, voir «Arrêt d'un système ou d'une partition
logique», à la page 153.
2. Débranchez le système pour l'isoler de la source d'alimentation.
Figure 123. Remplacement du câble SAS
Unités de disques 123
Avertissement : Le système est peut-être équipé d'un second bloc d'alimentation. Vous devez
couper l'alimentation du système en débranchant tous les cordons d'alimentation pour empêcher
tout endommagement du système pendant cette opération.
(L003)
ou
3. Placez le connecteur de la carte (C) sur le raccord du fond de panier central (D) en vous aidant du
guide (B) pour assurer un raccordement adéquat. Relevez ensuite le levier (A) pour insérer et fixer la
carte d'extension SAS dans le fond de panier central, comme illustré dans la figure suivante.
124 Unités de disques
4. Reposez le fond de panier central et la carte d'extension SAS dans le système en procédant comme
suit :
a. Alignez le fond de panier central avec le guide et assurez-vous que l'emplacement (A) du fond
de panier est fixé à la colonne (B) du système.
b. Serrez les vis moletées (C) et (D) de la carte d'extension SAS.
c. Serrez les vis moletées (E) de part et d'autre du fond de panier central.
Figure 124. Installation de la carte d'extension SAS dans le fond de panier central
Unités de disques 125
5. Fixez le cache du cordon d'alimentation et serrez la vis moletée (A) du système.
Figure 125. Installation de la carte d'extension SAS et du fond de panier central
Figure 126. Installation de la plaque d'alimentation
126 Unités de disques
6. Insérez avec précaution le contrôleur de gestion du boîtier dans le système et fermez le loquet (A)
comme indiqué dans la figure suivante.
7. Soulevez avec précaution la carte d'entrée-sortie et alignez-la avec l'emplacement situé à l'arrière de
l'unité d'extension.
8. Insérez la carte d'entrée-sortie fermement dans le serveur, comme indiqué dans la figure suivante.
9. Fixez la carte d'entrée-sortie avec les languettes de verrouillage (A) comme indiqué dans la figure
suivante.
Figure 127. Remplacement du contrôleur de gestion du boîtier
Unités de disques 127
10. Pour reposer le bloc d'alimentation, insérez-le dans l'unité d'extension en veillant à ce que le levier
de verrouillage (A) soit en position ouverte, comme illustré dans la figure suivante.
11. Fermez le levier de verrouillage (A) jusqu'à ce que le bloc d'alimentation soit maintenu en place.
Figure 128. Installation de la carte d'entrée-sortie sur un modèle monté en armoire
128 Unités de disques
12. Pour remplacer le fond de panier de l'unité de disques, soulevez le fond de panier sur les deux côtés
et alignez-le avec les rails de guidage situés dans l'unité d'extension.
ATTENTION :
Le fond de panier peut être lourd à porter. Assurez-vous de pouvoir effectuer cette tâche en toute
sécurité avant de commencer.
13. Faites glisser le fond de panier dans l'unité d'extension en le tenant par le bas, comme illustré dans
la figure suivante.
14. Fixez le fond de panier en amenant les pattes de fixation de la position ouverte (B) à la position
verrouillée (A).
Figure 129. Remplacement d'un bloc d'alimentation
Unités de disques 129
15. Reposez toutes les unités de disques et les obturateurs, si vous les avez retirés précédemment.
a. Alignez l'unité de disques avec les rails de guidage du fond de panier en la tenant par le bas.
b. Faites glisser l'unité de disques complètement dans le fond de panier.
c. Appuyez sur le levier (A) vers l'unité de disques pour le verrouiller.
Figure 130. Installation du fond de panier sur un modèle monté en armoire
130 Unités de disques
16. Remettez en place l'ensemble des cartes d'extension et obturateurs, si vous les aviez retirés
précédemment.
a. Alignez la carte d'extension avec les rails de guidage du fond de panier en la tenant par le bas.
b. Faites glisser la carte d'extension complètement dans le fond de panier.
c. Appuyez sur le levier (A) vers la carte pour le verrouiller.
17. Démarrez le système. Pour obtenir des instructions, voir «Démarrage du système ou de la partition
logique», à la page 151.
Que faire ensuite
Procédez à toutes les autres tâches de maintenance nécessaires.
v Pour plus d'informations sur le raccordement du système, voir Connexion des unités d'extension
v Pour plus d'informations sur l'installation des adaptateurs PCI sur la carte, voir Unités d'extension,
cartes PCI et cassettes - Modèles 58/02 et 58/77.
Retrait d'une carte d'extension SAS dans une unité d'extension 58/02
hors tension
Les procédures suivantes expliquent comment retirer une carte d'extension SAS d'une unité d'extension
(montée en armoire)58/02 pour remplacer un composant défectueux nécessitant une mise hors tension.
Avant de commencer
Si le système est géré par la console HMC (Hardware Management Console), utilisez-la pour retirer la
carte d'extension SAS. Pour plus d'informations, voir «Retrait d'un composant via la console HMC», à la
page 194.
Procédure
1. Effectuez les tâches prérequises (voir «Avant de commencer», à la page 143).
Unités de disques 131
2. Identifiez la carte que vous souhaitez retirer. Pour obtenir des instructions, voir «Identification d'un
composant défaillant», à la page 146.
3. Arrêtez le système. Pour obtenir des instructions, voir «Arrêt d'un système ou d'une partition
logique», à la page 153.
4. Retirez les unités de disques et les obturateurs, comme illustré dans la figure suivante.
a. Appuyez sur le loquet de l'unité de disques et tirez sur la poignée (A) vers vous pour dégager
les unités.
b. Faites glisser l'unité de disques à l'extérieur du fond de panier en la tenant par le bas.
5. Retirez les cartes de port et les obturateurs, comme illustré dans la figure ci-dessous.
a. Saisissez l'extrémité du levier de verrouillage (A) et tirez-le vers le haut pour déverrouiller la
carte de port.
b. Retirez la carte de port du fond de panier en la tenant par le bas.
c. Posez la carte de port sur une surface de protection contre les décharges électrostatiques.
Figure 131. Retrait des unités de disques du fond de panier
132 Unités de disques
6. Retirez le fond de panier, comme illustré dans la figure suivante.
ATTENTION :
Le fond de panier peut être lourd à porter. Assurez-vous de pouvoir effectuer cette procédure en
toute sécurité.
a. Appuyez sur les loquets bleus (A) pour déverrouiller les leviers.
b. Tirez les leviers (B) vers l'extérieur dans le sens indiqué pour libérer le fond de panier.
c. Retirez le fond de panier du boîtier en le tenant par le bas.
Figure 132. Retrait de la carte de port du fond de panier
Unités de disques 133
7. Débranchez les cordons d'alimentation des deux ensembles de conversion hors ligne (OCA) en
procédant comme suit :
a. A l'avant des deux ensembles de conversion hors ligne, faites glisser les supports de cordon
d'alimentation (A) vers la gauche.
b. Etiquetez et débranchez les cordons d'alimentation (B) de chaque ensemble de conversion hors
ligne.
Figure 133. Retrait du fond de panier du boîtier
134 Unités de disques
8. Retirez les ensembles de conversion hors ligne en procédant comme suit :
a. Tirez le levier (A) à fond vers le bas dans le sens indiqué pour dégager l'ensemble de conversion
hors ligne.
b. Saisissez l'ensemble par les extrémités et retirez-le du boîtier.
Remarque : Tenez l'ensemble des deux mains pour l'extraire.
c. Répétez ces opérations pour l'autre ensemble de conversion hors ligne.
Figure 134. Débranchement des cordons d'alimentation des ensembles de conversion hors ligne
Unités de disques 135
9. Retirez le contrôleur de gestion du boîtier en procédant comme suit :
a. Etiquetez et débranchez tous les câbles de la carte du contrôleur de gestion du boîtier.
b. Pincez le loquet (A) pour déverrouiller le levier (B).
c. Tirez le levier vers l'extérieur dans le sens indiqué pour dégager la carte du contrôleur de gestion
du boîtier.
d. Faites glisser la carte du contrôleur de gestion hors du boîtier.
Figure 135. Retirer les ensembles de conversion hors ligne
136 Unités de disques
10. Retirez la plaque d'alimentation en tournant la vis moletée (A) et en ôtant la plaque du système.
11. Déconnectez les câbles de l'arrière de la carte d'entrée-sortie.
Figure 136. Retirer le contrôleur de gestion du boîtier
Figure 137. Retrait de la plaque d'alimentation
Unités de disques 137
12. Appuyez sur les loquets de déverrouillage (A) et tirez sur les leviers (B) pour dégager la carte du
système, comme illustré dans la figure suivante.
13. Retirez la carte du système.
14. Retirez la carte d'extension SAS et le fond de panier central du boîtier en procédant comme suit :
a. Tirez les verrous éclipsables à ressort (A) vers le haut et tournez-les d'un quart vers la gauche.
b. Tournez les vis imperdables (B) et (C) dans le sens inverse des aiguilles d'une montre jusqu'à ce
que qu'elles soient complètement desserrées.
c. Retirez la carte d'extension SAS et le fond de panier central du boîtier.
Figure 138. Retrait de la carte d'entrée-sortie
138 Unités de disques
15. Appuyez sur les leviers (A) de la carte d'extension pour dégager cette dernière du fond de panier
central, puis retirez la carte.
Figure 139. Retrait de la carte d'extension SAS et du fond de panier central
Figure 140. Retrait de la carte d'extension SAS du fond de panier central
Unités de disques 139
Que faire ensuite
Poursuivez la procédure de remplacement de la carte d'extension SAS ou revenez à la procédure qui vous
a amené vers cette page.
Remise en place d'une carte d'extension SAS dans une unité
d'extension 58/02 hors tension
Les procédures suivantes expliquent comment remplacer une carte d'extension SAS dans une unité
d'extension 58/02 hors tension.
Avant de commencer
Examinez les conditions préalables requises décrites dans la section «Avant de commencer», à la page
143.
Pourquoi et quand exécuter cette tâche
Si le système est géré par la console HMC (Hardware Management Console), utilisez-la pour retirer la
carte. Pour obtenir des instructions, voir «Retrait d'un composant via la console HMC», à la page 194
Procédure
1. Avant d'installer ou de réinstaller une carte d'extension SAS, examinez les connecteurs de signal afin
de vous assurer qu'ils ne sont pas recourbés ou endommagés.
Avertissement : Les connecteurs recourbés risquent d'endommager le réceptacle du fond de panier.
Veillez à ne pas toucher ni endommager les connecteurs de signal.
2. Insérez complètement la carte dans l'emplacement et poussez le levier (A) vers le bas pour maintenir
la carte en place, comme illustré dans la figure suivante.
Figure 141. Installation de la d'extension SAS
140 Unités de disques
3. Démarrez le système. Pour obtenir des instructions, voir «Démarrage du système ou de la partition
logique», à la page 151.
4. Vérifiez que la nouvelle carte est correctement installée. Pour plus d'informations, voir «Vérification
du composant installé», à la page 195.
Que faire ensuite
Procédez à toutes les autres tâches de maintenance nécessaires.
Retrait d'une carte d'extension SAS d'une unité d'extension 58/02 hors
tension
Les procédures suivantes expliquent comment retirer une carte d'extension SAS d'une unité d'extension
(montée en armoire) 58/02 pour remplacer un composant défectueux nécessitant une mise hors tension.
Avant de commencer
Si le système est géré par la console HMC (Hardware Management Console), utilisez-la pour retirer la
carte d'extension SAS. Pour plus d'informations, voir «Retrait d'un composant via la console HMC», à la
page 194.
Procédure
1. Effectuez les tâches prérequises (voir «Avant de commencer», à la page 143).
2. Identifiez la carte que vous souhaitez retirer. Pour obtenir des instructions, voir «Identification d'un
composant défaillant», à la page 146.
3. Soulevez le levier (A) et retirez la carte du système, comme illustré dans la figure suivante.
Figure 142. Retrait de la carte d'extension SAS
Unités de disques 141
Que faire ensuite
Poursuivez la procédure de remplacement de la carte d'extension SAS ou revenez à la procédure qui vous
a amené vers cette page.
142 Unités de disques
Procédures communes pour les fonctions installables
Cette section contient toutes les procédures communes concernant l'installation, le retrait et le
remplacement de fonctionnalités.
Avant de commencer
Respectez les précautions suivantes lorsque vous installez, retirez et remplacez des dispositifs et des
composants.
Pourquoi et quand exécuter cette tâche
Ces précautions sont destinées à créer un environnement sûr pour la maintenance de votre système et ne
sont pas des étapes pour la maintenance de votre système. Les procédures d'installation, de retrait et de
remplacement fournissent les étapes requises pour la maintenance de votre système.
143
DANGER
Lorsque vous utilisez le système ou travaillez à proximité de ce dernier, observez les consignes
suivantes :
Le courant électrique provenant de l'alimentation, du téléphone et des câbles de transmission peut
présenter un danger. Pour éviter tout risque de choc électrique :
v Branchez cette unité uniquement avec le cordon d'alimentation fourni par . N'utilisez pas ce
dernier avec un autre produit.
v N'ouvrez pas et n'entretenez pas le bloc d'alimentation électrique.
v Ne manipulez aucun câble et n'effectuez aucune opération d'installation, d'entretien ou de
reconfiguration de ce produit au cours d'un orage.
v Le produit peut être équipé de plusieurs cordons d'alimentation. Pour supprimer tout risque de
danger électrique, débranchez tous les cordons d'alimentation.
v Branchez tous les cordons d'alimentation sur un socle de prise de courant correctement câblé et
mis à la terre. Vérifiez que la tension et l'ordre des phases des prises de courant correspondent
aux informations de la plaque d'alimentation électrique du système.
v Branchez sur des socles de prise de courant correctement câblés tout équipement connecté à ce
produit.
v Lorsque cela est possible, n'utilisez qu'une seule main pour connecter ou déconnecter les câbles
d'interface.
v Ne mettez jamais un équipement sous tension en cas d'incendie ou d'inondation, ou en présence
de dommages matériels.
v Avant de retirer les carters de l'unité, mettez celle-ci hors tension et déconnectez ses cordons
d'alimentation, ainsi que les câbles qui la relient aux réseaux, aux systèmes de télécommunication
et aux modems (sauf mention contraire dans les procédures d'installation et de configuration).
v Lorsque vous installez, que vous déplacez, ou que vous manipulez le présent produit ou des
périphériques qui lui sont raccordés, reportez-vous aux instructions ci-dessous pour connecter et
déconnecter les différents cordons.
Pour déconnecter les cordons :
1. Mettez toutes les unités hors tension (sauf mention contraire).
2. Débranchez les cordons d'alimentation des prises.
3. Débranchez les câbles d'interface des connecteurs.
4. Débranchez tous les câbles des unités.
Pour connecter les cordons :
1. Mettez toutes les unités hors tension (sauf mention contraire).
2. Branchez tous les cordons sur les unités.
3. Branchez les câbles d'interface sur des connecteurs.
4. Branchez les cordons d'alimentation aux prises.
5. Mettez l'unité sous tension.
(D005a)
DANGER
144 Unités de disques
Observez les consignes suivantes lors de l'utilisation du système en armoire ou lorsque vous
travaillez à proximité de ce dernier :
v Un mauvais maniement de l'équipement lourd peut engendrer blessures et dommages matériels.
v Abaissez toujours les vérins de mise à niveau de l'armoire.
v Installez toujours des équerres de stabilisation sur l'armoire.
v Pour prévenir tout danger lié à une mauvaise répartition de la charge, installez toujours les
unités les plus lourdes dans la partie inférieure de l'armoire. Installez toujours les serveurs et les
unités en option en commençant par le bas de l'armoire.
v Un serveur monté en armoire n'est pas une étagère ou un espace de travail. Ne posez pas d'objet
sur un serveur monté en armoire.
v Chaque armoire peut être équipée de plusieurs cordons d'alimentation. Avant de manipuler
l'armoire, vous devez débrancher l'ensemble des cordons d'alimentation.
v Reliez toutes les unités installées dans l'armoire aux dispositifs d'alimentation installés dans la
même armoire. Vous ne devez pas brancher le cordon d'alimentation d'une unité installée dans
une armoire au dispositif d'alimentation installé dans une autre armoire.
v Un mauvais câblage du socle de prise de courant peut provoquer une mise sous tension
dangereuse des parties métalliques du système ou des unités qui lui sont raccordées. Il
appartient au client de s'assurer que le socle de prise de courant est correctement câblé et mis à la
terre afin d'éviter tout risque de choc électrique.
ATTENTION
v N'installez pas d'unité dans une armoire dont la température ambiante interne dépasse la
température ambiante que le fabricant recommande pour toutes les unités montées en armoire.
v N'installez pas d'unité dans une armoire où la ventilation n'est pas assurée. Vérifiez que les
côtés, l'avant et l'arrière de l'unité sont correctement ventilés.
v Le matériel doit être correctement raccordé au circuit d'alimentation pour éviter qu'une surcharge
des circuits n'entrave le câblage des dispositifs d'alimentation ou de protection contre les
surintensités. Pour choisir des connexions d'alimentation à l'armoire adaptées, consultez les
étiquettes de puissance nominale situées sur le matériel dans l'armoire afin de déterminer
l'alimentation totale requise par le circuit d'alimentation.
v Armoires dotées de tiroirs coulissants : Si l'armoire n'est pas équipée d'équerres de stabilisation, ne
sortez et n'installez pas de tiroir ou de dispositif. Ne retirez pas plusieurs tiroirs à la fois. Si vous
retirez plusieurs tiroirs simultanément, l'armoire risque de devenir instable.
v Armoires dotées de tiroirs fixes : Sauf indication du fabricant, les tiroirs fixes ne doivent pas être
retirés à des fins de maintenance. Si vous tentez de retirer une partie ou l'ensemble du tiroir,
l'armoire risque de devenir instable et le tiroir risque de tomber.
(R001)
Avant de commencer toute procédure de remplacement ou d'installation, procédez comme suit :
Procédure
1. Si vous installez un nouveau dispositif, vous devez disposer des logiciels requis pour la prise en
charge de ce nouveau dispositif.
2. Si vous exécutez une procédure d'installation ou de remplacement qui risque d'affecter la sécurité des
données, vous devez disposer, dans la mesure du possible, d'une copie de sauvegarde récente de
votre système ou de la partition logique (systèmes d'exploitation, logiciels sous licence et données).
Procédures communes pour les fonctions installables 145
3. Passez en revue la procédure d'installation ou de remplacement relative au dispositif ou au
composant.
4. Notez la correspondance des couleurs utilisée sur le système.
La couleur bleue ou ocre sur les composants matériels indique un point de contact que vous pouvez
utiliser pour retirer ou installer le composant sur le système, ouvrir ou fermer un levier, etc. La
couleur ocre peut également désigner un composant que vous pouvez retirer ou remplacer alors que
le système ou la partition logique est sous tension.
5. Vous devez vous munir d'un tournevis à lame plate moyen, d'un tournevis cruciforme et d'une paire
de ciseaux.
6. Si certains composants sont incorrects, manquants ou visiblement endommagés, procédez comme suit
:
v Si vous remplacez un composant, contactez votre fournisseur de composants ou le support
technique.
v Si vous installez un dispositif, contactez l'un des fournisseurs de services suivants :
– Votre fournisseur de composants ou le support technique.
7. Si vous rencontrez des difficultés durant l'installation, contactez votre fournisseur de services, ou le
support technique.
8. Si vous installez du nouveau matériel dans une partition logique, vous devez planifier et comprendre
les implications inhérentes au partitionnement du système. Pour plus d'informations, consultez la
rubrique Partitionnement logique.
Identification d'un composant défaillant
Ces instructions permettent de savoir comment localiser et identifier un composant défaillant sur un
système ou une unité d'extension en utilisant la méthode propre à votre système.
Pourquoi et quand exécuter cette tâche
Pour les serveurs systèmes dotés d'un processeur POWER7, les voyants peuvent être utilisés pour
identifier ou vérifier un composant en cours de maintenance. Le voyant d'erreur (ambre) signale une
erreur et correspond au code d'emplacement du code SRC. Le voyant est activé et désactivé
automatiquement.
Si vous avez besoin d'utiliser la fonction d'identification, utilisez les procédures suivantes.
Voyants du panneau de commande
Ces informations peuvent s'utiliser comme un guide sur les voyants et boutons du panneau de
commande.
Le panneau de commande est doté de voyants qui indiquent les différents états du système.
146 Unités de disques
v A : Bouton de mise sous-tension
v B: Voyant de système sous tension
– Un voyant fixe indique que l'alimentation système de l'unité est complète.
– Un voyant clignotant indique que l'unité est sous alimentation de mode veille.
Remarque : Une période d'environ 30 secondes s'écoule entre le moment où vous appuyez sur le
bouton de mise sous tension et celui où le voyant d'alimentation cesse de clignoter pour devenir fixe.
Durant cette période de transition, le voyant peut clignoter plus rapidement.
v C : Voyant d'identification de boîtier
– Une lumière constante indique l'état d'identification, qui permet d'identifier une pièce.
– L'absence de lumière indique un fonctionnement normal du système.
v D : Voyant d'informations système
– L'absence de lumière indique un fonctionnement normal du système.
– Un voyant signifie qu'une intervention est requise sur le système.
v E : Port USB
v F: Voyant de panne de boîtier
– Un voyant fixe indique une panne dans l'unité centrale
– L'absence de lumière indique un fonctionnement normal du système.
v G : Ecran Fonctions/Données
v H : Bouton de décrémentation
v I : Bouton Entrée
v J : Bouton d'incrémentation
v K : Bouton de réinitialisation
Figure 143. Panneau de commande
Procédures communes pour les fonctions installables 147
Concepts associés
Identification d'un composant défaillant
Ces instructions permettent de savoir comment localiser et identifier un composant défaillant sur un
système ou une unité d'extension en utilisant la méthode propre à votre système.
Identification d'un composant défaillant sur un système AIX ou une
partition logique
Les instructions permettent de savoir comment rechercher un composant défaillant, puis d'activer le
voyant de ce composant sur un système ou une partition logique exécutant le système d'exploitation AIX.
Localisation d'un composant défaillant sur un système AIX ou une partition
logique
Il se peut que vous deviez utiliser les outils AIX, avant d'activer le voyant dans le cadre de la recherche
d'un composant défaillant.
Procédure
1. Connectez-vous en tant qu'utilisateur root ou celogin-.
2. A l'invite, tapez diag et appuyez sur Entrée.
3. Dans le menu Sélection des fonctions, choisissez Sélection d'une tâche et appuyez sur Entrée.
4. Sélectionnez Affichage des résultats de tests de diagnostic précédents, puis appuyez sur Entrée.
5. Dans les précédents diagnostics, choisissez Affichage du journal des diagnostics abrégé. L'écran
Affichage du journal des diagnostics qui contient une liste chronologique des événements s'affiche.
6. Dans la colonne T, recherchez l'entrée S la plus récente. Sélectionnez la ligne, puis appuyez sur
Entrée.
7. Sélectionnez Validation. Le programme affiche les détails de l'entrée du journal.
8. Notez l'emplacement et le code SRN figurant vers la fin de l'entrée.
9. Retournez à la ligne de commande.
Que faire ensuite
Utilisez les informations de localisation du composant défaillant pour activer le voyant qui identifie ce
dernier. «Activation du voyant associé au composant défaillant».
Activation du voyant associé au composant défaillant
Ces instructions permettent d'identifier physiquement l'emplacement d'un composant pour lequel vous
intervenez.
Procédure
1. Connectez-vous en tant que superutilisateur.
2. A l'invite, tapez diag et appuyez sur Entrée.
3. Dans le menu Sélection des fonctions, choisissez Sélection d'une tâche et appuyez sur Entrée.
4. Dans le menu Sélection des tâches, sélectionnez Indicateurs d'identification et d'avertissement, puis
appuyez sur Entrée.
5. Dans la liste des voyants, sélectionnez le code d'emplacement associé au composant défaillant et
appuyez sur Entrée.
6. Sélectionnez Validation. Le programme allume le voyant d'incident associé au composant défaillant.
7. Retournez à la ligne de commande.
Désactivation du voyant d'un composant défaillant
Cette procédure permet d'éteindre un voyant allumé dans le cadre d'une opération de maintenance.
148 Unités de disques
Pourquoi et quand exécuter cette tâche
Pour le désactiver, procédez comme suit :
Procédure
1. Connectez-vous en tant que superutilisateur.
2. A l'invite, tapez diag et appuyez sur Entrée.
3. Dans le menu Sélection des fonctions, choisissez Sélection d'une tâche et appuyez sur Entrée.
4. Dans le menu Sélection des tâches, sélectionnez Voyants d'identification et d'avertissement, puis
appuyez sur Entrée.
5. Dans la liste des voyants, sélectionnez le code d'emplacement associé au composant défaillant et
appuyez sur Entrée. Lorsque le voyant d'un composant défaillant est activé, le caractère I précède le
code d'emplacement.
6. Sélectionnez Validation. Le programme désactive l'intervention du système ainsi que le voyant associé
au composant défaillant.
7. Retournez à la ligne de commande.
Identification d'un composant défaillant sur un système Linux ou une
partition logique
Si l'aide à la maintenance a été installée sur un système ou une partition logique, vous pouvez activer ou
désactiver les voyants pour localiser un composant ou effectuer une opération de maintenance.
Localisation d'un composant défaillant sur un système Linux ou une partition
logique
Si l'aide à la maintenance a été installée sur un système ou une partition logique, vous devez activer les
voyants pour localiser un composant.
Pourquoi et quand exécuter cette tâche
Pour l'activer, procédez comme suit :
Procédure
1. Connectez-vous en tant que superutilisateur.
2. A partir de la ligne de commande, tapez /usr/sbin/usysident -s identify -l et
appuyez sur Entrée.
3. Repérez le voyant d'incident pour identifier le boîtier qui renferme le composant défaillant.
Localisation d'un composant défaillant sur un système ou une partition logique
Linux
Pour retrouver le code d'emplacement d'un composant défaillant si vous ne le connaissez pas, utilisez la
procédure de cette rubrique.
Pourquoi et quand exécuter cette tâche
Pour localiser le composant défaillant sur un système ou une partition logique, procédez comme suit :
Procédure
1. Connectez-vous en tant que superutilisateur.
2. Sur la ligne de commande, tapez grep diagela /var/log/platform et appuyez sur Entrée.
3. Recherchez l'entrée la plus récente contenant un code SRC (System Reference Code).
4. Notez les informations d'emplacement.
Procédures communes pour les fonctions installables 149
Activation du voyant associé au composant défaillant
Si vous connaissez le code d'emplacement du composant défaillant, activez le voyant pour vous aider à
rechercher le composant à remplacer.
Pourquoi et quand exécuter cette tâche
Pour l'activer, procédez comme suit :
Procédure
1. Connectez-vous en tant que superutilisateur.
2. A partir de la ligne de commande, tapez /usr/sbin/usysident -s identify -l et
appuyez sur Entrée.
3. Repérez le voyant d'incident pour identifier le boîtier qui renferme le composant défaillant.
Résultats
Désactivation du voyant d'un composant défaillant
Après avoir terminé la procédure de retrait et de remplacement, vous devez désactiver le voyant du
composant défaillant.
Pourquoi et quand exécuter cette tâche
Pour le désactiver, procédez comme suit :
Procédure
1. Connectez-vous en tant que superutilisateur.
2. A partir de la ligne de commande, tapez /usr/sbin/usysident -s normal -l et
appuyez sur Entrée.
Localisation d'un composant défaillant sur un système Virtual I/O
Server ou une partition logique
Il se peut que vous deviez utiliser les outils Virtual I/O Server (VIOS), avant d'activer le voyant dans le
cadre de la recherche d'un composant défaillant.
Procédure
1. Connectez-vous en tant qu'utilisateur root ou celogin-.
2. A l'invite, tapez diagmenu et appuyez sur Entrée.
3. Dans le menu Sélection des fonctions, choisissez Sélection d'une tâche et appuyez sur Entrée.
4. Sélectionnez Affichage des résultats de tests de diagnostic précédents, puis appuyez sur Entrée.
5. Dans l'écran Affichage des résultats de tests de diagnostic précédents, sélectionnez Affichage du
journal des diagnostics abrégé. L'écran Affichage du journal des diagnostics apparaît. Il contient la
liste des événements triée dans l'ordre chronologique.
6. Dans la colonne T, recherchez l'entrée S la plus récente. Sélectionnez la ligne, puis appuyez sur
Entrée.
7. Sélectionnez Validation. Le programme affiche les détails de l'entrée du journal.
8. Notez l'emplacement et le code SRN figurant vers la fin de l'entrée.
9. Retournez à la ligne de commande.
150 Unités de disques
Résultats
Utilisez les informations de localisation du composant défaillant pour activer le voyant qui identifie ce
dernier. Pour plus d'informations, voir «Identification d'un composant à l'aide du Virtual I/O Server».
Identification d'un composant à l'aide du Virtual I/O Server
Ces instructions permettent d'activer le voyant afin de repérer l'emplacement physique d'un composant à
l'aide du Virtual I/O Server (VIOS).
Procédure
1. Connectez-vous en tant que superutilisateur.
2. A l'invite, tapez diagmenu et appuyez sur Entrée.
3. Dans le menu Sélection des fonctions, choisissez Sélection d'une tâche. Appuyez sur Entrée.
4. Dans le menu Sélection des tâches, sélectionnez Indicateurs d'identification et d'avertissement.
Appuyez sur Entrée.
5. Dans la liste des voyants, sélectionnez le code d'emplacement associé au composant défaillant et
appuyez sur Entrée.
6. Sélectionnez Validation. Le programme allume le voyant d'incident associé au composant défaillant.
7. Retournez à la ligne de commande.
Démarrage du système ou de la partition logique
Apprenez comment démarrer un système ou une partition logique après avoir effectué une procédure de
maintenance ou une mise à niveau du système.
Démarrage d'un système non géré par une console HMC
Le bouton d'alimentation ou l'interface ASMI permet de démarrer un système non géré par une console
HMC (Hardware Management Console).
Pourquoi et quand exécuter cette tâche
Pour démarrer un système non géré par une console HMC (Hardware Management Console), procédez
comme suit :
Procédure
1. Le cas échéant, ouvrez le volet avant de l'armoire.
2. Avant d'appuyer sur le bouton d'alimentation du panneau de commande, vérifiez que le système est
bien relié à une source d'alimentation :
v Tous les cordons d'alimentation système doivent être reliés à une source d'alimentation.
v Le voyant d'alimentation clignote lentement, comme indiqué dans la figure suivante.
v Comme indiqué dans la figure ci-après, la partie supérieure de l'écran doit afficher la mention 01
V=F.
3. Appuyez sur le bouton d'alimentation (A) du panneau de commande (voir figure suivante).
Procédures communes pour les fonctions installables 151
v A : Bouton de mise sous-tension
v B : Voyant d'alimentation
– Un voyant fixe indique que l'unité est sous alimentation système complète.
– Un voyant clignotant indique que l'unité est sous alimentation de mode veille.
Remarque : Une période d'environ 30 secondes s'écoule entre le moment où vous appuyez sur le
bouton de mise sous tension et celui où le voyant d'alimentation cesse de clignoter pour devenir
fixe. Durant cette période de transition, le clignotement peut s'accélérer.
v $C : Voyant d'identification de boîtier
– Une lumière constante indique l'état d'identification, qui permet d'identifier une pièce.
– L'absence de lumière indique un fonctionnement normal du système.
v D : Voyant d'incident
– L'absence de lumière indique un fonctionnement normal du système.
– Un voyant clignotant signifie qu'une intervention est requise sur le système.
v E : Port USB
v F: Voyant de panne de boîtier
– Un voyant fixe indique une panne dans le système.
– L'absence de lumière indique un fonctionnement normal du système.
v G : Ecran Fonctions/Données
v H : Bouton de décrémentation
v I : Bouton Entrée
v J : Bouton d'incrémentation
v K : Bouton de réinitialisation
4. Une fois que vous avez appuyé sur le bouton d'alimentation, notez les éléments suivants :
v Le voyant d'alimentation commence à clignoter plus rapidement.
v Les ventilateurs système sont activés après environ 30 secondes et commencent à tourner plus
rapidement.
Figure 144. Panneau de commande
152 Unités de disques
v L'écran du panneau de commande affiche les jauges (ou points de contrôle) pendant le démarrage
du système. Le voyant de mise sous tension arrête de clignoter et reste fixe, indiquant que le
système est sous tension.
Que faire ensuite
Conseil : Si le système ne démarre pas alors que vous avez appuyé sur le bouton d'alimentation, utilisez
la procédure suivante pour démarrer le système à l'aide de l'interface ASMI (Advanced System
Management Interface) :
1. Lancez l'interface ASMI. Pour plus d'informations, reportez-vous à la rubrique Accès à l'interface
ASMI.
2. Démarrez le système à l'aide de l'interface ASMI. Pour plus d'informations, reportez-vous à la
rubrique Mise sous tension et arrêt du système.
Démarrage d'un système ou d'une partition logique via la console
HMC (Hardware Management Console)
Vous pouvez utiliser l'interface utilisateur de la console HMC (Hardware Management Console) pour
démarrer le système ou la partition logique après avoir installé les câbles requis et relié les cordons
d'alimentation à une source d'alimentation.
Pourquoi et quand exécuter cette tâche
Pour plus d'informations sur l'utilisation de la console HMC, reportez-vous à la rubrique Gestion de la
console HMC (Hardware Management Console). Pour plus d'informations sur le démarrage d'une
partition logique, reportez-vous à la rubrique Partitionnement logique. Pour plus d'informations sur le
démarrage du système, reportez-vous à la rubrique Mise sous tension du système géré.
L'écran du panneau de commande affiche les jauges (ou points de contrôle) pendant le démarrage du
système. Lorque le voyant du panneau de commande cesse de clignoter et reste allumé, le système est
sous tension.
Arrêt d'un système ou d'une partition logique
La présente section explique comment démarrer un système ou une partition logique dans le cadre d'une
mise à niveau du système ou d'une procédure de maintenance.
Pourquoi et quand exécuter cette tâche
Avertissement : Si vous utilisez le bouton de mise sous tension ou les commandes de la console HMC
(Hardware Management Console) pour arrêter le système, les fichiers de données risquent d'être
endommagés. Par ailleurs, le système risque de mettre plus de temps à démarrer la prochaine fois si
toutes les applications n'ont pas été fermées avant l'arrêt du système.
Pour arrêter le système ou la partition logique, sélectionnez la procédure appropriée.
Tâches associées
Arrêt de partitions logiques à l'aide du gestionnaire Integrated Virtualization Manager
Cette procédure permet d'arrêter une partition logique ou l'ensemble d'un système géré par le
gestionnaire Integrated Virtualization Manager.
Arrêt d'un système non géré par une console HMC
Pour effectuer une autre tâche, vous devez arrêter le système. Ces instructions indiquent comment arrêter
le système à l'aide du bouton d'alimentation ou de l'interface ASMI.
Procédures communes pour les fonctions installables 153
Avant de commencer
Avant d'arrêter le système, procédez comme suit :
1. Vérifiez que tous les travaux sont terminés et fermez toutes les applications.
2. Assurez-vous que le système d'exploitation est arrêté.
Avertissement : Si vous ne le faites pas, vous risquez de perdre des données.
3. Si une partition logique I/O Server (VIOS) est en cours d'exécution, assurez-vous que tous les clients
sont déconnectés ou qu'ils peuvent accéder à leurs unités par un autre moyen.
Pourquoi et quand exécuter cette tâche
La procédure ci-après explique comment arrêter un système non géré par une console HMC (Hardware
Management Console).
Procédure
1. Connectez-vous au système sous un nom d'utilisateur bénéficiant des droits d'accès nécessaires pour
exécuter la commande shutdown ou pwrdwnsys (mettre le système hors tension).
2. A l'invite, tapez la commande appropriée :
v Si le système exécute le système d'exploitation AIX, tapez shutdown.
v Si le système exécute le système d'exploitation Linux, tapez shutdown -h now.
v Si votre système exécute le système d'exploitation , tapez PWRDWNSYS. Si votre système est
partitionné, utilisez la commande PWRDWNSYS pour arrêter chaque partition secondaire. Ensuite,
utilisez la commande PWRDWNSYS pour arrêter la partition principale.
La commande arrête le système d'exploitation. Une fois le système d'exploitation arrêté, le voyant de
mise sous tension commence à clignoter lentement et le système passe en veille.
3. Pour restaurer l'état du système à l'issue de l'opération d'installation ou de remplacement, notez le
type et le mode de l'IPL.
4. Mettez les interrupteurs d'alimentation des unités reliées au système sur Off.
5. Débranchez tous les cordons d'alimentation connectés à l'unité des socles de prise de courant. Vérifiez
que vous débranchez également les cordons d'alimentation des périphériques (imprimantes et unités
d'extension, par exemple).
Important : Le système est équipé d'un second bloc d'alimentation. Avant de continuer la procédure,
vérifiez que toutes les sources d'alimentation ont été débranchées du système.
(L003)
ou
154 Unités de disques
Arrêt d'un système à l'aide de la console HMC
Pourquoi et quand exécuter cette tâche
Vous pouvez utiliser l'interface utilisateur de la console HMC (Hardware Management Console) pour
arrêter le système ou la partition logique. Pour effectuer cette tâche, procédez comme suit.
Par défaut, le système géré est configuré pour se mettre hors tension automatiquement lorsque vous
arrêtez sa dernière partition logique en cours d'exécution. Si vous définissez les propriétés du système
géré sur la console HMC de façon à empêcher la mise hors tension automatique du système, vous devez
utiliser cette procédure pour mettre hors tension le système géré.
Avertissement : Dans la mesure du possible, arrêtez les partitions logiques en cours d'exécution avant
de mettre le système géré hors tension. Si vous n'arrêtez pas d'abord les partitions logiques, vous risquez
de provoquer leur arrêt anormal et de perdre des données. Si vous utilisez une partition logique I/O
Server (VIOS), assurez-vous que tous les clients sont déconnectés ou qu'ils peuvent accéder à leurs unités
par un autre moyen.
Pour mettre le système géré hors tension, vous devez utiliser l'un des profils utilisateur suivants :
v Super administrateur
v Technicien de maintenance
v Opérateur
v Responsable produit au laboratoire
Procédure
1. Dans la zone de navigation, développez le dossier Gestion de systèmes.
2. Cliquez sur l'icône Serveurs.
3. Dans la zone de contenu, sélectionnez le système géré.
4. Sélectionnez Tâches, Opérations, puis Mise hors tension
5. Sélectionnez le mode de mise hors tension approprié et cliquez sur OK.
Procédures communes pour les fonctions installables 155
Information associée
Arrêt et redémarrage de partitions logiques
Retrait et remplacement du volet ou panneau d'unité d'extension
Il se peut que vous deviez retirer, remplacer ou installer des panneaux ou des volets sur une unité
d'extension pour accéder à des composants ou effectuer une opération de maintenance.
Retrait du panneau avant sur un modèle 14G/30
Il se peut que vous deviez retirer le panneau pour effectuer des opérations de maintenance du système.
Pourquoi et quand exécuter cette tâche
Pour retirer le panneau avant, procédez comme suit :
Procédure
1. Retirez les deux vis moletées (B) figurant sur la partie gauche et à l'arrière du panneau.
2. Appuyez sur les loquets (C) situés de part et d'autre du panneau pour libérer celui-ci.
3. Soulevez le panneau (A) et retirez-le du châssis.
Conseil : Si un bloc de ventilation est présent dans une partie vide du châssis, ne le retirez que si
vous installez des composants dans cette partie du châssis.
Installation du panneau avant sur un modèle 14G/30
Cette procédure permet d'installer le panneau avant après avoir installé le système ou effectué une
procédure de maintenance.
Figure 145. Retrait du panneau avant sur un modèle 14G/30
156 Unités de disques
Avant de commencer
Si des composants ne sont installés que d'un seul côté du châssis, assurez-vous que le bloc de ventilation
est présent du côté vide.
Procédure
1. Insérez les colonnes situées à l'arrière du panneau (A) dans les crochets du châssis.
2. Appuyez sur les loquets (C) situés de part et d'autre du panneau avant.
3. Insérez le panneau dans le châssis et ouvrez les loquets. Le panneau s'enclenche dans le logement.
4. Remettez en place les deux vis moletées (B) dans les emplacements gauche et droite du panneau
avant.
Carters système
Suivez ces procédures pour retirer et remettre en place les carters sur les serveurs systèmes contenant le
processeur POWER7.
Retrait et remplacement des carters du système 02E/4B ou 05F/6B
Ces instructions permettent de retirer, de remettre en place ou d'installer des carters pour accéder à des
composants ou effectuer une opération de maintenance.
Retrait du carter avant d'un système monté en armoire 02E/4B ou 05F/6B
Cette procédure permet de retirer le carter pour accéder aux composants ou effectuer une opération de
maintenance.
Figure 146. Panneau avant d'un modèle 14G/30
Procédures communes pour les fonctions installables 157
Pourquoi et quand exécuter cette tâche
Pour retirer le carter avant, procédez comme suit :
Procédure
1. Retirez les deux vis qui maintiennent le système fixé à l'armoire.
2. Retirez le carter du système. Le carter est doté d'un renfoncement permettant de le tenir plus
facilement.
Retrait du carter avant sur un système autonome 02E/4B ou 05F/6B
Cette procédure permet de retirer le carter pour accéder aux composants ou effectuer une opération de
maintenance.
Pourquoi et quand exécuter cette tâche
Pour retirer le carter avant, procédez comme suit :
Procédure
1. Ouvrez le volet avant (A) en le faisant pivoter vers la gauche, comme indiqué dans la figure suivante.
2. Appuyez sur le taquet de verrouillage (B) et retirez le carter du système. Le carter est doté d'un
renfoncement permettant de le tenir plus facilement.
Figure 147. Retrait du carter avant
158 Unités de disques
Installation du carter avant sur un système monté en armoire 02E/4B ou 05F/6B
Cette procédure permet d'installer le carter permettant d'accéder aux composants ou d'effectuer une
opération de maintenance.
Pourquoi et quand exécuter cette tâche
Pour installer le carter avant, procédez comme suit :
Procédure
1. Poussez le carter sur le système.
2. Poussez doucement le carter jusqu'à ce que les quatre taquets de déverrouillage soient positionnés
dans leurs montants de fixation comme indiqué dans la figure suivante. Le carter avant se met en
place ; il est doté d'un renfoncement permettant de le tenir plus facilement.
Figure 148. Retrait du carter avant
Procédures communes pour les fonctions installables 159
Installation du carter avant et du volet avant sur un système autonome 02E/4B ou
05F/6B
Cette procédure permet d'installer le carter permettant d'accéder aux composants ou d'effectuer une
opération de maintenance.
Pourquoi et quand exécuter cette tâche
Pour installer le carter avant et le volet avant, procédez comme suit.
Procédure
1. Alignez le carter (A) jusqu'à ce que les deux pattes du couvercle soient placées dans les encoches de
plaque de base (B), comme illustré dans la figure suivante.
2. Faites pivoter le carter vers le haut et vers le système, jusqu'à ce que le taquet de déverrouillage soit
inséré dans son emplacement. Le carter est doté d'un renfoncement permettant de le tenir plus
facilement.
Figure 149. Retrait du carter avant
160 Unités de disques
3. Maintenez le volet avant à un angle d'environ 120 degrés vers le système comme indiqué dans la
figure suivante. Enclenchez les taquets.
Retrait du capot d'accès d'un système monté sur armoire 02E/4B ou 05F/6B
Cette procédure décrit comment retirer le capot d'accès.
Procédure
1. Desserrez les deux vis moletées (A) situées à l'arrière du capot.
Figure 150. Installation du carter avant
Figure 151. Installation du volet avant
Procédures communes pour les fonctions installables 161
2. Faites glisser le capot vers l'arrière de l'unité centrale. Lorsque la partie avant du capot d'accès n'est
plus sur le bord du cadre supérieur, soulevez le capot et retirez-le de l'unité centrale.
Avertissement : Pour assurer un refroidissement et une circulation de l'air corrects, replacez le capot
avant la mise sous tension du système. Vous risquez d'endommager les composants système si vous
utilisez le système sans son capot pendant plus de 30 minutes.
Retrait du capot d'accès d'un système autonome 02E/4B ou 05F/6B
Cette procédure décrit comment retirer le capot d'accès.
Procédure
1. Desserrez la vis de serrage à main (A) située à l'arrière du capot.
2. Faites glisser le capot (B) vers l'arrière de l'unité centrale. Lorsque la partie avant du capot d'accès
n'est plus sur le bord du cadre supérieur, soulevez le capot et retirez-le de l'unité centrale.
Figure 152. Retrait du capot d'accès
162 Unités de disques
3. Desserrez les deux vis moletées (A) situées à l'arrière du capot intérieur.
4. Faites glisser le capot (B) vers l'arrière de l'unité centrale. Lorsque la partie avant du capot d'accès
n'est plus sur le bord du cadre supérieur, soulevez le capot et retirez-le de l'unité centrale.
Figure 153. Installation du capot d'accès extérieur
Figure 154. Retrait du capot d'accès intérieur
Procédures communes pour les fonctions installables 163
Installation du capot d'accès d'un système monté sur armoire 02E/4B ou 05F/6B
Cette procédure décrit comment installer le capot d'accès.
Procédure
1. Placez le capot d'accès sur le haut du système, à environ 25 mm du rebord du châssis supérieur.
2. Maintenez le capot d'accès contre l'unité centrale et faites-le glisser vers l'avant du système. Les
languettes du capot d'accès glissent sous le rebord du boîtier supérieur.
3. Alignez les deux vis moletées (A) situées à l'arrière du capot d'accès avec les deux trous à l'arrière du
boîtier du système.
4. Serrez les vis moletées afin de fixer le capot d'accès.
Installation du capot d'accès d'un système autonome 02E/4B ou 05F/6B
Cette procédure décrit comment installer le capot d'accès.
Procédure
1. Placez le capot d'accès interne (A) sur le haut du système, à environ 25 mm du rebord du châssis
supérieur.
2. Maintenez le capot d'accès contre l'unité centrale et faites-le glisser vers l'avant du système. Les
languettes du capot d'accès glissent sous le rebord du boîtier supérieur.
3. Alignez les deux vis moletées (B) situées à l'arrière du capot d'accès avec les deux trous à l'arrière du
boîtier du système.
4. Serrez les vis moletées afin de fixer le capot d'accès.
Figure 155. Installation du capot d'accès
164 Unités de disques
5. Placez le capot d'accès extérieur (A) sur le haut du système.
6. Maintenez le capot d'accès contre l'unité centrale et faites-le glisser vers l'avant du système. Les
languettes du capot d'accès glissent sous le rebord du boîtier supérieur.
7. Alignez la vis moletée (B) située à l'arrière du capot d'accès avec le trou à l'arrière du boîtier du
système.
8. Serrez la vis moletée afin de fixer le capot d'accès.
Figure 156. Installation du capot d'accès intérieur
Procédures communes pour les fonctions installables 165
Retrait et remplacement des carters du système 31E/2B
Ces instructions permettent de retirer, de remettre en place ou d'installer des carters pour accéder à des
composants ou effectuer une opération de maintenance.
Retrait du capot d'accès du système 31E/2B
Cette procédure décrit comment retirer le capot d'accès.
Procédure
1. Desserrez la vis moletée (A) située à l'arrière du capot.
2. Faites glisser le capot (B) vers l'arrière de l'unité centrale. Lorsque la partie avant du capot d'accès
n'est plus sur le bord du cadre supérieur, soulevez le capot et retirez-le de l'unité centrale.
Figure 157. Installation du capot d'accès extérieur
166 Unités de disques
Avertissement : Pour assurer un refroidissement et une circulation de l'air corrects, replacez le capot
avant la mise sous tension du système. Vous risquez d'endommager les composants système si vous
utilisez le système sans son capot pendant plus de 30 minutes.
Installation du capot d'accès du système 31E/2B
Cette procédure décrit comment installer le capot d'accès.
Procédure
1. Placez le capot d'accès (A) sur le haut du système, à environ 25 mm du rebord du châssis supérieur.
2. Maintenez le capot d'accès contre l'unité centrale et faites-le glisser vers l'avant du système. Les
languettes du capot d'accès glissent sous le rebord du boîtier supérieur.
3. Alignez la vis moletée (B) située à l'arrière du capot d'accès avec le trou à l'arrière du boîtier du
système.
4. Serrez les vis moletées afin de fixer le capot d'accès.
Figure 158. Retrait du capot d'accès
Procédures communes pour les fonctions installables 167
Retrait et installation des carters sur le système 33E/8B ou 36E/8C
Ces instructions permettent de retirer, de remettre en place ou d'installer des carters pour accéder à des
composants ou effectuer une opération de maintenance.
Retrait du carter avant des modèles 33E/8B ou 36E/8C.
Cette procédure permet de retirer le carter pour accéder aux composants ou effectuer une opération de
maintenance.
Pourquoi et quand exécuter cette tâche
Pour retirer le carter avant, procédez comme suit :
Procédure
1. Retirez les deux vis (A) qui maintiennent l'unité centrale à l'armoire (B) (voir figure suivante).
2. Retirez le carter du système. Le carter est doté d'un renfoncement permettant de le tenir plus
facilement.
Figure 159. Installation du capot d'accès
168 Unités de disques
Installation du carter avant des modèles 33E/8B ou 36E/8C
Cette procédure permet de remettre en place le carter après avoir accédé à des composants ou effectué
une opération de maintenance.
Pourquoi et quand exécuter cette tâche
Pour installer le carter avant, procédez comme suit :
Procédure
1. Alignez le carter face au système.
2. Poussez doucement le carter jusqu'à ce que les quatre taquets de déverrouillage soient positionnés
dans leurs montants de fixation respectifs (B) sur le système.
3. Remettez en place les deux vis (C) qui maintiennent le système fixé à l'armoire (A).
Figure 160. Retrait du carter avant
Procédures communes pour les fonctions installables 169
Retrait du capot d'accès des modèles 33E/8B ou 36E/8C
Cette procédure permet de retirer le capot d'accès pour effectuer des opérations de maintenance ou
accéder à des composants internes.
Pourquoi et quand exécuter cette tâche
Pour retirer le capot d'accès, procédez comme suit :
Procédure
1. Placez le système en position de maintenance. Pour plus d'informations, voir «Mise en position de
maintenance de l'unité 33E/8B ou 36E/8C», à la page 184.
2. Desserrez les deux vis moletées (A) situées à l'arrière du capot.
3. Placez le capot (B) à l'arrière de l'unité centrale. Une fois l'avant du capot d'accès dégagé du bord
supérieur, soulevez et retirez le capot.
Résultats
Avertissement : Pour garantir un niveau de refroidissement et de ventilation approprié, installez le
capot avant de démarrer le système. Vous risquez d'endommager les composants système si vous utilisez
le système sans son capot pendant plus de 30 minutes.
Figure 161. Installation du carter avant
170 Unités de disques
Installation du capot d'accès des modèles 33E/8B ou 36E/8C.
Cette procédure permet d'installer le capot d'accès après avoir effectué une opération de maintenance ou
accédé aux composants internes.
Pourquoi et quand exécuter cette tâche
Pour installer le capot d'accès, procédez comme suit :
Procédure
1. Placez le capot d'accès (A) sur le haut du système, à environ 25 mm du panneau avant.
2. Tout en maintenant le capot d'accès contre l'unité centrale, faites glisser le capot vers l'avant de l'unité
centrale.
Les taquets du capot d'accès doivent se trouver en-dessous du bord supérieur et les deux vis doivent
être alignées avec les trous situés à l'arrière de l'unité centrale.
Important : Assurez-vous que les câble LED du ventilateur ne sont pas coincés par l'arête avant du
capot d'accès lorsque vous le déplacez vers l'avant.
3. Serrez les deux vis (B) situées à l'arrière du capot.
Figure 162. Retrait du capot d'accès
Procédures communes pour les fonctions installables 171
Retrait et remplacement des carters du système 17M/MB ou 79M/HB
Ces instructions permettent de retirer, de remettre en place ou d'installer des carters pour accéder à des
composants ou effectuer une opération de maintenance.
Retrait du capot avant des modèles 17M/MB ou 79M/HB
Cette procédure permet de retirer le capot pour accéder aux composants ou effectuer une opération de
maintenance.
Pourquoi et quand exécuter cette tâche
Pour retirer le panneau avant, procédez comme suit :
Procédure
1. Le cas échéant, ouvrez le volet avant de l'armoire.
2. Tirez les taquets (A) situés de chaque côté du capot (voir figure suivante).
Figure 163. Installation du capot d'accès
172 Unités de disques
3. Tirez le panneau (B) pour l'ôter de l'unité centrale.
Installation du panneau avant du modèle 17M/MB ou 79M/HB
Cette procédure permet d'installer le capot après avoir accédé à des composants ou effectué une
opération de maintenance.
Pourquoi et quand exécuter cette tâche
Pour installer le panneau avant, procédez comme suit :
Procédure
1. Placez le panneau (A) sur la face avant de l'unité centrale en alignant les quatre goupilles sur les
quatre emplacements correspondants à l'arrière du panneau.
Figure 164. Retrait du capot avant
Procédures communes pour les fonctions installables 173
2. Appuyez sur les taquets (B) pour enclencher le panneau dans son emplacement.
3. Fermez le volet avant de l'armoire.
Mise en position de maintenance ou de fonctionnement du système
33E/8B ou 36E/8C
Ces procédures décrivent comment mettre un système en maintenance et en fonctionnement dans le
cadre d'une opération de maintenance ou pour accéder à des composants internes.
Mise en position de maintenance d'un système monté en armoire
31E/2B
Utilisez cette procédure pour placer le système monté en armoire en position de maintenance.
Avant de commencer
Remarques :
v Lorsque vous placez l'armoire en position de maintenance, il est essentiel de positionner correctement
toutes les plaques de stabilisation pour empêcher l'armoire de basculer. Assurez-vous qu'il n'y ait
qu'une seule unité centrale en position de maintenance à la fois.
v Veillez à ne pas bloquer ou plier les câbles situés à l'arrière de l'unité centrale lorsque vous extrayez
celle-ci de l'armoire.
v Une fois les glissières de l'unité centrale complètement étendues, les taquets de sécurité s'enclenchent,
ce qui empêche l'unité de trop sortir de son emplacement.
Procédure
1. Ouvrez le volet avant de l'armoire.
2. Identifiez l'unité système dont vous effectuez la maintenance dans l'armoire.
Figure 165. Installation du panneau avant
174 Unités de disques
3. Tout en maintenant enfoncés les taquets de déverrouillage droit et gauche de l'unité centrale, tirez
l'unité centrale hors de l'armoire jusqu'à ce que les glissières soient complètement étendues et
verrouillées.
Mise en position de fonctionnement d'un système monté en armoire
31E/2B
Utilisez cette procédure pour placer le système monté en armoire en position de fonctionnement.
Avant de commencer
Lors du placement en position de fonctionnement du système, veillez à ne pas bloquer ou plier les câbles
situés à l'arrière du système lorsque vous poussez l'unité dans l'armoire.
Procédure
1. Déverrouiller les taquets de sécurité en les faisant glisser vers l'avant du système.
2. Poussez l'unité centrale dans l'armoire jusqu'à ce que les deux taquets de déverrouillage de l'unité
soient verrouillés.
Figure 166. Placement du système en position de maintenance
Procédures communes pour les fonctions installables 175
3. Fermez le volet avant de l'unité dont vous effectuez la maintenance.
Mise en position de maintenance d'un système monté en armoire
02E/4B ou 05F/6B
Utilisez cette procédure pour placer le système monté en armoire en position de maintenance.
Avant de commencer
Remarques :
v Lorsque vous placez l'armoire en position de maintenance, il est essentiel de positionner correctement
toutes les plaques de stabilisation pour empêcher l'armoire de basculer. Assurez-vous qu'il n'y ait
qu'une seule unité centrale en position de maintenance à la fois.
v Veillez à ne pas bloquer ou plier les câbles situés à l'arrière de l'unité centrale lorsque vous extrayez
celle-ci de l'armoire.
v Une fois les glissières de l'unité centrale complètement étendues, les taquets de sécurité s'enclenchent,
ce qui empêche l'unité de trop sortir de son emplacement.
Procédure
1. Ouvrez le volet avant de l'armoire.
2. Identifiez l'unité système dont vous effectuez la maintenance dans l'armoire.
3. Retirez les deux vis (A) (si elles sont présentes) fixant l'unité centrale (B) à l'armoire.
Figure 167. Mise en position de fonctionnement du système
176 Unités de disques
4. Tout en maintenant enfoncés les taquets de déverrouillage droit et gauche de l'unité centrale (A), tirez
l'unité centrale (B) hors de l'armoire jusqu'à ce que les glissières soient complètement étendues et
verrouillées.
Figure 168. Placement du système en position de maintenance
Procédures communes pour les fonctions installables 177
Mise en position de fonctionnement du système monté en armoire
02E/4B ou 05F/6B
Utilisez cette procédure pour placer le système monté en armoire en position de fonctionnement.
Avant de commencer
Lors du placement en position de fonctionnement du système, veillez à ne pas bloquer ou plier les câbles
situés à l'arrière du système lorsque vous poussez l'unité dans l'armoire.
Procédure
1. Déverrouillez les taquets de sécurité (A) en les faisant glisser vers le haut du système. Les taquets de
sécurité sont situés près du milieu du système.
2. Poussez l'unité centrale (B) dans l'armoire jusqu'à ce que les deux taquets de déverrouillage de l'unité
soient verrouillés.
3. Replacez et serrez les deux vis (A) fixant l'unité centrale (B) à l'armoire.
4. Fermez le volet avant de l'unité dont vous effectuez la maintenance.
Figure 169. Mise en position de fonctionnement du système
178 Unités de disques
Positions de maintenance et de fonctionnement
Suivez ces procédures pour placer les serveurs systèmes en position de maintenance ou de
fonctionnement.
Mise en position de fonctionnement du système 02E/4B ou 05F/6B
Ces procédures décrivent comment mettre un système en maintenance et en fonctionnement dans le
cadre d'une opération de maintenance ou pour accéder à des composants internes.
Mise en position de maintenance d'un système monté en armoire 02E/4B ou
05F/6B
Utilisez cette procédure pour placer le système monté en armoire en position de maintenance.
Avant de commencer
Remarques :
v Lorsque vous placez l'armoire en position de maintenance, il est essentiel de positionner correctement
toutes les plaques de stabilisation pour empêcher l'armoire de basculer. Assurez-vous qu'il n'y ait
qu'une seule unité centrale en position de maintenance à la fois.
v Veillez à ne pas bloquer ou plier les câbles situés à l'arrière de l'unité centrale lorsque vous extrayez
celle-ci de l'armoire.
v Une fois les glissières de l'unité centrale complètement étendues, les taquets de sécurité s'enclenchent,
ce qui empêche l'unité de trop sortir de son emplacement.
Procédure
1. Ouvrez le volet avant de l'armoire.
2. Identifiez l'unité système dont vous effectuez la maintenance dans l'armoire.
3. Retirez les deux vis (A) (si elles sont présentes) fixant l'unité centrale (B) à l'armoire.
Figure 170. Mise en position de fonctionnement du système
Procédures communes pour les fonctions installables 179
4. Tout en maintenant enfoncés les taquets de déverrouillage droit et gauche de l'unité centrale (A), tirez
l'unité centrale (B) hors de l'armoire jusqu'à ce que les glissières soient complètement étendues et
verrouillées.
Figure 171. Placement du système en position de maintenance
180 Unités de disques
Mise en position de fonctionnement du système monté en armoire 02E/4B ou
05F/6B
Utilisez cette procédure pour placer le système monté en armoire en position de fonctionnement.
Avant de commencer
Lors du placement en position de fonctionnement du système, veillez à ne pas bloquer ou plier les câbles
situés à l'arrière du système lorsque vous poussez l'unité dans l'armoire.
Procédure
1. Déverrouillez les taquets de sécurité (A) en les faisant glisser vers le haut du système. Les taquets de
sécurité sont situés près du milieu du système.
2. Poussez l'unité centrale (B) dans l'armoire jusqu'à ce que les deux taquets de déverrouillage de l'unité
soient verrouillés.
3. Replacez et serrez les deux vis (A) fixant l'unité centrale (B) à l'armoire.
4. Fermez le volet avant de l'unité dont vous effectuez la maintenance.
Figure 172. Mise en position de fonctionnement du système
Procédures communes pour les fonctions installables 181
Mise en position de maintenance ou de fonctionnement du système
33E/8B ou 36E/8C
Ces procédures décrivent comment mettre un système en maintenance et en fonctionnement dans le
cadre d'une opération de maintenance ou pour accéder à des composants internes.
Mise en position de maintenance d'un système monté en armoire 31E/2B
Utilisez cette procédure pour placer le système monté en armoire en position de maintenance.
Avant de commencer
Remarques :
v Lorsque vous placez l'armoire en position de maintenance, il est essentiel de positionner correctement
toutes les plaques de stabilisation pour empêcher l'armoire de basculer. Assurez-vous qu'il n'y ait
qu'une seule unité centrale en position de maintenance à la fois.
v Veillez à ne pas bloquer ou plier les câbles situés à l'arrière de l'unité centrale lorsque vous extrayez
celle-ci de l'armoire.
v Une fois les glissières de l'unité centrale complètement étendues, les taquets de sécurité s'enclenchent,
ce qui empêche l'unité de trop sortir de son emplacement.
Procédure
1. Ouvrez le volet avant de l'armoire.
2. Identifiez l'unité système dont vous effectuez la maintenance dans l'armoire.
3. Tout en maintenant enfoncés les taquets de déverrouillage droit et gauche de l'unité centrale, tirez
l'unité centrale hors de l'armoire jusqu'à ce que les glissières soient complètement étendues et
verrouillées.
Figure 173. Mise en position de fonctionnement du système
182 Unités de disques
Mise en position de fonctionnement d'un système monté en armoire 31E/2B
Utilisez cette procédure pour placer le système monté en armoire en position de fonctionnement.
Avant de commencer
Lors du placement en position de fonctionnement du système, veillez à ne pas bloquer ou plier les câbles
situés à l'arrière du système lorsque vous poussez l'unité dans l'armoire.
Procédure
1. Déverrouiller les taquets de sécurité en les faisant glisser vers l'avant du système.
2. Poussez l'unité centrale dans l'armoire jusqu'à ce que les deux taquets de déverrouillage de l'unité
soient verrouillés.
Figure 174. Placement du système en position de maintenance
Procédures communes pour les fonctions installables 183
3. Fermez le volet avant de l'unité dont vous effectuez la maintenance.
Mise en position de maintenance ou de fonctionnement du système
33E/8B ou 36E/8C
Ces procédures décrivent comment mettre un système en maintenance et en fonctionnement dans le
cadre d'une opération de maintenance ou pour accéder à des composants internes.
Mise en position de maintenance de l'unité 33E/8B ou 36E/8C
Cette procédure permet d'effectuer des opérations de maintenance ou d'accéder à des composants
internes en plaçant l'unité centrale ou l'unité d'extension montée en armoire en position de maintenance.
Pourquoi et quand exécuter cette tâche
Remarque : Certaines figures accompagnant ces procédures peuvent ne pas correspondre exactement à
votre unité centrale ou votre unité d'extension. Néanmoins, les procédures demeurent identiques pour
l'exécution de chaque tâche.
Figure 175. Mise en position de fonctionnement du système
184 Unités de disques
DANGER
Lorsque vous utilisez le système ou travaillez à proximité de ce dernier, observez les consignes
suivantes :
Le courant électrique provenant de l'alimentation, du téléphone et des câbles de transmission peut
présenter un danger. Pour éviter tout risque de choc électrique :
v Branchez cette unité uniquement avec le cordon d'alimentation fourni par . N'utilisez pas ce
dernier avec un autre produit.
v N'ouvrez pas et n'entretenez pas le bloc d'alimentation électrique.
v Ne manipulez aucun câble et n'effectuez aucune opération d'installation, d'entretien ou de
reconfiguration de ce produit au cours d'un orage.
v Le produit peut être équipé de plusieurs cordons d'alimentation. Pour supprimer tout risque de
danger électrique, débranchez tous les cordons d'alimentation.
v Branchez tous les cordons d'alimentation sur un socle de prise de courant correctement câblé et
mis à la terre. Vérifiez que la tension et l'ordre des phases des prises de courant correspondent
aux informations de la plaque d'alimentation électrique du système.
v Branchez sur des socles de prise de courant correctement câblés tout équipement connecté à ce
produit.
v Lorsque cela est possible, n'utilisez qu'une seule main pour connecter ou déconnecter les câbles
d'interface.
v Ne mettez jamais un équipement sous tension en cas d'incendie ou d'inondation, ou en présence
de dommages matériels.
v Avant de retirer les carters de l'unité, mettez celle-ci hors tension et déconnectez ses cordons
d'alimentation, ainsi que les câbles qui la relient aux réseaux, aux systèmes de télécommunication
et aux modems (sauf mention contraire dans les procédures d'installation et de configuration).
v Lorsque vous installez, que vous déplacez, ou que vous manipulez le présent produit ou des
périphériques qui lui sont raccordés, reportez-vous aux instructions ci-dessous pour connecter et
déconnecter les différents cordons.
Pour déconnecter les cordons :
1. Mettez toutes les unités hors tension (sauf mention contraire).
2. Débranchez les cordons d'alimentation des prises.
3. Débranchez les câbles d'interface des connecteurs.
4. Débranchez tous les câbles des unités.
Pour connecter les cordons :
1. Mettez toutes les unités hors tension (sauf mention contraire).
2. Branchez tous les cordons sur les unités.
3. Branchez les câbles d'interface sur des connecteurs.
4. Branchez les cordons d'alimentation aux prises.
5. Mettez l'unité sous tension.
(D005a)
DANGER
Procédures communes pour les fonctions installables 185
Observez les consignes suivantes lors de l'utilisation du système en armoire ou lorsque vous
travaillez à proximité de ce dernier :
v Un mauvais maniement de l'équipement lourd peut engendrer blessures et dommages matériels.
v Abaissez toujours les vérins de mise à niveau de l'armoire.
v Installez toujours des équerres de stabilisation sur l'armoire.
v Pour prévenir tout danger lié à une mauvaise répartition de la charge, installez toujours les
unités les plus lourdes dans la partie inférieure de l'armoire. Installez toujours les serveurs et les
unités en option en commençant par le bas de l'armoire.
v Un serveur monté en armoire n'est pas une étagère ou un espace de travail. Ne posez pas d'objet
sur un serveur monté en armoire.
v Chaque armoire peut être équipée de plusieurs cordons d'alimentation. Avant de manipuler
l'armoire, vous devez débrancher l'ensemble des cordons d'alimentation.
v Reliez toutes les unités installées dans l'armoire aux dispositifs d'alimentation installés dans la
même armoire. Vous ne devez pas brancher le cordon d'alimentation d'une unité installée dans
une armoire au dispositif d'alimentation installé dans une autre armoire.
v Un mauvais câblage du socle de prise de courant peut provoquer une mise sous tension
dangereuse des parties métalliques du système ou des unités qui lui sont raccordées. Il
appartient au client de s'assurer que le socle de prise de courant est correctement câblé et mis à la
terre afin d'éviter tout risque de choc électrique.
ATTENTION
v N'installez pas d'unité dans une armoire dont la température ambiante interne dépasse la
température ambiante que le fabricant recommande pour toutes les unités montées en armoire.
v N'installez pas d'unité dans une armoire où la ventilation n'est pas assurée. Vérifiez que les
côtés, l'avant et l'arrière de l'unité sont correctement ventilés.
v Le matériel doit être correctement raccordé au circuit d'alimentation pour éviter qu'une surcharge
des circuits n'entrave le câblage des dispositifs d'alimentation ou de protection contre les
surintensités. Pour choisir des connexions d'alimentation à l'armoire adaptées, consultez les
étiquettes de puissance nominale situées sur le matériel dans l'armoire afin de déterminer
l'alimentation totale requise par le circuit d'alimentation.
v Armoires dotées de tiroirs coulissants : Si l'armoire n'est pas équipée d'équerres de stabilisation, ne
sortez et n'installez pas de tiroir ou de dispositif. Ne retirez pas plusieurs tiroirs à la fois. Si vous
retirez plusieurs tiroirs simultanément, l'armoire risque de devenir instable.
v Armoires dotées de tiroirs fixes : Sauf indication du fabricant, les tiroirs fixes ne doivent pas être
retirés à des fins de maintenance. Si vous tentez de retirer une partie ou l'ensemble du tiroir,
l'armoire risque de devenir instable et le tiroir risque de tomber.
(R001)
Pour mettre l'unité centrale ou l'unité d'extension montée en armoire en position de maintenance,
procédez comme suit :
Procédure
1. Le cas échéant, ouvrez le volet avant de l'armoire.
2. Retirez les deux vis (A) qui maintiennent l'unité centrale à l'armoire (voir figure suivante).
3. Débloquez les taquets de sécurité (B) situés sur les deux côtés de l'unité (voir figure suivante).
186 Unités de disques
4. Si vous devez sortir l'unité centrale ou l'unité d'extension de l'armoire, veillez à débrancher
auparavant les câbles situés à l'arrière de l'unité. Veillez à ne pas bloquer ou plier les câbles situés à
l'arrière de l'unité centrale ou de l'unité d'extension en tirant l'unité hors de l'armoire.
5. Tirez doucement l'unité centrale ou l'unité d'extension hors de l'armoire jusqu'à ce que les glissières
soient complètement étendues et bloquées. Vérifiez que les glissières sont complètement étendues.
Une fois les glissières complètement étendues, les taquets de sécurité s'enclenchent. Ils évitent de
sortir l'unité centrale ou l'unité d'extension de manière excessive.
Placement de 33E/8B ou de 36E/8C en position de fonctionnement
Cette procédure permet de mettre l'unité centrale (le système) ou l'unité d'extension en position de
fonctionnement pour pouvoir l'utiliser.
Pourquoi et quand exécuter cette tâche
Pour mettre le système en position de fonctionnement, procédez comme suit :
Conseil : Certaines figures accompagnant ces procédures peuvent ne pas correspondre exactement à
votre système ou votre unité d'extension. Néanmoins, les procédures demeurent identiques pour
l'exécution de chaque tâche.
Procédure
1. Libérez simultanément les taquets de sécurité bleus (B) situés vers l'avant de chaque taquet, puis
poussez le système ou l'unité d'extension dans l'armoire (voir la figure suivante).
Remarque : Veillez à ne pas bloquer ou plier les câbles situés à l'arrière de le système ou de l'unité
d'extension lorsque vous poussez l'unité dans l'armoire.
Figure 176. Libération des loquets de sécurité
Procédures communes pour les fonctions installables 187
2. Réinstallez et serrez les deux vis moletées (C) qui maintiennent le système ou l'unité d'extension (A) à
l'armoire (voir figure suivante).
3. Fermez le volet avant de l'armoire.
Figure 177. Déblocage des taquets de sécurité du rail
Figure 178. Remise en place des vis moletées
188 Unités de disques
Cordons d'alimentation
Suivez ces procédures pour déconnecter et reconnecter les cordons d'alimentation les serveurs systèmes
contenant le processeur POWER7.
Déconnexion des cordons d'alimentation d'un système 02E/4B ou
05F/6B
Utilisez ces procédures pour déconnecter les cordons d'alimentation d'un système 02E/4B ou 05F/6B.
Procédure
1. Pour un système monté en armoire, ouvrez le volet arrière de l'armoire sur l'unité sur laquelle vous
effectuez la maintenance et identifiez l'unité centrale.
2. Pour un système monté en armoire et un système autonome, débranchez tous les cordons
d'alimentation de l'unité sur laquelle vous effectuez la maintenance. Le système est peut-être équipé
de deux blocs d'alimentation. Avant de continuer toute procédure de retrait et de remplacement,
vérifiez que toutes les deux sources d'alimentation ont été débranchées du système.
Figure 179. Retrait des cordons d'alimentation pour un système monté en armoire
Procédures communes pour les fonctions installables 189
Connexion des cordons d'alimentation d'un système 02E/4B ou 05F/6B
Cette procédure permet de connecter les cordons d'alimentation.
Procédure
1. Identifiez l'unité système dont vous effectuez la maintenance dans l'armoire.
2. Connectez tous les cordons d'alimentation à l'unité dont vous effectuez la maintenance. Vérifiez que
les cordons d'alimentation passent par les poignées.
Figure 180. Retrait des cordons d'alimentation pour un système autonome
190 Unités de disques
3. Fermez le volet arrière de l'armoire.
Déconnexion des cordons d'alimentation du système 31E/2B
Utilisez ces procédures pour déconnecter les cordons d'alimentation du système 31E/2B.
Procédure
1. Ouvrez le volet arrière de l'unité dont vous effectuez la maintenance.
2. Identifiez l'unité système dont vous effectuez la maintenance dans l'armoire.
3. Débranchez tous les cordons d'alimentation de l'unité dont vous effectuez la maintenance. Le système
est peut-être équipé de deux blocs d'alimentation. Avant de continuer toute procédure de retrait et de
remplacement, vérifiez que toutes les deux sources d'alimentation ont été débranchées du système.
Figure 181. Connexion des cordons d'alimentation
Procédures communes pour les fonctions installables 191
Connexion des cordons d'alimentation au système 31E/2B
Utilisez ces procédures pour connecter les cordons d'alimentation au système 31E/2B.
Procédure
1. Ouvrez le volet arrière de l'unité dont vous effectuez la maintenance.
2. Identifiez l'unité système dont vous effectuez la maintenance dans l'armoire.
3. Débranchez tous les cordons d'alimentation de l'unité dont vous effectuez la maintenance. Vérifiez
que les cordons d'alimentation passent par les poignées.
Figure 182. Retrait des cordons d'alimentation
192 Unités de disques
Installation d'un dispositif via la console HMC (Hardware Management
Console)
Vous pouvez utiliser la console HMC (Hardware Management Console) pour effectuer diverses tâches de
maintenance, y compris l'installation d'un nouveau dispositif ou composant.
Avant de commencer
Pour utiliser l'interface utilisateur de la console HMC (Hardware Management Console) afin d'installer un
dispositif ou un composant sur une unité centrale ou une unité d'extension gérée par une console HMC,
procédez comme suit :
Procédure
1. Dans la zone de navigation, développez Gestion de systèmes ? Serveurs.
2. Sélectionnez le système géré sur lequel vous allez installer le composant.
3. Dans la zone des tâches, développez Maintenabilité ? Matériel.
4. Sélectionnez Ajout d'une FRU (unité remplaçable sur site, Field Replaceable Unit).
5. Dans la fenêtre Installation/ajout/suppression de matériel, sélectionnez le système ou le boîtier dans
lequel vous installez le dispositif.
6. Sélectionnez dans la liste le type de dispositif que vous souhaitez installer et cliquez sur Suivant.
7. Sélectionnez le code d'emplacement pour l'installation du dispositif et cliquez sur Ajouter.
8. Une fois l'unité remplaçable sur site FRU placée dans la catégorie des actions en attente, cliquez sur
Lancer la procédure et suivez les instructions d'installation du dispositif.
Remarque : La console HMC peut afficher des instructions expliquant comment installer le dispositif.
Dans ce cas, suivez ces instructions pour installer le dispositif.
Figure 183. Connexion des cordons d'alimentation
Procédures communes pour les fonctions installables 193
Retrait d'un composant via la console HMC
Vous pouvez utiliser la console HMC (Hardware Management Console) pour effectuer diverses tâches de
maintenance, y compris le retrait d'une unité remplaçable sur site (FRU) ou d'un composant.
Pourquoi et quand exécuter cette tâche
Pour utiliser l'interface utilisateur de la console HMC (Hardware Management Console) afin de retirer un
composant d'une unité centrale ou d'une unité d'extension gérée par une console HMC, procédez comme
suit :
Procédure
1. Dans la zone de navigation, développez Gestion de systèmes ? Serveurs.
2. Sélectionnez le système géré sont vous souhaitez retirer un composant.
3. Dans la zone des tâches, développez Maintenabilité ? Matériel.
4. Sélectionnez Retrait d'une FRU.
5. Dans la fenêtre Installation/ajout/suppression de matériel - Retrait de FRU, sélectionnez le type de
FRU, sélectionnez le système ou le boîtier duquel vous souhaitez retirer le composant.
6. Sélectionnez dans la liste le type de composant que vous souhaitez retirer et cliquez sur Suivant.
7. Sélectionnez l'emplacement du composant que vous retirez et cliquez sur Ajouter.
8. Une fois que le composant figure dans la liste des actions en attente, cliquez sur Lancer la procédure
et suivez les instructions de retrait du composant.
Remarque : La console HMC peut afficher des instructions de l'Information Center expliquant
comment retirer le composant. Dans ce cas, suivez ces instructions pour retirer le composant.
Remplacement d'un composant via la console HMC
Vous pouvez utiliser la console HMC (Hardware Management Console) pour effectuer diverses tâches de
maintenance, y compris l'échange d'une unité remplaçable sur site (FRU) ou d'un composant.
Pourquoi et quand exécuter cette tâche
Si vous procédez à un échange de composant dans le but de solutionner un événement réparable, suivez
ces instructions. Si vous échangez un composant dans le cadre d'une autre procédure à l'aide de la
version 7 ou d'une version ultérieure de la console HMC, procédez comme suit :
Procédure
1. Dans la zone de navigation, développez Gestion de systèmes ? Serveurs.
2. Sélectionnez le système géré dont vous souhaitez remplacer un composant.
3. Dans la zone des tâches, développez Maintenabilité ? Matériel.
4. Sélectionnez Echange de FRU.
5. Dans la fenêtre Remplacement de matériel - Remplacement de FRU, Sélectionnez le type de FRU,
sélectionnez dans la liste le système ou le boîtier dans lequel vous souhaitez échanger un composant.
6. Sélectionnez dans la liste le type de composant que vous souhaitez échanger et cliquez sur Suivant.
7. Sélectionnez dans la liste le code d'emplacement du composant que vous souhaitez échanger et
cliquez sur Ajouter.
8. Une fois l'unité remplaçable sur site FRU placée dans la catégorie des actions en attente, cliquez sur
Lancer la procédure et suivez les instructions d'échange du dispositif.
Remarque : La console HMC peut afficher des instructions concernant le replacement du composant.
Dans ce cas, suivez les instructions pour remplacer le composant.
194 Unités de disques
Vérification du composant installé
Vous pouvez vérifier un composant nouvellement installé ou remplacé sur votre système, partition
logique ou unité d'extension à l'aide du système d'exploitation, des diagnostics autonomes ou de la
console HMC.
Vérification d'un dispositif installé ou d'un composant remplacé sur un
système ou une partition logique AIX
Si vous avez installé un dispositif ou remplacé un composant, il est recommandé d'utiliser les outils AIX
pour vérifier que ce dispositif ou composant est reconnu par le système ou la partition logique.
Pourquoi et quand exécuter cette tâche
Pour vérifier le fonctionnement d'un dispositif ou d'un composant de rechange récemment installé,
sélectionnez la procédure appropriée :
v Vérification du dispositif installé via AIX
v Vérification du composant de rechange via AIX
Vérifiez le dispositif installé via AIX :
1. Connectez-vous en tant qu'utilisateur root.
2. A l'invite, tapez diag et appuyez sur Entrée.
3. Sélectionnez Routines de diagnostic avancé et appuyez sur Entrée.
4. Dans le menu Sélection du mode de diagnostic, sélectionnez Vérification du système et appuyez sur
Entrée.
5. Dans le menu Sélection des tests de diagnostic, mode étendu, exécutez l'une des procédures
suivantes :
v Pour tester une seule ressource, sélectionnez la ressource que vous venez d'installer dans la liste des
ressources et appuyez sur Entrée.
v Pour tester toutes les ressources disponibles du système d'exploitation, sélectionnez Toutes les
ressources et appuyez sur Entrée.
6. Sélectionnez Validation, puis attendez la fin de l'exécution des programmes de diagnostic en
répondant aux invites éventuelles.
7. Les programmes de diagnostic ont-ils été exécutés complètement avant d'afficher le message Aucun
incident n’a été détecté ?
v Non : Si un code SRN (Service Request Number - numéro de demande d'intervention) ou un autre
code de référence s'affiche, il est possible qu'un adaptateur ou qu'un câble ne soit pas bien
connecté. Passez en revue les procédures d'installation pour vérifier que le nouveau dispositif est
correctement installé. Si vous ne parvenez pas à corriger l'incident, rassemblez tous les codes SRN
et autres codes de référence éventuels. Si le système fonctionne en mode de partitionnement
logique (LPAR), notez la partition logique sur laquelle vous avez installé la fonctionnalité. Pour
obtenir de l'aide, contactez votre fournisseur de services.
v Oui : Le nouveau dispositif est installé correctement. Quittez les programmes de diagnostic et
revenez en mode de fonctionnement normal.
Vérifiez le composant de rechange via AIX
Pour vérifier le fonctionnement d'un dispositif ou d'un composant de rechange récemment installé,
procédez comme suit :
1. Avez-vous remplacé le composant à l'aide du système d'exploitation AIX ou des opérations de
remplacement à chaud simultané du service de diagnostic en ligne ?
Non : Passez à l'étape 2, à la page 196.
Procédures communes pour les fonctions installables 195
Oui : Passez à l'étape 5.
2. Le système est-il hors tension ?
Non : Passez à l'étape 4.
Oui : Si le système prend en charge l'initialisation lente, configurez la fonction. Pour plus
d'informations, voir Effectuer une initialisation lente.
3. Démarrez le système et attendez que l'invite de connexion du système d'exploitation AIX s'affiche ou
que l'activité apparente du système sur le panneau de commande ou l'écran s'arrête.
Avez-vous vu l'invite de connexion AIX ?
v Non : Si un code SRN (Service Request Number - numéro de demande d'intervention) ou un autre
code de référence s'affiche, il est possible qu'un adaptateur ou qu'un câble ne soit pas bien
connecté. Passez en revue les procédures de remplacement pour vérifier que le nouveau
composant est installé correctement. Si vous ne parvenez pas à corriger l'incident, rassemblez tous
les codes SRN et autres codes de référence éventuels. Si le système ne démarre pas ou que l'invite
de connexion ne s'affiche pas, voir la rubrique relative aux Incidents de chargement et de
démarrage du système d'exploitation.
Si le système est partitionné, notez la partition logique sur laquelle vous avez remplacé le
composant. Pour obtenir de l'aide, contactez votre fournisseur de services.
v Oui : Passez à l'étape 4.
4. A l'invite, tapez diag —a et appuyez sur Entrée pour vérifier s'il manque des ressources. Si une invite
s'affiche, passez à l'étape 5.
Si le menu de sélection des tests de diagnostic s'affiche avec un M en regard d'une ressource,
procédez comme suit :
a. Sélectionnez la ressource, puis appuyez sur Entrée.
b. Sélectionnez Validation.
c. Suivez les instructions affichées.
d. Si le message Souhaitez-vous revoir l'erreur affichée précédemment ? cliquez sur Oui et appuyez sur
Entrée.
e. Si le système génère un code SRN, une carte ou un câble est probablement mal connecté. Si
aucun incident évident n'apparaît, notez le code SRN et contactez votre fournisseur de services
pour obtenir de l'aide..
f. Si aucun code SRN ne s'affiche, passez à l'étape 5.
5. Testez le composant en effectuant les opérations suivantes :
a. A l'invite, tapez diag et appuyez sur Entrée.
b. Dans le menu Sélection des fonctions, sélectionnez Programmes de diagnostic, mode étendu et
appuyez sur Entrée.
c. Dans le menu Sélection du mode de diagnostic, sélectionnez Vérification du système et
appuyez sur Entrée.
d. Sélectionnez Toutes les ressources ou les programmes de diagnostic du composant pour tester
uniquement le composant remplacé ou les périphériques qui y sont raccordés et appuyez sur
Entrée.
Le menu Action corrective sur ressource s'est-il affiché ?
Non : Passez à l'étape 6.
Oui : Passez à l'étape 7, à la page 197.
6. Le message Test terminé, Aucun incident n'a été détecté s'est-il affiché ?
v Non : Il existe encore un incident. Contactez votre fournisseur de services. La procédure est
terminée.
v Oui : Si l'incident ne figure pas dans le journal des erreurs, sélectionnez Consignation d'action
corrective dans le menu Sélection des tâches pour mettre à jour le journal des erreurs AIX. Si la
196 Unités de disques
réparation consistait à remettre en place un câble ou une carte, sélectionnez la ressource
correspondante. Si la ressource n'apparaît pas dans la liste des ressources, sélectionnez sysplanar0
et appuyez sur Entrée.
Conseil : Le voyant du composant passe de l'état Incident à l'état normal.
Passez à l'étape 9.
7. Dans le menu Action de réparation de ressource, sélectionnez la ressource du composant remplacé.
En cas de test réussi sur une ressource en mode de vérification du système, dont une entrée figure
dans le journal des erreurs AIX, le menu Action corrective sur ressource apparaît. Procédez comme
suit pour que le journal des erreurs AIX se mette à jour et indique qu'un composant détectable par le
système a été remplacé.
Remarque : Sur les systèmes équipés d'un voyant associé au composant défaillant, le voyant passe à
l'état normal.
a. Dans le menu Action corrective sur ressource, sélectionnez la ressource remplacée. Si la
réparation consistait à remettre en place un câble ou une carte, sélectionnez la ressource
correspondante. Si la ressource n'apparaît pas dans la liste des ressources, sélectionnez
sysplanar0 et appuyez sur Entrée.
b. Sélectionnez Validation une fois les sélections effectuées. L'écran Action corrective sur ressource
s'est-il affiché de nouveau ?
Non : Si l'écran Aucun incident détecté apparaît, passez à l'étape 9.
Oui : Passez à l'étape 8.
8. Dans le menu Action de réparation de ressource, sélectionnez au besoin le parent ou l'enfant de la
ressource pour le composant remplacé. En cas de test réussi sur une ressource en mode de
vérification du système, dont une entrée figure dans le journal des erreurs AIX, le menu Action
corrective sur ressource apparaît. Procédez comme suit pour que le journal des erreurs AIX se mette
à jour et indique qu'un composant détectable par le système a été remplacé.
Remarque : Le voyant du composant passe de l'état Incident à l'état normal.
a. Dans le menu Action corrective sur ressource, sélectionnez le composant parent ou enfant de la
ressource remplacée. Si la réparation consistait à remettre en place un câble ou une carte,
sélectionnez la ressource correspondante. Si la ressource n'apparaît pas dans la liste des
ressources, sélectionnez sysplanar0 et appuyez sur Entrée.
b. Sélectionnez Validation une fois les sélections effectuées.
c. Si l'écran Aucun incident détecté apparaît, passez à l'étape 9.
9. Si vous avez modifié les paramètres du processeur de maintenance ou du réseau (expliqué dans les
procédures précédentes), restaurez la valeur initiale des paramètres.
10. Avez-vous exécuté des procédures de remplacement à chaud avant cette procédure ?
Non : Passez à l'étape 11.
Oui : Passez à l'étape 12.
11. Démarrez le système d'exploitation (système ou partition logique en mode normal). Avez-vous pu
démarrer le système d'exploitation ?
Non : Contactez votre fournisseur de services. La procédure est terminée.
Oui : Passez à l'étape 12.
12. Les voyants sont-ils encore allumés ?
v Non. La procédure est terminée.
v Oui : Désactivez les voyants. Pour obtenir des instructions: voir Modification des voyants de
maintenance.
Procédures communes pour les fonctions installables 197
Vérification d'un composant installé sur un système Linux ou une
partition logique
Si vous avez installé un nouveau composant, utilisez les instructions de cette section pour savoir
comment vérifier que le système reconnaît ce composant.
Pourquoi et quand exécuter cette tâche
Pour vérifier un composant que vous venez d'installer ou de remplacer consultez la rubrique
«Vérification d'un composant installé via les programmes de diagnostic autonomes».
Vérification d'un composant installé via les programmes de diagnostic
autonomes
Si vous avez installé ou remplacé un composant, vérifiez que le système le reconnaît. Vous pouvez
utiliser les programmes de diagnostic autonomes pour vérifier un composant installé sur un système
Linux, une unité d'extension ou une partition logique.
Avant de commencer
v Si le serveur est directement connecté à un autre serveur ou à un réseau, vérifiez que la
communication avec les autres serveurs est interrompue.
v Les programmes de diagnostic autonomes doivent utiliser l'ensemble des ressources de la partition
logique. Aucune autre activité ne peut être exécutée sur la partition.
v Les programmes de diagnostic autonomes doivent accéder à la console système.
Pourquoi et quand exécuter cette tâche
Pour accéder aux programmes de diagnostic, utilisez le CD-ROM approprié ou le serveur NIM (Network
Installation Management). La procédure suivante explique comment utiliser les programmes de
diagnostic depuis le CD-ROM. Pour obtenir plus d'informations sur l'exécution de diagnostics à partir du
serveur NIM, reportez-vous à la rubrique Exécution de diagnostics autonomes à partir d'un serveur NIM
(Network Installation Management).
Pour utiliser les programmes de diagnostic autonomes, procédez comme suit :
Procédure
1. Arrêtez les travaux et les applications, puis arrêtez le système d'exploitation sur le système ou la
partition logique.
2. Retirez les bandes, les disquettes et les CD-ROM.
3. Mettez l'unité centrale hors tension. L'étape suivante permet d'initialiser le serveur ou la partition
logique à partir du CD-ROM des programmes de diagnostic autonomes. Si l'unité de disques optique
n'est pas configurée comme unité d'amorçage sur le serveur ou la partition logique que vous utilisez,
procédez comme suit :
a. Lancez l'interface ASMI. Pour plus d'informations sur l'utilisation de l'interface ASMI,
reportez-vous à la rubrique Accès à l'interface ASMI.
b. Dans le menu principal de l'interface ASMI, cliquez sur Contrôle d'alimentation/redémarrage.
c. Cliquez sur Mettre le système sous/hors tension.
d. Dans le menu déroulant des modes d'initialisation de la partition logique AIX ou Linux,
sélectionnez l'option Amorçage du mode service à partir de la liste par défaut.
e. Cliquez sur Sauvegarder les paramètres et mettre sous tension. Dès que l'unité de disques
optique est sous tension, insérez le CD-ROM des programmes de diagnostic autonomes.
f. Passez à l'étape 5, à la page 199.
198 Unités de disques
4. Mettez l'unité centrale sous tension et insérez immédiatement le CD-ROM de programme de
diagnostic autonome dans l'unité de disque optique.
5. Après l'affichage du voyant POST clavier sur la console système et avant l'affichage du dernier
voyant POST (haut-parleur), appuyez sur la touche numérique 5 de la console système pour
indiquer qu'une initialisation en mode service doit être effectuée au moyen de la liste des unités
d'amorçage en mode service par défaut.
6. Tapez le mot de passe requis.
7. Dans l'écran Diagnostic Operating Instructions, appuyez sur Entrée.
Conseil : Si un code SRN (Service Request Number - numéro de demande d'intervention) ou un
autre code de référence s'affiche, il est possible qu'un adaptateur ou qu'un câble ne soit pas bien
connecté.
Remarque : Si le système a généré un code SRN ou un autre code de référence alors que vous
tentiez de le démarrer, contactez votre fournisseur de service pour obtenir de l'aide.
8. Si le type de terminal est requis, sélectionnez l'option Initialisation du terminal du menu Sélection
des fonctions pour initialiser le système d'exploitation.
9. Dans le menu Sélection des fonctions, sélectionnez Programmes de diagnostic, mode étendu et
appuyez sur Entrée.
10. Dans le menu Sélection du mode de diagnostic, sélectionnez Vérification du système et appuyez sur
Entrée.
11. Lorsque le menu Sélection des tests de diagnostic, mode étendu s'affiche, sélectionnez Toutes les
ressources ou testez uniquement le composant remplacé ou les unités qui y sont rattachées en
sélectionnant les programmes de diagnostic appropriés puis en appuyant sur Entrée.
12. Le message Test terminé, Aucun incident n'a été détecté s'est-il affiché ?
v Non : Il existe encore un incident. Contactez votre fournisseur de services.
v Oui : Passez à l'étape 13.
13. Si vous avez modifié les paramètres du processeur de service ou du réseau (voir procédures
précédentes), restaurez la valeur initiale des paramètres.
14. Si les voyants sont toujours allumés, procédez comme suit :
a. Dans le menu Sélection des tâches, sélectionnez Indicateurs d'identification et d'avertissement
pour désactiver les voyants d'identification et d'avertissement et appuyez sur Entrée.
b. Sélectionnez Set System Attention Indicator to NORMAL, puis appuyez sur Entrée.
c. Sélectionnez Set All Identify Indicators to NORMAL, puis appuyez sur Entrée.
d. Sélectionnez Validation.
Remarque : Les voyants d'identification et d'incident passent de l'état Incident à l'état Normal.
e. Retournez à la ligne de commande.
Vérification d'un composant installé via la console HMC (Hardware
Management Console)
Si vous avez installé ou remplacé un composant, utilisez la console HMC (Hardware Management
Console) pour mettre à jour les enregistrements de la console HMC une fois que vous avez effectué une
opération de maintenance sur le serveur. Si vous vous êtes servi de codes de référence, de symptômes ou
de codes d'emplacement lors de l'opération, repérez les enregistrements à utiliser pendant la procédure.
Pourquoi et quand exécuter cette tâche
Pour vérifier un composant installé, procédez comme suit :
Procédures communes pour les fonctions installables 199
Procédure
1. Sur la console HMC, recherchez d'éventuels événements d'action de maintenance ouverts dans le
journal des événements de maintenance. Pour plus d'informations, reportez-vous à la rubrique
«Visualisation d'événements réparables», à la page 202.
2. Des événements d'action de maintenance sont-ils ouverts ?
Non : Si le voyant d'avertissement système ou d'identification est encore allumé, utilisez la console
HMC pour le désactiver. Consultez la rubrique «Activation et désactivation de voyants». La
procédure est terminée.
Oui : Passez à l'étape suivante.
3. Notez la liste des événements d'action de maintenance ouverts.
4. Examinez les détails de chacun de ces événements. Les codes d'erreur associés à ces derniers sont-ils
les mêmes que ceux que vous avez collectés auparavant ?
v Non : Sélectionnez l'une des options suivantes :
– Examinez les autres événements réparables pour en trouver un dont le code d'erreur correspond
et passez à l'étape suivante.
– Si le contenu du journal ne correspond pas à ce que vous avez collecté précédemment, contactez
votre fournisseur de services.
v Oui : Passez à l'étape suivante.
5. Dans la fenêtre Error Associated With This Serviceable Event, sélectionnez et mettez en évidence
l'événement d'action de maintenance.
6. Cliquez sur Close Event.
7. Ajoutez des commentaires sur l'événement réparable. Incluez toute autre information propre à cet
événement. Cliquez sur OK.
8. Avez-vous remplacé, ajouté ou modifié une unité remplaçable sur site (FRU) de l'événement d'action
de maintenance ouvert ?
v Non : Sélectionnez l'option No FRU Replaced for this Serviceable Event et cliquez sur OK pour
fermer l'événement d'action de maintenance.
v Oui : Procédez comme suit :
a. Dans la liste des unités remplaçables sur site, sélectionnez une unité à mettre à jour.
b. Cliquez deux fois sur l'unité, puis mettez à jour les informations correspondantes.
c. Cliquez sur OK pour refermer l'événement d'action de maintenance.
9. Si les incidents persistent, contactez votre fournisseur de services.
Activation et désactivation de voyants
Cette procédure permet d'activer ou de désactiver les voyants à l'aide des utilitaires SFP (Service Focal
Point) pour la console HMC.
Pourquoi et quand exécuter cette tâche
Choisissez l'une des options suivantes :
v «Désactivation d'un voyant d'avertissement système ou de partition»
v «Activation ou désactivation de voyants d'identification», à la page 201
Désactivation d'un voyant d'avertissement système ou de partition :
Pourquoi et quand exécuter cette tâche
Vous pouvez désactiver un voyant d'avertissement système ou un voyant de partition logique. Par
exemple, vous pouvez constater qu'un incident n'est pas prioritaire et peut être résolu ultérieurement.
Cependant, si vous souhaitez être averti en cas d'incident, vous devez désactiver le voyant
d'avertissement système pour qu'il puisse être activé quand un nouvel incident survient.
200 Unités de disques
Procédure
1. Dans la zone de navigation, ouvrez le dossier Gestion de systèmes.
2. Ouvrez Serveurs et sélectionnez le système approprié.
3. Dans la zone de contenu, cochez la case correspondant à la partition concernée.
4. Sélectionnez Tâches, Opérations, puis Gestion des voyants d'avertissement système.
5. Sélectionnez la partition appropriée.
6. Sélectionnez Désactivation du voyant d'avertissement système dans le menu Action. La fenêtre de
confirmation qui s'affiche fournit les informations suivantes :
v Un message de vérification indique que le voyant voyant d'avertissement système a été désactivé.
v Un message indique que le système présente encore des incidents non résolus.
v Un message indique que le voyant d'avertissement système ne peut pas être activé.
7. Sélectionnez l'une des partitions logiques dans le tableau du bas, puis sélectionnez Désactivation du
voyant de partition dans le menu Opérations sur la partition. La fenêtre de confirmation qui s'affiche
fournit les informations suivantes :
v Une vérification indique que le voyant de la partition logique a été désactivé.
v Un message indique que la partition logique présente encore des incidents non résolus.
v Un message indique que le voyant de la partition logique ne peut pas être activé.
Activation ou désactivation de voyants d'identification :
Pourquoi et quand exécuter cette tâche
Le système comporte plusieurs voyants d'identification de composants, tels que les boîtiers ou les unités
remplaçables sur site (FRU). On les appelle donc voyants d'identification.
Vous pouvez activer ou désactiver les types de voyants d'identification suivants :
v Voyant d'identification d'un boîtier Si vous souhaitez ajouter une carte dans un tiroir précis (boîtier),
vous devez connaître le type et le modèle de l'ordinateur ainsi que le numéro de série (MTMS) du
tiroir. Pour déterminer si vous disposez du MTMS approprié du tiroir nécessitant la nouvelle carte,
vous pouvez activer le voyant d'un tiroir et vérifier que le MTMS correspond au tiroir qui doit
accueillir le nouveau matériel.
v Voyant d'identification d'une unité FRU associée à un boîtier Pour connecter un câble à une carte
d'E-S, vous pouvez activer le voyant de la carte s'il s'agit d'une unité FRU, puis vérifier sur le matériel
où le câble doit être fixé. Cette opération est particulièrement utile en présence de ports ouverts sur
plusieurs cartes.
Pour activer ou désactiver le voyant d'identification d'un boîtier ou d'une unité FRU, procédez comme
suit :
Procédure
1. Dans la zone de navigation, ouvrez l'élément Gestion de systèmes.
2. Sélectionnez Serveurs.
3. Dans la zone de contenu, cochez la case correspondant au système concerné.
4. Sélectionnez Tâches, Opérations, Statut du voyant, puis Voyant d'identification.
5. Pour activer ou désactiver le voyant d'identification d'un boîtier, sélectionnez le boîtier dans le
tableau, puis cliquez sur Activation du voyant ou Désactivation du voyant. Le voyant associé
s'allume ou s'éteint.
6. Pour activer ou désactiver le voyant d'identification d'une unité FRU, choisissez un boîtier dans le
tableau, puis sélectionnez Sélectionné ? Liste des FRU.
7. Sélectionnez une ou plusieurs unités FRU dans le tableau, et cliquez sur Activation du voyant ou
Désactivation du voyant. Le voyant associé s'allume ou s'éteint.
Procédures communes pour les fonctions installables 201
Visualisation d'événements réparables
Utilisez cette procédure pour afficher un événement réparable, y compris les détails, commentaires et
historique de maintenance.
Pourquoi et quand exécuter cette tâche
Pour pouvoir consulter des événements réparables et les informations qui leur sont associées, vous devez
avoir l'un des rôles suivants :
v Super administrateur
v Technicien de maintenance
v Opérateur
v Responsable produit
v Visionneur
Pour afficher des événements réparables, procédez comme suit :
Procédure
1. Dans la zone de navigation, sélectionnez Gestion de la maintenance.
2. Sélectionnez Gestion des événements réparables.
3. Sélectionnez les critères des événements réparables à visualiser, puis cliquez sur OK. La fenêtre
Présentation d'un événement réparable s'affiche. La liste comporte les événements réparables
correspondant à vos critères de sélection. La barre de menus vous propose des actions réalisables sur
les événements réparables.
4. Sélectionnez une ligne dans la fenêtre Présentation d'un événement réparable, puis choisissez
Sélectionné ? Affichage des détails La fenêtre Détails de l'événement réparable affiche des
informations détaillées sur l'événement réparable. Le tableau du haut comporte des informations telles
que le numéro d'incident et le code de référence. Le tableau du bas indique les unités remplaçables
sur site (FRU) associées à cet événement.
5. Sélectionnez l'erreur pour laquelle vous souhaitez consulter les commentaires et l'historique, puis
procédez comme suit :
a. Sélectionnez Actions ? Visualisation des commentaires.
b. Pour quitter les commentaires, cliquez sur Fermeture.
c. Sélectionnez Actions ? Affichage de l'historique de maintenance. La fenêtre Historique de
maintenance comporte l'historique de maintenance associé à l'erreur sélectionnée.
d. Pour quitter l'historique de maintenance, cliquez sur Fermeture.
6. Quand vous avez terminé, cliquez sur Annulation dans les fenêtres Détails de l'événement réparable
et Présentation de l'événement réparable pour les refermer.
Vérification de la présence d'un dispositif installé ou d'un composant
remplacé sur un système ou une partition logique à l'aide des outils
Virtual I/O Server
Si vous avez installé un dispositif ou remplacé un composant, il est recommandé d'utiliser les outils dans
Virtual I/O Server (VIOS) pour vérifier que ce dispositif ou ce composant est reconnu par le système ou
la partition logique.
Pourquoi et quand exécuter cette tâche
Pour vérifier le fonctionnement d'un dispositif ou d'un composant de rechange récemment installé,
sélectionnez la procédure appropriée :
v Vérification du dispositif installé via VIOS
202 Unités de disques
v Vérification du composant de rechange via VIOS
Vérifiez le dispositif installé via VIOS :
1. Connectez-vous en tant qu'utilisateur root.
2. Sur la ligne de commande, tapez diagmenu puis appuyez sur la touche Entrée.
3. Sélectionnez Routines de diagnostic avancé et appuyez sur Entrée.
4. Dans le menu Sélection du mode de diagnostic, sélectionnez Vérification du système et appuyez sur
Entrée.
5. Dans le menu Sélection des tests de diagnostic, mode étendu, exécutez l'une des procédures
suivantes :
v Pour tester une seule ressource, sélectionnez la ressource que vous venez d'installer dans la liste des
ressources et appuyez sur Entrée.
v Pour tester toutes les ressources disponibles du système d'exploitation, sélectionnez Toutes les
ressources et appuyez sur Entrée.
6. Sélectionnez Validation, puis attendez la fin de l'exécution des programmes de diagnostic en
répondant aux invites éventuelles.
7. Les programmes de diagnostic ont-ils été exécutés complètement avant d'afficher le message Aucun
incident n’a été détecté ?
v Non : Si un code SRN (Service Request Number - numéro de demande d'intervention) ou un autre
code de référence s'affiche, il est possible qu'un adaptateur ou qu'un câble ne soit pas bien
connecté. Passez en revue les procédures d'installation pour vérifier que le nouveau dispositif est
installé correctement. Si vous ne parvenez pas à corriger l'incident, rassemblez tous les codes SRN
et autres codes de référence éventuels. Si le système fonctionne en mode de partitionnement
logique, notez la partition logique sur laquelle le dispositif est installé. Pour obtenir de l'aide,
contactez votre fournisseur de services.
v Oui : Le nouveau dispositif est installé correctement. Quittez les programmes de diagnostic et
revenez en mode de fonctionnement normal.
Vérifiez le composant de rechange via VIOS:
Pour vérifier le fonctionnement d'un dispositif ou d'un composant de rechange récemment installé,
procédez comme suit :
1. Avez-vous remplacé le composant à l'aide d'VIOS ou des opérations de remplacement à chaud du
service de diagnostic en ligne ?
Non : Passez à l'étape 2.
Oui : Passez à l'étape 5, à la page 204.
2. Le système est-il hors tension ?
Non : Passez à l'étape 4, à la page 204.
Oui : Si le système prend en charge l'initialisation lente, configurez la fonction. Pour plus
d'informations, voir Effectuer une initialisation lente.
3. Démarrez le système et attendez que l'invite de connexion du système d'exploitation VIOS s'affiche
ou que l'activité apparente du système sur le panneau de commande ou l'écran s'arrête.
Avez-vous vu l'invite de connexion VIOS ?
v No : Si un code SRN ou un autre code de référence s'affiche, il est possible qu'un adaptateur ou
qu'un câble ne soit pas bien fixé. Passez en revue les procédures de remplacement pour vérifier
que le nouveau composant est installé correctement. Si vous ne parvenez pas à corriger l'incident,
rassemblez tous les codes SRN et autres codes de référence éventuels. Si le système ne démarre
pas ou que l'invite de connexion ne s'affiche pas, voir la rubrique relative aux Incidents de
chargement et de démarrage du système d'exploitation.
Si le système est partitionné, notez la partition logique sur laquelle vous avez remplacé le
composant. Pour obtenir de l'aide, contactez votre fournisseur de services.
Procédures communes pour les fonctions installables 203
v Oui : Passez à l'étape 4.
4. A l'invite, tapez diag —a et appuyez sur Entrée pour vérifier s'il manque des ressources. Si une invite
s'affiche, passez à l'étape 5.
Si le menu de sélection des tests de diagnostic s'affiche avec un M en regard d'une ressource,
procédez comme suit :
a. Sélectionnez la ressource, puis appuyez sur Entrée.
b. Sélectionnez Validation.
c. Suivez les instructions affichées.
d. Si un message vous demande si vous souhaitez revoir l'erreur affichée précédemment, cliquez sur
Oui et appuyez sur Entrée.
e. Si le système génère un code SRN, une carte ou un câble est probablement mal connecté. Si
aucun incident évident n'apparaît, notez le code SRN et contactez votre fournisseur de services
pour obtenir de l'aide.
f. Si aucun code SRN ne s'affiche, passez à l'étape 5.
5. Testez le composant en effectuant les opérations suivantes :
a. Sur la ligne de commande, tapez diagmenu puis appuyez sur la touche Entrée.
b. Dans le menu Sélection des fonctions, sélectionnez Programmes de diagnostic, mode étendu et
appuyez sur Entrée.
c. Dans le menu Sélection du mode de diagnostic, sélectionnez Vérification du système et
appuyez sur Entrée.
d. Sélectionnez Toutes les ressources ou les programmes de diagnostic du composant pour tester
uniquement le composant remplacé ou les périphériques qui y raccordées et appuyez sur Entrée.
Le menu Action corrective sur ressource s'est-il affiché ?
Non : Passez à l'étape 6.
Oui : Passez à l'étape 7.
6. Le message Test terminé, Aucun incident n'a été détecté s'est-il affiché ?
v Non : Il existe encore un incident. Contactez votre fournisseur de services. La procédure est
terminée.
v Oui : Si l'incident ne figure pas dans le journal des erreurs, sélectionnez Consignation d'action
corrective dans le menu Sélection des tâches pour mettre à jour le journal des erreurs AIX. Si la
réparation consistait à remettre en place un câble ou une carte, sélectionnez la ressource
correspondante. Si la ressource n'apparaît pas dans la liste des ressources, sélectionnez sysplanar0
et appuyez sur Entrée.
Conseil : Le voyant du composant passe de l'état Incident à l'état normal.
Passez à l'étape 9, à la page 205.
7. Dans le menu Action de réparation de ressource, sélectionnez la ressource du composant remplacé.
En cas de test réussi sur une ressource en mode de vérification du système, dont une entrée figure
dans le journal des erreurs AIX, le menu Action corrective sur ressource apparaît. Procédez comme
suit pour que le journal des erreurs AIX se mette à jour et indique qu'un composant détectable par le
système a été remplacé.
Remarque : Sur les systèmes équipés d'un voyant associé au composant défaillant, le voyant passe à
l'état normal.
a. Dans le menu Action corrective sur ressource, sélectionnez la ressource remplacée. Si la
réparation consistait à remettre en place un câble ou une carte, sélectionnez la ressource
correspondante. Si la ressource associée à votre action n'apparaît pas dans la liste des ressources,
sélectionnez sysplanar0. Appuyez sur Entrée.
b. Sélectionnez Validation une fois les sélections effectuées. L'écran Action corrective sur ressource
s'est-il affiché de nouveau ?
204 Unités de disques
Non : Si l'écran Aucun incident détecté apparaît, passez à l'étape 9.
Oui : Passez à l'étape 8.
8. Dans le menu Action de réparation de ressource, sélectionnez au besoin le parent ou l'enfant de la
ressource pour le composant remplacé. En cas de test réussi sur une ressource en mode de
vérification du système, dont une entrée figure dans le journal des erreurs AIX, le menu Action
corrective sur ressource apparaît. Procédez comme suit pour que le journal des erreurs AIX se mette
à jour et indique qu'un composant détectable par le système a été remplacé.
Remarque : Le voyant du composant passe de l'état Incident à l'état normal.
a. Dans le menu Action corrective sur ressource, sélectionnez le composant parent ou enfant de la
ressource remplacée. Si la réparation consistait à remettre en place un câble ou une carte,
sélectionnez la ressource correspondante. Si la ressource associée à votre action n'apparaît pas
dans la liste des ressources, sélectionnez sysplanar0. Appuyez sur Entrée.
b. Sélectionnez Validation une fois les sélections effectuées.
c. Si l'écran Aucun incident détecté apparaît, passez à l'étape 9.
9. Si vous avez modifié les paramètres du processeur de maintenance ou du réseau (expliqué dans les
procédures précédentes), restaurez la valeur initiale des paramètres.
10. Avez-vous exécuté des procédures de remplacement à chaud avant cette procédure ?
Non : Passez à l'étape 11.
Oui : Passez à l'étape 12.
11. Démarrez le système d'exploitation (système ou partition logique en mode normal). Avez-vous pu
démarrer le système d'exploitation ?
Non : Contactez votre fournisseur de services. La procédure est terminée.
Oui : Passez à l'étape 12.
12. Les voyants sont-ils encore allumés ?
v Non. La procédure est terminée.
v Oui : Désactivez les voyants. Pour plus d'informations, consultez l'une des rubriques suivantes :
Changement des voyants de maintenance.
Procédures communes pour les fonctions installables 205
206 Unités de disques
Remarques
Le présent document peut contenir des informations ou des références concernant certains produits,
logiciels ou services non annoncés dans ce pays. Pour plus d'informations, référez-vous aux documents
d'annonce disponibles dans votre pays ou adressez-vous au partenaire commercial du fabricant. Toute
référence à un produit, logiciel ou service du fabricant n'implique pas que seul ce produit, logiciel ou
service puisse convenir. Tout autre élément fonctionnellement équivalent peut être utilisé, s'il n'enfreint
aucun droit du fabricant. Il est toutefois de la responsabilité de l'utilisateur d'évaluer et de vérifier
lui-même le fonctionnement de tout produit, programme ou service.
Le fabricant peut détenir des brevets ou des demandes de brevet couvrant les produits mentionnés dans
le présent document. La remise de ce document ne vous octroie aucun droit de licence sur ces brevets ou
demandes de brevet. Si vous désirez recevoir des informations concernant l'acquisition de licences,
veuillez en faire la demande par écrit au fabricant.
Le paragraphe suivant ne s'applique ni au Royaume-Uni, ni dans aucun pays dans lequel il serait
contraire aux lois locales. LE PRESENT DOCUMENT EST LIVRE «EN L'ETAT» SANS AUCUNE
GARANTIE EXPLICITE OU IMPLICITE. LE FABRICANT DECLINE NOTAMMENT TOUTE
RESPONSABILITE RELATIVE A CES INFORMATIONS EN CAS DE CONTREFAÇON AINSI QU'EN
CAS DE DEFAUT D'APTITUDE A L'EXECUTION D'UN TRAVAIL DONNE. Certaines juridictions
n'autorisent pas l'exclusion des garanties implicites, auquel cas l'exclusion ci-dessus ne vous sera pas
applicable.
Le présent document peut contenir des inexactitudes ou des coquilles. Il est mis à jour périodiquement.
Chaque nouvelle édition inclut les mises à jour. IBM peut modifier sans préavis les produits et logiciels
décrits dans ce document.
Les références à des sites Web non référencés par le fabricant sont fournies à titre d'information
uniquement et n'impliquent en aucun cas une adhésion aux données qu'ils contiennent. Les éléments
figurant sur ces sites Web ne font pas partie des éléments du présent produit et l'utilisation de ces sites
relève de votre seule responsabilité.
Le fabricant pourra utiliser ou diffuser, de toute manière qu'il jugera appropriée et sans aucune obligation
de sa part, tout ou partie des informations qui lui seront fournies.
Les données de performance indiquées dans ce document ont été déterminées dans un environnement
contrôlé. Par conséquent, les résultats peuvent varier de manière significative selon l'environnement
d'exploitation utilisé. Certaines mesures évaluées sur des systèmes en cours de développement ne sont
pas garanties sur tous les systèmes disponibles. En outre, elles peuvent résulter d'extrapolations. Les
résultats peuvent donc varier. Il incombe aux utilisateurs de ce document de vérifier si ces données sont
applicables à leur environnement d'exploitation.
Les informations concernant les produits de fabricants tiers ont été obtenues auprès des fournisseurs de
ces produits, par l'intermédiaire d'annonces publiques ou via d'autres sources disponibles. Ce fabricant
n'a pas testé ces produits et ne peut confirmer l'exactitude de leurs performances ni leur compatibilité. Il
ne peut recevoir aucune réclamation concernant des produits de fabricants tiers. Toute question
concernant les performances de produits de fabricants tiers doit être adressée aux fournisseurs de ces
produits.
Toute instruction relative aux intentions du fabricant pour ses opérations à venir est susceptible d'être
modifiée ou annulée sans préavis, et doit être considérée uniquement comme un objectif.
207
Tous les tarifs indiqués sont les prix de vente actuels suggérés et sont susceptibles d'être modifiés sans
préavis. Les tarifs appliqués peuvent varier selon les revendeurs.
Ces informations sont fournies uniquement à titre de planification. Elles sont susceptibles d'être modifiées
avant la mise à disposition des produits décrits.
Le présent document peut contenir des exemples de données et de rapports utilisés couramment dans
l'environnement professionnel. Ces exemples mentionnent des noms fictifs de personnes, de sociétés, de
marques ou de produits à des fins illustratives ou explicatives uniquement. Tous ces noms sont fictifs et
toute ressemblance avec des noms et adresses utilisés par une entreprise réelle serait purement fortuite.
Si vous visualisez ces informations en ligne, il se peut que les photographies et illustrations en couleur
n'apparaissent pas à l'écran.
Les figures et les spécifications contenues dans le présent document ne doivent pas être reproduites,
même partiellement, sans l'autorisation écrite d'IBM.
Le fabricant a conçu le présent document pour expliquer comment utiliser les machines indiquées. Il n'est
exploitable dans aucun autre but.
Les ordinateurs du fabricant contiennent des mécanismes conçus pour réduire les risques d'altération ou
de perte de données. Ces risques, cependant, ne peuvent pas être éliminés. En cas de rupture de tension,
de défaillances système, de fluctuations ou de rupture de l'alimentation ou d'incidents au niveau des
composants, l'utilisateur doit s'assurer de l'exécution rigoureuse des opérations, et que les données ont été
sauvegardées ou transmises par le système au moment de la rupture de tension ou de l'incident (ou peu
de temps avant ou après). De plus, ces utilisateurs doivent établir des procédures garantissant la
vérification indépendante des données, afin de permettre une utilisation fiable de ces dernières dans le
cadre d'opérations stratégiques. Ces utilisateurs doivent enfin consulter régulièrement sur les sites Web de
support IBM les mises à jour et les correctifs applicables au système et aux logiciels associés.
Marques
IBM, le logo IBM et ibm.com sont des marques d'International Business Machines Corp. aux Etats-Unis
et/ou dans certains autres pays. Les autres noms de produits et de services peuvent appartenir à IBM ou
des tiers. Une liste à jour des marques IBM est disponible sur Internet : Copyright and trademark
information, à l'adresse www.ibm.com/legal/copytrade.shtml.
Linux est une marque de Linus Torvalds aux Etats-Unis et/ou dans certains autres pays.
Les autres noms de sociétés, de produits et de services peuvent appartenir à des tiers.
Bruits radioélectriques
Remarques sur la classe A
Les avis de conformité de classe A suivants s'appliquent aux serveurs.
Recommandation de la Federal Communications Commission (FCC) [Etats-Unis]
Remarque : Cet appareil respecte les limites des caractéristiques d'immunité des appareils numériques
définies pour la classe A, conformément au chapitre 15 de la réglementation de la FCC. La conformité
aux spécifications de cette classe offre une garantie acceptable contre les perturbations électromagnétiques
dans les zones commerciales. Ce matériel génère, utilise et peut émettre de l'énergie radiofréquence. Il
risque de parasiter les communications radio s'il n'est pas installé conformément aux instructions du
constructeur. L'exploitation faite en zone résidentielle peut entraîner le brouillage des réceptions radio et
télé, ce qui obligerait le propriétaire à prendre les dispositions nécessaires pour en éliminer les causes.
208 Unités de disques
Utilisez des câbles et connecteurs correctement blindés et mis à la terre afin de respecter les limites de
rayonnement définies par la réglementation de la FCC. IBM® ne peut pas être tenue pour responsable du
brouillage des réceptions radio ou télévision résultant de l'utilisation de câbles et connecteurs inadaptés
ou de modifications non autorisées apportées à cet appareil. Toute modification non autorisée pourra
annuler le droit d'utilisation de cet appareil.
Cet appareil est conforme aux restrictions définies dans le chapitre 15 de la réglementation de la FCC.
Son utilisation est soumise aux deux conditions suivantes : (1) il ne peut pas causer de perturbations
électromagnétiques gênantes et (2) il doit accepter toutes les perturbations reçues, y compris celles
susceptibles d'occasionner un fonctionnement indésirable.
Avis de conformité à la réglementation d'Industrie Canada
Cet appareil numérique de la classe A est conforme à la norme NMB-003 du Canada.
Avis de conformité aux exigences de l'Union européenne
Le présent produit satisfait aux exigences de protection énoncées dans la directive 2004/108/CEE du
Conseil concernant le rapprochement des législations des Etats membres relatives à la compatibilité
électromagnétique. IBM décline toute responsabilité en cas de non-respect de cette directive résultat d'une
modification non recommandée du produit, y compris l'ajout de cartes en option non IBM.
Ce produit respecte les limites des caractéristiques d'immunité des appareils de traitement de
l'information définies par la classe A de la norme européenne EN 55022 (CISPR 22). La conformité aux
spécifications de la classe A offre une garantie acceptable contre les perturbations avec les appareils de
communication agréés, dans les zones commerciales et industrielles.
Dans l'Union européenne, contactez :
IBM Deutschland GmbH
Technical Regulations, Department M456
IBM-Allee 1, 71139 Ehningen, Germany
Tele: +49 7032 15-2937
e-mail: tjahn@de.ibm.com
Avertissement : Ce matériel appartient à la classe A. Il est susceptible d'émettre des ondes
radioélectriques risquant de perturber les réceptions radio. Son emploi dans une zone résidentielle peut
créer des perturbations électromagnétiques. L'utilisateur devra alors prendre les mesures nécessaires pour
en éliminer les causes.
Avis de conformité aux exigences du Voluntary Control Council for Interference
(VCCI) - Japon
Voici un résumé de la recommandation du VCCI japonais figurant dans l'encadré ci-dessus :
Ce produit de la classe A respecte les limites des caractéristiques d'immunité définies par le VCCI
(Voluntary Control Council for Interference) japonais. Si ce matériel est utilisé dans une zone résidentielle,
il peut créer des perturbations électromagnétiques. L'utilisateur devra alors prendre les mesures
nécessaires pour en éliminer les causes.
Remarques 209
Directive relative aux harmoniques confirmée par l'association japonaise JEITA
(Japanese Electronics and Information Technology Industries Association)
(produits inférieurs ou égaux à 20 A par phase)
Directive relative aux harmoniques confirmée avec modifications par l'association
japonaise JEITA (Japanese Electronics and Information Technology Industries
Association) (produits supérieurs 20 A par phase)
Avis d'interférences électromagnétiques (EMI) - République populaire de Chine
Ce matériel appartient à la classe A. Il est susceptible d'émettre des ondes radioélectriques risquant de
perturber les réceptions radio. L'utilisateur devra alors prendre les mesures nécessaires pour en éliminer
les causes.
Avis d'interférences électromagnétiques (EMI) - Taïwan
Voici un résumé de l'avis EMI de Taïwan figurant ci-dessus.
Avertissement : Ce matériel appartient à la classe A. Il est susceptible d'émettre des ondes
radioélectriques risquant de perturber les réceptions radio. Son emploi dans une zone résidentielle peut
créer des interférences. L'utilisateur devra alors prendre les mesures nécessaires pour les supprimer.
210 Unités de disques
Avis d'interférences électromagnétiques (EMI) - Corée
Cet équipement a obtenu l'agrément EMC en vue de son utilisation commerciale. S'il a été vendu ou
acheté par erreur, veuillez l'échanger avec du matériel conforme à un usage domestique.
Avis de conformité pour l'Allemagne
Deutschsprachiger EU Hinweis: Hinweis für Geräte der Klasse A EU-Richtlinie zur
Elektromagnetischen Verträglichkeit
Dieses Produkt entspricht den Schutzanforderungen der EU-Richtlinie 2004/108/EG zur Angleichung der
Rechtsvorschriften über die elektromagnetische Verträglichkeit in den EU-Mitgliedsstaaten und hält die
Grenzwerte der EN 55022 Klasse A ein.
Um dieses sicherzustellen, sind die Geräte wie in den Handbüchern beschrieben zu installieren und zu
betreiben. Des Weiteren dürfen auch nur von der IBM empfohlene Kabel angeschlossen werden. IBM
übernimmt keine Verantwortung für die Einhaltung der Schutzanforderungen, wenn das Produkt ohne
Zustimmung von IBM verändert bzw. wenn Erweiterungskomponenten von Fremdherstellern ohne
Empfehlung von IBM gesteckt/eingebaut werden.
EN 55022 Klasse A Geräte müssen mit folgendem Warnhinweis versehen werden:
"Warnung: Dieses ist eine Einrichtung der Klasse A. Diese Einrichtung kann im Wohnbereich
Funk-Störungen verursachen; in diesem Fall kann vom Betreiber verlangt werden, angemessene
Maßnahmen zu ergreifen und dafür aufzukommen."
Deutschland: Einhaltung des Gesetzes über die elektromagnetische Verträglichkeit von Geräten
Dieses Produkt entspricht dem “Gesetz über die elektromagnetische Verträglichkeit von Geräten
(EMVG)“. Dies ist die Umsetzung der EU-Richtlinie 2004/108/EG in der Bundesrepublik Deutschland.
Zulassungsbescheinigung laut dem Deutschen Gesetz über die elektromagnetische Verträglichkeit von
Geräten (EMVG) (bzw. der EMC EG Richtlinie 2004/108/EG) für Geräte der Klasse A
Dieses Gerät ist berechtigt, in Übereinstimmung mit dem Deutschen EMVG das EG-Konformitätszeichen
- CE - zu führen.
Verantwortlich für die Einhaltung der EMV Vorschriften ist der Hersteller:
International Business Machines Corp.
New Orchard Road
Armonk, New York 10504
Tel: 914-499-1900
Der verantwortliche Ansprechpartner des Herstellers in der EU ist:
IBM Deutschland GmbH
Technical Regulations, Abteilung M456
IBM-Allee 1, 71139 Ehningen, Germany
Tel: +49 7032 15-2937
e-mail: tjahn@de.ibm.com
Remarques 211
Generelle Informationen:
Das Gerät erfüllt die Schutzanforderungen nach EN 55024 und EN 55022 Klasse A.
Avis d'interférences électromagnétiques (EMI) - Russie
Remarques sur la classe B
Les avis de conformité de classe B suivants s'appliquent aux dispositifs déclarés comme relevant de la
compatibilité électromagnétique (EMC) de classe B dans les informations d'installation des dispositifs.
Recommandation de la Federal Communications Commission (FCC) [Etats-Unis]
Cet appareil respecte les limites des caractéristiques d'immunité des appareils numériques définies par la
classe B, conformément au chapitre 15 de la réglementation de la FCC. La conformité aux spécifications
de la classe B offre une garantie acceptable contre les perturbations électromagnétiques dans les zones
résidentielles.
Ce matériel génère, utilise et peut émettre de l'énergie radiofréquence. Il risque de parasiter les
communications radio s'il n'est pas installé conformément aux instructions du constructeur. Toutefois, il
n'est pas garanti que des perturbations n'interviendront pas pour une installation particulière.
Si cet appareil provoque des perturbations gênantes dans les communications radio ou télévision,
mettez-le hors tension puis sous tension pour vous en assurer. L'utilisateur peut tenter de remédier à cet
incident en appliquant une ou plusieurs des mesures suivantes :
v Réorienter ou repositionner l'antenne de réception.
v Eloigner l'appareil du récepteur.
v Brancher l'appareil sur une prise différente de celle du récepteur, sur un circuit distinct.
v Prendre contact avec un distributeur agréé IBM ou un représentant commercial IBM pour obtenir de
l'aide.
Utilisez des câbles et connecteurs correctement blindés et mis à la terre afin de respecter les limites de
rayonnement définies par la réglementation de la FCC. Ces câbles et connecteurs sont disponibles chez
votre distributeur agréé IBM. IBM ne peut pas être tenue pour responsable du brouillage des réceptions
radio ou télévision résultant de modifications non autorisées apportées à cet appareil. Toute modification
non autorisée pourra annuler le droit d'utilisation de cet appareil.
Cet appareil est conforme aux restrictions définies dans le chapitre 15 de la réglementation de la FCC.
Son utilisation est soumise aux deux conditions suivantes : (1) il ne peut pas causer de perturbations
électromagnétiques gênantes et (2) il doit accepter toutes les perturbations reçues, y compris celles
susceptibles d'occasionner un fonctionnement indésirable.
Avis de conformité à la réglementation d'Industrie Canada
Cet appareil numérique de la classe B est conforme à la norme NMB-003 du Canada.
212 Unités de disques
Avis de conformité aux exigences de l'Union européenne
Le présent produit satisfait aux exigences de protection énoncées dans la directive 2004/108/CEE du
Conseil concernant le rapprochement des législations des Etats membres relatives à la compatibilité
électromagnétique. IBM décline toute responsabilité en cas de non-respect de cette directive résultant
d'une modification non recommandée du produit, y compris l'ajout de cartes en option non IBM.
Ce produit respecte les limites des caractéristiques d'immunité des appareils de traitement de
l'information définies par la classe B de la norme européenne EN 55022 (CISPR 22). La conformité aux
spécifications de la classe B offre une garantie acceptable contre les perturbations avec les appareils de
communication agréés, dans les zones résidentielles.
Dans l'Union européenne, contactez :
IBM Deutschland GmbH
Technical Regulations, Department M456
IBM-Allee 1, 71139 Ehningen, Germany
Tele: +49 7032 15-2937
e-mail: tjahn@de.ibm.com
Avis de conformité aux exigences du Voluntary Control Council for Interference
(VCCI) - Japon
Directive relative aux harmoniques confirmée par l'association japonaise JEITA
(Japanese Electronics and Information Technology Industries Association)
(produits inférieurs ou égaux à 20 A par phase)
Directive relative aux harmoniques confirmée avec modifications par l'association
japonaise JEITA (Japanese Electronics and Information Technology Industries
Association) (produits supérieurs 20 A par phase)
Avis d'interférences électromagnétiques (EMI) - Corée
Remarques 213
Avis de conformité pour l'Allemagne
Deutschsprachiger EU Hinweis: Hinweis für Geräte der Klasse B EU-Richtlinie zur
Elektromagnetischen Verträglichkeit
Dieses Produkt entspricht den Schutzanforderungen der EU-Richtlinie 2004/108/EG zur Angleichung der
Rechtsvorschriften über die elektromagnetische Verträglichkeit in den EU-Mitgliedsstaaten und hält die
Grenzwerte der EN 55022 Klasse B ein.
Um dieses sicherzustellen, sind die Geräte wie in den Handbüchern beschrieben zu installieren und zu
betreiben. Des Weiteren dürfen auch nur von der IBM empfohlene Kabel angeschlossen werden. IBM
übernimmt keine Verantwortung für die Einhaltung der Schutzanforderungen, wenn das Produkt ohne
Zustimmung von IBM verändert bzw. wenn Erweiterungskomponenten von Fremdherstellern ohne
Empfehlung von IBM gesteckt/eingebaut werden.
Deutschland: Einhaltung des Gesetzes über die elektromagnetische Verträglichkeit von Geräten
Dieses Produkt entspricht dem “Gesetz über die elektromagnetische Verträglichkeit von Geräten
(EMVG)“. Dies ist die Umsetzung der EU-Richtlinie 2004/108/EG in der Bundesrepublik Deutschland.
Zulassungsbescheinigung laut dem Deutschen Gesetz über die elektromagnetische Verträglichkeit von
Geräten (EMVG) (bzw. der EMC EG Richtlinie 2004/108/EG) für Geräte der Klasse B
Dieses Gerät ist berechtigt, in Übereinstimmung mit dem Deutschen EMVG das EG-Konformitätszeichen
- CE - zu führen.
Verantwortlich für die Einhaltung der EMV Vorschriften ist der Hersteller:
International Business Machines Corp.
New Orchard Road
Armonk, New York 10504
Tel: 914-499-1900
Der verantwortliche Ansprechpartner des Herstellers in der EU ist:
IBM Deutschland GmbH
Technical Regulations, Abteilung M456
IBM-Allee 1, 71139 Ehningen, Germany
Tel: +49 7032 15-2937
e-mail: tjahn@de.ibm.com
Generelle Informationen:
Das Gerät erfüllt die Schutzanforderungen nach EN 55024 und EN 55022 Klasse B.
Dispositions
Les droits d'utilisation relatifs à ces publications sont soumis aux dispositions suivantes.
Usage personnel : Vous pouvez reproduire ces publications pour votre usage personnel, non commercial,
sous réserve que toutes les mentions de propriété soient conservées. Vous ne pouvez distribuer ou
publier tout ou partie de ces publications ou en faire des oeuvres dérivées, sans le consentement exprès
du fabricant.
Usage commercial : Vous pouvez reproduire, distribuer et afficher ces publications uniquement au sein
de votre entreprise, sous réserve que toutes les mentions de propriété soient conservées. Vous ne pouvez
reproduire, distribuer, afficher ou publier tout ou partie de ces publications en dehors de votre entreprise,
ou en faire des oeuvres dérivées, sans le consentement exprès du fabricant.
214 Unités de disques
Excepté les droits d'utilisation expressément accordés dans ce document, aucun autre droit, licence ou
autorisation, implicite ou explicite, n'est accordé pour ces publications ou autres informations, données,
logiciels ou droits de propriété intellectuelle contenus dans ces publications.
Le fabricant se réserve le droit de retirer les autorisations accordées dans le présent document si
l'utilisation des publications s'avère préjudiciable à ses intérêts ou que, selon son appréciation, les
instructions susmentionnées n'ont pas été respectées.
Vous ne pouvez télécharger, exporter ou réexporter ces informations qu'en total accord avec toutes les lois
et règlements applicables dans votre pays, y compris les lois et règlements américains relatifs à
l'exportation.
LE FABRICANT NE DONNE AUCUNE GARANTIE SUR LE CONTENU DE CES PUBLICATIONS. CES
PUBLICATIONS SONT LIVREES EN L'ETAT SANS AUCUNE GARANTIE EXPLICITE OU IMPLICITE.
LE FABRICANT DECLINE NOTAMMENT TOUTE RESPONSABILITE RELATIVE A CES
INFORMATIONS EN CAS DE CONTREFAÇON AINSI QU'EN CAS DE DEFAUT D'APTITUDE A
L'EXECUTION D'UN TRAVAIL DONNE.
Activation SAS (Serial-Attached SCSI)
RAID pour les modèles E2-700, E2-700T,
E2-705, E2-705T, E4-700, E4-700T ou
E4-705
REFERENCE
86 F1 90FF 03
ESCALA Power7
Activation SAS (Serial-Attached SCSI) RAID pour les
modèles E2-700, E2-700T, E2-705, E2-705T, E4-700,
E4-700T ou E4-705
Les publications pour ESCALA Power7 concernent les modèles suivants :
- Bull Escala E5-700 (Power 750 / 8233-E8B)
- Bull Escala M6-700 (Power 770 / 9117-MMB)
- Bull Escala M6-705 (Power 770 / 9117-MMC)
- Bull Escala M7-700 (Power 780 / 9179-MHB)
- Bull Escala M7-705 (Power 780 / 9179-MHC)
- Bull Escala E1-700 (Power 710 / 8231-E2B)
- Bull Escala E1-705 (Power 710 / 8231-E1C)
- Bull Escala E2-700 / E2-700T (Power 720 / 8202-E4B)
- Bull Escala E2-705 / E2-705T (Power 720 / 8202-E4C)
- Bull Escala E3-700 (Power 730 / 8231-E2B)
- Bull Escala E3-705 (Power 730 / 8231-E2C)
- Bull Escala E4-700 / E4-700T (Power 740 / 8205-E6B)
- Bull Escala E4-705 (Power 740 / 8205-E6C)
Les références au modèle Power 755 / 8236-E8C sont sans objet.
Hardware
October 2011
BULL CEDOC
357 AVENUE PATTON
B.P.20845
49008 ANGERS CEDEX 01
FRANCE
REFERENCE
86 F1 90FF 03
L’avis juridique de copyright ci–après place le présent document sous la protection des lois de Copyright qui prohibent, sans s’y
limiter, des actions comme la copie, la distribution, la modification et la création de produits dérivés.
Copyright Bull SAS 2011
Imprimé en France
Marques déposées
Toutes les marques citées dans ce manuel sont la propriété de leurs détenteurs respectifs.
Tous les noms de marques ainsi que les noms de produits matériels et/ou logiciels sont régis par le droit des marques et/ou des
brevets.
La citation des noms de marques et de produits est purement informative et ne constitue pas une violation du droit des marques
et/ou des brevets.
Des corrections ou des modifications au contenu de ce document peuvent intervenir sans préavis. Bull SAS ne pourra pas
être tenu pour responsable des éventuelles erreurs qui pourraient être contenues dans ce manuel, ni pour tout dommage
p ouvant résulter de son application.
Table des matières
Avis aux lecteurs canadiens . . . . . . . . . . . . . . . . . . . . . . . . . . . v
Consignes de sécurité . . . . . . . . . . . . . . . . . . . . . . . . . . . . . ix
Activation SAS RAID et bloc de batteries de cache pour le modèle 02E/4B,
02E/4C, 05F/6B ou 05E/6C. . . . . . . . . . . . . . . . . . . . . . . . . . . 1
Activation SA RAID - Nouveautés. . . . . . . . . . . . . . . . . . . . . . . . . 3
Installation du fond de panier de stockage - 6 baies SFF/DVD SATA (FC 5618) . . . . . 5
Installation des baies d'unité partagée 3x3 sans prise en charge RAID 5/6 (FC 5631 et
FC EJ02) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 9
Installation du fond de panier de stockage - 8 baies SFF/RAID 175 Mo/Adaptateur d'E-S
double (FC 5630 et FC EJ01) . . . . . . . . . . . . . . . . . . . . . . . . . . 15
Retrait et remise en place des adaptateurs et batteries SAS RAID . . . . . . . . . . 23
Retrait et remplacement du fond de panier d'unités de disque . . . . . . . . . . . . . . . . . . 23
Retrait du fond de panier d'unités de disques sur les modèles 02E/4B, 02E/4C, 05F/6B ou 05E/6C. . . . . 24
Remplacement du fond de panier d'unités de disques sur les modèles 02E/4B, 02E/4C, 05F/6B ou 05E/6C . . 29
Retrait et remplacement du fond de panier du système 02E/4B, 02E/4C, 05F/6B ou 05E/6C . . . . . . . . 33
Retrait du fond de panier du système 02E/4B, 02E/4C, 05F/6B ou 05E/6C . . . . . . . . . . . . . 33
Remise en place du fond de panier du système 02E/4B, 02E/4C, 05F/6B ou 05E/6C. . . . . . . . . . 44
Considérations sur la maintenance du sous-système SAS 02E/4B ou 05F/6B. . . . . . . . . . . . . . 60
Considérations de maintenance pour le bloc de batterie de cache . . . . . . . . . . . . . . . . 60
Retrait de la carte d'activation RAID 02E/4B ou 05F/6B . . . . . . . . . . . . . . . . . . . . 62
Remplacement de la carte d'activation RAID 02E/4B ou 05F/6B. . . . . . . . . . . . . . . . . . 65
Retrait du contrôleur de stockage RAID/cache 02E/4B, 02E/4C, 05F/6B ou 05E/6C . . . . . . . . . . . 67
Remplacement du contrôleur de stockage RAID et cache 02E/4B, 02E/4C, 05F/6B ou 05E/6C . . . . . . . 70
Retrait et remise en place de la carte de batterie de cache . . . . . . . . . . . . . . . . . . . . 71
Retrait et installation du bloc de batterie de cache . . . . . . . . . . . . . . . . . . . . . . 77
Procédures communes aux dispositifs installables . . . . . . . . . . . . . . . . 81
Avant de commencer . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 81
Identification d'un composant défaillant. . . . . . . . . . . . . . . . . . . . . . . . . . 84
Voyants du panneau de commande . . . . . . . . . . . . . . . . . . . . . . . . . . 84
Identification d'un composant défaillant sur un système AIX ou une partition logique . . . . . . . . . 86
Localisation d'un composant défaillant sur un système AIX ou une partition logique. . . . . . . . . 86
Activation du voyant associé au composant défaillant . . . . . . . . . . . . . . . . . . . 86
Désactivation du voyant d'un composant défaillant . . . . . . . . . . . . . . . . . . . . 86
Identification d'un composant défaillant sur un système Linux ou une partition logique. . . . . . . . . 87
Localisation d'un composant défaillant sur un système Linux ou une partition logique . . . . . . . . 87
Localisation d'un composant défaillant sur un système ou une partition logique Linux . . . . . . . . 87
Activation du voyant associé au composant défaillant . . . . . . . . . . . . . . . . . . . 88
Désactivation du voyant d'un composant défaillant . . . . . . . . . . . . . . . . . . . . 88
Localisation d'un composant défaillant sur un système ou une partition logique Virtual I/O Server . . . . . 88
Identification d'un composant à l'aide du Virtual I/O Server . . . . . . . . . . . . . . . . . 89
Démarrage du système ou de la partition logique . . . . . . . . . . . . . . . . . . . . . . 89
Démarrage d'un système non géré par une console HMC ou une console SDMC . . . . . . . . . . . 89
Démarrage d'un système ou d'une partition logique via la console HMC. . . . . . . . . . . . . . 91
Démarrage d'un système ou d'un serveur virtuel via la console SDMC . . . . . . . . . . . . . . 91
iii
Arrêt d'un système ou d'une partition logique. . . . . . . . . . . . . . . . . . . . . . . . 91
Arrêt d'un système non géré par une console HMC ou une console SDMC . . . . . . . . . . . . . 92
Arrêt d'un système via la console HMC . . . . . . . . . . . . . . . . . . . . . . . . . 93
Arrêt d'un système via la console SDMC . . . . . . . . . . . . . . . . . . . . . . . . 94
Retrait et remplacement des capots sur le système 02E/4B, 02E/4C, 05F/6B ou 05E/6C. . . . . . . . . . 94
Retrait du capot avant sur un système 02E/4B, 02E/4C, 05F/6B ou 05E/6C monté en armoire . . . . . . 95
Retrait du capot avant sur un système autonome 02E/4B, 02E/4C ou 05F/6B . . . . . . . . . . . . 95
Installation du capot avant sur un système 02E/4B, 02E/4C, 05F/6B ou 05E/6C monté en armoire . . . . . 96
Installation des capot et volet avant sur un système 02E/4B, 02E/4C ou 05F/6B autonome. . . . . . . . 97
Retrait du capot d'accès sur un système 02E/4B, 02E/4C, 05F/6B ou 05E/6C monté en armoire . . . . . . 98
Retrait du capot d'accès d'un système autonome 02E/4B, 02E/4C ou 05F/6B . . . . . . . . . . . . 99
Installation du capot d'accès sur un système 02E/4B, 02E/4C, 05F/6B ou 05E/6C monté en armoire . . . . 101
Installation du capot d'accès d'un système autonome 02E/4B, 02E/4C ou 05F/6B . . . . . . . . . . 102
Mise en position de maintenance ou de fonctionnement du système 02E/4B, 02E/4C, 05F/6B ou 05E/6C . . . 104
Mise en position de maintenance du système 02E/4B, 02E/4C, 05F/6B ou 05E/6C monté en armoire. . . . 104
Mise en position de fonctionnement du système 02E/4B, 02E/4C, 05F/6B ou 05E/6C monté en armoire . . . 106
Déconnexion des cordons d'alimentation du système 02E/4B, 02E/4C, 05F/6B ou 05E/6C . . . . . . . . 107
Connexion des cordons d'alimentation au système 02E/4B, 02E/4C, 05F/6B ou 05E/6C . . . . . . . . . 109
Installation d'un dispositif via la console HMC . . . . . . . . . . . . . . . . . . . . . . . 109
Retrait d'un composant via la console HMC . . . . . . . . . . . . . . . . . . . . . . . . 110
Remplacement d'un composant via la console HMC . . . . . . . . . . . . . . . . . . . . . 110
Vérification du composant installé . . . . . . . . . . . . . . . . . . . . . . . . . . . 111
Vérification d'un dispositif installé ou d'un composant remplacé sur un système ou une partition logique AIX 111
Vérification d'un composant installé sur un système Linux ou une partition logique. . . . . . . . . . 114
Vérification d'un composant installé via les programmes de diagnostic autonome . . . . . . . . . . 114
Vérification d'un composant installé à l'aide de la console HMC (Hardware Management Console). . . . . 116
Activation/Désactivation des voyants via la console HMC . . . . . . . . . . . . . . . . . 117
Désactivation d'un voyant d'avertissement système ou de partition . . . . . . . . . . . . . 117
Activation ou désactivation d'un voyant d'identification . . . . . . . . . . . . . . . . . 117
Affichage des événements réparables à l'aide de la console HMC . . . . . . . . . . . . . . . 118
Vérification du composant installé via la console SDMC . . . . . . . . . . . . . . . . . . . 119
Activation/Désactivation de voyants via la console de gestion Systems Director . . . . . . . . . . 119
Désactivation d'un voyant d'avertissement système ou de partition . . . . . . . . . . . . . 119
Activation ou désactivation de voyants d'identification via la console SDMC . . . . . . . . . . 120
Visualisation d'événements réparables via la console de gestion Systems Director . . . . . . . . . 120
Vérification d'un dispositif installé ou d'un composant remplacé sur un système ou une partition logique à
l'aide des outils Virtual I/O Server . . . . . . . . . . . . . . . . . . . . . . . . . . 121
Installation d'un composant via la console SDMC . . . . . . . . . . . . . . . . . . . . . . 124
Remplacement d'un composant via la console SDMC . . . . . . . . . . . . . . . . . . . . . 124
Retrait d'un composant via la console SDMC. . . . . . . . . . . . . . . . . . . . . . . . 125
Remarques . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 127
Marques . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 128
Bruits radioélectriques . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 128
Remarques sur la classe A . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 129
Remarques sur la classe B . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 132
Dispositions. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 135
iv Activation SAS (Serial-attached SCSI) RAID
Avis aux lecteurs canadiens
Le présent document a été traduit en France. Voici les principales différences et particularités dont vous
devez tenir compte.
Illustrations
Les illustrations sont fournies à titre d'exemple. Certaines peuvent contenir des données propres à la
France.
Terminologie
La terminologie des titres IBM peut différer d'un pays à l'autre. Reportez-vous au tableau ci-dessous, au
besoin.
IBM France IBM Canada
ingénieur commercial représentant
agence commerciale succursale
ingénieur technico-commercial informaticien
inspecteur technicien du matériel
Claviers
Les lettres sont disposées différemment : le clavier français est de type AZERTY, et le clavier
français-canadien, de type QWERTY.
OS/2 - Paramètres canadiens
Au Canada, on utilise :
v les pages de codes 850 (multilingue) et 863 (français-canadien),
v le code pays 002,
v le code clavier CF.
Nomenclature
Les touches présentées dans le tableau d'équivalence suivant sont libellées différemment selon qu'il s'agit
du clavier de la France, du clavier du Canada ou du clavier des États-Unis. Reportez-vous à ce tableau
pour faire correspondre les touches françaises figurant dans le présent document aux touches de votre
clavier.
v
Recommandations à l'utilisateur
Ce matériel utilise et peut émettre de l'énergie radiofréquence. Il risque de parasiter les communications
radio et télévision s'il n'est pas installé et utilisé conformément aux instructions du constructeur
(instructions d'utilisation, manuels de référence et manuels d'entretien).
Si cet équipement provoque des interférences dans les communications radio ou télévision, mettez-le hors
tension puis sous tension pour vous en assurer. Il est possible de corriger cet état de fait par une ou
plusieurs des mesures suivantes :
v Réorienter l'antenne réceptrice ;
v Déplacer l'équipement par rapport au récepteur ;
v Éloigner l'équipement du récepteur ;
v Brancher l'équipement sur une prise différente de celle du récepteur pour que ces unités fonctionnent
sur des circuits distincts ;
v S'assurer que les vis de fixation des cartes et des connecteurs ainsi que les fils de masse sont bien
serrés ;
v Vérifier la mise en place des obturateurs sur les connecteurs libres.
Si vous utilisez des périphériques non IBM avec cet équipement, nous vous recommandons d'utiliser des
câbles blindés mis à la terre, à travers des filtres si nécessaire.
En cas de besoin, adressez-vous à votre détaillant.
Le fabricant n'est pas responsable des interférences radio ou télévision qui pourraient se produire si des
modifications non autorisées ont été effectuées sur l'équipement.
vi Activation SAS (Serial-attached SCSI) RAID
L'obligation de corriger de telles interférences incombe à l'utilisateur.
Au besoin, l'utilisateur devrait consulter le détaillant ou un technicien qualifié pour obtenir de plus
amples renseignements.
Brevets
Il est possible qu'IBM détienne des brevets ou qu'elle ait déposé des demandes de brevets portant sur
certains sujets abordés dans ce document. Le fait qu'IBM vous fournisse le présent document ne signifie
pas qu'elle vous accorde un permis d'utilisation de ces brevets. Vous pouvez envoyer, par écrit, vos
demandes de renseignements relatives aux permis d'utilisation au directeur général des relations
commerciales d'IBM, 3600 Steeles Avenue East, Markham, Ontario, L3R 9Z7.
Assistance téléphonique
Si vous avez besoin d'assistance ou si vous voulez commander du matériel, des logiciels et des
publications IBM, contactez IBM direct au 1 800 465-1234.
Avis aux lecteurs canadiens vii
viii Activation SAS (Serial-attached SCSI) RAID
Consignes de sécurité
Les consignes de sécurité peuvent être imprimées tout au long de ce guide.
v DANGER - Consignes attirant votre attention sur un risque de blessures graves, voire mortelles.
v ATTENTION - Consignes attirant votre attention sur un risque de blessures graves, en raison de
certaines circonstances réunies.
v Avertissement - Consignes attirant votre attention sur un risque de dommages sur un programme, une
unité, un système ou des données.
Consignes de sécurité relatives au commerce international
Plusieurs pays nécessitent la présentation des consignes de sécurité indiquées dans les publications du
produit dans leur langue nationale. Si votre pays en fait partie, un livret de consignes de sécurité est
inclus dans l'ensemble des publications livré avec le produit. Ce livret contient les consignes de sécurité
dans votre langue en faisant référence à la source en anglais (Etats-Unis). Avant d'utiliser une publication
en version originale anglaise pour installer, faire fonctionner ou dépanner ce produit, vous devez vous
familiariser avec les consignes de sécurité figurant dans ce livret. Vous devez également consulter ce
livret chaque fois que les consignes de sécurité des publications en anglais (Etats-Unis) ne sont pas assez
claires pour vous.
Informations sur les appareils à laser
Les serveurs peuvent comprendre des cartes d'E-S ou des composants à fibres optiques, utilisant des
lasers ou des diodes électroluminescentes (LED).
Conformité aux normes relatives aux appareils à laser
Les serveurs peuvent être installés à l'intérieur ou à l'extérieur d'une armoire d'équipement informatique.
ix
DANGER
Lorsque vous utilisez le système ou travaillez à proximité de ce dernier, observez les consignes
suivantes :
Le courant électrique provenant de l'alimentation, du téléphone et des câbles de transmission peut
présenter un danger. Pour éviter tout risque de choc électrique :
v Branchez cette unité uniquement avec le cordon d'alimentation fourni par . N'utilisez pas ce
dernier avec un autre produit.
v N'ouvrez pas et n'entretenez pas le bloc d'alimentation électrique.
v Ne manipulez aucun câble et n'effectuez aucune opération d'installation, d'entretien ou de
reconfiguration de ce produit au cours d'un orage.
v Le produit peut être équipé de plusieurs cordons d'alimentation. Pour supprimer tout risque de
danger électrique, débranchez tous les cordons d'alimentation.
v Branchez tous les cordons d'alimentation sur un socle de prise de courant correctement câblé et
mis à la terre. Vérifiez que la tension et l'ordre des phases des prises de courant correspondent
aux informations de la plaque d'alimentation électrique du système.
v Branchez sur des socles de prise de courant correctement câblés tout équipement connecté à ce
produit.
v Lorsque cela est possible, n'utilisez qu'une seule main pour connecter ou déconnecter les cordons
d'interface.
v Ne mettez jamais un équipement sous tension en cas d'incendie ou d'inondation, ou en présence
de dommages matériels.
v Avant de retirer les capots de l'unité, mettez celle-ci hors tension et déconnectez ses cordons
d'alimentation, ainsi que les câbles qui la relient aux réseaux, aux systèmes de télécommunication
et aux modems (sauf mention contraire dans les procédures d'installation et de configuration).
v Lorsque vous installez, que vous déplacez, ou que vous manipulez le présent produit ou des
périphériques qui lui sont raccordés, reportez-vous aux instructions ci-dessous pour connecter et
déconnecter les différents cordons.
Pour déconnecter les cordons :
1. Mettez toutes les unités hors tension (sauf mention contraire).
2. Débranchez les cordons d'alimentation des prises.
3. Débranchez les cordons d'interface des connecteurs.
4. Débranchez tous les câbles des unités.
Pour connecter les cordons :
1. Mettez toutes les unités hors tension (sauf mention contraire).
2. Branchez tous les cordons sur les unités.
3. Branchez les cordons d'interface sur des connecteurs.
4. Branchez les cordons d'alimentation aux prises.
5. Mettez l'unité sous tension.
(D005a)
x Activation SAS (Serial-attached SCSI) RAID
DANGER
Observez les consignes suivantes lors de l'utilisation du système en armoire ou lorsque vous
travaillez à proximité de ce dernier :
v Un mauvais maniement de l'équipement lourd peut engendrer blessures et dommages matériels.
v Abaissez toujours les vérins de mise à niveau de l'armoire.
v Installez toujours des équerres de stabilisation sur l'armoire.
v Pour prévenir tout danger lié à une mauvaise répartition de la charge, installez toujours les
unités les plus lourdes dans la partie inférieure de l'armoire. Installez toujours les serveurs et les
unités en option en commençant par le bas de l'armoire.
v Un serveur monté en armoire n'est pas une étagère ou un espace de travail. Ne posez pas d'objet
sur un serveur monté en armoire.
v Chaque armoire peut être équipée de plusieurs cordons d'alimentation. Avant de manipuler
l'armoire, vous devez débrancher l'ensemble des cordons d'alimentation.
v Reliez toutes les unités installées dans l'armoire aux dispositifs d'alimentation installés dans la
même armoire. Vous ne devez pas brancher le cordon d'alimentation d'une unité installée dans
une armoire au dispositif d'alimentation installé dans une autre armoire.
v Un mauvais câblage du socle de prise de courant peut provoquer une mise sous tension
dangereuse des parties métalliques du système ou des unités qui lui sont raccordées. Il
appartient au client de s'assurer que le socle de prise de courant est correctement câblé et mis à la
terre afin d'éviter tout risque de choc électrique.
ATTENTION
v N'installez pas d'unité dans une armoire dont la température ambiante interne dépasse la
température ambiante que le fabricant recommande pour toutes les unités montées en armoire.
v N'installez pas d'unité dans une armoire où la ventilation n'est pas assurée. Vérifiez que les
côtés, l'avant et l'arrière de l'unité sont correctement ventilés.
v Le matériel doit être correctement raccordé au circuit d'alimentation pour éviter qu'une surcharge
des circuits n'entrave le câblage des dispositifs d'alimentation ou de protection contre les
surintensités. Pour choisir des connexions d'alimentation à l'armoire adaptées, consultez les
étiquettes de puissance nominale situées sur le matériel dans l'armoire afin de déterminer
l'alimentation totale requise par le circuit d'alimentation.
v Armoires dotées de tiroirs coulissants : Si l'armoire n'est pas équipée d'équerres de stabilisation, ne
sortez et n'installez pas de tiroir ou de dispositif. Ne retirez pas plusieurs tiroirs à la fois. Si vous
retirez plusieurs tiroirs simultanément, l'armoire risque de devenir instable.
v Armoires dotées de tiroirs fixes : Sauf indication du fabricant, les tiroirs fixes ne doivent pas être
retirés à des fins de maintenance. Si vous tentez de retirer une partie ou l'ensemble du tiroir,
l'armoire risque de devenir instable et le tiroir risque de tomber.
(R001)
Consignes de sécurité xi
ATTENTION :
Le retrait des composants des parties supérieures de l'armoire améliore sa stabilité au cours du
déplacement. Pour déplacer une armoire remplie de composants dans une pièce ou dans un bâtiment,
procédez comme suit :
v Pour réduire le poids de l'armoire, retirez les équipements, à commencer par celui situé en haut. Si
possible, restaurez la configuration d'origine de l'armoire. Si vous ne connaissez pas cette
configuration, procédez comme suit :
– Retirez toutes les unités de la position 32U et plus.
– Assurez-vous que les unités les plus lourdes sont installées dans la partie inférieure de l'armoire.
– Assurez-vous qu'il ne reste aucun niveau U vide entre les unités installées dans l'armoire, en
dessous du niveau 32U.
v Si l'armoire déplacée fait partie d'un groupe d'armoires, séparez-la de ce dernier.
v Vérifiez l'itinéraire envisagé pour éliminer tout risque.
v Vérifiez que l'armoire une fois chargée n'est pas trop lourde pour l'itinéraire choisi. Pour plus
d'informations sur le poids d'une armoire chargée, consultez la documentation fournie avec votre
armoire.
v Vérifiez que toutes les ouvertures mesurent au moins 760 x 230 mm.
v Vérifiez que toutes les unités, toutes les étagères, tous les tiroirs, toutes les portes et tous les câbles
sont bien fixés.
v Vérifiez que les vérins de mise à niveau sont à leur position la plus haute.
v Vérifiez qu'aucune équerre de stabilisation n'est installée sur l'armoire pendant le déplacement.
v N'utilisez pas de rampe inclinée à plus de dix degrés.
v Dès que l'armoire est à son nouvel emplacement, procédez comme suit :
– Abaissez les quatre vérins de mise à niveau.
– Installez des équerres de stabilisation sur l'armoire.
– Si vous avez retiré des unités de l'armoire, remettez-les à leur place, en remontant de la partie
inférieure à la partie supérieure de l'armoire.
v Si un déplacement important est nécessaire, restaurez la configuration d'origine de l'armoire. Mettez
l'armoire dans son emballage d'origine ou dans un autre emballage équivalent. De plus, abaissez les
vérins de mise à niveau pour que les roulettes ne soient plus au contact de la palette et fixez
l'armoire à celle-ci.
(R002)
(L001)
(L002)
xii Activation SAS (Serial-attached SCSI) RAID
(L003)
ou
Aux Etats-Unis, tous les appareils à laser sont certifiés conformes aux normes indiquées dans le
sous-chapitre J du DHHS 21 CFR relatif aux produits à laser de classe 1. Dans les autres pays, ils sont
certifiés être des produits à laser de classe 1 conformes aux normes CEI 60825. Consultez les étiquettes
sur chaque pièce du laser pour les numéros d'accréditation et les informations de conformité.
ATTENTION :
Ce produit peut contenir des produits à laser de classe 1 : lecteur de CD-ROM, DVD-ROM,
DVD-RAM ou module à laser. Notez les informations suivantes :
v Ne retirez pas les capots. En ouvrant le produit à laser, vous vous exposez au rayonnement
dangereux du laser. Vous ne pouvez effectuer aucune opération de maintenance à l'intérieur.
v Pour éviter tout risque d'exposition au rayon laser, respectez les consignes de réglage et d'utilisation
des commandes, ainsi que les procédures décrites dans le présent manuel.
(C026)
Consignes de sécurité xiii
ATTENTION :
Les installations informatiques peuvent comprendre des modules à laser fonctionnant à des niveaux
de rayonnement excédant les limites de la classe 1. Il est donc recommandé de ne jamais examiner à
l'oeil nu la section d'un cordon optique ni une prise de fibres optiques ouverte. (C027)
ATTENTION :
Ce produit contient un laser de classe 1M. Ne l'observez pas à l'aide d'instruments optiques. (C028)
ATTENTION :
Certains produits à laser contiennent une diode à laser intégrée de classe 3A ou 3B. Prenez
connaissance des informations suivantes. Rayonnement laser lorsque le capot est ouvert. Evitez toute
exposition directe au rayon laser. Evitez de regarder fixement le faisceau ou de l'observer à l'aide
d'instruments optiques. (C030)
ATTENTION :
Cette pile contient du lithium. Pour éviter tout risque d'explosion, n'essayez pas de la recharger et ne
la faites pas brûler.
Ne pas :
v ___ la jeter à l'eau
v ___ l'exposer à une température supérieure à 100 °C
v ___ chercher à la réparer ou à la démonter
Ne la remplacez que par une pile agréée. Pour le recyclage ou la mise au rebut, reportez-vous à la
réglementation en vigueur. (C003a)
Informations sur l'alimentation électrique et sur le câblage relatives au document
GR-1089-CORE du NEBS (Network Equipment-Building System)
Les commentaires suivants s'appliquent aux serveurs qui ont été déclarés conformes au document
GR-1089-CORE du NEBS (Network Equipment-Building System) :
Cet équipement peut être installé :
v dans des infrastructures de télécommunications réseau
v aux endroits préconisés dans les directives NEC (National Electrical Code).
Les ports de ce matériel qui se trouvent à l'intérieur du bâtiment peuvent être connectés à des câbles
internes ou non exposés uniquement. Ils ne doivent pas être connectés par leur partie métallique aux
interfaces connectées au réseau extérieur ou à son câblage. Ces interfaces sont conçues pour être
exclusivement utilisées à l'intérieur d'un bâtiment (ports de type 2 ou 4 décrits dans le document
GR-1089-CORE) ; elles doivent être isolées du câblage à découvert du réseau extérieur. L'ajout de
dispositifs de protection primaires n'est pas suffisant pour pouvoir connecter ces interfaces par leur partie
métallique au câblage du réseau extérieur.
Remarque : Tous les câbles Ethernet doivent être blindés et mis à la terre aux deux extrémités.
Dans le cas d'un système alimenté en courant alternatif, il n'est pas nécessaire d'installer un dispositif
externe de protection contre les surtensions (SPD).
Un système alimenté en courant continu fait appel à un dispositif de retour du continu (DC-I). La borne
de retour de la batterie en courant continu ne doit pas être connectée à la masse.
xiv Activation SAS (Serial-attached SCSI) RAID
Activation SAS RAID et bloc de batteries de cache pour le
modèle 02E/4B, 02E/4C, 05F/6B ou 05E/6C
Vous pouvez être amené à configurer des disques RAID (Redundant Array of Independent Disks) sur les
unités de disque SAS (serial-attached SCSI) du boîtier système. Suivez les instructions de cette rubrique
pour installer sur le système les dispositifs d'activation RAID.
Les instructions suivantes permettent d'installer des cartes d'activation SAS RAID afin de préparer le
système à l'utilisation d'unités de disque SAS dans une configuration RAID.
1
2 Activation SAS (Serial-attached SCSI) RAID
Activation SA RAID - Nouveautés
Prenez connaissance des nouveautés et des modifications significatives apportées à "Activation SAS
RAID" depuis la dernière mise à jour de cet ensemble de rubriques.
Octobre 2011
v Ajout d'informations relatives aux serveurs 02E/4C et 05E/6C.
3
4 Activation SAS (Serial-attached SCSI) RAID
Installation du fond de panier de stockage - 6 baies SFF/DVD
SATA (FC 5618)
Utilisez cette procédure pour installer les composants inclus avec le dispositif de fond de panier de
stockage - 6 baies SFF/DVD SATA. Cette procédure s'applique à l'installation de nouveaux dispositifs.
Avant de commencer
Avertissement : Si vous effectuez la maintenance d'une pièce défaillante, consultez la procédure de
maintenance suivante :
v Pour le contrôleur de stockage RAID, voir «Retrait du contrôleur de stockage RAID/cache 02E/4B,
02E/4C, 05F/6B ou 05E/6C», à la page 67 et «Remplacement du contrôleur de stockage RAID et cache
02E/4B, 02E/4C, 05F/6B ou 05E/6C», à la page 70.
Pourquoi et quand exécuter cette tâche
Avant d'installer ou de remplacer un dispositif, assurez-vous que le logiciel requis pour prendre en
charge ce dispositif est installé sur votre système.
Avertissement : Pour maintenir la disponibilité du système et éviter d'éventuelles pertes de données,
vous devez comprendre que ces fonctions ont des conséquences sur la configuration de SAS
(serial-attached SCSI) et l'accessibilité des données du système. Consultez la rubrique Sous-système SAS
pour comprendre les différentes remarques sur la configuration et la maintenance SAS.
Avant toute opération d'installation ou de maintenance d'un dispositif, consultez les configurations d'E-S
pour chaque partition logique de votre système. Il est important de comprendre les différences entre les
diverses configurations et les considérations de maintenance qui en résultent. En outre, tenez compte des
incidences possibles sur les domaines suivants :
v Partitions logiques et leurs affectations à l'adaptateur d'E-S
v Configurations de grappes RAID
v Emplacements de disque physique
v Chemin d'accès à votre unité d'amorçage
Si votre système est géré par la console HMC, utilisez la console HMC pour effectuer la procédure
d'installation d'un dispositif sur le serveur. Pour plus d'instructions, voir «Installation d'un dispositif via
la console HMC», à la page 109.
Si le système est géré par la console SDMC (Systems Director Management Console), utilisez la console
SDMC pour exécuter la procédure d'installation d'un dispositif sur le serveur. Pour des instructions, voir
«Remplacement d'un composant via la console SDMC», à la page 124.
Si vous ne disposez pas d'une console HMC ou SDMC, procédez comme suit :
Procédure
1. Effectuez les tâches prérequises, comme décrit dans la rubrique «Avant de commencer», à la page 81.
2. Arrêtez le système (voir «Arrêt d'un système ou d'une partition logique», à la page 91).
3. Retirez le capot avant.
v Pour un système monté en armoire, voir «Retrait du capot avant sur un système 02E/4B, 02E/4C,
05F/6B ou 05E/6C monté en armoire», à la page 95.
5
v Pour un système autonome, voir «Retrait du capot avant sur un système autonome 02E/4B,
02E/4C ou 05F/6B», à la page 95.
4. Pour un système monté en armoire, mettez-le en position de maintenance, comme indiqué dans
«Mise en position de maintenance du système 02E/4B, 02E/4C, 05F/6B ou 05E/6C monté en
armoire», à la page 104.
5. Retirez le capot d'accès.
v Pour un système monté en armoire, voir «Retrait du capot d'accès sur un système 02E/4B,
02E/4C, 05F/6B ou 05E/6C monté en armoire», à la page 98.
v Pour un système autonome, voir «Retrait du capot d'accès d'un système autonome 02E/4B,
02E/4C ou 05F/6B», à la page 99.
6. Débranchez le système pour l'isoler de la source d'alimentation.
Remarque : Ce système est équipé d'un second bloc d'alimentation. Avant de poursuivre cette
procédure, vérifiez que la source d'alimentation a été débranchée du système.
(L003)
ou
7. Attachez le bracelet antistatique.
6 Activation SAS (Serial-attached SCSI) RAID
Avertissement :
v Portez un bracelet antistatique relié à une surface métallique non peinte pour protéger le matériel
contre les risques de décharge électrostatique.
v Si vous portez un bracelet antistatique, respectez toutes les consignes de sécurité relatives aux
dangers électriques. Un bracelet antistatique permet de réguler l'électricité statique. Il ne réduit et
n'augmente en aucun cas les risques d'électrocution liés à l'utilisation ou à la manipulation
d'appareils électriques.
v Si vous ne disposez pas d'un bracelet antistatique, touchez une surface métallique non peinte du
système pendant au moins 5 secondes avant de déballer un produit de son emballage antistatique,
d'installer ou de remplacer du matériel.
8. Installez le fond de panier de l'unité de disque. Pour des instructions, voir Remplacement du fond
de panier des unités de disque 02E/4B, 02E/4C, 05F/6B ou 05E/6C.
9. Installez les unités de disque.
v Pour l'unité de disque interne, voir Remplacement de l'unité de disque interne dans un système
02E/4B, 02E/4C, 05F/6B ou 05E/6C.
v Pour une unité SSD, voir Installation d'une unité de disque ou d'une unité SSD.
10. Installez l'unité de DVD SATA. Pour des instructions, voir Installation d'une unité de stockage
extra-plate sur le modèle 02E/4B, 02E/4C, 05F/6B ou 05E/6C.
11. Réinstallez le capot d'accès.
v Pour un système monté en armoire, voir «Installation du capot d'accès sur un système 02E/4B,
02E/4C, 05F/6B ou 05E/6C monté en armoire», à la page 101.
v Pour un système autonome, voir «Installation du capot d'accès d'un système autonome 02E/4B,
02E/4C ou 05F/6B», à la page 102.
12. Pour un système monté en armoire, mettez-le en position de fonctionnement, comme indiqué dans
«Mise en position de fonctionnement du système 02E/4B, 02E/4C, 05F/6B ou 05E/6C monté en
armoire», à la page 106.
13. Remettez en place le capot avant.
v Pour un système monté en armoire, voir «Installation du capot avant sur un système 02E/4B,
02E/4C, 05F/6B ou 05E/6C monté en armoire», à la page 96.
v Pour un système autonome, voir «Installation des capot et volet avant sur un système 02E/4B,
02E/4C ou 05F/6B autonome», à la page 97.
14. Rebranchez la source d'alimentation.
15. Démarrez le système (voir «Démarrage du système ou de la partition logique», à la page 89).
16. Vérifiez le composant installé :
v Si vous avez installé le composant pour toute autre raison, vérifiez le composant installé comme
décrit dans Vérification du composant installé.
Installation du fond de panier de stockage - 6 baies SFF/DVD SATA (FC 5618) 7
8 Activation SAS (Serial-attached SCSI) RAID
Installation des baies d'unité partagée 3x3 sans prise en
charge RAID 5/6 (FC 5631 et FC EJ02)
Utilisez cette procédure pour installer les composants inclus avec le dispositif de baies d'unité partagée
3x3 sans prise en charge RAID 5/6.
Avant de commencer
Remarque : FC 5631 s'applique à 02E/4B ou 05F/6B et FC EJ02 s'applique à 02E/4C ou 05E/6C.
Avertissement : Cette procédure s'applique à l'installation d'une capacité de baies d'unité partagée.
Pourquoi et quand exécuter cette tâche
Avant d'installer ou de remplacer un dispositif, assurez-vous que le logiciel requis pour prendre en
charge ce dispositif est installé sur votre système.
Avertissement : Pour maintenir la disponibilité du système et éviter d'éventuelles pertes de données,
vous devez comprendre que ces fonctions ont des conséquences sur la configuration de SAS
(serial-attached SCSI) et l'accessibilité des données du système. Consultez la rubrique Sous-système SAS
pour comprendre les différentes remarques sur la configuration et la maintenance SAS.
Avant toute opération d'installation ou de maintenance d'un dispositif, consultez les configurations d'E-S
pour chaque partition logique de votre système. Il est important de comprendre les différences entre les
diverses configurations et les considérations de maintenance qui en résultent. En outre, tenez compte des
incidences possibles sur les domaines suivants :
v Partitions logiques et leurs affectations à l'adaptateur d'E-S
v Configurations de grappes RAID
v Emplacements de disque physique
v Chemin d'accès à votre unité d'amorçage
Si votre système est géré par la console HMC, utilisez la console HMC pour effectuer la procédure
d'installation d'un dispositif sur le serveur. Pour plus d'instructions, voir «Installation d'un dispositif via
la console HMC», à la page 109
Si le système est géré par la console SDMC (Systems Director Management Console), utilisez la console
SDMC pour exécuter la procédure d'installation d'un dispositif sur le serveur. Pour des instructions, voir
«Remplacement d'un composant via la console SDMC», à la page 124.
Si vous ne disposez pas d'une console HMC ou SDMC, procédez comme suit :
Procédure
1. Effectuez les tâches prérequises, comme décrit dans la rubrique «Avant de commencer», à la page 81.
2. Arrêtez le système (voir «Arrêt d'un système ou d'une partition logique», à la page 91).
3. Pour un système monté en armoire, mettez-le en position de maintenance, comme indiqué dans
«Mise en position de maintenance du système 02E/4B, 02E/4C, 05F/6B ou 05E/6C monté en
armoire», à la page 104.
4. Retirez le capot d'accès.
v Pour un système monté en armoire, voir «Retrait du capot d'accès sur un système 02E/4B,
02E/4C, 05F/6B ou 05E/6C monté en armoire», à la page 98.
9
v Pour un système autonome, voir «Retrait du capot d'accès d'un système autonome 02E/4B,
02E/4C ou 05F/6B», à la page 99.
5. Utilisez les voyants de maintenance pour vous aider à identifier le composant, comme indiqué dans
«Identification d'un composant défaillant», à la page 84.
6. Débranchez le système pour l'isoler de la source d'alimentation.
Remarque : Ce système est équipé d'un second bloc d'alimentation. Avant de poursuivre cette
procédure, vérifiez que la source d'alimentation a été débranchée du système.
(L003)
ou
7. Attachez le bracelet antistatique.
Avertissement :
v Portez un bracelet antistatique relié à une surface métallique non peinte pour protéger le matériel
contre les risques de décharge électrostatique.
v Si vous portez un bracelet antistatique, respectez toutes les consignes de sécurité relatives aux
dangers électriques. Un bracelet antistatique permet de réguler l'électricité statique. Il ne réduit et
n'augmente en aucun cas les risques d'électrocution liés à l'utilisation ou à la manipulation
d'appareils électriques.
v Si vous ne disposez pas d'un bracelet antistatique, touchez une surface métallique non peinte du
système pendant au moins 5 secondes avant de déballer un produit de son emballage antistatique,
d'installer ou de remplacer du matériel.
8. Identifiez l'emplacement du contrôleur de stockage RAID-10.
10 Activation SAS (Serial-attached SCSI) RAID
9. Alignez le contrôleur de stockage RAID-10 (A) sur l'emplacement dans le système.
Figure 1. Emplacements de contrôleur de stockage RAID-10 (P1 à C19) pour 02E/4B ou 05F/6B
Figure 2. Emplacements de contrôleur de stockage RAID-10 (P1 à C19) pour 02E/4C ou 05E/6C
Installation des baies d'unité partagée 3x3 sans prise en charge RAID 5/6 (FC 5631 et FC EJ02) 11
10. Faites glisser le contrôleur de stockage RAID-10 dans le système jusqu'au verrouillage du loquet (B),
comme indiqué dans l'illustration.
11. Installez le fond de panier de l'unité de disque. Pour des instructions, voir Retrait du fond de panier
des unités de disque 02E/4B, 02E/4C, 05F/6B ou 05E/6C.
12. Installez les unités de disque.
v Pour l'unité de disque interne, voir Remplacement de l'unité de disque interne dans un système
02E/4B, 02E/4C, 05F/6B ou 05E/6C.
v Pour une unité SSD, voir Installation d'une unité de disque ou d'une unité SSD.
13. Installez l'unité de DVD SATA. Pour des instructions, voir Installation d'une unité de stockage
extra-plate sur le modèle 02E/4B, 02E/4C, 05F/6B ou 05E/6C.
14. Installez le port SAS externe. Pour plus d'informations, voir Installation du port SAS externe sur un
système 02E/4B, 02E/4C, 05F/6B ou 05E/6C.
15. Réinstallez le capot d'accès.
v Pour un système monté en armoire, voir «Installation du capot d'accès sur un système 02E/4B,
02E/4C, 05F/6B ou 05E/6C monté en armoire», à la page 101.
v Pour un système autonome, voir «Installation du capot d'accès d'un système autonome 02E/4B,
02E/4C ou 05F/6B», à la page 102.
16. Pour un système monté en armoire, mettez-le en position de fonctionnement, comme indiqué dans
«Mise en position de fonctionnement du système 02E/4B, 02E/4C, 05F/6B ou 05E/6C monté en
armoire», à la page 106.
17. Remettez en place le capot avant.
v Pour un système monté en armoire, voir «Installation du capot avant sur un système 02E/4B,
02E/4C, 05F/6B ou 05E/6C monté en armoire», à la page 96.
v Pour un système autonome, voir «Installation des capot et volet avant sur un système 02E/4B,
02E/4C ou 05F/6B autonome», à la page 97.
Figure 3. Modèle monté en armoire : remplacement du contrôleur de stockage RAID-10
12 Activation SAS (Serial-attached SCSI) RAID
18. Rebranchez la source d'alimentation.
19. Démarrez le système (voir «Démarrage du système ou de la partition logique», à la page 89).
20. Vérifiez le composant installé ou réparé.
v Si vous avez installé le composant pour toute autre raison, vérifiez le composant installé comme
décrit dans Vérification du composant installé.
Installation des baies d'unité partagée 3x3 sans prise en charge RAID 5/6 (FC 5631 et FC EJ02) 13
14 Activation SAS (Serial-attached SCSI) RAID
Installation du fond de panier de stockage - 8 baies SFF/RAID
175 Mo/Adaptateur d'E-S double (FC 5630 et FC EJ01)
Utilisez cette procédure pour installer les composants inclus avec le dispositif de fond de panier de
stockage - 8 baies SFF/RAID 175 Mo/Adaptateur d'E-S double. Cette procédure s'applique à l'installation
de nouveaux dispositifs.
Avant de commencer
Remarque : FC 5630 s'applique à 02E/4B ou 05F/6B et FC EJ01 s'applique à 02E/4C ou 05E/6C.
Avertissement : Si vous effectuez la maintenance d'un composant défaillant, voir les procédures de
maintenance suivantes :
v Pour la carte d'activation, voir Retrait de la carte d'activation et Remise en place de la carte
d'activation.
v Pour les contrôleurs de stockage RAID et Cache, voir Retrait du contrôleur de stockage et Remise en
place du contrôleur de stockage.
Pourquoi et quand exécuter cette tâche
Avertissement : Pour maintenir la disponibilité du système et éviter d'éventuelles pertes de données,
vous devez comprendre que ces fonctions ont des conséquences sur la configuration de SAS
(serial-attached SCSI) et l'accessibilité des données du système. Consultez la rubrique Sous-système SAS
pour comprendre les différentes remarques sur la configuration et la maintenance SAS.
Avant toute opération d'installation ou de maintenance d'un dispositif, consultez les configurations d'E-S
pour chaque partition logique de votre système. Il est important de comprendre les différences entre les
diverses configurations et les considérations de maintenance qui en résultent. En outre, tenez compte des
incidences possibles sur les domaines suivants :
v Partitions logiques et leurs affectations à l'adaptateur d'E-S
v Configurations de grappes RAID
v Emplacements de disque physique
v Chemin d'accès à votre unité d'amorçage
Avant d'installer ou de remplacer un dispositif, assurez-vous que le logiciel requis pour prendre en
charge ce dispositif est installé sur votre système. Si votre système est géré par la console HMC, utilisez
la console HMC pour effectuer la procédure d'installation d'un dispositif sur le serveur. Pour plus
d'instructions, voir «Installation d'un dispositif via la console HMC», à la page 109.
Si le système est géré par la console SDMC (Systems Director Management Console), utilisez la console
SDMC pour exécuter la procédure d'installation d'un dispositif sur le serveur. Pour des instructions, voir
«Remplacement d'un composant via la console SDMC», à la page 124.
Si vous ne disposez pas d'une console ou SDMC, procédez comme suit.
Procédure
1. Effectuez les tâches prérequises, comme décrit dans la rubrique «Avant de commencer», à la page 81.
2. Utilisez les voyants de maintenance pour vous aider à identifier le composant, comme indiqué dans
«Identification d'un composant défaillant», à la page 84.
3. Arrêtez le système (voir «Arrêt d'un système ou d'une partition logique», à la page 91).
15
4. Pour un système monté en armoire, mettez-le en position de maintenance, comme indiqué dans
«Mise en position de maintenance du système 02E/4B, 02E/4C, 05F/6B ou 05E/6C monté en
armoire», à la page 104.
5. Débranchez le système pour l'isoler de la source d'alimentation.
Remarque : Ce système est équipé d'un second bloc d'alimentation. Avant de poursuivre cette
procédure, vérifiez que la source d'alimentation a été débranchée du système.
(L003)
ou
6. Retirez le capot d'accès.
v Pour un système monté en armoire, voir «Retrait du capot d'accès sur un système 02E/4B,
02E/4C, 05F/6B ou 05E/6C monté en armoire», à la page 98.
v Pour un système autonome, voir «Retrait du capot d'accès d'un système autonome 02E/4B,
02E/4C ou 05F/6B», à la page 99.
7. Attachez le bracelet antistatique.
Avertissement :
v Portez un bracelet antistatique relié à une surface métallique non peinte pour protéger le matériel
contre les risques de décharge électrostatique.
v Si vous portez un bracelet antistatique, respectez toutes les consignes de sécurité relatives aux
dangers électriques. Un bracelet antistatique permet de réguler l'électricité statique. Il ne réduit et
n'augmente en aucun cas les risques d'électrocution liés à l'utilisation ou à la manipulation
d'appareils électriques.
v Si vous ne disposez pas d'un bracelet antistatique, touchez une surface métallique non peinte du
système pendant au moins 5 secondes avant de déballer un produit de son emballage antistatique,
d'installer ou de remplacer du matériel.
16 Activation SAS (Serial-attached SCSI) RAID
8. Pour le système 02E/4B ou 05F/6B, localisez la carte d'activation RAID (P1–C13), le contrôleur de
stockage RAID/cache (P1-C19) et la carte de batterie de cache (P1-C14) et passez à l'étape 10, à la
page 18.
9. Pour le système 02E/4C ou 05E/6C, localisez le contrôleur de stockage RAID/cache (P1-C19) et la
carte de batterie de cache (P1-C14) et passez à l'étape 11, à la page 18.
Figure 4. Emplacement de la carte d'activation RAID, des contrôleurs de stockage RAID et cache et de la carte de
batterie de cache
Figure 5. Emplacement du contrôleur de stockage RAID et cache et de la carte de batterie de cache
Installation du fond de panier de stockage - 8 baies SFF/RAID 175 Mo/Adaptateur d'E-S double (FC 5630 et FC EJ01) 17
10. Installez la carte d'activation RAID pour 02E/4B ou 05F/6B en procédant comme suit :
Remarque : Le système 02E/4C ou 05E/6C ne dispose pas de la carte d'activation.
a. Alignez la carte d'activation RAID (A) sur l'emplacement (B) dans le système.
b. Faites glisser la carte d'activation RAID dans le système jusqu'au verrouillage des loquets, comme
indiqué dans l'illustration.
11. Installez les contrôleurs de stockage RAID et cache en procédant comme suit :
a. Alignez les contrôleurs de stockage RAID et cache (A) sur l'emplacement dans le système.
b. Faites glisser les contrôleurs de stockage RAID et cache dans le système jusqu'au verrouillage du
loquet (B), comme indiqué dans l'illustration.
Figure 6. Modèle monté en armoire : remplacement de la carte d'activation RAID
18 Activation SAS (Serial-attached SCSI) RAID
12. Installez la carte de batterie de cache en procédant comme suit :
a. Faites glisser la carte de batterie de cache (A) dans les glissières de montage du contrôleur
jusqu'à ce qu'elle s'insère dans le connecteur de batterie.
b. Lorsque la batterie est placée dans le connecteur, déplacez le levier vers la position de
verrouillage afin d'insérer complètement la batterie dans le connecteur.
Figure 7. Modèle monté en armoire : remplacement des contrôleurs de stockage RAID et cache
Installation du fond de panier de stockage - 8 baies SFF/RAID 175 Mo/Adaptateur d'E-S double (FC 5630 et FC EJ01) 19
13. Retirez les unités de disque.
v Pour les unités de disque internes, voir Retrait de l'unité de disque interne dans un système
02E/4B, 02E/4C, 05F/6B ou 05E/6C.
v Pour les unités SSD, voir Retrait d'une unité de disque ou d'une unité SSD.
14. Retirez le fond de panier des unités de disque [CCIN 2BD5]. Pour des instructions, voir Retrait du
fond de panier des unités de disque 02E/4B, 02E/4C, 05F/6B ou 05E/6C.
15. Installez le fond de panier de l'unité de disque à haute fonctionnalité [CCIN 2BD6]. Pour des
instructions, voir Remplacement du fond de panier des unités de disque 02E/4B, 02E/4C, 05F/6B ou
05E/6C.
16. Remplacez les unités de disque dans le fond de panier d'unité de disque à haute fonctionnalité.
v Pour les unités de disque internes, voir Remplacement de l'unité de disque interne dans un
système 02E/4B, 02E/4C, 05F/6B ou 05E/6C.
v Pour les unités SSD, voir Installation d'une unité de disque ou d'une unité SSD.
17. Remettez en place le capot d'accès de maintenance. Voir
v Pour un système monté en armoire, voir «Installation du capot d'accès sur un système 02E/4B,
02E/4C, 05F/6B ou 05E/6C monté en armoire», à la page 101.
v Pour un système autonome, voir «Installation du capot d'accès d'un système autonome 02E/4B,
02E/4C ou 05F/6B», à la page 102.
18. Pour un système monté en armoire, mettez-le en position de fonctionnement, comme indiqué dans
«Mise en position de fonctionnement du système 02E/4B, 02E/4C, 05F/6B ou 05E/6C monté en
armoire», à la page 106.
19. Remettez en place le capot avant.
v Pour un système monté en armoire, voir «Installation du capot avant sur un système 02E/4B,
02E/4C, 05F/6B ou 05E/6C monté en armoire», à la page 96.
v Pour un système autonome, voir «Installation des capot et volet avant sur un système 02E/4B,
02E/4C ou 05F/6B autonome», à la page 97.
Figure 8. Modèle monté en armoire : remplacement de la carte de batterie de cache RAID
20 Activation SAS (Serial-attached SCSI) RAID
20. Rebranchez la source d'alimentation.
21. Démarrez le système (voir «Démarrage du système ou de la partition logique», à la page 89).
22. Vérifiez le composant installé :
v Si vous avez installé le composant pour toute autre raison, vérifiez le composant installé comme
décrit dans Vérification du composant installé.
Installation du fond de panier de stockage - 8 baies SFF/RAID 175 Mo/Adaptateur d'E-S double (FC 5630 et FC EJ01) 21
22 Activation SAS (Serial-attached SCSI) RAID
Retrait et remise en place des adaptateurs et batteries SAS
RAID
Il se peut que vous deviez retirer ou remettre en place les adaptateurs SAS RAID et les batteries. Cette
section comprend les procédures relatives à ces tâches.
Remarque : Le remplacement du dispositif incombe à l'utilisateur. Vous pouvez soit exécuter la tâche
vous-même, soit faire appel à un prestataire agréé. Il vous imputera probablement les frais
correspondants.
Retrait et remplacement du fond de panier d'unités de disque
Utilisez ces procédures pour retirer, remplacer ou installer le fond de panier d'unités de disques sur les
serveurs 7/20 (02E/4B et 02E/4C) et 7/40 (05F/6B et 05E/6C).
Les figures suivantes représentent les fonds de panier d'unité de disque d'un système monté en armoire
et d'un système autonome.
Figure 9. Fond de panier d'unités de disques d'un système monté en armoire
23
Retrait du fond de panier d'unités de disques sur les modèles 02E/4B,
02E/4C, 05F/6B ou 05E/6C
Cette procédure permet de retirer un fond de panier d'unités de disque du serveur.
Pourquoi et quand exécuter cette tâche
Si le système est géré par la console HMC (Hardware Management Console), utilisez la console HMC
pour retirer le fond de panier d'unité de disque du serveur. Pour obtenir des informations sur l'utilisation
de la console HMC en vue de retirer le fond de panier d'unité de disque, voir Retrait d'un composant via
la console HMC.
Si le système est géré par la console SDMC (Systems Director Management Console), utilisez la console
SDMC pour retirer le fond de panier d'unité de disque du serveur. Pour des instructions, voir Retrait
d'un composant via la console SDMC.
Si vous ne disposez pas d'une console HMC ou SDMC, procédez comme suit pour retirer le fond de
panier d'unité de disque :
Avertissement : Veillez à retirer le panneau de commande avant le fond de panier d'unité de disque
pour ne pas endommager les câbles du panneau de commande.
Procédure
1. Identifiez le système sur lequel vous travaillez à l'aide du processus d'identification du système afin
d'activer l'indicateur de système (bleu). Pour plus d'informations, voir Voyants du panneau de
commande.
Figure 10. Fond de panier d'unités de disques d'un système autonome
24 Activation SAS (Serial-attached SCSI) RAID
2. Utilisez les voyants des indicateurs de service pour vous aider à identifier le composant, comme
indiqué dans «Identification d'un composant défaillant», à la page 84.
3. Effectuez les tâches prérequises (voir «Avant de commencer», à la page 81).
4. Arrêtez le système (voir «Arrêt d'un système ou d'une partition logique», à la page 91).
5. Retirez le capot avant :
v Pour un système monté en armoire, voir «Retrait du capot avant sur un système 02E/4B, 02E/4C,
05F/6B ou 05E/6C monté en armoire», à la page 95.
v Pour un système autonome, voir «Retrait du capot avant sur un système autonome 02E/4B,
02E/4C ou 05F/6B», à la page 95.
6. Pour un système monté en armoire, mettez-le en position de maintenance, comme indiqué à la
section «Mise en position de maintenance du système 02E/4B, 02E/4C, 05F/6B ou 05E/6C monté en
armoire», à la page 104.
7. Débranchez le système pour l'isoler de la source d'alimentation, comme décrit dans «Déconnexion
des cordons d'alimentation du système 02E/4B, 02E/4C, 05F/6B ou 05E/6C», à la page 107.
Remarque : Ce système est équipé d'un second bloc d'alimentation. Avant de poursuivre cette
procédure, vérifiez que la source d'alimentation a été débranchée du système.
(L003)
ou
8. Retirez le capot d'accès :
v Pour un système monté en armoire, voir «Retrait du capot d'accès sur un système 02E/4B,
02E/4C, 05F/6B ou 05E/6C monté en armoire», à la page 98.
v Pour un système autonome, voir «Retrait du capot d'accès d'un système autonome 02E/4B,
02E/4C ou 05F/6B», à la page 99.
9. Attachez le bracelet antistatique.
Retrait et remise en place des adaptateurs et batteries SAS RAID 25
Avertissement :
v Portez un bracelet antistatique relié à une surface métallique non peinte pour protéger le matériel
contre les risques de décharge électrostatique.
v Si vous portez un bracelet antistatique, respectez toutes les consignes de sécurité relatives aux
dangers électriques. Un bracelet antistatique permet de réguler l'électricité statique. Il ne réduit et
n'augmente en aucun cas les risques d'électrocution liés à l'utilisation ou à la manipulation
d'appareils électriques.
v Si vous ne disposez pas d'un bracelet antistatique, touchez une surface métallique non peinte du
système pendant au moins 5 secondes avant de déballer un produit de son emballage antistatique,
d'installer ou de remplacer du matériel.
10. Si vous devez remplacer le fond de panier en vue d'une mise à niveau ou d'un incident système,
procédez comme suit afin de retirer les unités de disques, en fonction de votre fond de panier.
Remarque : Si vous retirez un fond de panier comportant des unités de disques, le fond de panier
est lourd. Assurez-vous de pouvoir effectuer cette procédure en toute sécurité.
a. Pour retirer l'unité de disque du fond de panier, procédez comme suit:
Remarque : Notez l'emplacement de chaque unité de disque retirée, de manière à remettre en
place les unités de disque à leur emplacement d'origine.
1) Déverrouillez la poignée de l'unité de disques (A) et tirez-la vers vous pour la dégager.
2) Placez une main sous l'unité de disques et tirez avec précaution l'unité de disques du
système, comme indiqué dans la figure 11.
11. Retirez l'unité de stockage en appuyant sur le loquet (A) et en faisant glisser l'unité en dehors du
système, comme indiqué dans la figure suivante.
Figure 11. Retrait de l'unité de disque du fond de panier
26 Activation SAS (Serial-attached SCSI) RAID
Pour plus d'informations sur le retrait des différents types d'unités de stockage, voir Retrait et
remplacement d'unités de stockage sur les modèles 02E/4B, 02E/4C, 05F/6B ou 05E/6C.
12. Débranchez les câbles situés à l'arrière du fond de panier d'unités de disque. Si vous avez connecté
un câble SAS (serial attached SCSI), pensez à débrancher ce câble du fond de panier d'unités de
disques. Pour plus d'informations, voir Installation du port SAS externe sur un modèle 02E/4B,
02E/4C, 05F/6B ou 05E/6C.
13. Déverrouillez le fond de panier de l'unité de disques en tirant les poignées (A) afin de les mettre en
position ouverte, au centre du fond de panier, comme indiqué dans la figure 13, à la page 28 et
figure 14, à la page 28.
Figure 12. Retrait de l'unité de stockage du fond de panier d'unité de disque
Retrait et remise en place des adaptateurs et batteries SAS RAID 27
14. Tirez le fond de panier en dehors du serveur.
Figure 13. Retrait du fond de panier comportant des unités de disque
Figure 14. Retrait du fond de panier ne comportant pas d'unités de disque
28 Activation SAS (Serial-attached SCSI) RAID
Que faire ensuite
Si vous avez retiré le fond de panier dans le cadre d'une autre procédure, reprenez cette procédure. Pour
remplacer le fond de panier ou en installer un nouveau, voir Remplacement du fond de panier d'unités
de disques sur les modèles 02E/4B, 02E/4C, 05F/6B ou 05E/6C.
Remplacement du fond de panier d'unités de disques sur les modèles
02E/4B, 02E/4C, 05F/6B ou 05E/6C
Utilisez cette procédure pour remplacer un fond de panier d'unités de disque.
Pourquoi et quand exécuter cette tâche
Si vous remplacez un fond de panier d'unités de disques dans le cadre d'une procédure de maintenance
et souhaitez le retirer, voir sur les modèles sur les modèles 02E/4B, 02E/4C, 05F/6B ou 05E/6C.
Si le système est géré par la console HMC (Hardware Management Console), utilisez la console HMC
pour effectuer la procédure de remplacement du fond de panier d'unité de disque sur le serveur. Pour
plus d'informations, voir la rubrique Remplacement d'un composant via la console HMC.
Si le système est géré par la console SDMC (Systems Director Management Console), utilisez la console
SDMC pour effectuer la procédure de remplacement du fond de panier d'unité de disque sur le serveur.
Pour obtenir des instructions, voir la rubrique Remplacement d'un composant via la console SDMC.
Si vous ne disposez pas d'une console HMC ou SDMC, procédez comme suit pour remplacer le fond de
panier d'unité de disque :
Procédure
1. Vérifiez la présence éventuelle d'un capot de protection sur les broches pour la partie inférieure
médiane à l'arrière du fond de panier. Si un capot de protection est installé, retirez-le.
2. Vérifiez les taquets de verrouillage (A) visibles dans les figures ci-dessous. Si les taquets sont en
position verrouillée, ouvrez-les.
3. Soulevez délicatement le fond de panier sur les deux côtés et alignez-le avec son emplacement dans
le serveur.
4. Insérez fermement le fond de panier dans le serveur.
5. Fixez le fond de panier à l'aide des taquets de verrouillage (A), comme indiqué dans les figures
suivantes.
Retrait et remise en place des adaptateurs et batteries SAS RAID 29
6. Réinstallez les unités de disques si vous les avez préalablement retirées. Utilisez les informations que
vous avez enregistrées lors du retrait des unités pour les installer dans les emplacements appropriés.
Pour remplacer les unités de disque, procédez comme suit :
Figure 15. Installation d'un fond de panier sans unités de disque dans un modèle monté en armoire
Figure 16. Installation d'un fond de panier doté d'unités de disque dans un modèle monté en armoire
30 Activation SAS (Serial-attached SCSI) RAID
a. Tenez l'unité de disque par les bords hauts et bas pour insérer le lecteur dans l'unité d'extension.
Ne la tenez pas par la poignée.
b. Faites glisser complètement l'unité de disques dans l'unité centrale ou l'unité d'extension, puis
verrouillez l'unité dans son emplacement en poussant la poignée, comme indiqué dans la figure
suivante.
7. Réinstallez l'unité de stockage extra-plate en l'insérant dans l'emplacement et en la poussant jusqu'à
ce que le levier (B) soit verrouillé, comme l'illustre la figure suivante.
Pour plus d'informations sur le remplacement des différents types d'unités de stockage, voir Retrait
et remplacement d'unités de stockage sur les modèles 02E/4B, 02E/4C, 05F/6B ou 05E/6C.
Figure 17. Installation des unités de disque d'un fond de panier
Retrait et remise en place des adaptateurs et batteries SAS RAID 31
8. Le cas échéant, connectez le câble au fond de panier. Pour plus d'informations, voir Installation du
port SAS externe sur un modèle 02E/4B, 02E/4C, 05F/6B ou 05E/6C.
9. Le cas échéant, installez la carte d'activation RAID. Pour plus d'informations, voir Activation SAS
RAID et bloc de batteries de cache pour les modèles 02E/4B, 02E/4C, 05F/6B ou 05E/6C.
10. Remettez le capot d'accès en place :
v Pour un système monté en armoire, voir «Installation du capot d'accès sur un système 02E/4B,
02E/4C, 05F/6B ou 05E/6C monté en armoire», à la page 101.
v Pour un système autonome, voir «Installation du capot d'accès d'un système autonome 02E/4B,
02E/4C ou 05F/6B», à la page 102.
11. Pour un système monté en armoire, mettez-le en position de fonctionnement, comme indiqué dans
«Mise en position de fonctionnement du système 02E/4B, 02E/4C, 05F/6B ou 05E/6C monté en
armoire», à la page 106.
12. Connectez les cordons d'alimentation au système, comme décrit dans Connexion des cordons
d'alimentation.
13. Remettez le capot avant en place :
v Pour un système monté en armoire, voir «Installation du capot avant sur un système 02E/4B,
02E/4C, 05F/6B ou 05E/6C monté en armoire», à la page 96.
v Pour un système autonome, voir «Installation des capot et volet avant sur un système 02E/4B,
02E/4C ou 05F/6B autonome», à la page 97.
14. Démarrez le système ou la partition logique. Voir «Démarrage du système ou de la partition
logique», à la page 89.
15. Vérifiez le composant installé :
v Si vous avez installé le composant pour toute autre raison, vérifiez le composant installé comme
décrit dans Vérification du composant installé.
Figure 18. Installation de l'unité de stockage
32 Activation SAS (Serial-attached SCSI) RAID
Retrait et remplacement du fond de panier du système 02E/4B, 02E/4C,
05F/6B ou 05E/6C
Découvrez comment retirer ou remplacer le fond de panier système.
Pourquoi et quand exécuter cette tâche
Les procédures suivantes expliquent comment retirer et remplacer le fond de panier système.
Retrait du fond de panier du système 02E/4B, 02E/4C, 05F/6B ou
05E/6C
Prenez connaissance des instructions de retrait du fond de panier système d'un serveur en vue du
remplacement d'un composant défaillant ou dans le cadre d'une autre procédure de maintenance.
Avant de commencer
Remarques :
v Avant de commencer cette procédure, vérifiez que vous disposez des droits de connexion en tant que
fournisseur de services agréé. Les différents niveaux d'autorisation sont décrits dans Niveaux
d'autorisation ASMI.
v Identifiez le système sur lequel vous travaillez à l'aide du processus d'identification du système afin
d'activer l'indicateur de système (bleu). Pour de plus amples informations, voir Voyant du panneau de
commande.
Pourquoi et quand exécuter cette tâche
Si votre système est géré par la console HMC (Hardware Management Console), utilisez-la pour retirer le
fond de panier système. Pour plus d'informations, voir la rubrique Remplacement d'un composant à
l'aide de la console HMC.
Si votre système est géré par la console SDMC (Systems Director Management Console), utilisez-la pour
retirer le fond de panier système. Pour obtenir des instructions, voir Remplacement d'un composant à
l'aide de la console SDMC.
Si vous ne disposez pas d'une console HMC ou SDMC, procédez comme suit pour retirer le fond de
panier système :
Procédure
1. Accédez à l'interface ASMI.
L'interface ASMI (Advanced System Management Interface) est l'interface du processeur de service
requise pour effectuer des tâches de maintenance générales et de niveau administrateur, telles que la
lecture des journaux d'erreurs du processeur de service, la lecture des données essentielles du
produit, la configuration du processeur de service et le contrôle de l'alimentation système. L'interface
ASMI est également désignée comme l'ensemble des menus du processeur de service. Pour accéder à
l'interface ASMI, voir Accès à l'interface ASMI. Une fois que vous avez accédé à l'interface ASMI,
passez à l'étape suivante.
2. Vérifiez et enregistrez le type de boîtier et le code dispositif.
Le numéro de série du boîtier est un numéro à sept chiffres imprimé sur une étiquette code barre
placée sur l'unité centrale.
Remarque : Le code dispositif de fond de panier du système 02E/4B, 02E/4C, 05F/6B ou 05E/6C
est 78AA-001.
3. Vérifiez et enregistrez les paramètres de l'unité carte Ethernet hôte (HEA) serveur.
Retrait et remise en place des adaptateurs et batteries SAS RAID 33
v Pour la console HMC :
a. Développez le dossier Gestion de systèmes dans le volet de l'arborescence de navigation.
b. Cliquez deux fois sur le dossier Serveurs.
c. Sélectionnez un serveur dans le panneau central.
d. Dans le menu Tâches, cliquez sur Informations sur le matériel > Cartes > Ethernet hôte. La
fenêtre Cartes Ethernet hôte apparaît.
e. Sélectionnez le port physique requis et cliquez sur Configurer. La fenêtre de configuration du
port physique de la carte HEA apparaît.
f. Enregistrez la valeur dans la zone Pending Port Group Multi-Core Scaling value.
v Pour la console SDMC :
a. Allez dans l'onglet Resources et cliquez sur Hosts.
b. Sélectionnez un serveur dans le panneau de droite.
c. Dans le menu Actions, cliquez sur Informations sur le matériel > Cartes > Ethernet hôte. La
fenêtre Cartes Ethernet hôte apparaît.
d. Sélectionnez le port physique requis et cliquez sur Configurer. La fenêtre de configuration du
port physique de la carte HEA apparaît.
e. Enregistrez la valeur dans la zone Pending Port Group Multi-Core Scaling value.
4. Vérifiez et notez les paramètres du serveur.
Avant de remplacer cette unité remplaçable sur site (FRU), vérifiez et enregistrez tous les paramètres
du serveur étant donné que vous devrez peut-être mettre à jour le système après avoir remplacé
l'unité FRU. Cela peut être effectué via l'interface ASMI (Advanced System Management Interface).
a. Recherchez et notez le niveau de microprogramme du serveur indiqué sur le bord droit de la
fenêtre d'état, la zone dans laquelle se trouve le bouton Déconnexion. Vous devrez peut-être
mettre à jour le système avec le niveau de code de microprogramme de serveur le plus récent
après avoir remplacé l'unité FRU.
b. Vérifiez et notez les paramètres du processeur de service que vous avez précédemment définis à
l'aide de l'interface ASMI. Vous devrez réinitialiser les paramètres du processeur de service après
la mise à jour de cette unité FRU, faute de quoi les paramètres par défaut seront utilisés. Vous
devez noter les paramètres suivants :
v Paramètre de nom de système
v Paramètres de contrôle de l'alimentation système
v Paramètres de l'aide à la maintenance système
v Paramètres de configuration système
v Paramètres des services réseau
v Paramètres des options de performances
v Paramètres de profil de connexion
v Paramètres de liste des unités d'amorçage
v Paramètres de carte Ethernet hôte
c. Notez tous les paramètres du processeur de service que vous avez éventuellement définis à l'aide
des commandes du système d'exploitation.
5. Arrêtez le système ou la partition logique. Pour obtenir des instructions, voir Arrêt du système ou de
la partition logique.
6. Débranchez le système pour l'isoler de la source d'alimentation.
Remarque : Ce système est équipé en option d'un second bloc d'alimentation. Avant de poursuivre
cette procédure, vérifiez que la source d'alimentation du système est complètement déconnectée.
(L003)
34 Activation SAS (Serial-attached SCSI) RAID
ou
7. Etiquetez et débranchez les câbles externes connectés à l'unité remplaçable sur site (FRU) dont vous
effectuez la maintenance.
8. Placez le système en position de maintenance. Pour des instructions, voir Mise en position de
maintenance d'un système 02E/4B, 02E/4C, 05F/6B ou 05E/6C monté en armoire.
9. Retirez le capot d'accès de maintenance :
v Pour un système monté en armoire, voir Retrait du capot d'accès sur un système monté en
armoire 02E/4B, 02E/4C, 05F/6B ou 05E/6C.
v Pour un système autonome, voir Retrait du capot d'accès sur un système autonome 02E/4B,
02E/4C ou 05F/6B.
10. Retirez la carte d'extension de mémoire. Pour des instructions, voir Retrait de la carte d'extension de
mémoire 02E/4B, 02E/4C, 05F/6B ou 05E/6C.
11. Retirez les cartes GX. Pour des instructions, voir Retrait des adaptateurs GX pour 02E/4B, 02E/4C,
05F/6B, ou 05E/6C.
12. Retirez les cartes PCI. Pour des instructions, voir Retrait hors tension d'une carte PCI sur le serveur
02E/4B, 02E/4C, 05F/6B ou 05E/6C..
13. Retirez la carte Ethernet hôte (02E/4B ou 05F/6B). Pour obtenir des instructions, voir Retrait hors
tension de l'adaptateur Ethernet hôte 02E/4B ou 05F/6B.
14. Retirez tous les régulateurs de tension. Pour des instructions, voir Retrait du régulateur de tension
02E/4B, 02E/4C, 05F/6B, ou 05E/6C.
15. Retirez les ventilateurs intermédiaires. Pour des instructions, voir Retrait des ventilateurs
intermédiaires 02E/4B, 02E/4C, 05F/6B ou 05E/6C.
16. Retirez les blocs d'alimentation. Pour des instructions, voir Retrait hors tension des blocs
d'alimentation 02E/4B, 02E/4C, 05F/6B, ou 05E/6C ou Retrait sous tension des blocs d'alimentation
02E/4B, 02E/4C, 05F/6B ou 05E/6C.
17. Débranchez les câbles internes du fond de panier.
Retrait et remise en place des adaptateurs et batteries SAS RAID 35
Retirez le câble SPCN à l'arrière du châssis (02E/4C ou 05E/6C).
18. Retirez le boîtier de l'unité de disque :
a. Saisissez les loquets de l'unité de disque et tirez dessus.
b. Saisissez les poignées, puis tirez le boîtier de l'unité de disque vers vous.
Figure 19. Débranchement des câbles internes
36 Activation SAS (Serial-attached SCSI) RAID
19. Retirez le répartiteur d'air de la carte PCI. Saisissez fermement le répartiteur d'air de la carte PCI et
soulevez-le.
Figure 20. Retrait du boîtier de l'unité de disque
Figure 21. Retrait du répartiteur d'air de la carte PCI
Retrait et remise en place des adaptateurs et batteries SAS RAID 37
20. Retirez la carte PCI (en option) :
a. Desserrez les deux vis moletées (A) en les tournant dans le sens inverse des aiguilles d'une
montre.
b. Déverrouillez les taquets de la carte PCI (B) en appuyant sur leurs extrémités et en les soulevant.
c. Retirez la carte PCI en la soulevant.
21. Retirez le bouclier du boîtier de l'unité de disque, le régulateur de tension et le répartiteur d'air.
a. Retirez les cinq vis qui maintiennent le bouclier du boîtier de l'unité de disque, le répartiteur
d'air et le régulateur de tension.
b. Retirez le bouclier du boîtier de l'unité de disque, le régulateur de tension et le répartiteur d'air
en les soulevant.
Figure 22. Retrait de la carte PCI
38 Activation SAS (Serial-attached SCSI) RAID
22. Retirez le logement de barrette DIMM :
a. Retirez les quatre vis qui maintiennent le boîtier de barrette DIMM (vue A).
Figure 23. Retrait du boîtier de l'unité de disque (02E/4B ou 05F/6B)
Figure 24. Retrait du boîtier de l'unité de disque (02E/4C ou 05E/6C)
Retrait et remise en place des adaptateurs et batteries SAS RAID 39
b. Retirez le boîtier de barrette DIMM en le soulevant.
23. Débranchez le câble du ventilateur avant.
Figure 25. Retrait du boîtier de barrette DIMM
40 Activation SAS (Serial-attached SCSI) RAID
24. Retirez le boîtier du ventilateur intermédiaire :
a. Retirez les vis (B) qui maintiennent le boîtier de ventilation.
b. Retirez les deux vis (A) qui maintiennent le boîtier de ventilation à l'extérieur du boîtier du
système.
c. Retirez le boîtier de ventilation en le soulevant.
Figure 26. Retrait du câble du ventilateur avant
Retrait et remise en place des adaptateurs et batteries SAS RAID 41
25. Retirez le commutateur de verrouillage d'alimentation. Pour des instructions, voir Retrait du
commutateur de verrouillage d'alimentation 02E/4B, 02E/4C, 05F/6B ou 05E/6C.
26. Retrait du module du processeur système. Pour des instructions, voir Retrait du module processus
système 02E/4B, 02E/4C, 05F/6B, ou 05E/6C.
27. Retirez le support métallique du régulateur de tension.
a. Localisez les trois vis qui maintiennent le support sur le côté du système.
b. Retirez les trois vis.
28. Retirez le fond de panier système.
Remarque : Retirez le fond du panier système avec précaution. Ne le soulevez pas à l'aide des
modules fixés.
a. Etiquetez et débranchez tous les câbles Ethernet qui sont présents.
b. Retirez les quatre vis (A).
Remarque : Ne retirez aucune des vis Torx. Retirez uniquement les quatre vis à tête hexagonale
et à fente.
Figure 27. Retrait du boîtier du ventilateur intermédiaire
42 Activation SAS (Serial-attached SCSI) RAID
c. Saisissez le fond de panier système par les deux bras de chargement du dissipateur thermique
(B).
Figure 28. Retrait du fond de panier système (02E/4B ou 05F/6B)
Figure 29. Retrait du fond de panier système
Retrait et remise en place des adaptateurs et batteries SAS RAID 43
d. Soulevez l'avant du fond de panier système pour dégager les chevilles de positionnement (6,35
mm environ).
e. Faites coulisser le fond de panier vers l'avant du système de façon à ce que ses connecteurs
laissent apparaître le châssis arrière.
f. Une fois les connecteurs dégagés du châssis, soulevez l'arrière du fond de panier système et
retirez ce dernier en l'inclinant.
Remise en place du fond de panier du système 02E/4B, 02E/4C, 05F/6B
ou 05E/6C
Informations sur le remplacement du fond de panier système dans le cadre d'une opération de
maintenance.
Pourquoi et quand exécuter cette tâche
Avant d'effectuer cette procédure, vous devez avoir effectué le retrait. Pour connaître les instructions de la
procédure de retrait, voir Retrait du fond de panier système 02E/4B, 02E/4C, 05F/6B, ou 05E/6C.
Si votre système est géré par la console HMC (Hardware Management Console), utilisez-la pour
remplacer le fond de panier système. Pour plus d'informations, voir la rubrique Remplacement d'un
composant à l'aide de la console HMC.
Si votre système est géré par la console SDMC (Systems Director Management Console), utilisez-la pour
remplacer le fond de panier système. Pour obtenir des instructions, voir Remplacement d'un composant à
l'aide de la console SDMC.
Si vous ne disposez pas d'une console HMC ou SDMC, procédez comme suit pour remplacer le fond de
panier système :
Avertissement :
Attachez un bracelet antistatique à une surface métallique non peinte de votre matériel afin
d'empêcher d'éventuelles décharges électrostatiques d'endommager celui-ci.
Si vous portez un bracelet antistatique, respectez toutes les consignes de sécurité relatives aux dangers
électriques. Un bracelet antistatique permet de réguler l'électricité statique. Il ne réduit et n'augmente
en aucun cas les risques d'électrocution liés à l'utilisation ou à la manipulation d'appareils électriques.
Si vous ne disposez pas d'un bracelet antistatique, touchez une surface métallique non peinte du
système pendant au moins 5 secondes avant de déballer le produit de son emballage antistatique et
d'installer ou de remettre en place le matériel.
Procédure
1. Remplacez le fond de panier système.
Remarque : Remplacez le fond de panier système avec précaution.
a. Saisissez avec précaution le fond de panier système de remplacement par les deux bras de
chargement du dissipateur thermique (A).
b. Abaissez le fond de panier système vers l'avant du boîtier du système.
c. Abaissez l'arrière du fond de panier système, puis faites-le glisser vers l'arrière du système
jusqu'à ce que les connecteurs laissent apparaître les ouvertures dans le boîtier (voir illustration).
d. Abaissez l'avant du fond de panier système sur les broches d'alignement (B).
44 Activation SAS (Serial-attached SCSI) RAID
e. Serrez les quatre vis (C) qui maintiennent le fond de panier système au boîtier du système.
f. Rebranchez tous les câbles Ethernet qui ont été précédemment retirés.
Figure 30. Installation du fond de panier système (02E/4B ou 05F/6B)
Figure 31. Installation du fond de panier système
Retrait et remise en place des adaptateurs et batteries SAS RAID 45
2. Installez le support métallique du régulateur de tension.
a. Alignez les trois trous du support sur les trois trous du côté du système.
b. Serrez les trois vis.
3. Remplacez le module processeur système. Pour des instructions, voir Remplacement du module
processus système 02E/4B, 02E/4C, 05F/6B ou 05E/6C.
4. Remplacez le commutateur de verrouillage d'alimentation. Pour des instructions, voir Remplacement
du commutateur de verrouillage d'alimentation 02E/4B, 02E/4C, 05F/6B ou 05E/6C.
5. Remplacez le boîtier de ventilation :
a. Abaissez le boîtier de ventilation pour le mettre en position, en vous assurant que la rainure
d'alignement (A) et la broche d'alignement (B) sont correctement alignés.
b. Serrez les deux vis (D) qui maintiennent le boîtier de ventilation au boîtier système.
c. Serrez les vis (C) qui maintiennent le boîtier de ventilation.
6. Connectez le câble du ventilateur avant.
Figure 32. Installation du boîtier de ventilation
46 Activation SAS (Serial-attached SCSI) RAID
7. Remplacez le logement de barrette DIMM :
a. Abaissez le boîtier de barrette DIMM pour le mettre en place.
b. Insérez et serrez les quatre vis qui maintiennent le boîtier de barrette DIMM (Vue A).
Figure 33. Connexion du câble du ventilateur avant
Retrait et remise en place des adaptateurs et batteries SAS RAID 47
8. Remplacez le guide plastique du régulateur de tension (A).
a. Abaissez le guide plastique pour le mettre en place.
b. Insérez et serrez les trois vis qui maintiennent le guide.
9. Remplacez le boîtier de l'unité de disque (B).
a. Abaissez le boîtier de l'unité de disque pour le mettre en place.
b. Insérez et serrez les deux vis qui maintiennent le boîtier de l'unité de disque.
Figure 34. Remplacement du boîtier de barrette DIMM
48 Activation SAS (Serial-attached SCSI) RAID
10. Remplacez la carte de connexion PCI :
a. Abaissez la carte de connexion PCI pour la mettre en place.
Figure 35. Remplacement du boîtier de l'unité de disque (02E/4B ou 05F/6B)
Figure 36. Remplacement du boîtier de l'unité de disque (02E/4C ou 05E/6C)
Retrait et remise en place des adaptateurs et batteries SAS RAID 49
b. Appuyez sur les loquets de la carte de connexion PCI (A) jusqu'à ce qu'ils s'enclenchent.
c. Serrez les deux vis moletées (D).
11. Remplacez le répartiteur d'air de la carte de connexion PCI.
a. Abaissez le répartiteur d'air de la carte de connexion PCI et appuyez dessus pour le mettre en
place.
Figure 37. Remplacement de la carte de connexion PCI
50 Activation SAS (Serial-attached SCSI) RAID
12. Remplacez le boîtier de l'unité de disque :
a. Les loquets de l'unité de disque étant ouverts, poussez le boîtier de l'unité de disque à l'intérieur
du système.
b. Poussez les loquets de l'unité de disque vers le système jusqu'à ce qu'ils soient verrouillés.
Figure 38. Remplacement du répartiteur d'air de la carte de connexion PCI
Retrait et remise en place des adaptateurs et batteries SAS RAID 51
13. Connectez les câbles internes au fond de panier.
Replacez le câble SPCN à l'arrière du châssis (02E/4C ou 05E/6C).
Figure 39. Remplacement du boîtier de l'unité de disque
Figure 40. Connexion des câbles internes
52 Activation SAS (Serial-attached SCSI) RAID
14. Remplacez les blocs d'alimentation. Pour des instructions, voir Remplacement hors tension des blocs
d'alimentation 02E/4B, 02E/4C, 05F/6B ou 05E/6C ou Remplacement sous tension des blocs
d'alimentation 02E/4B, 02E/4C, 05F/6B ou 05E/6C.
15. Remplacez les ventilateurs. Pour des instructions, voir Remplacement des ventilateurs 02E/4B,
02E/4C, 05F/6B ou 05E/6C.
16. Remplacez la carte d'extension de mémoire. Pour des instructions, voir Remplacement de la carte
d'extension de mémoire 02E/4B, 02E/4C, 05F/6B ou 05E/6C.
17. Remplacez tous les régulateurs de tension. Pour des instructions, voir Remplacement du régulateur
de tension 02E/4B, 02E/4C, 05F/6B ou 05E/6C.
18. Remplacez la carte Ethernet hôte (02E/4B ou 05F/6B). Pour obtenir des instructions, voir
Remplacement hors tension de la carte Ethernet hôte02E/4B ou 05F/6B.
19. Remplacez les adaptateurs GX. Pour des instructions, voir Remplacement des adaptateurs GX pour
02E/4B, 02E/4C, 05F/6B ou 05E/6C.
20. Remplacez les cartes PCI. Pour des instructions, voir Remplacement hors tension d'une carte PCI sur
le serveur 02E/4B, 02E/4C, 05F/6B ou 05E/6C.
21. Réinstallez le capot d'accès :
v Pour un système monté en armoire, voir Installation du capot d'accès sur un système monté en
armoire 02E/4B, 02E/4C, 05F/6B ou 05E/6C.
v Pour un système autonome, voir Installation du capot d'accès sur un système autonome 02E/4B,
02E/4C ou 05F/6B.
22. Remettez en place le capot avant :
v Pour un système monté en armoire, voir Installation du capot avant sur un système monté en
armoire 02E/4B, 02E/4C, 05F/6B ou 05E/6C.
v Pour un système autonome, voir Installation du capot avant sur un système autonome 02E/4B,
02E/4C ou 05F/6B.
23. Placez le système en position de fonctionnement. Pour des instructions, voir Mise en position de
fonctionnement du système 02E/4B, 02E/4C, 05F/6B ou 05E/6C monté en armoire.
24. Démarrez le système ou la partition logique. Voir Démarrage du système ou de la partition logique.
25. Restaurez la connectivité réseau.
Le processeur de service doit être connecté au réseau pour être accessible à partir des console de
gestion.
Comment l'adresse IP du processeur de service sera-t-elle gérée ?
Obtenir automatiquement une adresse IP à partir d'un
serveur DHCP (Dynamic Host Configuration Protocol)
Passez à l'étape suivante.
Configurer une adresse IP statique Passez à l'étape 27, à la page 54.
26. Restaurez les paramètres du processeur de service.
Activez l'accès réseau sur le processeur de service. Si la connexion réseau utilise le protocole DHCP
(Dynamic Host Configuration Protocol) pour établir une adresse IP, et utilise les Consoles de gestion
comme serveur DHCP, aucune configuration supplémentaire n'est nécessaire pour activer l'accès
réseau sur le processeur de service. Procédez comme suit :
a. Assurez-vous que le processeur de service est connecté au réseau de service existant en vérifiant
que le câble console de gestion est connecté au port console de gestion du fond de panier
système.
b. Le cas échéant, connectez tous les câbles d'alimentation système en les branchant dans les prises.
Remarque : ne démarrez pas le système à ce stade.
Pouvez-vous accéder au réseau de maintenance ?
Oui Non
Retrait et remise en place des adaptateurs et batteries SAS RAID 53
Passez à l'étape 29. Passez à l'étape 32, à la page 55.
27. Restaurez les paramètres du processeur de service.
L'interface ASMI (Advanced System Management Interface) est l'interface d'accès au processeur de
service qui est requise pour effectuer des tâches de maintenance générales et de niveau
administrateur, telles que la lecture des journaux d'erreurs du processeur de service, la lecture des
données essentielles du produit, la configuration du processeur de service, le contrôle de
l'alimentation système et la configuration des ports réseau du processeur de service. L'interface
ASMI est également désignée comme l'ensemble des menus du processeur de service. L'interface
ASMI est accessible via HTTPS. La connexion réseau utilisant des affectations d'adresses IP statiques,
procédez comme suit.
a. Pour identifier les ports réseau du processeur de service, vous pouvez utiliser le panneau de
commande en mode manuel. Reportez-vous aux fonctions de maintenance pour savoir comment
utiliser le panneau de commande dans ce cas.
b. Connectez un client disposant d'un navigateur Web directement aux ports réseau du processeur
de service que vous avez notés précédemment. Il peut s'agir d'une URL similaire aux exemples
suivants :
v https://169.254.2.147
v https://169.254.3.147
c. Le cas échéant, connectez tous les câbles d'alimentation système en les branchant dans les prises.
Remarque : ne démarrez pas le système à ce stade.
d. Connectez-vous à l'interface ASMI à l'aide de l'ID utilisateur admin et du mot de passe par
défaut admin.
e. A l'invite, modifiez le mot de passe de l'ID administrateur et le mot de passe de l'ID utilisateur
général.
f. Pour configurer l'accès réseau, procédez comme suit.
1) Cliquez sur Configuration de réseau sous le noeud Services réseau.
2) Dans le panneau de droite, configurez les interfaces réseau eth0 et eth1, sélectionnez Statique
comme Type d'adresse IP, puis définissez un nom d'hôte, une adresse IP et un masque de
sous-réseau.
28. Restaurez les paramètres du processeur système.
La connexion réseau étant maintenant configurée pour utiliser des affectations d'adresses IP
statiques, essayez d'accéder aux ports réseau du processeur de service.
Pouvez-vous accéder au réseau de maintenance ?
Oui Non
Passez à l'étape suivante. Passez à l'étape 33, à la page 55.
29. Lancez l'interface ASMI.
L'interface ASMI (Advanced System Management Interface) est l'interface du processeur de service
requise pour effectuer des tâches de maintenance générales et de niveau administrateur, telles que la
lecture des journaux d'erreurs du processeur de service, la lecture des données essentielles du
produit, la configuration du processeur de service et le contrôle de l'alimentation système. L'interface
ASMI est également désignée comme l'ensemble des menus du processeur de service. Pour accéder à
l'interface ASMI, voir Accès à l'interface ASMI. Une fois que vous avez accédé à l'interface ASMI,
passez à l'étape suivante.
30. Définissez le type de boîtier et le code dispositif.
Le numéro de série du boîtier est un numéro à sept chiffres imprimé sur une étiquette code barre
placée sur l'unité centrale. Lorsque le fond de panier système est remplacé, le numéro de série à sept
chiffres du boîtier et le code dispositif doivent être entrés manuellement et stockés dans le nouveau
fond de panier système à l'aide de l'interface ASMI (Advanced System Management Interface).
54 Activation SAS (Serial-attached SCSI) RAID
Remarque : Le code dispositif de fond de panier du système 02E/4B, 02E/4C, 05F/6B ou 05E/6C
est 78AA-001.
Remarque : Pour effectuer cette opération, vous devez disposer du niveau d'autorisation Prestataire
de services agréé (ID : celogin).
Pour entrer le numéro de série du système :
a. Dans la fenêtre d'accueil de l'interface ASMI, indiquez votre ID utilisateur et votre mot de passe,
et cliquez sur Connexion.
b. Dans la zone de navigation, développez Configuration système et Données techniques
essentielles du programme.
c. Sélectionnez Boîtiers du système. Dans le panneau de droite, les boîtiers du système actuel
s'affichent.
d. Sur le système concerné, enregistrez le numéro de série du boîtier imprimé l'étiquette code barre
apposée sur l'unité centrale.
e. Entrez le numéro de série du boîtier enregistré dans la zone Numéro de série du boîtier de
l'unité centrale.
f. Entrez le numéro du code dispositif dans la zone Code dispositif/Numéro de séquence.
g. Cliquez sur Sauvegarder les paramètres pour mettre à jour et enregistrer le numéro de série du
boîtier et le code dispositif.
31. Passez à l'étape 35, à la page 56.
32. Restaurez les paramètres du processeur de service.
Si la connexion réseau utilise DHCP pour établir une adresse IP, mais n'utilise pas de en tant que
serveur DHCP, procédez comme suit :
a. Effectuez toutes les configurations de réseau requises pour permettre au serveur DHCP
d'identifier une adresse IP et de l'affecter au processeur de service.
b. Assurez-vous que le processeur de service est connecté au réseau de service existant en vérifiant
que le câble réseau est connecté au port réseau.
c. Le cas échéant, connectez tous les câbles d'alimentation système en les branchant dans les prises.
Remarque : Ne démarrez pas le système à ce stade.
Pouvez-vous accéder au réseau de maintenance ?
Oui Non
Accéder à 29, à la page 54 Passez à l'étape suivante.
33. Restaurez les paramètres du processeur de service.
Vous avez indiqué que vous n'êtes toujours pas en mesure d'accéder au réseau de service. Vous
devrez réinitialiser le processeur de service en appuyant sur le commutateur (E) du panneau de
commande.
Retrait et remise en place des adaptateurs et batteries SAS RAID 55
34. Passez à l'étape 25, à la page 53.
35. Restaurez les paramètres du processeur de service.
Modifiez le mot de passe de l'ID utilisateur d'administrateur lorsque vous y êtes invité en procédant
de la manière suivante.
a. Dans la zone de navigation de l'interface ASMI, développez Profil de connexion.
b. Sélectionnez Modifier le mot de passe sous Profil de connexion.
c. Indiquez les informations requises, puis cliquez sur Continuer.
Par mesure de sécurité, vous êtes invité à entrer votre mot de passe d'utilisateur en cours dans le
champ Mot de passe en cours pour l'ID utilisateur en cours.
Remarque : Les mots de passe peuvent être une combinaison quelconque de 64 caractères
alphanumériques au maximum. Le mot de passe par défaut pour l'ID utilisateur d'administrateur
est admin. Après votre première connexion à l'interface ASMI, le mot de passe admin doit être
modifié.
Figure 41. Commutateur du panneau de commande
56 Activation SAS (Serial-attached SCSI) RAID
Par mesure de sécurité, vous êtes invité à entrer le mot de passe de votre ID utilisateur en cours
dans le champ Mot de passe en cours de l'utilisateur en cours.
36. Restaurez les paramètres du processeur de service.
Définissez le nom du système. Le nom du système peut être modifié pour toute chaîne ASCII
autorisée. Il ne doit pas nécessairement correspondre au format numéro de série_modèle_type de
machine. Pour modifier le nom du système, procédez comme suit.
a. Dans la zone de navigation de l'interface, développez Configuration système.
b. Sélectionnez Nom du système sous Configuration système.
c. Entrez le nom souhaité.
d. Cliquez sur Sauvegarder les paramètres pour mettre à jour le nom du système avec la nouvelle
valeur.
Le nouveau nom du système s'affiche dans le cadre d'état, zone dans laquelle le bouton
Déconnexion se trouve. Si une autre méthode, telle que la console de gestion est utilisée pour
modifier le nom du système, le cadre d'état ne reflète pas la modification.
37. Restaurez les paramètres du processeur de service.
Pour définir l'heure, procédez comme suit.
a. Dans la zone de navigation de l'interface, développez Configuration système.
b. Sélectionnez Heure locale sous Configuration système.
c. Si le système est hors tension, le volet droit affiche un formulaire qui indique la date (mois, jour
et année) et l'heure (heures, minutes, secondes).
d. Modifiez la date ou l'heure (ou les deux) et cliquez sur Sauvegarder les paramètres.
38. Restaurez les paramètres du processeur de service.
Vérifiez que le niveau du microprogramme de serveur est identique à la valeur enregistrée avant le
remplacement du fond de panier système. Si la valeur est différente, elle doit être corrigée. Pour plus
d'informations sur la correction du microprogramme, voir Obtention des correctifs de
microprogramme.
39. Restaurez les paramètres de l'unité carte Ethernet hôte (HEA) serveur.
v Pour la console HMC :
a. Développez le dossier Gestion de systèmes dans le volet de l'arborescence de navigation.
b. Cliquez deux fois sur le dossier Serveurs.
c. Sélectionnez un serveur dans le panneau central.
d. Dans le menu Tâches, cliquez sur Informations sur le matériel > Cartes > Ethernet hôte. La
fenêtre Cartes Ethernet hôte apparaît.
e. Sélectionnez le port physique requis et cliquez sur Configurer. La fenêtre de configuration du
port physique de la carte HEA apparaît.
f. Entrez la valeur dans la zone Pending Port Group Multi-Core Scaling value et cliquez sur
OK.
v Pour la console SDMC :
a. Allez dans l'onglet Resources et cliquez sur Hosts.
b. Sélectionnez un serveur dans le panneau de droite.
c. Dans le menu Actions, cliquez sur Informations sur le matériel > Cartes > Ethernet hôte. La
fenêtre Cartes Ethernet hôte apparaît.
d. Sélectionnez le port physique requis et cliquez sur Configurer. La fenêtre de configuration du
port physique de la carte HEA apparaît.
e. Entrez la valeur dans la zone Pending Port Group Multi-Core Scaling value et cliquez sur
OK.
40. Restaurez les paramètres du processeur de service.
Retrait et remise en place des adaptateurs et batteries SAS RAID 57
Entrez à nouveau les paramètres suivants qui ont été précédemment modifiés via l'interface ASMI
(Advanced System Management Interface), sauf si vous souhaitez utiliser les paramètres par défaut.
v Paramètre de nom de système
v Paramètres de contrôle de l'alimentation système
v Paramètres de l'aide à la maintenance système
v Paramètres de configuration système
v Paramètres des services réseau
v Paramètres des options de performances
v Paramètres de profil de connexion
v Paramètres de liste des unités d'amorçage
v Paramètres de carte Ethernet hôte
41. Restaurez les paramètres du processeur de service.
a. Redéfinissez les paramètres du processeur de service que vous aviez éventuellement définis via
les commandes du système d'exploitation. Vous avez enregistré ces paramètres au début de cette
procédure.
b. Si vous choisissez de réinitialiser le mot de passe d'accès aux Consoles de gestion, procédez
comme suit :
Pour modifier le mot de passe d'accès à la console HMC, choisissez l'une des méthodes
suivantes ;
1) A partir de l'interface graphique de la console HMC (méthode recommandée) :
a) Développez le dossier Gestion de systèmes dans le volet de l'arborescence de navigation.
b) Cliquez deux fois sur le dossier Serveurs.
c) Sélectionnez un serveur dans le panneau central.
d) Sous le dossier Opérations, sélectionnez Modifier le mot de passe.
e) Indiquez les informations souhaitées et cliquez sur OK.
2) Sur la ligne de commande de la console HMC, saisissez :
/usr/hmcrbin/chsyspwd -m système_géré -t access --passwd --newpasswd
nouveau_mot_de_passe où :
v La valeur de système_géré correspond au nom du système géré du nouveau processeur de
service.
v Aucune valeur n'est indiquée pour passwd-- de façon à permettre l'authentification.
v La valeur de newpasswd est celle de nouveau_mot_de_passe.
Pour modifier le mot de passe d'accès à la console SDMC, procédez comme suit :
1) Allez dans l'onglet Resources et cliquez sur Hosts.
2) Sélectionnez un serveur dans le panneau de droite.
3) Dans le menu Actions, cliquez sur Operations > Change Password.
4) Indiquez les informations souhaitées et cliquez sur OK.
42. Récupérez les données de partition sur le processeur de service.
Les données de profil stockées sur le serveur géré ont été supprimées ou endommagées.
Pour restaurer les données de profil à l'aide de la console HMC, procédez comme suit.
a. Développez le dossier Gestion de systèmes dans le volet de l'arborescence de navigation.
b. Cliquez deux fois sur le dossier Serveurs.
c. Dans le panneau central, sélectionnez un serveur en cochant la case correspondante.
d. Dans le dossier Configuration, si vous avez accès au dossier Gestion des données de partition,
sélectionnez Restaurer.
Pour restaurer les données de profil à l'aide de la console SDMC, procédez comme suit :
58 Activation SAS (Serial-attached SCSI) RAID
a. Allez dans l'onglet Resources et cliquez sur Hosts.
b. Sélectionnez un serveur dans le panneau de droite.
c. Dans le menu Actions, cliquez sur System Configuration > Manage Partition Data > Restore.
d. Sélectionnez l'option de restauration souhaitée et cliquez sur OK.
43. Mettez à jour les données de produit essentielles.
Si votre système exécute et n'est pas géré par les Consoles de gestion, vous devez mettre à jour la
source de chargement après avoir remplacé le fond de panier système. Pour mettre à jour la source
de chargement sans l'aide des Consoles de gestion, procédez comme suit :
a. Chargez le support optique du microcode sous licence I_BASE_01 dans l'unité qui est définie
pour le système et accédez-y pour effectuer un IPL (procédure de chargement initial).
Remarque : Cet IPL peut prendre plusieurs minutes.
b. Une fois l'écran de sélection de groupe de langues affiché, appuyez sur la touche Entrée pour
sélectionner la langue par défaut 2924 (anglais).
Remarque : Pour modifier la langue, tapez celle qui apparaît sur le support du système
d'exploitation, puis appuyez sur la touche Entrée.
Confirmez la langue en appuyant sur la touche Entrée.
c. A partir de l'écran d'installation du microcode sous licence, sélectionnez l'option 2 (utilisation des
outils de maintenance en mode dédié) et appuyez sur la touche Entrée.
Utilisez les outils de maintenance en mode dédié pour vous assurer que les unités de disque, y
compris la source de chargement, génèrent correctement des rapports et qu'il n'y a pas d'erreurs
susceptibles d'empêcher l'exécution de l'IPL à partir du disque. Lorsque toutes les erreurs ont été
résolues et que le disque source de chargement génère correctement des rapports, passez à l'étape
suivante.
d. Dans l'écran d'utilisation des outils de maintenance en mode dédié (DST), sélectionnez l'option 2
(utilisation des unités de disque) et appuyez sur la touche Entrée.
e. Sélectionnez l'option 8 (mise à jour des données produit essentielles du système) pour identifier
l'emplacement de l'unité de disque source de chargement. Appuyez sur Entrée. L'écran de mise à
jour des données produit essentielles du système s'affiche.
f. Appuyez sur la touche Entrée pour confirmer que vous souhaitez que les données produit
essentielles soient enregistrées.
g. Appuyez deux fois sur F12 (Précédent).
h. Tapez 1 et appuyez sur la touche Entrée pour quitter l'écran des outils de maintenance en mode
dédié. Utilisez le panneau de configuration système pour retarder l'arrêt du système. Passez le
mode IPL sur B et démarrez le système pour effectuer un IPL à partir du disque.
44. Restaurez les paramètres de la liste des unités d'amorçage et de la carte Ethernet hôte.
La liste des unités d'amorçage n'est pas présente sur un nouveau fond de panier système. Par
conséquent, il est nécessaire de créer les unités d'amorçage en démarrant les menus des services
SMS. Voir Sélection des options d'amorçage pour obtenir des informations sur la configuration de la
liste des unités d'amorçage.
Une fois le système d'exploitation démarré, vous pouvez utiliser la Commande bootlist pour vérifier
la liste des unités d'amorçage d'un système ou d'une partition, ou y apporter des modifications
supplémentaires.
Pour restaurer les paramètres de la carte Ethernet hôte, voir Configuration de ports physiques sur
une carte Ethernet hôte.
45. Vérifiez la réparation. Pour plus d'informations, voir Vérification d'une réparation.
Retrait et remise en place des adaptateurs et batteries SAS RAID 59
Considérations sur la maintenance du sous-système SAS 02E/4B ou
05F/6B
Il existe plusieurs éléments à prendre en compte pour la maintenance du sous-système SAS du système
02E/4B ou 05F/6B.
Avertissement : Ne tentez pas de retirer les composants associés au sous-système SAS si le voyant des
données en cache sur la carte de P1-C19 ou P1-C14 clignote. Pour voir ce voyant, vous devez arrêter le
système et retirer le capot de service. Une icône d'interdiction de toucher (une main barrée) s'affiche sur
le voyant, indiquant la présence possible de données dans le cache de l'adaptateur. Pour éviter la perte de
données, assurez-vous tout d'abord que le système a été arrêté correctement. Si le voyant des données en
cache clignote, reportez-vous à l'une des rubriques suivantes avant de retirer tout composant du
sous-système SAS. Vous pouvez remplacer en toute sécurité le bloc de batterie de cache lorsque Yes est
affiché en regard de Battery pack can be safely replaced sur l'écran d'information de batterie (pour les
systèmes Linux et IBM® i) ou l'écran de statut de la commande (pour les systèmes AIX).
Avant d'effectuer la maintenance, vérifiez les configurations d'ES de chaque partition logique sur le
système. Il est important de comprendre les différences entre les diverses configurations et les
considérations qui en découlent en matière de maintenance. En outre, vous devez tenir compte des
incidences possibles pour les domaines suivants.
v Partitions logiques et leurs affectations d'adaptateur d'E-S
v Configuration des grappes de disques RAID
v Emplacements de disque physique
v Chemin d'accès à votre unité d'amorçage
Considérations de maintenance pour le bloc de batterie de cache
Pour éviter la perte de données, suivez les procédures appropriées avant de remplacer le bloc de batterie
de cache sur la carte Cache RAID.
Remarque : La maintenance simultanée de la batterie de l'adaptateur n'est pas prise en charge. Même en
cas de batteries défaillantes ou manquantes, la carte associée à la batterie doit être retirée en premier.
Pour plus d'informations, voir Retrait et remplacement des adaptateurs RAID SAS et des batteries.
Avertissement : Pour maintenir la disponibilité du système et éviter d'éventuelles pertes de données,
vous devez comprendre que ces fonctions ont des conséquences sur la configuration de SAS et
l'accessibilité des données du système.
Avertissement : Ne tentez pas de supprimer les composants associés au sous-système SAS si le voyant
des données en cache sur la carte de C19-C19 ou P1-C14 clignote. Pour voir ce voyant, vous devez arrêter
le système et retirer le capot de service. Une icône d'interdiction de toucher (une main barrée) s'affiche
sur le voyant, indiquant la présence possible de données dans le cache de l'adaptateur. Pour éviter la
perte de données, assurez-vous tout d'abord que le système a été arrêté correctement. Si le voyant des
données en cache clignote, reportez-vous à l'une des rubriques suivantes avant de retirer tout composant
du sous-système SAS. Vous pouvez remplacer en toute sécurité le bloc de batterie de cache lorsque Yes
est affiché en regard de Battery pack can be safely replaced sur l'écran d'information de batterie (pour
les systèmes Linux et IBM i) ou l'écran de statut de la commande (pour les systèmes AIX).
La figure suivante indique l'emplacement du voyant de présence de données en cache dans les
sous-systèmes SAS 02E/4B ou 05F/6B.
60 Activation SAS (Serial-attached SCSI) RAID
Le tableau suivant décrit les états du voyant de présence des données en cache et les éléments à prendre
en considération avant de tenter une opération de maintenance.
Tableau 1. Indications et actions de voyants
Carte de batterie de
cache (Un-P1-C14)
Contrôleur de stockage
RAID/cache
(Un-P1-C19) Indication Action
désactivée désactivée
La mémoire cache n'est pas
active.
Les actions de maintenance
peuvent s'effectuer.
désactivée clignote
La mémoire cache est active sur
C19. La mémoire cache peut
avoir été vidée sur C14 en cas
de défaillance de C19.
Essayez de mettre le système
sous tension et de l'éteindre
correctement avant d'effectuer
toute action de maintenance.
clignote désactivée
La mémoire cache est active sur
C14. La mémoire cache peut
avoir été vidée sur C19 en cas
de défaillance de C14.
Essayez de mettre le système
sous tension et de l'éteindre
correctement avant d'effectuer
toute action de maintenance.
clignote clignote
La mémoire cache est active sur
les contrôleurs doubles.
Essayez de mettre le système
sous tension et de l'éteindre
correctement avant d'effectuer
toute action de maintenance. Si
cela ne parvient pas à effacer
les voyants de cache actifs,
contactez le support. Voir
Contacter le support et la
maintenance IBM.
Figure 42. Voyant des données en cache
Retrait et remise en place des adaptateurs et batteries SAS RAID 61
Retrait de la carte d'activation RAID 02E/4B ou 05F/6B
Utilisez cette procédure pour retirer une carte d'activation RAID (Redundant Array of Independent Disks)
avant d'installer une carte d'activation nouvelle ou mise à niveau.
Avant de commencer
Avertissement : Pour maintenir la disponibilité du système et éviter d'éventuelles pertes de données,
vous devez comprendre que ces fonctions ont des conséquences sur la configuration de SAS
(serial-attached SCSI) et l'accessibilité des données du système. Consultez la rubrique Sous-système SAS
pour comprendre les différentes remarques sur la configuration et la maintenance SAS.
Avant toute opération d'installation ou de maintenance d'un dispositif, consultez les configurations d'E-S
pour chaque partition logique de votre système. Il est important de comprendre les différences entre les
diverses configurations et les considérations de maintenance qui en résultent. En outre, tenez compte des
incidences possibles sur les domaines suivants :
v Partitions logiques et leurs affectations à l'adaptateur d'E-S
v Configurations de grappes RAID
v Emplacements de disque physique
v Chemin d'accès à votre unité d'amorçage
Pourquoi et quand exécuter cette tâche
Si le système est géré par la console HMC (Hardware Management Console), utilisez la console HMC
pour retirer la carte du serveur. Pour plus d'informations, voir «Retrait d'un composant via la console
HMC», à la page 110.
Si le système est géré par la console Systems Director Management Console (SDMC), utilisez SDMC pour
retirer la carte du serveur. Pour des instructions, voir «Retrait d'un composant via la console SDMC», à la
page 125.
Si vous ne disposez pas d'une console HMC ou SDMC, procédez comme suit pour retirer la carte
d'activation :
Procédure
1. Effectuez les tâches prérequises, comme décrit dans la rubrique «Avant de commencer», à la page 81.
2. Arrêtez le système (voir «Arrêt d'un système ou d'une partition logique», à la page 91).
3. Vérifiez que le système est hors tension. Sur le panneau de commande, vérifiez que le voyant
d'alimentation clignote.
4. Retirez le capot avant.
v Pour un système monté en armoire, voir «Retrait du capot avant sur un système 02E/4B, 02E/4C,
05F/6B ou 05E/6C monté en armoire», à la page 95.
v Pour un système autonome, voir «Retrait du capot avant sur un système autonome 02E/4B,
02E/4C ou 05F/6B», à la page 95.
5. Pour un système monté en armoire, mettez-le en position de maintenance, comme indiqué dans
«Mise en position de maintenance du système 02E/4B, 02E/4C, 05F/6B ou 05E/6C monté en
armoire», à la page 104.
6. Retirez le capot d'accès.
v Pour un système monté en armoire, voir «Retrait du capot d'accès sur un système 02E/4B,
02E/4C, 05F/6B ou 05E/6C monté en armoire», à la page 98.
v Pour un système autonome, voir «Retrait du capot d'accès d'un système autonome 02E/4B,
02E/4C ou 05F/6B», à la page 99.
62 Activation SAS (Serial-attached SCSI) RAID
7. Vérifiez l'état des voyants verts du cache d'écriture des cartes placées dans P1-C19 et P1-C14 de la
Figure 1. .
v Si les deux voyants verts du cache clignotent, remettez en place le capot système et suivez la
procédure de «Démarrage du système ou de la partition logique», à la page 89 avant de revenir à
l'étape actuelle.
v Si les deux voyants sont éteints, passez à l'étape suivante.
v S'il s'avère qu'un seul voyant du cache clignote, contactez votre fournisseur de services pour
obtenir de l'aide.
8. Débranchez le système pour l'isoler de la source d'alimentation.
Remarque : Ce système est équipé d'un second bloc d'alimentation. Avant de poursuivre cette
procédure, vérifiez que la source d'alimentation a été débranchée du système.
(L003)
ou
9. Attachez le bracelet antistatique.
Avertissement :
v Portez un bracelet antistatique relié à une surface métallique non peinte pour protéger le matériel
contre les risques de décharge électrostatique.
v Si vous portez un bracelet antistatique, respectez toutes les consignes de sécurité relatives aux
dangers électriques. Un bracelet antistatique permet de réguler l'électricité statique. Il ne réduit et
n'augmente en aucun cas les risques d'électrocution liés à l'utilisation ou à la manipulation
d'appareils électriques.
v Si vous ne disposez pas d'un bracelet antistatique, touchez une surface métallique non peinte du
système pendant au moins 5 secondes avant de déballer un produit de son emballage antistatique,
d'installer ou de remplacer du matériel.
10. Identifiez la carte d'activation RAID à retirer.
Retrait et remise en place des adaptateurs et batteries SAS RAID 63
Remarque : Si une unité de bande en option est installée, déplacez-la pour pouvoir accéder à la
carte d'activation RAID.
11. Appuyez sur le loquet et faites glisser la carte en dehors du système, comme illustré. La carte
d'activation RAID est fixée à une plaque en plastique. Vous devez faire glisser cette plaque en dehors
du système pour retirer la carte d'activation RAID.
Figure 43. Emplacement d'installation de la carte d'activation RAID (P1-C13)
64 Activation SAS (Serial-attached SCSI) RAID
Pour remplacer la carte d'activation RAID, voir «Remplacement de la carte d'activation RAID 02E/4B
ou 05F/6B».
12. Remettez en place le capot d'accès de maintenance.
v Pour un système monté en armoire, voir «Installation du capot d'accès sur un système 02E/4B,
02E/4C, 05F/6B ou 05E/6C monté en armoire», à la page 101.
v Pour un système autonome, voir «Installation du capot d'accès d'un système autonome 02E/4B,
02E/4C ou 05F/6B», à la page 102.
13. Pour un système monté en armoire, mettez-le en position de fonctionnement, comme indiqué dans
«Mise en position de fonctionnement du système 02E/4B, 02E/4C, 05F/6B ou 05E/6C monté en
armoire», à la page 106.
14. Remettez en place le capot avant.
v Pour un système monté en armoire, voir «Installation du capot avant sur un système 02E/4B,
02E/4C, 05F/6B ou 05E/6C monté en armoire», à la page 96.
v Pour un système autonome, voir «Installation des capot et volet avant sur un système 02E/4B,
02E/4C ou 05F/6B autonome», à la page 97.
15. Rebranchez la source d'alimentation.
16. Démarrez le système (voir «Démarrage du système ou de la partition logique», à la page 89).
17. Vérifiez le composant installé :
v Si vous avez installé le composant pour toute autre raison, vérifiez le composant installé comme
décrit dans Vérification du composant installé.
Remplacement de la carte d'activation RAID 02E/4B ou 05F/6B
Utilisez cette procédure pour remplacer une carte d'activation RAID (Redundant Array of Independent
Disks.
Figure 44. Retrait d'une carte d'activation RAID d'une unité centrale montée en armoire
Retrait et remise en place des adaptateurs et batteries SAS RAID 65
Pourquoi et quand exécuter cette tâche
Vous devez avoir exécuté la procédure «Retrait de la carte d'activation RAID 02E/4B ou 05F/6B», à la
page 62.
Pour remplacer une carte d'activation RAID, procédez comme suit :
Procédure
1. Identifiez l'emplacement de la carte d'activation RAID.
Remarque : Si une unité de bande en option est installée, déplacez-la pour pouvoir accéder à la carte
d'activation RAID.
2. Alignez la carte d'activation RAID sur l'emplacement dans le système.
3. Faites glisser complètement la carte d'activation RAID dans le système jusqu'au verrouillage du loquet
(A).
4. Remettez le capot d'accès en place :
v Pour un système monté en armoire, voir «Installation du capot d'accès sur un système 02E/4B,
02E/4C, 05F/6B ou 05E/6C monté en armoire», à la page 101.
v Pour un système autonome, voir «Installation du capot d'accès d'un système autonome 02E/4B,
02E/4C ou 05F/6B», à la page 102.
5. Pour un système monté en armoire, mettez-le en position de fonctionnement, comme indiqué dans
«Mise en position de fonctionnement du système 02E/4B, 02E/4C, 05F/6B ou 05E/6C monté en
armoire», à la page 106.
6. Remettez en place le capot avant :
v Pour un système monté en armoire, voir «Installation du capot avant sur un système 02E/4B,
02E/4C, 05F/6B ou 05E/6C monté en armoire», à la page 96.
Figure 45. Remplacement d'une carte d'activation RAID sur une unité centrale montée en armoire
66 Activation SAS (Serial-attached SCSI) RAID
v Pour un système autonome, voir «Installation des capot et volet avant sur un système 02E/4B,
02E/4C ou 05F/6B autonome», à la page 97.
7. Rebranchez la source d'alimentation.
8. Démarrez le système (voir «Démarrage du système ou de la partition logique», à la page 89).
9. Vérifiez le composant installé :
v Si vous avez installé le composant pour toute autre raison, vérifiez le composant installé comme
décrit dans Vérification du composant installé.
Retrait du contrôleur de stockage RAID/cache 02E/4B, 02E/4C, 05F/6B
ou 05E/6C
Utilisez cette procédure pour retirer un contrôleur de stockage RAID (Redundant Array of Independent
Disks) avant d'installer un contrôleur de stockage nouveau ou mis à niveau. Vous pouvez également
utiliser cette procédure pour retirer un contrôleur de stockage RAID-10.
Avant de commencer
Identifiez le système sur lequel vous travaillez à l'aide du processus d'identification du système afin
d'activer l'indicateur de système (bleu). Pour plus d'informations, voir «Voyants du panneau de
commande», à la page 84.
Avertissement : Pour maintenir la disponibilité du système et éviter d'éventuelles pertes de données,
vous devez comprendre que ces fonctions ont des conséquences sur la configuration de SAS
(serial-attached SCSI) et l'accessibilité des données du système. Consultez la rubrique Sous-système SAS
pour comprendre les différentes remarques sur la configuration et la maintenance SAS.
Avant toute opération d'installation ou de maintenance d'un dispositif, consultez les configurations d'E-S
pour chaque partition logique de votre système. Il est important de comprendre les différences entre les
diverses configurations et les considérations de maintenance qui en résultent. En outre, tenez compte des
incidences possibles sur les domaines suivants :
v Partitions logiques et leurs affectations à l'adaptateur d'E-S
v Configurations de grappes RAID
v Emplacements de disque physique
v Chemin d'accès à votre unité d'amorçage
Pourquoi et quand exécuter cette tâche
Si le système est géré par la console HMC (Hardware Management Console), utilisez la console HMC
pour retirer les contrôleurs de stockage RAID et cache du serveur. Pour plus d'informations, voir «Retrait
d'un composant via la console HMC», à la page 110.
Si le système est géré par la console Systems Director Management Console (SDMC), utilisez SDMC pour
retirer les contrôleurs de stockage RAID et cache du serveur. Pour des instructions, voir «Retrait d'un
composant via la console SDMC», à la page 125.
Si vous ne disposez pas d'une console HMC ou SDMC, procédez comme suit pour retirer le contrôleur de
stockage RAID/cache :
Procédure
1. Effectuez les tâches prérequises, comme décrit dans la rubrique «Avant de commencer», à la page 81.
2. Arrêtez le système (voir «Arrêt d'un système ou d'une partition logique», à la page 91).
3. Retirez le capot avant.
Retrait et remise en place des adaptateurs et batteries SAS RAID 67
v Pour un système monté en armoire, voir «Retrait du capot avant sur un système 02E/4B, 02E/4C,
05F/6B ou 05E/6C monté en armoire», à la page 95.
v Pour un système autonome, voir «Retrait du capot avant sur un système autonome 02E/4B,
02E/4C ou 05F/6B», à la page 95.
4. Pour un système monté en armoire, mettez-le en position de maintenance, comme indiqué dans
«Mise en position de maintenance du système 02E/4B, 02E/4C, 05F/6B ou 05E/6C monté en
armoire», à la page 104.
5. Retirez le capot d'accès.
v Pour un système monté en armoire, voir «Retrait du capot d'accès sur un système 02E/4B,
02E/4C, 05F/6B ou 05E/6C monté en armoire», à la page 98.
v Pour un système autonome, voir «Retrait du capot d'accès d'un système autonome 02E/4B,
02E/4C ou 05F/6B», à la page 99.
6. Utilisez les voyants de maintenance pour vous aider à identifier le composant, comme indiqué dans
«Identification d'un composant défaillant», à la page 84.
7. Débranchez le système pour l'isoler de la source d'alimentation.
Remarque : Ce système est équipé d'un second bloc d'alimentation. Avant de poursuivre cette
procédure, vérifiez que la source d'alimentation a été débranchée du système.
(L003)
ou
8. Attachez le bracelet antistatique.
68 Activation SAS (Serial-attached SCSI) RAID
Avertissement :
v Portez un bracelet antistatique relié à une surface métallique non peinte pour protéger le matériel
contre les risques de décharge électrostatique.
v Si vous portez un bracelet antistatique, respectez toutes les consignes de sécurité relatives aux
dangers électriques. Un bracelet antistatique permet de réguler l'électricité statique. Il ne réduit et
n'augmente en aucun cas les risques d'électrocution liés à l'utilisation ou à la manipulation
d'appareils électriques.
v Si vous ne disposez pas d'un bracelet antistatique, touchez une surface métallique non peinte du
système pendant au moins 5 secondes avant de déballer un produit de son emballage antistatique,
d'installer ou de remplacer du matériel.
9. Identifiez les contrôleurs de stockage RAID et cache à retirer.
10. Tirez sur le loquet (A) et retirez la carte (B) du système en la faisant glisser, comme l'illustre la figure
suivante.
Pour remplacer les contrôleurs de stockage RAID et cache, voir «Remplacement du contrôleur de
stockage RAID et cache 02E/4B, 02E/4C, 05F/6B ou 05E/6C», à la page 70.
11. Remettez le capot d'accès en place :
v Pour un système monté en armoire, voir «Installation du capot d'accès sur un système 02E/4B,
02E/4C, 05F/6B ou 05E/6C monté en armoire», à la page 101.
v Pour un système autonome, voir «Installation du capot d'accès d'un système autonome 02E/4B,
02E/4C ou 05F/6B», à la page 102.
12. Pour un système monté en armoire, mettez-le en position de fonctionnement, comme indiqué dans
«Mise en position de fonctionnement du système 02E/4B, 02E/4C, 05F/6B ou 05E/6C monté en
armoire», à la page 106.
13. Remettez en place le capot avant :
v Pour un système monté en armoire, voir «Installation du capot avant sur un système 02E/4B,
02E/4C, 05F/6B ou 05E/6C monté en armoire», à la page 96.
Figure 46. Retrait de contrôleurs de stockage RAID et cache d'une unité centrale montée en armoire
Retrait et remise en place des adaptateurs et batteries SAS RAID 69
v Pour un système autonome, voir «Installation des capot et volet avant sur un système 02E/4B,
02E/4C ou 05F/6B autonome», à la page 97.
14. Rebranchez la source d'alimentation.
15. Démarrez le système (voir «Démarrage du système ou de la partition logique», à la page 89).
16. Vérifiez le composant installé :
v Si vous avez installé le composant pour toute autre raison, vérifiez le composant installé comme
décrit dans Vérification du composant installé.
Remplacement du contrôleur de stockage RAID et cache 02E/4B,
02E/4C, 05F/6B ou 05E/6C
Utilisez cette procédure pour remplacer un contrôleur de stockage RAID (Redundant Array of
Independent Disks). Vous pouvez également utiliser cette procédure pour remplacer un contrôleur de
stockage RAID-10.
Pourquoi et quand exécuter cette tâche
Identifiez le système sur lequel vous travaillez à l'aide du processus d'identification du système afin
d'activer l'indicateur de système (bleu). Pour plus d'informations, voir «Voyants du panneau de
commande», à la page 84.
Vous devez avoir exécuté la procédure «Retrait du contrôleur de stockage RAID/cache 02E/4B, 02E/4C,
05F/6B ou 05E/6C», à la page 67.
Pour remplacer des contrôleurs de stockage RAID et cache, procédez comme suit :
Procédure
1. Identifiez l'emplacement des contrôleurs de stockage RAID et cache.
2. Alignez les contrôleurs de stockage RAID et cache sur l'emplacement dans le système.
3. Faites glisser les contrôleurs de stockage RAID et cache (A) entièrement dans le système jusqu'au
verrouillage du loquet (B).
70 Activation SAS (Serial-attached SCSI) RAID
4. Remettez le capot d'accès en place :
v Pour un système monté en armoire, voir «Installation du capot d'accès sur un système 02E/4B,
02E/4C, 05F/6B ou 05E/6C monté en armoire», à la page 101.
v Pour un système autonome, voir «Installation du capot d'accès d'un système autonome 02E/4B,
02E/4C ou 05F/6B», à la page 102.
5. Pour un système monté en armoire, mettez-le en position de fonctionnement, comme indiqué dans
«Mise en position de fonctionnement du système 02E/4B, 02E/4C, 05F/6B ou 05E/6C monté en
armoire», à la page 106.
6. Remettez en place le capot avant.
v Pour un système monté en armoire, voir «Installation du capot avant sur un système 02E/4B,
02E/4C, 05F/6B ou 05E/6C monté en armoire», à la page 96.
v Pour un système autonome, voir «Installation des capot et volet avant sur un système 02E/4B,
02E/4C ou 05F/6B autonome», à la page 97.
7. Rebranchez la source d'alimentation.
8. Démarrez le système (voir «Démarrage du système ou de la partition logique», à la page 89).
9. Vérifiez le composant installé :
v Si vous avez installé le composant pour toute autre raison, vérifiez le composant installé comme
décrit dans Vérification du composant installé.
Retrait et remise en place de la carte de batterie de cache
Pour effectuer la maintenance de la carte de batterie de cache, procédez comme suit :
Avertissement : Pour maintenir la disponibilité du système et éviter d'éventuelles pertes de données,
vous devez comprendre que ces fonctions ont des conséquences sur la configuration de SAS
(serial-attached SCSI) et l'accessibilité des données du système. Consultez la rubrique Sous-système SAS
pour comprendre les différentes remarques sur la configuration et la maintenance SAS.
Figure 47. Remplacement des contrôleurs de stockage RAID et cache sur une unité centrale montée en armoire
Retrait et remise en place des adaptateurs et batteries SAS RAID 71
Avant toute opération d'installation ou de maintenance d'un dispositif, consultez les configurations d'E-S
pour chaque partition logique de votre système. Il est important de comprendre les différences entre les
diverses configurations et les considérations de maintenance qui en résultent. En outre, tenez compte des
incidences possibles sur les domaines suivants :
v Partitions logiques et leurs affectations à l'adaptateur d'E-S
v Configurations de grappes RAID
v Emplacements de disque physique
v Chemin d'accès à votre unité d'amorçage
Procédez comme suit pour retirer et installer la carte de batterie de cache :
1. Commencez avec le système sous tension.
Pour éviter la perte de données, la carte de batterie de cache doit être à l'état d'erreur avant d'être
retirée du système.
Si votre système est hors tension, rallumez-le avant de continuer.
2. Choisissez votre système d'exploitation :
v AIX : Passez à l'étape 3.
v Linux : Passez à l'étape 5.
v IBM i : Passez à l'étape 7.
3. Sous AIX, mettez la carte de batterie de cache à l'état d'erreur (afin d'éviter d'éventuelles pertes de
données) en procédant comme suit :
a. Accédez au gestionnaire de grappes de disques SAS IBM, comme indiqué ci-dessous :
1) A l'invite de commande, tapez smit, puis appuyez sur Entrée.
2) Sélectionnez Devices.
3) Sélectionnez Disk Array.
4) Sélectionnez IBM SAS Disk Array.
5) Sélectionnez IBM SAS Disk Array Manager dans le menu comprenant les options de
configuration et de gestion du contrôleur IBM SAS RAID.
b. Sélectionnez Diagnostics and Recovery Options.
c. Sélectionnez l'option Controller Rechargeable Battery Maintenance.
d. Sélectionnez l'option Force Controller Rechargeable Battery Error.
e. Sélectionnez le contrôleur SAS RAID IBM dont vous souhaitez remplacer la batterie.
Cette option met la batterie à l'état d'erreur, ce qui nécessite son remplacement.
f. Déterminez si vous pouvez remplacer la carte de batterie de cache sans risque. Voir la procédure
Affichage des informations relatives aux batteries rechargeables. Vous pouvez remplacer la carte
de batterie de cache sans danger si la mention Yes apparaît en regard de l'option Battery pack
can be safely replaced. Vous devrez peut-être sélectionner à plusieurs reprises l'option permettant
d'afficher les informations relatives à la batterie rechargeable du contrôleur car plusieurs
minutes peuvent s'écouler avant que le remplacement de la carte de batterie de cache puisse
s'effectuer sans danger.
g. Vérifiez que le voyant Cache Present ne clignote pas.
Affichage des informations relatives aux batteries rechargeables
a. Accédez au gestionnaire de grappes de disques SAS IBM en suivant la procédure précédente.
b. Sélectionnez Diagnostics and Recovery Options.
c. Sélectionnez l'option Controller Rechargeable Battery Maintenance.
d. Sélectionnez l'option Displaying Rechargeable Battery Information.
e. Sélectionnez le contrôleur IBM SAS RAID.
4. Passez à l'étape 8.
72 Activation SAS (Serial-attached SCSI) RAID
5. Placez la carte de batterie de cache à l'état d'erreur sur le système d'exploitation Linux (afin d'éviter
d'éventuelles pertes de données) en procédant comme suit :
a. Exécutez l'utilitaire iprconfig en saisissant iprconfig.
b. Sélectionnez Work with disk unit recovery.
c. Sélectionnez Work with resources containing cache battery packs.
d. Sélectionnez votre carte et tapez 2. Appuyez ensuite sur Entrée pour mettre la batterie à l'état
d'erreur.
Remarque : Cette option permet de mettre la batterie à l'état d'erreur, ce qui nécessite son
remplacement.
e. Si vous êtes sûr de devoir mettre la batterie à l'état d'erreur, tapez c pour confirmer. Si vous ne
souhaitez pas mettre la batterie à l'état d'erreur, tapez q pour annuler.
f. Déterminez si vous pouvez remplacer la carte de batterie de cache sans risque. Voir Affichage des
informations relatives aux batteries rechargeables.
Vous pouvez remplacer la carte de batterie de cache sans danger si la mention Yes apparaît en
regard de l'option Battery pack can be safely replaced. Vous devrez peut-être sélectionner à
plusieurs reprises l'option permettant d'afficher les informations relatives à la batterie
rechargeable du contrôleur car plusieurs minutes peuvent s'écouler avant que le remplacement
de la carte de batterie de cache puisse s'effectuer sans danger.
6. Passez à l'étape 8.
7. Mettez la carte de batteries de cache à l'état d'erreur sous (afin d'éviter d'éventuelles pertes de
données) en appliquant la procédure suivante au système ou à la partition logique qui comporte
l'adaptateur :
a. Vous devez avoir ouvert une session avec au minimum les droits d'accès aux outils de
maintenance.
b. Tapez strsst sur la ligne de commande et appuyez sur Entrée.
c. Sur l'écran System Service Tools (SST) Sign On, tapez votre ID utilisateur et votre mot de passe
d'accès aux outils de maintenance. Appuyez sur Entrée.
d. Sur l'écran System Service Tools (SST), sélectionnez Start a Service Tool et appuyez sur Entrée.
e. Dans l'écran Start a Service Tool, sélectionnez Hardware Service Manager et appuyez sur la
touche Entrée.
f. Sélectionnez Work with resources containing cache battery packs dans l'écran Hardware Service
Manager.
g. Sélectionnez Force battery pack into error state pour l'adaptateur d'entrée-sortie que vous
utilisez dans l'écran Work with Resources containing cache battery packs, puis appuyez sur
Entrée.
h. Dans l'écran Force Battery Packs Into Error State, vérifiez que l'adaptateur d'entrée-sortie
approprié est sélectionné et appuyez sur la touche de fonction correspondant à votre choix pour
confirmer.
i. Revenez à l'écran Work with Resources containing cache battery packs et sélectionnez Display
battery information. Assurez-vous que la zone Safe to replace cache battery a la valeur Yes.
Cette opération peut prendre plusieurs minutes et vous devrez peut-être appuyer sur le bouton
d'actualisation pour que la zone soit mise à jour.
8. Placez le système monté en armoire en position de fonctionnement. Voir «Mise en position de
fonctionnement du système 02E/4B, 02E/4C, 05F/6B ou 05E/6C monté en armoire», à la page 106.
9. Retirez le capot d'accès :
v Pour un système monté en armoire, voir «Retrait du capot d'accès sur un système 02E/4B,
02E/4C, 05F/6B ou 05E/6C monté en armoire», à la page 98.
v Pour un système autonome, voir «Retrait du capot d'accès d'un système autonome 02E/4B,
02E/4C ou 05F/6B», à la page 99.
Retrait et remise en place des adaptateurs et batteries SAS RAID 73
10. Vérifiez le voyant de présence de données en cache (A) en examinant le voyant sur la carte de
batterie de cache.
v Si le voyant clignote, passez à l'étape suivante.
v Si le voyant ne clignote pas, passez à l'étape 12.
11. Ne retirez pas la carte de batterie de cache si le voyant clignote.
Lorsque le voyant clignote, cela indique qu'il existe des données en cache non sauvegardées sur la
batterie. Ces données seront perdues si la batterie est retirée à ce stade. Effectuez cette procédure de
maintenance sans remplacer la batterie, puis effectuez-la de nouveau pour vous assurer que toutes
les données en cache sont enregistrées sur le disque avant de remplacer la batterie. Passez à l'étape
18.
12. Retirez la carte de batterie de cache en procédant comme suit :
Avertissement : Ne retirez pas la carte de batterie de cache si le voyant de présence de données en
cache clignote, car il indique que des données existent dans le cache.
a. Dégagez la carte de batterie de cache (B) du connecteur (A).
b. Faites glisser la carte de batterie de cache sur les glissières de montage afin de l'extraire du
système.
c. Une fois la carte extraite du système, retirez le bloc de batterie de la carte de batterie :
1) Déconnectez le bloc de batterie de la carte de batterie.
2) Saisissez les pattes de fixation et serrez-les pour pouvoir extraire le bloc de batterie des pattes.
Figure 48. Voyant de présence de données en cache
74 Activation SAS (Serial-attached SCSI) RAID
Si vous remplacez le bloc de batterie de cache, passez à la section «Retrait et installation du bloc de
batterie de cache», à la page 77, puis revenez à cette section et poursuivez avec l'étape suivante.
13. Remplacez la carte de batteries de cache en procédant comme suit :
Avertissement :
Laissez la carte de batterie de cache déconnectée pendant au moins 60 secondes avant de connecter
la nouvelle batterie. Il s'agit du temps minimum nécessaire à l'adaptateur pour détecter que la
batterie a été remplacée.
a. Faites glisser la carte de batterie de cache dans les glissières de montage du contrôleur jusqu'à ce
qu'elle s'insère dans le connecteur de batterie.
Figure 49. Retrait de la carte de batterie de cache
Retrait et remise en place des adaptateurs et batteries SAS RAID 75
14. Mettez le système sous tension et redémarrez le cache d'écriture de l'adaptateur en procédant comme
suit :
Choisissez votre système d'exploitation :
v AIX : Passez à l'étape 15.
v Linux : Passez à l'étape 16.
v IBM i : Passez à l'étape 17.
15. Procédez comme suit :
a. Accédez au gestionnaire de grappes de disques IBM SAS en suivant la procédure de la section
3a.
b. Sélectionnez Diagnostics and Recovery Options.
c. Sélectionnez Controller Rechargeable Battery Maintenance.
d. Sélectionnez Start Adapter Cache.
e. Sélectionnez le contrôleur dont vous venez de remplacer la batterie et appuyez sur Entrée.
f. Passez à l'étape 18.
16. Procédez comme suit :
a. Exécutez l'utilitaire iprconfig en saisissant iprconfig.
b. Sélectionnez Work with disk unit recovery.
c. Sélectionnez Work with resources containing cache battery packs.
d. Démarrez le cache de l'adaptateur d'E-S sur l'adaptateur correspondant à la batterie que vous
venez de remplacer en saisissant 3, puis en appuyant sur Entrée.
e. Passez à l'étape 18.
17. Procédez comme suit :
a. Revenez à l'écran Work with Resources containing Cache Battery Packs en utilisant les
informations obtenues à l'étape 7a à 7f et sélectionnez Start IOA cache. Appuyez sur Entrée.
b. Vérifiez que vous obtenez bien le message Cache was started.
Figure 50. Installation de la carte de batterie de cache
76 Activation SAS (Serial-attached SCSI) RAID
c. Passez à l'étape 18.
18. Installez le capot d'accès :
v Pour un système monté en armoire, voir «Installation du capot d'accès sur un système 02E/4B,
02E/4C, 05F/6B ou 05E/6C monté en armoire», à la page 101.
v Pour un système autonome, voir «Installation du capot d'accès d'un système autonome 02E/4B,
02E/4C ou 05F/6B», à la page 102.
19. Mettez-le en position de fonctionnement. Voir «Mise en position de fonctionnement du système
02E/4B, 02E/4C, 05F/6B ou 05E/6C monté en armoire», à la page 106.
20. Fermez le volet arrière de l'unité dont vous effectuez la maintenance.
21. Vérifiez que la nouvelle ressource fonctionne correctement, comme indiqué dans la rubrique
«Vérification du composant installé», à la page 111.
Retrait et installation du bloc de batterie de cache
Cette procédure permet de remplacer le bloc de batterie de cache sur une carte de batterie.
Avant de commencer
Identifiez le système sur lequel vous travaillez à l'aide du processus d'identification du système afin
d'activer l'indicateur de système (bleu). Pour plus d'informations, voir «Voyants du panneau de
commande», à la page 84.
Avertissement : Avant de poursuivre ces procédures, déterminez si vous pouvez remplacer le bloc de
batterie de cache sans risque. Voir Affichage des informations sur la batterie rechargeable. Vous pouvez
remplacer le bloc de batterie de cache en toute sécurité si la mention Oui apparaît en regard de l'option
indiquant que le bloc de batterie peut être remplacé en toute sécurité.
Avertissement : Pour maintenir la disponibilité du système et éviter d'éventuelles pertes de données,
vous devez comprendre que ces fonctions ont des conséquences sur la configuration de SAS
(serial-attached SCSI) et l'accessibilité des données du système. Consultez la rubrique Sous-système SAS
pour comprendre les différentes remarques sur la configuration et la maintenance SAS.
Avant toute opération d'installation ou de maintenance d'un dispositif, consultez les configurations d'E-S
pour chaque partition logique de votre système. Il est important de comprendre les différences entre les
diverses configurations et les considérations de maintenance qui en résultent. En outre, tenez compte des
incidences possibles sur les domaines suivants :
v Partitions logiques et leurs affectations à l'adaptateur d'E-S
v Configurations de grappes RAID
v Emplacements de disque physique
v Chemin d'accès à votre unité d'amorçage
Pourquoi et quand exécuter cette tâche
Procédez comme suit pour remplacer un bloc de batterie n'acceptant pas la maintenance simultanée :
Procédure
1. Retirez le contrôleur du système. Pour plus d'informations, voir «Retrait du contrôleur de stockage
RAID/cache 02E/4B, 02E/4C, 05F/6B ou 05E/6C», à la page 67.
2. Placez le contrôleur sur une surface protégée contre les décharges électrostatiques.
3. Débranchez le connecteur de batterie (A) de son connecteur sur la carte, en tirant la languette de
verrouillage tout en tirant doucement sur la fiche. La fiche est connectée à la carte mère d'une seule
façon ; elle ne peut donc être insérée de manière incorrecte lors de la procédure de remplacement.
Retrait et remise en place des adaptateurs et batteries SAS RAID 77
Remarque : Laissez le bloc de batterie de cache déconnecté pendant au moins 60 secondes avant de
connecter la nouvelle batterie. Il s'agit du temps minimum nécessaire à la carte pour détecter que la
batterie a été remplacée.
Connecteur de la batterie (A)
Bloc de batterie de cache (B)
4. La batterie est montée sur une plaque en plastique noir avec deux taquets, maintenue par des vis de
butée en plastique blanc à l'aide de goupilles de retenue. Pour déposer le bloc de batterie, retirez la
plaque de montage des vis de butée, une vis à la fois.
5. Appuyez sur la goupille de retenue de la vis de butée et tirez sur le taquet de montage jusqu'à ce que
la vis de butée apparaisse dans le trou du taquet de montage. Répétez cette opération pour la
deuxième vis de butée.
Remarque : Si la goupille de retenue ne s'enfonce pas correctement, faites pivoter la vis de butée
blanche afin que sa goupille de retenue soit opposée au bloc de batterie.
6. Installez la batterie sur les supports de façon à ce que les loquets s'enclenchent.
Figure 51. Retrait de la batterie de cache
78 Activation SAS (Serial-attached SCSI) RAID
7. Réintroduisez les broches dans les rivets à l'arrière de la carte.
8. Connectez le connecteur du bloc de batterie de cache (A) à la carte. La fiche est connectée à la carte
mère d'une seule façon ; elle ne peut donc être insérée de manière incorrecte.
9. Réinstallez l'adaptateur et redémarrez le cache. Pour plus d'informations, voir les procédures
suivantes :
v «Retrait et remise en place de la carte de batterie de cache», à la page 71
v «Retrait du contrôleur de stockage RAID/cache 02E/4B, 02E/4C, 05F/6B ou 05E/6C», à la page 67.
Figure 52. Remplacement de la batterie de cache
Retrait et remise en place des adaptateurs et batteries SAS RAID 79
80 Activation SAS (Serial-attached SCSI) RAID
Procédures communes aux dispositifs installables
La présente section contient l'ensemble des procédures communes concernant l'installation, le retrait et le
remplacement de dispositifs.
Avant de commencer
Respectez ces précautions lorsque vous installez, retirez ou remplacez des dispositifs ou des composants.
Pourquoi et quand exécuter cette tâche
Ces précautions visent à créer un environnement sûr pour la maintenance de votre système, mais ne
proposent pas de procédure de maintenance du système. Les procédures d'installation, de retrait et de
remplacement fournissent les processus étape par étape pour la maintenance de votre système.
81
DANGER
Lorsque vous utilisez le système ou travaillez à proximité de ce dernier, observez les consignes
suivantes :
Le courant électrique provenant de l'alimentation, du téléphone et des câbles de transmission peut
présenter un danger. Pour éviter tout risque de choc électrique :
v Branchez cette unité uniquement avec le cordon d'alimentation fourni par . N'utilisez pas ce
dernier avec un autre produit.
v N'ouvrez pas et n'entretenez pas le bloc d'alimentation électrique.
v Ne manipulez aucun câble et n'effectuez aucune opération d'installation, d'entretien ou de
reconfiguration de ce produit au cours d'un orage.
v Le produit peut être équipé de plusieurs cordons d'alimentation. Pour supprimer tout risque de
danger électrique, débranchez tous les cordons d'alimentation.
v Branchez tous les cordons d'alimentation sur un socle de prise de courant correctement câblé et
mis à la terre. Vérifiez que la tension et l'ordre des phases des prises de courant correspondent
aux informations de la plaque d'alimentation électrique du système.
v Branchez sur des socles de prise de courant correctement câblés tout équipement connecté à ce
produit.
v Lorsque cela est possible, n'utilisez qu'une seule main pour connecter ou déconnecter les cordons
d'interface.
v Ne mettez jamais un équipement sous tension en cas d'incendie ou d'inondation, ou en présence
de dommages matériels.
v Avant de retirer les capots de l'unité, mettez celle-ci hors tension et déconnectez ses cordons
d'alimentation, ainsi que les câbles qui la relient aux réseaux, aux systèmes de télécommunication
et aux modems (sauf mention contraire dans les procédures d'installation et de configuration).
v Lorsque vous installez, que vous déplacez, ou que vous manipulez le présent produit ou des
périphériques qui lui sont raccordés, reportez-vous aux instructions ci-dessous pour connecter et
déconnecter les différents cordons.
Pour déconnecter les cordons :
1. Mettez toutes les unités hors tension (sauf mention contraire).
2. Débranchez les cordons d'alimentation des prises.
3. Débranchez les cordons d'interface des connecteurs.
4. Débranchez tous les câbles des unités.
Pour connecter les cordons :
1. Mettez toutes les unités hors tension (sauf mention contraire).
2. Branchez tous les cordons sur les unités.
3. Branchez les cordons d'interface sur des connecteurs.
4. Branchez les cordons d'alimentation aux prises.
5. Mettez l'unité sous tension.
(D005a)
82 Activation SAS (Serial-attached SCSI) RAID
DANGER
Observez les consignes suivantes lors de l'utilisation du système en armoire ou lorsque vous
travaillez à proximité de ce dernier :
v Un mauvais maniement de l'équipement lourd peut engendrer blessures et dommages matériels.
v Abaissez toujours les vérins de mise à niveau de l'armoire.
v Installez toujours des équerres de stabilisation sur l'armoire.
v Pour prévenir tout danger lié à une mauvaise répartition de la charge, installez toujours les
unités les plus lourdes dans la partie inférieure de l'armoire. Installez toujours les serveurs et les
unités en option en commençant par le bas de l'armoire.
v Un serveur monté en armoire n'est pas une étagère ou un espace de travail. Ne posez pas d'objet
sur un serveur monté en armoire.
v Chaque armoire peut être équipée de plusieurs cordons d'alimentation. Avant de manipuler
l'armoire, vous devez débrancher l'ensemble des cordons d'alimentation.
v Reliez toutes les unités installées dans l'armoire aux dispositifs d'alimentation installés dans la
même armoire. Vous ne devez pas brancher le cordon d'alimentation d'une unité installée dans
une armoire au dispositif d'alimentation installé dans une autre armoire.
v Un mauvais câblage du socle de prise de courant peut provoquer une mise sous tension
dangereuse des parties métalliques du système ou des unités qui lui sont raccordées. Il
appartient au client de s'assurer que le socle de prise de courant est correctement câblé et mis à la
terre afin d'éviter tout risque de choc électrique.
ATTENTION
v N'installez pas d'unité dans une armoire dont la température ambiante interne dépasse la
température ambiante que le fabricant recommande pour toutes les unités montées en armoire.
v N'installez pas d'unité dans une armoire où la ventilation n'est pas assurée. Vérifiez que les
côtés, l'avant et l'arrière de l'unité sont correctement ventilés.
v Le matériel doit être correctement raccordé au circuit d'alimentation pour éviter qu'une surcharge
des circuits n'entrave le câblage des dispositifs d'alimentation ou de protection contre les
surintensités. Pour choisir des connexions d'alimentation à l'armoire adaptées, consultez les
étiquettes de puissance nominale situées sur le matériel dans l'armoire afin de déterminer
l'alimentation totale requise par le circuit d'alimentation.
v Armoires dotées de tiroirs coulissants : Si l'armoire n'est pas équipée d'équerres de stabilisation, ne
sortez et n'installez pas de tiroir ou de dispositif. Ne retirez pas plusieurs tiroirs à la fois. Si vous
retirez plusieurs tiroirs simultanément, l'armoire risque de devenir instable.
v Armoires dotées de tiroirs fixes : Sauf indication du fabricant, les tiroirs fixes ne doivent pas être
retirés à des fins de maintenance. Si vous tentez de retirer une partie ou l'ensemble du tiroir,
l'armoire risque de devenir instable et le tiroir risque de tomber.
(R001)
Avant de commencer toute procédure de remplacement ou d'installation, procédez comme suit :
Procédure
1. Si vous installez un nouveau dispositif, vous devez disposer des logiciels requis pour la prise en
charge de ce nouveau dispositif.
Procédures communes aux dispositifs installables 83
2. Si vous exécutez une procédure d'installation ou de remplacement qui risque d'affecter la sécurité des
données, vous devez disposer, dans la mesure du possible, d'une copie de sauvegarde récente de
votre système ou de la partition logique (systèmes d'exploitation, logiciels sous licence et données).
3. Passez en revue la procédure d'installation ou de remplacement relative au dispositif ou au
composant.
4. Notez la correspondance des couleurs utilisées sur le système.
La couleur bleue ou ocre sur les composants matériels indique un point de contact que vous pouvez
utiliser pour retirer ou installer le composant sur le système, ouvrir ou fermer un levier, etc. La
couleur ocre peut également désigner un composant que vous pouvez retirer ou remplacer alors que
le système ou la partition logique est sous tension.
5. Vous devez vous munir d'un tournevis à lame plate moyen, d'un tournevis cruciforme et d'une paire
de ciseaux.
6. Si certains composants sont incorrects, manquants ou visiblement endommagés, procédez comme suit
:
v Si vous remplacez un composant, contactez votre fournisseur de services ou le support technique.
v Si vous installez une fonctionnalité, contactez l'un des services suivants :
– Le fournisseur de services ou le support technique
7. Si vous rencontrez des difficultés lors de l'installation, contactez votre fournisseur de services, ou le
support technique.
8. Si vous installez du nouveau matériel dans une partition logique, vous devez planifier et comprendre
les implications inhérentes au partitionnement du système. Pour plus d'informations, consultez la
rubrique Partitionnement logique.
Identification d'un composant défaillant
Ces instructions permettent d'apprendre à localiser et identifier un composant défaillant ou à retirer, ou
encore de localiser l'emplacement d'installation d'un nouveau composant sur votre système ou unité
d'extension en utilisant une méthode adaptée à votre système.
Pourquoi et quand exécuter cette tâche
Pour les serveurs systèmes dotés du processeur POWER7, les voyants peuvent être utilisés pour identifier
ou vérifier un composant à retirer, installer ou en cours de maintenance.
La combinaison de la fonction d'identification et du voyant d'incident (couleur ambre) montre
l'emplacement d'une unité remplaçable sur site (unité FRU). Lorsque vous retirez une unité FRU, vérifiez
d'abord que vous travaillez sur l'unité FRU correcte à l'aide de la fonction d'identification dans la console
de gestion ou dans une autre interface utilisateur. Lorsque vous retirez une unité FRU dans la console de
gestion du matériel, la fonction d'identification est activée et désactivée automatiquement aux moments
opportuns.
La fonction d'identification produit le clignotement du voyant jaune. Lorsque vous désactivez la fonction
d'identification, le voyant revient à son état précédent. Pour les composants dotés d'un bouton de
maintenance bleu, la fonction d'identification définit les informations du voyant pour le bouton de
maintenance de sorte que lorsque le bouton est activé, les voyants corrects du composant clignotent.
Si vous avez besoin d'utiliser la fonction d'identification, utilisez les procédures suivantes.
Voyants du panneau de commande
Ces informations peuvent s'utiliser comme un guide sur les voyants et boutons du panneau de
commande.
Le panneau de commande est doté de voyants qui indiquent les différents états du système.
84 Activation SAS (Serial-attached SCSI) RAID
v A : Bouton de mise sous tension
v B : Voyant d'alimentation
– Un voyant fixe indique que l'unité est sous alimentation système complète.
– Un voyant clignotant indique que l'unité est sous alimentation de mode veille.
Remarque : Une période d'environ 30 secondes s'écoule entre le moment où vous appuyez sur le
bouton de mise sous tension et celui où le voyant d'alimentation cesse de clignoter pour devenir fixe.
Durant cette période de transition, le clignotement peut s'accélérer.
v C : Voyant d'identification du boîtier
– Une lumière constante indique l'état d'identification, qui permet d'identifier une pièce.
– L'absence de lumière indique un fonctionnement normal du système.
v D: Voyant d'information système
– L'absence de lumière indique un fonctionnement normal du système.
– Un voyant allumé signifie qu'une intervention est requise sur le système.
v E : Port USB
v F : Voyant de panne de boîtier
– Un voyant fixe indique une panne dans l'unité centrale.
– L'absence de lumière indique un fonctionnement normal du système.
v G : Ecran Fonctions/Données
v H : Bouton de décrémentation
v I : Bouton Entrée
v J : Bouton d'incrémentation
v K : Bouton de réinitialisation
Figure 53. Panneau de commande
Procédures communes aux dispositifs installables 85
Concepts associés
Identification d'un composant défaillant
Ces instructions permettent de savoir comment localiser et identifier un composant défaillant sur un
système ou une unité d'extension en utilisant la méthode propre à votre système.
Identification d'un composant défaillant sur un système AIX ou une
partition logique
Les instructions permettent de savoir comment rechercher un composant défaillant, puis d'activer le
voyant de ce composant sur un système ou une partition logique exécutant le système d'exploitation AIX.
Localisation d'un composant défaillant sur un système AIX ou une partition
logique
Il se peut que vous deviez utiliser les outils AIX, avant d'activer le voyant dans le cadre de la recherche
d'un composant défaillant.
Procédure
1. Connectez-vous en tant qu'utilisateur root ou celogin-.
2. A l'invite, tapez diag et appuyez sur Entrée.
3. Dans le menu Sélection des fonctions, choisissez Sélection d'une tâche et appuyez sur Entrée.
4. Sélectionnez Affichage des résultats de tests de diagnostic précédents, puis appuyez sur Entrée.
5. Dans les précédents diagnostics, choisissez Affichage du journal des diagnostics abrégé. L'écran
Affichage du journal des diagnostics qui contient une liste chronologique des événements s'affiche.
6. Dans la colonne T, recherchez l'entrée S la plus récente. Sélectionnez la ligne, puis appuyez sur
Entrée.
7. Sélectionnez Validation. Le programme affiche les détails de l'entrée du journal.
8. Notez l'emplacement et le code SRN figurant vers la fin de l'entrée.
9. Retournez à la ligne de commande.
Que faire ensuite
Utilisez les informations de localisation du composant défaillant pour activer le voyant qui identifie ce
dernier. Pour plus d'informations, voir «Activation du voyant associé au composant défaillant».
Activation du voyant associé au composant défaillant
Ces instructions permettent d'identifier physiquement l'emplacement d'un composant pour lequel vous
intervenez.
Procédure
1. Connectez-vous en tant qu'utilisateur root.
2. A l'invite, tapez diag et appuyez sur Entrée.
3. Dans le menu Sélection des fonctions, choisissez Sélection d'une tâche et appuyez sur Entrée.
4. Dans le menu Sélection des tâches, sélectionnez Indicateurs d'identification et d'avertissement, puis
appuyez sur Entrée.
5. Dans la liste des voyants, sélectionnez le code d'emplacement associé au composant défaillant et
appuyez sur Entrée.
6. Sélectionnez Validation. Le programme allume le voyant d'incident associé au composant défaillant.
7. Retournez à la ligne de commande.
Désactivation du voyant d'un composant défaillant
Cette procédure permet d'éteindre un voyant allumé dans le cadre d'une opération de maintenance.
86 Activation SAS (Serial-attached SCSI) RAID
Pourquoi et quand exécuter cette tâche
Pour le désactiver, procédez comme suit :
Procédure
1. Connectez-vous en tant qu'utilisateur root.
2. A l'invite, tapez diag et appuyez sur Entrée.
3. Dans le menu Sélection des fonctions, choisissez Sélection d'une tâche et appuyez sur Entrée.
4. Dans le menu Sélection des tâches, sélectionnez Indicateurs d'identification et d'avertissement, puis
appuyez sur Entrée.
5. Dans la liste des voyants, sélectionnez le code d'emplacement associé au composant défaillant et
appuyez sur Entrée. Lorsque le voyant d'un composant défaillant est activé, le caractère I précède le
code d'emplacement.
6. Sélectionnez Validation. Le programme désactive l'intervention du système ainsi que le voyant associé
au composant défaillant.
7. Retournez à la ligne de commande.
Identification d'un composant défaillant sur un système Linux ou une
partition logique
Si l'aide à la maintenance a été installée sur un système ou une partition logique, vous pouvez activer ou
désactiver les voyants pour localiser un composant ou effectuer une opération de maintenance.
Localisation d'un composant défaillant sur un système Linux ou une partition
logique
Si l'aide à la maintenance a été installée sur un système ou une partition logique, vous devez activer les
voyants pour localiser un composant.
Pourquoi et quand exécuter cette tâche
Pour l'activer, procédez comme suit :
Procédure
1. Connectez-vous en tant qu'utilisateur root.
2. Sur la ligne de commande, tapez /usr/sbin/usysident -s identify -l et
appuyez sur Entrée.
3. Repérez le voyant d'incident pour identifier le boîtier qui renferme le composant défaillant.
Localisation d'un composant défaillant sur un système ou une partition logique
Linux
Pour retrouver le code d'emplacement d'un composant défaillant si vous avez oublié cette information,
utilisez la procédure décrite dans cette rubrique.
Pourquoi et quand exécuter cette tâche
Pour localiser le composant défaillant sur un système ou une partition logique, procédez comme suit :
Procédure
1. Connectez-vous en tant qu'utilisateur root.
2. Sur la ligne de commande, tapez grep diagela /var/log/platform et appuyez sur Entrée.
3. Recherchez l'entrée la plus récente contenant un code SRC (System Reference Code).
4. Notez les informations d'emplacement.
Procédures communes aux dispositifs installables 87
Activation du voyant associé au composant défaillant
Si vous connaissez le code d'emplacement du composant défaillant, activez le voyant pour vous aider à
rechercher le composant à remplacer.
Pourquoi et quand exécuter cette tâche
Pour l'activer, procédez comme suit :
Procédure
1. Connectez-vous en tant qu'utilisateur root.
2. Sur la ligne de commande, tapez /usr/sbin/usysident -s identify -l et
appuyez sur Entrée.
3. Repérez le voyant d'incident pour identifier le boîtier qui renferme le composant défaillant.
Résultats
Désactivation du voyant d'un composant défaillant
Après avoir terminé la procédure de retrait et de remplacement, vous devez désactiver le voyant du
composant défaillant.
Pourquoi et quand exécuter cette tâche
Pour le désactiver, procédez comme suit :
Procédure
1. Connectez-vous en tant qu'utilisateur root.
2. Sur la ligne de commande, tapez /usr/sbin/usysident -s normal -l et appuyez
sur Entrée.
Localisation d'un composant défaillant sur un système ou une
partition logique Virtual I/O Server
Vous pouvez utiliser les outils Virtual I/O Server (VIOS) avant d'activer le voyant, pour rechercher un
composant défaillant.
Procédure
1. Connectez-vous en tant qu'utilisateur root ou celogin-.
2. A l'invite, tapez diagmenu et appuyez sur Entrée.
3. Dans le menu Sélection des fonctions, choisissez Sélection d'une tâche et appuyez sur Entrée.
4. Sélectionnez Affichage des résultats de tests de diagnostic précédents, puis appuyez sur Entrée.
5. Dans l'écran Affichage des résultats de tests de diagnostic précédents, sélectionnez Affichage du
journal des diagnostics abrégé. L'écran Affichage du journal des diagnostics apparaît. Il contient la
liste des événements triée dans l'ordre chronologique.
6. Dans la colonne T, recherchez l'entrée S la plus récente. Sélectionnez la ligne, puis appuyez sur
Entrée.
7. Sélectionnez Validation. Le programme affiche les détails de l'entrée du journal.
8. Notez l'emplacement et le code SRN figurant vers la fin de l'entrée.
9. Retournez à la ligne de commande.
88 Activation SAS (Serial-attached SCSI) RAID
Résultats
Utilisez les informations de localisation du composant défaillant pour activer le voyant qui identifie ce
dernier. Pour plus d'informations, voir «Identification d'un composant à l'aide du Virtual I/O Server».
Identification d'un composant à l'aide du Virtual I/O Server
Ces instructions permettent d'activer le voyant afin de repérer l'emplacement physique d'un composant à
l'aide du Virtual I/O Server (VIOS).
Procédure
1. Connectez-vous en tant qu'utilisateur root.
2. A l'invite, tapez diagmenu et appuyez sur Entrée.
3. Dans le menu Sélection des fonctions, choisissez Sélection d'une tâche. Appuyez sur Entrée.
4. Dans le menu Sélection des tâches, sélectionnez Indicateurs d'identification et d'avertissement.
Appuyez sur Entrée.
5. Dans la liste des voyants, sélectionnez le code d'emplacement associé au composant défaillant et
appuyez sur Entrée.
6. Sélectionnez Validation. Le programme allume le voyant d'incident associé au composant défaillant.
7. Retournez à la ligne de commande.
Démarrage du système ou de la partition logique
La présente section explique comment démarrer un système ou une partition logique après avoir effectué
une action de maintenance ou une mise à niveau du système.
Démarrage d'un système non géré par une console HMC ou une
console SDMC
Vous pouvez utiliser le bouton d'alimentation ou l'interface ASMI pour démarrer un système qui n'est pas
géré par une console HMC (Hardware Management Console) ni par une console Systems Director
Management Console (SDMC).
Pourquoi et quand exécuter cette tâche
Pour démarrer un système non géré par une console HMC ou SDMC, procédez comme suit :
Procédure
1. Si nécessaire, ouvrez le volet avant de l'armoire.
2. Avant d'appuyer sur le bouton d'alimentation du panneau de commande, vérifiez que le système est
bien relié à une source d'alimentation :
v Tous les cordons d'alimentation système doivent être reliés à une source d'alimentation.
v Le voyant d'alimentation clignote lentement, comme indiqué dans la figure suivante.
v Comme indiqué dans la figure ci-après, la partie supérieure de l'écran doit afficher la mention 01
V=F.
3. Appuyez sur le bouton d'alimentation (A) du panneau de commande (voir figure suivante).
Procédures communes aux dispositifs installables 89
v A : Bouton de mise sous tension
v B : Voyant d'alimentation
– Un voyant fixe indique que l'unité est sous alimentation système complète.
– Un voyant clignotant indique que l'unité est sous alimentation de mode veille.
Remarque : Une période d'environ 30 secondes s'écoule entre le moment où vous appuyez sur le
bouton de mise sous tension et celui où le voyant d'alimentation cesse de clignoter pour devenir
fixe. Durant cette période de transition, le clignotement peut s'accélérer.
v C : Voyant d'identification de boîtier
– Une lumière fixe indique l'état d'identification du boîtier ou d'une ressource du boîtier.
– Aucune lumière signifie qu'aucune ressource du boîtier n'est identifiée.
v D : Voyant d'incident
– L'absence de lumière indique un fonctionnement normal du système.
– Un voyant fixe indique qu'une intervention est requise sur le système.
v E : Port USB
v F : Voyant de panne de boîtier
– Un voyant fixe correspond à un voyant de panne actif sur le système.
– L'absence de lumière indique un fonctionnement normal du système.
v G : Ecran Fonctions/Données
v H : Bouton de décrémentation
v I : Bouton Entrée
v J : Bouton d'incrémentation
v K : Bouton de réinitialisation
4. Une fois que vous avez appuyé sur le bouton d'alimentation, notez les éléments suivants :
v Le voyant d'alimentation commence à clignoter plus rapidement.
v Les ventilateurs système sont activés après environ 30 secondes et commencent à tourner plus
rapidement.
Figure 54. Panneau de commande
90 Activation SAS (Serial-attached SCSI) RAID
v L'écran du panneau de commande affiche les jauges (ou points de contrôle) pendant le démarrage
du système. Le voyant de mise sous tension arrête de clignoter et reste fixe, indiquant que le
système est sous tension.
Que faire ensuite
Conseil : Si le système ne démarre pas alors que vous avez appuyé sur le bouton d'alimentation, utilisez
la procédure suivante pour démarrer le système à l'aide de l'interface ASMI (Advanced System
Management Interface) :
1. Lancez l'interface ASMI. Pour plus d'informations, reportez-vous à la rubrique Accès à l'interface
ASMI.
2. Démarrez le système à l'aide de l'interface ASMI. Pour plus d'informations, reportez-vous à la
rubrique Mise sous tension et arrêt du système.
Démarrage d'un système ou d'une partition logique via la console
HMC
Vous pouvez utiliser l'interface utilisateur de la console HMC (Hardware Management Console) pour
démarrer le système ou la partition logique une fois les câbles requis installés et connectés à une source
d'alimentation.
Pourquoi et quand exécuter cette tâche
Pour obtenir des instructions sur l'utilisation de la console HMC, voir Gestion de la console HMC
(Hardware Management Console). Pour plus d'informations sur le démarrage d'une partition logique,
reportez-vous à la rubrique Partitionnement logique. Pour plus d'informations sur le démarrage du
système, reportez-vous à la rubrique Mise sous tension du système géré.
L'écran du panneau de commande affiche les jauges (ou points de contrôle) pendant le démarrage du
système. Lorsque le voyant d'alimentation s'arrête de clignoter et reste fixe, cela signifie que le système
est sous tension.
Démarrage d'un système ou d'un serveur virtuel via la console SDMC
Vous pouvez utiliser l'interface utilisateur de la console SDMC (Systems Director Management Console)
pour démarrer le système ou le serveur virtuel une fois les câbles requis installés et les cordons
d'alimentation connectés à une alimentation.
Pourquoi et quand exécuter cette tâche
Pour obtenir des instructions sur l'utilisation de la console SDMC, voir Gestion et configuration de la
console SDMC. Pour des instructions sur le démarrage d'un serveur virtuel, voir Gestion de serveurs
virtuels. Pour des instructions sur l'arrêt et le redémarrage de serveurs virtuels, voir Arrêt et redémarrage
de serveurs virtuels.
L'écran du panneau de commande affiche les jauges, ou points de contrôle, pendant le démarrage du
système. Lorsque le voyant de mise sous tension du panneau de commande cesse de clignoter et reste
fixe, le système est sous tension.
Arrêt d'un système ou d'une partition logique
La présente section explique comment arrêter un système ou une partition logique dans le cadre d'une
mise à niveau du système ou d'une procédure de maintenance.
Procédures communes aux dispositifs installables 91
Pourquoi et quand exécuter cette tâche
Avertissement : Si vous utilisez le bouton de mise sous tension ou les commandes de la console HMC
(Hardware Management Console) pour arrêter le système, les fichiers de données risquent d'être
endommagés. Par ailleurs, le système risque de mettre plus de temps à démarrer la prochaine fois si
toutes les applications n'ont pas été fermées avant l'arrêt du système.
Pour arrêter le système ou la partition logique, sélectionnez la procédure appropriée.
Arrêt d'un système non géré par une console HMC ou une console
SDMC
Pour effectuer une autre tâche, vous devez arrêter le système. Ces instructions indiquent comment arrêter
le système à l'aide du bouton d'alimentation ou de l'interface ASMI.
Avant de commencer
Avant d'arrêter le système, procédez comme suit :
1. Vérifiez que tous les travaux sont terminés et fermez toutes les applications.
2. Assurez-vous que le système d'exploitation est arrêté.
Avertissement : Si vous ne le faites pas, vous risquez de perdre des données.
3. Si une partition logique Virtual I/O Server (VIOS) est en cours d'exécution, assurez-vous que tous les
clients sont déconnectés ou qu'ils peuvent accéder à leurs unités par un autre moyen.
Pourquoi et quand exécuter cette tâche
La procédure ci-dessous explique comment arrêter un système non géré par une console HMC (Hardware
Management Console).
Procédure
1. Connectez-vous au système sous un nom d'utilisateur bénéficiant des droits d'accès nécessaires pour
exécuter la commande shutdown ou pwrdwnsys (mettre le système hors tension).
2. A l'invite, tapez la commande appropriée :
v Si le système exécute le système d'exploitation AIX, tapez shutdown.
v Si votre système exécute le système d'exploitation Linux, tapez shutdown -h now.
v Si votre système exécute le système d'exploitation , tapez PWRDWNSYS. Si votre système est
partitionné, utilisez la commande PWRDWNSYS pour arrêter chaque partition secondaire. Ensuite,
utilisez la commande PWRDWNSYS pour arrêter la partition principale.
La commande arrête le système d'exploitation. Une fois le système d'exploitation arrêté, le voyant de
mise sous tension commence à clignoter lentement et le système passe en veille.
3. Sur la ligne de commande Linux, tapez shutdown -h now.
La commande arrête le système d'exploitation. Une fois le système d'exploitation arrêté, le voyant de
mise sous tension commence à clignoter lentement et le système passe en veille.
4. Sur la ligne de commande Linux, tapez shutdown -h now.
La commande arrête le système d'exploitation. Une fois le système d'exploitation arrêté, le voyant de
mise sous tension commence à clignoter lentement et le système passe en veille.
5. Pour restaurer l'état du système à l'issue de l'opération d'installation ou de remplacement, notez le
type et le mode de l'IPL.
6. Mettez les interrupteurs d'alimentation des unités reliées au système sur Off.
7. Débranchez tous les cordons d'alimentation connectés à l'unité des socles de prise de courant. Vérifiez
que vous débranchez également les cordons d'alimentation des périphériques (imprimantes et unités
d'extension, par exemple).
92 Activation SAS (Serial-attached SCSI) RAID
Important : Le système est équipé d'un second bloc d'alimentation. Avant de continuer la procédure,
vérifiez que toutes les sources d'alimentation ont été débranchées du système.
(L003)
ou
Arrêt d'un système via la console HMC
Vous pouvez utiliser l'interface utilisateur de la console HMC (Hardware Management Console) pour
arrêter le système ou une partition logique.
Pourquoi et quand exécuter cette tâche
Par défaut, le système géré est configuré pour se mettre hors tension automatiquement lorsque vous
arrêtez sa dernière partition logique en cours d'exécution. Si vous définissez les propriétés du système
géré sur la console HMC de façon à empêcher la mise hors tension automatique du système, vous devez
utiliser cette procédure pour mettre hors tension le système géré.
Avertissement : Dans la mesure du possible, arrêtez les partitions logiques en cours d'exécution avant
de mettre le système géré hors tension. Si vous n'arrêtez pas d'abord les partitions logiques, vous risquez
de provoquer leur arrêt anormal et de perdre des données. Si vous utilisez une partition logique Virtual
I/O Server (VIOS), assurez-vous que tous les clients sont déconnectés ou qu'ils peuvent accéder à leurs
unités par un autre moyen.
Pour mettre le système géré hors tension, vous devez utiliser l'un des profils utilisateur suivants :
v Administrateur central
v Technicien de maintenance
v Opérateur
v Responsable produit
Procédures communes aux dispositifs installables 93
Procédez comme suit pour arrêter le système via la console HMC.
Procédure
1. Dans la zone de navigation, développez le dossier Gestion de systèmes.
2. Cliquez sur l'icône Serveurs.
3. Dans la zone de contenu, sélectionnez le système géré.
4. Sélectionnez Tâches, Opérations, puis Mise hors tension
5. Sélectionnez le mode de mise hors tension approprié et cliquez sur OK.
Information associée
Arrêt et redémarrage de partitions logiques
Arrêt d'un système via la console SDMC
Vous pouvez utiliser l'interface utilisateur de la console SDMC (Systems Director Management Console)
pour arrêter le système ou un serveur virtuel.
Pourquoi et quand exécuter cette tâche
Par défaut, le système géré est configuré pour se mettre hors tension automatiquement lorsque vous
arrêtez le dernier serveur virtuel en cours d'exécution. Si vous définissez les propriétés du système géré
sur la console SDMC de façon à empêcher la mise hors tension automatique du système, vous devez
utiliser cette procédure pour mettre hors tension le système géré.
Avertissement : Si possible, arrêtez les serveurs en cours d'exécution sur le système géré avant de
mettre ce dernier hors tension. Si vous n'arrêtez pas d'abord les serveurs virtuels, vous risquez de
provoquer leur arrêt anormal et de perdre des données. Si vous utilisez une partition logique Virtual I/O
Server (VIOS) assurez-vous que tous les clients sont déconnectés ou qu'ils peuvent accéder à leurs unités
par un autre moyen.
Pour mettre le système géré hors tension, vous devez utiliser l'un des profils utilisateur suivants :
v Administrateur central
v Technicien de maintenance
v Opérateur
v Responsable produit
Procédez comme suit pour arrêter le système via la console SDMC.
Procédure
1. Dans la zone des ressources Power Systems sélectionnez le système géré que vous voulez mettre hors
tension.
2. Dans le menu Actions, sélectionnez Opérations > Mise hors tension.
3. Sélectionnez le mode de mise hors tension approprié et cliquez sur OK.
Retrait et remplacement des capots sur le système 02E/4B, 02E/4C,
05F/6B ou 05E/6C
Les présentes instructions permettent de retirer, remettre en place ou installer des capots pour accéder à
des composants ou effectuer des opérations de maintenance.
94 Activation SAS (Serial-attached SCSI) RAID
Retrait du capot avant sur un système 02E/4B, 02E/4C, 05F/6B ou
05E/6C monté en armoire
Cette procédure permet de retirer le capot pour accéder aux composants ou effectuer une opération de
maintenance.
Pourquoi et quand exécuter cette tâche
Pour retirer le capot avant, procédez comme suit :
Procédure
1. Retirez les deux vis de transport (si elles sont présentes) fixant l'unité centrale à l'armoire.
Remarque : La remise en place des vis de transport est facultative mais doit être effectuée dans le cas
de zones géographiques soumises à une activité sismique.
2. Retirez le capot du système. Le capot est doté d'un renfoncement permettant de le tenir plus
facilement.
Retrait du capot avant sur un système autonome 02E/4B, 02E/4C ou
05F/6B
Cette procédure permet de retirer le capot pour accéder aux composants ou effectuer une opération de
maintenance.
Pourquoi et quand exécuter cette tâche
Pour retirer le capot avant, procédez comme suit :
Procédure
1. Ouvrez le volet avant (A) en le faisant pivoter vers la gauche, comme indiqué dans la figure suivante.
Figure 55. Retrait du capot avant
Procédures communes aux dispositifs installables 95
2. Appuyez sur le taquet de verrouillage (B) et retirez le capot du système. Le capot est doté d'un
renfoncement permettant de le tenir plus facilement.
Installation du capot avant sur un système 02E/4B, 02E/4C, 05F/6B ou
05E/6C monté en armoire
Cette procédure permet d'installer le capot permettant d'accéder aux composants ou d'effectuer une
opération de maintenance.
Pourquoi et quand exécuter cette tâche
Pour installer le capot avant, procédez comme suit :
Procédure
1. Poussez le capot sur le système.
2. Poussez doucement le capot jusqu'à ce que les quatre taquets de déverrouillage soient positionnés
dans leurs montants de fixation comme indiqué dans la figure suivante. Le capot avant se met en
place ; il est doté d'un renfoncement permettant de le tenir plus facilement.
Remarque : La remise en place des vis de transport est facultative mais doit être effectuée dans le cas
de zones géographiques soumises à une activité sismique.
Figure 56. Retrait du capot avant
96 Activation SAS (Serial-attached SCSI) RAID
Installation des capot et volet avant sur un système 02E/4B, 02E/4C ou
05F/6B autonome
Cette procédure permet d'installer le capot permettant d'accéder aux composants ou d'effectuer une
opération de maintenance.
Pourquoi et quand exécuter cette tâche
Pour installer le capot et le volet avant, procédez comme suit.
Procédure
1. Alignez le capot (A) jusqu'à ce que les deux taquets du couvercle soient placées dans les encoches de
plaque de base (B), comme illustré dans la figure suivante.
2. Faites pivoter le capot vers le haut et vers le système, jusqu'à ce que le taquet de déverrouillage soit
inséré dans son emplacement. Le capot est doté d'un renfoncement permettant de le tenir plus
facilement.
Figure 57. Retrait du capot avant
Procédures communes aux dispositifs installables 97
3. Maintenez le volet avant à un angle d'environ 120 degrés par rapport au système comme l'illustre la
figure suivante. Engagez les taquets.
Retrait du capot d'accès sur un système 02E/4B, 02E/4C, 05F/6B ou
05E/6C monté en armoire
Cette procédure décrit comment retirer le capot d'accès.
Procédure
1. Desserrez les deux vis moletées (A) situées à l'arrière du capot.
Figure 58. Installation du capot avant
Figure 59. Installation du volet avant
98 Activation SAS (Serial-attached SCSI) RAID
2. Faites glisser le capot vers l'arrière de l'unité centrale. Lorsque la partie avant du capot d'accès n'est
plus sur le bord du cadre supérieur, soulevez le capot et retirez-le de l'unité centrale.
Avertissement : Lorsque le capot d'accès pour la maintenance est retiré, le système est mis hors
tension.
Retrait du capot d'accès d'un système autonome 02E/4B, 02E/4C ou
05F/6B
Cette procédure décrit comment retirer le capot d'accès.
Procédure
1. Desserrez la vis moletée (A) située à l'arrière du capot.
2. Faites glisser le capot (B) vers l'arrière de l'unité centrale. Lorsque la partie avant du capot d'accès
n'est plus sur le bord du cadre supérieur, soulevez le capot et retirez-le de l'unité centrale.
Figure 60. Retrait du capot d'accès
Procédures communes aux dispositifs installables 99
3. Desserrez les deux vis moletées (A) situées à l'arrière du capot intérieur.
4. Faites glisser le capot (B) vers l'arrière de l'unité centrale. Lorsque la partie avant du capot d'accès
n'est plus sur le bord du cadre supérieur, soulevez le capot et retirez-le de l'unité centrale.
Avertissement : Lorsque le capot d'accès pour la maintenance est retiré, le système est mis hors
tension.
Figure 61. Installation du capot d'accès extérieur
100 Activation SAS (Serial-attached SCSI) RAID
Installation du capot d'accès sur un système 02E/4B, 02E/4C, 05F/6B
ou 05E/6C monté en armoire
Cette procédure décrit comment installer le capot d'accès.
Procédure
1. Placez le capot d'accès sur le haut du système, à environ 25 mm du rebord du châssis supérieur.
2. Maintenez le capot d'accès contre l'unité centrale et faites-le glisser vers l'avant du système. Les
languettes du capot d'accès glissent sous le rebord du boîtier supérieur.
3. Alignez les deux vis moletées (A) situées à l'arrière du capot d'accès avec les deux trous à l'arrière du
boîtier du système.
4. Serrez les vis moletées afin de fixer le capot d'accès.
Figure 62. Retrait du capot d'accès intérieur
Procédures communes aux dispositifs installables 101
Installation du capot d'accès d'un système autonome 02E/4B, 02E/4C
ou 05F/6B
Cette procédure décrit comment installer le capot d'accès.
Procédure
1. Placez le capot d'accès interne (A) sur le haut du système, à environ 25 mm du rebord du châssis
supérieur.
2. Maintenez le capot d'accès contre l'unité centrale et faites-le glisser vers l'avant du système. Les
languettes du capot d'accès glissent sous le rebord du boîtier supérieur.
3. Alignez les deux vis moletées (B) situées à l'arrière du capot d'accès avec les deux trous à l'arrière du
boîtier du système.
4. Serrez les vis moletées afin de fixer le capot d'accès.
Figure 63. Installation du capot d'accès
102 Activation SAS (Serial-attached SCSI) RAID
5. Placez le capot d'accès extérieur (A) sur le haut du système.
6. Maintenez le capot d'accès contre l'unité centrale et faites-le glisser vers l'avant du système. Les
languettes du capot d'accès glissent sous le rebord du boîtier supérieur.
7. Alignez la vis moletée (B) située à l'arrière du capot d'accès avec le trou à l'arrière du boîtier du
système.
8. Serrez la vis moletée afin de fixer le capot d'accès.
Figure 64. Installation du capot d'accès intérieur
Procédures communes aux dispositifs installables 103
Mise en position de maintenance ou de fonctionnement du système
02E/4B, 02E/4C, 05F/6B ou 05E/6C
Ces procédures décrivent comment mettre un système en maintenance et en fonctionnement dans le
cadre d'une opération de maintenance ou pour accéder à des composants internes.
Mise en position de maintenance du système 02E/4B, 02E/4C, 05F/6B
ou 05E/6C monté en armoire
Utilisez cette procédure pour placer en position de maintenance le système monté en armoire.
Avant de commencer
Remarques :
v Lorsque vous placez l'armoire en position de maintenance, il est essentiel de positionner correctement
toutes les plaques de stabilisation pour empêcher l'armoire de basculer. Assurez-vous qu'il n'y ait
qu'une seule unité centrale en position de maintenance à la fois.
v Veillez à ne pas bloquer ou plier les câbles situés à l'arrière de l'unité centrale lorsque vous extrayez
celle-ci de l'armoire.
v Une fois les glissières de l'unité centrale complètement étendues, les taquets de sécurité s'enclenchent,
ce qui empêche l'unité de trop sortir de son emplacement.
Procédure
1. Ouvrez le volet avant de l'armoire.
2. Identifiez l'unité système dont vous effectuez la maintenance dans l'armoire.
3. Retirez les deux vis (A) (si elles sont présentes) fixant l'unité centrale (B) à l'armoire.
Figure 65. Installation du capot d'accès extérieur
104 Activation SAS (Serial-attached SCSI) RAID
Remarque : La remise en place des vis de transport est facultative mais doit être effectuée dans le cas
de zones géographiques soumises à une activité sismique.
4. Tout en maintenant enfoncés les taquets de déverrouillage droit et gauche de l'unité centrale (A), tirez
l'unité centrale (B) hors de l'armoire jusqu'à ce que les glissières soient complètement étendues et
verrouillées.
Figure 66. Placement du système en position de maintenance
Procédures communes aux dispositifs installables 105
Mise en position de fonctionnement du système 02E/4B, 02E/4C,
05F/6B ou 05E/6C monté en armoire
Utilisez cette procédure pour placer le système monté en armoire en position de fonctionnement.
Avant de commencer
Lors du placement en position de fonctionnement du système, veillez à ne pas bloquer ou plier les câbles
situés à l'arrière du système lorsque vous poussez l'unité dans l'armoire.
Procédure
1. Déverrouillez les taquets de sécurité bleus (A) en les levant vers le haut. Les taquets de sécurité sont
situés près du milieu du système.
2. Poussez l'unité centrale (B) dans l'armoire jusqu'à ce que les deux taquets de déverrouillage de l'unité
soient verrouillés.
3. Le cas échéant, replacez et serrez les deux vis (A) fixant l'unité centrale (B) à l'armoire.
Remarque : La remise en place des deux vis de transport est facultative mais doit être effectuée dans
le cas de zones géographiques soumises à une activité sismique.
4. Fermez le volet avant de l'unité dont vous effectuez la maintenance.
Figure 67. Mise en position de fonctionnement du système
106 Activation SAS (Serial-attached SCSI) RAID
Déconnexion des cordons d'alimentation du système 02E/4B, 02E/4C,
05F/6B ou 05E/6C
Utilisez ces procédures pour déconnecter les cordons d'alimentation du système.
Procédure
1. Pour un système monté en armoire, ouvrez le volet arrière de l'armoire sur l'unité sur laquelle vous
effectuez la maintenance et identifiez l'unité centrale.
2. Pour un système monté en armoire et un système autonome, débranchez tous les cordons
d'alimentation de l'unité sur laquelle vous effectuez la maintenance. Le système est peut-être équipé
de deux blocs d'alimentation. Avant de continuer toute procédure de retrait et de remplacement,
vérifiez que toutes les deux sources d'alimentation ont été débranchées du système.
Figure 68. Mise en position de fonctionnement du système
Procédures communes aux dispositifs installables 107
Figure 69. Retrait des cordons d'alimentation pour un système monté en armoire
Figure 70. Retrait des cordons d'alimentation pour un système autonome
108 Activation SAS (Serial-attached SCSI) RAID
Connexion des cordons d'alimentation au système 02E/4B, 02E/4C,
05F/6B ou 05E/6C
Cette procédure permet de connecter les cordons d'alimentation au système.
Procédure
1. Identifiez l'unité système dont vous effectuez la maintenance dans l'armoire.
2. Connectez tous les cordons d'alimentation à l'unité dont vous effectuez la maintenance. Vérifiez que
les cordons d'alimentation passent par les poignées.
3. Fermez le volet arrière de l'armoire.
Installation d'un dispositif via la console HMC
Vous pouvez utiliser la console HMC (Hardware Management Console) pour effectuer diverses tâches de
maintenance, y compris l'installation d'un nouveau dispositif ou composant.
Avant de commencer
Pour employer l'interface utilisateur de la console HMC pour installer un dispositif ou un composant
dans une unité centrale ou une unité d'extension gérée par une console HMC, procédez comme suit :
Procédure
1. Dans la zone de navigation, développez Gestion de systèmes > Serveurs.
2. Sélectionnez le système géré sur lequel vous allez installer le composant.
3. Dans la zone des tâches, développez Maintenabilité > Matériel > Tâches de modification de
matériel.
4. Sélectionnez Ajout d'une FRU (unité remplaçable sur site).
Figure 71. Connexion des cordons d'alimentation
Procédures communes aux dispositifs installables 109
5. Dans la fenêtre Installation/ajout/suppression de matériel, sélectionnez le système ou le boîtier dans
lequel vous installez le dispositif.
6. Sélectionnez dans la liste le type de dispositif que vous souhaitez installer et cliquez sur Suivant.
7. Sélectionnez le code d'emplacement pour l'installation du dispositif et cliquez sur Ajouter.
8. Une fois l'unité remplaçable sur site FRU placée dans la catégorie des actions en attente, cliquez sur
Lancer la procédure et suivez les instructions d'installation du dispositif.
Remarque : La console HMC peut afficher des instructions expliquant comment installer le dispositif.
Dans ce cas, suivez ces instructions pour installer le dispositif.
Retrait d'un composant via la console HMC
Vous pouvez utiliser la console HMC (Hardware Management Console) pour effectuer de nombreuses
opérations de maintenance, y compris le retrait d'une unité remplaçable sur site (FRU) ou d'un
composant.
Pourquoi et quand exécuter cette tâche
Pour employer l'interface utilisateur de la console HMC afin de retirer un composant dans une unité
centrale ou une unité d'extension gérée par une console HMC, procédez comme suit :
Procédure
1. Dans la zone de navigation, développez Gestion de systèmes > Serveurs.
2. Sélectionnez le système géré sont vous souhaitez retirer un composant.
3. Sélectionnez l'une des options suivantes.
v Si vous installez un nouveau composant dans le cadre d'une mise à niveau, dans la zone des
tâches, développez Maintenabilité > Matériel > Tâches de modification de matériel.
v Si vous remplacez un composant dans le cadre d'une opération de maintenance, depuis la zone des
tâches, développez, Maintenabilité > Matériel > Retrait de FRU.
4. Pour installer un nouveau composant dans le cadre d'une mise à niveau, sélectionnez Ajout de FRU.
Pour remplacer un composant dans le cadre d'une opération de maintenance, sélectionnez Retrait de
FRU.
5. Dans la fenêtre Installation/ajout/suppression de matériel - Retrait de FRU, sélectionnez le type de
FRU, sélectionnez le système ou le boîtier duquel vous souhaitez retirer le composant.
6. Sélectionnez dans la liste le type de composant que vous souhaitez retirer et cliquez sur Suivant.
7. Sélectionnez l'emplacement du composant que vous retirez et cliquez sur Ajouter.
8. Une fois que le composant figure dans la liste des actions en attente, cliquez sur Lancer la procédure
et suivez les instructions de retrait du composant.
Remarque : La console HMC peut afficher des instructions de l'Information Center expliquant
comment retirer le composant. Dans ce cas, suivez ces instructions pour retirer le composant.
Remplacement d'un composant via la console HMC
Vous pouvez utiliser la console HMC (Hardware Management Console) pour effectuer diverses tâches de
maintenance, y compris le remplacement d'une unité remplaçable sur site (FRU) ou d'un composant.
Pourquoi et quand exécuter cette tâche
Si vous procédez à un échange de composant dans le but de solutionner un événement réparable, suivez
ces instructions. Si vous échangez un composant dans le cadre d'une autre procédure à l'aide de la
version 7 ou d'une version ultérieure de la console HMC, procédez comme suit :
110 Activation SAS (Serial-attached SCSI) RAID
Procédure
1. Dans la zone de navigation, développez Gestion de systèmes > Serveurs.
2. Sélectionnez le système géré dont vous souhaitez remplacer un composant.
3. Sélectionnez l'une des options suivantes.
v Si vous installez un nouveau composant dans le cadre d'une mise à niveau, dans la zone des
tâches, développez Maintenabilité > Matériel > Tâches de modification de matériel.
v Si vous remplacez un composant dans le cadre d'une opération de maintenance, depuis la zone des
tâches, développez, Maintenabilité > Matériel > Remplacement de FRU.
4. Pour installer un nouveau composant dans le cadre d'une mise à niveau, sélectionnez Ajout de FRU.
Pour remplacer un composant dans le cadre d'une opération de maintenance, sélectionnez
Remplacement de FRU.
5. Dans la fenêtre Remplacement de matériel - Remplacement de FRU, Sélectionnez le type de FRU,
sélectionnez dans la liste le système ou le boîtier dans lequel vous souhaitez échanger un composant.
6. Dans le menu, sélectionnez le type du composant et cliquez sur Suivant.
7. Dans le menu, sélectionnez le code emplacement du composant et cliquez sur Ajouter.
8. Une fois l'unité remplaçable sur site FRU placée dans la catégorie des actions en attente, cliquez sur
Lancer la procédure et suivez les instructions d'échange du dispositif.
Remarque : Il se peut que la console HMC ouvre une fenêtre d'instructions pour le remplacement du
composant. Dans ce cas, suivez les instructions pour remplacer le composant.
Vérification du composant installé
Vous pouvez vérifier un nouveau composant installé ou remplacé sur votre système, partition logique ou
unité d'extension à l'aide du système d'exploitation, de diagnostics autonomes ou de la console HMC
(Hardware Management Console).
Vérification d'un dispositif installé ou d'un composant remplacé sur un
système ou une partition logique AIX
Si vous avez installé un dispositif ou remplacé un composant, il est recommandé d'utiliser les outils dans
le système d'exploitation AIX pour vérifier que ce dispositif ou composant est reconnu par le système ou
la partition logique.
Pourquoi et quand exécuter cette tâche
Pour vérifier le fonctionnement d'un dispositif ou d'un composant de remplacement récemment installé,
sélectionnez la procédure appropriée :
v Vérification du dispositif installé via AIX
v Vérification du composant de remplacement via AIX
Vérification du dispositif installé via le système d'exploitation AIX :
1. Connectez-vous en tant qu'utilisateur root.
2. A l'invite, tapez diag et appuyez sur Entrée.
3. Sélectionnez Programmes de diagnostic, mode étendu et appuyez sur Entrée.
4. Dans le menu Sélection du mode de diagnostic, sélectionnez Vérification du système et appuyez sur
Entrée.
5. Dans le menu Sélection des tests de diagnostic, mode étendu, exécutez l'une des procédures
suivantes :
v Pour tester une seule ressource, sélectionnez la ressource que vous venez d'installer dans la liste des
ressources et appuyez sur Entrée.
Procédures communes aux dispositifs installables 111
v Pour tester toutes les ressources disponibles du système d'exploitation, sélectionnez Toutes les
ressources et appuyez sur Entrée.
6. Sélectionnez Validation, puis attendez la fin de l'exécution des programmes de diagnostic en
répondant aux invites éventuelles.
7. Les programmes de diagnostic ont-ils été exécutés complètement avant d'afficher le message Aucun
incident n’a été détecté ?
v Non : Si un code SRN (Service Request Number - numéro de demande d'intervention) ou un autre
code de référence s'affiche, il est possible qu'un adaptateur ou qu'un câble ne soit pas bien
connecté. Passez en revue les procédures d'installation pour vérifier que le nouveau dispositif est
installé correctement. Si vous ne parvenez pas à corriger l'incident, rassemblez tous les codes SRN
et autres codes de référence éventuels. Si le système fonctionne en mode de partitionnement
logique, notez la partition logique sur laquelle le dispositif est installé. Pour obtenir de l'aide,
contactez votre fournisseur de services.
v Oui : Le nouveau dispositif est installé correctement. Quittez les programmes de diagnostic et
revenez en mode de fonctionnement normal.
Vérification du composant de remplacement via le système d'exploitation AIX :
Pour vérifier le fonctionnement d'un dispositif ou d'un composant de remplacement récemment installé,
procédez comme suit :
1. Avez remplacé le composant à l'aide du système d'exploitation AIX ou des opérations de
remplacement à chaud du service de diagnostic en ligne ?
Non : Passez à l'étape 2.
Oui : Passez à l'étape 5, à la page 113.
2. Le système est-il hors tension ?
Non : Passez à l'étape 4.
Oui : Si le système prend en charge l'initialisation lente, configurez la fonction. Pour plus
d'informations, consultez la rubrique Exécution d'une initialisation lente.
3. Démarrez le système et attendez que l'invite de connexion du système d'exploitation AIX s'affiche ou
que l'activité apparente du système sur le panneau de commande ou l'écran s'arrête.
Avez-vous vu l'invite de connexion AIX ?
v Non : Si un code SRN (Service Request Number - numéro de demande d'intervention) ou un autre
code de référence s'affiche, il est possible qu'un adaptateur ou qu'un câble ne soit pas bien
connecté. Passez en revue les procédures de remplacement pour vérifier que le nouveau
composant est installé correctement. Si vous ne parvenez pas à corriger l'incident, rassemblez tous
les codes SRN et autres codes de référence éventuels. Si le système ne démarre pas ou que l'invite
de connexion ne s'affiche pas, consultez la rubrique : Incidents de chargement et de démarrage du
système d'exploitation.
Si le système est partitionné, notez la partition logique sur laquelle vous avez remplacé le
composant. Pour obtenir de l'aide, contactez votre fournisseur de services.
v Oui : Passez à l'étape 4.
4. A l'invite, tapez diag —a et appuyez sur Entrée pour vérifier s'il manque des ressources. Si une invite
s'affiche, passez à l'étape 5, à la page 113.
Si le menu Sélection des tests de diagnostic s'affiche avec un M en regard d'une ressource, procédez
comme suit :
a. Sélectionnez la ressource, puis appuyez sur Entrée.
b. Sélectionnez Validation.
c. Suivez les instructions affichées.
d. Si le message Souhaitez-vous revoir l'erreur affichée précédemment ? s'affiche, cliquez sur Oui et
appuyez sur Entrée.
112 Activation SAS (Serial-attached SCSI) RAID
e. Si le système génère un code SRN, une carte ou un câble est probablement mal connecté. Si
aucun incident évident n'apparaît, notez le code SRN et contactez votre fournisseur de services.
f. Si aucun code SRN ne s'affiche, passez à l'étape 5.
5. Testez le composant en procédant comme suit :
a. A l'invite, tapez diag et appuyez sur Entrée.
b. Dans le menu Sélection des fonctions, sélectionnez Programmes de diagnostic, mode étendu et
appuyez sur Entrée.
c. Dans le menu Sélection du mode de diagnostic, sélectionnez Vérification du système et
appuyez sur Entrée.
d. Sélectionnez Toutes les ressources ou sélectionnez les programmes de diagnostic du composant
pour tester uniquement le composant remplacé ou les unités qui y raccordées et appuyez sur
Entrée.
Le menu Action corrective sur ressource s'est-il affiché ?
Non : Passez à l'étape 6.
Oui : Passez à l'étape 7.
6. Le message Test terminé, aucun incident n'a été détecté s'est-il affiché ?
v Non : Il existe encore un incident. Contactez votre fournisseur de services. La procédure est
terminée.
v Oui : Si l'incident ne figure pas dans le journal des erreurs, sélectionnez Consignation d'action
corrective dans le menu Sélection des tâches pour mettre à jour le journal des erreurs AIX. Si la
réparation consistait à remettre en place un câble ou une carte, sélectionnez la ressource
correspondante. Si la ressource associée à l'action ne s'affiche pas dans la liste des ressources,
sélectionnez sysplanar0 et appuyez sur Entrée.
Conseil : Le voyant du composant passe de l'état Incident à l'état normal.
Passez à l'étape 9, à la page 114.
7. Dans le menu Action corrective sur ressource, sélectionnez la ressource remplacée. En cas de test
réussi sur une ressource en mode de vérification du système, dont une entrée figure dans le journal
des erreurs AIX, le menu Action corrective sur ressource apparaît. Pour mettre à jour le journal des
erreurs AIX pour indiquer qu'un composant détectable par le système a été remplacé, procédez
comme suit.
Remarque : Sur les systèmes équipés d'un voyant correspondant au composant défaillant, le voyant
passe à l'état normal.
a. Dans le menu Action corrective sur ressource, sélectionnez la ressource remplacée. Si la
réparation consistait à remettre en place un câble ou une carte, sélectionnez la ressource
correspondante. Si la ressource associée à l'action ne s'affiche pas dans la liste des ressources,
sélectionnez sysplanar0 et appuyez sur Entrée.
b. Sélectionnez Validation une fois les sélections effectuées. L'écran Action corrective sur ressource
s'est-il affiché de nouveau ?
Non : Si l'écran Aucun incident détecté apparaît, passez à l'étape 9, à la page 114.
Oui : Passez à l'étape 8.
8. Dans le menu Action corrective sur ressource, le composant parent ou enfant de la ressource que
vous venez de remplacer, si nécessaire. En cas de test réussi sur une ressource en mode de
vérification du système, dont une entrée figure dans le journal des erreurs AIX, le menu Action
corrective sur ressource apparaît. Pour mettre à jour le journal des erreurs AIX pour indiquer qu'un
composant détectable par le système a été remplacé, procédez comme suit.
Remarque : Le voyant du composant passe de l'état Incident à l'état normal.
a. Dans le menu Action corrective sur ressource, sélectionnez le composant parent ou enfant de la
ressource remplacée. Si la réparation consistait à remettre en place un câble ou une carte,
Procédures communes aux dispositifs installables 113
sélectionnez la ressource correspondante. Si la ressource associée à l'action ne s'affiche pas dans la
liste des ressources, sélectionnez sysplanar0 et appuyez sur Entrée.
b. Sélectionnez Validation une fois les sélections effectuées.
c. Si l'écran Aucun incident détecté apparaît, passez à l'étape 9.
9. Si vous avez modifié les paramètres du processeur de service ou du réseau (voir procédures
précédentes), restaurez la valeur initiale des paramètres.
10. Avez-vous exécuté des procédures de remplacement à chaud avant cette procédure ?
Non : Passez à l'étape 11.
Oui : Passez à l'étape 12.
11. Démarrez le système d'exploitation (système ou partition logique en mode normal). Avez-vous pu
démarrer le système d'exploitation ?
Non : Contactez votre fournisseur de services. La procédure est terminée.
Oui : Passez à l'étape 12.
12. Les voyants sont-ils encore allumés ?
v Non. Cela marque la fin de la procédure.
v Oui : Désactivez les voyants. Pour plus d'informations, consultez la rubrique suivante :
Modification des indicateurs de service.
Vérification d'un composant installé sur un système Linux ou une
partition logique
Si vous avez installé un nouveau composant, utilisez les instructions de cette section pour savoir
comment vérifier que le système reconnaît ce composant.
Pourquoi et quand exécuter cette tâche
Pour vérifier un composant que vous venez d'installer ou de remplacer consultez la rubrique
«Vérification d'un composant installé via les programmes de diagnostic autonome».
Vérification d'un composant installé via les programmes de diagnostic
autonome
Si vous avez installé ou remplacé un composant, vérifiez que le système le reconnaît. Vous pouvez
utiliser les programmes de diagnostic autonomes pour vérifier un composant installé sur un système AIX
ou Linux, une unité d'extension ou une partition logique.
Avant de commencer
v Si le serveur est directement connecté à un autre serveur ou à un réseau, vérifiez que la
communication avec les autres serveurs est interrompue.
v Les programmes de diagnostic autonomes doivent utiliser l'ensemble des ressources de la partition
logique. Aucune autre activité ne peut être exécutée sur la partition.
v Les programmes de diagnostic autonomes doivent accéder à la console système.
Pourquoi et quand exécuter cette tâche
Pour accéder aux programmes de diagnostic, utilisez le CD-ROM approprié ou le serveur NIM (Network
Installation Management). La procédure suivante explique comment utiliser les programmes de
diagnostic depuis le CD-ROM. Pour obtenir plus d'informations sur l'exécution de diagnostics à partir du
serveur NIM, voir Exécution de diagnostics autonomes à partir d'un serveur NIM.
Pour utiliser les programmes de diagnostic autonomes, procédez comme suit :
114 Activation SAS (Serial-attached SCSI) RAID
Procédure
1. Arrêtez les travaux et les applications, puis arrêtez le système d'exploitation sur le système ou la
partition logique.
2. Retirez les bandes, les disquettes et les CD-ROM.
3. Mettez l'unité centrale hors tension. L'étape suivante permet d'initialiser le serveur ou la partition
logique à partir du CD-ROM des programmes de diagnostic autonomes. Si l'unité de disques optique
n'est pas configurée comme unité d'amorçage sur le serveur ou la partition logique que vous utilisez,
procédez comme suit :
a. Lancez l'interface ASMI. Pour plus d'informations sur l'utilisation de l'interface ASMI,
reportez-vous à la rubrique Accès à l'interface ASMI.
b. Dans le menu principal de l'interface ASMI, cliquez sur Contrôle d'alimentation/redémarrage.
c. Cliquez sur Mettre le système sous/hors tension.
d. Dans le menu déroulant des modes d'initialisation de la partition logique AIX ou Linux
sélectionnez l'option Amorçage du mode service à partir de la liste par défaut.
e. Cliquez sur Sauvegarder les paramètres et mettre sous tension. Dès que l'unité de disques
optique est sous tension, insérez le CD-ROM des programmes de diagnostic autonomes.
f. Passez à l'étape 5.
4. Mettez l'unité centrale sous tension et insérez immédiatement le CD-ROM de programme de
diagnostic autonome dans l'unité de disque optique.
5. Après l'affichage du voyant POST clavier sur la console système et avant l'affichage du dernier
voyant POST (haut-parleur), appuyez sur la touche numérique 5 de la console système pour
indiquer qu'une initialisation en mode service doit être effectuée au moyen de la liste des unités
d'amorçage en mode service par défaut.
6. Tapez le mot de passe requis.
7. Dans l'écran des instructions d'exécution des tests de diagnostic, appuyez sur Entrée.
Conseil : Si un code SRN (Service Request Number - numéro de demande d'intervention) ou un
autre code de référence s'affiche, il est possible qu'un adaptateur ou qu'un câble ne soit pas bien
connecté.
Remarque : Si le système a généré un code SRN ou un autre code de référence alors que vous
tentiez de le démarrer, contactez votre fournisseur de service pour obtenir de l'aide.
8. Si le type de terminal est requis, sélectionnez l'option Initialisation du terminal du menu Sélection
des fonctions pour initialiser le système d'exploitation.
9. Dans le menu Sélection des fonctions, sélectionnez Programmes de diagnostic, mode étendu et
appuyez sur Entrée.
10. Dans le menu Sélection du mode de diagnostic, sélectionnez Vérification du système et appuyez sur
Entrée.
11. Lorsque le menu Sélection des tests de diagnostic, mode étendu s'affiche, sélectionnez Toutes les
ressources ou testez uniquement le composant remplacé ou les unités qui y sont rattachées en
sélectionnant les programmes de diagnostic appropriés puis en appuyant sur Entrée.
12. Le message Test terminé, Aucun incident n'a été détecté s'est-il affiché ?
v Non : Il existe encore un incident. Contactez votre fournisseur de services.
v Oui : Passez à l'étape 13.
13. Si vous avez modifié les paramètres du processeur de service ou du réseau (voir procédures
précédentes), restaurez la valeur initiale des paramètres.
14. Si les voyants sont toujours allumés, procédez comme suit :
a. Dans le menu Sélection des tâches, sélectionnez Indicateurs d'identification et d'avertissement
pour désactiver les voyants d'identification et d'avertissement et appuyez sur Entrée.
b. Sélectionnez Mettre l'indicateur d'avertissement système à NORMAL, puis appuyez sur Entrée.
Procédures communes aux dispositifs installables 115
c. Sélectionnez Mettre tous les indicateurs d'identification à NORMAL, puis appuyez sur Entrée.
d. Sélectionnez Validation.
Remarque : Les voyants d'identification et d'incident passent de l'état Incident à l'état Normal.
e. Retournez à la ligne de commande.
Vérification d'un composant installé à l'aide de la console HMC
(Hardware Management Console)
Si vous avez installé ou remplacé un composant, utilisez la console HMC (Hardware Management
Console) pour mettre à jour les enregistrements de la console HMC une fois que vous avez effectué une
opération de maintenance sur le serveur. Si vous vous êtes servi de codes de référence, de symptômes ou
de codes d'emplacement lors de l'opération, repérez les enregistrements à utiliser pendant la procédure.
Pourquoi et quand exécuter cette tâche
Pour vérifier un composant installé, procédez comme suit :
Procédure
1. Sur la console HMC, recherchez d'éventuels événements d'action de maintenance ouverts dans le
journal des événements de maintenance. Pour plus d'informations, reportez-vous à la rubrique
«Affichage des événements réparables à l'aide de la console HMC», à la page 118.
2. Des événements d'action de maintenance sont-ils ouverts ?
Non : Si le voyant d'avertissement système est encore allumé, utilisez la console HMC pour le
désactiver. Consultez la rubrique «Activation/Désactivation des voyants via la console HMC», à la
page 117. La procédure est terminée.
Oui : Passez à l'étape suivante.
3. Notez la liste des événements d'action de maintenance ouverts.
4. Examinez les détails de chacun de ces événements. Les codes d'erreur associés à ces derniers sont-ils
les mêmes que ceux que vous avez collectés auparavant ?
v Non : Sélectionnez l'une des options suivantes :
– Examinez les autres événements réparables pour en trouver un dont le code d'erreur correspond
et passez à l'étape suivante.
– Si le contenu du journal ne correspond pas à ce que vous avez collecté précédemment, contactez
votre fournisseur de services.
v Oui : Passez à l'étape suivante.
5. Dans la fenêtre Erreur associée à cet événement réparable, sélectionnez et mettez en évidence
l'événement d'action de maintenance.
6. Cliquez sur Fermer l'événement.
7. Ajoutez des commentaires sur l'événement réparable. Incluez toute autre information propre à cet
événement. Cliquez sur OK.
8. Avez-vous remplacé, ajouté ou modifié une unité remplaçable sur site (FRU) de l'événement d'action
de maintenance ouvert ?
v Non : Sélectionnez l'option Aucune FRU remplacée pour cet événement réparable et cliquez sur
OK pour fermer l'événement d'action de maintenance.
v Oui : Procédez comme suit :
a. Dans la liste des unités remplaçables sur site, sélectionnez une unité à mettre à jour.
b. Cliquez deux fois sur l'unité, puis mettez à jour les informations correspondantes.
c. Cliquez sur OK pour refermer l'événement d'action de maintenance.
9. Si les incidents persistent, contactez votre fournisseur de services.
116 Activation SAS (Serial-attached SCSI) RAID
Activation/Désactivation des voyants via la console HMC
Utilisez cette procédure pour activer ou désactiver des voyants via le Point focal de service pour la
console HMC (Hardware Management Console).
Pourquoi et quand exécuter cette tâche
Choisissez l'une des options suivantes :
v «Désactivation d'un voyant d'avertissement système ou de partition»
v «Activation ou désactivation d'un voyant d'identification»
Désactivation d'un voyant d'avertissement système ou de partition :
Pourquoi et quand exécuter cette tâche
Vous pouvez désactiver un voyant d'avertissement système ou un voyant de partition logique. Par
exemple, vous pouvez constater qu'un incident n'est pas prioritaire et peut être résolu ultérieurement.
Cependant, si vous souhaitez être averti en cas d'incident, vous devez désactiver le voyant
d'avertissement système pour qu'il puisse être activé quand un nouvel incident survient.
Procédure
1. Dans la zone de navigation, ouvrez le dossier Gestion de systèmes.
2. Ouvrez Serveurs et sélectionnez le système approprié.
3. Dans la zone de contenu, cochez la case correspondant à la partition concernée.
4. Sélectionnez Tâches, Opérations, puis Gestion des voyants d'avertissement système.
5. Sélectionnez la partition appropriée.
6. Sélectionnez Désactivation du voyant d'avertissement système dans le menu Action. La fenêtre de
confirmation qui s'affiche fournit les informations suivantes :
v Un message de vérification indique que le voyant d'avertissement système a été désactivé.
v Un message indique que le système présente encore des incidents non résolus.
v Un message indique que le voyant d'avertissement système ne peut pas être activé.
7. Sélectionnez l'une des partitions logiques dans le tableau du bas, puis sélectionnez Désactivation du
voyant de partition dans le menu Opérations sur la partition. La fenêtre de confirmation qui s'affiche
fournit les informations suivantes :
v Une vérification indique que le voyant de la partition logique a été désactivé.
v Un message indique que la partition logique présente encore des incidents non résolus.
v Un message indique que le voyant de la partition logique ne peut pas être activé.
Activation ou désactivation d'un voyant d'identification :
Pourquoi et quand exécuter cette tâche
Le système comporte plusieurs voyants d'identification de composants, tels que les boîtiers ou les unités
remplaçables sur site (FRU). On les appelle donc voyants d'identification.
Vous pouvez activer ou désactiver les types de voyants d'identification suivants :
v Voyant d'identification d'un boîtier : Si vous souhaitez ajouter un adaptateur dans un tiroir spécifique
(boîtier), vous devez connaître les type, modèle et numéro de série (MTMS) du tiroir. Pour déterminer
si vous disposez des informations MTMS appropriées du tiroir nécessitant le nouvel adaptateur, vous
pouvez activer le voyant d'un tiroir et vérifier que le MTMS correspond au tiroir qui doit accueillir le
nouveau matériel.
v Voyant d'identification d'une unité FRU associée à un boîtier : Pour connecter un câble à un
adaptateur d'E-S, vous pouvez activer le voyant de l'adaptateur s'il s'agit d'une unité FRU, puis vérifier
sur le matériel où le câble doit être fixé. Cette opération est particulièrement utile en présence de ports
ouverts sur plusieurs adaptateurs.
Procédures communes aux dispositifs installables 117
Pour activer ou désactiver le voyant d'identification d'un boîtier ou d'une unité FRU, procédez comme
suit :
Procédure
1. Dans la zone de navigation, ouvrez l'élément Gestion de systèmes.
2. Sélectionnez Serveurs.
3. Dans la zone de contenu, cochez la case correspondant au système concerné.
4. Sélectionnez Tâches, Opérations, Statut du voyant, puis Voyant d'identification.
5. Pour activer ou désactiver le voyant d'identification d'un boîtier, sélectionnez le boîtier dans le
tableau, puis cliquez sur Activation du voyant ou Désactivation du voyant. Le voyant associé
s'allume ou s'éteint.
6. Pour activer ou désactiver le voyant d'identification d'une unité FRU, choisissez un boîtier dans le
tableau, puis sélectionnez Sélectionné > Liste des FRU.
7. Sélectionnez une ou plusieurs unités FRU dans le tableau, et cliquez sur Activation du voyant ou
Désactivation du voyant. Le voyant associé s'allume ou s'éteint.
Affichage des événements réparables à l'aide de la console HMC
Utilisez cette procédure pour afficher un événement réparable, y compris les détails, commentaires et
historique de maintenance.
Pourquoi et quand exécuter cette tâche
Pour pouvoir consulter des événements réparables et les informations qui leur sont associées, vous devez
avoir l'un des rôles suivants :
v Administrateur central
v Technicien de maintenance
v Opérateur
v Responsable produit
v Visionneur
Pour afficher des événements réparables, procédez comme suit :
Procédure
1. Dans la zone de navigation, sélectionnez Gestion de la maintenance.
2. Sélectionnez Gestion des événements réparables.
3. Sélectionnez les critères des événements réparables à visualiser, puis cliquez sur OK. La fenêtre
Présentation d'un événement réparable s'affiche. La liste comporte les événements réparables
correspondant à vos critères de sélection. La barre de menus vous propose des actions réalisables sur
les événements réparables.
4. Sélectionnez une ligne dans la fenêtre Présentation d'un événement réparable, puis choisissez
Sélectionné > Affichage des détails La fenêtre Détails de l'événement réparable affiche des
informations détaillées sur l'événement réparable. Le tableau du haut comporte des informations telles
que le numéro d'incident et le code de référence. Le tableau du bas indique les unités remplaçables
sur site (FRU) associées à cet événement.
5. Sélectionnez l'erreur pour laquelle vous souhaitez consulter les commentaires et l'historique, puis
procédez comme suit :
a. Sélectionnez Actions > Visualisation des commentaires.
b. Pour quitter les commentaires, cliquez sur Fermeture.
c. Sélectionnez Actions > Affichage de l'historique de maintenance. La fenêtre Historique de
maintenance comporte l'historique de maintenance associé à l'erreur sélectionnée.
d. Pour quitter l'historique de maintenance, cliquez sur Fermeture.
118 Activation SAS (Serial-attached SCSI) RAID
6. Quand vous avez terminé, cliquez sur Annulation dans les fenêtres Détails de l'événement réparable
et Présentation de l'événement réparable pour les refermer.
Vérification du composant installé via la console SDMC
Si vous avez installé ou remplacé un composant, utilisez la console Systems Director Management
Console (SDMC) pour mettre à jour vos enregistrements SDMC lorsqu'une opération de maintenance est
terminée sur le serveur. Si vous vous êtes servi de codes de référence, de symptômes ou de codes
d'emplacement lors de l'opération, repérez les enregistrements à utiliser pendant la procédure.
Pourquoi et quand exécuter cette tâche
Pour vérifier un composant installé, procédez comme suit :
Procédure
1. Depuis la console SDMC, recherchez dans le journal des événements les opérations de maintenance
ouvertes. Pour plus de détails, voir «Visualisation d'événements réparables via la console de gestion
Systems Director», à la page 120.
2. Des événements d'action de maintenance sont-ils ouverts ?
Non : Si le voyant d'avertissement système est encore allumé, utilisez la console SDMC pour
l'éteindre. Pour plus d'informations, voir «Activation/Désactivation de voyants via la console de
gestion Systems Director». La procédure est terminée.
Oui : Passez à l'étape suivante.
3. Notez la liste des événements d'action de maintenance ouverts.
4. Examinez les détails de chacun de ces événements. Les codes d'erreur associés à ces derniers sont-ils
les mêmes que ceux que vous avez collectés auparavant ?
v Non : Sélectionnez l'une des options suivantes :
– Examinez les autres événements réparables pour en trouver un dont le code d'erreur correspond
et passez à l'étape suivante.
– Si le contenu du journal ne correspond pas à ce que vous avez collecté précédemment, contactez
votre fournisseur de services.
v Oui : Passez à l'étape suivante.
5. Dans la fenêtre Erreur associée à cet événement réparable, sélectionnez et mettez en évidence
l'événement d'action de maintenance.
6. Cliquez sur Supprimer ou Ignorer.
Remarque : Ces options ne sont disponibles qu'à partir du journal des événements.
Activation/Désactivation de voyants via la console de gestion Systems Director
Utilisez cette procédure pour activer ou désactiver des voyants via la console Systems Director
Management Console.
Pourquoi et quand exécuter cette tâche
Choisissez l'une des options suivantes :
v «Désactivation d'un voyant d'avertissement système ou de partition»
v «Activation ou désactivation de voyants d'identification via la console SDMC», à la page 120
Désactivation d'un voyant d'avertissement système ou de partition :
Pourquoi et quand exécuter cette tâche
Vous pouvez désactiver un voyant d'avertissement système ou un voyant de partition logique. Par
exemple, vous pouvez constater qu'un incident n'est pas prioritaire et peut être résolu ultérieurement.
Procédures communes aux dispositifs installables 119
Cependant, si vous souhaitez être averti en cas d'incident, vous devez désactiver le voyant
d'avertissement système pour qu'il puisse être activé quand un nouvel incident survient.
Procédure
1. Dans l'onglet Ressources, sélectionnez l'hôte ou le serveur virtuel approprié.
2. Sélectionnez Actions > Service et support > Matériel > Voyant d'avertissement système.
3. Sélectionnez Désactivation du voyant d'avertissement système. La fenêtre de confirmation qui
s'affiche fournit les informations suivantes :
v Un message de vérification indique que le voyant d'avertissement système a été désactivé.
v Un message indique que le système présente encore des incidents non résolus.
v Un message indique que le voyant d'avertissement système ne peut pas être activé.
4. Sélectionnez l'un des serveurs virtuels, puis sélectionnez Désactivation du voyant d'avertissement
système. La fenêtre de confirmation qui s'affiche fournit les informations suivantes :
v Un message de vérification indique que le voyant d'avertissement système a été désactivé.
v Un message indique que la partition logique présente encore des incidents non résolus.
v Un message indique que le voyant du serveur virtuel ne peut pas être activé.
Activation ou désactivation de voyants d'identification via la console SDMC :
Pourquoi et quand exécuter cette tâche
Le système comporte différents voyants d'identification de composants, tels que les boîtiers ou les unités
remplaçables sur site (FRU). On les appelle donc voyants d'identification.
Vous pouvez activer ou désactiver les types de voyants d'identification suivants :
v Voyant d'identification d'un boîtier : Si vous souhaitez ajouter un adaptateur dans un tiroir spécifique
(boîtier), vous devez connaître les type, modèle et numéro de série (MTMS) du tiroir. Pour déterminer
si vous disposez des informations MTMS appropriées du tiroir nécessitant le nouvel adaptateur, vous
pouvez activer le voyant d'un tiroir et vérifier que le MTMS correspond au tiroir qui doit accueillir le
nouveau matériel.
v Voyant d'identification d'une unité FRU associée à un boîtier : Pour connecter un câble à un
adaptateur d'E-S, vous pouvez activer le voyant de l'adaptateur s'il s'agit d'une unité FRU, puis vérifier
sur le matériel où le câble doit être fixé. Cette opération est particulièrement utile en présence de ports
ouverts sur plusieurs adaptateurs.
Pour activer ou désactiver le voyant d'identification d'un boîtier ou d'une unité FRU, procédez comme
suit :
Procédure
1. Dans l'onglet Ressources, sélectionnez l'hôte ou le serveur virtuel approprié.
2. Sélectionnez Actions > Service et support > Matériel > Voyant d'identification.
3. Sous Voyant d'identification, sélectionnez la fenêtre Boîtier puis l'unité centrale ou le boîtier.
4. Pour activer ou désactiver un voyant d'identification, cliquez sur Activation du voyant ou
Désactivation du voyant. Le voyant associé s'allume ou s'éteint.
5. Pour activer ou désactiver le voyant d'identification d'une unité FRU, sélectionnez un système ou un
boîtier dans le tableau, puis sélectionnez Liste des unités FRU.
6. Sélectionnez une ou plusieurs unités FRU dans le tableau, et cliquez sur Activation du voyant ou
Désactivation du voyant. Le voyant associé s'allume ou s'éteint.
Visualisation d'événements réparables via la console de gestion Systems Director
Utilisez cette procédure pour afficher un événement réparable, y compris les détails, commentaires et
historique de maintenance.
120 Activation SAS (Serial-attached SCSI) RAID
Pourquoi et quand exécuter cette tâche
Pour afficher des événements réparables, procédez comme suit :
Procédure
1. Dans l'onglet Ressources, sélectionnez l'hôte ou le serveur virtuel approprié.
2. Sélectionnez Actions > Etat et santé du système > Journal des événements.
3. Facultatif : Vous pouvez limiter les critères de recherche d'événement à l'aide du menu de filtrage des
événements.
4. Sélectionnez une ligne dans la fenêtre des événements puis sélectionnez Actions > Propriétés La
fenêtre Propriétés s'ouvre, affichant les informations détaillées relatives à l'événement réparables. Le
tableau indique des informations telles que le numéro d'incident, le code de référence et les unités
remplaçables sur site (FRU) associées à l'événement.
Vérification d'un dispositif installé ou d'un composant remplacé sur un
système ou une partition logique à l'aide des outils Virtual I/O Server
Si vous avez installé un dispositif ou remplacé un composant, il est recommandé d'utiliser les outils
Virtual I/O Server (VIOS) pour vérifier que ce dispositif ou composant est reconnu par le système ou la
partition logique.
Pourquoi et quand exécuter cette tâche
Pour vérifier le fonctionnement d'un dispositif récemment installé ou d'un composant de remplacement,
sélectionnez la procédure appropriée :
v Vérification du dispositif installé via VIOS
v Vérification du composant de remplacement via VIOS
Vérifiez le dispositif installé via VIOS :
1. Connectez-vous en tant qu'utilisateur root.
2. Sur la ligne de commande, tapez diagmenu et appuyez sur Entrée.
3. Sélectionnez Routines de diagnostic avancé et appuyez sur Entrée.
4. Dans le menu Sélection du mode de diagnostic, sélectionnez Vérification du système et appuyez sur
Entrée.
5. Dans le menu Sélection des tests de diagnostic, mode étendu, exécutez l'une des procédures
suivantes :
v Pour tester une seule ressource, sélectionnez la ressource que vous venez d'installer dans la liste des
ressources et appuyez sur Entrée.
v Pour tester toutes les ressources disponibles du système d'exploitation, sélectionnez Toutes les
ressources et appuyez sur Entrée.
6. Sélectionnez Validation, puis attendez la fin de l'exécution des programmes de diagnostic en
répondant aux invites éventuelles.
7. Les programmes de diagnostic ont-ils été exécutés complètement avant d'afficher le message Aucun
incident n’a été détecté ?
v Non : Si un code SRN (Service Request Number - numéro de demande d'intervention) ou un autre
code de référence s'affiche, il est possible qu'un adaptateur ou qu'un câble ne soit pas bien
connecté. Passez en revue les procédures d'installation pour vérifier que le nouveau dispositif est
installé correctement. Si vous ne parvenez pas à corriger l'incident, rassemblez tous les codes SRN
et autres codes de référence éventuels. Si le système fonctionne en mode de partitionnement
logique, notez la partition logique sur laquelle le dispositif est installé. Pour obtenir de l'aide,
contactez votre fournisseur de services.
Procédures communes aux dispositifs installables 121
v Oui : Le nouveau dispositif est installé correctement. Quittez les programmes de diagnostic et
revenez en mode de fonctionnement normal.
Vérifiez le composant de remplacement via VIOS :
Pour vérifier le fonctionnement d'un dispositif ou d'un composant de remplacement récemment installé,
procédez comme suit :
1. Avez-vous remplacé le composant à l'aide de VIOS ou des opérations de remplacement à chaud du
service de diagnostic en ligne ?
Non : Passez à l'étape 2.
Oui : Passez à l'étape 5.
2. Le système est-il hors tension ?
Non : Passez à l'étape 4.
Oui : Si le système prend en charge l'initialisation lente, configurez la fonction. Pour plus
d'informations, reportez-vous à la rubrique Exécution d'une initialisation lente.
3. Démarrez le système et attendez que l'invite de connexion du système d'exploitation VIOS s'affiche
ou que l'activité apparente du système sur le panneau de commande ou l'écran s'arrête.
Avez-vous vu l'invite de connexion VIOS ?
v Non : Si un code SRN ou un autre code de référence s'affiche, il est possible qu'un adaptateur ou
un câble ne soit pas bien connecté. Passez en revue les procédures de remplacement pour vérifier
que le nouveau composant est installé correctement. Si vous ne parvenez pas à corriger l'incident,
rassemblez tous les codes SRN et autres codes de référence éventuels. Si le système ne démarre
pas ou que l'invite de connexion ne s'affiche pas, consultez la rubrique : Incidents de chargement
et de démarrage du système d'exploitation.
Si le système est partitionné, notez la partition logique sur laquelle vous avez remplacé le
composant. Pour obtenir de l'aide, contactez votre fournisseur de services.
v Oui : Passez à l'étape 4.
4. A l'invite, tapez diag —a et appuyez sur Entrée pour vérifier s'il manque des ressources. Si une invite
s'affiche, passez à l'étape 5.
Si le menu Sélection des tests de diagnostic s'affiche avec un M en regard d'une ressource, procédez
comme suit :
a. Sélectionnez la ressource, puis appuyez sur Entrée.
b. Sélectionnez Validation.
c. Suivez les instructions affichées.
d. Si un message vous demande si vous souhaitez revoir l'erreur affichée précédemment, cliquez sur
Oui et appuyez sur Entrée.
e. Si le système génère un code SRN, une carte ou un câble est probablement mal connecté. Si
aucun incident évident n'apparaît, notez le code SRN et contactez votre fournisseur de services
pour obtenir de l'aide.
f. Si aucun code SRN ne s'affiche, passez à l'étape 5.
5. Testez le composant en effectuant les opérations suivantes :
a. Sur la ligne de commande, tapez diagmenu et appuyez sur Entrée.
b. Dans le menu Sélection des fonctions, sélectionnez Programmes de diagnostic, mode étendu et
appuyez sur Entrée.
c. Dans le menu Sélection du mode de diagnostic, sélectionnez Vérification du système et
appuyez sur Entrée.
d. Sélectionnez Toutes les ressources ou les programmes de diagnostic du composant pour tester
uniquement le composant remplacé ou les périphériques qui y sont raccordées et appuyez sur
Entrée.
Le menu Action corrective sur ressource s'est-il affiché ?
122 Activation SAS (Serial-attached SCSI) RAID
Non : Passez à l'étape 6.
Oui : Passez à l'étape 7.
6. Le message Test terminé, aucun incident n'a été détecté s'est-il affiché ?
v Non : Il existe encore un incident. Contactez votre fournisseur de services. La procédure est
terminée.
v Oui : Si l'incident ne figure pas dans le journal des erreurs, sélectionnez Consignation d'action
corrective dans le menu Sélection des tâches pour mettre à jour le journal des erreurs AIX. Si la
réparation consistait à remettre en place un câble ou une carte, sélectionnez la ressource
correspondante. Si la ressource associée à l'action ne s'affiche pas dans la liste des ressources,
sélectionnez sysplanar0 et appuyez sur Entrée.
Conseil : Le voyant du composant passe de l'état Incident à l'état normal.
Passez à l'étape 9.
7. Dans le menu Action corrective sur ressource, sélectionnez la ressource du composant remplacé. En
cas de test réussi sur une ressource en mode de vérification du système, dont une entrée figure dans
le journal des erreurs AIX, le menu Action corrective sur ressource apparaît. Pour mettre à jour le
journal des erreurs AIX pour indiquer qu'un composant détectable par le système a été remplacé,
procédez comme suit.
Remarque : Sur les systèmes équipés d'un voyant correspondant au composant défaillant, le voyant
passe à l'état normal.
a. Dans le menu Action corrective sur ressource, sélectionnez la ressource remplacée. Si la
réparation consistait à remettre en place un câble ou une carte, sélectionnez la ressource
correspondante. Si la ressource associée à votre action n'apparaît pas dans la liste des ressources,
sélectionnez sysplanar0. Appuyez sur Entrée.
b. Sélectionnez Validation une fois les sélections effectuées. L'écran Action corrective sur ressource
s'est-il affiché de nouveau ?
Non : Si l'écran Aucun incident détecté apparaît, passez à l'étape 9.
Oui : Passez à l'étape 8.
8. Dans le menu Action corrective sur ressource, sélectionnez le composant parent ou enfant de la
ressource que vous venez de remplacer, si nécessaire. En cas de test réussi sur une ressource en
mode de vérification du système, dont une entrée figure dans le journal des erreurs AIX, le menu
Action corrective sur ressource apparaît. Pour mettre à jour le journal des erreurs AIX pour indiquer
qu'un composant détectable par le système a été remplacé, procédez comme suit.
Remarque : Le voyant du composant passe de l'état Incident à l'état normal.
a. Dans le menu Action corrective sur ressource, sélectionnez le composant parent ou enfant de la
ressource remplacée. Si la réparation consistait à remettre en place un câble ou une carte,
sélectionnez la ressource correspondante. Si la ressource associée à votre action n'apparaît pas
dans la liste des ressources, sélectionnez sysplanar0. Appuyez sur Entrée.
b. Sélectionnez Validation une fois les sélections effectuées.
c. Si l'écran Aucun incident détecté apparaît, passez à l'étape 9.
9. Si vous avez modifié les paramètres du processeur de service ou du réseau (voir procédures
précédentes), restaurez la valeur initiale des paramètres.
10. Avez-vous exécuté des procédures de remplacement à chaud avant cette procédure ?
Non : Passez à l'étape 11.
Oui : Passez à l'étape 12, à la page 124.
11. Démarrez le système d'exploitation (système ou partition logique en mode normal). Avez-vous pu
démarrer le système d'exploitation ?
Non : Contactez votre fournisseur de services. La procédure est terminée.
Oui : Passez à l'étape 12, à la page 124.
Procédures communes aux dispositifs installables 123
12. Les voyants sont-ils encore allumés ?
v Non. La procédure est terminée.
v Oui : Désactivez les voyants. Pour plus d'informations, consultez l'une de ces rubriques :
Modification des indicateurs de service
Installation d'un composant via la console SDMC
Vous pouvez utiliser la console SDMC (Systems Director Management Console) pour effectuer diverses
tâches de maintenance, y compris l'installation d'une nouvelle unité remplaçable sur site (FRU) ou d'un
nouveau composant.
Avant de commencer
Pour utiliser l'interface utilisateur de la console SDMC pour installer un composant sur un système ou
une unité d'extension géré(e) par une console SDMC, procédez comme suit :
Procédure
1. Dans la zone des ressources Power Systems sélectionnez le système sur lequel vous voulez installer
un composant.
2. Dans le menu Actions, développez Service and Support > Matériel > Tâches de modification de
matériel > Ajout de FRU.
3. Dans l'onglet d'ajout de FRU, sélectionnez le système ou le type de boîtier.
4. Sélectionnez dans le menu le type de FRU que vous souhaitez installer et cliquez sur Suivant.
5. Sélectionnez le code d'emplacement pour l'installation du composant et cliquez sur Ajouter.
6. Une fois l'unité remplaçable sur site (FRU) placée dans la catégorie des actions en attente, cliquez sur
Lancer la procédure et suivez les instructions d'installation du composant.
Remarque : Il se peut que la console SDMC ouvre une fenêtre d'instructions pour l'installation du
composant. Dans ce cas, suivez les instructions pour installer le composant.
Remplacement d'un composant via la console SDMC
Vous pouvez utiliser la console SDMC (Systems Director Management Console) pour effectuer diverses
tâches de maintenance, y compris le remplacement d'une unité remplaçable sur site (FRU) ou d'un
composant.
Pourquoi et quand exécuter cette tâche
Si vous procédez à un échange de composant dans le but de solutionner un événement réparable, suivez
ces instructions.
Procédure
1. Dans la zone des ressources Power Systems, sélectionnez le système sur lequel vous souhaitez
remplacer un composant.
2. Sélectionnez l'une des options suivantes :
v Si vous remplacez un composant en dehors du cadre d'une opération de maintenance, depuis le
menu Actions, développez Service and Support > Matériel > Remplacement de FRU.
v Si vous procédez à un échange de composant dans le but de corriger un événement réparable, voir
Démarrage d'une réparation.
3. Pour remplacer un composant dans le cadre d'une opération de maintenance, sélectionnez
Remplacement de FRU.
124 Activation SAS (Serial-attached SCSI) RAID
4. Dans l'onglet Remplacement de FRU, sélectionnez l'unité centrale ou d'extension sur laquelle vous
souhaitez remplacer un composant.
5. Sélectionnez dans le menu le type de composant que vous souhaitez remplacer et cliquez sur Suivant.
6. Sélectionnez dans le menu le code d'emplacement du composant que vous souhaitez remplacer et
cliquez sur Ajouter.
7. Une fois l'unité remplaçable sur site (FRU) placée dans la fenêtre des actions en attente, cliquez sur
Lancer la procédure et suivez les instructions d'installation du composant.
Remarque : Il se peut que la console SDMC ouvre une fenêtre d'instructions pour le remplacement du
composant. Dans ce cas, suivez les instructions pour remplacer le composant.
Retrait d'un composant via la console SDMC
Vous pouvez utiliser la console SDMC (Systems Director Management Console) pour effectuer diverses
tâches de maintenance, y compris le retrait d'une unité remplaçable sur site (FRU) ou d'un composant.
Pourquoi et quand exécuter cette tâche
Pour utiliser l'interface utilisateur de la console SDMC pour retirer un composant sur un système ou une
unité d'extension géré(e) par une console SDMC, procédez comme suit :
Procédure
1. Dans la zone des ressources Power Systems, sélectionnez le système duquel vous souhaitez retirer un
composant.
2. Sélectionnez l'une des options suivantes :
v Si vous installez un nouveau composant dans le cadre d'une mise à niveau, depuis le menu Actions
développez Service and Support > Matériel > Tâches de modification de matériel.
v Si vous retirez un composant dans le cadre d'une opération de maintenance, depuis le menu
Actions développez Service and Support > Matériel > Remplacement de FRU.
3. Pour installer un nouveau composant dans le cadre d'une mise à niveau, sélectionnez Ajout de FRU.
Pour remplacer un composant dans le cadre d'une opération de maintenance, sélectionnez
Remplacement de FRU.
4. Dans la fenêtre d'ajout ou de remplacement de FRU, sélectionnez le système ou l'unité d'extension
duquel ou de laquelle vous souhaitez retirer le composant.
5. Sélectionnez dans la liste le type de composant que vous souhaitez retirer et cliquez sur Suivant.
6. Sélectionnez l'emplacement du composant, puis cliquez sur Ajouter.
7. Une fois que le composant figure dans la fenêtre des actions en attente, cliquez sur Lancer la
procédure et suivez les instructions de retrait du composant.
Remarque : Il se peut que la console SDMC ouvre une fenêtre d'instructions pour le retrait du
composant. Dans ce cas, suivez ces instructions pour retirer le composant.
Procédures communes aux dispositifs installables 125
126 Activation SAS (Serial-attached SCSI) RAID
Remarques
Le présent document peut contenir des informations ou des références concernant certains produits,
logiciels ou services du fabricant non annoncés dans ce pays.
Le présent document peut contenir des informations ou des références concernant certains produits,
logiciels ou services non annoncés dans ce pays. Pour plus d'informations, référez-vous aux documents
d'annonce disponibles dans votre pays ou adressez-vous au partenaire commercial du fabricant. Toute
référence à un produit, logiciel ou service du fabricant n'implique pas que seul ce produit, logiciel ou
service puisse convenir. Tout autre élément fonctionnellement équivalent peut être utilisé, s'il n'enfreint
aucun droit du fabricant. Il est toutefois de la responsabilité de l'utilisateur d'évaluer et de vérifier
lui-même le fonctionnement de tout produit, programme ou service.
Le fabricant peut détenir des brevets ou des demandes de brevet couvrant les produits mentionnés dans
le présent document. La remise de ce document ne vous octroie aucun droit de licence sur ces brevets ou
demandes de brevet. Si vous désirez recevoir des informations concernant l'acquisition de licences,
veuillez en faire la demande par écrit au fabricant.
Le paragraphe suivant ne s'applique ni au Royaume-Uni, ni dans aucun pays dans lequel il serait
contraire aux lois locales. LE PRESENT DOCUMENT EST LIVRE «EN L'ETAT» SANS AUCUNE
GARANTIE EXPLICITE OU IMPLICITE. LE FABRICANT DECLINE NOTAMMENT TOUTE
RESPONSABILITE RELATIVE A CES INFORMATIONS EN CAS DE CONTREFAÇON AINSI QU'EN
CAS DE DEFAUT D'APTITUDE A L'EXECUTION D'UN TRAVAIL DONNE. Certaines juridictions
n'autorisent pas l'exclusion des garanties implicites, auquel cas l'exclusion ci-dessus ne vous sera pas
applicable.
Le présent document peut contenir des inexactitudes ou des coquilles. Il est mis à jour périodiquement.
Chaque nouvelle édition inclut les mises à jour. IBM peut modifier sans préavis les produits et logiciels
décrits dans ce document.
Les références à des sites Web non référencés par le fabricant sont fournies à titre d'information
uniquement et n'impliquent en aucun cas une adhésion aux données qu'ils contiennent. Les éléments
figurant sur ces sites Web ne font pas partie des éléments du présent produit et l'utilisation de ces sites
relève de votre seule responsabilité.
Le fabricant pourra utiliser ou diffuser, de toute manière qu'il jugera appropriée et sans aucune obligation
de sa part, tout ou partie des informations qui lui seront fournies.
Les données de performance indiquées dans ce document ont été déterminées dans un environnement
contrôlé. Par conséquent, les résultats peuvent varier de manière significative selon l'environnement
d'exploitation utilisé. Certaines mesures évaluées sur des systèmes en cours de développement ne sont
pas garanties sur tous les systèmes disponibles. En outre, elles peuvent résulter d'extrapolations. Les
résultats peuvent donc varier. Il incombe aux utilisateurs de ce document de vérifier si ces données sont
applicables à leur environnement d'exploitation.
Les informations concernant les produits de fabricants tiers ont été obtenues auprès des fournisseurs de
ces produits, par l'intermédiaire d'annonces publiques ou via d'autres sources disponibles. Ce fabricant
n'a pas testé ces produits et ne peut confirmer l'exactitude de leurs performances ni leur compatibilité. Il
ne peut recevoir aucune réclamation concernant des produits de fabricants tiers. Toute question
concernant les performances de produits de fabricants tiers doit être adressée aux fournisseurs de ces
produits.
127
Toute instruction relative aux intentions du fabricant pour ses opérations à venir est susceptible d'être
modifiée ou annulée sans préavis, et doit être considérée uniquement comme un objectif.
Tous les tarifs indiqués sont les prix de vente actuels suggérés et sont susceptibles d'être modifiés sans
préavis. Les tarifs appliqués peuvent varier selon les revendeurs.
Ces informations sont fournies uniquement à titre de planification. Elles sont susceptibles d'être modifiées
avant la mise à disposition des produits décrits.
Le présent document peut contenir des exemples de données et de rapports utilisés couramment dans
l'environnement professionnel. Ces exemples mentionnent des noms fictifs de personnes, de sociétés, de
marques ou de produits à des fins illustratives ou explicatives uniquement. Tous ces noms sont fictifs et
toute ressemblance avec des noms et adresses utilisés par une entreprise réelle serait purement fortuite.
Si vous visualisez ces informations en ligne, il se peut que les photographies et illustrations en couleur
n'apparaissent pas à l'écran.
Les figures et les spécifications contenues dans le présent document ne doivent pas être reproduites,
même partiellement, sans l'autorisation écrite d'IBM.
Le fabricant a conçu le présent document pour expliquer comment utiliser les machines indiquées. Il n'est
exploitable dans aucun autre but.
Les ordinateurs du fabricant contiennent des mécanismes conçus pour réduire les risques d'altération ou
de perte de données. Ces risques, cependant, ne peuvent pas être éliminés. En cas de rupture de tension,
de défaillances système, de fluctuations ou de rupture de l'alimentation ou d'incidents au niveau des
composants, l'utilisateur doit s'assurer de l'exécution rigoureuse des opérations, et que les données ont été
sauvegardées ou transmises par le système au moment de la rupture de tension ou de l'incident (ou peu
de temps avant ou après). De plus, ces utilisateurs doivent établir des procédures garantissant la
vérification indépendante des données, afin de permettre une utilisation fiable de ces dernières dans le
cadre d'opérations stratégiques. Ces utilisateurs doivent enfin consulter régulièrement sur les sites Web de
support IBM les mises à jour et les correctifs applicables au système et aux logiciels associés.
Restrictions liées à l'utilisation de connexion Ethernet
Ce produit n'est pas destiné à être connecté, directement ou indirectement, par quelque moyen que ce
soit, à des interfaces de réseaux de télécommunications publiques.
Marques
IBM, le logo IBM et ibm.com sont des marques d'International Business Machines aux Etats-Unis et/ou
dans certains autres pays. Les autres noms de produits et de services peuvent être des marques d'IBM ou
d'autres sociétés. La liste actualisée de toutes les marques d'IBM est disponible sur la page Web
Copyright and trademark information à http://www.ibm.com/legal/copytrade.shtml.
Linux est une marque déposée de Linus Torvalds aux Etats-Unis et/ou dans certains autres pays.
Les autres noms de produits et de services peuvent être des marques d'IBM ou d'autres sociétés.
Bruits radioélectriques
Lorsque vous connectez un moniteur à l'équipement, vous devez utiliser le câble fourni à cet effet, ainsi
que toute unité de suppression des interférences.
128 Activation SAS (Serial-attached SCSI) RAID
Remarques sur la classe A
Les avis de conformité de classe A suivants s'appliquent aux serveurs.
Recommandation de la Federal Communications Commission (FCC) [Etats-Unis]
Remarque : Cet appareil respecte les limites des caractéristiques d'immunité des appareils numériques
définies pour la classe A, conformément au chapitre 15 de la réglementation de la FCC. La conformité
aux spécifications de cette classe offre une garantie acceptable contre les perturbations électromagnétiques
dans les zones commerciales. Ce matériel génère, utilise et peut émettre de l'énergie radiofréquence. Il
risque de parasiter les communications radio s'il n'est pas installé conformément aux instructions du
constructeur. L'exploitation faite en zone résidentielle peut entraîner le brouillage des réceptions radio et
télé, ce qui obligerait le propriétaire à prendre les dispositions nécessaires pour en éliminer les causes.
Utilisez des câbles et connecteurs correctement blindés et mis à la terre afin de respecter les limites de
rayonnement définies par la réglementation de la FCC. IBM ne peut pas être tenue pour responsable du
brouillage des réceptions radio ou télévision résultant de l'utilisation de câbles et connecteurs inadaptés
ou de modifications non autorisées apportées à cet appareil. Toute modification non autorisée pourra
annuler le droit d'utilisation de cet appareil.
Cet appareil est conforme aux restrictions définies dans le chapitre 15 de la réglementation de la FCC.
Son utilisation est soumise aux deux conditions suivantes : (1) il ne peut pas causer de perturbations
électromagnétiques gênantes et (2) il doit accepter toutes les perturbations reçues, y compris celles
susceptibles d'occasionner un fonctionnement indésirable.
Avis de conformité à la réglementation d'Industrie Canada
Cet appareil numérique de la classe A est conforme à la norme NMB-003 du Canada.
Avis de conformité aux exigences de l'Union européenne
Le présent produit satisfait aux exigences de protection énoncées dans la directive 2004/108/CEE du
Conseil concernant le rapprochement des législations des Etats membres relatives à la compatibilité
électromagnétique. IBM décline toute responsabilité en cas de non-respect de cette directive résultat d'une
modification non recommandée du produit, y compris l'ajout de cartes en option non IBM.
Ce produit respecte les limites des caractéristiques d'immunité des appareils de traitement de
l'information définies par la classe A de la norme européenne EN 55022 (CISPR 22). La conformité aux
spécifications de la classe A offre une garantie acceptable contre les perturbations avec les appareils de
communication agréés, dans les zones commerciales et industrielles.
Dans l'Union européenne, contactez :
IBM Deutschland GmbH
Technical Regulations, Department M456
IBM-Allee 1, 71139 Ehningen, Germany
Tele: +49 7032 15-2937
email: tjahn@de.ibm.com
Avertissement : Ce matériel appartient à la classe A. Il est susceptible d'émettre des ondes
radioélectriques risquant de perturber les réceptions radio. Son emploi dans une zone résidentielle peut
créer des perturbations électromagnétiques. L'utilisateur devra alors prendre les mesures nécessaires pour
en éliminer les causes.
Remarques 129
Avis de conformité aux exigences du Voluntary Control Council for Interference
(VCCI) - Japon
Voici un résumé de la recommandation du VCCI japonais figurant dans l'encadré ci-dessus :
Ce produit de la classe A respecte les limites des caractéristiques d'immunité définies par le VCCI
(Voluntary Control Council for Interference) japonais. Si ce matériel est utilisé dans une zone résidentielle,
il peut créer des perturbations électromagnétiques. L'utilisateur devra alors prendre les mesures
nécessaires pour en éliminer les causes.
Directive relative aux harmoniques confirmée par l'association japonaise JEITA
(Japanese Electronics and Information Technology Industries Association)
(produits inférieurs ou égaux à 20 A par phase)
Directive relative aux harmoniques confirmée avec modifications par l'association
japonaise JEITA (Japanese Electronics and Information Technology Industries
Association) (produits supérieurs 20 A par phase)
Avis d'interférences électromagnétiques (EMI) - République populaire de Chine
Ce matériel appartient à la classe A. Il est susceptible d'émettre des ondes radioélectriques risquant de
perturber les réceptions radio. L'utilisateur devra alors prendre les mesures nécessaires pour en éliminer
les causes.
130 Activation SAS (Serial-attached SCSI) RAID
Avis d'interférences électromagnétiques (EMI) - Taïwan
Voici un résumé de l'avis EMI de Taïwan figurant ci-dessus.
Avertissement : Ce matériel appartient à la classe A. Il est susceptible d'émettre des ondes
radioélectriques risquant de perturber les réceptions radio. Son emploi dans une zone résidentielle peut
créer des interférences. L'utilisateur devra alors prendre les mesures nécessaires pour les supprimer.
Avis d'interférences électromagnétiques (EMI) - Corée
Avis de conformité pour l'Allemagne
Deutschsprachiger EU Hinweis: Hinweis für Geräte der Klasse A EU-Richtlinie zur
Elektromagnetischen Verträglichkeit
Dieses Produkt entspricht den Schutzanforderungen der EU-Richtlinie 2004/108/EG zur Angleichung der
Rechtsvorschriften über die elektromagnetische Verträglichkeit in den EU-Mitgliedsstaaten und hält die
Grenzwerte der EN 55022 Klasse A ein.
Um dieses sicherzustellen, sind die Geräte wie in den Handbüchern beschrieben zu installieren und zu
betreiben. Des Weiteren dürfen auch nur von der IBM empfohlene Kabel angeschlossen werden. IBM
übernimmt keine Verantwortung für die Einhaltung der Schutzanforderungen, wenn das Produkt ohne
Zustimmung von IBM verändert bzw. wenn Erweiterungskomponenten von Fremdherstellern ohne
Empfehlung von IBM gesteckt/eingebaut werden.
EN 55022 Klasse A Geräte müssen mit folgendem Warnhinweis versehen werden:
"Warnung: Dieses ist eine Einrichtung der Klasse A. Diese Einrichtung kann im Wohnbereich
Funk-Störungen verursachen; in diesem Fall kann vom Betreiber verlangt werden, angemessene
Maßnahmen zu ergreifen und dafür aufzukommen."
Deutschland: Einhaltung des Gesetzes über die elektromagnetische Verträglichkeit von Geräten
Dieses Produkt entspricht dem “Gesetz über die elektromagnetische Verträglichkeit von Geräten
(EMVG)“. Dies ist die Umsetzung der EU-Richtlinie 2004/108/EG in der Bundesrepublik Deutschland.
Remarques 131
Zulassungsbescheinigung laut dem Deutschen Gesetz über die elektromagnetische Verträglichkeit von
Geräten (EMVG) (bzw. der EMC EG Richtlinie 2004/108/EG) für Geräte der Klasse A
Dieses Gerät ist berechtigt, in Übereinstimmung mit dem Deutschen EMVG das EG-Konformitätszeichen
- CE - zu führen.
Verantwortlich für die Einhaltung der EMV Vorschriften ist der Hersteller:
International Business Machines Corp.
New Orchard Road
Armonk, New York 10504
Tel: 914-499-1900
Der verantwortliche Ansprechpartner des Herstellers in der EU ist:
IBM Deutschland GmbH
Technical Regulations, Abteilung M456
IBM-Allee 1, 71139 Ehningen, Germany
Tel: +49 7032 15-2937
email: tjahn@de.ibm.com
Generelle Informationen:
Das Gerät erfüllt die Schutzanforderungen nach EN 55024 und EN 55022 Klasse A.
Avis d'interférences électromagnétiques (EMI) - Russie
Remarques sur la classe B
Les avis de conformité de classe B suivants s'appliquent aux dispositifs déclarés comme relevant de la
compatibilité électromagnétique (EMC) de classe B dans les informations d'installation des dispositifs.
Recommandation de la Federal Communications Commission (FCC) [Etats-Unis]
Cet appareil respecte les limites des caractéristiques d'immunité des appareils numériques définies par la
classe B, conformément au chapitre 15 de la réglementation de la FCC. La conformité aux spécifications
de la classe B offre une garantie acceptable contre les perturbations électromagnétiques dans les zones
résidentielles.
Ce matériel génère, utilise et peut émettre de l'énergie radiofréquence. Il risque de parasiter les
communications radio s'il n'est pas installé conformément aux instructions du constructeur. Toutefois, il
n'est pas garanti que des perturbations n'interviendront pas pour une installation particulière.
Si cet appareil provoque des perturbations gênantes dans les communications radio ou télévision,
mettez-le hors tension puis sous tension pour vous en assurer. L'utilisateur peut tenter de remédier à cet
incident en appliquant une ou plusieurs des mesures suivantes :
v Réorienter ou repositionner l'antenne de réception.
v Eloigner l'appareil du récepteur.
132 Activation SAS (Serial-attached SCSI) RAID
v Brancher l'appareil sur une prise différente de celle du récepteur, sur un circuit distinct.
v Prendre contact avec un distributeur agréé IBM ou un représentant commercial IBM pour obtenir de
l'aide.
Utilisez des câbles et connecteurs correctement blindés et mis à la terre afin de respecter les limites de
rayonnement définies par la réglementation de la FCC. Ces câbles et connecteurs sont disponibles chez
votre distributeur agréé IBM. IBM ne peut pas être tenue pour responsable du brouillage des réceptions
radio ou télévision résultant de modifications non autorisées apportées à cet appareil. Toute modification
non autorisée pourra annuler le droit d'utilisation de cet appareil.
Cet appareil est conforme aux restrictions définies dans le chapitre 15 de la réglementation de la FCC.
Son utilisation est soumise aux deux conditions suivantes : (1) il ne peut pas causer de perturbations
électromagnétiques gênantes et (2) il doit accepter toutes les perturbations reçues, y compris celles
susceptibles d'occasionner un fonctionnement indésirable.
Avis de conformité à la réglementation d'Industrie Canada
Cet appareil numérique de la classe B est conforme à la norme NMB-003 du Canada.
Avis de conformité aux exigences de l'Union européenne
Le présent produit satisfait aux exigences de protection énoncées dans la directive 2004/108/CEE du
Conseil concernant le rapprochement des législations des Etats membres relatives à la compatibilité
électromagnétique. IBM décline toute responsabilité en cas de non-respect de cette directive résultant
d'une modification non recommandée du produit, y compris l'ajout de cartes en option non IBM.
Ce produit respecte les limites des caractéristiques d'immunité des appareils de traitement de
l'information définies par la classe B de la norme européenne EN 55022 (CISPR 22). La conformité aux
spécifications de la classe B offre une garantie acceptable contre les perturbations avec les appareils de
communication agréés, dans les zones résidentielles.
Dans l'Union européenne, contactez :
IBM Deutschland GmbH
Technical Regulations, Department M456
IBM-Allee 1, 71139 Ehningen, Germany
Tele: +49 7032 15-2937
email: tjahn@de.ibm.com
Avis de conformité aux exigences du Voluntary Control Council for Interference
(VCCI) - Japon
Directive relative aux harmoniques confirmée par l'association japonaise JEITA
(Japanese Electronics and Information Technology Industries Association)
(produits inférieurs ou égaux à 20 A par phase)
Remarques 133
Directive relative aux harmoniques confirmée avec modifications par l'association
japonaise JEITA (Japanese Electronics and Information Technology Industries
Association) (produits supérieurs 20 A par phase)
Avis d'interférences électromagnétiques (EMI) - Corée
Avis de conformité pour l'Allemagne
Deutschsprachiger EU Hinweis: Hinweis für Geräte der Klasse B EU-Richtlinie zur
Elektromagnetischen Verträglichkeit
Dieses Produkt entspricht den Schutzanforderungen der EU-Richtlinie 2004/108/EG zur Angleichung der
Rechtsvorschriften über die elektromagnetische Verträglichkeit in den EU-Mitgliedsstaaten und hält die
Grenzwerte der EN 55022 Klasse B ein.
Um dieses sicherzustellen, sind die Geräte wie in den Handbüchern beschrieben zu installieren und zu
betreiben. Des Weiteren dürfen auch nur von der IBM empfohlene Kabel angeschlossen werden. IBM
übernimmt keine Verantwortung für die Einhaltung der Schutzanforderungen, wenn das Produkt ohne
Zustimmung von IBM verändert bzw. wenn Erweiterungskomponenten von Fremdherstellern ohne
Empfehlung von IBM gesteckt/eingebaut werden.
Deutschland: Einhaltung des Gesetzes über die elektromagnetische Verträglichkeit von Geräten
Dieses Produkt entspricht dem “Gesetz über die elektromagnetische Verträglichkeit von Geräten
(EMVG)“. Dies ist die Umsetzung der EU-Richtlinie 2004/108/EG in der Bundesrepublik Deutschland.
Zulassungsbescheinigung laut dem Deutschen Gesetz über die elektromagnetische Verträglichkeit von
Geräten (EMVG) (bzw. der EMC EG Richtlinie 2004/108/EG) für Geräte der Klasse B
Dieses Gerät ist berechtigt, in Übereinstimmung mit dem Deutschen EMVG das EG-Konformitätszeichen
- CE - zu führen.
Verantwortlich für die Einhaltung der EMV Vorschriften ist der Hersteller:
International Business Machines Corp.
New Orchard Road
Armonk, New York 10504
Tel: 914-499-1900
Der verantwortliche Ansprechpartner des Herstellers in der EU ist:
IBM Deutschland GmbH
Technical Regulations, Abteilung M456
134 Activation SAS (Serial-attached SCSI) RAID
IBM-Allee 1, 71139 Ehningen, Germany
Tel: +49 7032 15-2937
email: tjahn@de.ibm.com
Generelle Informationen:
Das Gerät erfüllt die Schutzanforderungen nach EN 55024 und EN 55022 Klasse B.
Dispositions
Les droits d'utilisation relatifs à ces publications sont soumis aux dispositions suivantes.
Usage personnel : Vous pouvez reproduire ces publications pour votre usage personnel, non commercial,
sous réserve que toutes les mentions de propriété soient conservées. Vous ne pouvez distribuer ou
publier tout ou partie de ces publications ou en faire des oeuvres dérivées, sans le consentement exprès
du fabricant.
Usage commercial : Vous pouvez reproduire, distribuer et afficher ces publications uniquement au sein
de votre entreprise, sous réserve que toutes les mentions de propriété soient conservées. Vous ne pouvez
reproduire, distribuer, afficher ou publier tout ou partie de ces publications en dehors de votre entreprise,
ou en faire des oeuvres dérivées, sans le consentement exprès du fabricant.
Excepté les droits d'utilisation expressément accordés dans ce document, aucun autre droit, licence ou
autorisation, implicite ou explicite, n'est accordé pour ces publications ou autres informations, données,
logiciels ou droits de propriété intellectuelle contenus dans ces publications.
Le fabricant se réserve le droit de retirer les autorisations accordées dans le présent document si
l'utilisation des publications s'avère préjudiciable à ses intérêts ou que, selon son appréciation, les
instructions susmentionnées n'ont pas été respectées.
Vous ne pouvez télécharger, exporter ou réexporter ces informations qu'en total accord avec toutes les lois
et règlements applicables dans votre pays, y compris les lois et règlements américains relatifs à
l'exportation.
LE FABRICANT NE DONNE AUCUNE GARANTIE SUR LE CONTENU DE CES PUBLICATIONS. CES
PUBLICATIONS SONT LIVREES EN L'ETAT SANS AUCUNE GARANTIE EXPLICITE OU IMPLICITE.
LE FABRICANT DECLINE NOTAMMENT TOUTE RESPONSABILITE RELATIVE A CES
INFORMATIONS EN CAS DE CONTREFAÇON AINSI QU'EN CAS DE DEFAUT D'APTITUDE A
L'EXECUTION D'UN TRAVAIL DONNE.
Bull ESCALA PL 3200R
Service Guide
86 A1 82EF 03
ORDER REFERENCE
Bull ESCALA PL 3200R
Service Guide
Hardware
March 2004
BULL CEDOC
357 AVENUE PATTON
B.P.20845
49008 ANGERS CEDEX 01
FRANCE
86 A1 82EF 03
ORDER REFERENCE
The following copyright notice protects this book under the Copyright laws of the United States of America
and other countries which prohibit such actions as, but not limited to, copying, distributing, modifying, and
making derivative works.
Copyright Bull S.A. 1992, 2004
Printed in France
Suggestions and criticisms concerning the form, content, and presentation of
this book are invited. A form is provided at the end of this book for this purpose.
To order additional copies of this book or other Bull Technical Publications, you
are invited to use the Ordering Form also provided at the end of this book.
Trademarks and Acknowledgements
We acknowledge the right of proprietors of trademarks mentioned in this book.
AIX is a registered trademark of International Business Machines Corporation, and is being used under
licence.
UNIX is a registered trademark in the United States of America and other countries licensed exclusively through
the Open Group.
Linux is a registered trademark of Linus Torvalds.
The information in this document is subject to change without notice. Bull will not be liable for errors contained
herein, or for incidental or consequential damages in connection with the use of this material.
Contents Safety Notices . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .vii Rack Safety Instructions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .vii Electrical Safety . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .vii Laser Safety Information . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .ix Data Integrity and Verification . . . . . . . . . . . . . . . . . . . . . . . . . . .xi About This Book . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .xiii ISO 9000 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .xiii Highlighting . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .xiii Related Publications . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .xiv Trademarks . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .xiv Chapter 1. Reference Information . . . . . . . . . . . . . . . . . . . . . . . . . .1 406/90 Overview . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .1 Hardware Management Console (HMC) . . . . . . . . . . . . . . . . . . . . . . . . .3 Powering the System On and Off . . . . . . . . . . . . . . . . . . . . . . . . . .10 406/90 Data Flow . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .11 Power-On Self-Test . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .12 POST Indicators . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .12 POST Keys . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .12 406/90 System Locations . . . . . . . . . . . . . . . . . . . . . . . . . . . . .13 406/1D I/O Subsystem Locations . . . . . . . . . . . . . . . . . . . . . . . . . . .24 Power Subsystem Locations . . . . . . . . . . . . . . . . . . . . . . . . . . . .29 Power Distribution Cabling . . . . . . . . . . . . . . . . . . . . . . . . . . . . .30 I/O Subsystem Power Distribution Cabling . . . . . . . . . . . . . . . . . . . . . . .33 Integrated Battery Feature Locations . . . . . . . . . . . . . . . . . . . . . . . . .35 Subsystem Positioning and Cabling . . . . . . . . . . . . . . . . . . . . . . . . . .36 System Memory . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .60 I/O Subsystem Features . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .62 Logical and Physical Locations . . . . . . . . . . . . . . . . . . . . . . . . . . .64 Physical Location Codes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .64 AIX Location Codes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .66 Specifications . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .129 Service Inspection Guide . . . . . . . . . . . . . . . . . . . . . . . . . . . . .142 Preventive Maintenance . . . . . . . . . . . . . . . . . . . . . . . . . . . . .142 Chapter 2. Diagnostics Overview . . . . . . . . . . . . . . . . . . . . . . . . .145 Maintenance Analysis Procedures (MAPs) . . . . . . . . . . . . . . . . . . . . . . .145 Checkpoints . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .145 FRU Isolation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .147 FRU Identify LEDs . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .147 Running Diagnostics on the Native Serial Port on a Full Machine Partition with an HMC Attached 148 Chapter 3. Maintenance Analysis Procedures (MAPs) . . . . . . . . . . . . . . . . .149 Entry MAP . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .149 Quick Entry MAP . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .151 MAP 1020: Problem Determination . . . . . . . . . . . . . . . . . . . . . . . . .158 MAP 1230: Linux Platform Error Problem Resolution . . . . . . . . . . . . . . . . . . .163 MAP 1235: System Array Self-Repair Problem Resolution for Linux . . . . . . . . . . . . .168 MAP 1320: Service Focal Point . . . . . . . . . . . . . . . . . . . . . . . . . . .169 MAP 1321: Quick Entry MAP for Systems with Service Focal Point . . . . . . . . . . . . . .170 MAP 1322: End of Call MAP for Systems with Service Focal Point . . . . . . . . . . . . . .183 iii
MAP 1420: Recovery Procedures for Hot-Pluggable PCI Adapters . . . . . . . . . . . . . .188 MAP 1520: Power . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .192 MAP 1521: The System Will Not Power On And No Error Codes Are Available . . . . . . . . .196 MAP 1522: UEPO Switch On The BPC Is In The Bypass Position . . . . . . . . . . . . . .199 MAP 1523: There Is a Bulk Power Regulator (BPR) Communications Fault . . . . . . . . . . .200 MAP 1524: An Open Room EPO Switch Has Been Detected From One Side . . . . . . . . . .201 MAP 1525: There Is a 350 Volt Bulk Failure . . . . . . . . . . . . . . . . . . . . . .203 MAP 1526: There Is An Integrated Battery Feature (IBF) Failure . . . . . . . . . . . . . . .206 MAP 1527: An Airflow Loss Has Been Detected . . . . . . . . . . . . . . . . . . . . .209 MAP 1528: There Is A Processor (Critical/Warning) Overtemperature Fault . . . . . . . . . . .211 MAP 1529: There Is A Bulk Power Assembly (BPA) Communication Failure . . . . . . . . . .212 MAP 152a: Loss of ac Power or Phase Missing . . . . . . . . . . . . . . . . . . . . .214 MAP 152b: 2.5 V Current/Voltage Problem in Processor Subsystem . . . . . . . . . . . . .216 MAP 152c: 1.8 V Current/Voltage Problem in Processor Subsystem . . . . . . . . . . . . .223 MAP 152d: 1.5 V Auxiliary Current/Voltage Problem in Processor Subsystem . . . . . . . . . .231 MAP 152e: 1.5 V CPU Current/Voltage Problem in Processor Subsystem . . . . . . . . . . .240 MAP 152f: 3.3 V Current/Voltage Problem in Processor Subsystem . . . . . . . . . . . . .245 MAP 152g: 5.0 V Current/Voltage Problem in Processor Subsystem . . . . . . . . . . . . .253 MAP 152h: 3.3V Current/Voltage Problem in I/O Subsystem . . . . . . . . . . . . . . . .259 MAP 152i: 5.0 V Current/Voltage Problem in I/O Subsystem . . . . . . . . . . . . . . . .267 MAP 152j: 2.5 V Current/Voltage Problem in I/O Subsystem . . . . . . . . . . . . . . . .276 MAP 152k: 12.0 V Current/Voltage Problem in I/O Subsystem . . . . . . . . . . . . . . .284 MAP 152l: -12.0V Current/Voltage Problem in I/O Subsystem . . . . . . . . . . . . . . . .293 MAP 152m: Cable Problem in Power Subsystem . . . . . . . . . . . . . . . . . . . .300 MAP 152n: DASD Subsystem Power Problem . . . . . . . . . . . . . . . . . . . . .304 MAP 1540: Problem Isolation Procedures . . . . . . . . . . . . . . . . . . . . . . .308 MAP 1541: JTAG Problem Isolation . . . . . . . . . . . . . . . . . . . . . . . . .310 MAP 1542: I/O Problem Isolation . . . . . . . . . . . . . . . . . . . . . . . . . .318 MAP 1543: MCM Module Problem Isolation . . . . . . . . . . . . . . . . . . . . . .337 MAP 1544: Inner L3 Module Problem Isolation . . . . . . . . . . . . . . . . . . . . .340 MAP 1545: Outer L3 Module Problem Isolation . . . . . . . . . . . . . . . . . . . . .342 MAP 1546: Memory Book Problem Isolation . . . . . . . . . . . . . . . . . . . . . .345 MAP 1547: I/O Book Problem Isolation . . . . . . . . . . . . . . . . . . . . . . . .348 MAP 1548: Memory and Processor Problem Isolation . . . . . . . . . . . . . . . . . . .350 MAP 1549: Attention Problem Isolation . . . . . . . . . . . . . . . . . . . . . . . .353 MAP 154A: I2C Bus Problem Isolation . . . . . . . . . . . . . . . . . . . . . . . .361 MAP 154B: Insufficient Hardware Resources Problem Isolation . . . . . . . . . . . . . . .365 Chapter 4. Checkpoints . . . . . . . . . . . . . . . . . . . . . . . . . . . . .369 IPL Flow . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .369 Service Processor Checkpoints . . . . . . . . . . . . . . . . . . . . . . . . . . .372 Firmware Checkpoints . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .379 Chapter 5. Error Code to FRU Index . . . . . . . . . . . . . . . . . . . . . . . .413 Performing Slow Boot . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .413 Confirming Initial Error Code . . . . . . . . . . . . . . . . . . . . . . . . . . . .414 Four-Character Checkpoints . . . . . . . . . . . . . . . . . . . . . . . . . . . .415 Replacing the Media Subsystem Operator Panel . . . . . . . . . . . . . . . . . . . .415 Replacing the Network Adapter . . . . . . . . . . . . . . . . . . . . . . . . . . .415 Determining Location Code . . . . . . . . . . . . . . . . . . . . . . . . . . . .415 Checkpoints and Error Codes Index . . . . . . . . . . . . . . . . . . . . . . . . .416 Operator Panel Error Codes . . . . . . . . . . . . . . . . . . . . . . . . . . . .419 SPCN Error Codes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .420 Firmware/POST Error Codes . . . . . . . . . . . . . . . . . . . . . . . . . . . .488 Service Processor Error Codes . . . . . . . . . . . . . . . . . . . . . . . . . . .516 System Firmware Update Messages . . . . . . . . . . . . . . . . . . . . . . . . .736 iv
406/90
Service
Guide
Capacity Upgrade on Demand (CUoD) Messages . . . . . . . . . . . . . . . . . . . .736 Scan Dump Messages . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .736 Common Firmware Error Codes . . . . . . . . . . . . . . . . . . . . . . . . . .737 Problem Determination-Generated Error Codes . . . . . . . . . . . . . . . . . . . . .757 Hypervisor Dump Retrieval Procedure . . . . . . . . . . . . . . . . . . . . . . . .757 Chapter 6. Using the Online and Standalone Diagnostics . . . . . . . . . . . . . . . .759 Operating Considerations for Online and Standalone Diagnostics . . . . . . . . . . . . . .759 Online Diagnostics Modes of Operation . . . . . . . . . . . . . . . . . . . . . . . .760 Standalone Diagnostic Operation . . . . . . . . . . . . . . . . . . . . . . . . . .764 Chapter 7. Using the Service Processor . . . . . . . . . . . . . . . . . . . . . . .769 Service Processor Menus . . . . . . . . . . . . . . . . . . . . . . . . . . . . .769 General User Menu . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .771 Privileged User Menus . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .772 Service Processor Parameters in Service Mode (Full System Partition) . . . . . . . . . . . .801 System Power-On Methods . . . . . . . . . . . . . . . . . . . . . . . . . . . .802 Service Processor Reboot/Restart Recovery . . . . . . . . . . . . . . . . . . . . . .803 System Firmware Updates . . . . . . . . . . . . . . . . . . . . . . . . . . . . .805 Frame (Power Subsystem) Firmware Update with a 45H/W4 Switch Installed . . . . . . . . . .809 Installing Corrective Service on the Frame . . . . . . . . . . . . . . . . . . . . . . .810 Reconfiguration Procedure for SNI Adapters . . . . . . . . . . . . . . . . . . . . . .811 Configuring and Deconfiguring Processors or Memory . . . . . . . . . . . . . . . . . .812 Service Processor System Monitoring - Surveillance . . . . . . . . . . . . . . . . . . .813 Call-Out (Call-Home) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .815 Console Mirroring . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .816 Service Processor Error Logs . . . . . . . . . . . . . . . . . . . . . . . . . . .817 LCD Progress Indicator Log . . . . . . . . . . . . . . . . . . . . . . . . . . . .818 Resetting the Service Processor . . . . . . . . . . . . . . . . . . . . . . . . . .819 Service Processor Operational Phases . . . . . . . . . . . . . . . . . . . . . . . .820 Chapter 8. Using System Management Services . . . . . . . . . . . . . . . . . . .823 Select Language . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .824 Change Password Options . . . . . . . . . . . . . . . . . . . . . . . . . . . .825 View Error Log . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .825 Setup Remote IPL (Initial Program Load) . . . . . . . . . . . . . . . . . . . . . . .826 Change SCSI Settings . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .828 Select Console . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .828 Select Boot Options . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .829 Select Boot Devices . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .831 Display Current Settings . . . . . . . . . . . . . . . . . . . . . . . . . . . . .833 Restore Default Settings . . . . . . . . . . . . . . . . . . . . . . . . . . . . .834 Multiboot Startup . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .834 Exiting System Management Services . . . . . . . . . . . . . . . . . . . . . . . .834 Chapter 9. Removal and Replacement Procedures . . . . . . . . . . . . . . . . . . .835 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .837 Powering Off and Powering On the System . . . . . . . . . . . . . . . . . . . . . .838 Hot-Pluggable FRUs . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .842 Power Subsystem . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .844 Processor Subsystem . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .853 I/O Subsystem . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .913 Media Subsystem . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .998 Chapter 10. Parts Information . . . . . . . . . . . . . . . . . . . . . . . . . .1009 406/1R Rack Subsystem . . . . . . . . . . . . . . . . . . . . . . . . . . . . .1010 Contents
v
406/81 Processor Subsystem (Front) . . . . . . . . . . . . . . . . . . . . . . . .1012 406/81 Processor Subsystem (Front) . . . . . . . . . . . . . . . . . . . . . . . .1014 406/81 Processor Subsystem (Rear) . . . . . . . . . . . . . . . . . . . . . . . .1018 406/81 Media Subsystem . . . . . . . . . . . . . . . . . . . . . . . . . . . .1021 Power and SCSI Cables to the Media Subsystem . . . . . . . . . . . . . . . . . . .1023 406/1D I/O Subsystem . . . . . . . . . . . . . . . . . . . . . . . . . . . . .1025 Power Cabling . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .1027 RIO Cables and I/O Power Cables . . . . . . . . . . . . . . . . . . . . . . . . .1029 Keyboards and Mouse . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .1031 System Power Cables . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .1032 Tools . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .1033 Chapter 11. SRNs for Linux . . . . . . . . . . . . . . . . . . . . . . . . . . .1035 Appendix A. Environmental Notices . . . . . . . . . . . . . . . . . . . . . . . .1049 Product Recycling and Disposal . . . . . . . . . . . . . . . . . . . . . . . . . .1049 Acoustical Noise Emissions . . . . . . . . . . . . . . . . . . . . . . . . . . . .1054 Appendix B. Notices . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .1055 Appendix C. Service Processor Setup and Test . . . . . . . . . . . . . . . . . . .1057 Service Processor Setup Checklist . . . . . . . . . . . . . . . . . . . . . . . . .1057 Testing the Service Processor Setup . . . . . . . . . . . . . . . . . . . . . . . .1058 Appendix D. Modem Configurations . . . . . . . . . . . . . . . . . . . . . . . .1061 Sample Modem Configuration Files . . . . . . . . . . . . . . . . . . . . . . . . .1061 Configuration File Selection . . . . . . . . . . . . . . . . . . . . . . . . . . . .1062 Transfer of a Modem Session . . . . . . . . . . . . . . . . . . . . . . . . . . .1066 Modem Configuration Sample Files . . . . . . . . . . . . . . . . . . . . . . . . .1068 Appendix E. Ground Path . . . . . . . . . . . . . . . . . . . . . . . . . . . .1089 Index . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .1091
vi
406/90
Service
Guide
Safety Notices A danger notice indicates the presence of a hazard that has the potential of causing death or serious personal injury. Danger notices appear on the following pages: v viii v 192 v 837 v 861 v 987 A caution notice indicates the presence of a hazard that has the potential of causing moderate or minor personal injury. Caution notices appear on the following pages: v viii v ix v 133 v 192 v 837 v 859 v 908 v 998 v 935 v 946 For a translation of the safety notices contained in this book, see the System Unit Safety Information, order number SA23-2652. Rack Safety Instructions v Do not install this unit in a rack where the internal rack ambient temperatures will exceed 35 degrees C. v Do not install this unit in a rack where the airflow is compromised. Any side, front or back of the unit used for air flow through the unit must not be in direct contact with the rack. v Care should be taken to ensure that a hazardous condition is not created due to uneven mechanical loading when installing this unit in a rack. If the rack has a stabilizer it must be firmly attached before installing or removing this unit. v Consideration should be given to the connection of the equipment to the supply circuit so that overloading of circuits does not compromise the supply wiring or overcurrent protection. To provide the correct power connection to the rack, refer to the rating labels located on the equipment in the rack to determine the total power requirement for the supply circuit. v An electrical outlet that is not correctly wired could place hazardous voltage on the metal parts of the system or the devices that attach to the system. It is the responsibility of the customer to ensure that the outlet is correctly wired and grounded to prevent an electrical shock.Electrical Safety Observe the following safety instructions any time you are connecting or disconnecting devices attached to the system. vii
DANGERAn electrical outlet that is not correctly wired could place hazardous voltage on metal parts of the system or the devices that attach to the system. It is the responsibility of the customer to ensure that the outlet is correctly wired and grounded to prevent an electrical shock. Use one hand, when possible, to connect or disconnect signal cables to prevent a possible shock from touching two surfaces with different electrical potentials. During an electrical storm, do not connect cables for display stations, printers, telephones, or station protectors for communications lines.D06 CAUTION: This product is equipped with a four-wire (three-phase and ground) power cable for the user’s safety. Use this power cable with a properly grounded electrical outlet to avoid electrical shock.C27 DANGERTo prevent electrical shock hazard, disconnect all power cables from the electrical outlet before relocating the system.D01 CAUTION: This unit has more than one power supply cord. Follow procedures for removal of power from the system when directed.C28 CAUTION: Energy hazard, remove power before servicing.C22 CAUTION: Energy hazard, remove all jewelry before servicing.C29 viii
406/90
Service
Guide
Laser Safety Information CAUTION: This product may contain a CD-ROM, DVD-ROM, or laser module on a PCI card, which are class 1 laser products.C30 Laser Compliance All lasers are certified in the U.S. to conform to the requirements of DHHS 21 CFR Subchapter J for class 1 laser products. Outside the U.S., they are certified to be in compliance with the IEC 825 (first edition 1984) as a class 1 laser product. Consult the label on each part for laser certification numbers and approval information. CAUTION: All laser modules are designed so that there is never any human access to laser radiation above a class 1 level during normal operation, user maintenance, or prescribed service conditions. Data processing environments can contain equipment transmitting on system links with laser modules that operate at greater than class 1 power levels. For this reason, never look into the end of an optical fiber cable or open receptacle. Only trained service personnel should perform the inspection or repair of optical fiber cable assemblies and receptacles.C25, C26
Preface
ix
x
406/90
Service
Guide
Data Integrity and Verification
These computer systems contain mechanisms designed to reduce the possibility of undetected data corruption or loss. This risk, however, cannot be eliminated. Users who experience unplanned outages, system failures, power fluctuations or outages, or component failures must verify the accuracy of operations performed and data saved or transmitted by the system at or near the time of the outage or failure. In addition, users must establish procedures to ensure that there is independent data verification before relying on such data in sensitive or critical operations. Users should periodically check our support websites for updated information and fixes applicable to the system and related software.
xi
xii
406/90
Service
Guide
About This Book This book provides maintenance information that is specific to the 406/90, adapters, and attached devices that do not have their own service information. It also contains Maintenance Analysis Procedures (MAPs) that are not common to other systems. MAPs that are common to all systems are contained in the Diagnostic Information for Multiple Bus Systems. This book is used by the service representative to repair system failures. This book assumes that the service representative has had training on the system. ISO 9000 ISO 9000 registered quality systems were used in the development and manufacturing of this product. Highlighting The following highlighting conventions are used in this book:
Bold
Identifies commands, subroutines, keywords, files, structures, directories, and other items whose names are predefined by the system. Also identifies graphical objects such as buttons, labels, and icons that the user selects.
Italics
Identifies parameters whose actual names or values are to be supplied by the user.
Monospace
Identifies examples of specific data values, examples of text similar to what you might see displayed, examples of portions of program code similar to what you might write as a programmer, messages from the system, or information you should actually type. xiii
In this manual, 406/90 stands for ESCALA PL 3200R.
Related Publications The following publications are available: v The System Unit Safety Information, order number 86 X1 11WD contains translations of safety information used throughout this book. v The Hardware Management Console Operations Guide, order number SA23-1251, contains information on how to set up and cable the hardware management console and verify system operation. v The Hardware Management Console Maintenance Guide, order number SA23-1273, contains MAPS, removal and replacement procedures, error codes, and parts information that help trained service representatives diagnose and repair the hardware management console. v The 45H/W4 Switch Planning, Installation, and Service Guide, order number GA22-7954, contains information on installing and servicing the pSeries High Performance Switch (machine type 7045-SW4), which is an option on this system. v The 406/90, order number SA23-1248, contains information on how to set up and cable the system, install and remove options, and verify system operation. v The 406/90, order number SA23-1249, contains information on how to use the system, use diagnostics, use service aids, and operations. v The Diagnostic Information for Multiple Bus Systems, order number SA23-2769, contains common diagnostic procedures, error codes, service request numbers, and failing function codes. This manual is intended for trained service technicians. v The Adapters, Devices, and Cable Information for Multiple Bus Systems, order number SA23-2778, contains information about adapters, external devices, and cabling. This manual is intended to supplement information found in the Diagnostic Information for Multiple Bus Systems. v The PCI Adapter Placement Reference, order number SA23-2504, contains information regarding slot restrictions for adapters that can be used in this system. v The Site and Hardware Planning Information, order number SA38-0508, contains information to help you plan your installation. v The AIX Installation Guide and Reference, order number SC23-4389, describes how to install systems, and to use a network server to do functions such as installing the operating system or diagnostics on systems connected to the network.Trademarks The following terms are trademarks of International Business Machines Corporation in the United States, other countries, or both: v AIX v Electronic Service Agent v IBM® v LANstreamerOther company, product, and service names may be trademarks or service marks of others. xiv
406/90
Service
Guide
Chapter 1. Reference Information This chapter provides an overview of the 406/90, including a logical description and a physical overview of the system. Additional details pertaining to the server are also provided. They include the following: v Memory overview and ordering rules v General description of the operator panel v Cabling rules v System location rules and descriptions v Powering on and powering off the system v Power flow v Data flow406/90 Overview The 406/90 server is a multiprocessor, multibus system composed of the following subsystems: v 406/1R, a 24-inch wide and 42-EIA-high rack unit that also contains the following: – An 8-EIA unit power subsystem, consisting of two redundant bulk power assemblies that distribute bulk power at 350 V to each of the subsystems. – An optional 2 EIA-unit integrated battery feature (IBF), which is available in either a redundant or non-redundant configuration.v 406/81 Processor Subsystem, a 17 EIA-unit processor subsystem, which contains the processors, L3 cache, memory, and service processor and a 1 EIA-unit media subsystem containing a diskette drive, a CD-ROM, and an operator panel, with space for up to three additional optional media devices. There is an option to expand the number of RIO (in this case RIO-2) ports to . This allows up to RIO-2 I/O subsystems, each with 2 GB-per-second transfers on each planar. Processor speeds up to 1.5 GHz are also available. Additional memory cards that operate at the higher 567 MHz bus speed are available. v 406/1D I/O Subsystem, 4 EIA-units, which can have up to 20 PCI adapters and up to 16 DASD disk drives. This subsystem may also be purchased to allow 10 or 20 PCI-X adapters, for use with the RIO-2 cabling option. Additionally, there is an option that allows 1 GB RIO-2 on each remote I/O port. RIO-2 allows up to 2 GB I/O transfers when cabled as a single planar loop (four CEC RIO-2 ports per I/O subsystem). v A Hardware Management Console (HMC), which provides a standard user interface for configuring and operating partitioned systems. One HMC can support multiple partitions on multiple systems. 1
The basic system consists of the redundant bulk power subsystem, one processor subsystem, a media subsystem, and one I/O subsystem in the same rack, and one HMC, as shown in the following illustration:
1
Hardware Management Console
5, 6*
406/1R primary Integrated Battery Feature (IBF), optional for the primary power subsystem. Redundant IBF, also optional, and installs from the rear.
2
406/1R Bulk Power Subsystem
7
406/1D I/O Subsystem
3
406/81 Processor Subsystem
8
406/1D I/O Subsystem (Optional)
4
406/81 Media Subsystem
9
406/1D I/O Subsystem (Optional)
*
406/1D I/O Subsystem (Optional, If features 5 and 6 are not installed)
You can expand the system by adding I/O subsystems for a total of eight in a minimum of two racks. The preceding figure shows the single rack. Additional I/O drawers and integrated battery features are installed in a second rack. A number of cables connect the power, processor, and I/O subsystems. These cables include the following: v UPIC (Universal Power Interface Controller) cables v RIO (Remote Input Output) cablesThe ac power cords are connected to the bulk power subsystem, which distributes the power throughout the system. Each I/O subsystem contains two I/O backplanes. Each backplane is capable of supporting the following: v Up to 10 adapters each (20 per subsystem) v Up to 8 disk drives (16 per subsystem) in two DASD 4-packs (4 per subsystem) v One distributed converter assembly (DCA) (2 per subsystem) v Two I/O subsystem fans (4 per subsystem)On each I/O backplane, PCI bus slots 1 through 7 are 3.3 V dc connectors running at 66 MHz, supporting 64-bit PCI adapters. Slots 8 through 10 are 64-bit 5 V dc slots, running at 33 MHz. 2
406/90
Service
Guide
The media subsystem provides space for up to three media devices (diskette drive, CD-ROM drive, and optional tape drive or DVD-RAM) in the front and two additional optional devices in the rear. Hardware Management Console (HMC) The Hardware Management Console (HMC) supports your system with features that allow you to manage configuration and operation of partitions in a system, as well as add and remove hardware without interrupting system operation. In this book, a system that is managed by the HMC is referred to as the managed system. The HMC uses its serial connection to the managed system to perform various functions. The HMC’s main functions include the following: v Creating and maintaining a multiple partition environment v Detecting, reporting, and storing changes in hardware conditions v Acting as a service focal point for service representatives to determine an appropriate service strategyPartitioning the system is similar to partitioning a hard drive. When you partition a hard drive, you divide a single hard drive so that the operating system recognizes it as a number of separate hard drives. The same holds true for the HMC’s partitioning capabilities, except the HMC allows you to divide the system’s processors, memory, and I/O. On each of these divisions, you can install an operating system and use each partition as you would a separate physical machine. This division of system resources is called a logical partition. The objective of partitioning is to provide users with the capability to split a single system into several independent systems, each capable of running applications in multiple, independent environments simultaneously. For example, partitioning makes it possible for a user to run a single application using different sets of data on separate partitions, as if it were running independently on separate physical systems. Service representatives use Service Focal Point (SFP), an application that runs on the HMC, to start and end their service calls. SFP provides service representatives with the serviceable event, vital product data (VPD), and diagnostic information. The HMC is a closed system. Additional applications cannot be installed on the HMC. All the tasks needed to maintain the platform, the underlying operating system, and the HMC application code are available by using the HMC’s management applications. Partition Standby and Full System Partition Power-On Options Booting your system in partition standby is markedly different from booting your system in a traditional single-machine full system partition. In partition standby, the system sets aside system memory for partition management. Partition Standby Memory Issues Unique issues are associated with assigning memory to each partition created in partition standby. In partition standby, the HMC allocates a portion of each assigned memory block to the system. Each partition requires a minimum of 1 gigabyte (GB) of system memory. When you start creating partitions, the system sets aside 256 megabytes (MB) of contiguous memory for its own use, and allocates another 256 MB of contiguous memory for each 16 GB allocated. Page Table Memory Usage: Partition page tables are additional memory required for a partition to operate, and is in addition to the total logical memory size of a partition. The partition table is outside of a partition’s accessible memory. The partition page table must be constructed with contiguous real system memory segments. Use the following table to help you keep track of the system’s page-table memory usage:
Chapter
1.
Reference
Information
3
Partition Memory Size (256 MB increments)
Partition Page Table Size (4 16-byte entries per 4 K real page)
Partition Page Table Alignment
Assigned Memory Segments (256 MB)
1 GB
16 MB
16 MB
1
1 GB - 2 GB
32 MB
32 MB
1
2 GB - 3 GB
64 MB
64 MB
1
4 GB - 7 GB
128 MB
128 MB
1
8 GB - 15 GB
256 MB
256 MB
1
16 GB - 31 GB
512 MB
512 MB
2
32 GB - 63 GB
1 GB
1 GB
4
64 GB - 96 GB
2 GB
2 GB
8
In
a full system partition, the operating system uses all of the installed memory; the system does not set aside contiguous memory for its own use. Keep this in mind when you record the amount of memory used when you are completing the planning worksheets in the Hardware Management Console Operations Guide, order number SA23-1251. Partition Requirements To activate a partition, you need a minimum of the following: v 1 GB of available system memory v One available system processor v One available network adapter (for error reporting)If you attempt to activate a partition and the resources that you specified are not available at that time, activation fails. It is important to keep track of your system’s resources and your activation times to avoid failures. Service Focal Point This chapter provides information about using the Service Focal Point application. The Service Focal Point application is used to help the service representative diagnose and repair problems on partitioned systems. Service representatives use the HMC as the starting point for all service issues. The HMC groups various system management issues at one control point, allowing service representatives to use the Service Focal Point application to determine an appropriate service strategy. Traditional service strategies become more complicated in a partitioned environment. Each partition runs on its own, unaware that other partitions exist on the same system. If one partition reports an error for a shared resource, such as a managed system power supply, other active partitions report the same error. The Service Focal Point application enables service representatives to avoid long lists of repetitive call-home information by recognizing that these errors repeat, and by filtering them into one error code. The following types of errors are reported to Service Focal Point: v Permanent hardware errors (detected by the managed system or operating system) v LAN surveillance errors detected by Service Focal Point v Hardware boot failure errorsThe following errors are not reported to Service Focal Point: v Software errors v Temporary hardware errors v Undetermined hardware errors 4
406/90
Service
Guide
v Informational hardware errorsErrors that require service are reported to the HMC as serviceable events. Because the HMC stores these serviceable events for 90 days and then discards them, it is important to have the partition and HMC date and time set correctly. For instance, if the date on a partition’s software is set 90 days behind the HMC’s set time, the serviceable events reported by this partition are immediately discarded. For more information about setting the HMC’s date and time, see the Hardware Management Console Installation and Operations Guide, order number SA23-1251. To set the partition’s date and time, see the documentation provided with the operating system that is running on that partition. Getting Started When you are setting up Service Focal Point, keep the following in mind: v If the time configured on a partition is 90 days older than time configured on the HMC, serviceable events cannot be reported. v Verify that the HMC host names are defined. For more information about using fully qualified and short host names, see Hardware Management Console Installation and Operations Guide, order number SA23-1251. v If you need to add or change a partition name, see the Hardware Management Console Installation and Operations Guide, order number SA23-1251.Testing Error Reporting To ensure that Service Focal Point is configured correctly, generate a test error by doing the following: 1. In the partition, run diagnostics to test the managed system’s operator panel. 2. When the diagnostics window asks you if you see 0000 on the managed system’s operator panel, select NO. This action generates an error. 3. In the SRN window, press Enter to proceed. 4. When the system asks you if you want the error sent to Service Focal Point, select YES. 5. Type F3 to exit diagnostics. 6. Wait for one minute while the managed system sends the error to Service Focal Point. 7. Check the Serviceable Event window to ensure that the error was sent to Service Focal Point and that Service Focal Point reported the error. For more information about working with serviceable events, see “Working With Serviceable Events” on page 7.Service Focal Point Settings The Service Focal Point Settings task in the HMC Contents area allows you to configure your Service Focal Point application. Note: The Advanced Operator, Operator, and Viewer roles have read-only access to the following tasks. Automatic Call-Home Feature: You can configure the HMC to automatically call an appropriate service center when it identifies a serviceable event. To enable or disable the call-home feature, you must be a member of one of the following roles: v System Administrator v Service RepresentativeTo enable or disable the call-home feature for one or more systems, do the following:Note: It is strongly recommended that you not disable the call-home feature. When you disable the call-home feature, serviceable events are not automatically reported to the service representative. 1. In the Navigation area, click the Service Applications icon. 2. In the Navigation area, double-click the Service Focal Point icon. 3. In the Contents area, click Service Focal Point Settings. Chapter
1.
Reference
Information
5
4. The Service Focal Point Settings window opens. Select the CEC Call Home tab at the top of the window. 5. Click on the managed system you want to enable or disable. 6. Click Enable to enable call-home for the selected system, or click Disable to disable call-home for the selected system. 7. Click OK.Setting Up Surveillance: Service Focal Point surveillance generates serviceable events when it detects communication problems between the HMC and its managed systems. You can configure how you want the HMC to survey the following: v The number of disconnected minutes that are considered an outage v The number of connected minutes you want the HMC to consider a recovery v The number of minutes between outages that are considered a new incidentTo set up surveillance, you must be a member of one of the following roles: v System Administrator v Service RepresentativeTo set up surveillance, do the following: 1. In the Navigation area, click the Service Applications icon. 2. In the Navigation area, double-click the Service Focal Point icon. 3. In the Contents area, select Service Focal Point Settings. 4. The Service Focal Point Settings window opens. Select the Surveillance Setup tab on the top of the window. 5. In the first field, select the number of minutes you want the HMC to wait before sending a disconnection error message. 6. In the second field, select the amount of connection time that the HMC is considered to be recovered. This amount is expressed in minutes. 7. In the third field, select the number of minutes between outages that you want the HMC to wait before sending a new incident report. 8. Select one or more managed systems from the table in the lower part of the window, then click Enable or Disable. Surveillance is then either enabled or disabled for the selected managed systems.Enabling Surveillance Notifications: You can enable or disable surveillance-error notification from this HMC to connected managed systems. Enabling this notification causes errors to be passed to the Service Agent application for notification. Note: You must further configure Service Agent to handle notifications sent by Service Focal Point. For more information about Service Agent, see the Hardware Management Console Installation and Operations Guide, order number SA23-1251. To set up surveillance, you must be a member of one of the following roles: v System Administrator v Service RepresentativeTo set up surveillance-error notification, do the following: 1. In the Navigation area, click the Service Applications icon. 2. In the Navigation area, double-click the Service Focal Point icon. 3. In the Contents area, select Service Focal Point Settings. 6
406/90
Service
Guide
4. The Service Focal Point Settings window opens. Select the Surveillance Notification tab at the top of the window. 5. Select one or more managed systems from the list, and click Enable or Disable. Surveillance notification is then either enabled or disabled for the selected managed systems.Working With Serviceable Events You can view, add, or update serviceable event information, including error details. Viewing Serviceable Events: To view serviceable events, you must be a member of one of the following roles: v System Administrator v Service Representative v Advanced Operator v Operator v ViewerTo view a serviceable event, do the following: 1. In the Navigation area, click the Service Applications icon. 2. In the Navigation area, double-click the Service Focal Point icon. 3. In the Contents area, click Select Serviceable Event. 4. Designate the set of serviceable events you want to view. When you are finished, click OK. 5. The Serviceable Event Overview window opens, and the entries displayed are ordered by time stamp. Each line in the Serviceable Event Overview window corresponds to one error within a serviceable event. On this window, designate the set of serviceable events you want to view by specifying your search criteria (such as event status or error class). Note: Only events that match all of the criteria that you specify are shown. 6. When you are finished, click OK.When you select a line in the Serviceable Event Overview window, all lines in the same serviceable event are selected. To open the Serviceable Event Details window for the selected event, select the event and click Event Details. Viewing Serviceable Event Details: To view serviceable event details, do the following: 1. Perform the steps in “Viewing Serviceable Events.” 2. The Serviceable Event Details window opens, showing extended serviceable event information, including the following: v Status v Earliest original time stamp of any managed object v AIX error log. (The Linux system error log does not place entries into Service Focal Point.) v Should this error ever get called home? v Error was called home v Pointer to extended error-data collection on the HMCThe window’s lower table displays all of the errors associated with the selected serviceable event. The information is shown in the following sequence: v Failing device system name v Failing device machine type/model/serial v Error class v Descriptive error text
Chapter
1.
Reference
Information
7
Viewing Serviceable Event Error Details: To view serviceable event error details, do the following: 1. Perform the steps in “Viewing Serviceable Event Details” on page 7. 2. Select an error in the lower table, and click Error Details.Viewing Service Processor Error Details: To view service processor error details, do the following: 1. Perform the steps in “Viewing Serviceable Event Error Details.” 2. If the serviceable event error details you are viewing are for a service processor-class error, the lower table on the resulting window contains service processor errors. Select a service processor error from the lower table, and click Service Processor Error Details to see further details.Saving and Managing Extended Error Data: To save extended error (EE) data, do the following: 1. Perform the steps in “Viewing Serviceable Event Details” on page 7. 2. Click Save EE Data. To save extended error data for only one error associated with the serviceable event (rather than for the entire serviceable event), select the error from the lower table, and click Error Details. In the next menu, click Manage EE Data.Viewing and Adding Serviceable Event Comments: To add comments to a serviceable event, you must be a member of the Service Representative or System Administrator roles. To add comments to a serviceable event, do the following: Note: You cannot edit or delete previous comments. 1. Perform the steps in “Viewing Serviceable Event Details” on page 7. 2. Select the error to which you want to add comments to and click Comments. If you want to close the event and add comments, click Close Event from this window. The Serviceable Event Comments window opens. 3. Type your name and add comments as appropriate. You can also review previous comments, but you cannot edit this information. 4. If you clicked Comments on the Serviceable Event Details window, clicking OK commits your entry and returns you to the Serviceable Event Details window.If you clicked Close Event on the Serviceable Event Details window, clicking OK commits all changes and opens the Update FRU Information window. For more information about updating field replaceable unit information, see “Updating Field Replaceable Unit (FRU) Information.” Closing a Serviceable Event: To close a serviceable event, do the following: 1. Perform the steps in “Viewing Serviceable Event Details” on page 7. 2. Click Close Event from this window. The Serviceable Event Comments window opens. 3. Click OK to commit your comments. The Update FRU Information window displays. See “Updating Field Replaceable Unit (FRU) Information” for information on completing this window. Click OK on the Update FRU Information window to close the serviceable event.Note: You must close a serviceable event after it has been serviced to ensure that if a similar error is reported later, it is called home. If an old problem remains open, the new similar problem is reported as a duplicate. Duplicate errors are neither reported nor called home to a service center. Close a serviceable event when the partition that reports the error is active. Closing the event causes the new status of the serviceable event to be correctly sent to the partition.Updating Field Replaceable Unit (FRU) Information: This task allows you to update the FRU information you changed or modified as a result of this serviceable event. From this panel, you can also activate and deactivate LEDs and search for other serviceable events that contain the same FRU entries. To update FRU information, do the following: 8
406/90
Service
Guide
1. Perform the steps in “Viewing Serviceable Event Details” on page 7. 2. Click FRU Information. The Update FRU Information window opens. The lower table shows any parts that you have replaced or added during your current update session but that have not been committed to the serviceable event. The changes from the lower table are committed by clicking OK or Apply.From this window, you can also activate and deactivate LEDs and search for other serviceable events that contain the same FRU entries. Replacing an Existing FRU: To replace a part already listed for this serviceable event, do the following: 1. Perform the steps in “Updating Field Replaceable Unit (FRU) Information” on page 8. 2. In the upper table, double-click the part you want to replace. 3. If the FRU has a new part number, type it in the New FRU Part Number field. 4. Click Replace FRU. The Update FRU Information window displays the FRU replacement information in the lower table. Click OK or Apply to commit the changes to the serviceable event.Adding a New FRU: You can add a part to the serviceable event that was not listed in the upper table of the Update FRU Information window. To add a new FRU for this serviceable event, do the following: 1. Perform the steps in “Updating Field Replaceable Unit (FRU) Information” on page 8. 2. Click Add New FRU. 3. Type the FRU’s location code and its part number in the appropriate fields. 4. Click Add to List. The Update FRU Information window opens and displays the newly added FRU in the lower table. 5. Click OK or Apply to commit these changes to the serviceable event. Note: After you click OK or Apply, you cannot change this information. If you clicked the Close Event button in the Serviceable Event Details window, then clicking OK also completes the close dialog and changes the status of the serviceable event to Closed.Viewing Serviceable Event Partition Information: You can view partition information associated with this serviceable event. This information includes each affected partition’s state and resource use. 1. Perform the steps in “Viewing Serviceable Event Details” on page 7. 2. Click Partition Information.Activating and Deactivating FRU LEDs This task allows you to activate or deactivate a managed system’s system attention LED or any FRU LED. FRU LEDs are helpful in determining which FRUs need servicing. To activate or deactivate a managed system’s System Attention LED, do the following: 1. In the Navigation area, click the Service Applications icon. 2. In the Navigation area, double-click the Service Focal Point icon. 3. In the Contents area, select Hardware Service Functions. The LED Management window opens. 4. In the LED Management window, select one or more managed systems from the table. 5. Select either Activate LED or Deactivate LED. The associated System Attention LED is then either turned on or off.To activate or deactivate a FRU associated with a particular managed system, do the following: 1. In the Navigation area, click the Service Applications icon. 2. In the Navigation area, double-click the Service Focal Point icon. 3. In the Contents area, click Hardware Service Functions. The LED Management window opens. 4. In the LED Management window, select one managed system from the table. Chapter
1.
Reference
Information
9
5. Click the List FRUs... button. The list of FRU slot indexes and their respective current LED states display. 6. Select one or more FRU slot indexes. 7. Click either the Activate LED or the Deactivate LED button.The associated FRU LEDs are now either enabled (blinking) or off. Powering the System On and Off For complete procedures to power on and power off the system in various system configurations running various operating systems, see “Powering the System On” on page 841. 10
406/90
Service
Guide
406/90 Data Flow The following figure shows the data flow for the system. Chapter
1.
Reference
Information
11
Power-On Self-Test After power is turned on and before the operating system is loaded, the partition does a power-on self-test (POST). This test performs checks to ensure that the hardware is functioning correctly before the operating system is loaded. During the POST, a POST screen displays, and POST indicators display on the virtual terminal. The next section describes the POST indicators and functions that can be accessed during the POST. POST Indicators POST indicators indicate tests that are being performed as the partition is preparing to load the operating system. The POST indicators are words that display on the virtual terminal. Each time that the firmware starts another step in the POST, a POST indicator word displays on the console. The POST screen displays the following words: Memory Memory test Keyboard Initialize the keyboard and mouse. The time period for pressing a key to access the System Management Services, or to initiate a service mode boot is now open. See “POST Keys” for more information. Network Self-test on network adapters SCSI Adapters are being initialized Speaker Sounds an audible tone at the end of POSTPOST Keys The POST keys, if pressed after the keyboard POST indicator displays and before the last POST indicator (speaker) displays, cause the system to start services or boot modes used for configuring the system and diagnosing problems. The system will beep to remind you to press the POST key (if desired) at the appropriate time. The keys are described as follows: 1 Key The numeric 1 key, when pressed during POST, starts the System Management Services (SMS) interface. 5 Key The numeric 5 key, when pressed during POST, initiates a partition boot in service mode using the default service mode boot list. This mode attempts to boot from the first device of each type found in the list. It does not search for other bootable devices of that type if the first device is not bootable. Instead, it continues to the next device type in the list. The firmware supports up to five entries in the boot list. The default boot sequence is: 1. Diskette 2. CD-ROM 3. Hard file 4. Tape drive (if installed) 5. Network a. Token ring b. Ethernet6 Key The numeric 6 key works like the numeric 5 key, except that firmware uses the customized service mode boot list, which can be set in the AIX service aids. 12
406/90
Service
Guide
8 Key To enter the open firmware command line, press the numeric 8 key after the word keyboard displays and before the last word (speaker) displays during startup. After you press the 8 key, the remaining POST indicators display until initialization completes. This option is used only by service personnel to collect additional debug information. To exit from the open firmware command prompt, type reset-all or power off the system and reboot. 406/90 System Locations The 406/90 system consists of a minimum of four subsystems in one rack as follows: v Processor subsystem v Media subsystem v One I/O subsystem v Power subsystemThese components are connected by cables that transmit data and control signals. I/O subsystems can be added if further expansion of the system is required (up to seven additional I/O subsystems, using two racks). Chapter
1.
Reference
Information
13
In each of the following figures, the minimum system configuration is shown on the left. The minimum system configuration with the integrated battery feature (IBF) is shown on the right in each of the figures. 13326547
1
Rack
5
Media Subsystem
2
Bulk Power Assembly
6
Optional Integrated Battery Feature
3
UEPO Switch
7
First I/O subsystem
4
Memory Books
14
406/90
Service
Guide
1
BPA Fan (U1.35-P1-F1)
4
Processor Subsystem Fan 1, (U1.18-F1)
2
BPA Fan (U1.35-P2-F1)
5
Processor Subsystem Fan 3, (U1.18-F3)
3
Processor Subsystem Fan 2, (U1.18-F2)
6
Processor Subsystem Fan 4, (U1.18-F4)
Note:
The I/O subsystem fan assemblies are inside the I/O subsystems, positioned from left to right behind the DASD 4-packs in order of fan assembly 1, fan assembly 2, fan assembly 3, and fan assembly 4. Chapter
1.
Reference
Information
15
UEPO Switch The UEPO (Unit Emergency Power Off) switch is accessible through the external cover. The switch contains a room emergency power off (EPO) interlock connector and bypass switch, a dual element rack service amber LED (one element driven per bulk power controller, a service complete button (white) and start service button (green), an off button and four thermistors for BPC ambient temperature sensing. The UEPO switch is shown in the following figure. 41235667891011
1
UEPO Front Panel
7
J02 Connector
2
UEPO Side Panel
8
Power Switch
3
UEPO Rear Panel
9
System Fault LED
4
J00 Connector Front BPA (Back)
10
Start Service Button (Green)
5
J01 Connector Rear BPA (Back)
11
Service Complete Button (White)
6
Room EPO Bypass Interlock
The UEPO switch disables 350 V bulk power conversion and isolates all battery power to the battery compartments, without having to disconnect the heavy power cords under the customer’s raised floor. At the system level, the UEPO switch is connected to each of the two BPCs through two interface connectors. The switch can be concurrently replaced by forcing the BPCs into UEPO bypass mode, using the small slide switches on the face of the BPCs. The switch is mounted in the side pocket area of the rack and remains stationary when the front cover of the system is opened. The service buttons are used by service personnel to repair the power subsystem. If the system is off when the service complete button is pressed, the system will power on. A sliding plastic cover over the red switch prevents it from being accidentally activated in the customer environment. 16
406/90
Service
Guide
406/81 Processor Subsystem Locations This section provides a front view and a rear view of the processor subsystem with descriptions and locations for each part of the processor subsystem. Detailed location information for processor subsystem I/O books is also included. 406/81 Processor Subsystem Front View Note: For slot assignments, see “Memory to Processor Relationships” on page 60. 13333222244445555677
1
Processor Subsystem Chassis
5
Fan Controller
2
Outer Memory Book
6
Fan Chassis
3
Inner Memory Book
7
Fan Filter
4
Fan
Chapter
1.
Reference
Information
17
406/81 Processor Subsystem Rear View
1
I/O Book – Slot 1
8
Capacitor Book
2
DCA Book 1
9
DCA Book 6 (32-way, 1.7 GHz MCMs)
3
DCA Book 2
10
I/O book - Slot 0
4
Capacitor Book
11
I/O Book 3 - Slot 3
5
DCA Book 3
12
I/O Book 2 - Slot 2 (Inner Connectors)
6
DCA Book 4
13
I/O Book 2 - Slot 2 (Outer Connectors)
7
DCA Book 5
18
406/90
Service
Guide
I/O Book Connector Locations
1
Primary I/O Book, Slot 0 (U1.18-P1-H2)
19
I/O Book Slot 2 (D0) (U1.18–P1–H3/Q6)I/O Book Slot 1 (D0) (U1.18–P1–H1/Q6)
2
Secondary I/O Book, Slot 1 (U1.18-P1-H1), Slot 2 (U1.18-P1-H3)
20
I/O Book Slot 2 (C1) (U1.18-P1-H3/Q7)I/O Book Slot 1 (C1) (U1.18-P1-H1/Q7)
3
Third I/O Book, Slot 3 (U1.18-P1-H4)
21
I/O Book Slot 2 (C0) (U1.18-P1-H3/Q8)I/O Book Slot 1 (C0) (U1.18-P1-H1/Q8)
4
I/O Book Slot 0 (A0) (U1.18-P1-H2/Q1)
22
I/O Book Slot 2 (A0) (U1.18-P1-H3/Q4)I/O Book Slot 1 (A0) (U1.18-P1-H1/Q4)
5
I/O Book Slot 0 (A1) (U1.18-P1-H2/Q2)
23
I/O Book Slot 2 (A1) (U1.18-P1-H3/Q3)I/O Book Slot 1 (A1) (U1.18-P1-H1/Q3)
6
Operator Panel (U1.18-P1-H2/Q7)
24
Indicator LEDs
7
BPC Y-cable* connector U1.18-P1-H2/Q8 or U1.18-P1-H2/Q9
25
I/O Book Slot 2 (B0) (U1.18-P1-H3/Q2)I/O Book Slot 1 (B0) (U1.18-P1-H1/Q2)
8
I/O Book Slot 0 (B0) (U1.18-P1-H2/Q3)
26
Camming Latches
9
I/O Book Slot 0 (B1) (U1.18-P1-H2/Q4)
27
I/O Book Slot 2 (B1) (U1.18-P1-H3/Q1)I/O Book Slot 1 (B1) (U1.18-P1-H1/Q1)
10
Diskette Drive (U1.18-P1-H2/Q10)
28
I/O Book Slot 3 (D1) (U1.18-P1-H4/Q4)
11
HMC 1 (U1.18-P1-H2/S3)
29
I/O Book Slot 3 (D0) (U1.18-P1-H4/Q3)
12
HMC 2 (U1.18-P1-H2/S4)
30
I/O Book Slot 3 (C1) (U1.18-P1-H4/Q2)
13
Serial Port 1 (U1.18-P1-H2/S1)
31
I/O Book Slot 3 (C0) (U1.18-P1-H4/Q1)
14
Serial Port 2 (U1.18-P1-H2/S2)
32
I/O Book Slot 3 (A0) (U1.18-P1-H4/Q5)
15
SPCN 0 (Manufacturing use only) (U1.18-P1-H2/Q5)
33
I/O Book Slot 3 (A1) (U1.18-P1-H4/Q6)
16
SPCN 1 (Manufacturing use only) (U1.18-P1-H2/Q6)
34
I/O Book Slot 3 (B0) (U1.18-P1-H4/Q7)
17
Debug (Manufacturing use only)
35
I/O Book Slot 3 (B1) (U1.18-P1-H4/Q8)
18
I/O Book Slot 2 (D1) (U1.18-P1-H3/Q5)I/O Book Slot 1 (D1) (U1.18-P1-H1/Q5)
* The Y-cable that attaches to this connector terminates at BPC-A connector U1.35-P1-X4/Q10 and BPC-B connector U1.35-P2-X4/Q10. Chapter
1.
Reference
Information
19
Media Subsystem Locations The following figures show locations for devices installed in the media subsystem, and the SCSI IDs for the media devices.Note: The SCSI IDs shown for media indicate how installed devices are set when shipped from the factory. Media Subsystem Front View 2SCSISCSI314
1
Diskette Drive
3
Optional Media Drive (4 mm Tape, CD-ROM, or DVD-RAM), SCSI Address 06
2
CD-ROM Drive, SCSI Address 05
4
Operator Panel
Media Subsystem Rear View Note: DVD-RAM and 4-mm tape drives can be located in the rear of the media subsystem only if the room is maintained at a nominal room temperature of 24C (75.2F).28361SCSISCSI754
1
Optional Media (4-mm Tape, CD-ROM, or DVD-RAM), SCSI Address 05
5
Front SCSI Power Connection
2
Optional Media (4-mm Tape, CD-ROM, or DVD-RAM), SCSI Address 06
6
Front SCSI Connection
3
Rear SCSI Power Connection
7
Diskette Drive Connection
4
Rear SCSI Connection
8
Operator Panel Connection
20
406/90
Service
Guide
Media Subsystem Operator Panel R125346!
1
Power on/off button
4
Reset button
2
Power on/off LED
5
Service processor reset button (Service use only)
3
Operator panel display
6
Disturbance or system attention LED
Note: You must activate the service processor reset button very carefully. Using an insulated paper clip is recommended. Unbend the clip so that it has a straight section about two inches long. Insert the clip straight into the hole, keeping the clip perpendicular to the plastic bezel. When you engage the test reset switch, you should feel the detent of the switch. After you press the switch, the service processor resets and then shuts down the system. Disturbance or System Attention LED The system attention LED lights on solid when an event occurs that either needs customer intervention or service. The system attention LED is turned on when an entry is made in the service processor error log. The error entry is transmitted to the following: v System-level error logs v the operating system error log v As an entry in the service action event log in the Service Focal Point application. For example, the loss of surveillance from the HMC to a logical partition.Continue to the following appropriate system error interrogation section. Accessing System Log Error Information Note: The system attention LED can be reset by following the procedures described in “Resetting the System Attention LED” on page 22. When an error or potential problem is detected by the system, the attention LED on the operator panel is turned on. Information about the error or potential problem is stored in error logs. The following sections explain how to access the logs that are available on a system. The error logs are accessed from the console that is used to manage the system. From the following console descriptions, go to the instructions under the heading that best describes the system on which you are working: v “Accessing Errors when a System is Attached to an HMC.” v “Accessing Errors when a System is Running AIX and the Console is Not an HMC” on page 22. v “Accessing Errors when a System is Running Linux and the Console is Not an HMC” on page 22.Accessing Errors when a System is Attached to an HMC If the system attention LED comes on and the system is attached to an HMC for its console, do the following to access the error logs: 1. Open the Service Action Event Log in the Service Focal Point application on the HMC. Chapter
1.
Reference
Information
21
2. Check the open service events using the procedure described in ?Working with Serviceable Events? in the Hardware Management Console Installation and Operations Guide.Note: For information on these Service Focal Point settings, see ?Setting Up Surveillance and Enabling Surveillance Notifications? in the Hardware Management Console Installation and Operations Guide. Events requiring customer intervention are marked Call Home Candidate? NO. For each of these events, examine the description in the serviceable event error details. If actions are listed in the description, perform those actions. If the error indicates a loss of surveillance between the HMC and a partition, check the status of the partition, the network, and the cabling between the HMC and the partition. Correct any problems found. If a surveillance problem is still indicated, call service support. If the service action event is labeled Call Home Candidate? YES, an error code is supplied to identify the problem. If the system is configured to automatically call home on error, the request for service is placed. If the system is not configured to automatically call home, you should place a call to service support. If the system attention LED comes on and an HMC is used to manage the system, do one of the following: v If you performed “Step 1321-43” on page 182 in “MAP 1321: Quick Entry MAP for Systems with Service Focal Point” on page 170, return to “Step 1321-43” on page 182 to process any other error codes or symptoms. v If you have not performed any of the steps in “MAP 1321: Quick Entry MAP for Systems with Service Focal Point” on page 170, go to step “Step 1321-1” on page 170 and perform the procedures in the MAP.Accessing Errors when a System is Running AIX and the Console is Not an HMC If the system attention LED comes on and the system is running AIX and the console is not an HMC, do the following to access the error logs: 1. If the system is operational, examine the AIX error log to see if user intervention is required. 2. If the system is powered off, examine the service processor error log for entries. 3. If the AIX error log indicates service support is required, or there are entries in the service processor error log, go to “Entry MAP” on page 149 with the information noted from the error log.Accessing Errors when a System is Running Linux and the Console is Not an HMC If the system attention LED comes on and the system is running Linux and is not attached to an HMC, do one of the following: v If you performed “Step 1321-43” on page 182 in “MAP 1321: Quick Entry MAP for Systems with Service Focal Point” on page 170, return to “Step 1321-43” on page 182 to process any other error codes or symptoms. v If you have not performed any of the steps in “MAP 1321: Quick Entry MAP for Systems with Service Focal Point” on page 170, go to step “Step 1321-1” on page 170 and perform the procedures in the MAP.Resetting the System Attention LED Choose one of the following procedures to reset your attention LED. Resetting the LED when a System is Attached To an HMC To reset the system attention LED on HMC-attached systems, do the following: 1. On the HMC graphical interface, click Service Applications. 2. Double-click Service Focal Point. 3. In the Contents area of the screen, select Hardware Service Functions. The LED Management window opens. 22
406/90
Service
Guide
4. In the LED Management window, select one or more managed systems from the table. 5. Select Deactivate LED. The associated system attention LED is turned off.For more information about the virtual operator panel on the HMC, see the Hardware Management Console Installation and Operations Guide. Resetting the LED when a System is Running AIX and the Console is not an HMC As a user with root authority, type diag on the AIX command line, and do the following: 1. Select Task Selection. 2. On the Task Selection Menu, select Identify and Attention Indicators. 3. When the list of LEDs displays, use the cursor to highlight Set System Attention Indicator to Normal. 4. Press Enter, and then press F7 to commit. This action turns off the LED.If the system is powered off, access the service processor menus. From the service processor main menu, do the following: 1. Select System Information Menu. 2. Select LED Control Menu. 3. Select Clear System Attention Indicator. This action turns off the LED.Resetting the LED when a System is Running Linux and the Console is not an HMC The system must be powered off to access the service processor menus. From the service processor main menu, do the following: 1. Select System Information Menu. 2. Select LED Control Menu. 3. Select Clear System Attention Indicator. This action turns off the LED. Chapter
1.
Reference
Information
23
406/1D I/O Subsystem Locations This section shows the locations for the I/O subsystem. A front view and a rear view with descriptions and locations for each part of the I/O subsystem are shown. 406/1D I/O Subsystem Front View 111123668557479981010111312141516
1
DASD 4-Pack
9
Drawer Fault/Identify LED (Amber)
2
DASD Disk Drive
10
Power Good Out LED (Green)
3
Disk Activity LED (Green)
11
Fan Fault LED (Amber)Ux.y-F1
4
Disk Drive Fault/Identify LED (Amber)
12
Fan Fault LED (Amber)Ux.y-F3
5
BPD 1 (Front) Cable In
13
Fan Fault LED (Amber)Ux.y-F2
6
BPD 2 (Rear) Cable In
14
Fan Fault LED (Amber)Ux.y-F4
7
Power Good In LED (Green)
15
I/O Subsystem Left DCAUx.y-V1
8
DCA Fault LED (Amber)
16
I/O Subsystem Right DCAUx.y-V2
Note: The fan fault LEDs shown in the preceding illustration indicate if an internal fan has a fault. The fan assemblies inside are positioned from left to right behind the DASD 4-packs in order of fan assembly 1, fan assembly 2, fan assembly 3, and fan assembly 4. 24
406/90
Service
Guide
406/1D I/O Subsystem Rear View 1399811111212456271010
1
I/O port connector 1 (Ux.y-P1/Q2)
7
I/O Card Power On LED (Green)
2
I/O port connector 0 (Ux.y-P1/Q1)
8
I/O Adapter Fault/Identify LED (Amber/Bottom)
3
Media subsystem power connector (Ux.y-P1-V1/Q3)
9
Auxiliary Power Good (Green)
4
I/O port connector 1 (Ux.y-P2/Q2)
10
I/O Subsystem Backplane Fault (Amber)
5
I/O port connector 0 (Ux.y-P2/Q1)
11
I/O Subsystem Backplane Power On (Green)
6
Media subsystem power connector (Ux.y-P2-V1/Q3)
12
I/O LED (Currently Unused)
Chapter
1.
Reference
Information
25
I/O Subsystem DASD Locations The following figures show the DASD drive positions in an I/O subsystem.Note: The SCSI IDs are set when the drive is installed into the DASD 4-pack. 406/1D I/O Subsystem DASD Locations 21368574910111312141516
1
Ux.y-P2/Z2-A8
9
Ux.y-P1/Z2-A8
2
Ux.y-P2/Z2-A9
10
Ux.y-P1/Z2-A9
3
Ux.y-P2/Z2-Aa
11
Ux.y-P1/Z2-Aa
4
Ux.y-P2/Z2-Ab
12
Ux.y-P1/Z2-Ab
5
Ux.y-P2/Z1-A8
13
Ux.y-P1/Z1-A8
6
Ux.y-P2/Z1-A9
14
Ux.y-P1/Z1-A9
7
Ux.y-P2/Z1-Aa
15
Ux.y-P1/Z1-Aa
8
Ux.y-P2/Z1-Ab
16
Ux.y-P1/Z1-Ab
Notes: 1. In the preceding table, x is equal to the rack number and y is equal to the I/O subsystem position in the rack. The SCSI ID for each drive is set when the drive is installed. Each I/O subsystem can have up to four DASD 4-packs installed. The SCSI IDs for the drives in each 4-pack are 8, 9, a, and b, from left to right. 2. The DASD attachment locations are as follows: v DASD 1, 2, 3, 4 are attached to Ux.y-P3 DASD backplane. v DASD 5, 6, 7, 8 are attached to Ux.y-P4 DASD backplane. v DASD 9, 10, 11, 12 are attached to Ux.y-P5 DASD backplane. v DASD 13, 14, 15, 16 are attached to Ux.y-P6 DASD backplane. 26
406/90
Service
Guide
I/O Subsystem PCI and PCI-X PHB and PCI and PCI-X Slot Locations I/O and Media Subsystem Connections RIO BusPCI- PCIBridgeRIO-PCIBridgePCI- PCIBridgePCI Slot - 64 bit 66/33mhz 3.3VPCI Slot - 64 bit 33mhz 5.0VPCI- PCIBridgeUltra3SCSIUltra3SCSI3210 slots10 slotsMedia SubsystemPowerSCSI4 DASDBays4 DASDBaysSESSESI/OSubsystemRearMedia Subsystem FrontI/O PlanarInternal SCSIControllers Chapter
1.
Reference
Information
27
I/O Subsystem Internal SCSI Distribution RearP1P2PPRPPISISPPRPPISISDASD4 packDASD4 packDASD4 packDASD4 packP=PCI-PCI bridgeRP=RIO-PCI bridgeIS=Internal SCSI Ultra 3 I/O Subsystem Power Distribution RearP1P2PPRPPISISPPRPPISISDCA 1Power SupplyDCA 2Power SupplyMidplaneBPABPAFan 2Fan 3Fan 1Fan 4P=PCI-PCI BridgeRP=RIO-PCI BridgeIS=Internal SCSI Ultra 3 28
406/90
Service
Guide
Power Subsystem Locations Note: Use the table located in “I/O Subsystem Power Distribution Cabling” on page 33 to identify the connector ids for BPD 2 and BPD 3.1235781191064I/O-1AJ00I/O-5AJ00I/O-5AJ00DCA-4J08DCA-5J09I/O-8BJ09I/O-8BJ09I/O-3AJ04I/O-2AJ02I/O-1BJ01J01J01J02J02J03J03J04J04J05J05J06J06J07J07J08J08I/O-4BJ07I/O-2BJ03DCA-1J08DCA-3J10DCA-2J09MDA-3J06MDA-1J04BPFJ03J00BPC-BPCSYSTEMJ01P02MDA-4J07MDA-2J05I/O-3BJ05BPD(1)BPD(2)BPD(3)BPCI/O-4AJ06J02EPOUPSooooooooUEPO Note: BPD (2) connectors J04 and J05 (I/O-4A/J04 and I/O-4B/J05 in the preceding figure) are for I/O subsystem 4 when the IBF is installed.
1
BPD Power Good LED
6
ac Power In Connector
2
Bulk Power Distributor (BPD)
7
Connectors to IBF
3
Bulk Power Controller (BPC)
8
UEPO Power (PWR) LED (Top LED)UEPO Power Complete (CMPLT) LED
4
Bulk Power Fan (BPF)
9
BPC Power Good LED
5
Bulk Power Regulators (BPR)
10
BPD for 32-way 1.7 GHz MCM systems
11
System Power (SP) LED(Top LED) Standby Power
Note: For 32-way 1.7 GHz MCM systems a third BPD was added to supply the additional power needed by the CEC. To get the correct connector location numbers, refer to the table located in “Power Distribution Cabling” on page 30. Chapter
1.
Reference
Information
29
Power Distribution Cabling Power to the I/O subsystems, DCAs, Fans, and IBFs is connected from the bulk power distribution connectors to each subsystem through cables. The cable connectors are labeled on each end. The following describes the cables that connect from the BPC BPDs, and BPRs to the DCAs, fans, power supplies, I/O subsystems, and integrated battery features. 30
406/90
Service
Guide
Key
From
To
Description
Label Color
1
Bulk Power BPE A Side
Bulk power enclosure on the A side of the system
2
Bulk Power BPE B Side
Bulk power enclosure on the B side of the system
3
A35-BPC-J03
A35-P1FAN (BPF-A)
A35-BPC-P03 to Bulk Power Fan P1-F1
Red with white stripe
4
B35-BPC-J03
B35-P2FAN (BPF-B)
B35-BPC-P03 to Bulk Power Fan P2-F1
Red
5
B35-BPC-J08
DCA-1 J01
B35-BPC-P08 to DCA-1-P01
Red
6
A35-BPC-J08
DCA-1 J00
A35-BPC-P08 to DCA-1-P00
Red with white stripe
7
B35-BPC-J09
DCA-2 J01
B35-BPC-P09 to DCA-2-P01
Red
8
A35-BPC-J09
DCA-2 J00
A35-BPC-P09 to DCA-2-P00
Red with white stripe
9
B35-BPC-J10
DCA-3 J01
B35-BPC-P10 to DCA-3-P01
Red
10
A35-BPC-J10
DCA-3 J00
A35-BPC-P10 to DCA-3-P00
Red with white stripe
11
B35-BPD-J08
DCA-4 J01
B35-BPD1-P08 to DCA-4-P01
Red
12
A35-BPD-J08
DCA-4 J00
A35-BPD1-P08 to DCA-4-P00
Red with white stripe
13
B35-BPD-J09
DCA-5 J01
B35-BPD1-P09 to DCA-5-P01
Red
14
A35-BPD-J09
DCA-5 J00
A35-BPD1-P09 to DCA-5-P00
Red with white stripe
14a
A35-BPD-J06
DCA-6-J00*
A35-BPD1-P06 to DCA-6-P01 (32-way, 1.7 GHz)
Red with white stripe
14b
A35-BPD-J06
DCA-6-J01*
A35-BPD1-P06 to DCA-6-P00(32-way, 1.7 GHz)
Red
15, 17
B17-Media-J02
I/O Subsystem 1B-J01
I/OB9-P01 to Media P02
16, 18
B17-Media-J00
I/O Subsystem 1B-J00
I/OB9-P00 to Media P00
19, 20, 21, 22
B35-BPDx-JxxSee “I/O Subsystem Power Distribution Cabling” on page 33 for I/O subsystem power cabling.
I/O SubsystemSee “I/O Subsystem Power Distribution Cabling” on page 33 for I/O subsystem power cabling.
B35-BPDx-P0x to I/O Subsystem #x
23
B35-BPC-J05
A09-MDA2-J02
A35-BPC-P05 to (AMD2-P01) J02
Red
24
A35-BPC-J05
A09-MDA2-J01
B35-BPC-P05 to (AMD2-P00) J01
Red with white stripe
25
B35-BPC-J04
A09-MDA1-J02
A35-BPC-P04 to (AMD1-P01) J02
Red Chapter
1.
Reference
Information
31
Key
From
To
Description
Label Color
26
A35-BPC-J04
A09-MDA1-J01
A35-BPC-P04 to (AMD1-P00) J01
Red with white stripe
27
A35-BPC-J06
A18-MDA3-J01
A35-BPC-P06 to (AMD3-P00) J01
Red with white stripe
28
B35-BPC-J06
A18-MDA3-J02
B35-BPC-P06 to (AMD3-P01) J02
Red
29
B35-BPC-J07
A18-MDA4-J02
B35-BPC-P07 to (AMD4-P00) J02
Red
30
A35-BPC-J07
A18-MDA4-J01
B35-BPC-P07 to (AMD4-P01) J01
Red with white stripe
31
B35-BPC-J02
UEPO Switch J01
B35-BPC-P02 to UEPO J01
Red
32
A35-BPC-J02
UEPO Switch J00
A35-BPC-P02 to UEPO J00
Red with white stripe
33
A35-BPR-J00
A13-IBF1-J01
A35-BPR-P00 to A13-IBF1-P00
B35-BPR-J00
B13-IBF1-J01
B35-BPR-P00 to B13-IBF1-P00
34
A35-BPR-J00
A13-IBF2-J01
A35-BPR-P00 to A13-IBF2-P00
B35-BPR-J00
B13-IBF2-J01
B35-BPR-P00 to B13-IBF2-P00
35
A35-BPR-J00
A17-IBF3-J01(Secondary Frame)
A35-BPR-P00 to A-IBF3-P00 (Secondary Frame)
B35-BPR-J00
B17-IBF3-J01(Secondary Frame)
B35-BPR-P00 to B-IBF3-P00 (Secondary Frame)
Note:
* DCA 6 is populated only for the 32-way 1.7 GHz MCMs. When DCA 6 is installed, BPD ports A35-BPD1-P07 and B35-BPD1-P07 are not used. Attention: For systems configured with the 1.5 GHz MCM, the total number of CEC DCAs installed must be one more than the number of 1.5 GHz MCMs installed. For systems configured with the 1.7 GHz MCMs, the total number of CEC DCAs installed must be two more than the number of 1.7 GHz MCMs installed. 32
406/90
Service
Guide
I/O Subsystem Power Distribution Cabling Power to the I/O subsystems is connected from the bulk power distribution connectors to each subsystem through cables. The cable connectors are labeled on each end. The following table describes the cables that connect from the BPDs to the I/O Subsystem DCAs (power supplies).
From
To(A=Primary)(Z=Secondary)
Description
Label Color
A35B-BPD1A-P00
A09B-DCA1-P01
BPD1A to I/O Subsystem #1
Purple With Stripes
A35P-BPD1B-P00
A09B-DCA1-P00
BPD1B to I/O Subsystem #1
Purple
A35B-BPD1A-P01
A09B-DCA2-P01
BPD1A to I/O Subsystem #1
Purple With Stripes
A35P-BPD1B-P01
A09B-DCA2-P00
BPD1B to I/O Subsystem #1
Purple
A35B-BPD1A-P02
A05B-DCA1-P01
BPD1A to I/O Subsystem #2
Yellow With Stripes
A35P-BPD1B-P02
A05B-DCA1-P00
BPD1B to I/O Subsystem #2
Yellow
A35B-BPD1A-P03
A05B-DCA2-P01
BPD1A to I/O Subsystem #2
Yellow With Stripes
A35P-BPD1B-P03
A05B-DCA2-P00
BPD1B to I/O Subsystem #2
Yellow
A35B-BPD1A-P04
A01B-DCA1-P01
BPD1A to I/O Subsystem #3
Green With Stripes
A35P-BPD1B-P04
A01B-DCA1-P00
BPD1B to I/O Subsystem #3
Green
A35B-BPD1A-P05
A01B-DCA2-P01
BPD1A to I/O Subsystem #3
Green With Stripes
A35P-BPD1B-P05
A01B-DCA2-P00
BPD1B to I/O Subsystem #3
Green
The following 4 rows apply to systems that are not configured with 32-way with 1.7 GHz MCMs.
A35B-BPD1A-P06
A13B-DCA1-P01
BPD1A to I/O Subsystem #4 (no IBF)
Blue With Stripes
A35P-BPD1B-P06
A13B-DCA1-P00
BPD1B to I/O Subsystem #4 (no IBF)
Blue
A35B-BPD1A-P07
A13B-DCA2-P01
BPD1A to I/O Subsystem #4 (no IBF)
Blue With Stripes
A35P-BPD1B-P07
A13B-DCA2-P00
BPD1B to I/O Subsystem #4 (no IBF)
Blue
The following 4 rows apply to systems that are configured with 32-way with 1.7 GHz MCMs.
A35B-BPD1A-P06
Goes to CEC DCA-6 (see item 14b in the previous table)
BPD1A to CEC DCA-6 (see item 14b in the previous table)
Blue With Stripes
A35P-BPD1B-P06
Goes to CEC DCA-6 (see item 14a in the previous table)
BPD1B to CEC DCA-6 (see item 14b in the previous table)
Blue
A35B-BPD1A-P07
No Connection Chapter
1.
Reference
Information
33
From
To(A=Primary)(Z=Secondary)
Description
Label Color
A35P-BPD1B-P07
No Connection
A35B-BPD2A-P04
Z09B-DCA1-P01
BPD1A to I/O Subsystem #4 (with IBF)
Yellow With Stripes
A35P-BPD2B-P04
Z09B-DCA1-P00
BPD1B to I/O Subsystem #4 (with IBF)
Yellow
A35B-BPD2A-P05
Z09B-DCA2-P01
BPD1A to I/O Subsystem #4 (with IBF)
Yellow With Stripes
A35P-BPD2B-P05
Z09B-DCA2-P00
BPD1B to I/O Subsystem #4 (with IBF)
Yellow
A35B-BPD2A-P00
Z01B-DCA1-P01
BPD2A to I/O Subsystem #5
Red With Stripes
A35P-BPD2B-P00
Z01B-DCA1-P00
BPD2B to I/O Subsystem #5
Red
A35B-BPD2A-P01
Z01B-DCA2-P01
BPD2A to I/O Subsystem #5
Red With Stripes
A35P-BPD2B-P01
Z01B-DCA2-P00
BPD2B to I/O Subsystem #5
Red
A35B-BPD2A-P02
Z05B-DCA1-P01
BPD2A to I/O Subsystem #6
Purple With Stripes
A35P-BPD2B-P02
Z05B-DCA1-P00
BPD2B to I/O Subsystem #6
Purple
A35B-BPD2A-P03
Z05B-DCA2-P01
BPD2A to I/O Subsystem #6
Purple With Stripes
A35P-BPD2B-P03
Z05B-DCA2-P00
BPD2B to I/O Subsystem #6
Purple
A35B-BPD2A-P06
Z13B-DCA1-P01
BPD2A to I/O Subsystem #7
Yellow With Stripes
A35P-BPD2B-P06
Z13B-DCA1-P00
BPD2B to I/O Subsystem #7
Yellow
A35B-BPD2A-P07
Z13B-DCA2-P01
BPD2A to I/O Subsystem #7
Yellow With Stripes
A35P-BPD2B-P07
Z13B-DCA2-P00
BPD2B to I/O Subsystem #7
Yellow
The following 4 rows apply to systems that are not configured with 32-way with 1.7 GHz MCMs.
A35B-BPD2A-P08
Z19B-DCA1-P01
BPD2A to I/O Subsystem #8
Blue With Stripes
A35P-BPD2B-P08
Z19B-DCA1-P00
BPD2B to I/O Subsystem #8
Blue
A35B-BPD2A-P09
Z19B-DCA2-P01
BPD2A to I/O Subsystem #8
Blue With Stripes
A35P-BPD2B-P09
Z19B-DCA2-P00
BPD2B to I/O Subsystem #8
Blue
The following 8 rows apply to systems that are configured with 32-way with 1.7 GHz MCMs.
A35B-BPD2A-P08
A13B-DCA1-P01
BPD2A to I/O Subsystem #4 (no IBF)
Blue With Stripes 34
406/90
Service
Guide
From
To(A=Primary)(Z=Secondary)
Description
Label Color
A35P-BPD2B-P08
A13B-DCA1-P00
BPD2B to I/O Subsystem #4 (no IBF)
Blue
A35B-BPD2A-P09
A13B-DCA2-P01
BPD2A to I/O Subsystem #4 (no IBF)
Blue With Stripes
A35P-BPD2B-P09
A13B-DCA2-P00
BPD2B to I/O Subsystem #4 (no IBF)
Blue
A35B-BPD3A-P08
Z19B-DCA1-P01
BPD3A to I/O Subsystem #8
Blue With Stripes
A35P-BPD3B-P08
Z19B-DCA1-P00
BPD3B to I/O Subsystem #8
Blue
A35B-BPD3A-P09
Z19B-DCA2-P01
BPD3A to I/O Subsystem #8
Blue With Stripes
A35P-BPD3B-P09
Z19B-DCA2-P00
BPD3B to I/O Subsystem #8
Blue
Integrated
Battery Feature Locations 1
1
Connector to BPR
Chapter
1.
Reference
Information
35
Subsystem Positioning and Cabling This section describes subsystem positioning and cabling for the 406/90. Operator Panel Cable and Diskette Drive Cable to the Media Subsystem (Rear View)
1
Primary I/O Book
3
Diskette Drive Data Cable
2
Cable Connecting the Operator Panel to the Media Subsystem
4
Media Subsystem
36
406/90
Service
Guide
I/O Subsystem Positions, Fully Populated Primary and Secondary Rack (Rear View without IBF)
C1
Primary Rack
3
Third I/O Subsystem
C2
Secondary Rack
4
Fourth I/O Subsystem
C3
Blank Space
5
Fifth I/O Subsystem
C4
Blank Space
6
Sixth I/O Subsystem
1
First I/O Subsystem
7
Seventh I/O Subsystem
2
Second I/O Subsystem
8
Eighth I/O Subsystem
Chapter
1.
Reference
Information
37
I/O Subsystem Positions, Fully Populated Primary and Secondary Rack (Rear View with IBFs)
C1
Primary Rack
4B
Four IBFs (primary rack), Position 4
C2
Primary I/O Book (Book 0)
5
Fifth I/O Subsystem
C3
Secondary Rack
6
Sixth I/O Subsystem
1
First I/O Subsystem
7
Seventh I/O Subsystem
2
Second I/O Subsystem
2B
Two IBFs (secondary rack)
3
Third I/O Subsystem
8
Eighth I/O Subsystem
4
Fourth I/O Subsystem
38
406/90
Service
Guide
Placement of Subsystem 4 in a Two-Rack Configuration (Rear View with IBF)
C1
Primary Rack
3
Third I/O Subsystem
C2
Secondary Rack
4
Fourth I/O Subsystem
1
First I/O Subsystem
4B
Four IBFs
2
Second I/O Subsystem
Note: The physical location shown for I/O Subsystem 4 is populated only when IBFs are present and four or more I/O subsystems are installed. Chapter
1.
Reference
Information
39
RIO and RIO-2 Cabling Rules If the CEC RIO ports are standard RIO, thumbscrew-style connectors, then connect a two-planar I/O loop to the RIO ports of the CEC in the following order: 1. Primary I/O Book Slot 0 (1, 2, 3, or 4 MCMs): RIO 1 A0&A1, RIO 2 B0&B1 2. Secondary I/O Book Slot 2: v 2 MCMs: RIO 3 C0&C1, RIO 4 D0&D1 v 3 or 4 MCMs: RIO 3 C0&C1, RIO 4 D0&D1, RIO 5 A0&A1, RIO 6 B0&B13. Secondary I/O Book Slot 3 (4 MCMs): RIO 7 C0&C1, RIO 8 D0&D1Notes: 1. Secondary I/O Book Slot 1 is not allowed with CEC standard RIO connections. 2. One I/O planar loops are not allowed with CEC standard RIO connections.If the CEC RIO ports are gigabyte RIO, or RIO-2, connectors, then connect either a one I/O planar loop or a two planar I/O loop to the first 16 RIO ports of the CEC in the following order: 1. Primary I/O Book Slot 0 (1, 2, 3, or 4 MCMs): RIO 1 A0&A1, RIO 2 B0&B1 2. Secondary I/O Book Slot 2: v 2 MCMs: RIO 3 C0&C1, RIO 4 D0&D1 v 3 or 4 MCMs: RIO 3 C0&C1, RIO 4 D0&D1, RIO 5 A0&A1, RIO 6 B0&B13. Secondary I/O Book Slot 3: v 2 MCMs: RIO 5 A0&A1, RIO 6 B0&B1 v 3 or 4 MCMs: RIO 7 A0&A1, RIO 8 B0&B1Although two I/O planar loops are not permitted on the last 12 RIO ports of the CEC, you can connect additional one I/O planar loops to the next 12 RIO ports of the CEC in the following order: 1. Secondary I/O Book Slot 3 (4 MCMs): RIO 9 C0&C1, RIO 10 D0&D1 2. Secondary I/O Book Slot 1: v 3 MCMs: RIO 9 C0&C1, RIO 10 D0&D1, RIO 11 A0&A1, RIO 12 B0&B1 v 4 MCMs: RIO 11 C0&C1, RIO 12 D0&D1, RIO 13 A0&A1, RIO 14 B0&B1Note: Use required pairs of RIO ports without omitting any. The following RIO port speeds are available: v Standard RIO uses a thumbscrew retention physical connector and runs at 500 MB per second. v Gigabyte RIO, or RIO-2, uses a bayonette retention physical connector and runs at 1 GB per second.Two I/O planars are available, one for RIO and one for RIO-2, that may be placed in either side of an I/O subsystem. They can be identified by their RIO connector style. Rules for connecting RIO and RIO-2 CEC ports to I/O subsystem RIO and RIO-2 ports are the following: v CEC RIO ports must not be connected to I/O subsystem RIO-2 ports, even with the available RIO-2 to RIO cables. v RIO-2 to RIO cables must only be used to connect CEC RIO-2 ports to I/O subsystem RIO ports. v RIO to RIO cables must only be used to connect CEC RIO ports to I/O subsystem RIO ports. v RIO-2 to RIO-2 cables must only be used to connect CEC RIO-2 ports to I/O subsystem RIO-2 ports.Rules for connecting RIO and RIO-2 I/O planars to the CEC RIO and RIO-2 ports are the following: v CEC RIO ports must be connected to 2 RIO planars per loop. Up to 8 I/O subsystems can be connected. 40
406/90
Service
Guide
v CEC RIO-2 ports can be connected to RIO and RIO-2 I/O planars with appropriate cables in one planar loop or two planar loops in the following order: 1. Cable any I/O subsystems with RIO I/O planars in both the left and the right positions with 2 I/O planar loop connections. The maximum data rate for each RIO I/O planar is 500 MB per second. 2. Cable any I/O subsystems with RIO-2 I/O planars in both the left and right positions with 2 I/O planar loop connections. The maximum data rate for each RIO-2 I/O planar is 1 GB per second. 3. Cable any I/O subsystems with a RIO I/O planar in one side and a RIO-2 I/O planar in the other side with 1 I/O planar loop connection. The maximum data rate for the RIO I/O planar is 500 MB per second, and for the RIO-2 I/O planar, the maximum data rate is 2 GB per second. However, in the event of a cable failure, there is no redundant path for the 2 GB per second RIO-2 I/O planar. Restoring the redundant path for the RIO-2 planar reduces the data rate to 1 GB per second. I/O subsystems with a RIO I/O planar in one side and a RIO-2 I/O planar in the other side with 2 I/O planar loop connections are not allowed. 4. Cable any I/O subsystems with RIO-2 I/O planars in both the left and the right positions with one I/O planar loop connections. The maximum data rate for each RIO-2 I/O planar is 1 GB per second with data path redundancy, or 2 GB per second without data path redundancy.Note: If all I/O planars are connected to the CEC with one I/O planar loops, the maximum number of I/O subsystems connected to the system is reduced to seven. Chapter
1.
Reference
Information
41
I/O Subsystems (1 Through 4 without IBFs) RIO Cabling to I/O Books, Two I/O Planar Loop Configuration, 2, 3, or 4 MCMs Only Note: The following figure applies to CECs with RIO, or thumbscrew retention, connections.A0C0A1C1B0000000001114356711111B1D1D012
1
Primary Rack
5
First I/O Subsystem
2
Primary I/O Book
6
Fourth I/O Subsystem
3
Third I/O Subsystem
7
Secondary I/O Book
4
Second I/O Subsystem
42
406/90
Service
Guide
I/O Subsystems (5 and 6) RIO Cabling to the Secondary I/O Book, Two I/O Planar Loop Configuration, 3 or 4 MCMs Only Note: The following figure applies to CECs with RIO, or thumbscrew retention, connections.ShownWithoutIBFs
C1
Primary Rack
5
Fifth I/O Subsystem
C2
Secondary Rack
6
Sixth I/O Subsystem
C3
Secondary I/O Book
Note: RIO cabling to I/O Subsystems 5 and 6 is the same for configurations with or without the IBFs.
Chapter
1.
Reference
Information
43
I/O Subsystems (7 and 8) RIO Cabling to the Third I/O Book, Two I/O Planar Loop Configuration, 4 MCMs Only Note: The following figure applies to CECs with RIO, or thumbscrew retention, connections.
C1
Primary Rack
4
Fourth I/O Subsystem
C2
Secondary Rack
4B
Two IBFs (maximum), Position 4
C3
Third I/O Book (Book 3)
5
Fifth I/O Subsystem
1
First I/O Subsystem
6
Sixth I/O Subsystem
2
Second I/O Subsystem
7
Seventh I/O Subsystem
3
Third I/O Subsystem
8
Eighth I/O Subsystem
Note: RIO cabling to I/O subsystems 7 and 8 is the same for configurations with or without the IBFs.
44
406/90
Service
Guide
I/O Subsystems (1 through 3 with IBFs) RIO Cabling to I/O Books, Two I/O Planar Loop Configuration, 2, 3, or 4 MCMs Only Note: The following figure applies to CECs with RIO, or thumbscrew retention, connections.A0C0A1C1B00000001143561111B112
1
Primary Rack
A0
Connection to I/O Book (Book 0)
2
Primary I/O Book
A1
Connection to I/O Book (Book 0)
3
Third I/O Subsystem
B0
Connection to I/O Book (Book 0)
4
Second I/O Subsystem
B1
Connection to I/O Book (Book 0)
5
First I/O Subsystem
C0
Connection to I/O Book (Book 2)
6
Secondary I/O Book
C1
Connection to I/O Book (Book 2)
Chapter
1.
Reference
Information
45
I/O Subsystems (4 through 6 with IBFs) RIO Cabling to I/O Books, Two I/O Planar Loop Configuration, 3 or 4 MCMs Only Note: The following figure applies to CECs with RIO, or thumbscrew retention, connections.C1C2C353214B4B6478B0A0A1D1D0B1
C1
Secondary Rack
6
Sixth I/O Subsystem
C2
Primary Rack
7
Seventh I/O Subsystem
C3
Secondary I/O Book in I/O Slot 3
8
Eighth I/O Subsystem
C4
Secondary I/O Book in Slot 2
A0
Connection to I/O Book (Book 2)
1
First I/O Subsystem
A1
Connection to I/O Book (Book 2)
2
Second I/O Subsystem
B0
Connection to I/O Book (Book 2)
3
Third I/O Subsystem
B1
Connection to I/O Book (Book 2)
4
Fourth I/O Subsystem
D0
Connection to I/O Book (Book 2)
4B
Integrated Battery Feature (IBF)
D1
Connection to I/O Book (Book 2)
5
Fifth I/O Subsystem
46
406/90
Service
Guide
I/O Subsystems (1 Through 4 without IBFs) RIO-2 Cabling to I/O Books, Two I/O Planar Loop Configuration, 2, 3, or 4 MCMs Only Note: The following figure applies to CECs with RIO-2, or bayonette retention, connections.A0C0A1C1B0000000001114356711111B1D1D012
1
Primary Rack
5
First I/O Subsystem
2
Primary I/O Book
6
Fourth I/O Subsystem
3
Third I/O Subsystem
7
Secondary I/O Book
4
Second I/O Subsystem
Chapter
1.
Reference
Information
47
I/O Subsystems (5 and 6) RIO-2 Cabling to the Secondary I/O Book, Two I/O Planar Loop Configuration, 2 MCMs Only Note: The following figure applies to CECs with RIO-2, or bayonette retention, connections.
C1
Primary Rack
5
Fifth I/O Subsystem
C2
Secondary Rack
6
Sixth I/O Subsystem
C3
Secondary I/O Book
Note: RIO-2 cabling to I/O Subsystems 5 and 6 is the same for configurations with or without the IBFs.
48
406/90
Service
Guide
I/O Subsystems (5 and 6) RIO-2 Cabling to the Secondary I/O Book, Two I/O Planar Loop Configuration, 3 or 4 MCMs Only Note: The following figure applies to CECs with RIO-2, or bayonette retention, connections.ShownWithoutIBFs
C1
Primary Rack
5
Fifth I/O Subsystem
C2
Secondary Rack
6
Sixth I/O Subsystem
C3
Secondary I/O Book
Note: RIO-2 cabling to I/O Subsystems 5 and 6 is the same for configurations with or without the IBFs.
Chapter
1.
Reference
Information
49
I/O Subsystems (7 and 8) RIO-2 Cabling to the Third I/O Book, Two I/O Planar Loop Configuration, 3 or 4 MCMs Only Note: The following figure applies to CECs with RIO-2, or bayonette retention connections.
C1
Primary Rack
5
Fifth I/O Subsystem
C2
Secondary Rack
6
Sixth I/O Subsystem
C3
Third I/O Book (Book 3)
7
Seventh I/O Subsystem
1
First /O Subsystem
8
Eighth I/O Subsystem
2
Second Subsystem
A0
Connection to I/O Book (Book 3)
3
Third I/O Subsystem
A1
Connection to I/O Book (Book 3)
4
Fourth I/O Subsystem
B0
Connection to I/O Book (Book 3)
4B
Two IBFs (maximum), Position 4
B1
Connection to I/O Book (Book 3)
50
406/90
Service
Guide
I/O Subsystems (1 Through 4 without IBFs) RIO-2 Cabling to I/O Books, One I/O Planar Loop Configuration, 3 or 4 MCMs Only Note: The following figure applies to CECs with RIO-2, or bayonette retention, connections.
1
Primary Rack
5
First I/O Subsystem
2
Primary I/O Book
6
Fourth I/O Subsystem
3
Third I/O Subsystem
7
Secondary I/O Book
4
Second I/O Subsystem
Chapter
1.
Reference
Information
51
I/O Subsystems (5 and 6) RIO-2 Cabling to the Secondary I/O Book, One I/O Planar Loop Configuration, 3 MCMs Only Note: The following figure applies to CECs with RIO-2, or bayonette retention, connections.
C1
Primary Rack
5
Fifth I/O Subsystem
C2
Secondary Rack
6
Sixth I/O Subsystem
C3
Secondary I/O Book
Note: RIO-2 cabling to I/O Subsystems 5 and 6 is the same for configurations with or without the IBFs.
52
406/90
Service
Guide
I/O Subsystems (5 and 6) RIO-2 Cabling to the Secondary I/O Book, One I/O Planar Loop Configuration, 4 MCMs Only Note: The following figure applies to CECs with RIO-2, or bayonette retention, connections.
C1
Primary Rack
5
Fifth I/O Subsystem
C2
Secondary Rack
6
Sixth I/O Subsystem
C3
Secondary I/O Book
Note: RIO-2 cabling to I/O Subsystems 5 and 6 is the same for configurations with or without the IBFs.
Chapter
1.
Reference
Information
53
I/O Subsystems (7) RIO-2 Cabling to the Fourth I/O Book, One I/O Planar Loop Configuration, 4 MCMs Only Note: The following figure applies to CECs with RIO-2, or bayonette retention, connections.
C1
Primary Rack
4
Fourth I/O Subsystem
C2
Secondary Rack
4B
Two IBFs (maximum), Position 4
C3
Third I/O Book (Book 3)
5
Fifth I/O Subsystem
1
First I/O Subsystem
6
Sixth I/O Subsystem
2
Second I/O Subsystem
7
Seventh I/O Subsystem
3
Third I/O Subsystem
8
Eighth I/O Subsystem
Notes: 1. RIO-2 cabling to I/O subsystem 7 is the same for configurations with or without the IBFs. 2. The maximum number of I/O subsystems allowed is reduced to seven when all I/O planars are connected to their own RIO-2 port pairs.
54
406/90
Service
Guide
I/O Subsystems (1 through 3 with IBFs) RIO-2 Cabling to I/O Books, One I/O Planar Loop Configuration, 3 or 4 MCMs Only Note: The following figure applies to CECs with RIO-2, or bayonette retention, connections.
1
Primary Rack
A0
Connection to I/O Book (Book 0)
2
Primary I/O Book
A1
Connection to I/O Book (Book 0)
3
Third I/O Subsystem
B0
Connection to I/O Book (Book 0)
4
Second I/O Subsystem
B1
Connection to I/O Book (Book 0)
5
First I/O Subsystem
C0
Connection to I/O Book (Book 2)
6
Secondary I/O Book
C1
Connection to I/O Book (Book 2)
Chapter
1.
Reference
Information
55
I/O Subsystems (1 through 3 with IBFs) RIO-2 Cabling to I/O Books, One I/O Planar Loop Configuration, 2 MCMs Only Note: The following figure applies to CECs with RIO-2, or bayonette retention, connections.
1
Primary Rack
A0
Connection to I/O Book (Book 0)
2
Primary I/O Book
A1
Connection to I/O Book (Book 0)
3
Third I/O Subsystem
B0
Connection to I/O Book (Book 0)
4
Second I/O Subsystem
B1
Connection to I/O Book (Book 0)
5
First I/O Subsystem
C0
Connection to I/O Book (Book 2)
6
Secondary I/O Book
C1
Connection to I/O Book (Book 2)
56
406/90
Service
Guide
I/O Subsystem (4 with IBFs Installed) RIO-2 Cabling to I/O Book 3, One I/O Planar Loop Configuration, 3 or 4 MCMs Only Note: The following figure applies to CECs with RIO-2, or bayonette retention, connections.
C1
Secondary Rack
6
Sixth I/O Subsystem
C2
Primary Rack
7
Seventh I/O Subsystem
C3
Secondary I/O Book in I/O Slot 3
8
Eighth I/O Subsystem
C4
Secondary I/O Book in Slot 2
A0
Connection to I/O Book (Book 2)
1
First I/O Subsystem
A1
Connection to I/O Book (Book 2)
2
Second I/O Subsystem
B0
Connection to I/O Book (Book 2)
3
Third I/O Subsystem
B1
Connection to I/O Book (Book 2)
4
Fourth I/O Subsystem
D0
Connection to I/O Book (Book 2)
4B
Integrated Battery Feature (IBF)
D1
Connection to I/O Book (Book 2)
5
Fifth I/O Subsystem
Chapter
1.
Reference
Information
57
Power and SCSI Cables to the Media Subsystem 1235476
1
Primary Rack
5
SCSI Cable for Media Subsystem Rear Drives
2
Primary I/O Book
6
SCSI Cable for Media Subsystem Front Drives
3
Power Cable for Media Subsystem Front Drives
7
Power Cable for Media Subsystem Rear Drives
4
Rear of Media Subsystem
58
406/90
Service
Guide
Hardware Management Console (HMC) to the Primary I/O Book To connect your HMC to the 406/90, connect the serial cable into serial port 1 located on the back of the HMC. Connect the other end of the serial cable into the HMC1 connector located on the primary I/O book in the managed system. The following illustration shows the location of the serial ports on the back of the HMC and the HMC connectors located on the processor subsystem.
1
Primary Rack
4
Primary HMC Connected to HMC #1 in Primary I/O Book (HMC #1 may be labeled HSC #1)
2
Primary I/O Book
5
Optional Second HMC Connected to HMC #2 in Primary I/O Book (HMC #2 may be labeled HSC #2)
3
Power Plug to External Power Source (wall plug)
Optional 8-port or 128 Port Async Adapters Up to two optional async adapters can be installed in the HMC. The adapters have breakout boxes that allow connection to serial cables that connect to the managed system’s HMC 1 serial ports. For information about installing these adapters in the HMC, see the 406/90, order number SA23-1248. Chapter
1.
Reference
Information
59
System Memory All of the main memory for the 406/90 consists of pluggable memory books in reserved slots with two memory controllers per book. The memory books are available in various sizes. Memory modules are not removable from the memory books. A minimum of 8 GB of memory must be installed for the system to operate. Eight slots, located on the front of the processor subsystem, are available for system memory books. These slots are shown in “406/81 Processor Subsystem Front View” on page 17. Memory to Processor Relationships The following illustration shows the relationship between the memory riser cards, L3 cache, and the MCM modules: MemorySlot0(800-PIN)BusBarMemorySlot7(800-PIN)MemorySlot3(800-PIN)MemorySlot6(800-PIN)MemorySlot4(800-PIN)MemorySlot1(800-PIN)MemorySlot5(800-PIN)MemorySlot2(800-PIN)GXSlot0(680-PIN)GXSlot3(680-PIN)GXSlot1(680-PIN)GXSlot2680-PIN)BusBarBusBarBusBarMCMModule0MCMModule3MCMModule2MCMModule1clockACADBL3L3L3L3L3L3L3L3L3L3L3L3L3L3L3L3CABDABCDABCDOR0OR1OR3OR4OR5OR7OR6OR15OR14OR13OR12OR11OR10OR9OR8OR21301234567891011121514 60
406/90
Service
Guide
Memory Requirements Note: The configuration of a system that was ordered with memory Capacity Upgrade on Demand (CUoD) features will appear slightly different than the configuration in this discussion. When the system is ordered, the installed memory is positioned according to the following guidelines: v The 406/90 has eight memory slots. Four memory slots use inward-facing memory cards and four use outward-facing memory cards. The inward-facing memory slots are used by the first and second processor module positions, while the outward-facing memory slots support the third and fourth 8-way processor module positions. v A 406/90 system with only one processor MCM position populated has access to only two inward-facing memory slots. v Minimum system memory is 8 GB. Maximum system memory is 512 GB. v Memory is available in 4 GB, 8 GB, 16 GB, 32 GB, and 64 GB inward-facing and outward-facing increments. v The following memory configuration considerations are recommended to ensure the optimum performance is obtained for the system: – Memory should be installed in identical pairs. – All available memory locations should be populated with memory cards. – Memory sizes should be balanced as closely as possible across all populated MCM locations.v The number of active memory book slots in a system is related to the number of populated MCM locations. Two memory book slots are activated with each MCM that is installed. – The first MCM installed activates the first and second inward-facing memory positions. – The second MCM installed activates the third and fourth inward-facing memory positions. – The third MCM installed activates the first and second outward-facing memory positions. – The fourth MCM installed activates the third and fourth outward-facing memory positions.v It is recommended that all systems have access to a 4-mm or 8-mm tape drive for submission of system dump information, if required. This function can be accomplished through locally attached or network-attached devices as appropriate. Chapter
1.
Reference
Information
61
I/O Subsystem Features I/O subsystems have two redundant power supplies. One of these power supplies is capable of providing the necessary voltages and currents, independent of the other power supply. The left and right power-supply output voltages are connected and monitored by the power distribution board contained in the I/O subsystem. The left and right power supplies are hot-pluggable and may be changed one at a time while the system is operational. Each power supply provides 5V dc, 3.3V dc, 12V dc, and 5V dc standby. I/O to Processor Relationships The following illustration shows the relationship between the I/O books, L3 cache, and the MCM modules: MemorySlot0BusBarMemorySlot7MemorySlot3MemorySlot6MemorySlot4MemorySlot1MemorySlot5MemorySlot2GXSlot0(680-PIN)GXSlot3(680-PIN)GXSlot1(680-PIN)GXSlot2680-PIN)BusBarBusBarBusBarMCMModule0MCMModule3MCMModule2MCMModule1clockACADBL3L3L3L3L3L3L3L3L3L3L3L3L3L3L3L3CABDABCDABCDGX0AGX0CGX3DGX3BGX3AGX3CGX2DGX2BGX0BGX0DGX1CGX1AGX1BGX1DGX2CGX2A3012456789101112131415(0)(1)(3)(2)(5)(4)(7)(6)(9)(11)(10)(13)(NC)(NC)(12)(8) 62
406/90
Service
Guide
I/O Subsystem Power Supply (DCA) LED Status Use the following table to determine the condition of the I/O subsystem power supply (DCA) based on the status of the power supply LEDs.
Status of LED
Operator Panel LED
Right Power Supply LED
Left Power Supply LED
Off
No power connected
No power connected or system power connected, not turned on, power supply detects no faults
No power connected or system power connected, not turned on, power supply detects no faults
On, fast blinking green, on for one second, off for one second
System power connected, power on initiated
System power connected, not turned on, power supply detected fault
System power connected, not turned on, power supply detected fault
On, slow blinking green, on for two seconds, off for two seconds
System power connected, not turned on
System power connected, not turned on, power supply detected system power control network fault
System power connected, not turned on, power supply detected system power control network fault
On, steady green
System power connected and turned on
System power connected and turned on
System power connected and turned on Chapter
1.
Reference
Information
63
Logical and Physical Locations The 406/90 system uses physical location codes in conjunction with AIX location codes to provide mapping of the failing field replaceable units (FRUs). The location codes are produced by the system’s firmware and the operating system. There are no Linux-specific location codes, but physical location codes may appear in the Linux error logs. Physical Location Codes Physical location codes provide a mapping of logical functions in a platform (or expansion sites for logical functions, such as connectors or ports) to their specific locations within the physical structure of the platform. Location Code Format The format for the location code is a string of alphanumeric characters separated by a dash (-), slash (/), pound sign (#) or period (.) character. The base location is all of the information preceding the slash (/) or pound sign (#). The base location identifies a device that is connected to or plugged into the parent. Extended location information follows the slash (/). Extended location information identifies a device that is part of the parent, a connector, or a cable. Cable information follows the pound sign (#). Cable information identifies a cable that is connected to parent. The following are examples: v P1-C1 identifies a processor card C1 (MCM) plugged into planar (backplane) P1. v P1-M1 identifies a memory card (book) M1 plugged into planar (backplane) P1. v P1-H2/S1 identifies serial port 1 controller on planar (backplane) P1, the connector for serial port 1, the cable attached to serial port 1 would be represented by a # sign after the S1. v P1-I2/E3 identifies a Ethernet controller 3 on the card in slot 2 (I2) on planar (backplane) P1, the connector for Ethernet controller 3, or the cable attached to Ethernet controller 3. v P1-I2/E3# identifies the cable attached to Ethernet controller 3 on the card in slot 2 (I2) on planar (backplane) P1.The period (.) identifies sublocations (SCSI addresses, cables). The following are examples: v P1-C1.1 identifies processor 1 on processor card (MCM) plugged into planar (backplane) P1. v P2-Z1-A3.1 identifies a SCSI device with SCSI address of LUN 1 at SCSI ID 3 attached to SCSI bus 1 from planar (backplane) 2. v P1-I2/E3.2# identifies the second in a series of cables attached to Ethernet controller 3 on the card in slot 2 (I2) on planar (backplane) P1.Depending on the AIX and firmware levels, AIX diagnostics may include the extended location information when identifying a planar (backplane) or card. The extended location information or cable information is always included when identifying a cable or connector. Location codes with extended location information that are displayed without a description identifying the devices always identify the cable attached to the port. 64
406/90
Service
Guide
Multiple FRU Callout Instructions If an eight-digit error code appears in the operator panel display or in Chapter 5, “Error Code to FRU Index,” on page 413, a location code for a failing part may also be specified. If the location code includes a blank space followed by a lowercase x followed by a number, this is an error code with multiple FRU callouts. This error can typically happen with memory books or processors, and may involve mixed types of parts. In this case, check the system’s configuration for FRU part numbers to determine the appropriate set of FRUs. You can determine the FRU part numbers of the electronic assemblies in the entire system by using the service processor menus. From the General User menu, select Read VPD Image from Last System Boot, then enter 90 to display detailed Vital Product Data (VPD). You can determine the FRU part numbers of the electronic assemblies in a partition by using the lscfg -vp | pg command on the AIX command line to display the detailed VPD of all assemblies. Notice that the FRU part number information for processors and memory books may be at the bottom of the command output. FRU Identify LEDs This system is configured with an arrangement of LEDs that help identify various components of the system. These include, but are not limited to: v Rack identify LED v Processor subsystem drawer identify LED v I/O drawer identify LED v RIO port identify LED v FRU identify LED – Power subsystem FRUs – Processor subsystem FRUs – I/O subsystem FRUsv I/O adapter identify LED v DASD identify LEDThe identify LEDs are arranged hierarchically with the FRU identify LED at the bottom of the hierarchy, followed by the corresponding processor subsystem or I/O drawer identify LED, and the corresponding rack-identify LED to locate the failing FRU more easily. Any identify LED in the system may be flashed (when the processor subsystem is in the failed state and power is on) by using the service processor LED Control Menu contained in the System Information Menu of the Privileged User Menus. For information about using the LED Control Menu, see page 788. Any identify LED in the system can also be flashed by using the Identify and Attention Indicators task in diagnostics. The procedure to use the Identify and Attention Indicators task in diagnostics is outlined in Diagnostic Information for Multiple Bus Systems. If the service processor menus and AIX diagnostics are not available, the FRU identify LEDs can be flashed by one of the following procedures: v If the system is configured as a full partition, the system may be booted to the open firmware prompt and the command FRU-LED-MENU entered. A menu displays that allows you to enable the desired FRU identify LED. See “System Power Control Menu” on page 778 for instructions on setting up the boot mode to enable the boot to the open firmware prompt. v If the system is logically partitioned, the HMC must be attached. You can use the HMC to enable any FRU identify LED to be flashed. See the Hardware Management Console Operations Guide for instructions on activating and deactivating a FRU identify LED.
Chapter
1.
Reference
Information
65
AIX Location Codes Note: AIX location codes are not available if Linux is the operating system. The basic formats of the AIX location codes are as follows: v For non-SCSI devices/drives: AB-CD-EF-GHv For SCSI devices/drives: AB-CD-EF-G,HNon-SCSI Devices For planars (backplanes), cards, and non-SCSI devices, the location code is defined as follows: AB-CD-EF-GH | | | | | | | Device/FRU/Port ID | | Connector ID | devfunc Number, Adapter Number or Physical Location Bus Type or PCI Parent Bus v The AB value identifies a bus type or PCI parent bus as assigned by the firmware. v The CD value identifies adapter number, adapter’s devfunc number, or physical location. The devfunc number is defined as the PCI device number times 8, plus the function number. v The EF value identifies a connector. v The GH value identifies a port, address, device, or FRU.Adapters and cards are identified only with AB-CD. The possible values for CD depend on the adapter/card. For pluggable PCI adapters/cards, CD is the device’s devfunc number (PCI device number times 8, plus the function number). The C and D are characters in the range of 0-9, and A-F (hex numbers). The location codes therefore uniquely identify multiple adapters on individual PCI cards. EF is the connector ID, used to identify the adapter’s connector to which a resource is attached. GH is used to identify a port, device, or FRU. For example: v For asynchronous devices, GH defines the port on the fanout box. The values are 00 to 15. v For a diskette drive, H identifies either diskette drive 1 or 2. G is always 0. v For all other devices, GH is equal to 00.For integrated adapter, EF-GH is the same as the definition for a pluggable adapter. For example, the location code for a diskette drive is 01-D1-00-00. A second diskette drive is 01-D1-00-01. 66
406/90
Service
Guide
SCSI Devices For SCSI devices, the location code is defined as follows: AB-CD-EF-G,H | | | | | | | | | Logical Unit address of the SCSI Device | | | Control Unit Address of the SCSI Device | | Connector ID | devfunc Number, Adapter Number or Physical Location Bus Type or PCI Parent Bus Where: v AB-CD-EF are the same as non-SCSI devices. v G defines the control unit address of the device. Values of 0 to 15 are valid. v H defines the logical unit address of the device. Values of 0 to 255 are valid.A bus location code is also generated as ’00-XXXXXXXX’, where XXXXXXXX is equivalent to the node’s unit address. Examples of physical location codes and AIX location codes are: PCI adapter in first I/O subsystem, slot 1 (primary rack): – Physical location code U1.9-P1-I1 – AIX location code 2V-08 PCI adapter in fifth I/O subsystem, slot 1 (secondary rack): – Location Code U2.1-P1-I1 – AIX location Code 8V-08 Chapter
1.
Reference
Information
67
AIX and Physical Location Code Reference Tables Attention: AIX location codes are not available when the operating system is Linux. The following illustrations show the location codes that are used to identify the major functional units in the 406/90 primary and secondary racks. The following figure shows the primary rack locations. V7U1.18-P1-V8REARFRONTU1.35-P1-X2U1.35-P2-X2U1.35-P1-X4U1.35-P2-X4U1.17U1.17U1.9U1.9U1.1U1.1U1.5U1.5U1.13-P1-U1.15-P1-V1orU1.13V2orU1.13U1.13-P2-V3orU1.13U1.15-P2-V4orU1.13U1.18-F1U1.18-X1U1.18-F2U1.18-X2U1.35-P1-F1U1.35-P2-F1U1.18-P1-M3U1.18-P1-M4U1.18-P1-M2U1.18-P1-M1U1.18-P1-M7U1.18-P1-M8U1.18-P1-M6U1.18-P1-M5U1.18-F3U1.18-X3U1.18-X4U1.18-F4U1.35-P1-X1UnusedU1.35-P2-X1UnusedU1.35-P1-U1.35-P1-V3U1.35-P1-V2V1U1.35-P2-U1.35-P2-V2U1.35-P2-V3V1U1.35-P1-X3U1.35-P2-X3U1.18-P1-V3U1.18-P1-V6U1.18-P1-V5U1.18-P1-H2U1.18-P1-H4U1.18-P1-H3U1.18-P1-H1U1.18-P1-V4U1.18-P1-U1.18-P1-V1U1.18-P1-V2 68
406/90
Service
Guide
The following figure shows the secondary rack locations. REARFRONTU2.17-P1-V5U2.13U2.13U2.17-P2-V6U2.9U2.1U2.1U2.5U2.5U2.9U2.19U2.19 Chapter
1.
Reference
Information
69
The following illustration shows the location codes that are used to identify the MCM modules and L3 modules (located behind fans 3--U1.18-P1-F3 and 4--U1.18-P1-F4). AA123456789101112131415161718192021
1
L3, Location: U1.18-P1-C5
12
L3, Location: U1.18-P1-C16
2
L3, Location: U1.18-P1-C6
13
L3, Location: U1.18-P1-C20
3
L3, Location: U1.18-P1-C10
14
L3, Location: U1.18-P1-C19
4
MCM 0, Location: U1.18-P1-C1
15
MCM 2, Location: U1.18-P1-C4
5
Clock Card, Location: U1.18-P1-X5
16
L3, Location: U1.18-P1-C18
6
L3, Location: U1.18-P1-C7
17
L3, Location: U1.18-P1-C17
7
L3, Location: U1.18-P1-C8
18
L3, Location: U1.18-P1-C13
8
L3, Location: U1.18-P1-C12
19
L3, Location: U1.18-P1-C14
9
L3, Location: U1.18-P1-C11
20
MCM 3, Location: U1.18-P1-C2
10
MCM 1, Location: U1.18-P1-C3
21
L3, Location: U1.18-P1-C9
11
L3, Location: U1.18-P1-C15
70
406/90
Service
Guide
The following tables contain location codes that are used to identify functional units in the 406/90 systems. Each table shows the locations for a physical part of the system.
FRU Name
Location Code
AIX Location Code
Physical Connection
Logical Connection
System Rack
(MT/M Serial#) U1
Bulk Power Subsystem 1 Locations
Bulk Power Enclosure A (front)
U1.35
BPE Backplane A
U1.35-P1
AC to BPE (A) Line Cord Connector
U1.35-P1/Q1
AC to BPE (A) Line Cord
U1.35-P1/Q1#
Bulk Power Reg 1
U1.35-P1-V1
BPR 1 Connector
U1.35-P1-V1/Q1
BPR 1 Cable to IBF 1A
U1.35-P1-V1/Q1#
Bulk Power Distribution 3 (Unused)
U1.35-P1-X1
SPARE
U1.35-P1-X1/Q1
SPARE Cable
U1.35-P1-X1/Q1#
SPARE
U1.35-P1-X1/Q2
SPARE Cable
U1.35-P1-X1/Q2#
SPARE
U1.35-P1-X1/Q3
SPARE Cable
U1.35-P1-X1/Q3#
SPARE
U1.35-P1-X1/Q4
SPARE Cable
U1.35-P1-X1/Q4#
SPARE
U1.35-P1-X1/Q5
SPARE Cable
U1.35-P1-X1/Q5#
SPARE
U1.35-P1-X1/Q6
SPARE Cable
U1.35-P1-X1/Q6#
SPARE
U1.35-P1-X1/Q7
SPARE Cable
U1.35-P1-X1/Q7#
SPARE
U1.35-P1-X1/Q8
SPARE Cable
U1.35-P1-X1/Q8#
SPARE
U1.35-P1-X1/Q9
SPARE Cable
U1.35-P1-X1/Q9#
SPARE
U1.35-P1-X1/Q10
SPARE Cable
U1.35-P1-X1/Q10#
Bulk Power Reg 2 (BPR)
U1.35-P1-V2
BPR 2 (BPR) Connector
U1.35-P1-V2/Q1
BPR 2 Cable to IBF 2A
U1.35-P1-V2/Q1#
Bulk Power Distribution 1
U1.35-P1-X3 Chapter
1.
Reference
Information
71
FRU Name
Location Code
AIX Location Code
Physical Connection
Logical Connection
I/O Subsystem -1A Connector 0 I/O Subsystem 1 (DCA 1)
U1.35-P1-X3/Q1
I/O Subsystem -1A Cable
U1.35-P1-X3/Q1#
I/O Subsystem -1B Connector 1 I/O Subsystem 1 (DCA 2)
U1.35-P1-X3/Q2
I/O Subsystem -1B Cable
U1.35-P1-X3/Q2#
I/O Subsystem -2A Connector 2 I/O Subsystem 2 (DCA 1)
U1.35-P1-X3/Q3
I/O Subsystem -2A Cable
U1.35-P1-X3/Q3#
I/O Subsystem -2B Connector 3 I/O Subsystem 2 (DCA 2)
U1.35-P1-X3/Q4
I/O Subsystem -2B Cable
U1.35-P1-X3/Q4#
I/O Subsystem -3A Connector 4 I/O Subsystem 3 (DCA 1)
U1.35-P1-X3/Q5
I/O Subsystem -3A Cable
U1.35-P1-X3/Q5#
I/O Subsystem -3B Connector 5 I/O Subsystem 3 (DCA 2)
U1.35-P1-X3/Q6
I/O Subsystem -3B Cable
U1.35-P1-X3/Q6#
I/O Subsystem -4A Connector 6 I/O Subsystem 4 (DCA 1)
U1.35-P1-X3/Q7#
I/O Subsystem -4A Cable
U1.35-P1-X3/Q7#
I/O Subsystem -4B Connector 7 I/O Subsystem 4 (DCA 2)
U1.35-P1-X3/Q8
I/O Subsystem -4B Cable
U1.35-P1-X3/Q8#
DCA-4 Connector
U1.35-P1-X3/Q9
DCA-4 Cable
U1.35-P1-X3/Q9#
DCA-5 Connector
U1.35-P1-X3/Q10
DCA-5 Cable
U1.35-P1-X3/Q10#
Bulk Power Distribution 2
U1.35-P1-X2
I/O Subsystem -5A Connector 0 I/O Subsystem 5 (DCA 1)
U1.35-P1-X2/Q1
I/O Subsystem -5A Cable
U1.35-P1-X2/Q1#
I/O Subsystem -5B Connector 1 I/O Subsystem 5 (DCA 2)
U1.35-P1-X2/Q2
I/O Subsystem -5B Cable
U1.35-P1-X2/Q2# 72
406/90
Service
Guide
FRU Name
Location Code
AIX Location Code
Physical Connection
Logical Connection
I/O Subsystem -6A Connector 2 I/O Subsystem 6 (DCA 1)
U1.35-P1-X2/Q3
I/O Subsystem -6A Cable
U1.35-P1-X2/Q3#
I/O Subsystem -6B Connector 3 I/O Subsystem 6 (DCA 2)
U1.35-P1-X2/Q4
I/O Subsystem -6B Cable
U1.35-P1-X2/Q4#
I/O Subsystem -7A Connector 4 I/O Subsystem 7 (DCA 1)
U1.35-P1-X2/Q5
I/O Subsystem -7A Cable
U1.35-P1-X2/Q5#
I/O Subsystem -7B Connector 5 I/O Subsystem 7 (DCA 2)
U1.35-P1-X2/Q6
I/O Subsystem -7B Cable
U1.35-P1-X2/Q6#
SPARE (Future Subsystem 8)
U1.35-P1-X2/Q7
SPARE cable
U1.35-P1-X2/Q7#
SPARE 5 (Future Subsystem 8)
U1.35-P1-X2/Q8
SPARE cable
U1.35-P1-X2/Q8#
SPARE connector
U1.35-P1-X2/Q9
SPARE cable
U1.35-P1-X2/Q9#
DCA-6 Connector
U1.35-P1-X2/Q10
DCA-6 Cable
U1.35-P1-X2/Q10#
Bulk Power Regulator 3
U1.35-P1-V3
BPR 3 Connector
U1.35-P1-V3/Q1
BPR 3 Cable to IBF 3A
U1.35-P1-V3/Q1#
Bulk Power Fan (BPF)
U1.35-P1-F1
BPF TO BPC Connector
U1.35-P1-F1/Q1
BPF TO BPC Cable
U1.35-P1-F1/Q1#
Bulk Power Controller A
U1.35-P1-X4
BPC A - MDA-1 Connector
U1.35-P1-X4/Q1
BPC A - MDA-1 Cable
U1.35-P1-X4/Q1#
BPC A - MDA-2 Connector
U1.35-P1-X4/Q2
BPC A - MDA-2 Cable
U1.35-P1-X4/Q2#
BPC A - MDA-3 Connector
U1.35-P1-X4/Q3
BPC A - MDA-3 Cable
U1.35-P1-X4/Q3# Chapter
1.
Reference
Information
73
FRU Name
Location Code
AIX Location Code
Physical Connection
Logical Connection
BPC A - MDA-4 Connector
U1.35-P1-X4/Q4
BPC A - MDA-4 Cable
U1.35-P1-X4/Q4#
BPC A - DCA-1 Connector
U1.35-P1-X4/Q5
BPC A - DCA-1 Cable
U1.35-P1-X4/Q5#
BPC A - DCA-2 Connector
U1.35-P1-X4/Q6
BPC A - DCA-2 Cable
U1.35-P1-X4/Q6#
BPC A - DCA-3 Connector
U1.35-P1-X4/Q7
BPC A - DCA-3 Cable
U1.35-P1-X4/Q7#
BPC A TO BPC B Connector
U1.35-P1-X4/Q8
BPC A TO BPC B Cable
U1.35-P1-X4/Q8#
BPC A To Primary I/O Book Connector U1.18-P1-H2/Q8
U1.35-P1-X4/Q10
BPC A ?Y? Cable To Primary I/O Book Connector U1.18-P1-H2/Q8
U1.35-P1-X4/Q10#
BPC A TO BPF A Connector
U1.35-P1-X4/Q11
BPC A TO BPF A Cable
U1.35-P1-X4/Q11#
UEPO Switch (attached to frame)
U1-V1
UEPO Connector J00
U1-V1/Q1
UEPO Cable to BPC A
U1-V1/Q1#
UEPO Connector J01
U1-V1/Q2
UEPO Cable to BPC B
U1-V1/Q2#
Bulk Power Enclosure B (back)
U1.35
BPE Backplane B
U1.35-P2
AC TO BPE (A) Line Cord Connector
U1.35-P2/Q1
AC TO BPE (A) Line Cord
U1.35-P2/Q1#
Bulk Power Regulator (BPR) 1
U1.35-P2-V1
BPR 1 Connector
U1.35-P2-V1/Q1
BPR 1 cable to IBF 1A
U1.35-P2-V1/Q1#
Bulk Power Distribution 2
U1.35-P2-X2 74
406/90
Service
Guide
FRU Name
Location Code
AIX Location Code
Physical Connection
Logical Connection
I/O Subsystem -5A Connector 0 I/O Subsystem 5 (DCA 1)
U1.35-P2-X2/Q1
I/O Subsystem -5A Cable
U1.35-P2-X2/Q1#
I/O Subsystem -5B Connector 1 I/O Subsystem 5 (DCA 2)
U1.35-P2-X2/Q2
I/O Subsystem -5B Cable
U1.35-P2-X2/Q2#
I/O Subsystem -6A Connector 2 I/O Subsystem 6 (DCA 1)
U1.35-P2-X2/Q3
I/O Subsystem -6A Cable
U1.35-P2-X2/Q3#
I/O Subsystem -6B Connector 3 I/O Subsystem 6 (DCA 2)
U1.35-P2-X2/Q4
I/O Subsystem -6B Cable
U1.35-P2-X2/Q4#
I/O Subsystem -7A Connector 4 I/O Subsystem 7 (DCA 1)
U1.35-P2-X2/Q5
I/O Subsystem -7A cable
U1.35-P2-X2/Q5#
I/O Subsystem -7B Connector 5 I/O Subsystem 7 (DCA 2)
U1.35-P2-X2/Q6
I/O Subsystem -7B cable
U1.35-P2-X2/Q6#
SPARE (Future Drawer 8)
U1.35-P2-X2/Q7
SPARE cable
U1.35-P2-X2/Q7#
SPARE 5 (Future Drawer 8)
U1.35-P2-X2/Q8
SPARE cable
U1.35-P2-X2/Q8#
SPARE
U1.35-P2-X2/Q9
SPARE cable
U1.35-P2-X2/Q9#
DCA-6 Connector / F UTURE USE
U1.35-P2-X2/Q10
DCA-6 Cable / FUTURE USE
U1.35-P2-X2/Q10#
Bulk Power Regulator 2
U1.35-P2-V2
BPR 2 Connector
U1.35-P2-V2/Q1
BPR 2 Cable to IBF 2A
U1.35-P2-V2/Q1#
Bulk Power Distribution 1
U1.35-P2-X3
I/O Subsystem -1A Connector 0 I/O Subsystem 1 (DCA 1)
U1.35-P2-X3/Q1
I/O Subsystem -1A Cable
U1.35-P2-X3/Q1# Chapter
1.
Reference
Information
75
FRU Name
Location Code
AIX Location Code
Physical Connection
Logical Connection
I/O Subsystem -1B Connector 1 I/O Subsystem 1 (DCA 2)
U1.35-P2-X3/Q2
I/O Subsystem -1B Cable
U1.35-P2-X3/Q2#
I/O Subsystem -2A Connector 2 I/O Subsystem 2 (DCA 1)
U1.35-P2-X3/Q3
I/O Subsystem -2A Cable
U1.35-P2-X3/Q3#
I/O Subsystem -2B Connector 3 I/O Subsystem 2 (DCA 2)
U1.35-P2-X3/Q4
I/O Subsystem -2B Cable
U1.35-P2-X3/Q4#
I/O Subsystem -3A Connector 4 I/O Subsystem 3 (DCA 1)
U1.35-P2-X3/Q5
I/O Subsystem -3A Cable
U1.35-P2-X3/Q5#
I/O Subsystem -3B Connector 5 I/O Subsystem 3 (DCA 2)
U1.35-P2-X3/Q6
I/O Subsystem -3B Cable
U1.35-P2-X3/Q6#
I/O Subsystem -4A Connector 6 I/O Subsystem 4 (DCA 1)
U1.35-P2-X3/Q7
I/O Subsystem -4A Cable
U1.35-P2-X3/Q7#
I/O Subsystem -4B Connector 7 I/O Subsystem 4 (DCA 2)
U1.35-P2-X3/Q8
I/O Subsystem -4B Cable
U1.35-P2-X3/Q8#
DCA-4 Connector
U1.35-P2-X3/Q9
DCA-4 Cable
U1.35-P2-X3/Q9#
DCA-5 Connector
U1.35-P2-X3/Q10
DCA-5 Cable
U1.35-P2-X3/Q10#
Bulk Power Regulator 3
U1.35-P2-V3
BPR 2 Connector
U1.35-P2-V3/Q1
BPR 2 Cable to IBF 3A
U1.35-P2-V3/Q1#
Bulk Power Distribution (Unused)
U1.35-P2-X1
SPARE
U1.35-P2-X1/Q1
SPARE cable
U1.35-P2-X1/Q1#
SPARE
U1.35-P2-X1/Q2
SPARE cable
U1.35-P2-X1/Q2#
SPARE
U1.35-P2-X1/Q3
SPARE cable
U1.35-P2-X1/Q3#
SPARE
U1.35-P2-X1/Q4
76
406/90
Service
Guide
FRU Name
Location Code
AIX Location Code
Physical Connection
Logical Connection
SPARE cable
U1.35-P2-X1/Q4#
SPARE
U1.35-P2-X1/Q5
SPARE cable
U1.35-P2-X1/Q5#
SPARE
U1.35-P2-X1/Q6
SPARE cable
U1.35-P2-X1/Q6#
SPARE
U1.35-P2-X1/Q7
SPARE cable
U1.35-P2-X1/Q7#
SPARE
U1.35-P2-X1/Q8
SPARE cable
U1.35-P2-X1/Q8#
SPARE
U1.35-P2-X1/Q9
SPARE cable
U1.35-P2-X1/Q9#
SPARE
U1.35-P2-X1/Q10
SPARE cable
U1.35-P2-X1/Q10#
Bulk Power Fan
U1.35-P2-F1
BPF TO BPC Connector
U1.35-P2-F1/Q1
BPF TO BPC Cable
U1.35-P2-F1/Q1#
Bulk Power Controller B
U1.35-P2-X4
BPC B - MDA-1 Connector
U1.35-P2-X4/Q1
BPC B - MDA-1 Cable
U1.35-P2-X4/Q1#
BPC B - MDA-2 Connector
U1.35-P2-X4/Q2
BPC B - MDA-2 Cable
U1.35-P2-X4/Q2#
BPC B - MDA-3 Connector
U1.35-P2-X4/Q3
BPC B - MDA-3 Cable
U1.35-P2-X4/Q3#
BPC B - MDA-4 Connector
U1.35-P2-X4/Q4
BPC B - MDA-4 Cable
U1.35-P2-X4/Q4#
BPC B - DCA-1 Connector
U1.35-P2-X4/Q5
BPC B - DCA-1 Cable
U1.35-P2-X4/Q5#
BPC B - DCA-2 Connector
U1.35-P2-X4/Q6
BPC B - DCA-2 Cable
U1.35-P2-X4/Q6#
BPC B - DCA-3 Connector
U1.35-P2-X4/Q7
BPC B - DCA-3 Cable
U1.35-P2-X4/Q7#
BPC TO BPC Connector
U1.35-P2-X4/Q8
BPC TO BPC Cable
U1.35-P2-X4/Q8# Chapter
1.
Reference
Information
77
FRU Name
Location Code
AIX Location Code
Physical Connection
Logical Connection
BPC TO EPOC Connector
U1.35-P2-X4/Q9
BPC TO EPOC Cable
U1.35-P2-X4/Q9#
BPC TO Primary I/O Book Connector U1.18-P1-H2/Q8
U1.35-P2-X4/Q10
BPC B ?Y? Cable To Primary I/O Book Connector U1.18-P1-H2/Q8
U1.35-P2-X4/Q10#
BPC TO BPF Connector
U1.35-P2-X4/Q11
BPC TO BPF Cable
U1.35-P2-X4/Q11#
UEPO TO BPC A Connector
U1-V1/Q1
UEPO TO BPC A Cable
U1-V1/Q1#
FRU Name
Location Code
AIX Location Code
Physical Connection
Logical Connection
Processor Subsystem Locations
Processor Subsystem
(MT/M Serial #) U1.18
System Board (backplane)
U1.18-P1
Memory Book - Slot 0
U1.18-P1-M2
00-00
Memory Book - Slot 1
U1.18-P1-M3
00-00
Memory Book - Slot 3
U1.18-P1-M6
00-00
Memory Book - Slot 2
U1.18-P1-M7
00-00
Memory Book - Slot 7
U1.18-P1-M1
00-00
Memory Book - Slot 4
U1.18-P1-M4
00-00
Memory Book - Slot 6
U1.18-P1-M5
00-00
Memory Book - Slot 5
U1.18-P1-M8
00-00
Primary I/O book - Slot 0
U1.18-P1-H2
System VPD Module (Backup)
U1.18-P1-H2-N1
Diskette Drive Controller
U1.18-P1-H2/D1
01-D1
Serial Port 1 Connector
U1.18-P1-H2/S1
01-S1
Serial Port 1 Cable
U1.18-P1-H2/S1#
Serial Port 2 Connector
U1.18-P1-H2/S2
01-S2
Serial Port 2 Cable
U1.18-P1-H2/S2#
HMC Serial Port 1 Connector
U1.18-P1-H2/S3
01-S3
HMC Serial Port 1 Cable
U1.18-P1-H2/S3#
HMC Serial Port 2 Connector
U1.18-P1-H2/S4
01-S4
HMC Serial Port 2 Cable
U1.18-P1-H2/S4#
Debug Port Only
U1.18-P1-H2/R1
01-R1 78
406/90
Service
Guide
FRU Name
Location Code
AIX Location Code
Physical Connection
Logical Connection
Operator Panel Connector
U1.18-P1-H2/Q7
Diskette Drive Connector
U1.18-P1-H2/Q10
Diskette Drive Cable
U1.18-P1-H2/Q10#
BPC A & B Connector
U1.18-P1-H2/Q8
BPC A Connector
U1.18-P1-H2/Q8
BPC A Cable
U1.18-P1-H2/Q8#
BPC B Connector
U1.18-P1-H2/Q9
BPC B Cable
U1.18-P1-H2/Q9#
SPCN Port 0 Connector
U1.18-P1-H2/Q5
SPCN Port 0 Cable
U1.18-P1-H2/Q5#
SPCN Port 1 Connector
U1.18-P1-H2/Q6
SPCN Port 1 Cable
U1.18-P1-H2/Q6#
JTAG
U1.18-P1-H2
NVRAM
U1.18-P1-H2
System Firmware
U1.18-P1-H2/Y1
Primary I/O book-0 Port 0 Connector 0 (1-2) (A0),
U1.18-P1-H2/Q1
Primary I/O book-0 Port 0 (1-2) Cable 1, see “I/O Subsystems (1 Through 4 without IBFs) RIO Cabling to I/O Books, Two I/O Planar Loop Configuration, 2, 3, or 4 MCMs Only” on page 42 for cable connections.
U1.18-P1-H2/Q1#
Primary I/O book-0 Port 0 Connector 1 (1-3) (A1)
U1.18-P1-H2/Q2
Primary I/O book-0 Port 0 Connector 1 (1-3) Cable 2, see “I/O Subsystems (1 Through 4 without IBFs) RIO Cabling to I/O Books, Two I/O Planar Loop Configuration, 2, 3, or 4 MCMs Only” on page 42 for cable connections.
U1.18-P1-H2/Q2#
Primary I/O book-1 Port 1 Connector 2 (2-0) (B0)
U1.18-P1-H2/Q3
Primary I/O book-1 Port 1 Connector 2 (2-0) Cable 3, see “I/O Subsystems (1 Through 4 without IBFs) RIO Cabling to I/O Books, Two I/O Planar Loop Configuration, 2, 3, or 4 MCMs Only” on page 42 for cable connections.
U1.18-P1-H2/Q3#
Primary I/O book-1 Port 1 Connector 3 (2-1) (B1),
U1.18-P1-H2/Q4
Primary I/O book-1 Port 1 Connector 3 (2-1) Cable 4 see “I/O Subsystems (1 Through 4 without IBFs) RIO Cabling to I/O Books, Two I/O Planar Loop Configuration, 2, 3, or 4 MCMs Only” on page 42 for cable connections.
U1.18-P1-H2/Q4#
Chapter
1.
Reference
Information
79
FRU Name
Location Code
AIX Location Code
Physical Connection
Logical Connection
ISA Bridge
U1.18-P1-H2
Real Time Clock (RTC)
U1.18-P1-H2
Battery (TOD)
U1.18-P1-H2-V1
Error logging VPD Card (MCM -0)
U1.18-P1-H2.3
Error logging VPD Card (MCM -1)
U1.18-P1-H2.5
Error logging VPD Card (MCM -2)
U1.18-P1-H2.4
Error logging VPD Card (MCM -3)
U1.18-P1-H2.6
Error logging VPD Card (L3)
U1.18-P1-H2.1
COD Security VPD Card
U1.18-P1-H2.2
SNI Adapter for High Performance Switch Book 1, Slot 1 Note: There are two types of SNI adapter books. One type has two ports and the other has four ports. Either type can be plugged in U1.18-P1-H1.
U1.18-P1-H1
SNI adapter book 1, connector 1
U1.18-P1-H1/Q1
SNI adapter book 1, cable attached to connector 1
U1.18-P1-H1/Q1#
SNI adapter book 1, connector 2
U1.18-P1-H1/Q2
SNI adapter book 1, cable attached to connector 2
U1.18-P1-H1/Q2#
SNI adapter book 1, connector 3
U1.18-P1-H1/Q3
SNI adapter book 1, cable attached to connector 3
U1.18-P1-H1/Q3#
SNI adapter book 1, connector 4
U1.18-P1-H1/Q4
SNI adapter book 1, cable attached to connector 4
U1.18-P1-H1/Q4#
I/O Book 2, Slot 2
U1.18-P1-H3
I/O Book 2, Port 11 Connector 3 (1-1) (B1)
U1.18-P1-H3/Q1
I/O Book 2, Port 11 Connector 3 (1-1) Cable 1, see “I/O Subsystems (5 and 6) RIO Cabling to the Secondary I/O Book, Two I/O Planar Loop Configuration, 3 or 4 MCMs Only” on page 43 for cable connections.
U1.18-P1-H3/Q1#
I/O Book 2, Port 11 Connector 2 (1-0) (B0)
U1.18-P1-H3/Q2
I/O Book 2, Port 11 Connector 2 (1-0) Cable 2, see “I/O Subsystems (5 and 6) RIO Cabling to the Secondary I/O Book, Two I/O Planar Loop Configuration, 3 or 4 MCMs Only” on page 43 for cable connections.
U1.18-P1-H3/Q2#
I/O Book 2, Port 10 Connector 1 (2-1) (A1)
U1.18-P1-H3/Q3 80
406/90
Service
Guide
FRU Name
Location Code
AIX Location Code
Physical Connection
Logical Connection
I/O Book 2, Port 10 Connector 1 (2-1) Cable 3, see “I/O Subsystems (5 and 6) RIO Cabling to the Secondary I/O Book, Two I/O Planar Loop Configuration, 3 or 4 MCMs Only” on page 43 for cable connections.
U1.18-P1-H3/Q3#
I/O Book 2, Port 10 Connector 0 (2-0) (A0)
U1.18-P1-H3/Q4
I/O Book 2, Port 10 Connector 0 (2-0) Cable 4, see “I/O Subsystems (5 and 6) RIO Cabling to the Secondary I/O Book, Two I/O Planar Loop Configuration, 3 or 4 MCMs Only” on page 43 for cable connections.
U1.18-P1-H3/Q4#
I/O Book 2, Port 5 Connector 3 (1-1) (D1)
U1.18-P1-H3/Q5
I/O Book 2, Port 5 Connector 3 (1-1) Cable 5, see “I/O Subsystems (1 Through 4 without IBFs) RIO Cabling to I/O Books, Two I/O Planar Loop Configuration, 2, 3, or 4 MCMs Only” on page 42 for cable connections.
U1.18-P1-H3/Q5#
I/O Book 2, Port 5 Connector 2 (1-0) (D0)
U1.18-P1-H3/Q6
I/O Book 2, Port 5 Connector 2 (1-0) Cable 6, see “I/O Subsystems (1 Through 4 without IBFs) RIO Cabling to I/O Books, Two I/O Planar Loop Configuration, 2, 3, or 4 MCMs Only” on page 42 for cable connections.
U1.18-P1-H3/Q6#
I/O Book 2, Port 4 Connector 1 (2-1) (C1)
U1.18-P1-H3/Q7
I/O Book 2, Port 4 Connector 1 (2-1) Cable 7, see “I/O Subsystems (1 Through 4 without IBFs) RIO Cabling to I/O Books, Two I/O Planar Loop Configuration, 2, 3, or 4 MCMs Only” on page 42 for cable connections.
U1.18-P1-H3/Q7#
I/O Book 2, Port 4 Connector 0 (2-0) (C0)
U1.18-P1-H3/Q8
I/O Book 2, Port 4 Connector 1 (2-0) Cable 8 , see “I/O Subsystems (1 Through 4 without IBFs) RIO Cabling to I/O Books, Two I/O Planar Loop Configuration, 2, 3, or 4 MCMs Only” on page 42 for cable connections.
U1.18-P1-H3/Q8#
I/O Book 3 or SNI Adapter Book- Slot 3 Note: This slot may have either a secondary I/O adapter book or an SNI adapter book for the High Performance Switch. If there is an SNI adapter in U1.18-P1-H4, go to ?SNI Adapter for High Performance Switch? on page 82
U1.18-P1-H4
I/O Book 3, Port 12 Connector 2 (1-0) (C0)
U1.18-P1-H4/Q1 Chapter
1.
Reference
Information
81
FRU Name
Location Code
AIX Location Code
Physical Connection
Logical Connection
I/O Book 3, Port 12 Connector 2 (1-0) Cable 2 , see “I/O Subsystems (7 and 8) RIO Cabling to the Third I/O Book, Two I/O Planar Loop Configuration, 4 MCMs Only” on page 44 for cable connections.
U1.18-P1-H4/Q1#
I/O Book 3, Port 12 Connector 3 (1-1) (C1)
U1.18-P1-H4/Q2
I/O Book 3, Port 12 Connector 3 (1-1) Cable 1, see “I/O Subsystems (7 and 8) RIO Cabling to the Third I/O Book, Two I/O Planar Loop Configuration, 4 MCMs Only” on page 44 for cable connections.
U1.18-P1-H4/Q2#
I/O Book 3, Port 13 Connector 0 (2-0) (D0)
U1.18-P1-H4/Q3
I/O Book 3, Port 13 Connector 0 (2-0) Cable 4, see “I/O Subsystems (7 and 8) RIO Cabling to the Third I/O Book, Two I/O Planar Loop Configuration, 4 MCMs Only” on page 44 for cable connections.
U1.18-P1-H4/Q3#
I/O Book 3, Port 13 Connector 1 (2-1) (D1)
U1.18-P1-H4/Q4
I/O Book 3, Port 13 Connector 1 (2-1) Cable 3, see “I/O Subsystems (7 and 8) RIO Cabling to the Third I/O Book, Two I/O Planar Loop Configuration, 4 MCMs Only” on page 44 for cable connections.
U1.18-P1-H4/Q4#
I/O Book 3, Port 6 Connector 3 (1-1) (A0)
U1.18-P1-H4/Q5
I/O Book 3, Port 6 Connector 3 (1-1) Cable 5
U1.18-P1-H4/Q5#
I/O Book 3, Port 6 Connector 2 (1-0) (A1)
U1.18-P1-H4/Q6
I/O Book 3, Port 6 Connector 2 (1-0) Cable 6
U1.18-P1-H4/Q6#
I/O Book 3, Port 7 Connector 1 (2-1) (B0)
U1.18-P1-H4/Q7
I/O Book 3, Port 7 Connector 1 (2-1) Cable 7
U1.18-P1-H4/Q7#
I/O Book 3, Port 7 Connector 0 (2-0) (B1)
U1.18-P1-H4/Q8
I/O Book 3, Port 7 Connector 1 (2-0) Cable 8
U1.18-P1-H4/Q8#
SNI Adapter for High Performance Switch Book 1, Slot 1 Note: There are two types of SNI adapter books. One type has two ports and the other four ports. Either type can be plugged into U1.18-P1-H4.
U1.18-P1-H4
SNI adapter book 3, connector 1
U1.18-P1-H4/Q1
SNI adapter book 3, cable attached to connector 1
U1.18-P1-H4/Q1#
SNI adapter book 3, connector 2
U1.18-P1-H4/Q2 82
406/90
Service
Guide
FRU Name
Location Code
AIX Location Code
Physical Connection
Logical Connection
SNI adapter book 3, cable attached to connector 2
U1.18-P1-H4/Q2#
SNI adapter book 3, connector 3
U1.18-P1-H4/Q3
SNI adapter book 3, cable attached to connector 3
U1.18-P1-H4/Q3#
SNI adapter book 3, connector 4
U1.18-P1-H4/Q4
SNI adapter book 3, cable attached to connector 4
U1.18-P1-H4/Q4#
DCA 1-RH
U1.18-P1-V1
DCA 1- BPA A - BPC - 5 Connector
U1.18-P1-V1/Q1
DCA 1- BPA A - BPC -5 Cable
U1.18-P1-V1/Q1#
DCA 1- BPA B - BPC - 5 Connector
U1.18-P1-V1/Q2
DCA 1- BPA B - BPC -5 Cable
U1.18-P1-V1/Q2#
DCA 2-RH
U1.18-P1-V2
DCA 2- BPA A - BPC - 6 Connector
U1.18-P1-V2/Q1
DCA 2- BPA A - BPC -6 Cable
U1.18-P1-V2/Q1#
DCA 2- BPA B - BPC - 6 Connector
U1.18-P1-V2/Q2
DCA 2- BPA B - BPC -6 Cable
U1.18-P1-V2/Q2#
DCA 3-RH
U1.18-P1-V4
DCA 3- BPA A - BPC - 7 Connector
U1.18-P1-V4/Q1
DCA 3- BPA A - BPC - 7 Cable
U1.18-P1-V4/Q1#
DCA 3- BPA B - BPC - 7 Connector
U1.18-P1-V4/Q2
DCA 3- BPA B - BPC - 7 Cable
U1.18-P1-V4/Q2#
DCA 4-RH
U1.18-P1-V5
DCA 4- BPA A - BPD1 - 9 Connector
U1.18-P1-V5/Q1
DCA 4- BPA A - BPD1 - 9 Cable
U1.18-P1-V5/Q1#
DCA 4- BPA B - BPD1 - 9 Connector
U1.18-P1-V5/Q2
DCA 4- BPA B - BPD1 - 9 Cable
U1.18-P1-V5/Q2#
DCA 5-RH
U1.18-P1-V6
DCA 5- BPA A - BPD1 - 10 Connector
U1.18-P1-V6/Q1
DCA 5- BPA A - BPD1 - 10 Cable
U1.18-P1-V6/Q1#
DCA 5- BPA B - BPD1 - 10 Connector
U1.18-P1-V6/Q2
DCA 5- BPA B - BPD1 - 10 Cable
U1.18-P1-V6/Q2#
DCA 6-RH
U1.18-P1-V8
DCA 6- BPA A - BPD2 - 10 Connector
U1.18-P1-V8/Q1
DCA 6- BPA A - BPD2 - 10 Cable
U1.18-P1-V8/Q1#
DCA 6- BPA B - BPD2 - 10 Connector
U1.18-P1-V8/Q2
DCA 6- BPA B - BPD2 - 10 Cable
U1.18-P1-V8/Q2#
RH CAP Card
U1.18-P1-V3
RH CAP Card
U1.18-P1-V7 Chapter
1.
Reference
Information
83
FRU Name
Location Code
AIX Location Code
Physical Connection
Logical Connection
Hourglass Card Assembly
U1.18-P1-X5
MCM Module 0 4 & 8 way
U1.18-P1-C1
Processor 0 (W/ L1 & L2 Cache)
U1.18-P1-C1
00-00
Processor 1 (W/ L1 & L2 Cache)
U1.18-P1-C1
00-01
Processor 2 (W/ L1 & L2 Cache)
U1.18-P1-C1
00-02
Processor 3 (W/ L1 & L2 Cache)
U1.18-P1-C1
00-03
Processor 4 (W/ L1 & L2 Cache)
U1.18-P1-C1
00-04
Processor 5 (W/ L1 & L2 Cache)
U1.18-P1-C1
00-05
Processor 6 (W/ L1 & L2 Cache)
U1.18-P1-C1
00-06
Processor 7 (W/ L1 & L2 Cache)
U1.18-P1-C1
00-07
MCM Module 3
U1.18-P1-C2
Processor 24 (W/ L1 & L2 Cache)
U1.18-P1-C2
00-24
Processor 25 (W/ L1 & L2 Cache)
U1.18-P1-C2
00-25
Processor 26 (W/ L1 & L2 Cache)
U1.18-P1-C2
00-26
Processor 27 (W/ L1 & L2 Cache)
U1.18-P1-C2
00-27
Processor 28 (W/ L1 & L2 Cache)
U1.18-P1-C2
00-28
Processor 29 (W/ L1 & L2 Cache)
U1.18-P1-C2
00-29
Processor 30 (W/ L1 & L2 Cache)
U1.18-P1-C2
00-30
Processor 31 (W/ L1 & L2 Cache)
U1.18-P1-C2
00-31
MCM Module 1
U1.18-P1-C3
Processor 08 (W/ L1 & L2 Cache)
U1.18-P1-C3
00-08
Processor 09 (W/ L1 & L2 Cache)
U1.18-P1-C3
00-09
Processor 10 (W/ L1 & L2 Cache)
U1.18-P1-C3
00-10
Processor 11 (W/ L1 & L2 Cache
U1.18-P1-C3
00-11
Processor 12 (W/ L1 & L2 Cache
U1.18-P1-C3
00-12
Processor 13 (W/ L1 & L2 Cache
U1.18-P1-C3
00-13
Processor 14 (W/ L1 & L2 Cache
U1.18-P1-C3
00-14
Processor 15 (W/ L1 & L2 Cache
U1.18-P1-C3
00-15
MCM Module 2
U1.18-P1-C4
Processor 16 (W/ L1 & L2 Cache
U1.18-P1-C4
00-16
Processor 17 (W/ L1 & L2 Cache
U1.18-P1-C4
00-17
Processor 18 (W/ L1 & L2 Cache
U1.18-P1-C4
00-18
Processor 19 (W/ L1 & L2 Cache
U1.18-P1-C4
00-19
Processor 20 (W/ L1 & L2 Cache
U1.18-P1-C4
00-20
Processor 21 (W/ L1 & L2 Cache)
U1.18-P1-C4
00-21
Processor 22 (W/ L1 & L2 Cache)
U1.18-P1-C4
00-22
Processor 21 (W/ L1 & L2 Cache)
U1.18-P1-C4
00-23
L3 Module (Proc 3D)
U1.18-P1-C5
00-00
L3 Module (Proc 0A)
U1.18-P1-C6
00-00 84
406/90
Service
Guide
FRU Name
Location Code
AIX Location Code
Physical Connection
Logical Connection
L3 Module (Proc 0B)
U1.18-P1-C7
00-00
L3 Module (Proc 1C)
U1.18-P1-C8
00-00
L3 Module (Proc 3B)
U1.18-P1-C9
00-00
L3 Module (Proc 0C)
U1.18-P1-C10
00-00
L3 Module (Proc 0D)
U1.18-P1-C11
00-00
L3 Module (Proc 1A)
U1.18-P1-C12
00-00
L3 Module (Proc 3A)
U1.18-P1-C13
00-00
L3 Module (Proc 2D)
U1.18-P1-C14
00-00
L3 Module (Proc 2C)
U1.18-P1-C15
00-00
L3 Module (Proc 1B)
U1.18-P1-C16
00-00
L3 Module (Proc 3C)
U1.18-P1-C17
00-00
L3 Module (Proc 2B)
U1.18-P1-C18
00-00
L3 Module (Proc 2A)
U1.18-P1-C19
00-00
L3 Module (Proc 1D)
U1.18-P1-C20
00-00
Passthru Card Module 1
U1.18-P1-C3
00-00
Passthru Card Module 3
U1.18-P1-C2
00-00
Fan Controller Assembly 1
U1.18-X1
Fan Controller 1 TO BPC A Connector
U1.18-X1/Q1
Fan Controller 1 TO BPC A Cable
U1.18-X1/Q1#
Fan Controller 1 TO BPC B Connector
U1.18-X1/Q2
Fan Controller 1 TO BPC B Cable
U1.18-X1/Q2#
Fan Controller Assembly 2
U1.18-X2
Fan Controller 2 TO BPC A Connector
U1.18-X2/Q1
Fan Controller 2 TO BPC A Connector
U1.18-X2/Q1#
Fan Controller 2 TO BPC B Connector
U1.18-X2/Q2#
Fan Controller 2 TO BPC B Connector
U1.18-X2/Q2#
Fan Controller Assembly 3
U1.18-X3
Fan Controller 3 TO BPC A Connector
U1.18-X3/Q1
Fan Controller 3 TO BPC A Cable
U1.18-X3/Q1#
Fan Controller 3 TO BPC B Connector
U1.18-X3/Q2
Fan Controller 3 TO BPC B Cable
U1.18-X3/Q2#
Fan Controller Assembly 4
U1.18-X4
Fan Controller 4 TO BPC A Connector
U1.18-X4/Q1
Fan Controller 4 TO BPC A Cable
U1.18-X4/Q1#
Fan Controller 4 TO BPC B Connector
U1.18-X4/Q2
Fan Controller 4 TO BPC B Cable
U1.18-X4/Q2#
Fan Assembly 1
U1.18-F1
Fan Assembly 2
U1.18-F2
Fan Assembly 3
U1.18-F3 Chapter
1.
Reference
Information
85
FRU Name
Location Code
AIX Location Code
Physical Connection
Logical Connection
Fan Assembly 4
U1.18-F4
FRU Name
Location Code
AIX Location Code
Physical Connection
Logical Connection
Media Subsystem Locations
Media Drawer chassis
U1.17
Media Drawer - Operator Panel,
U1.17-L1
Operator Panel Connector
U1.17-L1/Q1
Operator Panel Cable
U1.17-L1/Q1#
System VPD Module
U1.17-L1-N1
Media Drawer - Diskette
U1.17-D1
Diskette Drive Connector
U1.17-D1/Q2
Diskette Drive Cable
U1.17-D1/Q2#
Media Drawer - DVD-RAM/CD-ROM
U1.x-Px-Ix/Z1-A1
Media Drawer - Tape Drive
U1.x-Px-Ix/Z1-A0
Media Drawer connector
U1.17-A1/Q3
Media Drawer cable
U1.17-A1/Q3#
Media Drawer Power Connector
U1.17-A1/Q4
Media Drawer Power Cable
U1.17-A1/Q4#
FRU Name
Location Code
AIX Location Code
Physical Connection
Logical Connection
Integrated Battery Feature Locations (If Installed)
IBF (Battery) 1A to BPA A
U1.13-P1-V1
If Installed
IBF 1A Connector
U1.13-P1-V1/Q1
If Installed
IBF 1A Cable to BPA A - BPR 1
U1.13-P1-V1/Q1#
If Installed
IBF (Battery) 2A to BPA A
U1.15-P1-V2
If Installed
IBF 2A Connector
U1.15-P1-V2/Q1
If Installed
IBF 2A Cable to BPA A - BPR 2
U1.15-P1-V2/Q1#
If Installed
IBF (Battery) 1B to BPA B
U1.13-P2-V3
If Installed
IBF 1B Connector
U1.13-P2-V3/Q1
If Installed
IBF 1B Cable to BPA B -BPR 1
U1.13-P2-V3/Q1#
If Installed 86
406/90
Service
Guide
FRU Name
Location Code
AIX Location Code
Physical Connection
Logical Connection
IBF (Battery) 2B to BPA B
U1.15-P2-V4
If Installed
IBF 2B Connector
U1.15-P2-V4/Q1
If Installed
IBF 2B Cable to BPA B -BPR 2
U1.15-P2-V4/Q1#
If Installed
IBF (Battery) 3A to BPA A
U2.17-P1-V5
If Installed
IBF 3A Connector
U2.17-P1-V5/Q1
If Installed
IBF 3A Cable to BPA A - BPR 3
U2.17-P1-V5/Q1#
If Installed
IBF (Battery) 3B to BPA B
U2.17-P2-V6
If Installed
IBF 3B Connector
U2.17-P2-V6/Q1
If Installed
IBF 3B Cable to BPA B - BPR 3
U2.17-P2-V6/Q1#
If Installed
FRU Name
Location Code
AIX Location Code
Physical Connection
Logical Connection
I/O Subsystem 1 Locations
I/O Subsystem 1 Chassis & Midplane Card
(MT/M Serial #) U1.9
I/O Subsystem Left IO backplane assembly
U1.9-P1
I/O Subsystem Left IO backplane assembly VPD
U1.9-P1-N1
EADS 1 - PCI Controller
U1.9-P1
2U-58, 2U-5A, 2U-5C, 2U-5E
PCI Slot 1 Content
U1.9-P1-I1
2V-08 to 2V-0For 2W-xx or 2X-xx
PCI Slot 2 Content
U1.9-P1-I2
2Y-08 to 2Y-0For 2Z-xx or 2a-xx
PCI Slot 3 Content
U1.9-P1-I3
2b-08 to 2b-0For 2c-xx or 2d-xx
PCI Slot 4 Content
U1.9-P1-I4
2e-08 to 2e-0For 2f-xx or 2g-xx
EADS 2 - PCI Controller
U1.9-P1
2j-58, 2j-5A, 2j-5E
PCI Slot 5 Content
U1.9-P1-I5
2k-08 to 2k-0For 2m-xx or 2n-xx
PCI Slot 6 Content
U1.9-P1-I6
2p-08 to 2p-0For 2q-xx or 2r-xx
PCI Slot 7 Content
U1.9-P1-I7
2v-08 to 2v-0For 2w-xx or 2x-xx
EADS 3 - PCI Controller
U1.9-P1
30-58, 30-5A, 30-5E
PCI Slot 8 Content
U1.9-P1-I8
31-08 to 31-0For 32-xx or 32-xx
PCI Slot 9 Content
U1.9-P1-I9
34-08 to 34-0For 35-xx or 36-xx Chapter
1.
Reference
Information
87
FRU Name
Location Code
AIX Location Code
Physical Connection
Logical Connection
PCI Slot 10 Content
U1.9-P1-I10
3A-08 to 3A-0For 3B-xx or 3C-xx
I/O Riser Card
U1.9-P1
I/O Port Connector
U1.9-P1/Q1
I/O Port Cable From GX1A-port 1
U1.9-P1/Q1#
I/O Port Connector
U1.9-P1/Q2
I/O Port Cable to port 0 on next Riser card U1.9-P2.1/Q1
U1.9-P1/Q2#
I/O Subsystem Right IO backplane assembly
U1.9-P2
I/O Subsystem Right IO backplane assembly VPD
U1.9-P2-N1
EADS 1 - PCI Controller
U1.9-P2
3E-58, 3E-5A, 3E-5C, 3E-5E
PCI Slot 1 Content
U1.9-P2-I1
3F-08 to 3F-0For 3G-xx or 3H-xx
PCI Slot 2 Content
U1.9-P2-I2
3J-08 to 3J-0For 3K-xx or 3L-xx
PCI Slot 3 Content
U1.9-P2-I3
3M-08 to 3M-0For 3N-xx or 3P-xx
PCI Slot 4 Content
U1.9-P2-I4
3Q-08 to 3Q-0For 3R-xx or 3S-xx
EADS 2 - PCI Controller
U1.9-P2
3U-58, 3U-5A, 3U-5E
PCI Slot 5 Content
U1.9-P2-I5
3V-08 to 3V-0For 3W-xx or 3X-xx
PCI Slot 6 Content
U1.9-P2-I6
3Y-08 to 3Y-0For 3Z-xx or 3a-xx
PCI Slot 7 Content
U1.9-P2-I7
3e-08 to 3e-0For 3f-xx or 3a-xx
EADS 3 - PCI Controller
U1.9-P2
3j-58, 3j-5A, 3j-5E
PCI Slot 8 Content
U1.9-P2-I8
3k-08 to 3k-0For 3m-xx or 3n-xx
PCI Slot 9 Content
U1.9-P2-I9
3p-08 to 3p-0For 3q-xx or 3r-xx
PCI Slot 10 Content
U1.9-P2-I10
3v-08 to 3v-0For 3w-xx or 3x-xx
I/O Riser Card
U1.9-P2
I/O Port Connector
U1.9-P2/Q1
I/O Port Cable From Riser port 1 P1.1/Q1
U1.9-P2/Q1#
I/O Port Connector
U1.9-P2/Q2 88
406/90
Service
Guide
FRU Name
Location Code
AIX Location Code
Physical Connection
Logical Connection
I/O Port Cable to GX1A-port 2
U1.9-P2/Q2#
I/O Subsystem DCA 1
U1.9-V1
DCA 1 to (BPA B) BPD1- 1A Connector P00
U1.9-V1/Q1
DCA 1 to (BPA B) BPD1- 1A Cable
U1.9-V1/Q1#
DCA 1 to (BPA A) BPD1- 1A Connector P00
U1.9-V1/Q2
DCA 1 to (BPA A) BPD1- 1A Cable
U1.9-V1/Q2#
Thermal Sensor
U1.9-V1
I/O Subsystem DCA 2
U1.9-V2
DCA 2 to (BPA B) BPD1- 1B Connector P01
U1.9-V2/Q1
DCA 2 to (BPA B) BPD1- 1B Cable
U1.9-V2/Q1#
DCA 2 to (BPA A) BPD1- 1B Connector P01
U1.9-V2/Q2
DCA 2 to (BPA A) BPD1- 1B Cable
U1.9-V2/Q2#
Thermal Sensor
U1.9-V2
I/O Subsystem SCSI controller 1 on P1
U1.9-P1/Z1
2s-08
I/O Subsystem SCSI controller 2 on P1
U1.9-P1/Z2
37-08
I/O Subsystem SCSI controller 1 on P2
U1.9-P2/Z1
3b-08
I/O Subsystem SCSI controller 2 on P2
U1.9-P2/Z2
3s-08
DASD 4-pack Cage and card (1)
U1.9-P3
DASD 4-pack Cage and card (1) VPD
U1.9-P3-N1
SCSI DASD 1 hdisk at ID 8 connected to controller 2 on P2
U1.9-P2/Z2-A8
3s-08-00-8,0
SCSI DASD 2 hdisk at ID 9 connected to controller 2 on P2
U1.9-P2/Z2-A9
3s-08-00-9,0
SCSI DASD 3 hdisk at ID A connected to controller 2 on P2
U1.9-P2/Z2-Aa
3s-08-00-10,0
SCSI DASD 4 hdisk at ID A connected to controller 2 on P2
U1.9-P2/Z2-Ab
3s-08-00-11,0 Chapter
1.
Reference
Information
89
FRU Name
Location Code
AIX Location Code
Physical Connection
Logical Connection
SCSI Enclosure Services SES connected to controller 2 on P2
U1.9-P2/Z2-Bf
3s-08-00-15,0
DASD 4-pack Cage and card (2)
U1.9-P4
DASD 4-pack Cage and card (2) VPD
U1.9-P4-N1
SCSI DASD 1 hdisk at ID 8 connected to controller 1 on P2
U1.9-P2/Z1-A8
3b-08-00-8,0
SCSI DASD 2 hdisk at ID 9 connected to controller 1 on P2
U1.9-P2/Z1-A9
3b-08-00-9,0
SCSI DASD 3 hdisk at ID A connected to controller 1 on P2
U1.9-P2/Z1-Aa
3b-08-00-10,0
SCSI DASD 4 hdisk at ID B connected to controller 1 on P2
U1.9-P2/Z1-Ab
3b-08-00-11,0
SCSI Enclosure Services SES connected to controller 1 on P2
U1.9-P2/Z1-Af
3b-08-00-15,0
DASD 4-pack Cage and card (3)
U1.9-P5
DASD 4-pack Cage and card (3) VPD
U1.9-P5-N1
SCSI DASD 1 hdisk at ID 8 connected to controller 2 on P1
U1.9-P1/Z2-A8
37-08-00-8,0
SCSI DASD 2 hdisk at ID 9 connected to controller 2 on P1
U1.9-P1/Z2-A9
37-08-00-9,0
SCSI DASD 3 hdisk at ID A connected to controller 2 on P1
U1.9-P1/Z2-Aa
37-08-00-10,0
SCSI DASD 4 hdisk at ID B connected to controller 2 on P1
U1.9-P1/Z2-Ab
37-08-00-11,0
SCSI Enclosure Services SES connected to controller 2 on P1
U1.9-P1/Z2-Af
37-08-00-15,0
DASD 4-pack Cage and card (4)
U1.9-P6
DASD 4-pack Cage and card (4) VPD
U1.9-P6-N1
SCSI DASD 1 hdisk at ID 8 connected to controller 1 on P1
U1.9-P1/Z1-A8
2s-08-00-8,0 90
406/90
Service
Guide
FRU Name
Location Code
AIX Location Code
Physical Connection
Logical Connection
SCSI DASD 2 hdisk at ID 9 connected to controller 1 on P1
U1.9-P1/Z1-A9
2s-08-00-9,0
SCSI DASD 3 hdisk at ID A connected to controller 1 on P1
U1.9-P1/Z1-Aa
2s-08-00-10,0
SCSI DASD 4 hdisk at ID B connected to controller 1 on P1
U1.9-P1/Z1-Ab
2s-08-00-11,0
SCSI Enclosure Services SES connected to controller 1 on P1
U1.9-P1/Z1-Af
2s-08-00-15,0
I/O Subsystem Fan (MSA)
U1.9-F1
I/O Subsystem Fan (MSA)
U1.9-F2
I/O Subsystem Fan (MSA)
U1.9-F3
I/O Subsystem Fan (MSA)
U1.9-F4
Media Drawer Power Connector
U1.9-P1-V1/Q3
Media Drawer Power Cable
U1.9-P1-V1/Q3#
Media Drawer Power Connector
U1.9-P2-V2/Q3
Media Drawer Power Cable
U1.9-P2-V2/Q3#
FRU Name
Location Code
AIX Location Code
Physical Connection
Logical Connection
I/O Subsystem 2 Locations
I/O Subsystem 1 Chassis & Midplane Card
(MT/M Serial #) U1.5
I/O Subsystem Left IO backplane assembly
U1.5-P1
I/O Subsystem Left IO backplane assembly VPD
U1.5-P1-N1
EADS 1 - PCI Controller
U1.5-P1
40-58, 40-5A, 40-5C, 40-5E
PCI Slot 1 Content
U1.5-P1-I1
41-08 to 41-0For 42-xx or 43-xx
PCI Slot 2 Content
U1.5-P1-I2
44-08 to 44-0For 45-xx or 47-xx
PCI Slot 3 Content
U1.5-P1-I3
47-08 to 47-0For 48-xx or 49-xx Chapter
1.
Reference
Information
91
FRU Name
Location Code
AIX Location Code
Physical Connection
Logical Connection
PCI Slot 4 Content
U1.5-P1-I4
4A-08 to 4A-0For 4B-xx or 4C-xx
EADS 2 - PCI Controller
U1.5-P1
4E-58, 4E-5A, 4E-5E
PCI Slot 5 Content
U1.5-P1-I5
4F-08 to 4F-0For 4G-xx or 4H-xx
PCI Slot 6 Content
U1.5-P1-I6
4J-08 to 4J-0For 4K-xx or 4L-xx
PCI Slot 7 Content
U1.5-P1-I7
4Q-08 to 4Q-0For 4R-xx or 4S-xx
EADS 3 - PCI Controller
U1.5-P1
4U-58, 4U-5A, 4U-5E
PCI Slot 8 Content
U1.5-P1-I8
4V-08 to 4V-0For 4W-xx or 4X-xx
PCI Slot 9 Content
U1.5-P1-I9
4Y-08 to 4Y-0For 4Z-xx or 4a-xx
PCI Slot 10 Content
U1.5-P1-I10
4e-08 to 4e-0For 4f-xx or 4g-xx
I/O Riser Card
U1.5-P1
I/O Port Connector
U1.5-P1/Q1
I/O Port Cable From GX1A-port 1
U1.5-P1/Q1#
I/O Port Connector
U1.5-P1/Q2
I/O Port Cable to port 0 on next Riser card U1.5-P2.1/Q1
U1.5-P1/Q2#
I/O Subsystem Right IO backplane assembly
U1.5-P2
I/O Subsystem Right IO backplane assembly VPD
U1.5-P2-N1
EADS 1 - PCI Controller
U1.5-P2
4j-58, 4j-5A, 4j-5C, 4j-5E
PCI Slot 1 Content
U1.5-P2-I1
4k-08 to 4k-0For 4m-xx or 4n-xx
PCI Slot 2 Content
U1.5-P2-I2
4p-08 to 4p-0For 4q-xx or 4r-xx
PCI Slot 3 Content
U1.5-P2-I3
4s-08 to 4s-0For 4t-xx or 4t-xx
PCI Slot 4 Content
U1.5-P2-I4
4v-08 to 4v-0For 4t-xx or 4u-xx
EADS 2 - PCI Controller
U1.5-P2
50-58, 50-5A, 50-5E
PCI Slot 5 Content
U1.5-P2-I5
51-08 to 51-0For 52-xx or 53-xx
PCI Slot 6 Content
U1.5-P2-I6
54-08 to 54-0For 55-xx or 56-xx 92
406/90
Service
Guide
FRU Name
Location Code
AIX Location Code
Physical Connection
Logical Connection
PCI Slot 7 Content
U1.5-P2-I7
5A-08 to 5A-0For 5B-xx or 5C-xx
EADS 3 - PCI Controller
U1.5-P2
5E-58, 5E-5A, 5E-5E
PCI Slot 8 Content
U1.5-P2-I8
5F-08 to 5F-0For 5G-xx or 5H-xx
PCI Slot 9 Content
U1.5-P2-I9
5J-08 to 5J-0For 5K-xx or 5L-xx
PCI Slot 10 Content
U1.5-P2-I10
5Q-08 to 5Q-0For 5R-xx or 5S-xx
I/O Riser Card
U1.5-P2.1
I/O Port Connector
U1.5-P2.1/Q1
I/O Port Cable From Riser port 1 P1.1/Q1
U1.5-P2.1/Q1#
I/O Port Connector
U1.5-P2.1/Q2
I/O Port Cable to GX1A-port 2
U1.5-P2.1/Q2#
I/O Subsystem DCA 1
U1.5-V1
DCA 1 to (BPA B) BPD1- 1A Connector P02
U1.5-V1/Q1
DCA 1 to (BPA B) BPD1- 1A Cable
U1.5-V1/Q1#
DCA 1 to (BPA A) BPD1- 1A Connector P02
U1.5-V1/Q2
DCA 1 to (BPA A) BPD1- 1A Cable
U1.5-V1/Q2#
Thermal Sensor
U1.5-V1
I/O Subsystem DCA 2
U1.5-V2
DCA 2 to (BPA B) BPD1- 1B Connector P03
U1.5-V2/Q1
DCA 2 to (BPA B) BPD1- 1B Cable
U1.5-V2/Q1#
DCA 2 to (BPA A) BPD1- 1B Connector P03
U1.5-V2/Q2
DCA 2 to (BPA A) BPD1- 1B Cable
U1.5-V2/Q2#
Thermal Sensor
U1.5-V2
I/O Subsystem SCSI controller 1 on P1
U1.5-P1/Z1
4M-08
I/O Subsystem SCSI controller 2 on P1
U1.5-P1/Z2
4b-08
I/O Subsystem SCSI controller 1 on P2
U1.5-P2/Z1
57-08
I/O Subsystem SCSI controller 2 on P2
U1.5-P2/Z2
5M-08 Chapter
1.
Reference
Information
93
FRU Name
Location Code
AIX Location Code
Physical Connection
Logical Connection
DASD 4-pack Cage and card (1)
U1.5-P3
DASD 4-pack Cage and card (1) VPD
U1.5-P3-N1
SCSI DASD 1 hdisk at ID 8 connected to controller 2 on P2
U1.5-P2/Z2-A8
5M-08-00-8,0
SCSI DASD 2 hdisk at ID 9 connected to controller 2 on P2
U1.5-P2/Z2-A9
5M-08-00-9,0
SCSI DASD 3 hdisk at ID A connected to controller 2 on P2
U1.5-P2/Z2-Aa
5M-08-00-10,0
SCSI DASD 4 hdisk at ID B connected to controller 2 on P2
U1.5-P2/Z2-Ab
5M-08-00-11,0
SCSI Enclosure Services SES connected to controller 2 on P2
U1.5-P2/Z2-Af
5M-08-00-15,0
DASD 4-pack Cage and card (2)
U1.5-P4
DASD 4-pack Cage and card (2) VPD
U1.5-P4-N1
SCSI DASD 1 hdisk at ID 8 connected to controller 1 on P2
U1.5-P2/Z1-A8
57-08-00-8,0
SCSI DASD 2 hdisk at ID 9 connected to controller 1 on P2
U1.5-P2/Z1-A9
57-08-00-9,0
SCSI DASD 3 hdisk at ID A connected to controller 1 on P2
U1.5-P2/Z1-Aa
57-08-00-10,0
SCSI DASD 4 hdisk at ID B connected to controller 1 on P2
U1.5-P2/Z1-Ab
57-08-00-11,0
SCSI Enclosure Services SES connected to controller 1 on P2
U1.5-P2/Z1-Af
57-08-00-15,0
DASD 4-pack Cage and card (3)
U1.5-P5
DASD 4-pack Cage and card (3) VPD
U1.5-P5-N1
SCSI DASD 1 hdisk at ID 8 connected to controller 1 on P2
U1.5-P1/Z2-A8
4b-80-00-8,0
SCSI DASD 2 hdisk at ID 9 connected to controller 1 on P2
U1.5-P1/Z2-A9
4b-80-00-9,0 94
406/90
Service
Guide
FRU Name
Location Code
AIX Location Code
Physical Connection
Logical Connection
SCSI DASD 3 hdisk at ID A connected to controller 1 on P2
U1.5-P1/Z2-Aa
4b-80-00-10,0
SCSI DASD 4 hdisk at ID B connected to controller 1 on P2
U1.5-P1/Z2-Ab
4b-80-00-11,0
SCSI Enclosure Services SES connected to controller 2 on P1
U1.5-P1/Z2-Af
4b-80-00-15,0
DASD 4-pack Cage and card (4)
U1.5-P6
DASD 4-pack Cage and card (4) VPD
U1.5-P6-N1
SCSI DASD 1 hdisk at ID8 connected to controller 1 on P1
U1.5-P1/Z1-A8
4M-08-00-8,0
SCSI DASD 2 hdisk at ID9 connected to controller 1 on P1
U1.5-P1/Z1-A9
4M-08-00-9,0
SCSI DASD 3 hdisk at ID A connected to controller 1 on P1
U1.5-P1/Z1-Aa
4M-08-00-10,0
SCSI DASD 4 hdisk at ID B connected to controller 1 on P1
U1.5-P1/Z1-Ab
4M-08-00-11,0
SCSI Enclosure Services SES connected to controller 1 on P1
U1.5-P1/Z1-Af
4M-08-00-15,0
I/O Subsystem Fan (MSA)
U1.5-F1
I/O Subsystem Fan (MSA)
U1.5-F2
I/O Subsystem Fan (MSA)
U1.5-F3
I/O Subsystem Fan (MSA)
U1.5-F4
Media Drawer Power Connector
U1.5-P1-X1/Q3
Media Drawer Power Cable
U1.5-P1-X1/Q3#
Media Drawer Power Connector
U1.5-P2-X2/Q3
Media Drawer Power Cable
U1.5-P2-X2/Q3#
Chapter
1.
Reference
Information
95
FRU Name
Location Code
AIX Location Code
Physical Connection
Logical Connection
I/O Subsystem 3 Locations
I/O Subsystem 1 Chassis & Midplane Card
(MT/M Serial #) U1.1
I/O Subsystem Left IO backplane assembly
U1.1-P1
I/O Subsystem Left IO backplane assembly VPD
U1.1-P1-N1
EADS 1 - PCI Controller
U1.1-P1
5U-58, 5U-5A, 5U-5C, 5U-5E
PCI Slot 1 Content
U1.1-P1-I1
5V-08 to 5V-0For 5W-xx or 5X-xx
PCI Slot 2 Content
U1.1-P1-I2
5Y-08 to 5Y-0For 5Z-xx or 5a-xx
PCI Slot 3 Content
U1.1-P1-I3
5b-08 to 5b-0For 5c-xx or 5d-xx
PCI Slot 4 Content
U1.1-P1-I4
5e-08 to 5e-0For 5f-xx or 5g-xx
EADS 2 - PCI Controller
U1.1-P1
5j-58, 5j-5A, 5j-5E
PCI Slot 5 Content
U1.1-P1-I5
5k-08 to 5k-0For 5m-xx or 5n-xx
PCI Slot 6 Content
U1.1-P1-I6
5p-08 to 5p-0For 5q-xx or 5r-xx
PCI Slot 7 Content
U1.1-P1-I7
5v-08 to 5v-0For 5w-xx or 5w-xx
EADS 3 - PCI Controller
U1.1-P1
60-58, 60-5A, 60-5E
PCI Slot 8 Content
U1.1-P1-I8
61-08 to 61-0For 62-xx or 63-xx
PCI Slot 9 Content
U1.1-P1-I9
64-08 to 64-0For 65-xx or 66-xx
PCI Slot 10 Content
U1.1-P1-I10
6A-08 to 6A-0For 6B-xx or 6C-xx
I/O Riser Card
U1.1-P1
I/O Port Connector
U1.1-P1/Q1
I/O Port Cable From GX1A-port 1
U1.1-P1/Q1#
I/O Port Connector
U1.1-P1/Q2
I/O Port Cable to port 0 on next Riser card U1.1-P2.1/Q1
U1.1-P1/Q2#
I/O Subsystem Right IO backplane assembly
U1.1-P2
I/O Subsystem Right IO backplane assembly VPD
U1.1-P2-N1
EADS 1 - PCI Controller
U1.1-P2
6E-58, 6E-5A, 6E-5C, 6E-5E 96
406/90
Service
Guide
FRU Name
Location Code
AIX Location Code
Physical Connection
Logical Connection
PCI Slot 1 Content
U1.1-P2-I1
6F-08 to 6F-0For 6G-xx or 6H-xx
PCI Slot 2 Content
U1.1-P2-I2
6J-08 to 6J-0For 6K-xx or 6L-xx
PCI Slot 3 Content
U1.1-P2-I3
6M-08 to 6M-0For 6N-xx or 6P-xx
PCI Slot 4 Content
U1.1-P2-I4
6Q-08 to 6Q-0For 6R-xx or 6S-xx
EADS 2 - PCI Controller
U1.1-P2
6U-58, 6U-5A, 6U-5E
PCI Slot 5 Content
U1.1-P2-I5
6V-08 to 6V1-0For 6W-xx or 6F-xx
PCI Slot 6 Content
U1.1-P2-I6
6Y-08 to 6Y-0For 6Z-xx or 6a-xx
PCI Slot 7 Content
U1.1-P2-I7
6e-08 to 6Y-0For 6f-xx or 6g-xx
EADS 3 - PCI Controller
U1.1-P2
6j-58, 6j-5A, 6j-5E
PCI Slot 8 Content
U1.1-P2-I8
6k-08 to 6Y-0For 6m-xx or 6n-xx
PCI Slot 9 Content
U1.1-P2-I9
6p-08 to 6p-0For 6q-xx or 6r-xx
PCI Slot 10 Content
U1.1-P2-I10
6v-08 to 6v-0For 6w-xx or 6x-xx
I/O Riser Card
U1.1-P2
I/O Port Connector
U1.1-P2/Q1
I/O Port Cable From Riser port 1 P1.1/Q1
U1.1-P2/Q1#
I/O Port Connector
U1.1-P2/Q2
I/O Port Cable to GX1A-port 2
U1.1-P2/Q2#
I/O Subsystem DCA 1
U1.1-V1
DCA 1 to (BPA B) BPD1- 1A Connector P04
U1.1-V1/Q1
DCA 1 to (BPA B) BPD1- 1A Cable
U1.1-V1/Q1#
DCA 1 to (BPA A) BPD1- 1A Connector P04
U1.1-V1/Q2
DCA 1 to (BPA A) BPD1- 1A Cable
U1.1-V1/Q2#
Thermal Sensor
U1.1-V1
I/O Subsystem DCA 2
U1.1-V2
DCA 2 to (BPA B) BPD1- 1B Connector P05
U1.1-V2/Q1
DCA 2 to (BPA B) BPD1- 1B Cable
U1.1-V2/Q1#
Chapter
1.
Reference
Information
97
FRU Name
Location Code
AIX Location Code
Physical Connection
Logical Connection
DCA 2 to (BPA A) BPD1- 1B Connector P05
U1.1-V2/Q2
DCA 2 to (BPA A) BPD1- 1B Cable
U1.1-V2/Q2#
Thermal Sensor
U1.1-V2
I/O Subsystem SCSI controller 1 on P1
U1.1-P1/Z1
5s-08
I/O Subsystem SCSI controller 2 on P1
U1.1-P1/Z2
67-08
I/O Subsystem SCSI controller 1 on P2
U1.1-P2/Z1
6b-08
I/O Subsystem SCSI controller 2 on P2
U1.1-P2/Z2
6s-08
DASD 4-pack Cage and card (1)
U1.1-P3
DASD 4-pack Cage and card (1) VPD
U1.1-P3-N1
SCSI DASD 1 hdisk at ID 8 connected to controller 2 on P2
U1.1-P2/Z2-A8
6s-08-00-8,0
SCSI DASD 2 hdisk at ID 9 connected to controller 2 on P2
U1.1-P2/Z2-A9
6s-08-00-9,0
SCSI DASD 3 hdisk at ID A connected to controller 2 on P2
U1.1-P2/Z2-Aa
6s-08-00-10,0
SCSI DASD 4 hdisk at ID B connected to controller 2 on P2
U1.1-P2/Z2-Ab
6s-08-00-11,0
SCSI Enclosure Services SES connected to controller 2 on P2
U1.1-P2/Z2-Bf
6s-08-00-15,0
DASD 4-pack Cage and card (2)
U1.1-P4
DASD 4-pack Cage and card (2) VPD
U1.1-P4-N1
SCSI DASD 1 hdisk at ID 8 connected to controller 1 on P2
U1.1-P2/Z1-A8
6b-08-00-8,0
SCSI DASD 2 hdisk at ID 9 connected to controller 1 on P2
U1.1-P2/Z1-A9
6b-08-00-9,0
SCSI DASD 3 hdisk at ID A connected to controller 1 on P2
U1.1-P2/Z1-Aa
6b-08-00-10,0
SCSI DASD 4 hdisk at ID B connected to controller 1 on P2
U1.1-P2/Z1-Ab
6b-08-00-11,0 98
406/90
Service
Guide
FRU Name
Location Code
AIX Location Code
Physical Connection
Logical Connection
SCSI Enclosure Services SES connected to controller 1 on P2
U1.1-P2/Z1-Af
6b-08-00-15,0
DASD 4-pack Cage and card (3)
U1.1-P5
DASD 4-pack Cage and card (3) VPD
U1.1-P5/Z1-N1
SCSI DASD 1 hdisk at ID 8 connected to controller 2 on P1
U1.1-P1/Z2-A8
67-08-00-8,0
SCSI DASD 2 hdisk at ID 9 connected to controller 2 on P1
U1.1-P1/Z2-A9
67-08-00-9,0
SCSI DASD 3 hdisk at ID A connected to controller 2 on P1
U1.1-P1/Z2-Aa
67-08-00-10,0
SCSI DASD 4 hdisk at ID B connected to controller 2 on P1
U1.1-P1/Z2-Ab
67-08-00-11,0
SCSI Enclosure Services SES connected to controller 2 on P1
U1.1-P1/Z2-Af
67-08-00-15,0
DASD 4-pack Cage and card (4)
U1.1-P6
DASD 4-pack Cage and card (4) VPD
U1.1-P6/Z2-N1
SCSI DASD 1 hdisk at ID 8 connected to controller 1 on P1
U1.1-P1/Z1-A8
5s-08-00-8,0
SCSI DASD 2 hdisk at ID 9 connected to controller 1 on P1
U1.1-P1/Z1-A9
5s-08-00-9,0
SCSI DASD 3 hdisk at ID A connected to controller 1 on P1
U1.1-P1/Z1-Aa
5s-08-00-10,0
SCSI DASD 4 hdisk at ID B connected to controller 1 on P1
U1.1-P1/Z1-Ab
5s-08-00-11,0
SCSI Enclosure Services SES connected to controller 1 on P1
U1.1-P1/Z1-Af
5s-08-00-15,0
I/O Subsystem Fan (MSA)
U1.1-F1
I/O Subsystem Fan (MSA)
U1.1-F2
I/O Subsystem Fan (MSA)
U1.1-F3
I/O Subsystem Fan (MSA)
U1.1-F4 Chapter
1.
Reference
Information
99
FRU Name
Location Code
AIX Location Code
Physical Connection
Logical Connection
Media Drawer Power Connector
U1.1-P1-X1/Q3
Media Drawer Power Cable
U1.1-P1-X1/Q3#
Media Drawer Power Connector
U1.1-P2-X2/Q3
Media Drawer Power Cable
U1.1-P2-X2/Q3#
FRU Name
Location Code
AIX Location Code
Physical Connection
Logical Connection
I/O Subsystem 4 Locations (If No IBF Installed) Note: If the Integrated Battery Feature (IBF) is installed, I/O subsystem 4 is in I/O subsystem 7 position. See page 105.
I/O Subsystem 1 Chassis & Midplane Card
(MT/M Serial #) U1.13
I/O Subsystem Left IO backplane assembly
U1.13-P1
I/O Subsystem Left IO backplane assembly VPD
U1.13-P1-N1
EADS 1 - PCI Controller
U1.13-P1
70-58, 70-5A, 70-5C, 70-5E
PCI Slot 1 Content
U1.13-P1-I1
71-08 to 71-0For 72-xx or 73-xx
PCI Slot 2 Content
U1.13-P1-I2
74-08 to 74-0For 75-xx or 76-xx
PCI Slot 3 Content
U1.13-P1-I3
77-08 to 77-0For 78-xx or 79-xx
PCI Slot 4 Content
U1.13-P1-I4
7A-08 to 7F-0For 7B-xx or 7C-xx
EADS 2 - PCI Controller
U1.13-P1
7E-58, 7E-5A, 7E-5E
PCI Slot 5 Content
U1.13-P1-I5
7F-08 to 7F-0For 7G-xx or 7H-xx
PCI Slot 6 Content
U1.13-P1-I6
7J-08 to 7J-0For 7K-xx or 7L-xx
PCI Slot 7 Content
U1.13-P1-I7
7Q-08 to 7Q-0For 7R-xx or 7S-xx
EADS 3 - PCI Controller
U1.13-P1
7U-58, 7U-5A, 7U-5E
PCI Slot 8 Content
U1.13-P1-I8
7V-08 to 7V-0For 7W-xx or 7X-xx
PCI Slot 9 Content
U1.13-P1-I9
7Y-08 to 7Y-0For 7Z-xx or 7a-xx
PCI Slot 10 Content
U1.13-P1-I10
7e-08 to 7e-0For 7f-xx or 7g-xx 100
406/90
Service
Guide
FRU Name
Location Code
AIX Location Code
Physical Connection
Logical Connection
I/O Riser Card
U1.13-P1
I/O Port Connector
U1.13-P1/Q1
I/O Port Cable From GX1A-port 1
U1.13-P1/Q1#
I/O Port Connector
U1.13-P1/Q2
I/O Port Cable to port 0 on next Riser card U1.13-P2.1/Q1
U1.13-P1/Q2#
I/O Subsystem Right IO backplane assembly
U1.13-P2
I/O Subsystem Right IO backplane assembly VPD
U1.13-P2-N1
EADS 1 - PCI Controller
U1.13-P2
7j-58, 7j-5A, 7j-5C, 7j-5E
PCI Slot 1
U1.13-P2-I1
7k-08 to 7k-0For 7m-xx or 7n-xx
PCI Slot 2
U1.13-P2-I2
7p-08 to 7p-0For 7q-xx or 7r-xx
PCI Slot 3
U1.13-P2-I3
7s-08 to 7s-0For 7t-xx or 7u-xx
PCI Slot 4
U1.13-P2-I4
7v-08 to 7v-0For 7w-xx or 7x-xx
EADS 2 - PCI Controller
U1.13-P2
80-58, 80-5A, 80-5E
PCI Slot 5
U1.13-P2-I5
81-08 to 81-0For 82-xx or 83-xx
PCI Slot 6
U1.13-P2-I6
84-08 to 84-0For 85-xx or 86-xx
PCI Slot 7
U1.13-P2-I7
8A-08 to 8A-0For 8B-xx or 8C-xx
EADS 3 - PCI Controller
U1.13-P2
8E-58, 8E-5A, 8E-5E
PCI Slot 8
U1.13-P2-I8
8F-08 to 8F-0For 8G-xx or 8H-xx
PCI Slot 9
U1.13-P2-I9
8J-08 to 8J-0For 8K-xx or 8L-xx
PCI Slot 10
U1.13-P2-I10
8Q-08 to 8Q-0For 8R-xx or 8S-xx
I/O Riser Card
U1.13-P2
I/O Port Connector
U1.13-P2/Q1
I/O Port Cable From Riser port 1 P1.1/Q1
U1.13-P2/Q1#
I/O Port Connector
U1.13-P2/Q2
I/O Port Cable to GX1A-port 2
U1.13-P2/Q2#
I/O Subsystem DCA 1
U1.13-V1 Chapter
1.
Reference
Information
101
FRU Name
Location Code
AIX Location Code
Physical Connection
Logical Connection
DCA 1 to (BPA B) BPD1- 1A Connector P06
U1.13-V1/Q1
DCA 1 to (BPA B) BPD1- 1A Cable
U1.13-V1/Q1#
DCA 1 to (BPA A) BPD1- 1A Connector P06
U1.13-V1/Q2
DCA 1 to (BPA A) BPD1- 1A Cable
U1.13-V1/Q2#
Thermal Sensor
U1.13-V1
I/O Subsystem DCA 2
U1.13-V2
DCA 2 to (BPA B) BPD1- 1B Connector P07
U1.13-V2/Q1
DCA 2 to (BPA B) BPD1- 1B Cable
U1.13-V2/Q1#
DCA 2 to (BPA A) BPD1- 1B Connector P07
U1.13-V2/Q2
DCA 2 to (BPA A) BPD1- 1B Cable
U1.13-V2/Q2#
Thermal Sensor
U1.13-V2
I/O Subsystem SCSI controller 1 on P1
U1.13-P1/Z1
7M-08
I/O Subsystem SCSI controller 2 on P1
U1.13-P1/Z2
7b-08
I/O Subsystem SCSI controller 1 on P2
U1.13-P2/Z1
87-08
I/O Subsystem SCSI controller 2 on P2
U1.13-P2/Z2
8M-08
DASD 4-pack Cage and card (1)
U1.13-P3
DASD 4-pack Cage and card (1) VPD
U1.13-P3-N1
SCSI DASD 1 hdisk at ID 8 connected to controller 2 on P2
U1.13-P2/Z2-A8
8M-08-00-8,0
SCSI DASD 2 hdisk at ID 9 connected to controller 2 on P2
U1.13-P2/Z2-A9
8M-08-00-9,0
SCSI DASD 3 hdisk at ID A connected to controller 2 on P2
U1.13-P2/Z2-Aa
8M-08-00-10,0
SCSI DASD 4 hdisk at ID B connected to controller 2 on P2
U1.13-P2/Z2-Ab
8M-08-00-11,0
SCSI Enclosure Services SES connected to controller 2 on P2
U1.13-P2/Z2-Af
8M-08-00-15,0 102
406/90
Service
Guide
FRU Name
Location Code
AIX Location Code
Physical Connection
Logical Connection
DASD 4-pack Cage and card (2)
U1.13-P4
DASD 4-pack Cage and card (2) VPD
U1.13-P4-N1
SCSI DASD 1 hdisk at ID 8 connected to controller 1 on P2
U1.13-P2/Z1-A8
87-08-00-8,0
SCSI DASD 2 hdisk at ID 9 connected to controller 1 on P2
U1.13-P2/Z1-A9
87-08-00-9,0
SCSI DASD 3 hdisk at ID A connected to controller 1 on P2
U1.13-P2/Z1-Aa
87-08-00-10,0
SCSI DASD 4 hdisk at ID B connected to controller 1 on P2
U1.13-P2/Z1-Ab
87-08-00-11,0
SCSI Enclosure Services SES connected to controller 1 on P2
U1.13-P2/Z1-Af
87-08-00-15,0
DASD 4-pack Cage and card (3)
U1.13-P5
DASD 4-pack Cage and card (3) VPD
U1.13-P5-N1
SCSI DASD 1 hdisk at ID 8 connected to controller 2 on P1
U1.13-P1/Z2-A8
7b-08-00-8,0
SCSI DASD 2 hdisk at ID 9 connected to controller 2 on P1
U1.13-P1/Z2-A9
7b-08-00-9,0
SCSI DASD 3 hdisk at ID A connected to controller 2 on P1
U1.13-P1/Z2-Aa
7b-08-00-10,0
SCSI DASD 4 hdisk at ID B connected to controller 2 on P1
U1.13-P1/Z2-Ab
7b-08-00-11,0
SCSI Enclosure Services SES connected to controller 2 on P1
U1.13-P1/Z2-Af
7b-08-00-15,0
DASD 4-pack Cage and card (4)
U1.13-P6
DASD 4-pack Cage and card (4) VPD
U1.13-P6-N1
SCSI DASD 1 hdisk at ID 8 connected to controller 1 on P1
U1.13-P1/Z1-A8
7M-08-00-8,0
SCSI DASD 2 hdisk at ID 9 connected to controller 1 on P1
U1.13-P1/Z1-A9
7M-08-00-9,0 Chapter
1.
Reference
Information
103
FRU Name
Location Code
AIX Location Code
Physical Connection
Logical Connection
SCSI DASD 3 hdisk at ID A connected to controller 1 on P1
U1.13-P1/Z1-Aa
7M-08-00-10,0
SCSI DASD 4 hdisk at ID B connected to controller 1 on P1
U1.13-P1/Z1-Ab
7M-08-00-11,0
SCSI Enclosure Services SES connected to controller 1 on P1
U1.13-P1/Z1-Af
7M-08-00-15,0
I/O Subsystem Fan (MSA)
U1.13-F1
I/O Subsystem Fan (MSA)
U1.13-F2
I/O Subsystem Fan (MSA)
U1.13-F3
I/O Subsystem Blower (MSA)
U1.13-F4
Media Drawer Power Connector
U1.13-P1-X1/Q3
Media Drawer Power Cable
U1.13-P1-X1/Q3#
Media Drawer Power Connector
U1.13-P2-X2/Q3
Media Drawer Power Cable
U1.13-P2-X2/Q3#
104
406/90
Service
Guide
FRU Name
Location Code
AIX Location Code
Physical Connection
Logical Connection
I/O Subsystem 4 Locations (If IBF Installed) Note: If the Integrated Battery Feature (IBF) is not installed, I/O Subsystem 4 is in I/O Subsystem 4 position. See page 100.
I/O Subsystem 1 Chassis & Midplane Card
(MT/M Serial #) U2.9
I/O Subsystem Left IO backplane assembly
U2.9-P1
I/O Subsystem Left IO backplane assembly VPD
U2.9-P1-N1
EADS 1 - PCI Controller
U2.9-P1
70-58, 70-5A, 70-5C, 70-5E
PCI Slot 1 Content
U2.9-P1-I1
71-08 to 71-0For 72-xx or 73-xx
PCI Slot 2 Content
U2.9-P1-I2
74-08 to 74-0For 75-xx or 76-xx
PCI Slot 3 Content
U2.9-P1-I3
77-08 to 77-0For 78-xx or 79-xx
PCI Slot 4 Content
U2.9-P1-I4
7A-08 to 7A-0For 7B-xx or 7C-xx
EADS 2 - PCI Controller
U2.9-P1
7E-58, 7E-5A, 7E-5E
PCI Slot 5 Content
U2.9-P1-I5
7F-08 to 7F-0For 7G-xx or 7H-xx
PCI Slot 6 Content
U2.9-P1-I6
7J-08 to 7J-0For 7K-xx or 7L-xx
PCI Slot 7 Content
U2.9-P1-I7
7Q-08 to 7Q-0For 7R-xx or 7S-xx
EADS 3 - PCI Controller
U2.9-P1
7U-58, 7U-5A, 7U-5E
PCI Slot 8 Content
U2.9-P1-I8
7V-08 to 7V-0For 7W-xx or 7X-xx
PCI Slot 9 Content
U2.9-P1-I9
7Y-08 to 7Y-0For 7Z-xx or 7a-xx
PCI Slot 10 Content
U2.9-P1-I10
7e-08 to 7e-0For 7f-xx or 7g-xx
I/O Riser Card
U2.9-P1
I/O Port Connector
U2.9-P1/Q1
I/O Port Cable From GX1A-port 3
U2.9-P1/Q1#
I/O Port Connector
U2.9-P1/Q2
I/O Port Cable to port 0 on next Riser card U2.9-P2/Q1
U2.9-P1/Q2#
I/O Subsystem Right IO backplane assembly
U2.9-P2 Chapter
1.
Reference
Information
105
FRU Name
Location Code
AIX Location Code
Physical Connection
Logical Connection
I/O Subsystem Right IO backplane assembly VPD
U2.9-P2-N1
EADS 1 - PCI Controller
U2.9-P2
7j-58, 7j-5A, 7j-5C, 7j-5E
PCI Slot 1 Content
U2.9-P2-I1
7k-08 to 7k-0For 7m-xx or 7n-xx
PCI Slot 2 Content
U2.9-P2-I2
7p-08 to 7p-0For 7q-xx or 7r-xx
PCI Slot 3 Content
U2.9-P2-I3
7s-08 to 7s-0For 7t-xx or 7u-xx
PCI Slot 4 Content
U2.9-P2-I4
7v-08 to 7v-0For 7w-xx or 7x-xx
EADS 2 - PCI Controller
U2.9-P2
80-58, 80-5A, 80-5E
PCI Slot 5 Content
U2.9-P2-I5
81-08 to 81-0For 82-xx or 83-xx
PCI Slot 6 Content
U2.9-P2-I6
84-08 to 84-0For 85-xx or 86-xx
PCI Slot 7 Content
U2.9-P2-I7
8A-08 to 8A-0For 8B-xx or 8C-xx
EADS 3 - PCI Controller
U2.9-P2
8E-58, 8E-5A, 8E-5E
PCI Slot 8 Content
U2.9-P2-I8
8F-08 to 8F-0For 8G-xx or 8H-xx
PCI Slot 9 Content
U2.9-P2-I9
8J-08 to 8J-0For 8K-xx or 8L-xx
PCI Slot 10 Content
U2.9-P2-I10
8Q-08 to 8Q-0For 8R-xx or 8S-xx
I/O Riser Card
U2.9-P2
I/O Port Connector
U2.9-P2/Q1
I/O Port Cable From Riser port 1 P1.1/Q1
U2.9-P2/Q1#
I/O Port Connector
U2.9-P2/Q2
I/O Port Cable to GX1A-port 2
U2.9-P2/Q2#
I/O Subsystem DCA 1
U2.9-V1
DCA 1 to (BPA B) BPD2- 4A Connector P04
U2.9-V1/Q1
DCA 1 to (BPA B) BPD2- 4A Cable
U2.9-V1/Q1#
DCA 1 to (BPA A) BPD2- 4A Connector P04
U2.9-V1/Q2
DCA 1 to (BPA A) BPD2- 4A Cable
U2.9-V1/Q2#
Thermal Sensor
U2.9-V1
I/O Subsystem DCA 2
U2.9-V2 106
406/90
Service
Guide
FRU Name
Location Code
AIX Location Code
Physical Connection
Logical Connection
DCA 2 to (BPA B) BPD2- 4B Connector P05
U2.9-V2/Q1
DCA 2 to (BPA B) BPD2- 4B Cable
U2.9-V2/Q1#
DCA 2 to (BPA A) BPD2- 4B Connector P05
U2.9-V2/Q2
DCA 2 to (BPA A) BPD2- 4B Cable
U2.9-V2/Q2#
Thermal Sensor
U2.9-V2
I/O Subsystem SCSI controller 1 on P1
U2.9-P1/Z1
7M-08
I/O Subsystem SCSI controller 2 on P1
U2.9-P1/Z2
7b-08
I/O Subsystem SCSI controller 1 on P2
U2.9-P2/Z1
87-08
I/O Subsystem SCSI controller 2 on P2
U2.9-P2/Z2
8M-08
DASD 4-pack Cage and card (1)
U2.9-P3
DASD 4-pack Cage and card (1) VPD
U2.9-P3-N1
SCSI DASD 1 hdisk at ID 8 connected to controller 2 on P2
U2.9-P2/Z2-A8
8M-08-00-8,0
SCSI DASD 2 hdisk at ID 9 connected to controller 2 on P2
U2.9-P2/Z2-A9
8M-08-00-9,0
SCSI DASD 3 hdisk at ID A connected to controller 2 on P2
U2.9-P2/Z2-Aa
8M-08-00-10,0
SCSI DASD 4 hdisk at ID B connected to controller 2 on P2
U2.9-P2/Z2-Ab
8M-08-00-11,0
SCSI Enclosure Services SES connected to controller 2 on P2
U2.9-P2/Z2-Af
8M-08-00-15,0
DASD 4-pack Cage and card (2)
U2.9-P4
DASD 4-pack Cage and card (2) VPD
U2.9-P4-N1
SCSI DASD 1 hdisk at ID 8 connected to controller 1 on P2
U2.9-P2/Z1-A8
87-08-00-8,0
SCSI DASD 2 hdisk at ID 9 connected to controller 1 on P2
U2.9-P2/Z1-A9
87-08-00-9,0 Chapter
1.
Reference
Information
107
FRU Name
Location Code
AIX Location Code
Physical Connection
Logical Connection
SCSI DASD 3 hdisk at ID A connected to controller 1 on P2
U2.9-P2/Z1-Aa
87-08-00-10,0
SCSI DASD 4 hdisk at ID B connected to controller 1 on P2
U2.9-P2/Z1-Ab
87-08-00-11,0
SCSI Enclosure Services SES connected to controller 1 on P2
U2.9-P2/Z1-Af
87-08-00-15,0
DASD 4-pack Cage and card (3)
U2.9-P5
DASD 4-pack Cage and card (3) VPD
U2.9-P5-N1
SCSI DASD 1 hdisk at ID 8 connected to controller 2 on P1
U2.9-P1/Z2-A8
7b-08-00-8,0
SCSI DASD 2 hdisk at ID 9 connected to controller 2 on P1
U2.9-P1/Z2-A9
7b-08-00-9,0
SCSI DASD 3 hdisk at ID A connected to controller 2 on P1
U2.9-P1/Z2-Aa
7b-08-00-10,0
SCSI DASD 4 hdisk at ID B connected to controller 2 on P1
U2.9-P1/Z2-Ab
7b-08-00-11,0
SCSI Enclosure Services SES connected to controller 2 on P1
U2.9-P1/Z2-Af
7b-08-00-15,0
DASD 4-pack Cage and card (4)
U2.9-P6
DASD 4-pack Cage and card (4) VPD
U2.9-P6-N1
SCSI DASD 1 hdisk at ID 8 connected to controller 1 on P1
U2.9-P1/Z1-A8
7M-08-00-8,0
SCSI DASD 2 hdisk at ID 9 connected to controller 1 on P1
U2.9-P1/Z1-A9
7M-08-00-9,0
SCSI DASD 3 hdisk at ID A connected to controller 1 on P1
U2.9-P1/Z1-Aa
7M-08-0010,0
SCSI DASD 4 hdisk at ID B connected to controller 1 on P1
U2.9-P1/Z1-Ab
7M-08-00-11,0
SCSI Enclosure Services SES connected to controller 1 on P1
U2.9-P1/Z1-Af
7M-08-00-15,0
I/O Subsystem Fan (MSA)
U2.9-F1 108
406/90
Service
Guide
FRU Name
Location Code
AIX Location Code
Physical Connection
Logical Connection
I/O Subsystem Fan (MSA)
U2.9-F2
I/O Subsystem Fan (MSA)
U2.9-F3
I/O Subsystem Fan (MSA)
U2.9-F4
Media Drawer Power Connector
U2.9-P1-X1/Q3
Media Drawer Power Cable
U2.9-P1-X1/Q3#
Media Drawer Power Connector
U2.9-P2-X2/Q3
Media Drawer Power Cable
U2.9-P2-X2/Q3#
FRU Name
Location Code
AIX Location Code
Physical Connection
Logical Connection
I/O Subsystem 5 Locations
I/O Subsystem 1 Chassis & Midplane Card
(MT/M Serial #) U2.1
I/O Subsystem Left IO backplane assembly
U2.1-P1
I/O Subsystem Left IO backplane assembly VPD
U2.1-P1-N1
EADS 1 - PCI Controller
U2.1-P1
8U-58, 8U-5A, 8U-5C, 8U-5E
PCI Slot 1 Content
U2.1-P1-I1
8V-08 to 8V-0For 8W-xx or 8X-xx
PCI Slot 2 Content
U2.1-P1-I2
8Y-08 to 8Y-0For 8Z-xx or 8a-xx
PCI Slot 3 Content
U2.1-P1-I3
8b-08 to 8b-0For 8c-xx or 8d-xx
PCI Slot 4 Content
U2.1-P1-I4
8e-08 to 8e-0For 8f-xx or 8g-xx
EADS 2 - PCI Controller
U2.1-P1
8j-58, 8j-5A, 8j-5E
PCI Slot 5 Content
U2.1-P1-I5
8k-08 to 8k-0For 8m-xx or 8n-xx
PCI Slot 6 Content
U2.1-P1-I6
8p-08 to 8p-0For 8q-xx or 8r-xx
PCI Slot 7 Content
U2.1-P1-I7
8v-08 to 8v-0For 8w-xx or 8x-xx
EADS 3 - PCI Controller
U2.1-P1
90-58, 90-5A, 90-5E
PCI Slot 8 Content
U2.1-P1-I8
91-08 to 91-0For 92-xx or 93-xx Chapter
1.
Reference
Information
109
FRU Name
Location Code
AIX Location Code
Physical Connection
Logical Connection
PCI Slot 9 Content
U2.1-P1-I9
94-08 to 94-0For 95-xx or 96-xx
PCI Slot 10 Content
U2.1-P1-I10
9A-08 to 9A-0For 9B-xx or 9C-xx
I/O Riser Card
U2.1-P1
I/O Port Connector
U2.1-P1/Q1
I/O Port Cable From GX1A-port 1
U2.1-P1/Q1#
I/O Port Connector
U2.1-P1/Q2
I/O Port Cable to port 0 on next Riser card U2.1-P2.1/Q1
U2.1-P1/Q2#
I/O Subsystem Right IO backplane assembly
U2.1-P2
I/O Subsystem Right IO backplane assembly VPD
U2.1-P2-N1
EADS 1 - PCI Controller
U2.1-P2
9E-58, 9E-5A, 9E-5C, 9E-5E
PCI Slot 1 Content
U2.1-P2-I1
9F-08 to 9F-0For 9G-xx or 9H-xx
PCI Slot 2 Content
U2.1-P2-I2
9J-08 to 9J-0For 9K-xx or 9L-xx
PCI Slot 3 Content
U2.1-P2-I3
9M-08 to 9M-0For 9N-xx or 9P-xx
PCI Slot 4 Content
U2.1-P2-I4
9Q-08 to 9Q-0For 9R-xx or 9S-xx
EADS 2 - PCI Controller
U2.1-P2
9U-58, 9U-5A, 9U-5E
PCI Slot 5 Content
U2.1-P2-I5
9V-08 to 9V-0For 9W-xx or 9X-xx
PCI Slot 6 Content
U2.1-P2-I6
9Y-08 to 9Y-0For 9Z-xx or 9a-xx
PCI Slot 7 Content
U2.1-P2-I7
9e-08 to 91-0For 9Z-xx or 9a-xx
EADS 3 - PCI Controller
U2.1-P2
9j-58, 9j-5A, 9j-5E
PCI Slot 8 Content
U2.1-P2-I8
9k-08 to 9k-0For 9m-xx or 9n-xx
PCI Slot 9 Content
U2.1-P2-I9
9p-08 to 9p-0For 9q-xx or 9r-xx
PCI Slot 10 Content
U2.1-P2-I10
9v-08 to 9v-0For 9w-xx or 9x-xx
I/O Riser Card
U2.1-P2
I/O Port Connector
U2.1-P2/Q1
I/O Port Cable From Riser port 1 P1.1/Q1
U2.1-P2/Q1# 110
406/90
Service
Guide
FRU Name
Location Code
AIX Location Code
Physical Connection
Logical Connection
I/O Port Connector
U2.1-P2/Q2
I/O Port Cable to GX1A-port 2
U2.1-P2/Q2#
I/O Subsystem DCA 1
U2.1-V1
DCA 1 to (BPA B) BPD1- 5A Connector P00
U2.1-V1/Q1
DCA 1 to (BPA B) BPD1- 5A Cable
U2.1-V1/Q1#
DCA 1 to (BPA A) BPD1- 5A Connector P00
U2.1-V1/Q2
DCA 1 to (BPA A) BPD1- 5A Cable
U2.1-V1/Q2#
Thermal Sensor
U2.1-V1
I/O Subsystem DCA 2
U2.1-V2
DCA 2 to (BPA B) BPD1- 5B Connector P01
U2.1-V2/Q1
DCA 2 to (BPA B) BPD1- 5B Cable
U2.1-V2/Q1#
DCA 2 to (BPA A) BPD1- 5B Connector P01
U2.1-V2/Q2
DCA 2 to (BPA A) BPD1- 5B Cable
U2.1-V2/Q2#
Thermal Sensor
U2.1-V2
I/O Subsystem SCSI controller 1 on P1
U2.1-P1/Z1
8s-08
I/O Subsystem SCSI controller 2 on P1
U2.1-P1/Z2
97-08
I/O Subsystem SCSI controller 1 on P2
U2.1-P2/Z1
9b-08
I/O Subsystem SCSI controller 2 on P2
U2.1-P2/Z2
9s-08
DASD 4-pack Cage and card (1)
U2.1-P3
DASD 4-pack Cage and card (1) VPD
U2.1-P3-N1
SCSI DASD 1 hdisk at ID 8 connected to controller 2 on P2
U2.1-P2/Z2-A8
9s-08-00-8,0
SCSI DASD 2 hdisk at ID 9 connected to controller 2 on P2
U2.1-P2/Z2-A9
9s-08-00-9,0
SCSI DASD 3 hdisk at ID A connected to controller 2 on P2
U2.1-P2/Z2-Aa
9s-08-00-10,0 Chapter
1.
Reference
Information
111
FRU Name
Location Code
AIX Location Code
Physical Connection
Logical Connection
SCSI DASD 4 hdisk at ID B connected to controller 2 on P2
U2.1-P2/Z2-Ab
9s-08-00-11,0
SCSI Enclosure Services SES connected to controller 2 on P2
U2.1-P2/Z2-Af
9s-08-00-15,0
DASD 4-pack Cage and card (2)
U2.1-P
DASD 4-pack Cage and card (2) VPD
U2.1-P4-N1
SCSI DASD 1 hdisk at ID 8 connected to controller 1 on P2
U2.1-P2/Z1-A8
9b-08-00-8,0
SCSI DASD 2 hdisk at ID 9 connected to controller 1 on P2
U2.1-P2/Z1-A9
9b-08-00-9,0
SCSI DASD 3 hdisk at ID A connected to controller 1 on P2
U2.1-P2/Z1-Aa
9b-08-00-10,0
SCSI DASD 4 hdisk at ID B connected to controller 1 on P2
U2.1-P2/Z1-Ab
9b-08-00-11,0
SCSI Enclosure Services SES connected to controller 1 on P2
U2.1-P2/Z1-Af
9b-08-00-15,0
DASD 4-pack Cage and card (3)
U2.1-P5
DASD 4-pack Cage and card (3) VPD
U2.1-P5-N1
SCSI DASD 1 hdisk at ID 8 connected to controller 2 on P1
U2.1-P1/Z2-A8
97-08-00-8,0
SCSI DASD 2 hdisk at ID 9 connected to controller 2 on P1
U2.1-P1/Z2-A9
97-08-00-9,0
SCSI DASD 3 hdisk at ID A connected to controller 2 on P1
U2.1-P1/Z2-Aa
97-08-00-10,0
SCSI DASD 4 hdisk at ID B connected to controller 2 on P1
U2.1-P1/Z2-Ab
97-08-00-11,0
SCSI Enclosure Services SES connected to controller 2 on P1
U2.1-P1/Z2-Af
97-08-00-15,0
DASD 4-pack Cage and card (4)
U2.1-P6
DASD 4-pack Cage and card (4) VPD
U2.1-P6-N1 112
406/90
Service
Guide
FRU Name
Location Code
AIX Location Code
Physical Connection
Logical Connection
SCSI DASD 1 hdisk at ID 8 connected to controller 1 on P1
U2.1-P1/Z1-A8
8s-08-00-8,0
SCSI DASD 2 hdisk at ID 9 connected to controller 1 on P1
U2.1-P1/Z1-A9
8s-08-00-9,0
SCSI DASD 3 hdisk at ID A connected to controller 1 on P1
U2.1-P1/Z1-Aa
8s-08-00-10,0
SCSI DASD 4 hdisk at ID B connected to controller 1 on P1
U2.1-P1/Z1-Ab
8s-08-00-11,0
SCSI Enclosure Services SES connected to controller 1 on P1
U2.1-P1/Z1-Af
8s-08-00-15,0
I/O Subsystem Fan (MSA)
U2.1-F1
I/O Subsystem Fan (MSA)
U2.1-F2
I/O Subsystem Fan (MSA)
U2.1-F3
I/O Subsystem Fan (MSA)
U2.1-F4
Media Drawer Power Connector
U2.1-P1-X1/Q3
Media Drawer Power Cable
U2.1-P1-X1/Q3#
Media Drawer Power Connector
U2.1-P2-X2/Q3
Media Drawer Power Cable
U2.1-P2-X2/Q3#
Chapter
1.
Reference
Information
113
FRU Name
Location Code
AIX Location Code
Physical Connection
Logical Connection
I/O Subsystem 6 Locations
I/O Subsystem 1 Chassis & Midplane Card
(MT/M Serial #) U2.5
I/O Subsystem Left IO backplane assembly
U2.5-P1
I/O Subsystem Left IO backplane assembly VPD
U2.5-P1-N1
EADS 1 - PCI Controller
U2.5-P1
A0-58, A0-5A, A0-5C, A0-5E
PCI Slot 1 Content
U2.5-P1-I1
A1-08 to A1-0For A2-xx or A3-xx
PCI Slot 2 Content
U2.5-P1-I2
A4-08 to A4-0For A5-xx or A6-xx
PCI Slot 3 Content
U2.5-P1-I3
A7-08 to A7-0For A8-xx or A9-xx
PCI Slot 4 Content
U2.5-P1-I4
AA-08 to AA-0For AB-xx or AC-xx
EADS 2 - PCI Controller
U2.5-P1
AE-58, AE-5A, AE-5E
PCI Slot 5 Content
U2.5-P1-I5
AF-08 to AF-0For AG-xx or AH-xx
PCI Slot 6 Content
U2.5-P1-I6
AJ-08 to AJ-0For AK-xx or AL-xx
PCI Slot 7 Content
U2.5-P1-I7
AQ-08 to AQ-0For AR-xx or AS-xx
EADS 3 - PCI Controller
U2.5-P1
AU-58, AU-5A, AU-5E
PCI Slot 8 Content
U2.5-P1-I8
AV-08 to AV-0For AW-xx or AX-xx
PCI Slot 9 Content
U2.5-P1-I9
AY-08 to AY-0For AZ-xx or Aa-xx
PCI Slot 10 Content
U2.5-P1-I10
Ae-08 to Ae-0For Af-xx or Ag-xx
I/O Riser Card
U2.5-P1
I/O Port Connector
U2.5-P1/Q1
I/O Port Cable From GX1A-port 1
U2.5-P1/Q1#
I/O Port Connector
U2.5-P1/Q2
I/O Port Cable to port 0 on next Riser card U2.5-P2.1/Q1
U2.5-P1/Q2#
I/O Subsystem Right IO backplane assembly
U2.5-P2
I/O Subsystem Right IO backplane assembly VPD
U2.5-P2-N1 114
406/90
Service
Guide
FRU Name
Location Code
AIX Location Code
Physical Connection
Logical Connection
EADS 1 - PCI Controller
U2.5-P2
Aj-58, Aj-5A, Aj-5C, Aj-5E
PCI Slot 1 Content
U2.5-P2-I1
Ak-08 to Ak-0For Am-xx or An-xx
PCI Slot 2 Content
U2.5-P2-I2
Ap-08 to Ap-0For Aq-xx or Ar-xx
PCI Slot 3 Content
U2.5-P2-I3
As-08 to As-0For At-xx or Au-xx
PCI Slot 4 Content
U2.5-P2-I4
Av-08 to Av-0For Aw-xx or Ax-xx
EADS 2 - PCI Controller
U2.5-P2
B0-58, B0-5A, B0-5E
PCI Slot 5 Content
U2.5-P2-I5
B1-08 to B1-0For B2-xx or B3-xx
PCI Slot 6 Content
U2.5-P2-I6
B4-08 to B4-0For B5-xx or B6-xx
PCI Slot 7 Content
U2.5-P2-I7
BA-08 to BA-0For BB-xx or BC-xx
EADS 3 - PCI Controller
U2.5-P2
BE-58, BE-5A, BE-5E
PCI Slot 8 Content
U2.5-P2-I8
BF-08 to BF-0For BG-xx or BH-xx
PCI Slot 9 Content
U2.5-P2-I9
BJ-08 to BJ-0For BK-xx or BL-xx
PCI Slot 10 Content
U2.5-P2-I10
BQ-08 to BQ-0For BR-xx or BS-xx
I/O Riser Card
U2.5-P2
I/O Port Connector
U2.5-P2/Q1
I/O Port Cable From Riser port 1 P1.1/Q1
U2.5-P2/Q1#
I/O Port Connector
U2.5-P2/Q2
I/O Port Cable to GX1A-port 2
U2.5-P2/Q2#
I/O Subsystem DCA 1
U2.5-V1
DCA 1 to (BPA B) BPD2- 6A Connector P02
U2.5-V1/Q1
DCA 1 to (BPA B) BPD2- 6A Cable
U2.5-V1/Q1#
DCA 1 to (BPA A) BPD2- 6A Connector P02
U2.5-V1/Q2
DCA 1 to (BPA A) BPD2- 6A Cable
U2.5-V1/Q2#
Thermal Sensor
U2.5-V1
I/O Subsystem DCA 2
U2.5-V2 Chapter
1.
Reference
Information
115
FRU Name
Location Code
AIX Location Code
Physical Connection
Logical Connection
DCA 2 to (BPA B) BPD2- 6B Connector P03
U2.5-V2/Q1
DCA 2 to (BPA B) BPD2- 6B Cable
U2.5-V2/Q1#
DCA 2 to (BPA A) BPD2- 6B Connector P03
U2.5-V2/Q2
DCA 2 to (BPA A) BPD2- 6B Cable
U2.5-V2/Q2#
Thermal Sensor
U2.5-V2
I/O Subsystem SCSI controller 1 on P1
U2.5-P1/Z1
AM-08
I/O Subsystem SCSI controller 2 on P1
U2.5-P1/Z2
Ab-08
I/O Subsystem SCSI controller 1 on P2
U2.5-P2/Z1
B7-08
I/O Subsystem SCSI controller 2 on P2
U2.5-P2/Z2
BM-08
DASD 4-pack Cage and card (1)
U2.5-P3
DASD 4-pack Cage and card (1) VPD
U2.5-P3-N1
SCSI DASD 1 hdisk at ID 8 connected to controller 2 on P2
U2.5-P2/Z2-A8
BM-08-00-8,0
SCSI DASD 2 hdisk at ID 9 connected to controller 2 on P2
U2.5-P2/Z2-A9
BM-08-00-9,0
SCSI DASD 3 hdisk at ID A connected to controller 2 on P2
U2.5-P2/Z2-Aa
BM-08-00-A,0
SCSI DASD 4 hdisk at ID B connected to controller 2 on P2
U2.5-P2/Z2-Ab
BM-08-00-B,0
SCSI Enclosure Services SES connected to controller 2 on P2
U2.5-P2/Z2-Af
BM-08-00-F,0
DASD 4-pack Cage and card (2)
U2.5-P4
DASD 4-pack Cage and card (2) VPD
U2.5-P4-N1
SCSI DASD 1 hdisk at ID 8 connected to controller 1 on P2
U2.5-P2/Z1-A8
B7-08-00-8,0
SCSI DASD 2 hdisk at ID 9 connected to controller 1 on P2
U2.5-P2/Z1-A9
B7-08-00-9,0 116
406/90
Service
Guide
FRU Name
Location Code
AIX Location Code
Physical Connection
Logical Connection
SCSI DASD 3 hdisk at ID A connected to controller 1 on P2
U2.5-P2/Z1-Aa
B7-08-00-10,0
SCSI DASD 4 hdisk at ID B connected to controller 1 on P2
U2.5-P2/Z1-Ab
B7-08-00-11,0
SCSI Enclosure Services SES connected to controller 1 on P2
U2.5-P2/Z1-Af
B7-08-00-15,0
DASD 4-pack Cage and card (3)
U2.5-P5
DASD 4-pack Cage and card (3) VPD
U2.5-P5-N1
SCSI DASD 1 hdisk at ID 8 connected to controller 2 on P1
U2.5-P1/Z2-A8
Ab-08-00-8,0
SCSI DASD 2 hdisk at ID 9 connected to controller 2 on P1
U2.5-P1/Z2-A9
Ab-08-00-9,0
SCSI DASD 3 hdisk at ID A connected to controller 2 on P1
U2.5-P1/Z2-Aa
Ab-08-00-10,0
SCSI DASD 4 hdisk at ID B connected to controller 2 on P1
U2.5-P1/Z2-Ab
Ab-08-00-11,0
SCSI Enclosure Services SES connected to controller 2 on P1
U2.5-P1/Z2-Af
Ab-08-00-15,0
DASD 4-pack Cage and card (4)
U2.5-P6
DASD 4-pack Cage and card (4) VPD
U2.5-P6-N1
SCSI DASD 1 hdisk at ID 8 connected to controller 1 on P1
U2.5-P1/Z1-A8
AM-08-00-8,0
SCSI DASD 2 hdisk at ID 9 connected to controller 1 on P1
U2.5-P1/Z1-A9
AM-08-00-9,0
SCSI DASD 3 hdisk at ID A connected to controller 1 on P1
U2.5-P1/Z1-Aa
AM-08-00-10,0
SCSI DASD 4 hdisk at ID B connected to controller 1 on P1
U2.5-P1/Z1-Ab
AM-08-00-11,0
SCSI Enclosure Services SES connected to controller 1 on P1
U2.5-P1/Z1-Af
AM-08-00-15,0
I/O Subsystem Fan (MSA)
U2.5-F1 Chapter
1.
Reference
Information
117
FRU Name
Location Code
AIX Location Code
Physical Connection
Logical Connection
I/O Subsystem Fan (MSA)
U2.5-F2
I/O Subsystem Fan (MSA)
U2.5-F3
I/O Subsystem Fan (MSA)
U2.5-F4
Media Drawer Power Connector
U2.5-P1-X1/Q3
Media Drawer Power Cable
U2.5-P1-X1/Q3#
Media Drawer Power Connector
U2.5-P2-X2/Q3
Media Drawer Power Cable
U2.5-P2-X2/Q3#
FRU Name
Location Code
AIX Location Code
Physical Connection
Logical Connection
I/O Subsystem 7 Locations
I/O Subsystem 1 Chassis & Midplane Card
(MT/M Serial #) U2.13
I/O Subsystem Left IO backplane assembly
U2.13-P1
I/O Subsystem Left IO backplane assembly VPD
U2.13-P1-N1
EADS 1 - PCI Controller
U2.13-P1
BU-58, BU-5A, BU-5C, BU-5E
PCI Slot 1 Content
U2.13-P1-I1
BV-08 to BV-0For BW-xx or BX-xx
PCI Slot 2 Content
U2.13-P1-I2
BY-08 to BY-0F or BZ-xx or Ba-xx
PCI Slot 3 Content
U2.13-P1-I3
Bb-08 to Bb-0F or Bc-xx or Bd-xx
PCI Slot 4 Content
U2.13-P1-I4
Be-08 to Be-0F or Bf-xx or Bg-xx
EADS 2 - PCI Controller
U2.13-P1
Bj-58, Bj-5A, Bj-5E
PCI Slot 5 Content
U2.13-P1-I5
Bk-08 to Bk-0F or Bm-xx or Bn-xx
PCI Slot 6 Content
U2.13-P1-I6
Bp-08 to Bp-0F or Bq-xx or Br-xx
PCI Slot 7 Content
U2.13-P1-I7
Bv-08 to Bv-0F or Bw-xx or Bx-xx
EADS 3 - PCI Controller
U2.13-P1
C0-58, C0-5A, C0-5E
PCI Slot 8 Content
U2.13-P1-I8
C1-08 to C1-0F or C2-xx or C3-xx 118
406/90
Service
Guide
FRU Name
Location Code
AIX Location Code
Physical Connection
Logical Connection
PCI Slot 9 Content
U2.13-P1-I9
C4-08 to C4-0F or C5-xx or C6-xx
PCI Slot 10 Content
U2.13-P1-I10
CA-08 to CA-0F or CB-xx or CC-xx
I/O Riser Card
U2.13-P1
I/O Port Connector
U2.13-P1/Q1
I/O Port Cable From GX1A-port 1
U2.13-P1/Q1#
I/O Port Connector
U2.13-P1/Q2
I/O Port Cable to port 0 on next Riser card U2.13-P2.1/Q1
U2.13-P1/Q2#
I/O Subsystem Right IO backplane assembly
U2.13-P2
I/O Subsystem Right IO backplane assembly VPD
U2.13-P2-N1
EADS 1 - PCI Controller
U2.13-P2
CE-58, CE-5A, CE-5C, CE-5E
PCI Slot 1 Content
U2.13-P2-I1
CF-08 to CF-0F or CG-xx or CH-xx
PCI Slot 2 Content
U2.13-P2-I2
CJ-08 to CJ-0F or CK-xx or CL-xx
PCI Slot 3 Content
U2.13-P2-I3
CM-08 to CM-0F or CN-xx or CP-xx
PCI Slot 4 Content
U2.13-P2-I4
CQ-08 to CQ-0F or CR-xx or CS-xx
EADS 2 - PCI Controller
U2.13-P2
CU-58, CU-5A, CU-5E
PCI Slot 5 Content
U2.13-P2-I5
CV-08 to CV-0F or CW-xx or CX-xx
PCI Slot 6 Content
U2.13-P2-I6
CY-08 to CY-0F or CZ-xx or Ca-xx
PCI Slot 7 Content
U2.13-P2-I7
Ce-08 to Ce-0F or Cf-xx or Cg-xx
EADS 3 - PCI Controller
U2.13-P2
Cj-58, Cj-5A, Cj-5E
PCI Slot 8 Content
U2.13-P2-I8
Ck-08 to Ck-0F or Cm-xx or Cn-xx
PCI Slot 9 Content
U2.13-P2-I9
Cp-08 to Cp-0F or Cq-xx or Cr-xx
PCI Slot 10 Content
U2.13-P2-I10
Cv-08 to Cv-0F or Cw-xx or Cx-xx
I/O Riser Card
U2.13-P2
I/O Port Connector
U2.13-P2/Q1
I/O Port Cable From Riser port 1 P1.1/Q1
U2.13-P2/Q1# Chapter
1.
Reference
Information
119
FRU Name
Location Code
AIX Location Code
Physical Connection
Logical Connection
I/O Port Connector
U2.13-P2/Q2
I/O Port Cable to GX1A-port 2
U2.13-P2/Q2#
I/O Subsystem DCA 1
U2.13-V1
DCA 1 to (BPA B) BPD2- 7A Connector P02
U2.13-V1/Q1
DCA 1 to (BPA B) BPD2- 7A Cable
U2.13-V1/Q1#
DCA 1 to (BPA A) BPD2- 7A Connector P02
U2.13-V1/Q2
DCA 1 to (BPA A) BPD2- 7A Cable
U2.13-V1/Q2#
Thermal Sensor
U2.13-V1
I/O Subsystem DCA 2
U2.13-V2
DCA 2 to (BPA B) BPD2- 7B Connector P03
U2.13-V2/Q1
DCA 2 to (BPA B) BPD2- 7B Cable
U2.13-V2/Q1#
DCA 2 to (BPA A) BPD2- 7B Connector P03
U2.13-V2/Q2
DCA 2 to (BPA A) BPD2- 7B Cable
U2.13-V2/Q2#
Thermal Sensor
U2.13-V2
I/O Subsystem SCSI controller 1 on P1
U2.13-P1/Z1
Bs-08
I/O Subsystem SCSI controller 2 on P1
U2.13-P1/Z2
C7-08
I/O Subsystem SCSI controller 1 on P2
U2.13-P2/Z1
Cb-08
I/O Subsystem SCSI controller 2 on P2
U2.13-P2/Z2
Cs-08
DASD 4-pack Cage and card (1)
U2.13-P3
DASD 4-pack Cage and card (1) VPD
U2.13-P3-N1
SCSI DASD 1 hdisk at ID 8 connected to controller 2 on P2
U2.13-P2/Z2-A8
Cs-08-00-8,0
SCSI DASD 2 hdisk at ID 9 connected to controller 2 on P2
U2.13-P2/Z2-A9
Cs-08-00-9,0
SCSI DASD 3 hdisk at ID A connected to controller 2 on P2
U2.13-P2/Z2-Aa
Cs-08-00-10,0 120
406/90
Service
Guide
FRU Name
Location Code
AIX Location Code
Physical Connection
Logical Connection
SCSI DASD 4 hdisk at ID B connected to controller 2 on P2
U2.13-P2/Z2-Ab
Cs-08-00-11,0
SCSI Enclosure Services SES connected to controller 2 on P2
U2.13-P2/Z2-Af
Cs-08-00-15,0
DASD 4-pack Cage and card (2)
U2.13-P4
DASD 4-pack Cage and card (2) VPD
U2.13-P4-N1
SCSI DASD 1 hdisk at ID 8 connected to controller 1 on P2
U2.13-P2/Z1-A8
Cb-08-00-8,0
SCSI DASD 2 hdisk at ID 9 connected to controller 1 on P2
U2.13-P2/Z1-A9
Cb-08-00-9,0
SCSI DASD 3 hdisk at ID A connected to controller 1 on P2
U2.13-P2/Z1-Aa
Cb-08-00-10,0
SCSI DASD 4 hdisk at ID B connected to controller 1 on P2
U2.13-P2/Z1-Ab
Cb-08-00-11,0
SCSI Enclosure Services SES connected to controller 1 on P2
U2.13-P2/Z1-Af
Cb-08-00-15,0
DASD 4-pack Cage and card (3)
U2.13-P5
DASD 4-pack Cage and card (3) VPD
U2.13-P5-N1
SCSI DASD 1 hdisk at ID 8 connected to controller 2 on P1
U2.13-P1/Z2-A8
C7-08-00-8,0
SCSI DASD 2 hdisk at ID 9 connected to controller 2 on P1
U2.13-P1/Z2-A9
C7-08-00-9,0
SCSI DASD 3 hdisk at ID A connected to controller 2 on P1
U2.13-P1/Z2-Aa
C7-08-00-10,0
SCSI DASD 4 hdisk at ID B connected to controller 2 on P1
U2.13-P1/Z2-Ab
C7-08-00-11,0
SCSI Enclosure Services SES connected to controller 2 on P1
U2.13-P1/Z2-Af
C7-08-00-15,0
DASD 4-pack Cage and card (4)
U2.13-P6
DASD 4-pack Cage and card (4) VPD
U2.13-P6-N1 Chapter
1.
Reference
Information
121
FRU Name
Location Code
AIX Location Code
Physical Connection
Logical Connection
SCSI DASD 1 hdisk at ID 8 connected to controller 1 on P1
U2.13-P1/Z1-A8
Bs-08-00-8,0
SCSI DASD 2 hdisk at ID 9 connected to controller 1 on P1
U2.13-P1/Z1-A9
Bs-08-00-9,0
SCSI DASD 3 hdisk at ID A connected to controller 1 on P1
U2.13-P1/Z1-Aa
Bs-08-00-10,0
SCSI DASD 4 hdisk at ID B connected to controller 1 on P1
U2.13-P1/Z1-Ab
Bs-08-00-11,0
SCSI Enclosure Services SES connected to controller 1 on P1
U2.13-P1/Z1-Af
Bs-08-00-15,0
I/O Subsystem Fan (MSA)
U2.13-F1
I/O Subsystem Fan (MSA)
U2.13-F2
I/O Subsystem Fan (MSA)
U2.13-F3
I/O Subsystem Fan (MSA)
U2.13-F4
Media Drawer Power Connector
U2.13-P1-X1/Q3
Media Drawer Power Cable
U2.13-P1-X1/Q3#
Media Drawer Power Connector
U2.13-P2-X2/Q3
Media Drawer Power Cable
U2.13-P2-X2/Q3#
FRU Name
Location Code
AIX Location Code
Physical Connection
Logical Connection
I/O Subsystem 8 Locations
I/O Subsystem 1 Chassis & Midplane Card
(MT/M Serial #) U2.19
I/O Subsystem Left IO backplane assembly
U2.19-P1
I/O Subsystem Left IO backplane assembly VPD
U2.19-P1-N1
EADS 1 - PCI Controller
U2.19-P1
D0-58, D0-5A, D0-5C, D0-5E
PCI Slot 1 Content
U2.19-P1-I1
D1-08 to D1-0F or D2-xx or D3-xx 122
406/90
Service
Guide
FRU Name
Location Code
AIX Location Code
Physical Connection
Logical Connection
PCI Slot 2 Content
U2.19-P1-I2
D4-08 to D4-0F or D5-xx or D6-xx
PCI Slot 3 Content
U2.19-P1-I3
D7-08 to D7-0F or D8-xx or D9-xx
PCI Slot 4 Content
U2.19-P1-I4
DA-08 to DA-0F or DB-xx or DC-xx
EADS 2 - PCI Controller
U2.19-P1
DE-58, DE-5A, DE-5E
PCI Slot 5 Content
U2.19-P1-I5
DF-08 to DF-0F or DG-xx or DH-xx
PCI Slot 6 Content
U2.19-P1-I6
DJ-08 to DJ-0F or DK-xx or DL-xx
PCI Slot 7 Content
U2.19-P1-I7
DQ-08 to DQ-0F or DR-xx or DS-xx
EADS 3 - PCI Controller
U2.19-P1
DU-58, DU-5A, DU-5E
PCI Slot 8 Content
U2.19-P1-I8
DV-08 to DV-0F or DW-xx or DX-xx
PCI Slot 9 Content
U2.19-P1-I9
DY-08 to DY-0F or DZ-xx or Da-xx
PCI Slot 10 Content
U2.19-P1-I10
De-08 to De-0F or Df-xx or Dg-xx
I/O Riser Card
U2.19-P1
I/O Port Connector
U2.19-P1/Q1
I/O Port Cable From GX1A-port 1
U2.19-P1/Q1#
I/O Port Connector
U2.19-P1/Q2
I/O Port Cable to port 0 on next Riser card U2.19-P2.1/Q1
U2.19-P1/Q2#
I/O Subsystem Right IO backplane assembly
U2.19-P2
I/O Subsystem Right IO backplane assembly VPD
U2.19-P2-N1
EADS 1 - PCI Controller
U2.19-P2
Dj-58, Dj-5A, Dj-5C, Dj-5E
PCI Slot 1 Content
U2.19-P2-I1
Dk-08 to Dk-0F or Dm-xx or Dn-xx
PCI Slot 2 Content
U2.19-P2-I2
Dp-08 to Dp-0F or Dq-xx or Dr-xx
PCI Slot 3 Content
U2.19-P2-I3
Ds-08 to Ds-0F or Dt-xx or Du-xx
PCI Slot 4 Content
U2.19-P2-I4
Dv-08 to Dv-0F or Dw-xx or Dx-xx
EADS 2 - PCI Controller
U2.19-P2
E0-58, E0-5A, E0-5E Chapter
1.
Reference
Information
123
FRU Name
Location Code
AIX Location Code
Physical Connection
Logical Connection
PCI Slot 5 Content
U2.19-P2-I5
E1-08 to E1-0F or E2-xx or E3-xx
PCI Slot 6 Content
U2.19-P2-I6
E4-08 to E4-0F or E5-xx or E6-xx
PCI Slot 7 Content
U2.19-P2-I7
EA-08 to EA-0F or EB-xx or EC-xx
EADS 3 - PCI Controller
U2.19-P2
EE-58, EE-5A, EE-5E
PCI Slot 8 Content
U2.19-P2-I8
EF-08 to EF-0F or EG-xx or EH-xx
PCI Slot 9 Content
U2.19-P2-I9
EJ-08 to EJ-0F or EK-xx or EL-xx
PCI Slot 10 Content
U2.19-P2-I10
EQ-08 to EQ-0F or ER-xx or ES-xx
I/O Riser Card
U2.19-P2
I/O Port Connector
U2.19-P2/Q1
I/O Port Cable From Riser port 1 P1.1/Q1
U2.19-P2/Q1#
I/O Port Connector
U2.19-P2/Q2
I/O Port Cable to GX1A-port 2
U2.19-P2/Q2#
I/O Subsystem DCA 1
U2.19-V1
DCA 1 to (BPA B) BPD2- 8A Connector P02
U2.19-V1/Q1
DCA 1 to (BPA B) BPD2- 8A Cable
U2.19-V1/Q1#
DCA 1 to (BPA C) BPD2- 8A Connector P02
U2.19-V1/Q2
DCA 1 to (BPA C) BPD2- 8A Cable
U2.19-V1/Q2#
Thermal Sensor
U2.19-V1
I/O Subsystem DCA 2
U2.19-V2
DCA 2 to (BPA B) BPD2- 8B Connector P03
U2.19-V2/Q1
DCA 2 to (BPA B) BPD2- 8B Cable
U2.19-V2/Q1#
DCA 2 to (BPA A) BPD2- 8B Connector P03
U2.19-V2/Q2
DCA 2 to (BPA A) BPD2- 8B Cable
U2.19-V2/Q2#
Thermal Sensor
U2.19-V2
I/O Subsystem SCSI controller 1 on P1
U2.19-P1/Z1
DM-08
I/O Subsystem SCSI controller 2 on P1
U2.19-P1/Z2
Db-08 124
406/90
Service
Guide
FRU Name
Location Code
AIX Location Code
Physical Connection
Logical Connection
I/O Subsystem SCSI controller 1 on P2
U2.19-P2/Z1
E7-08
I/O Subsystem SCSI controller 2 on P2
U2.19-P2/Z2
EM-08
DASD 4-pack Cage and card (1)
U2.19-P3
DASD 4-pack Cage and card (1) VPD
U2.19-P3-N1
SCSI DASD 1 hdisk at ID 8 connected to controller 2 on P2
U2.19-P2/Z2-A8
EM-08-00-8,0
SCSI DASD 2 hdisk at ID 9 connected to controller 2 on P2
U2.19-P2/Z2-A9
EM-08-00-9,0
SCSI DASD 3 hdisk at ID A connected to controller 2 on P2
U2.19-P2/Z2-Aa
EM-08-00-10,0
SCSI DASD 4 hdisk at ID B connected to controller 2 on P2
U2.19-P2/Z2-Ab
EM-08-00-11,0
SCSI Enclosure Services SES connected to controller 2 on P2
U2.19-P2/Z2-Af
EM-08-00-15,0
DASD 4-pack Cage and card (2)
U2.19-P4
DASD 4-pack Cage and card (2) VPD
U2.19-P4-N1
SCSI DASD 1 hdisk at ID 8 connected to controller 1 on P2
U2.19-P2/Z1-A8
E7-08-00-8,0
SCSI DASD 2 hdisk at ID 9 connected to controller 1 on P2
U2.19-P2/Z1-A9
E7-08-00-9,0
SCSI DASD 3 hdisk at ID A connected to controller 1 on P2
U2.19-P2/Z1-Aa
E7-08-00-10,0
SCSI DASD 4 hdisk at ID B connected to controller 1 on P2
U2.19-P2/Z1-Ab
E7-08-00-11,0
SCSI Enclosure Services SES connected to controller 1 on P2
U2.19-P2/Z1-Af
E7-08-00-15,0
DASD 4-pack Cage and card (3)
U2.19-P5
DASD 4-pack Cage and card (3) VPD
U2.19-P5-N1
SCSI DASD 1 hdisk at ID 8 connected to controller 2 on P1
U2.19-P1/Z2-A8
Db-08-00-8,0 Chapter
1.
Reference
Information
125
FRU Name
Location Code
AIX Location Code
Physical Connection
Logical Connection
SCSI DASD 2 hdisk at ID 9 connected to controller 2 on P1
U2.19-P1/Z2-A9
Db-08-00-9,0
SCSI DASD 3 hdisk at ID A connected to controller 2 on P1
U2.19-P1/Z2-Aa
Db-08-00-10,0
SCSI DASD 4 hdisk at ID B connected to controller 2 on P1
U2.19-P1/Z2-Ab
Db-08-00-11,0
SCSI Enclosure Services SES connected to controller 2 on P1
U2.19-P1/Z2-Af
Db-08-00-15,0
DASD 4-pack Cage and card (4)
U2.19-P6
DASD 4-pack Cage and card (4) VPD
U2.19-P6-N1
SCSI DASD 1 hdisk at ID 8 connected to controller 1 on P1
U2.19-P1/Z1-A8
DM-08-00-8,0
SCSI DASD 2 hdisk at ID 9 connected to controller 1 on P1
U2.19-P1/Z1-A9
DM-08-00-9,0
SCSI DASD 3 hdisk at ID A connected to controller 1 on P1
U2.19-P1/Z1-Aa
DM-08-00-10,0
SCSI DASD 4 hdisk at ID B connected to controller 1 on P1
U2.19-P1/Z1-Ab
DM-08-00-11,0
SCSI Enclosure Services SES connected to controller 1 on P1
U2.19-P1/Z1-Af
DM-08-00-15,0
I/O Subsystem Fan (MSA)
U2.19-F1
I/O Subsystem Fan (MSA)
U2.19-F2
I/O Subsystem Fan (MSA)
U2.19-F3
I/O Subsystem Fan (MSA)
U2.19-F4
Media Drawer Power Connector
U2.19-P1-X1/Q3
Media Drawer Power Cable
U2.19-P1-X1/Q3#
Media Drawer Power Connector
U2.19-P2-X2/Q3
Media Drawer Power Cable
U2.19-P2-X2/Q3#
126
406/90
Service
Guide
System Requirements for Adding Processor MCMs
Processor MCM
Total BUS Pass-Thru Modules (FC5257) Required, Locations
Total DCA, Power Converter (FC 6170 and 6189) Required, Locations
Total Capacitor Books (FC 6198) Required Locations
L3 Cache Plug Positions
VPD Locations
Memory Book Plug Locations
Add 1st MCM Position C1 Note: 1.7 GHz MCMs require 1x FC 6189 at location U1.18-P1-V4. All other MCM speeds do not require FC 6189.
Qty = 0
FC 6170
(2X) U1.18-P1-V1 U1.18-P1-V2
Qty = 1 U1.18-P1-V3
C6, C7, C10, C11
U1.18-P1-H2.3
U1.18-P1-M2
U1.18-P1-M3
Add 2nd MCM Position C4 Note: 1.7 GHz MCMs require additional FC 6189 (2x total) at location U1.18-P1-V5. All other MCM speeds require 1x FC 6189 as shown.
Qty = 2 U1.18-P1-C3 U1.18-P1-C2
FC 6170
(2X) U1.18-P1-V1 U1.18-P1-V2 FC 6189 (1X) U1.18-P1-V4
Qty = 1 U1.18-P1-V3
C14, C15, C18, C19
U1.18-P1-H2.4
U1.18-P1-M7
U1.18-P1-M6
Add 3rd MCM Position C3 Note: 1.7 GHz MCMs require additional FC 6189 (3x total) at location U1.18-P1-V6. All other MCM speeds require 2x FC 6189 as shown.
Qty = 1 U1.18-P1-C2
FC 6170
(2X) U1.18-P1-V1 U1.18-P1-V2 FC 6189 (2X) U1.18-P1-V4 U1.18-P1-V5
Qty = 2 U1.18-P1-V3 U1.18-P1-V7
C8, C12, C16, C20
U1.18-P1-H2.5
U1.18-P1-M4
U1.18-P1-M8 Chapter
1.
Reference
Information
127
Add 4th MCM Position C2 Note: 1.7 GHz MCMs require additional FC 6189 (4x total) at location U1.18-P1-V8. All other MCM speeds require (3x) FC 6189 as shown.
Qty = 0
FC 6170
(2X) U1.18-P1-V1 U1.18-P1-V2 FC 6189 (3X) U1.18-P1-V4 U1.18-P1-V5 U1.18-P1-V6
Qty = 2 U1.18-P1-V3 U1.18-P1-V7
C5, C9, C13, C17
U1.18-P1-H2.6
U1.18-P1-M5
U1.18-P1-M1
Number
of Bulk Power Regulators (BPR) FC 6186 Required as a Function of MCMs and I/O Drawers
Number of I/O Drawers
MCM Sizes and Quantities
1 MCM(1.1 or 1.3 GHz)
1 MCM(1.5, 1.7, 1.9 GHz)
2 MCMs(1.1 or 1.3 GHz)
2 MCMs(1.5, 1.7, 1.9 GHz)
0
1
1
1
1
1
1
1
1
1*
2
1
1
1
2
3
2
2
4
2
2
* = MDA-G would require an additional BPR
Number of I/O Drawers
MCM Sizes and Quantities
3 MCM(1.1 or 1.3 GHz)
3 MCM(1.5, 1.7, 1.9 GHz)
4 MCM(1.1 or 1.3 GHz)
4 MCM(1.5, 1.7, 1.9 GHz)
0
1
2
2
2
1
2
2
2
2
2
2
2
2
2
3
2
2
2
2*
4
2
2
2
3
5
2
2
3
3
6
3
2*
3
3
7
3
3
3
3
8
3
3
3
* = MDA-G would require an additional BPR
128
406/90
Service
Guide
Specifications This section contains specifications for the 406/90 system. 406/90 Physical Specifications and Loads The following tables illustrate the physical, electrical and thermal, acoustical, and environmental characteristics of various 406/90 system configurations. Power and Electrical Requirements Redundant power line cords are standard on the 406/90. The system uses dual ac power cords. For maximum availability, each of the power cords should be fed from independent power grids. The following table illustrates electrical and thermal characteristics for the 406/90.
Electrical/Thermal Characteristic
Rated Voltage (V ac, 3 phase)
200 to 240
380 to 415
480
Rated Current (A, per phase)
45
25
20
Frequency (Hertz)
50 to 60
50 to 60
50 to 60
Power (Maximum in kVA)
15.7
15.7
15.7
Typical, full load power factor (pf)
0.99
0.97
0.93
Inrush current (Amps)
162 maximum (see note below)
Thermal output (Maximum kBtu/hr)
53.3
53.3
53.3
Notes: 1. Inrush currents occur only at initial application of power (very short duration for charging capacitors). No inrush currents occur during the normal power off-on cycle. 2. System will function normally with a nominal input voltage in the range of 200-480 V, AC, three phase. The following table illustrates the line cord options for the 406/90 with their geographic, breaker rating, and cord information.
3-Phase Supply Voltage (50/60 Hz)
200-240 V
380-415 V
480 V
Geography
United States, Canada, Japan
Europe, Middle East, Africa, Asia Pacific
United States, Canada
Customer Circuit Breaker Rating (see Note 1 below)
60 A
30 A
30 A
Cord Information
6 and 14 foot, 6 AWG line cord
14 foot, 6 or 8 AWG line cord, (electrician installed)
6 and 14 foot, 10 AWG line cord
Recommended Receptacle
IEC309, 60 A, type 460R9W (not provided)
Not specified, electrician installed
IEC309, 30 A, type 430R7W (not provided)
Notes: 1. The exact circuit breaker ratings may not be available in all countries. Where the specified circuit breaker ratings are not acceptable, use the nearest available rating. Use of a time delayed circuit breaker is recommended. 2. In two-frame systems, frame B receives its power from frame A. The power to frame B is 350 V dc fed from the BPD through UPIC cables. Phase Imbalance and BPR Configuration: Depending on the number of Bulk Power Regulators (BPRs) in your system, phase imbalance can occur in line currents. All systems are provided with two bulk power assemblies (BPAs), with separate line cords. Phase currents will be divided between two line cords in Chapter
1.
Reference
Information
129
normal operation. The following table illustrates phase imbalance as a function of BPR configuration.
Number of BPRs per BPA
Phase A Line Current
Phase B Line Current
Phase C Line Current
1
Power / Vline
Power / Vline
0
2
0.5 Power / Vline
0.866 Power / Vline
0.5 Power / Vline
3
0.577 Power / Vline
0.577 Power / Vline
0.577 Power / Vline
Note: Power is calculated from “Total System Power Consumption” on page 139. Vline is line-to-line nominal input voltage. Balancing Power Panel Loads: The system requires three-phase power. Depending on the system configuration, the phase currents can be fully balanced or unbalanced. System configurations with three BPRs per BPA have balanced power panel loads, while configurations with only one or two BPRs per BPA have unbalanced loads: v With two BPRs per BPA, two of the three phases will draw an equal amount of current, and will be, nominally, 57.8% of the current on the third phase. v With one BPR per BPA, two of three phases will carry an equal amount of current, with no current drawn on the third phase. The following figure is an example of feeding several loads of this type from two power panels in a way that balances the load among the three phases. The method illustrated in the preceding figure requires that the connection from the three poles of each breaker to the three phase pins of a connector be varied. Some electricians may prefer to maintain a consistent wiring sequence from the breakers to the connectors. The following figure shows a way to balance the load without changing the wiring on the output of any breakers. The three-pole breakers are alternated with single-pole breakers, so that the three-pole breakers do not all begin on Phase A. 130
406/90
Service
Guide
The following figure shows another way of distributing the unbalanced load evenly. Using this method, the three-pole breakers are alternated with two-pole breakers. Chapter
1.
Reference
Information
131
Power Cord Configuration: The power cords exit the system from different points of the frame as indicated in the following illustration.AAllSingle-FrameSystems(TopDownView)ABAllDouble-FrameSystems(TopDownView)FrontFrontRearRear 132
406/90
Service
Guide
Checking the Facility Outlets and Power Source: CAUTION: Do not touch the receptacle or the receptacle faceplate with anything other than your test probes before you have met the requirements in “Checking the Facility Outlets and Power Source” below. Performing the following will ensure that appropriate power will be used by the 406/90. The following checklist is for reference purposes, and will likely be performed by a service engineer prior to installation. __ 1. The 406/90 is equipped to use 200-240 V / 380-415 V / 480 V ac, three-phase. Check that the correct power source is available. __ 2. Before system installation, locate and turn off the branch circuit CB (circuit breaker). Attach tag S229-0237, which reads “Do Not Operate.” Note: All measurements are made with the receptacle faceplate in the normally installed position. __ 3. Some receptacles are enclosed in metal housings. On receptacles of this type, perform the following steps: a. Check for less than 1 volt from the receptacle case to any grounded metal structure in the building, such as a raised-floor metal structure, water pipe, building steel, or similar structure. b. Check for less than 1 volt from receptacle ground pin to a grounded point in the building.Note: If the receptacle case or faceplate is painted, be sure the probe tip penetrates the paint and makes good electrical contact with the metal. __ 4. Check the resistance from the ground pin of the receptacle to the receptacle case. Check resistance from the ground pin to building ground. The reading should be less than 1.0 ohm, which indicates the presence of a continuous grounding conductor. __ 5. If any of the checks made in substeps 2 and 3 are not correct, remove the power from the branch circuit and make the wiring corrections; then check the receptacle again. Note: Do not use the digital multimeter to measure grounding resistance. __ 6. Check for infinite resistance between the phase pins. This is a check for a wiring short. CAUTION: If the reading is other than infinity, do not proceed! You must make the necessary wiring corrections to satisfy the above criteria before continuing. Do not turn on the branch circuit CB until all the above steps are satisfactorily completed. __ 7. Remove tag S229-0237, which reads “Do Not Operate.” __ 8. Turn on the branch circuit CB. Measure for appropriate voltages between phases. If no voltage is present on the receptacle case or grounded pin, the receptacle is safe to touch. __ 9. With an appropriate meter, verify that the voltage at the outlet is correct. __ 10. Verify that the grounding impedance is correct by using the ECOS 1020, 1023, B7106, or an appropriately approved ground impedance tester. __ 11. Turn off the branch circuit CB. __ 12. Attach tag S229-0237, which reads “Do Not Operate.” __ 13. You are now ready to install and connect the power cables to the 406/90.Additional Installation Considerations: In the United States, installation must be made in accordance with Article 645 of the National Electric Code (NEC). In Canada, installation must be in accordance with Article 12-020 of the Canadian Electrical Code (CEC). Dimensions and Weight
Physical Characteristic
Slimline Doors
Acoustical Doors
1 Frame
2 Frames
1 Frame
2 Frames Chapter
1.
Reference
Information
133
Height
2025 mm (79.72 in.)
2025 mm (79.72 in.)
2025 mm (79.72 in.)
2025 mm (79.72 in.)
Width
785 mm (30.91 in.)
1575 mm (62.00 in.)
785 mm (30.91 in.)
1575 mm (62.00 in.)
Depth
1342 mm (52.83 in.)
1342 mm (52.83 in.)
1494 mm (58.83 in.)
1494 mm (58.83 in.)
Weight (maximum configuration)
1170 kg (2580 lbs.)
1973 kg (4349 lbs.)
1184 kg (2610 lbs.)
2000 kg (4409 lbs.)
Notes: 1. Doors are not installed during product shipment to the customer. A maximum configured system with batteries may exceed 1134 kg (2500 lbs.). 2. When moving or relocating certain configurations of the system, the Bulk Power Regulators (BPRs) must be removed from the top of the rack (front and rear) to ensure product stability. Specifically, removal of BPRs from frame A and B in the front and rear is required in systems that have a single I/O drawer, with more than 2 BPRs installed per BPA in the primary rack.
System Weights by Configuration
Total System Weight (Pounds)
Number of I/O Subsystems
1
2
3
4
5
6
7
8
Slimline Doors With IBF
2250
2415
2580
3633
3854
4019
4184
4349
Slimline Doors Without IBF
1865
2030
2195
2418
3266
3431
3596
3761
Acoustical Doors With IBF
2280
2445
2610
3693
3914
4079
4244
4409
Acoustical Doors Without IBF
1923
2088
2253
2506
3326
3491
3656
3821
No Doors With IBF
2192
2357
2522
3517
3738
3903
4068
4233
No Doors Without IBF
1807
1972
2137
2302
3150
3315
3480
3645
Note: Italicized numbers indicate single-frame systems.
Total System Weight (Kilograms)
Number of I/O Subsystems
1
2
3
4
5
6
7
8
Slimline Doors With IBF
1021
1095
1170
1648
1748
1823
1898
1973
Slimline Doors Without IBF
846
921
996
1097
1481
1556
1631
1706
Acoustical Doors With IBF
1034
1109
1184
1675
1775
1850
1925
2000
Acoustical Doors Without IBF
872
947
1022
1137
1509
1583
1658
1733
No Doors With IBF
994
1069
1144
1595
1696
1770
1845
1920
No Doors Without IBF
820
894
969
1044
1429
1504
1579
1653
Note: Italicized numbers indicate single-frame systems.
134
406/90
Service
Guide
Acoustical Noise Emissions
Acoustical Characteristic
Product Configuration
Declared A-Weighted Sound Power Level, LWAd (B)
Declared A-Weighted Sound Pressure Level, LpAm (dB)
Operating
Idle
Operating
Idle
7040 Acoustical Doors
7.5
7.5
57
57
7040 Slimline Doors
7.9
7.9
62
62
Notes: 1. Noise levels cited are for a typical configuration (A-Frame: Bulk Power, CEC cage, battery option, media drawer, and two I/O drawers). 2. The 0.6-B (6-dB) reduction in noise emission levels with the acoustical rear door corresponds to a factor of 4 reduction. That is, the noise level of a single A-Frame with thin covers is about the same as the noise level of four A-Frames with acoustical covers. 3. LWAd is the upper-limit A-weighted sound power level; LpAm is the mean A-weighted sound pressure level at the 1-meter bystander positions; 1 B = 10 dB. 4. All measurements made in conformance with ISO 7779 and declared in conformance with ISO 9296.
Environmental Specifications
Environmental Specification
Operating
Non-Operating
Storage
Shipping
Temperature
10 to 32°C(50 to 90°F) Max. of 24 ° C (75.2 ° F) with 4mm tape or DVD RAM in rear positions of the Media Subsystem
10 to 43°C(50 to 109°F)
1 to 60°C(34 to 140°F)
–40 to 60°C(–40 to 140°F)
Relative Humidity (Noncondensing)
8 to 80 %
8 to 80 %
5 to 80 %
5 to 100 %
Maximum Wet Bulb
23°C (73°F)
27°C (73°F)
29°C (84°F)
29°C (84°F)
Notes: 1. Storage and shipping specifications are valid for a maximum duration of two weeks each. 2. The upper limit of the dry bulb temperature must be derated 1 degree C per 189 m (619 ft.) above 1295 m (4250 ft.). Maximum altitude for 1.1 GHz modules is 3048 m (10,000 ft.) and for 1.3 GHz modules is 2134 m (7000 ft)
Weight Distribution The following table shows dimensions and weights used to calculate floor loading for the 406/90. All floor-loading calculations are intended for a raised-floor environment.
1 Frame with Slimming Covers
2 Frames with Slimming Covers
1 Frame with Acoustical Covers
2 Frames with Acoustical Covers
Weight
1170 kg (2580 lbs.)
1973 kg (4349 lbs.)
1184 kg (2610 lbs.)
2000 kg (4409 lbs.)
Width
750 mm (29.5 in.)
1539 mm (60.6 in.)
750 mm (29.5 in.)
1539 mm (60.6 in.)
Depth
1173 mm (46.2 in.)
1173 mm (46.2 in.)
1173 mm (46.2 in.)
1173 mm (46.2 in.) Chapter
1.
Reference
Information
135
Notes: 1. For 2 frame systems, widths of Frame A and Frame B. were added (the depth remains 1069 mm (42.1 in.), not including frame extenders). 2. For 2 frame systems, weights are based on maximum configuration (less than addition of maximum weights for each frame). 3. All floor-loading calculations are intended for a raised-floor environment.
The following table shows floor-loading specifications for systems with slimline covers. The values contained in the Condition column are described following the table.
Condition
a (sides)mm (in.)
b (front)mm (in.)
c (backmm (in.)
1 Frame kg/m2 (lb./ft.2)
2 Frameskg/m2 (lb./ft.2)
1
25 (1.0)
135 (5.3)
135 (5.3)
1080.1 (221.2)
924.4 (189.3)
2
25 (1.0)
554 (21.8)
655 (25.8)
702.2 (143.8)
607.9 (124.5)
3
25 (1.0)
762 (30.0)
762 (30.0)
634.5 (129.9)
551.2 (112.9)
4
254 (10.0)
554 (21.8)
655 (25.8)
491.2 (100.6)
499.4 (102.3)
5
254 (10.0)
762 (30.0)
762 (30.0)
448.1 (91.8)
455.3 (93.3)
6
508 (20.0)
554 (21.8)
655 (25.8)
385.0 (78.9)
424.4 (86.9)
7
508 (20.0)
762 (30.0)
762 (30.0)
354.3 (72.6)
389.1 (79.7)
8
554 (21.8)
554 (21.8)
655 (25.8)
372.1 (76.2)
413.9 (84.8)
9
559 (22)
762 (30.0)
762 (30.0)
341.7 (70.0)
378.9 (77.6)
10
762 (30.0)
521 (20.5)
521 (20.5)
341.7 (70.0)
393.3 (80.6)
11
762 (30.0)
762 (30.0)
762 (30.0)
302.4 (61.9)
344.8 (70.6)
Definition of Conditions: v Condition 1 indicates maximum floor loading when systems are stored cover-to-cover on all four sides with covers installed. v Conditions 2 and 3 indicate floor loading when the system has no side clearance (beyond side covers) on both sides while front/back distances varied. v Conditions 4 through 8 indicate floor loading at various points below the maximum weight-distribution distance of 762 mm (30.0 in.) from each edge of the frame. v Conditions 9 through 10 indicate floor-loading options when the installation is limited to 342.0 kg/m2 (70.0 lb/ft2). v Condition 11 is the minimum floor loading required, based on the maximum weight-distribution area (30.0 in. from each side of the base frame).Notes: 1. Service clearance is independent from weight distribution distance and must be at least 45 in. for the front of the frame and 36 in. for the rear of the frame (measured from the base frame). 2. Weight-distribution areas should not be overlapped. 3. Floor-loading weight distribution distances should not exceed 762 mm (30 in.) in any direction when measured from the base frame. 136
406/90
Service
Guide
The following table shows floor-loading specifications for systems with acoustical covers. The values contained in the Condition column are described following the table.
Condition
a (sides)mm (in.)
b (front)mm (in.)
c (backmm (in.)
1 Framekg/m2(lb./ft. 2)
2 Frameskg/m.2(lb./ft 2)
1
25 (1.0)
135 (5.3)
135 (5.3)
1091.9 (223.6)
936.2 (191.7)
2
25 (1.0)
554 (21.8)
757 (29.8)
685.3 (140.4)
594.9 (121.8)
3
25 (1.0)
762 (30.0)
762 (30.0)
640.8 (131.2)
557.5 (114.2)
4
254 (10.0)
554 (21.8)
757 (29.8)
480.5 (98.4)
489.3 (100.2)
5
254 (10.0)
762 (30.0)
762 (30.0)
452.2 (92.6)
460.3 (94.3)
6
508 (20.0)
554 (21.8)
757 (29.8)
377.4 (77.3)
416.3 (85.3)
7
508 (20.0)
762 (30.0)
762 (30.0)
357.2 (73.2)
393.0 (80.5)
8
569 (22.4)
762 (30.0)
762 (30.0)
342.0 (70.0)
380.7 (78.0)
9
762 (30.0)
554 (21.8)
757 (29.8)
320.3 (65.6)
367.5 (75.3)
10
762 (30.0)
533 (21.0)
533 (21.0)
342.0 (70.0)
394.2 (80.7)
11
762 (30.0)
762 (30.0)
762 (30.0)
304.6 (62.4)
348.1 (71.3)
Definition of Conditions: v Condition 1 indicates maximum floor loading when systems are stored cover-to-cover on all four sides with covers installed. v Conditions 2 and 3 indicate floor loading when the system has no side clearance (beyond side covers) on both sides while front/back distances varied. v Conditions 4 through 8 indicate floor loading at various points below the maximum weight-distribution distance of 762 mm (30.0 in.) from each edge of the frame. v Conditions 9 through 10 indicate floor-loading options when the installation is limited to 342.0 kg/m2 (70.0 lb/ft2). v Condition 11 is the minimum floor loading required, based on the maximum weight-distribution area (30.0 in. from each side of the base frame).Notes: 1. Service clearance is independent from weight-distribution distance and must be at least 45 in. at the front of the frame and 36 in. at the rear of the frame (measured from the base frame). 2. Weight-distribution areas should not be overlapped. 3. Floor-loading weight distribution distances should not exceed 762 mm (30 in.) in any direction when measured from the base frame. Chapter
1.
Reference
Information
137
Plan Views The following illustration shows dimensional planning information for single-frame systems and double-frame systems.Double-FrameSystemsAB1575mm(62.0in.)A785mm(30.9)Single-FrameSystemsFRAMEENTRY/EXITDIMENSION(mm)(in.)FRONT117by4034.6by15.9REAR117by4034.6by15.9A750mm(29.5in.)1174mm(46.2in.)1342mm(52.8in.)SlimlineCoversor1494mm(58.8in.)AcousticalCovers109mm(4.3in.)UniversalFrontCoverFRONT18mm(0.7in.)58mm(2.3in.)SlimlineRearCoveror211mm(8.3in.)AcousticalRearCover 138
406/90
Service
Guide
Total System Power Consumption The following tables contain minimum and maximum power consumption for the 1.1 and 1.3 GHz 406/90. Minimum power consumption is based on a configuration consisting of a single 4 GB memory card, 1 PCI card per I/O subsystem, and 1 DASD device per I/O subsystem. Maximum power consumption is based on a configuration consisting of two 32 GB memory cards per MCM module, maximum PCI cards (20 per I/O drawer), and maximum DASD (16 per I/O drawer). Power consumption calculations are estimates. Actual values may vary. Calculate heat load (Btu per hour) by multiplying the power (in watts) for the configuration by a factor of 3.4. Note: Certain system configurations only support a maximum number of I/O drawers. If a system does not support a particular count of I/O drawers, it is indicated with ?N/A?.
Number of I/O Drawers (7040-61D)
1.1 GHz 8-way Modules (minimum power consumption, in watts)
1.1 GHz 8-way Modules (maximum power consumption, in watts)
8-way
16-way
24-way
32-way
8-way
16-way
24-way
32-way
1
1911
2867
3823
4779
3042
4586
6130
7674
2
2279
3235
4191
5147
4090
5634
7178
8722
3
N/A
3603
4559
5515
N/A
6682
8226
9770
4
N/A
3971
4927
5883
N/A
7730
9274
10818
5
N/A
N/A
5295
6251
N/A
N/A
10322
11866
6
N/A
N/A
5663
6619
N/A
N/A
11370
12914
7
N/A
N/A
N/A
6987
N/A
N/A
N/A
13962
8
N/A
N/A
N/A
7355
N/A
N/A
N/A
15010
Number of I/O Drawers (7040-61D)
1.3 GHz 4-way Modules (minimum power consumption, in watts)
1.3 GHz 4-way Modules (maximum power consumption, in watts)
8-way
16-way
8-way
16-way
1
3213
5471
4932
8366
2
3581
5839
5980
9414
3
3949
6207
7028
10462
4
4317
6575
8076
11510
5
N/A
6943
N/A
12558
6
N/A
7311
N/A
13606
7
N/A
7679
N/A
14654
8
N/A
8047
N/A
15702
Chapter
1.
Reference
Information
139
Number of I/O Drawers (7040-61D)
1.3 GHz 8-way Modules (minimum power consumption, in watts)
1.3 GHz 8-way Modules (maximum power consumption, in watts)
8-way
16-way
24-way
32-way
8-way
16-way
24-way
32-way
1
2084
3213
4342
5471
3215
4932
6649
8366
2
2452
3581
4710
5839
4263
5980
7697
9414
3
N/A
3949
5078
6207
N/A
7028
8745
10462
4
N/A
4317
5446
6575
N/A
8076
9793
11510
5
N/A
N/A
5814
6943
N/A
N/A
10841
12558
6
N/A
N/A
6182
7311
N/A
N/A
11889
13606
7
N/A
N/A
N/A
7679
N/A
N/A
N/A
14654
8
N/A
N/A
N/A
8047
N/A
N/A
N/A
15702
Number of I/O Drawers (7040-61D)
1.5 GHz 4-way Modules (minimum power consumption, in watts)
1.5 GHz 4-way Modules (maximum power consumption, in watts)
4-way
8-way
12-way
16-way
4-way
8-way
12-way
16-way
1
1714
2473
3232
4946
2931
4364
5797
7230
2
2082
2841
3600
5314
3979
5412
6845
8278
3
N/A
3209
3968
5682
N/A
6460
7893
9326
4
N/A
3577
4336
6050
N/A
7508
8941
10374
5
N/A
N/A
4704
6418
N/A
N/A
9989
11422
6
N/A
N/A
5072
6786
N/A
N/A
11037
12470
7
N/A
N/A
N/A
7154
N/A
N/A
N/A
13518
8
N/A
N/A
N/A
7522
N/A
N/A
N/A
14566
Number of I/O Drawers (7040-61D)
1.5 GHz 8-way Modules (minimum power consumption, in watts)
1.5 GHz 8-way Modules (maximum power consumption, in watts)
8-way
16-way
24-way
32-way
8-way
16-way
24-way
32-way
1
1839
2723
3607
4491
3056
4614
6172
7730
2
2207
3091
3975
4859
4104
5662
7220
8778
3
N/A
3459
4343
5227
N/A
6710
8268
9826
4
N/A
3827
4711
5595
N/A
7758
9316
10874
5
N/A
N/A
5079
5963
N/A
N/A
10364
11922
6
N/A
N/A
5447
6331
N/A
N/A
11412
12970
7
N/A
N/A
N/A
6699
N/A
N/A
N/A
14018
8
N/A
N/A
N/A
7067
N/A
N/A
N/A
15066
140
406/90
Service
Guide
Number of I/O Drawers (7040-61D)
1.7 GHz 8-way Modules (minimum power consumption, in watts)
1.7 GHz 8-way Modules (maximum power consumption, in watts)
8-way
16-way
24-way
32-way
8-way
16-way
24-way
32-way
1
2017
3079
4141
5203
3234
4970
6706
8442
2
2385
3447
4509
5571
4282
6018
7754
9490
3
N/A
3815
4877
5939
N/A
7066
8802
10538
4
N/A
4183
5245
6307
N/A
8114
9850
11586
5
N/A
N/A
5613
6675
N/A
N/A
10898
12634
6
N/A
N/A
5981
7043
N/A
N/A
11946
13682
7
N/A
N/A
N/A
7411
N/A
N/A
N/A
14730
8
N/A
N/A
N/A
7779
N/A
N/A
N/A
15778
W
attage Addition/Subtraction for Minimum and Maximum Configurations Minimum configurations are based on a single 4 GB memory card and a single DASD/PCI card in each I/O subsystem. Maximum configurations are based on two 64 GB memory cards per MCM module, sixteen DASD per I/O subsystem and twenty PCI cards per I/O subsystem. To determine the typical power consumption for a specific configuration, use the following typical power values: v 4 GB memory book - 137 Watts v 8 GB memory book - 151 Watts v 16 GB memory book - 235 Watts v 32 GB memory book - 294 Watts v 64 GB memory book - 320 Watts v Each PCI card - 20 Watts v DASD - 20 Watts Chapter
1.
Reference
Information
141
Service Inspection Guide Perform a service inspection on the system when: v The system is inspected for a maintenance agreement. v Service is requested and service has not recently been performed. v An alterations and attachments review is performed. v Changes have been made to the equipment that may affect its safe operation. v External devices with their own power cables have those cables attached.If the inspection indicates an unacceptable safety condition, the condition must be corrected before anyone can service the machine. Note: The owner of the system is responsible to correct any unsafe conditions. Perform the following checks: 1. Check the covers for sharp edges and for damage or alterations that expose the internal parts of the system. 2. Check the covers for proper fit to the rack. They should be in place and secure. 3. Gently rock the rack from side to side to verify the rack is stable. 4. Turn off the UEPO switch, turn off the circuit breakers on all optional IBF subsystems (if provided), then disconnect both power cords from the customer’s receptacle. 5. Remove the covers. 6. Check for alterations or attachments. If there are any, check for obvious safety hazards, such as broken wires, sharp edges, or broken insulation. 7. Check the internal cables for damage. 8. Check for dirt, water, and any other contamination within the system. 9. Check the air filters in the processor subsystem to ensure they are clean. Replace if necessary. 10. Check the voltage label on the back of the system to ensure that it matches the voltage at the outlet. 11. Check the external power cable for damage. 12. With the external power cable connected to the system, check for 0.1 ohm or less resistance between the ground lug on the external power cable plug and the metal frame. 13. Perform the following checks on each device that has its own power cables: a. Check for damage to the power cord. b. Check for the correct grounded power cable. c. With the external power cable connected to the device, check for 0.1 ohm or less resistance between the ground lug on the external power cable plug and the metal frame of the device.14. Install the covers.Preventive Maintenance Perform preventive maintenance on filters as indicated in the following section. Processor Subsystem Air Filters Attention: To maintain proper airflow, the air filters (part number 11P3781) must be inspected every six months and replaced as necessary. To replace the air filters, it is not necessary to remove power from the system or blower assemblies.
142
406/90
Service
Guide
Removal To remove the air filters from the processor subsystem, grasp the finger tabs located on the air filter and pull the air filter straight out from the processor subsystem.
1
Top Filter Pull Tab
5
Bottom Filter
2
Top Filter Key
6
Bottom Filter Key
3
Processor Subsystem
7
Bottom Filter Pull Tab
4
Top Filter
Replacement To replace an air filter in the processor subsystem, slide the air filter into the slot. The filter is keyed to ensure correct orientation. Chapter
1.
Reference
Information
143
144
406/90
Service
Guide
Chapter 2. Diagnostics Overview 406/90 servers use an integrated set of software diagnostic procedures to facilitate system maintenance. This book and the Diagnostic Information for Multiple Bus Systems are the basis of the diagnostic procedures for 406/90 systems. In particular, chapters 3, 4, 5, and 10 in this book are important for the trained service representative to understand and use when isolating a failure on the system. The manufacturer recommends that systems configured with 4 GB of memory or greater have access to a 4-mm or 8-mm tape drive for submission of system dump information if it is required. This function can be accomplished through locally attached or network-attached devices, as appropriate. Maintenance Analysis Procedures (MAPs) Maintenance Analysis Procedures (MAPs) guide trained service representatives through the complex multirack 406/90 system. These MAPs are the entry point for all isolation and error-recovery procedures. 406/90 systems use a set of integrated procedures, to which the MAPS are the primary entry point. The MAPS are as follows: v Entry MAP v Quick Entry MAP v Service Focal Point Procedures v Power MAP v Problem Isolation MAPThe Entry Map is the starting point for problem determination. The purpose of this MAP is to quickly point to the appropriate MAP or service reference information either in this book, or in the common book set, which includes the Diagnostic Information for Multiple Bus Systems and PCI Adapter Placement Reference. The Quick Entry MAP is a subset of the Entry MAP and helps to save time for some types of problems. The Power MAP deals with isolation of components and cables to diagnose a power problem. The nature of power problems can be related to powering the system on and off, or power failures that can occur after power is turned on. The Problem Isolation MAP is used to locate defective components not found by normal diagnostics or error-isolation methods. This MAP provides a systematic method of isolation to the failing item or items. Checkpoints 406/90 systems use various types of checkpoints, error codes, and SRNs that are referred to throughout this book (primarily in Chapters 3, 4 and 5). On an HMC-managed system, the checkpoints and error codes displayed on the physical operator panel are also displayed on the HMC virtual terminal associated with the managed system object. On a system operating as a full system partition, checkpoints and error codes will appear on the physical operator panel from the beginning of the power-on process through the completion of booting AIX. On a partitioned system, checkpoints and error codes for a partition are displayed on the HMC virtual operator panel associated with that logical partition. Checkpoints and error codes generated by logical partitions are not displayed on the managed system’s physical operator panel. 145
During boot up of a partitioned system, there is a global initialization phase where one of the system processors is in control and initializing all of the system resources. During this phase, Exxx codes will be displayed on both the physical operator panel and the HMC managed-system object. After the global initialization phase completes, ?LPAR...? is displayed on the physical operator panel and the HMC managed-system object. Each logical partition also has a partition initialization phase where one of the system processors assigned to that partition is initializing all the resource of that partition. Exxx codes will be displayed on the virtual terminal of the HMC logical partition object during this partition initialization phase. Codes that may appear in the service processor boot progress log, the operating system error log, the virtual terminal, and the operator panel display are as follows: Checkpoints Checkpoints display in the operator panel from the time ac power is connected to the system until the operating system login prompt is displayed after a successful operating system boot. These checkpoints have the following forms: 8xxx 8xxx checkpoints are displayed from the time ac power is connected to the system until the OK prompt is displayed by the service processor on the operator panel display. (Several 9xxx checkpoints are displayed just before the OK prompt displays.) 9xxx 9xxx checkpoints are displayed by the service processor after the power-on sequence is initiated. A system processor takes control when 91FF displays on the operator panel display.Note: Certain checkpoints may remain in the display for long periods of time. A spinning cursor is visible in the upper-right corner of the display during these periods to indicate that system activity is continuing.Exxx Exxx checkpoints indicate that a system processor is in control and is initializing the system resources. Control is being passed to the operating system when E105 displays on the operator panel display. Location code information may also display on the operator panel during this time. 0xxxand2xxx 0xxx codes are AIX progress codes and configuration codes. Location codes may also display on the operator panel display during this time.Note: AIX progress codes will not display when the Linux operating system is booting.Error Codes If a fault is detected, an 8-digit error code is displayed in the operator panel display. A location code may be displayed at the same time on the second line. SRNs Service request numbers, in the form xxx-xxx or xxx-xxxx, may also be displayed on the operator panel display and be noted in the AIX error log.Note: If the operating system is Linux, SRNs will not be logged in the Linux system error log.Checkpoints can become error codes if the system fails to advance past the point at which the code is presented. For a list of the checkpoints, see Chapter 4, “Checkpoints,” on page 369. Each entry provides a description of the event and the recommended action if the system fails to advance. SRNs are listed in the Diagnostic Information for Multiple Bus Systems, order number SA23-2769. 146
406/90
Service
Guide
FRU Isolation For a list of error codes and recommended actions for each code, see Chapter 5, “Error Code to FRU Index,” on page 413. These actions can refer to Chapter 10, “Parts Information,” on page 1009, Chapter 3, “Maintenance Analysis Procedures (MAPs),” on page 149, or provide informational message and directions. If a replacement part is indicated, direct reference is made to the part name. The respective AIX and physical location codes are listed for each occurrence as required. For a list of location codes, see “AIX and Physical Location Code Reference Tables” on page 68. Note: AIX location codes are not available when the running operating system is Linux. To look up part numbers and view component diagrams, see Chapter 10, “Parts Information,” on page 1009. The beginning of the chapter provides a parts index with the predominant field replaceable units (FRUs) listed by name. The remainder of the chapter provides illustrations of the various assemblies and components that compose 406/90 servers. The illustrations are ordered with processor subsystem components listed first, then I/O subsystem, and then accessories. FRU Identify LEDs This system is configured with an arrangement of LEDs that help identify various components of the system. These include, but are not limited to: v Rack identify LED v Processor subsystem drawer identify LED v I/O drawer identify LED v RIO port identify LED v FRU identify LED – Power subsystem FRUs – Processor subsystem FRUs – I/O subsystem FRUsv I/O adapter identify LED v DASD identify LEDThe identify LEDs are arranged hierarchically with the FRU identify LED at the bottom of the hierarchy, followed by the corresponding processor subsystem or I/O drawer identify LED, and the corresponding rack-identify LED to locate the failing FRU more easily. Any identify LED in the system may be flashed (when the processor subsystem is in the failed state and power is on) by using the service processor LED Control Menu contained in the System Information Menu of the privileged user menus. To use operate the LED Control Menu, see page 788. Any identify LED in the system can also be flashed by using the Identify and Attention Indicators task in diagnostics. The procedure to use the Identify and Attention Indicators task in diagnostics is outlined in Diagnostic Information for Multiple Bus Systems. If the service processor menus and AIX diagnostics are not available, the FRU identify LEDs can be flashed by one of the following procedures: v If the system is configured as a full partition, the system may be booted to the open firmware prompt and the command FRU-LED-MENU entered. A menu displays that allows you to enable the desired FRU identify LED. See “System Power Control Menu” on page 778 for instructions on setting up the boot mode to enable the boot to the open firmware prompt. v If the system is logically partitioned, the HMC must be attached. You can use the HMC to enable any FRU identify LED to be flashed. See the Hardware Management Console Operations Guide for instructions on activating and deactivating a FRU identify LED.
Chapter
2.
Diagnostics
Overview
147
Running Diagnostics on the Native Serial Port on a Full Machine Partition with an HMC Attached For a full system partition, the output of the S1 serial port is redirected, or wrapped to the virtual console window. When the S1 serial port is wrapped, the output of any command is directed from the S1 serial port to the virtual console terminal. If you close the virtual console window on the managed system, normal function is restored to the S1 serial port. The implication of this is that the serial port wrap diagnostics may fail with a false SRN if the virtual console window on the managed system is not closed. Another method to prevent the false SRN is to unplug the HMC from its port before running the wrap tests. After running diagnostics on the serial port, make sure that the S1 serial port is enabled for login; use the AIX command chdev -l ’ttyx’ -a login=’enable’ where x is the tty number corresponding to the S1 serial port. 148
406/90
Service
Guide
Chapter 3. Maintenance Analysis Procedures (MAPs) This chapter contains Maintenance Analysis Procedures (MAPs) for the 406/90. Entry MAP When possible, run online diagnostics in service mode. Online diagnostics perform additional functions compared to standalone diagnostics. This ensures that the error state of the system that was captured in NVRAM is available for your use in fixing the problem. The AIX error log and SMIT are only available when diagnostics are run from the hard drive. Always check Service Focal Point for any open service events in Service Action Event log. Note: Online AIX diagnostics from the hard drive are not available when the running operating system is Linux. If this system has a Hardware Management Console (HMC) with Service Focal Point (SFP) and you have not previously been directed to MAP 1321, go to “MAP 1321: Quick Entry MAP for Systems with Service Focal Point” on page 170. Otherwise, use the following table to help determine your next step.Notes: 1. Licensed programs frequently rely on system information stored in the VPD module on the media subsystem operator panel. If the MAPs indicate that the media subsystem operator panel should be replaced, swap the VPD module from the old media subsystem operator panel to the new one (see “Operator Panel” on page 1000). If the old VPD module has to be replaced, call technical support for recovery instructions. If recovery is not possible, notify the system owner that new keys for licensed programs may be required. 2. If a network adapter is replaced, the network administrator must be notified so that the client IP addresses used by the system can be changed. In addition, the operating system configuration of the network controller may need to be changed in order to enable system startup. Also check to ensure that any client or system that addresses this system is updated.Note: The various codes that might display on the HMC are all listed as error codes by Service Focal Point (SFP). To assist you in identifying the types of error data in this guide, use the following table.
SFP Name
Number of Digits in Error Code
Error code
Service Guide Name
Error Code
Any
Contains #
Menu Goal
Error Code
Any
Contains - (hyphen)
SRN
Error Code
5
Does not contain # or -
SRN
Error Code
6
Does not contain # or -
Error Code
Error Code
8
Does not contain # or -
Error Code, SRC, or Ref Code
149
Symptom
Starting Point
You have a problem that does not prevent the system from booting and the media subsystem operator panel is functional.
Go to the Fast Path MAP in the Diagnostic Information for Multiple Bus Systems.
You do not have a symptom.
Go to MAP 0020 in the Diagnostic Information for Multiple Bus Systems.
You have an SRN, from a partition running Linux, that is one of the following: 651-88x, 651-89x, 652-88x, 652-89x, Axx-xxx, or Axx-xxxx.
Go to Chapter 11, “SRNs for Linux,” on page 1035 and perform the actions listed there.
You have an SRN, from a partition running Linux, that is not one of the following: 651-88x, 651-89x, 652-88x, 652-89x, Axx-xxx, or Axx-xxxx.
Go to the Fast Path MAP in the Diagnostic Information for Multiple Bus Systems.
You have an SRN from a partition running AIX or AIX standalone diagnostics
Go to the Fast Path MAP in the Diagnostic Information for Multiple Bus Systems.
The system stops and a 3-digit number is displayed in the operator panel display.
Record SRN 101-xxx, where xxx is the 3-digit number displayed in the operator panel display, then go to the Fast Path MAP in the Diagnostic Information for Multiple Bus Systems.
The system stops and a 4-digit number beginning with 0 or 2 is displayed in the operator panel display.
Go to the Fast Path MAP in the Diagnostic Information for Multiple Bus Systems.
All other symptoms.
Go to “Quick Entry MAP” on page 151.
150
406/90
Service
Guide
Quick Entry MAP The Quick Entry MAP is provided to help you quickly resolve a problem. Use the Quick Entry MAP when you know or have been provided with a symptom. Quick Entry MAP Table of Contents
Problem Description
Page No.
Service actions
152
The system attention LED on the media subsystem operator panel is on.
152
OK does not appear in the media subsystem operator panel display before pressing the power-on button. Other symptoms appear in the media subsystem operator panel display or LEDs before the power-on button is pressed.
152
8-Digit error codes
153
System stops with an 8-digit number displayed
153
System stops with a 4-digit number displayed that does not begin with 0 or 2.
153
System stops with a 3-digit number displayed (see below for 888 sequence).
153
Hardware Management Console (HMC) Problem
153
There appears to be a display problem (distortion, blurring, etc.)
154
Power and cooling problems
155
888 sequence in operator panel display
155
Other symptoms or problems
155
Attention: If you replace FRUs or perform an action on an I/O subsystem and the problem is still not corrected, go to “MAP 1542: I/O Problem Isolation” on page 318 unless you were already directed to any MAP 154x by the error code. Otherwise, call for support if the actions for an error code do not resolve the problem. If you replace FRUs or perform an action, and the problem is corrected, go to ?MAP 410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems.Note: If the only service actions that you perform are on the HMC, do not use MAP 410 to do the repair checkout. Chapter
3.
Maintenance
Analysis
Procedures
151
Symptom
Action
Service Actions
You have an open service event in the service action event log.
Go to “Service Focal Point” on page 4.
You have parts to exchange or a corrective action to perform.
1. Go to Chapter 9, “Removal and Replacement Procedures,” on page 835. 2. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems.
You need to verify that a part exchange or corrective action corrected the problem.
Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems.
You need to verify correct system operation.
Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems.
The system attention LED on the media subsystem operator panel is on.
Go to “Disturbance or System Attention LED” on page 21 and perform any actions indicated.
OK does not appear in the media subsystem operator panel display before pressing the power-on button. Other symptoms appear in the media subsystem operator panel display or LEDs before the power on button is pressed.
A bouncing or scrolling ball remains on the media subsystem operator panel display or the media subsystem operator panel display is filled with dashes
1. Verify the operator panel cable from the primary I/O book to the media subsystem is connected and correctly seated at both ends. 2. If an ASCII terminal is available, connect it to the primary I/O book through serial port 1. a. If the service processor menu is displayed, then: 1) Replace the operator panel assembly, location: U1.17-L1. Refer to “Operator Panel” on page 1000. 2) Replace the primary I/O book, location: U1.18-P1-H2. Refer to “I/O Books and I/O Blanks” on page 854.b. If the service processor menu is not displayed, then: 1) Replace the primary I/O book, location: U1.18-P1-H2. Refer to “I/O Books and I/O Blanks” on page 854. 2) Replace the operator panel assembly, location: U1.17-L1. Refer to “Operator Panel” on page 1000.3. If an ASCII terminal is not available, replace the following, one at a time, in the sequence listed: a. Operator panel assembly, location: U1.17-L1. Refer to “Operator Panel” on page 1000. b. Primary I/O book, location: U1.18-P1-H2. Refer to “I/O Books and I/O Blanks” on page 854.
You have a blank display on the operator panel. Other LEDs on the operator panel appear to behave normally.
1. Check the processor subsystem to media subsystem cable. 2. Replace the operator panel assembly, Location: U1.17-L1. Refer to “Operator Panel” on page 1000. 3. Replace the primary I/O book, Location: U1.18-P1-H2. Refer to “I/O Books and I/O Blanks” on page 854.
You have a blank display on the operator panel. Other LEDs on the operator panel are off.
Go to “MAP 1520: Power” on page 192. 152
406/90
Service
Guide
Symptom
Action
8-Digit Error Codes
You have an 8-digit error code displayed
1. Look up the error code in the table in “Checkpoints and Error Codes Index” on page 416. 2. Look up the service action event (SAE) log. If the FRU has been changed other than the one in step 1, look at the most recent one.
System stops with an 8-digit number displayed
The system stops with an 8-digit error code displayed when booting.
1. Look up the error code in the table in “Checkpoints and Error Codes Index” on page 416. 2. Look up the SAE log. If the FRU has been changed other than the one in step 1, look at the most recent one.
System stops with a 4-digit number displayed that does not begin with 0 or 2.
The system stops with a 4-digit checkpoint.
Go to “Checkpoints and Error Codes Index” on page 416. If you do not find the error code there, go to the Fast Path MAP in the Diagnostic Information for Multiple Bus Systems.
System stops with a 3-digit number displayed (see below for 888 sequence)
The system stops with a 3-digit error code.
Add 101- to the left of the three digits to create an SRN. Go to Diagnostic Information for Multiple Bus Systems. If there is a location code displayed under the 3-digit error code, then look at location to see if it matches failing component that the SRN pointed to. If they do not match, perform the action from Diagnostic Information for Multiple Bus Systems first. If the problem still exists, then replace the failing component from that location code.
Hardware Management Console (HMC) Problem Chapter
3.
Maintenance
Analysis
Procedures
153
Symptom
Action
Hardware Management Console (HMC) cannot be used to manage a managed system, or the connection to the managed system is failing.
If the managed system is operating normally (no error codes or other symptoms), the HMC might have a problem, or the connection to the managed system might be damaged or incorrectly cabled. Do the following: 1. Check the connections (serial cable) between the HMC and the managed system. Correct any cabling errors if found. If another serial cable is available, connect it in place of the existing cable and refresh the HMC graphical user interface. You may have to wait up to 30 seconds for the managed system to reconnect. 2. Verify that any connected HMC is connected to the managed system by checking the Management Environment of the HMC. Note: The managed system must have power connected and the system running, or waiting for a power-on instruction (the OK prompt is in the operator panel.) If the managed system does not appear in the Navigation area of the HMC Management Environment, then the HMC or the connection to the managed system might be failing. 3. Go to the Entry MAP in the Hardware Management Console Maintenance Guide, order number SA23-1273. 4. If you cannot fix the problem using the HMC tests in the Hardware Management Console Maintenance Guide, there might be a problem with the primary I/O book; replace the primary I/O book.
Hardware management console (HMC) cannot call out using the attached modem and the customer’s telephone line.
If the managed system is operating normally (no error codes or other symptoms), the HMC might have a problem, or the connection to the modem and telephone line may have a problem. Do the following: 1. Check the connections between the HMC and the modem and telephone line. Correct any cabling errors if found. 2. Go to the Entry MAP in the Hardware Management Console Maintenance Guide, order number SA23-1273.
There is a Display Problem (Distortion, Blurring, Etc.) 154
406/90
Service
Guide
Symptom
Action
All display problems.
1. If using the Hardware Management Console, go to the Hardware Management Console Maintenance Guide, order number SA23-1273. 2. If using a graphics display: a. Go to the problem determination procedures for the display. b. If you do not find a problem, replace the display adapter.3. If using an ASCII terminal: a. Make sure that the ASCII terminal is connected to S1. b. If problems persist, go to the problem determination procedures for the terminal. c. If you do not find a problem, replace the primary I/O book.
Power and Cooling Problems
The system will not power on and no error codes are available.
Go to “MAP 1521: The System Will Not Power On And No Error Codes Are Available” on page 196.
The power light does not come on or stay on. (both the operator panel power-on LED and the I/O subsystem indicator panel power LEDs do not come on or stay on).
1. Check service processor error log. 2. Go to “MAP 1520: Power” on page 192.
A rack or a rack-mounted unit will not power on.
1. Check service processor error log. 2. Go to “MAP 1520: Power” on page 192.
You have a power problem.
1. Check service processor error log. 2. Go to “MAP 1520: Power” on page 192.
The system attention LED on the media subsystem operator panel is on and there is no error code displayed.
1. Check service processor error log. 2. Go to “MAP 1520: Power” on page 192.
888 Sequence in Operator Panel Display
An 888 sequence is displayed in the media subsystem operator panel display.
Go to the Fast Path MAP in the Diagnostic Information for Multiple Bus Systems.
Other Symptoms or Problems
One or more partitions do not boot. You cannot activate one or more partitions.
Go to “MAP 1020: Problem Determination” on page 158.
The operator panel displays OK and the fans and blowers are off.
The service processor is ready. The system is waiting for power-on.
All of the system POST indicators are displayed on the system console, the system pauses and then restarts. The term POST indicators refers to the device mnemonics (the words memory, keyboard, network, scsi, and speaker) that appear on the system console during the power-on self-test (POST).
Go to “Boot Problems” on page 409.
The system stops and all of the POST indicators are displayed on the system console. The term POST indicators refers to the device mnemonics (the words memory, keyboard, network, scsi, and speaker) that appear on the system console during the power-on self-test (POST).
Go to “Boot Problems” on page 409.
The system stops and the message STARTING SOFTWARE PLEASE WAIT... is displayed on the firmware console.
Go to Chapter 4, “Checkpoints,” on page 369. Chapter
3.
Maintenance
Analysis
Procedures
155
Symptom
Action
The system does not respond to the password being entered or the system login prompt is displayed when booting in service mode.
1. If the password is being entered from the Hardware Management Console (HMC), go to the Hardware Management Console Maintenance Guide. 2. If the password is being entered from a keyboard attached to the system, then the keyboard or its controller may be faulty. In this case, replace these parts in the following order: a. Keyboard cable b. Keyboard c. USB card the keyboard is attached to.3. If the password is being entered from an ASCII terminal, then use the problem determination procedures for the ASCII terminal. Make sure the ASCII terminal is connected to S1. If the problem persists, replace the primary I/O book.
No codes are displayed on the operator panel within a few seconds of turning on the system. The operator panel is blank before the system is powered on.
Reseat the operator panel cable. If the problem is not resolved, replace in the following order: 1. Operator panel assembly. Swap the VPD module from the old operator panel to the new operator panel. 2. Primary I/O book. (See notes on page 149.)
The SMS configuration list or boot sequence selection menu shows more SCSI devices attached to a controller/adapter than are actually attached.
A device may be set to use the same SCSI bus ID as the control adapter. Note the ID being used by the controller/adapter (this can be checked and/or changed through an SMS utility), and verify that no device attached to the controller is set to use that ID. If settings do not appear to be in conflict: 1. Go to “MAP 1542: I/O Problem Isolation” on page 318 2. Replace the SCSI cable. 3. Replace the device. 4. Replace the SCSI adapterNote: In a ?twin-tailed? configuration where there is more than one initiator device (normally another system) attached to the SCSI bus, it may be necessary to use SMS utilities to change the ID of the SCSI controller or adapter.
The System Management Services menu is displayed.
The device or media you are attempting to boot from may be faulty. 1. Check the SMS error log for any errors. To check the error log: v Choose error log v If an error is logged, check the time stamp. v If the error was logged during the current boot attempt, record it. v Look up the error in Chapter 5, “Error Code to FRU Index” and do the listed action. v If no recent error is logged in the error log, continue to the next step below.2. Try to boot from an alternate boot device connected to the same controller as the original boot device. If the boot succeeds, replace the original boot device (for removable media devices, try the media first). 3. Go to “Boot Problems” on page 409. 156
406/90
Service
Guide
Symptom
Action
You suspect a cable problem.
Go to Adapters, Devices, and Cable Information for Multiple Bus Systems.
All other problems.
Go to “MAP 1020: Problem Determination” on page 158.
Chapter
3.
Maintenance
Analysis
Procedures
157
MAP 1020: Problem Determination Purpose of This MAP Use this MAP to get an error code if you were not provided one by the customer or you are unable to load diagnostics. If you are able to load the diagnostics, go to MAP 0020 in the Diagnostic Information for Multiple Bus Systems. The service processor may have recorded one or more symptoms in its error log. Examine this error log before proceeding (see “System Information Menu” on page 781). The server may have been set up by using the HMC. Check the Service Action Event (SAE) log in the Service Focal Point. The SAE log may have recorded one or more symptoms in the Service Focal Point. To avoid unnecessary replacement of the same FRU for the same problem, it is necessary to check the SAE log for evidence of prior service activity on the same subsystem. If you find any error codes in the service processor error log or the Service Focal Point, record all error codes and any location codes in the order you find them and process each error code and location code beginning at the Entry MAP. Otherwise, continue with the directions for MAP 1020. The service processor may have been set by the user to monitor system operations and to attempt recoveries. You can disable these actions while you diagnose and service the system. If the system was set up according to the recommendations of the 406/90, all the settings of the service processor (except language) were saved by using the SAVE/RESTORE HARDWARE MAINTENANCE POLICIES service aid. You can use the same service aid to restore the settings at the conclusion of your service action. In case the service processor settings were not saved by the user, if you disable them, make notes of their current settings for restoration before you leave. In addition to the parameters in the following table, you can disconnect the modem to prevent incoming signals that could cause the system to power on. Following are the service processor settings. The service processor menus are described in Chapter 7, “Using the Service Processor,” on page 769.
Surveillance
From the service processor Setup Menu, go to the Surveillance Setup Menu and disable surveillance. (Surveillance is automatically disabled in LPAR mode.)
Unattended Start
From the service processor System Power Control Menu, disable unattended start mode.
Reboot Policy
From the System Power Control Menu, go to the Reboot/Restart Policy Setup Menu and set: 1. Number of reboot attempts to 0 (zero) 2. Use OS-Defined restart policy to No 3. Enable supplemental restart policy to No.
Call Out
From the Call-In/Call-Out Setup Menu, go to the Serial Port Selection Menu and disable call-out on both serial ports. (Call-out is automatically disabled when the system is booted to operate with multiple partitions (also known as a partitioned system). Be prepared to record code numbers and use those numbers in the course of analyzing a problem. Go to “Step 1020-1” on page 159. 158
406/90
Service
Guide
Step 1020-1 The following steps analyze a failure to load the diagnostic programs. Note: Be prepared to answer questions regarding the operator panel display and to perform certain actions based on displayed POST indicators. 1. Run diagnostics on any partition. Find your symptom in the following table; then follow the instructions given in the Action column. If no fault is identified, continue to the next step. 2. Run diagnostics on the failing partition. Find your symptom in the following table; then follow the instructions given in the Action column. If no fault is identified, continue to the next step. 3. Power off the system. Refer to “Powering Off and Powering On the System” on page 838. 4. Load the standalone diagnostics in service mode to test the full system partition. Refer to Chapter 6, “Using the Online and Standalone Diagnostics,” on page 759. 5. Wait until the diagnostics are loaded or the system appears to stop. If you receive an error code or if the system stops before diagnostics are loaded, find your symptom in the following table; then follow the instructions given in the Action column. If no fault is identified, continue to the next step. 6. Run the standalone diagnostics on the entire system. Find your symptom in the following table; then follow the instructions given in the Action column. If no fault is identified, call service support for assistance.
Symptom
Action
One or more partitions do not boot.
1. Check service processor error log. If an error is indicated, go to the “Entry MAP” on page 149. 2. Check the Serviceable Action Event (SAE) log, refer to “Working With Serviceable Events” on page 7. If there is a Service Action Event, perform the recommended actions or go to the “Entry MAP” on page 149.
The system stopped and a code is displayed on the operator panel.
Go to the “Entry MAP” on page 149.
The system stops with a prompt to enter a password.
Enter the password. You cannot continue until a correct password has been entered. When you have entered a valid password, go to the beginning of this table and wait for one of the other conditions to occur.
The diagnostic operating instructions are displayed.
Go to MAP 0020 in the Diagnostic Information for Multiple Bus Systems.
The power good LED does not come on or does not stay on, or you have a power problem.
Go to “MAP 1520: Power” on page 192.
The system login prompt is displayed.
You may not have pressed the correct key or you may not have pressed the key soon enough when you were to trying to indicate a service mode IPL of the diagnostic programs. If this is the case, start again at the beginning of this step. Note: Perform the system shutdown procedure before turning off the system. If you are sure you pressed the correct key in a timely manner, go to “Step 1020-2” on page 161.
The system does not respond when the password is entered.
Go to “Step 1020-2” on page 161. Chapter
3.
Maintenance
Analysis
Procedures
159
Symptom
Action
The system stopped. A POST indicator is displayed on the system console and an eight-digit error code is not displayed.
If the POST indicator represents: 1. Memory, do a slow boot. If an error code points to memory, replace that memory book and run slow boot again. 2. Keyboard a. Replace the keyboard cable. b. Replace the keyboard. c. Replace the USB card that the keyboard is attached to. d. Go to “MAP 1542: I/O Problem Isolation” on page 318.3. Network, go to “MAP 1542: I/O Problem Isolation” on page 318. 4. SCSI, go to “MAP 1542: I/O Problem Isolation” on page 318. 5. Speaker a. Replace the operator panel. b. Replace the primary I/O book, location: U1.18-P1-H2. c. Go to “MAP 1542: I/O Problem Isolation” on page 318.
The System Management Services menu is displayed.
Go to “Step 1020-4” on page 162.
All other symptoms.
If you were directed here from the Entry MAP, go to “MAP 1542: I/O Problem Isolation” on page 318. Otherwise, find the symptom in the “Entry MAP” on page 149.
160
406/90
Service
Guide
Step 1020-2 There is a problem with the keyboard. Find the type of keyboard you are using in the following table; then follow the instructions given in the Action column.
Keyboard Type
Action
Type 101 keyboard (U.S.). Identified by the size of the Enter key. The Enter key is in only one horizontal row of keys.
Record error code M0KB D001; then go to “Step 1020-3.”
Type 102 keyboard (W.T.). Identified by the size of the Enter key. The Enter key extends into two horizontal rows.
Record error code M0KB D002; then go to “Step 1020-3.”
Type 106 keyboard. (Identified by the Japanese characters.)
Record error code M0KB D003; then go to “Step 1020-3.”
ASCII terminal keyboard
Go to the documentation for this type of ASCII terminal and continue with problem determination. Step 1020-3 Take the following actions: 1. Find the eight-digit error code in Chapter 5, “Error Code to FRU Index,” on page 413. Note: If the eight-digit error code is not listed in Chapter 5, “Error Code to FRU Index,” look for it in the following: v Any supplemental service manuals for attached devices v The diagnostic problem report screen for additional information v The Service Hints service aid v The CEREADME file (by using the Service Hints service aid)Note: Service aids can be found in Diagnostic Information for Multiple Bus Systems. 2. Perform the action listed. Chapter
3.
Maintenance
Analysis
Procedures
161
Step 1020-4 1. Turn off, then turn on the system unit. 2. When the keyboard indicator (the word keyboard) displays, press the 1 key on a directly attached keyboard or the 1 key on an ASCII terminal or 1 key on the HMC. 3. When the System Management Services menus appear, check the error log for any errors. a. Choose Error Log b. If an error is logged, check the time stamp. c. If the error was logged during the current boot attempt, record it. d. Look up the error in the Chapter 5, “Error Code to FRU Index,” on page 413 and do the listed action. e. If no recent error is logged in the error log, go to “MAP 1542: I/O Problem Isolation” on page 318. 162
406/90
Service
Guide
MAP 1230: Linux Platform Error Problem Resolution Purpose of This MAP Use this MAP to resolve problems reported by SRNs A00-000 to A1F-FFF from a partition running Linux. Step 1230-1 1. The last character of the SRN is bit-encoded as follows: 8 4 2 1 | | | | | | | Replace all FRUs listed | | Hot-swap is supported | Software or Firmware could be the cause Reserved 2. Refer to the last character in the SRN. A 4, 5, 6, or 7 indicates a possible software or firmware problem. Does the last character indicate a possible software or firmware problem? NO Go to “Step 1230-4” on page 164. YES Go to “Step 1230-2.”Step 1230-2 Ask the customer if any software or firmware has been installed recently. Has any software or firmware been installed recently? NO Go to “Step 1230-4” on page 164. YES Go to “Step 1230-3.”Step 1230-3 Suspect the new software or firmware. Check with your support center for any known problems with the new software or firmware. Are there any known problems with the software or firmware? NO Go to “Step 1230-4” on page 164. YES Obtain and follow the procedure to correct the software problem. This completes the repair. Chapter
3.
Maintenance
Analysis
Procedures
163
Step 1230-4 Were any FRUs or location code reported with the SRN? NO Go to “Step 1230-5.” YES Go to “Step 1230-9.”Step 1230-5 Run the diagnostics in problem determination mode on sysplanar0. Were there any FRUs reported with the SRN? NO Go to “Step 1230-6.” YES Go to “Step 1230-9.”Step 1230-6 Did the system display: ?Previous Diagnostic Results - Do you want to review the previously displayed error?? NO Go to “Step 1230-7.” YES You have a pending item in the error log. To see this error, select YES at the prompt. Information from the error log displays in order of last event first. Record the error code, the FRU names and the location code of the FRUs. Go to “Step 1230-7.”Step 1230-7 Were there any other SRNs that begin with an A00 to A1F reported? NO Go to “Step 1230-8.” YES Go to “Step 1230-1” on page 163 and use the new SRN.Step 1230-8 Attempt to boot the system on slow boot mode. If the system boots, run the diagnostics in problem determination mode on sysplanar0 Were any new error codes or SRNs reported? NO Call your support center. YES Follow the procedure for the new error code or SRN.Step 1230-9 1. Record the physical location codes and FRU numbers. 2. Refer to the last character in the SRN. A 2, 3, 6, or 7 indicates that hot-swap is possible. Does the last character indicate that hot-swap is possible? NO Go to “Step 1230-10.” YES Go to “Step 1230-14” on page 166.Step 1230-10 Note: If necessary, refer to “Powering Off and Powering On the System” on page 838 for information on system shut down and powering the system on and off. 1. If the operating system is running, perform the operating system’s shutdown procedure. 2. Turn off power to the system. 164
406/90
Service
Guide
3. Refer to the last character in the SRN. A 1, 3, 5, or 7 indicates that all FRUs listed on the Problem Report Screen need to be replaced. For SRNs ending with any other character, exchange one FRU at a time, in the order listed.Note: Use the appropriate procedure in Chapter 9, “Removal and Replacement Procedures,” on page 835 to replace any FRUs indicated. 4. Turn on power to the system. 5. Load standalone AIX diagnostics in service mode.Note: If the Diagnostics Operating Instructions do not display or you are unable to select the Task Selection option, check for loose cards, cables, and obvious problems. If you do not find a problem, go to “MAP 1020: Problem Determination” on page 158 and get a new SRN. 6. Wait until the Diagnostics Operating Instructions are displayed or the system appears to stop. 7. Press Enter. 8. Select Diagnostic Routines at the function selection menu. 9. Select System Verification. 10. If a missing options exist, particularly if it is related to the device that was replaced, resolve the missing options before proceeding 11. If the resource associated with your repair action is not displayed on the resource list, select sysplanar0. 12. Run diagnostics on all resources. Was a problem reported? NO The repair is complete. Go to MAP 0410: Repair Checkout, located in the Diagnostic Information for Multiple Bus Systems, order number SA23-2769. YES Go to “Step 1230-11.”Step 1230-11 Is the problem the same as the original problem? NO The symptom has changed. Check for loose cards, cables, and obvious problems. If you do not find a problem, go to “MAP 1020: Problem Determination” on page 158 and get a new SRN. YES Go to “Step 1230-12.”Step 1230-12 Look at the physical location codes and FRU part numbers you recorded. Have you exchanged all the FRUs that were listed? NO Go to “Step 1230-13” on page 166. YES The SRN did not identify the failing FRU. Call your support person for assistance. Chapter
3.
Maintenance
Analysis
Procedures
165
Step 1230-13 1. After performing a shutdown of the operating system, turn off power to the system. 2. Remove the new FRU and install the original FRU. 3. Exchange the next FRU in list. 4. Turn on power to the system. 5. Load standalone diagnostics in service mode.Note: If the Diagnostics Operating Instructions do not display or you are unable to select the Task Selection option, check for loose cards, cables, and obvious problems. If you do not find a problem, go to “MAP 1020: Problem Determination” on page 158 and get a new SRN. 6. Wait until the Diagnostics Operating Instructions are displayed or the system appears to stop. 7. Press Enter. 8. Select Diagnostic Routines at the function selection menu. 9. Select System Verification. 10. If a missing options exist, particularly if it is related to the device that was replaced, resolve the missing options before proceeding 11. If the resource associated with your action does not appear on the Resource List, select sysplanar0. 12. Run diagnostics on all resources. Was a problem reported? NO The repair is complete. Go to MAP 0410: Repair Checkout, located in the Diagnostic Information for Multiple Bus Systems, order number SA23-2769. YES Go to “Step 1230-11” on page 165.Step 1230-14 The FRUs can be hot-swapped. If you do not want to use the hot-swap capabilities of an identified hot-swap FRU, go to “Step 1230-10” on page 164.Note: See the “Hot-Pluggable FRUs” on page 842 for a list of hot-swap devices used in your system. For a procedure to replace a hot-plug (swap) PCI adapter go to “PCI Adapters” on page 914. 1. Refer to the last character in the SRN. A 1, 3, 5, or 7 indicates that all FRUs listed on the Problem Report Screen must be replaced. For SRNs ending with any other character, exchange one FRU at a time, in the order listed.Note: Although some components are physically hot-swappable, there are no procedures available to determine if a repair has been successful while the Linux operating system is running. If you desire to verify the repair with diagnostics, you must shutdown the system and run the AIX standalone diagnostics. 2. After replacing a hot-swappable FRU, examine the HMC or the platform error log to determine if the error has reappeared. You may need to wait 30 minutes or longer before the error is written back to the HMC or platform error log. Was a problem reported? NO The repair is completed. Return the system to the customer. YES Go to “Step 1230-15” on page 167. 166
406/90
Service
Guide
Step 1230-15 Is the problem the same as the original problem? NO The symptom has changed. Check for loose cards, cables, and obvious problems. If you do not find a problem, go to “MAP 1020: Problem Determination” on page 158 and get a new SRN. YES Go to “Step 1230-16.”Step 1230-16 Look at the physical location codes and FRU part numbers you recorded. Have you exchanged all the FRUs that were listed? NO Go to “Step 1230-17.” YES The SRN did not identify the failing FRU. Call your support person for assistance.Step 1230-17 1. Remove the new FRU and install the original FRU. 2. Exchange the next hot-swappable FRU in the list. If there are more FRUs in the list that are not hot-swappable, go to “Step 1230-10” on page 164 to replace the rest of the FRUs, if required.Note: Although some components are physically hot-swappable, there are no procedures available to determine if a repair has been successful while the Linux operating system is running. If you desire to verify the repair with diagnostics, you must shutdown the system and run the AIX standalone diagnostics. 3. After replacing a hot-swappable FRU, examine the HMC or the platform error log to determine if the error has reappeared. You may need to wait 30 minutes or longer before the error is written back to the HMC or platform error log. Was a problem reported? NO The repair is complete. Return the system to the customer. YES Go to “Step 1230-15.” Chapter
3.
Maintenance
Analysis
Procedures
167
MAP 1235: System Array Self-Repair Problem Resolution for Linux Purpose of This MAP Use this MAP to resolve problems reported by SRNS A11-560 to A11-580. Note: The following steps may require that the system be rebooted to invoke Array bit steering, so you may wish to schedule deferred maintenance with the system administrator to arrange a convenient time to reboot their system. Step 1235-1 Was the SRN A11-560? NO Go to “Step 1235-3.” YES Go to “Step 1235-2.”Step 1235-2 Were there any other errors on the resource reporting the array bit steering problem? NO Go to “Step 1235-4.” YES Resolve those errors before proceeding.Step 1235-3 Were there any other errors on procx? NO Go to “Step 1235-4.” YES Resolve those errors before proceeding.Step 1235-4 Schedule deferred Maintenance with the customer. When it is possible, reboot the system to invoke Array Bit steering Go to “Step 1235-5.” Step 1235-5 Reboot the system. To determine if the original error code or SRN does not reproduce, examine the problem Linux partition with the HMC, or by listing errors in the platform error log. 1. If there is no problem on the resource that originally reported the problem, then array bit steering was able to correct the problem. Go to MAP 0410: Repair Checkout, located in the Diagnostic Information for Multiple Bus Systems, order number SA23-2769. 2. If a problem is reported use the SRN to resolve the problem. Go to “Entry MAP” on page 149 with the new symptom. 168
406/90
Service
Guide
MAP 1320: Service Focal Point These procedures define the steps to take when servicing a machine that is equipped with Service Focal Point (SFP). You might also be directed to these procedures if the system is running Linux as its only operating system. Note: If the system is attached to an HMC and Service Focal Point is enabled, the various codes that might display on the HMC are all listed as error codes by Service Focal Point (SFP). To assist you in identifying the types of error data in this guide, use the following table.
Number of Digits in Error Code
Error code
Name Used in this Service Guide
Any
Contains # (pound sign)
Menu Goal
Any
Contains - (hyphen)
SRN
5
Does not contain # or -
SRN
6
Does not contain # or -
Error Code
8
Does not contain # or -
Error Code, Service Reference Code (SRC)
For steps to be taken when beginning service on a machine with SFP, see the “MAP 1321: Quick Entry MAP for Systems with Service Focal Point” on page 170. For steps to be taken after service is complete on a machine with SFP, see the “MAP 1322: End of Call MAP for Systems with Service Focal Point” on page 183. Chapter
3.
Maintenance
Analysis
Procedures
169
MAP 1321: Quick Entry MAP for Systems with Service Focal Point Notes: 1. If you are working on a system that is managed by an HMC, the term service action event is synonymous with serviceable event in this MAP. 2. If you are working on a system that is managed by an HMC, some steps in this MAP may ask you to open the details of the service action event and obtain error codes and FRU location codes. Depending on the error code(s), this operation may involve looking at service processor error code details. Be sure you select all listed error codes one at a time. Select details to display all error codes and location codes for the serviceable event that you are working with.Step 1321-1 For future reference, record the error code and the location code (that brought you to this MAP) of the FRU to be replaced. Record any other symptoms if an error code is not available. Does the system have an HMC attached? No Go to “Step 1321-23” on page 173. Yes Go to “Step 1321-2.”Step 1321-2 At the HMC console, start Service Focal Point (SFP) and examine the service action event log for any open service action events. For SFP operating instructions, see “Service Focal Point” on page 4. Does the HMC appear to operate correctly? No Go to “Quick Entry MAP” on page 151, look up the HMC symptom, and follow the directions there. Yes Go to “Step 1321-3.”Step 1321-3 Are there any service action events that are open? No Go to “Step 1321-40” on page 181. Yes Go to “Step 1321-4.”Step 1321-4 Record the list of open service action events. Step 1321-5 From the list of serviceable events recorded in “Step 1321-4,” open the details of each service action event and examine the FRU list and error code(s) for each. Step 1321-6 Are there any serviceable actions events with error codes of the form #xxxxxx in the list of error codes found in “Step 1321-5”? No Go to “Step 1321-7” on page 171. Yes Go to “Step 1321-18” on page 172. 170
406/90
Service
Guide
Step 1321-7 Are there any serviceable events with FRU location codes reported in “Step 1321-5” on page 170? No Go to “Step 1321-14” on page 172. Yes Go to “Step 1321-8.”Step 1321-8 From the list of open service action events with FRUs reported, examine the details of each service action event and record the error code. Step 1321-9 Examine the list of error codes you obtained from “Step 1321-8” and look for any of the form 4xxB xxxx, A0D-34x, or A1D-34x. Step 1321-10 Are there any error codes from “Step 1321-8” of the form 4xxB xxxx, A0D-34x, or A1D-34x? No Go to “Step 1321-13.” Yes Go to “Step 1321-11.”Step 1321-11 Do any of the error codes from “Step 1321-8” of the form 4xxB xxxx, A0D-34x, or A1D-34x have the same first two characters on the left as the first two characters on the left of the error code recorded in “Step 1321-1” on page 170 that sent you to this MAP? No Go to “Step 1321-13.” Yes Record the error codes from “Step 1321-8” of the form 4xxB xxxx, A0D-34x, or A1D-34x that have the same first two characters on the left as the first two characters on the left of the error code recorded in “Step 1321-1” on page 170 that sent you to this MAP. Also, record the service action event associated with each error code. Go to “Step 1321-12.”Step 1321-12 From the list of open service action events with error codes of the form 4xxB xxxx, A0D-34x, or A1D-34x recorded in “Step 1321-11,” choose the service action event with the earliest time stamp (the one that occurred first), record its error code and FRU location(s), and proceed to “Step 1321-14” on page 172 with this new code. When future steps refer to the error code recorded in “Step 1321-1” on page 170, use this new error code and its location code(s) instead of the original error code that sent you to this MAP. Step 1321-13 From the list of open service action events with FRUs reported, choose the service action event with the earliest time stamp (the one that occurred first), record its error code and FRU location(s), and proceed to “Step 1321-14” on page 172 with this new error code. When future steps refer to the error code recorded in “Step 1321-1” on page 170, use this new error code and its location code(s) instead of the original error code that sent you to this MAP. Chapter
3.
Maintenance
Analysis
Procedures
171
Step 1321-14 At the HMC console, open Service Focal Point (SFP) and examine the service action event log for any closed service action events. For SFP operating instructions, refer to “Service Focal Point” on page 4. Step 1321-15 Are there any service action events that are closed? No Go to the table in “Entry MAP” on page 149. Yes Go to “Step 1321-16.”Step 1321-16 Examine the details of the ten most-recently closed service action events. Are there any error codes or symptoms in the ten latest closed service action events that are the same as the error code or symptom recorded in “Step 1321-1” on page 170? No Go to the table in “Entry MAP” on page 149. Yes Go to “Step 1321-17.”Step 1321-17 Examine the FRUs of the same error codes or symptoms. Are there any FRUs in the same error codes or symptoms of the closed service action events left to be replaced? No Go to the table in “Entry MAP” on page 149. Yes Replace the next FRU of same error codes or symptoms of the closed service action events. Go to ?MAP 0410: Repair Checkout? in Diagnostic Information for Multiple Bus Systems.Step 1321-18 From the list of serviceable events with error code(s) of the form #xxxxxx, examine the details of each serviceable event to display the description field of the error code(s) of the form #xxxxxx. Step 1321-19 Double-click on the description field to view the complete text. Step 1321-20 Perform any actions that may be listed in the descriptions of all error codes of the form #xxxxxx before proceeding. Step 1321-21 Did you find a problem? No Make a note of the eight character error code number in the bottom of the description field. Use this error code when you are directed to the “Entry MAP” on page 149 later in this MAP. Go to “Step 1321-7” on page 171. Yes Go to ?MAP 0410: Repair Checkout? in Diagnostic Information for Multiple Bus Systems. 172
406/90
Service
Guide
Step 1321-22 Look for and record any additional error information or software messages on the operator panel or the console (serial TTY, Vterm, or HMC). Choose a running Linux partition (preferably the partition with the problem). Step 1321-23 Is Linux usable in any partition with Linux installed? No Go to “Step 1321-30” on page 178.Yes Go to “Step 1321-24.” Attention: “Step 1321-24” through “Step 1321-36” on page 180 must be performed on a partition running the Linux operating system. Step 1321-24 1. If your system is functional, determine your level of Linux by logging in to the system as the root user and entering the following command: ls -l /var/log/platform If the /var/log/platform file exists, go to substep 3. 2. Examine the Linux system log by entering the following command: cat /var/log/messages |grep RTAS |more Linux run-time RTAS error messages are logged in the messages file under /var/log. The following is an example of the Linux system RTAS error log messages.
Aug 27 18:13:41 rasler kernel: RTAS: -------- event-scan begin -------- Aug 27 18:13:41 rasler kernel: RTAS: Location Code: U0.1-P1-C1 Aug 27 18:13:41 rasler kernel: RTAS: WARNING: (FULLY RECOVERED) type: INTERN_DEV_FAIL Aug 27 18:13:41 rasler kernel: RTAS: initiator: UNKNOWN target: UNKNOWN Aug 27 18:13:41 rasler kernel: RTAS: Status: predictive new Aug 27 18:13:41 rasler kernel: RTAS: Date/Time: 20020827 18134000 Aug 27 18:13:41 rasler kernel: RTAS: CPU Failure Aug 27 18:13:41 rasler kernel: RTAS: CPU id: 0 Aug 27 18:13:41 rasler kernel: RTAS: Failing element: 0x0000 Aug 27 18:13:41 rasler kernel: RTAS: A reboot of the system may correct the problem Aug 27 18:13:41 rasler kernel: RTAS: -------- event-scan end ---------- Go to “Step 1321-25” on page 176. Chapter
3.
Maintenance
Analysis
Procedures
173
3. Record that, for this Linux partition, you performed substep 3 of “Step 1321-24” on page 173 for later steps.
Use the following command to list diagela messages recorded in the Linux system log:cat /var/log/platform |grep diagela |moreLinux run-time diagela error messages are logged in the platform file under /var/log.The following is an example of the Linux system error log diagela messages.Aug 13 09:38:45 larry diagela: 08/13/2003 09:38:44Aug 13 09:38:45 larry diagela: Automatic Error Log Analysis has detected a problem. Aug 13 09:38:45 larry diagela: Aug 13 09:38:45 larry diagela: The Service Request Number(s)/Probable Cause(s) Aug 13 09:38:45 larry diagela: (causes are listed in descending order of probability): Aug 13 09:38:45 larry diagela: Aug 13 09:38:45 larry diagela: 651-880: The CEC or SPCN reported an error. Report the SRN and the following reference and physical location codes to your service provider. Aug 13 09:38:45 larry diagela: Location: n/a FRU: n/a Ref-Code: B1004699 Aug 13 09:38:45 larry diagela: Aug 13 09:38:45 larry diagela: Analysis of Error log sequence number: 3 Aug 29 07:13:04 larry diagela: 08/29/2003 07:13:04 Aug 29 07:13:04 larry diagela: Automatic Error Log Analysis has detected a problem. Aug 29 07:13:04 larry diagela: Aug 29 07:13:04 larry diagela: The Service Request Number(s)/Probable Cause(s) Aug 29 07:13:04 larry diagela: (causes are listed in descending order of probability): Aug 29 07:13:04 larry diagela: Aug 29 07:13:04 larry diagela: 651-880: The CEC or SPCN reported an error. Report the SRN and the following reference and physical location codes to your service provider. Aug 29 07:13:04 larry diagela: Location: U0.1-F4 FRU: 09P5866 Ref-Code: 10117661 Aug 29 07:13:04 larry diagela: Aug 29 07:13:04 larry diagela: Analysis of /var/log/platform sequence number: 24 Sep 4 06:00:55 larry diagela: 09/04/2003 06:00:55 Sep 4 06:00:55 larry diagela: Automatic Error Log Analysis reports the following: Sep 4 06:00:55 larry diagela: Sep 4 06:00:55 larry diagela: 651204 ANALYZING SYSTEM ERROR LOG Sep 4 06:00:55 larry diagela: A loss of redundancy on input power was detected. Sep 4 06:00:55 larry diagela: Sep 4 06:00:55 larry diagela: Check for the following: Sep 4 06:00:55 larry diagela: 1. Loose or disconnected power source connections. Sep 4 06:00:55 larry diagela: 2. Loss of the power source. Sep 4 06:00:55 larry diagela: 3. For multiple enclosure systems, loose or Sep 4 06:00:55 larry diagela: disconnected power and/or signal connections Sep 4 06:00:55 larry diagela: between enclosures. Sep 4 06:00:55 larry diagela: Sep 4 06:00:55 larry diagela: Supporting data: Sep 4 06:00:55 larry diagela: Ref. Code: 10111520 Sep 4 06:00:55 larry diagela: Location Codes: P1 P2 Sep 4 06:00:55 larry diagela: Sep 4 06:00:55 larry diagela: Analysis of /var/log/platform sequence number: 13 174
406/90
Service
Guide
4. Also use the following command to list RTAS messages recorded in the Linux system log: cat /var/log/platform |grep RTAS |more Linux RTAS error messages are logged in the platform file under /var/log. The following is an example of the Linux system error log RTAS messages.
Aug 27 12:16:33 larry kernel: RTAS: 15 -------- RTAS event begin -------- Aug 27 12:16:33 larry kernel: RTAS 0: 04440040 000003f8 96008508 19155800 Aug 27 12:16:33 larry kernel: RTAS 1: 20030827 00000001 20000000 00000000 Aug 27 12:16:33 larry kernel: RTAS 2: 00000000 00000000 00000000 00000000 Aug 27 12:16:33 larry kernel: RTAS 3: 49424d00 55302e31 2d463400 00503034 Aug 27 12:16:33 larry kernel: RTAS 4: 10117661 04a0005d 10110000 00000000 Aug 27 12:16:33 larry kernel: RTAS 5: 00007701 000000e0 00000003 000000e3 Aug 27 12:16:33 larry kernel: RTAS 6: 00000000 01000000 00000000 31303131 Aug 27 12:16:33 larry kernel: RTAS 7: 37363631 20202020 20202020 55302e31 Aug 27 12:16:33 larry kernel: RTAS 8: 2d463420 20202020 20202020 03705a39 Aug 27 12:16:33 larry kernel: RTAS 9: 00000000 00000000 00000000 00000000 Aug 27 12:16:33 larry kernel: RTAS 10: 00000000 00000000 00000000 00000000 Aug 27 12:16:33 larry kernel: RTAS 11: 00000000 00000000 00000000 00000000Aug 27 12:16:33 larry kernel: RTAS 12: 00000000 00000000 00000000 00000000 Aug 27 12:16:33 larry kernel: RTAS 13: 00000000 00000000 00000000 00000000 Aug 27 12:16:33 larry kernel: RTAS 14: 00000000 00000000 00000000 00000000 Aug 27 12:16:33 larry kernel: RTAS 15: 00000000 00000000 00000000 00000000 Aug 27 12:16:33 larry kernel: RTAS 16: 00000000 00000000 00000000 00000000 Aug 27 12:16:33 larry kernel: RTAS 17: 00000000 00000000 00000000 00000000 Aug 27 12:16:33 larry kernel: RTAS 18: 00000000 00000000 00000000 00000000 Aug 27 12:16:33 larry kernel: RTAS 19: 00000000 00000000 00000000 00000000 Aug 27 12:16:33 larry kernel: RTAS 20: 00000000 00000000 00000000 00000000 Aug 27 12:16:33 larry kernel: RTAS 21: 00000000 00000000 00000000 00000000 Aug 27 12:16:33 larry kernel: RTAS 22: 00000000 00000000 00000000 00000000 Aug 27 12:16:33 larry kernel: RTAS 23: 00000000 00000000 00000000 00000000 Aug 27 12:16:33 larry kernel: RTAS 24: 00000000 00000000 00000000 00000000 Aug 27 12:16:33 larry kernel: RTAS 25: 00000000 00000000 00000000 00000000 Aug 27 12:16:33 larry kernel: RTAS 26: 00000000 00000000 00000000 00000000 Aug 27 12:16:33 larry kernel: RTAS 27: 00000000 00000000 00000000 00000000 Aug 27 12:16:33 larry kernel: RTAS 28: 00000000 00000000 00000000 00000000 Aug 27 12:16:33 larry kernel: RTAS 29: 00000000 00000000 00000000 00000000 Aug 27 12:16:33 larry kernel: RTAS 30: 00000000 00000000 00000000 00000000 Aug 27 12:16:33 larry kernel: RTAS 31: 00000000 00000000 00000000 00000000 Aug 27 12:16:33 larry kernel: RTAS 32: 00000000 00000000 00000000 00000000 Aug 27 12:16:33 larry kernel: RTAS 33: 00000000 00000000 00000000 00000000 Aug 27 12:16:33 larry kernel: RTAS 34: 00000000 00000000 00000000 00000000 Aug 27 12:16:33 larry kernel: RTAS 35: 00000000 00000000 00000000 00000000 Aug 27 12:16:33 larry kernel: RTAS 36: 00000000 00000000 00000000 00000000 Aug 27 12:16:33 larry kernel: RTAS 37: 00000000 00000000 00000000 00000000 Aug 27 12:16:33 larry kernel: RTAS 38: 00000000 00000000 00000000 00000000 Aug 27 12:16:33 larry kernel: RTAS 39: 00000000 00000000 00000000 00000000 Aug 27 12:16:33 larry kernel: RTAS 40: 00000000 00000000 00000000 00000000 Aug 27 12:16:33 larry kernel: RTAS 41: 00000000 00000000 00000000 00000000 Aug 27 12:16:33 larry kernel: RTAS 42: 00000000 00000000 00000000 00000000 Aug 27 12:16:33 larry kernel: RTAS 43: 00000000 00000000 00000000 00000000 Aug 27 12:16:33 larry kernel: RTAS 44: 00000000 00000000 00000000 00000000 Aug 27 12:16:33 larry kernel: RTAS 45: 00000000 00000000 00000000 00000000 Chapter
3.
Maintenance
Analysis
Procedures
175
Aug 27 12:16:33 larry kernel: RTAS 46: 00000000 00000000 00000000 00000000 Aug 27 12:16:33 larry kernel: RTAS 47: 00000000 00000000 00000000 00000000 Aug 27 12:16:33 larry kernel: RTAS 48: 00000000 00000000 00000000 00000000 Aug 27 12:16:33 larry kernel: RTAS 49: 00000000 00000000 00000000 00000000 Aug 27 12:16:33 larry kernel: RTAS 50: 00000000 00000000 00000000 00000000 Aug 27 12:16:33 larry kernel: RTAS 51: 00000000 00000000 00000000 00000000 Aug 27 12:16:33 larry kernel: RTAS 52: 00000000 00000000 00000000 00000000 Aug 27 12:16:33 larry kernel: RTAS 53: 00000000 00000000 00000000 00000000 Aug 27 12:16:33 larry kernel: RTAS 54: 00000000 00000000 00000000 00000000 Aug 27 12:16:33 larry kernel: RTAS 55: 00000000 00000000 00000000 00000000Aug 27 12:16:33 larry kernel: RTAS 56: 00000000 00000000 00000000 00000000Aug 27 12:16:33 larry kernel: RTAS 57: 00000000 00000000 00000000 00000000 Aug 27 12:16:33 larry kernel: RTAS 58: 00000000 00000000 00000000 00000000 Aug 27 12:16:33 larry kernel: RTAS 59: 00000000 00000000 00000000 00000000 Aug 27 12:16:33 larry kernel: RTAS 60: 00000000 00000000 00000000 00000000 Aug 27 12:16:33 larry kernel: RTAS 61: 00000000 00000000 00000000 00000000 Aug 27 12:16:33 larry kernel: RTAS 62: 00000000 00000000 00000000 00000000 Aug 27 12:16:33 larry kernel: RTAS 63: 00000000 00000000 00000000 00020000 Aug 27 12:16:33 larry kernel: RTAS: 15 -------- RTAS event end ---------- Error codes and location codes may appear as RTAS messages. The extended data is also provided in the form of an RTAS message. The extended data contains other error code words that help in isolating the correct FRUs. The start of the extended data is marked, for example, by the line Aug 27 12:16:33 larry kernel: RTAS: 15 -------- RTAS event begin --------. The number after the colon is a sequence number that correlates this data any diagela data with the same sequence number. The end of the extended data is marked by the line Aug 27 12:16:33 larry kernel: RTAS: 15 -------- RTAS event end ---------- with the same sequence number. Word 13 and word 19 are found in the RTAS messages. For example, to find word 13, first find the error code in the left column of words of the extended data, 10117661. In this example, we find the error code to the right of ?RTAS 4:?. This is also word 11. To get word 13, 10110000, simply count the words left-to-right, beginning at word 11.
Step 1321-25 If you performed substep 2 of “Step 1321-24” on page 173, then record any RTAS messages found in the Linux system log in “Step 1321-24” on page 173. If you performed substep 3 of “Step 1321-24” on page 173, then record any RTAS and diagela messages found in the Linux system log in “Step 1321-24” on page 173, and also record any extended data found in the RTAS messages, especially word 13 and word 19. Ignore all other messages in the Linux system log. If the system is configured with more than one logical partition with Linux installed, repeat “Step 1321-24” on page 173 and “Step 1321-25”for all logical partitions that have Linux installed. 176
406/90
Service
Guide
Step 1321-26 If you performed substep 3 of “Step 1321-24” on page 173 for the current Linux partition, go to “Step 1321-27,” and when asked in “Step 1321-27,” do not record any RTAS messages from “Step 1321-26” for the current Linux partition. Examine the Linux boot (IPL) log by logging in to the system as the root user and entering the following command: cat /var/log/boot.msg /var/log/dmesg 2>/dev/null | grep RTAS |more Linux boot (IPL) error messages are logged into the boot.msg file or the dmesg file under /var/log. The following is an example of the Linux boot error log.
RTAS daemon started RTAS: -------- event-scan begin -------- RTAS: Location Code: U0.1-F3 RTAS: WARNING: (FULLY RECOVERED) type: SENSOR RTAS: initiator: UNKNOWN target: UNKNOWN RTAS: Status: bypassed new RTAS: Date/Time: 20020830 14404000 RTAS: Environment and Power Warning RTAS: EPOW Sensor Value: 0x00000001 RTAS: EPOW caused by fan failure RTAS: -------- event-scan end ---------- Step 1321-27 Record any RTAS messages found in the Linux boot (IPL) log in Step 1321-26. Ignore all other messages in the Linux boot (IPL) log. If the system is configured with more than one logical partition with Linux installed, repeat “Step 1321-26” and “Step 1321-27” for all logical partitions that have Linux installed. Chapter
3.
Maintenance
Analysis
Procedures
177
Step 1321-28 If you performed substep 3 of “Step 1321-24” on page 1731321-24 for the current Linux partition, go to “Step 1321-29,” and when asked in “Step 1321-29,” do not record any additional extended data from “Step 1321-26” on page 177 for the current Linux partition. Examine the extended data in both logs. The following is an example of the Linux extended data.
<3>RTAS daemon started <3>RTAS: -------- event-scan begin -------- <3>RTAS: Location Code: U0.1-P1-C2 <4>RTAS: Log Debug: 04 4b2726fb04a00011702c0014000000000000000000000000f1800001001801d3ffffffff01000000000000004234313820202020383030343236464238454134303030303 030303030303030 <4>RTAS: Log Debug: D25046413405020d0a000001000271400100000033434d502044415441000001000000000000010000f1800001535953204441544100000000000000002002162715010509200216271501050920020637150105095352432044415441702c001400000000000000020018820201d3820000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000280048ea400000000000000000000000000000000000000004350542044415441702cff08000000001c000000702cf0080000000080000000702cf100702cf200702c000400000800702c01040bf2002e702c02040c1fffbf702c0300702c1000702c11040bf2002e702c12040c1fffbf702c1300702ca000702ca108000000000000a03c702ca208000000000000effc702cb000702cb108000000000000a03c702cb208000000000000effc702cc000702cc108000000000000a03c702cc208000000000000effc702c3000702c31080000000000000003702c3208000000000000007b702c8000702c81080000000020e27a39702c820800000000fffeffff702cd000702cd1080000000010004010702cd208000000007777f3ffffffffffffffffffffffffffffffffffffffffffffffffffffffffffffffffff<3>RTAS: WARNING: (FULLY RECOVERED) type: INTERN_DEV_FAIL <3>RTAS: initiator: UNKNOWN target: UNKNOWN <3>RTAS: Status: unrecoverable new <3>RTAS: Date/Time: 20020905 15372200 <3>RTAS: CPU Failure <3>RTAS: Internal error (not cache) <3>RTAS: CPU id: 0 <3>RTAS: Failing element: 0x0000 <3>RTAS: -------- event-scan end --------- Step 1321-29 Record any extended data found in the Linux system log in Step 1321-24 or the Linux boot (IPL) log in Step 1321-26. Be sure to record word 13. Note: The line(s) in the Linux extended data that begin with ?<4>RTAS: Log Debug: 04? contain the error code listed in the next 8 hex characters. In the previous example, ?4b27 26fb? is an error code. The error code is also known as word 11. Each 4 bytes after the error code in the Linux extended data is another word (for example, ?04a0 0011? is word 12, and ?702c 0014? is word 13, and so on). If the system is configured with more than one logical partition with Linux installed, repeat “Step 1321-28” and “Step 1321-29” for all logical partitions that have Linux installed. Step 1321-30 Were any error codes or checkpoints recorded in Steps 1321-1, 1321-22, 1321-25, 1321-27, or 1321-29? No Go to “Step 1321-31” on page 179. Yes Go to the “Quick Entry MAP” on page 151 with each recorded error code or symptom. Perform the indicated actions one at a time for each error code until the problem has been corrected. If all recorded error codes have been processed and the problem has not been corrected, go to “Step 1321-31” on page 179.
178
406/90
Service
Guide
Step 1321-31 If no additional error information is available and the problem has not been corrected, shut down the system. If an HMC is not attached, you need a laptop computer (and cable, part number 62H4857) capable of connecting to serial port 1 on the processor subsystem. Note: The Linux login prompt cannot be seen on a laptop computer connected to serial port 1. 1. If there is no HMC, attach the laptop computer and cable to serial port 1 on the processor subsystem. 2. When OK displays in the operator panel, press a key on the virtual terminal on the laptop computer. The service processor menus are available on the attached laptop computer or on the HMC. 3. Examine the service processor (SP) error log. Record all error codes and messages written to the SP error log. Go to “Step 1321-32.” 4. If the service processor menus are not available, perform the following steps: a. Examine and correct all connections to the service processor. b. Replace the service processor.Note: The service processor might be contained on a separate card or book. Or in some systems, the service processor is built into the service processor/PCI backplane, or into the system backplane. c. If you have not already replaced the system backplane in an earlier step, replace the system backplane. Note: Call service support before replacing the system backplane.Step 1321-32 Boot the system to the system management services (SMS) menus. Record all error codes and messages written to the SMS error log. Step 1321-33 Were any error codes recorded in steps 1321-31 or 1321-32? No Go to “Step 1321-41” on page 181. Yes Go to the “Quick Entry MAP” on page 151 with each error code or symptom you have recorded. Perform the indicated actions, one at a time, until the problem has been corrected. If all recorded error codes have been processed and the problem has not been corrected, go to “Step 1321-41” on page 181.Step 1321-34 Reboot the system and and bring all partitions to the login prompt. If Linux is not usable in all partitions, go to “Step 1321-38” on page 181. Step 1321-35 Use the lscfg command to list all resources assigned to all partitions. Record the adapter and the partition for each resource. Chapter
3.
Maintenance
Analysis
Procedures
179
Step 1321-36 To determine if any device(s) or adapter(s) are missing, compare the list of found resources and partition assignments to the customer’s known configuration. Record the location of any missing devices. Also record any differences in the descriptions or the locations of devices. You may also compare this list of found resources to a prior version of the device tree as follows: Note: Every time that the partition is booted, the update-device-tree command is run and the device tree is stored in the /var/lib/lsvpd/ directory in a file with the file name device-tree-YYYY-MM-DD-HH:MM:SS, where YYYY is the year, MM is the month, DD is the day, and HH, MM, and SS are the hour, minute and second, respectively, of the date of creation. v At the command line, type the following: cd /var/lib/lsvpd/ v At the command line, type the following: lscfg -vpd device-tree-2003-03-31-12:26:31. This displays the device tree created on 03/31/2003 at 12:26:31. The diff command offers a way to compare the output from a current lscfg command to the output from an older lscfg command. If the files names for the current and old device trees are current.out and old.out, respectively, type: diff old.out current.out. Any lines that exist in the old, but not in the current will be listed and preceded by a less-than symbol (<). Any lines that exist in the current, but not in the old will be listed and preceded by a greater-than symbol (>). Lines that are the same in both files are not listed; for example, files that are identical will produce no output from the diff command. If the location or description changes, lines preceded by both < and > will be output. If the system is configured with more than one logical partition with Linux installed, repeat “Step 1321-35” on page 179 and “Step 1321-36” for all logical partitions that have Linux installed. Step 1321-37 Was the location of one and only one device recorded in “Step 1321-36”? No If you previously answered Yes to “Step 1321-37,” return the system to its original configuration. This ends the procedure. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. If you did not previously answer Yes to “Step 1321-37,” go to “Step 1321-38” on page 181. Yes Perform the following steps one at a time. Power off the system before performing each step. After performing each step, power on the system and go to “Step 1321-34” on page 179. 1. Check all connections from the system to the device. 2. Replace the device (for example, tape or DASD) 3. If applicable, replace the device backplane. 4. Replace the device cable. 5. Replace the adapter. 6. If the adapter resides in an I/O drawer, replace the I/O backplane. 7. If the device adapter resides in the CEC, replace the I/O riser card, or the CEC backplane where the device adapter is located. 8. Call service support. Do not go to “Step 1321-34” on page 179. 180
406/90
Service
Guide
Step 1321-38 Does the system appear to stop or hang before reaching the login prompt or did you record any problems with resources in “Step 1321-36” on page 180?Note: If the system console or VTERM window is always blank, choose NO. If you are sure the console or VTERM is operational and connected correctly, answer the question for this step. No Go to “Step 1321-39.”Yes There may be a problem with an I/O device. Go to “MAP 1542: I/O Problem Isolation” on page 318. In MAP 1542, when instructed to boot the system, boot a full system partition.Step 1321-39 Boot the AIX standalone diagnostics, and run diagnostics in problem determination mode on all resources. Be sure to boot a full system partition. Ensure that AIX diagnostics was run on all known resources. You may need to select each resource individually and run diagnostics on each resource one at a time. Did AIX standalone diagnostics find a problem? No Go to “Step 1321-43” on page 182.Yes Go to the table in “Entry MAP” on page 149 with each error code or SRN you have recorded. Until the problem has been corrected, perform the indicated actions, one at a time, for each SRN or error code not already processed in “Step 1321-37” on page 180. If all recorded SRNs or error codes have been processed and the problem has not been corrected, go to “Step 1321-43” on page 182.Step 1321-40 Does the system have Linux installed on one or more partitions? No Go to “Step 1321-14” on page 172.Yes Go to “Step 1321-22” on page 173.Step 1321-41 Were any location codes recorded in Steps 1321-1, 1321-22, 1321-25, 1321-27, 1321-29, 1321-31, or 1321-32? No Go to “Step 1321-34” on page 179.Yes Replace, one at a time, all parts whose location code was recorded in Steps 1321-1, 1321-22, 1321-25, 1321-27, 1321-29, 1321-31, and 1321-32 that have not been replaced. Power off the system before replacing a part. After replacing the part, power on the system to check if the problem has been corrected. Go to “Step 1321-42” on page 182 when the problem has been corrected, or all parts in the location codes list have been replaced. Chapter
3.
Maintenance
Analysis
Procedures
181
Step 1321-42 Was the problem corrected in Step 1321-41? No Go to “Step 1321-34” on page 179.Yes Return the system to its original configuration. This ends the procedure. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems.Step 1321-43 Were any other symptoms recorded in step 1321-1? No Call support.Yes Go to the “Quick Entry MAP” on page 151 with each symptom you have recorded. Perform the indicated actions for all recorded symptoms, one at a time, until the problem has been corrected. If all recorded symptoms have been processed and the problem has not been corrected, call support. 182
406/90
Service
Guide
MAP 1322: End of Call MAP for Systems with Service Focal Point Note: If you are working on a system that not managed by an HMC go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems Step 1322-1 For future reference, record the SRC or symptom and the location code of the FRU you replaced. Step 1322-2 For Service Focal Point operating instructions, refer to “Service Focal Point” on page 4. At the HMC, open Service Focal Point and examine the service action event log for any open service action events. Step 1322-3 Are there any service action events that are open? No If the system attention LED is still on, use the HMC to turn off the LED as described in “Resetting the System Attention LED” on page 22. This completes the repair, return the system to the customer. Yes Go to “Step 1322-4”.Step 1322-4 Record the list of open service action events. Step 1322-5 From the list of serviceable events recorded in “Step 1322-4,” perform “Step 1322-6” through “Step 1322-33” on page 186 for each open service action event. Step 1322-6 Determine the error class of the serviceable event. Record for future use. Step 1322-7 Examine the details of the open service action event. Is the error code associated with this service action event the same as recorded in ?Step 1322-1?? No Go to “Step 1322-8.” Yes Go to “Step 1322-11” on page 184.Step 1322-8 Examine the FRU list of the service action event. Are there any FRUs listed for the service action event? No Go to “Step 1321-11” on page 171. Yes Go to “Step 1321-9” on page 171. Chapter
3.
Maintenance
Analysis
Procedures
183
Step 1322-9 Is the FRU list identical (same FRUs, same number of FRUs, and same order of FRUs) to the FRU list of the error code recorded in “Step 1322-1” on page 183? No Go to “Step 1322-10.” Yes Go to “Step 1322-11.”Step 1322-10 The FRU list is different. Is the FRU you replaced and recorded in “Step 1322-1” on page 183 in the list of FRUs for this service action event? No Go to “Step 1322-33” on page 186.Note: There are service action events that will remain open when you leave this MAP. Further service actions may be required to complete the repair.Yes Go to “Step 1322-11.”Step 1322-11 Step 1322-12 Is the error code associated with this service action event of the form A11-xxx or A01-xxx? No Go to “Step 1322-17” on page 185. Yes Go to “Step 1322-13.”Step 1322-13 Have you begun a list of “Axx” partitions from prior service action events that you processed in this MAP? No Go to “Step 1322-14.” Yes Go to “Step 1322-15.”Step 1322-14 Step 1322-15 Add the partition list obtained in “Step 1322-11” to the existing list of “Axx” partitions obtained from processing previous service action events in this MAP. 184
406/90
Service
Guide
Step 1322-16 Remove all entries in the list of all partition(s) you recorded in “Step 1322-11” on page 184. If you are referred to the list of partition(s) obtained in “Step 1322-11” on page 184 in future steps, the list is empty. Go to “Step 1322-17.” Step 1322-17 Select and highlight the service action event from the “Error Associated With This Serviceable Event” window. Step 1322-18 Click Close Event. Step 1322-19 Add comments for the serviceable event. Include any unique additional information. Click OK. Step 1322-20 The following steps will add or update FRU information. Step 1322-21 Did you replace, add, or modify a FRU of the open service action event? No Go to “Step 1322-23.” Yes Go to “Step 1322-22.”Step 1322-22 From the FRU list, select a FRU that you need to update. Double-click on the FRU, and update the FRU information. Go to “Step 1322-24.” Step 1322-23 Select the No FRU Replaced for this Serviceable Event option. Step 1322-24 Click OK to close the service action event. Step 1322-25 Is the list of all partition(s) you recorded in “Step 1322-11” on page 184 empty? No Go to “Step 1322-26.” Yes Go to “Step 1322-33” on page 186.Step 1322-26 Does the list of all partition(s) you recorded in “Step 1322-11” on page 184 contain more than one entry? No Go to “Step 1322-33” on page 186. Yes Go to “Step 1322-27” on page 186. Chapter
3.
Maintenance
Analysis
Procedures
185
Step 1322-27 Is the error class recorded in “Step 1322-6” on page 183 AIX? No Go to “Step 1322-33.” Yes Go to “Step 1322-28.”Step 1322-28 Perform the following steps for each entry in the list of all partition(s) you recorded in “Step 1322-11” on page 184, except the partition you were using to debug the original problem. Step 1322-29 From the HMC virtual terminal window of a partition in the list of all partitions, type diag at the AIX command prompt. Step 1322-30 1. When the diagnostic operating instructions are displayed, press Enter. 2. Select the Task Selection option. Note: If the terminal type is not defined, you are prompted to define it before you can proceed. 3. Select the Log Repair option. 4. Select the resource associated with the repair action. If the repair action was reseating a cable or adapter, select the resource associated with that repair action. If the resource associated with your repair action is not displayed on the Resource List, select sysplanar0. 5. Click commit after you have made your selection.Step 1322-31 Exit from diagnostics in this partition and return to the AIX prompt. Step 1322-32 Have all the partitions in the list of all partition(s) you recorded in “Step 1322-11” on page 184 been processed? No Go to “Step 1322-28” to process the next partition in the list you recorded in “Step 1322-11” on page 184. Yes Go to “Step 1322-33”.Step 1322-33 Have all the serviceable events recorded in “Step 1322-4” on page 183 been processed? No Go to “Step 1322-5” on page 183 and process the next service action event in the list of serviceable events recorded in “Step 1322-4” on page 183. Yes Go to “Step 1322-34” on page 187. 186
406/90
Service
Guide
Step 1322-34 While processing all service action events, were you directed to “Step 1322-14” on page 184? No If the system attention LED is still on, use the HMC to turn off the LED as described in “Resetting the LED when a System is Attached To an HMC” on page 22. This completes the repair. Return the system to the customer.Note: If during the processing of the list of open service action events, some service action events remained open, further service actions may be required to complete the repair.Yes Go to “Step 1322-35.”Step 1322-35 Perform the following steps for each entry in the list of “Axx” partitions you began recording in “Step 1322-14” on page 184, except the partition you were using to debug the original problem. Step 1322-36 From the HMC virtual terminal window of a partition in the list of “Axx” partitions, type diag at the AIX command prompt. Step 1322-37 1. When the diagnostic operating instructions are displayed, press Enter. 2. Select the Task Selection option. Note: If the terminal type is not defined, you are prompted to define it before you can proceed. 3. Select the Log Repair option. 4. Select the resource associated with the repair action. If the repair action was reseating a cable or adapter, select the resource associated with that repair action. If the resource associated with your repair action is not displayed on the Resource List, select sysplanar0. 5. Click commit after you have made your selection.Step 1322-38 Exit from diagnostics in this partition and return to the AIX prompt. Step 1322-39 Have all the partitions in the list of “Axx” partitions(s) you began recording in “Step 1322-14” on page 184 been processed? No Go to “Step 1322-35” to process the next partition in the list you recorded in “Step 1322-14” on page 184. Yes If the system attention LED is still on, use the HMC to turn off the LED as described in “Resetting the LED when a System is Attached To an HMC” on page 22. This completes the repair. Return the system to the customer.Note: If during the processing of the list of open service action events, some service action events remained open, further service actions may be required to complete the repair. Chapter
3.
Maintenance
Analysis
Procedures
187
MAP 1420: Recovery Procedures for Hot-Pluggable PCI Adapters This procedure is used to locate problems that might occur when performing a PCI adapter hot-plug operation. Find the symptom in the following table; then follow the instructions given in the Action column.
Recovery Procedures for Hot-Pluggable PCI Adapters
Symptom/Reference Code
Action
Partition will not boot rendering PCI hot-plug manager useless.
Go to “MAP 1421: Partition Will Not Boot Due to Faulty Adapter.”
Partition boots, but PCI adapter is not recognized and slot status is empty.
Go to “MAP 1422: Slot is Empty Even When Populated” on page 189.
MAP 1421: Partition Will Not Boot Due to Faulty Adapter Step 1421-1 The entire system must be shut down and powered off to repair. Does the customer wish to restore the partition by isolating the faulty adapter without powering off the entire system? NO Go to “Step 1421-12” on page 189. YES Go to “Step 1421-2.”Step 1421-2 At the HMC, view the resources assigned to the partition that will not boot. Step 1421-3 Remove the faulty resource from the partition that will not boot. Record the slot location and the adapter type. Step 1421-4 Attempt to boot the partition. Does the partition boot? NO Go to “Step 1421-12” on page 189. YES Go to “Step 1421-5.”Step 1421-5 Shut down the partition. 188
406/90
Service
Guide
Step 1421-6 Is the PCI slot power LED of the location recorded in “Step 1421-3” on page 188 on? NO Go to “Step 1421-7.” YES Go to “Step 1421-12.”Step 1421-7 Are there any empty PCI slots attached to the system that are not assigned to any partitions? NO Go to “Step 1421-12.” YES Go to “Step 1421-8.”Step 1421-8 Assign the available empty and unassigned slot to the partition that will not boot. Step 1421-9 Insert a new adapter of the same type recorded in “Step 1421-3” on page 188 into the slot assigned in “Step 1421-8.” Step 1421-10 Move any external cabling from the adapter in the slot recorded in “Step 1421-3” on page 188 to the adapter in the slot assigned in “Step 1421-8.” Step 1421-11 Attempt to boot the partition. Does the partition boot? NO Go to “Step 1421-12.” YES Partition and all resources are operational. Schedule repair for the slot recorded in “Step 1421-3” on page 188 with the customer. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. This ends the procedure.Step 1421-12 Cannot isolate the adapter without powering off the entire system. Schedule repair for the slot recorded in “Step 1421-3” on page 188 with the customer. This ends the procedure. MAP 1422: Slot is Empty Even When Populated Step 1422-1 Record the I/O drawer and slot location of the slot that is populated with an adapter but shows empty in the PCI hot-plug manager. Also record the adapter type. Chapter
3.
Maintenance
Analysis
Procedures
189
Step 1422-2 Is the PCI slot power LED of the location recorded in “Step 1422-1” on page 189 on? NO Go to “Step 1422-3.” YES Go to “Step 1422-7.”Step 1422-3 Disconnect any external cables from the adapter identified in “Step 1422-1” on page 189. Step 1422-4 Remove the adapter identified in “Step 1422-1” on page 189. Step 1422-5 Follow the procedure “Installing a New Hot-Pluggable PCI Adapter” on page 923 to add a new adapter at the location recorded in “Step 1422-1” on page 189. The new adapter should be the same type as that removed in “Step 1422-1” on page 189. Return to “Step 1422-6” after the new adapter is added. Step 1422-6 Did the PCI hot-plug procedure successfully add a new adapter at the location recorded in “Step 1422-1” on page 189? NO Go to “Step 1422-7.” YES Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. This ends the procedure.Step 1422-7 Shut down the partition that the faulty slot is assigned to. Step 1422-8 Is the PCI slot power LED of the location recorded in “Step 1422-1” on page 189 on? NO Go to “Step 1422-9.” YES Go to “Step 1422-14” on page 191Step 1422-9 Are there any empty PCI slots attached to the system that are not assigned to any partitions? NO Go to “Step 1422-14” on page 191. YES Go to “Step 1422-10.”Step 1422-10 Assign the available empty and unassigned slot to the partition that the faulty slot is assigned to. 190
406/90
Service
Guide
Step 1422-11 Insert a new adapter of the same type recorded in “Step 1422-1” on page 189 into the slot assigned in “Step 1422-10” on page 190. Step 1422-12 Move any external cabling from the adapter in the slot recorded in “Step 1422-1” on page 189 to the adapter in the slot assigned in “Step 1422-10” on page 190. Step 1422-13 Attempt to boot the partition. Does the partition boot?. NO Go to “Step 1422-14.” YES Partition and all resources are operational. Schedule repair with the customer for the slot recorded in “Step 1422-1” on page 189. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. This ends the procedure.Step 1422-14 You cannot repair the adapter without powering off the entire system. Schedule repair with the customer for the slot recorded in “Step 1422-1” on page 189. This ends the procedure. Chapter
3.
Maintenance
Analysis
Procedures
191
MAP 1520: Power This procedure is used to locate power problems in the processor subsystem, I/O subsystem(s) or rack If a problem is detected, this procedure helps you isolate the problem to a failing unit. Observe the following safety notices during service procedures. DANGERAn electrical outlet that is not correctly wired could place hazardous voltage on metal parts of the system or the devices that attach to the system. It is the responsibility of the customer to ensure that the outlet is correctly wired and grounded to prevent an electrical shock. Use one hand, when possible, to connect or disconnect signal cables to prevent a possible shock from touching two surfaces with different electrical potentials. During an electrical storm, do not connect cables for display stations, printers, telephones, or station protectors for communications lines.D06 CAUTION: This product is equipped with a three-wire power cable and plug for the user’s safety. Use this power cable with a properly grounded electrical outlet to avoid electrical shock.C01 CAUTION: Energy hazard, remove power before servicing.C22 Attention: To identify a FRU, these MAPs refer to FRU Identify LEDs. Information about the FRU Identify LEDs can be found in “FRU Identify LEDs” on page 65. 192
406/90
Service
Guide
Find the symptom in the following table; then follow the instructions given in the Action column.
Power Problems
Symptom/Reference Code
Action
The system will not power on and no error codes are available
Go to “MAP 1521: The System Will Not Power On And No Error Codes Are Available” on page 196.
1011 8700, 1011 8701, 101A 0106, 101B 0106
Go to “MAP 152a: Loss of ac Power or Phase Missing” on page 214.
1014 6014, 1014 6814, 1014 B014, 1014 B814, 1015 6014, 1015 6814, 1015 B014, 1015 B814, 1016 6014, 1016 6814, 1016 B014, 1016 B814,1017 6014, 1017 6814, 1017 B014, 1017 B814, 1021 6014, 1021 6814, 1021 B014, 1021 B814, 1022 6014, 1022 6814, 1022 B014, 1022 B814, 1023 6014, 1023 6814, 1023 B014, 1023 B814, 1024 6014, 1024 6814, 1024 B014, 1024 B814,1025 6014, 1025 6814, 1025 B014, 1025 B814
Go to “MAP 152h: 3.3V Current/Voltage Problem in I/O Subsystem” on page 259.
1014 6024, 1014 6824, 1014 B024, 1014 B824, 1015 6024, 1015 6824, 1015 B024, 1015 B824, 1016 6024, 1016 6824, 1016 B024, 1016 B824,1017 6024, 1017 6824, 1017 B024, 1017 B824, 1021 6024, 1021 6824, 1021 B024, 1021 B824, 1022 6024, 1022 6824, 1022 B024, 1022 B824, 1023 6024, 1023 6824, 1023 B024, 1023 B824, 1024 6024, 1024 6824, 1024 B024, 1024 B824,1025 6024, 1025 6824, 1025 B024, 1025 B824
Go to “MAP 152i: 5.0 V Current/Voltage Problem in I/O Subsystem” on page 267.
1014 6034, 1014 6834, 1014 B034, 1014 B834, 1015 6034, 1015 6834, 1015 B034, 1015 B834, 1016 6034, 1016 6834, 1016 B034, 1016 B834,1017 6034, 1017 6834, 1017 B034, 1017 B834, 1021 6034, 1021 6834, 1021 B034, 1021 B834, 1022 6034, 1022 6834, 1022 B034, 1022 B834, 1023 6034, 1023 6834, 1023 B034, 1023 B834, 1024 6034, 1024 6834, 1024 B034, 1024 B834,1025 6034, 1025 6834, 1025 B034, 1025 B834
Go to “MAP 152j: 2.5 V Current/Voltage Problem in I/O Subsystem” on page 276.
1014 6044, 1014 6844, 1014 B044, 1014 B844, 1015 6044, 1015 6844, 1015 B044, 1015 B844, 1016 6044, 1016 6844, 1016 B044, 1016 B844,1017 6044, 1017 6844, 1017 B044, 1017 B844, 1021 6044, 1021 6844, 1021 B044, 1021 B844, 1022 6044, 1022 6844, 1022 B044, 1022 B844, 1023 6044, 1023 6844, 1023 B044, 1023 B844, 1024 6044, 1024 6844, 1024 B044, 1024 B844,1025 6044, 1025 6844, 1025 B044, 1025 B844
Go to “MAP 152k: 12.0 V Current/Voltage Problem in I/O Subsystem” on page 284.
1014 6054, 1014 6854, 1014 B054, 1014 B854, 1015 6054, 1015 6854, 1015 B054, 1015 B854, 1016 6054, 1016 6854, 1016 B054, 1016 B854,1017 6054, 1017 6854, 1017 B054, 1017 B854, 1021 6054, 1021 6854, 1021 B054, 1021 B854, 1022 6054, 1022 6854, 1022 B054, 1022 B854, 1023 6054, 1023 6854, 1023 B054, 1023 B854, 1024 6054, 1024 6854, 1024 B054, 1024 B854,1025 6054, 1025 6854, 1025 B054, 1025 B854
Go to “MAP 152l: -12.0V Current/Voltage Problem in I/O Subsystem” on page 293. Chapter
3.
Maintenance
Analysis
Procedures
193
Power Problems
Symptom/Reference Code
Action
1014 CD96, 1014 CDA6, 1014 CDB6, 1014 CDC6,1015 CD96, 1015 CDA6, 1015 CDB6, 1015 CDC6,1016 CD96, 1016 CDA6, 1016 CDB6, 1016 CDC6,1017 CD96, 1017 CDA6, 1017 CDB6, 1017 CDC6,1021 CD96, 1021 CDA6, 1021 CDB6, 1021 CDC6,1022 CD96, 1022 CDA6, 1022 CDB6, 1022 CDC6,1023 CD96, 1023 CDA6, 1023 CDB6, 1023 CDC6,1024 CD96, 1024 CDA6, 1024 CDB6, 1024 CDC6,1025 CD96, 1025 CDA6, 1025 CDB6, 1025 CDC6
Go to “MAP 152n: DASD Subsystem Power Problem” on page 304.
101A FE16 for BPC A or101B FE16 for BPC B
Go to “MAP 1522: UEPO Switch On The BPC Is In The Bypass Position” on page 199
101A 0856 or 101A FE56for BPR 1A 101A 0866 or 101A FE66 for BPR 2A 101A 0876 or 101A FE76 for BPR 3A 101B 0856 or 101B FE56 for BPR 1B 101B 0866 or 101B FE66 for BPR 2B 101B 0876 or 101B FE76 for BPR 3B
Go to “MAP 1523: There Is a Bulk Power Regulator (BPR) Communications Fault” on page 200.
101A FDB6 for BPI A or 101B FDB6 for BPI B
Go to “MAP 1524: An Open Room EPO Switch Has Been Detected From One Side” on page 201.
101A 0D06 for BPA A or 101B 0D06 for BPA B
Go to “MAP 1525: There Is a 350 Volt Bulk Failure” on page 203.
101A 7A86 or 101A 7B86 for IBF 1A101A 7A96 or 101A 7B96 for IBF 2A101A 7AA6 or 101A 7BA6 for IBF 3A101B 7A86 or 101B 7B86 for IBF 1B101B 7A96 or 101B 7B96 for IBF 2B101B 7AA6 or 101B 7BA6 for IBF 3B
Go to “MAP 1526: There Is An Integrated Battery Feature (IBF) Failure” on page 206.
101A 1C06, 101B 1C06, 101C 1C06, 1014 1C06, 1015 1C06, 1016 1C06, 1017 1C06, 1021 1C06, 1022 1C06, 1023 1C06
Go to “MAP 1527: An Airflow Loss Has Been Detected” on page 209.
101C E006 or 101C E106
Go to “MAP 1528: There Is A Processor (Critical/Warning) Overtemperature Fault” on page 211.
101A D216 for BPA A or 101B D216 for BPA B
Go to “MAP 1529: There Is A Bulk Power Assembly (BPA) Communication Failure” on page 212.
101A F806 or 101A F906 for BPC Aor101B F806 or 101B F906 for BPC B
Go to “MAP 152a: Loss of ac Power or Phase Missing” on page 214. 194
406/90
Service
Guide
Power Problems
Symptom/Reference Code
Action
1014 xx15, 1014 xx25, 1015 xx15, 1015 xx25, 1016 xx15, 1016 xx25, 1017 xx15, 1017 xx25, 101C xx15, 101C xx25, 101C xx35, 101C xx45, 101C xx55, 101C xx75, 101C xx85, 101C xx95, 101C xxA5, 101A x115, 101A x125, 101A x135, 101A x215,101A x225, 101A x235, 101A x315, 101A x325,101A x335, 101A x415, 101A x425, 101A x435,101A x515, 101A x525, 101A x535, 101A x615,101A x625, 101A x635, 101A x715, 101A x725,101A x735, 101A x825, 101A x835, 101A x925,101A x935, 101A xA25, 101A xA35,101B x115, 101B x125, 101B x135, 101B x215,101B x225, 101B x235, 101B x315, 101B x325,101B x335, 101B x415, 101B x425, 101B x435,101B x515, 101B x525, 101B x535, 101B x615,101B x625, 101B x635, 101B x715, 101B x725,101B x735, 101B x825, 101B x835, 101B x925,101B x935, 101B xA25, 101B xA35,1021 xx15, 1021 xx25, 1022 xx15, 1022 xx25,1023 xx15, 1023 xx25, 1024 xx15, 1024 xx25,1025 xx15, 1025 xx25
Go to “MAP 152m: Cable Problem in Power Subsystem” on page 300.
101C 6014, 101C 6814, 101C B014, 101C B814
Go to “MAP 152b: 2.5 V Current/Voltage Problem in Processor Subsystem” on page 216.
101C 6114, 101C 6914, 101C B114, 101C B914
Go to “MAP 152c: 1.8 V Current/Voltage Problem in Processor Subsystem” on page 223.
101C 6124, 101C 6924, 101C B124, 101C B924
Go to “MAP 152e: 1.5 V CPU Current/Voltage Problem in Processor Subsystem” on page 240.
101C 6134, 101C 6934, 101C B134, 101C B934
Go to “MAP 152d: 1.5 V Auxiliary Current/Voltage Problem in Processor Subsystem” on page 231.
101C 6144, 101C 6944, 101C B144, 101C B944
Go to “MAP 152f: 3.3 V Current/Voltage Problem in Processor Subsystem” on page 245.
101C 6514, 101C 6D14, 101C B514, 101C BD14
Go to “MAP 152g: 5.0 V Current/Voltage Problem in Processor Subsystem” on page 253.
Chapter
3.
Maintenance
Analysis
Procedures
195
MAP 1521: The System Will Not Power On And No Error Codes Are Available Step 1521-1 Ensure that the following are true: v Both power cords are properly connected v The UEPO panel switch is in the ON position v The UEPO BYPASS switches on both BPCs are in the Normal position v The on/off switches located on all the BPRs are in the on (left) positionNote: A switch set to the off position cannot cause this problem, however, any switch found in the off position must be reset to the on position (wait 10 seconds) before proceeding. v The cable from UEPO connector J00 to BPC-A connector J05 and the cable from UEPO connector J01 to BPC-B connector J05 are properly connected without visible sign of damage. v The room temperature is not in excess of the maximum allowed (40 degrees Celsius or 104 degrees Fahrenheit)Note: If the room temperature exceeds the specified maximum, the machine may continually cycle on and off.Are all the above conditions true? NO Correct any problems and go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. YES Go to “Step 1521-2.”Step 1521-2 Check the LEDs on both sides of the bulk power assembly (BPA). Are all LEDs off on both sides of BPA? NO Go to “Step 1521-3.” YES Go to “Step 1521-9” on page 198Step 1521-3 Check the following LED states on both Bulk Power Controllers (BPCs): v UEPO PWR LED turned on v BPC GOOD LED turned on v All other LEDs are offAre all of the above true? NO Go to “Step 1521-4” on page 197. YES Go to “Step 1521-5” on page 197. 196
406/90
Service
Guide
Step 1521-4 Independent faults are indicated on both sides of the BPA. Each side must be isolated separately. Call for support. This ends the procedure. Step 1521-5 Check the External (Room) EPO Connection if used. If a room emergency power off (EPO) circuit is used, the external room EPO circuit is connected to the J02 connector on the lower edge of the UEPO Switch. Is a cable connected into the connector J02 on the UEPO Switch? NO Go to “Step 1521-6.” YES Go to “Step 1521-7.”Step 1521-6 Is the internal toggle switch on the EPO panel card set to the ’RM EPO BYPASS’ position? NO Set the internal toggle switch to the ’RM EPO BYPASS’ position, and go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. YES Exchange the UEPO card assembly. Follow all the steps in the removal and replacement procedure for the “Unit Emergency Power Off (UEPO) Card Assembly” on page 850. Go to “Step 1521-8” on page 198.Step 1521-7 Note: Read the Danger and Caution notices under “Safety Notices” on page vii before continuing with this procedure. Disconnect the cable from UEPO connector J02, and set the toggle switch to the ’RM EPO BYPASS’ position. Does the UEPO CMPLT LED on at least one BPC become lit? NO Exchange the UEPO card assembly. Follow all the steps in the removal and replacement procedure for the “Unit Emergency Power Off (UEPO) Card Assembly” on page 850. Go to “Step 1521-8” on page 198. YES Notify the customer that the room EPO circuit is defective at this connection and requires service. This ends the procedure. Chapter
3.
Maintenance
Analysis
Procedures
197
Step 1521-8 Is the error code that sent you to this MAP still reported? NO The problem has been corrected. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. YES Call for support. This ends the procedure.Step 1521-9 Note: Read the Danger and Caution notices under “Safety Notices” on page vii before continuing with this procedure. 1. Prepare a multimeter to measure up to 600 V ac. 2. Use the multimeter to measure the ac voltage at the test points on the frame labeled Phase A and Phase B. Is the ac voltage greater than 180 V ac? NO Inform the customer that the line voltage into the BPA is missing or too low. This ends the procedure. YES Independent faults are indicated on both sides of the BPA. Each side must be isolated separately. Call for support. This ends the procedure. 198
406/90
Service
Guide
MAP 1522: UEPO Switch On The BPC Is In The Bypass Position Step 1522-1 Check the UEPO Switch on the BPC on the BPA that is exhibiting the failure. Is the switch in the NORMAL position? NO Set UEPO switch on the BPC to the NORMAL position and go to “Step 1522-2.” YES Go to “Step 1522-2.”Step 1522-2 Press the Service Complete button. Is error code 101A FE16 or 101B FE16 displayed? NO The problem has already been corrected. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. YES Replace the BPC on the side that is exhibiting the failure. Follow all the steps in the “Bulk Power Controller (BPC)” on page 845. Go to “Step 1522-3.”Step 1522-3 If not done already, press the service complete button. Is error code 101A FE16 or 101B FE16 displayed? NO The problem has been corrected. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. This ends the procedure. YES Call for support. This ends the procedure. Chapter
3.
Maintenance
Analysis
Procedures
199
MAP 1523: There Is a Bulk Power Regulator (BPR) Communications Fault Step 1523-1 Check the position of the on/off switch on the BPR referenced by the error code that sent you to this MAP. Is the switch in the ON position (to the left)? NO Set the switch to the ON position (to the left), and go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. YES Go to “Step 1523-2.”Step 1523-2 Exchange the following FRUs one at a time: v BPR referenced by the error code that sent you to this MAP. Follow all steps in the removal and replacement procedures for “Bulk Power Regulator (BPR)” on page 844. v BPC on the side referenced by the error code that sent you to this MAP. Follow all steps in the removal and replacement procedures for “Bulk Power Controller (BPC)” on page 845. v BPE on the side referenced by the error code that sent you to this MAP. Follow all steps in the removal and replacement procedures for “Bulk Power Enclosure (BPE)” on page 849.After each FRU exchanged, is the error code that sent you to this MAP still reported? NO Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. This ends the procedure. YES Replace the next FRU on the list. 200
406/90
Service
Guide
MAP 1524: An Open Room EPO Switch Has Been Detected From One Side Step 1524-1 Check the unit emergency power off (UEPO) panel. Is a customer room EPO cable plugged into the UEPO panel? NO Go to “Step 1524-3” on page 202. YES Go to “Step 1524-2” on page 202. Chapter
3.
Maintenance
Analysis
Procedures
201
Step 1524-2 1. Set UEPO switch on both BPCs to the BYPASS position. 2. Unplug the customer room EPO cable. 3. Set switch (J4) on the UEPO panel to EPO BYPASS position. 4. Set UEPO switch on both BPCs back to the NORMAL position. 5. Press the service complete button. Is 101A FDB6 or 101B FDB6 generated? NO Go to “Step 1524-4.” YES Go to “Step 1524-3.”Step 1524-3 Exchange the following FRUs one at a time: 1. UEPO card assembly. Follow all steps in the removal and replacement procedures for “Unit Emergency Power Off (UEPO) Card Assembly” on page 850. 2. Cable from UEPO to BPC on the side referenced by the error code that sent you to this MAP. 3. BPC on the side referenced by the error code that sent you to this MAP. Follow all steps in the removal and replacement procedures for “Bulk Power Controller (BPC)” on page 845. After each FRU exchanged, is the error code that sent you to this MAP still reported? NO The problem has been corrected. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. YES Replace the next FRU on the list. If all FRUs on the list have been replaced, call for support.Step 1524-4 Perform the following actions one at a time: 1. Replace the UEPO card assembly. Follow all steps in the removal and replacement procedures for “Unit Emergency Power Off (UEPO) Card Assembly” on page 850. 2. Check with the customer’s site services to have them test the room EPO switch for correct operation. After each action is performed, is the error code that sent you to this MAP still reported? NO The problem has been corrected. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. YES Perform the next action on the list. If all actions on the list have been performed, call for support. 202
406/90
Service
Guide
MAP 1525: There Is a 350 Volt Bulk Failure Step 1525-1 Note: Read the Danger and Caution notices under “Safety Notices” on page vii before continuing with this procedure. 1. Prepare a multimeter to measure up to 600 V ac. 2. Use the multimieter to measure the ac voltage at the following test points on the faceplate of the BOA that is referecned by the error code that sent you to this MAP: v Phase A and phase B v Phase B and phase C v Phase C and phase A Are all the readings greater than 180 V ac? NO Go to “Step 1525-7” on page 205. YES Go to “Step 1525-2.”Step 1525-2 Notes: 1. This problem can be caused by any BPR plugged into the failing BPA. 2. The system can have one or two BPRs, depending on the system configuration. 3. Each BPR on the failing BPA must be checked independently. The following steps check all the BPRs on the side of the BPA (A or B) referenced by the error code that sent you to this MAP. Are there any remaining BPRs to be checked? NO Go to “Step 1525-3” on page 204. YES Go to “Step 1525-4” on page 204. Chapter
3.
Maintenance
Analysis
Procedures
203
Step 1525-3 Notes: 1. This problem can be caused by any BPD plugged into the failing BPA. 2. The system can have up to three BPDs, depending on the system configuration. 3. Each BPD on the failing BPA must be checked independently. The following steps check all the BPDs on the side of the BPA (A or B) referenced by the error code that sent you to this MAP. Are there any BPDs remaining to be checked? NO Exchange the following FRUs one at a time: v BPC on the side referenced by the error code that sent you to this MAP. Follow all steps in the removal and replacement procedures for “Bulk Power Controller (BPC)” on page 845. v Replace the BPE on the side referenced by the error code that sent you to this MAP. Follow all steps in the removal and replacement procedures for “Bulk Power Enclosure (BPE)” on page 849. After each FRU is exchanged, is the error code that sent you to this MAP still reported? NO The problem has been corrected. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. YES Replace the next FRU on the list. If all FRUs on the list have been replaced, call for support.YES Go to “Step 1525-5” on page 205.Step 1525-4 Note: Be sure you are working on the side of the BPA (A or B) referenced by the error code that sent you to this MAP. FRUs on the operating side of the BPA (with BPC GOOD LED on) must not be disturbed during this procedure. 1. Turn the BPR on/off switch to the off position (to the right). 2. Unplug the BPR to be checked and carefully pull it approximately 25 mm (1 inch) away from the backplane without unplugging any cables. If there is more than one BPR left to be checked, start with the uppermost unchecked BPR and make a note of which BPR you unplugged. Does the BPC GOOD LED on the BPC come on and stay on? NO Replug the BPR and set all the BPR on/off switches on this BPA to the on position (to the left), and continue with “Step 1525-1” on page 203. YES Replace the BPR that is currently unplugged. Follow all steps in the removal and replacement procedures for “Bulk Power Regulator (BPR)” on page 844. Go to “Step 1525-6” on page 205. 204
406/90
Service
Guide
Step 1525-5 Note: Be sure you are working on the side of the BPA (A or B) referenced by the error code that sent you to this MAP. FRUs on the operating side of the BPA (with BPC GOOD LED on) must not be disturbed during this procedure. 1. Turn all the BPR on/off switches on this BPA to the off position (to the right). 2. Unplug the BPD to be checked and carefully pull it approximately 25 mm (1 inch) away from the backplane without unplugging any cables. If there is more than one BPD left to be checked, start with the uppermost unchecked BPD and make a note of which BPD you unplugged. 3. Set all the BPR on/off switches on this BPA to the on position (to the left). Does the BPC GOOD LED on the BPC come on and stay on? NO Replug the BPD and continue with “Step 1525-3” on page 204. YES Replace the BPD that is currently unplugged. Follow all steps in the removal and replacement procedures for “Bulk Power Distributor (BPD)” on page 847. Go to “Step 1525-6.”Step 1525-6 Is the error code that sent you to this MAP still reported? NO The problem has been corrected. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. YES If the error code that directed you to this map was 1011 8710 or 1011 8711, then return to the “SPCN Error Codes” on page 420 table and take the next action defined in the table for these error codes. If the error code that directed you to this map was neither 1011 8710 nor 1011 8711, then call support.Step 1525-7 Inform the customer that the line voltage into the BPA is missing or is too low and must be checked. Can the line voltage be checked now? NO Defer this repair until the customer’s utility can check the line voltage. YES Go to “Step 1525-8.”Step 1525-8 Does the customer utility voltage checkout OK? NO The customer must restore utility voltage. YES Exchange the power cord and go to “Step 1525-9.”Step 1525-9 Press the service complete button. Is the error code that sent you to this MAP reported? NO The problem has been corrected. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. YES Call for support.
Chapter
3.
Maintenance
Analysis
Procedures
205
MAP 1526: There Is An Integrated Battery Feature (IBF) Failure Step 1526-1 Is the IBF feature installed? NO Replace the BPR that is connected to the IBF referenced in the error code that brought you to this MAP. Follow all steps in the removal and replacement procedures in “Bulk Power Regulator (BPR)” on page 844. Go to “Step 1526-7” on page 208. YES Go to “Step 1526-2.”Step 1526-2 Examine the following table to determine the BPR/IBF you are working with, then proceed to Step 1526-3. The remaining steps in this MAP refer to the indicated BPR/IBF:
Table 1.
Error Code
BPR and IBF
101A 7A56 or 101A 7B56
BPR1/IBF1, side A
101A 7A66 or 101A 7B66
BPR2/IBF2, side A
101A 7A76 or 101A 7B76
BPR3/IBF3, side A
101A 7A86 or 101A 7B86
BPR1/IBF1, side A
101A 7A96 or 101A 7B96
BPR2/IBF2, side A
101A 7AA6 or 101A 7BA6
BPR3/IBF3, side A
101B 7A56 or 101B 7B56
BPR1/IBF1, side B
101B 7A66 or 101B 7B66
BPR2/IBF2, side B
101B 7A76 or 101B 7B76
BPR3/IBF3, side B
101B 7A86 or 101B 7B86
BPR1/IBF1, side B
101B 7A96 or 101B 7B96
BPR2/IBF2, side B
101B 7AA6 or 101B 7BA6
BPR3/IBF3, side B Step 1526-3 Check the cable between the BPR and the IBF. The IBF LED on the IBF referenced in the error code that brought you to this MAP will be off. Is the cable plugged on both sides? NO Plug in the cable and go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. YES Go to “Step 1526-4” on page 207. 206
406/90
Service
Guide
Step 1526-4 Check the circuit breaker on the IBF referenced in the error code that brought you to this MAP. The IBF LED on the IBF referenced in the error code that brought you to this MAP will be off. Is the circuit breaker set on? NO Replace the IBF referenced in the error code that brought you to this MAP. Follow all steps in the removal and replacement procedures in “Integrated Battery Feature (IBF)” on page 851. Go to “Step 1526-7” on page 208. YES Exchange the following FRUs one at a time: v The BPR that is connected to the IBF referenced in the error code that brought you to this MAP. Follow all steps in the removal and replacement procedures for “Bulk Power Regulator (BPR)” on page 844. v The IBF that is referenced in the error code that brought you to this MAP. Follow all steps in the removal and replacement procedures for “Integrated Battery Feature (IBF)” on page 851. v The cable between the BPR and the IBF referenced in the error code that brought you to this MAP. After each FRU is exchanged, is the error code that sent you to this MAP still reported? NO The problem has been corrected. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems YES Replace the next FRU on the list. If all FRUs in the list have been replaced, call for support.Step 1526-5 Check the circuit breaker on the IBF referenced in the error code that brought you to this MAP. The IBF LED on the IBF referenced in the error code that brought you to this MAP will be off. Is the circuit breaker set on? NO Go to “Step 1526-6” on page 208 YES Replace the IBF referenced in the error code that brought you to this MAP. Follow all steps in the removal and replacement procedures in “Integrated Battery Feature (IBF)” on page 851. Go to “Step 1526-7” on page 208. Chapter
3.
Maintenance
Analysis
Procedures
207
Step 1526-6 Set the breaker to the on (down) position. Wait 10 seconds after the LED on the IBF referenced in the error code that brought you to this MAP comes on. Does the circuit breaker remain on? NO Exchange the following FRUs one at a time: v The BPR that is connected to the IBF referenced in the error code that brought you to this MAP. Follow all steps in the removal and replacement procedures for “Bulk Power Regulator (BPR)” on page 844. v The IBF that is referenced in the error code that brought you to this MAP. Follow all steps in the removal and replacement procedures for “Integrated Battery Feature (IBF)” on page 851. v The cable between the BPR and the IBF referenced in the error code that brought you to this MAP. After each FRU is exchanged, is the error code that sent you to this MAP still reported? NO The problem has been corrected. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. YES Replace the next FRU on the list. If all FRUs in the list have been replaced, call for support.YES Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems.Step 1526-7 Is the error code that sent you to this MAP still reported? NO The problem has been corrected. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. YES Call for support. 208
406/90
Service
Guide
MAP 1527: An Airflow Loss Has Been Detected Step 1527-1 Check the following air inlets and outlets for blockage or obstruction: v Front and rear frame door openings (except for acoustical foam) v Bulk power assembly (BPA) cage – Inlet opening in left side of front BPA, BPA A – Outlet opening in right side of rear BPA, BPB B – Bulk power fan (BPF) cover is present on front BPA and absent on rear BPAv Processor subsystem cage – Inlet opening at top front of processor subassembly – Outlet opening at top rear of processor subassemblyv I/O subsystem – Inlet opening at front of the I/O subassembly – Outlet opening at rear of I/O subassemblyAre all of these openings clear and unobstructed? NO Remove any obstructions and go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. YES Go to “Step 1527-2.”Step 1527-2 Do all book positions in the processor subsystem contain books or fillers? NO Fill any open positions with books or fillers, and go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. YES Go to “Step 1527-3.”Step 1527-3 Are the air filters clean and free of any obstructions? NO Clean and replace the air filters, then go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. YES Go to “Step 1527-4.”Step 1527-4 Remove the blowers or fans from the affected subsystem. Attention: Do not put your hand into the enclosure. Push the spring-loaded door open and hold it open with one hand. Using a suitably long instrument (a 12-inch ruler or long-handled screwdriver), push gently against the recirculation flaps at the back of each blower or fan enclosure. Are all the recirculation flaps in all enclosures free to swing up? NO The enclosure is not a replaceable FRU. Call your next level of support. YES Reinstall the blower or fans removed earlier and go to “Step 1527-5” on page 210 ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems.
Chapter
3.
Maintenance
Analysis
Procedures
209
Step 1527-5 Call your next level of support. 210
406/90
Service
Guide
MAP 1528: There Is A Processor (Critical/Warning) Overtemperature Fault Step 1528-1 Is the room ambient temperature in normal range (less than 35 degrees C/ 95 degrees F)? NO Notify the customer. The customer must bring the room temperature within normal range. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. YES Go to “Step 1528-2.”Step 1528-2 Are the system front and rear doors free of obstructions? Are the system air filters clean and free from obstructions? NO Notify the customer. The system must be free of obstructions for proper airflow. Clean or replace system air filters as required. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. YES Exchange the following FRUs one at a time: 1. The processor MCM module 0, at location U1.18-P1-C1. Follow all steps in the removal and replacement procedures for “MCM Module (Processor) and Pass Through Modules” on page 863. 2. The processor MCM module 3, at location U1.18-P1-C2, if present. Follow all steps in the removal and replacement procedures for “MCM Module (Processor) and Pass Through Modules” on page 863. 3. The processor MCM module 1, at location U1.18-P1-C3. Follow all steps in the removal and replacement procedures for “MCM Module (Processor) and Pass Through Modules” on page 863. 4. The processor MCM module 2, at location U1.18-P1-C4. Follow all steps in the removal and replacement procedures for “MCM Module (Processor) and Pass Through Modules” on page 863. After each FRU is exchanged, is the error code that sent you to this MAP still reported? NO The problem has been corrected. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. YES Replace the next FRU on the list. If all FRUs on the list have been replaced, call for support. Chapter
3.
Maintenance
Analysis
Procedures
211
MAP 1529: There Is A Bulk Power Assembly (BPA) Communication Failure Step 1529-1 Check the position of the on/off switch on the BPRs. Is the switch in the ON position (to the left)? NO Set the switch to the ON position (to the left), and go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. YES Go to “Step 1529-2.”Step 1529-2 Note: Read the Danger and Caution notices under “Safety Notices” on page vii before continuing with this procedure. 1. Prepare a multimeter to measure up to 600 V ac. 2. Use the multimeter to measure the ac voltage at the following test points on the face plate of the BPA that is not referenced by the error code that sent you to this MAP: v Phase A and phase B v Phase B and phase C v Phase C and phase A Are all the readings greater than 180 V ac? NO Go to “Step 1529-3” on page 213. YES Exchange the following FRUs one at a time: 1. BPR 1, that is connected to the IBF not referenced in the error code that brought you to this MAP. Follow all steps in the removal and replacement procedures for “Bulk Power Regulator (BPR)” on page 844. 2. BPR 2, that is connected to the IBF not referenced in the error code that brought you to this MAP. Follow all steps in the removal and replacement procedures for “Bulk Power Regulator (BPR)” on page 844. 3. BPR 3, that is connected to the IBF not referenced in the error code that brought you to this MAP. Follow all steps in the removal and replacement procedures for “Bulk Power Regulator (BPR)” on page 844. 4. BPC on the other side not referenced by the error code that sent you to this MAP. Follow all steps in the removal and replacement procedures for “Bulk Power Controller (BPC)” on page 845. 5. BPE on the other side not referenced by the error code that sent you to this MAP. Follow all steps in the removal and replacement procedures for “Bulk Power Enclosure (BPE)” on page 849. After each FRU is exchanged, is the error code that sent you to this MAP still reported? NO The problem has been corrected. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems YES Replace the next FRU on the list. If all FRUs on the list have been replaced, call for support.
212
406/90
Service
Guide
Step 1529-3 Inform the customer that the line voltage into the BPA is missing or too low, and needs to be checked. Can the line voltage be checked now? NO Defer this repair until the customer’s utility can check the line voltage. This ends the procedure. YES Go to “Step 1529-4.”Step 1529-4 Does the customer utility voltage check out OK? NO The customer must restore utility voltage. This ends the procedure. YES Exchange the power cord, and go to “Step 1529-5.”Step 1529-5 Press the service complete button. Is error code that sent you to this MAP still reported? NO The problem has been corrected. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. This ends the procedure. YES Call for support. This ends the procedure. Chapter
3.
Maintenance
Analysis
Procedures
213
MAP 152a: Loss of ac Power or Phase Missing Step 152a-1 Press the Service complete button. Is the error code that sent you to this MAP still reported? NO This was a transient condition. The condition no longer exists. No service action is required. YES Go to “Step 152a-2.”Step 152a-2 Check all LEDs on the BPA that is not referenced by the error code that sent you to this MAP. Are they OK? NO Defer this repair until the failure on the other BPA can be repaired. This ends the procedure. YES Go to “Step 152a-3.”Step 152a-3 Note: Read the Danger and Caution notices under “Safety Notices” on page vii before continuing with this procedure. 1. Prepare a multimeter to measure up to 600 V ac. 2. Use the multimeter to measure the ac voltage at the following test points on the faceplate of the BPA that is referenced by the error code that sent you to this MAP: v Phase A and phase B v Phase B and phase C v Phase C and phase A Are all the readings greater than 180 V ac? NO Go to “Step 152a-4” on page 215. YES Exchange the following FRUs one at a time: 1. BPR 1, that is connected to the BPA side referenced in the error code that brought you to this MAP. Follow all steps in the removal and replacement procedures for “Bulk Power Regulator (BPR)” on page 844. 2. BPR 2, that is connected to the BPA side referenced in the error code that brought you to this MAP. Follow all steps in the removal and replacement procedures for “Bulk Power Regulator (BPR)” on page 844. 3. BPR 3, that is connected to the BPA side referenced in the error code that brought you to this MAP. Follow all steps in the removal and replacement procedures for “Bulk Power Regulator (BPR)” on page 844. 4. BPC on the side referenced by the error code that sent you to this MAP. Follow all steps in the removal and replacement procedures for “Bulk Power Controller (BPC)” on page 845. 5. BPE on the side referenced by the error code that sent you to this MAP. Follow all steps in the removal and replacement procedures for “Bulk Power Enclosure (BPE)” on page 849. 214
406/90
Service
Guide
After each FRU is exchanged, is the error code that sent you to this MAP still reported? NO The problem has been corrected. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. YES Replace the next FRU on the list. If all FRUs on the list have been replaced, call for support.Step 152a-4 Inform the customer that the line voltage into the BPA is missing or too low, and must be checked. Can the line voltage be checked now? NO Defer this repair until the customer’s utility can check the line voltage. This ends the procedure. YES Go to “Step 152a-5.”Step 152a-5 Does the customer utility voltage check out OK? NO The customer must restore utility voltage. This ends the procedure. YES Exchange the power cord, and go to “Step 152a-6.”Step 152a-6 Press the service complete button. Is error code that sent you to this MAP still reported? NO The problem has been corrected. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. This ends the procedure. YES Call for support. This ends the procedure. Chapter
3.
Maintenance
Analysis
Procedures
215
MAP 152b: 2.5 V Current/Voltage Problem in Processor Subsystem Step 152b-1 Record the error code and location code(s) that sent you to this MAP. Step 152b-2 Turn off the power. Examine the amber logic-power LEDs of all installed processor subsystem DCAs. Step 152b-3 Are all amber logic-power LEDs of all installed processor subsystem DCAs off? Yes Go to “Step 152b-4.” No Call for support.Step 152b-4 Pull out the following books about 25 mm (1 inch), if present: v Memory book 0, at location U1.18-P1-M2 v Memory book 1, at location U1.18-P1-M3 v Memory book 2, at location U1.18-P1-M7 v Memory book 3, at location U1.18-P1-M6Step 152b-5 Turn on the power. Step 152b-6 Did the system stop with the same error code as recorded in “Step 152b-1”? Yes Go to “Step 152b-12” on page 217. No Go to “Step 152b-7.”Step 152b-7 One of the books just removed is defective. 1. Turn off the power. 2. Make sure all amber logic-power LEDs of all installed processor subsystem DCAs are off.Step 152b-8 Reinsert one of the books that is pulled out and listed in step 152b-4. Step 152b-9 Turn on the power. 216
406/90
Service
Guide
Step 152b-10 Did the system stop with the same error code as recorded in “Step 152b-1” on page 216? Yes The book just reinserted is defective. Replace it. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152b-11.”Step 152b-11 Have all the books listed in step 152b-4 been reinserted? Yes The symptom has changed. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152b-8” on page 216.Step 152b-12 Turn off the power. Examine the amber logic-power LEDs of all installed processor subsystem DCAs. Step 152b-13 Are all amber logic-power LEDs of all installed processor subsystem DCAs off? Yes Go to “Step 152b-14.” No Call for support.Step 152b-14 Pull out the following books about 25 mm (1 inch), if present: v Memory book 4, at location U1.18-P1-M4 v Memory book 5, at location U1.18-P1-M8 v Memory book 6, at location U1.18-P1-M5 v Memory book 7, at location U1.18-P1-M1Step 152b-15 Turn on the power. Step 152b-16 Did the system stop with the same error code as recorded in “Step 152b-1” on page 216? Yes Go to “Step 152b-22” on page 219. No Go to “Step 152b-17.”Step 152b-17 One of the books just removed is defective. 1. Turn off the power. 2. Make sure all amber logic-power LEDs of all installed processor subsystem DCAs are off.
Chapter
3.
Maintenance
Analysis
Procedures
217
Step 152b-18 Reinsert one of the books that is pulled out and listed in step “Step 152b-14” on page 217. 218
406/90
Service
Guide
Step 152b-19 Turn on the power. Step 152b-20 Did the system stop with the same error code as recorded in “Step 152b-1” on page 216? Yes The book just reinserted is defective. Replace it. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152b-21.”Step 152b-21 Have all the books listed in “Step 152b-14” on page 217 been reinserted? Yes The symptom has changed. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152b-18” on page 218.Step 152b-22 Turn off the power. Examine the amber logic-power LEDs of all installed processor subsystem DCAs. Step 152b-23 Are all amber logic-power LEDs of all installed processor subsystem DCAs off? Yes Go to “Step 152b-24.” No Call for support.Step 152b-24 Pull out the following books about 25 mm (1 inch), if present: v Secondary I/O book, at location U1.18-P1-H3 v Third I/O book, at location U1.18-P1-H4 v Fourth I/O book, at location U1.18-P1-H1 v Capacitor book, at location U1.18-P1-V3 v Capacitor book, at location U1.18-P1-V7 Chapter
3.
Maintenance
Analysis
Procedures
219
Step 152b-25 Turn on the power. Step 152b-26 Did the system stop with the same error code as recorded in “Step 152b-1” on page 216? Yes Go to “Step 152b-32.” No Go to “Step 152b-27.”Step 152b-27 One of the books just removed is defective. 1. Turn off the power. 2. Make sure all amber logic-power LEDs of all installed processor subsystem DCAs are off.Step 152b-28 Reinsert one of the books that is pulled out and listed in “Step 152b-24” on page 219. Step 152b-29 Turn on the power. Step 152b-30 Did the system stop with the same error code as recorded in “Step 152b-1” on page 216? Yes The book just reinserted is defective. Replace it. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152b-31.”Step 152b-31 Have all the books listed in “Step 152b-24” on page 219 been reinserted? Yes The symptom has changed. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152b-28.”Step 152b-32 Turn off the power. Examine the amber logic-power LEDs of all installed processor subsystem DCAs. Step 152b-33 Are all amber logic-power LEDs of all installed processor subsystem DCAs off? Yes Go to “Step 152b-34” on page 221. No Call for support. 220
406/90
Service
Guide
Step 152b-34 Replace the books in the following list, if present, one at a time and in the order listed: v Primary I/O Book, at location U1.18-P1-H2 v DCA 1-RH, at location U1.18-P1-V1 v DCA 2-RH, at location U1.18-P1-V2 v DCA 3-RH, at location U1.18-P1-V4 v DCA 4-RH, at location U1.18-P1-V5 v DCA 5-RH, at location U1.18-P1-V6 v DCA 6-RH, at location U1.18-P1-V8Step 152b-35 Turn on the power. Step 152b-36 Did the system stop with the same error code as recorded in “Step 152b-1” on page 216? Yes Go to “Step 152b-37.” No The book just replaced is defective. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems.Step 152b-37 Turn off the power. Make sure all amber logic-power LEDs of all installed processor subsystem DCAs are off. Step 152b-38 Remove the new book that was just installed in “Step 152b-34” and reinstall the original book. Step 152b-39 Have all the books listed in “Step 152b-34” been replaced with new or original books? Yes Go to “Step 152b-40.” No Go to “Step 152b-34.”Step 152b-40 Examine the amber logic-power LEDs of all installed processor subsystem DCAs. Step 152b-41 Are all amber logic-power LEDs of all installed processor subsystem DCAs off? Yes Go to “Step 152b-42” on page 222. No Call for support. Chapter
3.
Maintenance
Analysis
Procedures
221
Step 152b-42 Attention: Before replacing the system backplane, call for support. Replace the system backplane at location U1.18-P1. Step 152b-43 Turn on the power. Step 152b-44 Did the system stop with the same error code as recorded in “Step 152b-1” on page 216? Yes Go to “Step 152b-45.” No The system backplane was defective. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems.Step 152b-45 Turn off the power. Make sure all amber logic-power LEDs of all installed processor subsystem DCAs are off. Step 152b-46 Has the system backplane been replaced? Yes Call for support. No Go to “Step 152b-42.” 222
406/90
Service
Guide
MAP 152c: 1.8 V Current/Voltage Problem in Processor Subsystem Step 152c-1 For future reference, record the error code in the operator panel display. Step 152c-2 Turn off the power. Examine the amber logic-power LEDs of all installed processor subsystem DCAs. Step 152c-3 Are all amber logic-power LEDs of all installed processor subsystem DCAs off? Yes Go to “Step 152c-4.” No Call for support.Step 152c-4 Pull out the following books about 25 mm (1 inch), if present: v Memory book 0, at location U1.18-P1-M2 v Memory book 1, at location U1.18-P1-M3 v Memory book 2, at location U1.18-P1-M7 v Memory book 3, at location U1.18-P1-M6Step 152c-5 Turn on the power. Step 152c-6 Did the system stop with the same error code as recorded in “Step 152c-1”? Yes Go to “Step 152c-12” on page 224. No Go to “Step 152c-7.”Step 152c-7 One of the books just removed is defective. 1. Turn off the power. 2. Make sure all amber logic-power LEDs of all installed processor subsystem DCAs are off.Step 152c-8 Reinsert one of the books that is pulled out and listed in “Step 152c-4.” Step 152c-9 Turn on the power. Chapter
3.
Maintenance
Analysis
Procedures
223
Step 152c-10 Did the system stop with the same error code as recorded in “Step 152c-1” on page 223? Yes The book just reinserted is defective. Replace it. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152c-11.”Step 152c-11 Have all the books listed in “Step 152c-4” on page 223 been reinserted? Yes The symptom has changed. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152c-8” on page 223.Step 152c-12 Turn off the power. Examine the amber logic-power LEDs of all installed processor subsystem DCAs. Step 152c-13 Are all amber logic-power LEDs of all installed processor subsystem DCAs off? Yes Go to “Step 152c-14.” No Call for support.Step 152c-14 Pull out the following books about 25 mm (1 inch), if present: v Memory book 4, at location U1.18-P1-M4 v Memory book 5, at location U1.18-P1-M8 v Memory book 6, at location U1.18-P1-M5 v Memory book 7, at location U1.18-P1-M1Step 152c-15 Turn on the power. Step 152c-16 Did the system stop with the same error code as recorded in “Step 152c-1” on page 223? Yes Go to “Step 152c-22” on page 225. No Go to “Step 152c-17.”Step 152c-17 One of the books just removed is defective. 1. Turn off the power. 2. Make sure all amber logic-power LEDs of all installed processor subsystem DCAs are off. 224
406/90
Service
Guide
Step 152c-18 Reinsert one of the books that is pulled out and listed in “Step 152c-14” on page 224. Step 152c-19 Turn on the power. Step 152c-20 Did the system stop with the same error code as recorded in “Step 152c-1” on page 223? Yes The book just reinserted is defective. Replace it. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152c-21.”Step 152c-21 Have all the books listed in “Step 152c-14” on page 224 been reinserted? Yes The symptom has changed. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152c-18.”Step 152c-22 Turn off the power. Examine the amber logic-power LEDs of all installed processor subsystem DCAs. Step 152c-23 Are all amber logic-power LEDs of all installed processor subsystem DCAs off? Yes Go to “Step 152c-24.” No Call for support.Step 152c-24 Pull out the following books about 25 mm (1 inch), if present: v Secondary I/O book, at location U1.18-P1-H3 v Third I/O book, at location U1.18-P1-H4 v Fourth I/O book, at location U1.18-P1-H1 v Capacitor book, at location U1.18-P1-V3 v Capacitor book, at location U1.18-P1-V7 Chapter
3.
Maintenance
Analysis
Procedures
225
Step 152c-25 Turn on the power. Step 152c-26 Did the system stop with the same error code as recorded in “Step 152c-1” on page 223? Yes Go to “Step 152c-32.” No Go to “Step 152c-27.”Step 152c-27 One of the books just removed is defective. Turn off the power. Make sure all amber logic-power LEDs of all installed processor subsystem DCAs are off. Step 152c-28 Reinsert one of the books that is pulled out and listed in “Step 152c-24” on page 225. Step 152c-29 Turn on the power. Step 152c-30 Did the system stop with the same error code as recorded in “Step 152c-1” on page 223? Yes The book just reinserted is defective. Replace it. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152c-31.”Step 152c-31 Have all the books listed in “Step 152c-24” on page 225 been reinserted? Yes The symptom has changed. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152c-28.”Step 152c-32 Turn off the power. Examine the amber logic-power LEDs of all installed processor subsystem DCAs. 226
406/90
Service
Guide
Step 152c-33 Are all amber logic-power LEDs of all installed processor subsystem DCAs off? Yes Go to “Step 152c-34.” No Call for support.Step 152c-34 Replace the books in the following list, if present, one at a time and in the order listed: v Primary I/O book, at location U1.18-P1-H2 v DCA 1-RH, at location U1.18-P1-V1 v DCA 2-RH, at location U1.18-P1-V2 v DCA 3-RH, at location U1.18-P1-V4 v DCA 4-RH, at location U1.18-P1-V5 v DCA 5-RH, at location U1.18-P1-V6 v DCA 6-RH, at location U1.18-P1-V8Step 152c-35 Turn on the power. Step 152c-36 Did the system stop with the same error code as recorded in “Step 152c-1” on page 223? Yes Go to “Step 152c-37.” No The book just replaced was defective. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems.Step 152c-37 Turn off the power. Make sure all amber logic-power LEDs of all installed processor subsystem DCAs are off. Step 152c-38 Remove the new book that was just installed in “Step 152c-34” and reinstall the original book. Step 152c-39 Have all the books listed in “Step 152c-34” been replaced with new or original books? Yes Go to “Step 152c-40.” No Go to “Step 152c-34.”Step 152c-40 Turn off the power. Examine the amber logic-power LEDs of all installed processor subsystem DCAs. Chapter
3.
Maintenance
Analysis
Procedures
227
Step 152c-41 Are all amber logic-power LEDs of all installed processor subsystem DCAs off? Yes Go to “Step 152c-42.” No Call for support.Step 152c-42 Record the location of each installed L3 module. Attention: L3 modules have a limit of three plug cycles. Before removing any of the L3 modules in the following list, call for support. Remove all of the following modules, if present: v L3 module, at location U1.18-P1-C5 v L3 module, at location U1.18-P1-C6 v L3 module, at location U1.18-P1-C7 v L3 module, at location U1.18-P1-C8 v L3 module, at location U1.18-P1-C9 v L3 module, at location U1.18-P1-C10 v L3 module, at location U1.18-P1-C11 v L3 module, at location U1.18-P1-C12 v L3 module, at location U1.18-P1-C13 v L3 module, at location U1.18-P1-C14 v L3 module, at location U1.18-P1-C15 v L3 module, at location U1.18-P1-C16 v L3 module, at location U1.18-P1-C17 v L3 module, at location U1.18-P1-C18 v L3 module, at location U1.18-P1-C19 v L3 module, at location U1.18-P1-C20 v System clock, at location U1.18-P1-X5Step 152c-43 Turn on the power. Step 152c-44 Did the system stop with the same error code as recorded in “Step 152c-1” on page 223? Yes Go to “Step 152c-50” on page 229. No Go to “Step 152c-45.”Step 152c-45 One of the modules just removed is defective. Turn off the power. Make sure all amber logic-power LEDs of all installed processor subsystem DCAs are off. Step 152c-46 Reinstall one of the modules that is removed and listed in “Step 152c-42.” 228
406/90
Service
Guide
Step 152c-47 Turn on the power. Step 152c-48 Did the system stop with the same error code as recorded in “Step 152c-1” on page 223? Yes The module just replaced was defective. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152c-49.”Step 152c-49 Have all the modules listed in “Step 152c-42” on page 228 been reinstalled? Yes The symptom has changed. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152c-45” on page 228.Step 152c-50 Examine the amber logic-power LEDs of all installed processor subsystem DCAs. Step 152c-51 Are all amber logic-power LEDs of all installed processor subsystem DCAs off? Yes Go to “Step 152c-52.” No Call for support.Step 152c-52 Attention: Before replacing the system backplane, call for support. Replace the system backplane, at location U1.18-P1. Step 152c-53 Turn on the power. Step 152c-54 Did the system stop with the same error code as recorded in “Step 152c-1” on page 223? Yes Go to “Step 152c-55” on page 230. No The system backplane was defective. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. Chapter
3.
Maintenance
Analysis
Procedures
229
Step 152c-55 Turn off the power. Make sure all amber logic-power LEDs of all installed processor subsystem DCAs are off. Step 152c-56 Has the system backplane been replaced? Yes Call for support. No Go to “Step 152c-52” on page 229. 230
406/90
Service
Guide
MAP 152d: 1.5 V Auxiliary Current/Voltage Problem in Processor Subsystem Step 152d-1 Record the error code and the location code(s) that sent you to this MAP. Step 152d-2 Turn off the power. Examine the amber logic-power LEDs of all installed processor subsystem DCAs. Step 152d-3 Are all amber logic-power LEDs of all installed processor subsystem DCAs off? Yes Go to “Step 152d-4.” No Call for support.Step 152d-4 Pull out the following books about 25 mm (1 inch), if present: v Memory book 0, at location U1.18-P1-M2 v Memory book 1, at location U1.18-P1-M3 v Memory book 2, at location U1.18-P1-M7 v Memory book 3, at location U1.18-P1-M6Step 152d-5 Turn on the power. Step 152d-6 Did the system stop with the same error code as recorded in “Step 152d-1”? Yes Go to “Step 152d-12” on page 232. No Go to “Step 152d-7.”Step 152d-7 One of the books just removed is defective. Turn off the power. Make sure all amber logic-power LEDs of all installed processor subsystem DCAs are off. Chapter
3.
Maintenance
Analysis
Procedures
231
Step 152d-8 Reinsert one of the books that is pulled out and listed in “Step 152d-4” on page 231. Step 152d-9 Turn on the power. Step 152d-10 Did the system stop with the same error code as recorded in “Step 152d-1” on page 231? Yes The book just reinserted is defective. Replace it. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152d-11.”Step 152d-11 Have all the books listed in “Step 152d-4” on page 231 been reinserted? Yes The symptom has changed. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152d-8.”Step 152d-12 Turn off the power. Examine the amber logic-power LEDs of all installed processor subsystem DCAs. Step 152d-13 Are all amber logic-power LEDs of all installed processor subsystem DCAs off? Yes Go to “Step 152d-14.” No Call for support.Step 152d-14 Pull out the following books about 25 mm (1 inch), if present: v Memory book 4, at location U1.18-P1-M4 v Memory book 5, at location U1.18-P1-M8 v Memory book 6, at location U1.18-P1-M5 v Memory book 7, at location U1.18-P1-M1 232
406/90
Service
Guide
Step 152d-15 Turn on the power. Step 152d-16 Did the system stop with the same error code as recorded in “Step 152d-1” on page 231? Yes Go to “Step 152d-22” on page 234. No Go to “Step 152d-17.”Step 152d-17 One of the books just removed is defective. Turn off the power. Make sure all amber logic-power LEDs of all installed processor subsystem DCAs are off. Step 152d-18 Reinsert one of the books that is pulled out and listed in “Step 152d-14” on page 232. Step 152d-19 Turn on the power. Step 152d-20 Did the system stop with the same error code as recorded in “Step 152d-1” on page 231? Yes The book just reinserted is defective. Replace it. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152d-21.”Step 152d-21 Have all the books listed in “Step 152d-14” on page 232 been reinserted? Yes The symptom has changed. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152d-18.” Chapter
3.
Maintenance
Analysis
Procedures
233
Step 152d-22 Turn off the power. Examine the amber logic-power LEDs of all installed processor subsystem DCAs. Step 152d-23 Are all amber logic-power LEDs of all installed processor subsystem DCAs off? Yes Go to “Step 152d-24.” No Call for support.Step 152d-24 Pull out the following books about 25 mm (1 inch), if present: v Secondary I/O book, at location U1.18-P1-H3 v Third I/O book, at location U1.18-P1-H4 v Fourth I/O book, at location U1.18-P1-H1 v Capacitor book, at location U1.18-P1-V3 v Capacitor book, at location U1.18-P1-V7Step 152d-25 Turn on the power. Step 152d-26 Did the system stop with the same error code as recorded in “Step 152d-1” on page 231? Yes Go to “Step 152d-32” on page 235. No Go to “Step 152d-27.”Step 152d-27 One of the books just removed is defective. Turn off the power. Make sure all amber logic-power LEDs of all installed processor subsystem DCAs are off. Step 152d-28 Reinsert one of the books that is pulled out and listed in “Step 152d-24.” Step 152d-29 Turn on the power. 234
406/90
Service
Guide
Step 152d-30 Did the system stop with the same error code as recorded in “Step 152d-1” on page 231? Yes The book just reinserted is defective. Replace it. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152d-31.”Step 152d-31 Have all the books listed in “Step 152d-24” on page 234 been reinserted? Yes The symptom has changed. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152d-28” on page 234.Step 152d-32 Turn off the power. Examine the amber logic-power LEDs of all installed processor subsystem DCAs. Step 152d-33 Are all amber logic-power LEDs of all installed processor subsystem DCAs off? Yes Go to “Step 152d-34.” No Call for support.Step 152d-34 Replace the books in the following list, if present, one at a time and in the order listed: v DCA 1-RH, at location U1.18-P1-V1 v DCA 2-RH, at location U1.18-P1-V2 v DCA 3-RH, at location U1.18-P1-V4 v DCA 4-RH, at location U1.18-P1-V5 v DCA 5-RH, at location U1.18-P1-V6 v DCA 6-RH, at location U1.18-P1-V8Step 152d-35 Turn on the power. Chapter
3.
Maintenance
Analysis
Procedures
235
Step 152d-36 Did the system stop with the same error code as recorded in “Step 152d-1” on page 231? Yes Go to “Step 152d-37.” No The book just replaced was defective. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems.Step 152d-37 Turn off the power. Make sure all amber logic-power LEDs of all installed processor subsystem DCAs are off. Step 152d-38 Remove the new book that was just installed in “Step 152d-34” on page 235 and reinstall the original book. Step 152d-39 Have all the books listed in “Step 152d-34” on page 235 been replaced with new or original books? Yes Go to “Step 152d-40.” No Go to “Step 152d-34” on page 235.Step 152d-40 Turn off the power. Examine the amber logic-power LEDs of all installed processor subsystem DCAs. Step 152d-41 Are all amber logic-power LEDs of all installed processor subsystem DCAs off? Yes Go to “Step 152d-42” on page 237. No Call for support. 236
406/90
Service
Guide
Step 152d-42 Record the location of each installed L3 module. Attention: L3 modules have a limit of three plug cycles. Before removing any of the L3 modules in the following list, call for support. Remove all of the following modules if present: v L3 module, at location U1.18-P1-C5 v L3 module, at location U1.18-P1-C6 v L3 module, at location U1.18-P1-C7 v L3 module, at location U1.18-P1-C8 v L3 module, at location U1.18-P1-C9 v L3 module, at location U1.18-P1-C10 v L3 module, at location U1.18-P1-C11 v L3 module, at location U1.18-P1-C12 v L3 module, at location U1.18-P1-C13 v L3 module, at location U1.18-P1-C14 v L3 module, at location U1.18-P1-C15 v L3 module, at location U1.18-P1-C16 v L3 module, at location U1.18-P1-C17 v L3 module, at location U1.18-P1-C18 v L3 module, at location U1.18-P1-C19 v L3 module, at location U1.18-P1-C20Step 152d-43 Turn on the power. Step 152d-44 Did the system stop with the same error code as recorded in “Step 152d-1” on page 231? Yes Go to “Step 152d-50” on page 238. No Go to “Step 152d-45.”Step 152d-45 One of the modules just removed is defective. Turn off the power. Make sure all amber logic-power LEDs of all installed processor subsystem DCAs are off. Step 152d-46 Reinstall one of the L3 modules that is removed and listed in “Step 152d-42.” Step 152d-47 Turn on the power. Chapter
3.
Maintenance
Analysis
Procedures
237
Step 152d-48 Did the system stop with the same error code as recorded in “Step 152d-1” on page 231? Yes The module just replaced was defective. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152d-49.”Step 152d-49 Have all the modules listed in “Step 152d-42” on page 237 been reinstalled? Yes The symptom has changed. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152d-45” on page 237.Step 152d-50 Examine the amber logic-power LEDs of all installed processor subsystem DCAs. Step 152d-51 Are all amber logic-power LEDs of all installed processor subsystem DCAs off? Yes Go to “Step 152d-52.” No Call for support.Step 152d-52 Attention: Before replacing the system backplane, call for support. Replace the system backplane, at location U1.18-P1. Step 152d-53 Turn on the power. Step 152d-54 Did the system stop with the same error code as recorded in “Step 152d-1” on page 231? Yes Go to “Step 152d-55” on page 239. No The system backplane was defective. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. 238
406/90
Service
Guide
Step 152d-55 Turn off the power. Make sure all amber logic-power LEDs of all installed processor subsystem DCAs are off. Step 152d-56 Has the system backplane been replaced? Yes Call for support. No Go to “Step 152d-52” on page 238. Chapter
3.
Maintenance
Analysis
Procedures
239
MAP 152e: 1.5 V CPU Current/Voltage Problem in Processor Subsystem Step 152e-1 Record the error code and the location code(s) that sent you to this MAP. Step 152e-2 Turn off the power. Examine the amber logic-power LEDs of all installed processor subsystem DCAs. Step 152e-3 Are all amber logic-power LEDs of all installed processor subsystem DCAs off? Yes Go to “Step 152e-4.” No Call for support.Step 152e-4 Pull out the following books about 25 mm (1 inch), if present: v Capacitor book, at location U1.18-P1-V3 v Capacitor book, at location U1.18-P1-V7Step 152e-5 Turn on the power. Step 152e-6 Did the system stop with the same error code as recorded in “Step 152e-1”? Yes Go to “Step 152e-12” on page 241. No Go to “Step 152e-7.”Step 152e-7 One of the books just removed is defective. Turn off the power. Make sure all amber logic-power LEDs of all installed processor subsystem DCAs are off. Step 152e-8 Reinsert one of the books that is pulled out and listed in “Step 152e-4.” Step 152e-9 Turn on the power. 240
406/90
Service
Guide
Step 152e-10 Did the system stop with the same error code as recorded in “Step 152e-1” on page 240? Yes The book just reinserted is defective. Replace it. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152e-11.”Step 152e-11 Have all the books listed in “Step 152e-4” on page 240 been reinserted? Yes The symptom has changed. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152e-8” on page 240.Step 152e-12 Turn off the power. Examine the amber logic-power LEDs of all installed processor subsystem DCAs. Step 152e-13 Are all amber logic-power LEDs of all installed processor subsystem DCAs off? Yes Go to “Step 152e-14.” No Call for support.Step 152e-14 Replace the books in the following list, if present, one at a time and in the order listed: v DCA 1, at location U1.18-P1-V1 v DCA 2, at location U1.18-P1-V2 v DCA 3, at location U1.18-P1-V4 v DCA 4, at location U1.18-P1-V5 v DCA 5, at location U1.18-P1-V6 v DCA 6, at location U1.18-P1-V8Step 152e-15 Turn on the power. Chapter
3.
Maintenance
Analysis
Procedures
241
Step 152e-16 Did the system stop with the same error code as recorded in “Step 152e-1” on page 240? Yes Go to “Step 152e-17.” No The book just reinserted is defective. Replace it. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems.Step 152e-17 Turn off the power. Make sure all amber logic-power LEDs of all installed processor subsystem DCAs are off. Step 152e-18 Remove the new book that was just installed in “Step 152e-14” on page 241 and reinstall the original book. Step 152e-19 Have all the books listed in “Step 152e-14” on page 241 been replaced with new or original books? Yes Go to “Step 152e-20.” No Go to “Step 152e-14” on page 241.Step 152e-20 Turn off the power. Examine the amber logic-power LEDs of all installed processor subsystem DCAs. Step 152e-21 Are all amber logic-power LEDs of all installed processor subsystem DCAs off? Yes Go to “Step 152e-22.” No Call for support.Step 152e-22 Attention: Before you remove or replace any MCM or L3 module, stop, read and understand the following procedures: “MCM Module (Processor) and Pass Through Modules” on page 863 and “L3 Cache Modules” on page 882. Record the location of each installed MCM module. Attention: MCM modules have a limit of three plug cycles. Before replacing any of the MCMs in the following list, call for support. Record the original location on each MCM. Remove the following MCM modules, if present: v MCM module 0, at location U1.18-P1-C1 v MCM module 3, at location U1.18-P1-C2 v MCM module 1, at location U1.18-P1-C3 v MCM module 2, at location U1.18-P1-C4 242
406/90
Service
Guide
Step 152e-23 Turn on the power. Step 152e-24 Did the system stop with the same error code as recorded in “Step 152e-1” on page 240? Yes Go to “Step 152e-30” on page 244. No Go to “Step 152e-25.”Step 152e-25 One of the modules just removed is defective. Turn off the power. Make sure all amber logic-power LEDs of all installed processor subsystem DCAs are off. Step 152e-26 Attention: Before you remove or replace any MCM or L3 module, stop, read, and understand the following procedures: “MCM Module (Processor) and Pass Through Modules” on page 863 and “L3 Cache Modules” on page 882. Reinstall one of the MCM modules that is removed and listed in “Step 152e-22” on page 242. Step 152e-27 Turn on the power. Step 152e-28 Did the system stop with the same error code as recorded in “Step 152e-1” on page 240? Yes The module just reinserted was defective. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152e-29.”Step 152e-29 Have all the modules listed in “Step 152e-22” on page 242 been reinstalled? Yes The symptom has changed. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152e-25.” Chapter
3.
Maintenance
Analysis
Procedures
243
Step 152e-30 Examine the amber logic-power LEDs of all installed processor subsystem DCAs. Step 152e-31 Are all amber logic-power LEDs of all installed processor subsystem DCAs off? Yes Go to “Step 152e-32.” No Call for support.Step 152e-32 Attention: Before replacing the system backplane, call for support. Replace the system backplane, at location U1.18-P1. Step 152e-33 Turn on the power. Step 152e-34 Did the system stop with the same error code as recorded in “Step 152e-1” on page 240? Yes Go to “Step 152e-35.” No The system backplane was defective. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems.Step 152e-35 Turn off the power. Make sure all amber logic-power LEDs of all installed processor subsystem DCAs are off. Step 152e-36 Has the system backplane been replaced? Yes Call for support. No Go to “Step 152e-32.” 244
406/90
Service
Guide
MAP 152f: 3.3 V Current/Voltage Problem in Processor Subsystem Step 152f-1 Record the error code and the location code(s) that sent you to this MAP. Step 152f-2 Turn off the power. Examine the amber logic-power LEDs of all installed processor subsystem DCAs. Step 152f-3 Are all amber logic-power LEDs of all installed processor subsystem DCAs off? Yes Go to “Step 152f-4.” No Call for support.Step 152f-4 Pull out the following books about 25 mm (1 inch), if present: v Memory book 0, at location U1.18-P1-M2 v Memory book 1, at location U1.18-P1-M3 v Memory book 2, at location U1.18-P1-M7 v Memory book 3, at location U1.18-P1-M6Step 152f-5 Turn on the power. Step 152f-6 Did the system stop with the same error code as recorded in “Step 152f-1”? Yes Go to “Step 152f-12” on page 246. No Go to “Step 152f-7.”Step 152f-7 One of the books just removed is defective. Turn off the power. Make sure all amber logic-power LEDs of all installed processor subsystem DCAs are off. Step 152f-8 Reinsert one of the books that is pulled out and listed in “Step 152f-4.” Chapter
3.
Maintenance
Analysis
Procedures
245
Step 152f-9 Turn on the power. Step 152f-10 Did the system stop with the same error code as recorded in “Step 152f-1” on page 245? Yes The book just reinserted is defective. Replace it. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152f-11.”Step 152f-11 Have all the books listed in “Step 152f-4” on page 245 been reinserted? Yes The symptom has changed. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152f-8” on page 245.Step 152f-12 Turn off the power. Examine the amber logic-power LEDs of all installed processor subsystem DCAs. Step 152f-13 Are all amber logic-power LEDs of all installed processor subsystem DCAs off? Yes Go to “Step 152f-14.” No Call for support.Step 152f-14 Pull out the following books about 25 mm (1 inch), if present: v Memory book 4, at location U1.18-P1-M4 v Memory book 5, at location U1.18-P1-M8 v Memory book 6, at location U1.18-P1-M5 v Memory book 7, at location U1.18-P1-M1Step 152f-15 Turn on the power. 246
406/90
Service
Guide
Step 152f-16 Did the system stop with the same error code as recorded in “Step 152f-1” on page 245? Yes Go to “Step 152f-22.” No Go to “Step 152f-17.”Step 152f-17 One of the books just removed is defective. Turn off the power. Make sure all amber logic-power LEDs of all installed processor subsystem DCAs are off. Step 152f-18 Reinsert one of the books that is pulled out and listed in “Step 152f-14” on page 246. Step 152f-19 Turn on the power. Step 152f-20 Did the system stop with the same error code as recorded in “Step 152f-1” on page 245? Yes The book just reinserted is defective. Replace it. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152f-21.”Step 152f-21 Have all the books listed in “Step 152f-14” on page 246 been reinserted? Yes The symptom has changed. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152f-18.”Step 152f-22 Turn off the power. Examine the amber logic-power LEDs of all installed processor subsystem DCAs. Chapter
3.
Maintenance
Analysis
Procedures
247
Step 152f-23 Are all amber logic-power LEDs of all installed processor subsystem DCAs off? Yes Go to “Step 152f-24.” No Call for support.Step 152f-24 Pull out the following books about 25 mm (1 inch), if present: v Secondary I/O book, at location U1.18-P1-H3 v Third I/O book, at location U1.18-P1-H4 v Fourth I/O book, at location U1.18-P1-H1Step 152f-25 Turn on the power. Step 152f-26 Did the system stop with the same error code as recorded in “Step 152f-1” on page 245? Yes Go to “Step 152f-32” on page 249. No Go to “Step 152f-27.”Step 152f-27 The book just removed is defective. Turn off the power. Make sure all amber logic-power LEDs of all installed processor subsystem DCAs are off. Step 152f-28 Reinsert the book that is pulled out and listed in “Step 152f-24.” Step 152f-29 Turn on the power. Step 152f-30 Did the system stop with the same error code as recorded in “Step 152f-1” on page 245? Yes The book just reinserted is defective. Replace it. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152f-31” on page 249. 248
406/90
Service
Guide
Step 152f-31 Have all the books listed in “Step 152f-24” on page 248 been reinserted? Yes The symptom has changed. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152f-28” on page 248.Step 152f-32 Turn off the power. Examine the amber logic-power LEDs of all installed processor subsystem DCAs. Step 152f-33 Are all amber logic-power LEDs of all installed processor subsystem DCAs off? Yes Go to “Step 152f-34.” No Call for support.Step 152f-34 Replace the books in the following list, if present, one at a time and in the order listed: v Primary I/O book, at location U1.18-P1-H2 v DCA 1-RH, at location U1.18-P1-V1 v DCA 2-RH, at location U1.18-P1-V2 v DCA 3-RH, at location U1.18-P1-V4 v DCA 4-RH, at location U1.18-P1-V5 v DCA 5-RH, at location U1.18-P1-V6 v DCA 6-RH, at location U1.18-P1-V8Step 152f-35 Turn on the power. Step 152f-36 Did the system stop with the same error code as recorded in “Step 152f-1” on page 245? Yes Go to “Step 152f-37.” No The book just replaced was defective. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems.Step 152f-37 Turn off the power. Make sure all amber logic-power LEDs of all installed processor subsystem DCAs are off. Chapter
3.
Maintenance
Analysis
Procedures
249
Step 152f-38 Remove the new book that was just installed in “Step 152f-34” on page 249 and reinstall the original book. Step 152f-39 Have all the books listed in “Step 152f-34” on page 249 been replaced with new or original books? Yes Go to “Step 152f-40.” No Go to “Step 152f-34” on page 249.Step 152f-40 Turn off the power. Examine the amber logic-power LEDs of all installed processor subsystem DCAs. Step 152f-41 Are all amber logic-power LEDs of all installed processor subsystem DCAs off? Yes Go to “Step 152f-42.” No Call for support.Step 152f-42 Remove the system clock card, at location U1.18-P1-X5. Step 152f-43 Turn on the power. Step 152f-44 Did the system stop with the same error code as recorded in “Step 152f-1” on page 245? Yes Go to “Step 152f-50” on page 251. No Go to “Step 152f-45.”Step 152f-45 The system clock card is defective. Turn off the power. Make sure all amber logic-power LEDs of all installed processor subsystem DCAs are off. Step 152f-46 Reinstall the system clock card that you removed in “Step 152f-42.” 250
406/90
Service
Guide
Step 152f-47 Turn on the power. Step 152f-48 Did the system stop with the same error code as recorded in “Step 152f-1” on page 245? Yes The system clock card just reinserted is defective, replace it. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152f-49.”Step 152f-49 Have all the modules listed in “Step 152f-42” on page 250 been reinstalled? Yes The symptom has changed. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152f-45” on page 250.Step 152f-50 Examine the amber logic-power LEDs of all installed processor subsystem DCAs. Step 152f-51 Are all amber logic-power LEDs of all installed processor subsystem DCAs off? Yes Go to “Step 152f-52.” No Call for support.Step 152f-52 Attention: Before replacing the system backplane, call for support. Replace the system backplane, at location U1.18-P1. Step 152f-53 Turn on the power. Step 152f-54 Did the system stop with the same error code as recorded in “Step 152f-1” on page 245? Yes Go to “Step 152f-55” on page 252. No The system backplane was defective. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. Chapter
3.
Maintenance
Analysis
Procedures
251
Step 152f-55 Turn off the power. Make sure all amber logic-power LEDs of all installed processor subsystem DCAs are off. Step 152f-56 Has the system backplane been replaced? Yes Call for support. No Go to “Step 152f-52” on page 251. 252
406/90
Service
Guide
MAP 152g: 5.0 V Current/Voltage Problem in Processor Subsystem Step 152g-1 For future reference, record the error code and the location code(s) that sent you to this MAP. Step 152g-2 Turn off the power. Examine the amber logic-power LEDs of all installed processor subsystem DCAs. Step 152g-3 Are all amber logic-power LEDs of all installed processor subsystem DCAs off? Yes Go to “Step 152g-4.” No Call for support.Step 152g-4 Pull out the following books about 25 mm (1 inch), if present: v Memory book 0, at location U1.18-P1-M2 v Memory book 1, at location U1.18-P1-M3 v Memory book 2, at location U1.18-P1-M7 v Memory book 3, at location U1.18-P1-M6Step 152g-5 Turn on the power. Step 152g-6 Did the system stop with the same error code as recorded in “Step 152g-1”? Yes Go to “Step 152g-12” on page 254. No Go to “Step 152g-7.”Step 152g-7 One of the books just removed is defective. Turn off the power. Make sure all amber logic-power LEDs of all installed processor subsystem DCAs are off. Step 152g-8 Reinsert one of the books that is pulled out and listed in “Step 152g-4.” Step 152g-9 Turn on the power. Chapter
3.
Maintenance
Analysis
Procedures
253
Step 152g-10 Did the system stop with the same error code as recorded in “Step 152g-1” on page 253? Yes The book just reinserted is defective. Replace it. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152g-11.”Step 152g-11 Have all the books listed in “Step 152g-4” on page 253 been reinserted? Yes The symptom has changed. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152g-8” on page 253.Step 152g-12 Turn off the power. Examine the amber logic-power LEDs of all installed processor subsystem DCAs. Step 152g-13 Are all amber logic-power LEDs of all installed processor subsystem DCAs off? Yes Go to “Step 152g-14.” No Call for support.Step 152g-14 Pull out the following books about 25 mm (1 inch), if present: v Memory book 4, at location U1.18-P1-M4 v Memory book 5, at location U1.18-P1-M8 v Memory book 6, at location U1.18-P1-M5 v Memory book 7, at location U1.18-P1-M1Step 152g-15 Turn on the power. 254
406/90
Service
Guide
Step 152g-16 Did the system stop with the same error code as recorded in “Step 152g-1” on page 253? Yes Go to “Step 152g-22.” No Go to “Step 152g-17.”Step 152g-17 One of the books just removed is defective. Turn off the power. Make sure all amber logic-power LEDs of all installed processor subsystem DCAs are off. Step 152g-18 Reinsert one of the books that is pulled out and listed in “Step 152g-14” on page 254. Step 152g-19 Turn on the power. Step 152g-20 Did the system stop with the same error code as recorded in “Step 152g-1” on page 253? Yes The book just reinserted is defective. Replace it. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152g-21.”Step 152g-21 Have all the books listed in “Step 152g-14” on page 254 been reinserted? Yes The symptom has changed. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152g-18.”Step 152g-22 Turn off the power. Examine the amber logic-power LEDs of all installed processor subsystem DCAs. Chapter
3.
Maintenance
Analysis
Procedures
255
Step 152g-23 Are all amber logic-power LEDs of all installed processor subsystem DCAs off? Yes Go to “Step 152g-24.” No Call for support.Step 152g-24 Pull out the following books about 25 mm (1 inch), if present: v Fourth I/O book, at location U1.18-P1-H1 v Secondary I/O book, at location U1.18-P1-H3 v Third I/O book, at location U1.18-P1-H4 v Cap card at U1.18-P1-V3 v Cap card at U1.18-P1-V7Step 152g-25 Turn on the power. Step 152g-26 Did the system stop with the same error code as recorded in “Step 152g-1” on page 253? Yes Go to “Step 152g-32” on page 257. No Go to “Step 152g-27.”Step 152g-27 One of the books just removed is defective. Turn off the power. Make sure all amber logic-power LEDs of all installed processor subsystem DCAs are off. Step 152g-28 Reinsert one of the books that is pulled out and listed in “Step 152g-24.” Step 152g-29 Turn on the power. Step 152g-30 Did the system stop with the same error code as recorded in “Step 152g-1” on page 253? Yes The book just reinserted is defective. Replace it. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152g-31” on page 257. 256
406/90
Service
Guide
Step 152g-31 Have all the books listed in “Step 152g-24” on page 256 been reinserted? Yes The symptom has changed. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152g-28” on page 256.Step 152g-32 Turn off the power. Examine the amber logic-power LEDs of all installed processor subsystem DCAs. Step 152g-33 Are all amber logic-power LEDs of all installed processor subsystem DCAs off? Yes Go to “Step 152g-34.” No Call for support.Step 152g-34 Replace the book in the following list, if present, one at a time and in the order listed: v Primary I/O book, at location U1.18-P1-H2 v DCA 1-RH at U1.18-P1-V1 v DCA 2-RH at U1.18-P1-V2Step 152g-35 Turn on the power. Step 152g-36 Did the system stop with the same error code as recorded in “Step 152g-1” on page 253? Yes Go to “Step 152g-37.” No The book just replaced was defective. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems.Step 152g-37 Turn off the power. Make sure all amber logic-power LEDs of all installed processor subsystem DCAs are off. Chapter
3.
Maintenance
Analysis
Procedures
257
Step 152g-38 Remove the new book that was just installed in “Step 152g-34” on page 257 and reinstall the original book. Step 152g-39 Have all the books listed in “Step 152g-34” on page 257 been replaced with new or original books? Yes Go to “Step 152g-40.” No Go to “Step 152g-34” on page 257.Step 152g-40 Examine the amber logic-power LEDs of all installed processor subsystem DCAs. Step 152g-41 Are all amber logic-power LEDs of all installed processor subsystem DCAs off? Yes Go to “Step 152g-42.” No Call for support.Step 152g-42 Attention: Before replacing the system backplane, call for support. Replace the system backplane, at location U1.18-P1. Step 152g-43 Turn on the power. Step 152g-44 Did the system stop with the same error code as recorded in “Step 152g-1” on page 253? Yes Go to “Step 152g-45.” No The system backplane was defective. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems.Step 152g-45 Turn off the power. Make sure all amber logic-power LEDs of all installed processor subsystem DCAs are off. Step 152g-46 Has the system backplane been replaced? Yes Call for support. No Go to “Step 152g-42.” 258
406/90
Service
Guide
MAP 152h: 3.3V Current/Voltage Problem in I/O Subsystem Step 152h-1 Record the error code and the location code(s) that sent you to this MAP. Step 152h-2 Determine the I/O subsystem number from the error code (10yy xxxx) and the following list. The remaining steps of this map refer to this I/O subsystem: v If yy=14, I/O subsystem number is 1, location is U1.9 v If yy=15, I/O subsystem number is 2, location is U1.5 v If yy=16, I/O subsystem number is 3, location is U1.1 v If yy=17, I/O subsystem number is 4, location is U1.13 v If yy=21, I/O subsystem number is 5, location is U2.1 v If yy=22, I/O subsystem number is 6, location is U2.5 v If yy=23, I/O subsystem number is 4, location is U2.9 v If yy=24, I/O subsystem number is 7, location is U2.13 v If yy=25, I/O subsystem number is 8, location is U2.19Step 152h-3 Turn off the power. Examine the green ?power good out? LEDs of both I/O subsystem DCAs. Step 152h-4 Are all green ?power good out? LEDs of both I/O subsystem DCAs off? Yes Go to “Step 152h-5.” No Call for support.Step 152h-5 Remove the following adapter cards, if present, from the I/O subsystem. Record the adapter locations: v Adapter card at P1-I1 v Adapter card at P1-I2 v Adapter card at P1-I3 v Adapter card at P1-I4 v Adapter card at P1-I5Step 152h-6 Turn on the power. Step 152h-7 Did the system stop with the same error code as recorded in “Step 152h-1”? Yes Go to “Step 152h-13” on page 260. No Go to “Step 152h-8” on page 260. Chapter
3.
Maintenance
Analysis
Procedures
259
Step 152h-8 One of the cards just removed is defective. Turn off the power. Make sure all the green ?power good out? LEDs of both I/O subsystem DCAs are off. Step 152h-9 Reinsert one of the cards that is removed and listed in “Step 152h-5” on page 259. Step 152h-10 Turn on the power. Step 152h-11 Did the system stop with the same error code as recorded in “Step 152h-1” on page 259? Yes The card just reinserted is defective. Replace it. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152h-12.”Step 152h-12 Have all the cards listed in “Step 152h-5” on page 259 been reinserted? Yes The symptom has changed. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152h-9.”Step 152h-13 Turn off the power. Examine the green ?power good out? LEDs of both I/O subsystem DCAs. Step 152h-14 Are all the green ?power good out? LEDs of both I/O subsystem DCAs off? Yes Go to “Step 152h-15” on page 261. No Call for support. 260
406/90
Service
Guide
Step 152h-15 Remove the following adapter cards, if present, from the I/O subsystem. record the adapter locations: v Adapter card at P1-I6 v Adapter card at P1-I7 v Adapter card at P1-I8 v Adapter card at P1-I9 v Adapter card at P1-I10Step 152h-16 Turn on the power. Step 152h-17 Did the system stop with the same error code as recorded in “Step 152h-1” on page 259? Yes Go to “Step 152h-23” on page 262. No Go to “Step 152h-18.”Step 152h-18 One of the cards just removed is defective. Turn off the power. Make sure all the green ?power good out? LEDs of both I/O subsystem DCAs are off. Step 152h-19 Reinsert one of the cards that is removed and listed in “Step 152h-15.” Step 152h-20 Turn on the power. Step 152h-21 Did the system stop with the same error code as recorded in “Step 152h-1” on page 259? Yes The card just reinserted is defective. Replace it. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152h-22” on page 262. Chapter
3.
Maintenance
Analysis
Procedures
261
Step 152h-22 Have all the cards listed in “Step 152h-15” on page 261 been reinserted? Yes The symptom has changed. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152h-19” on page 261.Step 152h-23 Turn off the power. Examine the green ?power good out? LEDs of both I/O subsystem DCAs. Step 152h-24 Are all the green ?power good out? LEDs of both I/O subsystem DCAs off? Yes Go to “Step 152h-25.” No Call for support.Step 152h-25 Remove the following adapter cards, if present, from the I/O subsystem. record the adapter locations: v Adapter card at P2-I1 v Adapter card at P2-I2 v Adapter card at P2-I3 v Adapter card at P2-I4 v Adapter card at P2-I5Step 152h-26 Turn on the power. Step 152h-27 Did the system stop with the same error code as recorded in “Step 152h-1” on page 259? Yes Go to “Step 152h-33” on page 263. No Go to “Step 152h-28.”Step 152h-28 One of the cards just removed is defective. Turn off the power. Make sure all the green ?power good out? LEDs of both I/O subsystem DCAs are off. Step 152h-29 Reinsert one of the cards that is removed and listed in “Step 152h-25.” 262
406/90
Service
Guide
Step 152h-30 Turn on the power. Step 152h-31 Did the system stop with the same error code as recorded in “Step 152h-1” on page 259? Yes The card just reinserted is defective. Replace it. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152h-32.”Step 152h-32 Have all the cards listed in “Step 152h-25” on page 262 been reinserted? Yes The symptom has changed. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152h-29” on page 262.Step 152h-33 Turn off the power. Examine the green ?power good out? LEDs of both I/O subsystem DCAs. Step 152h-34 Are all the green ?power good out? LEDs of both I/O subsystem DCAs off? Yes Go to “Step 152h-35.” No Call for support.Step 152h-35 Remove the following adapter cards, if present, from the I/O subsystem. Record the adapter locations: v Adapter card at P2-I6 v Adapter card at P2-I7 v Adapter card at P2-I8 v Adapter card at P2-I9 v Adapter card at P2-I10 Chapter
3.
Maintenance
Analysis
Procedures
263
Step 152h-36 Turn on the power. Step 152h-37 Did the system stop with the same error code as recorded in “Step 152h-1” on page 259? Yes Go to “Step 152h-43.” No Go to “Step 152h-38.”Step 152h-38 One of the cards just removed is defective. Turn off the power. Make sure all the green ?power good out? LEDs of both I/O subsystem DCAs are off. Step 152h-39 Reinsert one of the cards that is removed and listed in “Step 152h-35” on page 263. Step 152h-40 Turn on the power. Step 152h-41 Did the system stop with the same error code as recorded in “Step 152h-1” on page 259? Yes The card just reinserted is defective. Replace it. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152h-42.”Step 152h-42 Have all the cards listed in “Step 152h-35” on page 263 been reinserted? Yes The symptom has changed. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152h-39.”Step 152h-43 Turn off the power. Examine the green ?power good out? LEDs of both I/O subsystem DCAs. 264
406/90
Service
Guide
Step 152h-44 Are all the green ?power good out? LEDs of both I/O subsystem DCAs off? Yes Go to “Step 152h-45.” No Call for support.Step 152h-45 Replace the DCAs in the following list, one at a time and in the order listed: v DCA 1 at V1 v DCA 2 at V2Step 152h-46 Turn on the power. Step 152h-47 Did the system stop with the same error code as recorded in “Step 152h-1” on page 259? Yes Go to “Step 152h-48.” No The DCA just replaced is defective. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems.Step 152h-48 Turn off the power. Make sure all the green ?power good out? LEDs of both I/O subsystem DCAs are off. Step 152h-49 Remove the new DCA that was just installed in “Step 152h-45” and reinstall the original DCA. Step 152h-50 Have all the DCAs listed in “Step 152h-45” been replaced with new or original cards? Yes Go to “Step 152h-51” No Go to “Step 152h-45.”Step 152h-51 Examine the green ?power good out? LEDs of both I/O subsystem DCAs. Chapter
3.
Maintenance
Analysis
Procedures
265
Step 152h-52 Are all the green ?power good out? LEDs of both I/O subsystem DCAs off? Yes Go to “Step 152h-53.” No Call for support.Step 152h-53 Replace the parts in the following list, one at a time and in the order listed: v I/O subsystem backplane at P1 v I/O subsystem backplane at P2Step 152h-54 Turn on the power. Step 152h-55 Did the system stop with the same error code as recorded in “Step 152h-1” on page 259? Yes Go to “Step 152h-56.” No The part just replaced is defective. Replace it. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems.Step 152h-56 Turn off the power. Make sure all the green ?power good out? LEDs of both I/O subsystem DCAs are off. Step 152h-57 Have all the cards listed in “Step 152h-53” been replaced with new or original cards? Yes Call for support. No Go to “Step 152h-53.” 266
406/90
Service
Guide
MAP 152i: 5.0 V Current/Voltage Problem in I/O Subsystem Step 152i-1 Record the error code and location code(s) that sent you to this MAP. Step 152i-2 Determine the I/O subsystem number from the error code (10yy xxxx) and the following list. The remaining steps of this map refer to this I/O subsystem: v If yy=14, I/O subsystem number is 1, location is U1.9 v If yy=15, I/O subsystem number is 2, location is U1.5 v If yy=16, I/O subsystem number is 3, location is U1.1 v If yy=17, I/O subsystem number is 4, location is U1.13 v If yy=21, I/O subsystem number is 5, location is U2.1 v If yy=22, I/O subsystem number is 6, location is U2.5 v If yy=23, I/O subsystem number is 4, location is U2.9 v If yy=24, I/O subsystem number is 7, location is U2.13 v If yy=25, I/O subsystem number is 8, location is U2.19Step 152i-3 Turn off the power. Examine the green ?power good out? LEDs of both I/O subsystem DCAs. Step 152i-4 Are all the green ?power good out? LEDs of both I/O subsystem DCAs off? Yes Go to “Step 152i-5.” No Call for support.Step 152i-5 Remove the following adapter cards, if present, from the I/O subsystem. Record the adapter locations: v Adapter card at P1-I1 v Adapter card at P1-I2 v Adapter card at P1-I3 v Adapter card at P1-I4 v Adapter card at P1-I5Step 152i-6 Turn on the power. Step 152i-7 Did the system stop with the same error code as recorded in “Step 152i-1”? Yes Go to “Step 152i-13” on page 268. No Go to “Step 152i-8” on page 268. Chapter
3.
Maintenance
Analysis
Procedures
267
Step 152i-8 One of the cards just removed is defective. Turn off the power. Make sure all the green ?power good out? LEDs of both I/O subsystem DCAs are off. Step 152i-9 Reinsert one of the cards that is removed and listed in “Step 152i-5” on page 267. Step 152i-10 Turn on the power. Step 152i-11 Did the system stop with the same error code as recorded in “Step 152i-1” on page 267? Yes The card just reinserted is defective. Replace it. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152i-12.”Step 152i-12 Have all the cards listed in “Step 152i-5” on page 267 been reinserted? Yes The symptom has changed. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152i-9.”Step 152i-13 Turn off the power. Examine the green ?power good out? LEDs of both I/O subsystem DCAs. Step 152i-14 Are all the green ?power good out? LEDs of both I/O subsystem DCAs off? Yes Go to “Step 152i-15” on page 269. No Call for support. 268
406/90
Service
Guide
Step 152i-15 Remove the following adapter cards, if present, from the I/O subsystem. Record the adapter locations: v Adapter card at P1-I6 v Adapter card at P1-I7 v Adapter card at P1-I8 v Adapter card at P1-I9 v Adapter card at P1-I10Step 152i-16 Turn on the power. Step 152i-17 Did the system stop with the same error code as recorded in “Step 152i-1” on page 267? Yes Go to “Step 152i-23” on page 270. No Go to “Step 152i-18.”Step 152i-18 One of the cards just removed is defective. Turn off the power. Make sure all the green ?power good out? LEDs of both I/O subsystem DCAs are off. Step 152i-19 Reinsert one of the cards that is removed and listed in “Step 152i-15.” Step 152i-20 Turn on the power. Step 152i-21 Did the system stop with the same error code as recorded in “Step 152i-1” on page 267? Yes The card just reinserted is defective. Replace it. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152i-22” on page 270. Chapter
3.
Maintenance
Analysis
Procedures
269
Step 152i-22 Have all the cards listed in “Step 152i-15” on page 269 been reinserted? Yes The symptom has changed. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152i-19” on page 269.Step 152i-23 Turn off the power. Examine all the green ?power good out? LEDs of both I/O subsystem DCAs. Step 152i-24 Are all the green ?power good out? LEDs of both I/O subsystem DCAs off? Yes Go to “Step 152i-25.” No Call for support.Step 152i-25 Remove the following adapter cards, if present, from the I/O subsystem. record the adapter locations: v Adapter card at P2-I1 v Adapter card at P2-I2 v Adapter card at P2-I3 v Adapter card at P2-I4 v Adapter card at P2-I5Step 152i-26 Turn on the power. Step 152i-27 Did the system stop with the same error code as recorded in “Step 152i-1” on page 267? Yes Go to “Step 152i-33” on page 271. No Go to “Step 152i-28” on page 271. 270
406/90
Service
Guide
Step 152i-28 One of the cards just removed is defective. Turn off the power. Make sure all the green ?power good out? LEDs of both I/O subsystem DCAs are off. Step 152i-29 Reinsert one of the cards that is removed and listed in “Step 152i-25” on page 270. Step 152i-30 Turn on the power. Step 152i-31 Did the system stop with the same error code as recorded in “Step 152i-1” on page 267? Yes The card just reinserted is defective. Replace it. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152i-32.”Step 152i-32 Have all the cards listed in “Step 152i-25” on page 270 been reinserted? Yes The symptom has changed. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152i-29.”Step 152i-33 Turn off the power. Examine all the green ?power good out? LEDs of both I/O subsystem DCAs. Step 152i-34 Are all the green ?power good out? LEDs of both I/O subsystem DCAs off? Yes Go to “Step 152i-35.” No Call for support.Step 152i-35 Remove the following adapter cards, if present, from the I/O subsystem. record the adapter locations: v Adapter card at P2-I6 v Adapter card at P2-I7 v Adapter card at P2-I8 v Adapter card at P2-I9 v Adapter card at P2-I10 Chapter
3.
Maintenance
Analysis
Procedures
271
Step 152i-36 Turn on the power. Step 152i-37 Did the system stop with the same error code as recorded in “Step 152i-1” on page 267? Yes Go to “Step 152i-43.” No Go to “Step 152i-38.”Step 152i-38 One of the cards just removed is defective. Turn off the power. Make sure all the green ?power good out? LEDs of both I/O subsystem DCAs are off. Step 152i-39 Reinsert one of the cards that is removed and listed in “Step 152i-35” on page 271. Step 152i-40 Turn on the power. Step 152i-41 Did the system stop with the same error code as recorded in “Step 152i-1” on page 267? Yes The card just reinserted is defective. Replace it. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152i-42.”Step 152i-42 Have all the cards listed in “Step 152i-35” on page 271 been reinserted? Yes The symptom has changed. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152i-39.”Step 152i-43 Turn off the power. Examine the green ?power good out? LEDs of both I/O subsystem DCAs. 272
406/90
Service
Guide
Step 152i-44 Are all the green ?power good out? LEDs of both I/O subsystem DCAs off? Yes Go to “Step 152i-45.” No Call for support.Step 152i-45 Remove the following DASD 4-packs, if present, from the I/O subsystem. Record the DASD locations: v DASD 4-pack at P3 v DASD 4-pack at P4 v DASD 4-pack at P5 v DASD 4-pack at P6Step 152i-46 Turn on the power. Step 152i-47 Did the system stop with the same error code as recorded in “Step 152i-1” on page 267? Yes Go to “Step 152i-53” on page 274. No Go to “Step 152i-48.”Step 152i-48 One of the DASDs just removed is defective. Turn off the power. Make sure all the green ?power good out? LEDs of both I/O subsystem DCAs are off. Step 152i-49 Reinsert one of the DASD 4-packs that is removed and listed in “Step 152i-45.” Step 152i-50 Turn on the power. Step 152i-51 Did the system stop with the same error code as recorded in “Step 152i-1” on page 267? Yes One of the DASD in the DASD 4-pack just reinserted is defective. Replace it. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152i-52” on page 274. Chapter
3.
Maintenance
Analysis
Procedures
273
Step 152i-52 Have all the cards listed in “Step 152i-45” on page 273 been reinserted? Yes The symptom has changed. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152i-49” on page 273.Step 152i-53 Turn off the power. Examine the green ?power good out? LEDs of both I/O subsystem DCAs. Step 152i-54 Are all the green ?power good out? LEDs of both I/O subsystem DCAs off? Yes Go to “Step 152i-55.” No Call for support.Step 152i-55 Replace the books in the following list one at a time and in the order listed: v DCA 1 at V1 v DCA 2 at V2Step 152i-56 Turn on the power. Step 152i-57 Did the system stop with the same error code as recorded in “Step 152i-1” on page 267? Yes Go to “Step 152i-58.” No The book just replaced was defective. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems.Step 152i-58 Turn off the power. Make sure all the green ?power good out? LEDs of both I/O subsystem DCAs are off. Step 152i-59 Remove the new card that was just installed in “Step 152i-55” and reinstall the original card. 274
406/90
Service
Guide
Step 152i-60 Have all the cards listed in “Step 152i-55” on page 274 been replaced with new or original cards? Yes Go to “Step 152i-61.” No Go to “Step 152i-55” on page 274.Step 152i-61 Examine the green ?power good out? LEDs of both I/O subsystem DCAs. Step 152i-62 Are all the green ?power good out? LEDs of both I/O subsystem DCAs off? Yes Go to “Step 152i-63.” No Call for support.Step 152i-63 Replace the parts in the following list one at a time and in the order listed: v I/O subsystem backplane at P1 v I/O subsystem backplane at P2Step 152i-64 Turn on the power. Step 152i-65 Did the system stop with the same error code as recorded in “Step 152i-1” on page 267? Yes Go to “Step 152i-66.” No The card just replaced was defective. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems.Step 152i-66 Turn off the power. Make sure all the green ?power good out? LEDs of both I/O subsystem DCAs are off. Step 152i-67 Have all the cards listed in “Step 152i-63” been replaced? Yes Call for support. No Go to “Step 152i-63.” Chapter
3.
Maintenance
Analysis
Procedures
275
MAP 152j: 2.5 V Current/Voltage Problem in I/O Subsystem Step 152j-1 Record the error code and location code(s) that sent you to this MAP. Step 152j-2 Determine the I/O subsystem number from the error code (10yy xxxx) and the following list. The remaining steps of this map refer to this I/O subsystem: v If yy=14, I/O subsystem number is 1, location is U1.9 v If yy=15, I/O subsystem number is 2, location is U1.5 v If yy=16, I/O subsystem number is 3, location is U1.1 v If yy=17, I/O subsystem number is 4, location is U1.13 v If yy=21, I/O subsystem number is 5, location is U2.1 v If yy=22, I/O subsystem number is 6, location is U2.5 v If yy=23, I/O subsystem number is 4, location is U2.9 v If yy=24, I/O subsystem number is 7, location is U2.13 v If yy=25, I/O subsystem number is 8, location is U2.19Step 152j-3 Turn off the power. Examine the green ?power good out? LEDs of both I/O subsystem DCAs. Step 152j-4 Are all the green ?power good out? LEDs of both I/O subsystem DCAs off? Yes Go to “Step 152j-5.” No Call for support.Step 152j-5 Remove the following adapter cards, if present, from the I/O subsystem. Record the adapter locations: v Adapter card at P1-I1 v Adapter card at P1-I2 v Adapter card at P1-I3 v Adapter card at P1-I4 v Adapter card at P1-I5Step 152j-6 Turn on the power. Step 152j-7 Did the system stop with the same error code as recorded in “Step 152j-1”? Yes Go to “Step 152j-13” on page 277. No Go to “Step 152j-8” on page 277. 276
406/90
Service
Guide
Step 152j-8 One of the cards just removed is defective. Turn off the power. Make sure all the green ?power good out? LEDs of both I/O subsystem DCAs are off. Step 152j-9 Reinsert one of the cards that is removed and listed in “Step 152j-5” on page 276. Step 152j-10 Turn on the power. Step 152j-11 Did the system stop with the same error code as recorded in “Step 152j-1” on page 276? Yes The card just reinserted is defective. Replace it. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152j-12.”Step 152j-12 Have all the cards listed in “Step 152j-5” on page 276 been reinserted? Yes The symptom has changed. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152j-9.”Step 152j-13 Turn off the power. Examine the green ?power good out? LEDs of both I/O subsystem DCAs. Step 152j-14 Are all the green ?power good out? LEDs of both I/O subsystem DCAs off? Yes Go to “Step 152j-15” on page 278. No Call for support. Chapter
3.
Maintenance
Analysis
Procedures
277
Step 152j-15 Remove the following adapter cards, if present, from the I/O subsystem. Record the adapter locations: v Adapter card at P1-I6 v Adapter card at P1-I7 v Adapter card at P1-I8 v Adapter card at P1-I9 v Adapter card at P1-I10Step 152j-16 Turn on the power. Step 152j-17 Did the system stop with the same error code as recorded in “Step 152j-1” on page 276? Yes Go to “Step 152j-23” on page 279. No Go to “Step 152j-18.”Step 152j-18 One of the cards just removed is defective. Turn off the power. Make sure all the green ?power good out? LEDs of both I/O subsystem DCAs are off. Step 152j-19 Reinsert one of the cards that is removed and listed in “Step 152j-15.” Step 152j-20 Turn on the power. Step 152j-21 Did the system stop with the same error code as recorded in “Step 152j-1” on page 276? Yes The card just reinserted is defective. Replace it.This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152j-22” on page 279. 278
406/90
Service
Guide
Step 152j-22 Have all the cards listed in “Step 152j-15” on page 278 been reinserted? Yes The symptom has changed. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152j-19” on page 278.Step 152j-23 Turn off the power. Examine the green ?power good out? LEDs of both I/O subsystem DCAs. Step 152j-24 Are all the green ?power good out? LEDs of both I/O subsystem DCAs off? Yes Go to “Step 152j-25.” No Call for support.Step 152j-25 Remove the following adapter cards, if present, from the I/O subsystem. Record the adapter locations: v Adapter card at P2-I1 v Adapter card at P2-I2 v Adapter card at P2-I3 v Adapter card at P2-I4 v Adapter card at P2-I5Step 152j-26 Turn on the power. Step 152j-27 Did the system stop with the same error code as recorded in “Step 152j-1” on page 276? Yes Go to “Step 152j-33” on page 280. No Go to “Step 152j-28.”Step 152j-28 One of the cards just removed is defective. Turn off the power. Make sure all the green ?power good out? LEDs of both I/O subsystem DCAs are off. Step 152j-29 Reinsert one of the cards that is removed and listed in “Step 152j-25.” Chapter
3.
Maintenance
Analysis
Procedures
279
Step 152j-30 Turn on the power. Step 152j-31 Did the system stop with the same error code as recorded in “Step 152j-1” on page 276? Yes The card just reinserted is defective. Replace it. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152j-32.”Step 152j-32 Have all the cards listed in “Step 152j-25” on page 279 been reinserted? Yes The symptom has changed. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152j-29” on page 279.Step 152j-33 Turn off the power. Examine the green ?power good out? LEDs of both I/O subsystem DCAs. Step 152j-34 Are all the green ?power good out? LEDs of both I/O subsystem DCAs off? Yes Go to “Step 152j-35.” No Call for support.Step 152j-35 Remove the following adapter cards, if present, from the I/O subsystem. Record the adapter locations: v Adapter card at P2-I6 v Adapter card at P2-I7 v Adapter card at P2-I8 v Adapter card at P2-I9 v Adapter card at P2-I10Step 152j-36 Turn on the power. 280
406/90
Service
Guide
Step 152j-37 Did the system stop with the same error code as recorded in “Step 152j-1” on page 276? Yes Go to “Step 152j-43.” No Go to “Step 152j-38.”Step 152j-38 One of the cards just removed is defective. Turn off the power. Make sure all the green ?power good out? LEDs of both I/O subsystem DCAs are off. Step 152j-39 Reinsert one of the cards that is removed and listed in “Step 152j-35” on page 280. Step 152j-40 Turn on the power. Step 152j-41 Did the system stop with the same error code as recorded in “Step 152j-1” on page 276? Yes The card just reinserted is defective. Replace it. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152j-42.”Step 152j-42 Have all the cards listed in “Step 152j-35” on page 280 been reinserted? Yes The symptom has changed. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152j-39.”Step 152j-43 Turn off the power. Examine the green ?power good out? LEDs of both I/O subsystem DCAs. Chapter
3.
Maintenance
Analysis
Procedures
281
Step 152j-44 Are all the green ?power good out? LEDs of both I/O subsystem DCAs off? Yes Go to “Step 152j-45.” No Call for support.Step 152j-45 Replace the cards in the following list one at a time and in the order listed: v DCA 1 at V1 v DCA 2 at V2Step 152j-46 Turn on the power. Step 152j-47 Did the system stop with the same error code as recorded in “Step 152j-1” on page 276? Yes Go to “Step 152j-48.” No The card just replaced is defective. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems.Step 152j-48 Turn off the power. Make sure all the green ?power good out? LEDs of both I/O subsystem DCAs are off. Step 152j-49 Remove the new card that was just installed in “Step 152j-45” and reinstall the original card. Step 152j-50 Have all the cards listed in “Step 152j-45” been replaced with new or original cards? Yes Go to “Step 152j-51” No Go to “Step 152j-45.”Step 152j-51 Examine the green ?power good out? LEDs of both I/O subsystem DCAs. 282
406/90
Service
Guide
Step 152j-52 Are all the green ?power good out? LEDs of both I/O subsystem DCAs off? Yes Go to “Step 152j-53.” No Call for support.Step 152j-53 Replace the parts in the following list one at a time and in the order listed: v I/O subsystem backplane at P1 v I/O subsystem backplane at P2Step 152j-54 Turn on the power. Step 152j-55 Did the system stop with the same error code as recorded in “Step 152j-1” on page 276? Yes Go to “Step 152j-56.” No The part just replaced is defective. Replace it. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems.Step 152j-56 Turn off the power. Make sure all the green ?power good out? LEDs of both I/O subsystem DCAs are off. Step 152j-57 Have all the cards listed in “Step 152j-53” been replaced with new or original cards? Yes Call for support. No Go to “Step 152j-53.” Chapter
3.
Maintenance
Analysis
Procedures
283
MAP 152k: 12.0 V Current/Voltage Problem in I/O Subsystem Step 152k-1 Record the error code and location code(s) that sent you to this MAP. Step 152k-2 Determine the I/O subsystem number from the error code (10yy xxxx) and the following list. The remaining steps of this map refer to this I/O subsystem: v If yy=14, I/O subsystem number is 1, location is U1.9 v If yy=15, I/O subsystem number is 2, location is U1.5 v If yy=16, I/O subsystem number is 3, location is U1.1 v If yy=17, I/O subsystem number is 4, location is U1.13 v If yy=21, I/O subsystem number is 5, location is U2.1 v If yy=22, I/O subsystem number is 6, location is U2.5 v If yy=23, I/O subsystem number is 4, location is U2.9 v If yy=24, I/O subsystem number is 7, location is U2.13 v If yy=25, I/O subsystem number is 8, location is U2.19Step 152k-3 Turn off the power. Examine the green ?power good out? LEDs of both I/O subsystem DCAs. Step 152k-4 Are all the green ?power good out? LEDs of both I/O subsystem DCAs off? Yes Go to “Step 152k-5.” No Call for support.Step 152k-5 Remove the following adapter cards, if present, from the I/O subsystem. Record the adapter locations: v Adapter card at P1-I1 v Adapter card at P1-I2 v Adapter card at P1-I3 v Adapter card at P1-I4 v Adapter card at P1-I5Step 152k-6 Turn on the power. Step 152k-7 Did the system stop with the same error code as recorded in “Step 152k-1”? Yes Go to “Step 152k-13” on page 285. No Go to “Step 152k-8” on page 285. 284
406/90
Service
Guide
Step 152k-8 One of the cards just removed is defective. Turn off the power. Make sure all the green ?power good out? LEDs of both I/O subsystem DCAs are off. Step 152k-9 Reinsert one of the cards that is removed and listed in “Step 152k-5” on page 284. Step 152k-10 Turn on the power. Step 152k-11 Did the system stop with the same error code as recorded in “Step 152k-1” on page 284? Yes The card just reinserted is defective. Replace it. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152k-12.”Step 152k-12 Have all the cards listed in “Step 152k-5” on page 284 been reinserted? Yes The symptom has changed. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152k-9.”Step 152k-13 Turn off the power. Examine the green ?power good out? LEDs of both I/O subsystem DCAs. Step 152k-14 Are all the green ?power good out? LEDs of both I/O subsystem DCAs off? Yes Go to “Step 152k-15” on page 286. No Call for support. Chapter
3.
Maintenance
Analysis
Procedures
285
Step 152k-15 Remove the following adapter cards, if present, from the I/O subsystem. Record the adapter locations: v Adapter card at P1-I6 v Adapter card at P1-I7 v Adapter card at P1-I8 v Adapter card at P1-I9 v Adapter card at P1-I10Step 152k-16 Turn on the power. Step 152k-17 Did the system stop with the same error code as recorded in “Step 152k-1” on page 284? Yes Go to “Step 152k-23” on page 287. No Go to “Step 152k-18.”Step 152k-18 One of the cards just removed is defective. Turn off the power. Make sure all the green ?power good out? LEDs of both I/O subsystem DCAs are off. Step 152k-19 Reinsert one of the cards that is removed and listed in “Step 152k-15.” Step 152k-20 Turn on the power. Step 152k-21 Did the system stop with the same error code as recorded in “Step 152k-1” on page 284? Yes The card just reinserted is defective. Replace it. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152k-22” on page 287. 286
406/90
Service
Guide
Step 152k-22 Have all the cards listed in “Step 152k-15” on page 286 been reinserted? Yes The symptom has changed. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152k-19” on page 286.Step 152k-23 Turn off the power. Examine the green ?power good out? LEDs of both I/O subsystem DCAs. Step 152k-24 Are all the green ?power good out? LEDs of both I/O subsystem DCAs off? Yes Go to “Step 152k-25.” No Call for support.Step 152k-25 Remove the following adapter cards, if present, from the I/O subsystem. Record the adapter locations: v Adapter card at P2-I1 v Adapter card at P2-I2 v Adapter card at P2-I3 v Adapter card at P2-I4 v Adapter card at P2-I5Step 152k-26 Turn on the power. Step 152k-27 Did the system stop with the same error code as recorded in “Step 152k-1” on page 284? Yes Go to “Step 152k-33” on page 288. No Go to “Step 152k-28.”Step 152k-28 One of the cards just removed is defective. Turn off the power. Make sure all the green ?power good out? LEDs of both I/O subsystem DCAs are off. Step 152k-29 Reinsert one of the cards that is removed and listed in “Step 152k-25.” Step 152k-30 Turn on the power. Chapter
3.
Maintenance
Analysis
Procedures
287
Step 152k-31 Did the system stop with the same error code as recorded in “Step 152k-1” on page 284? Yes The card just reinserted is defective. Replace it. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152k-32.”Step 152k-32 Have all the cards listed in “Step 152k-25” on page 287 been reinserted? Yes The symptom has changed. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152k-29” on page 287.Step 152k-33 Turn off the power. Examine the green ?power good out? LEDs of both I/O subsystem DCAs. Step 152k-34 Are all the green ?power good out? LEDs of both I/O subsystem DCAs off? Yes Go to “Step 152k-35.” No Call for support.Step 152k-35 Remove the following adapter cards, if present, from the I/O subsystem. Record the adapter locations: v Adapter card at P2-I6 v Adapter card at P2-I7 v Adapter card at P2-I8 v Adapter card at P2-I9 v Adapter card at P2-I10Step 152k-36 Turn on the power. Step 152k-37 Did the system stop with the same error code as recorded in “Step 152k-1” on page 284? Yes Go to “Step 152k-43” on page 289. No Go to “Step 152k-38” on page 289. 288
406/90
Service
Guide
Step 152k-38 One of the cards just removed is defective. Turn off the power. Make sure all the green ?power good out? LEDs of both I/O subsystem DCAs are off. Step 152k-39 Reinsert one of the cards that is removed and listed in “Step 152k-35” on page 288. Step 152k-40 Turn on the power. Step 152k-41 Did the system stop with the same error code as recorded in “Step 152k-1” on page 284? Yes The card just reinserted is defective. Replace it. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152k-42.”Step 152k-42 Have all the cards listed in “Step 152k-35” on page 288 been reinserted? Yes The symptom has changed. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152k-39.”Step 152k-43 Turn off the power. Examine the green ?power good out? LEDs of both I/O subsystem DCAs. Step 152k-44 Are all the green ?power good out? LEDs of both I/O subsystem DCAs off? Yes Go to “Step 152k-45.” No Call for support.Step 152k-45 Remove the following DASD 4-packs, if present, from the I/O subsystem. Record the DASD locations: v DASD 4-pack at P3 v DASD 4-pack at P4 v DASD 4-pack at P5 v DASD 4-pack at P6 Chapter
3.
Maintenance
Analysis
Procedures
289
Step 152k-46 Turn on the power. Step 152k-47 Did the system stop with the same error code as recorded in “Step 152k-1” on page 284? Yes Go to “Step 152k-53.” No Go to “Step 152k-48.”Step 152k-48 One of the DASDs just removed is defective. Turn off the power. Make sure all the green ?power good out? LEDs of both I/O subsystem DCAs are off. Step 152k-49 Reinsert one of the DASD 4-packs that is removed and listed in “Step 152k-45” on page 289. Step 152k-50 Turn on the power. Step 152k-51 Did the system stop with the same error code as recorded in “Step 152k-1” on page 284? Yes One of the DASD in the DASD 4-pack just reinserted is defective. Replace it. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152k-52.”Step 152k-52 Have all the cards listed in “Step 152k-45” on page 289 been reinserted? Yes The symptom has changed. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152k-49.”Step 152k-53 Turn off the power. Examine the green ?power good out? LEDs of both I/O subsystem DCAs. 290
406/90
Service
Guide
Step 152k-54 Are all the green ?power good out? LEDs of both I/O subsystem DCAs off? Yes Go to “Step 152k-55.” No Call for support.Step 152k-55 Replace the books in the following list one at a time and in the order listed: v DCA 1 at V1 v DCA 2 at V2Step 152k-56 Turn on the power. Step 152k-57 Did the system stop with the same error code as recorded in “Step 152k-1” on page 284? Yes Go to “Step 152k-58.” No The part just replaced was defective. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems.Step 152k-58 Turn off the power. Make sure all the green ?power good out? LEDs of both I/O subsystem DCAs are off. Step 152k-59 Remove the new card that was just installed in “Step 152k-55” and reinstall the original card. Step 152k-60 Have all the cards listed in “Step 152k-55” been replaced with new or original cards? Yes Go to “Step 152k-61.” No Go to “Step 152k-55.”Step 152k-61 Examine the the green ?power good out? LEDs of both I/O subsystem DCAs. Step 152k-62 Are all the green ?power good out? LEDs of both I/O subsystem DCAs off? Yes Go to “Step 152k-63” on page 292. No Call for support. Chapter
3.
Maintenance
Analysis
Procedures
291
Step 152k-63 Replace the parts in the following list one at a time and in the order listed: v I/O subsystem backplane at P1 v I/O subsystem backplane at P2Step 152k-64 Turn on the power. Step 152k-65 Did the system stop with the same error code as recorded in “Step 152k-1” on page 284? Yes Go to “Step 152k-66.” No The card just replaced was defective. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems.Step 152k-66 Turn off the power. Make sure all the green ?power good out? LEDs of both I/O subsystem DCAs are off. Step 152k-67 Have all the cards listed in “Step 152k-63” been replaced? Yes Call for support. No Go to “Step 152k-63.” 292
406/90
Service
Guide
MAP 152l: -12.0V Current/Voltage Problem in I/O Subsystem Step 152l-1 Record the error code and location code(s) that sent you to this MAP. Step 152l-2 Determine the I/O subsystem number from the error code (10yy xxxx) and the following list. The remaining steps of this map refer to this I/O subsystem: v If yy=14, I/O subsystem number is 1, location is U1.9 v If yy=15, I/O subsystem number is 2, location is U1.5 v If yy=16, I/O subsystem number is 3, location is U1.1 v If yy=17, I/O subsystem number is 4, location is U1.13 v If yy=21, I/O subsystem number is 5, location is U2.1 v If yy=22, I/O subsystem number is 6, location is U2.5 v If yy=23, I/O subsystem number is 4, location is U2.9 v If yy=24, I/O subsystem number is 7, location is U2.13 v If yy=25, I/O subsystem number is 8, location is U2.19Step 152l-3 Turn off the power. Examine the green ?power good out? LEDs of both I/O subsystem DCAs. Step 152l-4 Are all the green ?power good out? LEDs of both I/O subsystem DCAs off? Yes Go to “Step 152l-5.” No Call for support.Step 152l-5 Remove the following adapter cards, if present, from the I/O subsystem. Record the adapter locations: v Adapter card at P1-I1 v Adapter card at P1-I2 v Adapter card at P1-I3 v Adapter card at P1-I4 v Adapter card at P1-I5Step 152l-6 Turn on the power. Step 152l-7 Did the system stop with the same error code as recorded in “Step 152l-1”? Yes Go to “Step 152l-13” on page 294. No Go to “Step 152l-8” on page 294. Chapter
3.
Maintenance
Analysis
Procedures
293
Step 152l-8 One of the cards just removed is defective. Turn off the power. Make sure all the green ?power good out? LEDs of both I/O subsystem DCAs are off. Step 152l-9 Reinsert one of the cards that is removed and listed in “Step 152l-5” on page 293. Step 152l-10 Turn on the power. Step 152l-11 Did the system stop with the same error code as recorded in “Step 152l-1” on page 293? Yes The card just reinserted is defective. Replace it. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152l-12.”Step 152l-12 Have all the cards listed in “Step 152l-5” on page 293 been reinserted? Yes The symptom has changed. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152l-9.”Step 152l-13 Turn off the power. Examine the green ?power good out? LEDs of both I/O subsystem DCAs. Step 152l-14 Are all the green ?power good out? LEDs of both I/O subsystem DCAs off? Yes Go to “Step 152l-15.” No Call for support.Step 152l-15 Remove the following adapter cards, if present, from the I/O subsystem. Record the adapter locations: v Adapter card at P1-I6 v Adapter card at P1-I7 v Adapter card at P1-I8 v Adapter card at P1-I9 v Adapter card at P1-I10 294
406/90
Service
Guide
Step 152l-16 Turn on the power. Step 152l-17 Did the system stop with the same error code as recorded in “Step 152l-1” on page 293? Yes Go to “Step 152l-23.” No Go to “Step 152l-18.”Step 152l-18 One of the cards just removed is defective. Turn off the power. Make sure all the green ?power good out? LEDs of both I/O subsystem DCAs are off. Step 152l-19 Reinsert one of the cards that is removed and listed in “Step 152l-15” on page 294. Step 152l-20 Turn on the power. Step 152l-21 Did the system stop with the same error code as recorded in “Step 152l-1” on page 293? Yes The card just reinserted is defective. Replace it. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152l-22.”Step 152l-22 Have all the cards listed in “Step 152l-15” on page 294 been reinserted? Yes The symptom has changed. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152l-19.”Step 152l-23 Turn off the power. Examine the green ?power good out? LEDs of both I/O subsystem DCAs. Chapter
3.
Maintenance
Analysis
Procedures
295
Step 152l-24 Are all the green ?power good out? LEDs of both I/O subsystem DCAs off? Yes Go to “Step 152l-25.” No Call for support.Step 152l-25 Remove the following adapter cards, if present, from the I/O subsystem. Record the adapter locations: v Adapter card at P2-I1 v Adapter card at P2-I2 v Adapter card at P2-I3 v Adapter card at P2-I4 v Adapter card at P2-I5Step 152l-26 Turn on the power. Step 152l-27 Did the system stop with the same error code as recorded in “Step 152l-1” on page 293? Yes Go to “Step 152l-33” on page 297. No Go to “Step 152l-28.”Step 152l-28 One of the cards just removed is defective. Turn off the power. Make sure all the green ?power good out? LEDs of both I/O subsystem DCAs are off. Step 152l-29 Reinsert one of the cards that is removed and listed in “Step 152l-25.” Step 152l-30 Turn on the power. Step 152l-31 Did the system stop with the same error code as recorded in “Step 152l-1” on page 293? Yes The card just reinserted is defective. Replace it. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152l-32” on page 297. 296
406/90
Service
Guide
Step 152l-32 Have all the cards listed in “Step 152l-25” on page 296 been reinserted? Yes The symptom has changed. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152l-29” on page 296.Step 152l-33 Turn off the power. Examine the green ?power good out? LEDs of both I/O subsystem DCAs. Step 152l-34 Are all the green ?power good out? LEDs of both I/O subsystem DCAs off? Yes Go to “Step 152l-35.” No Call for support.Step 152l-35 Remove the following adapter cards, if present, from the I/O subsystem. Record the adapter locations: v Adapter card at P2-I6 v Adapter card at P2-I7 v Adapter card at P2-I8 v Adapter card at P2-I9 v Adapter card at P2-I10Step 152l-36 Turn on the power. Step 152l-37 Did the system stop with the same error code as recorded in “Step 152l-1” on page 293? Yes Go to “Step 152l-43” on page 298. No Go to “Step 152l-38.”Step 152l-38 One of the cards just removed is defective. Turn off the power. Make sure all the green ?power good out? LEDs of both I/O subsystem DCAs are off. Step 152l-39 Reinsert one of the cards that is removed and listed in “Step 152l-35.” Step 152l-40 Turn on the power. Chapter
3.
Maintenance
Analysis
Procedures
297
Step 152l-41 Did the system stop with the same error code as recorded in “Step 152l-1” on page 293? Yes The card just reinserted is defective. Replace it. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152l-42.”Step 152l-42 Have all the cards listed in “Step 152l-35” on page 297 been reinserted? Yes The symptom has changed. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152l-39” on page 297.Step 152l-43 Turn off the power. Examine the green ?power good out? LEDs of both I/O subsystem DCAs. Step 152l-44 Are all the green ?power good out? LEDs of both I/O subsystem DCAs off? Yes Go to “Step 152l-45.” No Call for support.Step 152l-45 Replace the DCAs in the following list one at a time and in the order listed: v DCA 1 at V1 v DCA 2 at V2Step 152l-46 Turn on the power. Step 152l-47 Did the system stop with the same error code as recorded in “Step 152l-1” on page 293? Yes Go to “Step 152l-48” on page 299. No The DCA just replaced is defective. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. 298
406/90
Service
Guide
Step 152l-48 Turn off the power. Make sure all the green ?power good out? LEDs of both I/O subsystem DCAs are off. Step 152l-49 Remove the new DCA that was just installed in “Step 152l-45” on page 298 and reinstall the original DCA. Step 152l-50 Have all the DCAs listed in “Step 152l-45” on page 298 been replaced with new or original cards? Yes Go to “Step 152l-51” No Go to “Step 152l-45” on page 298.Step 152l-51 Examine the green ?power good out? LEDs of both I/O subsystem DCAs. Step 152l-52 Are all the green ?power good out? LEDs of both I/O subsystem DCAs off? Yes Go to “Step 152l-53.” No Call for support.Step 152l-53 Replace the parts in the following list one at a time and in the order listed: v I/O subsystem backplane at P1 v I/O subsystem backplane at P2Step 152l-54 Turn on the power. Step 152l-55 Did the system stop with the same error code as recorded in “Step 152l-1” on page 293? Yes Go to “Step 152l-56.” No The part just replaced is defective. Replace it. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems.Step 152l-56 Turn off the power. Make sure all the green ?power good out? LEDs of both I/O subsystem DCAs are off. Step 152l-57 Have all the cards listed in “Step 152l-53” been replaced with new or original cards? Yes Call for support. No Go to “Step 152l-53.”
Chapter
3.
Maintenance
Analysis
Procedures
299
MAP 152m: Cable Problem in Power Subsystem Step 152m-1 Record the error code and location code(s) that sent you to this MAP. You should be supplied with one or two pairs of location codes from the action for the error code in the service guide. These location codes are the ends of the cable or cables in question. The fourth nibble of the error code indicates the side of the power subsystem that detected the failure. For example, 101A indicates side A (front), and 101B indicates side B (rear). The following steps apply to the power subsystem side indicated by the error code recorded. For each location code pair: 1. Generate the first FRU location code (from U1.35) to be replaced by taking the location code given for the action of the error code and truncating the connector ID (/Qx). 2. Generate the second FRU location code to be replaced by taking the location code (not from U1.35) given for the action of the error code and truncating the connector ID (/Qx). 3. Generate the third FRU location code to be replaced (the cable) by taking the location code (from U1.35) given for the action of the error code and appending a # (number sign). For example, the service guide for error code 101A x125 lists the location codes U1.35-P1-X3/Q1 and U1.9-V1/Q2. From the error code, you are working on the A (front) side. From the location codes, the FRUs, in order of replacement, (if required) are: a. U1.35-P1-X3 - BPD1 b. U1.9-V1 - DCA1 c. U1.35-P1-X3/Q1# - cable4. If two pairs of location codes (in other words, two cables) were reported by the eror code that sent you to this map, repeat steps 1, 2, and 3 for the other pair of location codes. Assemble the FRU list for both cables as follows: a. Cable from first pair of location codes. b. Cable from second pair of location codes. Refer to “AIX and Physical Location Code Reference Tables” on page 68 and “Power Distribution Cabling” on page 30. Step 152m-2 Press the green start service button on the panel with the UEPO switch. Step 152m-3 From the action column in the ?Error Code to FRU? table entry for the error code recorded in “Step 152m-1,” find the location codes of both ends of the suspect power cable or cables. Step 152m-4 Examine both ends of the power cable or cables, and make sure each end is properly connected and fully seated. Step 152m-5 Is the power cable or cables properly connected and seated fully at both ends? Yes Go to “Step 152m-30” on page 303. No Go to “Step 152m-29” on page 303. 300
406/90
Service
Guide
Step 152m-6 Have you replaced the first FRU in the list of FRUs recorded in “Step 152m-1” on page 300 in a previous service action? Yes Go to “Step 152m-18” on page 302. No Go to “Step 152m-7.”Step 152m-7 Is the power-on LED of the first FRU generated from the location codes recorded in “Step 152m-1” on page 300 off? Yes Go to “Step 152m-8.” No Call for support.Step 152m-8 Replace the first FRU in the list of FRUs recorded in “Step 152m-1” on page 300 with power-on LED off. Press the white service complete button on the panel with the UEPO switch after replacement. Step 152m-9 Does the error code recorded in “Step 152m-1” on page 300 reappear? Yes Return the system to its original configuration. Go to “Step 152m-11.” No Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems.Step 152m-10 Press the white service complete button on the panel with the UEPO switch. Step 152m-11 Schedule non-concurrent service with the customer to replace the second FRU in the list of FRUs recorded in “Step 152m-1” on page 300. This ends the procedure. Step 152m-12 Is the power-on LED of the first FRU generated from the location codes recorded in “Step 152m-1” on page 300 off? Yes Go to “Step 152m-13.” No Call for support.Step 152m-13 Reseat and properly connect the power cable ends, with location codes recorded in “Step 152m-1” on page 300. Step 152m-14 Set the power-on switches of all BPRs on the same side with the power cable that was not connected properly to the off (right) position. Chapter
3.
Maintenance
Analysis
Procedures
301
Step 152m-15 Wait 30 seconds and then turn the power-on switches of the BPRs found in “Step 152m-14” on page 301 to the on (left) position. Step 152m-16 Press the white service complete button on the panel with the UEPO switch after replacement. Step 152m-17 Does the error code recorded in “Step 152m-1” on page 300 reappear? Yes Go to “Step 152m-2” on page 300. No Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems.Step 152m-18 Note: The following steps will require the system power to be removed. The operating system must be shut down in all partitions before turning off the power.Turn off the power. Turn off the UEPO switch. Examine the amber logic-power LEDs of all installed processor subsystem DCAs. Also examine the green ?power good out? LEDs of all installed I/O subsystem DCAs. Step 152m-19 Are all amber logic-power LEDs of all installed processor subsystem DCAs and all green ?logic power out? LEDs of all installed I/O subsystem DCAs off? Yes Go to “Step 152m-20.” No Call for support.Step 152m-20 Replace the second FRU in the list of FRUs recorded in step “Step 152m-1” on page 300. Step 152m-21 Turn on the UEPO switch. Turn on the power. Step 152m-22 Does the error code recorded in “Step 152m-1” on page 300 reappear? Yes Go to “Step 152m-23.” No Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems.Step 152m-23 Go to “Step 152m-24.” Step 152m-24 Turn off the power. Turn off the UEPO switch. Examine the amber logic-power LEDs of all installed processor subsystem DCAs. Also examine the green ?power good out? LEDs of all installed I/O subsystem DCAs. 302
406/90
Service
Guide
Step 152m-25 Are all amber logic-power LEDs of all installed processor subsystem DCAs and all green ?logic power out? LEDs of all installed I/O subsystem DCAs off? Yes Go to “Step 152m-26.” No Call for support.Step 152m-26 Replace the third FRU in the list of FRUs recorded in step 152m-1. Step 152m-27 Turn on the UEPO switch. Turn on the power. Step 152m-28 Does the error code recorded in “Step 152m-1” on page 300 reappear? Yes Call for support. No Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems.Step 152m-29 Is the error code recorded in “Step 152m-1” on page 300 of the form 101A xxxx or 101B xxxx? Yes Go to “Step 152m-12” on page 301. No All system partitions must be shut down and the system must be powered off to correct the power cabling. Schedule nonconcurrent service with the customer to correct the power cabling. This ends this procedure.Step 152m-30 Is the error code recorded in “Step 152m-1” on page 300 of the form 101A xxxx or 101B xxxx? Yes Go to “Step 152m-6” on page 301. No Verify all power subsystem UPIC connections. Go to “Step 152m-31.”Step 152m-31 Are all power subsystem UPIC connections properly seated and correctly connected? Yes Call for support. No All system partitions must be shut down and the system must be powered off to correct the power cabling. Schedule nonconcurrent service with the customer to correct the power cabling. This ends this procedure. Chapter
3.
Maintenance
Analysis
Procedures
303
MAP 152n: DASD Subsystem Power Problem Step 152n-1 Record the error code and location code(s) that sent you to this MAP. Step 152n-2 Press the green start service button on the panel with the UEPO switch. Determine the I/O subsystem number from the error code (10yy xxxx) and the following list. The remaining steps of this map refer to this I/O subsystem: v If yy=14, I/O subsystem number is 1, location is U1.9 v If yy=15, I/O subsystem number is 2, location is U1.5 v If yy=16, I/O subsystem number is 3, location is U1.1 v If yy=17, I/O subsystem number is 4, location is U1.13 v If yy=21, I/O subsystem number is 5, location is U2.1 v If yy=22, I/O subsystem number is 6, location is U2.5 v If yy=23, I/O subsystem number is 4, location is U2.9 v If yy=24, I/O subsystem number is 7, location is U2.13 v If yy=25, I/O subsystem number is 8, location is U2.19Step 152n-3 Examine the last 2 characters of the error code recorded in “Step 152n-1,” and use the following list to determine which DASD backplane to service. v If the last 2 characters of the error code is 96, then the following steps refer to DASD backplane 1 (P3) of I/O subsystem number found in “Step 152n-2.” v If the last 2 characters of the error code is A6, then the following steps refer to DASD backplane 2 (P4) of I/O subsystem number found in “Step 152n-2.” v If the last 2 characters of the error code is B6, then the following steps refer to DASD backplane 3 (P5) of I/O subsystem number found in “Step 152n-2.” v If the last 2 characters of the error code is C6, then the following steps refer to DASD backplane 4 (P6) of I/O subsystem number found in “Step 152n-2.”Step 152n-4 Turn off the power. Examine the green ?power good out? LEDs of both I/O subsystem DCAs. Step 152n-5 Are all the green ?power good out? LEDs of both I/O subsystem DCAs off? Yes Go to “Step 152n-6” on page 305. No Call for support. 304
406/90
Service
Guide
Step 152n-6 Are there any DASD in positions A8, A9, Aa, or Ab of the DASD backplane found in “Step 152n-2” on page 304 and “Step 152n-3” on page 304? Yes Go to “Step 152n-7.” No Go to “Step 152n-15” on page 306.Step 152n-7 Record the DASD positions in slots A8, A9, Aa, or Ab, then remove any DASD in slots A8, A9, Aa, or Ab. Note: You will be instructed in a later step to reinstall the DASD one at a time. Turn on the power. Step 152n-8 Did the error code recorded in “Step 152n-1” on page 304 reappear? Yes Go to “Step 152n-14.” No One of the DASD just removed is defective. Go to “Step 152n-9.”Step 152n-9 Turn off the power. Make sure all green ?power good out? LEDs of all installed I/O subsystem DCAs are off. Step 152n-10 Reinstall one of the DASD removed in “Step 152n-6.” Step 152n-11 Turn on the power. Step 152n-12 Did the error code recorded in “Step 152n-1” on page 304 reappear? Yes The DASD just reinstalled is defective. Replace it. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152n-13.”Step 152n-13 Have all DASD that were removed in “Step 152n-6” been reinstalled? Yes The problem has changed. This ends the procedure. No Go to “Step 152n-9” to reinstall the next DASD.Step 152n-14 Turn off the power. Make sure all green ?power good out? LEDs of all installed I/O subsystem DCAs are off. Chapter
3.
Maintenance
Analysis
Procedures
305
Step 152n-15 Replace the books in the following list, one at a time and in the listed order: v DCA 1 at V1 v DCA 2 at V2Step 152n-16 Turn on the power. Step 152n-17 Did the system stop with the same error code as recorded in “Step 152n-1” on page 304? Yes Go to “Step 152n-18.” No The book just replaced was defective. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems.Step 152n-18 Turn off the power then make sure all the green ?power good out? LEDs of both I/O subsystem DCAs are off. Step 152n-19 Remove the new card that was just installed in “Step 152n-15” and reinstall the original card. Step 152n-20 Have all the cards listed in “Step 152n-15” been replaced with new or original cards?? Yes Go to “Step 152n-21.” No Go to “Step 152n-15.”Step 152n-21 Examine the green ?power good out? LEDs of both I/O subsystem DCAs. Step 152n-22 Are all the green ?power good out? LEDs of both I/O subsystem DCAs off? Yes Go to Step “Step 152n-23.” No Call for support.Step 152n-23 Replace the DASD backplane. Step 152n-24 Turn on the power. 306
406/90
Service
Guide
Step 152n-25 Did the system stop with the same error code as recorded in “Step 152m-1” on page 300? Yes Call for support. No The book just replaced was defective. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. Chapter
3.
Maintenance
Analysis
Procedures
307
MAP 1540: Problem Isolation Procedures These procedures are used to locate problems in the processor subsystem, I/O subsystems, or rack. If a problem is detected, these procedures help you isolate the problem to a failing unit. Find the symptom in the following table; then follow the instructions given in the Action column. Attention: To identify a FRU, these MAPs refer to FRU Identify LEDs. For more information about the FRU Identify LEDs see “FRU Identify LEDs” on page 65.
Problem Isolation Procedures
Symptom/Reference Code/Checkpoint
Action
You were sent here by the Diagnostic Information for Multiple Bus Systems book.
Go to “MAP 1542: I/O Problem Isolation” on page 318.
406x 00B7
Go to “MAP 154B: Insufficient Hardware Resources Problem Isolation” on page 365. Attention: Before you remove or replace any MCM or L3 module, stop, read and understand the following procedures: “MCM Module (Processor) and Pass Through Modules” on page 863 and “L3 Cache Modules” on page 882.
406x 0EB1, 406x 0EB2, 406x 0EB3, 406x 0EB4, 406x 0EB5, 406x 0EB6, 406x 0EB7, 406x 0EB8, 406x 0EB9, 406x 0EBA, 406x 0EBB, 406x 0EBC
Go to “MAP 1549: Attention Problem Isolation” on page 353.Attention: Before you remove or replace any MCM or L3 module, stop, read and understand the following procedures: “MCM Module (Processor) and Pass Through Modules” on page 863 and “L3 Cache Modules” on page 882.
406x 0EB0
Go to “MAP 1541: JTAG Problem Isolation” on page 310.
406x 0C03, 4B2x 258A, 4B2x 25C7, 4B2x 25C8, 4B2x 25CF, 4B2x 25D3, 4B2x 25D5, 4B2x 25D7, 4B2x 25D8, 4B2x 25E5, 4B2x 25E94B2x 526B, 4B2x 526C, 4B2x 526D, 4B2x 526E
Go to “MAP 1543: MCM Module Problem Isolation” on page 337.Attention: Before you remove or replace any MCM or L3 module, stop, read and understand the following procedures: “MCM Module (Processor) and Pass Through Modules” on page 863 and “L3 Cache Modules” on page 882.
450x 302A, 450x 302C, 450x305A, 450x 305C, 450x 305E, 450x 307A, 450x 307B, 450x 307C, 450x 30BD, 450x 30C3
Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340.Attention: Before you remove or replace any MCM or L3 module, stop, read and understand the following procedures: “MCM Module (Processor) and Pass Through Modules” on page 863 and “L3 Cache Modules” on page 882.
450x 302B, 450x 302D, 450x 305B 450x 305D, 450x 305F, 450x 308A, 450x 308B, 450x 308C, 450x 30C6, 450x 30CC
Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342.Attention: Before you remove or replace any MCM or L3 module, stop, read and understand the following procedures: “MCM Module (Processor) and Pass Through Modules” on page 863 and “L3 Cache Modules” on page 882. 308
406/90
Service
Guide
Problem Isolation Procedures
Symptom/Reference Code/Checkpoint
Action
406x 0C09
Go to “MAP 1546: Memory Book Problem Isolation” on page 345.
406x 0C0A
Go to “MAP 1547: I/O Book Problem Isolation” on page 348.
406x 0001, 406x 0002, 406x 000A, 406x 000B,406x 000C, 406x 000D, B1xx 4643, B1xx 4645, B1xx 4648
Go to “MAP 154A: I2C Bus Problem Isolation” on page 361.Attention: Before you remove or replace any MCM or L3 module, stop, read and understand the following procedures: “MCM Module (Processor) and Pass Through Modules” on page 863 and “L3 Cache Modules” on page 882.
E101, E102, E10A, E10B, E111, E120, E121, E122, E130, E131, E132, E133, E134, E135, E138, E139, E13A, E149, E14C, E191, E19A, E19B, E19D, E1A0, E1A1, E1A2, E1A3, E1A4, E1A5, E1A6, E1A7, E1A8, E1A9, E1AA, E1AB, E1AC, E1AD, E1AE, E1AF, E1B1, E1C4, E1C5, E1C6, E1D0, E1D3, E1D4, E1DB, E1F0, E900, E901
Go to “MAP 1548: Memory and Processor Problem Isolation” on page 350.Attention: Before you remove or replace any MCM or L3 module, stop, read and understand the following procedures: “MCM Module (Processor) and Pass Through Modules” on page 863 and “L3 Cache Modules” on page 882.
Chapter
3.
Maintenance
Analysis
Procedures
309
MAP 1541: JTAG Problem Isolation Step 1541-1 Record the error code and location code(s) that sent you to this MAP. Step 1541-2 Check the following list to find the location code(s) recorded in “Step 1541-1”: v L3 module at U1.18-P1-C5 v L3 module at U1.18-P1-C11 v L3 module at U1.18-P1-C14 v L3 module at U1.18-P1-C20 v L3 module at U1.18-P1-C9 v L3 module at U1.18-P1-C10 v L3 module at U1.18-P1-C6 v L3 module at U1.18-P1-C12 v L3 module at U1.18-P1-C13 v L3 module at U1.18-P1-C7 v L3 module at U1.18-P1-C15 v L3 module at U1.18-P1-C16 v L3 module at U1.18-P1-C17 v L3 module at U1.18-P1-C18 v L3 module at U1.18-P1-C19 v L3 module at U1.18-P1-C8 v MCM module 0 at U1.18-P1-C1 v MCM module 3 at U1.18-P1-C2 v MCM module 1 at U1.18-P1-C3 v MCM module 2 at U1.18-P1-C4 310
406/90
Service
Guide
Step 1541-3 Is (are) the location code(s) recorded in “Step 1541-1” on page 310 also listed in “Step 1541-2” on page 310? Yes Go to “Step 1541-19” on page 313. No Go to “Step 1541-4.”Step 1541-4 Check the following list to find the location code(s) recorded in “Step 1541-1” on page 310: v Fourth I/O book, at location U1.18-P1-H1 v Primary I/O Book at U1.18-P1-H2 v Secondary I/O Book at U1.18-P1-H3 v Third I/O Book at U1.18-P1-H4 v Memory Book 6 at U1.18-P1-M5 v Memory Book 7 at U1.18-P1-M1Step 1541-5 Is (are) the location code(s) recorded in “Step 1541-1” on page 310 also listed in “Step 1541-4”? Yes Go to “Step 1541-25” on page 314. No Go to “Step 1541-6.”Step 1541-6 Check the following list to find the location code(s) recorded in “Step 1541-1” on page 310: v Memory Book 0 at U1.18-P1-M2 v Memory Book 1 at U1.18-P1-M3 v Memory Book 2 at U1.18-P1-M7 v Memory Book 3 at U1.18-P1-M6 v Memory Book 4 at U1.18-P1-M4 v Memory Book 5 at U1.18-P1-M8Step 1541-7 Is (are) the location code(s) recorded in “Step 1541-1” on page 310 listed in “Step 1541-6”? Yes Go to “Step 1541-31” on page 315. No Go to “Step 1541-8” on page 312. Chapter
3.
Maintenance
Analysis
Procedures
311
Step 1541-8 Turn off the power. Examine the amber logic-power LEDs of all installed processor subsystem DCAs. Step 1541-9 Are all the amber logic-power LEDs of all installed processor subsystem DCAs off? Yes Go to “Step 1541-10.” No Call for support.Step 1541-10 Replace the item recorded in “Step 1541-1” on page 310. Step 1541-11 Turn on the power. Step 1541-12 Did the system stop with the same error code as recorded in “Step 1541-1” on page 310? Yes Go to “Step 1541-13.” No This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems.Step 1541-13 Turn off the power. Step 1541-14 Examine the amber logic-power LEDs of all installed processor subsystem DCAs. Step 1541-15 Are all the amber logic-power LEDs of all installed processor subsystem DCAs off? Yes Go to “Step 1541-16.” No Call for support.Step 1541-16 Replace the primary I/O book at U1.18-P1-H2. Step 1541-17 Turn on the power. 312
406/90
Service
Guide
Step 1541-18 Did the system stop with the same error code as recorded in “Step 1541-1” on page 310? Yes Go to “Step 1541-37” on page 316. No The part just replaced was defective. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems.Step 1541-19 Turn off the power. Examine the amber logic-power LEDs of all installed processor subsystem DCAs. Step 1541-20 Are all the amber logic-power LEDs of all installed processor subsystem DCAs off? Yes Go to “Step 1541-21.” No Call for support.Step 1541-21 Attention: Before you remove or replace any MCM or L3 module, stop, read and understand the following procedures: “MCM Module (Processor) and Pass Through Modules” on page 863 and “L3 Cache Modules” on page 882. Attention: MCM and L3 modules have a limit of three plug cycles. Before removing any of the modules in the following list, call for support. Replace the following modules or books, if present, one at a time and in the order listed: v First location code item recorded, if any, “Step 1541-1” on page 310. v Second location code item recorded, if any, in “Step 1541-1” on page 310. v Third location code item recorded, if any, in “Step 1541-1” on page 310. v Fourth location code item recorded, if any, in “Step 1541-1” on page 310. v Primary I/O book at U1.18-P1-H2. v L3 module, at location U1.18-P1-C5 if not recorded in “Step 1541-1” on page 310. v L3 module, at location U1.18-P1-C11 if not recorded in “Step 1541-1” on page 310. v L3 module, at location U1.18-P1-C14 if not recorded in “Step 1541-1” on page 310. v L3 module, at location U1.18-P1-C20 if not recorded in “Step 1541-1” on page 310. v L3 module, at location U1.18-P1-C9 if not recorded in “Step 1541-1” on page 310. v L3 module, at location U1.18-P1-C10 if not recorded in “Step 1541-1” on page 310. v L3 module, at location U1.18-P1-C6 if not recorded in “Step 1541-1” on page 310. v L3 module, at location U1.18-P1-C12 if not recorded in “Step 1541-1” on page 310. v L3 module, at location U1.18-P1-C13 if not recorded in “Step 1541-1” on page 310. v L3 module, at location U1.18-P1-C7 if not recorded in “Step 1541-1” on page 310. v L3 module, at location U1.18-P1-C15 if not recorded in “Step 1541-1” on page 310. v L3 module, at location U1.18-P1-C16 if not recorded in “Step 1541-1” on page 310. v L3 module, at location U1.18-P1-C17 if not recorded in “Step 1541-1” on page 310. v L3 module, at location U1.18-P1-C18 if not recorded in “Step 1541-1” on page 310. v L3 module, at location U1.18-P1-C19 if not recorded in “Step 1541-1” on page 310. v L3 module, at location U1.18-P1-C8 if not recorded in “Step 1541-1” on page 310. v MCM module 0 at U1.18-P1-C1 if not recorded in “Step 1541-1” on page 310. v MCM module 3 at U1.18-P1-C2 if not recorded in “Step 1541-1” on page 310. v MCM module 1 at U1.18-P1-C3 if not recorded in “Step 1541-1” on page 310. v MCM module 2 at U1.18-P1-C4 if not recorded in “Step 1541-1” on page 310.
Chapter
3.
Maintenance
Analysis
Procedures
313
Step 1541-22 Turn on the power. Step 1541-23 Did the system stop with the same error code as recorded in “Step 1541-1” on page 310? Yes Go to “Step 1541-24.” No The part just replaced was defective. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems.Step 1541-24 Have all the MCMs and books listed in “Step 1541-21” on page 313 been replaced? Yes Go to “Step 1541-37” on page 316. No Go to “Step 1541-19” on page 313.Step 1541-25 Turn off the power. Examine the amber logic-power LEDs of all installed processor subsystem DCAs. Step 1541-26 Are all the amber logic-power LEDs of all installed processor subsystem DCAs off? Yes Go to “Step 1541-27.” No Call for support.Step 1541-27 Replace the following books, if present, one at a time: v First location code item recorded, if any in “Step 1541-1” on page 310. v Second location code item recorded, if any, in “Step 1541-1” on page 310. v Third location code item recorded, if any, in “Step 1541-1” on page 310. v Fourth location code item recorded, if any, in “Step 1541-1” on page 310. v Fourth I/O book, at location U1.18-P1-H1 if not recorded in “Step 1541-1” on page 310. v Primary I/O Book at U1.18-P1-H2 if not recorded in “Step 1541-1” on page 310. v Secondary I/O Book at U1.18-P1-H3 if not recorded in “Step 1541-1” on page 310. v Third I/O Book at U1.18-P1-H4 if not recorded in “Step 1541-1” on page 310. v Memory Book 6 at U1.18-P1-M5 if not recorded in “Step 1541-1” on page 310. v Memory Book 7 at U1.18-P1-M1 if not recorded in “Step 1541-1” on page 310.Step 1541-28 Turn on the power. 314
406/90
Service
Guide
Step 1541-29 Did the system stop with the same error code as recorded in “Step 1541-1” on page 310? Yes Go to “Step 1541-30.” No The book just replaced was defective. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems.Step 1541-30 Have all the books listed in “Step 1541-27” on page 314 been replaced? Yes Go to “Step 1541-37” on page 316. No Go to “Step 1541-25” on page 314.Step 1541-31 Turn off the power. Examine the amber logic-power LEDs of all installed processor subsystem DCAs. Step 1541-32 Are all the amber logic-power LEDs of all installed processor subsystem DCAs off? Yes Go to “Step 1541-33.” No Call for support.Step 1541-33 Replace the following books, if present, one at a time: v First location code item recorded in “Step 1541-1” on page 310. v Second location code item recorded, if any, in “Step 1541-1” on page 310. v Third location code item recorded, if any, in “Step 1541-1” on page 310. v Fourth location code item recorded, if any, in “Step 1541-1” on page 310. v Primary I/O book at U1.18-P1-H2. v Memory book 0 at U1.18-P1-M2 if not recorded in “Step 1541-1” on page 310. v Memory book 1 at U1.18-P1-M3 if not recorded in “Step 1541-1” on page 310. v Memory book 2 at U1.18-P1-M7 if not recorded in “Step 1541-1” on page 310. v Memory book 3 at U1.18-P1-M6 if not recorded in “Step 1541-1” on page 310. v Memory book 4 at U1.18-P1-M4 if not recorded in “Step 1541-1” on page 310. v Memory book 5 at U1.18-P1-M8 if not recorded in “Step 1541-1” on page 310Step 1541-34 Turn on the power. Chapter
3.
Maintenance
Analysis
Procedures
315
Step 1541-35 Did the system stop with the same error code as recorded in “Step 1541-1” on page 310? Yes Go to “Step 1541-36.” No The book just replaced was defective. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems.Step 1541-36 Have all the books listed in “Step 1541-33” on page 315 been replaced? Yes Go to “Step 1541-37.” No Go to “Step 1541-31” on page 315.Step 1541-37 Turn off the power. Step 1541-38 Examine the amber logic-power LEDs of all installed processor subsystem DCAs. Step 1541-39 Are all the amber logic-power LEDs of all installed processor subsystem DCAs off? Yes Go to “Step 1541-40.” No Call for support.Step 1541-40 Attention: Before replacing the system backplane, call for support. Replace the system backplane at U1.18-P1. Step 1541-41 Turn on the power. Step 1541-42 Did the system stop with the same error code as recorded in “Step 1541-1” on page 310? Yes Go to “Step 1541-43” on page 317. No The book just replaced was defective. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. 316
406/90
Service
Guide
Step 1541-43 Turn off the power. Make sure the amber logic-power LEDs of all installed processor subsystem DCAs are off. Step 1541-44 Call for support. This ends the procedure. Chapter
3.
Maintenance
Analysis
Procedures
317
MAP 1542: I/O Problem Isolation This MAP is used to locate defective FRUs not found by normal diagnostics. For this procedure, diagnostics are run on a minimally configured system. If a failure is detected on the minimally configured system, the remaining FRUs are exchanged one at a time until the failing FRU is identified. If a failure is not detected, FRUs are added back until the failure occurs. The failure is then isolated to the failing FRU. This I/O problem-determination MAP isolates I/O card and I/O subsystem failures. When I/O problem isolation is complete, all cables and cards exhibiting a failure will have been replaced or reseated.Notes: 1. This MAP assumes that a CD-ROM drive is installed in the media subsystem and connected to a SCSI adapter in the first I/O subsystem, and that a Diagnostics CD-ROM is available. 2. If a general-access password or privileged-access password is installed, you are prompted to enter the password before the diagnostic CD-ROM can load. 3. The term POST indicators refers to the device mnemonics (words Memory, Keyboard, Network, SCSI, and Speaker) that appear on the system console during the power-on self-test (POST). 4. The service processor may have recorded one or more symptoms in its error log. It is a good idea to examine that error log before proceeding (see “System Information Menu” on page 781). 5. The service processor may have been set by the user to monitor system operations and to attempt recoveries. You can disable these actions while you diagnose and service the system. If you disable them, make notes of their current settings so that you can restore them easily. The following table lists the settings:
Surveillance
From the service processor Setup Menu, go to the Surveillance Setup Menu and disable surveillance. (Operating System Surveillance is disabled in partition mode.)
Unattended Start
From the service processor System Power Control Menu, disable unattended start mode.
Reboot Policy
From the System Power Control Menu, go to the Reboot/Restart Policy Setup Menu and set: 1. Number of reboot attempts to 0 (zero) 2. Use OS-Defined restart policy to No 3. Enable supplemental restart policy to No.
Call Out
From the Call-In/Call-Out Setup Menu, go to the Serial Port Selection Menu and disable call-out on both serial ports. The steps in this MAP will attempt to slow boot the system into service mode diagnostics. The following procedure describes booting a nonpartitioned system with an attached 3151 console. The procedure for booting a partitioned system with an attached HMC is slightly different and is described in “Performing Slow Boot” on page 413. “Performing Slow Boot” describes the procedure for booting online diagnostics (6 key between the Keyboard and Speaker indicators) and the following procedure requires booting standalone diagnostics (5 key between the Keyboard and Speaker indicators). Therefore, on a partitioned system, make sure the partition you selected to boot in the following procedure has the CD-ROM allocated to it, as described in Chapter 6, “Using the Online and Standalone Diagnostics,” on page 759. 318
406/90
Service
Guide
Step 1542-1 1. Ensure that the diagnostics and the operating system are shut down. 2. Turn off the power. 3. Select slow boot mode (select Disable Fast Boot) on the System Power Control menu from the service processor main menu. 4. Turn on the power. 5. Insert the diagnostic CD-ROM into the CD-ROM drive. Does the CD-ROM appear to operate correctly? NO Go to “Boot Problems” on page 409, and perform steps 1-8 for an internally attached boot device. YES Go to “Step 1542-2.”Step 1542-2 1. When the keyboard indicator is displayed (the word Keyboard), press the 5 key on the firmware console. 2. Enter the appropriate password when you are prompted to do so. Is the Please define system console screen displayed? NO Go to “Step 1542-3.” YES Go to “Step 1542-4” on page 320.Step 1542-3 The system is unable to boot standalone diagnostics. Check the service processor error log and the media subsystem operator panel for additional error codes resulting from the slow boot in “Step 1542-1.” Did the slow boot generate a different error code from the one that originally sent you to MAP 1542? NO It appears you have a processor subsystem problem. Call for support. This ends the procedure. YES Restore fast-boot mode (select Enable Fast Boot) on the System Power Control menu from the service processor main menu. Go to “Checkpoints and Error Codes Index” on page 416, and follow the actions for the new error code. Chapter
3.
Maintenance
Analysis
Procedures
319
Step 1542-4 The system stopped with the Please define system console prompt appearing on the system console. Standalone diagnostics can be booted. Perform the following: 1. Follow the instructions on the screen to select the system console. 2. When the DIAGNOSTIC OPERATING INSTRUCTIONS screen is displayed, press Enter. 3. If the terminal type has not been defined, you must use the Initialize Terminal option of the FUNCTION SELECTION menu to initialize the AIX operating system environment before you can continue with the diagnostics. This is a separate and different operation from selecting the firmware console display. 4. Select Advanced Diagnostic Routines. 5. When the DIAGNOSTIC MODE SELECTION menu displays, select System Verification to run diagnostics on all resources. Did running diagnostics produce a different symptom? NO Go to step number 6, below. YES Go to “Entry MAP” on page 149. Use the new symptom.6. Record any devices missing from the list of all adapters and devices. Continue with this MAP. When you have fixed the problem, use this record to verify that all devices appear when you run System Verification.Are there any devices missing from the list of all adapters and devices? NO Reinstall all remaining adapters, if any, and reconnect all devices. Return the system to its original configuration. Be sure to select fast-boot mode (select Enable Fast Boot) on the System Power Control menu from the service processor main menu. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. YES The boot attempts that follow will attempt to isolate any remaining I/O subsystem problems with missing devices. Ignore any codes that may display on the operator panel unless stated otherwise. Go to “Step 1542-5” on page 321. 320
406/90
Service
Guide
Step 1542-5 There are missing devices associated with one or more I/O subsystems.Note: Before continuing, check the cabling from the processor subsystem to I/O subsystem(s) to ensure that the system is cabled correctly. Refer to “Subsystem Positioning and Cabling” on page 36 for valid configurations. Record the current cabling configuration and then continue with the following steps: 1. Turn off the power. 2. At the primary I/O book, disconnect the cable connection at RIO port A0 (U1.18-P1-H2/Q1). 3. Disconnect the cable connection at I/O subsystem 1 left I/O port connector 0 (U1.9-P1/Q2). The RIO cable that was connected to RIO port A0 should now be loose and can be removed. 4. At the primary I/O book, disconnect the cable connection at RIO port A1 (U1.18-P1-H2/Q2) and reconnect it to RIO port A0 (U1.18-P1-H2/Q1). 5. Disconnect the cable connection at I/O subsystem 1 right I/O port connector 1 (U1.9-P2/Q1). The RIO cable that was connected to RIO port A0 on the primary I/O book should now be loose and can be removed. 6. Verify that processor subsystem RIO port A0 on primary I/O book (U1.18-P1-H2/Q1) is connected to I/O subsystem 1 left I/O port connector 0 (U1.9-P1/Q2). 7. Verify that I/O subsystem 1 left I/O port connector 1 (U1.9-P1/Q1) is connected to I/O subsystem 1 right I/O port connector 0 (U1.9-P2/Q2). 8. Turn on the power to boot standalone diagnostics from CD. 9. If the Please define the System Console screen is displayed, follow directions to select the system console. 10. Use the Display Configuration and Resource List to list all attached devices and adapters (refer to the Diagnostic Information for Multiple Bus Systems if necessary). 11. Check that all attached devices and adapters are listed. Did the Please define the System Console screen display and are all attached devices and adapters listed? NO Go to “Step 1542-6” on page 322. YES The RIO cable that was removed in step 5 above is defective. Replace the RIO cable. Connect the new RIO cable from I/O subsystem 1 right I/O port connector 1 (U1.9-P2/Q1) to processor subsystem RIO port A1 on primary I/O book (U1.18-P1-H2/Q2). Reconfigure the system to its original condition. Go to “MAP 0410: Repair Checkout” in the Diagnostic Information for Multiple Bus Systems. Chapter
3.
Maintenance
Analysis
Procedures
321
Step 1542-6 The 406/90 server must have a minimum of one I/O subsystem and one media subsystem attached to the processor subsystem. The following steps reduce the system I/O to the required minimum of one I/O subsystem: 1. Turn off the system power. 2. Verify that processor subsystem RIO port A0 on primary I/O book (U1.18-P1-H2/Q1) is connected to I/O subsystem 1 left I/O port connector 0 (U1.9-P1/Q2). 3. Verify that I/O subsystem 1 right I/O port connector 1 (U1.9-P2/Q1) is connected to processor subsystem RIO port A1 on primary I/O book (U1.18-P1-H2/Q2). 4. Verify that I/O subsystem 1 left I/O port connector 1 (U1.9-P1/Q1) is connected to I/O subsystem 1 right I/O port connector 0 (U1.9-P2/Q2). 5. Disconnect cable at processor subsystem RIO port B0 on primary I/O book (U1.18-P1-H2/Q3), if cable attached. 6. Disconnect cable at processor subsystem RIO port B1 on primary I/O book (U1.18-P1-H2/Q4), if cable attached. 7. Unplug the secondary I/O book (U1.18-P1-H3), if present, and carefully pull it approximately 25 mm (1 inch) away from the backplane without disconnecting any cables 8. Unplug the third I/O book (U1.18-P1-H4), if present, and carefully pull it approximately 25 mm (1 inch) away from the backplane without disconnecting any cables. 9. Connect the previously removed RIO cable from I/O subsystem 1 right I/O port connector 1 (U1.9-P2/Q1) to processor subsystem RIO port A1 on primary I/O book (U1.18-P1-H2/Q2). 10. Verify that the power cable is connected between I/O subsystem 1 right media subsystem power connector (U1.9-P2-V1/Q3) and the media subsystem. 11. Disconnect power cables from remaining I/O subsystems. All I/O subsystems except the primary I/O subsystem should now be physically disconnected from the processor subsystem. Go to “Step 1542-7” on page 323. 322
406/90
Service
Guide
Step 1542-7 I/O subsystem 1 (U1.9) is the only I/O subsystem connected to the processor subsystem. 1. Turn on the power to boot standalone diagnostics from CD. 2. If the Please define the System Console screen is displayed, follow directions to select the system console. 3. Use the Display Configuration and Resource List to list all attached devices and adapters (refer to the Diagnostic Information for Multiple Bus Systems, order number SA23-2769, if necessary). 4. Check that all attached devices and adapters are listed.If the Please define the System Console prompt did not display or all attached devices and adapters are not listed, the problem is in the media subsystem or I/O subsystem 1 (U1.9). Did the Please define the System Console screen display and are all attached devices and adapters listed? NO Go to “Step 1542-8.” YES Go to “Step 1542-21” on page 329.Step 1542-8 Perform the following to deconfigure the media subsystem: 1. Turn off the power. 2. If you have not already done so, set the service processor settings with the instructions in step 5 on page 318. Then return here and continue. 3. Exit the service processor menus, and disconnect the power cable between the I/O subsystem 1 right media subsystem power connector (U1.9-P2-V1/Q3) and the media subsystem. 4. If a display adapter with keyboard and mouse connected to a Keyboard/Mouse Attachment (USB) card in I/O subsystem 1 (U1.9) is being used as the console, locate a standalone serial terminal (for example a 3151), and connect to the S1 serial port or use the HMC. 5. Remove the keyboard and mouse, if attached to the I/O subsystem 1 (U1.9). 6. Remove (unplug) the tape drive (if installed) from the media subsystem. 7. Disconnect the diskette drive cable from the diskette drive connector on the media subsystem. 8. Connect the power cable between the I/O subsystem 1 right media subsystem power connector (U1.9-P2-V1/Q3) and the media subsystem. The media subsystem operator panel momentarily displays an OK prompt. 9. Set the current drawer location to x.y=1.9. Use this value in future steps until changed in a later step.Go to “Step 1542-9” on page 324. Chapter
3.
Maintenance
Analysis
Procedures
323
Step 1542-9 Are there any adapters in slots 1, 2, 3, 4 or 5 (location codes Ux.y-P1-I1 through I5) on the left side of the I/O subsystem? NO Go to “Step 1542-10.” YES Go to “Step 1542-13.”Step 1542-10 Are there any adapters in slots 6, 7, 8, 9 or 10 (location codes Ux.y-P1-I6 through I10) on the left side of the I/O subsystem? NO Go to “Step 1542-11.” YES Go to “Step 1542-14” on page 325.Step 1542-11 Are there any adapters in slots 1, 2, 3, 4 or 5 (location codes Ux.y-P2-I1 through I5) on the right side of the I/O subsystem? NO Go to “Step 1542-12.” YES Go to “Step 1542-15” on page 325.Step 1542-12 Are there any adapters in slots 6, 7, 8, 9 or 10 (location codes Ux.y-P2-I7 through I10) on the right side of the I/O subsystem? NO Go to “Step 1542-19” on page 328. YES Go to “Step 1542-16” on page 326.Step 1542-13 1. If it is not already off, turn off the power. 2. Remove all adapters from slots 1, 2, 3, 4 and 5 (location codes Ux.y-P1-I1 through I5) from the left side of the I/O subsystem that are not attached to the boot device. 3. Label and record the location of any cables attached to the adapters. 4. Record the slot number of the adapters. 5. Turn on the power to boot standalone diagnostics from CD. 6. If the TTY screen displays Enter 0 to select this console, press the 0 key on the TTY terminal’s keyboard. 7. If the Please define the System Console screen is displayed, follow directions to select the system console. 8. Use the Display Configuration and Resource List to list all attached devices and adapters (refer to the Diagnostic Information for Multiple Bus Systems, order number SA23-2769, if necessary). 9. Check that all attached devices and adapters are listed. Did the Please define the System Console screen display and are all attached devices and adapters listed? NO Go to “Step 1542-10.” YES Go to “Step 1542-17” on page 326. 324
406/90
Service
Guide
Step 1542-14 1. If it is not already off, turn off the power. 2. Remove all adapters from slots 6, 7, 8, 9 and 10 (location codes Ux.y-P1-I6 through I10) from the left side of the I/O subsystem that are not attached to the boot device. 3. Label and record the location of any cables attached to the adapters. 4. Record the slot number of the adapters. 5. Turn on the power to boot standalone diagnostics from CD. 6. If the TTY screen displays Enter 0 to select this console, press the 0 key on the TTY terminal’s keyboard. 7. If the Please define the System Console screen is displayed, follow directions to select the system console. 8. Use the Display Configuration and Resource List to list all attached devices and adapters (refer to the Diagnostic Information for Multiple Bus Systems, order number SA23-2769, if necessary). 9. Check that all attached devices and adapters are listed. Did the Please define the System Console screen display and are all attached devices and adapters listed? NO Go to “Step 1542-11” on page 324. YES Go to “Step 1542-17” on page 326.Step 1542-15 1. If it is not already off, turn off the power. 2. Remove all adapters from slots 1, 2, 3, 4 and 5 (location codes Ux.y-P2-I1 through I5) from the right side of the I/O subsystem that are not attached to the boot device. 3. Label and record the location of any cables attached to the adapters. 4. Record the slot number of the adapters. 5. Turn on the power to boot standalone diagnostics from CD. 6. If the TTY screen displays Enter 0 to select this console, press the 0 key on the TTY terminal’s keyboard. 7. If the Please define the System Console screen is displayed, follow directions to select the system console. 8. Use the Display Configuration and Resource List to list all attached devices and adapters (refer to the Diagnostic Information for Multiple Bus Systems, order number SA23-2769, if necessary). 9. Check that all attached devices and adapters are listed. Did the Please define the System Console screen display and are all attached devices and adapters listed? NO Go to “Step 1542-12” on page 324. YES Go to “Step 1542-17” on page 326. Chapter
3.
Maintenance
Analysis
Procedures
325
Step 1542-16 1. If it is not already off, turn off the power. 2. Remove all adapters from slots 6, 7, 8, 9 and 10 (location codes Ux.y-P2-I6 through I10) from the right side of the I/O subsystem that are not attached to the boot device. 3. Label and record the location of any cables attached to the adapters. 4. Record the slot number of the adapters. 5. Turn on the power to boot standalone diagnostics from CD. 6. If the TTY screen displays Enter 0 to select this console, press the 0 key on the TTY terminal’s keyboard. 7. If the Please define the System Console screen is displayed, follow directions to select the system console. 8. Use the Display Configuration and Resource List to list all attached devices and adapters (refer to the Diagnostic Information for Multiple Bus Systems, order number SA23-2769, if necessary). 9. Check that all attached devices and adapters are listed. Did the Please define the System Console screen display and are all attached devices and adapters listed? NO Go to “Step 1542-19” on page 328. YES Go to “Step 1542-17.”Step 1542-17 If the Please define the System Console screen displays and all attached devices and adapters are listed, the problem is with one of the adapter cards or devices that was removed or disconnected from the I/O subsystem. 1. Turn off the power. 2. Reinstall one adapter or device that was removed. Use the original adapter cards in their original slots when reinstalling adapters. 3. Turn on the power to boot standalone diagnostics from CD. 4. If the Please define the System Console screen is displayed, follow directions to select the system console. 5. Use the Display Configuration and Resource List to list all attached devices and adapters (refer to the Diagnostic Information for Multiple Bus Systems, order number SA23-2769, if necessary). 6. Check that all attached devices and adapters are listed.Did the Please define the System Console screen display and are all attached devices and adapters listed? NO Go to “Step 1542-18” on page 327. YES Reinstall the next adapter or device and return to the beginning of this step. Continue repeating this process until an adapter or device causes the Please define the System Console screen to not display or all attached devices and adapters to not be listed. After installing all of the adapters and the Please define the System Console screen does display and all attached devices and adapters are listed, return the system to its original configuration. Go to “MAP 0410: Repair Checkout” in the Diagnostic Information for Multiple Bus Systems. 326
406/90
Service
Guide
Step 1542-18 Replace the adapter you just installed with a new adapter and try to boot standalone diagnostics from CD again. 1. If the Please define the System Console screen is displayed, follow directions to select the system console. 2. Use the Display Configuration and Resource List to list all attached devices and adapters (refer to the Diagnostic Information for Multiple Bus Systems, order number SA23-2769, if necessary). 3. Check that all attached devices and adapters are listed.Did the Please define the System Console screen display and are all attached devices and adapters listed? NO The I/O subsystem backplane is defective. Replace the I/O subsystem backplane on the side of the problem adapter (Ux.y-P1 or Ux.y-P2). Go to “Step 1542-20” on page 329. YES The adapter was defective. Go to “Step 1542-20” on page 329. Chapter
3.
Maintenance
Analysis
Procedures
327
Step 1542-19 1. Turn off the power. 2. Disconnect the I/O subsystem power cables. 3. Replace the following parts, one at a time, in the sequence listed: a. Boot device SCSI cable b. Boot device SCSI adapter c. Boot device SCSI backplane d. Boot device e. I/O subsystem backplane where boot device is located f. Other I/O subsystem backplane g. Both I/O subsystem DCAs4. Reconnect the I/O subsystem power cables. 5. Turn on the power. 6. Boot standalone diagnostics from CD. 7. If the Please define the System Console screen is displayed, follow directions to select the system console. 8. Use the Display Configuration and Resource List to list all attached devices and adapters (refer to the Diagnostic Information for Multiple Bus Systems, order number SA23-2769, if necessary). 9. Check that all attached devices and adapters are listed. Did the Please define the System Console screen display and are all attached devices and adapters listed? NO Replace the next part in the list and return to the beginning of this step. Continue repeating this process until a part causes the Please define the System Console screen to be displayed and all attached devices and adapters to be listed. If you have replaced all the items listed above and the Please define the System Console screen does not display or all attached devices and adapters are not listed, check any external devices and cabling. If you do not find a problem, contact your next level of support for assistance. YES Go to “Step 1542-20” on page 329. 328
406/90
Service
Guide
Step 1542-20 1. Turn off the power. 2. The part just replaced fixed the problem. 3. If a display adapter with keyboard, and mouse were used, reinstall the display adapter card, keyboard and mouse. 4. Reconnect the diskette drive cable to the diskette drive connector on the media subsystem. 5. Reconnect the tape drive (if previously installed) to the internal SCSI bus cable. 6. Plug in all adapters that were previously removed but not reinstalled. 7. Reconnect the I/O subsystem power cables that were previously disconnected. Reconfigure the system to its original condition. Go to “MAP 0410: Repair Checkout” in the Diagnostic Information for Multiple Bus Systems. Step 1542-21 If the boot is successful, the problem is with a remaining I/O subsystem. Turn off the power. Are there any I/O subsystems left to connect to the processor subsystem? NO Reconfigure the system to its original condition. Go to “MAP 0410: Repair Checkout” in the Diagnostic Information for Multiple Bus Systems. YES Go to “Step 1542-22.”Step 1542-22 Reconnect the next I/O subsystem in the configuration that you recorded earlier in this MAP. Use the following table to determine the next I/O subsystem to reconnect.
I/O Subsystem
Current x.y Value
New x.y Value
Go to:
1
1.9
1.5
“Step 1542-23” on page 330
2
1.5
1.1
“Step 1542-24” on page 331
3
1.1
1.13 (no IBF), 2.9 (with IBF)
“Step 1542-25” on page 332
4
1.13
2.1
“Step 1542-26” on page 333
4
2.9
2.1
“Step 1542-26” on page 333
5
2.1
2.5
“Step 1542-27” on page 334
6
2.5
2.13
“Step 1542-28” on page 335
7
2.13
2.19
“Step 1542-29” on page 336
8
2.19
2.19
“Step 1542-30” on page 336 Note: The I/O subsystems should be cabled as shown in “Subsystem Positioning and Cabling” on page 36. Chapter
3.
Maintenance
Analysis
Procedures
329
Step 1542-23 1. Turn off the power. 2. At the primary I/O book, disconnect the cable connection at RIO port B0 (U1.18-P1-H2/Q3). 3. Disconnect the cable connection at I/O subsystem 2 left I/O port connector 0 (U1.5-P1/Q2). The RIO cable that was connected to RIO port B0 should now be loose and can be removed. 4. At the primary I/O book, disconnect the cable connection at RIO port B1 (U1.18-P1-H2/Q4) and reconnect it to RIO port B0 (U1.18-P1-H2/Q3). 5. Disconnect the cable connection at I/O subsystem 2 right I/O port connector 1 (U1.5-P2/Q1) and reconnect it to I/O subsystem 2 left I/O port connector 0 (U1.5-P1/Q2). 6. Verify that processor subsystem RIO port B0 on primary I/O book (U1.18-P1-H2/Q3) is connected to I/O subsystem 2 left I/O port connector 0 (U1.5-P1/Q2). 7. Verify that I/O subsystem 2 left I/O port connector 1 (U1.5-P1/Q1) is connected to I/O subsystem 2 right I/O port connector 0 (U1.5-P2/Q2). 8. Turn on the power to boot standalone diagnostics from CD. 9. If the Please define the System Console screen is displayed, follow directions to select the system console. 10. Use the Display Configuration and Resource List to list all attached devices and adapters (refer to the Diagnostic Information for Multiple Bus Systems, order number SA23-2769, if necessary). 11. Check that all attached devices and adapters are listed. Did the Please define the System Console screen display and are all attached devices and adapters listed? NO Connect the previously removed RIO cable from I/O subsystem 2 right I/O port connector 1 (U1.5-P2/Q1) to processor subsystem RIO port B1 on primary I/O book (U1.18-P1-H2/Q4). Go to “Step 1542-9” on page 324. YES The RIO cable that was removed in step number 3 above is defective. Replace the RIO cable. Connect the new RIO cable from I/O subsystem 2 right I/O port connector 1 (U1.5-P2/Q1) to processor subsystem RIO port B1 on primary I/O book (U1.18-P1-H2/Q4). Reconfigure the system to its original condition. Go to “MAP 0410: Repair Checkout” in the Diagnostic Information for Multiple Bus Systems. 330
406/90
Service
Guide
Step 1542-24 1. Turn off the power. 2. Slide the secondary I/O book (U1.18-P1-H3) all the way into the processor backplane until it is firmly seated and latched. 3. At the secondary I/O book, disconnect the cable connection at RIO port A0 (U1.18-P1-H3/Q4). 4. At the secondary I/O book, disconnect the cable connection at RIO port A1 (U1.18-P1-H3/Q3). 5. At the secondary I/O book, disconnect the cable connection at RIO port B0 (U1.18-P1-H3/Q2). 6. At the secondary I/O book, disconnect the cable connection at RIO port B1 (U1.18-P1-H3/Q1). 7. At the secondary I/O book, disconnect the cable connection at RIO port D0 (U1.18-P1-H3/Q6). 8. At the secondary I/O book, disconnect the cable connection at RIO port D1 (U1.18-P1-H3/Q5). 9. At the secondary I/O book, disconnect the cable connection at RIO port C0 (U1.18-P1-H3/Q8). 10. Disconnect the cable connection at I/O subsystem 3 left I/O port connector 0 (U1.1-P1/Q2). The RIO cable that was connected to RIO port C0 should now be loose and can be removed. 11. At the secondary I/O book, disconnect the cable connection at RIO port C1 (U1.18-P1-H3/Q7) and reconnect it to RIO port C0 (U1.18-P1-H3/Q8). 12. Disconnect the cable connection at I/O subsystem 3 right I/O port connector 1 (U1.1-P2/Q1) and reconnect it to I/O subsystem 3 left I/O port connector 0 (U1.1-P1/Q2). 13. Verify that processor subsystem RIO port C0 on secondary I/O book (U1.18-P1-H3/Q8) is connected to I/O subsystem 3 left I/O port connector 0 (U1.1-P1/Q2). 14. Verify that I/O subsystem 3 left I/O port connector 1 (U1.1-P1/Q1) is connected to I/O subsystem 3 right I/O port connector 0 (U1.1-P2/Q2). 15. Turn on the power to boot standalone diagnostics from CD. 16. If the Please define the System Console screen is displayed, follow directions to select the system console. 17. Use the Display Configuration and Resource List to list all attached devices and adapters (refer to the Diagnostic Information for Multiple Bus Systems, order number SA23-2769, if necessary). 18. Check that all attached devices and adapters are listed. Did the Please define the System Console screen display and are all attached devices and adapters listed? NO Connect the previously removed RIO cable from I/O subsystem 3 right I/O port connector 1 (U1.1-P2/Q1) to processor subsystem RIO port C1 on secondary I/O book (U1.18-P1-H3/Q7). Go to “Step 1542-9” on page 324. YES The RIO cable that was removed in step 10 above is defective. Replace the RIO cable. Connect the new RIO cable from I/O subsystem 3 right I/O port connector 1 (U1.1-P2/Q1) to processor subsystem RIO port C1 on secondary I/O book (U1.18-P1-H3/Q7). Reconfigure the system to its original condition. Go to “MAP 0410: Repair Checkout” in the Diagnostic Information for Multiple Bus Systems. Chapter
3.
Maintenance
Analysis
Procedures
331
Step 1542-25 1. Turn off the power. 2. At the secondary I/O book, disconnect the cable connection at RIO port D0 (U1.18-P1-H3/Q6). 3. Disconnect the cable connection at I/O subsystem 4 left I/O port connector 0 (Ux.y-P1/Q2). The RIO cable that was connected to RIO port D0 should now be loose and can be removed. 4. At the secondary I/O book, disconnect the cable connection at RIO port D1 (U1.18-P1-H3/Q5) and reconnect it to RIO port D0 (U1.18-P1-H3/Q6). 5. Disconnect the cable connection at I/O subsystem 4 right I/O port connector 1 (Ux.y-P2/Q1) and reconnect it to I/O subsystem 4 left I/O port connector 0 (Ux.y-P1/Q2). 6. Verify that processor subsystem RIO port D0 on secondary I/O book (U1.18-P1-H3/Q6) is connected to I/O subsystem 4 left I/O port connector 0 (Ux.y-P1/Q2). 7. Verify that I/O subsystem 4 left I/O port connector 1 (Ux.y-P1/Q1) is connected to I/O subsystem 4 right I/O port connector 0 (Ux.y-P2/Q2). 8. Turn on the power to boot standalone diagnostics from CD. 9. If the Please define the System Console screen is displayed, follow directions to select the system console. 10. Use the Display Configuration and Resource List to list all attached devices and adapters (refer to the Diagnostic Information for Multiple Bus Systems, order number SA23-2769, if necessary). 11. Check that all attached devices and adapters are listed. Did the Please define the System Console screen display and are all attached devices and adapters listed? NO Connect the previously removed RIO cable from I/O subsystem 4 right I/O port connector 1 (Ux.y-P2/Q1) to processor subsystem RIO port D1 on secondary I/O book (U1.18-P1-H3/Q5). Go to “Step 1542-9” on page 324. YES The RIO cable that was removed in step 3 above is defective. Replace the RIO cable. Connect the new RIO cable from I/O subsystem 4 right I/O port connector 1 (Ux.y-P2/Q1) to processor subsystem RIO port D1 on secondary I/O book (U1.18-P1-H3/Q5). Reconfigure the system to its original condition. Go to “MAP 0410: Repair Checkout” in the Diagnostic Information for Multiple Bus Systems. 332
406/90
Service
Guide
Step 1542-26 1. Turn off the power. 2. At the secondary I/O book disconnect the cable connection at RIO port A0 (U1.18-P1-H3/Q4). 3. Disconnect the cable connection at I/O subsystem 5 left I/O port connector 0 (U2.1-P1/Q2).The RIO cable that was connected to RIO port A0 should now be loose and can be removed. 4. At the secondary I/O book, disconnect the cable connection at RIO port A1 (U1.18-P1-H3/Q3) and reconnect it to RIO port A0 (U1.18-P1-H3/Q4). 5. Disconnect the cable connection at I/O subsystem 5 right I/O port connector 1 (U2.1-P2/Q1) and reconnect it to I/O subsystem 5 left I/O port connector 0 (U2.1-P1/Q2). 6. Verify that processor subsystem RIO port A0 on secondary I/O book (U1.18-P1-H3/Q4) is connected to I/O subsystem 5 left I/O port connector 0 (U2.1-P1/Q2). 7. Verify that I/O subsystem 5 left I/O port connector 1 (U2.1-P1/Q1) is connected to I/O subsystem 5 right I/O port connector 0 (U2.1-P2/Q2). 8. Turn on the power to boot standalone diagnostics from CD. 9. If the Please define the System Console screen is displayed, follow directions to select the system console. 10. Use the Display Configuration and Resource List to list all attached devices and adapters (refer to the Diagnostic Information for Multiple Bus Systems, order number SA23-2769, if necessary). 11. Check that all attached devices and adapters are listed. Did the Please define the System Console screen display and are all attached devices and adapters listed? NO Connect the previously removed RIO cable from I/O subsystem 5 right I/O port connector 1 (U2.1-P2/Q1) to processor subsystem RIO port A1 on secondary I/O book (U1.18-P1-H3/Q3). Go to “Step 1542-9” on page 324. YES The RIO cable that was removed in step 3 is defective. Replace the RIO cable. Connect the new RIO cable from I/O subsystem 5 right I/O port connector 1 (U2.1-P2/Q1) to processor subsystem RIO port A1 on secondary I/O book (U1.18-P1-H3/Q3). Reconfigure the system to its original condition. Go to “MAP 0410: Repair Checkout” in the Diagnostic Information for Multiple Bus Systems. Chapter
3.
Maintenance
Analysis
Procedures
333
Step 1542-27 1. Turn off the power. 2. At the secondary I/O book, disconnect the cable connection at RIO port B0 (U1.18-P1-H3/Q2). 3. Disconnect the cable connection at I/O subsystem 6 left I/O port connector 0 (U2.5-P1/Q2). The RIO cable that was connected to RIO port B0 should now be loose and can be removed. 4. At the secondary I/O book, disconnect the cable connection at RIO port B1 (U1.18-P1-H3/Q1) and reconnect it to RIO port B0 (U1.18-P1-H3/Q2). 5. Disconnect the cable connection at I/O subsystem 6 right I/O port connector 1 (U2.5-P2/Q1) and reconnect it to I/O subsystem 6 left I/O port connector 0 (U2.5-P1/Q2). 6. Verify that processor subsystem RIO port B0 on secondary I/O book (U1.18-P1-H3/Q2) is connected to I/O subsystem 6 left I/O port connector 0 (U2.5-P1/Q2). 7. Verify that I/O subsystem 6 left I/O port connector 1 (U2.5-P1/Q1) is connected to I/O subsystem 6 right I/O port connector 0 (U2.5-P2/Q2). 8. Turn on the power to boot standalone diagnostics from CD. 9. If the Please define the System Console screen is displayed, follow directions to select the system console. 10. Use the Display Configuration and Resource List to list all attached devices and adapters (refer to the Diagnostic Information for Multiple Bus Systems, order number SA23-2769, if necessary). 11. Check that all attached devices and adapters are listed. Did the Please define the System Console screen display and are all attached devices and adapters listed? NO Connect the previously removed RIO cable from I/O subsystem 6 right I/O port connector 1 (U2.5-P2/Q1) to processor subsystem RIO port B1 on secondary I/O book (U1.18-P1-H3/Q1). Go to “Step 1542-9” on page 324. YES The RIO cable that was removed in step 3 is defective. Replace the RIO cable. Connect the new RIO cable from I/O subsystem 6 right I/O port connector 1 (U2.5-P2/Q1) to processor subsystem RIO port B1 on secondary I/O book (U1.18-P1-H3/Q1). Reconfigure the system to its original condition. Go to “MAP 0410: Repair Checkout” in the Diagnostic Information for Multiple Bus Systems. 334
406/90
Service
Guide
Step 1542-28 1. Turn off the power. 2. Slide the secondary I/O book (U1.18-P1-H4) all the way into the processor backplane until it is firmly seated and latched. 3. At the secondary I/O book, disconnect the cable connection at RIO port D0 (U1.18-P1-H4/Q6). 4. At the secondary I/O book, disconnect the cable connection at RIO port D1 (U1.18-P1-H4/Q5). 5. At the secondary I/O book, disconnect the cable connection at RIO port C0 (U1.18-P1-H4/Q8). 6. Disconnect the cable connection at I/O subsystem 7 left I/O port connector 0 (U2.13-P1/Q2). The RIO cable that was connected to RIO port C0 should now be loose and can be removed. 7. At the secondary I/O book, disconnect the cable connection at RIO port C1 (U1.18-P1-H4/Q7) and reconnect it to RIO port C0 (U1.18-P1-H4/Q8). 8. Disconnect the cable connection at I/O subsystem 7 right I/O port connector 1 (U2.13-P2/Q1) and reconnect it to I/O subsystem 7 left I/O port connector 0 (U2.13-P1/Q2). 9. Verify that processor subsystem RIO port C0 on secondary I/O book (U1.18-P1-H4/Q8) is connected to I/O subsystem 7 left I/O port connector 0 (U2.13-P1/Q2). 10. Verify that I/O subsystem 7 left I/O port connector 1 (U2.13-P1/Q1) is connected to I/O subsystem 7 right I/O port connector 0 (U2.13-P2/Q2). 11. Turn on the power to boot standalone diagnostics from CD. 12. If the Please define the System Console screen is displayed, follow directions to select the system console. 13. Use the Display Configuration and Resource List to list all attached devices and adapters (refer to the Diagnostic Information for Multiple Bus Systems, order number SA23-2769, if necessary). 14. Check that all attached devices and adapters are listed. Did the Please define the System Console screen display and are all attached devices and adapters listed? NO Connect the previously removed RIO cable from I/O subsystem 7 right I/O port connector 1 (U2.13-P2/Q1) to processor subsystem RIO port C1 on secondary I/O book (U1.18-P1-H4/Q7). Go to “Step 1542-9” on page 324. YES The RIO cable that was removed in step 6 is defective. Replace the RIO cable. Connect the new RIO cable from I/O subsystem 7 right I/O port connector 1 (U2.13-P2/Q1) to processor subsystem RIO port C1 on secondary I/O book (U1.18-P1-H4/Q7). Reconfigure the system to its original condition. Go to “MAP 0410: Repair Checkout” in the Diagnostic Information for Multiple Bus Systems. Chapter
3.
Maintenance
Analysis
Procedures
335
Step 1542-29 1. Turn off the power. 2. At the secondary I/O book, disconnect the cable connection at RIO port D0 (U1.18-P1-H4/Q6). 3. Disconnect the cable connection at I/O subsystem 8 left I/O port connector 0 (U2.19-P1/Q2). The RIO cable that was connected to RIO port D0 should now be loose and can be removed 4. At the secondary I/O book, disconnect the cable connection at RIO port D1 (U1.18-P1-H4/Q5) and reconnect it to RIO port D0 (U1.18-P1-H4/Q6). 5. Disconnect the cable connection at I/O subsystem 8 right I/O port connector 1 (U2.19-P2/Q1) and reconnect it to I/O subsystem 8 left I/O port connector 0 (U2.19-P1/Q2). 6. Verify that processor subsystem RIO port D0 on secondary I/O book (U1.18-P1-H4/Q6) is connected to I/O subsystem 8 left I/O port connector 0 (U2.19-P1/Q2). 7. Verify that I/O subsystem 8 left I/O port connector 1 (U2.19-P1/Q1) is connected to I/O subsystem 8 right I/O port connector 0 (U2.19-P2/Q2). 8. Turn on the power to boot standalone diagnostics from CD. 9. If the Please define the System Console screen is displayed, follow directions to select the system console. 10. Use the Display Configuration and Resource List to list all attached devices and adapters (refer to the Diagnostic Information for Multiple Bus Systems, order number SA23-2769, if necessary). 11. Check that all attached devices and adapters are listed Did the Please define the System Console screen display and are all attached devices and adapters listed? NO Connect the previously removed RIO cable from I/O subsystem 8 right I/O port connector 1 (U2.19-P2/Q1) to processor subsystem RIO port D1 on secondary I/O book (U1.18-P1-H4/Q5). Go to “Step 1542-9” on page 324. YES The RIO cable that was removed in step 3 is defective. Replace the RIO cable. Connect the new RIO cable from I/O subsystem 8 right I/O port connector 1 (U2.19-P2/Q1) to processor subsystem RIO port D1 on secondary I/O book (U1.18-P1-H4/Q5). Reconfigure the system to its original condition. Go to “MAP 0410: Repair Checkout” in the Diagnostic Information for Multiple Bus Systems.Step 1542-30 This ends the procedure. Reconfigure the system to its original condition. Go to “MAP 0410: Repair Checkout” in the Diagnostic Information for Multiple Bus Systems. 336
406/90
Service
Guide
MAP 1543: MCM Module Problem Isolation Step 1543-1 Record the error code and the location code(s) that sent you to this MAP. Step 1543-2 Turn off the power. Examine the amber logic-power LEDs of all installed processor subsystem DCAs. Step 1543-3 Are all amber logic-power LEDs of all installed processor subsystem DCAs off? Yes Go to “Step 1543-4.” No Call for support.Step 1543-4 Attention: Some of the parts in the following list are modules that have a limit of three plug cycles. Before replacing any of the modules in the following list, call for support. You may be asked to replace one or more of the following cards and modules: v Passthru module at U1.18-P1-C2 v Passthru module at U1.18-P1-C3 v MCM module 0 at U1.18-P1-C1 v MCM module 2 at U1.18-P1-C4 v MCM module 1 at U1.18-P1-C3 v MCM module 3 at U1.18-P1-C2 v System clock card, at location U1.18-P1-X5Replace the following books and modules, if present, one at a time: 1. Passthru module at U1.18-P1-C2 if not recorded in “Step 1543-1.” 2. Passthru module at U1.18-P1-C3 if not recorded in “Step 1543-1.” 3. System clock card, at location U1.18-P1-X5 if not recorded in “Step 1543-1.” 4. First location code item recorded, if any, in “Step 1543-1.” 5. Second location code item recorded, if any, in “Step 1543-1.” 6. Third location code item recorded, if any, in “Step 1543-1.” 7. Fourth location code item recorded, if any, in “Step 1543-1.” 8. MCM module 0 at U1.18-P1-C1 if not recorded in “Step 1543-1.” 9. MCM module 2 at U1.18-P1-C4 if not recorded in “Step 1543-1.” 10. MCM module 1 at U1.18-P1-C3 if not recorded in “Step 1543-1.” 11. MCM module 3 at U1.18-P1-C2 if not recorded in “Step 1543-1.”Step 1543-5 Turn on the power. Chapter
3.
Maintenance
Analysis
Procedures
337
Step 1543-6 Did the system stop with the same error code as recorded in “Step 1543-1” on page 337? Yes Go to “Step 1543-7.” No The module just replaced was defective. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems.Step 1543-7 Have all the modules listed in “Step 1543-4” on page 337 been replaced? Yes Go to “Step 1543-8.” No Go to “Step 1543-2” on page 337.Step 1543-8 Turn off the power. Step 1543-9 Examine the amber logic-power LEDs of all installed processor subsystem DCAs. Step 1543-10 Are all amber logic-power LEDs of all installed processor subsystem DCAs off? Yes Go to “Step 1543-11.” No Call for support.Step 1543-11 Attention: Before replacing the system backplane, call for support. Replace the system backplane at U1.18-P1. Step 1543-12 Turn on the power. Step 1543-13 Did the system stop with the same error code as recorded in “Step 1541-1” on page 310? Yes Go to “Step 1543-14.” No The part just replaced was defective. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems.Step 1543-14 Turn off the power. Make sure all amber logic-power LEDs of all installed processor subsystem DCAs are off. Step 1543-15 Call for support. 338
406/90
Service
Guide
This ends the procedure.
Chapter
3.
Maintenance
Analysis
Procedures
339
MAP 1544: Inner L3 Module Problem Isolation Step 1544-1 Record the error code and the location code(s) that sent you to this MAP. Step 1544-2 Turn off the power. Examine the amber logic-power LEDs of all installed processor subsystem DCAs. Step 1544-3 Are all amber logic-power LEDs of all installed processor subsystem DCAs off? Yes Go to “Step 1544-4.” No Call for support.Step 1544-4 Attention: Some of the parts in the following list are modules that have a limit of three plug cycles. Before replacing any of the modules in the following list, call for support. You may be asked to replace one or more of the following books and modules: v Memory book 0 at U1.18-P1-M2 v Memory book 1 at U1.18-P1-M3 v Memory book 2 at U1.18-P1-M7 v Memory book 3 at U1.18-P1-M6 v L3 module at U1.18-P1-C6 v L3 module at U1.18-P1-C7 v L3 module at U1.18-P1-C10 v L3 module at U1.18-P1-C11 v L3 module at U1.18-P1-C14 v L3 module at U1.18-P1-C15 v L3 module at U1.18-P1-C18 v L3 module at U1.18-P1-C19Replace the following books and modules, if present, one at a time: 1. First location code item recorded, if any, in “Step 1544-1.” 2. Second location code item recorded, if any, in “Step 1544-1.” 3. Third location code item recorded, if any, in “Step 1544-1.”Step 1544-5 Turn on the power. Step 1544-6 Did the system stop with the same error code as recorded in “Step 1544-1”? Yes Go to “Step 1544-7” on page 341. No The book or module just replaced was defective. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. 340
406/90
Service
Guide
Step 1544-7 Have all the books or modules listed in “Step 1544-4” on page 340 been replaced? Yes Go to “Step 1544-8.” No Go to “Step 1544-2” on page 340.Step 1544-8 Turn off the power. Step 1544-9 Examine the amber logic-power LEDs of all installed processor subsystem DCAs. Step 1544-10 Are all amber logic-power LEDs of all installed processor subsystem DCAs off? Yes Go to “Step 1544-11.” No Call for support.Step 1544-11 Attention: Before replacing the system backplane, call for support. Replace the system backplane at U1.18-P1. Step 1544-12 Turn on the power. Step 1544-13 Did the system stop with the same error code as recorded in “Step 1544-1” on page 340? Yes Go to “Step 1544-14.” No The part just replaced was defective. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems.Step 1544-14 Turn off the power. Make sure all amber logic-power LEDs of all installed processor subsystem DCAs are off. Step 1544-15 Call for support. This ends the procedure. Chapter
3.
Maintenance
Analysis
Procedures
341
MAP 1545: Outer L3 Module Problem Isolation Step 1545-1 Record the error code and the location code(s) that sent you to this MAP. Step 1545-2 Turn off the power. Examine the amber logic-power LEDs of all installed processor subsystem DCAs. Step 1545-3 Are all amber logic-power LEDs of all installed processor subsystem DCAs off? Yes Go to “Step 1545-4.” No Call for support.Step 1545-4 Attention: Some of the parts in the following list are modules that have a limit of three plug cycles. Before replacing any of the modules in the following list, call for support. You may be asked to replace one or more of the following books and modules: v Memory book 4 at U1.18-P1-M4 v Memory book 5 at U1.18-P1-M8 v Memory book 6 at U1.18-P1-M5 v Memory book 7 at U1.18-P1-M1 v L3 module at U1.18-P1-C5 v L3 module at U1.18-P1-C8 v L3 module at U1.18-P1-C9 v L3 module at U1.18-P1-C12 v L3 module at U1.18-P1-C13 v L3 module at U1.18-P1-C16 v L3 module at U1.18-P1-C17 v L3 module at U1.18-P1-C20Replace the following books and modules, if present, one at a time: 1. First location code item recorded, if any, in “Step 1545-1.” 2. Second location code item recorded, if any, in “Step 1545-1.” 3. Third location code item recorded, if any, in “Step 1545-1.”Step 1545-5 Turn on the power. 342
406/90
Service
Guide
Step 1545-6 Did the system stop with the same error code as recorded in “Step 1545-1” on page 342? Yes Go to “Step 1545-7.” No The book or module just replaced was defective. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems.Step 1545-7 Have all the books or modules listed in “Step 1545-4” on page 342 been replaced? Yes Go to “Step 1545-8.” No Go to “Step 1545-2” on page 342.Step 1545-8 Turn off the power. Step 1545-9 Examine the amber logic-power LEDs of all installed processor subsystem DCAs. Step 1545-10 Are all amber logic-power LEDs of all installed processor subsystem DCAs off? Yes Go to “Step 1545-11.” No Call for support.Step 1545-11 Attention: Before replacing the system backplane, call for support. Replace the system backplane at U1.18-P1. Step 1545-12 Turn on the power. Step 1545-13 Did the system stop with the same error code as recorded in “Step 1545-1” on page 342? Yes Go to “Step 1545-14.” No The part just replaced was defective. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems.Step 1545-14 Turn off the power. Make sure all amber logic-power LEDs of all installed processor subsystem DCAs are off. Step 1545-15 Call for support. Chapter
3.
Maintenance
Analysis
Procedures
343
This ends the procedure.
344
406/90
Service
Guide
MAP 1546: Memory Book Problem Isolation Step 1546-1 Record the error code and the location code(s) that sent you to this MAP. Step 1546-2 Turn off the power. Examine the amber logic-power LEDs of all installed processor subsystem DCAs. Step 1546-3 Are all amber logic-power LEDs of all installed processor subsystem DCAs off? Yes Go to “Step 1546-4.” No Call for support.Step 1546-4 Attention: Some of the parts in the following list are modules that have a limit of three plug cycles. Before replacing any of the modules in the following list, call for support. Replace the following books and modules, if present, one at a time: v First location code item recorded, if any, in “Step 1546-1.” v Second location code item recorded, if any, in “Step 1546-1.” v Third location code item recorded, if any, in “Step 1546-1.” v Memory book 0 at U1.18-P1-M2 if not recorded in “Step 1546-1.” v Memory book 1 at U1.18-P1-M3 if not recorded in “Step 1546-1.” v Memory book 2 at U1.18-P1-M7 if not recorded in “Step 1546-1.” v Memory book 3 at U1.18-P1-M6 if not recorded in “Step 1546-1.” v Memory book 4 at U1.18-P1-M4 if not recorded in “Step 1546-1.” v Memory book 5 at U1.18-P1-M8 if not recorded in “Step 1546-1.” v Memory book 6 at U1.18-P1-M5 if not recorded in “Step 1546-1.” v Memory book 7 at U1.18-P1-M1 if not recorded in “Step 1546-1.” v L3 module at U1.18-P1-C5 if not recorded in “Step 1546-1.” v L3 module at U1.18-P1-C6 if not recorded in “Step 1546-1.” v L3 module at U1.18-P1-C7 if not recorded in “Step 1546-1.” v L3 module at U1.18-P1-C8 if not recorded in “Step 1546-1.” v L3 module at U1.18-P1-C9 if not recorded in “Step 1546-1.” v L3 module at U1.18-P1-C10 if not recorded in “Step 1546-1.” v L3 module at U1.18-P1-C11 if not recorded in “Step 1546-1.” v L3 module at U1.18-P1-C12 if not recorded in “Step 1546-1.” v L3 module at U1.18-P1-C13 if not recorded in “Step 1546-1.” v L3 module at U1.18-P1-C14 if not recorded in “Step 1546-1.” v L3 module at U1.18-P1-C15 if not recorded in “Step 1546-1.” v L3 module at U1.18-P1-C16 if not recorded in “Step 1546-1.” v L3 module at U1.18-P1-C17 if not recorded in “Step 1546-1.” v L3 module at U1.18-P1-C18 if not recorded in “Step 1546-1.” v L3 module at U1.18-P1-C19 if not recorded in “Step 1546-1.” v L3 module at U1.18-P1-C20 if not recorded in “Step 1546-1.”Step 1546-5 Turn on the power. Chapter
3.
Maintenance
Analysis
Procedures
345
Step 1546-6 Did the system stop with the same error code as recorded in “Step 1546-1” on page 345? Yes Go to “Step 1546-7.” No The book or module just replaced was defective. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems.Step 1546-7 Have all the books or modules listed in “Step 1546-4” on page 345 been replaced? Yes Go to “Step 1546-8.” No Go to “Step 1546-2” on page 345.Step 1546-8 Turn off the power. Step 1546-9 Examine the amber logic-power LEDs of all installed processor subsystem DCAs. Step 1546-10 Are all amber logic-power LEDs of all installed processor subsystem DCAs off? Yes Go to “Step 1546-11.” No Call for support.Step 1546-11 Attention: Before replacing the system backplane, call for support. Replace the system backplane at U1.18-P1. Step 1546-12 Turn on the power. Step 1546-13 Did the system stop with the same error code as recorded in “Step 1546-1” on page 345? Yes Go to “Step 1546-14.” No The part just replaced was defective. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems.Step 1546-14 Turn off the power. Make sure all amber logic-power LEDs of all installed processor subsystem DCAs are off. Step 1546-15 Call for support. 346
406/90
Service
Guide
This ends the procedure.
Chapter
3.
Maintenance
Analysis
Procedures
347
MAP 1547: I/O Book Problem Isolation Step 1547-1 Record the error code and the location code(s) that sent you to this MAP. Step 1547-2 Turn off the power. Examine the amber logic-power LEDs of all installed processor subsystem DCAs. Step 1547-3 Are all amber logic-power LEDs of all installed processor subsystem DCAs off? Yes Go to “Step 1547-4.” No Call for support.Step 1547-4 Replace the following books, if present, one at a time: v Primary I/O Book at U1.18-P1-H2 v Secondary I/O Book at U1.18-P1-H3 v Third I/O Book at U1.18-P1–H4 v Fourth I/O Book at U1.18-P1-H1Step 1547-5 Turn on the power. Step 1547-6 Did the system stop with the same error code as recorded in “Step 1547-1”? Yes Go to “Step 1547-7.” No The book just replaced was defective. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems.Step 1547-7 Have all the books or modules listed in “Step 1547-4” been replaced? Yes Go to “Step 1547-8.” No Go to “Step 1547-2.”Step 1547-8 Turn off the power. Step 1547-9 Examine the amber logic-power LEDs of all installed processor subsystem DCAs. Step 1547-10 Are all amber logic-power LEDs of all installed processor subsystem DCAs off? Yes Go to “Step 1547-11” on page 349. 348
406/90
Service
Guide
No Call for support.Step 1547-11 Attention: Before replacing the system backplane, call for support. Replace the system backplane at U1.18-P1. Step 1547-12 Turn on the power. Step 1547-13 Did the system stop with the same error code as recorded in “Step 1547-1” on page 348? Yes Go to “Step 1547-14.” No The part just replaced was defective. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems.Step 1547-14 Turn off the power. Make sure all amber logic-power LEDs of all installed processor subsystem DCAs are off. Step 1547-15 Call for support. This ends the procedure. Chapter
3.
Maintenance
Analysis
Procedures
349
MAP 1548: Memory and Processor Problem Isolation Step 1548-1 Record the error code and the location code(s) that sent you to this MAP. Step 1548-2 Turn off the power. Examine the amber logic-power LEDs of all installed processor subsystem DCAs. Step 1548-3 Are all amber logic-power LEDs of all installed processor subsystem DCAs off? Yes Go to “Step 1548-4.” No Call for support.Step 1548-4 Attention: Some of the parts in the following list are modules that have a limit of three plug cycles. Before replacing any of the modules in the following list, call for support. Replace the following books and modules, if present, one at a time: v First location code item recorded, if any, in “Step 1548-1.” v Second location code item recorded, if any, in “Step 1548-1.” v Third location code item recorded, if any, in “Step 1548-1.” v Memory book 0 at U1.18-P1-M2 if not recorded in “Step 1548-1.” v Memory book 1 at U1.18-P1-M3 if not recorded in “Step 1548-1.” v L3 module at U1.18-P1-C6 if not recorded in “Step 1548-1.” v L3 module at U1.18-P1-C7 if not recorded in “Step 1548-1.” v L3 module at U1.18-P1-C10 if not recorded in “Step 1548-1.” v L3 module at U1.18-P1-C11 if not recorded in “Step 1548-1.” v Passthru module at U1.18-P1-C2 if not recorded in “Step 1548-1.” v Passthru module at U1.18-P1-C3 if not recorded in “Step 1548-1.” v MCM module 0 at U1.18-P1-C1 if not recorded in “Step 1548-1.” v Memory book 2 at U1.18-P1-M7 if not recorded in “Step 1548-1.” v Memory book 3 at U1.18-P1-M6 if not recorded in “Step 1548-1.” v Memory book 4 at U1.18-P1-M4 if not recorded in “Step 1548-1.” v Memory book 5 at U1.18-P1-M8 if not recorded in “Step 1548-1.” v Memory book 6 at U1.18-P1-M5 if not recorded in “Step 1548-1.” v Memory book 7 at U1.18-P1-M1 if not recorded in “Step 1548-1.” v L3 module at U1.18-P1-C5 if not recorded in “Step 1548-1.” v L3 module at U1.18-P1-C8 if not recorded in “Step 1548-1.” v L3 module at U1.18-P1-C9 if not recorded in “Step 1548-1.” v L3 module at U1.18-P1-C12 if not recorded in “Step 1548-1.” v L3 module at U1.18-P1-C13 if not recorded in “Step 1548-1.” v L3 module at U1.18-P1-C14 if not recorded in “Step 1548-1.” v L3 module at U1.18-P1-C15 if not recorded in “Step 1548-1.” v L3 module at U1.18-P1-C16 if not recorded in “Step 1548-1.” v L3 module at U1.18-P1-C17 if not recorded in “Step 1548-1.” v L3 module at U1.18-P1-C18 if not recorded in “Step 1548-1.” v L3 module at U1.18-P1-C19 if not recorded in “Step 1548-1.” v L3 module at U1.18-P1-C20 if not recorded in “Step 1548-1.” v MCM module 2 at U1.18-P1-C4 if not recorded in “Step 1548-1.” v MCM module 1 at U1.18-P1-C3 if not recorded in “Step 1548-1.” 350
406/90
Service
Guide
v MCM module 3 at U1.18-P1-C2 if not recorded in “Step 1548-1” on page 350.Step 1548-5 Turn on the power. Step 1548-6 Did the system stop with the same error code as recorded in “Step 1548-1” on page 350? Yes Go to “Step 1548-7.” No The book or module just replaced was defective. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems.Step 1548-7 Have all the books or modules listed in “Step 1548-4” on page 350 been replaced? Yes Go to “Step 1548-8.” No Go to “Step 1548-2” on page 350.Step 1548-8 Turn off the power. Step 1548-9 Examine the amber logic-power LEDs of all installed processor subsystem DCAs. Step 1548-10 Are all amber logic-power LEDs of all installed processor subsystem DCAs off? Yes Go to “Step 1548-11.” No Call for support.Step 1548-11 Attention: Before replacing the system backplane, call for support. Replace the system backplane at U1.18-P1. Chapter
3.
Maintenance
Analysis
Procedures
351
Step 1548-12 Turn on the power. Step 1548-13 Did the system stop with the same error code as recorded in “Step 1548-1” on page 350? Yes Go to “Step 1548-14.” No The part just replaced was defective. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems.Step 1548-14 Turn off the power. Make sure all amber logic-power LEDs of all installed processor subsystem DCAs are off. Step 1548-15 Call for support. This ends the procedure. 352
406/90
Service
Guide
MAP 1549: Attention Problem Isolation Step 1549-1 Record the error code and the location code(s) that sent you to this MAP. Step 1549-2 Turn off the power. Examine the amber logic-power LEDs of all installed processor subsystem DCAs. Step 1549-3 Are all amber logic-power LEDs of all installed processor subsystem DCAs off? No Call for support. Yes Go to “Step 1549-4.”Step 1549-4 Examine the following table and locate the error code that sent you to this MAP to determine your next step.
Error Code
Action
406x 0EB1
Go to “Step 1549-5” on page 354.
406x 0EB2
Go to “Step 1549-9” on page 355.
406x 0EB3
Go to “Step 1549-13” on page 356.
406x 0EB4
Go to “Step 1549-33” on page 359.
406x 0EB5
Go to “Step 1549-29” on page 358.
406x 0EB6
Go to “Step 1549-33” on page 359.
406x 0EB7
Go to “Step 1549-33” on page 359.
406x 0EB8
Go to “Step 1549-33” on page 359.
406x 0EB9
Go to “Step 1549-33” on page 359.
406x 0EBA
Go to “Step 1549-17” on page 356.
406x 0EBB
Go to “Step 1549-21” on page 357.
406x 0EBC
Go to “Step 1549-25” on page 358. Chapter
3.
Maintenance
Analysis
Procedures
353
Step 1549-5 Attention: Some of the parts in the following list are modules that have a limit of three plug cycles. Before replacing any of the modules in the following list, call for support. Replace the following books and modules, if present, one at a time: v First location code item recorded, if any, in “Step 1549-1” on page 353. v Second location code item recorded, if any, in “Step 1549-1” on page 353. v Third location code item recorded, if any, in “Step 1549-1” on page 353. v Primary I/O Book at U1.18-P1-H2 if not recorded in “Step 1549-1” on page 353. v MCM module 0 at U1.18-P1-C1 if not recorded in “Step 1549-1” on page 353. v MCM module 2 at U1.18-P1-C4 if not recorded in “Step 1549-1” on page 353. v MCM module 1 at U1.18-P1-C3 if not recorded in “Step 1549-1” on page 353. v MCM module 3 at U1.18-P1-C2 if not recorded in “Step 1549-1” on page 353.Step 1549-6 Turn on the power. Step 1549-7 Did the system stop with the same error code as recorded in “Step 1549-1” on page 353? No The book or module just replaced was defective. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. Yes Go to “Step 1549-8.”Step 1549-8 Have all the books or modules listed in “Step 1549-5” been replaced? No Go to “Step 1549-2” on page 353. Yes Go to “Step 1549-37” on page 359. 354
406/90
Service
Guide
Step 1549-9 Attention: Some of the parts in the following list are modules that have a limit of three plug cycles. Before replacing any of the modules in the following list, call for support. Replace the following books and modules, if present, one at a time: v First location code item recorded, if any, in “Step 1549-1” on page 353. v Second location code item recorded, if any, in “Step 1549-1” on page 353. v Third location code item recorded, if any, in “Step 1549-1” on page 353. v Primary I/O Book at U1.18-P1-H2 if not recorded in “Step 1549-1” on page 353. v L3 module at U1.18-P1-C6 if not recorded in “Step 1549-1” on page 353. v L3 module at U1.18-P1-C7 if not recorded in “Step 1549-1” on page 353. v L3 module at U1.18-P1-C10 if not recorded in “Step 1549-1” on page 353. v L3 module at U1.18-P1-C11 if not recorded in “Step 1549-1” on page 353. v L3 module at U1.18-P1-C5 if not recorded in “Step 1549-1” on page 353. v L3 module at U1.18-P1-C8 if not recorded in “Step 1549-1” on page 353. v L3 module at U1.18-P1-C9 if not recorded in “Step 1549-1” on page 353. v L3 module at U1.18-P1-C12 if not recorded in “Step 1549-1” on page 353. v L3 module at U1.18-P1-C13 if not recorded in “Step 1549-1” on page 353. v L3 module at U1.18-P1-C14 if not recorded in “Step 1549-1” on page 353. v L3 module at U1.18-P1-C15 if not recorded in “Step 1549-1” on page 353. v L3 module at U1.18-P1-C16 if not recorded in “Step 1549-1” on page 353. v L3 module at U1.18-P1-C17 if not recorded in “Step 1549-1” on page 353. v L3 module at U1.18-P1-C18 if not recorded in “Step 1549-1” on page 353. v L3 module at U1.18-P1-C19 if not recorded in “Step 1549-1” on page 353. v L3 module at U1.18-P1-C20 if not recorded in “Step 1549-1” on page 353.Step 1549-10 Turn on the power. Step 1549-11 Did the system stop with the same error code as recorded in “Step 1549-1” on page 353? No The book or module just replaced was defective. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. Yes Go to “Step 1549-12.”Step 1549-12 Have all the books listed in “Step 1549-9” been replaced? No Go to “Step 1549-2” on page 353. Yes Go to “Step 1549-13” on page 356. Chapter
3.
Maintenance
Analysis
Procedures
355
Step 1549-13 Replace the following books and modules, if present, one at a time: v First location code item recorded, if any, in “Step 1549-1” on page 353. v Second location code item recorded, if any, in “Step 1549-1” on page 353. v Third location code item recorded, if any, in “Step 1549-1” on page 353. v Memory book 0 at U1.18-P1-M2 if not recorded in “Step 1549-1” on page 353. v Memory book 1 at U1.18-P1-M3 if not recorded in “Step 1549-1” on page 353. v Primary I/O Book at U1.18-P1-H2 if not recorded in “Step 1549-1” on page 353.Step 1549-14 Turn on the power. Step 1549-15 Did the system stop with the same error code as recorded in “Step 1549-1” on page 353? No The book or module just replaced was defective. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. Yes Go to “Step 1549-16.”Step 1549-16 Have all the books or modules listed in “Step 1549-13” been replaced? No Go to “Step 1549-2” on page 353. Yes Go to “Step 1549-37” on page 359.Step 1549-17 Replace the following books and modules, if present, one at a time: v First location code item recorded, if any, in “Step 1549-1” on page 353. v Second location code item recorded, if any, in “Step 1549-1” on page 353. v Third location code item recorded, if any, in “Step 1549-1” on page 353. v Memory book 2 at U1.18-P1-M7 if not recorded in “Step 1549-1” on page 353. v Memory book 3 at U1.18-P1-M6 if not recorded in “Step 1549-1” on page 353. v Primary I/O Book at U1.18-P1-H2 if not recorded in “Step 1549-1” on page 353.Step 1549-18 Turn on the power. 356
406/90
Service
Guide
Step 1549-19 Did the system stop with the same error code as recorded in “Step 1549-1” on page 353? No The book or module just replaced was defective. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. Yes Go to “Step 1549-20.”Step 1549-20 Have all the books or modules listed in “Step 1549-17” on page 356 been replaced? No Go to “Step 1549-2” on page 353. Yes Go to “Step 1549-37” on page 359.Step 1549-21 Replace the following books and modules, if present, one at a time: v First location code item recorded, if any, in “Step 1549-1” on page 353. v Second location code item recorded, if any, in “Step 1549-1” on page 353. v Third location code item recorded, if any, in “Step 1549-1” on page 353. v Memory book 4 at U1.18-P1-M4 if not recorded in “Step 1549-1” on page 353. v Memory book 5 at U1.18-P1-M8 if not recorded in “Step 1549-1” on page 353. v Primary I/O Book at U1.18-P1-H2 if not recorded in “Step 1549-1” on page 353.Step 1549-22 Turn on the power. Step 1549-23 Did the system stop with the same error code as recorded in “Step 1549-1” on page 353? No The book or module just replaced was defective. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. Yes Go to “Step 1549-24.”Step 1549-24 Have all the books or modules listed in “Step 1549-21” been replaced? No Go to “Step 1549-2” on page 353. Yes Go to “Step 1549-37” on page 359. Chapter
3.
Maintenance
Analysis
Procedures
357
Step 1549-25 Replace the following books and modules, if present, one at a time: v First location code item recorded, if any, in “Step 1549-1” on page 353. v Second location code item recorded, if any, in “Step 1549-1” on page 353. v Third location code item recorded, if any, in “Step 1549-1” on page 353. v Memory book 6 at U1.18-P1-M5 if not recorded in “Step 1549-1” on page 353. v Memory book 7 at U1.18-P1-M1 if not recorded in “Step 1549-1” on page 353. v Primary I/O Book at U1.18-P1-H2 if not recorded in “Step 1549-1” on page 353.Step 1549-26 Turn on the power. Step 1549-27 Did the system stop with the same error code as recorded in “Step 1549-1” on page 353? No The book or module just replaced was defective. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. Yes Go to “Step 1549-28.”Step 1549-28 Have all the books or modules listed in “Step 1549-25” been replaced? No Go to “Step 1549-2” on page 353. Yes Go to “Step 1549-37” on page 359.Step 1549-29 Replace the following books and modules, if present, one at a time: v First location code item recorded, if any, in “Step 1549-1” on page 353. v Second location code item recorded, if any, in “Step 1549-1” on page 353. v Third location code item recorded, if any, in “Step 1549-1” on page 353. v Secondary I/O Book at U1.18-P1-H3 if not recorded in “Step 1549-1” on page 353. v Third I/O Book at U1.18-P1-H4 if not recorded in “Step 1549-1” on page 353. v Fourth I/O Book at U1.18-P1-H1 if not recorded in “Step 1549-1” on page 353. v Primary I/O Book at U1.18-P1-H2 if not recorded in “Step 1549-1” on page 353.Step 1549-30 Turn on the power. 358
406/90
Service
Guide
Step 1549-31 Did the system stop with the same error code as recorded in “Step 1549-1” on page 353? No The book or module just replaced was defective. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. Yes Go to “Step 1549-32.”Step 1549-32 Have all the books or modules listed in “Step 1549-29” on page 358 been replaced? No Go to “Step 1549-2” on page 353. Yes Go to “Step 1549-37.”Step 1549-33 Replace the following books and modules, if present, one at a time: v Replace the Primary I/O Book at U1.18-P1-H2 if not recorded in “Step 1549-1” on page 353. v First location code item recorded, if any, in “Step 1549-1” on page 353. v Second location code item recorded, if any, in “Step 1549-1” on page 353. v Third location code item recorded, if any, in “Step 1549-1” on page 353.Step 1549-34 Turn on the power. Step 1549-35 Did the system stop with the same error code as recorded in “Step 1549-1” on page 353? No The book or module just replaced was defective. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. Yes Go to “Step 1549-36.”Step 1549-36 Have all the books or modules listed in “Step 1549-33” been replaced? No Go to “Step 1549-2” on page 353. Yes Call for support. This ends the procedure.Step 1549-37 Turn off the power. Step 1549-38 Examine the amber logic-power LEDs of all installed processor subsystem DCAs. Chapter
3.
Maintenance
Analysis
Procedures
359
Step 1549-39 Are all amber logic-power LEDs of all installed processor subsystem DCAs off? No Call for support. Yes Go to “Step 1549-40.”Step 1549-40 Attention: Before replacing the system backplane, call for support. Replace the system backplane at U1.18-P1. Step 1549-41 Turn on the power. Step 1549-42 Did the system stop with the same error code as recorded in “Step 1549-1” on page 353? No The part just replaced was defective. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. Yes Go to “Step 1549-43.”Step 1549-43 Turn off the power. Make sure all amber logic-power LEDs of all installed processor subsystem DCAs are off. Step 1549-44 Call for support. This ends the procedure. 360
406/90
Service
Guide
MAP 154A: I2C Bus Problem Isolation Step 154A-1 Record the error code, location code(s), word 13, and word 16 that sent you to this MAP. Step 154A-2 Turn off the power. Examine the amber logic-power LEDs of all installed processor subsystem DCAs. Step 154A-3 Are all amber logic-power LEDs of all installed processor subsystem DCAs off? No Call for support. Yes Go to “Step 154A-4.”Step 154A-4 Examine the following table and locate the error code that sent you to this MAP to determine your next step.
Error Code
Action
B1xx 4643
Go to “Step 154A-22” on page 364.
B1xx 4645
Go to “Step 154A-9” on page 362.
B1xx 4648, 406x 0001, 406x 0002, 406x 000A, 406x 000B, 406x 000C, 406x 000D
Go to “Step 154A-5.” Step 154A-5 Replace the following books and modules, if present, one at a time: v First location code item recorded, if any, in “Step 154A-1.” v Second location code item recorded, if any, in “Step 154A-1.” v Primary I/O Book at U1.18-P1-H2 if not recorded in “Step 154A-1.” v Secondary I/O Book at U1.18-P1-H3 if not recorded in “Step 154A-1.” v Third I/O Book at U1.18-P1-H4 if not recorded in “Step 154A-1.” v Fourth I/O Book at U1.18-P1-H1 if not recorded in “Step 1549-1” on page 353. v System clock card, at U1.18-P1-X5 if not recorded in “Step 154A-1.”Step 154A-6 Turn on the power. Chapter
3.
Maintenance
Analysis
Procedures
361
Step 154A-7 Did the system stop with the same error code as recorded in “Step 154A-1” on page 361? No The book or module just replaced was defective. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. Yes Go to “Step 154A-8.”Step 154A-8 Have all the books listed in “Step 154A-5” on page 361 been replaced? No Go to “Step 154A-2” on page 361. Yes Go to “Step 154A-14.”Step 154A-9 Is the value recorded in “Step 154A-1” on page 361 for word 16 of this error code equal to 00000008 or 00000009? No Go to “Step 154A-5” on page 361. Yes Go to “Step 154A-10.”Step 154A-10 Replace the following books and modules, if present, one at a time: v First location code item recorded, if any, in “Step 154A-1” on page 361. v Second location code item recorded, if any, in “Step 154A-1” on page 361. v Primary I/O Book at U1.18-P1-H2 if not recorded, in “Step 154A-1” on page 361. v Capacitor book, at location U1.18-P1-V3 if not recorded in “Step 154A-1” on page 361. v Capacitor book, at location U1.18-P1-V7 if not recorded in “Step 154A-1” on page 361.Step 154A-11 Turn on the power. Step 154A-12 Did the system stop with the same error code as recorded in “Step 154A-1” on page 361? No The book or module just replaced was defective. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. Yes Go to “Step 1549-12” on page 355.Step 154A-13 Have all the books or modules listed in “Step 154A-10” been replaced? No Go to “Step 154A-2” on page 361. Yes Go to “Step 154A-14.”Step 154A-14 Turn off the power. 362
406/90
Service
Guide
Step 154A-15 Examine the amber logic-power LEDs of all installed processor subsystem DCAs. Step 154A-16 Are all amber logic-power LEDs of all installed processor subsystem DCAs off? No Call for support. Yes Go to “Step 154A-17.”Step 154A-17 Attention: Before replacing the system backplane, call for support. Replace the system backplane at U1.18-P1. Step 154A-18 Turn on the power. Step 154A-19 Did the system stop with the same error code as recorded in “Step 154A-1” on page 361? No The book or module just replaced was defective. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. Yes Go to “Step 154A-20.”Step 154A-20 Turn off the power. Make sure all amber logic-power LEDs of all installed processor subsystem DCAs are off. Step 154A-21 Call for support. This ends the procedure. Chapter
3.
Maintenance
Analysis
Procedures
363
Step 154A-22 Is the value recorded in “Step 154A-1” on page 361 for word 13 of this error code equal to 0000A218 or 0000A21B? No Go to “Step 154A-5” on page 361. Yes Go to “Step 154A-23.”Step 154A-23 Replace the following cards and modules, if present, one at a time: v First location code item recorded, if any, in “Step 154A-1” on page 361. v Second location code item recorded, if any, in “Step 154A-1” on page 361. v Primary I/O Book at U1.18-P1-H2 if not recorded in “Step 154A-1” on page 361. v Component on which you are trying to turn on the identify LED at the location you selected.Step 154A-24 Turn on the power. Step 154A-25 Did the system stop with the same error code as recorded in “Step 154A-1” on page 361? No The card or module just replaced was defective. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. Yes Go to “Step 154A-26.”Step 154A-26 Have all the cards or modules listed in “Step 154A-23” been replaced? No Go to “Step 154A-2” on page 361. Yes Go to “Step 154A-14” on page 362. 364
406/90
Service
Guide
MAP 154B: Insufficient Hardware Resources Problem Isolation Step 154B-1 Record the error code, location code(s), and word 13 that sent you to this MAP. Step 154B-2 Turn off the power. Examine the amber logic-power LEDs of all installed processor subsystem DCAs. Step 154B-3 Are all amber logic-power LEDs of all installed processor subsystem DCAs off? No Call for support. Yes Go to “Step 154B-4.”Step 154B-4 Attention: Before replacing any parts, examine the processor subsystem drawer to ensure that: v At least one MCM (processor) module is installed. v At least one memory card is installed. v At least one L3 cache module is installed. v The primary I/O book is installed.Refer to the error code recorded in “Step 154B-1.” When using the listings below, note the error code condition before you replace parts that are called out. Replace the following FRUs, if present, one at a time: Attention: Some of the parts in the following list are modules that have a limit of three plug cycles. Before replacing any of the modules in the following list, call for support. v First location code item recorded, if any, in “Step 154B-1.” v Second location code item recorded, if any, in “Step 154B-1.” v Third location code item recorded, if any, in “Step 154B-1.”Note: Replace the following FRUs only if they are associated with the word 13 that you recorded earlier. If the value of word 13, recorded in “Step 154B-1,” is 4110 0000 or 4060 0000, replace the following: v Primary I/O book at U1.18-P1-H2 if not recorded in “Step 154B-1.” If the value of word 13, recorded in “Step 154B-1,” is 403F 0000 or 40FF 0000, replace the following: v Memory book 0 at U1.18-P1-M2 if not recorded in “Step 154B-1.” v Memory book 1 at U1.18-P1-M3 if not recorded in “Step 154B-1” v Memory book 2 at U1.18-P1-M7 if not recorded in “Step 154B-1” v Memory book 3 at U1.18-P1-M6 if not recorded in “Step 154B-1.” If the value of word 13, recorded in “Step 154B-1,” is 402F 0000, replace the following: v MCM module 0 at U1.18-P1-C1 if not recorded in “Step 154B-1.” v MCM module 2 at U1.18-P1-C4 if not recorded in“Step 154B-1.” v MCM module 1 at U1.18-P1-C3 if not recorded in “Step 154B-1.” v MCM module 3 at U1.18-P1-C2 if not recorded in “Step 154B-1.”
Chapter
3.
Maintenance
Analysis
Procedures
365
Step 154B-5 Turn on the power. Step 154B-6 Did the system stop with the same error code as recorded in “Step 154B-1” on page 365? No The book or module just replaced was defective. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. Yes Go to “Step 154B-7.”Step 154B-7 Have all the books or modules listed in “Step 154B-4” on page 365 been replaced? No Go to “Step 154B-2” on page 365. Yes Go to “Step 154B-8.”Step 154B-8 Turn off the power. Step 154B-9 Examine the amber logic-power LEDs of all installed processor subsystem DCAs. Step 154B-10 Are all amber logic-power LEDs of all installed processor subsystem DCAs off? No Call for support. Yes Go to “Step 154A-11” on page 362.Step 154B-11 Attention: Before replacing the system backplane, call for support. Replace the system backplane at U1.18–P1. Step 154B-12 Turn on the power. Step 154B-13 Did the system stop with the same error code as recorded in “Step 154B-1” on page 365? No The book or module just replaced was defective. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. Yes Go to “Step 154B-14.”Step 154B-14 Turn off the power. Make sure all amber logic-power LEDs of all installed processor subsystem DCAs are off. 366
406/90
Service
Guide
Step 154B-15 Call for support. This ends the procedure. Chapter
3.
Maintenance
Analysis
Procedures
367
368
406/90
Service
Guide
Chapter 4. Checkpoints Checkpoints display on the media subsystem operator panel and the virtual terminal while the system is powering on and going through the initial program load (IPL). This chapter explains the IPL flow of the system and provides a table that lists checkpoints that you might see in the media subsystem operator panel and in a virtual terminal window on the HMC. IPL Flow The IPL process starts when ac power is connected to the system. The IPL process has the following phases: v Phase 1: Service Processor Initialization Phase 1 starts when ac power is connected to the system and ends when OK is displayed in the media subsystem operator panel. 8xxx checkpoints are displayed during this phase. Several 9xxx codes may also be displayed. Service processor menus are available at the end of this phase by striking any key on the console keyboard. v Phase 2: Hardware Initialization by the Service Processor Phase 2 starts when system power-on is initiated by pressing the power on button on the media subsystem operator panel. 9xxx checkpoints are displayed during this time. 91FF, the last checkpoint in this phase, indicates the transition to phase 3 is taking place. v Phase 3: System Firmware initialization On a full system partition, at phase 3, a system processor takes over control and continues initializing partition resources. During this phase, checkpoints in the form Exxx are displayed. E105, the last checkpoint in this phase, indicates that control is being passed to the operating system boot program. On a partitioned system, there is a global systemwide initialization phase 3, during which a system processor continues the initialization process. Checkpoints in this phase are of the form Exxx. This global phase 3 ends with a "LPAR..." on the operator panel. As a logical partition begins a partition-initialization phase 3, one of the system processors assigned to that partition continues initialization of resource assigned to that partition. Checkpoints in this phase are also of the form Exxx. This partition phase 3 ends with an E105 displayed on the partition’s virtual operator panel on the HMC, indicating control has been passed to that logical partition’s operating system boot program. For both the global and partition phase 3, location codes may also be displayed on the physical operator panel and the partition’s virtual terminal, respectively. v Phase 4: Operating System Boot When the operating system starts to boot, checkpoints in the form 0xxx and 2xxx are displayed. This phase ends when the operating system login prompt displays on the operating system console. 369
The following table outlines the IPL phases from power-on to the operating system login prompt, matching the phases to corresponding operator-panel checkpoints for a typical partition boot. Also listed are the approximate boot times for each phase, along with their dependencies for each phase. Note: The following table lists the major checkpoints only.
Phase #
Checkpoints on Operator Panel or HMC
Time in Phase (minutes)
Major Contributors to Time in Phase
Phase 1
8xxx => OK
Approx. 1
Phase 2
9xxx => 91FF
3 to 6
Number of I/O drawers
Phase 3
Exxx => E105
1 to 3
Number of bootable adapters
Phase 4
0xxx or 2xxx => blank
2 to 30+
Number of SSA drivesNumber of SCSI drivesNumber of async sessionsNumber of processorsNumber of adaptersAmount of memory
370
406/90
Service
Guide
Checkpoints enable users and service personnel to know what the server is doing, with some detail, as it initializes. These checkpoints are not intended to be error indicators, but in some cases a server could hang at one of the checkpoints without displaying an 8-character error code. It is for these hang conditions, only, that any action should be taken with respect to checkpoints. The most appropriate action is included with each checkpoint. Before taking actions listed with a checkpoint, check for additional symptoms in the service processor error log. See the “System Information Menu” on page 781 for information on how to access the service processor error log. Note: Go to “MAP 1540: Problem Isolation Procedures” on page 308 for any of the following checkpoint hang conditions: v A four-digit code in the range of 8xxx through EFFF not listed in the checkpoint tables. v A four-digit code is in the checkpoint tables, but does not contain a repair action or FRU listing. v All of the FRUs listed in the repair action have been replaced and the problem has not been corrected.Checkpoints listed in the following tables are in the format 8xxx, 9xxx, Axxx, Bxxx, or Exxx, where x is any hexadecimal digit from 1-9 or A-F. If your system hangs with a checkpoint displayed that begins with anything other than 8, 9, A, B, or E, go to the Diagnostic Information for Multiple Bus Systems. Use the following table to determine where to find the checkpoint.
Symptom
Action
You have a code that is of the form 8xxx, 9xxx, Axxx, or Bxxx.
Go to “Service Processor Checkpoints” on page 372.
You have a code of the form Exxx.
Go to “Firmware Checkpoints” on page 379.
You have a four-character code that is not of the form 8xxx, 9xxx, Axxx, Bxxx, or Exxx.
See the Diagnostic Information for Multiple Bus Systems. Chapter
4.
Checkpoints
371
Service Processor Checkpoints Service processor checkpoints are in the range 8xxx to Bxxx. The message OK indicates successful service processor testing and initialization. Firmware checkpoints are listed in “Firmware Checkpoints” on page 379. Note: A spinning (or twirling) slash displays in the upper-right corner of the operator panel display while some checkpoints are being displayed. If the slash stops spinning, a hang condition is indicated.
Checkpoint
Description
Action/ Possible Failing FRU
8000
Test of the service processor DRAM
See note 1 on page 377.
8004
Verify base code checksum
See note 1 on page 377.
8008
Verify base code mirror checksum
See note 1 on page 377.
8009
Set NVRAM configuration in TITAN
See note 1 on page 377.
800F
Start base code
See note 1 on page 377.
8010
Start supervisor in base code
See note 1 on page 377.
8012
Initialize hardware interrupts
See note 1 on page 377.
8014
Allocate and initialize area in DRAM
See note 1 on page 377.
8016
Initialize debugger and build VPD
See note 1 on page 377.
8018
Initialize service processor interfaces
See note 1 on page 377.
801A
Initialize external FLASH load
See note 1 on page 377.
801B
Initialize and test ISA hub chip
See note 1 on page 377.
801C
Initialize and test SUPER I/O chip
See note 1 on page 377.
801F
Initialize diskette, verify VDASD and start operational loader
See note 1 on page 377.
809x
Related to recovery mode system firmware update from diskette
See note 3 on page 377.
8092
Diskette initialization during recovery mode system firmware update
See note 3 on page 377.
8098
Firmware image being read during recovery mode system firmware update
See note 3 on page 377.
809C
Firmware image being written during recovery mode system firmware update
See note 3 on page 377.
809F
Diskette error during recovery mode system firmware update
See note 3 on page 377.
8100
Start operational loader
See note 1 on page 377.
81xx
Load code into DRAM from VDASD (where xx = number of loads)
See note 1 on page 377.
81FE
Load service processor operating system
See note 1 on page 377.
81FF
Start service processor operating system
See note 1 on page 377.
9000
Primary service processor task initializing
See note 1 on page 377.
9009
System processor IPL has started
See note 1 on page 377.
9010
Primary service processor thread waiting for response
See note 1 on page 377.
9020
Service processor state set to pre-standby
See note 1 on page 377.
9021
Service processor state set to standby
See note 1 on page 377. 372
406/90
Service
Guide
Checkpoint
Description
Action/ Possible Failing FRU
9022
Service processor state set to IPL
See note 1 on page 377.
9023
Service processor state set to runtime
See note 1 on page 377.
9024
Service processor state set to terminate
See note 1 on page 377.
9025
Service processor state set to DPO
See note 1 on page 377.
9030
Waiting for secondary thread acknowledgement
See note 1 on page 377.
9033
I/O base initialization
See note 1 on page 377.
9034
I2C initialization
See note 1 on page 377.
9035
ISA initialization
See note 1 on page 377.
9036
Mailbox initialization
See note1 on page 377.
9037
SLLA initialization (VPD)
See note 1 on page 377.
9038
NVRAM initialization
See note 1 on page 377.
9039
Serial port initialization
See note 1 on page 377.
903A
Serial I/O initialization
See note 1 on page 377.
903B
SPCN initialization
See note 1 on page 377.
9051
End of reset/reload operation: service processor dump has completed
See note 1 on page 377.
9055
Start menus thread
See note 1 on page 377.
9060
Checking/configuring modem on serial port 0
See note 4 on page 377.
9061
Checking/configuring modem on serial port 1
See note 4 on page 377.
906A
Create mailbox processing thread
See note 1 on page 377.
906B
Create menu interface thread
See note 1 on page 377.
906C
Create general thread
See note 1 on page 377.
906D
Create SPCN thread
See note 1 on page 377.
9070
Create thread to control LEDs on I/O subsystems
See note 1 on page 377.
9071
Initialize thread to control LEDs on I/O subsystems
See note 1 on page 377.
9080
Start VPD collection program
See note 1 on page 377.
9081
Start SC VPD data collection
See note 1 on page 377.
9082
I2C/SPCN VPD data collection
See note 1 on page 377.
9083
End SC VPD data collection
See note 1 on page 377.
9084
End I2C/SPCN VPD data collection
See note 1 on page 377.
90FD
Waiting for IBIST to complete
See note 1 on page 377.
9101
Setup load source
See note 1 on page 377.
9102
Open load source
See note 1 on page 377.
9103
Verify open status of load source
See note 1 on page 377.
9104
Waiting for SPCN VPD collection to be completed
See note 1 on page 377.
9105
Load IPL LIDs
See note 1 on page 377. Chapter
4.
Checkpoints
373
Checkpoint
Description
Action/ Possible Failing FRU
9106
Initialize MOPs
See note 1 on page 377.
9107
System flush
See note 1 on page 377.
9108
Processor scan interface BATs and LBIST
See note 1 on page 377.
9109
ABIST
See note 1 on page 377.
910A
Non-processor scan interface BATs and LBIST
See note 1 on page 377.
910B
Wire test
See note 1 on page 377.
910C
Initialize main storage (system memory)
See note 1 on page 377.
910D
Configure processors
See note 1 on page 377.
910E
Configure and initialize I/O
See note 1 on page 377.
910F
Test and initialize main storage (system memory)
See note 1 on page 377.
9110
Free IPL LIDs
See note 1 on page 377.
9111
Load run-time LIDS
See note 1 on page 377.
9112
Load dump LIDs
See note 1 on page 377.
9113
Enable attention
See note 1 on page 377.
9114
Enable function 22
See note 1 on page 377.
9115
Load system IPL program
See note 1 on page 377.
9116
Load system data areas (NACA/PACA)
See note 1 on page 377.
9117
Start system IPL program
See note 1 on page 377.
9119
Waiting for VPD collection to complete
See note 1 on page 377.
911E
Message passing waiting
See note 1 on page 377.
911F
Message passing starting
See note 1 on page 377.
9184
Firmware image being loaded into service processor’s control storage during firmware update.
1. Remove, then reapply power to the system. If the service processor enters recovery mode (indicated by A1FD 0000 in the display), attempt to update the firmware again in recovery mode. If the service processor comes up to the OK prompt, try again to update the firmware using the service processor menus. 2. Call service support.
91C4
Flash update pending, either a firmware update or hypervisor dump.
This checkpoint may be displayed during the firmware update process. The checkpoint may be displayed for a long period of time during the process. If process does not complete, the system may be hung. See note 1 on page 377.
91FF
Control being handed to system processor from service processor
See note 1 on page 377.
9200
Scan interface BATs
See note 1 on page 377.
9300
Logic BIST
See note 1 on page 377.
9301
Scanned ABIST
See note 1 on page 377. 374
406/90
Service
Guide
Checkpoint
Description
Action/ Possible Failing FRU
9302
Dedicated ABIST
See note 1 on page 377.
9303
Dedicated ABIST array fuse repair calculation
See note 1 on page 377.
9380
Built-in-self-test (BIST)
See note 2 on page 377.
9400
Service processor is requesting system flush
See note 1 on page 377.
9410
Service processor is issuing request to start instruction execution
See note 1 on page 377.
9411
Service processor is issuing request to stop instruction execution
See note 1 on page 377.
9420
Service processor is issuing request to start system clocks
See note 1 on page 377.
9421
Service processor is issuing request to stop system clocks
See note 1 on page 377.
94B0
Wire test faulty driver status refcode
See note 2 on page 377.
94B1
Wire test shorted net status refcode
See note 2 on page 377.
94B2
Wire test elastic interface test
See note 2 on page 377.
94BB
Wire test starting
See note 2 on page 377.
9501
IPL diagnostic initialization
See note 2 on page 377.
9502
IPL diagnostic L3 cache march test
See note 2 on page 377.
9503
IPL diagnostic L3 connections test
See note 2 on page 377.
9504
IPL diagnostic L2 cache march test
See note 2 on page 377.
9505
IPL diagnostic RIO wrap test
See note 2 on page 377.
9506
IPL diagnostic cleanup
See note 2 on page 377.
9507
IPL diagnostic test
See note 2 on page 377.
9508
IPL diagnostic main storage march test
See note 2 on page 377.
9509
IPL diagnostic main storage connections test
See note 2 on page 377.
950A
IPL diagnostic elastic interface slack test
See note 2 on page 377.
950B
IPL diagnostic fast initialization to zeros
See note 2 on page 377.
96C2
IPL MOPs processor configuration
See note 2 on page 377.
96C3
IPL MOPs main storage size
See note 2 on page 377.
96C4
IPL MOPs main storage configuration
See note 2 on page 377.
96C6
IPL MOPs I/O configuration
See note 2 on page 377.
96C7
IPL MOPs: enable chip machine checks
See note 2 on page 377.
96E1
Initialize run-time PRD objects and memory
See note 2 on page 377.
96E2
Run-time initialization: enable attention handling
See note 2 on page 377.
96EE
Firmware update module corrupted
See note 3 on page 377. Chapter
4.
Checkpoints
375
Checkpoint
Description
Action/ Possible Failing FRU
99FD
Service processor receiving firmware update module
See note 3 on page 377.Attention: This checkpoint may stay in the operator panel for some period of time during a firmware update or hypervisor dump with no indication of activity. Do not assume that the service processor is hung unless this checkpoint has been in the operator panel for at least 15 minutes. with no indication of activity.Attention: Do not remove power from the system, or reset the service processor, during a firmware update or hypervisor dump.
99FF
Service processor writing firmware update module
See note 3 on page 377.Attention: This checkpoint may stay in the operator panel for some period of time during a firmware update or hypervisor dump with no indication of activity. Do not assume that the service processor is hung unless this checkpoint has been in the operator panel for at least 15 minutes. with no indication of activity.Attention: Do not remove power from the system, or reset the service processor, during a firmware update or hypervisor dump.
A800
Start Service Processor dump process
See note 1 on page 377.
A801
Start dump to NVRAM
See note 1 on page 377.
A802
Start dump to debug port
See note 1 on page 377.
A803
NVRAM not usable
See note 1 on page 377.
A804
NVRAM dump done
See note 1 on page 377.
A805
Start dump to flash
See note 1 on page 377.
A806
Flash dump area done
See note 1 on page 377.
A807
Flash area not usable
See note 1 on page 377.
A808
Flash error log to debug port
See note 1 on page 377.
A809
Flash erase start
See note 1 on page 377.
A80A
Flash erase end
See note 1 on page 377.
A80B
Reserved; not used
See note 1 on page 377.
A80C
Reserved; not used
See note 1 on page 377.
A80D
Reserved; not used
See note 1 on page 377.
A80E
Reserved; not used
See note 1 on page 377.
A80F
Service processor dump done
See note 1 on page 377. 376
406/90
Service
Guide
Checkpoint
Description
Action/ Possible Failing FRU
A810-A8FF
Scan log dump in progress
Informational message. (See “Scan Log Dump Policy” on page 776.) The last two characters of the checkpoints will change as the scan log dump progresses. If the last two characters do not change after five minutes, the service processor is hung and must be reset. Follow the procedure outlined in Note 1 following this table. When the scan log dump is complete, depending on how the reboot policy is set, the system will either: v Go to the standby state (and the service processor menus will be available), indicated by OK or STBY in the operator panel, ORv Attempt to reboot.
B0F5
Delayed power-off sequence has begun
See note 1.
B0FA
Delayed power-off acknowledged
See note 1.
B0FF
Power-off requested program executing
See note 1.
Notes:
1. If the system fails to progress after this checkpoint is put in the display, do the following: a. Activate the service processor pinhole reset switch on the system’s operator panel. b. If the hang repeats, check with service support to see if there is a firmware update that fixes the problem. c. If not, or the update does not fix the problem, replace the primary I/O book, location: U1.18-P1-H2. d. If this procedure does not fix the problem, call service support.2. If the system fails to progress after this checkpoint is put in the operator display, do the following: a. Activate the service processor pinhole reset switch on the system’s operator panel. b. Reboot the system in slow mode. (Slow/fast IPL is set using a service processor menu.) If the hang repeats, check with service support to see if there is a system firmware update that fixes this problem. c. If not, or the update does not fix the problem, call service support.3. Recovery mode flash update has failed. Do the following: a. Activate the service processor pinhole reset switch on the system’s operator panel and try the recovery mode flash update again. b. If the system hangs again, make a new set of firmware update diskettes, using new, blank diskettes. Retry the flash update with the new diskettes. c. If the same hang occurs with the new diskettes, replace the following FRUs one at a time in this order: 1) Diskette drive 2) Primary I/O book, location: U1.18-P1-H2.d. If this procedure does not fix the problem, call service support.4. The service processor was unable to configure the modem(s) attached to the serial port(s) on the primary I/O book (a problem with the modem attached to the HMC is not indicated). Do the following: a. Check the modem, modem cable, and serial port for loose or unconnected cables. Reset the service processor using the pinhole reset switch on the system’s operator panel. Chapter
4.
Checkpoints
377
b. If the hang repeats, check with service support to see if there is a firmware update that fixes the problem. c. If there is not, or the update does not fix the problem, replace the following FRUs, one at a time, in the following order: 1) Modem cable 2) Modemd. If this procedure does not fix the problem, call service support. 378
406/90
Service
Guide
Firmware Checkpoints Firmware uses checkpoints (progress codes) in the range of Exxx to EFFF. These checkpoints occur during system startup and can be useful in diagnosing certain problems. Service processor checkpoints are listed in “Service Processor Checkpoints” on page 372. If you have a checkpoint with no location code associated with it, see “Determining Location Code” on page 415. If a checkpoint has a location code associated with a FRU replacement, see “AIX and Physical Location Code Reference Tables” on page 68 for physical location information. If you replace FRUs and the problem is still not corrected, go to “MAP 1542: I/O Problem Isolation” on page 318 unless otherwise indicated in the tables. Note: If you receive a four-digit code in the range of E1xx to EFFF that is not listed in the following table, go to “MAP 1542: I/O Problem Isolation” on page 318.
Table 2. Firmware Checkpoints
Checkpoint
Description
Action/ Possible Failing FRU
E101
Create RTAS node
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
E102
Load/Init RTAS
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
E105
Transfer control to Operating System (normal boot).
See “Boot Problems” on page 409.
E10A
Load RTAS device tree
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
E10B
Set RTAS device properties
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
E111
GOOD CRC - jump to composite image
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
E120
Initialize I/O and early memory block
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
E121
Initialize service processor
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
E122
RTAS sensor setup complete
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
E130
Build device tree
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
E131
Build device tree
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
E132
Build device tree, just before setting up root node
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
E133
Build device tree, just before setting up CPUs
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
E134
Create memory node
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
E135
Configuring memory nodes
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
E138
Create options node
Go to “MAP 1540: Problem Isolation Procedures” on page 308. Chapter
4.
Checkpoints
379
Table 2. Firmware Checkpoints (continued)
Checkpoint
Description
Action/ Possible Failing FRU
E139
Create node aliases and system aliases
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
E13A
Create packages node
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
E140
Operating system load
See “Boot Problems” on page 409.
E145
Early debug code activated; firmware is halted
Press the g key on the serial interface.
E149
Create boot mgr node
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
E14C
Create terminal-emulator node
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
E14D
Loading boot image
See “Boot Problems” on page 409.
E150
Create host (primary) PCI controller node
If a location code is specified, replace the adapter at that location code. If no location code is specified, go to “MAP 1542: I/O Problem Isolation” on page 318.
E151
Probing primary PCI bus
If a location code is specified, replace the adapter at that location code. If replacing the adapter does not resolve the problem, replace the I/O backplane in which the adapter is plugged. If no location code is specified, go to “MAP 1542: I/O Problem Isolation” on page 318.
E152
Probing for adapter FCODE, evaluate if present
Follow the repair action listed for checkpoint E151.
E153
End adapter FCODE, probe/evaluate
Follow the repair action listed for checkpoint E151.
E154
Create PCI bridge node
Follow the repair action listed for checkpoint E151.
E155
Probing PCI bridge secondary bus
Follow the repair action listed for checkpoint E151.
E156
Create PCI Ethernet node
If a location code is specified, replace the adapter at that location code. If no location code is specified, go to “MAP 1542: I/O Problem Isolation” on page 318.
E158
System firmware waiting for virtual terminal to be opened on HMC for SMS menu to be displayed
Open a virtual terminal window on the HMC.
E15B
Transferring control to operating system (service mode boot)
See “Boot Problems” on page 409.
E15F
Adapter VPD probe
Follow the repair action listed for checkpoint E151.
E170
Start of PCI Bus Probe
Follow the repair action listed for checkpoint E151.
E172
First pass of PCI device probe
Follow the repair action listed for checkpoint E151. 380
406/90
Service
Guide
Table 2. Firmware Checkpoints (continued)
Checkpoint
Description
Action/ Possible Failing FRU
E174
Establishing host connection
If the system is not connected to an active network, or if the target server is inaccessible (which can result from incorrect IP parameters being specified), the system still attempts to boot and because time-out durations are necessarily long to accommodate retries, the system may appear to hang. Restart the system and get to the SMS utilities menu. In the multi-boot menu, verify: v Is the intended boot device correctly specified in the boot list? v Are the IP parameters correct? v Look at the server configuration for this client. v Attempt to ping the target server using the ping utility in SMS.
E175
BootP request
Follow the repair actions for checkpoint E174.
E176
TFTP file transfer
Follow the repair actions for checkpoint E174.
E177
Transfer failure due to TFTP error condition
See “Boot Problems” on page 409.
E17B
Processor clock speed measurement
Call service support.
E183
Service Processor POST
1. Replace the service processor, location: U1.18-P1-H2. 2. Go to “MAP 1541: JTAG Problem Isolation” on page 310.
E18D
A system reset request has been sent to the service processor
1. Check for system firmware updates. 2. Call service support.
E191
Create ISA reserved node
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
E199
Evaluating the RIO interface speed set by the user in the service processor menu
1. Check for system firmware updates. Apply if available. 2. Call service support.
E19A
NVRAM auto-boot? variable not found - assume FALSE
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
E19B
NVRAM menu? variable not found - assume FALSE
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
E19D
Create NVRAM node
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
E19E
Real-time clock (RTC) initialization
Refer to error code 28030xxx in “Firmware/POST Error Codes” on page 488.
E1A0
User requested boot to SMS menus via keyboard entry
Go to “MAP 1540: Problem Isolation Procedures” on page 308. Chapter
4.
Checkpoints
381
Table 2. Firmware Checkpoints (continued)
Checkpoint
Description
Action/ Possible Failing FRU
E1A1
User requested boot to open firmware prompt via keyboard entry
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
E1A2
User requested boot using default service mode boot list via keyboard entry
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
E1A3
User requested boot using customized service mode boot list via keyboard entry
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
E1A4
User requested boot to SMS menus via HMC or CSP command
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
E1A5
User requested boot to open firmware prompt via HMC or CSP command
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
E1A6
User requested boot using default service mode boot list via HMC or CSP command
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
E1A7
User requested boot using customized service mode boot list via HMC or CSP command
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
E1A8
System booting to open firmware prompt
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
E1A9
System booting to SMS menus
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
E1AA
System NVRAM settings during boot
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
E1AB
System booting using default service mode boot list
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
E1AC
System booting using customized service mode boot list
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
E1AD
System booting to operating system
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
E1AE
System booted to multiboot menu via NVRAM settings
1. This checkpoint is posted on the operator panel when system firmware has entered the boot devices menu in SMS because the multiboot flag was turned on. The firmware is waiting for user input. If the firmware console is not open, the boot devices menu will not be seen, and the system will appear to be hung even though it is not. System firmware will only progress past this point when the required input is provided. 2. Go to “MAP 1540: Problem Isolation Procedures” on page 308.
E1AF
System booted to utilities menu via NVRAM settings
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
E1B1
Create serial node
Go to “MAP 1540: Problem Isolation Procedures” on page 308. 382
406/90
Service
Guide
Table 2. Firmware Checkpoints (continued)
Checkpoint
Description
Action/ Possible Failing FRU
E1B2
XOFF received - waiting for XON
1. The system firmware is waiting for a virtual terminal to be opened on the HMC. Open a virtual terminal. 2. If a virtual terminal is open, the user might have entered a CTRL-S key sequence to stop the scrolling of data off of the screen. If this is the case, enter a CTRL-Q key sequence to resume scrolling. 3. Check the serial connection between the HMC and the managed system. 4. Reboot the HMC. 5. There may be a hardware problem with the HMC. Refer to the Hardware Management Console Installation and Operations Guide. 6. There may be a hardware problem with the service processor in the managed system. Check the service action event log in Service Focal Point for error codes that indicate a problem with the serial ports on the service processor. The service processor is in the primary I/O book at the U1.18-P1-H2 location. Take the appropriate actions based on the error codes that are found.
E1B3
XON received
This checkpoint will flash by too quickly to be seen on the operator panel. It may be recorded in the progress indicators log, which is accessed using the service processor menus. If a partition hangs on this checkpoint, complete the following: 1. Check for platform firmware updates. Apply if available. 2. Call service support.
E1B6
Probe for (ISA) keyboard
Replace the primary I/O book, Location: U1.18-P1-H2.
E1BD
Probe for (ISA) mouse
Replace the primary I/O book, Location: U1.18-P1-H2.
E1C4
Build ISA timer chip node
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
E1C5
Create ISA interrupt controller (pic) node
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
E1C6
Create DMA node
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
E1D0
Create PCI SCSI node
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
E1D3
Create SCSI block device node (SD)
Go to “MAP 1540: Problem Isolation Procedures” on page 308. Chapter
4.
Checkpoints
383
Table 2. Firmware Checkpoints (continued)
Checkpoint
Description
Action/ Possible Failing FRU
E1D4
Create SCSI byte device node (ST)
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
E1DB
Create floppy controller (FDC) node
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
E1DC
Dynamic console selection.
1. If a console is attached, but nothing is displayed on it, go to ?All display problems? in the Entry MAP. 2. If selection screens can be seen on the terminals and the appropriate key on the input device is pressed within 60 seconds but there is no response to the keystroke: a. If selecting the console with a keyboard attached to a USB card, replace the keyboard, then replace the USB card. b. If selecting the console with an ASCII terminal, suspect the terminal. Use the problem determination procedures for the terminal.Note: Terminal setting should be: v 9600 baud v no parity v 8 data bits v 1 stop bit
E1F0
Start O.B.E.
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
E1F1
Begin selftest sequence on boot devices.
Go to “MAP 1542: I/O Problem Isolation” on page 318.
E1F2
Power-on password prompt
The prompt should be visible on the firmware console (usually a VTERM on the HMC). If a console is attached but nothing is displayed on it, go to the “Entry MAP” on page 149 with the symptom ?All display problems.?
E1F3
Privileged-access password prompt
The prompt should be visible on the firmware console (usually a VTERM on the HMC). If a console is attached but nothing is displayed on it, go to the “Entry MAP” on page 149 with the symptom ?All display problems.?
E1F4
End self-test sequence on boot devices; begin SMS.
Go to “MAP 1542: I/O Problem Isolation” on page 318.
E1F5
Build boot device list
Go to “MAP 1542: I/O Problem Isolation” on page 318. 384
406/90
Service
Guide
Table 2. Firmware Checkpoints (continued)
Checkpoint
Description
Action/ Possible Failing FRU
E1F6
Determine boot device sequence
Go to “MAP 1542: I/O Problem Isolation” on page 318.
E1F7
No boot image located
Go to “Boot Problems” on page 409.
E1F8
Partition (system firmware) is waiting for a virtual terminal to be opened on HMC.
Open a virtual terminal for the partition on the HMC. If the partition hangs at this checkpoint after a virtual terminal is opened on the HMC: 1. Replace the service processor/RIO book, location: U1.18-P1-H2 2. Call service support.
E1FA
Building boot device list for SCSI adapters. (The device ID and the device LUN being scanned code are also displayed.)
1. If a location code is available, follow the actions listed for error code 21A0 00xx. 2. Go to “MAP 1542: I/O Problem Isolation” on page 318.
E1FB
Scan SCSI bus for attached devices
1. If a location code is available, follow the Repair actions listed for error code 21A0 00XX. 2. If no location code is available, go to “MAP 1542: I/O Problem Isolation” on page 318.
E1FC
Building boot device list for SSA adapters. The location code of the SSA adapter being scanned is also displayed.
Go to “MAP 1542: I/O Problem Isolation” on page 318.
E1FD
The operator panel alternates between the code E1FD and another Exxx code, where Exxx is the point at which the error occurred.
1. If a location code is available, follow the actions listed for error code 21A0 00xx. 2. If no location code is available, go to “MAP 1542: I/O Problem Isolation” on page 318.
E1FE
Building boot device list for fibre-channel adapters. (The WWPN of the SAN adapter being scanned is also displayed.)
Go to “MAP 1542: I/O Problem Isolation” on page 318.
E1FF
Building boot device list for fibre-channel adapters. (The WWPN of the SAN adapter being scanned is also displayed.)
Go to “MAP 1542: I/O Problem Isolation” on page 318.
E20F
System firmware has exited to open firmware prompt.
1. Replace the primary I/O book, Location: U1.18-P1-H2. 2. Call service support.
E440
Validate NVRAM, initialize partitions as needed
1. Check for system firmware updates. 2. Replace the primary I/O book, Location: U1.18-P1-H2. 3. Call service support.
E441
Generate /options node NVRAM configuration variable properties
1. Check for system firmware updates. 2. Replace the primary I/O book, Location: U1.18-P1-H2. 3. Call service support.
E442
Validate NVRAM partitions
1. Check for system firmware updates. 2. Replace the primary I/O book, Location: U1.18-P1-H2. 3. Call service support. Chapter
4.
Checkpoints
385
Table 2. Firmware Checkpoints (continued)
Checkpoint
Description
Action/ Possible Failing FRU
E443
Generate NVRAM configuration variable dictionary words
Suspect a system firmware problem if problem persists. If a system firmware update is available, install it.
E500
Begin I/O drawer configuration
1. Check for system firmware updates. 2. Go to “MAP 1542: I/O Problem Isolation” on page 318.
E501
Initialize I/O data structures
1. Check for system firmware updates. 2. Go to “MAP 1542: I/O Problem Isolation” on page 318.
E502
Set up I/O data structure pointers
1. Check for system firmware updates. 2. Go to “MAP 1542: I/O Problem Isolation” on page 318.
E503
Initialize location code data structures
1. Check for system firmware updates. 2. Go to “MAP 1542: I/O Problem Isolation” on page 318.
E504
Initialize memory map data structures
1. Check for system firmware updates. 2. Go to “MAP 1542: I/O Problem Isolation” on page 318.
E505
Enable the time base for all RIO hubs
1. Check for system firmware updates. 2. Replace the I/O book specified by the location code. 3. Go to “MAP 1542: I/O Problem Isolation” on page 318.
E506
Reset time base of processors
1. Check for system firmware updates. 2. Go to “MAP 1542: I/O Problem Isolation” on page 318.
E507
I/O hub assignment
1. Check for system firmware updates. 2. Replace the I/O book specified by the location code. 3. Go to “MAP 1542: I/O Problem Isolation” on page 318.
E508
Begin initializing I/O hubs
1. Check for system firmware updates. 2. Go to “MAP 1542: I/O Problem Isolation” on page 318.
E509
Begin configuring EADS chips in I/O subsystems
1. Check for system firmware updates. 2. Go to “MAP 1542: I/O Problem Isolation” on page 318.
E50A
Check for missing I/O interface chips
1. Check for system firmware updates. 2. Go to “MAP 1542: I/O Problem Isolation” on page 318.
E50B
Save I/O hub information
1. Check for system firmware updates. 2. Go to “MAP 1542: I/O Problem Isolation” on page 318. 386
406/90
Service
Guide
Table 2. Firmware Checkpoints (continued)
Checkpoint
Description
Action/ Possible Failing FRU
E50C
Copy all of the memory ranges
1. Check for system firmware updates. 2. Go to “MAP 1542: I/O Problem Isolation” on page 318.
E50D
Copy all of the I/O data
1. Check for system firmware updates. 2. Go to “MAP 1542: I/O Problem Isolation” on page 318.
E50E
Copy all of the I/O hub information
1. Check for system firmware updates. 2. Go to “MAP 1542: I/O Problem Isolation” on page 318.
E50F
Clean up the error registers
1. Check for system firmware updates. 2. Replace the I/O book specified by the location code. 3. Go to “MAP 1542: I/O Problem Isolation” on page 318.
E51F
End of I/O configuration
1. Check for system firmware updates. 2. Go to “MAP 1542: I/O Problem Isolation” on page 318.
E520
Set up the I/O architecture registers
1. Check for system firmware updates. 2. Replace the I/O book specified by the location code. 3. Go to “MAP 1542: I/O Problem Isolation” on page 318.
E521
Set up interrupts for the current I/O hub
1. Check for system firmware updates. 2. Replace the I/O book specified by the location code. 3. Go to “MAP 1542: I/O Problem Isolation” on page 318.
E522
Set up miscellaneous registers for the current I/O hub
1. Check for system firmware updates. 2. Replace the I/O book specified by the location code. 3. Go to “MAP 1542: I/O Problem Isolation” on page 318.
E523
Initialize data structures and enable all I/O hub ports
1. Check for system firmware updates. 2. Replace the I/O book specified by the location code. 3. Go to “MAP 1542: I/O Problem Isolation” on page 318.
E524
Determine I/O configuration of current I/O hub
1. Check for system firmware updates. 2. Replace the I/O book specified by the location code. 3. Go to “MAP 1542: I/O Problem Isolation” on page 318. Chapter
4.
Checkpoints
387
Table 2. Firmware Checkpoints (continued)
Checkpoint
Description
Action/ Possible Failing FRU
E525
Set up the ETE of the current I/O hub
1. Check for system firmware updates. 2. Replace the I/O book specified by the location code. 3. Go to “MAP 1542: I/O Problem Isolation” on page 318.
E526
Reassign the I/O subsystems to the nearest I/O hub
1. Replace the I/O cable specified by the location code. 2. Replace the I/O book that the failing I/O subsystem is attached to, U1.18-P1-Hx. 3. If the location code associated with this checkpoint specifies a PCI adapter card, replace the adapter card. 4. Replace the I/O subsystem backplane that the I/O cable in step 1 is attached to. 5. Go to “MAP 1542: I/O Problem Isolation” on page 318.
E527
Enable the memory map and I/O map for the current I/O hub port
1. Check for system firmware updates. 2. Replace the I/O book specified by the location code. 3. Go to “MAP 1542: I/O Problem Isolation” on page 318.
E528
Begin search for the next available I/O drawer from a current RIO port
1. Check the RIO cable specified by the location code. If the location code specifies an I/O subsystem or its RIO interface, check the RIO cables going into and out of the I/O subsystem. 2. Replace the RIO cables specified by the location code. If the location code specifies an I/O subsystem or its RIO interface, replace the RIO cables going into and out of the I/O subsystem. 3. Replace the I/O book specifed by the location code, or the I./O book that the RIO cables in step 1 are attached to. 4. If the location code associated with this checkpoint specifies a PCI adapter card, replace the adapter card. 5. Replace the I/O subsystem backplane specified by the location code, or the I/O subsystem backplane that the RIO cables in step 1 are attached to. 388
406/90
Service
Guide
Table 2. Firmware Checkpoints (continued)
Checkpoint
Description
Action/ Possible Failing FRU
E529
Enable the control port and driver of the current RIO hub
1. Check the RIO cable specified by the location code. If the location code specifies an I/O subsystem or its RIO interface, check the RIO cables going into and out of the I/O subsystem. 2. Replace the RIO cables specified by the location code. If the location code specifies an I/O subsystem or its RIO interface, replace the RIO cables going into and out of the I/O subsystem. 3. Replace the I/O book specifed by the location code, or the I./O book that the RIO cables in step 1 are attached to. 4. If the location code associated with this checkpoint specifies a PCI adapter card, replace the adapter card. 5. Replace the I/O subsystem backplane specified by the location code, or the I/O subsystem backplane that the RIO cables in step 1 are attached to.
E52A
Search for the next available I/O drawer from a current RIO port
1. Check the RIO cable specified by the location code. If the location code specifies an I/O subsystem or its RIO interface, check the RIO cables going into and out of the I/O subsystem. 2. Replace the RIO cables specified by the location code. If the location code specifies an I/O subsystem or its RIO interface, replace the RIO cables going into and out of the I/O subsystem. 3. Replace the I/O book specifed by the location code, or the I./O book that the RIO cables in step 1 are attached to. 4. If the location code associated with this checkpoint specifies a PCI adapter card, replace the adapter card. 5. Replace the I/O subsystem backplane specified by the location code, or the I/O subsystem backplane that the RIO cables in step 1 are attached to. Chapter
4.
Checkpoints
389
Table 2. Firmware Checkpoints (continued)
Checkpoint
Description
Action/ Possible Failing FRU
E52B
The RIO cable connected to the RIO port has been detected
1. Check the RIO cable specified by the location code. If the location code specifies an I/O subsystem or its RIO interface, check the RIO cables going into and out of the I/O subsystem. 2. Replace the RIO cables specified by the location code. If the location code specifies an I/O subsystem or its RIO interface, replace the RIO cables going into and out of the I/O subsystem. 3. Replace the I/O book specifed by the location code, or the I./O book that the RIO cables in step 1 are attached to. 4. If the location code associated with this checkpoint specifies a PCI adapter card, replace the adapter card. 5. Replace the I/O subsystem backplane specified by the location code, or the I/O subsystem backplane that the RIO cables in step 1 are attached to.
E52F
End of configuration of the RIO hub
1. Check for system firmware updates. 2. Go to “MAP 1542: I/O Problem Isolation” on page 318.
E530
Checking I/O hub ports on current I/O hub
1. Check for system firmware updates. 2. Go to “MAP 1542: I/O Problem Isolation” on page 318.
E531
Get I/O hub node ID
1. Replace the I/O cable specified by the location code. 2. Replace the I/O book that the failing I/O subsystem is attached to, U1.18-P1-Hx. 3. If the location code associated with this checkpoint specifies a PCI adapter card, replace the adapter card. 4. Replace the I/O subsystem backplane that the I/O cable in step 1 is attached to. 5. Go to “MAP 1542: I/O Problem Isolation” on page 318.
E532
I/O cable detected; enable the I/O hub port
1. Replace the I/O cable specified by the location code. 2. Replace the I/O book that the failing I/O subsystem is attached to, U1.18-P1-Hx. 3. If the location code associated with this checkpoint specifies a PCI adapter card, replace the adapter card. 4. Replace the I/O subsystem backplane that the I/O cable in step 1 is attached to. 5. Go to “MAP 1542: I/O Problem Isolation” on page 318. 390
406/90
Service
Guide
Table 2. Firmware Checkpoints (continued)
Checkpoint
Description
Action/ Possible Failing FRU
E533
Set up the I/O hub routing table
1. Replace the I/O cable specified by the location code. 2. Replace the I/O book that the failing I/O subsystem is attached to, U1.18-P1-Hx. 3. If the location code associated with this checkpoint specifies a PCI adapter card, replace the adapter card. 4. Replace the I/O subsystem backplane that the I/O cable in step 1 is attached to. 5. Go to “MAP 1542: I/O Problem Isolation” on page 318.
E534
Get the node ID of the node on the other end of the I/O cable
1. This error code may indicate that the I/O (RIO) cables are not properly connected to the I/O subsystems. Check the RIO cabling; refer to “I/O Book Connector Locations” on page 19 and “Subsystem Positioning and Cabling” on page 36. 2. Replace the RIO cable specified by the location code. 3. Replace the I/O book that the failing I/O subsystem is attached to, U1.18-P1-Hx. 4. If the location code associated with this checkpoint specifies a PCI adapter card, replace the adapter card. 5. Replace the I/O subsystem backplane that the I/O cable in step 1 is attached to. 6. Go to “MAP 1542: I/O Problem Isolation” on page 318.
E535
Valid node ID detected at other end of the I/O cable
1. Replace the I/O cable specified by the location code. 2. Replace the I/O book that the failing I/O subsystem is attached to, U1.18-P1-Hx. 3. If the location code associated with this checkpoint specifies a PCI adapter card, replace the adapter card. 4. Replace the I/O subsystem backplane that the I/O cable in step 1 is attached to. 5. Go to “MAP 1542: I/O Problem Isolation” on page 318.
E536
I/O subsystem detected at other end of the I/O cable
1. Replace the I/O cable specified by the location code. 2. Replace the I/O book that the failing I/O subsystem is attached to, U1.18-P1-Hx. 3. Go to “MAP 1542: I/O Problem Isolation” on page 318. Chapter
4.
Checkpoints
391
Table 2. Firmware Checkpoints (continued)
Checkpoint
Description
Action/ Possible Failing FRU
E537
Configuring the I/O subsystem
1. Replace the I/O cable specified by the location code. 2. Replace the I/O book that the failing I/O subsystem is attached to, U1.18-P1-Hx. 3. If the location code associated with this checkpoint specifies a PCI adapter card, replace the adapter card. 4. Replace the I/O subsystem backplane that the I/O cable in step 1 is attached to. 5. Go to “MAP 1542: I/O Problem Isolation” on page 318.
E538
Detected the I/O hub at other end of the I/O cable
1. Check for system firmware updates. 2. Go to “MAP 1542: I/O Problem Isolation” on page 318.
E539
Enabling the I/O hub ports
1. Check for system firmware updates. 2. Replace the I/O book specified by the location code. 3. Go to “MAP 1542: I/O Problem Isolation” on page 318. 4. If the location code associated with this checkpoint specifies a PCI adapter card, replace the adapter card.
E53A
Set up the RIO port speed of the exit port of the I/O subsystem connected to the RIO hub port
1. Replace the RIO cable specified by the location code. 2. Replace the I/O book that the failing drawer is attached to, U1.18-P1-Hx. 3. If the location code associated with this checkpoint specifies a PCI adapter card, replace the adapter card. 4. Replace the I/O subsystem backplane that the RIO cable in step 1 is attached to. 5. Go to “MAP 1542: I/O Problem Isolation” on page 318.
E53B
End of looking for next node port
1. Check for system firmware updates. 2. Go to “MAP 1542: I/O Problem Isolation” on page 318.
E53C
End of looking for next RIO hub port
1. Check for system firmware updates. 2. Go to “MAP 1542: I/O Problem Isolation” on page 318.
E53D
Disable the anynode mode
1. If the location code associated with this checkpoint specifies a PCI adapter card, replace the adapter card. 2. Replace the I/O subsytem backplane that the RIO cable specified by the location code is attached to. 392
406/90
Service
Guide
Table 2. Firmware Checkpoints (continued)
Checkpoint
Description
Action/ Possible Failing FRU
E540
Begin initialization of I/O subsystem
1. Check for system firmware updates. 2. Go to “MAP 1542: I/O Problem Isolation” on page 318.
E541
Initialize the routing table
1. Replace the I/O cable specified by the location code. 2. If the location code associated with this checkpoint specifies a PCI adapter card, replace the adapter card. 3. Replace the I/O subsystem backplane that the I/O cable in step 1 is attached to. 4. Go to “MAP 1542: I/O Problem Isolation” on page 318.
E544
Enable the I/O ports
1. Replace the I/O cable specified by the location code. 2. If the location code associated with this checkpoint specifies a PCI adapter card, replace the adapter card. 3. Replace the I/O subsystem backplane that the I/O cable in step 1 is attached to.
E545
Set up the device routing address for this new node ID
1. Replace the I/O cable specified by the location code. 2. If the location code associated with this checkpoint specifies a PCI adapter card, replace the adapter card. 3. Replace the I/O subsystem backplane that the I/O cable in step 1 is attached to.
E546
Set up the I/O interface chip new node ID
1. Replace the I/O cable specified by the location code. 2. If the location code associated with this checkpoint specifies a PCI adapter card, replace the adapter card. 3. Replace the I/O subsystem backplane that the I/O cable in step 1 is attached to.
E547
Set up the I/O interface chip routing table
1. Replace the I/O cable specified by the location code. 2. If the location code associated with this checkpoint specifies a PCI adapter card, replace the adapter card. 3. Replace the I/O subsystem backplane that the I/O cable in step 1 is attached to. Chapter
4.
Checkpoints
393
Table 2. Firmware Checkpoints (continued)
Checkpoint
Description
Action/ Possible Failing FRU
E548
Disable any node mode
1. Replace the I/O cable specified by the location code. 2. If the location code associated with this checkpoint specifies a PCI adapter card, replace the adapter card. 3. Replace the I/O subsystem backplane that the I/O cable in step 1 is attached to.
E549
Adjust I/O hub table
1. Replace the I/O cable specified by the location code. 2. Replace the I/O book that the failing I/O subsystem is attached to, U1.18-P1-Hx. 3. If the location code associated with this checkpoint specifies a PCI adapter card, replace the adapter card. 4. Replace the I/O subsystem backplane that the I/O cable in step 1 is attached to.
E54A
Configure the I/O drawer
1. Check the RIO cable specified by the location code. If the location code specifies an I/O subsystem or its RIO interface, check the RIO cables going into and out of the I/O subsystem. 2. Replace the RIO cables specified by the location code. If the location code specifies an I/O subsystem or its RIO interface, replace the RIO cables going into and out of the I/O subsystem. 3. If the location code associated with this checkpoint specifies a PCI adapter card, replace the adapter card.
E54B
Update the I/O drawer routing table of RIO ports
1. Check the RIO cable specified by the location code. If the location code specifies an I/O subsystem or its RIO interface, check the RIO cables going into and out of the I/O subsystem. 2. Replace the RIO cables specified by the location code. If the location code specifies an I/O subsystem or its RIO interface, replace the RIO cables going into and out of the I/O subsystem. 3. If the location code associated with this checkpoint specifies a PCI adapter card, replace the adapter card. 394
406/90
Service
Guide
Table 2. Firmware Checkpoints (continued)
Checkpoint
Description
Action/ Possible Failing FRU
E550
Begin to initialize I/O interface chip
1. Replace the I/O cable specified by the location code. 2. If the location code associated with this checkpoint specifies a PCI adapter card, replace the adapter card. 3. Replace the I/O subsystem backplane that the I/O cable in step 1 is attached to.
E551
Set up the I/O speed
1. Replace the I/O cable specified by the location code. 2. If the location code associated with this checkpoint specifies a PCI adapter card, replace the adapter card. 3. Replace the I/O subsystem backplane that the I/O cable in step 1 is attached to.
E552
Set up I/O interface chip registers
1. Replace the I/O cable specified by the location code. 2. If the location code associated with this checkpoint specifies a PCI adapter card, replace the adapter card. 3. Replace the I/O subsystem backplane that the I/O cable in step 1 is attached to.
E553
Set up the I/O interface chip base address
1. Replace the I/O cable specified by the location code. 2. If the location code associated with this checkpoint specifies a PCI adapter card, replace the adapter card. 3. Replace the I/O subsystem backplane that the I/O cable in step 1 is attached to.
E554
Set up more I/O interface chip registers
1. Replace the I/O cable specified by the location code. 2. If the location code associated with this checkpoint specifies a PCI adapter card, replace the adapter card. 3. Replace the I/O subsystem backplane that the I/O cable in step 1 is attached to.
E555
Set up the I/O interface chip I/O link
1. Replace the I/O cable specified by the location code. 2. If the location code associated with this checkpoint specifies a PCI adapter card, replace the adapter card. 3. Replace the I/O subsystem backplane that the I/O cable in step 1 is attached to. Chapter
4.
Checkpoints
395
Table 2. Firmware Checkpoints (continued)
Checkpoint
Description
Action/ Possible Failing FRU
E556
Set up and initialize I/O interface chips under current I/O interface chip
1. Replace the I/O cable specified by the location code. 2. If the location code associated with this checkpoint specifies a PCI adapter card, replace the adapter card. 3. Replace the I/O subsystem backplane that the I/O cable in step 1 is attached to.
E55F
End of I/O drawer initialization
1. Replace the I/O cable specified by the location code. 2. Replace the I/O book that the failing I/O subsystem is attached to, U1.18-P1-Hx. 3. If the location code associated with this checkpoint specifies a PCI adapter card, replace the adapter card. 4. Replace the I/O subsystem backplane that the I/O cable in step 1 is attached to.
E560
Begin initializing the I/O subsystem
1. Check for system firmware updates. 2. Go to “MAP 1542: I/O Problem Isolation” on page 318.
E561
Set up I/O interface chip registers
1. Replace the I/O cable specified by the location code. 2. If the location code associated with this checkpoint specifies a PCI adapter card, replace the adapter card. 3. Replace the I/O subsystem backplane that the I/O cable in step 1 is attached to.
E562
Set up I/O interface chip registers
1. Replace the I/O cable specified by the location code. 2. If the location code associated with this checkpoint specifies a PCI adapter card, replace the adapter card. 3. Replace the I/O subsystem backplane that the I/O cable in step 1 is attached to.
E563
Set up I/O interface chip registers
1. Replace the I/O cable specified by the location code. 2. If the location code associated with this checkpoint specifies a PCI adapter card, replace the adapter card. 3. Replace the I/O subsystem backplane that the I/O cable in step 1 is attached to. 396
406/90
Service
Guide
Table 2. Firmware Checkpoints (continued)
Checkpoint
Description
Action/ Possible Failing FRU
E564
Set up I/O interface chip registers
1. Replace the I/O cable specified by the location code. 2. If the location code associated with this checkpoint specifies a PCI adapter card, replace the adapter card. 3. Replace the I/O subsystem backplane that the I/O cable in step 1 is attached to.
E565
Set up I/O interface chip registers
1. Replace the I/O cable specified by the location code. 2. If the location code associated with this checkpoint specifies a PCI adapter card, replace the adapter card. 3. Replace the I/O subsystem backplane that the I/O cable in step 1 is attached to.
E566
Enable device arbitration on the I/O interface chip
1. Replace the I/O cable specified by the location code. 2. If the location code associated with this checkpoint specifies a PCI adapter card, replace the adapter card. 3. Replace the I/O subsystem backplane that the I/O cable in step 1 is attached to.
E567
Set up I/O interface chip registers
1. Replace the I/O cable specified by the location code. 2. If the location code associated with this checkpoint specifies a PCI adapter card, replace the adapter card. 3. Replace the I/O subsystem backplane that the I/O cable in step 1 is attached to.
E568
Set up the external interrupt vector register
1. Replace the I/O cable specified by the location code. 2. If the location code associated with this checkpoint specifies a PCI adapter card, replace the adapter card. 3. Replace the I/O subsystem backplane that the I/O cable in step 1 is attached to.
E569
Set up I/O interface chip registers
1. Replace the I/O cable specified by the location code. 2. If the location code associated with this checkpoint specifies a PCI adapter card, replace the adapter card. 3. Replace the I/O subsystem backplane that the I/O cable in step 1 is attached to. Chapter
4.
Checkpoints
397
Table 2. Firmware Checkpoints (continued)
Checkpoint
Description
Action/ Possible Failing FRU
E56A
Set up I/O interface chip registers
1. Replace the I/O cable specified by the location code. 2. If the location code associated with this checkpoint specifies a PCI adapter card, replace the adapter card. 3. Replace the I/O subsystem backplane that the I/O cable in step 1 is attached to.
E56B
Set up I/O interface chip registers
1. Replace the I/O cable specified by the location code. 2. If the location code associated with this checkpoint specifies a PCI adapter card, replace the adapter card. 3. Replace the I/O subsystem backplane that the I/O cable in step 1 is attached to.
E56C
Set up I/O interface chip PCI bus
1. Replace the I/O cable specified by the location code. 2. If the location code associated with this checkpoint specifies a PCI adapter card, replace the adapter card. 3. Replace the I/O subsystem backplane that the I/O cable in step 1 is attached to.
E56D
Identify devices on current bus
1. Replace the I/O cable specified by the location code. 2. If the location code associated with this checkpoint specifies a PCI adapter card, replace the adapter card. 3. Replace the I/O subsystem backplane that the I/O cable in step 1 is attached to.
E56E
Begin initializing the service processor control chip
1. Replace the I/O cable specified by the location code. 2. If the location code associated with this checkpoint specifies a PCI adapter card, replace the adapter card. 3. Replace the I/O subsystem backplane that the I/O cable in step 1 is attached to.
E56F
End initializing the service processor control chip
1. Check for system firmware updates. 2. Go to “MAP 1542: I/O Problem Isolation” on page 318. 3. If the location code associated with this checkpoint specifies a PCI adapter card, replace the adapter card. 398
406/90
Service
Guide
Table 2. Firmware Checkpoints (continued)
Checkpoint
Description
Action/ Possible Failing FRU
E570
Initialize the PCI-ISA bridge chip
1. Replace the I/O cable specified by the location code. 2. If the location code associated with this checkpoint specifies a PCI adapter card, replace the adapter card. 3. Replace the I/O subsystem backplane that the I/O cable in step 1 is attached to.
E571
Initialize the integrated Ethernet (if present)
1. Replace the I/O cable specified by the location code. 2. If the location code associated with this checkpoint specifies a PCI adapter card, replace the adapter card. 3. Replace the I/O subsystem backplane that the I/O cable in step 1 is attached to.
E572
Initialize the integrated SCSI
1. Replace the I/O cable specified by the location code. 2. If the location code associated with this checkpoint specifies a PCI adapter card, replace the adapter card. 3. Replace the I/O subsystem backplane that the I/O cable in step 1 is attached to.
E57F
End of I/O interface chip initialization
1. Check for system firmware updates. 2. Go to “MAP 1542: I/O Problem Isolation” on page 318.
E580
Begin initializing the EADS module
1. Check for system firmware updates. 2. Go to “MAP 1542: I/O Problem Isolation” on page 318.
E581
Read the PCI bus configuration header
1. This error code may indicate that the I/O (RIO) cables are not properly connected to the I/O subsystems. Check the RIO cabling; refer to “I/O Book Connector Locations” on page 19and “Subsystem Positioning and Cabling” on page 36. 2. Replace the RIO cable specified by the location code. 3. If the location code associated with this checkpoint specifies a PCI adapter card, replace the adapter card. 4. Replace the I/O subsystem backplane that the I/O cable in step 1 is attached to. Chapter
4.
Checkpoints
399
Table 2. Firmware Checkpoints (continued)
Checkpoint
Description
Action/ Possible Failing FRU
E582
EADS has been detected
1. Replace the I/O cable specified by the location code. 2. If the location code associated with this checkpoint specifies a PCI adapter card, replace the adapter card. 3. Replace the I/O subsystem backplane that the I/O cable in step 1 is attached to.
E583
Running BIST on the current EADS
1. Replace the I/O cable specified by the location code. 2. If the location code associated with this checkpoint specifies a PCI adapter card, replace the adapter card. 3. Replace the I/O subsystem backplane that the I/O cable in step 1 is attached to.
E584
Checking function of current EADS
1. Replace the I/O cable specified by the location code. 2. If the location code associated with this checkpoint specifies a PCI adapter card, replace the adapter card. 3. Replace the I/O subsystem backplane that the I/O cable in step 1 is attached to.
E585
Valid function of current EADS detected
1. Replace the I/O cable specified by the location code. 2. If the location code associated with this checkpoint specifies a PCI adapter card, replace the adapter card. 3. Replace the I/O subsystem backplane that the I/O cable in step 1 is attached to.
E586
Set up EADS function
1. Replace the I/O cable specified by the location code. 2. If the location code associated with this checkpoint specifies a PCI adapter card, replace the adapter card. 3. Replace the I/O subsystem backplane that the I/O cable in step 1 is attached to.
E587
Set up EADS BIST
1. Replace the I/O cable specified by the location code. 2. If the location code associated with this checkpoint specifies a PCI adapter card, replace the adapter card. 3. Replace the I/O subsystem backplane that the I/O cable in step 1 is attached to. 400
406/90
Service
Guide
Table 2. Firmware Checkpoints (continued)
Checkpoint
Description
Action/ Possible Failing FRU
E588
Set up EADS function registers
1. Replace the I/O cable specified by the location code. 2. If the location code associated with this checkpoint specifies a PCI adapter card, replace the adapter card. 3. Replace the I/O subsystem backplane that the I/O cable in step 1 is attached to.
E589
Set up EADS function
1. Replace the I/O cable specified by the location code. 2. If the location code associated with this checkpoint specifies a PCI adapter card, replace the adapter card. 3. Replace the I/O subsystem backplane that the I/O cable in step 1 is attached to.
E58A
Set up EADS function registers
1. Replace the I/O cable specified by the location code. 2. If the location code associated with this checkpoint specifies a PCI adapter card, replace the adapter card. 3. Replace the I/O subsystem backplane that the I/O cable in step 1 is attached to.
E58B
Set up EADS function registers
1. Replace the I/O cable specified by the location code. 2. If the location code associated with this checkpoint specifies a PCI adapter card, replace the adapter card. 3. Replace the I/O subsystem backplane that the I/O cable in step 1 is attached to.
E58C
Set up EADS function registers
1. Replace the I/O cable specified by the location code. 2. If the location code associated with this checkpoint specifies a PCI adapter card, replace the adapter card. 3. Replace the I/O subsystem backplane that the I/O cable in step 1 is attached to.
E58D
Set up EADS function registers
1. Replace the I/O cable specified by the location code. 2. Replace the I/O subsystem backplane that the I/O cable in step 1 is attached to. 3. If the location code associated with this checkpoint specifies a PCI adapter card, replace the adapter card. Chapter
4.
Checkpoints
401
Table 2. Firmware Checkpoints (continued)
Checkpoint
Description
Action/ Possible Failing FRU
E58E
Set up EADS function registers
1. Replace the I/O cable specified by the location code. 2. If the location code associated with this checkpoint specifies a PCI adapter card, replace the adapter card. 3. Replace the I/O subsystem backplane that the I/O cable in step 1 is attached to.
E58F
Set up EADS function registers
1. Replace the I/O cable specified by the location code. 2. If the location code associated with this checkpoint specifies a PCI adapter card, replace the adapter card. 3. Replace the I/O subsystem backplane that the I/O cable in step 1 is attached to.
E590
Set up EADS function registers
1. Replace the I/O cable specified by the location code. 2. If the location code associated with this checkpoint specifies a PCI adapter card, replace the adapter card. 3. Replace the I/O subsystem backplane that the I/O cable in step 1 is attached to.
E591
Set up EADS function registers
1. Replace the I/O cable specified by the location code. 2. If the location code associated with this checkpoint specifies a PCI adapter card, replace the adapter card. 3. Replace the I/O subsystem backplane that the I/O cable in step 1 is attached to.
E592
Set up EADS function registers
1. Replace the I/O cable specified by the location code. 2. If the location code associated with this checkpoint specifies a PCI adapter card, replace the adapter card. 3. Replace the I/O subsystem backplane that the I/O cable in step 1 is attached to.
E593
Set up EADS function registers
1. Replace the I/O cable specified by the location code. 2. If the location code associated with this checkpoint specifies a PCI adapter card, replace the adapter card. 3. Replace the I/O subsystem backplane that the I/O cable in step 1 is attached to. 402
406/90
Service
Guide
Table 2. Firmware Checkpoints (continued)
Checkpoint
Description
Action/ Possible Failing FRU
E594
Begin probing slots
1. Replace the I/O cable specified by the location code. 2. If the location code associated with this checkpoint specifies a PCI adapter card, replace the adapter card. 3. Replace the I/O subsystem backplane that the I/O cable in step 1 is attached to.
E595
Detected valid adapter in slot
1. Check for system firmware updates. 2. Go to “MAP 1542: I/O Problem Isolation” on page 318. 3. If the location code associated with this checkpoint specifies a PCI adapter card, replace the adapter card.
E596
Read the EADS revision ID and class code
1. Replace the RIO cable specified by the location code. 2. If the location code associated with this checkpoint specifies a PCI adapter card, replace the adapter card. 3. Replace the I/O subsystem planar to which the RIO cable in Step 1 is attached.
E59E
End of walking bus of current EADS
1. Check for system firmware updates. 2. Go to “MAP 1542: I/O Problem Isolation” on page 318. 3. If the location code associated with this checkpoint specifies a PCI adapter card, replace the adapter card.
E59F
End of EADS initialization
1. Check for system firmware updates. 2. Go to “MAP 1542: I/O Problem Isolation” on page 318. 3. If the location code associated with this checkpoint specifies a PCI adapter card, replace the adapter card.
E5E0
Initialize system processor array
1. Check for system firmware updates. 2. Go to “MAP 1542: I/O Problem Isolation” on page 318.
E5E1
Reserved
E5E2
Initialization of the hypervisor, phase 1
1. Check for system firmware updates. 2. Go to “MAP 1542: I/O Problem Isolation” on page 318.
E5E3
Not in quick-restart: load and set up open firmware code in memory
1. Check for system firmware updates. 2. Go to “MAP 1542: I/O Problem Isolation” on page 318. Chapter
4.
Checkpoints
403
Table 2. Firmware Checkpoints (continued)
Checkpoint
Description
Action/ Possible Failing FRU
E5E4
Activate slave processors
1. Check for system firmware updates. 2. Go to “MAP 1542: I/O Problem Isolation” on page 318.
E5E5
Initialize the hypervisor, phase 2
1. Check for system firmware updates. 2. Go to “MAP 1542: I/O Problem Isolation” on page 318.
E5E6
Set L2 and processor status
1. Check for system firmware updates. 2. Go to “MAP 1542: I/O Problem Isolation” on page 318.
E5E7
Save I/O hub information for RTAS and open firmware
1. Check for system firmware updates. 2. Go to “MAP 1542: I/O Problem Isolation” on page 318.
E5E8
Set up registers
1. Check for system firmware updates. 2. Go to “MAP 1542: I/O Problem Isolation” on page 318.
E5E9
Reset the quick restart path
1. Check for system firmware updates. 2. Go to “MAP 1542: I/O Problem Isolation” on page 318.
E5EE
Hang state in main control code
1. Check for system firmware updates. 2. Go to “MAP 1542: I/O Problem Isolation” on page 318.
E5EF
End of low-level firmware
1. Replace the I/O drawer subsystem backplane specified by the location code. 2. Power down, then power up the system using the UEPO switch. 3. Call service support.
E5FE
Get the system’s licensed memory quantity from the service processor
1. Disconnect and then reconnect AC power to the system using the UEPO switch. 2. Verify that the CUoD capacity card is installed and properly seated. 3. Call service support.
E600
SSA PCI adapter open firmware has run successfully.
Replace the adapter.
E601
SSA PCI adapter BIST has started but failed to complete after 4 seconds.
Replace the adapter.
E602
SSA PCI open firmware has started.
Replace the adapter.
E603
SSA PCI adapter BIST has completed with an error.
Replace the adapter.
E604
SSA PCI adapter BIST and subsequent POSTs have completed successfully.
Replace the adapter.
E605
SSA PCI adapter BIST has completed successfully but subsequent POSTS have failed.
Replace the adapter. 404
406/90
Service
Guide
Table 2. Firmware Checkpoints (continued)
Checkpoint
Description
Action/ Possible Failing FRU
E60E
SSA PCI open firmware about to exit (no stack corruption).
Replace the adapter.
E60F
SSA PCI open firmware has run unsuccessfully.
Replace the adapter.
E6FF
SSA PCI open firmware about to exit (with stack corruption).
Replace the adapter.
E700
Create system firmware VPD
System firmware may be corrupted. Reinstall system firmware.
E701
Create memory VPD.
1. Check for system firmware updates. 2. Call for support.
E702
Create processor card VPD.
1. Check for system firmware updates. 2. Call for support.
E703
Create operator panel VPD.
1. Check for system firmware updates. 2. Call for support.
E704
Create processor subsystem VPD.
1. Check for system firmware updates. 2. Call for support.
E705
Create clock card VPD.
1. Check for system firmware updates. 2. Call for support.
E706
Create RIO extender card VPD.
1. Check for system firmware updates. 2. Call for support.
E707
Create system VPD.
1. Check for system firmware updates. 2. Call for support.
E708
Create RIO to PCI adapter VPD.
Go to “MAP 1542: I/O Problem Isolation” on page 318.
E709
Create VPD via SPCN.
1. Check for system firmware updates. 2. Call service support.
E70A
Create service processor VPD.
1. Check for system firmware updates. 2. Call for support.
E70B
Create L3 module VPD
1. Check for system firmware updates. 2. Call service support.
E70C
Create I/O backplane VPD (I2C problem)
Go to “MAP 1542: I/O Problem Isolation” on page 318.
E70D
Create capacitor book VPD
1. Check for system firmware updates. 2. Call service support.
E70F
Create CEC VPD
1. Check for system firmware updates. 2. Call service support.
E7EE
Delaying while waiting for power subsystem VPD collection to be completed; informational only
1. Check for system firmware updates. 2. Call service support. Chapter
4.
Checkpoints
405
Table 2. Firmware Checkpoints (continued)
Checkpoint
Description
Action/ Possible Failing FRU
E7FF
Successful end of VPD creation
Informational message. If the system hangs here, go to “MAP 1541: JTAG Problem Isolation” on page 310.
E800
RTAS initialization problem
1. Check for system firmware updates. 2. Call service support.
E841
Initializing RTAS/open firmware device tree interface
1. Check for system firmware updates. 2. Call service support.
E842
Initializing RTAS/service processor interface
1. Check for system firmware updates. 2. Replace the primary I/O bookLocation: U1.18-P1-H2. 3. Call service support.
E843
Initializing RTAS/sensor interface
1. Check for system firmware updates. 2. Call service support.
E844
Initializing RTAS/time-of-day clock interface
1. Check for system firmware updates. 2. Replace the primary I/O bookLocation: U1.18-P1-H2. 3. Call service support.
E845
Initializing interface/sensor access
1. Check for system firmware updates. 2. Call service support.
E846
Initializing interface/sensors/led_table access
1. Check for system firmware updates. 2. Call service support.
E84F
Initializing RTAS/operator panel interface
1. Check for system firmware updates. 2. Call service support.
E850
Initializing RTAS/cache interface
1. Check for system firmware updates. 2. Call service support.
E851
Initializing RTAS/hardware access
1. Check for system firmware updates. 2. Call service support.
E852
Initializing RTAS/PCI bus interface
1. Check for system firmware updates. 2. Go to “MAP 1542: I/O Problem Isolation” on page 318. 3. Call service support.
E853
Start cable ID request process
1. Check the SPCN and RIO cabling to the I/O subsystems 2. Disconnect then reconnect power to the system. 3. Go to “MAP 1542: I/O Problem Isolation” on page 318. 4. Call service support. 406
406/90
Service
Guide
Table 2. Firmware Checkpoints (continued)
Checkpoint
Description
Action/ Possible Failing FRU
E854
Start collecting cable IDs
1. Check the SPCN and RIO cabling to the I/O subsystems 2. Disconnect then reconnect power to the system. 3. Go to “MAP 1542: I/O Problem Isolation” on page 318. 4. Call service support.
E855
Start setting RIO link speed
1. Check the SPCN and RIO cabling to the I/O subsystems 2. Disconnect then reconnect power to the system. 3. Go to “MAP 1542: I/O Problem Isolation” on page 318. 4. Call service support.
E860
Initializing RTAS/power subsystem interface
1. Check for system firmware updates. 2. Call service support.
E861
Initializing RTAS/PCI bus interface
1. Check for system firmware updates. 2. Go to “MAP 1542: I/O Problem Isolation” on page 318 3. Call service support.
E870
Initializing RTAS/logs interface
1. Check for system firmware updates. 2. Call service support.
E871
Initializing RTAS/EEH handling interface
1. Check for system firmware updates. 2. Go to “MAP 1542: I/O Problem Isolation” on page 318. 3. Call service support.
E872
Initializing RTAS/error inject interface
1. Check for system firmware updates. 2. Call service support.
E873
Initializing RTAS/error handling interface
1. Check for system firmware updates. 2. Call service support.
E880
Initializing RTAS/debug interface
1. Check for system firmware updates. 2. Call service support.
E881
Initializing RTAS/utility interface
1. Check for system firmware updates. 2. Call service support.
E882
Initializing RTAS/softpatch register interface
1. Check for system firmware updates. 2. Call service support.
E890
Initializing RTAS/hot-plug interface
1. Check for system firmware updates. 2. Call service support.
E8C1
Initializing interface/client memory access
1. Check for system firmware updates. 2. Call service support. Chapter
4.
Checkpoints
407
Table 2. Firmware Checkpoints (continued)
Checkpoint
Description
Action/ Possible Failing FRU
E8D1
Initializing interface/special memory access
1. Check for system firmware updates. 2. Call service support.
E900
The system firmware is in the recoverable non-maskable interrupt code
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
E901
The system firmware is in the recoverable non-maskable interrupt code
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
EAA1
Probe PCI-PCI bridge bus
Follow the repair actions for checkpoint E151.
408
406/90
Service
Guide
Boot Problems Attention: If the system is running partitions from partition standby (LPAR), the following procedure addresses the problem in which one partition will not boot while other partitions boot successfully and run the operating system successfully. This procedure also addresses the problem in which a system does not boot to a full system partition. All of the system’s resources are available to the diagnostics; the same general procedure applies, with the exception of moving resources between partitions. Attention: It is the customer’s responsibility to move devices between partitions. If a device must be moved to another partition to run standalone diagnostics, contact the customer or system administrator. (If the CD-ROM must be moved to another partition, all SCSI devices connected to that SCSI adapter must be moved because moves are done at the slot level, not at the device level.) Both partitions must be rebooted if a device is moved between partitions. Depending on the boot device, a checkpoint may be displayed on the operator panel for an extended period of time while the boot image is retrieved from the device. This is particularly true for tape and network boot attempts. If the checkpoint/code is displayed for an extended time, there may be a problem loading the boot image from the device. If booting from CD-ROM or tape, watch for activity on the drive’s LED indicator. A blinking LED indicates that the loading of either the boot image, or additional information required by the operating system being booted, is still being loaded. For network boot attempts, refer to checkpoint E174. If the checkpoint is displayed for an extended time, there may be a problem with the integrity of the boot image. This procedure assumes that a CD-ROM drive is connected to the system, or that the diagnostics can be run from a NIM (Network Installation Management) server. 1. Look at the service action event-error log in the Service Focal Point on the HMC (see “Service Focal Point” on page 4 for instructions on accessing the service action event log). If there are open entries that affect devices in the boot path of the partition, perform the actions to correct those errors. If there are open entries that indicate problems with I/O cabling, perform the actions to correct those problems. Try to reboot the partition. If the partition hangs with a 4-character checkpoint in the display, the partition must be deactivated, then reactivated before attempting to reboot. If a 20EE xxxx error code is reported, the partition is already deactivated and in the error state. Reboot by activating the partition. If the system is running in full system partition, the power must be cycled before the system can be rebooted. If the reboot is still not successful, continue to step 2. 2. Boot to the SMS main menu as follows: v If you are rebooting a partition from partition standby (LPAR), go to the properties of the partition and select Boot to SMS, then activate the partition. v If you are rebooting the system in full system partition, start the service processor menus, select option 2, System Power Control Menu, then select option 7, Boot Mode Menu, then select option 1, Boot to SMS menu. Enter 98 to return to the system power control menu, then select option 4, Power-on System. Enter y to confirm.At the SMS main menu, select Multiboot. a. Check to see if the intended boot device is correctly specified in the boot list. If it is in the boot list: 1) Remove all removable media from devices in the boot list from which you do not want to boot. 2) If attempting to boot from a network, go to step 3 on page 410. If attempting to boot from a disk drive or CD-ROM, go to step 4 on page 410.
Chapter
4.
Checkpoints
409
b. If the intended boot device is not correctly listed in the boot sequence, add it to the boot sequence using the SMS menus. If the device can be added to the boot sequence, reboot the partition. If the intended boot device cannot be added to the boot list, ask the customer or system administrator to verify that the device you are trying to boot from is assigned to the correct partition. If it is, go to step 4.3. If attempting to boot from the network: a. Verify that the IP parameters are correct. b. Attempt to ?ping? the target server using the SMS ping utility. If the ping is not successful, have the network administrator verify the server configuration for this client. c. Check with the network administrator to ensure that the network is up. Also ask the network administrator to verify the settings on the server from which you are trying to boot. d. Check the network cabling to the adapter. e. Restart the partition and retry the boot operation.4. Try to boot and run standalone diagnostics against the devices in the partition, particularly against the intended boot device. Standalone diagnostics can be run from CD-ROM or a NIM server. To boot standalone diagnostics, follow the detailed procedures in “Loading the Standalone Diagnostics from CD-ROM Using an HMC” on page 764 or “Running Standalone Diagnostics from a Network Installation Management (NIM) Server” on page 766. Attention: Standalone diagnostics may be run on systems configured for either full system partition or partition standby. When attempting to boot diagnostics on a partition from partition standby, the device from which you are booting standalone diagnostics must be made available to the partition that is not able to boot, if it is not already in that partition. Contact the customer or system administrator if a device must be moved between partitions in order to boot standalone diagnostics. If devices are moved between partitions, both partitions must be rebooted. For more information, see “Standalone Diagnostic Operation” on page 764. If standalone diagnostics boot successfully: a. Go to the Task Selection menu, and select Display Configuration and Resource List. If the intended boot device is not listed, ask the customer or system administrator to verify that it is assigned to the correct partition. If it is, and does not appear in the NEW RESOURCE list, go to ?MAP 0290: Missing Resource Problem Resolution? in the Diagnostic Information for Multiple Bus Systems. b. Run diagnostics on the device you want to boot from. If an SRN, not an 8-digit error code, is reported, go to the Diagnostic Information for Multiple Bus Systems. c. If the diagnostics are successful, it may be necessary to perform an operating system-specific recovery process, or reinstall the operating system.If standalone diagnostics do not boot successfully from CD-ROM: a. Verify that the CD-ROM drive is assigned to the correct partition. b. Check the SCSI cabling to the CD-ROM drive. Check the power connector on the CD-ROM. c. It is possible that another SCSI device on the same bus is causing the problem. Do the following: 1) Disconnect the power and signal cables from all other SCSI devices that are attached to the SCSI adapter to which the CD-ROM is attached. 2) Try to boot standalone diagnostics again. 3) If you are still unable to load standalone diagnostics, go to step 5 on page 411. 4) If standalone diagnostics load, one of the disconnected devices is causing the problem. Reconnect the devices one at a time and retry the boot operation until the problem recurs, and replace the device that caused the problem. Go to ?MAP 410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems.If standalone diagnostics do not boot successfully from a NIM server: a. Verify that the network adapter is assigned to the correct partition. 410
406/90
Service
Guide
b. Check the network settings and flag settings, as detailed in “Running Standalone Diagnostics from a Network Installation Management (NIM) Server” on page 766. c. Follow the procedure outlined in step 3 on page 410.5. Go to “MAP 1542: I/O Problem Isolation” on page 318.
Chapter
4.
Checkpoints
411
412
406/90
Service
Guide
Chapter 5. Error Code to FRU Index The Error Code to FRU Index lists fault symptoms and possible causes. The most likely cause is listed first. Use this index to help you decide which FRUs to replace when servicing the system. Attention: If you replace FRUs or perform an action on an I/O subsystem and the problem is still not corrected, go to “MAP 1542: I/O Problem Isolation” on page 318 unless you were already directed to any MAP 154x by the error code. Otherwise, call service support if the action(s) for an error code do not resolve the problem. Attention: If the error code that brought you to this chapter originated in the AIX error log, remember that an error may require an additional 15 minutes to be propagated to the AIX error log. If you replace FRUs and the problem is corrected, go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. Performing Slow Boot On this system, the error code displayed in the operator panel or in the service action event (SAE) log may not indicate the root cause of the problem. To fully analyze all of the available information, perform the following steps before doing a hardware repair or replacement: 1. Record the 8-character error code (and location code if present) in operator panel. 2. Do a slow-mode boot into the SMS main menu. (A fast-mode boot skips much of the built-in diagnostic testing.) A slow-mode boot may yield a new 8-character error code on the operator panel or new errors in the service processor error log. If a new error code is reported, record it for use in subsequent steps. Slow mode boot is selected through the service processor menus. The procedure is different for systems without an HMC attached versus systems with an HMC attached. v Non-partitioned systems: a. Go to the system power control menu from the service processor main menu. b. Select option 7, the boot mode menu. c. On the boot mode menu, select option 1. This automatically enables Boot to SMS menus. d. Enter 98 to go back to the system power control menu. e. Look at the state of the fast boot enable, option 6. If it is enabled, select option 6 to change it to disabled (disabling fast system boot automatically enables slow mode boot). If it is already disabled, it may have been disabled by the service processor because the previous boot failed. In either case, boot the system by continuing to substep 2f. f. Select option 4 to power on the system. Enter y when prompted. The system will power up and begin the boot process. g. When the system stops at the SMS main menu, use the white power button on the operator panel to power down the system. At this point, the service processor menus should be available to examine the service processor error logs. v Partitioned systems: a. From the HMC, select Partition Manager. b. Highlight the processor subsystem name. c. Right-click the mouse and select Open Terminal Window. d. From the Service Processor Menu on the VTERM select Option 2 System Power Control Menu. 413
e. Select option 6. Verify that the state changes to currently disabled. Disabling fast system boot automatically enables slow boot. f. Select Option 98 to exit the system power control menu. g. Use the HMC to power up the system to the SMS menus in a full system partition.3. When the system has stopped at the SMS menus, use the HMC to power off the system. At this point, the service processor menus should be available to examine the service processor error log.Confirming Initial Error Code The service processor may have recorded one or more symptoms in its error log. If the system has attempted to boot since an error terminated normal operation, there may be specific fault information in the service processor error log. Examine this error log before proceeding (see “System Information Menu” on page 781). In the service processor error log, the most recent error is at the top of the list and the oldest error is at the bottom. It is also important to look at the time stamp associated with each error. (The time stamps in the service processor error log reflect coordinated universal time (UTC), which is also referred to as Greenwich mean time.) On this system, the error code displayed in the operator panel may not be indicative of the root cause of the problem. This is also true of the most recent error in the service processor error log. Attempt to find the error code that originally sent you to the “Checkpoints and Error Codes Index” on page 416 table. 1. If you cannot find the error code in the service processor error log, start with the error code at the bottom of the list. Proceed to step 3. 2. If you can find the error code in the service processor error log, observe the time stamps and look for a group of failures that happened within minutes prior to this error code. In this group, start at the error code with the earliest time stamp. Proceed to step 3. 3. Examine the list of error codes you obtained from steps 1 or 2, and look for any of the form 4xxB xxxx. Can you find any error codes of the form 4xxB xxxx? No Proceed to step 6. Yes Proceed to step 4.4. Do any of the error codes of the form 4xxB xxxx have the same first two characters on the left as the first two characters on the left of the error code that sent you to the MAP? No Proceed to step 6. Yes Proceed to step 5.5. Adjust the order of the list of error codes you obtained from steps 1 or 2 by placing the error codes of the form 4xxB xxxx with the same first two characters as the error code that sent you to this MAP on the bottom of the list. Proceed to step 6. starting with the codes at the bottom of the adjusted list of error codes. 6. To get a list of possible FRUs, select the detail screen for each error code. Record the listed location codes and their order. Note: If the error code that results from this procedure is 406x 0EA0, 406x 0EA1, 406x 0EA2, or 406x 0EA4, do not go back to the “Checkpoints and Error Codes Index” on page 416 with this error code. Look for a hardware error in the service processor error log of the form 450x yyyy, 460x yyyy, or 4B2x yyyy that occurred just prior to the 406x 0EAy error. Use this hardware error code instead when returning to the “Checkpoints and Error Codes Index” on page 416.Return to the step in the “Checkpoints and Error Codes Index” on page 416 table immediately after the step that sent you to this ?Confirming Initial Error Code? procedure to perform the listed repair actions. 414
406/90
Service
Guide
If the repair actions for an error code are ineffective, continue to work upward through this group of error codes (which is forward in time from the earliest time stamp to the latest) until either the problem is fixed, you reach the error code that was first reported, or you are at the top of the list.Four-Character Checkpoints Four-character checkpoints in the form 8xxx, 9xxx, Axxx, Bxxx, and Exxx are listed in “Firmware Checkpoints” on page 379. If you have a four-character code that begins with any character other than 8, 9, A, B, or E, refer to the Diagnostic Information for Multiple Bus Systems. Replacing the Media Subsystem Operator Panel Licensed programs frequently rely on system information that is stored in the VPD module on the media subsystem’s operator panel. If the MAPs indicate that the operator panel should be replaced, the pluggable VPD module must be swapped. If the old module is defective, or cannot be recovered, contact service support. Replacing the Network Adapter If a network adapter is replaced, the network administrator must be notified so that the client IP addresses used by the server can be updated. In addition, the operating system configuration of the network controller may need to be changed in order to enable system startup. Also check to ensure that any client or server that accesses this system is updated. Determining Location Code If you have an error code in the form 2xxx xxxx or Exxx, and no location code associated with it, the location code may possibly be found in the following places, depending on the state of the system: v For systems with AIX installed, if the system will boot into AIX, the location code will be reported when error log analysis is run under AIX diagnostics. v If the system will boot into the system management services (SMS) menus, the SMS error log will contain the location code associated with the error code. v If the system halted when the error occurred, the error code and location can be found in the LCD progress indicator log, which can be accessed using the service processor menus. See “System Information Menu” on page 781, then select Read Progress Indicator From Last System Boot. Find the error code that directed you to this note in the list, and the location code is part of the entry in the log. Attention: To identify a FRU, these MAPs refer to FRU identify LEDs. For more Information about the FRU identify LEDs see “FRU Identify LEDs” on page 65. Chapter
5.
Error
Code
to
FRU
Index
415
Checkpoints and Error Codes Index The following table is an index to the checkpoints or error codes that you may receive to help diagnose a failure. Locate the code you received, and follow the instructions to determine your next step.
8-Digit Error Codeor4-Digit Checkpoint
What You Should Do
0000 xxxx
Operator Panel Error Codes Go to “Operator Panel Error Codes” on page 419, and follow the instructions in the operator panel error code table.
1xxx xxxx
System Power Control Network (SPCN) Error Codes 1. There may be additional error codes and information in the service processor error log. The system must be powered off to examine the contents of the service processor error log. a. Is the system still powered on? NO Go to step2 below. YES Ask the customer if the system can be powered off, then go to step 1b.b. Can the system be powered off? NO Go to step3 below. YES Power off the system, then Go to step2 below.2. Confirm this is the first error code that caused the failure, as described in “Confirming Initial Error Code” on page 414. 3. Did you come to this step from step 2? NO Go to “SPCN Error Codes” on page 420 with the error codes you already have and follow the instructions in the SPCN error code table. YES Go to “SPCN Error Codes” on page 420 with the error code(s) from step 2 and follow the instructions in the SPCN error code table.
2xxx xxxx
Firmware Error Codes Go to “Firmware/POST Error Codes” on page 488, and follow the instructions in the firmware error code table. 416
406/90
Service
Guide
8-Digit Error Codeor4-Digit Checkpoint
What You Should Do
4xxx xxxx
Service Processor Error Codes 1. Is the system powered up? NO Go to step 2 below. YES Go to step 4 below.2. Find the error code in the service processor error log and check the value of word 13. Go to step 3 below. 3. Is the value of word 13 C0xx xxxx? NO Go to step 7 below. YES Go to “MAP 1541: JTAG Problem Isolation” on page 310.4. Is the system running Linux? NO Go to step 6 below. YES Go to step 5 below.5. If the system is not functional, go to step 7 below. If the system is functional, do the following: Note: If you have already recorded the value of word 13 in a previous step, continue to answer the question about the word 13 in this susbstep below. Find the value of word 13 in the Linux system log entry. For detailed instructions on finding word 13 in a Linux system log entry, refer to the actions under error code B1xx 4699 in this service guide. Is the value of word 13 C0xx xxxx? NO Go to step 7 below. YES Go to “MAP 1541: JTAG Problem Isolation” on page 310.6. If the system is not functional, go to step 7. If the system is functional, do the following: Examine the AIX error log entry. Find the value of word 13. For detailed instructions on finding word 13 in a AIX error log entry, refer to the actions under error code B1xx 4699 in this service guide. Is the value of word 13 C0xx xxxx? NO Go to step 7 below. YES Go to “MAP 1541: JTAG Problem Isolation” on page 310.
4xxx xxxx (Cont.)
Service Processor Error Codes 7. There might be additional error codes and information in the service processor error log. The system must be powered off to examine the contents of the service processor error log. a. Is the system still powered on? NO Go to step 8 below. YES Ask the customer if the system can be powered off, then go to step 7b.b. Can the system be powered off? NO Go to step 10 below. YES Power off the system, then go to step 8. 8. Perform slow boot procedure as described in “Performing Slow Boot” on page 413. 9. Confirm this is the first error code that caused the failure as described in “Confirming Initial Error Code” on page 414. 10. Did you come to this step from step 9? NO Go to “Service Processor Error Codes” on page 516 with the error codes you already have and follow the instructions in the service processor error codes table. YES Go to “Service Processor Error Codes” on page 516 with the error codes from step 9 above and follow the instructions in the service processor error codes table.
Chapter
5.
Error
Code
to
FRU
Index
417
8-Digit Error Codeor4-Digit Checkpoint
What You Should Do
8xxx 9xxx Axxx Bxxx
Service Processor Checkpoints Go to “Service Processor Checkpoints” on page 372 for more information on these checkpoints.
A1FD 000x
The system has entered recovery mode. Go to “Recovery Mode” on page 808.
B006 xxxx B1xx xxxx
Common Firmware Error Codes 1. There may be additional error codes and information in the service processor error log. The system must be powered off to examine the contents of the service processor error log. a. Is the system still powered on? NO Go to step 2 below. YES Ask the customer if the system can be powered off, then go to step 1b below.b. Can the system be powered off? NO Go to step 3 below. YES Power off the system, then go to step 2 below.2. Confirm this is the first error code that caused the failure as described in “Confirming Initial Error Code” on page 414. 3. Did you come to this step from step 2. NO Go to “Common Firmware Error Codes” on page 737 with the error codes you already have and follow the instructions in the Common Firmware error code table. YES Go to “Common Firmware Error Codes” on page 737 with the error code(s) from step 2 and follow the instructions in the Common Firmware error code table.
BBxx xxxx BCxx xxxx
pSeries High Performance Switch Error Codes. Go to 45H/W4 Switch Planning, Installation, and Service Guide, order number GA22-7954.
Exxx
Firmware Checkpoints Go to “Firmware Checkpoints” on page 379 for more information on these checkpoints.
Mxxx xxxx
Problem Determination-Generated Error Codes Go to “Problem Determination-Generated Error Codes” on page 757, and follow the instructions in the error code table.
418
406/90
Service
Guide
Operator Panel Error Codes Attention: If you replace FRUs or perform an action on an I/O subsystem and the problem is still not corrected, go to “MAP 1542: I/O Problem Isolation” on page 318 unless you were already directed to any MAP 154x by the error code. Otherwise call support if the action(s) for an error code do not resolve the problem. Attention: If the error code that brought you to this chapter originated in the AIX error log, remember that an error may require an additional 15 minutes to be propagated to the AIX error log. If you replace FRUs and the problem is corrected, go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems.
Error Code
Description
Action/ Possible Failing FRU
0000 1111
At power on, failure of power-good signal from power supply
1. Primary I/O backplane Location: U1.18-P1-H2 2. DCAs providing power to the primary I/O book Locations: U1.18-P1-V1, U1.18-P1-V2
0000 1112
At power off, failure of power-good signal from power supply
1. Primary I/O backplane Location: U1.18-P1-H2 2. DCAs providing power to the primary I/O book Locations: U1.18-P1-V1, U1.18-P1-V2
0000 1113
While powered on, failure of power-good signal from power supply
1. Primary I/O backplane Location: U1.18-P1-H2 2. DCAs providing power to the primary I/O book Locations: U1.18-P1-V1, U1.18-P1-V2
0000 1114
Operator panel and service processor have been reset after main power disturbance
Informational message
0000 2222
Operator panel not communicating with service processor
1. If system cannot be powered off, reset the service processor (via the pinhole reset or by disconnecting then reconnecting power). 2. Primary I/O book,Location: U1.18-P1-H2
0000 3333
SPCN communications error
Primary I/O book,Location: U1.18-P1-H2
0000 3334
I2C repeater error
Primary I/O book,Location: U1.18-P1-H2
0000 BBBB
Time-of-day battery malfunction
Time-of-day battery Location: U1.18-P1-H2-V1 Chapter
5.
Error
Code
to
FRU
Index
419
SPCN Error Codes Attention: If you replace FRUs or perform an action on an I/O subsystem and the problem is still not corrected, go to “MAP 1542: I/O Problem Isolation” on page 318 unless you were already directed to any MAP 154x by the error code. Otherwise, call support if the action(s) for an error code do not resolve the problem. Attention: If the error code that brought you to this chapter originated in the operating system error log, be aware that an error may require an additional 15 minutes to be propagated to the operating system error log. If you replace FRUs and the problem is corrected, go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems.
Error Code
Description
Action/ Possible Failing FRU
1010 xxxx
An undefined error has occurred in the power subsystem.
Call service support.
1011 00AC
Last power-down was the result of loss of AC.
Informational message
1011 00CA
Thermal calibration in progress
Has this code been displayed longer than 15 minutes? YES Reboot the system. The code will be cleared when the system successfully reboots. NO Wait until this code has been displayed for 15 minutes to allow thermal calibration to complete. The system will not IPL while thermal calibration is in progress.
1011 3100
I2C Controller
Primary I/O bookLocation: U1.18-P1-H2
1011 3101
1. I2C bus 1 fault 2. Operator panel 3. Operator panel cable
1. Primary I/O bookLocation: U1.18-P1-H2 2. Operator panelLocation: U1.17-L1 3. Operator panel cableLocation: U1.17-L1/Q1#
1011 3102
I2C bus 2 fault
Primary I/O bookLocation: U1.18-P1-H2
1011 3104
I2C bus 4 fault
Primary I/O bookLocation: U1.18-P1-H2
1011 3106
I2C bus 6 fault
Primary I/O bookLocation: U1.18-P1-H2
1011 3113
RIO Bus Adapter
Primary I/O bookLocation: U1.18-P1-H2 420
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
1011 3114
PCI Backplane
Was the primary I/O book at location U1.18-P1-H2 changed prior to the most recent reboot of the system? YES This error code may be due to down-level system power control network firmware on the new primary I/O book. The system will automatically update the system power control network firmware to the correct level. This error code should be cleared after the next reboot. NO Replace the primary I/O book, Location: U1.18-P1-H2
1011 3115
Operator Panel
1. Operator panel,location: U1.17-L1 2. Primary I/O book, Location: U1.18-P1-H2
1011 3118
SPCN
Primary I/O bookLocation: U1.18-P1-H2
1011 34A2
SPCN
Primary I/O bookLocation: U1.18-P1-H2
1011 8400
VPD Configuration
Primary I/O bookLocation: U1.18-P1-H2
1011 8401
VPD access problem
1. System VPD module Location: U1.17-L1-N1 2. Primary I/O bookLocation: U1.18-P1-H2
1011 8402
VPD access problem
Primary I/O bookLocation: U1.18-P1-H2
1011 8403
VPD level mismatch
1. System VPD module Location: U1.17-L1-N1 2. Primary I/O bookLocation: U1.18-P1-H2
1011 8404
MCM 2 VPD mismatch
1. MCM 2 and VPD card,Locations: U1.18-P1-C4 and U1.18-P1-H2.4 2. Primary I/O bookLocation: U1.18-P1-H2
1011 8405
MCM 1 VPD mismatch
1. MCM 1 and VPD card,Locations: U1.18-P1-C3 and U1.18-P1-H2.5 2. Primary I/O bookLocation: U1.18-P1-H2
1011 8406
MCM 3 VPD mismatch
1. MCM 3 and VPD card,Locations: U1.18-P1-C2 and U1.18-P1-H2.6 2. Primary I/O bookLocation: U1.18-P1-H2 Chapter
5.
Error
Code
to
FRU
Index
421
Error Code
Description
Action/ Possible Failing FRU
1011 8408
MCM 0 VPD mismatch
1. MCM 0 and VPD card,locations: U1.18-P1-C1 and U1.18-P1-H2.3 2. Primary I/O book, location: U1.18-P1-H2
1011 8409
No valid processor VPD cards found on primary I/O book
1. Verify that the processor VPD cards are properly installed and in the correct slots on the primary I/O book for the MCMs that are installed. 2. Go to “MAP 1543: MCM Module Problem Isolation” on page 337.
1011 840A
VPD +5 stuck high
Primary I/O bookLocation: U1.18-P1-H2
1011 840B
VPD 5 V power-on failure
Primary I/O bookLocation: U1.18-P1-H2
1011 840D
SPCN Configuration mismatch
1. Set I/O type to correct value. See “Change I/O Type” on page 785. 2. Primary I/O bookLocation: U1.18-P1-H2
1011 840E
SPCN default configuration loaded
1. Set I/O type to correct value. See “Change I/O Type” on page 785. 2. Primary I/O bookLocation: U1.18-P1-H2
1011 840F
SPCN Configuration mismatch
1. Set I/O type to correct value. See “Change I/O Type” on page 785. 2. Primary I/O bookLocation: U1.18-P1-H2
1011 8413
Invalid MCM 0 VPD card or MCM
1. Location: U1.18-P1-H2.3 and U1.18-P1-C1See note on page 516. 2. Primary I/O bookLocation: U1.18-P1-H2
1011 8414
Invalid MCM 2 VPD card or MCM
1. Location: U1.18-P1-H2.4 and U1.18-P1-C4See note on page 516. 2. Primary I/O bookLocation: U1.18-P1-H2
1011 8415
Invalid MCM 1 VPD card or MCM
1. Location: U1.18-P1-H2.5 and U1.18-P1-C3See note on page 516. 2. Primary I/O bookLocation: U1.18-P1-H2
1011 8416
Invalid MCM 3 VPD card or MCM
1. Location: U1.18-P1-H2.6 and U1.18-P1-C2See note on page 516. 2. Primary I/O bookLocation: U1.18-P1-H2
1011 8423
No MCM 0 VPD card or MCM
1. Location: U1.18-P1-H2.3 and U1.18-P1-C1See note on page 516. 2. Primary I/O bookLocation: U1.18-P1-H2 422
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
1011 8424
No MCM 2 VPD card or MCM
1. Location: U1.18-P1-H2.4 and U1.18-P1-C4See note on page 516. 2. Primary I/O bookLocation: U1.18-P1-H2
1011 8425
No MCM 1 VPD card or MCM
1. Location: U1.18-P1-H2.5 and U1.18-P1-C3See note on page 516. 2. Primary I/O bookLocation: U1.18-P1-H2
1011 8426
No MCM 3 VPD card or MCM
1. Location: U1.18-P1-H2.6 and U1.18-P1-C2See note on page 516. 2. Primary I/O bookLocation: U1.18-P1-H2
1011 8430
SPCN configuration mismatch
1. Set I/O type to correct value. See “Change I/O Type” on page 785. 2. Primary I/O bookLocation: U1.18-P1-H2
1011 8431
SPCN configuration mismatch
1. Set I/O type to correct value. See “Change I/O Type” on page 785. 2. Primary I/O bookLocation: U1.18-P1-H2
1011 8440
SPCN configuration mismatch
1. Set I/O type to correct value. See “Change I/O Type” on page 785. 2. Primary I/O bookLocation: U1.18-P1-H2
1011 8441
SPCN configuration mismatch
1. Set I/O type to correct value. See “Change I/O Type” on page 785. 2. Primary I/O bookLocation: U1.18-P1-H2
1011 8468
Capacitor card fault
1. If fewer than 3 DCAs are installed: a. If no capacitor card is installed in U1.18-P1-V3 or U1.18-P1-V7, install a card in U1.18-P1-V3. b. If a capacitor card is present in either U1.18-P1-V3 or U1.18-P1-V7, reseat it. If the problem is not fixed, replace it.2. If 3 or more DCAs are installed: a. If no capacitor card is installed in U1.18-P1-V3, one is required. b. If a capacitor card is installed in U1.18-P1-V3, reseat it. If the problem is not fixed, replace it.3. If the same error code reoccurs, replace the primary I/O book, location: U1.18-P1-H2. Chapter
5.
Error
Code
to
FRU
Index
423
Error Code
Description
Action/ Possible Failing FRU
1011 8469
Capacitor card fault
1. If no capacitor card is installed in U1.18-P1-V7, install one. The capacitor card is required. 2. If a capacitor card is present in U1.18-P1-V7, reseat it. If the problem is not fixed, replace the capacitor card. 3. If the same error code reoccurs, replace the primary I/O book, location: U1.18-P1-H2.
1011 8700
BPA A ac loss
Go to “MAP 1520: Power” on page 192.
1011 8701
BPA B ac loss
Go to “MAP 1520: Power” on page 192.
1011 8710
BPC A communication failure
1. Go to “MAP 1525: There Is a 350 Volt Bulk Failure” on page 203. 2. BPC A cable to primary I/O bookLocation: U1.18-P1-H2/Q8# 3. BPC A controllerLocation: U1.35-P1-X4 4. Primary I/O bookLocation: U1.18-P1-H2
1011 8711
BPC B communication failure
1. Go to “MAP 1525: There Is a 350 Volt Bulk Failure” on page 203. 2. BPC B cable to primary I/O bookLocation: U1.18-P1-H2/Q9# 3. BPC B controllerLocation: U1.35-P2-X4 4. Primary I/O bookLocation: U1.18-P1-H2
1011 8722
BPC A and B are not at standby.
1. BPC A controllerLocation: U1.35-P1-X4 2. BPC B controllerLocation: U1.35-P2-X4
1011 8731
BPC failure
1. Ensure that the red switch on the front of the EPO panel is in the ON position. If there is no external EPO cable, also ensure that the internal toggle switch to the ROOM EPO BYPASS on the bottom of the EPO panel is set to the left before proceeding. 2. BPC A controllerLocation: U1.35-P1-X4and BPC-B controller,Location: U1.35-P2-X4 3. Primary I/O bookLocation: U1.18-P1-H2
1011 8750
BPC download failure on BPA A
1. BPC A controller,Location: U1.35-P1-X4 2. Primary I/O book, Location: U1.18-P1-H2 424
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
1011 8751
BPC download failure on BPA B
1. BPC-B controller,Location: U1.35-P2-X4 2. Primary I/O book, Location: U1.18-P1-H2
1011 875A
Informational message.
No power subsystem FRU code update is required.
1011 875B
Informational message.
The power subsystem FRU code update has begun.
1011 875C
Informational message.
The power subsystem FRU code update has ended.
1011 8800
BPC A cable missing
1. BPC A cable to primary I/O bookLocation: U1.18-P1-H2/Q8# 2. BPC A controllerLocation: U1.35-P1-X4 3. Primary I/O bookLocation: U1.18-P1-H2
1011 8801
BPC B cable missing
1. BPC B cable to primary I/O bookLocation: U1.18-P1-H2/Q9# 2. BPC B controllerLocation: U1.35-P2-X4 3. Primary I/O bookLocation: U1.18-P1-H2
1011 8A00
The SPCN firmware has detected a change in the drawer configuration
Informational message.
1011 xxxx
SPCN configuration error
1. Set I/O type to correct value. See “Change I/O Type” on page 785. 2. Primary I/O bookLocation: U1.18-P1-H2
1014 1B17
Problem with fan 1 in first I/O subsystem in primary rack
1. DCA-1Location: U1.9-V1 2. Fan 1Location: U1.9-F1
1014 1B27
Problem with fan 2 in first I/O subsystem in primary rack
1. DCA-2Location: U1.9-V2 2. Fan 2Location: U1.9-F2
1014 1C06
Loss of airflow in first I/O subsystemlocation: U1.9
Go to “MAP 1527: An Airflow Loss Has Been Detected” on page 209.
1014 6014
Overcurrent or overvoltage problem with 3.3V in first I/O subsystem
Go to “MAP 1520: Power” on page 192.
1014 6024
Overcurrent or overvoltage problem with 5.0V in first I/O subsystem
Go to “MAP 1520: Power” on page 192.
1014 6034
Overcurrent or overvoltage problem with 2.5V in first I/O subsystem
Go to “MAP 1520: Power” on page 192.
1014 6044
Overcurrent or overvoltage problem with 12.0V in first I/O subsystem
Go to “MAP 1520: Power” on page 192. Chapter
5.
Error
Code
to
FRU
Index
425
Error Code
Description
Action/ Possible Failing FRU
1014 6054
Overcurrent or overvoltage problem with -12.0V in first I/O subsystem
Go to “MAP 1520: Power” on page 192.
1014 6814
Overcurrent or overvoltage problem with 3.3V in first I/O subsystem
Go to “MAP 1520: Power” on page 192.
1014 6824
Overcurrent or overvoltage problem with 5.0V in first I/O subsystem
Go to “MAP 1520: Power” on page 192.
1014 6834
Overcurrent or overvoltage problem with 2.5V in first I/O subsystem
Go to “MAP 1520: Power” on page 192.
1014 6844
Overcurrent or overvoltage problem with 12.0V in first I/O subsystem
Go to “MAP 1520: Power” on page 192.
1014 6854
Overcurrent or overvoltage problem with -12.0V in first I/O subsystem
Go to “MAP 1520: Power” on page 192.
1014 7017
Problem with fan 3 in first I/O subsystem in primary rack
1. DCA-1Location: U1.9-V1 2. Fan 3Location: U1.9-F3
1014 7027
Problem with fan 4 in first I/O subsystem in primary rack
1. DCA-2Location: U1.9-V2 2. Fan 4Location: U1.9-F4
1014 7117
Problem with fan 1 and fan 3 in first I/O subsystem in primary rack
1. DCA-1Location: U1.9-V1 2. Fans 1 and 3Location: U1.9-F1 and U1.9-F3
1014 7127
Problem with fan 2 and fan 4 in first I/O subsystem in primary rack
1. DCA-2Location: U1.9-V2 2. Fans 2 and 4Location: U1.9-F2 and U1.9-F4
1014 8A00
The SPCN firmware has detected a change in the drawer configuration
Informational message.
1014 B014
Overcurrent or overvoltage problem with 3.3V in first I/O subsystem
Go to “MAP 1520: Power” on page 192.
1014 B024
Overcurrent or overvoltage problem with 5.0V in first I/O subsystem
Go to “MAP 1520: Power” on page 192.
1014 B034
Overcurrent or overvoltage problem with 2.5V in first I/O subsystem
Go to “MAP 1520: Power” on page 192.
1014 B044
Overcurrent or overvoltage problem with 12.0V in first I/O subsystem
Go to “MAP 1520: Power” on page 192.
1014 B054
Overcurrent or overvoltage problem with -12.0V in first I/O subsystem
Go to “MAP 1520: Power” on page 192.
1014 B814
Overcurrent or overvoltage problem with 3.3V in first I/O subsystem
Go to “MAP 1520: Power” on page 192.
1014 B824
Overcurrent or overvoltage problem with 5.0V in first I/O subsystem
Go to “MAP 1520: Power” on page 192.
1014 B834
Overcurrent or overvoltage problem with 2.5V in first I/O subsystem
Go to “MAP 1520: Power” on page 192. 426
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
1014 B844
Overcurrent or overvoltage problem with 12.0V in first I/O subsystem
Go to “MAP 1520: Power” on page 192.
1014 B854
Overcurrent or overvoltage problem with -12.0V in first I/O subsystem
Go to “MAP 1520: Power” on page 192.
1014 BF06
Unable to resolve the chassis VPD in an I/O subsystem
The chassis VPD in the I/O subsystem drawer identified by the location code could not be resolved. This may be an indication that more than one FRU in the drawer was replaced during a service action. If this is the case, reinstall the original FRUs in the drawer, then replace the FRUs one at a time. If only one FRU was replaced and this error code occurred, call service support.
1014 CC36
Description: Loss of pgood on an I/O planar in the I/O drawer at location U1.9Action: This error code can be caused by hot-plugging RIO cables. To see if this is a recoverable error, do the following: 1. Power down the system by following the appropriate procedure in “Powering Off and Powering On the System” on page 838. 2. After the system is shut down, 350V dc must be removed from the DCAs in the I/O drawer that reported the error: v If a High Performance Switch (7045-SW4) is installed in the CEC frame, follow the instructions in “Powering Off and Powering On the System” on page 838 for deactivating a system’s I/O drawer after the system is shut down. v If a High Performance Switch (7045-SW4) is not present in the CEC frame, use the UEPO switch to remove 350V dc from all of the drawers in the frame.3. Then boot the system by following the appropriate procedure in “Powering Off and Powering On the System” on page 838. At any point after the service processor comes up to standby, indicated by OK in the operator panel in the media drawer, look at the service action event log in Service Focal Point on the HMC. If this error code does not reoccur, no action is required. If this error code reoccurs, replace the I/O planar, location: U1.9-P1.
1014 CC46
Description: Loss of pgood on an I/O planar in the I/O drawer at location U1.9Actions: This error code can be caused by hot-plugging RIO cables. To see if this is a recoverable error, do the following: 1. Power down the system by following the appropriate procedure in “Powering Off and Powering On the System” on page 838. 2. After the system is shut down, 350V dc must be removed from the DCAs in the I/O drawer that reported the error: v If a High Performance Switch (7045-SW4) is installed in the CEC frame, follow the instructions in “Powering Off and Powering On the System” on page 838 for deactivating a system’s I/O drawer after the system is shut down. v If a High Performance Switch (7045-SW4) is not present in the CEC frame, use the UEPO switch to remove 350V dc from all of the drawers in the frame.3. Then boot the system by following the appropriate procedure in “Powering Off and Powering On the System” on page 838. At any point after the service processor comes up to standby, indicated by OK in the operator panel in the media drawer, look at the service action event log in Service Focal Point on the HMC. If this error code does not reoccur, no action is required. If this error code reoccurs, replace the I/O planar, location: U1.9-P2.
1014 CD96
A power short has been detected in the DASD subsystem in an I/O drawer.
Go to “MAP 1520: Power” on page 192.
1014 CDA6
A power short has been detected in the DASD subsystem in an I/O drawer.
Go to “MAP 1520: Power” on page 192. Chapter
5.
Error
Code
to
FRU
Index
427
Error Code
Description
Action/ Possible Failing FRU
1014 CDB6
A power short has been detected in the DASD subsystem in an I/O drawer.
Go to “MAP 1520: Power” on page 192.
1014 CDC6
A power short has been detected in the DASD subsystem in an I/O drawer.
Go to “MAP 1520: Power” on page 192.
1014 F4x6
Hardware configuration does not support N-mode power.
Call Service Support
1014 F516
An N+1 boundary (DCA or fan) has dropped to N-mode status in the first I/O subsystem, location: U1.9.
This is normally an informational message. It should be accompanied by other error codes that more precisely indicate where the failure is. Examine the service processor error log. If there are no other power-related error codes (in the time frame in which this code was generated) in the service processor error log, the FRU indicated with this error code, if any, should be replaced.
1014 F526
An N+1 boundary (DCA or fan) has dropped to N-mode status in the first I/O subsystem, location: U1.9.
This is normally an informational message. It should be accompanied by other error codes that more precisely indicate where the failure is. Examine the service processor error log. If there are no other power-related error codes (in the time frame in which this code was generated) in the service processor error log, the FRU indicated with this error code, if any, should be replaced.
1014 xx11
First I/O subsystem in primary rack, DCA-1
U1.9-V1
1014 xx13
Seeprom fault on DCA-1 in first I/O subsystem in primary rack
U1.9-V1
1014 xx15
First I/O subsystem in primary rack, DCA-1, cable fault
Go to “MAP 1520: Power” on page 192 with these pairs of location codes: 1. U1.9-V1/Q1 and U1.35-P2-X3/Q1 2. U1.9-V1/Q2 and U1.35-P1-X3/Q1
1014 xx17
Problem with fan 1 and fan 3 in first I/O subsystem in primary rack
1. DCA-1Location U1.9–V1 2. Fan 1,Location U1.9–F1 3. Fan 3,Location U1.9–F3
1014 xx21
First I/O subsystem in primary rack, DCA-2
U1.9-V2
1014 xx23
Seeprom fault on DCA-2 in first I/O subsystem in primary rack
U1.9-V2
1014 xx25
First I/O subsystem in primary rack, DCA-2, cable fault
Go to “MAP 1520: Power” on page 192 with these pairs of location codes: 1. U1.9-V2/Q1 and U1.35-P2-X3/Q2 2. U1.9-V2/Q2 and U1.35-P1-X3/Q2
1014 xx27
Problem with fan 2 and fan 4 in first I/O subsystem in primary rack
1. DCA-2Location U1.9–V2 2. Fan 2,Location U1.9–F2 3. Fan 4,Location U1.9–F4 428
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
1014 xx31
First I/O subsystem in primary rack, I/O subsystem backplane 1 fault
U1.9-P1
1014 xx33
Seeprom fault on I/O subsystem backplane 1 in first I/O subsystem in primary rack
U1.9-P1
1014 xx41
First I/O subsystem in primary rack, I/O subsystem backplane 2 fault
U1.9-P2
1014 xx43
Seeprom fault on I/O subsystem backplane 2 in first I/O subsystem in primary rack
U1.9-P2
1014 xx51
First I/O subsystem in primary rack, DASD backplane 1 fault
U1.9-P3
1014 xx53
Seeprom fault on DASD backplane 1 in first I/O subsystem in primary rack
U1.9-P3
1014 xx61
First I/O subsystem in primary rack, DASD backplane 2 fault
U1.9-P4
1014 xx63
Seeprom fault on DASD backplane 2 in first I/O subsystem in primary rack
U1.9-P4
1014 xx71
First I/O subsystem in primary rack, DASD backplane 3 fault
U1.9-P5
1014 xx73
Seeprom fault on DASD backplane 3 in first I/O subsystem in primary rack
U1.9-P5
1014 xx81
First I/O subsystem in primary rack, DASD backplane 4 fault
U1.9-P6
1014 xx83
Seeprom fault on DASD backplane 4 in first I/O subsystem in primary rack
U1.9-P6
1015 1B17
Problem with fan 1 in second I/O subsystem in primary rack.
1. DCA-1Location: U1.5-V1 2. Fan 1Location: U1.5-F1
1015 1B27
Problem with fan 2 in second I/O subsystem in primary rack.
1. DCA-1Location: U1.5-V2 2. Fan 1Location: U1.5-F2
1015 1C06
Loss of airflow in second I/O subsystem, location: U1.5
Go to “MAP 1527: An Airflow Loss Has Been Detected” on page 209.
1015 6814
Overcurrent or overvoltage problem with 3.3V in second I/O subsystem
Go to “MAP 1520: Power” on page 192.
1015 6824
Overcurrent or overvoltage problem with 5.0V in second I/O subsystem
Go to “MAP 1520: Power” on page 192.
1015 6834
Overcurrent or overvoltage problem with 2.5V in second I/O subsystem
Go to “MAP 1520: Power” on page 192.
1015 6844
Overcurrent or overvoltage problem with 12.0V in second I/O subsystem
Go to “MAP 1520: Power” on page 192.
1015 6854
Overcurrent or overvoltage problem with -12.0V in second I/O subsystem
Go to “MAP 1520: Power” on page 192. Chapter
5.
Error
Code
to
FRU
Index
429
Error Code
Description
Action/ Possible Failing FRU
1015 7017
Problem with fan 3 in second I/O subsystem in primary rack
1. DCA-1Location: U1.5-V1 2. Fan 3Location: U1.5-F3
1015 7027
Problem with fan 4 in second I/O subsystem in primary rack
1. DCA-2Location: U1.5-V2 2. Fan 4Location: U1.5-F4
1015 7117
Problem with fan 1 and fan 3 in second I/O subsystem in primary rack
1. DCA-1Location: U1.5-V1 2. Fans 1 and 3Location: U1.5-F1 and U1.5-F3
1015 7127
Problem with fan 2 and fan 4 in second I/O subsystem in primary rack
1. DCA-2Location: U1.5-V2 2. Fans 2 and 4Location: U1.5-F2 and U1.5-F4
1015 8A00
The SPCN firmware has detected a change in the drawer configuration
Informational message.
1015 B014
Overcurrent or overvoltage problem with 3.3V in second I/O subsystem
Go to “MAP 1520: Power” on page 192.
1015 B024
Overcurrent or overvoltage problem with 5.0V in second I/O subsystem
Go to “MAP 1520: Power” on page 192.
1015 B034
Overcurrent or overvoltage problem with 2.5V in second I/O subsystem
Go to “MAP 1520: Power” on page 192.
1015 B044
Overcurrent or overvoltage problem with 12.0V in second I/O subsystem
Go to “MAP 1520: Power” on page 192.
1015 B054
Overcurrent or overvoltage problem with -12.0V in second I/O subsystem
Go to “MAP 1520: Power” on page 192.
1015 B814
Overcurrent or overvoltage problem with 3.3V in second I/O subsystem
Go to “MAP 1520: Power” on page 192.
1015 B824
Overcurrent or overvoltage problem with 5.0V in second I/O subsystem
Go to “MAP 1520: Power” on page 192.
1015 B834
Overcurrent or overvoltage problem with 2.5V in second I/O subsystem
Go to “MAP 1520: Power” on page 192.
1015 B844
Overcurrent or overvoltage problem with 12.0V in second I/O subsystem
Go to “MAP 1520: Power” on page 192.
1015 B854
Overcurrent or overvoltage problem with -12.0V in second I/O subsystem
Go to “MAP 1520: Power” on page 192.
1015 BF06
Unable to resolve the chassis VPD in an I/O subsystem
The chassis VPD in the I/O subsystem drawer identified by the location code could not be resolved. This may be an indication that more than one FRU in the drawer was replaced during a service action. If this is the case, reinstall the original FRUs in the drawer, then replace the FRUs one at a time. If only one FRU was replaced and this error code occurred, call service support. 430
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
1015 CC36
Description: Loss of pgood on an I/O planar in the I/O drawer at location U1.5 Action: This error code can be caused by hot-plugging RIO cables. To see if this is a recoverable error, do the following: 1. Power down the system by following the appropriate procedure in “Powering Off and Powering On the System” on page 838. 2. After the system is shut down, 350V dc must be removed from the DCAs in the I/O drawer that reported the error: v If a High Performance Switch (7045-SW4) is installed in the CEC frame, follow the instructions in “Powering Off and Powering On the System” on page 838 for deactivating a system’s I/O drawer after the system is shut down. v If a High Performance Switch (7045-SW4) is not present in the CEC frame, use the UEPO switch to remove 350V dc from all of the drawers in the frame.3. Then boot the system by following the appropriate procedure in “Powering Off and Powering On the System” on page 838. At any point after the service processor comes up to standby, indicated by OK in the operator panel in the media drawer, look at the service action event log in Service Focal Point on the HMC. If this error code does not reoccur, no action is required. If this error code reoccurs, replace the I/O planar, location: U1.5-P1.
1015 CC46
Description: Loss of pgood on an I/O planar in the I/O drawer at location U1.5Action:This error code can be caused by hot-plugging RIO cables. To see if this is a recoverable error, do the following: 1. Power down the system by following the appropriate procedure in “Powering Off and Powering On the System” on page 838. 2. After the system is shut down, 350V dc must be removed from the DCAs in the I/O drawer that reported the error: v If a High Performance Switch (7045-SW4) is installed in the CEC frame, follow the instructions in “Powering Off and Powering On the System” on page 838 for deactivating a system’s I/O drawer after the system is shut down. v If a High Performance Switch (7045-SW4) is not present in the CEC frame, use the UEPO switch to remove 350V dc from all of the drawers in the frame.3. Then boot the system by following the appropriate procedure in “Powering Off and Powering On the System” on page 838. At any point after the service processor comes up to standby, indicated by OK in the operator panel in the media drawer, look at the service action event log in Service Focal Point on the HMC. If this error code does not reoccur, no action is required. If this error code reoccurs, replace the I/O planar, location: U1.5-P2.
1015 CD96
A power short has been detected in the DASD subsystem in an I/O drawer.
Go to “MAP 1520: Power” on page 192.
1015 CDA6
A power short has been detected in the DASD subsystem in an I/O drawer.
Go to “MAP 1520: Power” on page 192.
1015 CDB6
A power short has been detected in the DASD subsystem in an I/O drawer.
Go to “MAP 1520: Power” on page 192.
1015 CDC6
A power short has been detected in the DASD subsystem in an I/O drawer.
Go to “MAP 1520: Power” on page 192.
1015 F4x6
Hardware configuration does not support N-mode power.
Call Service Support Chapter
5.
Error
Code
to
FRU
Index
431
Error Code
Description
Action/ Possible Failing FRU
1015 F516
An N+1 boundary (DCA or fan) has dropped to N-mode status in the second I/O subsystem, location: U1.5.
This is normally an informational message. It should be accompanied by other error codes that more precisely indicate where the failure is. Examine the service processor error log. If there are no other power-related error codes (in the time frame in which this code was generated) in the service processor error log, the FRU indicated with this error code, if any, should be replaced.
1015 F526
An N+1 boundary (DCA or fan) has dropped to N-mode status in the second I/O subsystem, location: U1.5.
This is normally an informational message. It should be accompanied by other error codes that more precisely indicate where the failure is. Examine the service processor error log. If there are no other power-related error codes (in the time frame in which this code was generated) in the service processor error log, the FRU indicated with this error code, if any, should be replaced.
1015 xx11
Second I/O subsystem in primary rack, DCA-1
U1.5-V1
1015 xx13
Seeprom fault on DCA-1 in second I/O subsystem in primary rack
U1.5-V1
1015 xx15
Second I/O subsystem in primary rack, DCA-1, cable fault
Go to “MAP 1520: Power” on page 192 with these pairs of location codes: 1. U1.5-V1/Q1 and U1.35-P2-X3/Q3 2. U1.5-V1/Q2 and U1.35-P1-X3/Q3
1015 xx17
Problem with fan 1 and fan 3 in second I/O subsystem in primary rack
1. DCA-1Location U1.5–V1 2. Fan 1,Location U1.5–F1 3. Fan 3,Location U1.5–F3
1015 xx21
Second I/O subsystem in primary rack, DCA-2
U1.5-V2
1015 xx23
Seeprom fault on DCA-2 in second I/O subsystem in primary rack
U1.5-V2
1015 xx25
Second I/O subsystem in primary rack, DCA-2, cable fault
Go to “MAP 1520: Power” on page 192 with these pairs of location codes: 1. U1.5-V2/Q1 and U1.35-P2-X3/Q4 2. U1.5-V2/Q2 and U1.35-P1-X3/Q4
1015 xx27
Problem with fan 2 and fan 4 in second I/O subsystem in primary rack
1. DCA-2Location U1.5–V2 2. Fan 2,Location U1.5–F2 3. Fan 4,Location U1.5–F4
1015 xx31
Second I/O subsystem in primary rack, I/O subsystem backplane 1 fault
U1.5-P1
1015 xx33
Seeprom fault on I/O subsystem backplane 1 in second I/O subsystem in primary rack
U1.5-P1
1015 xx41
Second I/O subsystem in primary rack, I/O subsystem backplane 2 fault
U1.5-P2 432
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
1015 xx43
Seeprom fault on I/O subsystem backplane 2 in second I/O subsystem in primary rack
U1.5-P2
1015 xx51
Second I/O subsystem in primary rack, DASD backplane 1 fault
U1.5-P3
1015 xx53
Seeprom fault on DASD backplane 1 in second I/O subsystem in primary rack
U1.5-P3
1015 xx61
Second I/O subsystem in primary rack, DASD backplane 2 fault
U1.5-P4
1015 xx63
Seeprom fault on DASD backplane 2 in second I/O subsystem in primary rack
U1.5-P4
1015 xx71
Second I/O subsystem in primary rack, DASD backplane 3 fault
U1.5-P5
1015 xx73
Seeprom fault on DASD backplane 3 in second I/O subsystem in primary rack
U1.5-P5
1015 xx81
Second I/O subsystem in primary rack, DASD backplane 4 fault
U1.5-P6
1015 xx83
Seeprom fault on DASD backplane 4 in second I/O subsystem in primary rack
U1.5-P6
1016 1B17
Problem with fan 1 in third I/O subsystem in primary rack
1. DCA-1Location: U1.1-V1 2. Fan 1Location: U1.1-F1
1016 1B27
Problem with fan 2 in third I/O subsystem in primary rack
1. DCA-2Location: U1.1-V2 2. Fan 2Location: U1.1-F2
1016 1C06
Loss of airflow in third I/O subsystem, location: U1.1
Go to “MAP 1527: An Airflow Loss Has Been Detected” on page 209.
1016 6014
Overcurrent or overvoltage problem with 3.3V in third I/O subsystem
Go to “MAP 1520: Power” on page 192.
1016 6024
Overcurrent or overvoltage problem with 5.0V in third I/O subsystem
Go to “MAP 1520: Power” on page 192.
1016 6034
Overcurrent or overvoltage problem with 2.5V in third I/O subsystem
Go to “MAP 1520: Power” on page 192.
1016 6044
Overcurrent or overvoltage problem with 12.0V in third I/O subsystem
Go to “MAP 1520: Power” on page 192.
1016 6054
Overcurrent or overvoltage problem with -12.0V in third I/O subsystem
Go to “MAP 1520: Power” on page 192.
1016 6814
Overcurrent or overvoltage problem with 3.3V in third I/O subsystem
Go to “MAP 1520: Power” on page 192.
1016 6824
Overcurrent or overvoltage problem with 5.0V in third I/O subsystem
Go to “MAP 1520: Power” on page 192.
1016 6834
Overcurrent or overvoltage problem with 2.5V in third I/O subsystem
Go to “MAP 1520: Power” on page 192.
1016 6844
Overcurrent or overvoltage problem with 12.0V in third I/O subsystem
Go to “MAP 1520: Power” on page 192. Chapter
5.
Error
Code
to
FRU
Index
433
Error Code
Description
Action/ Possible Failing FRU
1016 6854
Overcurrent or overvoltage problem with -12.0V in third I/O subsystem
Go to “MAP 1520: Power” on page 192.
1016 7017
Problem with fan 3 in third I/O subsystem in primary rack
1. DCA-1Location: U1.1-V1 2. Fan 3Location: U1.1-F3
1016 7027
Problem with fan 4 in third I/O subsystem in primary rack
1. DCA-2Location: U1.1-V2 2. Fan 4Location: U1.1-F4
1016 7117
Problem with fan 1 and fan 3 in third I/O subsystem in primary rack
1. DCA-1Location: U1.1-V1 2. Fans 1 and 3Location: U1.1-F1 and U1.1-F3
1016 7127
Problem with fan 2 and fan 4 in third I/O subsystem in primary rack
1. DCA-2Location: U1.1-V2 2. Fans 2 and 4Location: U1.1-F2 and U1.1-F4
1016 8A00
The SPCN firmware has detected a change in the drawer configuration
Informational message.
1016 B014
Overcurrent or overvoltage problem with 3.3V in third I/O subsystem
Go to “MAP 1520: Power” on page 192.
1016 B024
Overcurrent or overvoltage problem with 5.0V in third I/O subsystem
Go to “MAP 1520: Power” on page 192.
1016 B034
Overcurrent or overvoltage problem with 2.5V in third I/O subsystem
Go to “MAP 1520: Power” on page 192.
1016 B044
Overcurrent or overvoltage problem with 12.0V in third I/O subsystem
Go to “MAP 1520: Power” on page 192.
1016 B054
Overcurrent or overvoltage problem with -12.0V in third I/O subsystem
Go to “MAP 1520: Power” on page 192.
1016 B814
Overcurrent or overvoltage problem with 3.3V in third I/O subsystem
Go to “MAP 1520: Power” on page 192.
1016 B824
Overcurrent or overvoltage problem with 5.0V in third I/O subsystem
Go to “MAP 1520: Power” on page 192.
1016 B834
Overcurrent or overvoltage problem with 2.5V in third I/O subsystem
Go to “MAP 1520: Power” on page 192.
1016 B844
Overcurrent or overvoltage problem with 12.0V in third I/O subsystem
Go to “MAP 1520: Power” on page 192.
1016 B854
Overcurrent or overvoltage problem with -12.0V in third I/O subsystem
Go to “MAP 1520: Power” on page 192. 434
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
1016 BF06
Unable to resolve the chassis VPD in an I/O subsystem
The chassis VPD in the I/O subsystem drawer identified by the location code could not be resolved. This may be an indication that more than one FRU in the drawer was replaced during a service action. If this is the case, reinstall the original FRUs in the drawer, then replace the FRUs one at a time. If only one FRU was replaced and this error code occurred, call service support.
1016 CC36
Description: Loss of pgood on an I/O planar in the I/O drawer at location U1.1Action:This error code can be caused by hot-plugging RIO cables. To see if this is a recoverable error, do the following: 1. Power down the system by following the appropriate procedure in “Powering Off and Powering On the System” on page 838. 2. After the system is shut down, 350V dc must be removed from the DCAs in the I/O drawer that reported the error: v If a High Performance Switch (7045-SW4) is installed in the CEC frame, follow the instructions in “Powering Off and Powering On the System” on page 838 for deactivating a system’s I/O drawer after the system is shut down. v If a High Performance Switch (7045-SW4) is not present in the CEC frame, use the UEPO switch to remove 350V dc from all of the drawers in the frame.3. Then boot the system by following the appropriate procedure in “Powering Off and Powering On the System” on page 838. At any point after the service processor comes up to standby, indicated by OK in the operator panel in the media drawer, look at the service action event log in Service Focal Point on the HMC. If this error code does not reoccur, no action is required. If this error code reoccurs, replace the I/O planar, location: U1.1-P1.
1016 CC46
Description: Loss of pgood on an I/O planar in the I/O drawer at location U1.1Action:This error code can be caused by hot-plugging RIO cables. To see if this is a recoverable error, do the following: 1. Power down the system by following the appropriate procedure in “Powering Off and Powering On the System” on page 838. 2. After the system is shut down, 350V dc must be removed from the DCAs in the I/O drawer that reported the error: v If a High Performance Switch (7045-SW4) is installed in the CEC frame, follow the instructions in “Powering Off and Powering On the System” on page 838 for deactivating a system’s I/O drawer after the system is shut down. v If a High Performance Switch (7045-SW4) is not present in the CEC frame, use the UEPO switch to remove 350V dc from all of the drawers in the frame.3. Then boot the system by following the appropriate procedure in “Powering Off and Powering On the System” on page 838. At any point after the service processor comes up to standby, indicated by OK in the operator panel in the media drawer, look at the service action event log in Service Focal Point on the HMC. If this error code does not reoccur, no action is required. If this error code reoccurs, replace the I/O planar, location: U1.1-P2.
1016 CD96
A power short has been detected in the DASD subsystem in an I/O drawer.
Go to “MAP 1520: Power” on page 192.
1016 CDA6
A power short has been detected in the DASD subsystem in an I/O drawer.
Go to “MAP 1520: Power” on page 192.
1016 CDB6
A power short has been detected in the DASD subsystem in an I/O drawer.
Go to “MAP 1520: Power” on page 192.
1016 CDC6
A power short has been detected in the DASD subsystem in an I/O drawer.
Go to “MAP 1520: Power” on page 192. Chapter
5.
Error
Code
to
FRU
Index
435
Error Code
Description
Action/ Possible Failing FRU
1016 F4x6
Hardware configuration does not support N-mode power.
Call Service Support
1016 F516
An N+1 boundary (DCA or fan) has dropped to N-mode status in the third I/O subsystem, location: U1.1.
This is normally an informational message. It should be accompanied by other error codes that more precisely indicate where the failure is. Examine the service processor error log. If there are no other power-related error codes (in the time frame in which this code was generated) in the service processor error log, the FRU indicated with this error code, if any, should be replaced.
1016 F526
An N+1 boundary (DCA or fan) has dropped to N-mode status in the third I/O subsystem, location: U1.1.
This is normally an informational message. It should be accompanied by other error codes that more precisely indicate where the failure is. Examine the service processor error log. If there are no other power-related error codes (in the time frame in which this code was generated) in the service processor error log, the FRU indicated with this error code, if any, should be replaced.
1016 xx11
third I/O subsystem in primary rack, DCA-1
U1.1-V1
1016 xx13
Seeprom fault on DCA-1 in third I/O subsystem in primary rack
U1.1-V1
1016 xx15
Third I/O subsystem in primary rack, DCA-1, cable fault
Go to “MAP 1520: Power” on page 192 with these pairs of location codes: 1. U1.1-V1/Q1 and U1.35-P2-X3/Q5 2. U1.1-V1/Q2 and U1.35-P1-X3/Q5
1016 xx17
Problem with fan 1 and fan 3 in third I/O subsystem in primary rack
1. DCA-1Location U1.1–V1 2. Fan 1,Location U1.1–F1 3. Fan 3,Location U1.1–F3
1016 xx21
Third I/O subsystem in primary rack, DCA-2
U1.1-V2
1016 xx23
Seeprom fault on DCA-2 in third I/O subsystem in primary rack
U1.1-V2
1016 xx25
Third I/O subsystem in primary rack, DCA-2, cable fault
Go to “MAP 1520: Power” on page 192 with these pairs of location codes: 1. U1.1-V2/Q1 and U1.35-P2-X3/Q6 2. U1.1-V2/Q2 and U1.35-P1-X3/Q6
1016 xx27
Problem with fan 2 and fan 4 in third I/O subsystem in primary rack
1. DCA-2Location U1.1–V2 2. Fan 2,Location U1.1F2 3. Fan 4,Location U1.1–F4
1016 xx31
Third I/O subsystem in primary rack, I/O subsystem backplane 1 fault
U1.1-P1
1016 xx33
Seeprom fault on I/O subsystem backplane 1 in third I/O subsystem in primary rack
U1.1-P1 436
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
1016 xx41
Third I/O subsystem in primary rack, I/O subsystem backplane 2 fault
U1.1-P2
1016 xx43
Seeprom fault on I/O subsystem backplane 2 in third I/O subsystem in primary rack
U1.1-P2
1016 xx51
Third I/O subsystem in primary rack, DASD backplane 1 fault
U1.1-P3
1016 xx53
Seeprom fault on DASD backplane 1 in third I/O subsystem in primary rack
U1.1-P3
1016 xx61
Third I/O subsystem in primary rack, DASD backplane 2 fault
U1.1-P4
1016 xx63
Seeprom fault on DASD backplane 2 in third I/O subsystem in primary rack
U1.1-P4
1016 xx71
Third I/O subsystem in primary rack, DASD backplane 3 fault
U1.1-P5
1016 xx73
Seeprom fault on DASD backplane 3 in third I/O subsystem in primary rack
U1.1-P5
1016 xx81
Third I/O subsystem in primary rack, DASD backplane 4 fault
U1.1-P6
1016 xx83
Seeprom fault on DASD backplane 4 in third I/O subsystem in primary rack
U1.1-P6
1017 1B17
Problem with fan 1 in fourth I/O subsystem (no IBF installed) in primary rack
1. DCA-1Location: U1.13-V1 2. Fan 1Location: U1.13-F1
1017 1B27
Problem with fan 2 in fourth I/O subsystem (no IBF installed) in primary rack
1. DCA-2Location: U1.13-V2 2. Fan 2Location: U1.13-F2
1017 1C06
Loss of airflow in fourth I/O subsystem, no IBFlocation: U1.13
Go to “MAP 1527: An Airflow Loss Has Been Detected” on page 209.
1017 6014
Overcurrent or overvoltage problem with 3.3V in fourth I/O subsystem, no IBF
Go to “MAP 1520: Power” on page 192.
1017 6024
Overcurrent or overvoltage problem with 5.0V in fourth I/O subsystem, no IBF
Go to “MAP 1520: Power” on page 192.
1017 6034
Overcurrent or overvoltage problem with 2.5V in fourth I/O subsystem, no IBF
Go to “MAP 1520: Power” on page 192.
1017 6044
Overcurrent or overvoltage problem with 12.0V in fourth I/O subsystem, no IBF
Go to “MAP 1520: Power” on page 192.
1017 6054
Overcurrent or overvoltage problem with -12.0V in fourth I/O subsystem, no IBF
Go to “MAP 1520: Power” on page 192.
1017 6814
Overcurrent or overvoltage problem with 3.3V in fourth I/O subsystem, no IBF
Go to “MAP 1520: Power” on page 192.
1017 6824
Overcurrent or overvoltage problem with 5.0V in fourth I/O subsystem, no IBF
Go to “MAP 1520: Power” on page 192.
1017 6834
Overcurrent or overvoltage problem with 2.5V in fourth I/O subsystem, no IBF
Go to “MAP 1520: Power” on page 192. Chapter
5.
Error
Code
to
FRU
Index
437
Error Code
Description
Action/ Possible Failing FRU
1017 6844
Overcurrent or overvoltage problem with 12.0V in fourth I/O subsystem, no IBF
Go to “MAP 1520: Power” on page 192.
1017 6854
Overcurrent or overvoltage problem with -12.0V in fourth I/O subsystem, no IBF
Go to “MAP 1520: Power” on page 192.
1017 7017
Problem with fan 3 in fourth I/O subsystem (no IBF installed) in primary rack
1. DCA-1Location: U1.13-V1 2. Fan 3Location: U1.13-F3
1017 7027
Problem with fan 4 in fourth I/O subsystem (no IBF installed) in primary rack
1. DCA-2Location: U1.13-V2 2. Fan 4Location: U1.13-F4
1017 7117
Problem with fan 1 and fan 3 in fourth I/O subsystem (no IBF installed) in primary rack
1. DCA-1Location: U1.13-V1 2. Fans 1 and 3Location: U1.13-F1 and U1.13-F3
1017 7127
Problem with fan 2 and fan 4 in fourth I/O subsystem (no IBF installed) in primary rack
1. DCA-2Location: U1.13-V2 2. Fans 2 and 4Location: U1.13-F2 and U1.13-F4
1017 8A00
The SPCN firmware has detected a change in the drawer configuration
Informational message.
1017 B014
Overcurrent or overvoltage problem with 3.3V in fourth I/O subsystem, no IBF
Go to “MAP 1520: Power” on page 192.
1017 B024
Overcurrent or overvoltage problem with 5.0V in fourth I/O subsystem, no IBF
Go to “MAP 1520: Power” on page 192.
1017 B034
Overcurrent or overvoltage problem with 2.5V in fourth I/O subsystem, no IBF
Go to “MAP 1520: Power” on page 192.
1017 B044
Overcurrent or overvoltage problem with 12.0V in fourth I/O subsystem, no IBF
Go to “MAP 1520: Power” on page 192.
1017 B054
Overcurrent or overvoltage problem with -12.0V in fourth I/O subsystem, no IBF
Go to “MAP 1520: Power” on page 192.
1017 B814
Overcurrent or overvoltage problem with 3.3V in fourth I/O subsystem, no IBF
Go to “MAP 1520: Power” on page 192.
1017 B824
Overcurrent or overvoltage problem with 5.0V in fourth I/O subsystem, no IBF
Go to “MAP 1520: Power” on page 192.
1017 B834
Overcurrent or overvoltage problem with 2.5V in fourth I/O subsystem, no IBF
Go to “MAP 1520: Power” on page 192.
1017 B844
Overcurrent or overvoltage problem with 12.0V in fourth I/O subsystem, no IBF
Go to “MAP 1520: Power” on page 192.
1017 B854
Overcurrent or overvoltage problem with -12.0V in fourth I/O subsystem, no IBF
Go to “MAP 1520: Power” on page 192. 438
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
1017 BF06
Unable to resolve the chassis VPD in an I/O subsystem
The chassis VPD in the I/O subsystem drawer identified by the location code could not be resolved. This may be an indication that more than one FRU in the drawer was replaced during a service action. If this is the case, reinstall the original FRUs in the drawer, then replace the FRUs one at a time. If only one FRU was replaced and this error code occurred, call service support.
1017 CC36
Description: Loss of pgood on an I/O planar in the I/O drawer at location U1.13Action:This error code can be caused by hot-plugging RIO cables. To see if this is a recoverable error, do the following: 1. Power down the system by following the appropriate procedure in “Powering Off and Powering On the System” on page 838. 2. After the system is shut down, 350V dc must be removed from the DCAs in the I/O drawer that reported the error: v If a High Performance Switch (7045-SW4) is installed in the CEC frame, follow the instructions in “Powering Off and Powering On the System” on page 838 for deactivating a system’s I/O drawer after the system is shut down. v If a High Performance Switch (7045-SW4) is not present in the CEC frame, use the UEPO switch to remove 350V dc from all of the drawers in the frame.3. Then boot the system by following the appropriate procedure in “Powering Off and Powering On the System” on page 838. At any point after the service processor comes up to standby, indicated by OK in the operator panel in the media drawer, look at the service action event log in Service Focal Point on the HMC. If this error code does not reoccur, no action is required. If this error code reoccurs, replace the I/O planar, location: U1.13-P1.
1017 CC46
Description: Loss of pgood on an I/O planar in the I/O drawer at location U1.13Action:This error code can be caused by hot-plugging RIO cables. To see if this is a recoverable error, do the following: 1. Power down the system by following the appropriate procedure in “Powering Off and Powering On the System” on page 838. 2. After the system is shut down, 350V dc must be removed from the DCAs in the I/O drawer that reported the error: v If a High Performance Switch (7045-SW4) is installed in the CEC frame, follow the instructions in “Powering Off and Powering On the System” on page 838 for deactivating a system’s I/O drawer after the system is shut down. v If a High Performance Switch (7045-SW4) is not present in the CEC frame, use the UEPO switch to remove 350V dc from all of the drawers in the frame.3. Then boot the system by following the appropriate procedure in “Powering Off and Powering On the System” on page 838. At any point after the service processor comes up to standby, indicated by OK in the operator panel in the media drawer, look at the service action event log in Service Focal Point on the HMC. If this error code does not reoccur, no action is required. If this error code reoccurs, replace the I/O planar, location: U1.13-P2.Go to “MAP 1520: Power” on page 192.
1017 CD96
A power short has been detected in the DASD subsystem in an I/O drawer.
Go to “MAP 1520: Power” on page 192.
1017 CDA6
A power short has been detected in the DASD subsystem in an I/O drawer.
Go to “MAP 1520: Power” on page 192.
1017 CDB6
A power short has been detected in the DASD subsystem in an I/O drawer.
Go to “MAP 1520: Power” on page 192. Chapter
5.
Error
Code
to
FRU
Index
439
Error Code
Description
Action/ Possible Failing FRU
1017 CDC6
A power short has been detected in the DASD subsystem in an I/O drawer.
Go to “MAP 1520: Power” on page 192.
1017 F4x6
Hardware configuration does not support N-mode power.
Call Service Support
1017 F516
An N+1 boundary (DCA or fan) has dropped to N-mode status in the fourth I/O subsystem, location: U1.13.
This is normally an informational message. It should be accompanied by other error codes that more precisely indicate where the failure is. Examine the service processor error log. If there are no other power-related error codes (in the time frame in which this code was generated) in the service processor error log, the FRU indicated with this error code, if any, should be replaced.
1017 F526
An N+1 boundary (DCA or fan) has dropped to N-mode status in the fourth I/O subsystem, location: U1.13.
This is normally an informational message. It should be accompanied by other error codes that more precisely indicate where the failure is. Examine the service processor error log. If there are no other power-related error codes (in the time frame in which this code was generated) in the service processor error log, the FRU indicated with this error code, if any, should be replaced.
1017 xx11
Fourth I/O subsystem in primary rack (no IBF), DCA-1
U1.13-V1
1017 xx13
Seeprom fault on DCA-1 in fourth I/O subsystem (no IBF) in primary rack
U1.13-V1
1017 xx15
Fourth I/O subsystem in primary rack (no IBF)
If there is no DCA in U1.18-P1-V8, go to “MAP 1520: Power” on page 192 with these pairs of location codes: 1. U1.13-V1/Q1 and U1.35-P2-X3/Q7 2. U1.13-V1/Q2 and U1.35-P1-X3/Q7If there is a 6th DCA in U1.18-P1-V8, go to “MAP 1520: Power” on page 192 with these pairs of location codes: 1. U1.13-V1/Q1 and U1.35-P2-X2/Q9 2. U1.13-V1/Q2 and U1.35-P1-X2/Q9
1017 xx17
Problem with fan 1 and fan 3 in fourth I/O subsystem (no IBF installed) in primary rack
1. DCA-1Location U1.13-V1 2. Fan 1,Location U1.13-F1 3. Fan 3,Location U1.13-F3
1017 xx21
Fourth I/O subsystem in primary rack (no IBF), DCA-2
U1.13-V2
1017 xx23
Seeprom fault on DCA-2 in fourth I/O subsystem (no IBF) in primary rack
U1.13-V2 440
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
1017 xx25
Fourth I/O subsystem in primary rack no IBFs
If there is no DCA in U1.18-P1-V8, primary rack, no IBFs, go to “MAP 1520: Power” on page 192with these pairs of location codes: 1. U1.13-V2/Q1 and U1.35-P2-X3/Q8 2. U1.13-V2/Q2 and U1.35-P1-X3/Q8If there is a 6th DCA in U1.18-P1-V8, go to “MAP 1520: Power” on page 192 with these pairs of location codes: 1. U1.13-V2/Q1 and U1.35-P2-X2/Q10 2. U1.13-V2/Q2 and U1.35-P1-X2/Q10
1017 xx27
Problem with fan 2 and fan 4 in fourth I/O subsystem (no IBF installed) in primary rack
1. DCA-2Location U1.13-V2 2. Fan 2,Location U1.13-F2 3. Fan 4,Location U1.13-F4
1017 xx31
Fourth I/O subsystem in primary rack, I/O subsystem backplane 1 fault
U1.13-P1
1017 xx33
Seeprom fault on I/O subsystem backplane 1 in fourth I/O subsystem (no IBF) in primary rack
U1.13-P1
1017 xx41
Fourth I/O subsystem in primary rack, I/O subsystem backplane 2 fault
U1.13-P2
1017 xx43
Seeprom fault on I/O subsystem backplane 2 in fourth I/O subsystem (no IBF) in primary rack
U1.13-P2
1017 xx51
Fourth I/O subsystem in primary rack, DASD backplane 1 fault
U1.13-P3
1017 xx53
Seeprom fault on DASD backplane 1 in fourth I/O subsystem (no IBF) in primary rack
U1.13-P3
1017 xx61
Fourth I/O subsystem in primary rack, DASD backplane 2 fault
U1.13-P4
1017 xx63
Seeprom fault on DASD backplane 2 in fourth I/O subsystem (no IBF) in primary rack
U1.13-P4
1017 xx71
Fourth I/O subsystem in primary rack, DASD backplane 3 fault
U1.13-P5
1017 xx73
Seeprom fault on DASD backplane 3 in fourth I/O subsystem (no IBF) in primary rack
U1.13-P5
1017 xx81
Fourth I/O subsystem in primary rack, DASD backplane 4 fault
U1.13-P6
1017 xx83
Seeprom fault on DASD backplane 4 in fourth I/O subsystem (no IBF) in primary rack
U1.13-P6
101A 0106
BPA A ac loss in a frame that contains only High Performance Switches (no CECs).
Go to “MAP 1520: Power” on page 192.
101A 0856
BPR 1A communications fault
Go to “MAP 1523: There Is a Bulk Power Regulator (BPR) Communications Fault” on page 200.
101A 0866
BPR 2A communications fault
Go to “MAP 1523: There Is a Bulk Power Regulator (BPR) Communications Fault” on page 200. Chapter
5.
Error
Code
to
FRU
Index
441
Error Code
Description
Action/ Possible Failing FRU
101A 0876
BPR 3A communications fault
Go to “MAP 1523: There Is a Bulk Power Regulator (BPR) Communications Fault” on page 200.
101A 0D06
350 volt bulk failure, BPA-A
Go to “MAP 1525: There Is a 350 Volt Bulk Failure” on page 203.
101A 1C06
Cage airflow missing (temperature problem)
Go to “MAP 1527: An Airflow Loss Has Been Detected” on page 209.
101A 3F06
Invalid I/O subsystem configurations
1. Verify that the I/O subsystems are properly cabled. Note: If the service processor is hung, this error code causes the service processor to hang. Refer to “Resetting the Service Processor” on page 819. 2. Call service support.
101A 7A56
IBF failure
Go to “MAP 1526: There Is An Integrated Battery Feature (IBF) Failure” on page 206.
101A 7A66
IBF failure
Go to “MAP 1526: There Is An Integrated Battery Feature (IBF) Failure” on page 206.
101A 7A76
IBF failure
Go to “MAP 1526: There Is An Integrated Battery Feature (IBF) Failure” on page 206.
101A 7A86
IBF 1A failure
Go to “MAP 1526: There Is An Integrated Battery Feature (IBF) Failure” on page 206.
101A 7A96
IBF 2A failure
Go to “MAP 1526: There Is An Integrated Battery Feature (IBF) Failure” on page 206.
101A 7AA6
IBF 3A failure
Go to “MAP 1526: There Is An Integrated Battery Feature (IBF) Failure” on page 206.
101A 7B56
IBF failure
Go to “MAP 1526: There Is An Integrated Battery Feature (IBF) Failure” on page 206.
101A 7B66
IBF failure
Go to “MAP 1526: There Is An Integrated Battery Feature (IBF) Failure” on page 206.
101A 7B76
IBF failure
Go to “MAP 1526: There Is An Integrated Battery Feature (IBF) Failure” on page 206.
101A 7B86
IBF 1A failure
Go to “MAP 1526: There Is An Integrated Battery Feature (IBF) Failure” on page 206.
101A 7B96
IBF 2A failure
Go to “MAP 1526: There Is An Integrated Battery Feature (IBF) Failure” on page 206.
101A 7BA6
IBF 3A failure
Go to “MAP 1526: There Is An Integrated Battery Feature (IBF) Failure” on page 206.
101A CE16
BPC-BPC communication failure
1. Check the cable between the two BPCs; reseat and look for bent pins. 2. Replace BPC-A, location: U1.35-P1-X4 3. Replace BPC-B, location: U1.35-P2-X4.
101A D216
BPA-BPA communication failure
Go to “MAP 1520: Power” on page 192 for isolation. 442
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
101A D616
Problem with external UPS: utility failure
1. Verify the cabling between the UPS and the BPCs. 2. Refer to the manufacturer’s documentation for the UPS. 3. Call service support.
101A D716
Problem with external UPS: battery low
1. Verify the cabling between the UPS and the BPCs. 2. Refer to the manufacturer’s documentation for the UPS. 3. Call service support.
101A D756
Informational message about the IBF.
Informational message.
101A D766
Informational message about the IBF.
Informational message.
101A D776
Informational message about the IBF.
Informational message.
101A D816
External UPS: bypass active (informational message)
Informational message. If a problem is suspected: 1. Verify the cabling between the UPS and the BPCs. 2. Refer to the manufacturer’s documentation for the UPS. 3. Call service support.
101A D916
External UPS: power restored (informational message)
Informational message. If a problem is suspected: 1. Verify the cabling between the UPS and the BPCs. 2. Refer to the manufacturer’s documentation for the UPS. 3. Call service support.
101A DA16
External UPS: installed (informational message)
Informational message. If a problem is suspected: 1. Verify the cabling between the UPS and the BPCs. 2. Refer to the manufacturer’s documentation for the UPS. 3. Call service support.
101A DB16
Problem with external UPS: UPS not available
1. Verify the cabling between the UPS and the BPCs. 2. Refer to the manufacturer’s documentation for the UPS. 3. Call service support.
101A DC16
External UPS: battery not low (informational message)
Informational message. If a problem is suspected: 1. Verify the cabling between the UPS and the BPCs. 2. Refer to the manufacturer’s documentation for the UPS. 3. Call service support.
101A DC56
Informational message about the IBF.
Informational message. Chapter
5.
Error
Code
to
FRU
Index
443
Error Code
Description
Action/ Possible Failing FRU
101A DC66
Informational message about the IBF.
Informational message.
101A DC76
Informational message about the IBF.
Informational message.
101A DD16
External UPS: bypass ended (informational message)
Informational message. If a problem is suspected: 1. Verify the cabling between the UPS and the BPCs. 2. Refer to the manufacturer’s documentation for the UPS. 3. Call service support.
101A DF16
Problem with external UPS: UPS failure
1. Verify the cabling between the UPS and the BPCs. 2. Refer to the manufacturer’s documentation for the UPS. 3. Call service support.
101A E106
Critical logic over temperature
Go to “MAP 1528: There Is A Processor (Critical/Warning) Overtemperature Fault” on page 211.
101A E206
Ambient room temperature too high
1. The ambient room temperature has reached a warning level (30 C) or a critical level (36 C). Ask the customer to correct the room temperature problem. 2. Call service support.
101A F4x6
Hardware configuration does not support N-mode power
Call service support.
101A F806
BPC A: loss of ac power or phase missing
Go to “MAP 152a: Loss of ac Power or Phase Missing” on page 214.
101A F906
BPC A: loss of ac power or phase missing
Go to “MAP 152a: Loss of ac Power or Phase Missing” on page 214.
101A F806
BPC A: loss of ac power or phase missing
Go to “MAP 152a: Loss of ac Power or Phase Missing” on page 214.
101A F906
BPC A: loss of ac power or phase missing
Go to “MAP 152a: Loss of ac Power or Phase Missing” on page 214.
101A FA06
Problem with EPO switch detected by BPC-A
Go to “MAP 152a: Loss of ac Power or Phase Missing” on page 214.
101A FDB6
BPC A: open room EPO switch has been detected
Go to “MAP 1524: An Open Room EPO Switch Has Been Detected From One Side” on page 201.
101A FE06
UEPO switch on BPC A is in the bypass position
Go to “MAP 1522: UEPO Switch On The BPC Is In The Bypass Position” on page 199.
101A FE16
UEPO switch on BPC A is in the bypass position
Go to “MAP 1522: UEPO Switch On The BPC Is In The Bypass Position” on page 199.
101A FE56
BPR 1A communications fault
Go to “MAP 1523: There Is a Bulk Power Regulator (BPR) Communications Fault” on page 200.
101A FE66
BPR 2A communications fault
Go to “MAP 1523: There Is a Bulk Power Regulator (BPR) Communications Fault” on page 200. 444
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
101A FE76
BPR 3A communications fault
Go to “MAP 1523: There Is a Bulk Power Regulator (BPR) Communications Fault” on page 200.
101A FFx6
Cabling error on the BPC or a BPD that is not isolated
There is a cabling problem on a BPC or BPD that cannot be isolated.The x is decoded as follows:If x=1, the problem is on the BPC, location: U1.35-P1-X4If x=2, the problem is on BPD-1, location: U1.35-P1-X3 If x=3, the problem is on BPD-2, location: U1.35-P1-X2If x=4, the problem is on BPD-3, location: U1.35-P1-X1 1. Check the UPIC cabling on the BPC or BPD specified by decoding the x as shown above. Correct any cabling problems that are found. 2. If no problems are found on the A side of the BPE, check the UPIC cabling on the BPC or BPD in the same position on the B side of the BPE for cabling errors. Correct and problems that are found. 3. If both sides of the BPE are cabled correctly, call service support.
101A x115
Power cable fault
Go to “MAP 1520: Power” on page 192 with these location codes:U1.35-P1-X4/Q1# to U1.18-X1/Q1#
101A x125
Power cable fault
Go to “MAP 1520: Power” on page 192 with these location codes:U1.35-P1-X3/Q1# to U1.9-V1/Q2#
101A x135
Power cable fault
Go to “MAP 1520: Power” on page 192 with these location codes:U1.35-P1-X2/Q1# to U2.1-V1/Q1#
101A x215
Power cable fault
Go to “MAP 1520: Power” on page 192 with these location codes:U1.35-P1-X4/Q2# to U1.18-X2/Q1#
101A x225
Power cable fault
Go to “MAP 1520: Power” on page 192 with these location codes:U1.35-P1-X3/Q2# to U1.9-V2/Q2#
101A x235
Power cable fault
Go to “MAP 1520: Power” on page 192 with these location codes:U1.35-P1-X2/Q2# to U2.1-V2/Q1#
101A x315
Power cable fault
Go to “MAP 1520: Power” on page 192 with these location codes:U1.35-P1-X4/Q3# to U1.18-X3/Q1#
101A x325
Power cable fault
Go to “MAP 1520: Power” on page 192 with these location codes:U1.35-P1-X3/Q3# to U1.5-V1/Q2#
101A x335
Power cable fault
Go to “MAP 1520: Power” on page 192 with these location codes:U1.35-P1-X2/Q3# to U2.5-V1/Q1# Chapter
5.
Error
Code
to
FRU
Index
445
Error Code
Description
Action/ Possible Failing FRU
101A x415
Power cable fault
Go to “MAP 1520: Power” on page 192 with these location codes:U1.35-P1-X4/Q4# to U1.18-X4/Q1#
101A x425
Power cable fault
Go to “MAP 1520: Power” on page 192 with these location codes:U1.35-P1-X3/Q4# to U1.5-V2/Q2#
101A x435
Power cable fault
Go to “MAP 1520: Power” on page 192 with these location codes:U1.35-P1-X2/Q4# to U2.5-V2/Q1#
101A x515
Power cable fault
Go to “MAP 1520: Power” on page 192 with these location codes:U1.35-P1-X4/Q5# to U1.18-P1-V1/Q1#
101A x525
Power cable fault
Go to “MAP 1520: Power” on page 192 with these location codes:U1.35-P1-X3/Q5# to U1.1-V1/Q2#
101A x535
Power cable fault
Go to “MAP 1520: Power” on page 192 with these location codes:U1.35-P1-X2/Q5# to U2.9-V1/Q1#
101A x615
Power cable fault
Go to “MAP 1520: Power” on page 192 with these location codes:U1.35-P1-X4/Q6# to U1.18-P1-V2/Q1#
101A x625
Power cable fault
Go to “MAP 1520: Power” on page 192 with these location codes:U1.35-P1-X3/Q6# to U1.1-V2/Q2#
101A x635
Power cable fault
Go to “MAP 1520: Power” on page 192 with these location codes:U1.35-P1-X2/Q6# to U2.9-V2/Q1#
101A x715
Power cable fault
Go to “MAP 1520: Power” on page 192 with these location codes:U1.35-P1-X4/Q7# to U1.18-P1-V4/Q1#
101A x725
Power cable fault
If there is no DCA in U1.18-P1-V8, go to “MAP 1520: Power” on page 192 with these location codes: U1.35-P1-X3/Q7# to U1.13-V1/Q2#If there is a 6th DCA in U1.18-P1-V8, go to “MAP 1520: Power” on page 192 with these location codes:U1.35-P1-X3/Q7# to U1.18-P1-V8/Q1#
101A x735
Power cable fault
Go to “MAP 1520: Power” on page 192 with these location codes:U1.35-P1-X2/Q7# to U2.13-V1/Q1#
101A x825
Power cable fault
If there is no DCA in U1.18-P1-V8, go to “MAP 1520: Power” on page 192 with these location codes:U1.35-P1-X3/Q8# to U1.13-V2/Q2#If there is a 6th DCA in U1.18-P1-V8, this error code will not be reported.
101A x835
Power cable fault
Go to “MAP 1520: Power” on page 192 with these location codes:U1.35-P1-X2/Q8# to U2.13-V2/Q1# 446
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
101A x925
Power cable fault
Go to “MAP 1520: Power” on page 192 with these location codes:U1.35-P1-X3/Q9# to U1.18-P1-V5/Q1#
101A x935
Power cable fault
If there is no DCA in U1.18-P1-V8, go to “MAP 1520: Power” on page 192 with these location codes:U1.35-P2-X2/Q9# to U2.19-V1/Q1# If there is a 6th DCA in U1.18-P1-V8, go to “MAP 1520: Power” on page 192 with these location codes:U1.35-P1-X2/Q9# to U1.13-V1/Q1#
101A x945
Power cable fault
Go to “MAP 1520: Power” on page 192 with these location codes: U1.35-P2-X1/Q9# to U2.19-V1/Q1#
101A xA25
Power cable fault
Go to “MAP 1520: Power” on page 192 with these location codes:U1.35-P1-X3/Q10# to U1.18-P1-V6/Q1#
101A xA35
Power cable fault
If there is no DCA in U1.18-P1-V8, go to “MAP 1520: Power” on page 192 with these location codes:U1.35-P2-X2/Q10# to U2.19-V2/Q1#If there is a 6th DCA in U1.18-P1-V8, go to “MAP 1520: Power” on page 192 with these location codes:U1.35-P1-X2/Q10# to U1.13-V2/Q1#
101A xA45
Power cable fault
Go to “MAP 1520: Power” on page 192 with these location codes:U1.35-P2-X1/Q10# to U2.19-V2/Q1#
101A xx0x
BPA A-FRU not isolated
U1.35
101A xx11
BPA-A / BPC
U1.35-P1-X4
101A xx13
Seeprom (VPD) fault on BPC-A
U1.35-P1-X4
101A xx17
Fan fault on BPC-A (BPA-A fan)
1. BPE-A side fan Location: U1.35-P1-F1 2. BPC-A Location: U1.35-P1-X4
101A xx21
BPA-A / BPD-1
U1.35-P1-X3
101A xx22
BPC-A to BPD-1 communications fault
1. BPD-1 Location: U1.35-P1-X3 2. BPC-A Location: U1.35-P1-X4 3. BPE backplane A Location: U1.35-P1
101A xx23
Seeprom (VPD) fault on BPD-1A
U1.35-P1-X3
101A xx31
BPA-A / BPD-2
U1.35-P1-X2 Chapter
5.
Error
Code
to
FRU
Index
447
Error Code
Description
Action/ Possible Failing FRU
101A xx32
BPC-A to BPD-2 communications fault
1. BPD-2 Location: U1.35-P1-X2 2. BPC-A Location: U1.35-P1-X4 3. BPE backplane A Location: U1.35-P1
101A xx33
Seeprom (VPD) fault on BPD-2A
U1.35-P1-X2
101A xx41
BPA-A / BPD-3
U1.35-P1-X1
101A xx42
BPC-A to BPD-3 communications fault
1. BPD-3 Location: U1.35-P1-X1 2. BPC-A Location: U1.35-P1-X4 3. BPE backplane A Location: U1.35-P1
101A xx43
Seeprom (VPD) fault on BPD-3A
U1.35-P1-X1
101A xx51
BPA-A / BPR-1
U1.35-P1-V1
101A xx52
BPC-A to BPR-1 communications fault
1. BPR-1 Location: U1.35-P1-V1 2. BPC-A Location: U1.35-P1-X4 3. BPE backplane A Location: U1.35-P1
101A xx53
Seeprom (VPD) fault on BPR-1A
U1.35-P1-V1
101A xx61
BPA-A / BPR-2
U1.35-P1-V2
101A xx62
BPC-A to BPR-2 communications fault
1. BPR-2 Location: U1.35-P1-V2 2. BPC-A Location: U1.35-P1-X4 3. BPE backplane A Location: U1.35-P1
101A xx63
Seeprom (VPD) fault on BPR-2A
U1.35-P1-V2
101A xx71
BPA-A / BPR-3
U1.35-P1-V3
101A xx72
BPC-A to BPR-3 communications fault
1. BPR-3 Location: U1.35-P1-V3 2. BPC-A Location: U1.35-P1-X4 3. BPE backplane A Location: U1.35-P1
101A xx73
Seeprom (VPD) fault on BPR-3A
U1.35-P1-V3
101A xx81
BPA-A / IBF 1
U1.13-P1-V1
101A xx83
Seeprom (VPD) fault on IBF-1A
U1.13-P1-V1
101A xx91
BPA-A / IBF 2
U1.15-P1-V2
101A xx93
Seeprom (VPD) fault on IBF-2A
U1.15-P1-V2
101A xxA1
BPA-A / IBF 3
U2.17-P1-V5
101A xxA3
Seeprom (VPD) fault on IBF-3A
U2.17-P1-V5 448
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
101A xxB1
BPA-A / EPO_SW
1. Replace the UEPO switch,Location: U1-V1 2. Check the cable from BPC-A to the UEPO switch, Location U1-V1/Q1#
101B 0106
BPA B ac loss in a frame that contains only High Performance Switches (no CECs).
Go to “MAP 1520: Power” on page 192.
101B 0856
BPR 1B communications fault
Go to “MAP 1523: There Is a Bulk Power Regulator (BPR) Communications Fault” on page 200.
101B 0866
BPR 2B communications fault
Go to “MAP 1523: There Is a Bulk Power Regulator (BPR) Communications Fault” on page 200.
101B 0876
BPR 3B communications fault
Go to “MAP 1523: There Is a Bulk Power Regulator (BPR) Communications Fault” on page 200.
101B 0D06
350 volt bulk failure, BPA-B
Go to “MAP 1525: There Is a 350 Volt Bulk Failure” on page 203.
101B 1C06
Cage airflow missing (temperature problem)
Go to “MAP 1527: An Airflow Loss Has Been Detected” on page 209.
101B 3F06
Invalid I/O subsystem configuration
1. Verify that the I/O subsystems are properly cabled. Note: If the service processor is hung, this error code causes the service processor to hang. Refer to “Resetting the Service Processor” on page 819. 2. Call service support.
101B 7A56
IBF failure
Go to “MAP 1526: There Is An Integrated Battery Feature (IBF) Failure” on page 206.
101B 7A66
IBF failure
Go to “MAP 1526: There Is An Integrated Battery Feature (IBF) Failure” on page 206.
101B 7A76
IBF failure
Go to “MAP 1526: There Is An Integrated Battery Feature (IBF) Failure” on page 206.
101B 7A86
IBF 1B failure
Go to “MAP 1526: There Is An Integrated Battery Feature (IBF) Failure” on page 206.
101B 7A96
IBF 2B failure
Go to “MAP 1526: There Is An Integrated Battery Feature (IBF) Failure” on page 206.
101B 7AA6
IBF 3B failure
Go to “MAP 1526: There Is An Integrated Battery Feature (IBF) Failure” on page 206.
101B 7B56
IBF failure
Go to “MAP 1526: There Is An Integrated Battery Feature (IBF) Failure” on page 206.
101B 7B66
IBF failure
Go to “MAP 1526: There Is An Integrated Battery Feature (IBF) Failure” on page 206.
101B 7B76
IBF failure
Go to “MAP 1526: There Is An Integrated Battery Feature (IBF) Failure” on page 206.
101B 7B86
IBF 1B failure
Go to “MAP 1526: There Is An Integrated Battery Feature (IBF) Failure” on page 206.
101B 7B96
IBF 2B failure
Go to “MAP 1526: There Is An Integrated Battery Feature (IBF) Failure” on page 206. Chapter
5.
Error
Code
to
FRU
Index
449
Error Code
Description
Action/ Possible Failing FRU
101B 7BA6
IBF 3B failure
Go to “MAP 1526: There Is An Integrated Battery Feature (IBF) Failure” on page 206.
101B CE16
BPC-BPC communication failure
1. Check the cable between the two BPCs; reseat and look for bent pins. 2. Replace BPC-A, location: U1.35-P1-X4 3. Replace BPC-B, location: U1.35-P2-X4.
101B D216
BPA-BPA communication failure
Go to “MAP 1520: Power” on page 192 for isolation.
101B D616
Problem with external UPS: utility failure
1. Verify the cabling between the UPS and the BPCs. 2. Refer to the manufacturer’s documentation for the UPS. 3. Call service support.
101B D716
Problem with external UPS: battery low
1. Verify the cabling between the UPS and the BPCs. 2. Refer to the manufacturer’s documentation for the UPS. 3. Call service support.
101B D716
Informational message about the IBF.
Informational message.
101B D766
Informational message about the IBF.
Informational message.
101B D776
Informational message about the IBF.
Informational message.
101B D816
External UPS: bypass active (informational message)
Informational message. If a problem is suspected: 1. Verify the cabling between the UPS and the BPCs. 2. Refer to the manufacturer’s documentation for the UPS. 3. Call service support.
101B D916
External UPS: power restored (informational message)
Informational message. If a problem is suspected: 1. Verify the cabling between the UPS and the BPCs. 2. Refer to the manufacturer’s documentation for the UPS. 3. Call service support.
101B DA16
External UPS: installed (informational message)
Informational message. If a problem is suspected: 1. Verify the cabling between the UPS and the BPCs. 2. Refer to the manufacturer’s documentation for the UPS. 3. Call service support.
101B DB16
Problem with external UPS: UPS not available
1. Verify the cabling between the UPS and the BPCs. 2. Refer to the manufacturer’s documentation for the UPS. 3. Call service support. 450
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
101B DC16
External UPS: battery not low (informational message)
Informational message. If a problem is suspected: 1. Verify the cabling between the UPS and the BPCs. 2. Refer to the manufacturer’s documentation for the UPS. 3. Call service support.
101B DC56
Informational message about the IBF.
Informational message.
101B DC66
Informational message about the IBF.
Informational message.
101B DC76
Informational message about the IBF.
Informational message.
101B DD16
External UPS: bypass ended (informational message)
Informational message. If a problem is suspected: 1. Verify the cabling between the UPS and the BPCs. 2. Refer to the manufacturer’s documentation for the UPS. 3. Call service support.
101B DF16
Problem with external UPS: UPS failure
1. Verify the cabling between the UPS and the BPCs. 2. Refer to the manufacturer’s documentation for the UPS. 3. Call service support.
101B E106
Critical logic over temperature
Go to “MAP 1528: There Is A Processor (Critical/Warning) Overtemperature Fault” on page 211.
101B E206
Ambient room temperature too high
1. The ambient room temperature has reached a warning level or a critical level. Ask the customer to correct the room temperature problem. 2. Call service support.
101B F4x6
Hardware configuration does not support N-mode power
Call service support.
101B F806
BPC B: loss of ac power or phase missing
Go to “MAP 152a: Loss of ac Power or Phase Missing” on page 214.
101B F906
BPC B: loss of ac power or phase missing
Go to “MAP 152a: Loss of ac Power or Phase Missing” on page 214.
101B FA06
BPC B: loss of ac power or phase missing
Go to “MAP 152a: Loss of ac Power or Phase Missing” on page 214.
101B FDB6
BPC B: open room EPO switch has been detected
Go to “MAP 1524: An Open Room EPO Switch Has Been Detected From One Side” on page 201.
101B FE06
UEPO switch on BPC B is in the bypass position
Go to “MAP 1522: UEPO Switch On The BPC Is In The Bypass Position” on page 199.
101B FE16
UEPO switch on BPC B is in the bypass position
Go to “MAP 1522: UEPO Switch On The BPC Is In The Bypass Position” on page 199.
101B FE56
BPR 1B communications fault
Go to “MAP 1523: There Is a Bulk Power Regulator (BPR) Communications Fault” on page 200. Chapter
5.
Error
Code
to
FRU
Index
451
Error Code
Description
Action/ Possible Failing FRU
101B FE66
BPR 2B communications fault
Go to “MAP 1523: There Is a Bulk Power Regulator (BPR) Communications Fault” on page 200.
101B FE76
BPR 3B communications fault
Go to “MAP 1523: There Is a Bulk Power Regulator (BPR) Communications Fault” on page 200.
101B FFx6
Cabling error on the BPC or a BPD that is not isolated
There is a cabling problem on a BPC or BPD that cannot be isolated.The x is decoded as follows:If x=1, the problem is on the BPC, location: U1.35-P2-X4If x=2, the problem is on BPD-1, location: U1.35-P2-X3 If x=3, the problem is on BPD-2, location: U1.35-P2-X2If x=4, the problem is on BPD-3, location: U1.35-P2-X1 1. Check the UPIC cabling on the BPC or BPD specified by decoding the x as shown above. Correct any cabling problems that are found. 2. If no problems are found on the A side of the BPE, check the UPIC cabling on the BPC or BPD in the same position on the B side of the BPE for cabling errors. Correct and problems that are found. 3. If both sides of the BPE are cabled correctly, call service support.
101B x115
Power cable fault
Go to “MAP 1520: Power” on page 192 with these location codes:U1.35-P2-X4/Q1# to U1.18-X1/Q2#
101B x125
Power cable fault
Go to “MAP 1520: Power” on page 192 with these location codes:U1.35-P2-X3/Q1# to U1.9-V1/Q1#
101B x135
Power cable fault
Go to “MAP 1520: Power” on page 192 with these location codes:U1.35-P2-X2/Q1# to U2.1-V1/Q2#
101B x215
Power cable fault
Go to “MAP 1520: Power” on page 192 with these location codes:U1.35-P2-X4/Q2# to U1.18-X2/Q2#
101B x225
Power cable fault
Go to “MAP 1520: Power” on page 192 with these location codes:U1.35-P2-X3/Q2# to U1.9-V2/Q1#
101B x235
Power cable fault
Go to “MAP 1520: Power” on page 192 with these location codes:U1.35-P2-X2/Q2# to U2.1-V2/Q2#
101B x315
Power cable fault
Go to “MAP 1520: Power” on page 192 with these location codes:U1.35-P2-X4/Q3# to U1.18-X3/Q2#
101B x325
Power cable fault
Go to “MAP 1520: Power” on page 192 with these location codes:U1.35-P2-X3/Q3# to U1.5-V1/Q1# 452
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
101B x335
Power cable fault
Go to “MAP 1520: Power” on page 192 with these location codes:U1.35-P2-X2/Q3# to U2.5-V1/Q2#
101B x415
Power cable fault
Go to “MAP 1520: Power” on page 192 with these location codes:U1.35-P2-X4/Q4# to U1.18-X4/Q2#
101B x425
Power cable fault
Go to “MAP 1520: Power” on page 192 with these location codes:U1.35-P2-X3/Q4# to U1.5-V2/Q1#
101B x435
Power cable fault
Go to “MAP 1520: Power” on page 192 with these location codes:U1.35-P2-X2/Q4# to U2.5-V2/Q2#
101B x515
Power cable fault
Go to “MAP 1520: Power” on page 192 with these location codes:U1.35-P2-X4/Q5# to U1.18-P1-V1/Q2#
101B x525
Power cable fault
Go to “MAP 1520: Power” on page 192 with these location codes:U1.35-P2-X3/Q5# to U1.1-V1/Q1#
101B x535
Power cable fault
Go to “MAP 1520: Power” on page 192 with these location codes:U1.35-P2-X2/Q5# to U2.9-V1/Q2#
101B x615
Power cable fault
Go to “MAP 1520: Power” on page 192 with these location codes:U1.35-P2-X4/Q6# to U1.18-P1-V2/Q2#
101B x625
Power cable fault
Go to “MAP 1520: Power” on page 192 with these location codes:U1.35-P2-X3/Q6# to U1.1-V2/Q1#
101B x635
Power cable fault
Go to “MAP 1520: Power” on page 192 with these location codes:U1.35-P2-X2/Q6# to U2.9-V2/Q2#
101B x715
Power cable fault
Go to “MAP 1520: Power” on page 192 with these location codes:U1.35-P2-X4/Q7# to U1.18-P1-V4/Q2#
101B x725
Power cable fault
If there is no DCA in U1.18-P1-V8, go to “MAP 1520: Power” on page 192 with these location codes:U1.35-P2-X3/Q7# to U1.13-V1/Q1#If there is a 6th DCA in U1.18-P1-V8, go to “MAP 1520: Power” on page 192 with these location codes:U1.35-P2-X3/Q7# to U1.18-P1-V8/Q2#
101B x735
Power cable fault
Go to “MAP 1520: Power” on page 192 with these location codes:U1.35-P2-X2/Q7# to U2.13-V1/Q2#
101B x825
Power cable fault
Go to “MAP 1520: Power” on page 192 with these location codes:U1.35-P2-X3/Q8# to U1.13-V2/Q1# If there is a 6th DCA in U1.18-P1-V8, this error code will not be reported. Chapter
5.
Error
Code
to
FRU
Index
453
Error Code
Description
Action/ Possible Failing FRU
101B x835
Power cable fault
Go to “MAP 1520: Power” on page 192 with these location codes:U1.35-P2-X2/Q8# to U2.13-V2/Q2#
101B x925
Power cable fault
Go to “MAP 1520: Power” on page 192 with these location codes:U1.35-P2-X3/Q9# to U1.18-P1-V5/Q2#
101B x935
Power cable fault
If there is no DCA in U1.18-P1-V8, go to “MAP 1520: Power” on page 192 with these location codes:U1.35-P1-X2/Q9# to U2.19-V1/Q2# If there is a 6th DCA in U1.18-P1-V8, go to “MAP 1520: Power” on page 192 with these location codes:U1.35-P2-X2/Q9# to U1.13-V1/Q2#
101B x945
Power cable fault
Go to “MAP 1520: Power” on page 192 with these location codes: U1.35-P1-X1/Q9# to U2.19-V1/Q2#
101B xA25
Power cable fault
Go to “MAP 1520: Power” on page 192 with these location codes:U1.35-P2-X3/Q10# to U1.18-P1-V6/Q2#
101B xA35
Power cable fault
If there is no DCA in U1.18-P1-V8, go to “MAP 1520: Power” on page 192 with these location codes: U1.35-P1-X2/Q10# to U2.19-V2/Q2# If there is a 6th DCA in U1.18-P1-V8, go to“MAP 1520: Power” on page 192 with these location codes:U1.35-P2-X2/Q10# to U1.13-V2/Q2#
101B xA45
Power cable fault
Go to “MAP 1520: Power” on page 192 with these location codes:U1.35-P1-X1/Q10# to U2.19-V2/Q2#
101B xx0x
BPA B-FRU not isolated
U1.35
101B xx11
BPA-B / BPC
U1.35-P2-X4
101B xx13
Seeprom (VPD) fault on BPC-B
U1.35-P2-X4
101B xx17
Fan fault on BPC-B (BPA-B fan)
1. BPE-B side fan Location: U1.35-P2-F1 2. BPC-B Location: U1.35-P2-X4
101B xx21
BPA-B / BPD-1
U1.35-P2-X3
101B xx22
BPC-B to BPD-1 communications fault
1. BPD-1 Location: U1.35-P2-X3 2. BPC-B Location: U1.35-P2-X4 3. BPE backplane B Location: U1.35-P2
101B xx23
Seeprom (VPD) fault on BPD-1B
U1.35-P2-X3
101B xx31
BPA-B / BPD-2
U1.35-P2-X2 454
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
101B xx32
BPC-B to BPD-2 communications fault
1. BPD-2 Location: U1.35-P2-X2 2. BPC-B Location: U1.35-P2-X4 3. BPE backplane B Location: U1.35-P2
101B xx33
Seeprom (VPD) fault on BPD-2B
U1.35-P2-X2
101B xx41
BPA-B / BPD-3
U1.35-P2-X1
101B xx42
BPC-B to BPD-3 communications fault
1. BPD-3 Location: U1.35-P2-X1 2. BPC-B Location: U1.35-P2-X4 3. BPE backplane B Location: U1.35-P2
101B xx43
Seeprom (VPD) fault on BPD-3B
U1.35-P2-X1
101B xx51
BPA-B / BPR-1
U1.35-P2-V1
101B xx52
BPC-B to BPR-1 communications fault
1. BPR-1 Location: U1.35-P2-V1 2. BPC-B Location: U1.35-P2-X4 3. BPE backplane B Location: U1.35-P2
101B xx53
Seeprom (VPD) fault on BPR-1B
U1.35-P2-V1
101B xx61
BPA-B / BPR-2
U1.35-P2-V2
101B xx62
BPC-B to BPR-2 communications fault
1. BPR-2 Location: U1.35-P2-V2 2. BPC-B Location: U1.35-P2-X4 3. BPE backplane B Location: U1.35-P2
101B xx63
Seeprom (VPD) fault on BPR-2B
U1.35-P2-V2
101B xx71
BPA-B / BPR-3
U1.35-P2-V3
101B xx72
BPC-B to BPR-3 communications fault
1. BPR-3 Location: U1.35-P2-V3 2. BPC-B Location: U1.35-P2-X4 3. BPE backplane B Location: U1.35-P2
101B xx73
Seeprom (VPD) fault on BPR-3B
U1.35-P2-V3
101B xx81
BPA-B / IBF 1
U1.13-P2-V3
101B xx83
Seeprom (VPD) fault on IBF-1B
U1.13-P2-V3
101B xx91
BPA-B / IBF 2
U1.15-P2-V4
101B xx93
Seeprom (VPD) fault on IBF-2B
U1.15-P2-V4
101B xxA1
BPA-B / IBF 3
U2.17-P2-V6
101B xxA3
Seeprom (VPD) fault on IBF-3B
U2.17-P2-V6 Chapter
5.
Error
Code
to
FRU
Index
455
Error Code
Description
Action/ Possible Failing FRU
101B xxB1
BPA-B / EPO_SW
1. Replace the UEPO switch,Location: U1-V1 2. Check the cable from BPC-B to the UEPO switch, Location U1-V1/Q2#
101C 0FB6
MCM 0 running over temperature
1. Verify that the MCM at U1.18-P1-C1 is installed. 2. Verify that there are no airflow problems with the MCM at U1.18-P1-C1 or its heatsink. 3. Verify that the MCM at U1.18-P1-C1 is installed correctly onto the processor backplane. 4. Call service support.
101C 0FC6
MCM 1 running over temperature
1. Verify that the MCM at U1.18-P1-C3 is installed. 2. Verify that there are no airflow problems with the MCM at U1.18-P1-C3 or its heatsink. 3. Verify that the MCM at U1.18-P1-C3 is installed correctly onto the processor backplane. 4. Call service support.
101C 0FD6
MCM 2 running over temperature
1. Verify that the MCM at U1.18-P1-C4 is installed. 2. Verify that there are no airflow problems with the MCM at U1.18-P1-C4 or its heatsink. 3. Verify that the MCM at U1.18-P1-C4 is installed correctly onto the processor backplane. 4. Call service support.
101C 0FE6
MCM 3 running over temperature
1. Verify that the MCM at U1.18-P1-C2 is installed. 2. Verify that there are no airflow problems with the MCM at U1.18-P1-C2 or its heatsink. 3. Verify that the MCM at U1.18-P1-C2 is installed correctly onto the processor backplane. 4. Call service support.
101C 1C06
Loss of airflow in processor subsystem resulting in over-temperature
Go to “MAP 1527: An Airflow Loss Has Been Detected” on page 209 with location U1.18. 456
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
101C 5Ax6
A DCA FRU whose firmware is too down-level to function with the power subsystem has been installed in the CEC. For the DCA to function, the power subsystem controller must download code to it. (The x indicates which DCA has the down-level firmware.)
If the system is powered up: 1. Wait approximately 30 minutes, then press the green start-of-service button followed by the white end-of-service button. 2. Look at the service action event log in Service Focal Point. Verify that the 101C 5Ax6 error code did not reoccur; if it did not, the update was successful. If the 101C 5Ax6 error code reoccurs, call service support. If the system is powered down: a. Wait approximately 30 minutes. b. Look at the service action event log in Service Focal Point. Verify that the 101C 5Ax6 error code did not reoccur after 30 minutes. If it did not, the update was successful. If the 101C 5Ax6 error code reoccurs, call service support.
101C 6014
Overcurrent or overvoltage problem with 2.5V in processor subsystem
Go to “MAP 1520: Power” on page 192.
101C 6114
Overcurrent or overvoltage problem with 1.8V in processor subsystem
Go to “MAP 1520: Power” on page 192.
101C 6124
Overcurrent or overvoltage problem with 1.5V in processor subsystem
Go to “MAP 1520: Power” on page 192.
101C 6134
Overcurrent or overvoltage problem with 1.5V in processor subsystem
Go to “MAP 1520: Power” on page 192.
101C 6144
Overcurrent or overvoltage problem with 3.3V in processor subsystem (oscillator)
Go to “MAP 1520: Power” on page 192.
101C 6514
Overcurrent or overvoltage problem with 5.0V in processor subsystem (autostart, standby)
Go to “MAP 1520: Power” on page 192.
101C 6814
Overcurrent or overvoltage problem with 2.5V in processor subsystem
Go to “MAP 1520: Power” on page 192.
101C 6914
Overcurrent or overvoltage problem with 1.8V in processor subsystem
Go to “MAP 1520: Power” on page 192.
101C 6924
Overcurrent or overvoltage problem with 1.5V in processor subsystem
Go to “MAP 1520: Power” on page 192.
101C 6934
Overcurrent or overvoltage problem with 1.5V in processor subsystem
Go to “MAP 1520: Power” on page 192.
101C 6944
Overcurrent or overvoltage problem with 3.3V in processor subsystem (oscillator)
Go to “MAP 1520: Power” on page 192.
101C 6D14
Overcurrent or overvoltage problem with 5.0V in processor subsystem (autostart, standby)
Go to “MAP 1520: Power” on page 192.
101C 7E06
No MCMs detected OR the actual number of MCMs does not match the expected number of MCMs
1. Verify that the processor MCMs are plugged in a valid configuration. 2. Verify that the processor MCMs are seated properly. 3. Verify the processor VPD cards in the primary I/O book are of the proper number and are positioned and seated correctly. 4. Call service support. Chapter
5.
Error
Code
to
FRU
Index
457
Error Code
Description
Action/ Possible Failing FRU
101C 7EB6
MCM 0 running over temperature
1. Verify that the MCM at U1.18-P1-C1 is installed. 2. Verify that there are no airflow problems with the MCM at U1.18-P1-C1 or its heatsink. 3. Verify that the MCM at U1.18-P1-C1 is installed correctly onto the processor backplane. 4. Call service support.
101C 7EC6
MCM 1 running over temperature
1. Verify that the MCM at U1.18-P1-C3 is installed. 2. Verify that there are no airflow problems with the MCM at U1.18-P1-C3 or its heatsink. 3. Verify that the MCM at U1.18-P1-C3 is installed correctly onto the processor backplane. 4. Call service support.
101C 7ED6
MCM 2 running over temperature
1. Verify that the MCM at U1.18-P1-C4 is installed. 2. Verify that there are no airflow problems with the MCM at U1.18-P1-C4 or its heatsink. 3. Verify that the MCM at U1.18-P1-C4 is installed correctly onto the processor backplane. 4. Call service support.
101C 7EE6
MCM 3 running over temperature
1. Verify that the MCM at U1.18-P1-C2 is installed. 2. Verify that there are no airflow problems with the MCM at U1.18-P1-C2 or its heatsink. 3. Verify that the MCM at U1.18-P1-C2 is installed correctly onto the processor backplane. 4. Call service support.
101C 7F06
No MCMs detected OR the actual number of MCMs does not match the expected number of MCMs
Note: If the service processor is hung, this error code causes the service processor to hang. Refer to “Resetting the Service Processor” on page 819. 1. Verify that the processor MCMs are plugged in a valid configuration. 2. Verify that the processor MCMs are seated properly. 3. Verify the processor VPD cards in the primary I/O book are of the proper number and are positioned and seated correctly. 4. Call service support.
101C B014
Overcurrent or overvoltage problem with 2.5V in processor subsystem
Go to “MAP 1520: Power” on page 192.
101C B114
Overcurrent or overvoltage problem with 1.8V in processor subsystem
Go to “MAP 1520: Power” on page 192. 458
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
101C B124
Overcurrent or overvoltage problem with 1.5V in processor subsystem
Go to “MAP 1520: Power” on page 192.
101C B134
Overcurrent or overvoltage problem with 1.5V in processor subsystem
Go to “MAP 1520: Power” on page 192.
101C B144
Overcurrent or overvoltage problem with 3.3V in processor subsystem (oscillator)
Go to “MAP 1520: Power” on page 192.
101C B544
Overcurrent or overvoltage problem with 5.0V in processor subsystem (autostart, standby)
Go to “MAP 1520: Power” on page 192.
101C B814
Overcurrent or overvoltage problem with 2.5V in processor subsystem
Go to “MAP 1520: Power” on page 192.
101C B914
Overcurrent or overvoltage problem with 1.8V in processor subsystem
Go to “MAP 1520: Power” on page 192.
101C B924
Overcurrent or overvoltage problem with 1.5V in processor subsystem
Go to “MAP 1520: Power” on page 192.
101C B934
Overcurrent or overvoltage problem with 1.5V in processor subsystem
Go to “MAP 1520: Power” on page 192.
101C B944
Overcurrent or overvoltage problem with 3.3V in processor subsystem (oscillator)
Go to “MAP 1520: Power” on page 192.
101C BD44
Overcurrent or overvoltage problem with 5.0V in processor subsystem (autostart, standby)
Go to “MAP 1520: Power” on page 192.
101C E006
Processor over-temperature detected
Go to “MAP 1528: There Is A Processor (Critical/Warning) Overtemperature Fault” on page 211.
101C E106
Processor over-temperature detected
Go to “MAP 1528: There Is A Processor (Critical/Warning) Overtemperature Fault” on page 211.
101C E1B6
MCM 0 running over temperature
1. Verify that the MCM at U1.18-P1-C1 is installed. 2. Verify that there are no airflow problems with the MCM at U1.18-P1-C1 or its heatsink. 3. Verify that the MCM at U1.18-P1-C1 is installed correctly onto the processor backplane. 4. Call service support.
101C E1C6
MCM 1 running over temperature
1. Verify that the MCM at U1.18-P1-C3 is installed. 2. Verify that there are no airflow problems with the MCM at U1.18-P1-C3 or its heatsink. 3. Verify that the MCM at U1.18-P1-C3 is installed correctly onto the processor backplane. 4. Call service support. Chapter
5.
Error
Code
to
FRU
Index
459
Error Code
Description
Action/ Possible Failing FRU
101C E1D6
MCM 2 running over temperature
1. Verify that the MCM at U1.18-P1-C4 is installed. 2. Verify that there are no airflow problems with the MCM at U1.18-P1-C4 or its heatsink. 3. Verify that the MCM at U1.18-P1-C4 is installed correctly onto the processor backplane. 4. Call service support.
101C E1E6
MCM 3 running over temperature
1. Verify that the MCM at U1.18-P1-C2 is installed. 2. Verify that there are no airflow problems with the MCM at U1.18-P1-C2 or its heatsink. 3. Verify that the MCM at U1.18-P1-C2 is installed correctly onto the processor backplane. 4. Call service support.
101C F206
Error not isolated; VPD on BPR may be damaged
Call service support.
101C F306
The power subsystem does not recognize the processor subsystem type or processor subsystem configuration
Call service support.
101C F406
Hardware configuration cannot support N-mode power
Call service support.
101C F606
The power subsystem does not recognize the processor subsystem type or processor subsystem configuration
Call service support.
101C F516
An N+1 boundary (DCA or fan) has dropped to N-mode status in the processor subsystem, location: U1.18
This is normally an informational message. It should be accompanied by other error codes that more precisely indicate where the failure is. Examine the service processor error log. If there are no other power-related error codes (in the time frame in which this code was generated) in the service processor error log, the FRU indicated with this error code, if any, should be replaced.
101C F526
An N+1 boundary (DCA or fan) has dropped to N-mode status in the processor subsystem, location: U1.18
This is normally an informational message. It should be accompanied by other error codes that more precisely indicate where the failure is. Examine the service processor error log. If there are no other power-related error codes (in the time frame in which this code was generated) in the service processor error log, the FRU indicated with this error code, if any, should be replaced. 460
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
101C F536
An N+1 boundary (DCA or fan) has dropped to N-mode status in the processor subsystem, location: U1.18
This is normally an informational message. It should be accompanied by other error codes that more precisely indicate where the failure is. Examine the service processor error log. If there are no other power-related error codes (in the time frame in which this code was generated) in the service processor error log, the FRU indicated with this error code, if any, should be replaced.
101C F546
An N+1 boundary (DCA or fan) has dropped to N-mode status in the processor subsystem, location: U1.18
This is normally an informational message. It should be accompanied by other error codes that more precisely indicate where the failure is. Examine the service processor error log. If there are no other power-related error codes (in the time frame in which this code was generated) in the service processor error log, the FRU indicated with this error code, if any, should be replaced.
101C F556
An N+1 boundary (DCA or fan) has dropped to N-mode status in the processor subsystem, location: U1.18
This is normally an informational message. It should be accompanied by other error codes that more precisely indicate where the failure is. Examine the service processor error log. If there are no other power-related error codes (in the time frame in which this code was generated) in the service processor error log, the FRU indicated with this error code, if any, should be replaced.
101C F566
An N+1 boundary (DCA or fan) has dropped to N-mode status in the processor subsystem, location: U1.18
This is normally an informational message. It should be accompanied by other error codes that more precisely indicate where the failure is. Examine the service processor error log. If there are no other power-related error codes (in the time frame in which this code was generated) in the service processor error log, the FRU indicated with this error code, if any, should be replaced.
101C F576
An N+1 boundary (DCA or fan) has dropped to N-mode status in the processor subsystem, location: U1.18
This is normally an informational message. It should be accompanied by other error codes that more precisely indicate where the failure is. Examine the service processor error log. If there are no other power-related error codes (in the time frame in which this code was generated) in the service processor error log, the FRU indicated with this error code, if any, should be replaced.
101C F586
An N+1 boundary (DCA or fan) has dropped to N-mode status in the processor subsystem, location: U1.18
This is normally an informational message. It should be accompanied by other error codes that more precisely indicate where the failure is. Examine the service processor error log. If there are no other power-related error codes (in the time frame in which this code was generated) in the service processor error log, the FRU indicated with this error code, if any, should be replaced. Chapter
5.
Error
Code
to
FRU
Index
461
Error Code
Description
Action/ Possible Failing FRU
101C F596
An N+1 boundary (DCA or fan) has dropped to N-mode status in the processor subsystem, location: U1.18
This is normally an informational message. It should be accompanied by other error codes that more precisely indicate where the failure is. Examine the service processor error log. If there are no other power-related error codes (in the time frame in which this code was generated) in the service processor error log, the FRU indicated with this error code, if any, should be replaced.
101C F5A6
An N+1 boundary (DCA or fan) has dropped to N-mode status in the processor subsystem, location: U1.18
This is normally an informational message. It should be accompanied by other error codes that more precisely indicate where the failure is. Examine the service processor error log. If there are no other power-related error codes (in the time frame in which this code was generated) in the service processor error log, the FRU indicated with this error code, if any, should be replaced.
101C xx11
Processor subsystem / DCA-1
U1.18-P1-V1
101C xx13
Seeprom (VPD) fault on processor subsystem DCA-1
U1.18-P1-V1
101C xx15
Processor subsystem / DCA-1 cable fault
Go to “MAP 1520: Power” on page 192 with these pairs of location codes: 1. U1.18-P1-V1/Q1 and U1.35-P1-X4/Q5 2. U1.18-P1-V1/Q2 and U1.35-P2-X4/Q5
101C xx21
Processor subsystem / DCA-2
U1.18-P1-V2
101C xx23
Seeprom (VPD) fault on processor subsystem DCA-2
U1.18-P1-V2
101C xx25
Processor subsystem / DCA-2 cable fault
Go to “MAP 1520: Power” on page 192 with these pairs of location codes: 1. U1.18-P1-V2/Q1 and U1.35-P1-X4/Q6 2. U1.18-P1-V2/Q2 and U1.35-P2-X4/Q6
101C xx31
Processor subsystem / DCA-3
U1.18-P1-V4
101C xx33
Seeprom (VPD) fault on processor subsystem DCA -3
U1.18-P1-V4
101C xx35
Processor subsystem / DCA-3 cable fault
Go to “MAP 1520: Power” on page 192 with these pairs of location codes: 1. U1.18-P1-V4/Q1 and U1.35-P1-X4/Q7 2. U1.18-P1-V4/Q2 and U1.35-P2-X4/Q7
101C xx41
Processor subsystem / DCA-4
U1.18-P1-V5
101C xx43
Seeprom (VPD) fault on processor subsystem DCA-4
U1.18-P1-V5
101C xx45
Processor subsystem / DCA-4 cable fault
Go to “MAP 1520: Power” on page 192 with these pairs of location codes: 1. U1.18-P1-V5/Q1 and U1.35-P1-X3/Q9 2. U1.18-P1-V5/Q2 and U1.35-P2-X3/Q9
101C xx51
Processor subsystem / DCA-5
U1.18-P1-V6
101C xx53
Seeprom (VPD) fault on processor subsystem DCA-5
U1.18-P1-V6 462
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
101C xx55
Processor subsystem / DCA-5 cable fault
Go to “MAP 1520: Power” on page 192 with these pairs of location codes: 1. U1.18-P1-V6/Q1 and U1.35-P1-X3/Q10 2. U1.18-P1-V6/Q2 and U1.35-P2-X3/Q10
101C xx61
Processor subsystem / DCA-6 (if installed)
U1.18-P1-V8
101C xx63
Seeprom (VPD) fault on processor subsystem DCA-6
U1.18-P1-V8
101C xx65
Processor subsystem/DCA-6 cable fault
Go to “MAP 1520: Power” on page 192 with these pairs of location codes: 1. U1.18-P1-V8/Q1 and U1.35-P1-X3/Q7 2. U1.18-P1-V8/Q2 and U1.35-P2-X3/Q7
101C xx71
Processor subsystem / MDA-1
U1.18-X1
101C xx73
Seeprom (VPD) fault on processor subsystem MDA -1
U1.18-X1
101C xx75
Processor subsystem / MDA-1 cable fault
Go to “MAP 1520: Power” on page 192 with these pairs of location codes: 1. U1.18-X1/Q1 and U1.35-P1-X4/Q1 2. U1.18-X1/Q2 and U1.35-P2-X4/Q1
101C xx77
Fan fault on processor subsystem fan 1
1. Fan 1 controllerLocation: U1.18-X1 2. Fan 1 Location: U1.18-F1
101C xx81
Processor subsystem / MDA-2
U1.18-X2
101C xx83
Seeprom (VPD) fault on processor subsystem MDA-2
U1.18-X2
101C xx85
Processor subsystem / MDA-2 cable fault
Go to “MAP 1520: Power” on page 192 with these pairs of location codes: 1. U1.18-X2/Q1 and U1.35-P1-X4/Q2 2. U1.18-X2/Q2 and U1.35-P2-X4/Q2
101C xx87
Fan fault on processor subsystem fan 2
1. Fan 2 controllerLocation: U1.18-X2 2. Fan 2 Location: U1.18-F2
101C xx91
Processor subsystem / MDA-3
U1.18-X3
101C xx93
Seeprom (VPD) fault on processor subsystem MDA-3
U1.18-X3
101C xx95
Processor subsystem / MDA-3 cable fault
Go to “MAP 1520: Power” on page 192 with these pairs of location codes: 1. U1.18-X3/Q1 and U1.35-P1-X4/Q3 2. U1.18-X3/Q2 and U1.35-P2-X4/Q3
101C xx97
Fan fault on processor subsystem fan 3
1. Fan 3 controllerLocation: U1.18-X3 2. Fan 3 Location: U1.18-F3
101C xxA1
Processor subsystem / MDA-4
U1.18-X4 Chapter
5.
Error
Code
to
FRU
Index
463
Error Code
Description
Action/ Possible Failing FRU
101C xxA3
Seeprom (VPD) fault on processor subsystem MDA-4
U1.18-X4
101C xxA5
Processor subsystem / MDA-4 cable fault
Go to “MAP 1520: Power” on page 192 with these pairs of location codes: 1. U1.18-X4/Q1 and U1.35-P1-X4/Q4 2. U1.18-X4/Q2 and U1.35-P2-X4/Q4
101C xxA7
Fan fault on processor subsystem fan 4
1. Fan 4 controllerLocation: U1.18-X4 2. Fan 4 Location: U1.18-F4
101F xxxx
Power fault on high performance switch
Go to the 45H/W4 Switch Planning, Installation, and Service Guide.
1021 1B17
Problem with fan 1 in first I/O subsystem in second rack (I/O subsystem 5)
1. DCA-1Location: U2.1-V1 2. Fan 1Location: U2.1-F1
1021 1B27
Problem with fan 2 in first I/O subsystem in second rack (I/O subsystem 5)
1. DCA-2Location: U2.1-V2 2. Fan 2Location: U2.1-F2
1021 1C06
Loss of airflow in first I/O subsystem in second rack, location: U2.1
Go to “MAP 1527: An Airflow Loss Has Been Detected” on page 209.
1021 6014
Overcurrent or overvoltage problem with 3.3V in 5th I/O subsystem
Go to “MAP 1520: Power” on page 192.
1021 6024
Overcurrent or overvoltage problem with 5.0V in 5th I/O subsystem
Go to “MAP 1520: Power” on page 192.
1021 6034
Overcurrent or overvoltage problem with 2.5V in 5th I/O subsystem
Go to “MAP 1520: Power” on page 192.
1021 6044
Overcurrent or overvoltage problem with 12.0V in 5th I/O subsystem
Go to “MAP 1520: Power” on page 192.
1021 6054
Overcurrent or overvoltage problem with -12.0V in 5th I/O subsystem
Go to “MAP 1520: Power” on page 192.
1021 6814
Overcurrent or overvoltage problem with 3.3V in 5th I/O subsystem
Go to “MAP 1520: Power” on page 192.
1021 6824
Overcurrent or overvoltage problem with 5.0V in 5th I/O subsystem
Go to “MAP 1520: Power” on page 192.
1021 6834
Overcurrent or overvoltage problem with 2.5V in 5th I/O subsystem
Go to “MAP 1520: Power” on page 192.
1021 6844
Overcurrent or overvoltage problem with 12.0V in 5th I/O subsystem
Go to “MAP 1520: Power” on page 192.
1021 6854
Overcurrent or overvoltage problem with -12.0V in 5th I/O subsystem
Go to “MAP 1520: Power” on page 192.
1021 7017
Problem with fan 3 in first I/O subsystem in second rack (I/O subsystem 5)
1. DCA-1Location: U2.1-V1 2. Fan 3Location: U2.1-F3 464
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
1021 7027
Problem with fan 4 in first I/O subsystem in second rack (I/O subsystem 5)
1. DCA-2Location: U2.1-V2 2. Fan 4Location: U2.1-F4
1021 7117
Problem with fan 1 and fan 3 in first I/O subsystem in second rack (I/O subsystem 5)
1. DCA-1Location: U2.1-V1 2. Fans 1 and 3Location: U2.1-F1 and U2.1-F3
1021 7127
Problem with fan 2 and fan 4 in first I/O subsystem in second rack (I/O subsystem 5)
1. DCA-2Location: U2.1-V2 2. Fans 2 and 4Location: U2.1-F2 and U2.1-F4
1021 8A00
The SPCN firmware has detected a change in the drawer configuration
Informational message.
1021 B014
Overcurrent or overvoltage problem with 3.3V in 5th I/O subsystem
Go to “MAP 1520: Power” on page 192.
1021 B024
Overcurrent or overvoltage problem with 5.0V in 5th I/O subsystem
Go to “MAP 1520: Power” on page 192.
1021 B034
Overcurrent or overvoltage problem with 2.5V in 5th I/O subsystem
Go to “MAP 1520: Power” on page 192.
1021 B044
Overcurrent or overvoltage problem with 12.0V in 5th I/O subsystem
Go to “MAP 1520: Power” on page 192.
1021 B054
Overcurrent or overvoltage problem with -12.0V in 5th I/O subsystem
Go to “MAP 1520: Power” on page 192.
1021 B814
Overcurrent or overvoltage problem with 3.3V in 5th I/O subsystem
Go to “MAP 1520: Power” on page 192.
1021 B824
Overcurrent or overvoltage problem with 5.0V in 5th I/O subsystem
Go to “MAP 1520: Power” on page 192.
1021 B834
Overcurrent or overvoltage problem with 2.5V in 5th I/O subsystem
Go to “MAP 1520: Power” on page 192.
1021 B844
Overcurrent or overvoltage problem with 12.0V in 5th I/O subsystem
Go to “MAP 1520: Power” on page 192.
1021 B854
Overcurrent or overvoltage problem with -12.0V in 5th I/O subsystem
Go to “MAP 1520: Power” on page 192.
1021 BF06
Unable to resolve the chassis VPD in an I/O subsystem
The chassis VPD in the I/O subsystem drawer identified by the location code could not be resolved. This may be an indication that more than one FRU in the drawer was replaced during a service action. If this is the case, reinstall the original FRUs in the drawer, then replace the FRUs one at a time. If only one FRU was replaced and this error code occurred, call service support. Chapter
5.
Error
Code
to
FRU
Index
465
Error Code
Description
Action/ Possible Failing FRU
1021 CC36
Description: Loss of pgood on an I/O planar in the I/O drawer at location U2.1Action:This error code can be caused by hot-plugging RIO cables. To see if this is a recoverable error, do the following: 1. Power down the system by following the appropriate procedure in “Powering Off and Powering On the System” on page 838. 2. After the system is shut down, 350V dc must be removed from the DCAs in the I/O drawer that reported the error: v If a High Performance Switch (7045-SW4) is installed in the CEC frame, follow the instructions in “Powering Off and Powering On the System” on page 838 for deactivating a system’s I/O drawer after the system is shut down. v If a High Performance Switch (7045-SW4) is not present in the CEC frame, use the UEPO switch to remove 350V dc from all of the drawers in the frame.3. Then boot the system by following the appropriate procedure in “Powering Off and Powering On the System” on page 838. At any point after the service processor comes up to standby, indicated by OK in the operator panel in the media drawer, look at the service action event log in Service Focal Point on the HMC. If this error code does not reoccur, no action is required. If this error code reoccurs, replace the I/O planar, location: U2.1-P1.
1021 CC46
Description: Loss of pgood on an I/O planar in the I/O drawer at location U2.1Action:This error code can be caused by hot-plugging RIO cables. To see if this is a recoverable error, do the following: 1. Power down the system by following the appropriate procedure in “Powering Off and Powering On the System” on page 838. 2. After the system is shut down, 350V dc must be removed from the DCAs in the I/O drawer that reported the error: v If a High Performance Switch (7045-SW4) is installed in the CEC frame, follow the instructions in “Powering Off and Powering On the System” on page 838 for deactivating a system’s I/O drawer after the system is shut down. v If a High Performance Switch (7045-SW4) is not present in the CEC frame, use the UEPO switch to remove 350V dc from all of the drawers in the frame.3. Then boot the system by following the appropriate procedure in “Powering Off and Powering On the System” on page 838. At any point after the service processor comes up to standby, indicated by OK in the operator panel in the media drawer, look at the service action event log in Service Focal Point on the HMC. If this error code does not reoccur, no action is required. If this error code reoccurs, replace the I/O planar, location: U2.1-P2.Go to “MAP 1520: Power” on page 192.
1021 CD96
A power short has been detected in the DASD subsystem in an I/O drawer.
Go to “MAP 1520: Power” on page 192.
1021 CDA6
A power short has been detected in the DASD subsystem in an I/O drawer.
Go to “MAP 1520: Power” on page 192.
1021 CDB6
A power short has been detected in the DASD subsystem in an I/O drawer.
Go to “MAP 1520: Power” on page 192.
1021 CDC6
A power short has been detected in the DASD subsystem in an I/O drawer.
Go to “MAP 1520: Power” on page 192.
1021 F4x6
Hardware configuration does not support N-mode power.
Call Service Support 466
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
1021 F516
An N+1 boundary (DCA or fan) has dropped to N-mode status in the first I/O subsystem in the secondary rack, location: U2.1.
This is normally an informational message. It should be accompanied by other error codes that more precisely indicate where the failure is. Examine the service processor error log. If there are no other power-related error codes (in the time frame in which this code was generated) in the service processor error log, the FRU indicated with this error code, if any, should be replaced.
1021 F526
An N+1 boundary (DCA or fan) has dropped to N-mode status in the first I/O subsystem in the secondary rack, location: U2.1.
This is normally an informational message. It should be accompanied by other error codes that more precisely indicate where the failure is. Examine the service processor error log. If there are no other power-related error codes (in the time frame in which this code was generated) in the service processor error log, the FRU indicated with this error code, if any, should be replaced.
1021 xx11
First I/O subsystem in second rack, DCA-1
U2.1-V1
1021 xx13
Seeprom fault on DCA-1 in first I/O subsystem secondary rack
U2.1-V1
1021 xx15
First I/O subsystem in second rack, DCA-1, cable fault
Go to “MAP 1520: Power” on page 192 with these pairs of location codes: 1. U2.1-V1/Q1 and U1.35-P2-X2/Q1 2. U2.1-V1/Q2 and U1.35-P1-X2/Q1
1021 xx17
Problem with fan 1 and fan 3 in first I/O subsystem in second rack
1. DCA-1Location U2.1–V1 2. Fan 1,Location U2.1–F1 3. Fan 3,Location U2.1–F3
1021 xx21
First I/O subsystem in second rack, DCA-2
U2.1-V2
1021 xx23
Seeprom fault on DCA-2 in first I/O subsystem secondary rack
U2.1-V2
1021 xx25
First I/O subsystem in second rack, DCA-2, cable fault
Go to “MAP 1520: Power” on page 192 with these pairs of location codes: 1. U2.1-V2/Q1 and U1.35-P2-X2/Q2 2. U2.1-V2/Q2 and U1.35-P1-X2/Q2
1021 xx27
Problem with fan 2 and fan 4 in first I/O subsystem in second rack
1. DCA-2Location U2.1–V2 2. Fan 2,Location U2.1–F2 3. Fan 4,Location U2.1–F4
1021 xx31
First I/O subsystem in secondary rack I/O subsystem backplane 1 fault
U2.1-P1
1021 xx33
Seeprom fault on I/O subsystem backplane 1 in first I/O subsystem in secondary rack
U2.1-P1
1021 xx41
First I/O subsystem in secondary rack I/O subsystem backplane 2 fault
U2.1-P2 Chapter
5.
Error
Code
to
FRU
Index
467
Error Code
Description
Action/ Possible Failing FRU
1021 xx43
Seeprom fault on I/O subsystem backplane 2 in first I/O subsystem in secondary rack
U2.1-P2
1021 xx51
First I/O subsystem in secondary rack DASD backplane 1 fault
U2.1-P3
1021 xx53
Seeprom fault on DASD backplane 1 in first I/O subsystem in secondary rack
U2.1-P3
1021 xx61
First I/O subsystem in secondary rack DASD backplane 2 fault
U2.1-P4
1021 xx63
Seeprom fault on DASD backplane 2 in first I/O subsystem in secondary rack
U2.1-P4
1021 xx71
First I/O subsystem in secondary rack DASD backplane 3 fault
U2.1-P5
1021 xx73
Seeprom fault on DASD backplane 3 in first I/O subsystem in secondary rack
U2.1-P5
1021 xx81
First I/O subsystem in secondary rack DASD backplane 4 fault
U2.1-P6
1021 xx83
Seeprom fault on DASD backplane 4 in first I/O subsystem in secondary rack
U2.1-P6
1022 1B17
Problem with fan 1 in second I/O subsystem in second rack (I/O subsystem 6)
1. DCA-1Location: U2.5-V1 2. Fan 1Location: U2.5-F1
1022 1B27
Problem with fan 2 in second I/O subsystem in second rack (I/O subsystem 6)
1. DCA-2Location: U2.5-V2 2. Fan 2Location: U2.5-F2
1022 1C06
Loss of airflow in second I/O subsystem in second rack (I/O subsystem 6), location: U2.5
Go to “MAP 1527: An Airflow Loss Has Been Detected” on page 209.
1022 6014
Overcurrent or overvoltage problem with 3.3V in second I/O subsystem in second rack (I/O subsystem 6)
Go to “MAP 1520: Power” on page 192.
1022 6024
Overcurrent or overvoltage problem with 5.0V in second I/O subsystem in second rack (I/O subsystem 6)
Go to “MAP 1520: Power” on page 192.
1022 6034
Overcurrent or overvoltage problem with 2.5V in second I/O subsystem in second rack (I/O subsystem 6)
Go to “MAP 1520: Power” on page 192.
1022 6044
Overcurrent or overvoltage problem with 12.0V in second I/O subsystem in second rack (I/O subsystem 6)
Go to “MAP 1520: Power” on page 192.
1022 6054
Overcurrent or overvoltage problem with -12.0V in second I/O subsystem in second rack (I/O subsystem 6)
Go to “MAP 1520: Power” on page 192.
1022 6814
Overcurrent or overvoltage problem with 3.3V in second I/O subsystem in second rack (I/O subsystem 6)
Go to “MAP 1520: Power” on page 192. 468
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
1022 6824
Overcurrent or overvoltage problem with 5.0V in second I/O subsystem in second rack (I/O subsystem 6)
Go to “MAP 1520: Power” on page 192.
1022 6834
Overcurrent or overvoltage problem with 2.5V in second I/O subsystem in second rack (I/O subsystem 6)
Go to “MAP 1520: Power” on page 192.
1022 6844
Overcurrent or overvoltage problem with 12.0V in second I/O subsystem in second rack (I/O subsystem 6)
Go to “MAP 1520: Power” on page 192.
1022 6854
Overcurrent or overvoltage problem with -12.0V in second I/O subsystem in second rack (I/O subsystem 6)
Go to “MAP 1520: Power” on page 192.
1022 7017
Problem with fan 3 in second I/O subsystem in second rack (I/O subsystem 6)
1. DCA-1Location: U2.5-V1 2. Fan 3Location: U2.5-F3
1022 7027
Problem with fan 4 in second I/O subsystem in second rack (I/O subsystem 6)
1. DCA-2Location: U2.5-V2 2. Fan 4Location: U2.5-F4
1022 7117
Problem with fan 1 and fan 3 in second I/O subsystem in second rack (I/O subsystem 6)
1. DCA-1Location: U2.5-V1 2. Fans 1 and 3Location: U2.5-F1 and U2.5-F3
1022 7127
Problem with fan 2 and fan 4 in second I/O subsystem in second rack (I/O subsystem 6)
1. DCA-2Location: U2.5-V2 2. Fans 2 and 4Location: U2.5-F2 and U2.5-F4
1022 8A00
The SPCN firmware has detected a change in the drawer configuration
Informational message.
1022 B014
Overcurrent or overvoltage problem with 3.3V in second I/O subsystem in second rack (I/O subsystem 6)
Go to “MAP 1520: Power” on page 192.
1022 B024
Overcurrent or overvoltage problem with 5.0V in second I/O subsystem in second rack (I/O subsystem 6)
Go to “MAP 1520: Power” on page 192.
1022 B034
Overcurrent or overvoltage problem with 2.5V in 6th I/O subsystem
Go to “MAP 1520: Power” on page 192.
1022 B044
Overcurrent or overvoltage problem with 12.0V in6th I/O subsystem
Go to “MAP 1520: Power” on page 192.
1022 B054
Overcurrent or overvoltage problem with -12.0V in 6th I/O subsystem
Go to “MAP 1520: Power” on page 192.
1022 B814
Overcurrent or overvoltage problem with 3.3V in 6th I/O subsystem
Go to “MAP 1520: Power” on page 192.
1022 B824
Overcurrent or overvoltage problem with 5.0V in second I/O subsystem in second rack (I/O subsystem 6)
Go to “MAP 1520: Power” on page 192. Chapter
5.
Error
Code
to
FRU
Index
469
Error Code
Description
Action/ Possible Failing FRU
1022 B834
Overcurrent or overvoltage problem with 2.5V in second I/O subsystem in second rack (I/O subsystem 6)
Go to “MAP 1520: Power” on page 192.
1022 B844
Overcurrent or overvoltage problem with 12.0V in second I/O subsystem in second rack (I/O subsystem 6)
Go to “MAP 1520: Power” on page 192.
1022 B854
Overcurrent or overvoltage problem with -12.0V in second I/O subsystem in second rack (I/O subsystem 6)
Go to “MAP 1520: Power” on page 192.
1022 BF06
Unable to resolve the chassis VPD in an I/O subsystem
The chassis VPD in the I/O subsystem drawer identified by the location code could not be resolved. This may be an indication that more than one FRU in the drawer was replaced during a service action. If this is the case, reinstall the original FRUs in the drawer, then replace the FRUs one at a time. If only one FRU was replaced and this error code occurred, call service support.
1022 CC36
Description: Loss of pgood on an I/O planar in the I/O drawer at location U2.5Action:This error code can be caused by hot-plugging RIO cables. To see if this is a recoverable error, do the following: 1. Power down the system by following the appropriate procedure in “Powering Off and Powering On the System” on page 838. 2. After the system is shut down, 350V dc must be removed from the DCAs in the I/O drawer that reported the error: v If a High Performance Switch (7045-SW4) is installed in the CEC frame, follow the instructions in “Powering Off and Powering On the System” on page 838 for deactivating a system’s I/O drawer after the system is shut down. v If a High Performance Switch (7045-SW4) is not present in the CEC frame, use the UEPO switch to remove 350V dc from all of the drawers in the frame.3. Then boot the system by following the appropriate procedure in “Powering Off and Powering On the System” on page 838. At any point after the service processor comes up to standby, indicated by OK in the operator panel in the media drawer, look at the service action event log in Service Focal Point on the HMC. If this error code does not reoccur, no action is required. If this error code reoccurs, replace the I/O planar, location: U2.5-P1.
1022 CC46
Description: Loss of pgood on an I/O planar in the I/O drawer at location U2.5Action:This error code can be caused by hot-plugging RIO cables. To see if this is a recoverable error, do the following: 1. Power down the system by following the appropriate procedure in “Powering Off and Powering On the System” on page 838. 2. After the system is shut down, 350V dc must be removed from the DCAs in the I/O drawer that reported the error: v If a High Performance Switch (7045-SW4) is installed in the CEC frame, follow the instructions in “Powering Off and Powering On the System” on page 838 for deactivating a system’s I/O drawer after the system is shut down. v If a High Performance Switch (7045-SW4) is not present in the CEC frame, use the UEPO switch to remove 350V dc from all of the drawers in the frame.3. Then boot the system by following the appropriate procedure in “Powering Off and Powering On the System” on page 838. At any point after the service processor comes up to standby, indicated by OK in the operator panel in the media drawer, look at the service action event log in Service Focal Point on the HMC. If this error code does not reoccur, no action is required. If this error code reoccurs, replace the I/O planar, location: U2.5-P2. 470
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
1022 CD96
A power short has been detected in the DASD subsystem in an I/O drawer.
Go to “MAP 1520: Power” on page 192.
1022 CDA6
A power short has been detected in the DASD subsystem in an I/O drawer.
Go to “MAP 1520: Power” on page 192.
1022 CDB6
A power short has been detected in the DASD subsystem in an I/O drawer.
Go to “MAP 1520: Power” on page 192.
1022 CDC6
A power short has been detected in the DASD subsystem in an I/O drawer.
Go to “MAP 1520: Power” on page 192.
1022 F4x6
Hardware configuration does not support N-mode power.
Call Service Support
1022 F516
An N+1 boundary (DCA or fan) has dropped to N-mode status in second I/O subsystem in second rack (I/O subsystem 6), location: U2.5.
This is normally an informational message. It should be accompanied by other error codes that more precisely indicate where the failure is. Examine the service processor error log. If there are no other power-related error codes (in the time frame in which this code was generated) in the service processor error log, the FRU indicated with this error code, if any, should be replaced.
1022 F526
An N+1 boundary (DCA or fan) has dropped to N-mode status in the second I/O subsystem in the secondary rack, location: U2.5.
This is normally an informational message. It should be accompanied by other error codes that more precisely indicate where the failure is. Examine the service processor error log. If there are no other power-related error codes (in the time frame in which this code was generated) in the service processor error log, the FRU indicated with this error code, if any, should be replaced.
1022 xx11
Second I/O subsystem in second rack (I/O subsystem 6), DCA-1
U2.5-V1
1022 xx13
Seeprom fault on DCA-1 in second I/O subsystem in second rack (I/O subsystem 6)
U2.5-V1
1022 xx15
Second I/O subsystem in second rack (I/O subsystem 6), DCA-1, cable fault
Go to “MAP 1520: Power” on page 192 with these pairs of location codes: 1. U2.5-V1/Q1 and U1.35-P2-X2/Q3 2. U2.5-V1/Q2 and U1.35-P1-X2/Q3
1022 xx17
Problem with fan 1 and fan 3 in second I/O subsystem in second rack (I/O subsystem 6)
1. DCA-1Location U2.5–V1 2. Fan 1,Location U2.5–F1 3. Fan 3,Location U2.5–F3
1022 xx21
Second I/O subsystem in second rack (I/O subsystem 6), DCA-2
U2.5-V2
1022 xx23
Seeprom fault on DCA-2 in second I/O subsystem in secondary rack (I/O subsystem 6)
U2.5-V2
1022 xx25
Second I/O subsystem in second rack (I/O subsystem 6), DCA-2, cable fault
Go to “MAP 1520: Power” on page 192 with these pairs of location codes: 1. U2.5-V2/Q1 and U1.35-P2-X2/Q4 2. U2.5-V2/Q2 and U1.35-P1-X2/Q4 Chapter
5.
Error
Code
to
FRU
Index
471
Error Code
Description
Action/ Possible Failing FRU
1022 xx27
Problem with fan 2 and fan 4 in second I/O subsystem in second rack (I/O subsystem 6)
1. DCA-2Location U2.5–V2 2. Fan 2,Location U2.5–F2 3. Fan 4,Location U2.5–F4
1022 xx31
Second I/O subsystem in secondary rack (I/O subsystem 6) I/O subsystem backplane 1 fault
U2.5-P1
1022 xx33
Seeprom fault on I/O subsystem backplane 1 in second I/O subsystem in secondary rack (I/O subsystem 6)Broken 12C Bus
U2.5-P1
1022 xx41
Second I/O subsystem in secondary rack (I/O subsystem 6) I/O subsystem backplane 2 fault
U2.5-P2
1022 xx43
Seeprom fault on I/O subsystem backplane 2 in second I/O subsystem in secondary rack (I/O subsystem 6)
U2.5-P2
1022 xx51
Second I/O subsystem in secondary rack (I/O subsystem 6) DASD backplane 1 fault
U2.5-P3
1022 xx53
Seeprom fault on DASD backplane 1 in second I/O subsystem in secondary rack (I/O subsystem 6)
U2.5-P3
1022 xx61
Second I/O subsystem in secondary rack (I/O subsystem 6), DASD backplane 2 fault
U2.5-P4
1022 xx63
Seeprom fault on DASD backplane 2 in second I/O subsystem in secondary rack (I/O subsystem 6)
U2.5-P4
1022 xx71
Second I/O subsystem in secondary rack (I/O subsystem 6), DASD backplane 3 fault
U2.5-P5
1022 xx73
Seeprom fault on DASD backplane 3 in second I/O subsystem in secondary rack (I/O subsystem 6)
U2.5-P5
1022 xx81
Second I/O subsystem in secondary rack (I/O subsystem 6) DASD backplane 4 fault
U2.5-P6
1022 xx83
Seeprom fault on DASD backplane 4 in second I/O subsystem in secondary rack (I/O subsystem 6)
U2.5-P6
1023 1B17
Problem with fan 1 in fourth I/O subsystem (with IBF, moves to second rack)
1. DCA-1Location: U2.9-V1 2. Fan 1Location: U2.9-F1
1023 1B27
Problem with fan 2 in fourth I/O subsystem (with IBF, moves to second rack)
1. DCA-2Location: U2.9-V2 2. Fan 2Location: U2.9-F2
1023 1C06
Loss of airflow in fourth I/O subsystem, with IBFlocation: U2.9
Go to “MAP 1527: An Airflow Loss Has Been Detected” on page 209. 472
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
1023 6014
Overcurrent or overvoltage problem with 3.3V in fourth I/O subsystem (with IBF, moves to second rack)
Go to “MAP 1520: Power” on page 192.
1023 6024
Overcurrent or overvoltage problem with 5.0V in fourth I/O subsystem (with IBF, moves to second rack)
Go to “MAP 1520: Power” on page 192.
1023 6034
Overcurrent or overvoltage problem with 2.5V in fourth I/O subsystem (with IBF, moves to second rack)
Go to “MAP 1520: Power” on page 192.
1023 6044
Overcurrent or overvoltage problem with 12.0V in fourth I/O subsystem (with IBF, moves to second rack)
Go to “MAP 1520: Power” on page 192.
1023 6054
Overcurrent or overvoltage problem with -12.0V in fourth I/O subsystem (with IBF, moves to second rack)
Go to “MAP 1520: Power” on page 192.
1023 6814
Overcurrent or overvoltage problem with 3.3V in fourth I/O subsystem (with IBF, moves to second rack)
Go to “MAP 1520: Power” on page 192.
1023 6824
Overcurrent or overvoltage problem with 5.0V in fourth I/O subsystem (with IBF, moves to second rack)
Go to “MAP 1520: Power” on page 192.
1023 6834
Overcurrent or overvoltage problem with 2.5V in fourth I/O subsystem (with IBF, moves to second rack)
Go to “MAP 1520: Power” on page 192.
1023 6844
Overcurrent or overvoltage problem with 12.0V in fourth I/O subsystem (with IBF, moves to second rack)
Go to “MAP 1520: Power” on page 192.
1023 6854
Overcurrent or overvoltage problem with -12.0V in fourth I/O subsystem (with IBF, moves to second rack)
Go to “MAP 1520: Power” on page 192.
1023 7017
Problem with fan 3 in fourth I/O subsystem (with IBF, moves to second rack)
1. DCA-1Location: U2.9-V1 2. Fan 3Location: U2.9-F3
1023 7027
Problem with fan 4 in fourth I/O subsystem (with IBF, moves to second rack)
1. DCA-2Location: U2.9-V2 2. Fan 4Location: U2.9-F4
1023 7117
Problem with fan 1 and fan 3 in fourth I/O subsystem (with IBF, moves to second rack)
1. DCA-1Location: U2.9-V1 2. Fans 1 and 3Location: U2.9-F1 and U2.9-F3
1023 7127
Problem with fan 2 and fan 4 in fourth I/O subsystem (with IBF, moves to second rack)
1. DCA-2Location: U2.9-V2 2. Fans 2 and 4Location: U2.9-F2 and U2.9-F4
1023 8A00
The SPCN firmware has detected a change in the drawer configuration
Informational message. Chapter
5.
Error
Code
to
FRU
Index
473
Error Code
Description
Action/ Possible Failing FRU
1023 B014
Overcurrent or overvoltage problem with 3.3V in fourth I/O subsystem (with IBF, moves to second rack)
Go to “MAP 1520: Power” on page 192.
1023 B024
Overcurrent or overvoltage problem with 5.0V in fourth I/O subsystem (with IBF, moves to second rack)
Go to “MAP 1520: Power” on page 192.
1023 B034
Overcurrent or overvoltage problem with 2.5V in fourth I/O subsystem (with IBF, moves to second rack)
Go to “MAP 1520: Power” on page 192.
1023 B044
Overcurrent or overvoltage problem with 12.0V in fourth I/O subsystem (with IBF, moves to second rack)
Go to “MAP 1520: Power” on page 192.
1023 B054
Overcurrent or overvoltage problem with -12.0V in fourth I/O subsystem (with IBF, moves to second rack)
Go to “MAP 1520: Power” on page 192.
1023 B814
Overcurrent or overvoltage problem with 3.3V in fourth I/O subsystem (with IBF, moves to second rack)
Go to “MAP 1520: Power” on page 192.
1023 B824
Overcurrent or overvoltage problem with 5.0V in fourth I/O subsystem (with IBF, moves to second rack)
Go to “MAP 1520: Power” on page 192.
1023 B834
Overcurrent or overvoltage problem with 2.5V in fourth I/O subsystem (with IBF, moves to second rack)
Go to “MAP 1520: Power” on page 192.
1023 B844
Overcurrent or overvoltage problem with 12.0V in fourth I/O subsystem (with IBF, moves to second rack)
Go to “MAP 1520: Power” on page 192.
1023 B854
Overcurrent or overvoltage problem with -12.0V in fourth I/O subsystem (with IBF, moves to second rack)
Go to “MAP 1520: Power” on page 192.
1023 BF06
Unable to resolve the chassis VPD in an I/O subsystem
The chassis VPD in the I/O subsystem drawer identified by the location code could not be resolved. This may be an indication that more than one FRU in the drawer was replaced during a service action. If this is the case, reinstall the original FRUs in the drawer, then replace the FRUs one at a time. If only one FRU was replaced and this error code occurred, call service support. 474
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
1023 CC36
Description: Loss of pgood on an I/O planar in the I/O drawer at location U2.9Action:This error code can be caused by hot-plugging RIO cables. To see if this is a recoverable error, do the following: 1. Power down the system by following the appropriate procedure in “Powering Off and Powering On the System” on page 838. 2. After the system is shut down, 350V dc must be removed from the DCAs in the I/O drawer that reported the error: v If a High Performance Switch (7045-SW4) is installed in the CEC frame, follow the instructions in “Powering Off and Powering On the System” on page 838 for deactivating a system’s I/O drawer after the system is shut down. v If a High Performance Switch (7045-SW4) is not present in the CEC frame, use the UEPO switch to remove 350V dc from all of the drawers in the frame.3. Then boot the system by following the appropriate procedure in “Powering Off and Powering On the System” on page 838. At any point after the service processor comes up to standby, indicated by OK in the operator panel in the media drawer, look at the service action event log in Service Focal Point on the HMC. If this error code does not reoccur, no action is required. If this error code reoccurs, replace the I/O planar, location: U2.9-P1.
1023 CC46
Description: Loss of pgood on an I/O planar in the I/O drawer at location U2.9Action:This error code can be caused by hot-plugging RIO cables. To see if this is a recoverable error, do the following: 1. Power down the system by following the appropriate procedure in “Powering Off and Powering On the System” on page 838. 2. After the system is shut down, 350V dc must be removed from the DCAs in the I/O drawer that reported the error: v If a High Performance Switch (7045-SW4) is installed in the CEC frame, follow the instructions in “Powering Off and Powering On the System” on page 838 for deactivating a system’s I/O drawer after the system is shut down. v If a High Performance Switch (7045-SW4) is not present in the CEC frame, use the UEPO switch to remove 350V dc from all of the drawers in the frame.3. Then boot the system by following the appropriate procedure in “Powering Off and Powering On the System” on page 838. At any point after the service processor comes up to standby, indicated by OK in the operator panel in the media drawer, look at the service action event log in Service Focal Point on the HMC. If this error code does not reoccur, no action is required. If this error code reoccurs, replace the I/O planar, location: U2.9-P2.
1023 CD96
A power short has been detected in the DASD subsystem in an I/O drawer.
Go to “MAP 1520: Power” on page 192.
1023 CDA6
A power short has been detected in the DASD subsystem in an I/O drawer.
Go to “MAP 1520: Power” on page 192.
1023 CDB6
A power short has been detected in the DASD subsystem in an I/O drawer.
Go to “MAP 1520: Power” on page 192.
1023 CDC6
A power short has been detected in the DASD subsystem in an I/O drawer.
Go to “MAP 1520: Power” on page 192.
1023 F4x6
Hardware configuration does not support N-mode power.
Call Service Support Chapter
5.
Error
Code
to
FRU
Index
475
Error Code
Description
Action/ Possible Failing FRU
1023 F516
An N+1 boundary (DCA or fan) has dropped to N-mode status in the fourth subsystem (with IBF, location: U2.9.
This is normally an informational message. It should be accompanied by other error codes that more precisely indicate where the failure is. Examine the service processor error log. If there are no other power-related error codes (in the time frame in which this code was generated) in the service processor error log, the FRU indicated with this error code, if any, should be replaced.
1023 F526
An N+1 boundary (DCA or fan) has dropped to N-mode status in the fourth I/O subsystem (with IBF), location: U2.9.
This is normally an informational message. It should be accompanied by other error codes that more precisely indicate where the failure is. Examine the service processor error log. If there are no other power-related error codes (in the time frame in which this code was generated) in the service processor error log, the FRU indicated with this error code, if any, should be replaced.
1023 xx11
Fourth I/O subsystem moves to second rack (with IBF), DCA-1
U2.9-V1
1023 xx13
Seeprom fault on DCA-1 in fourth I/O subsystem (with IBF) in secondary rack
U2.9-V1
1023 xx15
Fourth I/O subsystem moves to second rack (with IBF), DCA-1, cable fault
Go to “MAP 1520: Power” on page 192 with these pairs of location codes: 1. U2.9-V1/Q1 and U1.35-P2-X2/Q5 2. U2.9-V1/Q2 and U1.35-P1-X2/Q5
1023 xx17
Problem with fan 1 and fan 3 in fourth I/O subsystem (with IBF, moves to second rack)
1. DCA-1Location U2.9–V1 2. Fan 1,Location U2.9–F1 3. Fan 3,Location U2.9–F3
1023 xx21
Fourth I/O subsystem moves to second rack (with IBF), DCA-2
U2.9-V2
1023 xx23
Seeprom fault on DCA-2 in fourth I/O subsystem (with IBF) in secondary rack
U2.9-V2
1023 xx25
Fourth I/O subsystem moves to second rack (with IBF), DCA-2, cable fault
Go to “MAP 1520: Power” on page 192 with these pairs of location codes: 1. U2.9-V2/Q1 and U1.35-P2-X2/Q6 2. U2.9-V2/Q2 and U1.35-P1-X2/Q6
1023 xx27
Problem with fan 2 and fan 4 in fourth I/O subsystem (with IBF, moves to second rack)
1. DCA-2Location U2.9–V2 2. Fan 2,Location U2.9–F2 3. Fan 4,Location U2.9–F4
1023 xx31
Fourth I/O subsystem (with IBF) in secondary rack I/O subsystem backplane 1 fault
U2.9-P1 476
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
1023 xx33
Seeprom fault on I/O subsystem backplane 1 in fourth I/O subsystem (with IBF) in secondary rack
U2.9-P1
1023 xx41
Fourth I/O subsystem (with IBF) in secondary rack I/O subsystem backplane 2 fault
U2.9-P2
1023 xx43
Seeprom fault on I/O subsystem backplane 2 in fourth I/O subsystem (with IBF) in secondary rack
U2.9-P2
1023 xx51
Fourth I/O subsystem (with IBF) in secondary rack DASD backplane 1 fault
U2.9-P3
1023 xx53
Seeprom fault on DASD backplane 1 in fourth I/O subsystem (with IBF) in secondary rack
U2.9-P3
1023 xx61
Fourth I/O subsystem (with IBF) in secondary rack DASD backplane 2 fault
U2.9-P4
1023 xx63
Seeprom fault on DASD backplane 2 in fourth I/O subsystem (with IBF) in secondary rack
U2.9-P4
1023 xx71
Fourth I/O subsystem (with IBF) in secondary rack DASD backplane 3 fault
U2.9-P5
1023 xx73
Seeprom fault on DASD backplane 3 in fourth I/O subsystem (with IBF) in secondary rack
U2.9-P5
1023 xx81
Fourth I/O subsystem (with IBF) in secondary rack DASD backplane 4 fault
U2.9-P6
1023 xx83
Seeprom fault on DASD backplane 4 in fourth I/O subsystem (with IBF) in secondary rack
U2.9-P6
1024 1B17
Problem with fan 1 in third I/O subsystem in second rack (I/O subsystem 7)
1. DCA-1Location: U2.13-V1 2. Fan 1Location: U2.13-F1
1024 1B27
Problem with fan 2 in third I/O subsystem in second rack (I/O subsystem 7)
1. DCA-2Location: U2.13-V2 2. Fan 2Location: U2.13-F2
1024 1C06
Loss of airflow in third I/O subsystem in second rack (I/O subsystem 7), location: U2.13
Go to “MAP 1527: An Airflow Loss Has Been Detected” on page 209.
1024 6014
Overcurrent or overvoltage problem with 3.3V in third I/O subsystem in second rack (I/O subsystem 7). Location:U2.13
Go to “MAP 1520: Power” on page 192.
1024 6024
Overcurrent or overvoltage problem with 5.0V in third I/O subsystem in second rack (I/O subsystem 7). Location:U2.13
Go to “MAP 1520: Power” on page 192.
1024 6034
Overcurrent or overvoltage problem with 2.5V in third I/O subsystem in second rack (I/O subsystem 7). Location:U2.13
Go to “MAP 1520: Power” on page 192. Chapter
5.
Error
Code
to
FRU
Index
477
Error Code
Description
Action/ Possible Failing FRU
1024 6044
Overcurrent or overvoltage problem with 12.0V in third I/O subsystem in second rack (I/O subsystem 7). Location:U2.13
Go to “MAP 1520: Power” on page 192.
1024 6054
Overcurrent or overvoltage problem with -12.0V in third I/O subsystem in second rack (I/O subsystem 7). Location:U2.13
Go to “MAP 1520: Power” on page 192.
1024 6814
Overcurrent or overvoltage problem with 3.3V in third I/O subsystem in second rack (I/O subsystem 7). Location:U2.13
Go to “MAP 1520: Power” on page 192.
1024 6824
Overcurrent or overvoltage problem with 5.0V in third I/O subsystem in second rack (I/O subsystem 7). Location:U2.13
Go to “MAP 1520: Power” on page 192.
1024 6834
Overcurrent or overvoltage problem with 2.5V in third I/O subsystem in second rack (I/O subsystem 7). Location:U2.13
Go to “MAP 1520: Power” on page 192.
1024 6844
Overcurrent or overvoltage problem with 12.0V in third I/O subsystem in second rack (I/O subsystem 7). Location:U2.13
Go to “MAP 1520: Power” on page 192.
1024 6854
Overcurrent or overvoltage problem with -12.0V in third I/O subsystem in second rack (I/O subsystem 7). Location:U2.13
Go to “MAP 1520: Power” on page 192.
1024 7017
Problem with fan 3 in third I/O subsystem in second rack (I/O subsystem 7)
1. DCA-1Location: U2.13-V1 2. Fan 3Location: U2.13-F3
1024 7027
Problem with fan 4 in third I/O subsystem in second rack (I/O subsystem 7)
1. DCA-2Location: U2.13-V2 2. Fan 4Location: U2.13-F4
1024 7117
Problem with fan 1 and fan 3 in third I/O subsystem in second rack (I/O subsystem 7)
1. DCA-1Location: U2.13-V1 2. Fans 1 and 3Location: U2.13-F1 and U2.13-F3
1024 7127
Problem with fan 2 and fan 4 in third I/O subsystem in second rack (I/O subsystem 7)
1. DCA-2Location: U2.13-V2 2. Fans 2 and 4Location: U2.13-F2 and U2.13-F4
1024 8A00
The SPCN firmware has detected a change in the drawer configuration
Informational message.
1024 B014
Overcurrent or overvoltage problem with 3.3V in third I/O subsystem in second rack (I/O subsystem 7). Location:U2.13
Go to “MAP 1520: Power” on page 192. 478
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
1024 B024
Overcurrent or overvoltage problem with 5.0V in third I/O subsystem in second rack (I/O subsystem 7). Location:U2.13
Go to “MAP 1520: Power” on page 192.
1024 B034
Overcurrent or overvoltage problem with 2.5V in third I/O subsystem in second rack (I/O subsystem 7). Location:U2.13
Go to “MAP 1520: Power” on page 192.
1024 B044
Overcurrent or overvoltage problem with 12.0V in third I/O subsystem in second rack (I/O subsystem 7). Location:U2.13
Go to “MAP 1520: Power” on page 192.
1024 B054
Overcurrent or overvoltage problem with -12.0V in third I/O subsystem in second rack (I/O subsystem 7). Location:U2.13
Go to “MAP 1520: Power” on page 192.
1024 B814
Overcurrent or overvoltage problem with 3.3V in third I/O subsystem in second rack (I/O subsystem 7). Location:U2.13
Go to “MAP 1520: Power” on page 192.
1024 B824
Overcurrent or overvoltage problem with 5.0V in third I/O subsystem in second rack (I/O subsystem 7). Location:U2.13
Go to “MAP 1520: Power” on page 192.
1024 B834
Overcurrent or overvoltage problem with 2.5V in third I/O subsystem in second rack (I/O subsystem 7). Location:U2.13
Go to “MAP 1520: Power” on page 192.
1024 B844
Overcurrent or overvoltage problem with 12.0V in third I/O subsystem in second rack (I/O subsystem 7). Location:U2.13
Go to “MAP 1520: Power” on page 192.
1024 B854
Overcurrent or overvoltage problem with -12.0V in third I/O subsystem in second rack (I/O subsystem 7). Location:U2.13
Go to “MAP 1520: Power” on page 192.
1024 BF06
Unable to resolve the chassis VPD in an I/O subsystem
The chassis VPD in the I/O subsystem drawer identified by the location code could not be resolved. This may be an indication that more than one FRU in the drawer was replaced during a service action. If this is the case, reinstall the original FRUs in the drawer, then replace the FRUs one at a time. If only one FRU was replaced and this error code occurred, call service support. Chapter
5.
Error
Code
to
FRU
Index
479
Error Code
Description
Action/ Possible Failing FRU
1024 CC36
Description: Loss of pgood on an I/O planar in the I/O drawer at location U2.13Action:This error code can be caused by hot-plugging RIO cables. To see if this is a recoverable error, do the following: 1. Power down the system by following the appropriate procedure in “Powering Off and Powering On the System” on page 838. 2. After the system is shut down, 350V dc must be removed from the DCAs in the I/O drawer that reported the error: v If a High Performance Switch (7045-SW4) is installed in the CEC frame, follow the instructions in “Powering Off and Powering On the System” on page 838 for deactivating a system’s I/O drawer after the system is shut down. v If a High Performance Switch (7045-SW4) is not present in the CEC frame, use the UEPO switch to remove 350V dc from all of the drawers in the frame.3. Then boot the system by following the appropriate procedure in “Powering Off and Powering On the System” on page 838. At any point after the service processor comes up to standby, indicated by OK in the operator panel in the media drawer, look at the service action event log in Service Focal Point on the HMC. If this error code does not reoccur, no action is required. If this error code reoccurs, replace the I/O planar, location: U2.13-P1.
1024 CC46
Description: Loss of pgood on an I/O planar in the I/O drawer at location U2.13Action:This error code can be caused by hot-plugging RIO cables. To see if this is a recoverable error, do the following: 1. Power down the system by following the appropriate procedure in “Powering Off and Powering On the System” on page 838. 2. After the system is shut down, 350V dc must be removed from the DCAs in the I/O drawer that reported the error: v If a High Performance Switch (7045-SW4) is installed in the CEC frame, follow the instructions in “Powering Off and Powering On the System” on page 838 for deactivating a system’s I/O drawer after the system is shut down. v If a High Performance Switch (7045-SW4) is not present in the CEC frame, use the UEPO switch to remove 350V dc from all of the drawers in the frame.3. Then boot the system by following the appropriate procedure in “Powering Off and Powering On the System” on page 838. At any point after the service processor comes up to standby, indicated by OK in the operator panel in the media drawer, look at the service action event log in Service Focal Point on the HMC. If this error code does not reoccur, no action is required. If this error code reoccurs, replace the I/O planar, location: U2.13-P2.
1024 CD96
A power short has been detected in the DASD subsystem in an I/O drawer. Location:U2.13
Go to “MAP 1520: Power” on page 192.
1024 CDA6
A power short has been detected in the DASD subsystem in an I/O drawer. Location:U2.13
Go to “MAP 1520: Power” on page 192.
1024 CDB6
A power short has been detected in the DASD subsystem in an I/O drawer. Location:U2.13
Go to “MAP 1520: Power” on page 192.
1024 CDC6
A power short has been detected in the DASD subsystem in an I/O drawer. Location:U2.13
Go to “MAP 1520: Power” on page 192.
1024 F4x6
Hardware configuration does not support N-mode power.
Call Service Support 480
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
1024 F516
An N+1 boundary (DCA or fan) has dropped to N-mode status in third I/O subsystem in second rack (I/O subsystem 7), location: U2.13.
This is normally an informational message. It should be accompanied by other error codes that more precisely indicate where the failure is. Examine the service processor error log. If there are no other power-related error codes (in the time frame in which this code was generated) in the service processor error log, the FRU indicated with this error code, if any, should be replaced.
1024 F526
An N+1 boundary (DCA or fan) has dropped to N-mode status in third I/O subsystem in second rack (I/O subsystem 7), location: U2.13.
This is normally an informational message. It should be accompanied by other error codes that more precisely indicate where the failure is. Examine the service processor error log. If there are no other power-related error codes (in the time frame in which this code was generated) in the service processor error log, the FRU indicated with this error code, if any, should be replaced.
1024 xx11
Third I/O subsystem in second rack (I/O subsystem 7), DCA-1
U2.13-V1
1024 xx13
Seeprom fault on DCA-1 in third I/O subsystem in second rack (I/O subsystem 7)
U2.13-V1
1024 xx15
Third I/O subsystem in second rack (I/O subsystem 7), DCA-1, cable fault
Go to “MAP 1520: Power” on page 192 with these pairs of location codes: 1. U2.13-V1/Q1 and U1.35-P2-X2/Q7 2. U2.13-V1/Q2 and U1.35-P1-X2/Q7
1024 xx17
Problem with fan 1 and fan 3 in third I/O subsystem in second rack (I/O subsystem 7)
1. DCA-1Location U2.13–V1 2. Fan 1,Location U2.13–F1 3. Fan 3,Location U2.13–F3
1024 xx21
Third I/O subsystem in second rack (I/O subsystem 7), DCA-2
U2.13-V2
1024 xx23
Seeprom fault on DCA-2 in third I/O subsystem in second rack (I/O subsystem 7)
U2.13-V2
1024 xx25
Third I/O subsystem in second rack (I/O subsystem 7), DCA-2, cable fault
Go to “MAP 1520: Power” on page 192 with these pairs of location codes: 1. U2.13-V2/Q1 and U1.35-P2-X2/Q8 2. U2.13-V2/Q2 and U1.35-P1-X2/Q8
1024 xx27
Problem with fan 2 and fan 4 in third I/O subsystem in second rack (I/O subsystem 7)
1. DCA-2Location U2.13–V2 2. Fan 2,Location U2.13–F2 3. Fan 4,Location U2.13–F4
1024 xx31
Third I/O subsystem in second rack (I/O subsystem 7) I/O subsystem backplane 1 fault
U2.13-P1 Chapter
5.
Error
Code
to
FRU
Index
481
Error Code
Description
Action/ Possible Failing FRU
1024 xx33
Seeprom fault on I/O subsystem backplane 1 in third I/O subsystem in second rack (I/O subsystem 7)
U2.13-P1
1024 xx41
Third I/O subsystem in second rack (I/O subsystem 7) I/O subsystem backplane 2 fault
U2.13-P2
1024 xx43
Seeprom fault on I/O subsystem backplane 2 in third I/O subsystem in second rack (I/O subsystem 7)
U2.13-P2
1024 xx51
Third I/O subsystem in second rack (I/O subsystem 7) DASD backplane 1 fault
U2.13-P3
1024 xx53
Seeprom fault on DASD backplane 1 in third I/O subsystem in second rack (I/O subsystem 7)
U2.13-P3
1024 xx61
Third I/O subsystem in second rack (I/O subsystem 7), DASD backplane 2 fault
U2.13-P4
1024 xx63
Seeprom fault on DASD backplane 2 in third I/O subsystem in second rack (I/O subsystem 7)
U2.13-P4
1024 xx71
Third I/O subsystem in second rack (I/O subsystem 7), DASD backplane 3 fault
U2.13-P5
1024 xx73
Seeprom fault on DASD backplane 3 in third I/O subsystem in second rack (I/O subsystem 7)
U2.13-P5
1024 xx81
Third I/O subsystem in second rack (I/O subsystem 7) DASD backplane 4 fault
U2.13-P6
1024 xx83
Seeprom fault on DASD backplane 4 in third I/O subsystem in second rack (I/O subsystem 7)
U2.13-P6
1025 1B17
Problem with fan 1 in fourth I/O subsystem in second rack (I/O subsystem 8)
1. DCA-1Location: U2.19-V1 2. Fan 1Location: U2.19-F1
1025 1B27
Problem with fan 2 in fourth I/O subsystem in second rack (I/O subsystem 8)
1. DCA-2Location: U2.19-V2 2. Fan 2Location: U2.19-F2
1025 1C06
Loss of airflow in fourth I/O subsystem in second rack (I/O subsystem 8), location: U2.19
Go to “MAP 1527: An Airflow Loss Has Been Detected” on page 209.
1025 6014
Overcurrent or overvoltage problem with 3.3V in fourth I/O subsystem in second rack (I/O subsystem 8)
Go to “MAP 1520: Power” on page 192.
1025 6024
Overcurrent or overvoltage problem with 5.0V in fourth I/O subsystem in second rack (I/O subsystem 8)
Go to “MAP 1520: Power” on page 192.
1025 6034
Overcurrent or overvoltage problem with 2.5V in fourth I/O subsystem in second rack (I/O subsystem 8)
Go to “MAP 1520: Power” on page 192.
1025 6044
Overcurrent or overvoltage problem with 12.0V in fourth I/O subsystem in second rack (I/O subsystem 8)
Go to “MAP 1520: Power” on page 192. 482
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
1025 6054
Overcurrent or overvoltage problem with -12.0V in fourth I/O subsystem in second rack (I/O subsystem 8)
Go to “MAP 1520: Power” on page 192.
1025 6814
Overcurrent or overvoltage problem with 3.3V in fourth I/O subsystem in second rack (I/O subsystem 8)
Go to “MAP 1520: Power” on page 192.
1025 6824
Overcurrent or overvoltage problem with 5.0V in fourth I/O subsystem in second rack (I/O subsystem 8)
Go to “MAP 1520: Power” on page 192.
1025 6834
Overcurrent or overvoltage problem with 2.5V in fourth I/O subsystem in second rack (I/O subsystem 8)
Go to “MAP 1520: Power” on page 192.
1025 6844
Overcurrent or overvoltage problem with 12.0V in fourth I/O subsystem in second rack (I/O subsystem 8)
Go to “MAP 1520: Power” on page 192.
1025 6854
Overcurrent or overvoltage problem with -12.0V in fourth I/O subsystem in second rack (I/O subsystem 8)
Go to “MAP 1520: Power” on page 192.
1025 7017
Problem with fan 3 in fourth I/O subsystem in second I/O subsystem in second rack (I/O subsystem 8)
1. DCA-1Location: U2.19-V1 2. Fan 3Location: U2.19-F3
1025 7027
Problem with fan 4 in fourth I/O subsystem in second rack (I/O subsystem 8)
1. DCA-2Location: U2.19-V2 2. Fan 4Location: U2.19-F4
1025 7117
Problem with fan 1 and fan 3 in fourth I/O subsystem in second rack (I/O subsystem 8)
1. DCA-1Location: U2.19-V1 2. Fans 1 and 3Location: U2.19-F1 and U2.19-F3
1025 7127
Problem with fan 2 and fan 4 in fourth I/O subsystem in second rack (I/O subsystem 8)
1. DCA-2Location: U2.19-V2 2. Fans 2 and 4Location: U2.19-F2 and U2.19-F4
1025 8A00
The SPCN firmware has detected a change in the drawer configuration
Informational message.
1025 B014
Overcurrent or overvoltage problem with 3.3V in fourth I/O subsystem in second rack (I/O subsystem 8)
Go to “MAP 1520: Power” on page 192.
1025 B024
Overcurrent or overvoltage problem with 5.0V in fourth I/O subsystem in second rack (I/O subsystem 8)
Go to “MAP 1520: Power” on page 192.
1025 B034
Overcurrent or overvoltage problem with 2.5V in fourth I/O subsystem in second rack (I/O subsystem 8)
Go to “MAP 1520: Power” on page 192.
1025 B044
Overcurrent or overvoltage problem with 12.0V in fourth I/O subsystem in second rack (I/O subsystem 8)
Go to “MAP 1520: Power” on page 192. Chapter
5.
Error
Code
to
FRU
Index
483
Error Code
Description
Action/ Possible Failing FRU
1025 B054
Overcurrent or overvoltage problem with -12.0V in fourth I/O subsystem in second rack (I/O subsystem 8)
Go to “MAP 1520: Power” on page 192.
1025 B814
Overcurrent or overvoltage problem with 3.3V in fourth I/O subsystem in second rack (I/O subsystem 8)
Go to “MAP 1520: Power” on page 192.
1025 B824
Overcurrent or overvoltage problem with 5.0V in second I/O subsystem in fourth I/O subsystem in second rack (I/O subsystem 8)
Go to “MAP 1520: Power” on page 192.
1025 B834
Overcurrent or overvoltage problem with 2.5V in second I/O subsystem in fourth I/O subsystem in second rack (I/O subsystem 8)
Go to “MAP 1520: Power” on page 192.
1025 B844
Overcurrent or overvoltage problem with 12.0V in second I/O subsystem in fourth I/O subsystem in second rack (I/O subsystem 8)
Go to “MAP 1520: Power” on page 192.
1025 B854
Overcurrent or overvoltage problem with -12.0V in fourth I/O subsystem in second rack (I/O subsystem 8)
Go to “MAP 1520: Power” on page 192.
1025 BF06
Unable to resolve the chassis VPD in an I/O subsystem
The chassis VPD in the I/O subsystem drawer identified by the location code could not be resolved. This may be an indication that more than one FRU in the drawer was replaced during a service action. If this is the case, reinstall the original FRUs in the drawer, then replace the FRUs one at a time. If only one FRU was replaced and this error code occurred, call service support.
1025 CC36
Description: Loss of pgood on an I/O planar in the I/O drawer at location U2.19Action:This error code can be caused by hot-plugging RIO cables. To see if this is a recoverable error, do the following: 1. Power down the system by following the appropriate procedure in “Powering Off and Powering On the System” on page 838. 2. After the system is shut down, 350V dc must be removed from the DCAs in the I/O drawer that reported the error: v If a High Performance Switch (7045-SW4) is installed in the CEC frame, follow the instructions in “Powering Off and Powering On the System” on page 838 for deactivating a system’s I/O drawer after the system is shut down. v If a High Performance Switch (7045-SW4) is not present in the CEC frame, use the UEPO switch to remove 350V dc from all of the drawers in the frame.3. Then boot the system by following the appropriate procedure in “Powering Off and Powering On the System” on page 838. At any point after the service processor comes up to standby, indicated by OK in the operator panel in the media drawer, look at the service action event log in Service Focal Point on the HMC. If this error code does not reoccur, no action is required. If this error code reoccurs, replace the I/O planar, location: U2.19-P1. 484
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
1025 CC46
Description: Loss of pgood on an I/O planar in the I/O drawer at location U2.19Action:This error code can be caused by hot-plugging RIO cables. To see if this is a recoverable error, do the following: 1. Power down the system by following the appropriate procedure in “Powering Off and Powering On the System” on page 838. 2. After the system is shut down, 350V dc must be removed from the DCAs in the I/O drawer that reported the error: v If a High Performance Switch (7045-SW4) is installed in the CEC frame, follow the instructions in “Powering Off and Powering On the System” on page 838 for deactivating a system’s I/O drawer after the system is shut down. v If a High Performance Switch (7045-SW4) is not present in the CEC frame, use the UEPO switch to remove 350V dc from all of the drawers in the frame.3. Then boot the system by following the appropriate procedure in “Powering Off and Powering On the System” on page 838. At any point after the service processor comes up to standby, indicated by OK in the operator panel in the media drawer, look at the service action event log in Service Focal Point on the HMC. If this error code does not reoccur, no action is required. If this error code reoccurs, replace the I/O planar, location: U2.19-P2.
1025 CD96
A power short has been detected in the DASD subsystem in an I/O drawer.
Go to “MAP 1520: Power” on page 192.
1025 CDA6
A power short has been detected in the DASD subsystem in an I/O drawer.
Go to “MAP 1520: Power” on page 192.
1025 CDB6
A power short has been detected in the DASD subsystem in an I/O drawer.
Go to “MAP 1520: Power” on page 192.
1025 CDC6
A power short has been detected in the DASD subsystem in an I/O drawer.
Go to “MAP 1520: Power” on page 192.
1025 F4x6
Hardware configuration does not support N-mode power.
Call Service Support
1025 F516
An N+1 boundary (DCA or fan) has dropped to N-mode status in fourth I/O subsystem in second rack (I/O subsystem 8), location: U2.19.
This is normally an informational message. It should be accompanied by other error codes that more precisely indicate where the failure is. Examine the service processor error log. If there are no other power-related error codes (in the time frame in which this code was generated) in the service processor error log, the FRU indicated with this error code, if any, should be replaced.
1025 F526
An N+1 boundary (DCA or fan) has dropped to N-mode status in fourth I/O subsystem in second rack (I/O subsystem 8), location: U2.19.
This is normally an informational message. It should be accompanied by other error codes that more precisely indicate where the failure is. Examine the service processor error log. If there are no other power-related error codes (in the time frame in which this code was generated) in the service processor error log, the FRU indicated with this error code, if any, should be replaced.
1025 xx11
Fourth I/O subsystem in second rack (I/O subsystem 8), DCA-1
U2.19-V1
1025 xx13
Seeprom fault on DCA-1 in fourth I/O subsystem in second rack (I/O subsystem 8)
U2.19-V1 Chapter
5.
Error
Code
to
FRU
Index
485
Error Code
Description
Action/ Possible Failing FRU
1025 xx15
Fourth I/O subsystem in second rack (I/O subsystem 8), DCA-1, cable fault
If there is no DCA in U1.18-P1-V8, second rack (I/O go to “MAP 1520: Power” on page 192 subsystem 8) with these pairs of location codes: 1. U2.19-V1/Q1 and U1.35-P2-X2/Q9 2. U2.19-V1/Q2 and U1.35-P1-X2/Q9If there is a 6th DCA in U1.18-P1-V8, go to “MAP 1520: Power” on page 192 with these pairs of location codes: 1. U2.19-V1/Q1 and U1.35-P2-X1/Q9 2. U2.19-V1/Q2 and U1.35-P1-X1/Q9
1025 xx17
Problem with fan 1 and fan 3 in fourth I/O subsystem in second rack (I/O subsystem 8)
1. DCA-1Location U2.19–V1 2. Fan 1,Location U2.19–F1 3. Fan 3,Location U2.19–F3
1025 xx21
Fourth I/O subsystem in second rack (I/O subsystem 8), DCA-2
U2.19-V2
1025 xx23
Seeprom fault on DCA-2 in fourth I/O subsystem in second rack (I/O subsystem 8)
U2.19-V2
1025 xx25
Fourth I/O subsystem in second rack (I/O subsystem 8), DCA-2, cable fault
If there is no DCA in U1.18-P1-V8, second rack (I/O go to “MAP 1520: Power” on page 192 subsystem 8), DCA-2, with these pairs of location codes: 1. U2.19-V2/Q1 and U1.35-P2-X2/Q10 2. U2.19-V2/Q2 and U1.35-P1-X2/Q10If there is a 6th DCA in U1.18-P1-V8, go to “MAP 1520: Power” on page 192 with these pairs of location codes: 1. U2.19-V2/Q1 and U1.35-P2-X1/Q10 2. U2.19-V2/Q2 and U1.35-P1-X1/Q10
1025 xx27
Problem with fan 2 and fan 4 in fourth I/O subsystem in second rack (I/O subsystem 8)
1. DCA-2Location U2.19–V2 2. Fan 2,Location U2.19–F2 3. Fan 4,Location U2.19–F4
1025 xx31
Fourth I/O subsystem in second rack (I/O subsystem 8) I/O subsystem backplane 1 fault
U2.19-P1
1025 xx33
Seeprom fault on I/O subsystem backplane 1 in fourth I/O subsystem in second rack (I/O subsystem 8)
U2.19-P1
1025 xx41
Fourth I/O subsystem in second rack (I/O subsystem 8) I/O subsystem backplane 2 fault
U2.19-P2
1025 xx43
Seeprom fault on I/O subsystem backplane 2 in fourth I/O subsystem in second rack (I/O subsystem 8)
U2.19-P2 486
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
1025 xx51
Fourth I/O subsystem in second rack (I/O subsystem 8) DASD backplane 1 fault
U2.19-P3
1025 xx53
Seeprom fault on DASD backplane 1 in fourth I/O subsystem in second rack (I/O subsystem 8)
U2.19-P3
1025 xx61
Fourth I/O subsystem in second rack (I/O subsystem 8), DASD backplane 2 fault
U2.19-P4
1025 xx63
Seeprom fault on DASD backplane 2 in fourth I/O subsystem in second rack (I/O subsystem 8)
U2.19-P4
1025 xx71
Fourth I/O subsystem in second rack (I/O subsystem 8), DASD backplane 3 fault
U2.19-P5
1025 xx73
Seeprom fault on DASD backplane 3 in fourth I/O subsystem in second rack (I/O subsystem 8)
U2.19-P5
1025 xx81
Fourth I/O subsystem in second rack (I/O subsystem 8) DASD backplane 4 fault
U2.19-P6
1025 xx83
Seeprom fault on DASD backplane 4 in fourth I/O subsystem in second rack (I/O subsystem 8)
U2.19-P6
1RRU 1C06
Go to “MAP 1527: An Airflow Loss Has Been Detected” on page 209.
SPCNtoA
Go to “MAP 1529: There Is A Bulk Power Assembly (BPA) Communication Failure” on page 212.
SPCNtoB
Go to “MAP 1529: There Is A Bulk Power Assembly (BPA) Communication Failure” on page 212.
Chapter
5.
Error
Code
to
FRU
Index
487
Firmware/POST Error Codes Attention: If you replace FRUs or perform an action on an I/O subsystem and the problem is still not corrected, go to “MAP 1542: I/O Problem Isolation” on page 318 unless you were already directed to any MAP 154x by the error code. Otherwise call service support if the action(s) for an error code do not resolve the problem. Attention: If the error code that brought you to this chapter originated in the operating system error log, remember that an error may require an additional 15 minutes to be propagated to the operating system error log. If you replace FRUs and the problem is corrected, go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems.
Error Code
Description
Action/ Possible Failing FRU
2011 00xy
I/O subsystem power supply failure
1. Replace power supply “y” in I/O subsystem “x”. If y=_, then the power supply is: y = power supply 3 = Both power supplies (I/O subsystem) 2 = Reserved 1 = Reserved See “AIX and Physical Location Code Tables” in Chapter 1, for location codes. 2. Check ac power source to I/O subsystem “x”. See “AIX and Physical Location Code Tables” in Chapter 1, for location codes.
2022 0000
The I/O bus is running slowly
1. Check cabling. 2. Call second level of support. 488
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
203w 0xyz
Remote I/O (RIO) configuration warning. This code indicates that a RIO configuration was detected that does not result in complete loops. The system continues to boot, however performance or recovery capability may be reduced. In some cases, the problem may result in missing I/O. Remote I/O subsystems are connected in loops. Use the letters w, y, and z in the error code to isolate the location of the open loop. Check RIO cables for loose connections, and verify power on the I/O subsystems by the presence of fan motion. The following numbers may be used to help isolate the failing component. v w = loop number, this is either 0 or 1 in a system with 2 RIO loops. v x = Always 0. v y is either 1 or 2 on a system with 2 loops. y indicates the nearest associated RIO port number on the processor subsystem according to the following table: y = Nearest RIO Port 1 = 0 2 = 1 3 = 2 4 = 3 v z = B, indicates a missing return line from the I/O subsystem to the processor subsystem. v z = C, indicates a missing link between two I/O subsystems. v z = E, indicates an I/O subsystem was found connected to RIO port 3 with no return to the processor subsystem, and no I/O subsystem was found connected to RIO port 2. In this case, the I/O connected to RIO port 3 is removed from the configuration since the cause of the error and the proper location of the I/O cannot be determined.
Go to “MAP 1542: I/O Problem Isolation” on page 318.
20A8 0000
Insufficient information to boot.
Verify the IP address in the SMS menu.
20A8 0001
Client IP address is already in use by other network device
Verify the IP address in the SMS menu.
20A8 0002
Cannot get gateway IP address
Refer to Table 2 on page 379 using code E174.
20A8 0003
Cannot get server hardware address
Refer to Table 2 on page 379 using code E174.
20A8 0004
BootP failed
Refer to Table 2 on page 379 using code E175.
20A8 0005
File transmission (TFTP) failed.
Check network connection, try again.
20A8 0006
Boot image too large
Contact your next level of support for assistance. Chapter
5.
Error
Code
to
FRU
Index
489
Error Code
Description
Action/ Possible Failing FRU
20D0 0001
I/O subsystem has changed locations
This is an informational message. The I/O subsystem at the location code associated with this messages was found at another location code during a prior boot of the system. If an I/O subsystem has not been moved, complete the following steps: 1. Boot the system and match the I/O subsystem location codes reported by the operating system to the subsystems attached to the CEC. Check the SPCN and RIO cabling to the I/O subsystems. If a cabling problem is found, correct it. 2. If no cabling problem is found, call your next level of support.
20D0 000F
Self-test failed on device, no error and/or location code information available
Refer to “Determining Location Code” on page 415 to see if a location code is available for the device the error was reported against. 1. Replace device indicated by the location code (if available). 2. Go to “MAP 1542: I/O Problem Isolation” on page 318.
20D0 0010
Selftest failed on device, cannot locate package
Contact your next level of support for assistance.
20D0 0011
Firmware RTAS attempt to allocate memory failed.
Contact your next level of support for assistance.
20D0 0800
Firmware update failure: partition does not have service authority
Using the HMC, grant service authority to that partition. Reboot the partition. Retry the flash update.
20D0 0801
System firmware update failure: more than one partition running
Shut down all other partitions, then retry the firmware update.
20D0 0997
Operating system terminated with error string
Call service support. A hypervisor dump may be available; see “Hypervisor Dump Retrieval Procedure” on page 757.
20D0 0A01
Error initializing RTAS NVRAM
1. Check for system firmware updates. Apply update if available. 2. Contact your next level of support for assistance.
20D0 0A02
Error initializing RTAS debugging token
1. Check for system firmware updates. Apply update if available. 2. Contact your next level of support for assistance.
20D0 0A03
Error initializing interface/service processor access
1. Check for system firmware updates. Apply update if available. 2. Contact your next level of support for assistance. 490
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
20D0 0A04
Error initializing interface/ operator panel access
1. Check for system firmware updates. Apply update if available. 2. Contact your next level of support for assistance.
20D0 0A05
Error initializing hardware access
1. Check for system firmware updates. Apply update if available. 2. Contact your next level of support for assistance.
20D0 0A06
Error initializing interface/PCI access
1. Check for system firmware updates. Apply update if available. 2. Contact your next level of support for assistance.
20D0 0A07
Error initializing interface/operating system access
1. Check for system firmware updates. Apply update if available. 2. Contact your next level of support for assistance.
20D0 0A08
Error initializing interface/ client_interface access
1. Check for system firmware updates. Apply update if available. 2. Contact your next level of support for assistance.
20D0 0A09
Error initializing interface/NUMA access
1. Check for system firmware updates. Apply update if available. 2. Check for system firmware updates. Apply update if available.
20D0 0A0A
Error initializing interface/interrupt access
1. Check for system firmware updates. Apply update if available. 2. Contact your next level of support for assistance.
20D0 0A0B
Error initializing interface/OFDT access
1. Check for system firmware updates. Apply update if available. 2. Contact your next level of support for assistance.
20D0 0A0C
Error initializing interface/sensor access
1. Check for system firmware updates. Apply update if available. 2. Contact your next level of support for assistance.
20D0 0A0D
Error initializing interface/sensors/led_table access
1. Check for system firmware updates. Apply update if available. 2. Contact your next level of support for assistance.
20D0 0A0E
Error initializing interface/timebase access
1. Check for system firmware updates. Apply update if available. 2. Contact your next level of support for assistance. Chapter
5.
Error
Code
to
FRU
Index
491
Error Code
Description
Action/ Possible Failing FRU
20D0 0A0F
Error initializing interface/time-of-day access
1. Check for system firmware updates. Apply update if available. 2. Contact your next level of support for assistance.
20D0 0A10
Error initializing interface/cache access
1. Check for system firmware updates. Apply update if available. 2. Contact your next level of support for assistance.
20D0 0A1
Error initializing interface/power access
1. Check for system firmware updates. Apply update if available. 2. Contact your next level of support for assistance.
20D0 0A1
Error initializing interface/hot-plug access
1. Check for system firmware updates. Apply update if available. 2. Contact your next level of support for assistance.
20D0 0A1
Error initializing interface/log access
1. Check for system firmware updates. Apply update if available. 2. Contact your next level of support for assistance.
20D0 0A1
Error initializing interface/EEH access
1. Check for system firmware updates. Apply update if available. 2. Contact your next level of support for assistance.
20D0 0A1
Error initializing interface/error injection access
1. Check for system firmware updates. Apply update if available. 2. Contact your next level of support for assistance.
20D0 0A1
Error initializing interface/handling access
1. Check for system firmware updates. Apply update if available. 2. Contact your next level of support for assistance.
20D0 0A1
Error initializing interface/utility access
1. Check for system firmware updates. Apply update if available. 2. Contact your next level of support for assistance.
20D0 0A1
Error initializing register softpatch access
1. Check for system firmware updates. Apply update if available. 2. Contact your next level of support for assistance.
20D0 0998
Operating system terminated with error string
Call service support. A hypervisor dump may be available; see “Hypervisor Dump Retrieval Procedure” on page 757. 492
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
20D0 0999
Operating system terminated with error string
Call service support. A hypervisor dump may be available; see “Hypervisor Dump Retrieval Procedure” on page 757.
20D0 9001
Subsystem information not found in VPD
1. If the location code starts with U1.17 or U1.18, the problem is in the system VPD module on the media subsystem operator panel. Replace the operator panel; do not swap the old VPD module onto the new operator panel. Call service support for instructions on programming the VPD information into the new VPD module. 2. If the location code indicates an I/O subsystem, call service support.
20D0 9002
Subsystem machine type and model information not found in VPD
1. If the location code starts with U1.17 or U1.18, the problem is in the system VPD module on the media subsystem operator panel. Replace the operator panel; do not swap the old VPD module onto the new operator panel. Call service support for instructions on programming the VPD information into the new VPD module. 2. If the location code indicates an I/O subsystem, call service support.
20D0 9003
Subsystem serial number information not found in VPD
1. If the location code starts with U1.17 or U1.18, the problem is in the system VPD module on the media subsystem operator panel. Replace the operator panel; do not swap the old VPD module onto the new operator panel. Call service support for instructions on programming the VPD information into the new VPD module. 2. If the location code indicates an I/O subsystem, call service support.
20D0 9004
Subsystem machine type and model field is empty in VPD
1. If the location code starts with U1.17 or U1.18, the problem is in the system VPD module on the media subsystem operator panel. Replace the operator panel; do not swap the old VPD module onto the new operator panel. Call service support for instructions on programming the VPD information into the new VPD module. 2. If the location code indicates an I/O subsystem, call service support. Chapter
5.
Error
Code
to
FRU
Index
493
Error Code
Description
Action/ Possible Failing FRU
20D0 9005
Subsystem serial number field is empty in VPD
1. If the location code starts with U1.17 or U1.18, the problem is in the system VPD module on the media subsystem operator panel. Replace the operator panel; do not swap the old VPD module onto the new operator panel. Call service support for instructions on programming the VPD information into the new VPD module. 2. If the location code indicates an I/O subsystem, call service support.
20D0 9006
Duplicate subsystem serial number found in VPD
1. If the location code starts with U1.17 or U1.18, the problem is in the system VPD module on the media subsystem operator panel. Replace the operator panel; do not swap the old VPD module onto the new operator panel. Call service support for instructions on programming the VPD information into the new VPD module. 2. If the location code indicates an I/O subsystem, call service support.
20D0 9007
Invalid subsystem machine type and model information in VPD
1. If the location code starts with U1.17 or U1.18, the problem is in the system VPD module on the media subsystem operator panel. Replace the operator panel; do not swap the old VPD module onto the new operator panel. Call service support for instructions on programming the VPD information into the new VPD module. 2. If the location code indicates an I/O subsystem, call service support.
20D0 9008
Invalid subsystem serial number found in VPD
1. If the location code starts with U1.17 or U1.18, the problem is in the system VPD module on the media subsystem operator panel. Replace the operator panel; do not swap the old VPD module onto the new operator panel. Call service support for instructions on programming the VPD information into the new VPD module. 2. If the location code indicates an I/O subsystem, call service support.
20D1 0001
Failed to send state-change message to service processor
1. Check for system firmware updates. Apply update if available. 2. Contact your next level of support for assistance.
20D1 0002
Failed to disable heartbeat alarm
1. Check for system firmware updates. Apply update if available. 2. Contact your next level of support for assistance. 494
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
20D1 0003
Failed to send boot status in LPAR partition
1. Check for system firmware updates. Apply update if available. 2. Contact your next level of support for assistance.
20D1 0004
Failed to locate service processor device tree node
1. Check for system firmware updates. Apply update if available. 2. Contact your next level of support for assistance.
20D1 0005
Failed to reboot system in LPAR mode
1. Check for system firmware updates. Apply update if available. 2. Contact your next level of support for assistance.
20D1 0006
Failed to reboot a partition in LPAR mode
1. Check for system firmware updates. Apply update if available. 2. Contact your next level of support for assistance.
20D1 0007
Failed to set PCI read/write permissions array
1. Check for system firmware updates. Apply update if available. 2. Contact your next level of support for assistance.
20D1 0008
Failed to set PCI read/write permissions array for slot
1. Check for system firmware updates. Apply update if available. 2. Contact your next level of support for assistance.
20D1 0009
Missing or invalid subsystem serial number
1. Check for system firmware updates. Apply update if available. 2. If a location code is shown with this error code, replace the part at that location. 3. Contact your next level of support for assistance.Note: The location code identifies the failing subsystem.
20D1 000A
Failed to send boot failed message to service processor
1. Check for system firmware updates. Apply update if available. 2. Contact your next level of support for assistance.
20D1 000B
Failed to reset PCI read/write permissions array
1. Check for system firmware updates. Apply update if available. 2. Contact your next level of support for assistance.
20D1 000C
Failed to send LMB-TABLE-INFO mailbox repair
1. Check for system firmware updates. Apply update if available. 2. Contact your next level of support for assistance. Chapter
5.
Error
Code
to
FRU
Index
495
Error Code
Description
Action/ Possible Failing FRU
20D1 000D
Hypervisor function to get time-of-day failed
1. Check for system firmware updates. Apply update if available. 2. Contact your next level of support for assistance.
20D1 000F
Hypervisor call to reset PHB or to check for drawer existence failed
1. Check for system firmware updates. Apply update if available. 2. Contact your next level of support for assistance.
20D1 0010
Stack underflow from fiber channel adapter
1. Check for microcode updates for the fiber channel adapter. Apply update if available. 2. Check for system firmware updates. Apply update if available. 3. Contact your next level of support for assistance.
20D1 0011
A recovery operation on an internal I/O control chip failed
1. If the system is partitioned, perform a hard reset on the partition that owns the drawer specified by the location code reported with the error. Reboot the partition. If the error reoccurs, replace the I/O planar specified by the location code. If the system is running in full system partition mode, or an HMC is not attached to the system, reboot the system. If the error reoccurs, replace the I/O planar specified by the location code. 2. Call service support.
20D1 0012
Attention: One of the I/O subsystems has been disconnected from the system
This indicates that an I/O subsystem that was previously in the configuration was not found during the current boot. If an I/O subsystem has been removed, this code is an informational message. If this message occurs during a system boot when the system’s drawer configuration has not changed, complete the following steps: 1. If not already booted, boot the system and match the I/O subsystem location codes reported by the operating system to the subsystems attached to the CEC. Check the SPCN and RIO cabling to the I/O subsystems. If a cabling problem is found, correct it. 2. Check for system firmware updates. Apply update if available. 3. Contact your next level of support for assistance.
20E0 0001
Privileged-access password entry error.
The password has been entered incorrectly. Try installing the password again. 496
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
20E0 0004
Battery drained or needs replacement
1. Replace the battery,Location: U1.18-P1-H2-V1 2. If problem persists, replace the primary I/O book, Location: U1.18-P1-H2
20E0 0009
Invalid password entered - system locked
The password has been entered incorrectly 3 times. Turn off, then turn on the system unit, then enter the password correctly.
20EE 0003
IP parameter requires 3 dots ?.?
Enter valid IP parameter. Example: 000.000.000.000
20EE 0004
Invalid IP parameter
Enter valid IP parameter. Example: 000.000.000.000
20EE 0005
Invalid IP parameter (>255)
Enter valid IP parameter. Example: 255.192.002.000
20EE 0007
Keyboard not found
1. Plug in keyboard. 2. Replace the USB card the keyboard is plugged into.
20EE 0008
No configurable adapters found in the system by the remote IPL menu in the SMS utilities
This error occurs when the remote IPL menu in the SMS utilities cannot locate any LAN adapters that are supported by the remote IPL function. If a supported device is installed: 1. Replace the adapter. (See notes on page 413.) 2. Replace the I/O subsystem backplane in the subsystem(s) containing the device/adapter.
20EE 0009
Unable to communicate with the service processor
Replace the primary I/O book, Location: U1.18-P1-H2
20EE 000A
The image size on the boot or installation device exceeds the system firmware memory limitations.
1. Check for system firmware updates. Apply if available. 2. Call service support.
20EE 000B
The system was not able to find an operating system on the devices in the boot list.
Go to “Boot Problems” on page 409. Chapter
5.
Error
Code
to
FRU
Index
497
Error Code
Description
Action/ Possible Failing FRU
20EE 000C
Pointer to the operating system found in non-volatile storage.
Values normally found in non-volatile storage that point to the location of an operating system were not found. This can happen for two reasons: v Either your operating system doesn’t support storing the values orv An event occurred that caused the system to lose non-volatile storage information (drainage or replacement of the battery).If you are running AIX, reconstruct this information by running the bootlist command and specifying the device that the operating system is installed on. See the AIX documentation for the syntax and usage of the bootlist command. To boot the operating system so that the above-mentioned values can be reconstructed, power the system down and power it back on again to cause the system to look for the operating system in the devices contained in the custom boot list or in the default boot list, depending on the condition of the system. If this is not successful, use the SMS menus to modify the boot sequence (also known as the boot list) to include devices that are known to contain a copy of the operating system. For example, select a hard disk known to have a copy of the operating system as the first and only device in the boot sequence and attempt to boot again. If the problem persists, replace the primary I/O book, Location: U1.18-P1-H2
20EE 000E
Unable to execute the lpar-mgr open firmware method
1. Verify that the system is in LPAR mode, and reboot the system. 2. Reflash the system with the latest level of system firmware. 3. Call service support.
20EE 000F
Unable to find the lpar-mgr package node
1. Verify that the system is in LPAR mode, and reboot the system. 2. Reflash the system with the latest level of system firmware. 3. Call service support.
20EE 0010
Unable to execute the root vpd method
1. Verify that the system is in LPAR mode, and reboot the system. 2. Reflash the system with the latest level of system firmware. 3. Call service support. 498
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
20EE 0011
Unable to execute the root-lpar-vpd vpd method
1. Verify that the system is in LPAR mode, and reboot the system. 2. Reflash the system with the latest level of system firmware. 3. Call service support.
20EE 0012
Unable to execute the mem-dimm-vpd method
1. Verify that the system is in LPAR mode, and reboot the system. 2. Reflash the system with the latest level of system firmware. 3. Call service support.
20EE 0013
Unable to allocate memory for root-lpar-vpd vpd
1. Verify that the system is in LPAR mode, and reboot the system. 2. Reflash the system with the latest level of system firmware. 3. Call service support.
20EE 0014
Unable to allocate memory for root-lpar-vpd location codes
Verify that the system is in LPAR mode, and try again. Otherwise, the system firmware may be damaged.
20EE 0020
The boot-device environment variable exceeds the allowed character limit
If this error occurs during the installation of AIX via a process called NIM push, the set_bootlist attribute may not have been set correctly on the NIM master. See the AIX 5.2 Installation Guide and Reference, order number SC23-4389, for more information. If this error occurs at any other time, complete the following steps: 1. Check for platform firmware updates. Apply if available. 2. Call service support.
20EE 0021
The boot-device environment variable contained more than five entries
If this error occurs during the installation of AIX via a process called NIM push, the set_bootlist attribute may not have been set correctly on the NIM master. See the AIX 5.2 Installation Guide and Reference, order number SC23-4389, for more information. If this error occurs at any other time, complete the following steps: 1. Check for platform firmware updates. Apply if available. 2. Call service support. Chapter
5.
Error
Code
to
FRU
Index
499
Error Code
Description
Action/ Possible Failing FRU
20EE 0022
The boot-device environment variable contained an entry that exceeded 255 characters in length
If this error occurs during the installation of AIX via a process called NIM push, the set_bootlist attribute may not have been set correctly on the NIM master. See the AIX 5.2 Installation Guide and Reference, order number SC23-4389, for more information. If this error occurs at any other time, complete the following steps: 1. Check for platform firmware updates. Apply if available. 2. Call service support.
20EE 0100
Fcode resident on an I/O adapter is not supported on this system
Informational message. The adapter’s Fcode driver is not supported on this system. IBM may produce a compatible driver in the future, but does not guarantee one. Additional information may be available from service support. This error code only affects boot-time operation, not the adapter’s functionality under the operating system.
20FB 0010
Failed to find keyword offset of SPCN configuration table in NVRAM.
Call service support.
20FB 0020
Invalid SPCN configuration table flag before SPCN table update.
Call service support.
20FC 0020
Failed to retrieve power, package, and/or cooling VPD via SPCN for the corresponding subsystem. Service processor mailbox failed to get VPD
1. Check RIO cable connection. 2. Check for system firmware updates. 3. Replace the part specified by the location code.
20FC 0021
Failed to retrieve power, package, and/or cooling VPD via SPCN for the subsystem. Service processor mailbox failed to get power, packaging, and cooling device VPD
1. Check RIO cable connection. 2. Check for system firmware updates. 3. Replace the part specified by the location code.
20FC 0030
Damaged power, packaging, and/or cooling device VPD format in SPCN configuration table
1. Check for system firmware updates. 2. Replace the part specified by the location code.
20FF 0001
Problem with VPD
1. Replace the part indicated by the location code. 2. If no location code is shown with the error code, call service support.
20FF0010
Invalid VPD (defect ROM structure for PCI)
1. Replace the part specified by the location code. 2. If no location code is shown with the error code, call service support.
20FF0020
Invalid VPD (checksum mismatch)
1. Replace the part specified by the location code. 2. If no location code is shown with the error code, call service support. 500
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
20FF0021
Invalid VPD (ID tag length is more than 256 bytes)
1. Replace the part specified by the location code. 2. If no location code is shown with the error code, call service support.
20FF0022
Invalid VPD (84/90 tag length is more than 2KB - large resource tag)
1. Replace the part specified by the location code. 2. If no location code is shown with the error code, call service support.
20FF0023
Invalid VPD (buffer length is more than 2KB
1. Replace the part specified by the location code. 2. If no location code is shown with the error code, call service support.
20FF0024
Invalid VPD (84/90 tag length is more than 2KB - vendor-defined tag)
1. Replace the part specified by the location code. 2. If no location code is shown with the error code, call service support.
20FF0026
Invalid VPD (tag is not recognized)
1. Replace the part specified by the location code. 2. If no location code is shown with the error code, call service support.
20FF0030
Invalid VPD (VPD data bad)
1. Replace the part specified by the location code. 2. If no location code is shown with the error code, call service support.
20FF0060
Invalid VPD (VPD length is zero)
1. Replace the part specified by the location code. 2. If no location code is shown with the error code, call service support.
20FF0070
Invalid VPD (cannot find the beginning of the keyword)
1. Replace the part specified by the location code. 2. If no location code is shown with the error code, call service support.
20FF0072
Invalid VPD (cannot find the beginning of the keyword)
1. Replace the part specified by the location code. 2. If no location code is shown with the error code, call service support.
20FF0073
Invalid VPD (cannot find the beginning of the keyword)
1. Replace the part specified by the location code. 2. If no location code is shown with the error code, call service support.
20FF0090
Invalid VPD (empty VPD data from the service processor mailbox command)
1. Replace the part specified by the location code. 2. If no location code is shown with the error code, call service support. Chapter
5.
Error
Code
to
FRU
Index
501
Error Code
Description
Action/ Possible Failing FRU
20FF00A0
Invalid VPD (getvpd mailbox command failed)
1. Replace the part specified by the location code. 2. If no location code is shown with the error code, call service support.
20FF00B0
Invalid VPD (the FG keyword is not found for a fabric card)
1. Replace the part specified by the location code. 2. If no location code is shown with the error code, call service support.
20FF00B1
Invalid VPD (cannot find RM keyword)
1. Replace the part specified by the location code. 2. If no location code is shown with the error code, call service support.
20FF00C0
Invalid VPD (missing base address of the RIO-PCI bridge)
1. Replace the part specified by the location code. 2. If no location code is shown with the error code, call service support.
20FF00D0
Invalid VPD (machine type field is invalid)
1. Replace the part specified by the location code. 2. If no location code is shown with the error code, call service support.
20FF00F0
Invalid VPD (the rack number and the unit number are not found in the SCF partition in NVRAM)
1. Replace the part specified by the location code. 2. If no location code is shown with the error code, call service support.
2100 0xxx
SCSI controller fault
Before replacing any system components, refer to the notes in error code 21A0 0001. 1. Replace the SCSI device.
21A0 0001
SCSI DASD: test unit ready failed; hardware error
Before replacing any system components: 1. Ensure that the controller and each device on the SCSI bus is assigned a unique SCSI ID. 2. Ensure that the SCSI bus is properly terminated. 3. Ensure that the SCSI signal and power cables are securely connected and not damaged.The location code information is required to identify the ID of SCSI device failures as well as to indicate the location of the controller to which the device is attached. Check the system error logs to determine the location code information associated with the error code. 1. Replace the SCSI device. 2. Replace the SCSI cable. 3. Replace the SCSI controller. 502
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
21A0 0002
SCSI DASD: test unit ready failed; sense data available
Before replacing any system components, refer to the Action under error code 21A0 0001. 1. Replace the media (if a removable media device). 2. Replace the SCSI device.
21A0 0003
SCSI DASD: send diagnostic failed; sense data available
Before replacing any system components, refer to the Action under error code 21A0 0001. Replace the SCSI device.
21A0 0004
SCSI DASD: send diagnostic failed - devofl command
Before replacing any system components, refer to the Action under error code 21A0 0001. Replace the SCSI device.
21E0 0001
SCSI tape: test unit ready failed; hardware error
Before replacing any system components, refer to the Action under error code 21A0 0001. 1. Replace the SCSI device. 2. Replace the SCSI cable. 3. Replace the SCSI controller.
21E0 0002
SCSI tape: test unit ready failed; sense data available
Before replacing any system components, refer to the Action under error code 21A0 0001. 1. Replace the media (if a removable media device). 2. Replace the SCSI device.
21E0 0003
SCSI tape: send diagnostic failed; sense data available
Before replacing any system components, refer to the Action under error code 21A0 0001. Replace the SCSI device.
21E0 0004
SCSI tape: send diagnostic failed - devofl command
Before replacing any system components, refer to the Action under error code 21A0 0001. Replace the SCSI device.
21ED 0001
SCSI C: test unit ready failed; hardware error
Before replacing any system components, refer to the Action under error code 21A0 0001. 1. Replace the SCSI device. 2. Replace the SCSI cable. 3. Replace the SCSI controller.
21ED 0002
SCSI changer: test unit ready failed; sense data available
Before replacing any system components, refer to the Action under error code 21A0 0001. 1. Replace the SCSI device. 2. Replace the media. Chapter
5.
Error
Code
to
FRU
Index
503
Error Code
Description
Action/ Possible Failing FRU
21ED 0003
SCSI changer: send diagnostic failed; sense data available
Before replacing any system components, refer to the Action under error code 21A0 0001. Replace the SCSI device.
21ED 0004
SCSI changer: send diagnostic failed - devofl command
Before replacing any system components, refer to the Action under error code 21A0 0001. Replace the SCSI device.
21EE 0001
Undetermined SCSI device: test unit ready failed; hardware error
Before replacing any system components, refer to the Action under error code 21A0 0001. 1. Replace the SCSI device. 2. Replace the SCSI cable. 3. If the missing SCSI devices are connected to the same backplane, replace the SCSI backplane. 4. Replace the SCSI controller.
21EE 0002
Undetermined SCSI device: test unit ready failed; sense data available
Before replacing any system components, refer to the Action under error code 21A0 0001. 1. Replace the SCSI device. 2. Replace the media.
21EE 0003
Undetermined SCSI device: send diagnostic failed; sense data available
Before replacing any system components, refer to the Action under error code 21A0 0001. Replace the SCSI device.
21EE 0004
Undetermined SCSI device: send diagnostic failed - devofl command
Before replacing any system components, refer to the Action under error code 21A0 0001. Replace the SCSI device.
21F0 0001
SCSI CD-ROM: test unit ready failed; hardware error
Before replacing any system components, refer to the Action under error code 21A0 0001. 1. Replace the SCSI device. 2. Replace the SCSI cable. 3. Replace the SCSI controller.
21F0 0002
SCSI CD-ROM: test unit ready failed; sense data available
Before replacing any system components, refer to the Action under error code 21A0 0001. 1. Replace the SCSI device. 2. Replace the media.
21F0 0003
SCSI CD-ROM: send diagnostic failed; sense data available
Before replacing any system components, refer to the Action under error code 21A0 0001. Replace the SCSI device. 504
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
21F0 0004
SCSI CD-ROM: send diagnostic failed - devofl command
Before replacing any system components, refer to the Action under error code 21A0 0001. Replace the SCSI device.
21F2 0001
SCSI read/write optical device: test unit ready failed; hardware error
Before replacing any system components, refer to the Action under error code 21A0 0001. 1. Replace the SCSI device. 2. Replace the SCSI cable. 3. Replace the SCSI controller.
21F2 0002
SCSI read/write optical device: test unit ready failed; sense data available
Before replacing any system components, refer to the Action under error code 21A0 0001. 1. Replace the SCSI device. 2. Replace the media.
21F2 0003
SCSI read/write optical drive: send diagnostic failed; sense data available
Before replacing any system components, refer to the Action under error code 21A0 0001. Replace the SCSI drive.
21F2 0004
SCSI read/write optical drive: send diagnostic failed - devofl command
Before replacing any system components, refer to the Action under error code 21A0 0001. Replace the SCSI drive.
2200 0001
PCI Ethernet BNC/RJ-45 or PCI Ethernet AUI/RJ-45 Adapter Internal Wrap Test failure
Replace the Adapter. See “Determining Location Code” on page 415 for location code information related to this error.
2200 1001
10/100 Mbps Ethernet PCI Adapter Internal Wrap Test failure
Replace the adapter. See “Determining Location Code” on page 415 for location code information related to this error.
2200 1002
10/100 Mbps Ethernet PCI Adapter failure
Replace the adapter. See “Determining Location Code” on page 415 for location code information related to this error.
2201 0001
PCI Auto LANstreamer Token Ring Adapter Adapter failed to complete hardware initialization.
Replace the adapter. See “Determining Location Code” on page 415 for location code information related to this error.
2201 1001
PCI Token Ring Adapter Adapter failed to complete hardware initialization.
Replace the adapter. See “Determining Location Code” on page 415 for location code information related to this error.
20EE 0013
Unable to allocate memory for root-lpar-vpd vpd
1. Verify that the system is in LPAR mode, and reboot the system. 2. Reflash the system with the latest level of system firmware. 3. Call service support. Chapter
5.
Error
Code
to
FRU
Index
505
Error Code
Description
Action/ Possible Failing FRU
25A0 0001
L2 Cache controller failure
Replace the MCM module specified by the location code.
25A1 0001
L2 Cache SRAM failure
Replace the MCM module specified by the location code.
25A8 0xxx
NVRAM problems
Errors reported against NVRAM can be caused by low battery voltage and (more rarely) power outages that occur during normal system usage. With the exception of the 25A8 0000 error, these errors are warnings that the NVRAM data content had to be reestablished and do not require any FRU replacement unless the error is persistent. When one of these errors occurs, any system customization (for example, boot device list) information has been lost, the system may need to be reconfigured. If the error is persistent, replace the primary I/O book, Location: U1.18-P1-H2
25A8 0000
Initialization failed, device test failed
Refer to Action under error code 25A8 0xxx
25A8 0100
NVRAM data validation check failed.
Turn off, turn on system unit and retry the operation before replacing any system component. Refer to Action under error code 25A8 0xxx
25A8 0201
Unable to expand target partition - Saving configuration variable.
Refer to Action under error code 25A8 0xxx
25A8 0202
Unable to expand target partition - Writing error log entry.
Refer to Action under error code 25A8 0xxx
25A8 0203
Unable to expand target partition - Writing VPD data.
Refer to Action under error code 25A8 0xxx
25A8 0210
Setenv/$Setenv parameter error - name contains a null character.
Refer to Action under error code 25A8 0xxx
25A8 0211
Setenv/$Setenv parameter error - value contains a null character.
Refer to Action under error code 25A8 0xxx
25A8 0215
NVRAM RIO-USR-select variable not found
Complete the following: 1. Check for platform firmware updates. Apply if available. 2. Call service support. 506
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
25A8 0998
NVRAMRC script evaluation error - command line execution error Action: Execution of a command line within the nvram configuration variable “nvramrc” (script) resulted in a “throw” being executed. This script can be modified by the system firmware SMS utilities, the operating system, PCI adapter rom code or utility, or an operator (via the open firmware script editing command nvedit). It may not be possible to resolve the problem without a detailed analysis of the nvram script, the current system configuration, and device tree contents. 1. The problem can be caused by a SCSI adapter, whose SCSI bus ID has been changed from the default setting, no longer appearing in the system. This can be caused either by removing a SCSI adapter, or by a problem with a SCSI adapter. a. Select option 5, Change SCSI Settings, on the SMS main menu. Then on the SCSI utilities menu, select option 2, Change SCSI Id. 1) Verify the list of SCSI controllers/adapters. If the list is not correct, suspect a problem with the adapter(s) installed but not listed. 2) Select one of the adapters listed and enter the SCSI id that is currently displayed for that adapter, then exit from this menu. This causes the information in NVRAM to be updated. 3) Restart the system.b. If the problem persists, boot the operating system and verify the SCSI bus IDs of any installed/available SCSI controllers (change as necessary), and restart the system.2. Contact your support representative for further assistance.
25A8 0999
NVRAMRC script evaluation error - stack unbalanced on completion
This is a firmware debug environment error. There is no user action or FRU replacement for this error. If this error persists, call service support.
25A8 0A00
Unable to retrieve NVRAM LPAR table information
Refer to the actions for error code 25A8 0xxx.
25B0 0001
Memory book failure
Replace the memory book specified by the location code.
25B0 0004
Memory book is not supported
Replace the unsupported memory book(s) specified by the location code.
25Cy y001
Memory book is not supported
Replace unsupported memory book(s) specified by the location code.
25Cy y002
Memory book failure
Replace the memory book(s) specified by the location code.
2601 0000
Boot failed because the RIO cables, or the power subsystem cables, are not routed correctly to the I/O subsystems
1. Check both ends of the RIO cable specified in the location code that appears with this error code. 2. Determine which I/O subsystem is at the end of the RIO cable specified by the location code shown with this error code. Check both ends of the power cables going to both DCAs in that I/O subsystem.For information about I/O cabling and connector locations, see “I/O Book Connector Locations” on page 19 and “Subsystem Positioning and Cabling” on page 36. Note: The port location code identifies the I/O book number. Chapter
5.
Error
Code
to
FRU
Index
507
Error Code
Description
Action/ Possible Failing FRU
2601 0001
Boot failed because the I/O subsystem has not been programmed correctly.
For information about I/O book cabling and connector locations, see “I/O Book Connector Locations” on page 19 and “Subsystem Positioning and Cabling” on page 36. 1. Ensure that power cables are correctly connected to drawers. 2. Firmware, including power code, may have to be flashed on both sides of the I/O subsystem. Note: The location code identifies the I/O subsystem that has the problem 3. Contact the next level of support.
2601 0002
RIO cable error or DCA power cable error detected on an I/O subsystem
1. Check both ends of the RIO cable specified in the location code that appears with this error code. 2. Determine which I/O subsystem is at the end of the RIO cable specified by the location code shown with this error code. Check the power cables going to both DCAs in that I/O subsystem.For information about I/O cabling and connector locations, see “I/O Book Connector Locations” on page 19 and “Subsystem Positioning and Cabling” on page 36. Note: The port location code identifies the I/O book number.
2602 0001
Invalid PCI adapter vendor ID
1. Move adapter to another slot (behind a different PCI bus). 2. Check for available firmware updates for adapter. Apply update if available. 3. Run AIX diagnostics on adapter. Replace if indicated. 4. Replace adapter. 5. Check for system firmware updates. Apply if update available. 6. Replace I/O subsystem backplane (See notes on Chapter 5, “Error Code to FRU Index,” on page 413.)
2602 0002
Invalid PCI adapter device ID
1. Move adapter to another slot (behind a different PCI bus). 2. Check for available firmware updates for adapter. Apply update if available. 3. Run AIX diagnostics on adapter. Replace if indicated. 4. Replace adapter. 5. Check for system firmware updates. Apply if update available. 6. Replace I/O subsystem backplane. (See notes on Chapter 5, “Error Code to FRU Index,” on page 413.) 508
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
2602 0007
PCI device I/O or memory space request could not be satisfied
1. If the location code identifies a slot: a. Reseat the adapter and reboot the system. b. Replace the adapter. c. Check for system firmware updates. Apply if available. d. Replace the I/O backplane in which the adapter is installed.2. If the location code identifies an I/O planar: a. Check for system firmware updates. Apply if available. b. Replace the I/O backplane.3. Call service support.
2602 0008
PCI device Fcode evaluation error
1. If the location code identifies a slot: a. Check for adapter firmware updates. Apply if available. b. Replace the adapter. c. Check for system firmware updates. Apply if available.2. If the location code identifies an I/O planar: a. Check for system firmware updates. Apply if available. b. Replace the I/O planar.3. Call service support.
2602 0010
PCI probe error, bridge in freeze state If the location code identifies a slot or PCI adapter: 1. Check for system firmware updates. Apply if update available. 2. Check the cabling to the adapter (in particular, the adapters that have serial ports). Serial ports may require null modems or special cabling configurations to avoid connecting driver outputs together. This may create a PCI slot power problem and force the adapter to be deconfigured. 3. Use the hot plug service aid to reseat the adapter specified by the location code. See “Removing and Replacing a Hot-Pluggable PCI Adapter” on page 921. If reseating the adapter fixes the problem, go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems manual. If the problem is not resolved, continue with 4. 4. Use the hot plug task to move the adapter to another slot (behind a different PCI bridge). The “I/O Subsystem PCI and PCI-X PHB and PCI and PCI-X Slot Locations” on page 27 identifies the PCI bridges and the associated slots. See “Removing and Replacing a Hot-Pluggable PCI Adapter” on page 921 when removing the adapter, and “Installing a New Hot-Pluggable PCI Adapter” on page 923 when installing the adapter in the new slot. If the adapter is successfully reconfigured in the new slot (behind a different PCI bridge), the slot in which the adapter was originally plugged is bad. Replace the I/O backplane assembly that contains the slot in which the adapter was originally installed, location: Ux.y-Pz. If the adapter does not reconfigure successfully in the new slot, replace the adapter. 5. Go to “MAP 1542: I/O Problem Isolation” on page 318. If the location code does not identify a slot or PCI adapter or no location code is specified, go to “MAP 1542: I/O Problem Isolation” on page 318. Chapter
5.
Error
Code
to
FRU
Index
509
Error Code
Description
Action/ Possible Failing FRU
2602 0011
PCI probe error, bridge is not usable
If the location code identifies a slot: 1. Check the cabling to the adapter (in particular, the adapters that have serial ports). Serial ports may require null modems or special cabling configurations to avoid connecting driver outputs together. This may create a PCI slot power problem and force the adapter to be deconfigured. 2. Move adapter to another slot (behind a different PCI bus). 3. Check for available firmware updates for adapter. Apply update if available. 4. Replace adapter. 5. Check for system firmware updates. Apply if update available. 6. Replace I/O subsystem backplane (See notes on Chapter 5, “Error Code to FRU Index,” on page 413.)If the location code identifies an I/O subsystem backplane: 1. Check for system firmware updates. Apply if update is available. 2. Replace I/O subsystem backplane.
2602 0012
PCI device run-time error, bridge in freeze state
If the location code identifies a slot: 1. Check the cabling to the adapter (in particular, the adapters that have serial ports). Serial ports may require null modems or special cabling configurations to avoid connecting driver outputs together. This may create a PCI slot power problem and force the adapter to be deconfigured. 2. Move adapter to another slot (behind a different PCI bus). 3. Check for available firmware updates for adapter. Apply update if available. 4. Replace adapter. 5. Check for system firmware updates. Apply if update available. 6. Replace I/O subsystem backplane (See notes on Chapter 5, “Error Code to FRU Index,” on page 413).If the location code identifies an I/O subsystem backplane: 1. Check for system firmware updates. Apply if update is available. 2. Replace I/O subsystem backplane. 510
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
2602 0014
A PCI adapter is not properly seated, or an invalid type of PCI adapter is installed in the system.
If the location code identifies a slot: 1. Check the PCI adapter specified by the location code for proper seating. 2. Verify that the PCI adapter specified by the location code is supported on this system. 3. Replace the adapter.If the location code identifies an I/O planar: 1. Check for system firmware updates. Apply the update if one is available. 2. Replace the I/O planar.
2602 0Fxy
PCI bridge chip failure
Replace the I/O planar specified by the location code. The I/IO planar may be in the CEC or in an external I/O drawer.
2680 0Axy
MX-PCI Bridge Bist Failure
The problem is associated with bridge ?y? in I/O subsystem ?x?. Replace the I/O board associated with x x = I/O subsystem 1=0 2=1 3=2 4=3 5=4 6=5 See “Determining Location Code” on page 415 or location code information related to this error.
2680 0Cxx
Machine Check occurred
1. Replace the part identified by the location code associated with this error code. 2. If the location code associated with this error code identifies an adapter, and replacing the adapter does not correct the problem, replace the I/O subsystem backplane.See “Determining Location Code” on page 415 for location code information related to this error.
2680 0Dxx
Machine check occurred; unable to isolate to a single device
Go to “MAP 1542: I/O Problem Isolation” on page 318.
27A0 0001
Error obtaining external RIO cable IDs
1. Check the SPCN and RIO cabling going to the I/O subsystem specified by the location code. 2. Check for system firmware updates. 3. Replace the I/O subsystem planar specified by the location code.
2803 0xxx
Real-time-clock (RTC) errors Chapter
5.
Error
Code
to
FRU
Index
511
Error Code
Description
Action/ Possible Failing FRU
2803 0001
RTC not updating - RTC initialization required
Errors reported against the real-time clock can be caused by low battery voltage and (more rarely) power outages that occur during normal system usage. These errors are warnings that the RTC data content must be re-established and do not require any FRU replacement unless the error is persistent. When one of these errors occurs, the time and date information has been lost. To set/restore the time and date, use the operating system facility. If the error is persistent, replace the primary I/O book, Location: U1.18-P1-H2
2803 0002
Bad time/date values
1. Set time and date. 2. Refer to Action under error code 2803 0001.
2803 0003
RTC not updating - not correctable
Replace the primary I/O book, Location: U1.18-P1-H2
2803 0004
RTC operating mode parameters (for example, data mode) changed
1. Set time and date. 2. Refer to Action under error code 2803 0001.
2803 0005
RTC battery error
Replace the primary I/O book, Location: U1.18-P1-H2 Note: Passwords, time, and date must be reset.
2803 0006
Processor frequency measurement error
Replace the clock card, location: U1.18-P1-X5.
2803 0010
The real-time-clock cannot be protected in the LPAR environment.
1. Check for system firmware updates. Apply if available. 2. Contact your next level of support.
2803 F003
Real-time-clock not updating - not correctable (detected by hypervisor)
Replace the primary I/O book, location: U1.18-P1-H2.
2900 0002
ISA keyboard/mouse controller failed self-test
Replace the primary I/O book, Location: U1.18-P1-H2
29A0 0003
ISA keyboard not present/detected
Replace the primary I/O book, Location: U1.18-P1-H2
29A0 0004
ISA keyboard stuck key detected
Replace the primary I/O book, Location: U1.18-P1-H2
29B0 0004
ISA mouse not present/detected
Replace the primary I/O book, Location: U1.18-P1-H2
2B20 0011
Processor is manually disabled.
Use the service processor menus to re-enable the processor and reboot the system.
2B20 0012
This is an informational code that is generated when the cpu_deallocate command is used to take a processor off-line.
There is no action required. To bring the processor back on-line, or back into the system or partition configuration, reboot the operating system. 512
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
2B20 0022
Processor is deconfigured by the system.
The processor has previously failed BIST or POST. Replace the processor MCM indicated by the location code associated with the error code in the error log.
2B20 0031
Processor is failing POST
Replace the processor MCM indicated by the location code associated with the error code in the error log.
2B20 0042
Unknown processor card
Remove the unknown/unsupported processor MCM.
2B20 8880
Unrecoverable processor subsystem error
If Linux is the only operating system on the failing system, call service support. If AIX is available, and the system is powered off, replace the FRU indicated in the service processor error log entry. If AIX is available and the system has been rebooted, run online AIX diagnostics and look for an AIX error log entry in which the description says cache or cache controller error. Replace the FRUs listed in this entry.
2B20 8881
Software error
Call service support.
2B20 8882
Software error
Call service support.
2B20 8883
Software error
Call service support.
2B20 8884
Unrecoverable processor subsystem error
If Linux is the only operating system on the failing system, call service support. If AIX is available, and the system is powered off, replace the FRU indicated in the service processor error log entry. If AIX is available and the system has been rebooted, run online AIX diagnostics and look for an AIX error log entry in which the description says cache or cache controller error. Replace the FRUs listed in this entry.
2B20 8885
Unrecoverable internal hardware error
If Linux is the only operating system on the failing system, call service support. If Aix is available, run online AIX diagnostics.Look for an AIX error log entry in which the description says I/O bus time out, access, or other error, or I/O bridge/device internal error. Replace the FRUs listed in this entry.
2B20 8886
Recoverable internal error while resetting a partition
Informational message; no action required. If a partition is reset, either from the operating system or the HMC, and a machine check occurs once the partition has started but not completed the reset operation, the normal reset operation is aborted and the partition is terminated without an operating system dump. When the partition is brought back up, examine the operating system error log to determine the cause of the machine check. Chapter
5.
Error
Code
to
FRU
Index
513
Error Code
Description
Action/ Possible Failing FRU
2B20 8888
Operating system termination request received
Informational message.
2B2x 00EE
An exception error has been reported on a global processor
The type of interrupt that caused the exception is specified by the x as follows: 0: Unknown interrupt 1: System reset interrupt (SRI) 2. Machine check interrupt (MCI) 3. Data storage interrupt (DSI) 4. Instruction storage interrupt (ISI) 5. External interrupt (EXI) 6. Alignment interrupt (ALI) 7. Program interrupt (SRI) 8. Floating unavailable interrupt (FUI)1. Check for system firmware updates. Apply if available. 2. Call service support.
2BA0 0000
Service processor POST failure
Replace the primary I/O book, Location: U1.18-P1-H2
2BA0 0012
Service processor reports self test failure
Replace the primary I/O book, Location: U1.18-P1-H2
2BA0 0013
Service processor reports bad NVRAM CRC
Refer to the actions for 25A8 0xxx.
2BA0 0017
Service processor reports bad or low battery.
Refer to the actions for 25A8 0xxx.
2BA0 0021
Open firmware mailbox call to activate attention LED failed
1. Check for system firmware updates. Apply if available. 2. Call service support.
2BA0 0041
Service processor VPD is damaged.
Replace the primary I/O book, Location: U1.18-P1-H2
2BA0 0071
VPD data is damaged for MCM 0
Replace the MCM and VPD card
2BA0 0073
VPD data is damaged for MCM 2
Replace the MCM and VPD card
2BA0 0080
Service processor hardware reset request failure
1. Check for system firmware updates. Apply if available. 2. Call service support.
2Cmm mhhh
A processor’s timebase enable signal is stuck enabled
1. Replace the primary I/O book, location: U1.18-P1-H2. 2. hhh indicates the failing processor. Replace the MCM that contains the failing processor according to the following table:v hhh = 0 - 7: U1.18-P1-C1 v hhh = 8-15: U1.18-P1-C3 v hhh = 16-23: U1.18-P1-C4 v hhh = 24-31: U1.18-P1-C2 514
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
2Dmm mhhh
A processor’s timebase enable signal is stuck disabled
1. Replace the primary I/O book, location: U1.18-P1-H2. 2. hhh indicates the failing processor. Replace the MCM that contains the failing processor according to the following table:v hhh = 0 - 7: U1.18-P1-C1 v hhh = 8-15: U1.18-P1-C3 v hhh = 16-23: U1.18-P1-C4 v hhh = 24-31: U1.18-P1-C2
Chapter
5.
Error
Code
to
FRU
Index
515
Service Processor Error Codes Attention: Follow the procedure defined in “Checkpoints and Error Codes Index” on page 416. If you replace FRUs or perform an action on an I/O subsystem and the problem is still not corrected, go to “MAP 1542: I/O Problem Isolation” on page 318 unless you were already directed to any MAP 154x by the error code. Otherwise call support if the action(s) for an error code do not resolve the problem. Attention: If the error code that brought you to this chapter originated in the AIX error log, remember that an error may require an additional 15 minutes to be propagated to the AIX error log. If you replace FRUs and the problem is corrected, go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. Note: The MCM and its associated VPD card are a single FRU, and must be replaced as a set.
Error Code
Description
Action/ Possible Failing FRU
4064 0001
Processor subsystem tests detected clock card assembly failure
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
4064 0002
Processor subsystem tests detected backplane assembly failure
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
4064 0003
Processor subsystem tests detected fault in MCM module 0
MCM module 0 and VPD cardLocation: U1.18-P1-C1 and U1.18-P1-H2.3
4064 0004
Processor subsystem tests detected fault in MCM module 2
MCM module 2 and VPD cardLocation: U1.18-P1-C4 and U1.18-P1-H2.4
4064 0005
Processor subsystem tests detected fault in MCM module 1
MCM module 1 and VPD cardLocation: U1.18-P1-C3 and U1.18-P1-H2.5
4064 0006
Processor subsystem tests detected fault in MCM module 3
MCM module 3 and VPD cardLocation: U1.18-P1-C2 and U1.18-P1-H2.6
4064 0007
Processor subsystem tests detected fault in L3 VPD card
1. L3 VPD cardLocation: U1.18-P1-H2.1 (Order the L3 cache module FRU to obtain a new L3 VPD card. Return the L3 cache module that was included in the FRU kit, and the failing L3 VPD card as ?new defective?.) 2. Primary I/O bookLocation: U1.18-P1-H2
4064 000A
Processor subsystem tests detected fault in primary I/O book 1 (slot 0)
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
4064 000B
Processor subsystem tests detected fault in secondary I/O book (slot 1)
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
4064 000C
Processor subsystem tests detected fault in I/O book 3 (slot 2)
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
4064 000D
Processor subsystem tests detected fault in I/O book 4 (slot 3)
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
4064 000E
CUoD capacity card
1. Replace the CUoD capacity cardLocation: U1.18-P1-H2.2 2. Ask the customer to reenter the CUoD activation keys.
4064 0010
Processor subsystem tests detected fault in Memory book (slot 0)
Memory book (slot 0) Location: U1.18-P1-M2 516
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
4064 0011
Processor subsystem tests detected fault in Memory book (slot 1)
Memory book (slot 1) Location: U1.18-P1-M3
4064 0012
Processor subsystem tests detected fault in Memory book (slot 2)
Memory book (slot 2) Location: U1.18-P1-M7
4064 0013
Processor subsystem tests detected fault in Memory book (slot 3)
Memory book (slot 3)Location: U1.18-P1-M6
4064 0014
Processor subsystem tests detected fault in Memory book (slot 4)
Memory book (slot 4)Location: U1.18-P1-M4
4064 0015
Processor subsystem tests detected fault in Memory book (slot 5)
Memory book (slot 5)Location: U1.18-P1-M8
4064 0016
Processor subsystem tests detected fault in Memory book (slot 6)
Memory book (slot 6)Location: U1.18-P1-M5
4064 0017
Processor subsystem tests detected fault in Memory book (slot 7)
Memory book (slot 7)Location: U1.18-P1-M1
4064 00B0
Processor subsystem tests detected possible problem with bus controller or CSP code
1. Check for system firmware updates. 2. Primary I/O book, Location: U1.18-P1-H2 3. Call service support.
4064 00B1
Processor subsystem tests detected possible problem with bus controller
1. Check for system firmware updates. 2. Primary I/O book, Location: U1.18-P1-H2 3. Call service support.
4064 00B3
Processor subsystem tests detected MCM 0/VPD problem
1. Check for system firmware updates. 2. MCM module 0 and VPD cardLocation: U1.18-P1-C1 and U1.18-P1-H2.3
4064 00B4
Processor subsystem tests detected MCM 2/VPD problem
1. Check for system firmware updates. 2. MCM module 2 and VPD cardLocation: U1.18-P1-C4 and U1.18-P1-H2.4
4064 00B5
Processor subsystem tests detected MCM 1/VPD problem
1. Check for system firmware updates. 2. MCM module 1 and VPD cardLocation: U1.18-P1-C3 and U1.18-P1-H2.5
4064 00B6
Processor subsystem tests detected MCM 3/VPD problem
1. Check for system firmware updates. 2. MCM module 3 and VPD cardLocation: U1.18-P1-C2 and U1.18-P1-H2.6
4064 00B7
Processor subsystem tests detected Insufficient hardware resources to continue IPL after processor subsystem testing and initialization completed
Go to “MAP 1540: Problem Isolation Procedures” on page 308. Chapter
5.
Error
Code
to
FRU
Index
517
Error Code
Description
Action/ Possible Failing FRU
4064 0A03
Reboot message (run-time array gard failure) for processor 0
Array bit steering, which is a function in the firmware, may be able to correct this problem without replacing hardware. If AIX is available on the system, go to MAP 0235 in the Diagnostic Information for Multiple Bus Systems, order number SA23-2769. If Linux is the only operating system on the system, schedule deferred maintenance with the customer. When it is possible to reboot the system, do the following: 1. Using the service processor’s system power control menu, disable fast mode boot. (Disabling fast mode automatically enables slow mode.) 2. Boot the AIX diagnostic CD-ROM. 3. Run diagnostics in problem determination mode. If no errors are recorded against any of the processors (devices with a resource name of procx), array bit steering was successful and the problem has been corrected.If errors are reported against the hardware, follow the actions for the error code or SRN.
4064 0A04
Reboot message (run-time array gard failure) for processor 2
Reboot the system in slow mode. Run diagnostics, then go to MAP 0235 in the .Array bit steering, which is a function in the firmware, may be able to correct this problem without replacing hardware. If AIX is available on the system, go to MAP 0235 in the Diagnostic Information for Multiple Bus Systems, order number SA23-2769. If Linux is the only operating system on the system, schedule deferred maintenance with the customer. When it is possible to reboot the system, do the following: 1. Using the service processor’s system power control menu, disable fast mode boot. (Disabling fast mode automatically enables slow mode.) 2. Boot the AIX diagnostic CD-ROM. 3. Run diagnostics in problem determination mode. If no errors are recorded against any of the processors (devices with a resource name of procx), array bit steering was successful and the problem has been corrected.If errors are reported against the hardware, follow the actions for the error code or SRN. 518
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
4064 0A05
Reboot message (run-time array gard failure) for processor 1
Reboot the system in slow mode. Run diagnostics, then go to MAP 0235 in the .Array bit steering, which is a function in the firmware, may be able to correct this problem without replacing hardware. If AIX is available on the system, go to MAP 0235 in the Diagnostic Information for Multiple Bus Systems, order number SA23-2769. If Linux is the only operating system on the system, schedule deferred maintenance with the customer. When it is possible to reboot the system, do the following: 1. Using the service processor’s system power control menu, disable fast mode boot. (Disabling fast mode automatically enables slow mode.) 2. Boot the AIX diagnostic CD-ROM. 3. Run diagnostics in problem determination mode. If no errors are recorded against any of the processors (devices with a resource name of procx), array bit steering was successful and the problem has been corrected.If errors are reported against the hardware, follow the actions for the error code or SRN.
4064 0A06
Reboot message (run-time array gard failure) for processor 3
Reboot the system in slow mode. Run diagnostics, then go to MAP 0235 in the .Array bit steering, which is a function in the firmware, may be able to correct this problem without replacing hardware. If AIX is available on the system, go to MAP 0235 in the Diagnostic Information for Multiple Bus Systems, order number SA23-2769. If Linux is the only operating system on the system, schedule deferred maintenance with the customer. When it is possible to reboot the system, do the following: 1. Using the service processor’s system power control menu, disable fast mode boot. (Disabling fast mode automatically enables slow mode.) 2. Boot the AIX diagnostic CD-ROM. 3. Run diagnostics in problem determination mode. If no errors are recorded against any of the processors (devices with a resource name of procx), array bit steering was successful and the problem has been corrected.If errors are reported against the hardware, follow the actions for the error code or SRN. Chapter
5.
Error
Code
to
FRU
Index
519
Error Code
Description
Action/ Possible Failing FRU
4064 0B00
Processor subsystem tests detected fault in I/O book (slot 0 RIO port 0)
1. Check for system firmware updates. 2. Check the I/O cabling between the primary I/O book and the I/O subsystems. 3. Primary I/O book Location: U1.18-P1-H2 4. Call service support.
4064 0B01
Processor subsystem tests detected fault in I/O book (slot 0 RIO port 1)
1. Check for system firmware updates. 2. Check the I/O cabling between the primary I/O book and the I/O subsystems. 3. Primary I/O book Location: U1.18-P1-H2 4. Call service support.
4064 0B02
Processor subsystem tests detected fault in I/O book (slot 0 RIO port 2)
1. Check for system firmware updates. 2. Check the I/O cabling between the primary I/O book and the I/O subsystems. 3. Primary I/O book Location: U1.18-P1-H2 4. Call service support.
4064 0B03
Processor subsystem tests detected fault in I/O book (slot 0 RIO port 3)
1. Check for system firmware updates. 2. Check the I/O cabling between the primary I/O book and the I/O subsystems. 3. Primary I/O book Location: U1.18-P1-H2 4. Call service support.
4064 0B10
Processor subsystem tests detected fault in SNI adapter
1. Check for system firmware updates. 2. Check the cabling between the SNI adapter book and the High Performance Switch. 3. SNI adapter bookLocation: U1.18-P1-H1 4. Call service support.
4064 0B11
Processor subsystem tests detected fault in SNI adapter
1. Check for system firmware updates. 2. Check the cabling between the SNI adapter book and the High Performance Switch. 3. SNI adapter bookLocation: U1.18-P1-H1 4. Call service support.
4064 0B12
Processor subsystem tests detected fault in SNI adapter
1. Check for system firmware updates. 2. Check the cabling between the SNI adapter book and the High Performance Switch. 3. SNI adapter bookLocation: U1.18-P1-H1 4. Call service support. 520
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
4064 0B13
Processor subsystem tests detected fault in SNI adapter
1. Check for system firmware updates. 2. Check the cabling between the SNI adapter book and the High Performance Switch. 3. SNI adapter bookLocation: U1.18-P1-H1 4. Call service support.
4064 0B20
Processor subsystem tests detected fault in I/O book
1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support.
4064 0B21
Processor subsystem tests detected fault in I/O book
1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support.
4064 0B22
Processor subsystem tests detected fault in I/O book
1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support.
4064 0B23
Processor subsystem tests detected fault in I/O book
1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support.
4064 0B24
Processor subsystem tests detected fault in I/O book
1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support. Chapter
5.
Error
Code
to
FRU
Index
521
Error Code
Description
Action/ Possible Failing FRU
4064 0B25
Processor subsystem tests detected fault in I/O book
1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support.
4064 0B26
Processor subsystem tests detected fault in I/O book
1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support.
4064 0B27
Processor subsystem tests detected fault in I/O book
1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support.
4064 0B30
Processor subsystem tests detected fault in I/O book
1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems. 3. Third I/O book Location: U1.18-P1-H4 4. Call service support.
4064 0B31
Processor subsystem tests detected fault in I/O book
1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems. 3. Third I/O book Location: U1.18-P1-H4 4. Call service support.
4064 0B32
Processor subsystem tests detected fault in I/O book
1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems. 3. Third I/O book Location: U1.18-P1-H4 4. Call service support.
4064 0B33
Processor subsystem tests detected fault in I/O book
1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems. 3. Third I/O book Location: U1.18-P1-H4 4. Call service support. 522
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
4064 0B34
Error code generation/logging detected fault in I/O book or SNI adapter
1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems, or the SNI adapter and the High Performance Switch, depending on which adapter is installed in location U1.18-P1-H4. 3. Third I/O book or SNI adapterLocation: U1.18-P1-H4 4. Call service support.
4064 0B35
Error code generation/logging detected fault in I/O book or SNI adapter
1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems, or the SNI adapter and the High Performance Switch, depending on which adapter is installed in location U1.18-P1-H4. 3. Third I/O book or SNI adapterLocation: U1.18-P1-H4 4. Call service support.
4064 0B36
Error code generation/logging detected fault in I/O book or SNI adapter
1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems, or the SNI adapter and the High Performance Switch, depending on which adapter is installed in location U1.18-P1-H4. 3. Third I/O book or SNI adapterLocation: U1.18-P1-H4 4. Call service support.
4064 0B37
Error code generation/logging detected fault in I/O book or SNI adapter
1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems, or the SNI adapter and the High Performance Switch, depending on which adapter is installed in location U1.18-P1-H4. 3. Third I/O book or SNI adapterLocation: U1.18-P1-H4 4. Call service support.
4064 0BF0
Processor subsystem tests detected fault in I/O hub chip 0 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H2/Q1 b. U1.18-P1-H2/Q22. Replace the primary I/O book, Location: U1.18-P1-H2
4064 0BF1
Processor subsystem tests detected fault in I/O hub chip 1 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H2/Q3 b. U1.18-P1-H2/Q42. Replace the primary I/O book, Location: U1.18-P1-H2 Chapter
5.
Error
Code
to
FRU
Index
523
Error Code
Description
Action/ Possible Failing FRU
4064 0BF2
Processor subsystem tests detected fault in I/O hub chip 2 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H1/Q3 b. U1.18-P1-H1/Q42. Replace the RIO book, Location: U1.18-P1-H1
4064 0BF3
Processor subsystem tests detected fault in I/O hub chip 3 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H1/Q1 b. U1.18-P1-H1/Q22. Replace the RIO book, Location: U1.18-P1-H1
4064 0BF4
Processor subsystem tests detected fault in I/O hub chip 4 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H3/Q7 b. U1.18-P1-H3/Q82. Replace the RIO book, Location: U1.18-P1-H3
4064 0BF5
Processor subsystem tests detected fault in I/O hub chip 5 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H3/Q5 b. U1.18-P1-H3/Q62. Replace the RIO book, Location: U1.18-P1-H3
4064 0BF6
Processor subsystem tests detected fault in I/O hub chip 6 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H4/Q5 b. U1.18-P1-H4/Q62. Replace the RIO book, Location: U1.18-P1-H4
4064 0BF7
Processor subsystem tests detected fault in I/O hub chip 7 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H4/Q7 b. U1.18-P1-H4/Q82. Replace the RIO book, Location: U1.18-P1-H4
4064 0BF8
Processor subsystem tests detected fault in I/O hub chip 8 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H1/Q7 b. U1.18-P1-H1/Q82. Replace the RIO book, Location: U1.18-P1-H1 524
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
4064 0BF9
Processor subsystem tests detected fault in I/O hub chip 9 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H1/Q5 b. U1.18-P1-H1/Q62. Replace the RIO book, Location: U1.18-P1-H1
4064 0BFA
Processor subsystem tests detected fault in I/O hub chip 10 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H3/Q3 b. U1.18-P1-H3/Q42. Replace the RIO book, Location: U1.18-P1-H3
4064 0BFB
Processor subsystem tests detected fault in I/O hub chip 11 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H3/Q1 b. U1.18-P1-H3/Q22. Replace the RIO book, Location: U1.18-P1-H3
4064 0BFC
Processor subsystem tests detected fault in I/O hub chip 12 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H4/Q1 b. U1.18-P1-H4/Q22. Replace the RIO book, Location: U1.18-P1-H4.
4064 0BFD
Processor subsystem tests detected fault in I/O hub chip 13 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H4/Q3 b. U1.18-P1-H4/Q42. Replace the RIO book, Location: U1.18-P1-H4
4064 0C03
Processor subsystem tests detected fault in all MCMs
Go to “MAP 1543: MCM Module Problem Isolation” on page 337.
4064 0C09
Processor subsystem tests detected fault in all memory books
Go to “MAP 1546: Memory Book Problem Isolation” on page 345.
4064 0C0A
Processor subsystem tests detected fault in all I/O books
Go to “MAP 1547: I/O Book Problem Isolation” on page 348.
4064 0C10
Processor subsystem tests detected mismatch between MCM 0 and VPD card
Plugging error with FRU kitLocation: U1.18-P1-C1 and U1.18-P1-H2.3
4064 0C11
Processor subsystem tests detected mismatch between MCM 2 and VPD card
Plugging error with FRU kitLocation: U1.18-P1-C4 and U1.18-P1-H2.4
4064 0C12
Processor subsystem tests detected mismatch between MCM 1 and VPD card
Plugging error with FRU kitLocation: U1.18-P1-C3 and U1.18-P1-H2.5
4064 0C13
Processor subsystem tests detected mismatch between MCM 3 and VPD card
Plugging error with FRU kitLocation: U1.18-P1-C2 and U1.18-P1-H2.6
4064 0CA5
Processor subsystem tests detected invalid processor subsystem configuration
Verify that MCMs, memory books, and L3 modules are correctly configured. Chapter
5.
Error
Code
to
FRU
Index
525
Error Code
Description
Action/ Possible Failing FRU
4064 0CA6
Processor subsystem tests detected no usable main storage
Reboot the system in slow mode to isolate the failure.
4064 0CA9
Processor subsystem tests detected configuration error with processor memory
Reboot the system in slow mode to isolate the failure.
4064 0D00
Processor subsystem tests detected fault in L3 cache module 0
L3 cache module 0 and L3 VPD cardLocation: U1.18-P1-C6 and U1.18-P1-H2.1
4064 0D01
Processor subsystem tests detected fault in L3 cache module 1
L3 cache module 1 and L3 VPD card Location: U1.18-P1-C7 and U1.18-P1-H2.1
4064 0D02
Processor subsystem tests detected fault in L3 cache module 2
L3 cache module 2 and L3 VPD card Location: U1.18-P1-C11 and U1.18-P1-H2.1
4064 0D03
Processor subsystem tests detected fault in L3 cache module 3
L3 cache module 3 and L3 VPD card Location: U1.18-P1-C10 and U1.18-P1-H2.1
4064 0D04
Processor subsystem tests detected fault in L3 cache module 4
L3 cache module 4 and L3 VPD card Location: U1.18-P1-C8 and U1.18-P1-H2.1
4064 0D05
Processor subsystem tests detected fault in L3 cache module 5
L3 cache module 5 and L3 VPD card Location: U1.18-P1-C20 and U1.18-P1-H2.1
4064 0D06
Processor subsystem tests detected fault in L3 cache module 6
L3 cache module 6 and L3 VPD card Location: U1.18-P1-C16 and U1.18-P1-H2.1
4064 0D07
Processor subsystem tests detected fault in L3 cache module 7
L3 cache module 7 and L3 VPD card Location: U1.18-P1-C12 and U1.18-P1-H2.1
4064 0D08
Processor subsystem tests detected fault in L3 cache module 8
L3 cache module 8 and L3 VPD card Location: U1.18-P1-C19 and U1.18-P1-H2.1
4064 0D09
Processor subsystem tests detected fault in L3 cache module 9
L3 cache module 9 and L3 VPD card Location: U1.18-P1-C18 and U1.18-P1-H2.1
4064 0D0A
Processor subsystem tests detected fault in L3 cache module 10
L3 cache module 10 and L3 VPD card Location: U1.18-P1-C14 and U1.18-P1-H2.1
4064 0D0B
Processor subsystem tests detected fault in L3 cache module 11
L3 cache module 11 and L3 VPD card Location: U1.18-P1-C15 and U1.18-P1-H2.1
4064 0D0C
Processor subsystem tests detected fault in L3 cache module 12
L3 cache module 12 and L3 VPD card Location: U1.18-P1-C17 and U1.18-P1-H2.1
4064 0D0D
Processor subsystem tests detected fault in L3 cache module 13
L3 cache module 13 and L3 VPD card Location: U1.18-P1-C5 and U1.18-P1-H2.1
4064 0D0E
Processor subsystem tests detected fault in L3 cache module 14
L3 cache module 14 and L3 VPD card Location: U1.18-P1-C9 and U1.18-P1-H2.1
4064 0D0F
Processor subsystem tests detected fault in L3 cache module 15
L3 cache module 15 and L3 VPD card Location: U1.18-P1-C13 and U1.18-P1-H2.1 526
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
4064 0EA0
Processor subsystem tests detected fault in MOPS service processor code
1. Access the service processor error log. Find the error log entry for the 406x 0EA0 error. Look for a hardware error of the form 450x yyyy, 460x yyyy, or 4B2x yyyy with a timestamp within a few minutes of the 406x 0EA0 error. Follow the actions for this hardware error. 2. If there is no hardware error in the service processor error log with a timestamp that is close to the 406x 0EA0 error, or following the actions for the hardware error does not resolve the problem, check for platform firmware updates. If an update is available, apply it. 3. If a platform firmware update is not available, or the update does not correct the problem, call service support.
4064 0EA1
Processor subsystem tests detected fault in processor subsystem code on main processor
1. Check for operating system updates. 2. Call service support.
4064 0EA2
Processor subsystem tests detected fault in PRD code on service processor
1. Check for system firmware updates. 2. Call service support.
4064 0EA4
The service processor firmware detected a hardware problem but was unable to isolate the source.
Call service support.
4064 0EA5
PRD detected a problem in the FNM code
1. Check for system firmware updates. 2. Call service support.
4064 0EAA
PRD: FNM is handling this error
1. Check for system firmware updates. 2. Call service support.
4064 0EAF
Processor subsystem tests detected unknown service processor code and system type
1. Check for system firmware updates. 2. Call service support.
4064 0EB0
Processor subsystem tests detected JTAG problem
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
4064 0EB1
Attention line 0 stuck fault
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
4064 0EB2
Attention line 1 stuck fault
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
4064 0EB3
Attention line 2 stuck fault
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
4064 0EB4
Attention line 3 stuck fault
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
4064 0EB5
Attention line 4 stuck fault
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
4064 0EB6
Attention line 5 stuck fault
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
4064 0EB7
Attention line 6 stuck fault
Go to “MAP 1540: Problem Isolation Procedures” on page 308. Chapter
5.
Error
Code
to
FRU
Index
527
Error Code
Description
Action/ Possible Failing FRU
4064 0EB8
Attention line 7 stuck fault
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
4064 0EB9
Attention line 8 stuck fault
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
4064 0EBA
Attention line 9 stuck fault
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
4064 0EBB
Attention line 10 stuck fault
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
4064 0EBC
Attention line 11 stuck fault
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
4064 0EBF
Memory error
1. Reboot the system in slow mode to the SMS menus. See “Performing Slow Boot” on page 413. 2. Examine the service processor error log. v If a CEC subsystem error (a 4xxx yyyy error) occurred during the slow boot that calls out the memory, processors, L3 cache modules, or the primary I/O book, replace the part indicated by the location code. v If a CEC subsystem error (a 4xxx yyyy error) did not occur during the slow boot, look for a 4xxx yyyy error immediately preceding the 406x 0EBF error. Replace the part indicated by the location code associated with that error.
4064 0EE1
Processor subsystem tests detected fault in service processor code (invalid MRU to mapping)
1. Check for system firmware updates. 2. Call service support.
4064 0EE2
Processor subsystem tests detected fault in service processor firmware fault
1. Check for system firmware updates. 2. Call service support.
4064 0EE5
Processor subsystem tests detected firmware error; no callout available
1. Check for system firmware updates. 2. Call service support.
4064 25E6
SMA 4-port switch adapter fault
Replace the I/O book (switch adapter) specified by the location code.
4064 273F
SMA 2-port switch adapter fault
Replace the I/O book (switch adapter) specified by the location code.
4065 0001
Display alter tests detected clock card assembly failure
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
4065 0002
Display alter tests detected backplane assembly failure
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
4065 0003
Display alter tests detected fault in MCM module 0
MCM module 0 and VPD cardLocation: U1.18-P1-C1 and U1.18-P1-H2.3
4065 0004
Display alter tests detected fault in MCM module 2
MCM module 2 and VPD cardLocation: U1.18-P1-C4 and U1.18-P1-H2.4
4065 0005
Display alter tests detected fault in MCM module 1
MCM module 1 and VPD cardLocation: U1.18-P1-C3 and U1.18-P1-H2.5 528
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
4065 0006
Display alter tests detected fault in MCM module 3
MCM module 3 and VPD cardLocation: U1.18-P1-C2 and U1.18-P1-H2.6
4065 0007
Display alter tests detected fault in L3 VPD card
1. L3 VPD cardLocation: U1.18-P1-H2.1 (Order the L3 cache module FRU to obtain a new L3 VPD card. Return the L3 cache module that was included in the FRU kit, and the failing L3 VPD card as ?new defective.?) 2. Primary I/O bookLocation: U1.18-P1-H2
4065 000A
Display alter tests detected fault in primary I/O book 1 (slot 0)
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
4065 000B
Display alter tests detected fault in secondary I/O book (slot 1)
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
4065 000C
Display alter tests detected fault in I/O book 3 (slot 2)
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
4065 000D
Display alter tests detected fault in I/O book 4 (slot 3)
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
4065 000E
CUoD capacity card
1. Replace the CUoD capacity cardLocation: U1.18-P1-H2.2 2. Ask the customer to reenter the CUoD activation keys.
4065 0010
Display alter tests detected fault in Memory book (slot 0)
Memory book (slot 0) Location: U1.18-P1-M2
4065 0011
Display alter tests detected fault in Memory book (slot 1)
Memory book (slot 1) Location: U1.18-P1-M3
4065 0012
Display alter tests detected fault in Memory book (slot 2)
Memory book (slot 2) Location: U1.18-P1-M7
4065 0013
Display alter tests detected fault in Memory book (slot 3)
Memory book (slot 3)Location: U1.18-P1-M6
4065 0014
Display alter tests detected fault in Memory book (slot 4)
Memory book (slot 4)Location: U1.18-P1-M4
4065 0015
Display alter tests detected fault in Memory book (slot 5)
Memory book (slot 5)Location: U1.18-P1-M8
4065 0016
Display alter tests detected fault in Memory book (slot 6)
Memory book (slot 6)Location: U1.18-P1-M5
4065 0017
Display alter tests detected fault in Memory book (slot 7)
Memory book (slot 7)Location: U1.18-P1-M1
4065 00B0
Display alter tests detected possible problem with bus controller or CSP code
1. Check for system firmware updates. 2. Primary I/O book, Location: U1.18-P1-H2 3. Call service support.
4065 00B1
Display alter tests detected possible problem with bus controller
1. Check for system firmware updates. 2. Primary I/O book, Location: U1.18-P1-H2 3. Call service support. Chapter
5.
Error
Code
to
FRU
Index
529
Error Code
Description
Action/ Possible Failing FRU
4065 00B3
Display alter tests detected MCM 0/VPD problem
1. Check for system firmware updates. 2. MCM module 0 and VPD cardLocation: 2U1.18-P1-C1 and U1.18-P1-H2.3
4065 00B4
Display alter tests detected MCM 2/VPD problem
1. Check for system firmware updates. 2. MCM module 2 and VPD cardLocation: U1.18-P1-C4 and U1.18-P1-H2.4
4065 00B5
Display alter tests detected MCM 1/VPD problem
1. Check for system firmware updates. 2. MCM module 1 and VPD cardLocation: U1.18-P1-C3 and U1.18-P1-H2.5
4065 00B6
Display alter tests detected MCM 3/VPD problem
1. Check for system firmware updates. 2. MCM module 3 and VPD cardLocation: U1.18-P1-C2 and U1.18-P1-H2.6
4065 00B7
Display alter tests detected hardware problem in the processor subsystem
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
4065 0A03
Reboot message (run-time array gard failure) for processor 0
Array bit steering, which is a function in the firmware, may be able to correct this problem without replacing hardware. If AIX is available on the system, go to MAP 0235 in the Diagnostic Information for Multiple Bus Systems, order number SA23-2769. If Linux is the only operating system on the system, schedule deferred maintenance with the customer. When it is possible to reboot the system, do the following: 1. Using the service processor’s system power control menu, disable fast mode boot. (Disabling fast mode automatically enables slow mode.) 2. Boot the AIX diagnostic CD-ROM. 3. Run diagnostics in problem determination mode. If no errors are recorded against any of the processors (devices with a resource name of procx), array bit steering was successful and the problem has been corrected.If errors are reported against the hardware, follow the actions for the error code or SRN. 530
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
4065 0A04
Reboot message (run-time array gard failure) for processor 2
Reboot the system in slow mode. Run diagnostics, then go to MAP 0235 in the .Array bit steering, which is a function in the firmware, may be able to correct this problem without replacing hardware. If AIX is available on the system, go to MAP 0235 in the Diagnostic Information for Multiple Bus Systems, order number SA23-2769. If Linux is the only operating system on the system, schedule deferred maintenance with the customer. When it is possible to reboot the system, do the following: 1. Using the service processor’s system power control menu, disable fast mode boot. (Disabling fast mode automatically enables slow mode.) 2. Boot the AIX diagnostic CD-ROM. 3. Run diagnostics in problem determination mode. If no errors are recorded against any of the processors (devices with a resource name of procx), array bit steering was successful and the problem has been corrected.If errors are reported against the hardware, follow the actions for the error code or SRN.
4065 0A05
Reboot message (run-time array gard failure) for processor 1
Reboot the system in slow mode. Run diagnostics, then go to MAP 0235 in the .Array bit steering, which is a function in the firmware, may be able to correct this problem without replacing hardware. If AIX is available on the system, go to MAP 0235 in the Diagnostic Information for Multiple Bus Systems, order number SA23-2769. If Linux is the only operating system on the system, schedule deferred maintenance with the customer. When it is possible to reboot the system, do the following: 1. Using the service processor’s system power control menu, disable fast mode boot. (Disabling fast mode automatically enables slow mode.) 2. Boot the AIX diagnostic CD-ROM. 3. Run diagnostics in problem determination mode. If no errors are recorded against any of the processors (devices with a resource name of procx), array bit steering was successful and the problem has been corrected.If errors are reported against the hardware, follow the actions for the error code or SRN. Chapter
5.
Error
Code
to
FRU
Index
531
Error Code
Description
Action/ Possible Failing FRU
4065 0A06
Reboot message (run-time array gard failure) for processor 3
Reboot the system in slow mode. Run diagnostics, then go to MAP 0235 in the .Array bit steering, which is a function in the firmware, may be able to correct this problem without replacing hardware. If AIX is available on the system, go to MAP 0235 in the Diagnostic Information for Multiple Bus Systems, order number SA23-2769. If Linux is the only operating system on the system, schedule deferred maintenance with the customer. When it is possible to reboot the system, do the following: 1. Using the service processor’s system power control menu, disable fast mode boot. (Disabling fast mode automatically enables slow mode.) 2. Boot the AIX diagnostic CD-ROM. 3. Run diagnostics in problem determination mode. If no errors are recorded against any of the processors (devices with a resource name of procx), array bit steering was successful and the problem has been corrected.If errors are reported against the hardware, follow the actions for the error code or SRN.
4065 0B00
Display alter tests detected fault in I/O book (slot 0 RIO port 0)
1. Check for system firmware updates. 2. Check the I/O cabling between the primary I/O book and the I/O subsystems. 3. Primary I/O book Location: U1.18-P1-H2 4. Call service support.
4065 0B01
Display alter tests detected fault in I/O book (slot 0 RIO port 1)
1. Check for system firmware updates. 2. Check the I/O cabling between the primary I/O book and the I/O subsystems. 3. Primary I/O book Location: U1.18-P1-H2 4. Call service support.
4065 0B02
Display alter tests detected fault in I/O book (slot 0 RIO port 2)
1. Check for system firmware updates. 2. Check the I/O cabling between the primary I/O book and the I/O subsystems. 3. Primary I/O book Location: U1.18-P1-H2 4. Call service support.
4065 0B03
Display alter tests detected fault in I/O book (slot 0 RIO port 3)
1. Check for system firmware updates. 2. Check the I/O cabling between the primary I/O book and the I/O subsystems. 3. Primary I/O book Location: U1.18-P1-H2 4. Call service support. 532
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
4065 0B10
Processor subsystem tests detected fault in SNI adapter
1. Check for system firmware updates. 2. Check the cabling between the SNI adapter book and the High Performance Switch. 3. SNI adapter bookLocation: U1.18-P1-H1 4. Call service support.
4065 0B11
Processor subsystem tests detected fault in SNI adapter
1. Check for system firmware updates. 2. Check the cabling between the SNI adapter book and the High Performance Switch. 3. SNI adapter bookLocation: U1.18-P1-H1 4. Call service support.
4065 0B12
Processor subsystem tests detected fault in SNI adapter
1. Check for system firmware updates. 2. Check the cabling between the SNI adapter book and the High Performance Switch. 3. SNI adapter bookLocation: U1.18-P1-H1 4. Call service support.
4065 0B13
Processor subsystem tests detected fault in SNI adapter
1. Check for system firmware updates. 2. Check the cabling between the SNI adapter book and the High Performance Switch. 3. SNI adapter bookLocation: U1.18-P1-H1 4. Call service support.
4065 0B20
Display alter tests detected fault in I/O book
1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support.
4065 0B21
Display alter tests detected fault in I/O book
1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support.
4065 0B22
Display alter tests detected fault in I/O book
1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support. Chapter
5.
Error
Code
to
FRU
Index
533
Error Code
Description
Action/ Possible Failing FRU
4065 0B23
Display alter tests detected fault in I/O book
1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support.
4065 0B24
Display alter tests detected fault in I/O book
1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support.
4065 0B25
Display alter tests detected fault in I/O book
1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support.
4065 0B26
Display alter tests detected fault in I/O book
1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support.
4065 0B27
Display alter tests detected fault in I/O book
1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support.
4065 0B30
Display alter tests detected fault in I/O book
1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems. 3. Third I/O book Location: U1.18-P1-H4 4. Call service support.
4065 0B31
Display alter tests detected fault in I/O book
1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems. 3. Third I/O book Location: U1.18-P1-H4 4. Call service support. 534
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
4065 0B32
Display alter tests detected fault in I/O book
1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems. 3. Third I/O book Location: U1.18-P1-H4 4. Call service support.
4065 0B33
Display alter tests detected fault in I/O book
1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems. 3. Third I/O book Location: U1.18-P1-H4 4. Call service support.
4065 0B34
Error code generation/logging detected fault in I/O book or SNI adapter
1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems, or the SNI adapter and the High Performance Switch, depending on which adapter is installed in location U1.18-P1-H4. 3. Third I/O book or SNI adapterLocation: U1.18-P1-H4 4. Call service support.
4065 0B35
Error code generation/logging detected fault in I/O book or SNI adapter
1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems, or the SNI adapter and the High Performance Switch, depending on which adapter is installed in location U1.18-P1-H4. 3. Third I/O book or SNI adapterLocation: U1.18-P1-H4 4. Call service support.
4065 0B36
Error code generation/logging detected fault in I/O book or SNI adapter
1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems, or the SNI adapter and the High Performance Switch, depending on which adapter is installed in location U1.18-P1-H4. 3. Third I/O book or SNI adapterLocation: U1.18-P1-H4 4. Call service support.
4065 0B37
Error code generation/logging detected fault in I/O book or SNI adapter
1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems, or the SNI adapter and the High Performance Switch, depending on which adapter is installed in location U1.18-P1-H4. 3. Third I/O book or SNI adapterLocation: U1.18-P1-H4 4. Call service support. Chapter
5.
Error
Code
to
FRU
Index
535
Error Code
Description
Action/ Possible Failing FRU
4065 0BF0
Display alter tests detected fault in I/O hub chip 0 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H2/Q1 b. U1.18-P1-H2/Q22. Replace the primary I/O book, Location: U1.18-P1-H2
4065 0BF1
Display alter tests detected fault in I/O hub chip 1 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H2/Q3 b. U1.18-P1-H2/Q42. Replace the primary I/O book, Location: U1.18-P1-H2
4065 0BF2
Display alter tests detected fault in I/O hub chip 2 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H1/Q3 b. U1.18-P1-H1/Q42. Replace the RIO book, Location: U1.18-P1-H1
4065 0BF3
Display alter tests detected fault in I/O hub chip 3 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H1/Q1 b. U1.18-P1-H1/Q22. Replace the RIO book, Location: U1.18-P1-H1
4065 0BF4
Display alter tests detected fault in I/O hub chip 4 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H3/Q7 b. U1.18-P1-H3/Q82. Replace the RIO book, Location: U1.18-P1-H3
4065 0BF5
Display alter tests detected fault in I/O hub chip 5 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H3/Q5 b. U1.18-P1-H3/Q62. Replace the RIO book, Location: U1.18-P1-H3
4065 0BF6
Display alter tests detected fault in I/O hub chip 6 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H4/Q5 b. U1.18-P1-H4/Q62. Replace the RIO book, Location: U1.18-P1-H4 536
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
4065 0BF7
Display alter tests detected fault in I/O hub chip 7 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H4/Q7 b. U1.18-P1-H4/Q82. Replace the RIO book, Location: U1.18-P1-H4
4065 0BF8
Display alter tests detected fault in I/O hub chip 8 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H1/Q7 b. U1.18-P1-H1/Q82. Replace the RIO book, Location: U1.18-P1-H1
4065 0BF9
Display alter tests detected fault in I/O hub chip 9 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H1/Q5 b. U1.18-P1-H1/Q62. Replace the RIO book, Location: U1.18-P1-H1
4065 0BFA
Display alter tests detected fault in I/O hub chip 10 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H3/Q3 b. U1.18-P1-H3/Q42. Replace the RIO book, Location: U1.18-P1-H3
4065 0BFB
Display alter tests detected fault in I/O hub chip 11 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H3/Q1 b. U1.18-P1-H3/Q22. Replace the RIO book, Location: U1.18-P1-H3
4065 0BFC
Display alter tests detected fault in I/O hub chip 12 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H4/Q1 b. U1.18-P1-H4/Q22. Replace the RIO book, Location: U1.18-P1-H4.
4065 0BFD
Display alter tests detected fault in I/O hub chip 13 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H4/Q3 b. U1.18-P1-H4/Q42. Replace the RIO book, Location: U1.18-P1-H4
4065 0C03
Display alter tests detected fault in all MCMs
Go to “MAP 1543: MCM Module Problem Isolation” on page 337.
4065 0C09
Display alter tests detected fault in all memory books
Go to “MAP 1546: Memory Book Problem Isolation” on page 345. Chapter
5.
Error
Code
to
FRU
Index
537
Error Code
Description
Action/ Possible Failing FRU
4065 0C0A
Display alter tests detected fault in all I/O books
Go to “MAP 1547: I/O Book Problem Isolation” on page 348.
4065 0C10
Display alter tests detected mismatch between MCM 0 and VPD card
Plugging error with FRU kitLocation: U1.18-P1-C1 and U1.18-P1-H2.3
4065 0C11
Display alter tests detected mismatch between MCM 2 and VPD card
Plugging error with FRU kitLocation: U1.18-P1-C4 and U1.18-P1-H2.4
4065 0C12
Display alter tests detected mismatch between MCM 1 and VPD card
Plugging error with FRU kitLocation: U1.18-P1-C3 and U1.18-P1-H2.5
4065 0C13
Display alter tests detected mismatch between MCM 3 and VPD card
Plugging error with FRU kitLocation: U1.18-P1-C2 and U1.18-P1-H2.6
4065 0CA5
Display alter tests detected invalid processor subsystem configuration
Verify that MCMs, memory books, and L3 modules are correctly configured.
4065 0CA6
Display alter tests detected no usable main storage
Reboot the system in slow mode to isolate the failure.
4065 0CA9
Display alter tests detected configuration error with processor memory
Reboot the system in slow mode to isolate the failure.
4065 0D00
Display alter tests detected fault in L3 cache module 0
L3 cache module 0 and L3 VPD cardLocation: U1.18-P1-C6 and U1.18-P1-H2.1
4065 0D01
Display alter tests detected fault in L3 cache module 1
L3 cache module 1 and L3 VPD card Location: U1.18-P1-C7 and U1.18-P1-H2.1
4065 0D02
Display alter tests detected fault in L3 cache module 2
L3 cache module 2 and L3 VPD card Location: U1.18-P1-C11 and U1.18-P1-H2.1
4065 0D03
Display alter tests detected fault in L3 cache module 3
L3 cache module 3 and L3 VPD card Location: U1.18-P1-C10 and U1.18-P1-H2.1
4065 0D04
Display alter tests detected fault in L3 cache module 4
L3 cache module 4 and L3 VPD card Location: U1.18-P1-C8 and U1.18-P1-H2.1
4065 0D05
Display alter tests detected fault in L3 cache module 5
L3 cache module 5 and L3 VPD card Location: U1.18-P1-C20 and U1.18-P1-H2.1
4065 0D06
Display alter tests detected fault in L3 cache module 6
L3 cache module 6 and L3 VPD card Location: U1.18-P1-C16 and U1.18-P1-H2.1
4065 0D07
Display alter tests detected fault in L3 cache module 7
L3 cache module 7 and L3 VPD card Location: U1.18-P1-C12 and U1.18-P1-H2.1
4065 0D08
Display alter tests detected fault in L3 cache module 8
L3 cache module 8 and L3 VPD card Location: U1.18-P1-C19 and U1.18-P1-H2.1
4065 0D09
Display alter tests detected fault in L3 cache module 9
L3 cache module 9 and L3 VPD card Location: U1.18-P1-C18 and U1.18-P1-H2.1
4065 0D0A
Display alter tests detected fault in L3 cache module 10
L3 cache module 10 and L3 VPD card Location: U1.18-P1-C14 and U1.18-P1-H2.1
4065 0D0B
Display alter tests detected fault in L3 cache module 11
L3 cache module 11 and L3 VPD card Location: U1.18-P1-C15 and U1.18-P1-H2.1
4065 0D0C
Display alter tests detected fault in L3 cache module 12
L3 cache module 12 and L3 VPD card Location: U1.18-P1-C17 and U1.18-P1-H2.1
4065 0D0D
Display alter tests detected fault in L3 cache module 13
L3 cache module 13 and L3 VPD card Location: U1.18-P1-C5 and U1.18-P1-H2.1
4065 0D0E
Display alter tests detected fault in L3 cache module 14
L3 cache module 14 and L3 VPD card Location: U1.18-P1-C9 and U1.18-P1-H2.1 538
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
4065 0D0F
Display alter tests detected fault in L3 cache module 15
L3 cache module 15 and L3 VPD card Location: U1.18-P1-C13 and U1.18-P1-H2.1
4065 0EA0
Display alter tests detected fault in MOPS service processor code
1. Access the service processor error log. Find the error log entry for the 406x 0EA0 error. Look for a hardware error of the form 450x yyyy, 460x yyyy, or 4B2x yyyy with a timestamp within a few minutes of the 406x 0EA0 error. Follow the actions for this hardware error. 2. If there is no hardware error in the service processor error log with a timestamp that is close to the 406x 0EA0 error, or following the actions for the hardware error does not resolve the problem, check for platform firmware updates. If an update is available, apply it. 3. If a platform firmware update is not available, or the update does not correct the problem, call service support.
4065 0EA1
Display alter tests detected fault in processor subsystem code on main processor
1. Check for operating system updates. 2. Call service support.
4065 0EA2
Display alter tests detected fault in PRD code on service processor
1. Check for system firmware updates. 2. Call service support.
4065 0EA4
The service processor firmware detected a hardware problem but was unable to isolate the source.
Call service support.
4065 0EA5
PRD detected a problem in the FNM code
1. Check for system firmware updates. 2. Call service support.
4065 0EAA
PRD: FNM is handling this error
1. Check for system firmware updates. 2. Call service support.
4065 0EAF
Display alter tests detected unknown service processor code and system type
1. Check for system firmware updates. 2. Call service support.
4065 0EB0
Display alter tests detected JTAG problem
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
4065 0EB1
Attention line 0 stuck fault
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
4065 0EB2
Attention line 1 stuck fault
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
4065 0EB3
Attention line 2 stuck fault
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
4065 0EB4
Attention line 3 stuck fault
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
4065 0EB5
Attention line 4 stuck fault
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
4065 0EB6
Attention line 5 stuck fault
Go to “MAP 1540: Problem Isolation Procedures” on page 308. Chapter
5.
Error
Code
to
FRU
Index
539
Error Code
Description
Action/ Possible Failing FRU
4065 0EB7
Attention line 6 stuck fault
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
4065 0EB8
Attention line 7 stuck fault
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
4065 0EB9
Attention line 8 stuck fault
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
4065 0EBA
Attention line 9 stuck fault
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
4065 0EBB
Attention line 10 stuck fault
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
4065 0EBC
Attention line 11 stuck fault
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
4065 0EBF
Memory error
1. Reboot the system in slow mode to the SMS menus. See “Performing Slow Boot” on page 413. 2. Examine the service processor error log. v If a CEC subsystem error (a 4xxx yyyy error) occurred during the slow boot that calls out the memory, processors, L3 cache modules, or the primary I/O book, replace the part indicated by the location code. v If a CEC subsystem error (a 4xxx yyyy error) did not occur during the slow boot, look for a 4xxx yyyy error immediately preceding the 406x 0EBF error. Replace the part indicated by the location code associated with that error.
4065 0EE1
Display alter tests detected fault in service processor code (invalid MRU to mapping)
1. Check for system firmware updates. 2. Call service support.
4065 0EE2
Display alter tests detected fault in service processor firmware fault
1. Check for system firmware updates. 2. Call service support.
4065 0EE5
Display alter tests detected firmware error; no callout available
1. Check for system firmware updates. 2. Call service support.
4065 25E6
SMA 4-port switch adapter fault
Replace the I/O book (switch adapter) specified by the location code.
4065 273F
SMA 2-port switch adapter fault
Replace the I/O book (switch adapter) specified by the location code.
4066 0001
Scan interface basic assurance tests detected clock card assembly failure
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
4066 0002
Scan interface basic assurance tests detected backplane assembly failure
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
4066 0003
Scan interface basic assurance tests detected fault in MCM module 0
MCM module 0 and VPD cardLocation: U1.18-P1-C1 and U1.18-P1-H2.3
4066 0004
Scan interface basic assurance tests detected fault in MCM module 2
MCM module 2 and VPD cardLocation: U1.18-P1-C4 and U1.18-P1-H2.4
4066 0005
Scan interface basic assurance tests detected fault in MCM module 1
MCM module 1 and VPD cardLocation: U1.18-P1-C3 and U1.18-P1-H2.5 540
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
4066 0006
Scan interface basic assurance tests detected fault in MCM module 3
MCM module 3 and VPD cardLocation: U1.18-P1-C2 and U1.18-P1-H2.6
4066 0007
Scan interface basic assurance tests detected fault in L3 VPD card
1. L3 VPD cardLocation: U1.18-P1-H2.1 (Order the L3 cache module FRU to obtain a new L3 VPD card. Return the L3 cache module that was included in the FRU kit, and the failing L3 VPD card as ?new defective?.) 2. Primary I/O bookLocation: U1.18-P1-H2
4066 000A
Scan interface basic assurance tests detected fault in primary I/O book 1 (slot 0)
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
4066 000B
Scan interface basic assurance tests detected fault in secondary I/O book (slot 1)
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
4066 000C
Scan interface basic assurance tests detected fault in I/O book 3 (slot 2)
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
4066 000D
Scan interface basic assurance tests detected fault in I/O book 4 (slot 3)
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
4066 000E
CUoD capacity card
1. Replace the CUoD capacity cardLocation: U1.18-P1-H2.2 2. Ask the customer to reenter the CUoD activation keys.
4066 0010
Scan interface basic assurance tests detected fault in Memory book (slot 0)
Memory book (slot 0) Location: U1.18-P1-M2
4066 0011
Scan interface basic assurance tests detected fault in Memory book (slot 1)
Memory book (slot 1) Location: U1.18-P1-M3
4066 0012
Scan interface basic assurance tests detected fault in Memory book (slot 2)
Memory book (slot 2) Location: U1.18-P1-M7
4066 0013
Scan interface basic assurance tests detected fault in Memory book (slot 3)
Memory book (slot 3)Location: U1.18-P1-M6
4066 0014
Scan interface basic assurance tests detected fault in Memory book (slot 4)
Memory book (slot 4)Location: U1.18-P1-M4
4066 0015
Scan interface basic assurance tests detected fault in Memory book (slot 5)
Memory book (slot 5)Location: U1.18-P1-M8
4066 0016
Scan interface basic assurance tests detected fault in Memory book (slot 6)
Memory book (slot 6)Location: U1.18-P1-M5
4066 0017
Scan interface basic assurance tests detected fault in Memory book (slot 7)
Memory book (slot 7)Location: U1.18-P1-M1
4066 00B0
Possible problem with bus controller or CSP code
1. Check for system firmware updates. 2. Primary I/O book, Location: U1.18-P1-H2 3. Call service support.
4066 00B1
Possible problem with bus controller
1. Check for system firmware updates. 2. Primary I/O book, Location: U1.18-P1-H2 3. Call service support. Chapter
5.
Error
Code
to
FRU
Index
541
Error Code
Description
Action/ Possible Failing FRU
4066 00B3
MCM 0/VPD problem
1. Check for system firmware updates. 2. MCM module 0 and VPD cardLocation: U1.18-P1-C1 and U1.18-P1-H2.3
4066 00B4
MCM 2/VPD problem
1. Check for system firmware updates. 2. MCM module 2 and VPD cardLocation: U1.18-P1-C4 and U1.18-P1-H2.4
4066 00B5
MCM 1/VPD problem
1. Check for system firmware updates. 2. MCM module 1 and VPD cardLocation: U1.18-P1-C3 and U1.18-P1-H2.5
4066 00B6
MCM 3/VPD problem
1. Check for system firmware updates. 2. MCM module 3 and VPD cardLocation: U1.18-P1-C2 and U1.18-P1-H2.6
4066 00B7
Hardware problem in the processor subsystem
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
4066 0A03
Reboot message (run-time array gard failure) for processor 0
Array bit steering, which is a function in the firmware, may be able to correct this problem without replacing hardware. If AIX is available on the system, go to MAP 0235 in the Diagnostic Information for Multiple Bus Systems, order number SA23-2769. If Linux is the only operating system on the system, schedule deferred maintenance with the customer. When it is possible to reboot the system, do the following: 1. Using the service processor’s system power control menu, disable fast mode boot. (Disabling fast mode automatically enables slow mode.) 2. Boot the AIX diagnostic CD-ROM. 3. Run diagnostics in problem determination mode. If no errors are recorded against any of the processors (devices with a resource name of procx), array bit steering was successful and the problem has been corrected.If errors are reported against the hardware, follow the actions for the error code or SRN. 542
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
4066 0A04
Reboot message (run-time array gard failure) for processor 2
Reboot the system in slow mode. Run diagnostics, then go to MAP 0235 in the .Array bit steering, which is a function in the firmware, may be able to correct this problem without replacing hardware. If AIX is available on the system, go to MAP 0235 in the Diagnostic Information for Multiple Bus Systems, order number SA23-2769. If Linux is the only operating system on the system, schedule deferred maintenance with the customer. When it is possible to reboot the system, do the following: 1. Using the service processor’s system power control menu, disable fast mode boot. (Disabling fast mode automatically enables slow mode.) 2. Boot the AIX diagnostic CD-ROM. 3. Run diagnostics in problem determination mode. If no errors are recorded against any of the processors (devices with a resource name of procx), array bit steering was successful and the problem has been corrected.If errors are reported against the hardware, follow the actions for the error code or SRN.
4066 0A05
Reboot message (run-time array gard failure) for processor 1
Reboot the system in slow mode. Run diagnostics, then go to MAP 0235 in the .Array bit steering, which is a function in the firmware, may be able to correct this problem without replacing hardware. If AIX is available on the system, go to MAP 0235 in the Diagnostic Information for Multiple Bus Systems, order number SA23-2769. If Linux is the only operating system on the system, schedule deferred maintenance with the customer. When it is possible to reboot the system, do the following: 1. Using the service processor’s system power control menu, disable fast mode boot. (Disabling fast mode automatically enables slow mode.) 2. Boot the AIX diagnostic CD-ROM. 3. Run diagnostics in problem determination mode. If no errors are recorded against any of the processors (devices with a resource name of procx), array bit steering was successful and the problem has been corrected.If errors are reported against the hardware, follow the actions for the error code or SRN. Chapter
5.
Error
Code
to
FRU
Index
543
Error Code
Description
Action/ Possible Failing FRU
4066 0A06
Reboot message (run-time array gard failure) for processor 3
Reboot the system in slow mode. Run diagnostics, then go to MAP 0235 in the .Array bit steering, which is a function in the firmware, may be able to correct this problem without replacing hardware. If AIX is available on the system, go to MAP 0235 in the Diagnostic Information for Multiple Bus Systems, order number SA23-2769. If Linux is the only operating system on the system, schedule deferred maintenance with the customer. When it is possible to reboot the system, do the following: 1. Using the service processor’s system power control menu, disable fast mode boot. (Disabling fast mode automatically enables slow mode.) 2. Boot the AIX diagnostic CD-ROM. 3. Run diagnostics in problem determination mode. If no errors are recorded against any of the processors (devices with a resource name of procx), array bit steering was successful and the problem has been corrected.If errors are reported against the hardware, follow the actions for the error code or SRN.
4066 0B00
Scan interface basic assurance tests detected fault in I/O book (slot 0 RIO port 0)
1. Check for system firmware updates. 2. Check the I/O cabling between the primary I/O book and the I/O subsystems. 3. Primary I/O book Location: U1.18-P1-H2 4. Call service support.
4066 0B01
Scan interface basic assurance tests detected fault in I/O book (slot 0 RIO port 1)
1. Check for system firmware updates. 2. Check the I/O cabling between the primary I/O book and the I/O subsystems. 3. Primary I/O book Location: U1.18-P1-H2 4. Call service support.
4066 0B02
Scan interface basic assurance tests detected fault in I/O book (slot 0 RIO port 2)
1. Check for system firmware updates. 2. Check the I/O cabling between the primary I/O book and the I/O subsystems. 3. Primary I/O book Location: U1.18-P1-H2 4. Call service support.
4066 0B03
Scan interface basic assurance tests detected fault in I/O book (slot 0 RIO port 3)
1. Check for system firmware updates. 2. Check the I/O cabling between the primary I/O book and the I/O subsystems. 3. Primary I/O book Location: U1.18-P1-H2 4. Call service support. 544
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
4066 0B10
Processor subsystem tests detected fault in SNI adapter
1. Check for system firmware updates. 2. Check the cabling between the SNI adapter book and the High Performance Switch. 3. SNI adapter bookLocation: U1.18-P1-H1 4. Call service support.
4066 0B11
Processor subsystem tests detected fault in SNI adapter
1. Check for system firmware updates. 2. Check the cabling between the SNI adapter book and the High Performance Switch. 3. SNI adapter bookLocation: U1.18-P1-H1 4. Call service support.
4066 0B12
Processor subsystem tests detected fault in SNI adapter
1. Check for system firmware updates. 2. Check the cabling between the SNI adapter book and the High Performance Switch. 3. SNI adapter bookLocation: U1.18-P1-H1 4. Call service support.
4066 0B13
Processor subsystem tests detected fault in SNI adapter
1. Check for system firmware updates. 2. Check the cabling between the SNI adapter book and the High Performance Switch. 3. SNI adapter bookLocation: U1.18-P1-H1 4. Call service support.
4066 0B20
Scan interface basic assurance tests detected fault in I/O book
1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support.
4066 0B21
Scan interface basic assurance tests detected fault in I/O book
1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support.
4066 0B22
Scan interface basic assurance tests detected fault in I/O book
1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support. Chapter
5.
Error
Code
to
FRU
Index
545
Error Code
Description
Action/ Possible Failing FRU
4066 0B23
Scan interface basic assurance tests detected fault in I/O book
1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support.
4066 0B24
Scan interface basic assurance tests detected fault in I/O book
1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support.
4066 0B25
Scan interface basic assurance tests detected fault in I/O book
1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support.
4066 0B26
Scan interface basic assurance tests detected fault in I/O book
1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support.
4066 0B27
Scan interface basic assurance tests detected fault in I/O book
1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support.
4066 0B30
Scan interface basic assurance tests detected fault in I/O book
1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems. 3. Third I/O book Location: U1.18-P1-H4 4. Call service support.
4066 0B31
Scan interface basic assurance tests detected fault in I/O book
1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems. 3. Third I/O book Location: U1.18-P1-H4 4. Call service support. 546
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
4066 0B32
Scan interface basic assurance tests detected fault in I/O book
1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems. 3. Third I/O book Location: U1.18-P1-H4 4. Call service support.
4066 0B33
Scan interface basic assurance tests detected fault in I/O book
1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems. 3. Third I/O book Location: U1.18-P1-H4 4. Call service support.
4066 0B34
Error code generation/logging detected fault in I/O book or SNI adapter
1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems, or the SNI adapter and the High Performance Switch, depending on which adapter is installed in location U1.18-P1-H4. 3. Third I/O book or SNI adapterLocation: U1.18-P1-H4 4. Call service support.
4066 0B35
Error code generation/logging detected fault in I/O book or SNI adapter
1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems, or the SNI adapter and the High Performance Switch, depending on which adapter is installed in location U1.18-P1-H4. 3. Third I/O book or SNI adapterLocation: U1.18-P1-H4 4. Call service support.
4066 0B36
Error code generation/logging detected fault in I/O book or SNI adapter
1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems, or the SNI adapter and the High Performance Switch, depending on which adapter is installed in location U1.18-P1-H4. 3. Third I/O book or SNI adapterLocation: U1.18-P1-H4 4. Call service support.
4066 0B37
Error code generation/logging detected fault in I/O book or SNI adapter
1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems, or the SNI adapter and the High Performance Switch, depending on which adapter is installed in location U1.18-P1-H4. 3. Third I/O book or SNI adapterLocation: U1.18-P1-H4 4. Call service support. Chapter
5.
Error
Code
to
FRU
Index
547
Error Code
Description
Action/ Possible Failing FRU
4066 0BF0
Scan interface basic assurance tests detected fault in I/O hub chip 0 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H2/Q1 b. U1.18-P1-H2/Q22. Replace the primary I/O book, Location: U1.18-P1-H2
4066 0BF1
Scan interface basic assurance tests detected fault in I/O hub chip 1 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H2/Q3 b. U1.18-P1-H2/Q42. Replace the primary I/O book, Location: U1.18-P1-H2
4066 0BF2
Scan interface basic assurance tests detected fault in I/O hub chip 2 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H1/Q3 b. U1.18-P1-H1/Q42. Replace the RIO book, Location: U1.18-P1-H1
4066 0BF3
Scan interface basic assurance tests detected fault in I/O hub chip 3 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H1/Q1 b. U1.18-P1-H1/Q22. Replace the RIO book, Location: U1.18-P1-H1
4066 0BF4
Scan interface basic assurance tests detected fault in I/O hub chip 4 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H3/Q7 b. U1.18-P1-H3/Q82. Replace the RIO book, Location: U1.18-P1-H3
4066 0BF5
Scan interface basic assurance tests detected fault in I/O hub chip 5 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H3/Q5 b. U1.18-P1-H3/Q62. Replace the RIO book, Location: U1.18-P1-H3
4066 0BF6
Scan interface basic assurance tests detected fault in I/O hub chip 6 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H4/Q5 b. U1.18-P1-H4/Q62. Replace the RIO book, Location: U1.18-P1-H4 548
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
4066 0BF7
Scan interface basic assurance tests detected fault in I/O hub chip 7 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H4/Q7 b. U1.18-P1-H4/Q82. Replace the RIO book, Location: U1.18-P1-H4
4066 0BF8
Scan interface basic assurance tests detected fault in I/O hub chip 8 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H1/Q7 b. U1.18-P1-H1/Q82. Replace the RIO book, Location: U1.18-P1-H1
4066 0BF9
Scan interface basic assurance tests detected fault in I/O hub chip 9 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H1/Q5 b. U1.18-P1-H1/Q62. Replace the RIO book, Location: U1.18-P1-H1
4066 0BFA
Scan interface basic assurance tests detected fault in I/O hub chip 10 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H3/Q3 b. U1.18-P1-H3/Q42. Replace the RIO book, Location: U1.18-P1-H3
4066 0BFB
Scan interface basic assurance tests detected fault in I/O hub chip 11 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H3/Q1 b. U1.18-P1-H3/Q22. Replace the RIO book, Location: U1.18-P1-H3
4066 0BFC
Scan interface basic assurance tests detected fault in I/O hub chip 12 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H4/Q1 b. U1.18-P1-H4/Q22. Replace the RIO book, Location: U1.18-P1-H4.
4066 0BFD
Scan interface basic assurance tests detected fault in I/O hub chip 13 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H4/Q3 b. U1.18-P1-H4/Q42. Replace the RIO book, Location: U1.18-P1-H4
4066 0C03
Scan interface basic assurance tests detected fault in all MCMs
Go to “MAP 1543: MCM Module Problem Isolation” on page 337.
4066 0C09
Scan interface basic assurance tests detected fault in all memory books
Go to “MAP 1546: Memory Book Problem Isolation” on page 345. Chapter
5.
Error
Code
to
FRU
Index
549
Error Code
Description
Action/ Possible Failing FRU
4066 0C0A
Scan interface basic assurance tests detected fault in all I/O books
Go to “MAP 1547: I/O Book Problem Isolation” on page 348.
4066 0C10
Mismatch between MCM 0 and VPD card
Plugging error with FRU kitLocation: U1.18-P1-C1 and U1.18-P1-H2.3
4066 0C11
Mismatch between MCM 2 and VPD card
Plugging error with FRU kitLocation: U1.18-P1-C4 and U1.18-P1-H2.4
4066 0C12
Mismatch between MCM 1 and VPD card
Plugging error with FRU kitLocation: U1.18-P1-C3 and U1.18-P1-H2.5
4066 0C13
Mismatch between MCM 3 and VPD card
Plugging error with FRU kitLocation: U1.18-P1-C2 and U1.18-P1-H2.6
4066 0CA5
Scan interface basic assurance tests detected invalid processor subsystem configuration
Verify that MCMs, memory books, and L3 modules are correctly configured.
4066 0CA6
Scan interface basic assurance tests detected no usable main storage
Reboot the system in slow mode to isolate the failure.
4066 0CA9
Scan interface basic assurance tests detected configuration error with processor memory
Reboot the system in slow mode to isolate the failure.
4066 0D00
Scan interface basic assurance tests detected fault in L3 cache module 0
L3 cache module 0 and L3 VPD cardLocation: U1.18-P1-C6 and U1.18-P1-H2.1
4066 0D01
Scan interface basic assurance tests detected fault in L3 cache module 1
L3 cache module 1 and L3 VPD card Location: U1.18-P1-C7 and U1.18-P1-H2.1
4066 0D02
Scan interface basic assurance tests detected fault in L3 cache module 2
L3 cache module 2 and L3 VPD card Location: U1.18-P1-C11 and U1.18-P1-H2.1
4066 0D03
Scan interface basic assurance tests detected fault in L3 cache module 3
L3 cache module 3 and L3 VPD card Location: U1.18-P1-C10 and U1.18-P1-H2.1
4066 0D04
Scan interface basic assurance tests detected fault in L3 cache module 4
L3 cache module 4 and L3 VPD card Location: U1.18-P1-C8 and U1.18-P1-H2.1
4066 0D05
Scan interface basic assurance tests detected fault in L3 cache module 5
L3 cache module 5 and L3 VPD card Location: U1.18-P1-C20 and U1.18-P1-H2.1
4066 0D06
Scan interface basic assurance tests detected fault in L3 cache module 6
L3 cache module 6 and L3 VPD card Location: U1.18-P1-C16 and U1.18-P1-H2.1
4066 0D07
Scan interface basic assurance tests detected fault in L3 cache module 7
L3 cache module 7 and L3 VPD card Location: U1.18-P1-C12 and U1.18-P1-H2.1
4066 0D08
Scan interface basic assurance tests detected fault in L3 cache module 8
L3 cache module 8 and L3 VPD card Location: U1.18-P1-C19 and U1.18-P1-H2.1
4066 0D09
Scan interface basic assurance tests detected fault in L3 cache module 9
L3 cache module 9 and L3 VPD card Location: U1.18-P1-C18 and U1.18-P1-H2.1
4066 0D0A
Scan interface basic assurance tests detected fault in L3 cache module 10
L3 cache module 10 and L3 VPD card Location: U1.18-P1-C14 and U1.18-P1-H2.1
4066 0D0B
Scan interface basic assurance tests detected fault in L3 cache module 11
L3 cache module 11 and L3 VPD card Location: U1.18-P1-C15 and U1.18-P1-H2.1
4066 0D0C
Scan interface basic assurance tests detected fault in L3 cache module 12
L3 cache module 12 and L3 VPD card Location: U1.18-P1-C17 and U1.18-P1-H2.1
4066 0D0D
Scan interface basic assurance tests detected fault in L3 cache module 13
L3 cache module 13 and L3 VPD card Location: U1.18-P1-C5 and U1.18-P1-H2.1
4066 0D0E
Scan interface basic assurance tests detected fault in L3 cache module 14
L3 cache module 14 and L3 VPD card Location: U1.18-P1-C9 and U1.18-P1-H2.1 550
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
4066 0D0F
Scan interface basic assurance tests detected fault in L3 cache module 15
L3 cache module 15 and L3 VPD card Location: U1.18-P1-C13 and U1.18-P1-H2.1
4066 0EA0
Scan interface basic assurance tests detected fault in MOPS service processor code
1. Access the service processor error log. Find the error log entry for the 406x 0EA0 error. Look for a hardware error of the form 450x yyyy, 460x yyyy, or 4B2x yyyy with a timestamp within a few minutes of the 406x 0EA0 error. Follow the actions for this hardware error. 2. If there is no hardware error in the service processor error log with a timestamp that is close to the 406x 0EA0 error, or following the actions for the hardware error does not resolve the problem, check for platform firmware updates. If an update is available, apply it. 3. If a platform firmware update is not available, or the update does not correct the problem, call service support.
4066 0EA1
Scan interface basic assurance tests detected fault in processor subsystem code on main processor
1. Check for operating system updates. 2. Call service support.
4066 0EA2
Scan interface basic assurance tests detected fault in PRD code on service processor
1. Check for system firmware updates. 2. Call service support.
4066 0EA4
The service processor firmware detected a hardware problem but was unable to isolate the source.
Call service support.
4066 0EA5
PRD detected a problem in the FNM code
1. Check for system firmware updates. 2. Call service support.
4066 0EAA
PRD: FNM is handling this error
1. Check for system firmware updates. 2. Call service support.
4066 0EAF
Scan interface basic assurance tests detected unknown service processor code and system type
1. Check for system firmware updates. 2. Call service support.
4066 0EB0
Scan interface basic assurance tests detected JTAG problem
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
4066 0EB1
Attention line 0 stuck fault
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
4066 0EB2
Attention line 1 stuck fault
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
4066 0EB3
Attention line 2 stuck fault
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
4066 0EB4
Attention line 3 stuck fault
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
4066 0EB5
Attention line 4 stuck fault
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
4066 0EB6
Attention line 5 stuck fault
Go to “MAP 1540: Problem Isolation Procedures” on page 308. Chapter
5.
Error
Code
to
FRU
Index
551
Error Code
Description
Action/ Possible Failing FRU
4066 0EB7
Attention line 6 stuck fault
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
4066 0EB8
Attention line 7 stuck fault
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
4066 0EB9
Attention line 8 stuck fault
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
4066 0EBA
Attention line 9 stuck fault
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
4066 0EBB
Attention line 10 stuck fault
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
4066 0EBC
Attention line 11 stuck fault
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
4066 0EBF
Memory error
1. Reboot the system in slow mode to the SMS menus. See “Performing Slow Boot” on page 413. 2. Examine the service processor error log. v If a CEC subsystem error (a 4xxx yyyy error) occurred during the slow boot that calls out the memory, processors, L3 cache modules, or the primary I/O book, replace the part indicated by the location code. v If a CEC subsystem error (a 4xxx yyyy error) did not occur during the slow boot, look for a 4xxx yyyy error immediately preceding the 406x 0EBF error. Replace the part indicated by the location code associated with that error.
4066 0EE1
Scan interface basic assurance tests detected fault in service processor code (invalid MRU to mapping)
1. Check for system firmware updates. 2. Call service support.
4066 0EE2
Scan interface basic assurance tests detected fault in service processor firmware fault
1. Check for system firmware updates. 2. Call service support.
4066 0EE5
Scan interface basic assurance tests detected firmware error; no callout available
1. Check for system firmware updates. 2. Call service support.
4066 25E6
SMA 4-port switch adapter fault
Replace the I/O book (switch adapter) specified by the location code.
4066 273F
SMA 2-port switch adapter fault
Replace the I/O book (switch adapter) specified by the location code.
4067 0001
Bus tests detected clock card assembly failure
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
4067 0002
Bus tests detected backplane assembly failure
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
4067 0003
Bus tests detected fault in MCM module 0
MCM module 0 and VPD cardLocation: U1.18-P1-C1 and U1.18-P1-H2.3
4067 0004
Bus tests detected fault in MCM module 2
MCM module 2 and VPD cardLocation: U1.18-P1-C4 and U1.18-P1-H2.4 552
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
4067 0005
Bus tests detected fault in MCM module 1
MCM module 1 and VPD cardLocation: U1.18-P1-C3 and U1.18-P1-H2.5
4067 0006
Bus tests detected fault in MCM module 3
MCM module 3 and VPD cardLocation: U1.18-P1-C2 and U1.18-P1-H2.6
4067 0007
Bus tests detected fault in L3 VPD card
1. L3 VPD cardLocation: U1.18-P1-H2.1 (Order the L3 cache module FRU to obtain a new L3 VPD card. Return the L3 cache module that was included in the FRU kit, and the failing L3 VPD card as ?new defective?.) 2. Primary I/O bookLocation: U1.18-P1-H2
4067 000A
Bus tests detected fault in primary I/O book 1 (slot 0)
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
4067 000B
Bus tests detected fault in secondary I/O book (slot 1)
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
4067 000C
Bus tests detected fault in I/O book 3 (slot 2)
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
4067 000D
Bus tests detected fault in I/O book 4 (slot 3)
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
4067 000E
CUoD capacity card
1. Replace the CUoD capacity cardLocation: U1.18-P1-H2.2 2. Ask the customer to reenter the CUoD activation keys.
4067 0010
Bus tests detected fault in Memory book (slot 0)
Memory book (slot 0) Location: U1.18-P1-M2
4067 0011
Bus tests detected fault in Memory book (slot 1)
Memory book (slot 1) Location: U1.18-P1-M3
4067 0012
Bus tests detected fault in Memory book (slot 2)
Memory book (slot 2) Location: U1.18-P1-M7
4067 0013
Bus tests detected fault in Memory book (slot 3)
Memory book (slot 3)Location: U1.18-P1-M6
4067 0014
Bus tests detected fault in Memory book (slot 4)
Memory book (slot 4)Location: U1.18-P1-M4
4067 0015
Bus tests detected fault in Memory book (slot 5)
Memory book (slot 5)Location: U1.18-P1-M8
4067 0016
Bus tests detected fault in Memory book (slot 6)
Memory book (slot 6)Location: U1.18-P1-M5
4067 0017
Bus tests detected fault in Memory book (slot 7)
Memory book (slot 7)Location: U1.18-P1-M1
4067 00B0
Bus tests detected possible problem with bus controller or CSP code
1. Check for system firmware updates. 2. Primary I/O book, Location: U1.18-P1-H2 3. Call service support. Chapter
5.
Error
Code
to
FRU
Index
553
Error Code
Description
Action/ Possible Failing FRU
4067 00B1
Bus tests detected possible problem with bus controller
1. Check for system firmware updates. 2. Primary I/O book, Location: U1.18-P1-H2 3. Call service support.
4067 00B3
Bus tests detected MCM 0/VPD problem
1. Check for system firmware updates. 2. MCM module 0 and VPD cardLocation: U1.18-P1-C1 and U1.18-P1-H2.3
4067 00B4
Bus tests detected MCM 2/VPD problem
1. Check for system firmware updates. 2. MCM module 2 and VPD cardLocation: U1.18-P1-C4 and U1.18-P1-H2.4
4067 00B5
Bus tests detected MCM 1/VPD problem
1. Check for system firmware updates. 2. MCM module 1 and VPD cardLocation: U1.18-P1-C3 and U1.18-P1-H2.5
4067 00B6
Bus tests detected MCM 3/VPD problem
1. Check for system firmware updates. 2. MCM module 3 and VPD cardLocation: U1.18-P1-C2 and U1.18-P1-H2.6
4067 00B7
Bus tests detected hardware problem in the processor subsystem
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
4067 0A03
Reboot message (run-time array gard failure) for processor 0
Array bit steering, which is a function in the firmware, may be able to correct this problem without replacing hardware. If AIX is available on the system, go to MAP 0235 in the Diagnostic Information for Multiple Bus Systems, order number SA23-2769. If Linux is the only operating system on the system, schedule deferred maintenance with the customer. When it is possible to reboot the system, do the following: 1. Using the service processor’s system power control menu, disable fast mode boot. (Disabling fast mode automatically enables slow mode.) 2. Boot the AIX diagnostic CD-ROM. 3. Run diagnostics in problem determination mode. If no errors are recorded against any of the processors (devices with a resource name of procx), array bit steering was successful and the problem has been corrected.If errors are reported against the hardware, follow the actions for the error code or SRN. 554
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
4067 0A04
Reboot message (run-time array gard failure) for processor 2
Reboot the system in slow mode. Run diagnostics, then go to MAP 0235 in the .Array bit steering, which is a function in the firmware, may be able to correct this problem without replacing hardware. If AIX is available on the system, go to MAP 0235 in the Diagnostic Information for Multiple Bus Systems, order number SA23-2769. If Linux is the only operating system on the system, schedule deferred maintenance with the customer. When it is possible to reboot the system, do the following: 1. Using the service processor’s system power control menu, disable fast mode boot. (Disabling fast mode automatically enables slow mode.) 2. Boot the AIX diagnostic CD-ROM. 3. Run diagnostics in problem determination mode. If no errors are recorded against any of the processors (devices with a resource name of procx), array bit steering was successful and the problem has been corrected.If errors are reported against the hardware, follow the actions for the error code or SRN.
4067 0A05
Reboot message (run-time array gard failure) for processor 1
Reboot the system in slow mode. Run diagnostics, then go to MAP 0235 in the .Array bit steering, which is a function in the firmware, may be able to correct this problem without replacing hardware. If AIX is available on the system, go to MAP 0235 in the Diagnostic Information for Multiple Bus Systems, order number SA23-2769. If Linux is the only operating system on the system, schedule deferred maintenance with the customer. When it is possible to reboot the system, do the following: 1. Using the service processor’s system power control menu, disable fast mode boot. (Disabling fast mode automatically enables slow mode.) 2. Boot the AIX diagnostic CD-ROM. 3. Run diagnostics in problem determination mode. If no errors are recorded against any of the processors (devices with a resource name of procx), array bit steering was successful and the problem has been corrected.If errors are reported against the hardware, follow the actions for the error code or SRN. Chapter
5.
Error
Code
to
FRU
Index
555
Error Code
Description
Action/ Possible Failing FRU
4067 0A06
Reboot message (run-time array gard failure) for processor 3
Reboot the system in slow mode. Run diagnostics, then go to MAP 0235 in the .Array bit steering, which is a function in the firmware, may be able to correct this problem without replacing hardware. If AIX is available on the system, go to MAP 0235 in the Diagnostic Information for Multiple Bus Systems, order number SA23-2769. If Linux is the only operating system on the system, schedule deferred maintenance with the customer. When it is possible to reboot the system, do the following: 1. Using the service processor’s system power control menu, disable fast mode boot. (Disabling fast mode automatically enables slow mode.) 2. Boot the AIX diagnostic CD-ROM. 3. Run diagnostics in problem determination mode. If no errors are recorded against any of the processors (devices with a resource name of procx), array bit steering was successful and the problem has been corrected.If errors are reported against the hardware, follow the actions for the error code or SRN.
4067 0B00
Bus tests detected fault in I/O book (slot 0 RIO port 0)
1. Check for system firmware updates. 2. Check the I/O cabling between the primary I/O book and the I/O subsystems. 3. Primary I/O book Location: U1.18-P1-H2 4. Call service support.
4067 0B01
Bus tests detected fault in I/O book (slot 0 RIO port 1)
1. Check for system firmware updates. 2. Check the I/O cabling between the primary I/O book and the I/O subsystems. 3. Primary I/O book Location: U1.18-P1-H2 4. Call service support.
4067 0B02
Bus tests detected fault in I/O book (slot 0 RIO port 2)
1. Check for system firmware updates. 2. Check the I/O cabling between the primary I/O book and the I/O subsystems. 3. Primary I/O book Location: U1.18-P1-H2 4. Call service support.
4067 0B03
Bus tests detected fault in I/O book (slot 0 RIO port 3)
1. Check for system firmware updates. 2. Check the I/O cabling between the primary I/O book and the I/O subsystems. 3. Primary I/O book Location: U1.18-P1-H2 4. Call service support. 556
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
4067 0B10
Processor subsystem tests detected fault in SNI adapter
1. Check for system firmware updates. 2. Check the cabling between the SNI adapter book and the High Performance Switch. 3. SNI adapter bookLocation: U1.18-P1-H1 4. Call service support.
4067 0B11
Processor subsystem tests detected fault in SNI adapter
1. Check for system firmware updates. 2. Check the cabling between the SNI adapter book and the High Performance Switch. 3. SNI adapter bookLocation: U1.18-P1-H1 4. Call service support.
4067 0B12
Processor subsystem tests detected fault in SNI adapter
1. Check for system firmware updates. 2. Check the cabling between the SNI adapter book and the High Performance Switch. 3. SNI adapter bookLocation: U1.18-P1-H1 4. Call service support.
4067 0B13
Processor subsystem tests detected fault in SNI adapter
1. Check for system firmware updates. 2. Check the cabling between the SNI adapter book and the High Performance Switch. 3. SNI adapter bookLocation: U1.18-P1-H1 4. Call service support.
4067 0B20
Bus tests detected fault in I/O book
1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support.
4067 0B21
Bus tests detected fault in I/O book
1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support.
4067 0B22
Bus tests detected fault in I/O book
1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support. Chapter
5.
Error
Code
to
FRU
Index
557
Error Code
Description
Action/ Possible Failing FRU
4067 0B23
Bus tests detected fault in I/O book
1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support.
4067 0B24
Bus tests detected fault in I/O book
1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support.
4067 0B25
Bus tests detected fault in I/O book
1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support.
4067 0B26
Bus tests detected fault in I/O book
1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support.
4067 0B27
Bus tests detected fault in I/O book
1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support.
4067 0B30
Bus tests detected fault in I/O book
1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems. 3. Third I/O book Location: U1.18-P1-H4 4. Call service support.
4067 0B31
Bus tests detected fault in I/O book
1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems. 3. Third I/O book Location: U1.18-P1-H4 4. Call service support. 558
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
4067 0B32
Bus tests detected fault in I/O book
1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems. 3. Third I/O book Location: U1.18-P1-H4 4. Call service support.
4067 0B33
Bus tests detected fault in I/O book
1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems. 3. Third I/O book Location: U1.18-P1-H4 4. Call service support.
4067 0B34
Error code generation/logging detected fault in I/O book or SNI adapter
1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems, or the SNI adapter and the High Performance Switch, depending on which adapter is installed in location U1.18-P1-H4. 3. Third I/O book or SNI adapterLocation: U1.18-P1-H4 4. Call service support.
4067 0B35
Error code generation/logging detected fault in I/O book or SNI adapter
1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems, or the SNI adapter and the High Performance Switch, depending on which adapter is installed in location U1.18-P1-H4. 3. Third I/O book or SNI adapterLocation: U1.18-P1-H4 4. Call service support.
4067 0B36
Error code generation/logging detected fault in I/O book or SNI adapter
1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems, or the SNI adapter and the High Performance Switch, depending on which adapter is installed in location U1.18-P1-H4. 3. Third I/O book or SNI adapterLocation: U1.18-P1-H4 4. Call service support.
4067 0B37
Error code generation/logging detected fault in I/O book or SNI adapter
1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems, or the SNI adapter and the High Performance Switch, depending on which adapter is installed in location U1.18-P1-H4. 3. Third I/O book or SNI adapterLocation: U1.18-P1-H4 4. Call service support. Chapter
5.
Error
Code
to
FRU
Index
559
Error Code
Description
Action/ Possible Failing FRU
4067 0BF0
Bus tests detected fault in I/O hub chip 0 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H2/Q1 b. U1.18-P1-H2/Q22. Replace the primary I/O book, Location: U1.18-P1-H2
4067 0BF1
Bus tests detected fault in I/O hub chip 1 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H2/Q3 b. U1.18-P1-H2/Q42. Replace the primary I/O book, Location: U1.18-P1-H2
4067 0BF2
Bus tests detected fault in I/O hub chip 2 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H1/Q3 b. U1.18-P1-H1/Q42. Replace the RIO book, Location: U1.18-P1-H1
4067 0BF3
Bus tests detected fault in I/O hub chip 3 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H1/Q1 b. U1.18-P1-H1/Q22. Replace the RIO book, Location: U1.18-P1-H1
4067 0BF4
Bus tests detected fault in I/O hub chip 4 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H3/Q7 b. U1.18-P1-H3/Q82. Replace the RIO book, Location: U1.18-P1-H3
4067 0BF5
Bus tests detected fault in I/O hub chip 5 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H3/Q5 b. U1.18-P1-H3/Q62. Replace the RIO book, Location: U1.18-P1-H3
4067 0BF6
Bus tests detected fault in I/O hub chip 6 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H4/Q5 b. U1.18-P1-H4/Q62. Replace the RIO book, Location: U1.18-P1-H4 560
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
4067 0BF7
Bus tests detected fault in I/O hub chip 7 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H4/Q7 b. U1.18-P1-H4/Q82. Replace the RIO book, Location: U1.18-P1-H4
4067 0BF8
Bus tests detected fault in I/O hub chip 8 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H1/Q7 b. U1.18-P1-H1/Q82. Replace the RIO book, Location: U1.18-P1-H1
4067 0BF9
Bus tests detected fault in I/O hub chip 9 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H1/Q5 b. U1.18-P1-H1/Q62. Replace the RIO book, Location: U1.18-P1-H1
4067 0BFA
Bus tests detected fault in I/O hub chip 10 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H3/Q3 b. U1.18-P1-H3/Q42. Replace the RIO book, Location: U1.18-P1-H3
4067 0BFB
Bus tests detected fault in I/O hub chip 11 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H3/Q1 b. U1.18-P1-H3/Q22. Replace the RIO book, Location: U1.18-P1-H3
4067 0BFC
Bus tests detected fault in I/O hub chip 12 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H4/Q1 b. U1.18-P1-H4/Q22. Replace the RIO book, Location: U1.18-P1-H4.
4067 0BFD
Bus tests detected fault in I/O hub chip 13 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H4/Q3 b. U1.18-P1-H4/Q42. Replace the RIO book, Location: U1.18-P1-H4
4067 0C03
Bus tests detected fault in all MCMs
Go to “MAP 1543: MCM Module Problem Isolation” on page 337.
4067 0C09
Bus tests detected fault in all memory books
Go to “MAP 1546: Memory Book Problem Isolation” on page 345. Chapter
5.
Error
Code
to
FRU
Index
561
Error Code
Description
Action/ Possible Failing FRU
4067 0C0A
Bus tests detected fault in all I/O books
Go to “MAP 1547: I/O Book Problem Isolation” on page 348.
4067 0C10
Bus tests detected mismatch between MCM 0 and VPD card
Plugging error with FRU kitLocation: U1.18-P1-C1 and U1.18-P1-H2.3
4067 0C11
Bus tests detected mismatch between MCM 2 and VPD card
Plugging error with FRU kitLocation: U1.18-P1-C4 and U1.18-P1-H2.4
4067 0C12
Bus tests detected mismatch between MCM 1 and VPD card
Plugging error with FRU kitLocation: U1.18-P1-C3 and U1.18-P1-H2.5
4067 0C13
Bus tests detected mismatch between MCM 3 and VPD card
Plugging error with FRU kitLocation: U1.18-P1-C2 and U1.18-P1-H2.6
4067 0CA5
Bus tests detected invalid processor subsystem configuration
Verify that MCMs, memory books, and L3 modules are correctly configured.
4067 0CA6
Bus tests detected no usable main storage
Reboot the system in slow mode to isolate the failure.
4067 0CA9
Bus tests detected configuration error with processor memory
Reboot the system in slow mode to isolate the failure.
4067 0D00
Bus tests detected fault in L3 cache module 0
L3 cache module 0 and L3 VPD cardLocation: U1.18-P1-C6 and U1.18-P1-H2.1
4067 0D01
Bus tests detected fault in L3 cache module 1
L3 cache module 1 and L3 VPD card Location: U1.18-P1-C7 and U1.18-P1-H2.1
4067 0D02
Bus tests detected fault in L3 cache module 2
L3 cache module 2 and L3 VPD card Location: U1.18-P1-C11 and U1.18-P1-H2.1
4067 0D03
Bus tests detected fault in L3 cache module 3
L3 cache module 3 and L3 VPD card Location: U1.18-P1-C10 and U1.18-P1-H2.1
4067 0D04
Bus tests detected fault in L3 cache module 4
L3 cache module 4 and L3 VPD card Location: U1.18-P1-C8 and U1.18-P1-H2.1
4067 0D05
Bus tests detected fault in L3 cache module 5
L3 cache module 5 and L3 VPD card Location: U1.18-P1-C20 and U1.18-P1-H2.1
4067 0D06
Bus tests detected fault in L3 cache module 6
L3 cache module 6 and L3 VPD card Location: U1.18-P1-C16 and U1.18-P1-H2.1
4067 0D07
Bus tests detected fault in L3 cache module 7
L3 cache module 7 and L3 VPD card Location: U1.18-P1-C12 and U1.18-P1-H2.1
4067 0D08
Bus tests detected fault in L3 cache module 8
L3 cache module 8 and L3 VPD card Location: U1.18-P1-C19 and U1.18-P1-H2.1
4067 0D09
Bus tests detected fault in L3 cache module 9
L3 cache module 9 and L3 VPD card Location: U1.18-P1-C18 and U1.18-P1-H2.1
4067 0D0A
Bus tests detected fault in L3 cache module 10
L3 cache module 10 and L3 VPD card Location: U1.18-P1-C14 and U1.18-P1-H2.1
4067 0D0B
Bus tests detected fault in L3 cache module 11
L3 cache module 11 and L3 VPD card Location: U1.18-P1-C15 and U1.18-P1-H2.1
4067 0D0C
Bus tests detected fault in L3 cache module 12
L3 cache module 12 and L3 VPD card Location: U1.18-P1-C17 and U1.18-P1-H2.1
4067 0D0D
Bus tests detected fault in L3 cache module 13
L3 cache module 13 and L3 VPD card Location: U1.18-P1-C5 and U1.18-P1-H2.1
4067 0D0E
Bus tests detected fault in L3 cache module 14
L3 cache module 14 and L3 VPD card Location: U1.18-P1-C9 and U1.18-P1-H2.1 562
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
4067 0D0F
Bus tests detected fault in L3 cache module 15
L3 cache module 15 and L3 VPD card Location: U1.18-P1-C13 and U1.18-P1-H2.1
4067 0EA0
Bus tests detected fault in MOPS service processor code
1. Access the service processor error log. Find the error log entry for the 406x 0EA0 error. Look for a hardware error of the form 450x yyyy, 460x yyyy, or 4B2x yyyy with a timestamp within a few minutes of the 406x 0EA0 error. Follow the actions for this hardware error. 2. If there is no hardware error in the service processor error log with a timestamp that is close to the 406x 0EA0 error, or following the actions for the hardware error does not resolve the problem, check for platform firmware updates. If an update is available, apply it. 3. If a platform firmware update is not available, or the update does not correct the problem, call service support.
4067 0EA1
Bus tests detected fault in processor subsystem code on main processor
1. Check for operating system update. 2. Call service support.
4067 0EA2
Bus tests detected fault in PRD code on service processor
1. Check for system firmware updates. 2. Call service support.
4067 0EA4
The service processor firmware detected a hardware problem but was unable to isolate the source.
Call service support.
4067 0EA5
PRD detected a problem in the FNM code
1. Check for system firmware updates. 2. Call service support.
4067 0EAA
PRD: FNM is handling this error
1. Check for system firmware updates. 2. Call service support.
4067 0EAF
Bus tests detected unknown service processor code and system type
1. Check for system firmware updates. 2. Call service support.
4067 0EB0
Bus tests detected JTAG problem
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
4067 0EB1
Attention line 0 stuck fault
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
4067 0EB2
Attention line 1 stuck fault
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
4067 0EB3
Attention line 2 stuck fault
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
4067 0EB4
Attention line 3 stuck fault
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
4067 0EB5
Attention line 4 stuck fault
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
4067 0EB6
Attention line 5 stuck fault
Go to “MAP 1540: Problem Isolation Procedures” on page 308. Chapter
5.
Error
Code
to
FRU
Index
563
Error Code
Description
Action/ Possible Failing FRU
4067 0EB7
Attention line 6 stuck fault
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
4067 0EB8
Attention line 7 stuck fault
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
4067 0EB9
Attention line 8 stuck fault
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
4067 0EBA
Attention line 9 stuck fault
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
4067 0EBB
Attention line 10 stuck fault
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
4067 0EBC
Attention line 11 stuck fault
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
4067 0EBF
Memory error
1. Reboot the system in slow mode to the SMS menus. See “Performing Slow Boot” on page 413. 2. Examine the service processor error log. v If a CEC subsystem error (a 4xxx yyyy error) occurred during the slow boot that calls out the memory, processors, L3 cache modules, or the primary I/O book, replace the part indicated by the location code. v If a CEC subsystem error (a 4xxx yyyy error) did not occur during the slow boot, look for a 4xxx yyyy error immediately preceding the 406x 0EBF error. Replace the part indicated by the location code associated with that error.
4067 0EE1
Bus tests detected fault in service processor code (invalid MRU to mapping)
1. Check for system firmware updates. 2. Call service support.
4067 0EE2
Bus tests detected fault in service processor firmware fault
1. Check for system firmware updates. 2. Call service support.
4067 0EE5
Bus tests detected firmware error; no callout available
1. Check for system firmware updates. 2. Call service support.
4067 25E6
SMA 4-port switch adapter fault
Replace the I/O book (switch adapter) specified by the location code.
4067 273F
SMA 2-port switch adapter fault
Replace the I/O book (switch adapter) specified by the location code.
4068 0001
Other bus tests detected clock card assembly failure
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
4068 0002
Other bus tests detected backplane assembly failure
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
4068 0003
Other bus tests detected fault in MCM module 0
MCM module 0 and VPD cardLocation: U1.18-P1-C1 and U1.18-P1-H2.3
4068 0004
Other bus tests detected fault in MCM module 2
MCM module 2 and VPD cardLocation: U1.18-P1-C4 and U1.18-P1-H2.4
4068 0005
Other bus tests detected fault in MCM module 1
MCM module 1 and VPD cardLocation: U1.18-P1-C3 and U1.18-P1-H2.5 564
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
4068 0006
Other bus tests detected fault in MCM module 3
MCM module 3 and VPD cardLocation: U1.18-P1-C2 and U1.18-P1-H2.6
4068 0007
Other bus tests detected fault in L3 VPD card
1. L3 VPD cardLocation: U1.18-P1-H2.1 (Order the L3 cache module FRU to obtain a new L3 VPD card. Return the L3 cache module that was included in the FRU kit, and the failing L3 VPD card as ?new defective?.) 2. Primary I/O bookLocation: U1.18-P1-H2
4068 000A
Other bus tests detected fault in primary I/O book 1 (slot 0)
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
4068 000B
Other bus tests detected fault in secondary I/O book (slot 1)
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
4068 000C
Other bus tests detected fault in I/O book 3 (slot 2)
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
4068 000D
Other bus tests detected fault in I/O book 4 (slot 3)
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
4068 000E
CUoD capacity card
1. Replace the CUoD capacity cardLocation: U1.18-P1-H2.2 2. Ask the customer to reenter the CUoD activation keys.
4068 0010
Other bus tests detected fault in Memory book (slot 0)
Memory book (slot 0) Location: U1.18-P1-M2
4068 0011
Other bus tests detected fault in Memory book (slot 1)
Memory book (slot 1) Location: U1.18-P1-M3
4068 0012
Other bus tests detected fault in Memory book (slot 2)
Memory book (slot 2) Location: U1.18-P1-M7
4068 0013
Other bus tests detected fault in Memory book (slot 3)
Memory book (slot 3)Location: U1.18-P1-M6
4068 0014
Other bus tests detected fault in Memory book (slot 4)
Memory book (slot 4)Location: U1.18-P1-M4
4068 0015
Other bus tests detected fault in Memory book (slot 5)
Memory book (slot 5)Location: U1.18-P1-M8
4068 0016
Other bus tests detected fault in Memory book (slot 6)
Memory book (slot 6)Location: U1.18-P1-M5
4068 0017
Other bus tests detected fault in Memory book (slot 7)
Memory book (slot 7)Location: U1.18-P1-M1
4068 00B0
Other bus tests detected possible problem with bus controller or CSP code
1. Check for system firmware updates. 2. Primary I/O book, Location: U1.18-P1-H2 3. Call service support.
4068 00B1
Other bus tests detected possible problem with bus controller
1. Check for system firmware updates. 2. Primary I/O book, Location: U1.18-P1-H2 3. Call service support. Chapter
5.
Error
Code
to
FRU
Index
565
Error Code
Description
Action/ Possible Failing FRU
4068 00B3
Other bus tests detected MCM 0/VPD problem
1. Check for system firmware updates. 2. MCM module 0 and VPD cardLocation: U1.18-P1-C1 and U1.18-P1-H2.3
4068 00B4
Other bus tests detected MCM 2/VPD problem
1. Check for system firmware updates. 2. MCM module 2 and VPD cardLocation: U1.18-P1-C4 and U1.18-P1-H2.4
4068 00B5
Other bus tests detected MCM 1/VPD problem
1. Check for system firmware updates. 2. MCM module 1 and VPD cardLocation: U1.18-P1-C3 and U1.18-P1-H2.5
4068 00B6
Other bus tests detected MCM 3/VPD problem
1. Check for system firmware updates. 2. MCM module 3 and VPD cardLocation: U1.18-P1-C2 and U1.18-P1-H2.6
4068 00B7
Other bus tests detected hardware problem in the processor subsystem
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
4068 0A03
Reboot message (run-time array gard failure) for processor 0
Array bit steering, which is a function in the firmware, may be able to correct this problem without replacing hardware. If AIX is available on the system, go to MAP 0235 in the Diagnostic Information for Multiple Bus Systems, order number SA23-2769. If Linux is the only operating system on the system, schedule deferred maintenance with the customer. When it is possible to reboot the system, do the following: 1. Using the service processor’s system power control menu, disable fast mode boot. (Disabling fast mode automatically enables slow mode.) 2. Boot the AIX diagnostic CD-ROM. 3. Run diagnostics in problem determination mode. If no errors are recorded against any of the processors (devices with a resource name of procx), array bit steering was successful and the problem has been corrected.If errors are reported against the hardware, follow the actions for the error code or SRN. 566
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
4068 0A04
Reboot message (run-time array gard failure) for processor 2
Reboot the system in slow mode. Run diagnostics, then go to MAP 0235 in the .Array bit steering, which is a function in the firmware, may be able to correct this problem without replacing hardware. If AIX is available on the system, go to MAP 0235 in the Diagnostic Information for Multiple Bus Systems, order number SA23-2769. If Linux is the only operating system on the system, schedule deferred maintenance with the customer. When it is possible to reboot the system, do the following: 1. Using the service processor’s system power control menu, disable fast mode boot. (Disabling fast mode automatically enables slow mode.) 2. Boot the AIX diagnostic CD-ROM. 3. Run diagnostics in problem determination mode. If no errors are recorded against any of the processors (devices with a resource name of procx), array bit steering was successful and the problem has been corrected.If errors are reported against the hardware, follow the actions for the error code or SRN.
4068 0A05
Reboot message (run-time array gard failure) for Processor 1
Reboot the system in slow mode. Run diagnostics, then go to MAP 0235 in the .Array bit steering, which is a function in the firmware, may be able to correct this problem without replacing hardware. If AIX is available on the system, go to MAP 0235 in the Diagnostic Information for Multiple Bus Systems, order number SA23-2769. If Linux is the only operating system on the system, schedule deferred maintenance with the customer. When it is possible to reboot the system, do the following: 1. Using the service processor’s system power control menu, disable fast mode boot. (Disabling fast mode automatically enables slow mode.) 2. Boot the AIX diagnostic CD-ROM. 3. Run diagnostics in problem determination mode. If no errors are recorded against any of the processors (devices with a resource name of procx), array bit steering was successful and the problem has been corrected.If errors are reported against the hardware, follow the actions for the error code or SRN. Chapter
5.
Error
Code
to
FRU
Index
567
Error Code
Description
Action/ Possible Failing FRU
4068 0A06
Reboot message (run-time array gard failure) for Processor 3
Reboot the system in slow mode. Run diagnostics, then go to MAP 0235 in the .Array bit steering, which is a function in the firmware, may be able to correct this problem without replacing hardware. If AIX is available on the system, go to MAP 0235 in the Diagnostic Information for Multiple Bus Systems, order number SA23-2769. If Linux is the only operating system on the system, schedule deferred maintenance with the customer. When it is possible to reboot the system, do the following: 1. Using the service processor’s system power control menu, disable fast mode boot. (Disabling fast mode automatically enables slow mode.) 2. Boot the AIX diagnostic CD-ROM. 3. Run diagnostics in problem determination mode. If no errors are recorded against any of the processors (devices with a resource name of procx), array bit steering was successful and the problem has been corrected.If errors are reported against the hardware, follow the actions for the error code or SRN.
4068 0B00
Other bus tests detected fault in I/O book (slot 0 RIO port 0)
1. Check for system firmware updates. 2. Check the I/O cabling between the primary I/O book and the I/O subsystems. 3. Primary I/O book Location: U1.18-P1-H2 4. Call service support.
4068 0B01
Other bus tests detected fault in I/O book (slot 0 RIO port 1)
1. Check for system firmware updates. 2. Check the I/O cabling between the primary I/O book and the I/O subsystems. 3. Primary I/O book Location: U1.18-P1-H2 4. Call service support.
4068 0B02
Other bus tests detected fault in I/O book (slot 0 RIO port 2)
1. Check for system firmware updates. 2. Check the I/O cabling between the primary I/O book and the I/O subsystems. 3. Primary I/O book Location: U1.18-P1-H2 4. Call service support.
4068 0B03
Other bus tests detected fault in I/O book (slot 0 RIO port 3)
1. Check for system firmware updates. 2. Check the I/O cabling between the primary I/O book and the I/O subsystems. 3. Primary I/O book Location: U1.18-P1-H2 4. Call service support. 568
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
4068 0B10
Processor subsystem tests detected fault in SNI adapter
1. Check for system firmware updates. 2. Check the cabling between the SNI adapter book and the High Performance Switch. 3. SNI adapter bookLocation: U1.18-P1-H1 4. Call service support.
4068 0B11
Processor subsystem tests detected fault in SNI adapter
1. Check for system firmware updates. 2. Check the cabling between the SNI adapter book and the High Performance Switch. 3. SNI adapter bookLocation: U1.18-P1-H1 4. Call service support.
4068 0B12
Processor subsystem tests detected fault in SNI adapter
1. Check for system firmware updates. 2. Check the cabling between the SNI adapter book and the High Performance Switch. 3. SNI adapter bookLocation: U1.18-P1-H1 4. Call service support.
4068 0B13
Processor subsystem tests detected fault in SNI adapter
1. Check for system firmware updates. 2. Check the cabling between the SNI adapter book and the High Performance Switch. 3. SNI adapter bookLocation: U1.18-P1-H1 4. Call service support.
4068 0B20
Other bus tests detected fault in I/O book
1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support.
4068 0B21
Other bus tests detected fault in I/O book
1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support.
4068 0B22
Other bus tests detected fault in I/O book
1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support. Chapter
5.
Error
Code
to
FRU
Index
569
Error Code
Description
Action/ Possible Failing FRU
4068 0B23
Other bus tests detected fault in I/O book
1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support.
4068 0B24
Other bus tests detected fault in I/O book
1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support.
4068 0B25
Other bus tests detected fault in I/O book
1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support.
4068 0B26
Other bus tests detected fault in I/O book
1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support.
4068 0B27
Other bus tests detected fault in I/O book
1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support.
4068 0B30
Other bus tests detected fault in I/O book
1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems. 3. Third I/O book Location: U1.18-P1-H4 4. Call service support.
4068 0B31
Other bus tests detected fault in I/O book
1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems. 3. Third I/O book Location: U1.18-P1-H4 4. Call service support. 570
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
4068 0B32
Other bus tests detected fault in I/O book
1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems. 3. Third I/O book Location: U1.18-P1-H4 4. Call service support.
4068 0B33
Other bus tests detected fault in I/O book
1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems. 3. Third I/O book Location: U1.18-P1-H4 4. Call service support.
4068 0B34
Error code generation/logging detected fault in I/O book or SNI adapter
1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems, or the SNI adapter and the High Performance Switch, depending on which adapter is installed in location U1.18-P1-H4. 3. Third I/O book or SNI adapterLocation: U1.18-P1-H4 4. Call service support.
4068 0B35
Error code generation/logging detected fault in I/O book or SNI adapter
1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems, or the SNI adapter and the High Performance Switch, depending on which adapter is installed in location U1.18-P1-H4. 3. Third I/O book or SNI adapterLocation: U1.18-P1-H4 4. Call service support.
4068 0B36
Error code generation/logging detected fault in I/O book or SNI adapter
1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems, or the SNI adapter and the High Performance Switch, depending on which adapter is installed in location U1.18-P1-H4. 3. Third I/O book or SNI adapterLocation: U1.18-P1-H4 4. Call service support.
4068 0B37
Error code generation/logging detected fault in I/O book or SNI adapter
1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems, or the SNI adapter and the High Performance Switch, depending on which adapter is installed in location U1.18-P1-H4. 3. Third I/O book or SNI adapterLocation: U1.18-P1-H4 4. Call service support. Chapter
5.
Error
Code
to
FRU
Index
571
Error Code
Description
Action/ Possible Failing FRU
4068 0BF0
Other bus tests detected fault in I/O hub chip 0 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H2/Q1 b. U1.18-P1-H2/Q22. Replace the primary I/O book, Location: U1.18-P1-H2
4068 0BF1
Other bus tests detected fault in I/O hub chip 1 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H2/Q3 b. U1.18-P1-H2/Q42. Replace the primary I/O book, Location: U1.18-P1-H2
4068 0BF2
Other bus tests detected fault in I/O hub chip 2 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H1/Q3 b. U1.18-P1-H1/Q42. Replace the RIO book, Location: U1.18-P1-H1
4068 0BF3
Other bus tests detected fault in I/O hub chip 3 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H1/Q1 b. U1.18-P1-H1/Q22. Replace the RIO book, Location: U1.18-P1-H1
4068 0BF4
Other bus tests detected fault in I/O hub chip 4 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H3/Q7 b. U1.18-P1-H3/Q82. Replace the RIO book, Location: U1.18-P1-H3
4068 0BF5
Other bus tests detected fault in I/O hub chip 5 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H3/Q5 b. U1.18-P1-H3/Q62. Replace the RIO book, Location: U1.18-P1-H3
4068 0BF6
Other bus tests detected fault in I/O hub chip 6 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H4/Q5 b. U1.18-P1-H4/Q62. Replace the RIO book, Location: U1.18-P1-H4 572
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
4068 0BF7
Other bus tests detected fault in I/O hub chip 7 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H4/Q7 b. U1.18-P1-H4/Q82. Replace the RIO book, Location: U1.18-P1-H4
4068 0BF8
Other bus tests detected fault in I/O hub chip 8 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H1/Q7 b. U1.18-P1-H1/Q82. Replace the RIO book, Location: U1.18-P1-H1
4068 0BF9
Other bus tests detected fault in I/O hub chip 9 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H1/Q5 b. U1.18-P1-H1/Q62. Replace the RIO book, Location: U1.18-P1-H1
4068 0BFA
Other bus tests detected fault in I/O hub chip 10 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H3/Q3 b. U1.18-P1-H3/Q42. Replace the RIO book, Location: U1.18-P1-H3
4068 0BFB
Other bus tests detected fault in I/O hub chip 11 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H3/Q1 b. U1.18-P1-H3/Q22. Replace the RIO book, Location: U1.18-P1-H3
4068 0BFC
Other bus tests detected fault in I/O hub chip 12 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H4/Q1 b. U1.18-P1-H4/Q22. Replace the RIO book, Location: U1.18-P1-H4.
4068 0BFD
Other bus tests detected fault in I/O hub chip 13 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H4/Q3 b. U1.18-P1-H4/Q42. Replace the RIO book, Location: U1.18-P1-H4
4068 0C03
Other bus tests detected fault in all MCMs
Go to “MAP 1543: MCM Module Problem Isolation” on page 337.
4068 0C09
Other bus tests detected fault in all memory books
Go to “MAP 1546: Memory Book Problem Isolation” on page 345. Chapter
5.
Error
Code
to
FRU
Index
573
Error Code
Description
Action/ Possible Failing FRU
4068 0C0A
Other bus tests detected fault in all I/O books
Go to “MAP 1547: I/O Book Problem Isolation” on page 348.
4068 0C10
Other bus tests detected mismatch between MCM 0 and VPD card
Plugging error with FRU kitLocation: U1.18-P1-C1 and U1.18-P1-H2.3
4068 0C11
Other bus tests detected mismatch between MCM 2 and VPD card
Plugging error with FRU kitLocation: U1.18-P1-C4 and U1.18-P1-H2.4
4068 0C12
Other bus tests detected mismatch between MCM 1 and VPD card
Plugging error with FRU kitLocation: U1.18-P1-C3 and U1.18-P1-H2.5
4068 0C13
Other bus tests detected mismatch between MCM 3 and VPD card
Plugging error with FRU kitLocation: U1.18-P1-C2 and U1.18-P1-H2.6
4068 0CA5
Other bus tests detected invalid processor subsystem configuration
Verify that MCMs, memory books, and L3 modules are correctly configured.
4068 0CA6
Other bus tests detected no usable main storage
Reboot the system in slow mode to isolate the failure.
4068 0CA9
Other bus tests detected configuration error with processor memory
Reboot the system in slow mode to isolate the failure.
4068 0D00
Other bus tests detected fault in L3 cache module 0
L3 cache module 0 and L3 VPD cardLocation: U1.18-P1-C6 and U1.18-P1-H2.1
4068 0D01
Other bus tests detected fault in L3 cache module 1
L3 cache module 1 and L3 VPD card Location: U1.18-P1-C7 and U1.18-P1-H2.1
4068 0D02
Other bus tests detected fault in L3 cache module 2
L3 cache module 2 and L3 VPD card Location: U1.18-P1-C11 and U1.18-P1-H2.1
4068 0D03
Other bus tests detected fault in L3 cache module 3
L3 cache module 3 and L3 VPD card Location: U1.18-P1-C10 and U1.18-P1-H2.1
4068 0D04
Other bus tests detected fault in L3 cache module 4
L3 cache module 4 and L3 VPD card Location: U1.18-P1-C8 and U1.18-P1-H2.1
4068 0D05
Other bus tests detected fault in L3 cache module 5
L3 cache module 5 and L3 VPD card Location: U1.18-P1-C20 and U1.18-P1-H2.1
4068 0D06
Other bus tests detected fault in L3 cache module 6
L3 cache module 6 and L3 VPD card Location: U1.18-P1-C16 and U1.18-P1-H2.1
4068 0D07
Other bus tests detected fault in L3 cache module 7
L3 cache module 7 and L3 VPD card Location: U1.18-P1-C12 and U1.18-P1-H2.1
4068 0D08
Other bus tests detected fault in L3 cache module 8
L3 cache module 8 and L3 VPD card Location: U1.18-P1-C19 and U1.18-P1-H2.1
4068 0D09
Other bus tests detected fault in L3 cache module 9
L3 cache module 9 and L3 VPD card Location: U1.18-P1-C18 and U1.18-P1-H2.1
4068 0D0A
Other bus tests detected fault in L3 cache module 10
L3 cache module 10 and L3 VPD card Location: U1.18-P1-C14 and U1.18-P1-H2.1
4068 0D0B
Other bus tests detected fault in L3 cache module 11
L3 cache module 11 and L3 VPD card Location: U1.18-P1-C15 and U1.18-P1-H2.1
4068 0D0C
Other bus tests detected fault in L3 cache module 12
L3 cache module 12 and L3 VPD card Location: U1.18-P1-C17 and U1.18-P1-H2.1
4068 0D0D
Other bus tests detected fault in L3 cache module 13
L3 cache module 13 and L3 VPD card Location: U1.18-P1-C5 and U1.18-P1-H2.1
4068 0D0E
Other bus tests detected fault in L3 cache module 14
L3 cache module 14 and L3 VPD card Location: U1.18-P1-C9 and U1.18-P1-H2.1 574
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
4068 0D0F
Other bus tests detected fault in L3 cache module 15
L3 cache module 15 and L3 VPD card Location: U1.18-P1-C13 and U1.18-P1-H2.1
4068 0EA0
Other bus tests detected fault in MOPS service processor code
1. Access the service processor error log. Find the error log entry for the 406x 0EA0 error. Look for a hardware error of the form 450x yyyy, 460x yyyy, or 4B2x yyyy with a timestamp within a few minutes of the 406x 0EA0 error. Follow the actions for this hardware error. 2. If there is no hardware error in the service processor error log with a timestamp that is close to the 406x 0EA0 error, or following the actions for the hardware error does not resolve the problem, check for platform firmware updates. If an update is available, apply it. 3. If a platform firmware update is not available, or the update does not correct the problem, call service support.
4068 0EA1
Other bus tests detected fault in processor subsystem code on main processor
1. Check for operating system updates. 2. Call service support.
4068 0EA2
Other bus tests detected fault in PRD code on service processor
1. Check for system firmware updates. 2. Call service support.
4068 0EA4
The service processor firmware detected a hardware problem but was unable to isolate the source.
Call service support.
4068 0EA5
PRD detected a problem in the FNM code
1. Check for system firmware updates. 2. Call service support.
4068 0EAA
PRD: FNM is handling this error
1. Check for system firmware updates. 2. Call service support.
4068 0EAF
Other bus tests detected unknown service processor code and system type
1. Check for system firmware updates. 2. Call service support.
4068 0EB0
Other bus tests detected JTAG problem
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
4068 0EB1
Attention line 0 stuck fault
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
4068 0EB2
Attention line 1 stuck fault
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
4068 0EB3
Attention line 2 stuck fault
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
4068 0EB4
Attention line 3 stuck fault
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
4068 0EB5
Attention line 4 stuck fault
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
4068 0EB6
Attention line 5 stuck fault
Go to “MAP 1540: Problem Isolation Procedures” on page 308. Chapter
5.
Error
Code
to
FRU
Index
575
Error Code
Description
Action/ Possible Failing FRU
4068 0EB7
Attention line 6 stuck fault
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
4068 0EB8
Attention line 7 stuck fault
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
4068 0EB9
Attention line 8 stuck fault
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
4068 0EBA
Attention line 9 stuck fault
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
4068 0EBB
Attention line 10 stuck fault
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
4068 0EBC
Attention line 11 stuck fault
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
4068 0EBF
Memory error
1. Reboot the system in slow mode to the SMS menus. See “Performing Slow Boot” on page 413. 2. Examine the service processor error log. v If a CEC subsystem error (a 4xxx yyyy error) occurred during the slow boot that calls out the memory, processors, L3 cache modules, or the primary I/O book, replace the part indicated by the location code. v If a CEC subsystem error (a 4xxx yyyy error) did not occur during the slow boot, look for a 4xxx yyyy error immediately preceding the 406x 0EBF error. Replace the part indicated by the location code associated with that error.
4068 0EE1
Other bus tests detected fault in service processor code (invalid MRU to mapping)
1. Check for system firmware updates. 2. Call service support.
4068 0EE2
Other bus tests detected fault in service processor firmware fault
1. Check for system firmware updates. 2. Call service support.
4068 0EE5
Other bus tests detected firmware error; no callout available
1. Check for system firmware updates. 2. Call service support.
4068 25E6
SMA 4-port switch adapter fault
Replace the I/O book (switch adapter) specified by the location code.
4068 273F
SMA 2-port switch adapter fault
Replace the I/O book (switch adapter) specified by the location code.
4069 0001
Memory tests detected clock card assembly failure
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
4069 0002
Memory tests detected backplane assembly failure
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
4069 0003
Memory tests detected fault in MCM module 0
MCM module 0 and VPD cardLocation: U1.18-P1-C1 and U1.18-P1-H2.3
4069 0004
Memory tests detected fault in MCM module 2
MCM module 2 and VPD cardLocation: U1.18-P1-C4 and U1.18-P1-H2.4
4069 0005
Memory tests detected fault in MCM module 1
MCM module 1 and VPD cardLocation: U1.18-P1-C3 and U1.18-P1-H2.5 576
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
4069 0006
Memory tests detected fault in MCM module 3
MCM module 3 and VPD cardLocation: U1.18-P1-C2 and U1.18-P1-H2.6
4069 0007
Memory tests detected fault in L3 VPD card
1. L3 VPD cardLocation: U1.18-P1-H2.1 (Order the L3 cache module FRU to obtain a new L3 VPD card. Return the L3 cache module that was included in the FRU kit, and the failing L3 VPD card as ?new defective?.) 2. Primary I/O bookLocation: U1.18-P1-H2
4069 000A
Memory tests detected fault in primary I/O book 1 (slot 0)
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
4069 000B
Memory tests detected fault in secondary I/O book (slot 1)
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
4069 000C
Memory tests detected fault in I/O book 3 (slot 2)
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
4069 000D
Memory tests detected fault in I/O book 4 (slot 3)
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
4069 000E
CUoD capacity card
1. Replace the CUoD capacity cardLocation: U1.18-P1-H2.2 2. Ask the customer to reenter the CUoD activation keys.
4069 0010
Memory tests detected fault in Memory book (slot 0)
Memory book (slot 0) Location: U1.18-P1-M2
4069 0011
Memory tests detected fault in Memory book (slot 1)
Memory book (slot 1) Location: U1.18-P1-M3
4069 0012
Memory tests detected fault in Memory book (slot 2)
Memory book (slot 2) Location: U1.18-P1-M7
4069 0013
Memory tests detected fault in Memory book (slot 3)
Memory book (slot 3)Location: U1.18-P1-M6
4069 0014
Memory tests detected fault in Memory book (slot 4)
Memory book (slot 4)Location: U1.18-P1-M4
4069 0015
Memory tests detected fault in Memory book (slot 5)
Memory book (slot 5)Location: U1.18-P1-M8
4069 0016
Memory tests detected fault in Memory book (slot 6)
Memory book (slot 6)Location: U1.18-P1-M5
4069 0017
Memory tests detected fault in Memory book (slot 7)
Memory book (slot 7)Location: U1.18-P1-M1
4069 00B0
Memory tests detected possible problem with bus controller or CSP code
1. Check for system firmware updates. 2. Primary I/O book, Location: U1.18-P1-H2 3. Call service support.
4069 00B1
Memory tests detected possible problem with bus controller
1. Check for system firmware updates. 2. Primary I/O book, Location: U1.18-P1-H2 3. Call service support. Chapter
5.
Error
Code
to
FRU
Index
577
Error Code
Description
Action/ Possible Failing FRU
4069 00B3
Memory tests detected MCM 0/VPD problem
1. Check for system firmware updates. 2. MCM module 0 and VPD cardLocation: U1.18-P1-C1 and U1.18-P1-H2.3
4069 00B4
Memory tests detected MCM 2/VPD problem
1. Check for system firmware updates. 2. MCM module 2 and VPD cardLocation: U1.18-P1-C4 and U1.18-P1-H2.4
4069 00B5
Memory tests detected MCM 1/VPD problem
1. Check for system firmware updates. 2. MCM module 1 and VPD cardLocation: U1.18-P1-C3 and U1.18-P1-H2.5
4069 00B6
Memory tests detected MCM 3/VPD problem
1. Check for system firmware updates. 2. MCM module 3 and VPD cardLocation: U1.18-P1-C2 and U1.18-P1-H2.6
4069 00B7
Memory tests detected hardware problem in the processor subsystem
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
4069 0A03
Reboot message (run-time array gard failure) for processor 0
Array bit steering, which is a function in the firmware, may be able to correct this problem without replacing hardware. If AIX is available on the system, go to MAP 0235 in the Diagnostic Information for Multiple Bus Systems, order number SA23-2769. If Linux is the only operating system on the system, schedule deferred maintenance with the customer. When it is possible to reboot the system, do the following: 1. Using the service processor’s system power control menu, disable fast mode boot. (Disabling fast mode automatically enables slow mode.) 2. Boot the AIX diagnostic CD-ROM. 3. Run diagnostics in problem determination mode. If no errors are recorded against any of the processors (devices with a resource name of procx), array bit steering was successful and the problem has been corrected.If errors are reported against the hardware, follow the actions for the error code or SRN. 578
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
4069 0A04
Reboot message (run-time array gard failure) for processor 2
Reboot the system in slow mode. Run diagnostics, then go to MAP 0235 in the .Array bit steering, which is a function in the firmware, may be able to correct this problem without replacing hardware. If AIX is available on the system, go to MAP 0235 in the Diagnostic Information for Multiple Bus Systems, order number SA23-2769. If Linux is the only operating system on the system, schedule deferred maintenance with the customer. When it is possible to reboot the system, do the following: 1. Using the service processor’s system power control menu, disable fast mode boot. (Disabling fast mode automatically enables slow mode.) 2. Boot the AIX diagnostic CD-ROM. 3. Run diagnostics in problem determination mode. If no errors are recorded against any of the processors (devices with a resource name of procx), array bit steering was successful and the problem has been corrected.If errors are reported against the hardware, follow the actions for the error code or SRN.
4069 0A05
Reboot message (run-time array gard failure) for processor 1
Reboot the system in slow mode. Run diagnostics, then go to MAP 0235 in the .Array bit steering, which is a function in the firmware, may be able to correct this problem without replacing hardware. If AIX is available on the system, go to MAP 0235 in the Diagnostic Information for Multiple Bus Systems, order number SA23-2769. If Linux is the only operating system on the system, schedule deferred maintenance with the customer. When it is possible to reboot the system, do the following: 1. Using the service processor’s system power control menu, disable fast mode boot. (Disabling fast mode automatically enables slow mode.) 2. Boot the AIX diagnostic CD-ROM. 3. Run diagnostics in problem determination mode. If no errors are recorded against any of the processors (devices with a resource name of procx), array bit steering was successful and the problem has been corrected.If errors are reported against the hardware, follow the actions for the error code or SRN. Chapter
5.
Error
Code
to
FRU
Index
579
Error Code
Description
Action/ Possible Failing FRU
4069 0A06
Reboot message (run-time array gard failure) for processor 3
Reboot the system in slow mode. Run diagnostics, then go to MAP 0235 in the .Array bit steering, which is a function in the firmware, may be able to correct this problem without replacing hardware. If AIX is available on the system, go to MAP 0235 in the Diagnostic Information for Multiple Bus Systems, order number SA23-2769. If Linux is the only operating system on the system, schedule deferred maintenance with the customer. When it is possible to reboot the system, do the following: 1. Using the service processor’s system power control menu, disable fast mode boot. (Disabling fast mode automatically enables slow mode.) 2. Boot the AIX diagnostic CD-ROM. 3. Run diagnostics in problem determination mode. If no errors are recorded against any of the processors (devices with a resource name of procx), array bit steering was successful and the problem has been corrected.If errors are reported against the hardware, follow the actions for the error code or SRN.
4069 0B00
Memory tests detected fault in I/O book (slot 0 RIO port 0)
1. Check for system firmware updates. 2. Check the I/O cabling between the primary I/O book and the I/O subsystems. 3. Primary I/O book Location: U1.18-P1-H2 4. Call service support.
4069 0B01
Memory tests detected fault in I/O book (slot 0 RIO port 1)
1. Check for system firmware updates. 2. Check the I/O cabling between the primary I/O book and the I/O subsystems. 3. Primary I/O book Location: U1.18-P1-H2 4. Call service support.
4069 0B02
Memory tests detected fault in I/O book (slot 0 RIO port 2)
1. Check for system firmware updates. 2. Check the I/O cabling between the primary I/O book and the I/O subsystems. 3. Primary I/O book Location: U1.18-P1-H2 4. Call service support.
4069 0B03
Memory tests detected fault in I/O book (slot 0 RIO port 3)
1. Check for system firmware updates. 2. Check the I/O cabling between the primary I/O book and the I/O subsystems. 3. Primary I/O book Location: U1.18-P1-H2 4. Call service support. 580
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
4069 0B10
Processor subsystem tests detected fault in SNI adapter
1. Check for system firmware updates. 2. Check the cabling between the SNI adapter book and the High Performance Switch. 3. SNI adapter bookLocation: U1.18-P1-H1 4. Call service support.
4069 0B11
Processor subsystem tests detected fault in SNI adapter
1. Check for system firmware updates. 2. Check the cabling between the SNI adapter book and the High Performance Switch. 3. SNI adapter bookLocation: U1.18-P1-H1 4. Call service support.
4069 0B12
Processor subsystem tests detected fault in SNI adapter
1. Check for system firmware updates. 2. Check the cabling between the SNI adapter book and the High Performance Switch. 3. SNI adapter bookLocation: U1.18-P1-H1 4. Call service support.
4069 0B13
Processor subsystem tests detected fault in SNI adapter
1. Check for system firmware updates. 2. Check the cabling between the SNI adapter book and the High Performance Switch. 3. SNI adapter bookLocation: U1.18-P1-H1 4. Call service support.
4069 0B20
Memory tests detected fault in I/O book
1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support.
4069 0B21
Memory tests detected fault in I/O book
1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support.
4069 0B22
Memory tests detected fault in I/O book
1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support. Chapter
5.
Error
Code
to
FRU
Index
581
Error Code
Description
Action/ Possible Failing FRU
4069 0B23
Memory tests detected fault in I/O book
1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support.
4069 0B24
Memory tests detected fault in I/O book
1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support.
4069 0B25
Memory tests detected fault in I/O book
1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support.
4069 0B26
Memory tests detected fault in I/O book
1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support.
4069 0B27
Memory tests detected fault in I/O book
1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support.
4069 0B30
Memory tests detected fault in I/O book
1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems. 3. Third I/O book Location: U1.18-P1-H4 4. Call service support.
4069 0B31
Memory tests detected fault in I/O book
1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems. 3. Third I/O book Location: U1.18-P1-H4 4. Call service support. 582
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
4069 0B32
Memory tests detected fault in I/O book
1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems. 3. Third I/O book Location: U1.18-P1-H4 4. Call service support.
4069 0B33
Memory tests detected fault in I/O book
1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems. 3. Third I/O book Location: U1.18-P1-H4 4. Call service support.
4069 0B34
Error code generation/logging detected fault in I/O book or SNI adapter
1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems, or the SNI adapter and the High Performance Switch, depending on which adapter is installed in location U1.18-P1-H4. 3. Third I/O book or SNI adapterLocation: U1.18-P1-H4 4. Call service support.
4069 0B35
Error code generation/logging detected fault in I/O book or SNI adapter
1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems, or the SNI adapter and the High Performance Switch, depending on which adapter is installed in location U1.18-P1-H4. 3. Third I/O book or SNI adapterLocation: U1.18-P1-H4 4. Call service support.
4069 0B36
Error code generation/logging detected fault in I/O book or SNI adapter
1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems, or the SNI adapter and the High Performance Switch, depending on which adapter is installed in location U1.18-P1-H4. 3. Third I/O book or SNI adapterLocation: U1.18-P1-H4 4. Call service support.
4069 0B37
Error code generation/logging detected fault in I/O book or SNI adapter
1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems, or the SNI adapter and the High Performance Switch, depending on which adapter is installed in location U1.18-P1-H4. 3. Third I/O book or SNI adapterLocation: U1.18-P1-H4 4. Call service support. Chapter
5.
Error
Code
to
FRU
Index
583
Error Code
Description
Action/ Possible Failing FRU
4069 0BF0
Memory tests detected fault in I/O hub chip 0 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H2/Q1 b. U1.18-P1-H2/Q22. Replace the primary I/O book, Location: U1.18-P1-H2
4069 0BF1
Memory tests detected fault in I/O hub chip 1 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H2/Q3 b. U1.18-P1-H2/Q42. Replace the primary I/O book, Location: U1.18-P1-H2
4069 0BF2
Memory tests detected fault in I/O hub chip 2 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H1/Q3 b. U1.18-P1-H1/Q42. Replace the RIO book, Location: U1.18-P1-H1
4069 0BF3
Memory tests detected fault in I/O hub chip 3 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H1/Q1 b. U1.18-P1-H1/Q22. Replace the RIO book, Location: U1.18-P1-H1
4069 0BF4
Memory tests detected fault in I/O hub chip 4 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H3/Q7 b. U1.18-P1-H3/Q82. Replace the RIO book, Location: U1.18-P1-H3
4069 0BF5
Memory tests detected fault in I/O hub chip 5 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H3/Q5 b. U1.18-P1-H3/Q62. Replace the RIO book, Location: U1.18-P1-H3
4069 0BF6
Memory tests detected fault in I/O hub chip 6 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H4/Q5 b. U1.18-P1-H4/Q62. Replace the RIO book, Location: U1.18-P1-H4 584
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
4069 0BF7
Memory tests detected fault in I/O hub chip 7 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H4/Q7 b. U1.18-P1-H4/Q82. Replace the RIO book, Location: U1.18-P1-H4
4069 0BF8
Memory tests detected fault in I/O hub chip 8 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H1/Q7 b. U1.18-P1-H1/Q82. Replace the RIO book, Location: U1.18-P1-H1
4069 0BF9
Memory tests detected fault in I/O hub chip 9 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H1/Q5 b. U1.18-P1-H1/Q62. Replace the RIO book, Location: U1.18-P1-H1
4069 0BFA
Memory tests detected fault in I/O hub chip 10 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H3/Q3 b. U1.18-P1-H3/Q42. Replace the RIO book, Location: U1.18-P1-H3
4069 0BFB
Memory tests detected fault in I/O hub chip 11 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H3/Q1 b. U1.18-P1-H3/Q22. Replace the RIO book, Location: U1.18-P1-H3
4069 0BFC
Memory tests detected fault in I/O hub chip 12 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H4/Q1 b. U1.18-P1-H4/Q22. Replace the RIO book, Location: U1.18-P1-H4.
4069 0BFD
Memory tests detected fault in I/O hub chip 13 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H4/Q3 b. U1.18-P1-H4/Q42. Replace the RIO book, Location: U1.18-P1-H4
4069 0C03
Memory tests detected fault in all MCMs
Go to “MAP 1543: MCM Module Problem Isolation” on page 337.
4069 0C09
Memory tests detected fault in all memory books
Go to “MAP 1546: Memory Book Problem Isolation” on page 345. Chapter
5.
Error
Code
to
FRU
Index
585
Error Code
Description
Action/ Possible Failing FRU
4069 0C0A
Memory tests detected fault in all I/O books
Go to “MAP 1547: I/O Book Problem Isolation” on page 348.
4069 0C10
Memory tests detected mismatch between MCM 0 and VPD card
Plugging error with FRU kitLocation: U1.18-P1-C1 and U1.18-P1-H2.3
4069 0C11
Memory tests detected mismatch between MCM 2 and VPD card
Plugging error with FRU kitLocation: U1.18-P1-C4 and U1.18-P1-H2.4
4069 0C12
Memory tests detected mismatch between MCM 1 and VPD card
Plugging error with FRU kitLocation: U1.18-P1-C3 and U1.18-P1-H2.5
4069 0C13
Memory tests detected mismatch between MCM 3 and VPD card
Plugging error with FRU kitLocation: U1.18-P1-C2 and U1.18-P1-H2.6
4069 0CA5
Memory tests detected invalid processor subsystem configuration
Verify that MCMs, memory books, and L3 modules are correctly configured.
4069 0CA6
Memory tests detected no usable main storage
Reboot the system in slow mode to isolate the failure.
4069 0CA9
Memory tests detected configuration error with processor memory
Reboot the system in slow mode to isolate the failure.
4069 0D00
Memory tests detected fault in L3 cache module 0
L3 cache module 0 and L3 VPD cardLocation: U1.18-P1-C6 and U1.18-P1-H2.1
4069 0D01
Memory tests detected fault in L3 cache module 1
L3 cache module 1 and L3 VPD card Location: U1.18-P1-C7 and U1.18-P1-H2.1
4069 0D02
Memory tests detected fault in L3 cache module 2
L3 cache module 2 and L3 VPD card Location: U1.18-P1-C11 and U1.18-P1-H2.1
4069 0D03
Memory tests detected fault in L3 cache module 3
L3 cache module 3 and L3 VPD card Location: U1.18-P1-C10 and U1.18-P1-H2.1
4069 0D04
Memory tests detected fault in L3 cache module 4
L3 cache module 4 and L3 VPD card Location: U1.18-P1-C8 and U1.18-P1-H2.1
4069 0D05
Memory tests detected fault in L3 cache module 5
L3 cache module 5 and L3 VPD card Location: U1.18-P1-C20 and U1.18-P1-H2.1
4069 0D06
Memory tests detected fault in L3 cache module 6
L3 cache module 6 and L3 VPD card Location: U1.18-P1-C16 and U1.18-P1-H2.1
4069 0D07
Memory tests detected fault in L3 cache module 7
L3 cache module 7 and L3 VPD card Location: U1.18-P1-C12 and U1.18-P1-H2.1
4069 0D08
Memory tests detected fault in L3 cache module 8
L3 cache module 8 and L3 VPD card Location: U1.18-P1-C19 and U1.18-P1-H2.1
4069 0D09
Memory tests detected fault in L3 cache module 9
L3 cache module 9 and L3 VPD card Location: U1.18-P1-C18 and U1.18-P1-H2.1
4069 0D0A
Memory tests detected fault in L3 cache module 10
L3 cache module 10 and L3 VPD card Location: U1.18-P1-C14 and U1.18-P1-H2.1
4069 0D0B
Memory tests detected fault in L3 cache module 11
L3 cache module 11 and L3 VPD card Location: U1.18-P1-C15 and U1.18-P1-H2.1
4069 0D0C
Memory tests detected fault in L3 cache module 12
L3 cache module 12 and L3 VPD card Location: U1.18-P1-C17 and U1.18-P1-H2.1
4069 0D0D
Memory tests detected fault in L3 cache module 13
L3 cache module 13 and L3 VPD card Location: U1.18-P1-C5 and U1.18-P1-H2.1
4069 0D0E
Memory tests detected fault in L3 cache module 14
L3 cache module 14 and L3 VPD card Location: U1.18-P1-C9 and U1.18-P1-H2.1 586
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
4069 0D0F
Memory tests detected fault in L3 cache module 15
L3 cache module 15 and L3 VPD card Location: U1.18-P1-C13 and U1.18-P1-H2.1
4069 0EA0
Memory tests detected fault in MOPS service processor code
1. Access the service processor error log. Find the error log entry for the 406x 0EA0 error. Look for a hardware error of the form 450x yyyy, 460x yyyy, or 4B2x yyyy with a timestamp within a few minutes of the 406x 0EA0 error. Follow the actions for this hardware error. 2. If there is no hardware error in the service processor error log with a timestamp that is close to the 406x 0EA0 error, or following the actions for the hardware error does not resolve the problem, check for platform firmware updates. If an update is available, apply it. 3. If a platform firmware update is not available, or the update does not correct the problem, call service support.
4069 0EA1
Memory tests detected fault in processor subsystem code on main processor
1. Check for operating system updates. 2. Call service support.
4069 0EA2
Memory tests detected fault in PRD code on service processor
1. Check for system firmware updates. 2. Call service support. Chapter
5.
Error
Code
to
FRU
Index
587
Error Code
Description
Action/ Possible Failing FRU
4069 0EA4
Description: The service processor firmware detected a hardware problem but was unable to isolate the source.Action: Examine word 13 and word 16 of the extended error log entry. (Instructions for finding word 13 and word 16 in the service processor error log entry can be found in the action column for error code B1xx 4699. To find word 16 in the AIX error log entry, go forward 20 bytes after ?40?. If word 13 is 9250xxxx and word 16 is A30A000, do the following: 1. Access the service processor menus. Select option 2, ?System Power Control Menu?, then select option 6, ?Enable/Disable Fast System Boot?. Note: If necessary, change the setting of option 6 to ?Currently Enabled?. 2. Enter ?98? to go back to the ?System Power Control Menu?. 3. Select option 3, ?Reboot/Restart Policy Setup Menu?. 4. Select option 1 in the ?Reboot/Restart Policy Setup Menu? to set the number of reboot attempts to 0. 5. Enter ?98? to exit the ?Reboot/Restart Policy Setup Menu?. 6. Power up the system using either the ?System Power Control Menu? or the white power button. Note: The system should fail to boot. 7. Look for entries in the service processor error log that were generated during the latest boot. In particular, look for an error code that has a format of 4xxx yyyy whose word 13 value is 7xxxxxxx. Replace the FRU indicated by this error code. 8. Go back to the ?System Power Control Menu? 9. Select option 6. This disables fast boot and automatically enables slow boot. 10. Power up the system using either the ?System Power Control Menu? or the white power button. 11. Look again for a 4069 0EA4 error. If one reoccurs, repeat steps 1 through 10 until the problem has been resolved or all FRUs have been replaced. 12. Call service support if all of the potential FRUs have been replaced and the problem has not been resolved.
4069 0EA5
PRD detected a problem in the FNM code
1. Check for system firmware updates. 2. Call service support.
4069 0EAA
PRD: FNM is handling this error
1. Check for system firmware updates. 2. Call service support.
4069 0EAF
Memory tests detected unknown service processor code and system type
1. Check for system firmware updates. 2. Call service support.
4069 0EB0
Memory tests detected JTAG problem
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
4069 0EB1
Attention line 0 stuck fault
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
4069 0EB2
Attention line 1 stuck fault
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
4069 0EB3
Attention line 2 stuck fault
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
4069 0EB4
Attention line 3 stuck fault
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
4069 0EB5
Attention line 4 stuck fault
Go to “MAP 1540: Problem Isolation Procedures” on page 308. 588
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
4069 0EB6
Attention line 5 stuck fault
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
4069 0EB7
Attention line 6 stuck fault
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
4069 0EB8
Attention line 7 stuck fault
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
4069 0EB9
Attention line 8 stuck fault
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
4069 0EBA
Attention line 9 stuck fault
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
4069 0EBB
Attention line 10 stuck fault
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
4069 0EBC
Attention line 11 stuck fault
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
4069 0EBF
Memory error
1. Reboot the system in slow mode to the SMS menus. See “Performing Slow Boot” on page 413. 2. Examine the service processor error log. v If a CEC subsystem error (a 4xxx yyyy error) occurred during the slow boot that calls out the memory, processors, L3 cache modules, or the primary I/O book, replace the part indicated by the location code. v If a CEC subsystem error (a 4xxx yyyy error) did not occur during the slow boot, look for a 4xxx yyyy error immediately preceding the 406x 0EBF error. Replace the part indicated by the location code associated with that error.
4069 0EE1
Memory tests detected fault in service processor code (invalid MRU to mapping)
1. Check for system firmware updates. 2. Call service support.
4069 0EE2
Memory tests detected fault in service processor firmware fault
1. Check for system firmware updates. 2. Call service support.
4069 0EE5
Memory tests detected firmware error; no callout available
1. Check for system firmware updates. 2. Call service support.
4069 25E6
SMA 4-port switch adapter fault
Replace the I/O book (switch adapter) specified by the location code.
4069 273F
SMA 2-port switch adapter fault
Replace the I/O book (switch adapter) specified by the location code.
406A 0001
Self tests detected clock card assembly failure
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
406A 0002
Self tests detected backplane assembly failure
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
406A 0003
Self tests detected fault in MCM module 0
MCM module 0 and VPD cardLocation: U1.18-P1-C1 and U1.18-P1-H2.3
406A 0004
Self tests detected fault in MCM module 2
MCM module 2 and VPD cardLocation: U1.18-P1-C4 and U1.18-P1-H2.4 Chapter
5.
Error
Code
to
FRU
Index
589
Error Code
Description
Action/ Possible Failing FRU
406A 0005
Self tests detected fault in MCM module 1
MCM module 1 and VPD cardLocation: U1.18-P1-C3 and U1.18-P1-H2.5
406A 0006
Self tests detected fault in MCM module 3
MCM module 3 and VPD cardLocation: U1.18-P1-C2 and U1.18-P1-H2.6
406A 0007
Self tests detected fault in L3 VPD card
1. L3 VPD cardLocation: U1.18-P1-H2.1 (Order the L3 cache module FRU to obtain a new L3 VPD card. Return the L3 cache module that was included in the FRU kit, and the failing L3 VPD card as ?new defective?.) 2. Primary I/O bookLocation: U1.18-P1-H2
406A 000A
Self tests detected fault in primary I/O book 1 (slot 0)
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
406A 000B
Self tests detected fault in secondary I/O book (slot 1)
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
406A 000C
Self tests detected fault in I/O book 3 (slot 2)
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
406A 000D
Self tests detected fault in I/O book 4 (slot 3)
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
406A 000E
CUoD capacity card
1. Replace the CUoD capacity cardLocation: U1.18-P1-H2.2 2. Ask the customer to reenter the CUoD activation keys.
406A 0010
Self tests detected fault in Memory book (slot 0)
Memory book (slot 0) Location: U1.18-P1-M2
406A 0011
Self tests detected fault in Memory book (slot 1)
Memory book (slot 1) Location: U1.18-P1-M3
406A 0012
Self tests detected fault in Memory book (slot 2)
Memory book (slot 2) Location: U1.18-P1-M7
406A 0013
Self tests detected fault in Memory book (slot 3)
Memory book (slot 3)Location: U1.18-P1-M6
406A 0014
Self tests detected fault in Memory book (slot 4)
Memory book (slot 4)Location: U1.18-P1-M4
406A 0015
Self tests detected fault in Memory book (slot 5)
Memory book (slot 5)Location: U1.18-P1-M8
406A 0016
Self tests detected fault in Memory book (slot 6)
Memory book (slot 6)Location: U1.18-P1-M5
406A 0017
Self tests detected fault in Memory book (slot 7)
Memory book (slot 7)Location: U1.18-P1-M1
406A 00B0
Self tests detected possible problem with bus controller or CSP code
1. Check for system firmware updates. 2. Primary I/O book, Location: U1.18-P1-H2 3. Call service support. 590
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
406A 00B1
Self tests detected possible problem with bus controller
1. Check for system firmware updates. 2. Primary I/O book, Location: U1.18-P1-H2 3. Call service support.
406A 00B3
Self tests detected MCM 0/VPD problem
1. Check for system firmware updates. 2. MCM module 0 and VPD cardLocation: U1.18-P1-C1 and U1.18-P1-H2.3
406A 00B4
Self tests detected MCM 2/VPD problem
1. Check for system firmware updates. 2. MCM module 2 and VPD cardLocation: U1.18-P1-C4 and U1.18-P1-H2.4
406A 00B5
Self tests detected MCM 1/VPD problem
1. Check for system firmware updates. 2. MCM module 1 and VPD cardLocation: U1.18-P1-C3 and U1.18-P1-H2.5
406A 00B6
Self tests detected MCM 3/VPD problem
1. Check for system firmware updates. 2. MCM module 3 and VPD cardLocation: U1.18-P1-C2 and U1.18-P1-H2.6
406A 00B7
Self tests detected hardware problem in the processor subsystem
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
406A 0A03
Reboot message (run-time array gard failure) for processor 0
Array bit steering, which is a function in the firmware, may be able to correct this problem without replacing hardware. If AIX is available on the system, go to MAP 0235 in the Diagnostic Information for Multiple Bus Systems, order number SA23-2769. If Linux is the only operating system on the system, schedule deferred maintenance with the customer. When it is possible to reboot the system, do the following: 1. Using the service processor’s system power control menu, disable fast mode boot. (Disabling fast mode automatically enables slow mode.) 2. Boot the AIX diagnostic CD-ROM. 3. Run diagnostics in problem determination mode. If no errors are recorded against any of the processors (devices with a resource name of procx), array bit steering was successful and the problem has been corrected.If errors are reported against the hardware, follow the actions for the error code or SRN. Chapter
5.
Error
Code
to
FRU
Index
591
Error Code
Description
Action/ Possible Failing FRU
406A 0A04
Reboot message (run-time array gard failure) for processor 2
Reboot the system in slow mode. Run diagnostics, then go to MAP 0235 in the .Array bit steering, which is a function in the firmware, may be able to correct this problem without replacing hardware. If AIX is available on the system, go to MAP 0235 in the Diagnostic Information for Multiple Bus Systems, order number SA23-2769. If Linux is the only operating system on the system, schedule deferred maintenance with the customer. When it is possible to reboot the system, do the following: 1. Using the service processor’s system power control menu, disable fast mode boot. (Disabling fast mode automatically enables slow mode.) 2. Boot the AIX diagnostic CD-ROM. 3. Run diagnostics in problem determination mode. If no errors are recorded against any of the processors (devices with a resource name of procx), array bit steering was successful and the problem has been corrected.If errors are reported against the hardware, follow the actions for the error code or SRN.
406A 0A05
Reboot message (run-time array gard failure) for processor 1
Reboot the system in slow mode. Run diagnostics, then go to MAP 0235 in the .Array bit steering, which is a function in the firmware, may be able to correct this problem without replacing hardware. If AIX is available on the system, go to MAP 0235 in the Diagnostic Information for Multiple Bus Systems, order number SA23-2769. If Linux is the only operating system on the system, schedule deferred maintenance with the customer. When it is possible to reboot the system, do the following: 1. Using the service processor’s system power control menu, disable fast mode boot. (Disabling fast mode automatically enables slow mode.) 2. Boot the AIX diagnostic CD-ROM. 3. Run diagnostics in problem determination mode. If no errors are recorded against any of the processors (devices with a resource name of procx), array bit steering was successful and the problem has been corrected.If errors are reported against the hardware, follow the actions for the error code or SRN. 592
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
406A 0A06
Reboot message (run-time array gard failure) for processor 3
Reboot the system in slow mode. Run diagnostics, then go to MAP 0235 in the .Array bit steering, which is a function in the firmware, may be able to correct this problem without replacing hardware. If AIX is available on the system, go to MAP 0235 in the Diagnostic Information for Multiple Bus Systems, order number SA23-2769. If Linux is the only operating system on the system, schedule deferred maintenance with the customer. When it is possible to reboot the system, do the following: 1. Using the service processor’s system power control menu, disable fast mode boot. (Disabling fast mode automatically enables slow mode.) 2. Boot the AIX diagnostic CD-ROM. 3. Run diagnostics in problem determination mode. If no errors are recorded against any of the processors (devices with a resource name of procx), array bit steering was successful and the problem has been corrected.If errors are reported against the hardware, follow the actions for the error code or SRN.
406A 0B00
Self-tests detected fault in I/O book (slot 0 RIO port 0)
1. Check for system firmware updates. 2. Check the I/O cabling between the primary I/O book and the I/O subsystems. 3. Primary I/O book Location: U1.18-P1-H2 4. Call service support.
406A 0B01
Self-tests detected fault in I/O book (slot 0 RIO port 1)
1. Check for system firmware updates. 2. Check the I/O cabling between the primary I/O book and the I/O subsystems. 3. Primary I/O book Location: U1.18-P1-H2 4. Call service support.
406A 0B02
Self-tests detected fault in I/O book (slot 0 RIO port 2)
1. Check for system firmware updates. 2. Check the I/O cabling between the primary I/O book and the I/O subsystems. 3. Primary I/O book Location: U1.18-P1-H2 4. Call service support.
406A 0B03
Self-tests detected fault in I/O book (slot 0 RIO port 3)
1. Check for system firmware updates. 2. Check the I/O cabling between the primary I/O book and the I/O subsystems. 3. Primary I/O book Location: U1.18-P1-H2 4. Call service support. Chapter
5.
Error
Code
to
FRU
Index
593
Error Code
Description
Action/ Possible Failing FRU
406A 0B10
Processor subsystem tests detected fault in SNI adapter
1. Check for system firmware updates. 2. Check the cabling between the SNI adapter book and the High Performance Switch. 3. SNI adapter bookLocation: U1.18-P1-H1 4. Call service support.
406A 0B11
Processor subsystem tests detected fault in SNI adapter
1. Check for system firmware updates. 2. Check the cabling between the SNI adapter book and the High Performance Switch. 3. SNI adapter bookLocation: U1.18-P1-H1 4. Call service support.
406A 0B12
Processor subsystem tests detected fault in SNI adapter
1. Check for system firmware updates. 2. Check the cabling between the SNI adapter book and the High Performance Switch. 3. SNI adapter bookLocation: U1.18-P1-H1 4. Call service support.
406A 0B13
Processor subsystem tests detected fault in SNI adapter
1. Check for system firmware updates. 2. Check the cabling between the SNI adapter book and the High Performance Switch. 3. SNI adapter bookLocation: U1.18-P1-H1 4. Call service support.
406A 0B20
Self-tests detected fault in I/O book
1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support.
406A 0B21
Self-tests detected fault in I/O book
1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support.
406A 0B22
Self-tests detected fault in I/O book
1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support. 594
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
406A 0B23
Self-tests detected fault in I/O book
1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support.
406A 0B24
Self-tests detected fault in I/O book
1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support.
406A 0B25
Self-tests detected fault in I/O book
1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support.
406A 0B26
Self-tests detected fault in I/O book
1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support.
406A 0B27
Self-tests detected fault in I/O book
1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support.
406A 0B30
Self-tests detected fault in I/O book
1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems. 3. Third I/O book Location: U1.18-P1-H4 4. Call service support.
406A 0B31
Self-tests detected fault in I/O book
1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems. 3. Third I/O book Location: U1.18-P1-H4 4. Call service support. Chapter
5.
Error
Code
to
FRU
Index
595
Error Code
Description
Action/ Possible Failing FRU
406A 0B32
Self-tests detected fault in I/O book
1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems. 3. Third I/O book Location: U1.18-P1-H4 4. Call service support.
406A 0B33
Self-tests detected fault in I/O book
1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems. 3. Third I/O book Location: U1.18-P1-H4 4. Call service support.
406A 0B34
Error code generation/logging detected fault in I/O book or SNI adapter
1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems, or the SNI adapter and the High Performance Switch, depending on which adapter is installed in location U1.18-P1-H4. 3. Third I/O book or SNI adapterLocation: U1.18-P1-H4 4. Call service support.
406A 0B35
Error code generation/logging detected fault in I/O book or SNI adapter
1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems, or the SNI adapter and the High Performance Switch, depending on which adapter is installed in location U1.18-P1-H4. 3. Third I/O book or SNI adapterLocation: U1.18-P1-H4 4. Call service support.
406A 0B36
Error code generation/logging detected fault in I/O book or SNI adapter
1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems, or the SNI adapter and the High Performance Switch, depending on which adapter is installed in location U1.18-P1-H4. 3. Third I/O book or SNI adapterLocation: U1.18-P1-H4 4. Call service support.
406A 0B37
Error code generation/logging detected fault in I/O book or SNI adapter
1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems, or the SNI adapter and the High Performance Switch, depending on which adapter is installed in location U1.18-P1-H4. 3. Third I/O book or SNI adapterLocation: U1.18-P1-H4 4. Call service support. 596
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
406A 0BF0
Self-tests detected fault in I/O hub chip 0 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H2/Q1 b. U1.18-P1-H2/Q22. Replace the primary I/O book, Location: U1.18-P1-H2
406A 0BF1
Self-tests detected fault in I/O hub chip 1 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H2/Q3 b. U1.18-P1-H2/Q42. Replace the primary I/O book, Location: U1.18-P1-H2
406A 0BF2
Self-tests detected fault in I/O hub chip 2 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H1/Q3 b. U1.18-P1-H1/Q42. Replace the RIO book, Location: U1.18-P1-H1
406A 0BF3
Self-tests detected fault in I/O hub chip 3 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H1/Q1 b. U1.18-P1-H1/Q22. Replace the RIO book, Location: U1.18-P1-H1
406A 0BF4
Self-tests detected fault in I/O hub chip 4 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H3/Q7 b. U1.18-P1-H3/Q82. Replace the RIO book, Location: U1.18-P1-H3
406A 0BF5
Self-tests detected fault in I/O hub chip 5 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H3/Q5 b. U1.18-P1-H3/Q62. Replace the RIO book, Location: U1.18-P1-H3
406A 0BF6
Self-tests detected fault in I/O hub chip 6 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H4/Q5 b. U1.18-P1-H4/Q62. Replace the RIO book, Location: U1.18-P1-H4 Chapter
5.
Error
Code
to
FRU
Index
597
Error Code
Description
Action/ Possible Failing FRU
406A 0BF7
Self-tests detected fault in I/O hub chip 7 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H4/Q7 b. U1.18-P1-H4/Q82. Replace the RIO book, Location: U1.18-P1-H4
406A 0BF8
Self-tests detected fault in I/O hub chip 8 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H1/Q7 b. U1.18-P1-H1/Q82. Replace the RIO book, Location: U1.18-P1-H1
406A 0BF9
Self-tests detected fault in I/O hub chip 9 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H1/Q5 b. U1.18-P1-H1/Q62. Replace the RIO book, Location: U1.18-P1-H1
406A 0BFA
Self-tests detected fault in I/O hub chip 10 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H3/Q3 b. U1.18-P1-H3/Q42. Replace the RIO book, Location: U1.18-P1-H3
406A 0BFB
Self-tests detected fault in I/O hub chip 11 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H3/Q1 b. U1.18-P1-H3/Q22. Replace the RIO book, Location: U1.18-P1-H3
406A 0BFC
Self-tests detected fault in I/O hub chip 12 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H4/Q1 b. U1.18-P1-H4/Q22. Replace the RIO book, Location: U1.18-P1-H4.
406A 0BFD
Self-tests detected fault in I/O hub chip 13 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H4/Q3 b. U1.18-P1-H4/Q42. Replace the RIO book, Location: U1.18-P1-H4
406A 0C03
Self-tests detected fault in all MCMs
Go to “MAP 1543: MCM Module Problem Isolation” on page 337.
406A 0C09
Self-tests detected fault in all memory books
Go to “MAP 1546: Memory Book Problem Isolation” on page 345. 598
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
406A 0C0A
Self-tests detected fault in all I/O books
Go to “MAP 1547: I/O Book Problem Isolation” on page 348.
406A 0C10
Self-tests detected mismatch between MCM 0 and VPD card
Plugging error with FRU kitLocation: U1.18-P1-C1 and U1.18-P1-H2.3
406A 0C11
Self-tests detected mismatch between MCM 2 and VPD card
Plugging error with FRU kitLocation: U1.18-P1-C4 and U1.18-P1-H2.4
406A 0C12
Self-tests detected mismatch between MCM 1 and VPD card
Plugging error with FRU kitLocation: U1.18-P1-C3 and U1.18-P1-H2.5
406A 0C13
Self-tests detected mismatch between MCM 3 and VPD card
Plugging error with FRU kitLocation: U1.18-P1-C2 and U1.18-P1-H2.6
406A 0CA5
Self-tests detected invalid processor subsystem configuration
Verify that MCMs, memory books, and L3 modules are correctly configured.
406A 0CA6
Self-tests detected no usable main storage
Reboot the system in slow mode to isolate the failure.
406A 0CA9
Self-tests detected configuration error with processor memory
Reboot the system in slow mode to isolate the failure.
406A 0D00
Self-tests detected fault in L3 cache module 0
L3 cache module 0 and L3 VPD cardLocation: U1.18-P1-C6 and U1.18-P1-H2.1
406A 0D01
Self-tests detected fault in L3 cache module 1
L3 cache module 1 and L3 VPD card Location: U1.18-P1-C7 and U1.18-P1-H2.1
406A 0D02
Self-tests detected fault in L3 cache module 2
L3 cache module 2 and L3 VPD card Location: U1.18-P1-C11 and U1.18-P1-H2.1
406A 0D03
Self-tests detected fault in L3 cache module 3
L3 cache module 3 and L3 VPD card Location: U1.18-P1-C10 and U1.18-P1-H2.1
406A 0D04
Self-tests detected fault in L3 cache module 4
L3 cache module 4 and L3 VPD card Location: U1.18-P1-C8 and U1.18-P1-H2.1
406A 0D05
Self-tests detected fault in L3 cache module 5
L3 cache module 5 and L3 VPD card Location: U1.18-P1-C20 and U1.18-P1-H2.1
406A 0D06
Self-tests detected fault in L3 cache module 6
L3 cache module 6 and L3 VPD card Location: U1.18-P1-C16 and U1.18-P1-H2.1
406A 0D07
Self-tests detected fault in L3 cache module 7
L3 cache module 7 and L3 VPD card Location: U1.18-P1-C12 and U1.18-P1-H2.1
406A 0D08
Self-tests detected fault in L3 cache module 8
L3 cache module 8 and L3 VPD card Location: U1.18-P1-C19 and U1.18-P1-H2.1
406A 0D09
Self-tests detected fault in L3 cache module 9
L3 cache module 9 and L3 VPD card Location: U1.18-P1-C18 and U1.18-P1-H2.1
406A 0D0A
Self-tests detected fault in L3 cache module 10
L3 cache module 10 and L3 VPD card Location: U1.18-P1-C14 and U1.18-P1-H2.1
406A 0D0B
Self-tests detected fault in L3 cache module 11
L3 cache module 11 and L3 VPD card Location: U1.18-P1-C15 and U1.18-P1-H2.1
406A 0D0C
Self-tests detected fault in L3 cache module 12
L3 cache module 12 and L3 VPD card Location: U1.18-P1-C17 and U1.18-P1-H2.1
406A 0D0D
Self-tests detected fault in L3 cache module 13
L3 cache module 13 and L3 VPD card Location: U1.18-P1-C5 and U1.18-P1-H2.1
406A 0D0E
Self-tests detected fault in L3 cache module 14
L3 cache module 14 and L3 VPD card Location: U1.18-P1-C9 and U1.18-P1-H2.1 Chapter
5.
Error
Code
to
FRU
Index
599
Error Code
Description
Action/ Possible Failing FRU
406A 0D0F
Self-tests detected fault in L3 cache module 15
L3 cache module 15 and L3 VPD card Location: U1.18-P1-C13 and U1.18-P1-H2.1
406A 0EA0
Self-tests detected fault in MOPS service processor code
1. Access the service processor error log. Find the error log entry for the 406x 0EA0 error. Look for a hardware error of the form 450x yyyy, 460x yyyy, or 4B2x yyyy with a timestamp within a few minutes of the 406x 0EA0 error. Follow the actions for this hardware error. 2. If there is no hardware error in the service processor error log with a timestamp that is close to the 406x 0EA0 error, or following the actions for the hardware error does not resolve the problem, check for platform firmware updates. If an update is available, apply it. 3. If a platform firmware update is not available, or the update does not correct the problem, call service support.
406A 0EA1
Self-tests detected fault in processor subsystem code on main processor
1. Check for operating system updates. 2. Call service support.
406A 0EA2
Self-tests detected fault in PRD code on service processor
1. Check for system firmware updates. 2. Call service support.
406A 0EA4
The service processor firmware detected a hardware problem but was unable to isolate the source.
Call service support.
406A 0EA5
PRD detected a problem in the FNM code
1. Check for system firmware updates. 2. Call service support.
406A 0EAA
PRD: FNM is handling this error
1. Check for system firmware updates. 2. Call service support.
406A 0EAF
Self-tests detected unknown service processor code and system type
1. Check for system firmware updates. 2. Call service support.
406A 0EB0
Self-tests detected JTAG problem
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
406A 0EB1
Attention line 0 stuck fault
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
406A 0EB2
Attention line 1 stuck fault
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
406A 0EB3
Attention line 2 stuck fault
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
406A 0EB4
Attention line 3 stuck fault
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
406A 0EB5
Attention line 4 stuck fault
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
406A 0EB6
Attention line 5 stuck fault
Go to “MAP 1540: Problem Isolation Procedures” on page 308. 600
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
406A 0EB7
Attention line 6 stuck fault
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
406A 0EB8
Attention line 7 stuck fault
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
406A 0EB9
Attention line 8 stuck fault
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
406A 0EBA
Attention line 9 stuck fault
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
406A 0EBB
Attention line 10 stuck fault
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
406A 0EBC
Attention line 11 stuck fault
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
406A 0EBF
Memory error
1. Reboot the system in slow mode to the SMS menus. See “Performing Slow Boot” on page 413. 2. Examine the service processor error log. v If a CEC subsystem error (a 4xxx yyyy error) occurred during the slow boot that calls out the memory, processors, L3 cache modules, or the primary I/O book, replace the part indicated by the location code. v If a CEC subsystem error (a 4xxx yyyy error) did not occur during the slow boot, look for a 4xxx yyyy error immediately preceding the 406x 0EBF error. Replace the part indicated by the location code associated with that error.
406A 0EE1
Self-tests detected fault in service processor code (invalid MRU to mapping)
1. Check for system firmware updates. 2. Call service support.
406A 0EE2
Self-tests detected fault in service processor firmware fault
1. Check for system firmware updates. 2. Call service support.
406A 0EE5
Self-tests detected firmware error; no callout available
1. Check for system firmware updates. 2. Call service support.
406A 25E6
SMA 4-port switch adapter fault
Replace the I/O book (switch adapter) specified by the location code.
406A 273F
SMA 2-port switch adapter fault
Replace the I/O book (switch adapter) specified by the location code.
406B 0001
Wire tests detected clock card assembly failure
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
406B 0002
Wire tests detected backplane assembly failure
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
406B 0003
Wire tests detected fault in MCM module 0
MCM module 0 and VPD cardLocation: U1.18-P1-C1 and U1.18-P1-H2.3
406B 0004
Wire tests detected fault in MCM module 2
MCM module 2 and VPD cardLocation: U1.18-P1-C4 and U1.18-P1-H2.4
406B 0005
Wire tests detected fault in MCM module 1
MCM module 1 and VPD cardLocation: U1.18-P1-C3 and U1.18-P1-H2.5 Chapter
5.
Error
Code
to
FRU
Index
601
Error Code
Description
Action/ Possible Failing FRU
406B 0006
Wire tests detected fault in MCM module 3
MCM module 3 and VPD cardLocation: U1.18-P1-C2 and U1.18-P1-H2.6
406B 0007
Wire tests detected fault in L3 VPD card
1. L3 VPD cardLocation: U1.18-P1-H2.1 (Order the L3 cache module FRU to obtain a new L3 VPD card. Return the L3 cache module that was included in the FRU kit, and the failing L3 VPD card as ?new defective?.) 2. Primary I/O bookLocation: U1.18-P1-H2
406B 000A
Wire tests detected fault in primary I/O book 1 (slot 0)
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
406B 000B
Wire tests detected fault in secondary I/O book (slot 1)
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
406B 000C
Wire tests detected fault in I/O book 3 (slot 2)
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
406B 000D
Wire tests detected fault in I/O book 4 (slot 3)
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
406B 000E
CUoD capacity card
1. Replace the CUoD capacity cardLocation: U1.18-P1-H2.2 2. Ask the customer to reenter the CUoD activation keys.
406B 0010
Wire tests detected fault in Memory book (slot 0)
Memory book (slot 0) Location: U1.18-P1-M2
406B 0011
Wire tests detected fault in Memory book (slot 1)
Memory book (slot 1) Location: U1.18-P1-M3
406B 0012
Wire tests detected fault in Memory book (slot 2)
Memory book (slot 2) Location: U1.18-P1-M7
406B 0013
Wire tests detected fault in Memory book (slot 3)
Memory book (slot 3)Location: U1.18-P1-M6
406B 0014
Wire tests detected fault in Memory book (slot 4)
Memory book (slot 4)Location: U1.18-P1-M4
406B 0015
Wire tests detected fault in Memory book (slot 5)
Memory book (slot 5)Location: U1.18-P1-M8
406B 0016
Wire tests detected fault in Memory book (slot 6)
Memory book (slot 6)Location: U1.18-P1-M5
406B 0017
Wire tests detected fault in Memory book (slot 7)
Memory book (slot 7)Location: U1.18-P1-M1
406B 00B0
Wire tests detected possible problem with bus controller or CSP code
1. Check for system firmware updates. 2. Primary I/O book, Location: U1.18-P1-H2 3. Call service support.
406B 00B1
Wire tests detected possible problem with bus controller
1. Check for system firmware updates. 2. Primary I/O book, Location: U1.18-P1-H2 3. Call service support. 602
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
406B 00B3
Wire tests detected MCM 0/VPD problem
1. Check for system firmware updates. 2. MCM module 0 and VPD cardLocation: U1.18-P1-C1 and U1.18-P1-H2.3
406B 00B4
Wire tests detected MCM 2/VPD problem
1. Check for system firmware updates. 2. MCM module 2 and VPD cardLocation: U1.18-P1-C4 and U1.18-P1-H2.4
406B 00B5
Wire tests detected MCM 1/VPD problem
1. Check for system firmware updates. 2. MCM module 1 and VPD cardLocation: U1.18-P1-C3 and U1.18-P1-H2.5
406B 00B6
Wire tests detected MCM 3/VPD problem
1. Check for system firmware updates. 2. MCM module 3 and VPD cardLocation: U1.18-P1-C2 and U1.18-P1-H2.6
406B 00B7
Wire tests detected hardware problem in the processor subsystem
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
406B 0A03
Reboot message (run-time array gard failure) for processor 0
Array bit steering, which is a function in the firmware, may be able to correct this problem without replacing hardware. If AIX is available on the system, go to MAP 0235 in the Diagnostic Information for Multiple Bus Systems, order number SA23-2769. If Linux is the only operating system on the system, schedule deferred maintenance with the customer. When it is possible to reboot the system, do the following: 1. Using the service processor’s system power control menu, disable fast mode boot. (Disabling fast mode automatically enables slow mode.) 2. Boot the AIX diagnostic CD-ROM. 3. Run diagnostics in problem determination mode. If no errors are recorded against any of the processors (devices with a resource name of procx), array bit steering was successful and the problem has been corrected.If errors are reported against the hardware, follow the actions for the error code or SRN. Chapter
5.
Error
Code
to
FRU
Index
603
Error Code
Description
Action/ Possible Failing FRU
406B 0A04
Reboot message (run-time array gard failure) for processor 2
Reboot the system in slow mode. Run diagnostics, then go to MAP 0235 in the .Array bit steering, which is a function in the firmware, may be able to correct this problem without replacing hardware. If AIX is available on the system, go to MAP 0235 in the Diagnostic Information for Multiple Bus Systems, order number SA23-2769. If Linux is the only operating system on the system, schedule deferred maintenance with the customer. When it is possible to reboot the system, do the following: 1. Using the service processor’s system power control menu, disable fast mode boot. (Disabling fast mode automatically enables slow mode.) 2. Boot the AIX diagnostic CD-ROM. 3. Run diagnostics in problem determination mode. If no errors are recorded against any of the processors (devices with a resource name of procx), array bit steering was successful and the problem has been corrected.If errors are reported against the hardware, follow the actions for the error code or SRN.
406B 0A05
Reboot message (run-time array gard failure) for processor 1
Reboot the system in slow mode. Run diagnostics, then go to MAP 0235 in the .Array bit steering, which is a function in the firmware, may be able to correct this problem without replacing hardware. If AIX is available on the system, go to MAP 0235 in the Diagnostic Information for Multiple Bus Systems, order number SA23-2769. If Linux is the only operating system on the system, schedule deferred maintenance with the customer. When it is possible to reboot the system, do the following: 1. Using the service processor’s system power control menu, disable fast mode boot. (Disabling fast mode automatically enables slow mode.) 2. Boot the AIX diagnostic CD-ROM. 3. Run diagnostics in problem determination mode. If no errors are recorded against any of the processors (devices with a resource name of procx), array bit steering was successful and the problem has been corrected.If errors are reported against the hardware, follow the actions for the error code or SRN. 604
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
406B 0A06
Reboot message (run-time array gard failure) for processor 3
Reboot the system in slow mode. Run diagnostics, then go to MAP 0235 in the .Array bit steering, which is a function in the firmware, may be able to correct this problem without replacing hardware. If AIX is available on the system, go to MAP 0235 in the Diagnostic Information for Multiple Bus Systems, order number SA23-2769. If Linux is the only operating system on the system, schedule deferred maintenance with the customer. When it is possible to reboot the system, do the following: 1. Using the service processor’s system power control menu, disable fast mode boot. (Disabling fast mode automatically enables slow mode.) 2. Boot the AIX diagnostic CD-ROM. 3. Run diagnostics in problem determination mode. If no errors are recorded against any of the processors (devices with a resource name of procx), array bit steering was successful and the problem has been corrected.If errors are reported against the hardware, follow the actions for the error code or SRN.
406B 0B00
Wire tests detected fault in I/O book (slot 0 RIO port 0)
1. Check for system firmware updates. 2. Check the I/O cabling between the primary I/O book and the I/O subsystems. 3. Primary I/O book Location: U1.18-P1-H2 4. Call service support.
406B 0B01
Wire tests detected fault in I/O book (slot 0 RIO port 1)
1. Check for system firmware updates. 2. Check the I/O cabling between the primary I/O book and the I/O subsystems. 3. Primary I/O book Location: U1.18-P1-H2 4. Call service support.
406B 0B02
Wire tests detected fault in I/O book (slot 0 RIO port 2)
1. Check for system firmware updates. 2. Check the I/O cabling between the primary I/O book and the I/O subsystems. 3. Primary I/O book Location: U1.18-P1-H2 4. Call service support.
406B 0B03
Wire tests detected fault in I/O book (slot 0 RIO port 3)
1. Check for system firmware updates. 2. Check the I/O cabling between the primary I/O book and the I/O subsystems. 3. Primary I/O book Location: U1.18-P1-H2 4. Call service support. Chapter
5.
Error
Code
to
FRU
Index
605
Error Code
Description
Action/ Possible Failing FRU
406B 0B10
Processor subsystem tests detected fault in SNI adapter
1. Check for system firmware updates. 2. Check the cabling between the SNI adapter book and the High Performance Switch. 3. SNI adapter bookLocation: U1.18-P1-H1 4. Call service support.
406B 0B11
Processor subsystem tests detected fault in SNI adapter
1. Check for system firmware updates. 2. Check the cabling between the SNI adapter book and the High Performance Switch. 3. SNI adapter bookLocation: U1.18-P1-H1 4. Call service support.
406B 0B12
Processor subsystem tests detected fault in SNI adapter
1. Check for system firmware updates. 2. Check the cabling between the SNI adapter book and the High Performance Switch. 3. SNI adapter bookLocation: U1.18-P1-H1 4. Call service support.
406B 0B13
Processor subsystem tests detected fault in SNI adapter
1. Check for system firmware updates. 2. Check the cabling between the SNI adapter book and the High Performance Switch. 3. SNI adapter bookLocation: U1.18-P1-H1 4. Call service support.
406B 0B20
Wire tests detected fault in I/O book
1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support.
406B 0B21
Wire tests detected fault in I/O book
1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support.
406B 0B22
Wire tests detected fault in I/O book
1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support. 606
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
406B 0B23
Wire tests detected fault in I/O book
1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support.
406B 0B24
Wire tests detected fault in I/O book
1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support.
406B 0B25
Wire tests detected fault in I/O book
1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support.
406B 0B26
Wire tests detected fault in I/O book
1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support.
406B 0B27
Wire tests detected fault in I/O book
1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support.
406B 0B30
Wire tests detected fault in I/O book
1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems. 3. Third I/O book Location: U1.18-P1-H4 4. Call service support.
406B 0B31
Wire tests detected fault in I/O book
1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems. 3. Third I/O book Location: U1.18-P1-H4 4. Call service support. Chapter
5.
Error
Code
to
FRU
Index
607
Error Code
Description
Action/ Possible Failing FRU
406B 0B32
Wire tests detected fault in I/O book
1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems. 3. Third I/O book Location: U1.18-P1-H4 4. Call service support.
406B 0B33
Wire tests detected fault in I/O book
1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems. 3. Third I/O book Location: U1.18-P1-H4 4. Call service support.
406B 0B34
Error code generation/logging detected fault in I/O book or SNI adapter
1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems, or the SNI adapter and the High Performance Switch, depending on which adapter is installed in location U1.18-P1-H4. 3. Third I/O book or SNI adapterLocation: U1.18-P1-H4 4. Call service support.
406B 0B35
Error code generation/logging detected fault in I/O book or SNI adapter
1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems, or the SNI adapter and the High Performance Switch, depending on which adapter is installed in location U1.18-P1-H4. 3. Third I/O book or SNI adapterLocation: U1.18-P1-H4 4. Call service support.
406B 0B36
Error code generation/logging detected fault in I/O book or SNI adapter
1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems, or the SNI adapter and the High Performance Switch, depending on which adapter is installed in location U1.18-P1-H4. 3. Third I/O book or SNI adapterLocation: U1.18-P1-H4 4. Call service support.
406B 0B37
Error code generation/logging detected fault in I/O book or SNI adapter
1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems, or the SNI adapter and the High Performance Switch, depending on which adapter is installed in location U1.18-P1-H4. 3. Third I/O book or SNI adapterLocation: U1.18-P1-H4 4. Call service support. 608
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
406B 0BF0
Wire tests detected fault in I/O hub chip 0 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H2/Q1 b. U1.18-P1-H2/Q22. Replace the primary I/O book, Location: U1.18-P1-H2
406B 0BF1
Wire tests detected fault in I/O hub chip 1 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H2/Q3 b. U1.18-P1-H2/Q42. Replace the primary I/O book, Location: U1.18-P1-H2
406B 0BF2
Wire tests detected fault in I/O hub chip 2 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H1/Q3 b. U1.18-P1-H1/Q42. Replace the RIO book, Location: U1.18-P1-H1
406B 0BF3
Wire tests detected fault in I/O hub chip 3 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H1/Q1 b. U1.18-P1-H1/Q22. Replace the RIO book, Location: U1.18-P1-H1
406B 0BF4
Wire tests detected fault in I/O hub chip 4 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H3/Q7 b. U1.18-P1-H3/Q82. Replace the RIO book, Location: U1.18-P1-H3
406B 0BF5
Wire tests detected fault in I/O hub chip 5 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H3/Q5 b. U1.18-P1-H3/Q62. Replace the RIO book, Location: U1.18-P1-H3
406B 0BF6
Wire tests detected fault in I/O hub chip 6 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H4/Q5 b. U1.18-P1-H4/Q62. Replace the RIO book, Location: U1.18-P1-H4 Chapter
5.
Error
Code
to
FRU
Index
609
Error Code
Description
Action/ Possible Failing FRU
406B 0BF7
Wire tests detected fault in I/O hub chip 7 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H4/Q7 b. U1.18-P1-H4/Q82. Replace the RIO book, Location: U1.18-P1-H4
406B 0BF8
Wire tests detected fault in I/O hub chip 8 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H1/Q7 b. U1.18-P1-H1/Q82. Replace the RIO book, Location: U1.18-P1-H1
406B 0BF9
Wire tests detected fault in I/O hub chip 9 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H1/Q5 b. U1.18-P1-H1/Q62. Replace the RIO book, Location: U1.18-P1-H1
406B 0BFA
Wire tests detected fault in I/O hub chip 10 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H3/Q3 b. U1.18-P1-H3/Q42. Replace the RIO book, Location: U1.18-P1-H3
406B 0BFB
Wire tests detected fault in I/O hub chip 11 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H3/Q1 b. U1.18-P1-H3/Q22. Replace the RIO book, Location: U1.18-P1-H3
406B 0BFC
Wire tests detected fault in I/O hub chip 12 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H4/Q1 b. U1.18-P1-H4/Q22. Replace the RIO book, Location: U1.18-P1-H4.
406B 0BFD
Wire tests detected fault in I/O hub chip 13 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H4/Q3 b. U1.18-P1-H4/Q42. Replace the RIO book, Location: U1.18-P1-H4
406B 0C03
Wire tests detected fault in all MCMs
Go to “MAP 1543: MCM Module Problem Isolation” on page 337.
406B 0C09
Wire tests detected fault in all memory books
Go to “MAP 1546: Memory Book Problem Isolation” on page 345. 610
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
406B 0C0A
Wire tests detected fault in all I/O books
Go to “MAP 1547: I/O Book Problem Isolation” on page 348.
406B 0C10
Wire tests detected mismatch between MCM 0 and VPD card
Plugging error with FRU kitLocation: U1.18-P1-C1 and U1.18-P1-H2.3
406B 0C11
Wire tests detected mismatch between MCM 2 and VPD card
Plugging error with FRU kitLocation: U1.18-P1-C4 and U1.18-P1-H2.4
406B 0C12
Wire tests detected mismatch between MCM 1 and VPD card
Plugging error with FRU kitLocation: U1.18-P1-C3 and U1.18-P1-H2.5
406B 0C13
Wire tests detected mismatch between MCM 3 and VPD card
Plugging error with FRU kitLocation: U1.18-P1-C2 and U1.18-P1-H2.6
406B 0CA5
Wire tests detected invalid processor subsystem configuration
Verify that MCMs, memory books, and L3 modules are correctly configured.
406B 0CA6
Wire tests detected no usable main storage
Reboot the system in slow mode to isolate the failure.
406B 0CA9
Wire tests detected configuration error with processor memory
Reboot the system in slow mode to isolate the failure.
406B 0D00
Wire tests detected fault in L3 cache module 0
L3 cache module 0 and L3 VPD cardLocation: U1.18-P1-C6 and U1.18-P1-H2.1
406B 0D01
Wire tests detected fault in L3 cache module 1
L3 cache module 1 and L3 VPD card Location: U1.18-P1-C7 and U1.18-P1-H2.1
406B 0D02
Wire tests detected fault in L3 cache module 2
L3 cache module 2 and L3 VPD card Location: U1.18-P1-C11 and U1.18-P1-H2.1
406B 0D03
Wire tests detected fault in L3 cache module 3
L3 cache module 3 and L3 VPD card Location: U1.18-P1-C10 and U1.18-P1-H2.1
406B 0D04
Wire tests detected fault in L3 cache module 4
L3 cache module 4 and L3 VPD card Location: U1.18-P1-C8 and U1.18-P1-H2.1
406B 0D05
Wire tests detected fault in L3 cache module 5
L3 cache module 5 and L3 VPD card Location: U1.18-P1-C20 and U1.18-P1-H2.1
406B 0D06
Wire tests detected fault in L3 cache module 6
L3 cache module 6 and L3 VPD card Location: U1.18-P1-C16 and U1.18-P1-H2.1
406B 0D07
Wire tests detected fault in L3 cache module 7
L3 cache module 7 and L3 VPD card Location: U1.18-P1-C12 and U1.18-P1-H2.1
406B 0D08
Wire tests detected fault in L3 cache module 8
L3 cache module 8 and L3 VPD card Location: U1.18-P1-C19 and U1.18-P1-H2.1
406B 0D09
Wire tests detected fault in L3 cache module 9
L3 cache module 9 and L3 VPD card Location: U1.18-P1-C18 and U1.18-P1-H2.1
406B 0D0A
Wire tests detected fault in L3 cache module 10
L3 cache module 10 and L3 VPD card Location: U1.18-P1-C14 and U1.18-P1-H2.1
406B 0D0B
Wire tests detected fault in L3 cache module 11
L3 cache module 11 and L3 VPD card Location: U1.18-P1-C15 and U1.18-P1-H2.1
406B 0D0C
Wire tests detected fault in L3 cache module 12
L3 cache module 12 and L3 VPD card Location: U1.18-P1-C17 and U1.18-P1-H2.1
406B 0D0D
Wire tests detected fault in L3 cache module 13
L3 cache module 13 and L3 VPD card Location: U1.18-P1-C5 and U1.18-P1-H2.1
406B 0D0E
Wire tests detected fault in L3 cache module 14
L3 cache module 14 and L3 VPD card Location: U1.18-P1-C9 and U1.18-P1-H2.1 Chapter
5.
Error
Code
to
FRU
Index
611
Error Code
Description
Action/ Possible Failing FRU
406B 0D0F
Wire tests detected fault in L3 cache module 15
L3 cache module 15 and L3 VPD card Location: U1.18-P1-C13 and U1.18-P1-H2.1
406B 0EA0
Wire tests detected fault in MOPS service processor code
1. Access the service processor error log. Find the error log entry for the 406x 0EA0 error. Look for a hardware error of the form 450x yyyy, 460x yyyy, or 4B2x yyyy with a timestamp within a few minutes of the 406x 0EA0 error. Follow the actions for this hardware error. 2. If there is no hardware error in the service processor error log with a timestamp that is close to the 406x 0EA0 error, or following the actions for the hardware error does not resolve the problem, check for platform firmware updates. If an update is available, apply it. 3. If a platform firmware update is not available, or the update does not correct the problem, call service support.
406B 0EA1
Wire tests detected fault in processor subsystem code on main processor
1. Check for operating system updates. 2. Call service support.
406B 0EA2
Wire tests detected fault in PRD code on service processor
1. Check for system firmware updates. 2. Call service support.
406B 0EA4
The service processor firmware detected a hardware problem but was unable to isolate the source.
Call service support.
406B 0EA5
PRD detected a problem in the FNM code
1. Check for system firmware updates. 2. Call service support.
406B 0EAA
PRD: FNM is handling this error
1. Check for system firmware updates. 2. Call service support.
406B 0EAF
Wire tests detected unknown service processor code and system type
1. Check for system firmware updates. 2. Call service support.
406B 0EB0
Wire tests detected JTAG problem
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
406B 0EB1
Attention line 0 stuck fault
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
406B 0EB2
Attention line 1 stuck fault
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
406B 0EB3
Attention line 2 stuck fault
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
406B 0EB4
Attention line 3 stuck fault
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
406B 0EB5
Attention line 4 stuck fault
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
406B 0EB6
Attention line 5 stuck fault
Go to “MAP 1540: Problem Isolation Procedures” on page 308. 612
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
406B 0EB7
Attention line 6 stuck fault
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
406B 0EB8
Attention line 7 stuck fault
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
406B 0EB9
Attention line 8 stuck fault
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
406B 0EBA
Attention line 9 stuck fault
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
406B 0EBB
Attention line 10 stuck fault
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
406B 0EBC
Attention line 11 stuck fault
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
406B 0EBF
Memory error
1. Reboot the system in slow mode to the SMS menus. See “Performing Slow Boot” on page 413. 2. Examine the service processor error log. v If a CEC subsystem error (a 4xxx yyyy error) occurred during the slow boot that calls out the memory, processors, L3 cache modules, or the primary I/O book, replace the part indicated by the location code. v If a CEC subsystem error (a 4xxx yyyy error) did not occur during the slow boot, look for a 4xxx yyyy error immediately preceding the 406x 0EBF error. Replace the part indicated by the location code associated with that error.
406B 0EE1
Wire tests detected fault in service processor code (invalid MRU to mapping)
1. Check for system firmware updates. 2. Call service support.
406B 0EE2
Wire tests detected fault in service processor firmware fault
1. Check for system firmware updates. 2. Call service support.
406B 0EE5
Wire tests detected firmware error; no callout available
1. Check for system firmware updates. 2. Call service support.
406B 25E6
SMA 4-port switch adapter fault
Replace the I/O book (switch adapter) specified by the location code.
406B 273F
SMA 2-port switch adapter fault
Replace the I/O book (switch adapter) specified by the location code.
406C 0001
IPL detected clock card assembly failure
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
406C 0002
IPL detected backplane assembly failure
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
406C 0003
IPL detected fault in MCM module 0
MCM module 0 and VPD cardLocation: U1.18-P1-C1 and U1.18-P1-H2.3
406C 0004
IPL detected fault in MCM module 2
MCM module 2 and VPD cardLocation: U1.18-P1-C4 and U1.18-P1-H2.4
406C 0005
IPL detected fault in MCM module 1
MCM module 1 and VPD cardLocation: U1.18-P1-C3 and U1.18-P1-H2.5 Chapter
5.
Error
Code
to
FRU
Index
613
Error Code
Description
Action/ Possible Failing FRU
406C 0006
IPL detected fault in MCM module 3
MCM module 3 and VPD cardLocation: U1.18-P1-C2 and U1.18-P1-H2.6
406C 0007
IPL detected fault in L3 VPD card
1. L3 VPD cardLocation: U1.18-P1-H2.1 (Order the L3 cache module FRU to obtain a new L3 VPD card. Return the L3 cache module that was included in the FRU kit, and the failing L3 VPD card as ?new defective?.) 2. Primary I/O bookLocation: U1.18-P1-H2
406C 000A
IPL detected fault in primary I/O book 1 (slot 0)
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
406C 000B
IPL detected fault in secondary I/O book (slot 1)
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
406C 000C
IPL detected fault in I/O book 3 (slot 2)
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
406C 000D
IPL detected fault in I/O book 4 (slot 3)
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
406C 000E
CUoD capacity card
1. Replace the CUoD capacity cardLocation: U1.18-P1-H2.2 2. Ask the customer to reenter the CUoD activation keys.
406C 0010
IPL detected fault in Memory book (slot 0)
Memory book (slot 0) Location: U1.18-P1-M2
406C 0011
IPL detected fault in Memory book (slot 1)
Memory book (slot 1) Location: U1.18-P1-M3
406C 0012
IPL detected fault in Memory book (slot 2)
Memory book (slot 2) Location: U1.18-P1-M7
406C 0013
IPL detected fault in Memory book (slot 3)
Memory book (slot 3)Location: U1.18-P1-M6
406C 0014
IPL detected fault in Memory book (slot 4)
Memory book (slot 4)Location: U1.18-P1-M4
406C 0015
IPL detected fault in Memory book (slot 5)
Memory book (slot 5)Location: U1.18-P1-M8
406C 0016
IPL detected fault in Memory book (slot 6)
Memory book (slot 6)Location: U1.18-P1-M5
406C 0017
IPL detected fault in Memory book (slot 7)
Memory book (slot 7)Location: U1.18-P1-M1
406C 00B0
IPL detected possible problem with bus controller or CSP code
1. Check for system firmware updates. 2. Primary I/O book, Location: U1.18-P1-H2 3. Call service support.
406C 00B1
IPL detected possible problem with bus controller
1. Check for system firmware updates. 2. Primary I/O book, Location: U1.18-P1-H2 3. Call service support. 614
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
406C 00B3
IPL detected MCM 0/VPD problem
1. Check for system firmware updates. 2. MCM module 0 and VPD cardLocation: U1.18-P1-C1 and U1.18-P1-H2.3
406C 00B4
IPL detected MCM 2/VPD problem
1. Check for system firmware updates. 2. MCM module 2 and VPD cardLocation: U1.18-P1-C4 and U1.18-P1-H2.4
406C 00B5
IPL detected MCM 1/VPD problem
1. Check for system firmware updates. 2. MCM module 1 and VPD cardLocation: U1.18-P1-C3 and U1.18-P1-H2.5
406C 00B6
IPL detected MCM 3/VPD problem
1. Check for system firmware updates. 2. MCM module 3 and VPD cardLocation: U1.18-P1-C2 and U1.18-P1-H2.6
406C 00B7
IPL detected hardware problem in the processor subsystem
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
406C 0A03
Reboot message (run-time array gard failure) for processor 0
Array bit steering, which is a function in the firmware, may be able to correct this problem without replacing hardware. If AIX is available on the system, go to MAP 0235 in the Diagnostic Information for Multiple Bus Systems, order number SA23-2769. If Linux is the only operating system on the system, schedule deferred maintenance with the customer. When it is possible to reboot the system, do the following: 1. Using the service processor’s system power control menu, disable fast mode boot. (Disabling fast mode automatically enables slow mode.) 2. Boot the AIX diagnostic CD-ROM. 3. Run diagnostics in problem determination mode. If no errors are recorded against any of the processors (devices with a resource name of procx), array bit steering was successful and the problem has been corrected.If errors are reported against the hardware, follow the actions for the error code or SRN. Chapter
5.
Error
Code
to
FRU
Index
615
Error Code
Description
Action/ Possible Failing FRU
406C 0A04
Reboot message (run-time array gard failure) for processor 2
Reboot the system in slow mode. Run diagnostics, then go to MAP 0235 in the .Array bit steering, which is a function in the firmware, may be able to correct this problem without replacing hardware. If AIX is available on the system, go to MAP 0235 in the Diagnostic Information for Multiple Bus Systems, order number SA23-2769. If Linux is the only operating system on the system, schedule deferred maintenance with the customer. When it is possible to reboot the system, do the following: 1. Using the service processor’s system power control menu, disable fast mode boot. (Disabling fast mode automatically enables slow mode.) 2. Boot the AIX diagnostic CD-ROM. 3. Run diagnostics in problem determination mode. If no errors are recorded against any of the processors (devices with a resource name of procx), array bit steering was successful and the problem has been corrected.If errors are reported against the hardware, follow the actions for the error code or SRN.
406C 0A05
Reboot message (run-time array gard failure) for processor 1
Reboot the system in slow mode. Run diagnostics, then go to MAP 0235 in the .Array bit steering, which is a function in the firmware, may be able to correct this problem without replacing hardware. If AIX is available on the system, go to MAP 0235 in the Diagnostic Information for Multiple Bus Systems, order number SA23-2769. If Linux is the only operating system on the system, schedule deferred maintenance with the customer. When it is possible to reboot the system, do the following: 1. Using the service processor’s system power control menu, disable fast mode boot. (Disabling fast mode automatically enables slow mode.) 2. Boot the AIX diagnostic CD-ROM. 3. Run diagnostics in problem determination mode. If no errors are recorded against any of the processors (devices with a resource name of procx), array bit steering was successful and the problem has been corrected.If errors are reported against the hardware, follow the actions for the error code or SRN. 616
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
406C 0A06
Reboot message (run-time array gard failure) for processor 3
Reboot the system in slow mode. Run diagnostics, then go to MAP 0235 in the .Array bit steering, which is a function in the firmware, may be able to correct this problem without replacing hardware. If AIX is available on the system, go to MAP 0235 in the Diagnostic Information for Multiple Bus Systems, order number SA23-2769. If Linux is the only operating system on the system, schedule deferred maintenance with the customer. When it is possible to reboot the system, do the following: 1. Using the service processor’s system power control menu, disable fast mode boot. (Disabling fast mode automatically enables slow mode.) 2. Boot the AIX diagnostic CD-ROM. 3. Run diagnostics in problem determination mode. If no errors are recorded against any of the processors (devices with a resource name of procx), array bit steering was successful and the problem has been corrected.If errors are reported against the hardware, follow the actions for the error code or SRN.
406C 0B00
IPL detected fault in I/O book (slot 0 RIO port 0)
1. Check for system firmware updates. 2. Check the I/O cabling between the primary I/O book and the I/O subsystems. 3. Primary I/O book Location: U1.18-P1-H2 4. Call service support.
406C 0B01
IPL detected fault in I/O book (slot 0 RIO port 1)
1. Check for system firmware updates. 2. Check the I/O cabling between the primary I/O book and the I/O subsystems. 3. Primary I/O book Location: U1.18-P1-H2 4. Call service support.
406C 0B02
IPL detected fault in I/O book (slot 0 RIO port 2)
1. Check for system firmware updates. 2. Check the I/O cabling between the primary I/O book and the I/O subsystems. 3. Primary I/O book Location: U1.18-P1-H2 4. Call service support.
406C 0B03
IPL detected fault in I/O book (slot 0 RIO port 3)
1. Check for system firmware updates. 2. Check the I/O cabling between the primary I/O book and the I/O subsystems. 3. Primary I/O book Location: U1.18-P1-H2 4. Call service support. Chapter
5.
Error
Code
to
FRU
Index
617
Error Code
Description
Action/ Possible Failing FRU
406C 0B10
Processor subsystem tests detected fault in SNI adapter
1. Check for system firmware updates. 2. Check the cabling between the SNI adapter book and the High Performance Switch. 3. SNI adapter bookLocation: U1.18-P1-H1 4. Call service support.
406C 0B11
Processor subsystem tests detected fault in SNI adapter
1. Check for system firmware updates. 2. Check the cabling between the SNI adapter book and the High Performance Switch. 3. SNI adapter bookLocation: U1.18-P1-H1 4. Call service support.
406C 0B12
Processor subsystem tests detected fault in SNI adapter
1. Check for system firmware updates. 2. Check the cabling between the SNI adapter book and the High Performance Switch. 3. SNI adapter bookLocation: U1.18-P1-H1 4. Call service support.
406C 0B13
Processor subsystem tests detected fault in SNI adapter
1. Check for system firmware updates. 2. Check the cabling between the SNI adapter book and the High Performance Switch. 3. SNI adapter bookLocation: U1.18-P1-H1 4. Call service support.
406C 0B20
IPL detected fault in I/O book
1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support.
406C 0B21
IPL detected fault in I/O book
1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support.
406C 0B22
IPL detected fault in I/O book
1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support. 618
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
406C 0B23
IPL detected fault in I/O book
1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support.
406C 0B24
IPL detected fault in I/O book
1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support.
406C 0B25
IPL detected fault in I/O book
1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support.
406C 0B26
IPL detected fault in I/O book
1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support.
406C 0B27
IPL detected fault in I/O book
1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support.
406C 0B30
IPL detected fault in I/O book
I/O book (slot 3 RIO port 0) Location: U1.18-P1-H4/Q1 1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems. 3. Third I/O book Location: U1.18-P1-H4 4. Call service support. Chapter
5.
Error
Code
to
FRU
Index
619
Error Code
Description
Action/ Possible Failing FRU
406C 0B31
IPL detected fault in I/O book
1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems. 3. Third I/O book Location: U1.18-P1-H4 4. Call service support.
406C 0B32
IPL detected fault in I/O book
1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems. 3. Third I/O book Location: U1.18-P1-H4 4. Call service support.
406C 0B33
IPL detected fault in I/O book
1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems. 3. Third I/O book Location: U1.18-P1-H4 4. Call service support.
406C 0B34
Error code generation/logging detected fault in I/O book or SNI adapter
1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems, or the SNI adapter and the High Performance Switch, depending on which adapter is installed in location U1.18-P1-H4. 3. Third I/O book or SNI adapterLocation: U1.18-P1-H4 4. Call service support.
406C 0B35
Error code generation/logging detected fault in I/O book or SNI adapter
1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems, or the SNI adapter and the High Performance Switch, depending on which adapter is installed in location U1.18-P1-H4. 3. Third I/O book or SNI adapterLocation: U1.18-P1-H4 4. Call service support.
406C 0B36
Error code generation/logging detected fault in I/O book or SNI adapter
1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems, or the SNI adapter and the High Performance Switch, depending on which adapter is installed in location U1.18-P1-H4. 3. Third I/O book or SNI adapterLocation: U1.18-P1-H4 4. Call service support. 620
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
406C 0B37
Error code generation/logging detected fault in I/O book or SNI adapter
1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems, or the SNI adapter and the High Performance Switch, depending on which adapter is installed in location U1.18-P1-H4. 3. Third I/O book or SNI adapterLocation: U1.18-P1-H4 4. Call service support.
406C 0BF0
IPL detected fault in I/O hub chip 0 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H2/Q1 b. U1.18-P1-H2/Q22. Replace the primary I/O book, Location: U1.18-P1-H2
406C 0BF1
IPL detected fault in I/O hub chip 1 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H2/Q3 b. U1.18-P1-H2/Q42. Replace the primary I/O book, Location: U1.18-P1-H2
406C 0BF2
IPL detected fault in I/O hub chip 2 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H1/Q3 b. U1.18-P1-H1/Q42. Replace the RIO book, Location: U1.18-P1-H1
406C 0BF3
IPL detected fault in I/O hub chip 3 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H1/Q1 b. U1.18-P1-H1/Q22. Replace the RIO book, Location: U1.18-P1-H1
406C 0BF4
IPL detected fault in I/O hub chip 4 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H3/Q7 b. U1.18-P1-H3/Q82. Replace the RIO book, Location: U1.18-P1-H3
406C 0BF5
IPL detected fault in I/O hub chip 5 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H3/Q5 b. U1.18-P1-H3/Q62. Replace the RIO book, Location: U1.18-P1-H3 Chapter
5.
Error
Code
to
FRU
Index
621
Error Code
Description
Action/ Possible Failing FRU
406C 0BF6
IPL detected fault in I/O hub chip 6 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H4/Q5 b. U1.18-P1-H4/Q62. Replace the RIO book, Location: U1.18-P1-H4
406C 0BF7
IPL detected fault in I/O hub chip 7 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H4/Q7 b. U1.18-P1-H4/Q82. Replace the RIO book, Location: U1.18-P1-H4
406C 0BF8
IPL detected fault in I/O hub chip 8 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H1/Q7 b. U1.18-P1-H1/Q82. Replace the RIO book, Location: U1.18-P1-H1
406C 0BF9
IPL detected fault in I/O hub chip 9 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H1/Q5 b. U1.18-P1-H1/Q62. Replace the RIO book, Location: U1.18-P1-H1
406C 0BFA
IPL detected fault in I/O hub chip 10 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H3/Q3 b. U1.18-P1-H3/Q42. Replace the RIO book, Location: U1.18-P1-H3
406C 0BFB
IPL detected fault in I/O hub chip 11 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H3/Q1 b. U1.18-P1-H3/Q22. Replace the RIO book, Location: U1.18-P1-H3
406C 0BFC
IPL detected fault in I/O hub chip 12 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H4/Q1 b. U1.18-P1-H4/Q22. Replace the RIO book, Location: U1.18-P1-H4. 622
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
406C 0BFD
IPL detected fault in I/O hub chip 13 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H4/Q3 b. U1.18-P1-H4/Q42. Replace the RIO book, Location: U1.18-P1-H4
406C 0C03
IPL detected fault in all MCMs
Go to “MAP 1543: MCM Module Problem Isolation” on page 337.
406C 0C09
IPL detected fault in all memory books
Go to “MAP 1546: Memory Book Problem Isolation” on page 345.
406C 0C0A
IPL detected fault in all I/O books
Go to “MAP 1547: I/O Book Problem Isolation” on page 348.
406C 0C10
IPL detected mismatch between MCM 0 and VPD card
Plugging error with FRU kitLocation: U1.18-P1-C1 and U1.18-P1-H2.3
406C 0C11
IPL detected mismatch between MCM 2 and VPD card
Plugging error with FRU kitLocation: U1.18-P1-C4 and U1.18-P1-H2.4
406C 0C12
IPL detected mismatch between MCM 1 and VPD card
Plugging error with FRU kitLocation: U1.18-P1-C3 and U1.18-P1-H2.5
406C 0C13
IPL detected mismatch between MCM 3 and VPD card
Plugging error with FRU kitLocation: U1.18-P1-C2 and U1.18-P1-H2.6
406C 0CA5
IPL detected invalid processor subsystem configuration
Verify that MCMs, memory books, and L3 modules are correctly configured.
406C 0CA6
IPL detected no usable main storage
Reboot the system in slow mode to isolate the failure.
406C 0CA9
IPL detected configuration error with processor memory
Reboot the system in slow mode to isolate the failure.
406C 0D00
IPL detected fault in L3 cache module 0
L3 cache module 0 and L3 VPD cardLocation: U1.18-P1-C6 and U1.18-P1-H2.1
406C 0D01
IPL detected fault in L3 cache module 1
L3 cache module 1 and L3 VPD card Location: U1.18-P1-C7 and U1.18-P1-H2.1
406C 0D02
IPL detected fault in L3 cache module 2
L3 cache module 2 and L3 VPD card Location: U1.18-P1-C11 and U1.18-P1-H2.1
406C 0D03
IPL detected fault in L3 cache module 3
L3 cache module 3 and L3 VPD card Location: U1.18-P1-C10 and U1.18-P1-H2.1
406C 0D04
IPL detected fault in L3 cache module 4
L3 cache module 4 and L3 VPD card Location: U1.18-P1-C8 and U1.18-P1-H2.1
406C 0D05
IPL detected fault in L3 cache module 5
L3 cache module 5 and L3 VPD card Location: U1.18-P1-C20 and U1.18-P1-H2.1
406C 0D06
IPL detected fault in L3 cache module 6
L3 cache module 6 and L3 VPD card Location: U1.18-P1-C16 and U1.18-P1-H2.1
406C 0D07
IPL detected fault in L3 cache module 7
L3 cache module 7 and L3 VPD card Location: U1.18-P1-C12 and U1.18-P1-H2.1
406C 0D08
IPL detected fault in L3 cache module 8
L3 cache module 8 and L3 VPD card Location: U1.18-P1-C19 and U1.18-P1-H2.1
406C 0D09
IPL detected fault in L3 cache module 9
L3 cache module 9 and L3 VPD card Location: U1.18-P1-C18 and U1.18-P1-H2.1 Chapter
5.
Error
Code
to
FRU
Index
623
Error Code
Description
Action/ Possible Failing FRU
406C 0D0A
IPL detected fault in L3 cache module 10
L3 cache module 10 and L3 VPD card Location: U1.18-P1-C14 and U1.18-P1-H2.1
406C 0D0B
IPL detected fault in L3 cache module 11
L3 cache module 11 and L3 VPD card Location: U1.18-P1-C15 and U1.18-P1-H2.1
406C 0D0C
IPL detected fault in L3 cache module 12
L3 cache module 12 and L3 VPD card Location: U1.18-P1-C17 and U1.18-P1-H2.1
406C 0D0D
IPL detected fault in L3 cache module 13
L3 cache module 13 and L3 VPD card Location: U1.18-P1-C5 and U1.18-P1-H2.1
406C 0D0E
IPL detected fault in L3 cache module 14
L3 cache module 14 and L3 VPD card Location: U1.18-P1-C9 and U1.18-P1-H2.1
406C 0D0F
IPL detected fault in L3 cache module 15
L3 cache module 15 and L3 VPD card Location: U1.18-P1-C13 and U1.18-P1-H2.1
406C 0EA0
IPL detected fault in MOPS service processor code
1. Access the service processor error log. Find the error log entry for the 406x 0EA0 error. Look for a hardware error of the form 450x yyyy, 460x yyyy, or 4B2x yyyy with a timestamp within a few minutes of the 406x 0EA0 error. Follow the actions for this hardware error. 2. If there is no hardware error in the service processor error log with a timestamp that is close to the 406x 0EA0 error, or following the actions for the hardware error does not resolve the problem, check for platform firmware updates. If an update is available, apply it. 3. If a platform firmware update is not available, or the update does not correct the problem, call service support.
406C 0EA1
IPL detected fault in processor subsystem code on main processor
1. Check for operating system updates. 2. Call service support.
406C 0EA2
IPL detected fault in PRD code on service processor
1. Check for system firmware updates. 2. Call service support.
406C 0EA4
The service processor firmware detected a hardware problem but was unable to isolate the source.
Call service support.
406C 0EA5
PRD detected a problem in the FNM code
1. Check for system firmware updates. 2. Call service support.
406C 0EAA
PRD: FNM is handling this error
1. Check for system firmware updates. 2. Call service support.
406C 0EAF
IPL detected unknown service processor code and system type
1. Check for system firmware updates. 2. Call service support.
406C 0EB0
IPL tests detected JTAG problem
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
406C 0EB1
Attention line 0 stuck fault
Go to “MAP 1540: Problem Isolation Procedures” on page 308. 624
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
406C 0EB2
Attention line 1 stuck fault
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
406C 0EB3
Attention line 2 stuck fault
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
406C 0EB4
Attention line 3 stuck fault
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
406C 0EB5
Attention line 4 stuck fault
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
406C 0EB6
Attention line 5 stuck fault
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
406C 0EB7
Attention line 6 stuck fault
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
406C 0EB8
Attention line 7 stuck fault
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
406C 0EB9
Attention line 8 stuck fault
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
406C 0EBA
Attention line 9 stuck fault
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
406C 0EBB
Attention line 10 stuck fault
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
406C 0EBC
Attention line 11 stuck fault
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
406C 0EBF
Memory error
1. Reboot the system in slow mode to the SMS menus. See “Performing Slow Boot” on page 413. 2. Examine the service processor error log. v If a CEC subsystem error (a 4xxx yyyy error) occurred during the slow boot that calls out the memory, processors, L3 cache modules, or the primary I/O book, replace the part indicated by the location code. v If a CEC subsystem error (a 4xxx yyyy error) did not occur during the slow boot, look for a 4xxx yyyy error immediately preceding the 406x 0EBF error. Replace the part indicated by the location code associated with that error.
406C 0EE1
IPL detected fault in service processor code (invalid MRU to mapping)
1. Check for system firmware updates. 2. Call service support.
406C 0EE2
IPL detected fault in service processor firmware fault
1. Check for system firmware updates. 2. Call service support.
406C 0EE5
IPL detected firmware error; no callout available
1. Check for system firmware updates. 2. Call service support.
406C 25E6
SMA 4-port switch adapter fault
Replace the I/O book (switch adapter) specified by the location code.
406C 273F
SMA 2-port switch adapter fault
Replace the I/O book (switch adapter) specified by the location code. Chapter
5.
Error
Code
to
FRU
Index
625
Error Code
Description
Action/ Possible Failing FRU
406D 0001
Attention handler detected clock card assembly failure
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
406D 0002
Attention handler detected backplane assembly failure
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
406D 0003
Attention handler detected fault in MCM module 0
MCM module 0 and VPD cardLocation: U1.18-P1-C1 and U1.18-P1-H2.3
406D 0004
Attention handler detected fault in MCM module 2
MCM module 2 and VPD cardLocation: U1.18-P1-C4 and U1.18-P1-H2.4
406D 0005
Attention handler detected fault in MCM module 1
MCM module 1 and VPD cardLocation: U1.18-P1-C3 and U1.18-P1-H2.5
406D 0006
Attention handler detected fault in MCM module 3
MCM module 3 and VPD cardLocation: U1.18-P1-C2 and U1.18-P1-H2.6
406D 0007
Attention handler detected fault in L3 VPD card
1. L3 VPD cardLocation: U1.18-P1-H2.1 (Order the L3 cache module FRU to obtain a new L3 VPD card. Return the L3 cache module that was included in the FRU kit, and the failing L3 VPD card as ?new defective?.) 2. Primary I/O bookLocation: U1.18-P1-H2
406D 000A
Attention handler detected fault in primary I/O book 1 (slot 0)
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
406D 000B
Attention handler detected fault in secondary I/O book (slot 1)
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
406D 000C
Attention handler detected fault in I/O book 3 (slot 2)
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
406D 000D
Attention handler detected fault in I/O book 4 (slot 3)
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
406D 000E
CUoD capacity card
1. Replace the CUoD capacity cardLocation: U1.18-P1-H2.2 2. Ask the customer to reenter the CUoD activation keys.
406D 0010
Attention handler detected fault in Memory book (slot 0)
Memory book (slot 0) Location: U1.18-P1-M2
406D 0011
Attention handler detected fault in Memory book (slot 1)
Memory book (slot 1) Location: U1.18-P1-M3
406D 0012
Attention handler detected fault in Memory book (slot 2)
Memory book (slot 2) Location: U1.18-P1-M7
406D 0013
Attention handler detected fault in Memory book (slot 3)
Memory book (slot 3)Location: U1.18-P1-M6
406D 0014
Attention handler detected fault in Memory book (slot 4)
Memory book (slot 4)Location: U1.18-P1-M4
406D 0015
Attention handler detected fault in Memory book (slot 5)
Memory book (slot 5)Location: U1.18-P1-M8
406D 0016
Attention handler detected fault in Memory book (slot 6)
Memory book (slot 6)Location: U1.18-P1-M5 626
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
406D 0017
Attention handler detected fault in Memory book (slot 7)
Memory book (slot 7)Location: U1.18-P1-M1
406D 00B0
Attention handler detected possible problem with bus controller or CSP code
1. Check for system firmware updates. 2. Primary I/O book, Location: U1.18-P1-H2 3. Call service support.
406D 00B1
Attention handler detected possible problem with bus controller
1. Check for system firmware updates. 2. Primary I/O book, Location: U1.18-P1-H2 3. Call service support.
406D 00B3
Attention handler detected MCM 0/VPD problem
1. Check for system firmware updates. 2. MCM module 0 and VPD cardLocation: U1.18-P1-C1 and U1.18-P1-H2.3
406D 00B4
Attention handler detected MCM 2/VPD problem
1. Check for system firmware updates. 2. MCM module 2 and VPD cardLocation: U1.18-P1-C4 and U1.18-P1-H2.4
406D 00B5
Attention handler detected MCM 1/VPD problem
1. Check for system firmware updates. 2. MCM module 1 and VPD cardLocation: U1.18-P1-C3 and U1.18-P1-H2.5
406D 00B6
Attention handler detected MCM 3/VPD problem
1. Check for system firmware updates. 2. MCM module 3 and VPD cardLocation: U1.18-P1-C2 and U1.18-P1-H2.6
406D 00B7
Attention handler detected hardware problem in the processor subsystem
Go to “MAP 1540: Problem Isolation Procedures” on page 308. Chapter
5.
Error
Code
to
FRU
Index
627
Error Code
Description
Action/ Possible Failing FRU
406D 0A03
Reboot message (run-time array gard failure) for processor 0
Array bit steering, which is a function in the firmware, may be able to correct this problem without replacing hardware. If AIX is available on the system, go to MAP 0235 in the Diagnostic Information for Multiple Bus Systems, order number SA23-2769. If Linux is the only operating system on the system, schedule deferred maintenance with the customer. When it is possible to reboot the system, do the following: 1. Using the service processor’s system power control menu, disable fast mode boot. (Disabling fast mode automatically enables slow mode.) 2. Boot the AIX diagnostic CD-ROM. 3. Run diagnostics in problem determination mode. If no errors are recorded against any of the processors (devices with a resource name of procx), array bit steering was successful and the problem has been corrected.If errors are reported against the hardware, follow the actions for the error code or SRN.
406D 0A04
Reboot message (run-time array gard failure) for processor 2
Reboot the system in slow mode. Run diagnostics, then go to MAP 0235 in the .Array bit steering, which is a function in the firmware, may be able to correct this problem without replacing hardware. If AIX is available on the system, go to MAP 0235 in the Diagnostic Information for Multiple Bus Systems, order number SA23-2769. If Linux is the only operating system on the system, schedule deferred maintenance with the customer. When it is possible to reboot the system, do the following: 1. Using the service processor’s system power control menu, disable fast mode boot. (Disabling fast mode automatically enables slow mode.) 2. Boot the AIX diagnostic CD-ROM. 3. Run diagnostics in problem determination mode. If no errors are recorded against any of the processors (devices with a resource name of procx), array bit steering was successful and the problem has been corrected.If errors are reported against the hardware, follow the actions for the error code or SRN. 628
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
406D 0A05
Reboot message (run-time array gard failure) for processor 1
Reboot the system in slow mode. Run diagnostics, then go to MAP 0235 in the .Array bit steering, which is a function in the firmware, may be able to correct this problem without replacing hardware. If AIX is available on the system, go to MAP 0235 in the Diagnostic Information for Multiple Bus Systems, order number SA23-2769. If Linux is the only operating system on the system, schedule deferred maintenance with the customer. When it is possible to reboot the system, do the following: 1. Using the service processor’s system power control menu, disable fast mode boot. (Disabling fast mode automatically enables slow mode.) 2. Boot the AIX diagnostic CD-ROM. 3. Run diagnostics in problem determination mode. If no errors are recorded against any of the processors (devices with a resource name of procx), array bit steering was successful and the problem has been corrected.If errors are reported against the hardware, follow the actions for the error code or SRN.
406D 0A06
Reboot message (run-time array gard failure) for processor 3
Reboot the system in slow mode. Run diagnostics, then go to MAP 0235 in the .Array bit steering, which is a function in the firmware, may be able to correct this problem without replacing hardware. If AIX is available on the system, go to MAP 0235 in the Diagnostic Information for Multiple Bus Systems, order number SA23-2769. If Linux is the only operating system on the system, schedule deferred maintenance with the customer. When it is possible to reboot the system, do the following: 1. Using the service processor’s system power control menu, disable fast mode boot. (Disabling fast mode automatically enables slow mode.) 2. Boot the AIX diagnostic CD-ROM. 3. Run diagnostics in problem determination mode. If no errors are recorded against any of the processors (devices with a resource name of procx), array bit steering was successful and the problem has been corrected.If errors are reported against the hardware, follow the actions for the error code or SRN. Chapter
5.
Error
Code
to
FRU
Index
629
Error Code
Description
Action/ Possible Failing FRU
406D 0B00
Attention handler detected fault in I/O book (slot 0 RIO port 0)
1. Check for system firmware updates. 2. Check the I/O cabling between the primary I/O book and the I/O subsystems. 3. Primary I/O book Location: U1.18-P1-H2 4. Call service support.
406D 0B01
Attention handler detected fault in I/O book (slot 0 RIO port 1)
1. Check for system firmware updates. 2. Check the I/O cabling between the primary I/O book and the I/O subsystems. 3. Primary I/O book Location: U1.18-P1-H2 4. Call service support.
406D 0B02
Attention handler detected fault in I/O book (slot 0 RIO port 2)
1. Check for system firmware updates. 2. Check the I/O cabling between the primary I/O book and the I/O subsystems. 3. Primary I/O book Location: U1.18-P1-H2 4. Call service support.
406D 0B03
Attention handler detected fault in I/O book (slot 0 RIO port 3)
1. Check for system firmware updates. 2. Check the I/O cabling between the primary I/O book and the I/O subsystems. 3. Primary I/O book Location: U1.18-P1-H2 4. Call service support.
406D 0B10
Processor subsystem tests detected fault in SNI adapter
1. Check for system firmware updates. 2. Check the cabling between the SNI adapter book and the High Performance Switch. 3. SNI adapter bookLocation: U1.18-P1-H1 4. Call service support.
406D 0B11
Processor subsystem tests detected fault in SNI adapter
1. Check for system firmware updates. 2. Check the cabling between the SNI adapter book and the High Performance Switch. 3. SNI adapter bookLocation: U1.18-P1-H1 4. Call service support.
406D 0B12
Processor subsystem tests detected fault in SNI adapter
1. Check for system firmware updates. 2. Check the cabling between the SNI adapter book and the High Performance Switch. 3. SNI adapter bookLocation: U1.18-P1-H1 4. Call service support. 630
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
406D 0B13
Processor subsystem tests detected fault in SNI adapter
1. Check for system firmware updates. 2. Check the cabling between the SNI adapter book and the High Performance Switch. 3. SNI adapter bookLocation: U1.18-P1-H1 4. Call service support.
406D 0B20
Attention handler detected fault in I/O book
1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support.
406D 0B21
Attention handler detected fault in I/O book
1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support.
406D 0B22
Attention handler detected fault in I/O book
1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support.
406D 0B23
Attention handler detected fault in I/O book
1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support.
406D 0B24
Attention handler detected fault in I/O book
1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support. Chapter
5.
Error
Code
to
FRU
Index
631
Error Code
Description
Action/ Possible Failing FRU
406D 0B25
Attention handler detected fault in I/O book
1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support.
406D 0B26
Attention handler detected fault in I/O book
1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support.
406D 0B27
Attention handler detected fault in I/O book
1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support.
406D 0B30
Attention handler detected fault in I/O book
1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems. 3. Third I/O book Location: U1.18-P1-H4 4. Call service support.
406D 0B31
Attention handler detected fault in I/O book
1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems. 3. Third I/O book Location: U1.18-P1-H4 4. Call service support.
406D 0B32
Attention handler detected fault in I/O book
1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems. 3. Third I/O book Location: U1.18-P1-H4 4. Call service support.
406D 0B33
Attention handler detected fault in I/O book
1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems. 3. Third I/O book Location: U1.18-P1-H4 4. Call service support. 632
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
406D 0B34
Error code generation/logging detected fault in I/O book or SNI adapter
1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems, or the SNI adapter and the High Performance Switch, depending on which adapter is installed in location U1.18-P1-H4. 3. Third I/O book or SNI adapterLocation: U1.18-P1-H4 4. Call service support.
406D 0B35
Error code generation/logging detected fault in I/O book or SNI adapter
1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems, or the SNI adapter and the High Performance Switch, depending on which adapter is installed in location U1.18-P1-H4. 3. Third I/O book or SNI adapterLocation: U1.18-P1-H4 4. Call service support.
406D 0B36
Error code generation/logging detected fault in I/O book or SNI adapter
1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems, or the SNI adapter and the High Performance Switch, depending on which adapter is installed in location U1.18-P1-H4. 3. Third I/O book or SNI adapterLocation: U1.18-P1-H4 4. Call service support.
406D 0B37
Error code generation/logging detected fault in I/O book or SNI adapter
1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems, or the SNI adapter and the High Performance Switch, depending on which adapter is installed in location U1.18-P1-H4. 3. Third I/O book or SNI adapterLocation: U1.18-P1-H4 4. Call service support.
406D 0BF0
Attention handler detected fault in I/O hub chip 0 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H2/Q1 b. U1.18-P1-H2/Q22. Replace the primary I/O book, Location: U1.18-P1-H2
406D 0BF1
Attention handler detected fault in I/O hub chip 1 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H2/Q3 b. U1.18-P1-H2/Q42. Replace the primary I/O book, Location: U1.18-P1-H2 Chapter
5.
Error
Code
to
FRU
Index
633
Error Code
Description
Action/ Possible Failing FRU
406D 0BF2
Attention handler detected fault in I/O hub chip 2 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H1/Q3 b. U1.18-P1-H1/Q42. Replace the RIO book, Location: U1.18-P1-H1
406D 0BF3
Attention handler detected fault in I/O hub chip 3 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H1/Q1 b. U1.18-P1-H1/Q22. Replace the RIO book, Location: U1.18-P1-H1
406D 0BF4
Attention handler detected fault in I/O hub chip 4 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H3/Q7 b. U1.18-P1-H3/Q82. Replace the RIO book, Location: U1.18-P1-H3
406D 0BF5
Attention handler detected fault in I/O hub chip 5 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H3/Q5 b. U1.18-P1-H3/Q62. Replace the RIO book, Location: U1.18-P1-H3
406D 0BF6
Attention handler detected fault in I/O hub chip 6 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H4/Q5 b. U1.18-P1-H4/Q62. Replace the RIO book, Location: U1.18-P1-H4
406D 0BF7
Attention handler detected fault in I/O hub chip 7 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H4/Q7 b. U1.18-P1-H4/Q82. Replace the RIO book, Location: U1.18-P1-H4
406D 0BF8
Attention handler detected fault in I/O hub chip 8 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H1/Q7 b. U1.18-P1-H1/Q82. Replace the RIO book, Location: U1.18-P1-H1 634
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
406D 0BF9
Attention handler detected fault in I/O hub chip 9 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H1/Q5 b. U1.18-P1-H1/Q62. Replace the RIO book, Location: U1.18-P1-H1
406D 0BFA
Attention handler detected fault in I/O hub chip 10 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H3/Q3 b. U1.18-P1-H3/Q42. Replace the RIO book, Location: U1.18-P1-H3
406D 0BFB
Attention handler detected fault in I/O hub chip 11 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H3/Q1 b. U1.18-P1-H3/Q22. Replace the RIO book, Location: U1.18-P1-H3
406D 0BFC
Attention handler detected fault in I/O hub chip 12 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H4/Q1 b. U1.18-P1-H4/Q22. Replace the RIO book, Location: U1.18-P1-H4.
406D 0BFD
Attention handler detected fault in I/O hub chip 13 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H4/Q3 b. U1.18-P1-H4/Q42. Replace the RIO book, Location: U1.18-P1-H4
406D 0C03
Attention handler detected fault in all MCMs
Go to “MAP 1543: MCM Module Problem Isolation” on page 337.
406D 0C09
Attention handler detected fault in all memory books
Go to “MAP 1546: Memory Book Problem Isolation” on page 345.
406D 0C0A
Attention handler detected fault in all I/O books
Go to “MAP 1547: I/O Book Problem Isolation” on page 348.
406D 0C10
Attention handler detected mismatch between MCM 0 and VPD card
Plugging error with FRU kitLocation: U1.18-P1-C1 and U1.18-P1-H2.3
406D 0C11
Attention handler detected mismatch between MCM 2 and VPD card
Plugging error with FRU kitLocation: U1.18-P1-C4 and U1.18-P1-H2.4
406D 0C12
Attention handler detected mismatch between MCM 1 and VPD card
Plugging error with FRU kitLocation: U1.18-P1-C3 and U1.18-P1-H2.5
406D 0C13
Attention handler detected mismatch between MCM 3 and VPD card
Plugging error with FRU kitLocation: U1.18-P1-C2 and U1.18-P1-H2.6
406D 0CA5
Attention handler detected invalid processor subsystem configuration
Verify that MCMs, memory books, and L3 modules are correctly configured. Chapter
5.
Error
Code
to
FRU
Index
635
Error Code
Description
Action/ Possible Failing FRU
406D 0CA6
Attention handler detected no usable main storage
Reboot the system in slow mode to isolate the failure.
406D 0CA9
Attention handler detected configuration error with processor memory
Reboot the system in slow mode to isolate the failure.
406D 0D00
Attention handler detected fault in L3 cache module 0
L3 cache module 0 and L3 VPD cardLocation: U1.18-P1-C6 and U1.18-P1-H2.1
406D 0D01
Attention handler detected fault in L3 cache module 1
L3 cache module 1 and L3 VPD card Location: U1.18-P1-C7 and U1.18-P1-H2.1
406D 0D02
Attention handler detected fault in L3 cache module 2
L3 cache module 2 and L3 VPD card Location: U1.18-P1-C11 and U1.18-P1-H2.1
406D 0D03
Attention handler detected fault in L3 cache module 3
L3 cache module 3 and L3 VPD card Location: U1.18-P1-C10 and U1.18-P1-H2.1
406D 0D04
Attention handler detected fault in L3 cache module 4
L3 cache module 4 and L3 VPD card Location: U1.18-P1-C8 and U1.18-P1-H2.1
406D 0D05
Attention handler detected fault in L3 cache module 5
L3 cache module 5 and L3 VPD card Location: U1.18-P1-C20 and U1.18-P1-H2.1
406D 0D06
Attention handler detected fault in L3 cache module 6
L3 cache module 6 and L3 VPD card Location: U1.18-P1-C16 and U1.18-P1-H2.1
406D 0D07
Attention handler detected fault in L3 cache module 7
L3 cache module 7 and L3 VPD card Location: U1.18-P1-C12 and U1.18-P1-H2.1
406D 0D08
Attention handler detected fault in L3 cache module 8
L3 cache module 8 and L3 VPD card Location: U1.18-P1-C19 and U1.18-P1-H2.1
406D 0D09
Attention handler detected fault in L3 cache module 9
L3 cache module 9 and L3 VPD card Location: U1.18-P1-C18 and U1.18-P1-H2.1
406D 0D0A
Attention handler detected fault in L3 cache module 10
L3 cache module 10 and L3 VPD card Location: U1.18-P1-C14 and U1.18-P1-H2.1
406D 0D0B
Attention handler detected fault in L3 cache module 11
L3 cache module 11 and L3 VPD card Location: U1.18-P1-C15 and U1.18-P1-H2.1
406D 0D0C
Attention handler detected fault in L3 cache module 12
L3 cache module 12 and L3 VPD card Location: U1.18-P1-C17 and U1.18-P1-H2.1
406D 0D0D
Attention handler detected fault in L3 cache module 13
L3 cache module 13 and L3 VPD card Location: U1.18-P1-C5 and U1.18-P1-H2.1
406D 0D0E
Attention handler detected fault in L3 cache module 14
L3 cache module 14 and L3 VPD card Location: U1.18-P1-C9 and U1.18-P1-H2.1
406D 0D0F
Attention handler detected fault in L3 cache module 15
L3 cache module 15 and L3 VPD card Location: U1.18-P1-C13 and U1.18-P1-H2.1 636
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
406D 0EA0
Attention handler detected fault in MOPS service processor code
1. Access the service processor error log. Find the error log entry for the 406x 0EA0 error. Look for a hardware error of the form 450x yyyy, 460x yyyy, or 4B2x yyyy with a timestamp within a few minutes of the 406x 0EA0 error. Follow the actions for this hardware error. 2. If there is no hardware error in the service processor error log with a timestamp that is close to the 406x 0EA0 error, or following the actions for the hardware error does not resolve the problem, check for platform firmware updates. If an update is available, apply it. 3. If a platform firmware update is not available, or the update does not correct the problem, call service support.
406D 0EA1
Attention handler detected fault in processor subsystem code on main processor
1. Check for operating system updates. 2. Call service support.
406D 0EA2
Attention handler detected fault in PRD code on service processor
1. Check for system firmware updates. 2. Call service support.
406D 0EA4
The service processor firmware detected a hardware problem but was unable to isolate the source.
Call service support.
406D 0EA5
PRD detected a problem in the FNM code
1. Check for system firmware updates. 2. Call service support.
406D 0EAA
PRD: FNM is handling this error
1. Check for system firmware updates. 2. Call service support.
406D 0EAF
Attention handler detected unknown service processor code and system type
1. Check for system firmware updates. 2. Call service support.
406D 0EB0
Attention handler detected JTAG problem
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
406D 0EB1
Attention line 0 stuck fault
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
406D 0EB2
Attention line 1 stuck fault
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
406D 0EB3
Attention line 2 stuck fault
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
406D 0EB4
Attention line 3 stuck fault
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
406D 0EB5
Attention line 4 stuck fault
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
406D 0EB6
Attention line 5 stuck fault
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
406D 0EB7
Attention line 6 stuck fault
Go to “MAP 1540: Problem Isolation Procedures” on page 308. Chapter
5.
Error
Code
to
FRU
Index
637
Error Code
Description
Action/ Possible Failing FRU
406D 0EB8
Attention line 7 stuck fault
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
406D 0EB9
Attention line 8 stuck fault
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
406D 0EBA
Attention line 9 stuck fault
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
406D 0EBB
Attention line 10 stuck fault
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
406D 0EBC
Attention line 11 stuck fault
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
406D 0EBF
Memory error
1. Reboot the system in slow mode to the SMS menus. See “Performing Slow Boot” on page 413. 2. Examine the service processor error log. v If a CEC subsystem error (a 4xxx yyyy error) occurred during the slow boot that calls out the memory, processors, L3 cache modules, or the primary I/O book, replace the part indicated by the location code. v If a CEC subsystem error (a 4xxx yyyy error) did not occur during the slow boot, look for a 4xxx yyyy error immediately preceding the 406x 0EBF error. Replace the part indicated by the location code associated with that error.
406D 0EE1
Attention handler detected fault in service processor code (invalid MRU to mapping)
1. Check for system firmware updates. 2. Call service support.
406D 0EE2
Attention handler detected fault in service processor firmware fault
1. Check for system firmware updates. 2. Call service support.
406D 0EE5
Attention handler detected firmware error; no callout available
1. Check for system firmware updates. 2. Call service support.
406D 25E6
SMA 4-port switch adapter fault
Replace the I/O book (switch adapter) specified by the location code.
406D 273F
SMA 2-port switch adapter fault
Replace the I/O book (switch adapter) specified by the location code.
406E 0001
Scan control routineIPL detecteds detected clock card assembly failure
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
406E 0002
Scan control routines detected backplane assembly failure
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
406E 0003
Scan control routines detected fault in MCM module 0
MCM module 0 and VPD cardLocation: U1.18-P1-C1 and U1.18-P1-H2.3
406E 0004
Scan control routines detected fault in MCM module 2
MCM module 2 and VPD cardLocation: U1.18-P1-C4 and U1.18-P1-H2.4
406E 0005
Scan control routines detected fault in MCM module 1
MCM module 1 and VPD cardLocation: U1.18-P1-C3 and U1.18-P1-H2.5
406E 0006
Scan control routines detected fault in MCM module 3
MCM module 3 and VPD cardLocation: U1.18-P1-C2 and U1.18-P1-H2.6 638
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
406E 0007
Scan control routines detected fault in L3 VPD card
1. L3 VPD cardLocation: U1.18-P1-H2.1 (Order the L3 cache module FRU to obtain a new L3 VPD card. Return the L3 cache module that was included in the FRU kit, and the failing L3 VPD card as ?new defective?.) 2. Primary I/O bookLocation: U1.18-P1-H2
406E 000A
Scan control routines detected fault in primary I/O book 1 (slot 0)
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
406E 000B
Scan control routines detected fault in secondary I/O book (slot 1)
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
406E 000C
Scan control routines detected fault in I/O book 3 (slot 2)
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
406E 000D
Scan control routines detected fault in I/O book 4 (slot 3)
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
406E 000E
CUoD capacity card
1. Replace the CUoD capacity cardLocation: U1.18-P1-H2.2 2. Ask the customer to reenter the CUoD activation keys.
406E 0010
Scan control routines detected fault in Memory book (slot 0)
Memory book (slot 0) Location: U1.18-P1-M2
406E 0011
Scan control routines detected fault in Memory book (slot 1)
Memory book (slot 1) Location: U1.18-P1-M3
406E 0012
Scan control routines detected fault in Memory book (slot 2)
Memory book (slot 2) Location: U1.18-P1-M7
406E 0013
Scan control routines detected fault in Memory book (slot 3)
Memory book (slot 3)Location: U1.18-P1-M6
406E 0014
Scan control routines detected fault in Memory book (slot 4)
Memory book (slot 4)Location: U1.18-P1-M4
406E 0015
Scan control routines detected fault in Memory book (slot 5)
Memory book (slot 5)Location: U1.18-P1-M8
406E 0016
Scan control routines detected fault in Memory book (slot 6)
Memory book (slot 6)Location: U1.18-P1-M5
406E 0017
Scan control routines detected fault in Memory book (slot 7)
Memory book (slot 7)Location: U1.18-P1-M1
406E 00B0
Scan control routines detected possible problem with bus controller or CSP code
1. Check for system firmware updates. 2. Primary I/O book, Location: U1.18-P1-H2 3. Call service support.
406E 00B1
Scan control routines detected possible problem with bus controller
1. Check for system firmware updates. 2. Primary I/O book, Location: U1.18-P1-H2 3. Call service support. Chapter
5.
Error
Code
to
FRU
Index
639
Error Code
Description
Action/ Possible Failing FRU
406E 00B3
Scan control routines detected MCM 0/VPD problem
1. Check for system firmware updates. 2. MCM module 0 and VPD cardLocation: U1.18-P1-C1 and U1.18-P1-H2.3
406E 00B4
Scan control routines detected MCM 2/VPD problem
1. Check for system firmware updates. 2. MCM module 2 and VPD cardLocation: U1.18-P1-C4 and U1.18-P1-H2.4
406E 00B5
Scan control routines detected MCM 1/VPD problem
1. Check for system firmware updates. 2. MCM module 1 and VPD cardLocation: U1.18-P1-C3 and U1.18-P1-H2.5
406E 00B6
Scan control routines detected MCM 3/VPD problem
1. Check for system firmware updates. 2. MCM module 3 and VPD cardLocation: U1.18-P1-C2 and U1.18-P1-H2.6
406E 00B7
Scan control routines detected hardware problem in the processor subsystem
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
406E 0A03
Reboot message (run-time array gard failure) for processor 0
Array bit steering, which is a function in the firmware, may be able to correct this problem without replacing hardware. If AIX is available on the system, go to MAP 0235 in the Diagnostic Information for Multiple Bus Systems, order number SA23-2769. If Linux is the only operating system on the system, schedule deferred maintenance with the customer. When it is possible to reboot the system, do the following: 1. Using the service processor’s system power control menu, disable fast mode boot. (Disabling fast mode automatically enables slow mode.) 2. Boot the AIX diagnostic CD-ROM. 3. Run diagnostics in problem determination mode. If no errors are recorded against any of the processors (devices with a resource name of procx), array bit steering was successful and the problem has been corrected.If errors are reported against the hardware, follow the actions for the error code or SRN. 640
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
406E 0A04
Reboot message (run-time array gard failure) for processor 2
Reboot the system in slow mode. Run diagnostics, then go to MAP 0235 in the .Array bit steering, which is a function in the firmware, may be able to correct this problem without replacing hardware. If AIX is available on the system, go to MAP 0235 in the Diagnostic Information for Multiple Bus Systems, order number SA23-2769. If Linux is the only operating system on the system, schedule deferred maintenance with the customer. When it is possible to reboot the system, do the following: 1. Using the service processor’s system power control menu, disable fast mode boot. (Disabling fast mode automatically enables slow mode.) 2. Boot the AIX diagnostic CD-ROM. 3. Run diagnostics in problem determination mode. If no errors are recorded against any of the processors (devices with a resource name of procx), array bit steering was successful and the problem has been corrected.If errors are reported against the hardware, follow the actions for the error code or SRN.
406E 0A05
Reboot message (run-time array gard failure) for processor 1
Reboot the system in slow mode. Run diagnostics, then go to MAP 0235 in the .Array bit steering, which is a function in the firmware, may be able to correct this problem without replacing hardware. If AIX is available on the system, go to MAP 0235 in the Diagnostic Information for Multiple Bus Systems, order number SA23-2769. If Linux is the only operating system on the system, schedule deferred maintenance with the customer. When it is possible to reboot the system, do the following: 1. Using the service processor’s system power control menu, disable fast mode boot. (Disabling fast mode automatically enables slow mode.) 2. Boot the AIX diagnostic CD-ROM. 3. Run diagnostics in problem determination mode. If no errors are recorded against any of the processors (devices with a resource name of procx), array bit steering was successful and the problem has been corrected.If errors are reported against the hardware, follow the actions for the error code or SRN. Chapter
5.
Error
Code
to
FRU
Index
641
Error Code
Description
Action/ Possible Failing FRU
406E 0A06
Reboot message (run-time array gard failure) for processor 3
Reboot the system in slow mode. Run diagnostics, then go to MAP 0235 in the .Array bit steering, which is a function in the firmware, may be able to correct this problem without replacing hardware. If AIX is available on the system, go to MAP 0235 in the Diagnostic Information for Multiple Bus Systems, order number SA23-2769. If Linux is the only operating system on the system, schedule deferred maintenance with the customer. When it is possible to reboot the system, do the following: 1. Using the service processor’s system power control menu, disable fast mode boot. (Disabling fast mode automatically enables slow mode.) 2. Boot the AIX diagnostic CD-ROM. 3. Run diagnostics in problem determination mode. If no errors are recorded against any of the processors (devices with a resource name of procx), array bit steering was successful and the problem has been corrected.If errors are reported against the hardware, follow the actions for the error code or SRN.
406E 0B00
Scan control routines detected fault in I/O book (slot 0 RIO port 0)
1. Check for system firmware updates. 2. Check the I/O cabling between the primary I/O book and the I/O subsystems. 3. Primary I/O book Location: U1.18-P1-H2 4. Call service support.
406E 0B01
Scan control routines detected fault in I/O book (slot 0 RIO port 1)
1. Check for system firmware updates. 2. Check the I/O cabling between the primary I/O book and the I/O subsystems. 3. Primary I/O book Location: U1.18-P1-H2 4. Call service support.
406E 0B02
Scan control routines detected fault in I/O book (slot 0 RIO port 2)
1. Check for system firmware updates. 2. Check the I/O cabling between the primary I/O book and the I/O subsystems. 3. Primary I/O book Location: U1.18-P1-H2 4. Call service support.
406E 0B03
Scan control routines detected fault in I/O book (slot 0 RIO port 3)
1. Check for system firmware updates. 2. Check the I/O cabling between the primary I/O book and the I/O subsystems. 3. Primary I/O book Location: U1.18-P1-H2 4. Call service support. 642
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
406E 0B10
Processor subsystem tests detected fault in SNI adapter
1. Check for system firmware updates. 2. Check the cabling between the SNI adapter book and the High Performance Switch. 3. SNI adapter bookLocation: U1.18-P1-H1 4. Call service support.
406E 0B11
Processor subsystem tests detected fault in SNI adapter
1. Check for system firmware updates. 2. Check the cabling between the SNI adapter book and the High Performance Switch. 3. SNI adapter bookLocation: U1.18-P1-H1 4. Call service support.
406E 0B12
Processor subsystem tests detected fault in SNI adapter
1. Check for system firmware updates. 2. Check the cabling between the SNI adapter book and the High Performance Switch. 3. SNI adapter bookLocation: U1.18-P1-H1 4. Call service support.
406E 0B13
Processor subsystem tests detected fault in SNI adapter
1. Check for system firmware updates. 2. Check the cabling between the SNI adapter book and the High Performance Switch. 3. SNI adapter bookLocation: U1.18-P1-H1 4. Call service support.
406E 0B20
Scan control routines detected fault in I/O book
1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support.
406E 0B21
Scan control routines detected fault in I/O book
1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support.
406E 0B22
Scan control routines detected fault in I/O book
1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support. Chapter
5.
Error
Code
to
FRU
Index
643
Error Code
Description
Action/ Possible Failing FRU
406E 0B23
Scan control routines detected fault in I/O book
1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support.
406E 0B24
Scan control routines detected fault in I/O book
1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support.
406E 0B25
Scan control routines detected fault in I/O book
1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support.
406E 0B26
Scan control routines detected fault in I/O book
1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support.
406E 0B27
Scan control routines detected fault in I/O book
1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support.
406E 0B30
Scan control routines detected fault in I/O book
1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems. 3. Third I/O book Location: U1.18-P1-H4 4. Call service support.
406E 0B31
Scan control routines detected fault in I/O book
1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems. 3. Third I/O book Location: U1.18-P1-H4 4. Call service support. 644
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
406E 0B32
Scan control routines detected fault in I/O book
1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems. 3. Third I/O book Location: U1.18-P1-H4 4. Call service support.
406E 0B33
Scan control routines detected fault in I/O book
1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems. 3. Third I/O book Location: U1.18-P1-H4 4. Call service support.
406E 0B34
Error code generation/logging detected fault in I/O book or SNI adapter
1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems, or the SNI adapter and the High Performance Switch, depending on which adapter is installed in location U1.18-P1-H4. 3. Third I/O book or SNI adapterLocation: U1.18-P1-H4 4. Call service support.
406E 0B35
Error code generation/logging detected fault in I/O book or SNI adapter
1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems, or the SNI adapter and the High Performance Switch, depending on which adapter is installed in location U1.18-P1-H4. 3. Third I/O book or SNI adapterLocation: U1.18-P1-H4 4. Call service support.
406E 0B36
Error code generation/logging detected fault in I/O book or SNI adapter
1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems, or the SNI adapter and the High Performance Switch, depending on which adapter is installed in location U1.18-P1-H4. 3. Third I/O book or SNI adapterLocation: U1.18-P1-H4 4. Call service support.
406E 0B37
Error code generation/logging detected fault in I/O book or SNI adapter
1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems, or the SNI adapter and the High Performance Switch, depending on which adapter is installed in location U1.18-P1-H4. 3. Third I/O book or SNI adapterLocation: U1.18-P1-H4 4. Call service support. Chapter
5.
Error
Code
to
FRU
Index
645
Error Code
Description
Action/ Possible Failing FRU
406E 0BF0
Scan control routines detected fault in I/O hub chip 0 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H2/Q1 b. U1.18-P1-H2/Q22. Replace the primary I/O book, Location: U1.18-P1-H2
406E 0BF1
Scan control routines detected fault in I/O hub chip 1 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H2/Q3 b. U1.18-P1-H2/Q42. Replace the primary I/O book, Location: U1.18-P1-H2
406E 0BF2
Scan control routines detected fault in I/O hub chip 2 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H1/Q3 b. U1.18-P1-H1/Q42. Replace the RIO book, Location: U1.18-P1-H1
406E 0BF3
Scan control routines detected fault in I/O hub chip 3 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H1/Q1 b. U1.18-P1-H1/Q22. Replace the RIO book, Location: U1.18-P1-H1
406E 0BF4
Scan control routines detected fault in I/O hub chip 4 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H3/Q7 b. U1.18-P1-H3/Q82. Replace the RIO book, Location: U1.18-P1-H3
406E 0BF5
Scan control routines detected fault in I/O hub chip 5 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H3/Q5 b. U1.18-P1-H3/Q62. Replace the RIO book, Location: U1.18-P1-H3
406E 0BF6
Scan control routines detected fault in I/O hub chip 6 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H4/Q5 b. U1.18-P1-H4/Q62. Replace the RIO book, Location: U1.18-P1-H4 646
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
406E 0BF7
Scan control routines detected fault in I/O hub chip 7 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H4/Q7 b. U1.18-P1-H4/Q82. Replace the RIO book, Location: U1.18-P1-H4
406E 0BF8
Scan control routines detected fault in I/O hub chip 8 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H1/Q7 b. U1.18-P1-H1/Q82. Replace the RIO book, Location: U1.18-P1-H1
406E 0BF9
Scan control routines detected fault in I/O hub chip 9 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H1/Q5 b. U1.18-P1-H1/Q62. Replace the RIO book, Location: U1.18-P1-H1
406E 0BFA
Scan control routines detected fault in I/O hub chip 10 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H3/Q3 b. U1.18-P1-H3/Q42. Replace the RIO book, Location: U1.18-P1-H3
406E 0BFB
Scan control routines detected fault in I/O hub chip 11 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H3/Q1 b. U1.18-P1-H3/Q22. Replace the RIO book, Location: U1.18-P1-H3
406E 0BFC
Scan control routines detected fault in I/O hub chip 12 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H4/Q1 b. U1.18-P1-H4/Q22. Replace the RIO book, Location: U1.18-P1-H4.
406E 0BFD
Scan control routines detected fault in I/O hub chip 13 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H4/Q3 b. U1.18-P1-H4/Q42. Replace the RIO book, Location: U1.18-P1-H4
406E 0C03
Scan control routines detected fault in all MCMs
Go to “MAP 1543: MCM Module Problem Isolation” on page 337.
406E 0C09
Scan control routines detected fault in all memory books
Go to “MAP 1546: Memory Book Problem Isolation” on page 345. Chapter
5.
Error
Code
to
FRU
Index
647
Error Code
Description
Action/ Possible Failing FRU
406E 0C0A
Scan control routines detected fault in all I/O books
Go to “MAP 1547: I/O Book Problem Isolation” on page 348.
406E 0C10
Scan control routines detected mismatch between MCM 0 and VPD card
Plugging error with FRU kitLocation: U1.18-P1-C1 and U1.18-P1-H2.3
406E 0C11
Scan control routines detected mismatch between MCM 2 and VPD card
Plugging error with FRU kitLocation: U1.18-P1-C4 and U1.18-P1-H2.4
406E 0C12
Scan control routines detected mismatch between MCM 1 and VPD card
Plugging error with FRU kitLocation: U1.18-P1-C3 and U1.18-P1-H2.5
406E 0C13
Scan control routines detected mismatch between MCM 3 and VPD card
Plugging error with FRU kitLocation: U1.18-P1-C2 and U1.18-P1-H2.6
406E 0CA5
Scan control routines detected invalid processor subsystem configuration
Verify that MCMs, memory books, and L3 modules are correctly configured.
406E 0CA6
Scan control routines detected no usable main storage
Reboot the system in slow mode to isolate the failure.
406E 0CA9
Scan control routines detected configuration error with processor memory
Reboot the system in slow mode to isolate the failure.
406E 0D00
Scan control routines detected fault in L3 cache module 0
L3 cache module 0 and L3 VPD cardLocation: U1.18-P1-C6 and U1.18-P1-H2.1
406E 0D01
Scan control routines detected fault in L3 cache module 1
L3 cache module 1 and L3 VPD card Location: U1.18-P1-C7 and U1.18-P1-H2.1
406E 0D02
Scan control routines detected fault in L3 cache module 2
L3 cache module 2 and L3 VPD card Location: U1.18-P1-C11 and U1.18-P1-H2.1
406E 0D03
Scan control routines detected fault in L3 cache module 3
L3 cache module 3 and L3 VPD card Location: U1.18-P1-C10 and U1.18-P1-H2.1
406E 0D04
Scan control routines detected fault in L3 cache module 4
L3 cache module 4 and L3 VPD card Location: U1.18-P1-C8 and U1.18-P1-H2.1
406E 0D05
Scan control routines detected fault in L3 cache module 5
L3 cache module 5 and L3 VPD card Location: U1.18-P1-C20 and U1.18-P1-H2.1
406E 0D06
Scan control routines detected fault in L3 cache module 6
L3 cache module 6 and L3 VPD card Location: U1.18-P1-C16 and U1.18-P1-H2.1
406E 0D07
Scan control routines detected fault in L3 cache module 7
L3 cache module 7 and L3 VPD card Location: U1.18-P1-C12 and U1.18-P1-H2.1
406E 0D08
Scan control routines detected fault in L3 cache module 8
L3 cache module 8 and L3 VPD card Location: U1.18-P1-C19 and U1.18-P1-H2.1
406E 0D09
Scan control routines detected fault in L3 cache module 9
L3 cache module 9 and L3 VPD card Location: U1.18-P1-C18 and U1.18-P1-H2.1
406E 0D0A
Scan control routines detected fault in L3 cache module 10
L3 cache module 10 and L3 VPD card Location: U1.18-P1-C14 and U1.18-P1-H2.1
406E 0D0B
Scan control routines detected fault in L3 cache module 11
L3 cache module 11 and L3 VPD card Location: U1.18-P1-C15 and U1.18-P1-H2.1
406E 0D0C
Scan control routines detected fault in L3 cache module 12
L3 cache module 12 and L3 VPD card Location: U1.18-P1-C17 and U1.18-P1-H2.1
406E 0D0D
Scan control routines detected fault in L3 cache module 13
L3 cache module 13 and L3 VPD card Location: U1.18-P1-C5 and U1.18-P1-H2.1
406E 0D0E
Scan control routines detected fault in L3 cache module 14
L3 cache module 14 and L3 VPD card Location: U1.18-P1-C9 and U1.18-P1-H2.1 648
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
406E 0D0F
Scan control routines detected fault in L3 cache module 15
L3 cache module 15 and L3 VPD card Location: U1.18-P1-C13 and U1.18-P1-H2.1
406E 0EA0
Scan control routines detected fault in MOPS service processor code
1. Access the service processor error log. Find the error log entry for the 406x 0EA0 error. Look for a hardware error of the form 450x yyyy, 460x yyyy, or 4B2x yyyy with a timestamp within a few minutes of the 406x 0EA0 error. Follow the actions for this hardware error. 2. If there is no hardware error in the service processor error log with a timestamp that is close to the 406x 0EA0 error, or following the actions for the hardware error does not resolve the problem, check for platform firmware updates. If an update is available, apply it. 3. If a platform firmware update is not available, or the update does not correct the problem, call service support.
406E 0EA1
Scan control routines detected fault in processor subsystem code on main processor
1. Check for operating system updates. 2. Call service support.
406E 0EA2
Scan control routines detected fault in PRD code on service processor
1. Check for system firmware updates. 2. Call service support.
406E 0EA4
The service processor firmware detected a hardware problem but was unable to isolate the source.
Call service support.
406E 0EA5
PRD detected a problem in the FNM code
1. Check for system firmware updates. 2. Call service support.
406E 0EAA
PRD: FNM is handling this error
1. Check for system firmware updates. 2. Call service support.
406E 0EAF
Scan control routines detected unknown service processor code and system type
1. Check for system firmware updates. 2. Call service support.
406E 0EB0
Scan control routines detected JTAG problem
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
406E 0EB1
Attention line 0 stuck fault
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
406E 0EB2
Attention line 1 stuck fault
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
406E 0EB3
Attention line 2 stuck fault
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
406E 0EB4
Attention line 3 stuck fault
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
406E 0EB5
Attention line 4 stuck fault
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
406E 0EB6
Attention line 5 stuck fault
Go to “MAP 1540: Problem Isolation Procedures” on page 308. Chapter
5.
Error
Code
to
FRU
Index
649
Error Code
Description
Action/ Possible Failing FRU
406E 0EB7
Attention line 6 stuck fault
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
406E 0EB8
Attention line 7 stuck fault
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
406E 0EB9
Attention line 8 stuck fault
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
406E 0EBA
Attention line 9 stuck fault
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
406E 0EBB
Attention line 10 stuck fault
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
406E 0EBC
Attention line 11 stuck fault
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
406E 0EBF
Memory error
1. Reboot the system in slow mode to the SMS menus. See “Performing Slow Boot” on page 413. 2. Examine the service processor error log. v If a CEC subsystem error (a 4xxx yyyy error) occurred during the slow boot that calls out the memory, processors, L3 cache modules, or the primary I/O book, replace the part indicated by the location code. v If a CEC subsystem error (a 4xxx yyyy error) did not occur during the slow boot, look for a 4xxx yyyy error immediately preceding the 406x 0EBF error. Replace the part indicated by the location code associated with that error.
406E 0EE1
Scan control routines detected fault in service processor code (invalid MRU to mapping)
1. Check for system firmware updates. 2. Call service support.
406E 0EE2
Scan control routines detected fault in service processor firmware fault
1. Check for system firmware updates. 2. Call service support.
406E 0EE5
Scan control routines detected firmware error; no callout available
1. Check for system firmware updates. 2. Call service support.
406E 25E6
SMA 4-port switch adapter fault
Replace the I/O book (switch adapter) specified by the location code.
406E 273F
SMA 2-port switch adapter fault
Replace the I/O book (switch adapter) specified by the location code.
406F 0001
Error code generation/logging detected clock card assembly failure
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
406F 0002
Error code generation/logging detected backplane assembly failure
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
406F 0003
Error code generation/logging detected fault in MCM module 0
MCM module 0 and VPD cardLocation: U1.18-P1-C1 and U1.18-P1-H2.3
406F 0004
Error code generation/logging detected fault in MCM module 2
MCM module 2 and VPD cardLocation: U1.18-P1-C4 and U1.18-P1-H2.4
406F 0005
Error code generation/logging detected fault in MCM module 1
MCM module 1 and VPD cardLocation: U1.18-P1-C3 and U1.18-P1-H2.5 650
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
406F 0006
Error code generation/logging detected fault in MCM module 3
MCM module 3 and VPD cardLocation: U1.18-P1-C2 and U1.18-P1-H2.6
406F 0007
Error code generation/logging detected fault in L3 VPD card
1. L3 VPD cardLocation: U1.18-P1-H2.1 (Order the L3 cache module FRU to obtain a new L3 VPD card. Return the L3 cache module that was included in the FRU kit, and the failing L3 VPD card as ?new defective?.) 2. Primary I/O bookLocation: U1.18-P1-H2
406F 000A
Error code generation/logging detected fault in I/O book 1 (slot 0)
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
406F 000B
Error code generation/logging detected fault in primary I/O book (slot 1)
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
406F 000C
Error code generation/logging detected fault in I/O book 3 (slot 2)
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
406F 000D
Error code generation/logging detected fault in I/O book 4 (slot 3)
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
406F 000E
CUoD capacity card
1. Replace the CUoD capacity cardLocation: U1.18-P1-H2.2 2. Ask the customer to reenter the CUoD activation keys.
406F 0010
Error code generation/logging detected fault in Memory book (slot 0)
Memory book (slot 0) Location: U1.18-P1-M2
406F 0011
Error code generation/logging detected fault in Memory book (slot 1)
Memory book (slot 1) Location: U1.18-P1-M3
406F 0012
Error code generation/logging detected fault in Memory book (slot 2)
Memory book (slot 2) Location: U1.18-P1-M7
406F 0013
Error code generation/logging detected fault in Memory book (slot 3)
Memory book (slot 3)Location: U1.18-P1-M6
406F 0014
Error code generation/logging detected fault in Memory book (slot 4)
Memory book (slot 4)Location: U1.18-P1-M4
406F 0015
Error code generation/logging detected fault in Memory book (slot 5)
Memory book (slot 5)Location: U1.18-P1-M8
406F 0016
Error code generation/logging detected fault in Memory book (slot 6)
Memory book (slot 6)Location: U1.18-P1-M5
406F 0017
Error code generation/logging detected fault in Memory book (slot 7)
Memory book (slot 7)Location: U1.18-P1-M1
406F 00B0
Error code generation/logging detected possible problem with bus controller or CSP code
1. Check for system firmware updates. 2. Primary I/O book, Location: U1.18-P1-H2 3. Call service support.
406F 00B1
Error code generation/logging detected possible problem with bus controller
1. Check for system firmware updates. 2. Primary I/O book, Location: U1.18-P1-H2 3. Call service support. Chapter
5.
Error
Code
to
FRU
Index
651
Error Code
Description
Action/ Possible Failing FRU
406F 00B3
Error code generation/logging detected MCM 0/VPD problem
1. Check for system firmware updates. 2. MCM module 0 and VPD cardLocation: U1.18-P1-C1 and U1.18-P1-H2.3
406F 00B4
Error code generation/logging detected MCM 2/VPD problem
1. Check for system firmware updates. 2. MCM module 2 and VPD cardLocation: U1.18-P1-C4 and U1.18-P1-H2.4
406F 00B5
Error code generation/logging detected MCM 1/VPD problem
1. Check for system firmware updates. 2. MCM module 1 and VPD cardLocation: U1.18-P1-C3 and U1.18-P1-H2.5
406F 00B6
Error code generation/logging detected MCM 3/VPD problem
1. Check for system firmware updates. 2. MCM module 3 and VPD cardLocation: U1.18-P1-C2 and U1.18-P1-H2.6
406F 00B7
Error code generation/logging detected hardware problem in the processor subsystem
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
406F 0A03
Reboot message (run-time array gard failure) for processor 0
Array bit steering, which is a function in the firmware, may be able to correct this problem without replacing hardware. If AIX is available on the system, go to MAP 0235 in the Diagnostic Information for Multiple Bus Systems, order number SA23-2769. If Linux is the only operating system on the system, schedule deferred maintenance with the customer. When it is possible to reboot the system, do the following: 1. Using the service processor’s system power control menu, disable fast mode boot. (Disabling fast mode automatically enables slow mode.) 2. Boot the AIX diagnostic CD-ROM. 3. Run diagnostics in problem determination mode. If no errors are recorded against any of the processors (devices with a resource name of procx), array bit steering was successful and the problem has been corrected.If errors are reported against the hardware, follow the actions for the error code or SRN. 652
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
406F 0A04
Reboot message (run-time array gard failure) for processor 2
Reboot the system in slow mode. Run diagnostics, then go to MAP 0235 in the .Array bit steering, which is a function in the firmware, may be able to correct this problem without replacing hardware. If AIX is available on the system, go to MAP 0235 in the Diagnostic Information for Multiple Bus Systems, order number SA23-2769. If Linux is the only operating system on the system, schedule deferred maintenance with the customer. When it is possible to reboot the system, do the following: 1. Using the service processor’s system power control menu, disable fast mode boot. (Disabling fast mode automatically enables slow mode.) 2. Boot the AIX diagnostic CD-ROM. 3. Run diagnostics in problem determination mode. If no errors are recorded against any of the processors (devices with a resource name of procx), array bit steering was successful and the problem has been corrected.If errors are reported against the hardware, follow the actions for the error code or SRN.
406F 0A05
Reboot message (run-time array gard failure) for processor 1
Reboot the system in slow mode. Run diagnostics, then go to MAP 0235 in the .Array bit steering, which is a function in the firmware, may be able to correct this problem without replacing hardware. If AIX is available on the system, go to MAP 0235 in the Diagnostic Information for Multiple Bus Systems, order number SA23-2769. If Linux is the only operating system on the system, schedule deferred maintenance with the customer. When it is possible to reboot the system, do the following: 1. Using the service processor’s system power control menu, disable fast mode boot. (Disabling fast mode automatically enables slow mode.) 2. Boot the AIX diagnostic CD-ROM. 3. Run diagnostics in problem determination mode. If no errors are recorded against any of the processors (devices with a resource name of procx), array bit steering was successful and the problem has been corrected.If errors are reported against the hardware, follow the actions for the error code or SRN. Chapter
5.
Error
Code
to
FRU
Index
653
Error Code
Description
Action/ Possible Failing FRU
406F 0A06
Reboot message (run-time array gard failure) for processor 3
Reboot the system in slow mode. Run diagnostics, then go to MAP 0235 in the .Array bit steering, which is a function in the firmware, may be able to correct this problem without replacing hardware. If AIX is available on the system, go to MAP 0235 in the Diagnostic Information for Multiple Bus Systems, order number SA23-2769. If Linux is the only operating system on the system, schedule deferred maintenance with the customer. When it is possible to reboot the system, do the following: 1. Using the service processor’s system power control menu, disable fast mode boot. (Disabling fast mode automatically enables slow mode.) 2. Boot the AIX diagnostic CD-ROM. 3. Run diagnostics in problem determination mode. If no errors are recorded against any of the processors (devices with a resource name of procx), array bit steering was successful and the problem has been corrected.If errors are reported against the hardware, follow the actions for the error code or SRN.
406F 0B00
Error code generation/logging detected fault in I/O book (slot 0 RIO port 0)
1. Check for system firmware updates. 2. Check the I/O cabling between the primary I/O book and the I/O subsystems. 3. Primary I/O book Location: U1.18-P1-H2 4. Call service support.
406F 0B01
Error code generation/logging detected fault in I/O book (slot 0 RIO port 1)
1. Check for system firmware updates. 2. Check the I/O cabling between the primary I/O book and the I/O subsystems. 3. Primary I/O book Location: U1.18-P1-H2 4. Call service support.
406F 0B02
Error code generation/logging detected fault in I/O book (slot 0 RIO port 2)
1. Check for system firmware updates. 2. Check the I/O cabling between the primary I/O book and the I/O subsystems. 3. Primary I/O book Location: U1.18-P1-H2 4. Call service support.
406F 0B03
Error code generation/logging detected fault in I/O book (slot 0 RIO port 3)
1. Check for system firmware updates. 2. Check the I/O cabling between the primary I/O book and the I/O subsystems. 3. Primary I/O book Location: U1.18-P1-H2 4. Call service support. 654
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
406F 0B10
Processor subsystem tests detected fault in SNI adapter
1. Check for system firmware updates. 2. Check the cabling between the SNI adapter book and the High Performance Switch. 3. SNI adapter bookLocation: U1.18-P1-H1 4. Call service support.
406F 0B11
Processor subsystem tests detected fault in SNI adapter
1. Check for system firmware updates. 2. Check the cabling between the SNI adapter book and the High Performance Switch. 3. SNI adapter bookLocation: U1.18-P1-H1 4. Call service support.
406F 0B12
Processor subsystem tests detected fault in SNI adapter
1. Check for system firmware updates. 2. Check the cabling between the SNI adapter book and the High Performance Switch. 3. SNI adapter bookLocation: U1.18-P1-H1 4. Call service support.
406F 0B13
Processor subsystem tests detected fault in SNI adapter
1. Check for system firmware updates. 2. Check the cabling between the SNI adapter book and the High Performance Switch. 3. SNI adapter bookLocation: U1.18-P1-H1 4. Call service support.
406F 0B20
Error code generation/logging detected fault in I/O book
1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support.
406F 0B21
Error code generation/logging detected fault in I/O book
1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support.
406F 0B22
Error code generation/logging detected fault in I/O book
1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support. Chapter
5.
Error
Code
to
FRU
Index
655
Error Code
Description
Action/ Possible Failing FRU
406F 0B23
Error code generation/logging detected fault in I/O book
1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support.
406F 0B24
Error code generation/logging detected fault in I/O book
1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support.
406F 0B25
Error code generation/logging detected fault in I/O book
1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support.
406F 0B26
Error code generation/logging detected fault in I/O book
1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support.
406F 0B27
Error code generation/logging detected fault in I/O book
1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support.
406F 0B30
Error code generation/logging detected fault in I/O book
1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems. 3. Third I/O book Location: U1.18-P1-H4 4. Call service support.
406F 0B31
Error code generation/logging detected fault in I/O book
1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems. 3. Third I/O book Location: U1.18-P1-H4 4. Call service support. 656
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
406F 0B32
Error code generation/logging detected fault in I/O book
1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems. 3. Third I/O book Location: U1.18-P1-H4 4. Call service support.
406F 0B33
Error code generation/logging detected fault in I/O book
1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems. 3. Third I/O book Location: U1.18-P1-H4 4. Call service support.
406F 0B34
Error code generation/logging detected fault in I/O book or SNI adapter
1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems, or the SNI adapter and the High Performance Switch, depending on which adapter is installed in location U1.18-P1-H4. 3. Third I/O book or SNI adapterLocation: U1.18-P1-H4 4. Call service support.
406F 0B35
Error code generation/logging detected fault in I/O book or SNI adapter
1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems, or the SNI adapter and the High Performance Switch, depending on which adapter is installed in location U1.18-P1-H4. 3. Third I/O book or SNI adapterLocation: U1.18-P1-H4 4. Call service support.
406F 0B36
Error code generation/logging detected fault in I/O book or SNI adapter
1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems, or the SNI adapter and the High Performance Switch, depending on which adapter is installed in location U1.18-P1-H4. 3. Third I/O book or SNI adapterLocation: U1.18-P1-H4 4. Call service support.
406F 0B37
Error code generation/logging detected fault in I/O book or SNI adapter
1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems, or the SNI adapter and the High Performance Switch, depending on which adapter is installed in location U1.18-P1-H4. 3. Third I/O book or SNI adapterLocation: U1.18-P1-H4 4. Call service support. Chapter
5.
Error
Code
to
FRU
Index
657
Error Code
Description
Action/ Possible Failing FRU
406F 0BF0
Error code generation/logging detected fault in I/O hub chip 0 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H2/Q1 b. U1.18-P1-H2/Q22. Replace the primary I/O book, Location: U1.18-P1-H2
406F 0BF1
Error code generation/logging detected fault in I/O hub chip 1 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H2/Q3 b. U1.18-P1-H2/Q42. Replace the primary I/O book, Location: U1.18-P1-H2
406F 0BF2
Error code generation/logging detected fault in I/O hub chip 2 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H1/Q3 b. U1.18-P1-H1/Q42. Replace the RIO book, Location: U1.18-P1-H1
406F 0BF3
Error code generation/logging detected fault in I/O hub chip 3 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H1/Q1 b. U1.18-P1-H1/Q22. Replace the RIO book, Location: U1.18-P1-H1
406F 0BF4
Error code generation/logging detected fault in I/O hub chip 4 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H3/Q7 b. U1.18-P1-H3/Q82. Replace the RIO book, Location: U1.18-P1-H3
406F 0BF5
Error code generation/logging detected fault in I/O hub chip 5 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H3/Q5 b. U1.18-P1-H3/Q62. Replace the RIO book, Location: U1.18-P1-H3
406F 0BF6
Error code generation/logging detected fault in I/O hub chip 6 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H4/Q5 b. U1.18-P1-H4/Q62. Replace the RIO book, Location: U1.18-P1-H4 658
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
406F 0BF7
Error code generation/logging detected fault in I/O hub chip 7 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H4/Q7 b. U1.18-P1-H4/Q82. Replace the RIO book, Location: U1.18-P1-H4
406F 0BF8
Error code generation/logging detected fault in I/O hub chip 8 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H1/Q7 b. U1.18-P1-H1/Q82. Replace the RIO book, Location: U1.18-P1-H1
406F 0BF9
Error code generation/logging detected fault in I/O hub chip 9 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H1/Q5 b. U1.18-P1-H1/Q62. Replace the RIO book, Location: U1.18-P1-H1
406F 0BFA
Error code generation/logging detected fault in I/O hub chip 10 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H3/Q3 b. U1.18-P1-H3/Q42. Replace the RIO book, Location: U1.18-P1-H3
406F 0BFB
Error code generation/logging detected fault in I/O hub chip 11 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H3/Q1 b. U1.18-P1-H3/Q22. Replace the RIO book, Location: U1.18-P1-H3
406F 0BFC
Error code generation/logging detected fault in I/O hub chip 12 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H4/Q1 b. U1.18-P1-H4/Q22. Replace the RIO book, Location: U1.18-P1-H4.
406F 0BFD
Error code generation/logging detected fault in I/O hub chip 13 (port x)
1. Check the cabling on the following RIO ports: a. U1.18-P1-H4/Q3 b. U1.18-P1-H4/Q42. Replace the RIO book, Location: U1.18-P1-H4
406F 0C03
Error code generation/logging detected fault in all MCMs
Go to “MAP 1543: MCM Module Problem Isolation” on page 337.
406F 0C09
Error code generation/logging detected fault in all memory books
Go to “MAP 1546: Memory Book Problem Isolation” on page 345. Chapter
5.
Error
Code
to
FRU
Index
659
Error Code
Description
Action/ Possible Failing FRU
406F 0C0A
Error code generation/logging detected fault in all I/O books
Go to “MAP 1547: I/O Book Problem Isolation” on page 348.
406F 0C10
Error code generation/logging detected mismatch between MCM 0 and VPD card
Plugging error with FRU kitLocation: U1.18-P1-C1 and U1.18-P1-H2.3
406F 0C11
Error code generation/logging detected mismatch between MCM 2 and VPD card
Plugging error with FRU kitLocation: U1.18-P1-C4 and U1.18-P1-H2.4
406F 0C12
Error code generation/logging detected mismatch between MCM 1 and VPD card
Plugging error with FRU kitLocation: U1.18-P1-C3 and U1.18-P1-H2.5
406F 0C13
Error code generation/logging detected mismatch between MCM 3 and VPD card
Plugging error with FRU kitLocation: U1.18-P1-C2 and U1.18-P1-H2.6
406F 0CA5
Error code generation/logging detected invalid processor subsystem configuration
Verify that MCMs, memory books, and L3 modules are correctly configured.
406F 0CA6
Error code generation/logging detected no usable main storage
Reboot the system in slow mode to isolate the failure.
406F 0CA9
Error code generation/logging detected configuration error with processor memory
Reboot the system in slow mode to isolate the failure.
406F 0D00
Error code generation/logging detected fault in L3 cache module 0
L3 cache module 0 and L3 VPD cardLocation: U1.18-P1-C6 and U1.18-P1-H2.1
406F 0D01
Error code generation/logging detected fault in L3 cache module 1
L3 cache module 1 and L3 VPD card Location: U1.18-P1-C7 and U1.18-P1-H2.1
406F 0D02
Error code generation/logging detected fault in L3 cache module 2
L3 cache module 2 and L3 VPD card Location: U1.18-P1-C11 and U1.18-P1-H2.1
406F 0D03
Error code generation/logging detected fault in L3 cache module 3
L3 cache module 3 and L3 VPD card Location: U1.18-P1-C10 and U1.18-P1-H2.1
406F 0D04
Error code generation/logging detected fault in L3 cache module 4
L3 cache module 4 and L3 VPD card Location: U1.18-P1-C8 and U1.18-P1-H2.1
406F 0D05
Error code generation/logging detected fault in L3 cache module 5
L3 cache module 5 and L3 VPD card Location: U1.18-P1-C20 and U1.18-P1-H2.1
406F 0D06
Error code generation/logging detected fault in L3 cache module 6
L3 cache module 6 and L3 VPD card Location: U1.18-P1-C16 and U1.18-P1-H2.1
406F 0D07
Error code generation/logging detected fault in L3 cache module 7
L3 cache module 7 and L3 VPD card Location: U1.18-P1-C12 and U1.18-P1-H2.1
406F 0D08
Error code generation/logging detected fault in L3 cache module 8
L3 cache module 8 and L3 VPD card Location: U1.18-P1-C19 and U1.18-P1-H2.1
406F 0D09
Error code generation/logging detected fault in L3 cache module 9
L3 cache module 9 and L3 VPD card Location: U1.18-P1-C18 and U1.18-P1-H2.1
406F 0D0A
Error code generation/logging detected fault in L3 cache module 10
L3 cache module 10 and L3 VPD card Location: U1.18-P1-C14 and U1.18-P1-H2.1
406F 0D0B
Error code generation/logging detected fault in L3 cache module 11
L3 cache module 11 and L3 VPD card Location: U1.18-P1-C15 and U1.18-P1-H2.1
406F 0D0C
Error code generation/logging detected fault in L3 cache module 12
L3 cache module 12 and L3 VPD card Location: U1.18-P1-C17 and U1.18-P1-H2.1
406F 0D0D
Error code generation/logging detected fault in L3 cache module 13
L3 cache module 13 and L3 VPD card Location: U1.18-P1-C5 and U1.18-P1-H2.1
406F 0D0E
Error code generation/logging detected fault in L3 cache module 14
L3 cache module 14 and L3 VPD card Location: U1.18-P1-C9 and U1.18-P1-H2.1 660
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
406F 0D0F
Error code generation/logging detected fault in L3 cache module 15
L3 cache module 15 and L3 VPD card Location: U1.18-P1-C13 and U1.18-P1-H2.1
406F 0EA0
Error code generation/logging detected fault in MOPS service processor code
1. Access the service processor error log. Find the error log entry for the 406x 0EA0 error. Look for a hardware error of the form 450x yyyy, 460x yyyy, or 4B2x yyyy with a timestamp within a few minutes of the 406x 0EA0 error. Follow the actions for this hardware error. 2. If there is no hardware error in the service processor error log with a timestamp that is close to the 406x 0EA0 error, or following the actions for the hardware error does not resolve the problem, check for platform firmware updates. If an update is available, apply it. 3. If a platform firmware update is not available, or the update does not correct the problem, call service support.
406F 0EA1
Error code generation/logging detected fault in processor subsystem code on main processor
1. Check for operating system updates. 2. Call service support.
406F 0EA2
Error code generation/logging detected fault in PRD code on service processor
1. Check for system firmware updates. 2. Call service support.
406F 0EA4
The service processor firmware detected a hardware problem but was unable to isolate the source.
Call service support.
406F 0EA5
PRD detected a problem in the FNM code
1. Check for system firmware updates. 2. Call service support.
406F 0EAA
PRD: FNM is handling this error
1. Check for system firmware updates. 2. Call service support.
406F 0EAF
Error code generation/logging detected unknown service processor code and system type
1. Check for system firmware updates. 2. Call service support.
406F 0EB0
Error code generation/logging detected JTAG problem
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
406F 0EB1
Attention line 0 stuck fault
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
406F 0EB2
Attention line 1 stuck fault
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
406F 0EB3
Attention line 2 stuck fault
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
406F 0EB4
Attention line 3 stuck fault
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
406F 0EB5
Attention line 4 stuck fault
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
406F 0EB6
Attention line 5 stuck fault
Go to “MAP 1540: Problem Isolation Procedures” on page 308. Chapter
5.
Error
Code
to
FRU
Index
661
Error Code
Description
Action/ Possible Failing FRU
406F 0EB7
Attention line 6 stuck fault
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
406F 0EB8
Attention line 7 stuck fault
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
406F 0EB9
Attention line 8 stuck fault
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
406F 0EBA
Attention line 9 stuck fault
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
406F 0EBB
Attention line 10 stuck fault
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
406F 0EBC
Attention line 11 stuck fault
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
406F 0EBF
Memory error
1. Reboot the system in slow mode to the SMS menus. See “Performing Slow Boot” on page 413. 2. Examine the service processor error log. v If a CEC subsystem error (a 4xxx yyyy error) occurred during the slow boot that calls out the memory, processors, L3 cache modules, or the primary I/O book, replace the part indicated by the location code. v If a CEC subsystem error (a 4xxx yyyy error) did not occur during the slow boot, look for a 4xxx yyyy error immediately preceding the 406x 0EBF error. Replace the part indicated by the location code associated with that error.
406F 0EE1
Error code generation/logging detected fault in service processor code (invalid MRU to mapping)
1. Check for system firmware updates. 2. Call service support.
406F 0EE2
Error code generation/logging detected fault in service processor firmware fault
1. Check for system firmware updates. 2. Call service support.
406F 0EE5
Error code generation/logging detected firmware error; no callout available
1. Check for system firmware updates. 2. Call service support.
406F 25E6
SMA 4-port switch adapter fault
Replace the I/O book (switch adapter) specified by the location code.
406F 273F
SMA 2-port switch adapter fault
Replace the I/O book (switch adapter) specified by the location code.
4503 25D9
This error code indicates that the memory repeat gard function has deconfigured an L3 cache module.
Replace the L3 cache module indicated by the location code reported with the error code, and the L3 VPD card, location: U1.18-P1-H2.1
4503 25E7
This error code indicates that the memory repeat gard function has deconfigured an L3 cache module.
Replace the L3 cache module indicated by the location code reported with the error code, and the L3 VPD card, location: U1.18-P1-H2.1 662
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
4503 25E8
This error code indicates that the memory repeat gard function has deconfigured an L3 cache module.
Replace the L3 cache module indicated by the location code reported with the error code, and the L3 VPD card, location: U1.18-P1-H2.1
4503 25F1
This error code indicates that the memory repeat gard function has deconfigured an L3 cache module.
Replace the L3 cache module indicated by the location code reported with the error code, and the L3 VPD card, location: U1.18-P1-H2.1
4503 25F9
This error code indicates that the memory repeat gard function has deconfigured an L3 cache module.
Replace the L3 cache module indicated by the location code reported with the error code, and the L3 VPD card, location: U1.18-P1-H2.1
4503 269A
This error code indicates that the memory repeat gard function has deconfigured an L3 cache module.
Replace the L3 cache module indicated by the location code reported with the error code, and the L3 VPD card, location: U1.18-P1-H2.1
4503 302A
This error code indicates that the memory repeat gard function has deconfigured or partially deconfigured a memory book.
Do the following: 1. Go to the memory configuration/deconfiguration menu, which is under the system information menu in the service processor menus. Look at the status of the books. If any of the books have a status of ?manually deconfigured?, reconfigure the books and boot the system in a full system partition to the SMS menus. Shut down the system and look again at the status of the books. If all of the books are still configured in the memory configuration/deconfiguration menu, the problem has been resolved. If any of the books have a status of ?deconfigured by the system?, either before or after the boot to the SMS menus, go to step 2. 2. Replace the memory book specified by the location code. Chapter
5.
Error
Code
to
FRU
Index
663
Error Code
Description
Action/ Possible Failing FRU
4503 302B
This error code indicates that the memory repeat gard function has deconfigured or partially deconfigured a memory book.
Do the following: 1. Go to the memory configuration/deconfiguration menu, which is under the system information menu in the service processor menus. Look at the status of the books. If any of the books have a status of ?manually deconfigured?, reconfigure the books and boot the system in a full system partition to the SMS menus. Shut down the system and look again at the status of the books. If all of the books are still configured in the memory configuration/deconfiguration menu, the problem has been resolved. If any of the books have a status of ?deconfigured by the system?, either before or after the boot to the SMS menus, go to step 2. 2. Replace the memory book specified by the location code.
4503 302C
This error code indicates that the memory repeat gard function has deconfigured or partially deconfigured a memory book.
Do the following: 1. Go to the memory configuration/deconfiguration menu, which is under the system information menu in the service processor menus. Look at the status of the books. If any of the books have a status of ?manually deconfigured?, reconfigure the books and boot the system in a full system partition to the SMS menus. Shut down the system and look again at the status of the books. If all of the books are still configured in the memory configuration/deconfiguration menu, the problem has been resolved. If any of the books have a status of ?deconfigured by the system?, either before or after the boot to the SMS menus, go to step 2. 2. Replace the memory book specified by the location code. 664
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
4503 302D
This error code indicates that the memory repeat gard function has deconfigured or partially deconfigured a memory book.
Do the following: 1. Go to the memory configuration/deconfiguration menu, which is under the system information menu in the service processor menus. Look at the status of the books. If any of the books have a status of ?manually deconfigured?, reconfigure the books and boot the system in a full system partition to the SMS menus. Shut down the system and look again at the status of the books. If all of the books are still configured in the memory configuration/deconfiguration menu, the problem has been resolved. If any of the books have a status of ?deconfigured by the system?, either before or after the boot to the SMS menus, go to step 2. 2. Replace the memory book specified by the location code.
4503 305A
This error code indicates that the memory repeat gard function has deconfigured or partially deconfigured a memory book.
Do the following: 1. Go to the memory configuration/deconfiguration menu, which is under the system information menu in the service processor menus. Look at the status of the books. If any of the books have a status of ?manually deconfigured?, reconfigure the books and boot the system in a full system partition to the SMS menus. Shut down the system and look again at the status of the books. If all of the books are still configured in the memory configuration/deconfiguration menu, the problem has been resolved. If any of the books have a status of ?deconfigured by the system?, either before or after the boot to the SMS menus, go to step 2. 2. Replace the memory book specified by the location code. Chapter
5.
Error
Code
to
FRU
Index
665
Error Code
Description
Action/ Possible Failing FRU
4503 305B
This error code indicates that the memory repeat gard function has deconfigured or partially deconfigured a memory book.
Do the following: 1. Go to the memory configuration/deconfiguration menu, which is under the system information menu in the service processor menus. Look at the status of the books. If any of the books have a status of ?manually deconfigured?, reconfigure the books and boot the system in a full system partition to the SMS menus. Shut down the system and look again at the status of the books. If all of the books are still configured in the memory configuration/deconfiguration menu, the problem has been resolved. If any of the books have a status of ?deconfigured by the system?, either before or after the boot to the SMS menus, go to step 2. 2. Replace the memory book specified by the location code.
4503 305C
This error code indicates that the memory repeat gard function has deconfigured or partially deconfigured a memory book.
Do the following: 1. Go to the memory configuration/deconfiguration menu, which is under the system information menu in the service processor menus. Look at the status of the books. If any of the books have a status of ?manually deconfigured?, reconfigure the books and boot the system in a full system partition to the SMS menus. Shut down the system and look again at the status of the books. If all of the books are still configured in the memory configuration/deconfiguration menu, the problem has been resolved. If any of the books have a status of ?deconfigured by the system?, either before or after the boot to the SMS menus, go to step 2. 2. Replace the memory book specified by the location code. 666
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
4503 305D
This error code indicates that the memory repeat gard function has deconfigured or partially deconfigured a memory book.
Do the following: 1. Go to the memory configuration/deconfiguration menu, which is under the system information menu in the service processor menus. Look at the status of the books. If any of the books have a status of ?manually deconfigured?, reconfigure the books and boot the system in a full system partition to the SMS menus. Shut down the system and look again at the status of the books. If all of the books are still configured in the memory configuration/deconfiguration menu, the problem has been resolved. If any of the books have a status of ?deconfigured by the system?, either before or after the boot to the SMS menus, go to step 2. 2. Replace the memory book specified by the location code.
4503 305E
This error code indicates that the memory repeat gard function has deconfigured or partially deconfigured a memory book.
Do the following: 1. Go to the memory configuration/deconfiguration menu, which is under the system information menu in the service processor menus. Look at the status of the books. If any of the books have a status of ?manually deconfigured?, reconfigure the books and boot the system in a full system partition to the SMS menus. Shut down the system and look again at the status of the books. If all of the books are still configured in the memory configuration/deconfiguration menu, the problem has been resolved. If any of the books have a status of ?deconfigured by the system?, either before or after the boot to the SMS menus, go to step 2. 2. Replace the memory book specified by the location code. Chapter
5.
Error
Code
to
FRU
Index
667
Error Code
Description
Action/ Possible Failing FRU
4503 305F
This error code indicates that the memory repeat gard function has deconfigured or partially deconfigured a memory book.
Do the following: 1. Go to the memory configuration/deconfiguration menu, which is under the system information menu in the service processor menus. Look at the status of the books. If any of the books have a status of ?manually deconfigured?, reconfigure the books and boot the system in a full system partition to the SMS menus. Shut down the system and look again at the status of the books. If all of the books are still configured in the memory configuration/deconfiguration menu, the problem has been resolved. If any of the books have a status of ?deconfigured by the system?, either before or after the boot to the SMS menus, go to step 2. 2. Replace the memory book specified by the location code.
4503 307A
This error code indicates that the memory repeat gard function has deconfigured or partially deconfigured a memory book.
Do the following: 1. Go to the memory configuration/deconfiguration menu, which is under the system information menu in the service processor menus. Look at the status of the books. If any of the books have a status of ?manually deconfigured?, reconfigure the books and boot the system in a full system partition to the SMS menus. Shut down the system and look again at the status of the books. If all of the books are still configured in the memory configuration/deconfiguration menu, the problem has been resolved. If any of the books have a status of ?deconfigured by the system?, either before or after the boot to the SMS menus, go to step 2. 2. Replace the memory book specified by the location code. 668
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
4503 307B
This error code indicates that the memory repeat gard function has deconfigured or partially deconfigured a memory book.
Do the following: 1. Go to the memory configuration/deconfiguration menu, which is under the system information menu in the service processor menus. Look at the status of the books. If any of the books have a status of ?manually deconfigured?, reconfigure the books and boot the system in a full system partition to the SMS menus. Shut down the system and look again at the status of the books. If all of the books are still configured in the memory configuration/deconfiguration menu, the problem has been resolved. If any of the books have a status of ?deconfigured by the system?, either before or after the boot to the SMS menus, go to step 2. 2. Replace the memory book specified by the location code.
4503 307C
This error code indicates that the memory repeat gard function has deconfigured or partially deconfigured a memory book.
Do the following: 1. Go to the memory configuration/deconfiguration menu, which is under the system information menu in the service processor menus. Look at the status of the books. If any of the books have a status of ?manually deconfigured?, reconfigure the books and boot the system in a full system partition to the SMS menus. Shut down the system and look again at the status of the books. If all of the books are still configured in the memory configuration/deconfiguration menu, the problem has been resolved. If any of the books have a status of ?deconfigured by the system?, either before or after the boot to the SMS menus, go to step 2. 2. Replace the memory book specified by the location code. Chapter
5.
Error
Code
to
FRU
Index
669
Error Code
Description
Action/ Possible Failing FRU
4503 307D
This error code indicates that the memory repeat gard function has deconfigured or partially deconfigured a memory book.
Do the following: 1. Go to the memory configuration/deconfiguration menu, which is under the system information menu in the service processor menus. Look at the status of the books. If any of the books have a status of ?manually deconfigured?, reconfigure the books and boot the system in a full system partition to the SMS menus. Shut down the system and look again at the status of the books. If all of the books are still configured in the memory configuration/deconfiguration menu, the problem has been resolved. If any of the books have a status of ?deconfigured by the system?, either before or after the boot to the SMS menus, go to step 2. 2. Replace the memory book specified by the location code.
4503 307E
This error code indicates that the memory repeat gard function has deconfigured or partially deconfigured a memory book.
Do the following: 1. Go to the memory configuration/deconfiguration menu, which is under the system information menu in the service processor menus. Look at the status of the books. If any of the books have a status of ?manually deconfigured?, reconfigure the books and boot the system in a full system partition to the SMS menus. Shut down the system and look again at the status of the books. If all of the books are still configured in the memory configuration/deconfiguration menu, the problem has been resolved. If any of the books have a status of ?deconfigured by the system?, either before or after the boot to the SMS menus, go to step 2. 2. Replace the memory book specified by the location code. 670
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
4503 308A
This error code indicates that the memory repeat gard function has deconfigured or partially deconfigured a memory book.
Do the following: 1. Go to the memory configuration/deconfiguration menu, which is under the system information menu in the service processor menus. Look at the status of the books. If any of the books have a status of ?manually deconfigured?, reconfigure the books and boot the system in a full system partition to the SMS menus. Shut down the system and look again at the status of the books. If all of the books are still configured in the memory configuration/deconfiguration menu, the problem has been resolved. If any of the books have a status of ?deconfigured by the system?, either before or after the boot to the SMS menus, go to step 2. 2. Replace the memory book specified by the location code.
4503 308B
This error code indicates that the memory repeat gard function has deconfigured or partially deconfigured a memory book.
Do the following: 1. Go to the memory configuration/deconfiguration menu, which is under the system information menu in the service processor menus. Look at the status of the books. If any of the books have a status of ?manually deconfigured?, reconfigure the books and boot the system in a full system partition to the SMS menus. Shut down the system and look again at the status of the books. If all of the books are still configured in the memory configuration/deconfiguration menu, the problem has been resolved. If any of the books have a status of ?deconfigured by the system?, either before or after the boot to the SMS menus, go to step 2. 2. Replace the memory book specified by the location code. Chapter
5.
Error
Code
to
FRU
Index
671
Error Code
Description
Action/ Possible Failing FRU
4503 308C
This error code indicates that the memory repeat gard function has deconfigured or partially deconfigured a memory book.
Do the following: 1. Go to the memory configuration/deconfiguration menu, which is under the system information menu in the service processor menus. Look at the status of the books. If any of the books have a status of ?manually deconfigured?, reconfigure the books and boot the system in a full system partition to the SMS menus. Shut down the system and look again at the status of the books. If all of the books are still configured in the memory configuration/deconfiguration menu, the problem has been resolved. If any of the books have a status of ?deconfigured by the system?, either before or after the boot to the SMS menus, go to step 2. 2. Replace the memory book specified by the location code.
4503 308D
This error code indicates that the memory repeat gard function has deconfigured or partially deconfigured a memory book.
Do the following: 1. Go to the memory configuration/deconfiguration menu, which is under the system information menu in the service processor menus. Look at the status of the books. If any of the books have a status of ?manually deconfigured?, reconfigure the books and boot the system in a full system partition to the SMS menus. Shut down the system and look again at the status of the books. If all of the books are still configured in the memory configuration/deconfiguration menu, the problem has been resolved. If any of the books have a status of ?deconfigured by the system?, either before or after the boot to the SMS menus, go to step 2. 2. Replace the memory book specified by the location code. 672
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
4503 308E
This error code indicates that the memory repeat gard function has deconfigured or partially deconfigured a memory book.
Do the following: 1. Go to the memory configuration/deconfiguration menu, which is under the system information menu in the service processor menus. Look at the status of the books. If any of the books have a status of ?manually deconfigured?, reconfigure the books and boot the system in a full system partition to the SMS menus. Shut down the system and look again at the status of the books. If all of the books are still configured in the memory configuration/deconfiguration menu, the problem has been resolved. If any of the books have a status of ?deconfigured by the system?, either before or after the boot to the SMS menus, go to step 2. 2. Replace the memory book specified by the location code.
4503 30BD
This error code indicates that the memory repeat gard function has deconfigured or partially deconfigured a memory book.
Do the following: 1. Go to the memory configuration/deconfiguration menu, which is under the system information menu in the service processor menus. Look at the status of the books. If any of the books have a status of ?manually deconfigured?, reconfigure the books and boot the system in a full system partition to the SMS menus. Shut down the system and look again at the status of the books. If all of the books are still configured in the memory configuration/deconfiguration menu, the problem has been resolved. If any of the books have a status of ?deconfigured by the system?, either before or after the boot to the SMS menus, go to step 2. 2. Replace the memory book specified by the location code. Chapter
5.
Error
Code
to
FRU
Index
673
Error Code
Description
Action/ Possible Failing FRU
4503 30C0
This error code indicates that the memory repeat gard function has deconfigured or partially deconfigured a memory book.
Do the following: 1. Go to the memory configuration/deconfiguration menu, which is under the system information menu in the service processor menus. Look at the status of the books. If any of the books have a status of ?manually deconfigured?, reconfigure the books and boot the system in a full system partition to the SMS menus. Shut down the system and look again at the status of the books. If all of the books are still configured in the memory configuration/deconfiguration menu, the problem has been resolved. If any of the books have a status of ?deconfigured by the system?, either before or after the boot to the SMS menus, go to step 2. 2. Replace the memory book specified by the location code.
4503 30C1
This error code indicates that the memory repeat gard function has deconfigured or partially deconfigured a memory book.
Do the following: 1. Go to the memory configuration/deconfiguration menu, which is under the system information menu in the service processor menus. Look at the status of the books. If any of the books have a status of ?manually deconfigured?, reconfigure the books and boot the system in a full system partition to the SMS menus. Shut down the system and look again at the status of the books. If all of the books are still configured in the memory configuration/deconfiguration menu, the problem has been resolved. If any of the books have a status of ?deconfigured by the system?, either before or after the boot to the SMS menus, go to step 2. 2. Replace the memory book specified by the location code. 674
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
4503 30C3
This error code indicates that the memory repeat gard function has deconfigured or partially deconfigured a memory book.
Do the following: 1. Go to the memory configuration/deconfiguration menu, which is under the system information menu in the service processor menus. Look at the status of the books. If any of the books have a status of ?manually deconfigured?, reconfigure the books and boot the system in a full system partition to the SMS menus. Shut down the system and look again at the status of the books. If all of the books are still configured in the memory configuration/deconfiguration menu, the problem has been resolved. If any of the books have a status of ?deconfigured by the system?, either before or after the boot to the SMS menus, go to step 2. 2. Replace the memory book specified by the location code.
4503 30C6
This error code indicates that the memory repeat gard function has deconfigured or partially deconfigured a memory book.
Do the following: 1. Go to the memory configuration/deconfiguration menu, which is under the system information menu in the service processor menus. Look at the status of the books. If any of the books have a status of ?manually deconfigured?, reconfigure the books and boot the system in a full system partition to the SMS menus. Shut down the system and look again at the status of the books. If all of the books are still configured in the memory configuration/deconfiguration menu, the problem has been resolved. If any of the books have a status of ?deconfigured by the system?, either before or after the boot to the SMS menus, go to step 2. 2. Replace the memory book specified by the location code. Chapter
5.
Error
Code
to
FRU
Index
675
Error Code
Description
Action/ Possible Failing FRU
4503 30C9
This error code indicates that the memory repeat gard function has deconfigured or partially deconfigured a memory book.
Do the following: 1. Go to the memory configuration/deconfiguration menu, which is under the system information menu in the service processor menus. Look at the status of the books. If any of the books have a status of ?manually deconfigured?, reconfigure the books and boot the system in a full system partition to the SMS menus. Shut down the system and look again at the status of the books. If all of the books are still configured in the memory configuration/deconfiguration menu, the problem has been resolved. If any of the books have a status of ?deconfigured by the system?, either before or after the boot to the SMS menus, go to step 2. 2. Replace the memory book specified by the location code.
4503 30CA
This error code indicates that the memory repeat gard function has deconfigured or partially deconfigured a memory book.
Do the following: 1. Go to the memory configuration/deconfiguration menu, which is under the system information menu in the service processor menus. Look at the status of the books. If any of the books have a status of ?manually deconfigured?, reconfigure the books and boot the system in a full system partition to the SMS menus. Shut down the system and look again at the status of the books. If all of the books are still configured in the memory configuration/deconfiguration menu, the problem has been resolved. If any of the books have a status of ?deconfigured by the system?, either before or after the boot to the SMS menus, go to step 2. 2. Replace the memory book specified by the location code. 676
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
4503 30CC
This error code indicates that the memory repeat gard function has deconfigured or partially deconfigured a memory book.
Do the following: 1. Go to the memory configuration/deconfiguration menu, which is under the system information menu in the service processor menus. Look at the status of the books. If any of the books have a status of ?manually deconfigured?, reconfigure the books and boot the system in a full system partition to the SMS menus. Shut down the system and look again at the status of the books. If all of the books are still configured in the memory configuration/deconfiguration menu, the problem has been resolved. If any of the books have a status of ?deconfigured by the system?, either before or after the boot to the SMS menus, go to step 2. 2. Replace the memory book specified by the location code.
4504 248B
Memory subsystem and processor subsystem tests detected fault in operator panel
Operator panel Location: U1.17-L1
4504 25C0
Memory subsystem and processor subsystem tests detected fault in system backplane
System backplane Location: U1.18-P1
4504 25C1
Memory subsystem and processor subsystem tests detected fault in 1.0 GHz clock card
Clock card Location: U1.18-P1-X5
4504 25D9
Memory subsystem and processor subsystem tests detected fault in L3 VPD card used with 1.1 GHz MCM and 1.3 GHz MCM
Go to “MAP 1548: Memory and Processor Problem Isolation” on page 350.
4504 25E7
Fault in L3 VPD card used with 1.5 GHz and 1.7 GHz MCMs
Go to “MAP 1548: Memory and Processor Problem Isolation” on page 350.
4504 25E8
Fault in L3 VPD card used with 1.5 GHz and 1.7 GHz MCMs
Go to “MAP 1548: Memory and Processor Problem Isolation” on page 350.
4504 25F1
Memory subsystem and processor subsystem tests detected fault in L3 VPD card used with 1.3 GHz MCM
Go to “MAP 1548: Memory and Processor Problem Isolation” on page 350.
4504 25F9
Fault in L3 VPD card used with 1.5 GHz and 1.7 GHz MCMs
Go to “MAP 1548: Memory and Processor Problem Isolation” on page 350.
4504 269A
Fault in L3 VPD card used with 1.5 GHz and 1.7 GHz MCMs
Go to “MAP 1548: Memory and Processor Problem Isolation” on page 350.
4504 28B0
Memory subsystem and processor subsystem tests detected fault in I/O subsystem PCI backplane
I/O subsystem PCI backplane at location specified in error code
4504 28B1
Memory subsystem and processor subsystem tests detected fault in I/O subsystem DASD backplane
I/O subsystem DASD backplane at location specified in error code
4504 302A
Memory subsystem and processor subsystem tests detected fault in 8 GB memory book (inner)
Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340 to isolate the failing component. Chapter
5.
Error
Code
to
FRU
Index
677
Error Code
Description
Action/ Possible Failing FRU
4504 302B
Memory subsystem and processor subsystem tests detected fault in 8 GB memory book (outer)
Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342 to isolate the failing component.
4504 302C
Memory subsystem and processor subsystem tests detected fault in 32 GB memory book (inner)
Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340 to isolate the failing component.
4504 302D
Memory subsystem and processor subsystem tests detected fault in 32 GB memory book (outer)
Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342 to isolate the failing component.
4504 305A
Memory subsystem and processor subsystem tests detected fault in 8 GB memory book (inner)
Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340.
4504 305B
Memory subsystem and processor subsystem tests detected fault in 8 GB memory book (outer)
Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342.
4504 305C
Memory subsystem and processor subsystem tests detected fault in 4 GB inner memory book
Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340.
4504 305D
Memory subsystem and processor subsystem tests detected fault in 4 GB outer memory book
Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342.
4504 305E
Memory subsystem and processor subsystem tests detected fault in 16 GB inner memory book
Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340.
4504 305F
Memory subsystem and processor subsystem tests detected fault in 16 GB outer memory book
Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342.
4504 307A
Memory subsystem and processor subsystem tests detected fault in 4 GB inner memory book
Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340.
4504 307B
Memory subsystem and processor subsystem tests detected fault in 8 GB inner memory book
Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340.
4504 307C
Memory subsystem and processor subsystem tests detected fault in 16 GB inner memory book
Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340.
4504 307D
Memory subsystem and processor subsystem tests detected fault in 16 GB inner memory book
Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340.
4504 307E
Memory subsystem and processor subsystem tests detected fault in 16 GB inner memory book
Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340.
4504 308A
Memory subsystem and processor subsystem tests detected fault in 4 GB outer memory book
Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342.
4504 308B
Memory subsystem and processor subsystem tests detected fault in 4 GB outer memory book
Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342.
4504 308C
Memory subsystem and processor subsystem tests detected fault in 4 GB outer memory book
Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342.
4504 308D
Memory subsystem and processor subsystem tests detected fault in 4 GB outer memory book
Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342.
4504 308E
Memory subsystem and processor subsystem tests detected fault in 4 GB outer memory book
Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342. 678
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
4504 30BD
Memory subsystem and processor subsystem tests detected fault in 8 GB or 16 GB Capacity Upgrade on Demand (CUoD) inner memory book
Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340.
4504 30C0
Memory subsystem and processor subsystem tests detected fault in 8 GB or 16 GB Capacity Upgrade on Demand (CUoD) inner memory book
Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340.
4504 30C1
Memory subsystem and processor subsystem tests detected fault in 8 GB or 16 GB Capacity Upgrade on Demand (CUoD) inner memory book
Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340.
4504 30C3
Memory subsystem and processor subsystem tests detected fault in 32 GB or 64 GB CUoD inner memory book
Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340.
4504 30C6
Memory subsystem and processor subsystem tests detected fault in 8 GB or 16 GB CUoD outer memory book
Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342.
4504 30C9
Memory subsystem and processor subsystem tests detected fault in 8 GB or 16 GB CUoD outer memory book
Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342.
4504 30CA
Memory subsystem and processor subsystem tests detected fault in 8 GB or 16 GB CUoD outer memory book
Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342.
4504 30CC
Memory subsystem and processor subsystem tests detected fault in 32 GB or 64 GB CUoD outer memory book
Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342.
4505 248B
Memory subsystem and display alter tests detected fault in operator panel
Operator panel Location: U1.17-L1
4505 25C0
Memory subsystem and display alter tests detected fault in system backplane
System backplane Location: U1.18-P1
4505 25C1
Memory subsystem and display alter tests detected fault in 1.0 GHz clock card
Clock card Location: U1.18-P1-X5
4505 25D9
Memory subsystem and display alter tests detected fault in L3 VPD card used with 1.1 GHz MCM and 1.3 GHz MCM
Go to “MAP 1548: Memory and Processor Problem Isolation” on page 350.
4505 25E7
Fault in L3 VPD card used with 1.5 GHz and 1.7 GHz MCMs
Go to “MAP 1548: Memory and Processor Problem Isolation” on page 350.
4505 25E8
Fault in L3 VPD card used with 1.5 GHz and 1.7 GHz MCMs
Go to “MAP 1548: Memory and Processor Problem Isolation” on page 350.
4505 25F1
Memory subsystem and display alter tests detected fault in L3 VPD card used with 1.3 GHz MCM
Go to “MAP 1548: Memory and Processor Problem Isolation” on page 350.
4505 25F9
Fault in L3 VPD card used with 1.5 GHz and 1.7 GHz MCMs
Go to “MAP 1548: Memory and Processor Problem Isolation” on page 350.
4505 269A
Fault in L3 VPD card used with 1.5 GHz and 1.7 GHz MCMs
Go to “MAP 1548: Memory and Processor Problem Isolation” on page 350.
4505 28B0
Memory subsystem and display alter tests detected fault in I/O subsystem PCI backplane
I/O subsystem PCI backplane at location specified in error code Chapter
5.
Error
Code
to
FRU
Index
679
Error Code
Description
Action/ Possible Failing FRU
4505 28B1
Memory subsystem and display alter tests detected fault in I/O subsystem DASD backplane
I/O subsystem DASD backplane at location specified in error code
4505 302A
Memory subsystem and display alter tests detected fault in 8 GB memory book (inner)
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4505 302B
Memory subsystem and display alter tests detected fault in 8 GB memory book (outer)
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4505 302C
Memory subsystem and display alter tests detected fault in 32 GB memory book (inner)
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4505 302D
Memory subsystem and display alter tests detected fault in 32 GB memory book (outer)
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4505 305A
Memory subsystem and processor subsystem tests detected fault in 8 GB memory book (inner)
Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340.
4505 305B
Memory subsystem and processor subsystem tests detected fault in 8 GB memory book (outer)
Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342.
4505 305C
Memory subsystem and display alter tests detected fault in 4 GB inner memory book
Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340.
4505 305D
Memory subsystem and display alter tests detected fault in 4 GB outer memory book
Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342.
4505 305E
Memory subsystem and display alter tests detected fault in 16 GB inner memory book
Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340.
4505 305F
Memory subsystem and display alter tests detected fault in 16 GB outer memory book
Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342.
4505 307A
Memory subsystem and processor subsystem tests detected fault in 4 GB inner memory book
Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340.
4505 307B
Memory subsystem and processor subsystem tests detected fault in 8 GB inner memory book
Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340.
4505 307C
Memory subsystem and processor subsystem tests detected fault in 16 GB inner memory book
Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340.
4505 307D
Memory subsystem and processor subsystem tests detected fault in 16 GB inner memory book
Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340.
4505 307E
Memory subsystem and processor subsystem tests detected fault in 16 GB inner memory book
Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340.
4505 308A
Memory subsystem and processor subsystem tests detected fault in 4 GB outer memory book
Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342.
4505 308B
Memory subsystem and processor subsystem tests detected fault in 4 GB outer memory book
Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342.
4505 308C
Memory subsystem and processor subsystem tests detected fault in 4 GB outer memory book
Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342. 680
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
4505 308D
Memory subsystem and processor subsystem tests detected fault in 4 GB outer memory book
Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342.
4505 308E
Memory subsystem and processor subsystem tests detected fault in 4 GB outer memory book
Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342.
4505 30BD
Memory subsystem and processor subsystem tests detected fault in 8 GB or 16 GB CUoD inner memory book
Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340.
4505 30C3
Memory subsystem and processor subsystem tests detected fault in 32 GB or 64 GB CUoD inner memory book
Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340.
4505 30C6
Memory subsystem and processor subsystem tests detected fault in 8 GB or 16 GB CUoD outer memory book
Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342.
4505 30CC
Memory subsystem and processor subsystem tests detected fault in 32 GB or 64 GB CUoD outer memory book
Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342.
4506 248B
Memory subsystem and scan interface basic assurance tests detected fault in operator panel
Operator panel Location: U1.17-L1
4506 25C0
Memory subsystem and scan interface basic assurance tests detected fault in system backplane
System backplane Location: U1.18-P1
4506 25C1
Memory subsystem and scan interface basic assurance tests detected fault in 1.0 GHz clock card
Clock card Location: U1.18-P1-X5
4506 25D9
Memory subsystem and scan interface basic assurance tests detected fault in L3 VPD card used with 1.1 GHz MCM and 1.3 GHz MCM
Go to “MAP 1548: Memory and Processor Problem Isolation” on page 350.
4506 25E7
Fault in L3 VPD card used with 1.5 GHz and 1.7 GHz MCMs
Go to “MAP 1548: Memory and Processor Problem Isolation” on page 350.
4506 25E8
Fault in L3 VPD card used with 1.5 GHz and 1.7 GHz MCMs
Go to “MAP 1548: Memory and Processor Problem Isolation” on page 350.
4506 25F1
Memory subsystem and scan interface basic assurance tests detected fault in L3 VPD card used with 1.3 GHz MCM
Go to “MAP 1548: Memory and Processor Problem Isolation” on page 350.
4506 25F9
Fault in L3 VPD card used with 1.5 GHz and 1.7 GHz MCMs
Go to “MAP 1548: Memory and Processor Problem Isolation” on page 350.
4506 269A
Fault in L3 VPD card used with 1.5 GHz and 1.7 GHz MCMs
Go to “MAP 1548: Memory and Processor Problem Isolation” on page 350.
4506 28B0
Memory subsystem and scan interface basic assurance tests detected fault in I/O subsystem PCI backplane
I/O subsystem PCI backplane at location specified in error code
4506 28B1
Memory subsystem and scan interface basic assurance tests detected fault in I/O subsystem DASD backplane
I/O subsystem DASD backplane at location specified in error code
4506 302A
Memory subsystem and scan interface basic assurance tests detected fault in 8 GB memory book (inner)
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4506 302B
Memory subsystem and scan interface basic assurance tests detected fault in 8 GB memory book (outer)
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. Chapter
5.
Error
Code
to
FRU
Index
681
Error Code
Description
Action/ Possible Failing FRU
4506 302C
Memory subsystem and scan interface basic assurance tests detected fault in 32 GB memory book (inner)
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4506 302D
Memory subsystem and scan interface basic assurance tests detected fault in 32 GB memory book (outer)
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4506 305A
Memory subsystem and processor subsystem tests detected fault in 8 GB memory book (inner)
Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340.
4506 305B
Memory subsystem and processor subsystem tests detected fault in 8 GB memory book (outer)
Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342.
4506 305C
Memory subsystem and scan interface basic assurance tests detected fault in 4 GB inner memory book
Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340.
4506 305D
Memory subsystem and scan interface basic assurance tests detected fault in 4 GB outer memory book
Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342.
4506 305E
Memory subsystem and scan interface basic assurance tests detected fault in 16 GB inner memory book
Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340.
4506 305F
Memory subsystem and scan interface basic assurance tests detected fault in 16 GB outer memory book
Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342.
4506 307A
Memory subsystem and processor subsystem tests detected fault in 4 GB inner memory book
Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340.
4506 307B
Memory subsystem and processor subsystem tests detected fault in 8 GB inner memory book
Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340.
4506 307C
Memory subsystem and processor subsystem tests detected fault in 16 GB inner memory book
Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340.
4506 307D
Memory subsystem and processor subsystem tests detected fault in 16 GB inner memory book
Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340.
4506 307E
Memory subsystem and processor subsystem tests detected fault in 16 GB inner memory book
Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340.
4506 308A
Memory subsystem and processor subsystem tests detected fault in 4 GB outer memory book
Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342.
4506 308B
Memory subsystem and processor subsystem tests detected fault in 4 GB outer memory book
Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342.
4506 308C
Memory subsystem and processor subsystem tests detected fault in 4 GB outer memory book
Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342.
4506 308D
Memory subsystem and processor subsystem tests detected fault in 4 GB outer memory book
Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342.
45064 308E
Memory subsystem and processor subsystem tests detected fault in 4 GB outer memory book
Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342.
4506 30BD
Memory subsystem and processor subsystem tests detected fault in 8 GB or 16 GB CUoD inner memory book
Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340. 682
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
4506 30C3
Memory subsystem and processor subsystem tests detected fault in 32 GB or 64 GB CUoD inner memory book
Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340.
4506 30C6
Memory subsystem and processor subsystem tests detected fault in 8 GB or 16 GB CUoD outer memory book
Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342.
4506 30CC
Memory subsystem and processor subsystem tests detected fault in 32 GB or 64 GB CUoD outer memory book
Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342.
4507 248B
Memory subsystem and bus tests detected fault in operator panel
Operator panel Location: U1.17-L1
4507 25C0
Memory subsystem and bus tests detected fault in system backplane
System backplane Location: U1.18-P1
4507 25C1
Memory subsystem and bus tests detected fault in 1.0 GHz clock card
Clock card Location: U1.18-P1-X5
4507 25D9
Memory subsystem and bus tests detected fault in L3 VPD card used with 1.1 GHz MCM and 1.3 GHz MCM
Go to “MAP 1548: Memory and Processor Problem Isolation” on page 350.
4507 25E7
Fault in L3 VPD card used with 1.5 GHz and 1.7 GHz MCMs
Go to “MAP 1548: Memory and Processor Problem Isolation” on page 350.
4507 25E8
Fault in L3 VPD card used with 1.5 GHz and 1.7 GHz MCMs
Go to “MAP 1548: Memory and Processor Problem Isolation” on page 350.
4507 25F1
Memory subsystem and bus tests detected fault in L3 VPD card used with 1.3 GHz MCM
Go to “MAP 1548: Memory and Processor Problem Isolation” on page 350.
4507 25F9
Fault in L3 VPD card used with 1.5 GHz and 1.7 GHz MCMs
Go to “MAP 1548: Memory and Processor Problem Isolation” on page 350.
4507 269A
Fault in L3 VPD card used with 1.5 GHz and 1.7 GHz MCMs
Go to “MAP 1548: Memory and Processor Problem Isolation” on page 350.
4507 28B0
Memory subsystem and bus tests detected fault in I/O subsystem PCI backplane
I/O subsystem PCI backplane at location specified in error code
4507 28B1
Memory subsystem and bus tests detected fault in I/O subsystem DASD backplane
I/O subsystem DASD backplane at location specified in error code
4507 302A
Memory subsystem and bus tests detected fault in 8 GB memory book (inner)
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4507 302B
Memory subsystem and bus tests detected fault in 8 GB memory book (outer)
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4507 302C
Memory subsystem and bus tests detected fault in 32 GB memory book (inner)
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4507 302D
Memory subsystem and bus tests detected fault in 32 GB memory book (outer)
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4507 305A
Memory subsystem and processor subsystem tests detected fault in 8 GB memory book (inner)
Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340. Chapter
5.
Error
Code
to
FRU
Index
683
Error Code
Description
Action/ Possible Failing FRU
4507 305B
Memory subsystem and processor subsystem tests detected fault in 8 GB memory book (outer)
Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342.
4507 305C
Memory subsystem and bus tests detected fault in 4 GB inner memory book
Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340.
4507 305D
Memory subsystem and bus tests detected fault in 4 GB outer memory book
Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342.
4507 305E
Memory subsystem and bus tests detected fault in 16 GB inner memory book
Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340.
4507 305F
Memory subsystem and bus tests detected fault in 16 GB outer memory book
Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342.
4507 307A
Memory subsystem and processor subsystem tests detected fault in 4 GB inner memory book
Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340.
4507 307B
Memory subsystem and processor subsystem tests detected fault in 8 GB inner memory book
Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340.
4507 307C
Memory subsystem and processor subsystem tests detected fault in 16 GB inner memory book
Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340.
4507 307D
Memory subsystem and processor subsystem tests detected fault in 16 GB inner memory book
Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340.
4507 307E
Memory subsystem and processor subsystem tests detected fault in 16 GB inner memory book
Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340.
4507 308A
Memory subsystem and processor subsystem tests detected fault in 4 GB outer memory book
Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342.
4507 308B
Memory subsystem and processor subsystem tests detected fault in 4 GB outer memory book
Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342.
4507 308C
Memory subsystem and processor subsystem tests detected fault in 4 GB outer memory book
Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342.
4507 308D
Memory subsystem and processor subsystem tests detected fault in 4 GB outer memory book
Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342.
4507 308E
Memory subsystem and processor subsystem tests detected fault in 4 GB outer memory book
Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342.
4507 30BD
Memory subsystem and processor subsystem tests detected fault in 8 GB or 16 GB CUoD inner memory book
Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340.
4507 30C3
Memory subsystem and processor subsystem tests detected fault in 32 GB or 64 GB CUoD inner memory book
Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340.
4507 30C6
Memory subsystem and processor subsystem tests detected fault in 8 GB or 16 GB CUoD outer memory book
Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342.
4507 30CC
Memory subsystem and processor subsystem tests detected fault in 32 GB or 64 GB CUoD outer memory book
Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342.
4508 248B
Memory subsystem and other bus tests detected fault in operator panel
Operator panel Location: U1.17-L1 684
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
4508 25C0
Memory subsystem and other bus tests detected fault in system backplane
System backplane Location: U1.18-P1
4508 25C1
Memory subsystem and other bus tests detected fault in 1.0 GHz clock card
Clock card Location: U1.18-P1-X5
4508 25D9
Memory subsystem and other bus tests detected fault in L3 VPD card used with 1.1 GHz MCM and 1.3 GHz MCM
Go to “MAP 1548: Memory and Processor Problem Isolation” on page 350.
4508 25E7
Fault in L3 VPD card used with 1.5 GHz and 1.7 GHz MCMs
Go to “MAP 1548: Memory and Processor Problem Isolation” on page 350.
4508 25E8
Fault in L3 VPD card used with 1.5 GHz and 1.7 GHz MCMs
Go to “MAP 1548: Memory and Processor Problem Isolation” on page 350.
4508 25F1
Memory subsystem and other bus tests detected fault in L3 VPD card used with 1.3 GHz MCM
Go to “MAP 1548: Memory and Processor Problem Isolation” on page 350.
4508 25F9
Fault in L3 VPD card used with 1.5 GHz and 1.7 GHz MCMs
Go to “MAP 1548: Memory and Processor Problem Isolation” on page 350.
4508 269A
Fault in L3 VPD card used with 1.5 GHz and 1.7 GHz MCMs
Go to “MAP 1548: Memory and Processor Problem Isolation” on page 350.
4508 28B0
Memory subsystem and other bus tests detected fault in I/O subsystem PCI backplane
I/O subsystem PCI backplane at location specified in error code
4508 28B1
Memory subsystem and other bus tests detected fault in I/O subsystem DASD backplane
I/O subsystem DASD backplane at location specified in error code
4508 302A
Memory subsystem and other bus tests detected fault in 8 GB memory book (inner)
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4508 302B
Memory subsystem and other bus tests detected fault in 8 GB memory book (outer)
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4508 302C
Memory subsystem and other bus tests detected fault in 32 GB memory book (inner)
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4508 302D
Memory subsystem and other bus tests detected fault in 32 GB memory book (outer)
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4508 305A
Memory subsystem and processor subsystem tests detected fault in 8 GB memory book (inner)
Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340.
4508 305B
Memory subsystem and processor subsystem tests detected fault in 8 GB memory book (outer)
Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342.
4508 305C
Memory subsystem and other bus tests detected fault in 4 GB inner memory book
Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340.
4508 305D
Memory subsystem and other bus tests detected fault in 4 GB outer memory book
Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342.
4508 305E
Memory subsystem and other bus tests detected fault in 16 GB inner memory book
Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340.
4508 305F
Memory subsystem and other bus tests detected fault in 16 GB outer memory book
Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342. Chapter
5.
Error
Code
to
FRU
Index
685
Error Code
Description
Action/ Possible Failing FRU
4508 307A
Memory subsystem and processor subsystem tests detected fault in 4 GB inner memory book
Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340.
4508 307B
Memory subsystem and processor subsystem tests detected fault in 8 GB inner memory book
Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340.
4508 307C
Memory subsystem and processor subsystem tests detected fault in 16 GB inner memory book
Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340.
4508 307D
Memory subsystem and processor subsystem tests detected fault in 16 GB inner memory book
Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340.
45084 307E
Memory subsystem and processor subsystem tests detected fault in 16 GB inner memory book
Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340.
4508 308A
Memory subsystem and processor subsystem tests detected fault in 4 GB outer memory book
Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342.
4508 308B
Memory subsystem and processor subsystem tests detected fault in 4 GB outer memory book
Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342.
4508 308C
Memory subsystem and processor subsystem tests detected fault in 4 GB outer memory book
Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342.
4508 308D
Memory subsystem and processor subsystem tests detected fault in 4 GB outer memory book
Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342.
4508 308E
Memory subsystem and processor subsystem tests detected fault in 4 GB outer memory book
Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342.
4508 30BD
Memory subsystem and processor subsystem tests detected fault in 8 GB or 16 GB CUoD inner memory book
Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340.
4508 30C3
Memory subsystem and processor subsystem tests detected fault in 32 GB or 64 GB CUoD inner memory book
Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340.
4508 30C6
Memory subsystem and processor subsystem tests detected fault in 8 GB or 16 GB CUoD outer memory book
Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342.
4508 30CC
Memory subsystem and processor subsystem tests detected fault in 32 GB or 64 GB CUoD outer memory book
Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342.
4509 248B
Memory subsystem and memory tests detected fault in operator panel
Operator panel Location: U1.17-L1
4509 25C0
Memory subsystem and memory tests detected fault in system backplane
System backplane Location: U1.18-P1
4509 25C1
Memory subsystem and memory tests detected fault in 1.0 GHz clock card
Clock card Location: U1.18-P1-X5
4509 25D9
Memory subsystem and memory tests detected fault in L3 VPD card used with 1.1 GHz MCM and 1.3 GHz MCM
Go to “MAP 1548: Memory and Processor Problem Isolation” on page 350.
4509 25E7
Fault in L3 VPD card used with 1.5 GHz and 1.7 GHz MCMs
Go to “MAP 1548: Memory and Processor Problem Isolation” on page 350.
4509 25E8
Fault in L3 VPD card used with 1.5 GHz and 1.7 GHz MCMs
Go to “MAP 1548: Memory and Processor Problem Isolation” on page 350. 686
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
4509 25F1
Memory subsystem and memory tests detected fault in L3 VPD card used with 1.3 GHz MCM
Go to “MAP 1548: Memory and Processor Problem Isolation” on page 350.
4509 25F9
Fault in L3 VPD card used with 1.5 GHz and 1.7 GHz MCMs
Go to “MAP 1548: Memory and Processor Problem Isolation” on page 350.
4509 269A
Fault in L3 VPD card used with 1.5 GHz and 1.7 GHz MCMs
Go to “MAP 1548: Memory and Processor Problem Isolation” on page 350.
4509 28B0
Memory subsystem and memory tests detected fault in I/O subsystem PCI backplane
I/O subsystem PCI backplane at location specified in error code
4509 28B1
Memory subsystem and memory tests detected fault in I/O subsystem DASD backplane
I/O subsystem DASD backplane at location specified in error code
4509 302A
Memory subsystem and memory tests detected fault in 8 GB memory book (inner)
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4509 302B
Memory subsystem and memory tests detected fault in 8 GB memory book (outer)
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4509 302C
Memory subsystem and memory tests detected fault in 32 GB memory book (inner)
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4509 302D
Memory subsystem and memory tests detected fault in 32 GB memory book (outer)
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4509 305A
Memory subsystem and processor subsystem tests detected fault in 8 GB memory book (inner)
Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340.
4509 305B
Memory subsystem and processor subsystem tests detected fault in 8 GB memory book (outer)
Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342.
4509 305C
Memory subsystem and memory tests detected fault in 4 GB inner memory book
Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340.
4509 305D
Memory subsystem and memory tests detected fault in 4 GB outer memory book
Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342.
4509 305E
Memory subsystem and memory tests detected fault in 16 GB inner memory book
Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340.
4509 305F
Memory subsystem and memory tests detected fault in 16 GB outer memory book
Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342.
4509 307A
Memory subsystem and processor subsystem tests detected fault in 4 GB inner memory book
Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340.
4509 307B
Memory subsystem and processor subsystem tests detected fault in 8 GB inner memory book
Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340.
4509 307C
Memory subsystem and processor subsystem tests detected fault in 16 GB inner memory book
Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340.
4509 307D
Memory subsystem and processor subsystem tests detected fault in 16 GB inner memory book
Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340.
4509 307E
Memory subsystem and processor subsystem tests detected fault in 16 GB inner memory book
Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340. Chapter
5.
Error
Code
to
FRU
Index
687
Error Code
Description
Action/ Possible Failing FRU
4509 308A
Memory subsystem and processor subsystem tests detected fault in 4 GB outer memory book
Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342.
4509 308B
Memory subsystem and processor subsystem tests detected fault in 4 GB outer memory book
Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342.
4509 308C
Memory subsystem and processor subsystem tests detected fault in 4 GB outer memory book
Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342.
4509 308D
Memory subsystem and processor subsystem tests detected fault in 4 GB outer memory book
Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342.
4509 308E
Memory subsystem and processor subsystem tests detected fault in 4 GB outer memory book
Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342.
4509 30BD
Memory subsystem and processor subsystem tests detected fault in 8 GB or 16 GB CUoD inner memory book
Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340.
4509 30C3
Memory subsystem and processor subsystem tests detected fault in 32 GB or 64 GB CUoD inner memory book
Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340.
4509 30C6
Memory subsystem and processor subsystem tests detected fault in 8 GB or 16 GB CUoD outer memory book
Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342.
4509 30CC
Memory subsystem and processor subsystem tests detected fault in 32 GB or 64 GB CUoD outer memory book
Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342.
450A 248B
Memory subsystem and self-tests detected fault in operator panel
Operator panel Location: U1.17-L1
450A 25C0
Memory subsystem and self-tests detected fault in system backplane
System backplane Location: U1.18-P1
450A 25C1
Memory subsystem and self-tests detected fault in 1.0 GHz clock card
Clock card Location: U1.18-P1-X5
450A 25D9
Memory subsystem and self-tests detected fault in L3 VPD card used with 1.1 GHz MCM and 1.3 GHz MCM
Go to “MAP 1548: Memory and Processor Problem Isolation” on page 350.
450A 25E7
Fault in L3 VPD card used with 1.5 GHz and 1.7 GHz MCMs
Go to “MAP 1548: Memory and Processor Problem Isolation” on page 350.
450A 25E8
Fault in L3 VPD card used with 1.5 GHz and 1.7 GHz MCMs
Go to “MAP 1548: Memory and Processor Problem Isolation” on page 350.
450A 25F1
Memory subsystem and self-tests detected fault in L3 VPD card used with 1.3 GHz MCM
Go to “MAP 1548: Memory and Processor Problem Isolation” on page 350.
450A 25F9
Fault in L3 VPD card used with 1.5 GHz and 1.7 GHz MCMs
Go to “MAP 1548: Memory and Processor Problem Isolation” on page 350.
450A 269A
Fault in L3 VPD card used with 1.5 GHz and 1.7 GHz MCMs
Go to “MAP 1548: Memory and Processor Problem Isolation” on page 350.
450A 28B0
Memory subsystem and self-tests detected fault in I/O subsystem PCI backplane
I/O subsystem PCI backplane at location specified in error code
450A 28B1
Memory subsystem and self-tests detected fault in I/O subsystem DASD backplane
I/O subsystem DASD backplane at location specified in error code
450A 302A
Memory subsystem and self-tests detected fault in 8 GB memory book (inner)
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 688
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
450A 302B
Memory subsystem and self-tests detected fault in 8 GB memory book (outer)
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
450A 302C
Memory subsystem and self-tests detected fault in 32 GB memory book (inner)
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
450A 302D
Memory subsystem and self-tests detected fault in 32 GB memory book (outer)
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
450A 305A
Memory subsystem and processor subsystem tests detected fault in 8 GB memory book (inner)
Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340.
450A 305B
Memory subsystem and processor subsystem tests detected fault in 8 GB memory book (outer)
Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342.
450A 305C
Memory subsystem and self-tests detected fault in 4 GB inner memory book
Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340.
450A 305D
Memory subsystem and self-tests detected fault in 4 GB outer memory book
Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342.
450A 305E
Memory subsystem and self-tests detected fault in 16 GB inner memory book
Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340.
450A 305F
Memory subsystem and self-tests detected fault in 16 GB outer memory book
Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342.
450A 307A
Memory subsystem and processor subsystem tests detected fault in 4 GB inner memory book
Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340.
450A 307B
Memory subsystem and processor subsystem tests detected fault in 8 GB inner memory book
Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340.
450A 307C
Memory subsystem and processor subsystem tests detected fault in 16 GB inner memory book
Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340.
450A 307D
Memory subsystem and processor subsystem tests detected fault in 16 GB inner memory book
Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340.
450A 307E
Memory subsystem and processor subsystem tests detected fault in 16 GB inner memory book
Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340.
450A 308A
Memory subsystem and processor subsystem tests detected fault in 4 GB outer memory book
Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342.
450A 308B
Memory subsystem and processor subsystem tests detected fault in 4 GB outer memory book
Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342.
450A 308C
Memory subsystem and processor subsystem tests detected fault in 4 GB outer memory book
Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342.
450A 308D
Memory subsystem and processor subsystem tests detected fault in 4 GB outer memory book
Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342.
450A 308E
Memory subsystem and processor subsystem tests detected fault in 4 GB outer memory book
Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342.
450A 30BD
Memory subsystem and processor subsystem tests detected fault in 8 GB or 16 GB CUoD inner memory book
Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340. Chapter
5.
Error
Code
to
FRU
Index
689
Error Code
Description
Action/ Possible Failing FRU
450A 30C3
Memory subsystem and processor subsystem tests detected fault in 32 GB or 64 GB CUoD inner memory book
Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340.
450A 30C6
Memory subsystem and processor subsystem tests detected fault in 8 GB or 16 GB CUoD outer memory book
Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342.
450A 30CC
Memory subsystem and processor subsystem tests detected fault in 32 GB or 64 GB CUoD outer memory book
Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342.
450B 248B
Memory subsystem and wire tests detected fault in operator panel
Operator panel Location: U1.17-L1
450B 25C0
Memory subsystem and wire tests detected fault in system backplane
System backplane Location: U1.18-P1
450B 25C1
Memory subsystem and wire tests detected fault in 1.0 GHz clock card
Clock card Location: U1.18-P1-X5
450B 25D9
Memory subsystem and wire tests detected fault in L3 VPD card used with 1.1 GHz MCM and 1.3 GHz MCM
Go to “MAP 1548: Memory and Processor Problem Isolation” on page 350.
450B 25E7
Fault in L3 VPD card used with 1.5 GHz and 1.7 GHz MCMs
Go to “MAP 1548: Memory and Processor Problem Isolation” on page 350.
450B 25E8
Fault in L3 VPD card used with 1.5 GHz and 1.7 GHz MCMs
Go to “MAP 1548: Memory and Processor Problem Isolation” on page 350.
450B 25F1
Memory subsystem and wire tests detected fault in L3 VPD card used with 1.3 GHz MCM
Go to “MAP 1548: Memory and Processor Problem Isolation” on page 350.
450B 25F9
Fault in L3 VPD card used with 1.5 GHz and 1.7 GHz MCMs
Go to “MAP 1548: Memory and Processor Problem Isolation” on page 350.
450B 269A
Fault in L3 VPD card used with 1.5 GHz and 1.7 GHz MCMs
Go to “MAP 1548: Memory and Processor Problem Isolation” on page 350.
450B 28B0
Memory subsystem and wire tests detected fault in I/O subsystem PCI backplane
I/O subsystem PCI backplane at location specified in error code
450B 28B1
Memory subsystem and wire tests detected fault in I/O subsystem DASD backplane
I/O subsystem DASD backplane at location specified in error code
450B 302A
Memory subsystem and wire tests detected fault in 8 GB memory book (inner)
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
450B 302B
Memory subsystem and wire tests detected fault in 8 GB memory book (outer)
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
450B 302C
Memory subsystem and wire tests detected fault in 32 GB memory book (inner)
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
450B 302D
Memory subsystem and wire tests detected fault in 32 GB memory book (outer)
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
450B 305A
Memory subsystem and processor subsystem tests detected fault in 8 GB memory book (inner)
Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340. 690
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
450B 305B
Memory subsystem and processor subsystem tests detected fault in 8 GB memory book (outer)
Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342.
450B 305C
Memory subsystem and wire tests detected fault in 4 GB inner memory book
Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340.
450B 305D
Memory subsystem and wire tests detected fault in 4 GB outer memory book
Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342.
450B 305E
Memory subsystem and wire tests detected fault in 16 GB inner memory book
Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340.
450B 305F
Memory subsystem and wire tests detected fault in 16 GB outer memory book
Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342.
450B 307A
Memory subsystem and processor subsystem tests detected fault in 4 GB inner memory book
Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340.
450B 307B
Memory subsystem and processor subsystem tests detected fault in 8 GB inner memory book
Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340.
450B 307C
Memory subsystem and processor subsystem tests detected fault in 16 GB inner memory book
Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340.
450B 307D
Memory subsystem and processor subsystem tests detected fault in 16 GB inner memory book
Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340.
450B 307E
Memory subsystem and processor subsystem tests detected fault in 16 GB inner memory book
Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340.
450B 308A
Memory subsystem and processor subsystem tests detected fault in 4 GB outer memory book
Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342.
450B 308B
Memory subsystem and processor subsystem tests detected fault in 4 GB outer memory book
Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342.
450B 308C
Memory subsystem and processor subsystem tests detected fault in 4 GB outer memory book
Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342.
450B 308D
Memory subsystem and processor subsystem tests detected fault in 4 GB outer memory book
Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342.
450B 308E
Memory subsystem and processor subsystem tests detected fault in 4 GB outer memory book
Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342.
450B 30BD
Memory subsystem and processor subsystem tests detected fault in 8 GB or 16 GB CUoD inner memory book
Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340.
450B 30C3
Memory subsystem and processor subsystem tests detected fault in 32 GB or 64 GB CUoD inner memory book
Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340.
450B 30C6
Memory subsystem and processor subsystem tests detected fault in 8 GB or 16 GB CUoD outer memory book
Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342.
450B 30CC
Memory subsystem and processor subsystem tests detected fault in 32 GB or 64 GB CUoD outer memory book
Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342.
450C 248B
Memory subsystem and IPL detected fault in operator panel
Operator panel Location: U1.17-L1 Chapter
5.
Error
Code
to
FRU
Index
691
Error Code
Description
Action/ Possible Failing FRU
450C 25C0
Memory subsystem and IPL detected fault in system backplane
System backplane Location: U1.18-P1
450C 25C1
Memory subsystem and IPL detected fault in 1.0 GHz clock card
Clock card Location: U1.18-P1-X5
450C 25D9
Memory subsystem and IPL detected fault in L3 VPD card used with 1.1 GHz MCM and 1.3 GHz MCM
Go to “MAP 1548: Memory and Processor Problem Isolation” on page 350.
450C 25E7
Fault in L3 VPD card used with 1.5 GHz and 1.7 GHz MCMs
Go to “MAP 1548: Memory and Processor Problem Isolation” on page 350.
450C 25E8
Fault in L3 VPD card used with 1.5 GHz and 1.7 GHz MCMs
Go to “MAP 1548: Memory and Processor Problem Isolation” on page 350.
450C 25F1
Memory subsystem and IPL detected fault in L3 VPD card used with 1.3 GHz MCM
Go to “MAP 1548: Memory and Processor Problem Isolation” on page 350.
450C 25F9
Fault in L3 VPD card used with 1.5 GHz and 1.7 GHz MCMs
Go to “MAP 1548: Memory and Processor Problem Isolation” on page 350.
450C 269A
Fault in L3 VPD card used with 1.5 GHz and 1.7 GHz MCMs
Go to “MAP 1548: Memory and Processor Problem Isolation” on page 350.
450C 28B0
Memory subsystem and IPL detected fault in I/O subsystem PCI backplane
I/O subsystem PCI backplane at location specified in error code
450C 28B1
Memory subsystem and IPL detected fault in I/O subsystem DASD backplane
I/O subsystem DASD backplane at location specified in error code
450C 302A
Memory subsystem and IPL detected fault in 8 GB memory book (inner)
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
450C 302B
Memory subsystem and IPL detected fault in 8 GB memory book (outer)
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
450C 302C
Memory subsystem and IPL detected fault in 32 GB memory book (inner)
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
450C 302D
Memory subsystem and IPL detected fault in 32 GB memory book (outer)
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
450C 305A
Memory subsystem and processor subsystem tests detected fault in 8 GB memory book (inner)
Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340.
450C 305B
Memory subsystem and processor subsystem tests detected fault in 8 GB memory book (outer)
Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342.
450C 305C
Memory subsystem and IPL detected fault in 4 GB inner memory book
Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340.
450C 305D
Memory subsystem and IPL detected fault in 4 GB outer memory book
Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342.
450C 305E
Memory subsystem and IPL detected fault in 16 GB inner memory book
Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340.
450C 305F
Memory subsystem and IPL detected fault in 16 GB outer memory book
Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342.
450C 307A
Memory subsystem and processor subsystem tests detected fault in 4 GB inner memory book
Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340. 692
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
450C 307B
Memory subsystem and processor subsystem tests detected fault in 8 GB inner memory book
Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340.
450C 307C
Memory subsystem and processor subsystem tests detected fault in 16 GB inner memory book
Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340.
450C 307D
Memory subsystem and processor subsystem tests detected fault in 16 GB inner memory book
Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340.
450C 307E
Memory subsystem and processor subsystem tests detected fault in 16 GB inner memory book
Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340.
450C 308A
Memory subsystem and processor subsystem tests detected fault in 4 GB outer memory book
Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342.
450C 308B
Memory subsystem and processor subsystem tests detected fault in 4 GB outer memory book
Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342.
450C 308C
Memory subsystem and processor subsystem tests detected fault in 4 GB outer memory book
Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342.
450C 308D
Memory subsystem and processor subsystem tests detected fault in 4 GB outer memory book
Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342.
450C 308E
Memory subsystem and processor subsystem tests detected fault in 4 GB outer memory book
Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342.
450C 30BD
Memory subsystem and processor subsystem tests detected fault in 8 GB or 16 GB CUoD inner memory book
Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340.
450C 30C3
Memory subsystem and processor subsystem tests detected fault in 32 GB or 64 GB CUoD inner memory book
Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340.
450C 30C6
Memory subsystem and processor subsystem tests detected fault in 8 GB or 16 GB CUoD outer memory book
Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342.
450C 30CC
Memory subsystem and processor subsystem tests detected fault in 32 GB or 64 GB CUoD outer memory book
Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342.
450D 248B
Memory subsystem and attention handler detected fault in operator panel
Operator panel Location: U1.17-L1
450D 25C0
Memory subsystem and attention handler detected fault in system backplane
System backplane Location: U1.18-P1
450D 25C1
Memory subsystem and attention handler detected fault in 1.0 GHz clock card
Clock card Location: U1.18-P1-X5
450D 25D9
Memory subsystem and attention handler detected fault in L3 VPD card used with 1.1 GHz MCM and 1.3 GHz MCM
Go to “MAP 1548: Memory and Processor Problem Isolation” on page 350.
450D 25E7
Fault in L3 VPD card used with 1.5 GHz and 1.7 GHz MCMs
Go to “MAP 1548: Memory and Processor Problem Isolation” on page 350.
450D 25E8
Fault in L3 VPD card used with 1.5 GHz and 1.7 GHz MCMs
Go to “MAP 1548: Memory and Processor Problem Isolation” on page 350.
450D 25F1
Memory subsystem and attention handler detected fault in L3 VPD card used with 1.3 GHz MCM
Go to “MAP 1548: Memory and Processor Problem Isolation” on page 350. Chapter
5.
Error
Code
to
FRU
Index
693
Error Code
Description
Action/ Possible Failing FRU
450D 25F9
Fault in L3 VPD card used with 1.5 GHz and 1.7 GHz MCMs
Go to “MAP 1548: Memory and Processor Problem Isolation” on page 350.
450D 269A
Fault in L3 VPD card used with 1.5 GHz and 1.7 GHz MCMs
Go to “MAP 1548: Memory and Processor Problem Isolation” on page 350.
450D 28B0
Memory subsystem and attention handler detected fault in I/O subsystem PCI backplane
I/O subsystem PCI backplane at location specified in error code
450D 28B1
Memory subsystem and attention handler detected fault in I/O subsystem DASD backplane
I/O subsystem DASD backplane at location specified in error code
450D 302A
Memory subsystem and attention handler detected fault in 8 GB memory book (inner)
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
450D 302B
Memory subsystem and attention handler detected fault in 8 GB memory book (outer)
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
450D 302C
Memory subsystem and attention handler detected fault in 32 GB memory book (inner)
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
450D 302D
Memory subsystem and attention handler detected fault in 32 GB memory book (outer)
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
450D 305A
Memory subsystem and processor subsystem tests detected fault in 8 GB memory book (inner)
Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340.
450D 305B
Memory subsystem and processor subsystem tests detected fault in 8 GB memory book (outer)
Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342.
450D 305C
Memory subsystem and attention handler detected fault in 4 GB inner memory book
Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340.
450D 305D
Memory subsystem and attention handler detected fault in 4 GB outer memory book
Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342.
450D 305E
Memory subsystem and attention handler detected fault in 16 GB inner memory book
Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340.
450D 305F
Memory subsystem and attention handler detected fault in 16 GB outer memory book
Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342.
450D 307A
Memory subsystem and processor subsystem tests detected fault in 4 GB inner memory book
Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340.
450D 307B
Memory subsystem and processor subsystem tests detected fault in 8 GB inner memory book
Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340.
450D 307C
Memory subsystem and processor subsystem tests detected fault in 16 GB inner memory book
Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340.
450D 307D
Memory subsystem and processor subsystem tests detected fault in 16 GB inner memory book
Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340.
450D 307E
Memory subsystem and processor subsystem tests detected fault in 16 GB inner memory book
Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340. 694
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
450D 308A
Memory subsystem and processor subsystem tests detected fault in 4 GB outer memory book
Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342.
450D 308B
Memory subsystem and processor subsystem tests detected fault in 4 GB outer memory book
Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342.
450D 308C
Memory subsystem and processor subsystem tests detected fault in 4 GB outer memory book
Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342.
450D 308D
Memory subsystem and processor subsystem tests detected fault in 4 GB outer memory book
Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342.
450D 308E
Memory subsystem and processor subsystem tests detected fault in 4 GB outer memory book
Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342.
450D 30BD
Memory subsystem and processor subsystem tests detected fault in 8 GB or 16 GB CUoD inner memory book
Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340.
450D 30C3
Memory subsystem and processor subsystem tests detected fault in 32 GB or 64 GB CUoD inner memory book
Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340.
450D 30C6
Memory subsystem and processor subsystem tests detected fault in 8 GB or 16 GB CUoD outer memory book
Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342.
450D 30CC
Memory subsystem and processor subsystem tests detected fault in 32 GB or 64 GB CUoD outer memory book
Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342.
450E 248B
Memory subsystem and scan control routines detected fault in operator panel
Operator panel Location: U1.17-L1
450E 25C0
Memory subsystem and scan control routines detected fault in system backplane
System backplane Location: U1.18-P1
450E 25C1
Memory subsystem and scan control routines detected fault in 1.0 GHz clock card
Clock card Location: U1.18-P1-X5
450E 25D9
Memory subsystem and scan control routines detected fault in L3 VPD card used with 1.1 GHz MCM and 1.3 GHz MCM
Go to “MAP 1548: Memory and Processor Problem Isolation” on page 350.
450E 25E7
Fault in L3 VPD card used with 1.5 GHz and 1.7 GHz MCMs
Go to “MAP 1548: Memory and Processor Problem Isolation” on page 350.
450E 25E8
Fault in L3 VPD card used with 1.5 GHz and 1.7 GHz MCMs
Go to “MAP 1548: Memory and Processor Problem Isolation” on page 350.
450E 25F1
Memory subsystem and scan control routines detected fault in L3 VPD card used with 1.3 GHz MCM
Go to “MAP 1548: Memory and Processor Problem Isolation” on page 350.
450E 25F9
Fault in L3 VPD card used with 1.5 GHz and 1.7 GHz MCMs
Go to “MAP 1548: Memory and Processor Problem Isolation” on page 350.
450E 269A
Fault in L3 VPD card used with 1.5 GHz and 1.7 GHz MCMs
Go to “MAP 1548: Memory and Processor Problem Isolation” on page 350.
450E 28B0
Memory subsystem and scan control routines detected fault in I/O subsystem PCI backplane
I/O subsystem PCI backplane at location specified in error code
450E 28B1
Memory subsystem and scan control routines detected fault in I/O subsystem DASD backplane
I/O subsystem DASD backplane at location specified in error code Chapter
5.
Error
Code
to
FRU
Index
695
Error Code
Description
Action/ Possible Failing FRU
450E 302A
Memory subsystem and scan control routines detected fault in 8 GB memory book (inner)
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
450E 302B
Memory subsystem and scan control routines detected fault in 8 GB memory book (outer)
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
450E 302C
Memory subsystem and scan control routines detected fault in 32 GB memory book (inner)
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
450E 302D
Memory subsystem and scan control routines detected fault in 32 GB memory book (outer)
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
450E 305A
Memory subsystem and processor subsystem tests detected fault in 8 GB memory book (inner)
Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340.
450E 305B
Memory subsystem and processor subsystem tests detected fault in 8 GB memory book (outer)
Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342.
450E 305C
Memory subsystem and scan control routines detected fault in 4 GB inner memory book
Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340.
450E 305D
Memory subsystem and scan control routines detected fault in 4 GB outer memory book
Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342.
450E 305E
Memory subsystem and scan control routines detected fault in 16 GB inner memory book
Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340.
450E 305F
Memory subsystem and scan control routines detected fault in 16 GB outer memory book
Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342.
450E 307A
Memory subsystem and processor subsystem tests detected fault in 4 GB inner memory book
Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340.
450E 307B
Memory subsystem and processor subsystem tests detected fault in 8 GB inner memory book
Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340.
450E 307C
Memory subsystem and processor subsystem tests detected fault in 16 GB inner memory book
Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340.
450E 307D
Memory subsystem and processor subsystem tests detected fault in 16 GB inner memory book
Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340.
450E 307E
Memory subsystem and processor subsystem tests detected fault in 16 GB inner memory book
Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340.
450E 308A
Memory subsystem and processor subsystem tests detected fault in 4 GB outer memory book
Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342.
450E 308B
Memory subsystem and processor subsystem tests detected fault in 4 GB outer memory book
Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342.
450E 308C
Memory subsystem and processor subsystem tests detected fault in 4 GB outer memory book
Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342.
450E 308D
Memory subsystem and processor subsystem tests detected fault in 4 GB outer memory book
Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342.
450E 308E
Memory subsystem and processor subsystem tests detected fault in 4 GB outer memory book
Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342. 696
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
450E 30BD
Memory subsystem and processor subsystem tests detected fault in 8 GB or 16 GB CUoD inner memory book
Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340.
450E 30C3
Memory subsystem and processor subsystem tests detected fault in 32 GB or 64 GB CUoD inner memory book
Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340.
450E 30C6
Memory subsystem and processor subsystem tests detected fault in 8 GB or 16 GB CUoD outer memory book
Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342.
450E 30CC
Memory subsystem and processor subsystem tests detected fault in 32 GB or 64 GB CUoD outer memory book
Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342.
450F 248B
Memory subsystem and error code generation/logging detected fault in operator panel
Operator panel Location: U1.17-L1
450F 25C0
Memory subsystem and error code generation/logging detected fault in system backplane
System backplane Location: U1.18-P1
450F 25C1
Memory subsystem and error code generation/logging detected fault in 1.0 GHz clock card
Clock card Location: U1.18-P1-X5
450F 25D9
Memory subsystem and error code generation/logging detected fault in L3 VPD card used with 1.1 GHz MCM and 1.3 GHz MCM
Go to “MAP 1548: Memory and Processor Problem Isolation” on page 350.
450F 25E7
Fault in L3 VPD card used with 1.5 GHz and 1.7 GHz MCMs
Go to “MAP 1548: Memory and Processor Problem Isolation” on page 350.
450F 25E8
Fault in L3 VPD card used with 1.5 GHz and 1.7 GHz MCMs
Go to “MAP 1548: Memory and Processor Problem Isolation” on page 350.
450F 25F1
Memory subsystem and error code generation/logging detected fault in L3 VPD card used with 1.3 GHz MCM
Go to “MAP 1548: Memory and Processor Problem Isolation” on page 350.
450F 25F9
Fault in L3 VPD card used with 1.5 GHz and 1.7 GHz MCMs
Go to “MAP 1548: Memory and Processor Problem Isolation” on page 350.
450F 269A
Fault in L3 VPD card used with 1.5 GHz and 1.7 GHz MCMs
Go to “MAP 1548: Memory and Processor Problem Isolation” on page 350.
450F 28B0
Memory subsystem and error code generation/logging detected fault in I/O subsystem PCI backplane
I/O subsystem PCI backplane at location specified in error code
450F 28B1
Memory subsystem and error code generation/logging detected fault in I/O subsystem DASD backplane
I/O subsystem DASD backplane at location specified in error code
450F 302A
Memory subsystem and error code generation/logging detected fault in 8 GB memory book (inner)
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
450F 302B
Memory subsystem and error code generation/logging detected fault in 8 GB memory book (outer)
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. Chapter
5.
Error
Code
to
FRU
Index
697
Error Code
Description
Action/ Possible Failing FRU
450F 302C
Memory subsystem and error code generation/logging detected fault in 32 GB memory book (inner)
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
450F 302D
Memory subsystem and error code generation/logging detected fault in 32 GB memory book (outer)
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
450F 305A
Memory subsystem and processor subsystem tests detected fault in 8 GB memory book (inner)
Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340.
450F 305B
Memory subsystem and processor subsystem tests detected fault in 8 GB memory book (outer)
Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342.
450F 305C
Memory subsystem and error code generation/logging detected fault in 4 GB inner memory book
Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340.
450F 305D
Memory subsystem and error code generation/logging detected fault in 4 GB outer memory book
Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342.
450F 305E
Memory subsystem and error code generation/logging detected fault in 16 GB inner memory book
Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340.
450F 305F
Memory subsystem and error code generation/logging detected fault in 16 GB outer memory book
Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342.
450F 307A
Memory subsystem and processor subsystem tests detected fault in 4 GB inner memory book
Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340.
450F 307B
Memory subsystem and processor subsystem tests detected fault in 8 GB inner memory book
Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340.
450F 307C
Memory subsystem and processor subsystem tests detected fault in 16 GB inner memory book
Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340.
450F 307D
Memory subsystem and processor subsystem tests detected fault in 16 GB inner memory book
Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340.
450F 307E
Memory subsystem and processor subsystem tests detected fault in 16 GB inner memory book
Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340.
450F 308A
Memory subsystem and processor subsystem tests detected fault in 4 GB outer memory book
Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342.
450F 308B
Memory subsystem and processor subsystem tests detected fault in 4 GB outer memory book
Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342.
450F 308C
Memory subsystem and processor subsystem tests detected fault in 4 GB outer memory book
Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342.
450F 308D
Memory subsystem and processor subsystem tests detected fault in 4 GB outer memory book
Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342.
450F 308E
Memory subsystem and processor subsystem tests detected fault in 4 GB outer memory book
Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342.
450F 30BD
Memory subsystem and processor subsystem tests detected fault in 8 GB or 16 GB CUoD inner memory book
Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340. 698
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
450F 30C3
Memory subsystem and processor subsystem tests detected fault in 32 GB or 64 GB CUoD inner memory book
Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340.
450F 30C6
Memory subsystem and processor subsystem tests detected fault in 8 GB or 16 GB CUoD outer memory book
Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342.
450F 30CC
Memory subsystem and processor subsystem tests detected fault in 32 GB or 64 GB CUoD outer memory book
Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342.
4604 248B
I/O subsystem and processor subsystem tests detected fault in operator panel
Operator panel Location: U1.17-L1
4604 25C0
I/O subsystem and processor subsystem tests detected fault in system backplane
System backplane Location: U1.18-P1
4604 25C1
I/O subsystem and processor subsystem tests detected fault in 1.0 GHz clock card
Clock card Location: U1.18-P1-X5
4604 25E6
SMA 4-port switch adapter fault
Replace the I/O book (switch adapter) specified by the location code.
4604 2738
I/O subsystem and processor subsystem tests detected fault in I/O book 1, 3, or 4
1. If word 13 is CFF1 xxxx, a configuration problem exists. Ensure that the number of MCMs is consistent with the number of I/O books. 2. Replace I/O book 1, 3, or 4 at location specified in error code.
4604 273C
I/O subsystem and processor subsystem tests detected fault in primary I/O book - Slot 0
Primary I/O book - Slot 0 Location: U1.18-P1-H2
4604 273D
I/O subsystem and processor subsystem tests detected fault in primary I/O book - Slot 0
Primary I/O book - Slot 0 Location: U1.18-P1-H2
4604 273F
SMA 2-port switch adapter fault
Replace the I/O book (switch adapter) specified by the location code.
4604 2776
I/O subsystem and processor subsystem tests detected fault in I/O book 1, 3, or 4
1. If word 13 is CFF1 xxxx, a configuration problem exists. Ensure that the number of MCMs is consistent with the number of I/O books. 2. Replace I/O book 1, 3, or 4 at location specified in error code.
4604 2806
I/O subsystem and processor subsystem tests detected fault in I/O book 1, 3, or 4
1. If word 13 is CFF1 xxxx, a configuration problem exists. Ensure that the number of MCMs is consistent with the number of I/O books. 2. Replace I/O book 1, 3, or 4 at location specified in error code.
4604 28B0
I/O subsystem and processor subsystem tests detected fault in I/O subsystem PCI backplane
I/O subsystem PCI backplane at location specified in error code
4604 28B1
I/O subsystem and processor subsystem tests detected fault in I/O subsystem DASD backplane
I/O subsystem DASD backplane at location specified in error code Chapter
5.
Error
Code
to
FRU
Index
699
Error Code
Description
Action/ Possible Failing FRU
4604 525E
CUoD capacity card fault
1. Verify that the system firmware is RH021114 or later. CUoD is not supported by the earlier firmware levels. 2. Replace the CUoD capacity card, location: U1.18-P1-H2.2, then ask the customer to reenter the CUoD activation keys.
4604 525F
CUoD capacity card fault
1. Verify that the system firmware is RH021114 or later. CUoD is not supported by the earlier firmware levels. 2. Replace the CUoD capacity card, location: U1.18-P1-H2.2, then ask the customer to reenter the CUoD activation keys.
4604 527A
CUoD capacity card fault
1. Verify that the system firmware is RH021114 or later. CUoD is not supported by the earlier firmware levels. 2. Replace the CUoD capacity card, location: U1.18-P1-H2.2, then ask the customer to reenter the CUoD activation keys.
4604 527C
CUoD capacity card fault
1. Verify that the system firmware is RH021114 or later. CUoD is not supported by the earlier firmware levels. 2. Replace the CUoD capacity card, location: U1.18-P1-H2.2, then ask the customer to reenter the CUoD activation keys.
4605 248B
I/O subsystem and display alter tests detected fault in operator panel
Operator panel Location: U1.17-L1
4605 25C0
I/O subsystem and display alter tests detected fault in system backplane
System backplane Location: U1.18-P1
4605 25C1
I/O subsystem and display alter tests detected fault in 1.0 GHz clock card
Clock card Location: U1.18-P1-X5
4605 25E6
SMA 4-port switch adapter fault
Replace the I/O book (switch adapter) specified by the location code.
4605 2738
I/O subsystem and display alter tests detected fault in I/O book 1, 3, or 4
1. If word 13 is CFF1 xxxx, a configuration problem exists. Ensure that the number of MCMs is consistent with the number of I/O books. 2. Replace I/O book 1, 3, or 4 at location specified in error code.
4605 273C
I/O subsystem and display alter tests detected fault in primary I/O book - Slot 0
Primary I/O book - Slot 0 Location: U1.18-P1-H2
4605 273D
I/O subsystem and processor subsystem tests detected fault in primary I/O book - Slot 0
Primary I/O book - Slot 0 Location: U1.18-P1-H2
4605 273F
SMA 2-port switch adapter fault
Replace the I/O book (switch adapter) specified by the location code. 700
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
4605 2776
I/O subsystem and processor subsystem tests detected fault in I/O book 1, 3, or 4
1. If word 13 is CFF1 xxxx, a configuration problem exists. Ensure that the number of MCMs is consistent with the number of I/O books. 2. Replace I/O book 1, 3, or 4 at location specified in error code.
4605 2806
I/O subsystem and processor subsystem tests detected fault in I/O book 1, 3, or 4
1. If word 13 is CFF1 xxxx, a configuration problem exists. Ensure that the number of MCMs is consistent with the number of I/O books. 2. Replace I/O book 1, 3, or 4 at location specified in error code.
4605 28B0
I/O subsystem and display alter tests detected fault in I/O subsystem PCI backplane
I/O subsystem PCI backplane at location specified in error code
4605 28B1
I/O subsystem and display alter tests detected fault in I/O subsystem DASD backplane
I/O subsystem DASD backplane at location specified in error code
4605 525E
CUoD capacity card fault
1. Verify that the system firmware is RH021114 or later. CUoD is not supported by the earlier firmware levels. 2. Replace the CUoD capacity card, location: U1.18-P1-H2.2, then ask the customer to reenter the CUoD activation keys.
4605 525F
CUoD capacity card fault
1. Verify that the system firmware is RH021114 or later. CUoD is not supported by the earlier firmware levels. 2. Replace the CUoD capacity card, location: U1.18-P1-H2.2, then ask the customer to reenter the CUoD activation keys.
4605 527A
CUoD capacity card fault
1. Verify that the system firmware is RH021114 or later. CUoD is not supported by the earlier firmware levels. 2. Replace the CUoD capacity card, location: U1.18-P1-H2.2, then ask the customer to reenter the CUoD activation keys.
4605 527C
CUoD capacity card fault
1. Verify that the system firmware is RH021114 or later. CUoD is not supported by the earlier firmware levels. 2. Replace the CUoD capacity card, location: U1.18-P1-H2.2, then ask the customer to reenter the CUoD activation keys.
4606 248B
I/O subsystem and scan interface basic assurance tests detected fault in operator panel
Operator panel Location: U1.17-L1
4606 25C0
I/O subsystem and scan interface basic assurance tests detected fault in system backplane
System backplane Location: U1.18-P1 Chapter
5.
Error
Code
to
FRU
Index
701
Error Code
Description
Action/ Possible Failing FRU
4606 25C1
I/O subsystem and scan interface basic assurance tests detected fault in 1.0 GHz clock card
Clock card Location: U1.18-P1-X5
4606 25E6
SMA 4-port switch adapter fault
Replace the I/O book (switch adapter) specified by the location code.
4606 2738
I/O subsystem and scan interface basic assurance tests detected fault in I/O book 1, 3, or 4
1. If word 13 is CFF1 xxxx, a configuration problem exists. Ensure that the number of MCMs is consistent with the number of I/O books. 2. Replace I/O book 1, 3, or 4 at location specified in error code.
4606 273C
I/O subsystem and scan interface basic assurance tests detected fault in primary I/O book - Slot 0
Primary I/O book - Slot 0 Location: U1.18-P1-H2
4606 273D
I/O subsystem and processor subsystem tests detected fault in primary I/O book - Slot 0
Primary I/O book - Slot 0 Location: U1.18-P1-H2
4606 273F
SMA 2-port switch adapter fault
Replace the I/O book (switch adapter) specified by the location code.
4606 2776
I/O subsystem and processor subsystem tests detected fault in I/O book 1, 3, or 4
1. If word 13 is CFF1 xxxx, a configuration problem exists. Ensure that the number of MCMs is consistent with the number of I/O books. 2. Replace I/O book 1, 3, or 4 at location specified in error code.
4606 2806
I/O subsystem and processor subsystem tests detected fault in I/O book 1, 3, or 4
1. If word 13 is CFF1 xxxx, a configuration problem exists. Ensure that the number of MCMs is consistent with the number of I/O books. 2. Replace I/O book 1, 3, or 4 at location specified in error code.
4606 28B0
I/O subsystem and scan interface basic assurance tests detected fault in I/O subsystem PCI backplane
I/O subsystem PCI backplane at location specified in error code
4606 28B1
I/O subsystem and scan interface basic assurance tests detected fault in I/O subsystem DASD backplane
I/O subsystem DASD backplane at location specified in error code
4606 525E
CUoD capacity card fault
1. Verify that the system firmware is RH021114 or later. CUoD is not supported by the earlier firmware levels. 2. Replace the CUoD capacity card, location: U1.18-P1-H2.2, then ask the customer to reenter the CUoD activation keys.
4606 525F
CUoD capacity card fault
1. Verify that the system firmware is RH021114 or later. CUoD is not supported by the earlier firmware levels. 2. Replace the CUoD capacity card, location: U1.18-P1-H2.2, then ask the customer to reenter the CUoD activation keys. 702
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
4606 527A
CUoD capacity card fault
1. Verify that the system firmware is RH021114 or later. CUoD is not supported by the earlier firmware levels. 2. Replace the CUoD capacity card, location: U1.18-P1-H2.2, then ask the customer to reenter the CUoD activation keys.
4606 527C
CUoD capacity card fault
1. Verify that the system firmware is RH021114 or later. CUoD is not supported by the earlier firmware levels. 2. Replace the CUoD capacity card, location: U1.18-P1-H2.2, then ask the customer to reenter the CUoD activation keys.
4607 248B
I/O subsystem and bus tests detected fault in operator panel
Operator panel Location: U1.17-L1
4607 25C0
I/O subsystem and bus tests detected fault in system backplane
System backplane Location: U1.18-P1
4607 25C1
I/O subsystem and bus tests detected fault in 1.0 GHz clock card
Clock card Location: U1.18-P1-X5
4607 25E6
SMA 4-port switch adapter fault
Replace the I/O book (switch adapter) specified by the location code.
4607 2738
I/O subsystem and bus tests detected fault in I/O book 1, 3, or 4
1. If word 13 is CFF1 xxxx, a configuration problem exists. Ensure that the number of MCMs is consistent with the number of I/O books. 2. Replace I/O book 1, 3, or 4 at location specified in error code.
4607 273C
I/O subsystem and bus tests detected fault in primary I/O book - Slot 0
Primary I/O book - Slot 0 Location: U1.18-P1-H2
4607 273D
I/O subsystem and processor subsystem tests detected fault in primary I/O book - Slot 0
Primary I/O book - Slot 0 Location: U1.18-P1-H2
4607 273F
SMA 2-port switch adapter fault
Replace the I/O book (switch adapter) specified by the location code.
4607 2776
I/O subsystem and processor subsystem tests detected fault in I/O book 1, 3, or 4
1. If word 13 is CFF1 xxxx, a configuration problem exists. Ensure that the number of MCMs is consistent with the number of I/O books. 2. Replace I/O book 1, 3, or 4 at location specified in error code.
4607 2806
I/O subsystem and processor subsystem tests detected fault in I/O book 1, 3, or 4
1. If word 13 is CFF1 xxxx, a configuration problem exists. Ensure that the number of MCMs is consistent with the number of I/O books. 2. Replace I/O book 1, 3, or 4 at location specified in error code.
4607 28B0
I/O subsystem and bus tests detected fault in I/O subsystem PCI backplane
I/O subsystem PCI backplane at location specified in error code Chapter
5.
Error
Code
to
FRU
Index
703
Error Code
Description
Action/ Possible Failing FRU
4607 28B1
I/O subsystem and bus tests detected fault in I/O subsystem DASD backplane
I/O subsystem DASD backplane at location specified in error code
4607 525E
CUoD capacity card fault
1. Verify that the system firmware is RH021114 or later. CUoD is not supported by the earlier firmware levels. 2. Replace the CUoD capacity card, location: U1.18-P1-H2.2, then ask the customer to reenter the CUoD activation keys.
4607 525F
CUoD capacity card fault
1. Verify that the system firmware is RH021114 or later. CUoD is not supported by the earlier firmware levels. 2. Replace the CUoD capacity card, location: U1.18-P1-H2.2, then ask the customer to reenter the CUoD activation keys.
4607 527A
CUoD capacity card fault
1. Verify that the system firmware is RH021114 or later. CUoD is not supported by the earlier firmware levels. 2. Replace the CUoD capacity card, location: U1.18-P1-H2.2, then ask the customer to reenter the CUoD activation keys.
4607 527C
CUoD capacity card fault
1. Verify that the system firmware is RH021114 or later. CUoD is not supported by the earlier firmware levels. 2. Replace the CUoD capacity card, location: U1.18-P1-H2.2, then ask the customer to reenter the CUoD activation keys.
4608 248B
I/O subsystem and other bus tests detected fault in operator panel
Operator panel Location: U1.17-L1
4608 25C0
I/O subsystem and other bus tests detected fault in system backplane
System backplane Location: U1.18-P1
4608 25C1
I/O subsystem and other bus tests detected fault in 1.0 GHz clock card
Clock card Location: U1.18-P1-X5
4608 25E6
SMA 4-port switch adapter fault
Replace the I/O book (switch adapter) specified by the location code.
4608 2738
I/O subsystem and other bus tests detected fault in I/O book 1, 3, or 4
1. If word 13 is CFF1 xxxx, a configuration problem exists. Ensure that the number of MCMs is consistent with the number of I/O books. 2. Replace I/O book 1, 3, or 4 at location specified in error code.
4608 273C
I/O subsystem and other bus tests detected fault in primary I/O book - Slot 0
Primary I/O book - Slot 0 Location: U1.18-P1-H2
4608 273D
I/O subsystem and processor subsystem tests detected fault in primary I/O book - Slot 0
Primary I/O book - Slot 0 Location: U1.18-P1-H2
4608 273F
SMA 2-port switch adapter fault
Replace the I/O book (switch adapter) specified by the location code. 704
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
4608 2776
I/O subsystem and processor subsystem tests detected fault in I/O book 1, 3, or 4
1. If word 13 is CFF1 xxxx, a configuration problem exists. Ensure that the number of MCMs is consistent with the number of I/O books. 2. Replace I/O book 1, 3, or 4 at location specified in error code.
4608 2806
I/O subsystem and processor subsystem tests detected fault in I/O book 1, 3, or 4
1. If word 13 is CFF1 xxxx, a configuration problem exists. Ensure that the number of MCMs is consistent with the number of I/O books. 2. Replace I/O book 1, 3, or 4 at location specified in error code.
4608 28B0
I/O subsystem and other bus tests detected fault in I/O subsystem PCI backplane
I/O subsystem PCI backplane at location specified in error code
4608 28B1
I/O subsystem and other bus tests detected fault in I/O subsystem DASD backplane
I/O subsystem DASD backplane at location specified in error code
4608 525E
CUoD capacity card fault
1. Verify that the system firmware is RH021114 or later. CUoD is not supported by the earlier firmware levels. 2. Replace the CUoD capacity card, location: U1.18-P1-H2.2, then ask the customer to reenter the CUoD activation keys.
4608 525F
CUoD capacity card fault
1. Verify that the system firmware is RH021114 or later. CUoD is not supported by the earlier firmware levels. 2. Replace the CUoD capacity card, location: U1.18-P1-H2.2, then ask the customer to reenter the CUoD activation keys.
4608 527A
CUoD capacity card fault
1. Verify that the system firmware is RH021114 or later. CUoD is not supported by the earlier firmware levels. 2. Replace the CUoD capacity card, location: U1.18-P1-H2.2, then ask the customer to reenter the CUoD activation keys.
4608 527C
CUoD capacity card fault
1. Verify that the system firmware is RH021114 or later. CUoD is not supported by the earlier firmware levels. 2. Replace the CUoD capacity card, location: U1.18-P1-H2.2, then ask the customer to reenter the CUoD activation keys.
4609 248B
I/O subsystem and memory tests detected fault in operator panel
Operator panel Location: U1.17-L1
4609 25C0
I/O subsystem and memory tests detected fault in system backplane
System backplane Location: U1.18-P1
4609 25C1
I/O subsystem and memory tests detected fault in 1.0 GHz clock card
Clock card Location: U1.18-P1-X5 Chapter
5.
Error
Code
to
FRU
Index
705
Error Code
Description
Action/ Possible Failing FRU
4609 25E6
SMA 4-port switch adapter fault
Replace the I/O book (switch adapter) specified by the location code.
4609 2738
I/O subsystem and memory tests detected fault in I/O book 1, 3, or 4
1. If word 13 is CFF1 xxxx, a configuration problem exists. Ensure that the number of MCMs is consistent with the number of I/O books. 2. Replace I/O book 1, 3, or 4 at location specified in error code.
4609 273C
I/O subsystem and memory tests detected fault in primary I/O book - Slot 0
Primary I/O book - Slot 0 Location: U1.18-P1-H2
4609 273D
I/O subsystem and processor subsystem tests detected fault in primary I/O book - Slot 0
Primary I/O book - Slot 0 Location: U1.18-P1-H2
4609 273F
SMA 2-port switch adapter fault
Replace the I/O book (switch adapter) specified by the location code.
4609 2776
I/O subsystem and processor subsystem tests detected fault in I/O book 1, 3, or 4
1. If word 13 is CFF1 xxxx, a configuration problem exists. Ensure that the number of MCMs is consistent with the number of I/O books. 2. Replace I/O book 1, 3, or 4 at location specified in error code.
4609 2806
I/O subsystem and processor subsystem tests detected fault in I/O book 1, 3, or 4
1. If word 13 is CFF1 xxxx, a configuration problem exists. Ensure that the number of MCMs is consistent with the number of I/O books. 2. Replace I/O book 1, 3, or 4 at location specified in error code.
4609 28B0
I/O subsystem and memory tests detected fault in I/O subsystem PCI backplane
I/O subsystem PCI backplane at location specified in error code
4609 28B1
I/O subsystem and memory tests detected fault in I/O subsystem DASD backplane
I/O subsystem DASD backplane at location specified in error code
4609 525E
CUoD capacity card fault
1. Verify that the system firmware is RH021114 or later. CUoD is not supported by the earlier firmware levels. 2. Replace the CUoD capacity card, location: U1.18-P1-H2.2, then ask the customer to reenter the CUoD activation keys.
4609 525F
CUoD capacity card fault
1. Verify that the system firmware is RH021114 or later. CUoD is not supported by the earlier firmware levels. 2. Replace the CUoD capacity card, location: U1.18-P1-H2.2, then ask the customer to reenter the CUoD activation keys. 706
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
4609 527A
CUoD capacity card fault
1. Verify that the system firmware is RH021114 or later. CUoD is not supported by the earlier firmware levels. 2. Replace the CUoD capacity card, location: U1.18-P1-H2.2, then ask the customer to reenter the CUoD activation keys.
4609 527C
CUoD capacity card fault
1. Verify that the system firmware is RH021114 or later. CUoD is not supported by the earlier firmware levels. 2. Replace the CUoD capacity card, location: U1.18-P1-H2.2, then ask the customer to reenter the CUoD activation keys.
460A 248B
I/O subsystem and self-tests detected fault in operator panel
Operator panel Location: U1.17-L1
460A 25C0
I/O subsystem and self-tests detected fault in system backplane
System backplane Location: U1.18-P1
460A 25C1
I/O subsystem and self-tests detected fault in 1.0 GHz clock card
Clock card Location: U1.18-P1-X5
460A 25E6
SMA 4-port switch adapter fault
Replace the I/O book (switch adapter) specified by the location code.
460A 2738
I/O subsystem and self-tests detected fault in I/O book 1, 3, or 4
1. If word 13 is CFF1 xxxx, a configuration problem exists. Ensure that the number of MCMs is consistent with the number of I/O books. 2. Replace I/O book 1, 3, or 4 at location specified in error code.
460A 273C
I/O subsystem and self-tests detected fault in primary I/O book - Slot 0
Primary I/O book - Slot 0 Location: U1.18-P1-H2
460A 273D
I/O subsystem and processor subsystem tests detected fault in primary I/O book - Slot 0
Primary I/O book - Slot 0 Location: U1.18-P1-H2
460A 273F
SMA 2-port switch adapter fault
Replace the I/O book (switch adapter) specified by the location code.
460A 2776
I/O subsystem and processor subsystem tests detected fault in I/O book 1, 3, or 4
1. If word 13 is CFF1 xxxx, a configuration problem exists. Ensure that the number of MCMs is consistent with the number of I/O books. 2. Replace I/O book 1, 3, or 4 at location specified in error code.
460A 2806
I/O subsystem and processor subsystem tests detected fault in I/O book 1, 3, or 4
1. If word 13 is CFF1 xxxx, a configuration problem exists. Ensure that the number of MCMs is consistent with the number of I/O books. 2. Replace I/O book 1, 3, or 4 at location specified in error code.
460A 28B0
I/O subsystem and self-tests detected fault in I/O subsystem PCI backplane
I/O subsystem PCI backplane at location specified in error code Chapter
5.
Error
Code
to
FRU
Index
707
Error Code
Description
Action/ Possible Failing FRU
460A 28B1
I/O subsystem and self-tests detected fault in I/O subsystem DASD backplane
I/O subsystem DASD backplane at location specified in error code
460A 525E
CUoD capacity card fault
1. Verify that the system firmware is RH021114 or later. CUoD is not supported by the earlier firmware levels. 2. Replace the CUoD capacity card, location: U1.18-P1-H2.2, then ask the customer to reenter the CUoD activation keys.
460A 525F
CUoD capacity card fault
1. Verify that the system firmware is RH021114 or later. CUoD is not supported by the earlier firmware levels. 2. Replace the CUoD capacity card, location: U1.18-P1-H2.2, then ask the customer to reenter the CUoD activation keys.
460A 527A
CUoD capacity card fault
1. Verify that the system firmware is RH021114 or later. CUoD is not supported by the earlier firmware levels. 2. Replace the CUoD capacity card, location: U1.18-P1-H2.2, then ask the customer to reenter the CUoD activation keys.
460A 527C
CUoD capacity card fault
1. Verify that the system firmware is RH021114 or later. CUoD is not supported by the earlier firmware levels. 2. Replace the CUoD capacity card, location: U1.18-P1-H2.2, then ask the customer to reenter the CUoD activation keys.
460B 248B
I/O subsystem and wire tests detected fault in operator panel
Operator panel Location: U1.17-L1
460B 25C0
I/O subsystem and wire tests detected fault in system backplane
System backplane Location: U1.18-P1
460B 25C1
I/O subsystem and wire tests detected fault in 1.0 GHz clock card
Clock card Location: U1.18-P1-X5
460B 25E6
SMA 4-port switch adapter fault
Replace the I/O book (switch adapter) specified by the location code.
460B 2738
I/O subsystem and wire tests detected fault in I/O book 1, 3, or 4
1. If word 13 is CFF1 xxxx, a configuration problem exists. Ensure that the number of MCMs is consistent with the number of I/O books. 2. Replace I/O book 1, 3, or 4 at location specified in error code.
460B 273C
I/O subsystem and wire tests detected fault in primary I/O book - Slot 0
Primary I/O book - Slot 0 Location: U1.18-P1-H2
460B 273D
I/O subsystem and processor subsystem tests detected fault in primary I/O book - Slot 0
Primary I/O book - Slot 0 Location: U1.18-P1-H2
460B 273F
SMA 2-port switch adapter fault
Replace the I/O book (switch adapter) specified by the location code. 708
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
460B 2776
I/O subsystem and processor subsystem tests detected fault in I/O book 1, 3, or 4
1. If word 13 is CFF1 xxxx, a configuration problem exists. Ensure that the number of MCMs is consistent with the number of I/O books. 2. Replace I/O book 1, 3, or 4 at location specified in error code.
460B 2806
I/O subsystem and processor subsystem tests detected fault in I/O book 1, 3, or 4
1. If word 13 is CFF1 xxxx, a configuration problem exists. Ensure that the number of MCMs is consistent with the number of I/O books. 2. Replace I/O book 1, 3, or 4 at location specified in error code.
460B 28B0
I/O subsystem and wire tests detected fault in I/O subsystem PCI backplane
I/O subsystem PCI backplane at location specified in error code
460B 28B1
I/O subsystem and wire tests detected fault in I/O subsystem DASD backplane
I/O subsystem DASD backplane at location specified in error code
460B 525E
CUoD capacity card fault
1. Verify that the system firmware is RH021114 or later. CUoD is not supported by the earlier firmware levels. 2. Replace the CUoD capacity card, location: U1.18-P1-H2.2, then ask the customer to reenter the CUoD activation keys.
460B 525F
CUoD capacity card fault
1. Verify that the system firmware is RH021114 or later. CUoD is not supported by the earlier firmware levels. 2. Replace the CUoD capacity card, location: U1.18-P1-H2.2, then ask the customer to reenter the CUoD activation keys.
460B 527A
CUoD capacity card fault
1. Verify that the system firmware is RH021114 or later. CUoD is not supported by the earlier firmware levels. 2. Replace the CUoD capacity card, location: U1.18-P1-H2.2, then ask the customer to reenter the CUoD activation keys.
460B 527C
CUoD capacity card fault
1. Verify that the system firmware is RH021114 or later. CUoD is not supported by the earlier firmware levels. 2. Replace the CUoD capacity card, location: U1.18-P1-H2.2, then ask the customer to reenter the CUoD activation keys.
460C 248B
I/O subsystem and IPL detected fault in operator panel
Operator panel Location: U1.17-L1
460C 25C0
I/O subsystem and IPL detected fault in system backplane
System backplane Location: U1.18-P1
460C 25C1
I/O subsystem and IPL detected fault in 1.0 GHz clock card
Clock card Location: U1.18-P1-X5 Chapter
5.
Error
Code
to
FRU
Index
709
Error Code
Description
Action/ Possible Failing FRU
460C 25E6
SMA 4-port switch adapter fault
Replace the I/O book (switch adapter) specified by the location code.
460C 2738
I/O subsystem and IPL detected fault in I/O book 1, 3, or 4
1. If word 13 is CFF1 xxxx, a configuration problem exists. Ensure that the number of MCMs is consistent with the number of I/O books. 2. Replace I/O book 1, 3, or 4 at location specified in error code.
460C 273C
I/O subsystem and IPL detected fault in primary I/O book - Slot 0
Primary I/O book - Slot 0 Location: U1.18-P1-H2
460C 273D
I/O subsystem and processor subsystem tests detected fault in primary I/O book - Slot 0
Primary I/O book - Slot 0 Location: U1.18-P1-H2
460C 273F
SMA 2-port switch adapter fault
Replace the I/O book (switch adapter) specified by the location code.
460C 2776
I/O subsystem and processor subsystem tests detected fault in I/O book 1, 3, or 4
1. If word 13 is CFF1 xxxx, a configuration problem exists. Ensure that the number of MCMs is consistent with the number of I/O books. 2. Replace I/O book 1, 3, or 4 at location specified in error code.
460C 2806
I/O subsystem and processor subsystem tests detected fault in I/O book 1, 3, or 4
1. If word 13 is CFF1 xxxx, a configuration problem exists. Ensure that the number of MCMs is consistent with the number of I/O books. 2. Replace I/O book 1, 3, or 4 at location specified in error code.
460C 28B0
I/O subsystem and IPL detected fault in I/O subsystem PCI backplane
I/O subsystem PCI backplane at location specified in error code
460C 28B1
I/O subsystem and IPL detected fault in I/O subsystem DASD backplane
I/O subsystem DASD backplane at location specified in error code
460C 525E
CUoD capacity card fault
1. Verify that the system firmware is RH021114 or later. CUoD is not supported by the earlier firmware levels. 2. Replace the CUoD capacity card, location: U1.18-P1-H2.2, then ask the customer to reenter the CUoD activation keys.
460C 525F
CUoD capacity card fault
1. Verify that the system firmware is RH021114 or later. CUoD is not supported by the earlier firmware levels. 2. Replace the CUoD capacity card, location: U1.18-P1-H2.2, then ask the customer to reenter the CUoD activation keys. 710
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
460C 527A
CUoD capacity card fault
1. Verify that the system firmware is RH021114 or later. CUoD is not supported by the earlier firmware levels. 2. Replace the CUoD capacity card, location: U1.18-P1-H2.2, then ask the customer to reenter the CUoD activation keys.
460C 527C
CUoD capacity card fault
1. Verify that the system firmware is RH021114 or later. CUoD is not supported by the earlier firmware levels. 2. Replace the CUoD capacity card, location: U1.18-P1-H2.2, then ask the customer to reenter the CUoD activation keys.
460D 248B
I/O subsystem and attention handler detected fault in operator panel
Operator panel Location: U1.17-L1
460D 25C0
I/O subsystem and attention handler detected fault in system backplane
System backplane Location: U1.18-P1
460D 25C1
I/O subsystem and attention handler detected fault in 1.0 GHz clock card
Clock card Location: U1.18-P1-X5
460D 25E6
SMA 4-port switch adapter fault
Replace the I/O book (switch adapter) specified by the location code.
460D 2738
I/O subsystem and attention handler detected fault in I/O book 1, 3, or 4
1. If word 13 is CFF1 xxxx, a configuration problem exists. Ensure that the number of MCMs is consistent with the number of I/O books. 2. Replace I/O book 1, 3, or 4 at location specified in error code.
460D 273C
I/O subsystem and attention handler detected fault in primary I/O book - Slot 0
Primary I/O book - Slot 0 Location: U1.18-P1-H2
460D 273D
I/O subsystem and processor subsystem tests detected fault in primary I/O book - Slot 0
Primary I/O book - Slot 0 Location: U1.18-P1-H2
460D 273F
SMA 2-port switch adapter fault
Replace the I/O book (switch adapter) specified by the location code.
460D 2776
I/O subsystem and processor subsystem tests detected fault in I/O book 1, 3, or 4
1. If word 13 is CFF1 xxxx, a configuration problem exists. Ensure that the number of MCMs is consistent with the number of I/O books. 2. Replace I/O book 1, 3, or 4 at location specified in error code.
460D 2806
I/O subsystem and processor subsystem tests detected fault in I/O book 1, 3, or 4
1. If word 13 is CFF1 xxxx, a configuration problem exists. Ensure that the number of MCMs is consistent with the number of I/O books. 2. Replace I/O book 1, 3, or 4 at location specified in error code.
460D 28B0
I/O subsystem and attention handler detected fault in I/O subsystem PCI backplane
I/O subsystem PCI backplane at location specified in error code Chapter
5.
Error
Code
to
FRU
Index
711
Error Code
Description
Action/ Possible Failing FRU
460D 28B1
I/O subsystem and attention handler detected fault in I/O subsystem DASD backplane
I/O subsystem DASD backplane at location specified in error code
460D 525E
CUoD capacity card fault
1. Verify that the system firmware is RH021114 or later. CUoD is not supported by the earlier firmware levels. 2. Replace the CUoD capacity card, location: U1.18-P1-H2.2, then ask the customer to reenter the CUoD activation keys.
460D 525F
CUoD capacity card fault
1. Verify that the system firmware is RH021114 or later. CUoD is not supported by the earlier firmware levels. 2. Replace the CUoD capacity card, location: U1.18-P1-H2.2, then ask the customer to reenter the CUoD activation keys.
460D 527A
CUoD capacity card fault
1. Verify that the system firmware is RH021114 or later. CUoD is not supported by the earlier firmware levels. 2. Replace the CUoD capacity card, location: U1.18-P1-H2.2, then ask the customer to reenter the CUoD activation keys.
460D 527C
CUoD capacity card fault
1. Verify that the system firmware is RH021114 or later. CUoD is not supported by the earlier firmware levels. 2. Replace the CUoD capacity card, location: U1.18-P1-H2.2, then ask the customer to reenter the CUoD activation keys.
460E 248B
I/O subsystem and scan control routines detected fault in operator panel
Operator panel Location: U1.17-L1
460E 25C0
I/O subsystem and scan control routines detected fault in system backplane
System backplane Location: U1.18-P1
460E 25C1
I/O subsystem and scan control routines detected fault in 1.0 GHz clock card
Clock card Location: U1.18-P1-X5
460E 25E6
SMA 4-port switch adapter fault
Replace the I/O book (switch adapter) specified by the location code.
460E 2738
I/O subsystem and scan control routines detected fault in I/O book 1, 3, or 4
1. If word 13 is CFF1 xxxx, a configuration problem exists. Ensure that the number of MCMs is consistent with the number of I/O books. 2. Replace I/O book 1, 3, or 4 at location specified in error code.
460E 273C
I/O subsystem and scan control routines detected fault in primary I/O book - Slot 0
Primary I/O book - Slot 0 Location: U1.18-P1-H2
460E 273D
I/O subsystem and processor subsystem tests detected fault in primary I/O book - Slot 0
Primary I/O book - Slot 0 Location: U1.18-P1-H2
460E 273F
SMA 2-port switch adapter fault
Replace the I/O book (switch adapter) specified by the location code. 712
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
460E 2776
I/O subsystem and processor subsystem tests detected fault in I/O book 1, 3, or 4
1. If word 13 is CFF1 xxxx, a configuration problem exists. Ensure that the number of MCMs is consistent with the number of I/O books. 2. Replace I/O book 1, 3, or 4 at location specified in error code.
460E 2806
I/O subsystem and processor subsystem tests detected fault in I/O book 1, 3, or 4
1. If word 13 is CFF1 xxxx, a configuration problem exists. Ensure that the number of MCMs is consistent with the number of I/O books. 2. Replace I/O book 1, 3, or 4 at location specified in error code.
460E 28B0
I/O subsystem and scan control routines detected fault in I/O subsystem PCI backplane
I/O subsystem PCI backplane at location specified in error code
460E 28B1
I/O subsystem and scan control routines detected fault in I/O subsystem DASD backplane
I/O subsystem DASD backplane at location specified in error code
460E 525E
CUoD capacity card fault
1. Verify that the system firmware is RH021114 or later. CUoD is not supported by the earlier firmware levels. 2. Replace the CUoD capacity card, location: U1.18-P1-H2.2, then ask the customer to reenter the CUoD activation keys.
460E 525F
CUoD capacity card fault
1. Verify that the system firmware is RH021114 or later. CUoD is not supported by the earlier firmware levels. 2. Replace the CUoD capacity card, location: U1.18-P1-H2.2, then ask the customer to reenter the CUoD activation keys.
460E 527A
CUoD capacity card fault
1. Verify that the system firmware is RH021114 or later. CUoD is not supported by the earlier firmware levels. 2. Replace the CUoD capacity card, location: U1.18-P1-H2.2, then ask the customer to reenter the CUoD activation keys.
460E 527C
CUoD capacity card fault
1. Verify that the system firmware is RH021114 or later. CUoD is not supported by the earlier firmware levels. 2. Replace the CUoD capacity card, location: U1.18-P1-H2.2, then ask the customer to reenter the CUoD activation keys.
460F 248B
I/O subsystem and error code generation/logging detected fault in operator panel
Operator panel Location: U1.17-L1 Chapter
5.
Error
Code
to
FRU
Index
713
Error Code
Description
Action/ Possible Failing FRU
460F 25C0
I/O subsystem and error code generation/logging detected fault in system backplane
System backplane Location: U1.18-P1
460F 25C1
I/O subsystem and error code generation/logging detected fault in 1.0 GHz clock card
Clock card Location: U1.18-P1-X5
460F 25E6
SMA 4-port switch adapter fault
Replace the I/O book (switch adapter) specified by the location code.
460F 2738
I/O subsystem and error code generation/logging detected fault in I/O book 1, 3, or 4
1. If word 13 is CFF1 xxxx, a configuration problem exists. Ensure that the number of MCMs is consistent with the number of I/O books. 2. Replace I/O book 1, 3, or 4 at location specified in error code.
460F 273C
I/O subsystem and error code generation/logging detected fault in primary I/O book - Slot 0
Primary I/O book - Slot 0 Location: U1.18-P1-H2
460F 273D
I/O subsystem and processor subsystem tests detected fault in primary I/O book - Slot 0
Primary I/O book - Slot 0 Location: U1.18-P1-H2
460F 273F
SMA 2-port switch adapter fault
Replace the I/O book (switch adapter) specified by the location code.
460F 2776
I/O subsystem and processor subsystem tests detected fault in I/O book 1, 3, or 4
1. If word 13 is CFF1 xxxx, a configuration problem exists. Ensure that the number of MCMs is consistent with the number of I/O books. 2. Replace I/O book 1, 3, or 4 at location specified in error code.
460F 2806
I/O subsystem and processor subsystem tests detected fault in I/O book 1, 3, or 4
1. If word 13 is CFF1 xxxx, a configuration problem exists. Ensure that the number of MCMs is consistent with the number of I/O books. 2. Replace I/O book 1, 3, or 4 at location specified in error code.
460F 28B0
I/O subsystem and error code generation/logging detected fault in I/O subsystem PCI backplane
I/O subsystem PCI backplane at location specified in error code
460F 28B1
I/O subsystem and error code generation/logging detected fault in I/O subsystem DASD backplane
I/O subsystem DASD backplane at location specified in error code
460F 525E
CUoD capacity card fault
1. Verify that the system firmware is RH021114 or later. CUoD is not supported by the earlier firmware levels. 2. Replace the CUoD capacity card, location: U1.18-P1-H2.2, then ask the customer to reenter the CUoD activation keys. 714
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
460F 525F
CUoD capacity card fault
1. Verify that the system firmware is RH021114 or later. CUoD is not supported by the earlier firmware levels. 2. Replace the CUoD capacity card, location: U1.18-P1-H2.2, then ask the customer to reenter the CUoD activation keys.
460F 527A
CUoD capacity card fault
1. Verify that the system firmware is RH021114 or later. CUoD is not supported by the earlier firmware levels. 2. Replace the CUoD capacity card, location: U1.18-P1-H2.2, then ask the customer to reenter the CUoD activation keys.
460F 527C
CUoD capacity card fault
1. Verify that the system firmware is RH021114 or later. CUoD is not supported by the earlier firmware levels. 2. Replace the CUoD capacity card, location: U1.18-P1-H2.2, then ask the customer to reenter the CUoD activation keys.
4B23 258A
The processor repeat gard function has deconfigured a processor.
Do the following: 1. Go to the processor configuration/deconfiguration menu, which is under the system information menu in the service processor menus. Look at the status of the processors. If any of the processors have a status of ?manually deconfigured,? reconfigure the processor(s) and boot the system to the SMS menus. Power down the system; look again at the status of the processors. If all of the processors are still configured in the processor configuration/deconfiguration menu, the problem has been resolved. If any of the processors have a status of ?deconfigured by the system,? either before or after the boot to the SMS menus, replace the processor MCM (on which the deconfigured processor resides) specified by the location code. 2. Chapter
5.
Error
Code
to
FRU
Index
715
Error Code
Description
Action/ Possible Failing FRU
4B23 25C7
The processor repeat gard function has deconfigured a processor.
Do the following: 1. Go to the processor configuration/deconfiguration menu, which is under the system information menu in the service processor menus. Look at the status of the processors. If any of the processors have a status of ?manually deconfigured,? reconfigure the processor(s) and boot the system to the SMS menus. Power down the system; look again at the status of the processors. If all of the processors are still configured in the processor configuration/deconfiguration menu, the problem has been resolved. If any of the processors have a status of ?deconfigured by the system,? either before or after the boot to the SMS menus, replace the processor MCM (on which the deconfigured processor resides) specified by the location code.
4B23 25C8
The processor repeat gard function has deconfigured a processor.
Do the following: 1. Go to the processor configuration/deconfiguration menu, which is under the system information menu in the service processor menus. Look at the status of the processors. If any of the processors have a status of ?manually deconfigured,? reconfigure the processor(s) and boot the system to the SMS menus. Power down the system; look again at the status of the processors. If all of the processors are still configured in the processor configuration/deconfiguration menu, the problem has been resolved. If any of the processors have a status of ?deconfigured by the system,? either before or after the boot to the SMS menus, replace the processor MCM (on which the deconfigured processor resides) specified by the location code. 716
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
4B23 25CF
The processor repeat gard function has deconfigured a processor.
Do the following: 1. Go to the processor configuration/deconfiguration menu, which is under the system information menu in the service processor menus. Look at the status of the processors. If any of the processors have a status of ?manually deconfigured,? reconfigure the processor(s) and boot the system to the SMS menus. Power down the system; look again at the status of the processors. If all of the processors are still configured in the processor configuration/deconfiguration menu, the problem has been resolved. If any of the processors have a status of ?deconfigured by the system,? either before or after the boot to the SMS menus, replace the processor MCM (on which the deconfigured processor resides) specified by the location code.
4B23 25D3
The processor repeat gard function has deconfigured a processor.
Do the following: 1. Go to the processor configuration/deconfiguration menu, which is under the system information menu in the service processor menus. Look at the status of the processors. If any of the processors have a status of ?manually deconfigured,? reconfigure the processor(s) and boot the system to the SMS menus. Power down the system; look again at the status of the processors. If all of the processors are still configured in the processor configuration/deconfiguration menu, the problem has been resolved. If any of the processors have a status of ?deconfigured by the system,? either before or after the boot to the SMS menus, replace the processor MCM (on which the deconfigured processor resides) specified by the location code. Chapter
5.
Error
Code
to
FRU
Index
717
Error Code
Description
Action/ Possible Failing FRU
4B23 25D5
The processor repeat gard function has deconfigured a processor.
Do the following: 1. Go to the processor configuration/deconfiguration menu, which is under the system information menu in the service processor menus. Look at the status of the processors. If any of the processors have a status of ?manually deconfigured,? reconfigure the processor(s) and boot the system to the SMS menus. Power down the system; look again at the status of the processors. If all of the processors are still configured in the processor configuration/deconfiguration menu, the problem has been resolved. If any of the processors have a status of ?deconfigured by the system,? either before or after the boot to the SMS menus, replace the processor MCM (on which the deconfigured processor resides) specified by the location code.
4B23 25D7
The processor repeat gard function has deconfigured a processor.
Do the following: 1. Go to the processor configuration/deconfiguration menu, which is under the system information menu in the service processor menus. Look at the status of the processors. If any of the processors have a status of ?manually deconfigured,? reconfigure the processor(s) and boot the system to the SMS menus. Power down the system; look again at the status of the processors. If all of the processors are still configured in the processor configuration/deconfiguration menu, the problem has been resolved. If any of the processors have a status of ?deconfigured by the system,? either before or after the boot to the SMS menus, replace the processor MCM (on which the deconfigured processor resides) specified by the location code. 718
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
4B23 25D8
The processor repeat gard function has deconfigured a processor.
Do the following: 1. Go to the processor configuration/deconfiguration menu, which is under the system information menu in the service processor menus. Look at the status of the processors. If any of the processors have a status of ?manually deconfigured,? reconfigure the processor(s) and boot the system to the SMS menus. Power down the system; look again at the status of the processors. If all of the processors are still configured in the processor configuration/deconfiguration menu, the problem has been resolved. If any of the processors have a status of ?deconfigured by the system,? either before or after the boot to the SMS menus, replace the processor MCM (on which the deconfigured processor resides) specified by the location code.
4B23 25E5
The processor repeat gard function has deconfigured a processor.
Do the following: 1. Go to the processor configuration/deconfiguration menu, which is under the system information menu in the service processor menus. Look at the status of the processors. If any of the processors have a status of ?manually deconfigured,? reconfigure the processor(s) and boot the system to the SMS menus. Power down the system; look again at the status of the processors. If all of the processors are still configured in the processor configuration/deconfiguration menu, the problem has been resolved. If any of the processors have a status of ?deconfigured by the system,? either before or after the boot to the SMS menus, replace the processor MCM (on which the deconfigured processor resides) specified by the location code. Chapter
5.
Error
Code
to
FRU
Index
719
Error Code
Description
Action/ Possible Failing FRU
4B23 25E9
The processor repeat gard function has deconfigured a processor.
Do the following: 1. Go to the processor configuration/deconfiguration menu, which is under the system information menu in the service processor menus. Look at the status of the processors. If any of the processors have a status of ?manually deconfigured,? reconfigure the processor(s) and boot the system to the SMS menus. Power down the system; look again at the status of the processors. If all of the processors are still configured in the processor configuration/deconfiguration menu, the problem has been resolved. If any of the processors have a status of ?deconfigured by the system,? either before or after the boot to the SMS menus, replace the processor MCM (on which the deconfigured processor resides) specified by the location code.
4B23 526B
The processor repeat gard function has deconfigured a processor.
Do the following: 1. Go to the processor configuration/deconfiguration menu, which is under the system information menu in the service processor menus. Look at the status of the processors. If any of the processors have a status of ?manually deconfigured,? reconfigure the processor(s) and boot the system to the SMS menus. Power down the system; look again at the status of the processors. If all of the processors are still configured in the processor configuration/deconfiguration menu, the problem has been resolved. If any of the processors have a status of ?deconfigured by the system,? either before or after the boot to the SMS menus, replace the processor MCM (on which the deconfigured processor resides) specified by the location code. 720
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
4B23 526C
The processor repeat gard function has deconfigured a processor.
Do the following: 1. Go to the processor configuration/deconfiguration menu, which is under the system information menu in the service processor menus. Look at the status of the processors. If any of the processors have a status of ?manually deconfigured,? reconfigure the processor(s) and boot the system to the SMS menus. Power down the system; look again at the status of the processors. If all of the processors are still configured in the processor configuration/deconfiguration menu, the problem has been resolved. If any of the processors have a status of ?deconfigured by the system,? either before or after the boot to the SMS menus, replace the processor MCM (on which the deconfigured processor resides) specified by the location code.
4B23 526D
The processor repeat gard function has deconfigured a processor.
Do the following: 1. Go to the processor configuration/deconfiguration menu, which is under the system information menu in the service processor menus. Look at the status of the processors. If any of the processors have a status of ?manually deconfigured,? reconfigure the processor(s) and boot the system to the SMS menus. Power down the system; look again at the status of the processors. If all of the processors are still configured in the processor configuration/deconfiguration menu, the problem has been resolved. If any of the processors have a status of ?deconfigured by the system,? either before or after the boot to the SMS menus, replace the processor MCM (on which the deconfigured processor resides) specified by the location code. Chapter
5.
Error
Code
to
FRU
Index
721
Error Code
Description
Action/ Possible Failing FRU
4B23 526E
The processor repeat gard function has deconfigured a processor.
Do the following: 1. Go to the processor configuration/deconfiguration menu, which is under the system information menu in the service processor menus. Look at the status of the processors. If any of the processors have a status of ?manually deconfigured,? reconfigure the processor(s) and boot the system to the SMS menus. Power down the system; look again at the status of the processors. If all of the processors are still configured in the processor configuration/deconfiguration menu, the problem has been resolved. If any of the processors have a status of ?deconfigured by the system,? either before or after the boot to the SMS menus, replace the processor MCM (on which the deconfigured processor resides) specified by the location code.
4B24 248B
Processor subsystem detected fault in operator panel
Operator panel Location: U1.17-L1
4B24 258A
Processor subsystem detected fault in 4-way processor MCM
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B24 25C0
Processor subsystem detected fault in system backplane
System backplane Location: U1.18-P1
4B24 25C1
Processor subsystem detected fault in 1.0 GHz clock card
Clock card Location: U1.18-P1-X5
4B24 25C7
Processor subsystem detected fault in 4-way processor MCM
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B24 25C8
Processor subsystem detected fault in 4-way processor MCM
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B24 25CF
Processor subsystem detected fault in 8-way processor MCM
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B24 25D3
Processor subsystem detected fault in 8-way processor MCM with VPD card
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B24 25D5
Processor subsystem detected fault in 8-way processor MCM with VPD card (8-, 16-, or 24-way)
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B24 25D7
Fault in 4-way processor MCM detected
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B24 25D8
Fault in 4-way processor MCM detected
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 722
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
4B24 25E5
Processor subsystem detected fault in 4-way processor MCM
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B24 25E9
Processor subsystem detected fault in 8-way processor MCM
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B24 28B0
Processor subsystem detected fault in I/O subsystem PCI backplane
I/O subsystem PCI backplane at location specified in error code
4B24 28B1
Processor subsystem detected fault in I/O subsystem DASD backplane
I/O subsystem DASD backplane at location specified in error code
4B24 526B
Fault detected in 4-way or 8-way CUoD MCM
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B24 526C
Fault detected in 4-way or 8-way CUoD MCM
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B24 526D
Fault detected in 4-way or 8-way CUoD MCM
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B24 526E
Fault detected in 4-way or 8-way CUoD MCM
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B25 248B
Processor subsystem and display alter tests detected fault in operator panel
Operator panel Location: U1.17-L1
4B25 258A
Processor subsystem detected fault in 4-way processor MCM
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B25 25C0
Processor subsystem and display alter tests detected fault in system backplane
System backplane Location: U1.18-P1
4B25 25C1
Processor subsystem and display alter tests detected fault in 1.0 GHz clock card
Clock card Location: U1.18-P1-X5
4B25 25C7
Processor subsystem and display alter tests detected fault in 4-way processor MCM
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B25 25C8
Processor subsystem and display alter tests detected fault in 4-way processor MCM
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B25 25CF
Processor subsystem detected fault in 8-way processor MCM
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B25 25D3
Processor subsystem and display alter tests detected fault in 8-way processor MCM with VPD card
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B25 25D5
Processor subsystem and display alter tests detected fault in 8-way processor MCM with VPD card (8-, 16-, or 24-way)
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B25 25D7
Fault in 4-way processor MCM detected
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. Chapter
5.
Error
Code
to
FRU
Index
723
Error Code
Description
Action/ Possible Failing FRU
4B25 25D8
Fault in 4-way processor MCM detected
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B25 25E5
Processor subsystem detected fault in 4-way processor MCM
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B25 25E9
Processor subsystem detected fault in 8-way processor MCM
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B25 28B0
Processor subsystem and display alter tests detected fault in I/O subsystem PCI backplane
I/O subsystem PCI backplane at location specified in error code
4B25 28B1
Processor subsystem and display alter tests detected fault in I/O subsystem DASD backplane
I/O subsystem DASD backplane at location specified in error code
4B25 526B
Fault detected in 4-way or 8-way CUoD MCM
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B25 526C
Fault detected in 4-way or 8-way CUoD MCM
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B25 526D
Fault detected in 4-way or 8-way CUoD MCM
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B25 526D
Fault detected in 4-way or 8-way CUoD MCM
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B26 248B
Processor subsystem and scan interface basic assurance tests detected fault in operator panel
Operator panel Location: U1.17-L1
4B26 258A
Processor subsystem detected fault in 4-way processor MCM
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B26 25C0
Processor subsystem and scan interface basic assurance tests detected fault in system backplane
System backplane Location: U1.18-P1
4B26 25C1
Processor subsystem and scan interface basic assurance tests detected fault in 1.0 GHz clock card
Clock card Location: U1.18-P1-X5
4B26 25C7
Processor subsystem and scan interface basic assurance tests detected fault in 4-way processor MCM
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B26 25C8
Processor subsystem and scan interface basic assurance tests detected fault in 4-way processor MCM
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B26 25CF
Processor subsystem detected fault in 8-way processor MCM
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B26 25D3
Processor subsystem and scan interface basic assurance tests detected fault in 8-way processor MCM with VPD card
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 724
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
4B26 25D5
Processor subsystem and scan interface basic assurance tests detected fault in 8-way processor MCM with VPD card (8-, 16-, or 24-way)
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B26 25D7
Fault in 4-way processor MCM detected
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B26 25D8
Fault in 4-way processor MCM detected
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B26 25E5
Processor subsystem detected fault in 4-way processor MCM
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B26 25E9
Processor subsystem detected fault in 84-way processor MCM
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B26 28B0
Processor subsystem and scan interface basic assurance tests detected fault in I/O subsystem PCI backplane
I/O subsystem PCI backplane at location specified in error code
4B26 28B1
Processor subsystem and scan interface basic assurance tests detected fault in I/O subsystem DASD backplane
I/O subsystem DASD backplane at location specified in error code
4B26 526B
Fault detected in 4-way or 8-way CUoD MCM
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B26 526C
Fault detected in 4-way or 8-way CUoD MCM
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B26 526D
Fault detected in 4-way or 8-way CUoD MCM
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B26 526E
Fault detected in 4-way or 8-way CUoD MCM
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B27 248B
Processor subsystem and bus tests detected fault in operator panel
Operator panel Location: U1.17-L1
4B27 258A
Processor subsystem detected fault in 4-way processor MCM
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B27 25C0
Processor subsystem and bus tests detected fault in system backplane
System backplane Location: U1.18-P1
4B27 25C1
Processor subsystem and bus tests detected fault in 1.0 GHz clock card
Clock card Location: U1.18-P1-X5
4B27 25C7
Processor subsystem and bus tests detected fault in 4-way processor MCM
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B27 25C8
Processor subsystem and bus tests detected fault in 4-way processor MCM
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. Chapter
5.
Error
Code
to
FRU
Index
725
Error Code
Description
Action/ Possible Failing FRU
4B27 25CF
Processor subsystem detected fault in 8-way processor MCM
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B27 25D3
Processor subsystem and bus tests detected fault in 8-way processor MCM with VPD card
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B27 25D5
Processor subsystem and bus tests detected fault in 8-way processor MCM with VPD card (8-, 16-, or 24-way)
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B27 25D7
Fault in 4-way processor MCM detected
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B27 25D8
Fault in 4-way processor MCM detected
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B27 25E5
Processor subsystem detected fault in 4-way processor MCM
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B27 25E9
Processor subsystem detected fault in 8-way processor MCM
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B27 28B0
Processor subsystem and bus tests detected fault in I/O subsystem PCI backplane
I/O subsystem PCI backplane at location specified in error code
4B27 28B1
Processor subsystem and bus tests detected fault in I/O subsystem DASD backplane
I/O subsystem DASD backplane at location specified in error code
4B27 526B
Fault detected in 4-way or 8-way CUoD MCM
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B27 526C
Fault detected in 4-way or 8-way CUoD MCM
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B27 526D
Fault detected in 4-way or 8-way CUoD MCM
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B27 526E
Fault detected in 4-way or 8-way CUoD MCM
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B28 248B
Processor subsystem and other bus tests detected fault in operator panel
Operator panel Location: U1.17-L1
4B28 258A
Processor subsystem detected fault in 4-way processor MCM
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B28 25C0
Processor subsystem and other bus tests detected fault in system backplane
System backplane Location: U1.18-P1
4B28 25C1
Processor subsystem and other bus tests detected fault in 1.0 GHz clock card
Clock card Location: U1.18-P1-X5
4B28 25C7
Processor subsystem and other bus tests detected fault in 4-way processor MCM
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 726
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
4B28 25C8
Processor subsystem and other bus tests detected fault in 4-way processor MCM
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B28 25CF
Processor subsystem detected fault in 8-way processor MCM
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B28 25D3
Processor subsystem and other bus tests detected fault in 8-way processor MCM with VPD card
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B28 25D5
Processor subsystem and other bus tests detected fault in 8-way processor MCM with VPD card (8-, 16-, or 24-way)
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B28 25D7
Fault in 4-way processor MCM detected
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B28 25D8
Fault in 4-way processor MCM detected
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B28 25E5
Processor subsystem detected fault in 4-way processor MCM
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B28 25E9
Processor subsystem detected fault in 8-way processor MCM
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B28 28B0
Processor subsystem and other bus tests detected fault in I/O subsystem PCI backplane
I/O subsystem PCI backplane at location specified in error code
4B28 28B1
Processor subsystem and other bus tests detected fault in I/O subsystem DASD backplane
I/O subsystem DASD backplane at location specified in error code
4B28 526B
Fault detected in 4-way or 8-way CUoD MCM
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B28 526C
Fault detected in 4-way or 8-way CUoD MCM
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B28 526D
Fault detected in 4-way or 8-way CUoD MCM
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B28 526E
Fault detected in 4-way or 8-way CUoD MCM
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B29 248B
Processor subsystem and memory test detected fault in operator panel
Operator panel Location: U1.17-L1
4B29 258A
Processor subsystem detected fault in 4-way processor MCM
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B29 25C0
Processor subsystem and memory test detected fault in system backplane
System backplane Location: U1.18-P1
4B29 25C1
Processor subsystem and memory test detected fault in 1.0 GHz clock card
Clock card Location: U1.18-P1-X5 Chapter
5.
Error
Code
to
FRU
Index
727
Error Code
Description
Action/ Possible Failing FRU
4B29 25C7
Processor subsystem and memory test detected fault in 4-way processor MCM
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B29 25C8
Processor subsystem and memory test detected fault in 4-way processor MCM
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B29 25CF
Processor subsystem detected fault in 8-way processor MCM
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B29 25D3
Processor subsystem and memory test detected fault in 8-way processor MCM with VPD card
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B29 25D5
Processor subsystem and memory test detected fault in 8-way processor MCM with VPD card (8-, 16-, or 24-way)
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B29 25D7
Fault in 4-way processor MCM detected
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B29 25D8
Fault in 4-way processor MCM detected
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B29 25E5
Processor subsystem detected fault in 4-way processor MCM
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B29 25E9
Processor subsystem detected fault in 8-way processor MCM
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B29 28B0
Processor subsystem and memory test detected fault in I/O subsystem PCI backplane
I/O subsystem PCI backplane at location specified in error code
4B29 28B1
Processor subsystem and memory test detected fault in I/O subsystem DASD backplane
I/O subsystem DASD backplane at location specified in error code
4B29 526B
Fault detected in 4-way or 8-way CUoD MCM
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B29 526C
Fault detected in 4-way or 8-way CUoD MCM
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B29 526D
Fault detected in 4-way or 8-way CUoD MCM
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B29 526E
Fault detected in 4-way or 8-way CUoD MCM
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B2A 248B
Processor subsystem and self test detected fault in operator panel
Operator panel Location: U1.17-L1
4B2A 258A
Processor subsystem detected fault in 4-way processor MCM
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 728
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
4B2A 25C0
Processor subsystem and self test detected fault in system backplane
System backplane Location: U1.18-P1
4B2A 25C1
Processor subsystem and self test detected fault in 1.0 GHz clock card
Clock card Location: U1.18-P1-X5
4B2A 25C7
Processor subsystem and self test detected fault in 4-way processor MCM
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B2A 25C8
Processor subsystem and self test detected fault in 4-way processor MCM
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B2A 25CF
Processor subsystem detected fault in 8-way processor MCM
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B2A 25D3
Processor subsystem and self test detected fault in 8-way processor MCM with VPD card
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B2A 25D5
Processor subsystem and self test detected fault in 8-way processor MCM with VPD card (8-, 16-, or 24-way)
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B2A 25D7
Fault in 4-way processor MCM detected
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B2A 25D8
Fault in 4-way processor MCM detected
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B2A 25E5
Processor subsystem detected fault in 4-way processor MCM
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B2A 25E9
Processor subsystem detected fault in 8-way processor MCM
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B2A 28B0
Processor subsystem and self test detected fault in I/O subsystem PCI backplane
I/O subsystem PCI backplane at location specified in error code
4B2A 28B1
Processor subsystem and self test detected fault in I/O subsystem DASD backplane
I/O subsystem DASD backplane at location specified in error code
4B2A 526B
Fault detected in 4-way or 8-way CUoD MCM
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B2A 526C
Fault detected in 4-way or 8-way CUoD MCM
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B2A 526D
Fault detected in 4-way or 8-way CUoD MCM
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B2A 526E
Fault detected in 4-way or 8-way CUoD MCM
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B2B 248B
Processor subsystem and wire test detected fault in operator panel
Operator panel Location: U1.17-L1 Chapter
5.
Error
Code
to
FRU
Index
729
Error Code
Description
Action/ Possible Failing FRU
4B2B 258A
Processor subsystem detected fault in 4-way processor MCM
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B2B 25C0
Processor subsystem and wire test detected fault in system backplane
System backplane Location: U1.18-P1
4B2B 25C1
Processor subsystem and wire test detected fault in 1.0 GHz clock card
Clock card Location: U1.18-P1-X5
4B2B 25C7
Processor subsystem and wire test detected fault in 4-way processor MCM
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B2B 25C8
Processor subsystem and wire test detected fault in 4-way processor MCM
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B2B 25CF
Processor subsystem detected fault in 8-way processor MCM
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B2B 25D3
Processor subsystem and wire test detected fault in 8-way processor MCM with VPD card
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B2B 25D5
Processor subsystem and wire test detected fault in 8-way processor MCM with VPD card (8-, 16-, or 24-way)
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B2B 25D7
Fault in 4-way processor MCM detected
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B2B 25D8
Fault in 4-way processor MCM detected
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B2B 25E5
Processor subsystem detected fault in 4-way processor MCM
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B2B 25E9
Processor subsystem detected fault in 8-way processor MCM
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B2B 28B0
Processor subsystem and wire test detected fault in I/O subsystem PCI backplane
I/O subsystem PCI backplane at location specified in error code
4B2B 28B1
Processor subsystem and wire test detected fault in I/O subsystem DASD backplane
I/O subsystem DASD backplane at location specified in error code
4B2B 526B
Fault detected in 4-way or 8-way CUoD MCM
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B2B 526C
Fault detected in 4-way or 8-way CUoD MCM
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B2B 526D
Fault detected in 4-way or 8-way CUoD MCM
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B2B 526E
Fault detected in 4-way or 8-way CUoD MCM
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 730
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
4B2C 248B
Processor subsystem and IPL detected fault in operator panel
Operator panel Location: U1.17-L1
4B2C 258A
Processor subsystem detected fault in 4-way processor MCM
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B2C 25C0
Processor subsystem and IPL detected fault in system backplane
System backplane Location: U1.18-P1
4B2C 25C1
Processor subsystem and IPL detected fault in 1.0 GHz clock card
Clock card Location: U1.18-P1-X5
4B2C 25C7
Processor subsystem and IPL detected fault in 4-way processor MCM
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B2C 25C8
Processor subsystem and IPL detected fault in 4-way processor MCM
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B2C 25CF
Processor subsystem detected fault in 8-way processor MCM
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B2C 25D3
Processor subsystem and IPL detected fault in 8-way processor MCM with VPD card
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B2C 25D5
Processor subsystem and IPL detected fault in 8-way processor MCM with VPD card (8-, 16-, or 24-way)
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B2C 25D7
Fault in 4-way processor MCM detected
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B2C 25D8
Fault in 4-way processor MCM detected
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B2C 25E5
Processor subsystem detected fault in 4-way processor MCM
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B2C 25E9
Processor subsystem detected fault in 8-way processor MCM
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B2C 28B0
Processor subsystem and IPL detected fault in I/O subsystem PCI backplane
I/O subsystem PCI backplane at location specified in error code
4B2C 28B1
Processor subsystem and IPL detected fault in I/O subsystem DASD backplane
I/O subsystem DASD backplane at location specified in error code
4B2C 526B
Fault detected in 4-way or 8-way CUoD MCM
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B2C 526C
Fault detected in 4-way or 8-way CUoD MCM
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B2C 526D
Fault detected in 4-way or 8-way CUoD MCM
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. Chapter
5.
Error
Code
to
FRU
Index
731
Error Code
Description
Action/ Possible Failing FRU
4B2C 526E
Fault detected in 4-way or 8-way CUoD MCM
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B2D 248B
Processor subsystem and attention handler detected fault in operator panel
Operator panel Location: U1.17-L1
4B2D 258A
Processor subsystem detected fault in 4-way processor MCM
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B2D 25C0
Processor subsystem and attention handler detected fault in system backplane
System backplane Location: U1.18-P1
4B2D 25C1
Processor subsystem and attention handler detected fault in 1.0 GHz clock card
Clock card Location: U1.18-P1-X5
4B2D 25C7
Processor subsystem and attention handler detected fault in 4-way processor MCM
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B2D 25C8
Processor subsystem and attention handler detected fault in 4-way processor MCM
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B2D 25CF
Processor subsystem detected fault in 8-way processor MCM
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B2D 25D3
Processor subsystem and attention handler detected fault in 8-way processor MCM with VPD card
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B2D 25D5
Processor subsystem and attention handler detected fault in 8-way processor MCM with VPD card (8-, 16-, or 24-way)
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B2D 25D7
Fault in 4-way processor MCM detected
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B2D 25D8
Fault in 4-way processor MCM detected
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B2D 25E5
Processor subsystem detected fault in 4-way processor MCM
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B2D 25E9
Processor subsystem detected fault in 8-way processor MCM
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B2D 28B0
Processor subsystem and attention handler detected fault in I/O subsystem PCI backplane
I/O subsystem PCI backplane at location specified in error code
4B2D 28B1
Processor subsystem and attention handler detected fault in I/O subsystem DASD backplane
I/O subsystem DASD backplane at location specified in error code
4B2D 526B
Fault detected in 4-way or 8-way CUoD MCM
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B2D 526C
Fault detected in 4-way or 8-way CUoD MCM
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 732
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
4B2D 526D
Fault detected in 4-way or 8-way CUoD MCM
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B2D 526E
Fault detected in 4-way or 8-way CUoD MCM
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B2E 248B
Processor subsystem and scan control routines detected fault in operator panel
Operator panel Location: U1.17-L1
4B2E 258A
Processor subsystem detected fault in 4-way processor MCM
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B2E 25C0
Processor subsystem and scan control routines detected fault in system backplane
System backplane Location: U1.18-P1
4B2E 25C1
Processor subsystem and scan control routines detected fault in 1.0 GHz clock card
Clock card Location: U1.18-P1-X5
4B2E 25C7
Processor subsystem and scan control routines detected fault in 4-way processor MCM
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B2E 25C8
Processor subsystem and scan control routines detected fault in 4-way processor MCM
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B2E 25CF
Processor subsystem detected fault in 8-way processor MCM
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B2E 25D3
Processor subsystem and scan control routines detected fault in 8-way processor MCM with VPD card
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B2E 25D5
Processor subsystem and scan control routines detected fault in 8-way processor MCM with VPD card (8-, 16-, or 24-way)
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B2E 25D7
Fault in 4-way processor MCM detected
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B2E 25D8
Fault in 4-way processor MCM detected
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B2E 25E5
Processor subsystem detected fault in 4-way processor MCM
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B2E 25E9
Processor subsystem detected fault in 8-way processor MCM
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B2E 28B0
Processor subsystem and scan control routines detected fault in I/O subsystem PCI backplane
I/O subsystem PCI backplane at location specified in error code
4B2E 28B1
Processor subsystem and scan control routines detected fault in I/O subsystem DASD backplane
I/O subsystem DASD backplane at location specified in error code
4B2E 526B
Fault detected in 4-way or 8-way CUoD MCM
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. Chapter
5.
Error
Code
to
FRU
Index
733
Error Code
Description
Action/ Possible Failing FRU
4B2E 526C
Fault detected in 4-way or 8-way CUoD MCM
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B2E 526D
Fault detected in 4-way or 8-way CUoD MCM
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B2E 526E
Fault detected in 4-way or 8-way CUoD MCM
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B2F 248B
Processor subsystem and error code generation/logging detected fault in operator panel
Operator panel Location: U1.17-L1
4B2F 258A
Processor subsystem detected fault in 4-way processor MCM
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B2F 25C0
Processor subsystem and error code generation/logging detected fault in system backplane
System backplane Location: U1.18-P1
4B2F 25C1
Processor subsystem and error code generation/logging detected fault in 1.0 GHz clock card
Clock card Location: U1.18-P1-X5
4B2F 25C7
Processor subsystem and error code generation/logging detected fault in 4-way processor MCM
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B2F 25C8
Processor subsystem and error code generation/logging detected fault in 4-way processor MCM
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B2F 25CF
Processor subsystem detected fault in 8-way processor MCM
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B2F 25D3
Processor subsystem and error code generation/logging detected fault in 8-way processor MCM with VPD card
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B2F 25D5
Processor subsystem and error code generation/logging detected fault in 8-way processor MCM with VPD card (8-, 16-, or 24-way)
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B2F 25D7
Fault in 4-way processor MCM detected
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B2F 25D8
Fault in 4-way processor MCM detected
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B2F 25E5
Processor subsystem detected fault in 4-way processor MCM
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B2F 25E9
Processor subsystem detected fault in 8-way processor MCM
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 734
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
4B2F 28B0
Processor subsystem and error code generation/logging detected fault in I/O subsystem PCI backplane
I/O subsystem PCI backplane at location specified in error code
4B2F 28B1
Processor subsystem and error code generation/logging detected fault in I/O subsystem DASD backplane
I/O subsystem DASD backplane at location specified in error code
4B2F 526B
Fault detected in 4-way or 8-way CUoD MCM
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B2F 526C
Fault detected in 4-way or 8-way CUoD MCM
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B2F 526D
Fault detected in 4-way or 8-way CUoD MCM
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
4B2F 526E
Fault detected in 4-way or 8-way CUoD MCM
Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component.
Chapter
5.
Error
Code
to
FRU
Index
735
System Firmware Update Messages
Error Code
Description
Action/ Possible Failing FRU
A1FD 0000
Firmware update recovery mode entered
Informational message
A1FD 0001
Service processor flash recovery
Insert Diskette #1
A1FD 0002
Service processor flash recovery
Insert Diskette #2
A1FD 0003
Service processor flash recovery
Insert Diskette #3
A1FD 0004
Service processor flash recovery
Insert Diskette #4
A1FD 000n
Service processor flash recovery
Insert Diskette #n
Capacity Upgrade on Demand (CUoD) Messages
Error Code
Description
Action/ Possible Failing FRU
A100 C2AC
Waiting for user to accept CUoD license agreement
User must click to accept license agreement. If the user declines, the IPL will be halted.
Scan Dump Messages
Error Code
Description
Action/ Possible Failing FRU
A100 3000
Successful completion of scan dump
Informational message; may or may not be seen during a scan log dump.
A1FF 3000
Scan dump has been disabled
Informational message; may or may not be seen during a scan log dump.
736
406/90
Service
Guide
Common Firmware Error Codes Attention: Follow the procedure defined in “Checkpoints and Error Codes Index” on page 416. If you replace FRUs or perform an action on an I/O subsystem and the problem is still not corrected, go to “MAP 1542: I/O Problem Isolation” on page 318 unless you were already directed to any MAP 154x by the error code. Otherwise call support if the action(s) for an error code do not resolve the problem. Attention: If the error code that brought you to this chapter originated in the AIX error log, remember that an error may require an additional 15 minutes to be propagated to the AIX error log. If you replace FRUs and the problem is corrected, go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems.
Error Code
Description
Action/ Possible Failing FRU
B006 1403
Missing service processor resource
1. Check for system firmware updates. 2. Primary I/O bookLocation: U1.18-P1-H2
B006 1404
Service processor firmware fault
1. Check for system firmware updates. 2. Primary I/O book Location: U1.18-P1-H2
B006 1405
Service processor firmware fault
1. Check for system firmware updates. 2. Primary I/O bookLocation: U1.18-P1-H2
B006 1406
Service processor program error
1. Check for system firmware updates. 2. Check for operating system updates 3. Primary I/O bookLocation: U1.18-P1-H2
B006 1407
Service processor resource allocation problem
1. Check for system firmware updates. 2. Check for operating system updates 3. Primary I/O bookLocation: U1.18-P1-H2
B006 1408
Service processor program error
1. Check for system firmware updates. 2. Check for operating system updates 3. Primary I/O bookLocation: U1.18-P1-H2
B006 1409
Service processor program error
1. Check for system firmware updates. 2. Primary I/O bookLocation: U1.18-P1-H2
B10F 1000
Service processor failure
Primary I/O bookLocation: U1.18-P1-H2
B10F 1002
Service processor failure
Primary I/O bookLocation: U1.18-P1-H2
B10F 1005
Service processor failure
Primary I/O bookLocation: U1.18-P1-H2
B10F 1007
Service processor failure
Primary I/O bookLocation: U1.18-P1-H2
B10F 1008
Service processor failure
Primary I/O bookLocation: U1.18-P1-H2
B10F 100E
Service processor boot flash damaged
Primary I/O bookLocation: U1.18-P1-H2
B10F 100F
Service processor base flash damaged
Primary I/O bookLocation: U1.18-P1-H2 Chapter
5.
Error
Code
to
FRU
Index
737
Error Code
Description
Action/ Possible Failing FRU
B10F 1300
Service processor failure
Primary I/O bookLocation: U1.18-P1-H2
B10F 1370
Service processor flash update failure
Primary I/O bookLocation: U1.18-P1-H2
B10F 1380
Service processor program failure
Primary I/O bookLocation: U1.18-P1-H2
B10F 1381
Service processor flash error
Primary I/O bookLocation: U1.18-P1-H2
B10F 1384
Service processor failure
Primary I/O bookLocation: U1.18-P1-H2
B10F 1387
Service processor failure
Primary I/O bookLocation: U1.18-P1-H2
B10F 1400
Service processor failure
Primary I/O bookLocation: U1.18-P1-H2
B10F 1401
Service processor failure
Primary I/O bookLocation: U1.18-P1-H2
B10F 1670
Service processor flash update failure
Primary I/O bookLocation: U1.18-P1-H2
B10F 1672
Service processor firmware update invalid
Call second level of support
B10F 1675
Service processor memory failure
Primary I/O bookLocation: U1.18-P1-H2
B10F 1681
Service processor firmware update invalid
Call second level of support
B10F 1682
Service processor firmware update invalid
Call second level of support
B10F 1683
Service processor firmware update invalid
Call second level of support
B1xx 0013
An attempt was made to flash (update) the firmware from AIX using the wrong image for this system. No flashing took place; the firmware level was not changed.
Retry the firmware update from AIX using the correct flash image.
B1xx 0577
Capacity upgrade on demand (CUoD) correctable surveillance error
Informational message; no action required.
B1xx 4600
Service processor failure Action: This code may be informational, or it may indicate a failure in the service processor. Before changing any parts, examine word 13 in the service processor error log entry, or bytes 68 and 69 in the AIX error log entry. For detailed instructions on finding word 13 in the service processor error log entry, or bytes 68 and 69 in the AIX error log entry, refer to error code B1xx 4699. If the error code (word 11 value) is B19E4600, and the word 13 value is 230Dxxxx, this is an informational message. No action is required by the customer or service representative. For all other values of word 11 and word 13, replace the primary I/O book, U1.18-P1-H2. 1. Check for system firmware updates. 2. Primary I/O bookLocation: U1.18-P1-H2
B1xx 4601
System processor failure
Reboot the system in slow mode to isolate the failure. 738
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
B1xx 4602
Hardware failure
1. Was this error code logged shortly after the system crashed? NO Go to 3. YES Go to 22. Examine the service processor error log. Look for a 4xxx yyyy error that calls out FRU(s) in the CEC whose timestamp just precedes the B1xx 4602 error. If a 4xxx yyyy error occurred, replace the FRU indicated by the location code. If there is no 4xxx yyyy error whose timestamp just precedes the B1xx 4602 error, continue to 3. 3. a. Reboot the system in slow mode to the SMS menus. Refer to “Performing Slow Boot” on page 413. b. Examine the service processor error logs. Look for a 4xxx yyyy error that occurred during the slow boot that calls out FRU(s) in the CEC. v If a 4xxx yyyy error occurred during the slow boot, replace the FRU(s) indicated by the location code(s). v If a 4xxx yyyy error did not occur during the slow boot, replace the primary I/O book, location: U1.18-P1-H2.
B1xx 4603
Service processor firmware damaged
1. Reflash system firmware. 2. Primary I/O bookLocation: U1.18-P1-H2
B1xx 4604
Firmware failure Action: This code may be informational, or it may indicate a failure in the firmware. Before taking any action, examine word 13 in the service processor error log entry, or bytes 68 and 69 in the AIX error log entry. For detailed instructions on finding word 13 in the service processor error log entry, or bytes 68 and 69 in the AIX error log entry, refer to error code B1xx 4699. If the error code (word 11 value) is B1A14604, and the word 13 value is 2304xxxx, this is an informational message. No action is required by the customer or service representative. For all other values of word 11 and word 13: 1. Check for system firmware updates. Apply if available. 2. Call service support.
B1xx 4606
Firmware failure
1. Check for system firmware updates. 2. Operator panelLocation: U1.17-L1 Chapter
5.
Error
Code
to
FRU
Index
739
Error Code
Description
Action/ Possible Failing FRU
B1xx 4608
Service processor early termination
1. Reset the system by removing, then reapplying power to the processor subsystem. 2. Reflash system firmware. 3. Primary I/O book Location: U1.18-P1-H2
B1xx 460A
Time-of-day clock has been reset
1. Set time-of-day clock 2. Replace Primary I/O book Location: U1.18-P1-H2
B1xx 460B
NVRAM/Time-of-day battery
Replace battery Location: U1.18-P1-H2-V1
B1xx 4611
Service processor failure
1. Reboot the system in slow mode to isolate the problem in the processor subsystem. 2. Primary I/O bookLocation: U1.18-P1-H2
B1xx 4612
Service processor firmware failure
1. Check for system firmware updates. 2. Primary I/O bookLocation: U1.18-P1-H2
B1xx 4613
Service processor failure
Primary I/O bookLocation: U1.18-P1-H2
B1xx 4614
Service processor failure related to processor subsystem
Reboot the system in slow mode to isolate the problem in the processor subsystem.
B1xx 4620
Modem configuration failure
1. Check modem configuration files. 2. Check for system firmware updates. 3. Modem and modem cable. 4. Primary I/O bookLocation: U1.18-P1-H2
B1xx 4621
Modem disconnection failure
1. Check modem configuration files. 2. Check for system firmware updates. 3. Modem and modem cable. 4. Primary I/O bookLocation: U1.18-P1-H2
B1xx 4622
Service processor failure Action: This code may be informational, or it may indicate a failure in the service processor. Before changing any parts, examine word 13 in the service processor error log entry, or bytes 68 and 69 in the AIX error log entry. For detailed instructions on finding word 13 in the service processor error log entry, or bytes 68 and 69 in the AIX error log entry, refer to error code B1xx 4699. If the error code (word 11 value) is B19F4622, and the word 13 value is 230Axxxx, this is an informational message. No action is required by the customer or service representative. For all other values of word 11 and word 13: 1. Operator panel, location: U1.17-L1 2. Check for system firmware updates. Apply if available. 3. Primary I/O book, location: U1.18-P1-H2. 740
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
B1xx 4633
System power control network (SPCN) failure Action: This error code may be informational, or it may indicate a failure in the SPCN firmware update. Before changing any parts, examine word 13 in the service processor error log entry, or bytes 68 and 69 in the AIX error log entry. For detailed instructions on finding word 13 in the service processor error log entry, or bytes 68 and 69 in the AIX error log entry, see error code “Error Code B1xx 4699” on page 747. If the error code (word 11 value) is B1394633, and the word 13 value is 2309xxxx, this is an informational message. No action is required by the customer or service representative. All of the following values of word 13 (or bytes 68 and 69) indicate a failure of an SPCN firmware update: A704: The SPCN code update has been interrupted; the SPCN firmware update terminates. This is logged in the AIX error log as a firmware event. Action: Stop all partitions. Use the HMC to power down the system. Power on the system and restart all partitions; the SPCN code update will restart and run in the background. A70E: SPCN (system power control network) flash update was successful. Actions: This is an informational message; no action is required by the customer or service representative. It indicates that the SPCN code flash update was successful. A710: SPCN command ?RLI? failed; this indicates a loss of communication with the SPCN controller. This is logged in the AIX error log as a firmware event. Action: Stop all partitions. Use the HMC to power down the system. Remove the 350V of power from the system by putting the UEPO switch into the OFF position. Place the UEPO switch into the ON position. Wait five minutes. Use the HMC to power on the system . Restart at least one partition. The SPCN code update will run in the background. If this error code (word 13 value) reoccurs, contact service support. A711: SPCN command ?DDB? failed; this indicates a loss of communication with the SPCN controller. This is logged in the AIX error log as a firmware event. Actions: Stop all partitions. Use the HMC to power down the system. Remove the 350V of power from the system by placing the UEPO switch into the OFF position. Place the UEPO switch into the ON position. Wait five minutes. Use the HMC to power on the system. Restart at least one partition. The SPCN code update will run in the background. If this error code (word 13 value) reoccurs, contact service support. A712: SPCN command ?QFS? failed. This is logged in the AIX error log as a firmware event. Actions: Stop all partitions. Use the HMC to power down the system. Remove the 350V of power from the system by placing the UEPO switch into the OFF position. Place the UEPO switch into the ON position. Wait five minutes. Use the HMC to power on the system. Restart at least one partition. The SPCN code update will run in the background. If this error code (word 13 value) reoccurs, contact service support. A713: No power subsystem code LID was found in the firmware image; the FRU firmware update terminates. This is logged in the AIX error log as a sysplanar0 error. Actions: Contact service support. A715: A bad power subsystem code LID was found in the firmware image; the FRU firmware update terminates. This is logged in the AIX error log as a sysplanar0 error. Actions: Contact service support.Error code B1xx 4633 continues on to the next row. Chapter
5.
Error
Code
to
FRU
Index
741
Error Code
Description
Action/ Possible Failing FRU
B1xx 4633Cont’d
Continued from previous row. A716: The power subsystem FRU code update was interrupted due to the system being shut down or turned off because of a critical power problem; the FRU firmware update terminates. This is logged into the AIX error log as a firmware event. Actions: Resolve the problem that caused the power shutdown. Power on and reboot at least one partition; the power subsystem code update will run in the background.A701 A702 A703 A705 A706 A707 For these values of word 13, do the following: 1. Power off the system. 2. Collect the service processor error log data for the B1xx 4633 error. From the main service processor menu, enter 3 to access the System Information menu. Select option 3, Service Processor Error Logs. 3. Collect the SPCN trace data. From the main service processor menu, enter 3 to access the System Information menu. Select option 9, the Power Control Network Utilities Menu. Then select option 4, Collect and Display the SPCN Trace Data. 4. Power on the system.Error code B1xx 4633 continues on to the next row.
B1xx 4633Cont’d
Continued from previous row. 1. Wait 10 minutes after the system has reached the AIX prompt. Examine the AIX error log for new B1xx 4633 errors with the word 13 values (in bytes 68 and 69 in the error log entry). If there are no B1xx 4633 errors with matching time stamps, the SPCN down load was successful. If the download failed again (there is another B1xx 4633 error in the AIX error log with the correct time stamp), do the following: a. Power down the system. b. Collect the service processor error log data for the B1xx 4633 error (see 2). c. Collect the SPCN trace data (see 3). d. Contact service support. For all other values of word 13, or bytes 68 and 69, do the following: 1) Check for system firmware updates. Apply if available. 2) Call service support.
B1xx 4634
SPCN Loop Fault
1. Check SPCN cabling. 2. Go to “MAP 1542: I/O Problem Isolation” on page 318.
B1xx 4635
AC input was removed, then restored.
Informational message. Look at the service processor error log for additional error codes relating to loss of ac power to the system.
B1xx 4636
System is running on battery back-up power
Informational message.
B1xx 4643
Problem when turning on the identify LED of a FRU
1. Use the location code that was provided with the original error code to isolate and replace the failing device. 2. If a problem persists, go to “MAP 154A: I2C Bus Problem Isolation” on page 361. 742
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
B1xx 4644
I2C connection failure
1. Check for system firmware updates. 2. Primary I/O bookLocation: U1.18-P1-H2
B1xx 4645
I2C (non-processor) VPD system fault
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
B1xx 4646
Vital product data (VPD) acquisition failure
1. Check for system firmware updates. 2. Primary I/O bookLocation: U1.18-P1-H2
B1xx 4647
Machine type and model fields missing or invalid in system VPD
The machine type and model fields must be rewritten. Call service support.
B1xx 4648
Processor subsystem backplane VPD or power source problem
Go to “MAP 1540: Problem Isolation Procedures” on page 308.
B1xx 4650
System processor failure (all processors reset).
Reboot the system in slow mode to isolate the problem.
B1xx 4651
CPU GARD VPD on MCM 0
U1.18-P1-C1 and U1.18-P1-H2.3
B1xx 4652
CPU GARD VPD on MCM 1
U1.18-P1-C3 and U1.18-P1-H2.5
B1xx 4653
CPU GARD VPD on MCM 2
U1.18-P1-C4 and U1.18-P1-H2.4
B1xx 4654
CPU GARD VPD on MCM 3
U1.18-P1-C2 and U1.18-P1-H2.6
B1xx 4660
Memory subsystem failure (cannot isolate)
Reboot the system in slow mode to isolate the failure, then check the service processor error log for failing FRUs associated with this error code.
B1xx 4661
Memory book VPD failure
Reboot the system in slow mode to isolate the failure, then check the service processor error log for failing FRUs associated with this error code.
B1xx 4662
Memory book VPD failure
Reboot the system in slow mode to isolate the failure, then check the service processor error log for failing FRUs associated with this error code.
B1xx 4681
JTAG scan system (cabling)
1. Check JTAG cabling. 2. Call service support
B1xx 4682
Problem with VPD on operator panel
1. Replace the operator panel,Location: U1.17-L1Do not swap the old VPD module onto the new op panel. Call service support for instructions on programming the new VPD module. 2. Call service support. Chapter
5.
Error
Code
to
FRU
Index
743
Error Code
Description
Action/ Possible Failing FRU
B1xx 4683
Capacity Upgrade on Demand (CUoD) card fault Action: This error code may be informational, or it may indicate a failure of the CUoD capacity card. Before changing any parts, examine word 13 in the service processor error log entry, or bytes 68 and 69 in the AIX error log entry. For detailed instructions on finding word 13 in the service processor error log entry, or bytes 68 and 69 in the AIX error log entry, see error code “Error Code B1xx 4699” on page 747. All of the following values of word 13 (or bytes 68 and 69) indicate that a surveillance error occurred in the capacity upgrade on demand (CUoD) firmware or hardware that was corrected. No action is required by the customer or service representative.C157 C1A0 C1A1 C1A2 C1A3 C201 For all other values of word 13 (or bytes 68 and 69): 1. Replace the CUoD capacity card, location: U1.18-P1-H2.2. See also “Capacity Upgrade on Demand (CUoD) VPD Module” on page 902. If the value of the first two bytes of word 13 in the detailed service processor error log entry, or the value of bytes 68 and 69 in the AIX error log entry, is E196, this indicates the CUoD capacity card installed in the system was previously used in another system. CUoD capacity cards cannot be moved between systems; replace the CUoD capacity card with the one for this system. For detailed instructions on finding word 13 in the service processor error log entry, and bytes 68 and 69 in the AIX error log entry, refer to the action column for error code B1xx 4699.
B1xx 4690
Operating system surveillance time out (AIX to service processor interface failure) Action: 1. This error code indicates that the operating system terminated early (which usually implies an operating system crash). This error code may appear in the service processor error log by itself. However, in the AIX error log, there should be another error which points to the cause of the operating system crash. Use the other error as the starting point for your service action. 2. The other possibility is that the operating system was not found during a prior boot attempt. To determine if this occurred, do the following: Look at the AIX error log entry containing B1xx4690. This will be a ?SCAN_ERROR_CHRP? error with an identifier of BFEC0425. In the detail data, find the string ?B1xx4690? (If present, it will be at byte 60 of the detail data.) Then go forward 8 bytes after the ?B1? to byte 68 and look at bytes 68 and 69. If the values of bytes 68 and 69 are A2B0, this indicates that the firmware was unable to find a bootable device in the boot list that is set in the SMS menus. If the system is up, the boot list problem has been corrected and the B1xx 4690 can be treated as an informational message with no actions required. 3. Call service support. 744
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
B1xx 4691
Description: System firmware to service processor interface failure. (System firmware surveillance time out) Action: This code may be informational, or it may indicate a system firmware to service processor interface failure. Before changing any parts, examine word 13 in the service processor error log entry, or bytes 68 and 69 in the AIX error log entry. For detailed instructions on finding word 13 in the service processor error log entry, or bytes 68 and 69 in the AIX error log entry, refer to error code B1xx 4699. If the error code (word 11 value) is B19F4691and the word 13 value is 230Axxxx, this is an informational message. No action is required by the customer or service representative. 1. Check for system firmware updates. 2. Go to the service processor main menu and select System Information Menu. Then select Read Progress Indicators From Last System Boot. Begin your repair action with the error code or checkpoint immediately preceding B1xx 4691. If a location code displays with the error code or checkpoint, replace the part at that location. If changing that part does not fix the problem, or no location code is specified, and you have an 8-character error code, go to the “Checkpoints and Error Codes Index” on page 416. If changing that part does not fix the problem, or no location code is specified, and you have a 4-character checkpoint, go to “Firmware Checkpoints” on page 379. 3. If the problem is not resolved, call the second level support.
B1xx 4692
Operator panel task interface failure
1. Check for system firmware updates. 2. Call service support
B1xx 4693
Service processor firmware failure
1. Check for system firmware updates. 2. Call service support
B1xx 4694
System processor firmware failure
1. Check for system firmware updates. 2. Primary I/O bookLocation: U1.18-P1-H2
B1xx 4695
System power control network (SPCN) to service processor interface failure
1. Check for system firmware updates. 2. Primary I/O bookLocation: U1.18-P1-H2
B1xx 4696
System firmware interface failure
1. Check for system firmware updates. 2. Call service support. Chapter
5.
Error
Code
to
FRU
Index
745
Error Code
Description
Action/ Possible Failing FRU
B1xx 4698
Service processor firmware error This is usually an indication that the interposer plug count has not been set after an MES upgrade or FRU replacement of an MCM or L3 cache module. It can also indicate that the plug count values were not revalidated after the primary I/O book was replaced. In both cases, it is only an informational message. If the managed server is down, go to the service processor error log and find the error log entry containing B1xx 4698. Examine the first two bytes of word 13 of the detailed entry information. If the managed server is running, look at the AIX error log entry containing B1xx 4698. This will be a ?SCAN_ERROR_CHRP? error with an identifier of BFE4C025. In the detail data, find the string ?B1xx 4698? (if present, it will be at byte 60 of the detail data). Go forward 8 bytes after the ?B1? to byte 68 and examine bytes 68 and 69. If the value is E10B or E10C, do the following: v If the primary I/O book was just replaced, go to the service processor menus and select ?System Information Menu,? then select ?MCM/L3 Interposer Plug Count Menu.? At the 0> prompt, enter 50 to commit the values and write them to the VPD. Fully exit the menu before powering on the system. v If an MCM or L3 cache module was just added or replaced, go to the service processor menu and select ?System Information Menu,? then select ?MCM/L3 Interposer Plug Count Menu.? Find the index number that corresponds to the module that was just added or replaced and enter the new plug count that was included in the FRU or MES kit. If the plug count is not included with the FRU, enter 7 as the new plug count (to account for the number of times the module was plugged during the manufacturing process). Fully exit the menu before powering on the system.For all other values of bytes 68 and 69, or the first two bytes of word 13, do the following: 1. Check for system firmware updates. 2. Reset the service processor by activating the pinhole reset switch on the operator panel. 3. Call service support.
B1xx 4699
Service processor firmware error or informational message.
See “Error Code B1xx 4699” on page 747.
B1xx 469A
System firmware interface
1. Check for system firmware updates. 2. Call service support.
B1xx 469B
Firmware to I/O interface
1. Check for system firmware updates. 2. Primary I/O bookLocation: U1.18-P1-H2
B1xx 469C
Firmware to I/O interface
1. Check for system firmware updates. 2. Primary I/O bookLocation: U1.18-P1-H2
B1xx 469D
Service processor firmware
1. Check for system firmware updates. 2. Primary I/O bookLocation: U1.18-P1-H2
B1xx 469E
Service processor firmware failure
1. Check for system firmware updates. 2. Reboot the system in slow mode to isolate a problem in the processor subsystem. 3. Primary I/O bookLocation: U1.18-P1-H2 746
406/90
Service
Guide
Error Code
Description
Action/ Possible Failing FRU
B1xx 469F
Service processor failure
1. Check for system firmware updates. 2. Reboot the system in slow mode to isolate a problem in the processor subsystem. 3. Primary I/O bookLocation: U1.18-P1-H2
B1xx 8FF0
Informational message
See B1xx 8FF0 in “Error Code B1xx 8FF0” on page 752.
B1xx A2D0
Write operation to the LPAR data area in NVRAM failed
1. Using the HMC, perform a soft reset on the partition that reported this error. 2. Using the HMC, perform a hard reset on the partition that reported this error. 3. If the attempts to reset the partition are not successful, shut down all partitions, then remove and reapply AC power to the system. 4. Reboot the system.
B1FD 0013
A firmware update has failed because the update image is invalid. The update was stopped before any changes were made to the currently installed firmware
Reset the service processor by activating the pinhole reset switch on the operator panel. Flash a valid firmware update image for the system type and model.
B1FD 0015
Too many firmware update diskettes were inserted (or the update image was faulty, or not the right image for the system)
1. Essure that the firmware update image is correct for the system type and model. 2. Replace the update diskettes with a new set of firmware update diskettes.
B1FD 0016
Firmware update diskette read error
1. The firmware diskette may be faulty. Try another diskette. 2. Diskette drive Location: U1.17- D1. 3. Primary I/O book Location: U1.18-P1-H2.
B1FD 001A
Service processor recovery mode update fault
Reset the service processor by activating the pinhole reset switch on the operator panel. Attempt to flash a valid firmware update image for the system type and model again.
B1FD 001E
Service processor recovery mode update fault
Wrong firmware update diskette. Insert the correct update diskette.
B1FD 001F
Firmware update diskette bad
Replace the update diskette.
Error
Code B1xx 4699 The following error code was moved out of the Common Firmware Error Code table because of the size of the description column.
Error Code
Description:
Action/ Possible Failing FRU:
B1xx 4699
Service processor firmware error or informational message.
For complete information regarding this error code, read through this entire section. This is usually an indication of a problem in the communication path between the HMC and the service processor. It may only be an informational message. Chapter
5.
Error
Code
to
FRU
Index
747
If the managed server is down, go to the service processor error log and find the error log entry containing B1xx 4699. Examine the first two bytes of word 13 of the detailed entry information. If the managed server is running, examine the AIX error log entry containing B1xx 4699. This is a ?SCAN_ERROR_CHRP? error with an identifier of BFE4C025. In the detail data, find the string ?B1xx 4699?. (If present, it will be at byte 60 of the detail data.) Go forward 8 bytes after the ?B1? to byte 68 and examine bytes 68 and 69. If the system is running Linux, examine the Linux system log. The line(s) in the extended data that begin with <4>RTAS: Log Debug: 04? contain the error code in the next 8 hex characters. (This error code is also known as word 11). Each 4 bytes after the error code in the Linux extended data is another word. The 4 bytes after the error code are word 12 and the next four bytes are word 13. An example of the Linux extended data, and finding words 11, 12, and 13, is shown in “MAP 1321: Quick Entry MAP for Systems with Service Focal Point” on page 170, “Step 1321-28” on page 178, and “Step 1321-29” on page 178. Perform the following actions based on the values of bytes 68 and 69 listed below from the AIX error log entry, or on the first two bytes of word 13 from the service processor error log entry: 2200: Informational message. Actions: This is an informational message. No action is required by the customer or service representative.2209: Informational message. Actions: This is an informational message. No action is required by the customer or service representative.2306: No MCM is detected in the first position (U1.18-P1-C1); an MCM is required in the first position for the system to boot. Actions: 1. If an MCM is not plugged into the first position (U1.18-P1-C1), plug an MCM into position one. 2. If an MCM is plugged into the first position (U1.18-P1-C1), follow the removal and replacement procedures and reseat the MCM. If reseating the MCM does not fix the problem, replace the MCM.2310: Informational message. Actions: This is an informational message. No action is required by the customer or service representative.2311: Informational message. Actions: This is an informational message. No action is required by the customer or service representative.2315: Informational message. Actions: This is an informational message. No action is required by the customer or service representative.240A: Informational message. Actions: This is an informational message. No action is required by the customer or service representative.240B: Informational message. Actions: This is an informational message. No action is required by the customer or service representative.
748
406/90
Service
Guide
24FF: Informational message. Actions: This is an informational message. No action is required by the customer or service representative.2504: Informational message. Actions: This is an informational message. No action is required by the customer or service representative.2512: Informational message. Actions: This is an informational message. No action is required by the customer or service representative.2513: Informational message. Actions: This is an informational message. No action is required by the customer or service representative.2524: Informational message. Actions: This is an informational message. No action is required by the customer or service representative.2526: Informational message. Actions: This is an informational message. No action is required by the customer or service representative.35EF: The scan dump code was not able to successfully complete the request to close the connection to the load source.Actions: This is an informational message. No action is required by the customer or service representative.9900: Memory allocation failure during firmware update. Actions: 1. Reset the system using the pin-hole reset button on the operator panel. Retry the operation that caused this error code. 2. Check for a later level of firmware than the one that you were trying to install. Apply if available. 3. Call service support.9901: State sequence error found during CRC query during firmware update. Actions: 1. Reset the system using the pin-hole reset button on the operator panel. Retry the operation that caused this error code. 2. Check for a later level of firmware than the one that you were trying to install. Apply if available. 3. Call service support.9902: Mailbox LID data failure during firmware update. Actions: 1. Reset the system using the pin-hole reset button on the operator panel. Retry the operation that caused this error code. 2. Check for a later level of firmware than the one that you were trying to install. Apply if available. 3. Call service support.9903: State sequence error found after a firmware update LID was received during firmware update. Chapter
5.
Error
Code
to
FRU
Index
749
Actions: 1. Reset the system using the pin-hole reset button on the operator panel. Retry the operation that caused this error code. 2. Check for a later level of firmware than the one that you were trying to install. Apply if available. 3. Call service support.9904: Error while flashing an update LID during firmware update. Actions: 1. Reset the system using the pin-hole reset button on the operator panel. Retry the operation that caused this error code. 2. Check for a later level of firmware than the one that you were trying to install. Apply if available. 3. Call service support.9905: CRC mismatch after a firmware update LID was received during firmware update. Actions: 1. Reset the system using the pin-hole reset button on the operator panel. Retry the operation that caused this error code. 2. Check for a later level of firmware than the one that you were trying to install. Apply if available. 3. Call service support.9906: Software problem during firmware update from operating system. Actions: 1. Check for a system firmware update that is later than the one that caused this error. Apply if available. 2. Call service support.9920: Error occurred during model upgrade from a 406/70 (7040 model 671) to a 406/90 (7040 model 681). Actions: 1. Verify that the system being upgraded is a 406/70 (7040 model 671). This error will occur if the system has already been upgraded. 2. The special model upgrade diskette might be damaged. Obtain another model upgrade diskette. 3. The diskette drive in the media subsystem might be damaged. Replace it. 4. Check for system firmware updates. Apply if available. 5. Call service support.A205: Machine type and model fields are not valid in the VPD module. Obtain a new operator panel. Do not swap the old VPD module onto the new operator panel. Call service support for instructions on how to write the machine type and model into the new VPD module. A20B: Error requesting trace buffer for service processor. Actions: 1. Reset the service processor, if possible. 2. Check for system firmware updates. Apply the updates if they are available.A218: Unknown return code detected. Actions: Check for system firmware updates. Apply the updates if they are available. A21A: Error allocating an internal service processor. 750
406/90
Service
Guide
Actions: 1. Reset the service processor, if possible. 2. Check for system firmware updates. Apply the updates if they are available.A800: This indicates an HMC/service processor initialization failure. Actions: 1. Check for system firmware updates. 2. Replace the primary I/O book, location: U1.18-P1-H2.A801: This indicates an HMC wrap failure. Actions: Replace the primary I/O book, location: U1.18-P1-H2. A806: This indicates a loss of the surveillance heartbeat between the HMC and the service processor. Actions: 1. Make sure that the HMC is booted and operational. 2. Check the serial cables that go from the HMC to the primary I/O book, location: U1.18-P1-H2.If there are no other error codes or indications of a problem, the A806 (loss of surveillance heartbeat) was a temporary condition and has been resolved; the B1xx4699 code is then an informational message only. If the problem persists: 1. Run diagnostics on the serial port on the HMC. 2. Replace the primary I/O book, location U1.18-P1-H2. Values of A009 and A719 of bytes 68 and 69 in the AIX error log entry or the first two bytes of word 13 in the service processor error log entry are also informational entries: A009: The system received a power-off request at runtime from the HMC, or the white button on the operator panel was used to power down the system at runtime. A719: Primary power failed; the system switched to battery backup power. The following values of bytes 68 and 69, or the first half of word 13, indicate an internal error occurred in the capacity upgrade on demand (CUoD) firmware or hardware that was corrected. No action is required by the customer or service representative. C101, C103, C104, C109, C10B, C10C, C10D, C10E, C10F, C110, C112, C113, C114, C115, C116, C117, C118, C11A, C11F, C120, C121, C122, C123, C124, C125, C126, C127, C128, C129, C12A, C12B, C12C, C12D, C12E, C12F, C130, C131, C132, C133, C134, C135, C136, C138, C13B, C13C, C140, C141, C142, C143, C144, C145, C146, C147, C149, C14A, C14B, C14C, C14D, C14E, C14F, C150, C151, C152, C153, C154, C155, C156, C158, C159 C11B: This value indicates that the CUoD capacity card (location code U1.18-P1-H2.2) needs to be initialized or activated. If the customer is entitled to processor activations, go to step 9 in the “Capacity Upgrade on Demand (CUoD) VPD Module” on page 902. If the customer is not entitled to customer activations, do the following: 1. Power down the system. Wait until OK is in the operator panel in the media drawer indicating that the service processor is in standby. 2. From the service processor main menu, enter code 88484 to access the hidden menu. 3. Press enter again at ?Authorized Use Only? screen. 4. When the CUoD menu appears, the status of ?Test and Run? should be Available. 5. Select option 3 to enter an activation code. 6. Enter the null activation code of 34 zeros: 0000000000000000000000000000000000 7. A success return code should be returned and the Test and Run Status will now be Complete. 8. Now exit out back to the service processor main menu. The system is ready to be powered up.For all other values of bytes 68 and 69, or the first two bytes of word 13, do the following: 1. Check for system firmware updates. Apply if available. Chapter
5.
Error
Code
to
FRU
Index
751
2. Reset the service processor by activating the pinhole reset switch on the operator panel. 3. Call service support.Error Code B1xx 8FF0 The following error code was moved out of the Common Firmware Error Code table because of the size of the description column.
Error Code
Description:
Action/ Possible Failing FRU:
B1xx 8FF0
Informational message
No action required. (The data in these informational log entries may be used by support personnel to gather additional debug information.) For complete information regarding this error code, read through this entire section. If the managed server is down, go to the service processor error log and find the error log entry containing B1xx 8FF0. Examine the first two bytes of word 13 of the detailed entry information. If the managed server is running, do the following: v Examine the AIX error log entry containing B1xx 8FF0. This is a FIRMWARE_EVENT error with an identifier of A6D1BD62. v In the detail data, find the string B1xx 8FF0. (If present, it will be at byte 60 of the detail data.) v Move forward 8 bytes to bytes 68 and 69; this is the value of word 13 in the service processor error log entry.If the system is running Linux, examine the Linux system log. The line(s) in the extended data that begin with <4>RTAS: Log Debug: 04 contain the error code in the next 8 hex characters. (This error code is also known as word 11). Each 4 bytes after the error code in the Linux extended data are another word. The 4 bytes after the error code are word 12 and the next four bytes are word 13. An example of the Linux extended data, and finding words 11, 12, and 13, is shown in MAP 1321, step 1321-28, and step 1321-29, in this service guide. CoD Transactions If the value of the first two bytes of word 13 in the detailed service processor error log entry, or the value of bytes 68 and 69 in the AIX error log entry is C1A4, this indicates a capacity on demand (CoD) transaction has taken place. For more detail about CUoD and CoD transactions, see the Hardware Management Console Operations Guide, order number SA23-1251. If a CoD transaction has taken place, and more detail about the transaction is required, do the following: v Record the 36 bytes in the AIX error log entry starting with B1xx 8FF0, grouping 4 bytes at a time so that the resulting words match the service processor error log’s detail data. For example, B1xx 8FF0 is the value of word 11, the next 4 bytes are word 12, the next 4 are word 13, etc, until word 19 has a value.There are five parameters of interest in the CoD log entry: 1. Word 16 value (first 2 bytes): the type of transaction 2. Word 17 value (first 2 bytes): the change in number of processors due to this transaction 3. Word 17 value (second 2 bytes): the resulting number of processor due to this transaction 4. Word 18 value (first 2 bytes): the on/off CoD processor days remaining 5. Word 18 value (second 2 bytes): the hours remaining in the current on/off CoD resource periodTo find these values in the service processor error log entry, or in the AIX error log entry, do the following: 752
406/90
Service
Guide
Type of Transaction and Resource Type Examine the first two bytes of word 16 recorded above. The value of the first byte of word 16 denotes the type of CoD transaction; the value of the second byte of word 16 is always 01. The Change in Number of Processors Due to This transaction, and the Resulting Number of Processor Due to This transaction Examine the first two bytes of word 17 recorded above. The value in the first two bytes of word 17 is the change in the number of CoD processors due to this transaction. The value in the third and fourth bytes of word 17 is the number of CoD processors resulting from this transaction. The On/Off CoD Processor Days Remaining, and the Hours Remaining in the Current On/Off CoD Resource Period Examine the first two bytes of word 18 recorded above. The value in the first two bytes of word 18 is the change in the number of CoD processors days remaining. The value in the third and fourth bytes of word 18 is the number of hours remaining in the current on/off CoD resource period. Note: The hours remaining counter starts at 24 hours, 0 minutes. As soon as it decrements to 23 hours, 59 minutes, 59 seconds, it will display on the HMC as 23 hours. The same is true as the counter gets close to zero: as the counter rolls over from 1 to zero hours, zero is displayed on the HMC for the next hour.Permanent activation code entered An informational log entry is made each time a permanent activation code is entered. An informational log entry is made each time a permanent activation code is entered. v Word 16 value (first 2 bytes): 0100 v Word 17 value (first 2 bytes): The number of processors activated with this transaction. v Word 17 value (second 2 bytes): The total number of activated processors of this type. This includes those previously activated plus those activated with this transaction. v Word 18 value (first 2 bytes): The current value of the processor day counter. v Word 18 value (second 2 bytes): The current value of the hours remaining in the current period counter.Trial CoD activation An informational log entry is made each time a trial CoD period is started. v Word 16 value (first 2 bytes): 0200 v Word 17 value (first 2 bytes): The number of processors activated with this transaction. v Word 17 value (second 2 bytes): This value is not used; it will be 0. v Word 18 value (first 2 bytes): The current value of the processor day counter. v Word 18 value (second 2 bytes): The current value of the hours remaining in the current period counter.Trial CoD cancellation An informational log entry is made each time a trial CoD period is cancelled. v Word 16 value (first 2 bytes): 0300 v Word 17 value (first 2 bytes): The number of processors originally requested. v Word 17 value (second 2 bytes): This value is not used; it will be 0. v Word 18 value (first 2 bytes): The current value of the processor day counter. v Word 18 value (second 2 bytes): The current value of the hours remaining in the current period counter.On/off activation code entered An informational log entry is made each time an on/off CoD activation code is entered. v Word 16 value (first 2 bytes): 0400 v Word 17 value (first 2 bytes): The number of processors days purchased. Chapter
5.
Error
Code
to
FRU
Index
753
v Word 17 value (second 2 bytes): The total number of activated processors of this type. This includes those previously activated plus those activated with this transaction. v Word 18 value (first 2 bytes): The current value of the processor day counter. It includes the processor days just added with this transaction. v Word 18 value (second 2 bytes): The current value of the hours remaining in the current period counter.Manage on/off CoD request An informational log entry is made each time the change that is requested in the number of on/off CoD resources to be activated is accepted. No processor day charges are associated with this entry. v Word 16 value (first 2 bytes): 0500 v Word 17 value (first 2 bytes): The number entered on the processor on/off CoD management screen on the HMC (the number itself, not the difference from the previous value). v Word 17 value (second 2 bytes): This value is not used; it will be 0. v Word 18 value (first 2 bytes): The current value of the processor day counter. v Word 18 value (second 2 bytes): The current value of the hours remaining in the current period counter.On/off CoD number change, per user request or during IPL An informational log entry is made when: 1. On/off CoD resources are started during an IPL (logically partitioned mode or full system partition mode) 2. On/off CoD resources are started or stopped due to a customer request (logically partitioned mode)v Word 16 value (first 2 bytes): 0600 v Word 17 value (first 2 bytes): The number entered on the processor on/off CoD management screen on the HMC (the number itself, not the difference from the previous value). v Word 17 value (second 2 bytes): The resulting number of processors. This is the number of on/off processors that the firmware was able to start. It may be less than the number requested. v Word 18 value (first 2 bytes): The current value of the processor day counter after the charge for starting the 24 hour period. v Word 18 value (second 2 bytes): The current value of the hours remaining in the current period counter. At boot time, and on an increase in the number of processor days, the counter is reset to 24 hours. If the number of processor days is decreased, the counter is not changed and will continue to show the current value.On/off CoD number change, system An informational log entry is made when the number of on/off CoD resources is automatically reduced by the firmware due to a failing processor being taken off-line and not replaced. v Word 16 value (first 2 bytes): 0700 v Word 17 value (first 2 bytes): The number entered on the processor on/off CoD management screen on the HMC. This is the number of on/off CoD resources that was previously requested. v Word 17 value (second 2 bytes): The number of on/off CoD resource currently active. It may be smaller than the number requested. v Word 18 value (first 2 bytes): The current value of the processor day counter after the charge for starting the 24 hour period. (This transaction never decrements the number of processor days ) v Word 18 value (second 2 bytes): The current value of the hours remaining in the current period counter. (This transaction never resets the hours remaining.
754
406/90
Service
Guide
On/off processor day charge An informational log entry is made when the system decrements the processor day counter at the start of each 24 hour period (including the first one). The ?account? is being charged for the next day’s use of the processor. v Word 16 value (first 2 bytes): 0800 v Word 17 value (first 2 bytes): The number entered on the processor on/off CoD management screen on the HMC. This is the number of on/off CoD resources that is being requested. v Word 17 value (second 2 bytes): The number of on/off CoD resource currently active. It may be smaller than the number requested due to failing resources. v Word 18 value (first 2 bytes): The current value of the processor day counter after the charge for starting the 24 hour period. v Word 18 value (second 2 bytes): The current value of the hours remaining in the current period counter. (This will always be 24 hours.)Out of compliance due to the expiration of a trial CoD period An informational log entry is made when the system goes out of compliance due to either: 1. The expiration of a trial CoD period. 2. The cancellation of a trial CoD period.v Word 16 value (first 2 bytes): 0900 v Word 17 value (first 2 bytes): The number of trial on/off CoD resources out of compliance. v Word 17 value (second 2 bytes): The new number of trial on/off CoD resources. This value is always 0. v Word 18 value (first 2 bytes): The current value of the processor day counter. v Word 18 value (second 2 bytes): The current value of the hours remaining in the current period counter.Out of compliance due to the expiration of an on/off CoD period An informational log entry is made when the system goes out of compliance due to the expiration of an on/off CoD period (when the processor day counter is fully decremented to zero). v Word 16 value (first 2 bytes): 0A00 v Word 17 value (first 2 bytes): The number of on/off CoD resources out of compliance. v Word 17 value (second 2 bytes): The new number of on/off CoD resources. This value is always 0. v Word 18 value (first 2 bytes): The current value of the processor day counter. This value is always 0. v Word 18 value (second 2 bytes): The current value of the hours remaining in the current period counter. This value is always 0.Trial CoD termination An informational log entry is made each time a trial CoD period times out. v Word 16 value (first 2 bytes): 0B00 v Word 17 value (first 2 bytes): The number of processors originally requested. v Word 17 value (second 2 bytes): This value is not used; it will be 0. v Word 18 value (first 2 bytes): The current value of the processor day counter. v Word 18 value (second 2 bytes): The current value of the hours remaining in the current period counter.SPCN Informational Codes The following values of word 13 indicate an SPCN informational message: C006, C007, C00C, C030, C0E0, C0E2, C0F0, C0F1. No action is required by the customer or service representative. Chapter
5.
Error
Code
to
FRU
Index
755
VPD Informational Codes The following values of word 13 indicate a VPD informational message: E104, E10D, E112, E113, E114. No action is required by the customer or service representative. HMC Surveillance Error Codes The HMC surveillance error codes generated only by the HMC are listed in the following table.
Error Code
Description
Action/ Possible Failing FRU
B301 0001
HMC/partition surveillance fault
Contact service support.
B301 0002
HMC/partition surveillance fault
Verify the network between the HMC and the partition by pinging in both directions. If problems are found, correct them. Reboot the HMC.
B301 0012
HMC/partition surveillance fault
Contact service support.
B301 0013
HMC/partition surveillance fault
Contact service support.
B301 0019
HMC/partition surveillance fault
Contact service support.
B301 0020
HMC/partition surveillance fault
Contact service support.
B301 0023
HMC/partition surveillance fault
Contact service support.
B301 0025
HMC/partition surveillance fault
Contact service support.
B301 0026
HMC/partition surveillance fault
Contact service support.
B301 0027
HMC/partition surveillance fault
1. If short host names are being used for the HMC and the partitions, ensure that the short host names are set up correctly according to the instructions in ?Using Network Adapters to Communicate with Partitions? in the Hardware Management Console Operations Guide. Correct any problems that are found, then verify that you can ping in both directions between the HMC and partitions. 2. If short host names are not being used, or the previous step does not correct the problem, contact service support.
B301 0029
HMC/partition surveillance fault
Contact service support.
B301 002B
HMC/partition surveillance fault
Contact service support.
B301 002C
HMC/partition surveillance fault
Contact service support.
B301 002D
HMC/partition surveillance fault
Contact service support.
B302 0001
HMC/partition surveillance fault
1. If short host names are being used for the HMC and the partitions, ensure that the short host names are set up correctly according to the instructions in ?Using Network Adapters to Communicate with Partitions? in the Hardware Management Console Operations Guide. Correct any problems that are found, then verify that you can ping in both directions between the HMC and partitions. 2. If short host names are not being used, or the previous step does not correct the problem, contact service support. 756
406/90
Service
Guide
Problem Determination-Generated Error Codes
Error Code
Description
Action/ Possible Failing FRU
M0BT 0000
The system hung during speaker POST. This error code is generated by the customer performing “Problem Determination” as described in the 406/90.
Go to “MAP 1542: I/O Problem Isolation” on page 318.
M0BT 0001
The system hung during “Starting Software...”. This error code is generated by the customer performing “Problem Determination” as described in the 406/90.
Go to “MAP 1542: I/O Problem Isolation” on page 318.
M0KB D000
The system hung during keyboard POST. This error code is generated by the customer performing “Problem Determination” as described in the 406/90.
1. Primary I/O bookLocation: U1.18-P1-H2 2. Keyboard
M0KB D001
The system did not respond to a keyboard entry. This error code is generated by the customer performing “Problem Determination” as described in the 406/90.
Type 101 keyboard.
M0KB D002
The system did not respond to a keyboard entry. This error code is generated by the customer performing “Problem Determination” as described in the 406/90.
Type 102 keyboard.
M0KB D003
The system did not respond to a keyboard entry. This error code is generated by the customer performing “Problem Determination” as described in the 406/90.
Kanji-type keyboard.
M0ME M002
The system hung during memory POST. This error code is generated by the customer performing “Problem Determination” as described in the 406/90.
Go to “MAP 1542: I/O Problem Isolation” on page 318.
M0NE T000
The system hung during network POST. This error code is generated by the customer performing “Problem Determination” as described in the 406/90.
Go to “MAP 1542: I/O Problem Isolation” on page 318.
M0PS 0000
Power failure. This error code is generated by the customer performing “Problem Determination” as described in the 406/90.
Go to “MAP 1520: Power” on page 192.
M0SC SI00
Unable to load diagnostics. This error code is generated by the customer performing “Problem Determination” as described in the 406/90.
Go to “MAP 1542: I/O Problem Isolation” on page 318.
M0SC SI01
Unable to load diagnostics. This error code is generated by the customer performing “Problem Determination” as described in the 406/90.
Go to “MAP 1542: I/O Problem Isolation” on page 318.
Hypervisor Dump Retrieval Procedure This procedure defines the requirements and methods for retrieving a hypervisor dump file following a system crash. This procedure is applicable only if the managed system had been running as a partitioned system before it crashed. Chapter
5.
Error
Code
to
FRU
Index
757
This procedure requires the managed system to be rebooted as a partitioned system. Retrieval must then take place from an AIX partition. Hypervisor Dump Indications If the system operator panel displays error code 20D00997, 20D00998, or 20D00999, a system crash within the run-time firmware has occurred. If error code 20D00997, 20D00998, or 20D00999 is in the service processor error log, a system crash within the run-time firmware has occurred. The following is an example of a detailed service processor error log entry with error code 20D00999: 05/29/2001 21:06:12 Operating System Terminated with Error String Error code: 20D00999 Detail: A2C1 SRC -------------------------------------------------------------- word11: B1194690 word12: 04A0005D word13: A2C1C0FA word14: 00000000 word15: 00007701 word16: 00000008 word17: 00000000 word18: 00000000 word19: 00000000 20D00999 After a system crash has occurred, verify if a hypervisor dump has been collected, as described in the following section. Retrieving the Hypervisor Dump file After it has been determined that the managed system has crashed and a hypervisor dump exists, or if the user wants to test for an existing dump file, the managed system must be rebooted as a partitioned system. A partition or partitions running AIX must then be activated. After rebooting, any partition running AIX may be used to retrieve the dump file. Do the following: 1. Install the devices.chrp_lpar.base.ras fileset. The devices.chrp_lpar.base.ras AIX fileset must be installed and committed. To determine if the package has been previously installed, enter the following command on the AIX command line: lslpp -l | grep devices.chrp_lpar.base.ras If the output on the AIX console is similar to the following, the devices.chrp_lpar.base.ras fileset is already installed and no further action for step 1 is necessary: devices.chrp_lpar.base.ras 5.X.X.X COMMITTED CHRP LPAR RAS Support If no output is returned, the devices.chrp_lpar.base LPP must be installed from the AIX installation media by the customer or system administrator. After the fileset is installed, the dump file is ready to be retrieved. 2. Enter fetchdbg -k at the command prompt. This produces the following output: Retrieving Dump File ............Dump read successful This action results in a file being written into the /tmp directory named dumpMMDDYY where MM is the current month, DD is the current day, and YY is the current year. Forward the /tmp/dumpMMDDYY file to service support. If the following text displays on the AIX console, note the error and report it to service support: Retrieving Dump File Error performing dump read 758
406/90
Service
Guide
Chapter 6. Using the Online and Standalone Diagnostics The AIX diagnostics consist of online diagnostics and standalone diagnostics. Attention: The AIX operating system must be installed in a partition in order to run online diagnostics on that partition. If the AIX operating system is not installed, use the standalone diagnostic procedures. Online diagnostics, when they are installed, reside with AIX in the file system. They can be booted: v In single user mode (referred to as service mode) v To run in maintenance mode (referred to as maintenance mode) v To run concurrently with other applications (referred to as concurrent mode)Standalone diagnostics must be booted before they can be run. If booted, the diagnostics have no access to the AIX error log or the AIX configuration data. Does the system have AIX diagnostics preinstalled? Yes If there is an HMC attached to the system, go to “Operating Considerations for Online and Standalone Diagnostics.” If an HMC is not attached to the system, go to “Running the Online Diagnostics with No HMC Attached” on page 762. No If there is an HMC attached to the system, go to “Loading the Standalone Diagnostics from CD-ROM Using an HMC” on page 764. If an HMC is not attached to the system, go to “Running the Standalone Diagnostics with CD-ROM with No HMC Attached” on page 765. If standalone diagnostics will be run from a NIM server, go to “Running Standalone Diagnostics from a Network Installation Management (NIM) Server” on page 766. If an HMC is not attached to the system, follow the directions for a full system partition.Operating Considerations for Online and Standalone Diagnostics Note: When possible, run online diagnostics in service mode. Online diagnostics perform additional functions compared to standalone diagnostics. Running online diagnostics in service mode ensures that the error state of the system that has been captured in NVRAM is available for your use in analyzing the problem. The AIX error log and certain SMIT functions are only available when diagnostics are run from the disk drive. Consider the following items before using the diagnostics: v Standalone diagnostics can run on systems configured for either a full (or single) system partition or a multiple partitioned system. When running in a partitioned system, the device from which you are booting standalone diagnostics must be made available to the partition dedicated to running standalone diagnostics. This may require moving the device from the partition that currently contains the boot device (for example, the CD-ROM or network adapter connected to the NIM server that has a standalone diagnostic image) to the partition used to run standalone diagnostics. For partitions running AIX levels earlier than AIX 5.2, if you move devices, reboot both source and destination partitions. For more information, see “Standalone Diagnostic Operation” on page 764. v When diagnostics are installed, the device support for some devices might not get installed. If this is the case, that device does not display in the diagnostic test list when running disk-based diagnostics. v When running diagnostics in a partitioned system, diagnostics will work only with the resources that were assigned to that partition. You must run diagnostics in the partition containing the resource that you want to test.
759
Identifying the Terminal Type to the Diagnostics When you run diagnostics, you must identify which type of terminal you are using. If the terminal type is not known when the FUNCTION SELECTION menu is displayed, the diagnostics do not allow you to continue until a terminal is selected from the DEFINE TERMINAL option menu. Choose the lft selection when running diagnostics from an HMC. Undefined Terminal Types If you specify an undefined terminal type from the DEFINE TERMINAL option menu, the menu prompts the user to enter a valid terminal type. The menu redisplays until either a valid type is entered or you exit the DEFINE TERMINAL option. For a system attached to an HMC use VT320 emulation. If the AIX operating system is being used, from the command line, type: export term=vt320 Resetting the Terminal If you enter a terminal type that is valid (according to the DEFINE TERMINAL option menu) but is not the correct type for the HMC virtual terminal window being used, you may be unable to read the screen, use the function keys, or use the Enter key. Bypass these difficulties by pressing Ctrl-C to reset the terminal. The screen display that results from this reset depends on the mode in which the system is being run: v Online Normal or Maintenance Mode - The command prompt displays. v Standalone Mode or Online Service Mode - The terminal type is reset to dumb, the Diagnostic Operating Instruction panel displays, and you are required to go through the DEFINE TERMINAL process again.Online Diagnostics Modes of Operation Notes: 1. When running online diagnostics on a partition in a partitioned system, diagnostics can be run only on resources that are allocated to that partition. 2. Online diagnostics can only be run in a partition with AIX installed. The online diagnostics can be run in the following modes: v Service Mode v Concurrent Mode v Maintenance ModeService Mode Service mode provides the most complete checkout of the system resources. This mode also requires that no other programs be running on the partition or on a full system partition. Except for the SCSI adapter and the disk drives used for paging, resources on all partitions or systems on a full system partition can be tested. However, note that the memory and processor are only tested during POST, and the results of the POST tests are reported by diagnostics. Error-log analysis is done in service mode when you select the Problem Determination option on the DIAGNOSTIC MODE SELECTION menu. Consider the following when you run the online diagnostics from a server or a disk: v The diagnostics cannot be loaded and run from a disk until the AIX operating system has been installed and configured. v The diagnostics cannot be loaded on a system (client) from a server if that system is not set up to boot from a server over a network. When the system is set up to boot from a server, the diagnostics are run in the same manner as they are from disk. 760
406/90
Service
Guide
v On a full system partition, if the diagnostics are loaded from disk or a server, you must shut down the AIX operating system before turning off the system unit to prevent possible damage to disk data. Do this in either of the following ways: – If the diagnostics were loaded in standalone mode, press the F3 key until DIAGNOSTIC OPERATING INSTRUCTIONS displays. Then press the F3 key once again to shut down the AIX operating system. – If the diagnostics were loaded in maintenance or concurrent mode, type the shutdown -F command.v Under some conditions, the system might stop, with instructions displayed on attached displays and terminals. Follow the instructions to select a console display.Running Online Diagnostics in Service Mode with an HMC Attached to the System To run the online diagnostics in service mode from the boot hard disk, do the following: 1. From the HMC, select Partition Manager. 2. Right-click on the mouse and select Open Terminal Window. 3. From the Service Processor Menu on the VTERM, select Option 2 System Power Control. 4. Select option 6. Verify that the state changes to currently disabled. Disabling fast system boot automatically enables slow boot. 5. Select Option 98 to exit the system power control menu. 6. Use the HMC to power on the managed system in a full system partition by selecting the managed system in the Contents area. 7. Right-click or select the desired system in the Contents area. Next, on the menu, choose Selected. 8. Select Power On. 9. Select the Power on Diagnostic Stored Boot list option (refer to “Full System Management Tasks” in the Hardware Management Console , order number SA23-1251, for more information on full system partitions). 10. Make sure that there is no media in the devices in the media drive. 11. Enter any passwords, if requested. Chapter
6.
Using
the
Online
and
Standalone
Diagnostics
761
Running the Online Diagnostics with No HMC Attached Online diagnostics can be loaded by booting from a hard file in the custom boot list by pressing the numeric 6 key on the keyboard. To boot from the devices listed in the service mode boot list, perform the following procedure: 1. Verify with the system administrator and users that all programs may be stopped, then do so. 2. Turn off the system. Note: If AIX is running, type the shutdown command. 3. Wait until the Power-on LED stops flashing and remains off, then press the power button to turn the system on. 4. When the keyboard indicator appears during startup, press the numeric 6 key on the keyboard. 5. Enter any requested passwords.After any requested passwords have been entered, the system attempts to boot from the first device of each type found on the list. If no bootable image is found on the first device of each type on the list, the system does not search through the other devices of that type for a bootable image; instead, it polls the first device of the next type. If all types of devices in the boot list have been polled without finding a bootable image, the system restarts. This action gives the user the opportunity to start the System Management Services (by pressing the number 1 key on a directly attached keyboard or the number 1 on an ASCII terminal) before the system attempts to boot again. Concurrent Mode Running the online diagnostics in concurrent mode is started by entering diag at an AIX prompt. Use concurrent mode to run online diagnostics on some of the system resources while the system is running normal activity. Because the system is running in normal operation, the following resources cannot be tested in concurrent mode: v SCSI adapters connected to paging devices v Disk drive used for paging v Some display adapters and graphics related devices v Memory (tested during POST) v Processor (tested during POST)The following levels of testing exist in concurrent mode: v The share-test level tests a resource while the resource is being shared by programs running in the normal operation. This testing is mostly limited to normal commands that test for the presence of a device or adapter. v The sub-test level tests a portion of a resource while the remaining part of the resource is being used in normal operation. For example, this test could test one port of a multiport device while the other ports are being used in normal operation. v The full-test level requires the device not be assigned to or used by any other operation. This level of testing on a disk drive might require the use of the varyoff command. The diagnostics display menus to allow you to vary off the needed resource. 762
406/90
Service
Guide
Error-log analysis is done in concurrent mode when you select the Problem Determination option on the DIAGNOSTIC MODE SELECTION menu. To run the online diagnostics in concurrent mode, you must be logged in to the AIX operating system and have proper authority to issue the commands (if help is needed, see the system operator). The diag command loads the diagnostic controller and displays the online diagnostic menus. Running the Online Diagnostics in Concurrent Mode To run online diagnostics in concurrent mode, do the following: 1. Log in to the AIX operating system as root user, or use CE Login. 2. Enter the diag command. 3. When the DIAGNOSTIC OPERATING INSTRUCTIONS display, follow the instructions to check out the desired resources. 4. When testing is complete, use the F3 key to return to the DIAGNOSTIC OPERATING INSTRUCTIONS. Press the F3 key again to return to the AIX operating system prompt. Be sure to vary on any resource that you had varied to off. 5. Press Ctrl-D to log off from root user or CE Login.Maintenance Mode Maintenance mode runs the online diagnostics using the customer’s version of the AIX operating system. This mode requires that all activity on the partition running the AIX operating system be stopped so that the online diagnostics have most of the resources available to check. All of the system resources, except the SCSI adapters, memory, processor, and the disk drive used for paging, can be checked. Error log analysis is done in maintenance mode when you select the Problem Determination option on the DIAGNOSTIC MODE SELECTION menu. Use the shutdown -m command to stop all activity on the AIX operating system and put the AIX operating system into maintenance mode. The diag command is then used to invoke the diagnostic controller so you can run the diagnostics. After the diagnostic controller is loaded, follow the normal diagnostic instructions. Running the Online Diagnostics in Maintenance Mode To run the online diagnostics in maintenance mode, do the following: 1. Stop all programs running on the partition except the AIX operating system (if help is needed, see the system operator). 2. Log in to the AIX operating system as root user, or use CE Login. 3. Type the shutdown -m command. 4. When a message indicates the system is in maintenance mode, enter the diag command. Note: It might be necessary to set TERM type again. 5. When DIAGNOSTIC OPERATING INSTRUCTIONS screen displays, follow the displayed instructions to check out the desired resources. 6. When testing is complete, use the F3 key to return to DIAGNOSTIC OPERATING INSTRUCTIONS. Press the F3 key again to return to the AIX operating system prompt. 7. Press Ctrl-D to log off from root user or CE Login. Chapter
6.
Using
the
Online
and
Standalone
Diagnostics
763
Standalone Diagnostic Operation Use standalone diagnostics to test the system when the online diagnostics are not installed and as a method of testing the disk drives that cannot be tested by the online diagnostics. Note: No error log analysis is done by the standalone diagnostics. The CD-ROM drive and the SCSI controller that controls it cannot be tested by the standalone diagnostics. Standalone diagnostics: v Are resident on CD-ROM or a Network Installation Management (NIM) server v Provide a method to test the system when the online diagnostics are not installed or cannot be loaded from the disk drive v Allow testing of the disk drives and other resources that cannot be tested by the online diagnostics v Do not have access to the AIX configuration data v Do not have access to the AIX error log v Do not allow for running of error log analysisConsiderations for Running Standalone Diagnostics on a Partitioned System To run standalone diagnostics on a full system partition, you must reboot the entire system. However, for a partition in a partitioned system, you can boot standalone diagnostics either in a given partition or on the entire system (which is the same procedure as a full system partition). For a partitioned system, before running standalone diagnostics on a given partition, the user must move the device from the existing location where standalone diagnostics is booted (the CD-ROM drive or the network adapter connected to the NIM server, in the case of NIM boot of standalone diagnostics), to the partition that will run standalone diagnostics. Devices on a partitioned system are moved on an I/O-slot basis. If the CD-ROM drive is moved from one partition to another, all SCSI devices on the same SCSI adapter card which the CD-ROM drive is attached must be moved to the same partition. It is recommended that you attach few SCSI devices to the same SCSI controller card to prevent moving them, along with the CD-ROM drive, between partitions. For partitions running AIX levels earlier than AIX 5.2, a reboot is required on the partition containing this device and on the moved device, but the system itself is not powered off in order to boot standalone diagnostics in a logical partitioned system partition. Considerations for Running Standalone Diagnostics from CD-ROM Consider the following when you run standalone diagnostics: v The diagnostic CD must remain in the CD-ROM drive for the entire time that diagnostics are running. v The diagnostic CD cannot be ejected from the CD-ROM drive after the diagnostics have loaded. The CD can only be ejected after the system has been turned off and then turned on (standalone mode) or after the diagnostics program has terminated (online concurrent mode). v The CD-ROM drive from which diagnostics were loaded cannot be tested. v The SCSI adapter (or circuitry) controlling the CD-ROM drive from which diagnostics were loaded cannot be tested.Loading the Standalone Diagnostics from CD-ROM Using an HMC To run standalone diagnostics in service mode from CD-ROM, do the following: 1. Verify with the system administrator and users that all programs may be stopped, including the AIX operating system (get help if needed). Stop all programs. 2. Remove all tapes, diskettes, and CD-ROMs. 3. From the following list, select the type of partitioned system you are running the standalone diagnostics in, then follow the power or reboot instructions. 764
406/90
Service
Guide
v If you are running standalone diagnostics in a full machine partition, power off the system unit. Wait until the Power-on LED stops flashing and remains off, then press the power button to turn on the system. v If you are running standalone diagnostics in an LPAR partition, reboot the partition that will be running the standalone diagnostics.Note: If AIX is running, type the shutdown command. If Linux is running, type the shutdown now -h command. 4. Immediately insert the Standalone Diagnostic CD into the CD-ROM drive, located on the media subsystem. Do Not use the HMC’s CD-ROM drive to do this operation. 5. When the keyboard POST indicator appears, press the F5 key on the directly attached keyboard, or the number 5 key on the TTY keyboard. 6. Enter any requested passwords. 7. Follow any instructions to select the console. 8. After the diagnostic controller loads, DIAGNOSTIC OPERATING INSTRUCTIONS appear on the console display. 9. Follow the displayed instructions to test the desired resources. 10. When testing is complete, use the F3 key to return to the DIAGNOSTIC OPERATING INSTRUCTIONS.Running the Standalone Diagnostics with CD-ROM with No HMC Attached To load standalone diagnostics from the default boot list, perform the following procedure: To run standalone diagnostics in service mode from CD-ROM, do the following: 1. Verify with the system administrator and users that all programs may be stopped, including the AIX operating system (get help if needed). Stop all programs. 2. Remove all tapes, diskettes, and CD-ROMs. 3. From the following list, select the type of partitioned system you are running the standalone diagnostics in, then follow the power or reboot instructions. v If you are running standalone diagnostics in a full machine partition, power off the system unit. Wait until the Power-on LED stops flashing and remains off, then press the power button to turn on the system. v If you are running standalone diagnostics in an LPAR partition, reboot the partition that will be running the standalone diagnostics.Note: If AIX is running, type the shutdown command. If Linux is running, type the shutdown now -h command. 4. Immediately insert the Standalone Diagnostic CD into the CD-ROM drive, located on the media subsystem. 5. When the keyboard POST indicator appears, press the F5 key on the directly attached keyboard, or the number 5 key on the TTY keyboard. 6. Enter any requested passwords. 7. Follow any instructions to select the console. 8. After the diagnostic controller loads, DIAGNOSTIC OPERATING INSTRUCTIONS appear on the console display. 9. Follow the displayed instructions to test the desired resources. 10. When testing is complete, use the F3 key to return to the DIAGNOSTIC OPERATING INSTRUCTIONS.
Chapter
6.
Using
the
Online
and
Standalone
Diagnostics
765
Running Standalone Diagnostics from a Network Installation Management (NIM) Server A client system connected to a network with a Network Installation Management (NIM) server can boot standalone diagnostics from the NIM server if the client-specific settings on both the NIM server and client are correct. Notes: 1. All operations to configure the NIM server require root user authority. 2. If you replace the network adapter in the client, the network adapter hardware address settings for the client must be updated on the NIM server. 3. The Cstate for each standalone diagnostics client on the NIM server should be kept in the diagnostic boot has been enabled state. 4. On the client system, the NIM server network adapter should be put in the bootlist after the boot disk drive. This allows the system to boot in standalone diagnostics from the NIM server if there is a problem booting from the disk drive. Refer to the Multiboot section under “SMS” in the client system’s service guide for information on setting the bootlist.NIM Server Configuration Refer to the “Advanced NIM Configuration Tasks” chapter of the AIX Installation Guide and Reference, order number SC23-4389 for information on doing the following: v Registering a client on the NIM server v Enabling a client to run diagnostics from the NIM serverTo verify that the client system is registered on the NIM server and the diagnostic boot is enabled, run the command lsnim -a Cstate -Z ClientName from the command line on the NIM server. Refer to the following table for system responses. Note: The ClientName is the name of the system on which you want to run standalone diagnostics.
System Response
Client Status
#name:Cstate:ClientName:diagnostic boot has beenenabled:
The client system is registered on the NIM server and enabled to run diagnostics from the NIM server.
#name:Cstate:ClientName:ready for a NIM operation:
or #name:Cstate:ClientName:BOS installation has beenenabled:
The client is registered on the NIM server but not enabled to run diagnostics from the NIM server. Note: If the client system is registered on the NIM server but Cstate has not been set, no data will be returned.
0042-053 lsnim: there is no NIM objectnamed "ClientName"
The client is not registered on the NIM server.
766
406/90
Service
Guide
Booting Standalone Diagnostics from the NIM Server on a System with an HMC Attached To run standalone diagnostics on a client from the NIM server, do the following: 1. Remove any removable media (tape or CD-ROM). 2. Stop all programs including the AIX operating system (get help if needed). 3. If you are running standalone diagnostics in a full system partition, verify with the system administrator and system users that the system unit can shut down. Stop all programs, including the AIX operating system. Refer to the AIX operating system documentation for shutdown command information. Verify with the system administrator and system users using that partition that all applications on that partition must be stopped, and that the partition will be rebooted. Stop all programs on that partition, including the operating system. 4. If you are in a full system partition, power on the system unit to run standalone diagnostics. In a partitioned system, reboot the partition to run standalone diagnostics. 5. When the keyboard indicator is displayed (the word keyboard on a HMC virtual terminal window or the keyboard icon on a graphical display), press the number 1 key on the keyboard to display the SMS menu. 6. Enter any requested passwords. 7. Select Set Up Remote IPL (Initial Program Load). 8. Enter the client address, server address, gateway address (if applicable), and subnet mask. 9. If the NIM server is set up to allow pinging from the client system, use the ping utility in the RIPL utility to verify that the client system can ping the NIM server. Under the ping utility, choose the network adapter that provides the attachment to the NIM server to do the ping operation. If ping returns with an OK prompt, the client is prepared to boot from the NIM server. If ping returns with a FAILED prompt, the client cannot proceed with the NIM boot. Note: If the ping fails, refer to “Boot Problems” on page 409 and follow the steps for network boot problems.To do a one-time boot of the network adapter attached to the NIM server network, do the following: 1. Exit to the SMS Main screen. 2. Select Select Boot Options. 3. Select Install or Boot a Device. 4. On the Select Device Type screen. select Network. 5. Set the network parameters for the adapter from which you want to boot. 6. Exit completely from SMS. The system starts loading packets while doing a bootp from the network.Follow the instructions on the screen to select the system console. v If Diagnostics Operating Instructions Version x.x.x displays, standalone diagnostics have loaded successfully. v If the AIX login prompt displays, standalone diagnostics did not load. Check the following items: – The network parameters on the client may be incorrect. – Cstate on the NIM server may be incorrect. – Network problems might be preventing you from connecting to the NIM server. Verify the settings and the status of the network. If you continue to have problems, refer to “Boot Problems” on page 409 and follow the steps for network boot problems. Chapter
6.
Using
the
Online
and
Standalone
Diagnostics
767
768
406/90
Service
Guide
Chapter 7. Using the Service Processor Notes: 1. The information in this chapter regarding the configuring of serial ports, and modems attached to those serial ports, applies only to the serial ports (S1 and S2) on the primary I/O book (location U1.18-P1-H2). None of this information is applicable to the serial ports, or modems attached to those serial ports, on the Hardware Management Console (HMC). 2. On some of the system management services (or service processor) screens, you will see the term LPAR, which is equivalent to the term partitioned system. The service processor runs on its own power boundary and continually monitors hardware attributes and the environmental conditions within the system. The service processor is controlled by firmware and does not require the operating system to be operational to perform its tasks. The service processor menus allow you to configure service processor options, as well as enable and disable functions. Service processor menus are available using an HMC virtual terminal window when OK is displayed on the operator panel or when the service processor has detected a server problem (such as a surveillance failure). Service Processor Menus The service processor menus are divided into the following groups: v General user menu - the user must know the general-access password. v Privileged user menus - the user must know the privileged-access password.If the server is powered off, the service processor menus can be accessed locally or remotely on the following: v Serial port 1 (S1) v Serial port 2 (S2) v The HMCAccessing the Service Processor Menus Locally Service processor menus can be accessed by opening a virtual terminal window on the HMC, or by attaching an ASCII terminal to serial port 1 (S1) or serial port 2 (S2). After OK displays in the operator panel, press any key on the keyboard to signal the service. Note: The service processor menus cannot be accessed simultaneously on a virtual terminal window on the HMC and on an ASCII terminal. Accessing the menus on the HMC locks out access to the ASCII terminals and the other way around. When you gain access, the service processor prompts you for a password (if one is set), and when verified, displays the service processor menus. The service processor menu prompt, represented by 0> or 1>, indicates the serial port to which the terminal is connected. v An ASCII terminal can have the following prompts: – 0> indicates serial port 1 (S1) – 1> indicates serial port 2 (S2)v The HMC always indicates 0>
769
Accessing the Service Processor Menus Remotely If your system has a modem connected to serial port 1 or serial port 2 and is configured for call-in (see “Modem Configuration Menu” on page 796), access the service processor menus remotely as follows: 1. With the system powered off, call in from a remote terminal. 2. The service processor detects ring-indicate and prompts you for a password (if one is set). When verified, the service processor menus display remotely.Saving and Restoring Service Processor Settings All the settings that you make (except language) from the service processor menus can be backed up either for recovering from a fault that may corrupt these settings, or for replicating these settings to other servers that include a service processor. The service aid, Save or Restore Hardware Management Policies, can be used to save your settings after initial setup or whenever the settings must be changed for system operation purposes. It is strongly recommended that you use this service aid for backing up service processor settings to protect the usefulness of the service processor and the availability of the server. Refer to “Save or Restore Hardware Management Policies,” in the “Introduction to Tasks and Service Aids” section of the Diagnostic Information for Multiple Bus Systems, order number SA23-2769. If this task cannot be run, or the service processor settings were not previously backed up, the settings should be recorded manually. To record the settings manually, do the following: 1. At the service processor main menu, select option 3, ?System Information Menu. 2. At the system information menu, select option 6, ?Read Service Processor Configuration?. 3. Manually record the settings.Menu Inactivity The service processor exits menu mode after ten minutes of inactivity and displays a message indicating that it has done so. Pressing any key on the virtual terminal window causes the main menu to display. 770
406/90
Service
Guide
General User Menu The menu options presented to the general user are a subset of the options available to the privileged user. The user must know the general-access password, if one is set, to access this menu. GENERAL USER MENU 1. Power-on System 2. Power-off System 3. Read VPD Image from Last System Boot 4. Read Progress Indicators from Last System Boot 5. Read Service Processor Error Logs 6. Read System POST Errors 99. Exit from Menus 0> v Power-on System Allows the user to start the system using the current virtual terminal window as the active console. v Power-off System This option is not available on this system. v Read VPD Image from Last System Boot Displays manufacturer vital product data, such as serial numbers, part numbers, and so on, that were stored from the system boot prior to the one in progress now, for the entire system. v Read Progress Indicators from Last System Boot Displays a number of the boot progress indicators, which may include service processor checkpoints, IPL checkpoints, or AIX configuration codes, from the previous system boot. This information can be useful in diagnosing system faults.Note: If you are running one or more logical partitions, enter the partition ID (0-15) to display progress indicators for that partition since the last system boot. If your system is running in full system partition mode, this option automatically displays details from partition 0. The progress indicator codes are listed from top (latest) to bottom (oldest). This information is not stored in nonvolatile storage. If the system is powered off using the power-on button on the operator panel, this information is retained. If the ac power is disconnected from the system, this information will be lost. For an example, refer to “LCD Progress Indicator Log” on page 818. v Read Service Processor Error Logs Displays the service processor error logs. For an example, refer to “Service Processor Error Logs” on page 817. v Read System POST Errors Displays additional error log information (this option is only for service personnel). v Exit from Menus Selecting this option will exit the service processor menus. You can reenter the menus by pressing any key on the console. Chapter
7.
Using
the
Service
Processor
771
Privileged User Menus The following menus are available to privileged users only. The user must know the privileged-access password, if one is set, to access these menus. Main Menu A listing at the top of the main menu contains the following: v Your system’s current firmware version v The firmware copyright notice v The system name given to your server during setupYou need the firmware version for reference when you either update or repair the functions of your service processor. The system name, an optional field, is the name that your server reports in problem messages. This name helps your support team (for example, your system administrator, network administrator, or service representative) to more quickly identify the location, configuration, and history of your server. Set the system name, from the main menu, using option 6. Note: The information under the Service Processor Firmware heading in the following Main Menu illustration is example information only. Service Processor Firmware VERSION: RH011007 Copyright 2001 IBM Corporation SYSTEM NAME MAIN MENU 1. Service Processor Setup Menu 2. System Power Control Menu 3. System Information Menu 4. Language Selection Menu 5. Call-In/Call-Out Setup Menu 6. Set System Name 99. Exit from Menus 0> v Service Processor Setup Menu See “Service Processor Setup Menu” on page 773 for more information. v System Power Control Menu See “System Power Control Menu” on page 778 for more information. v System Information Menu See “System Information Menu” on page 781 for more information. v Language Selection Menu See “Language Selection Menu” on page 794 for more information. v Call-In/Call-Out Setup Menu See “Call-In/Call-Out Setup Menu” on page 795 for more information. v Set System Name Allows setting of the system name.
772
406/90
Service
Guide
Service Processor Setup Menu The following Service Processor Setup Menu is accessed from the Main Menu: SERVICE PROCESSOR SETUP MENU 1. Change Privileged Access Password 2. Change General Access Password 3. Enable/Disable Console Mirroring: Currently Enabled 4. Start Talk Mode 5. OS Surveillance Setup Menu NOT supported in LPAR mode. 6. Reset Service Processor 7. Reprogram Flash EPROM Menu 8. Serial Port Snoop Setup Menu NOT supported in LPAR mode. 9. Scan Log Dump Policy: Currently Never 98. Return to Previous Menu 99. Exit from Menus 0> Note: Unless otherwise stated in menu responses, settings become effective when a menu is exited using option 98 or 99. Chapter
7.
Using
the
Service
Processor
773
Passwords Passwords can be any combination of up to eight alphanumeric characters. You can enter longer passwords, but the entries are truncated to include only the first eight characters. The privileged-access password can be set from service processor menus or from System Management Services (SMS) utilities (see Chapter 8, “Using System Management Services,” on page 823). The general-access password can be set only from service processor menus. For security purposes, the service processor counts the number of attempts to enter passwords. The results of not recognizing a password within this error threshold are different, depending on whether the attempts are being made locally (at the server) or remotely (through a modem). The error threshold is three attempts. If the error threshold is reached by someone entering passwords at the server, the service processor commands the server to resume the initial program load (IPL). This action is taken based on the assumption that the server is in an adequately secure location with only authorized users having access. Such users must still successfully enter a login password to access the operating system. If the error threshold is reached by someone entering passwords remotely, the service processor commands the server to power off to prevent potential security attacks on the server by unauthorized remote users. The following table lists what you can access with the privileged-access password and the general-access password.
Privileged Access Password
General Access Password
Resulting Menu
None
None
Service processor MAIN MENU displays.
Set
None
Users with the password see the service processor MAIN MENU. Users without password cannot log in.
Set
Set
Users see menus associated with the entered password. If you forget the password, you must remove the battery for at least 30 seconds to disable the password. v Change Privileged-Access Password Set or change the privileged-access password. It provides the user with the capability to access all service processor functions. This password is usually used by the system administrator or root user. v Change General-Access Password Set or change the general-access password. It provides limited access to service processor menus, and is usually available to all users who are allowed to power on the server, especially remotely.Note: The general-access password can only be set or changed after the privileged access password is set. v Enable/Disable Console Mirroring Note: Console mirroring is disabled in partitioned systems. Console mirroring is supported on serial port 1 (S1) and serial port 2 (S2). When console mirroring is enabled, the service processor sends information to all serial ports. The serial port from which console mirroring is enabled is referred to as the active port. The mirror port is determined when keyboard input is detected from one of the other ports. From this point on, the service processor sends information only to the active port and the mirror port. This capability can be enabled by local or remote users, providing local users with the capability to monitor remote sessions. Console mirroring can be enabled for the current session only. For more information, see “Console Mirroring” on page 816. 774
406/90
Service
Guide
v Start Talk Mode In a console-mirroring session, it is useful for those who are monitoring the session to be able to communicate with each other. Selecting Start Talk Mode activates the keyboards and displays for such communications while console mirroring is established. This is a full duplex link, so message interference is possible. Alternating messages between users works best. v OS Surveillance Setup Menu Note: This option is disabled in partitioned systems. This menu can be used to set up operating system (OS) surveillance. OS Surveillance Setup Menu 1. Surveillance: Currently Enabled 2. Surveillance Time Interval: 2 minutes 3. Surveillance Delay: 2 minutes 98. Return to Previous Menu 0> – Surveillance Can be set to Enabled or Disabled. – Surveillance Time Interval Can be set to any number from 2 through 255. – Surveillance Delay Can be set to any number from 0 through 255.Refer to “Service Processor System Monitoring - Surveillance” on page 813 for more information about surveillance. v Reset Service Processor If this option is selected, entering Y causes the service processor to reboot. v Reprogram Flash EPROM Menu This option updates the system EPROMs. After entering Y to indicate that you want to continue, you are prompted to enter the update diskettes. Follow the instructions on the screen. When the update is complete, the service processor reboots. All system EPROMs that can be reprogrammed are updated at the same time and are as follows: – System power control network programming – Service processor programming – System firmware programming – Run-Time Abstraction Services Chapter
7.
Using
the
Service
Processor
775
v Serial Port Snoop Setup Menu Note: This option is disabled in partitioned systems. This menu can be used to set up serial port snooping, in which the user can configure serial port 1 as a ?catch-all? reset device. From the service processor main menu, select option 1, service processor setup menu, then select option 8 (Serial Port Snoop Setup Menu). SERIAL PORT SNOOP SETUP MENU 1. System reset string: Currently Unassigned 2. Snoop Serial Port: Currently Unassigned 98. Return to Previous Menu 1> Use the Snoop Serial Port option to select the serial port to snoop. Note: Only serial port 1 is supported. Use the system reset string option to enter the system reset string, which resets the machine when it is detected on the main console on Serial Port 1. After serial port snooping is correctly configured, at any point after the system is booted to AIX, whenever the reset string is typed on the main console, the system uses the service processor reboot policy to restart. Because pressing Enter after the reset string is not required, make sure that the string is not common or trivial. A mixed-case string is recommended. v Scan Log Dump Policy A scan dump is the collection of chip data that the service processor gathers after a system malfunction, such as a checkstop or hang. The scan dump data may contain chip scan rings, chip trace arrays, and SCOM contents. The scan dump data are stored in the system control store. The size of the scan dump area is approximately 4 MB. During the scan log dump, A8xx (in the range A810 to A8FF) displays in the operator panel. The xx characters will change as the scan log dump progresses. If the xx characters do not change after several minutes, the service processor is hung and must be reset. When the scan log dump is complete, depending on how the reboot policy is set, the system will either: – Go to the standby state (and the service processor menus will be available), indicated by OK or STBY in the operator panel OR– Attempt to reboot.
776
406/90
Service
Guide
Scan Log Dump Setup Menu 1. Scan Log Dump Policy: Currently As Needed 2. Scan Log Dump Content: Currently As Requested 3. Immediate Dump 98. Return to Previous Menu 0>1 Select from the following options: (As Needed=2, Always=3) Enter New Option: 0>1
The scan log dump policy can be set to the following: 2=As Needed The processor run-time diagnostics record the dump data base on the error type. This is the default value. 3=Always Selecting this option allows the service processor to record a scan log dump for all error types.The scan log dump policy can also be set from the Tasks menu in the AIX service aids. Option 2 displays the following screen: Scan Log Dump Setup Menu 1. Scan Log Dump Policy: Currently As Needed 2. Scan Log Dump Content: Currently As Requested 3. Immediate Dump 98. Return to Previous Menu 0>2 Select from the following options: (As Requested=1, Optimum=2, Complete=3, Minimum=4)) Enter New Option: 0>0 The scan log dump content can be set to the following: 1 = As needed The processor run-time diagnostics will select the the contents of the dump file based on the type of error that occurs. This is the default value. 2 = Optimum The dump will include the smallest amount of information possible (a minimum number of hardware scan log rings). 3 = Complete The dump will include as much information as possible to allow the complete analysis of hardware and software errors. 4 = Minimum The dump will include the smallest amount of information possible (a minimum number of hardware scan log rings).
Chapter
7.
Using
the
Service
Processor
777
The complete dump will take the longest time to finish; it may take as long as 1.5 hours on a fully configured system. Option 3, Immediate Dump, can only be used when the system is in the standby state with power on. It is used to dump the system data after a checkstop or machine check occurs when the system firmware is running, or when the operating system is booting or running. The scan log dump policy can also be set from the Tasks menu in the AIX service aids. If a valid dump file already exists, the dump control code will stop because the contents of the prior dump must be protected.System Power Control Menu This menu is used to set power control options. Other menus that control boot options are available from this menu. SYSTEM POWER CONTROL MENU 1. Enable/Disable Unattended Start Mode: Currently Enabled 2. Ring Indicate Power-On Menu 3. Reboot/Restart Policy Setup Menu 4. Power-On System 5. Power-Off System 6. Enable/Disable Fast System Boot Currently Enabled 7. Boot Mode Menu 98. Return to Previous Menu 99. Exit from Menus 0> v Enable/Disable Unattended Start Mode Use this option to instruct the service processor to restore the power state of the server after a temporary power failure. This option is intended to be used on servers that require automatic power-on after a power failure. For more information, see “System Power-On Methods” on page 802. v Ring Indicate Power-On Menu RING INDICATE POWER-ON MENU 1. Ring indicate power-on : Currently Enabled 2. Number of rings: Currently 3 30. Refresh Modem Settings 98. Return to Previous Menu Ring indicate power-on is enabled by default on both serial port 1 (S1) and serial port 2 (S2). When ring indicate power-on is enabled, call-in is disabled. If ring indicate power-on is enabled and call-in is already enabled, you will be asked to confirm your choice. Refer to the message displayed on your screen. 778
406/90
Service
Guide
If the ring indicate power-on setting is changed, you must select option 30, Refresh Modem Settings to update the modem settings. If Refresh Modem Settings is selected, and the modem(s) have not been configured, you will be asked to configure the modems first. See “Call-In/Call-Out Setup Menu” on page 795 for information on configuring modems. Option 2 is used to set the number of rings. v Reboot/Restart Policy Setup Menu The following menu controls the Reboot/Restart Policy: Reboot/Restart Policy Setup Menu 1. Number of reboot attempts: Currently 1 2. Use OS-Defined restart policy? Currently No 3. Enable supplemental restart policy? Currently Yes 4. Call-Out before restart: Currently Disabled 98. Return to Previous Menu 0> Reboot is the process of bringing up the system hardware; for example, from a system reset or power on. Restart is activating the operating system after the system hardware is reinitialized. Restart must follow a successful reboot. – Number of reboot attempts - If the server fails to successfully complete the boot process, it attempts to reboot the number of times specified. Entry values equal to or greater than 0 are valid. Only successive failed reboot/restart attempts are counted. – Use OS-Defined restart policy - In a full system partition, this allows the service processor to react in the same way that the operating system does to major system faults by reading the setting of the operating system parameter Automatically Restart/Reboot After a System Crash. This parameter might already be defined, depending on the operating system or its version (or level). If the operating system automatic restart setting is defined, it can be set to respond to a major fault by restarting or by not restarting. See your operating system documentation for details on setting up operating system automatic restarts. The default value is No. On a partitioned system, this setting is ignored. – Enable supplemental restart policy - The default setting is Yes. When set to Yes in a full system partition, the service processor restarts the system when the system loses control as detected by service processor surveillance, and either: - The Use OS-Defined restart policy is set to No. OR - The Use OS-Defined restart policy is set to Yes, and the operating system has no automatic restart policy. If set to Yes in a partitioned system, the service processor restarts the system when the system loses control and it is detected by service processor surveillance.– Call-Out before restart (Enabled/Disabled) - If a restart is necessary due to a system fault, and you are running a full system partition, you can enable the service processor to call out and report the event. This option can be valuable if the number of these events becomes excessive, which might signal a bigger problem. This setting is ignored on a partitioned system. Chapter
7.
Using
the
Service
Processor
779
v Power-On System Allows immediate power-on of the system. v Power-Off System This option is not available on this system. v Enable/Disable Fast System Boot Allows the user to select the IPL type, mode, and speed of the system boot. Attention: Selecting the fast IPL results in several diagnostic tests being skipped and a shorter memory test being run. v Boot Mode Menu Note: This option is disabled in partitioned systems. The Boot Mode Menu allows you to select a boot mode. Boot Mode Menu 1. Boot to SMS Menu: Currently Disabled 2. Service Mode Boot from Saved List: Currently Disabled 3. Service Mode Boot from Default List: Currently Disabled 4. Boot to Open Firmware Prompt: Currently Disabled 98. Return to Previous Menu 0> To select a boot mode, select a number and press Enter. The item corresponding to the selected number toggles between Disabled to Enabled. If a boot mode is Enabled, the boot mode selected is performed, and the Disabled/Enabled selection is reset to Disabled. The following describes each boot mode: – Boot to SMS Menu When this selection is enabled, the system boots to the System Management Services (SMS) Menu. – Service Mode Boot from Saved List This selection causes the system to perform a service mode boot using the service mode boot list saved in NVRAM. If the system boots AIX from the disk drive and AIX diagnostics are loaded on the disk drive, AIX boots to the diagnostics menu. Using this option to boot the system is the preferred way to run online diagnostics. – Service Mode Boot from Default List This selection is similar to Service Mode Boot from Saved List, except the system boots using the default boot list that is stored in the system firmware. This is normally used to try to boot customer diagnostics from the CD-ROM drive. Using this option to boot the system is the preferred way to run standalone diagnostics. – Boot to Open Firmware This option should only be used by service personnel to obtain additional debug information. When this selection is enabled, the system boots to the open firmware prompt. 780
406/90
Service
Guide
System Information Menu This menu provides access to system configuration information, error logs, system resources, and processor configuration. SYSTEM INFORMATION MENU 1. Read VPD Image from Last System Boot 2. Read Progress Indicators from Last System Boot 3. Read Service Processor Error Logs 4. Read System POST Errors 5. Read NVRAM 6. Read Service Processor Configuration 7. Processor Configuration/Deconfiguration Menu 8. Memory Configuration/Deconfiguration Menu 9. Power Control Network Utilities Menu 10. LED Control Menu 11. MCM/L3 Interposer Plug Count Menu 12. Performance Mode Setup Menu 13. L3 Mode Menu Not Applicable 14. Remote I/O (RIO) Link Speed Setup Menu 98. Return to Previous Menu 99. Exit from Menus 0> v Read VPD Image from Last System Boot Displays manufacturer’s vital product data (VPD), such as serial numbers, part numbers, and so on, that was stored from the system boot prior to the one in progress now. VPD from all devices in the system is displayed. v Read Progress Indicators from Last System Boot Displays a number of the boot progress indicators, which may include service processor checkpoints, IPL checkpoints, or AIX configuration codes, from the previous system boot. This information can be useful in diagnosing system faults.Note: If you are running from a partitioned system, enter the partition ID (0-15) to display progress indicators for that partition since the last system boot. In a full system partition, this option automatically displays details from partition 0. The progress indicator codes are listed from top (latest) to bottom (oldest). This information is not stored in nonvolatile storage. If the system is powered off using the power-on button on the operator panel, this information is retained. If the ac power is disconnected from the system, this information will be lost. For an example, refer to “LCD Progress Indicator Log” on page 818. v Read Service Processor Error Logs Displays error conditions detected by the service processor. Refer to “Service Processor Error Logs” on page 817 for an example of this error log. v Read System POST Errors Chapter
7.
Using
the
Service
Processor
781
This option should only be used by service personnel to obtain additional debug information. v Read NVRAM Displays Non Volatile Random Access Memory (NVRAM) content. v Read Service Processor Configuration Displays current service processor configuration. v Processor Configuration/Deconfiguration Menu Enable/Disable CPU Repeat Gard: CPU repeat gard automatically deconfigures a CPU during a system boot if a processor has failed BIST (built-in self test), caused a machine check or check stop, or has reached a threshold of recoverable errors. The processor will remain deconfigured until repeat gard is disabled or the processor is replaced. CPU repeat gard is enabled by default. For more information, see “Configuring and Deconfiguring Processors or Memory” on page 812. Enable/Disable Dynamic Processor Sparing: This option is part of the Capacity Upgrade on Demand (CUoD) function. Dynamic processor sparing is the capability of the system to deconfigure a failing (or potentially failing) processor and then configure a replacement processor from the unlicensed CUoD processor pool. – If the system is running logical partitions, the dynamic processor sparing operation takes place with the system running. – If the system is booted in a full system partition, the system must be rebooted for the dynamic processor sparing operation to take place.Note: The memory affinity of the failing processor is not taken into account when the replacement processor is assigned; the replacement processor is the next one that is available. The Processor Configuration/Deconfiguration menu allows the user to change the system processor configuration. If it is necessary to take one of the processors offline, use this menu to deconfigure the processor, and then reconfigure the processor at a later time. An example of this menu follows: PROCESSOR CONFIGURATION/DECONFIGURATION MENU 77. Enable/Disable CPU Repeat Gard: Currently Enabled 78. Enable/Disable Dynamic Processor Sparing (if available): Currently enabled 1. 0 3.0 (00) Configured by system 2. 1 3.1 (00) Deconfigured by system 3. 2 3.2 (00) Configured by system 4. 3 3.3 (00) Configured by system 5. 4 3.4 (00) Configured by system 6. 5 3.5 (00) Deconfigured by system 7. 6 3.6 (00) Configured by system 8. 7 3.7 (00) Configured by system 98. Return to Previous Menu 0> Note: This table is built from vital product data collected during the last boot sequence. The first time the system is powered on, or after the system’s nonvolatile RAM (NVRAM) has been erased, this table may be empty. The table is rebuilt during the next boot into AIX. The fields of the previous table represent the following: Column 1 (1.) Menu selection index. 782
406/90
Service
Guide
Column 2 (0) Logical processor device number assigned by AIX. You can display these logical device numbers by issuing the following command on the AIX command line: lsdev -C | grep proc Column 3 (3.0) Processor address list used by the service processor. Column 4 (00) Error status of the processors.The error status of each processor is indicated by AB, where B indicates the number of errors and A indicates the type of error according to the following: 1. Bring-up failure 2. Run-time non-recoverable failure 3. Run-time recoverable failure 4. Group integrity failure 5. Non-repeat-gardable error. The resource may be reconfigured on the next boot.A status of 00 indicates that the CPU has not had any errors logged against it by the service processor. To enable or disable CPU repeat gard, use menu option 77. The default is enabled. If CPU repeat gard is disabled, processors that are in the ?deconfigured by system? state will be reconfigured. These reconfigured processors are then tested during the boot process, and if they pass, they remain online. If they fail the boot testing, they are deconfigured even though CPU repeat gard is disabled. The failure history of each CPU is retained. If a processor with a history of failures is brought back online by disabling repeat gard, it remains online if it passes testing during the boot process. However, if CPU repeat gard is enabled, the processor is taken offline again because of its history of failures. Notes: 1. The processor numbering scheme used by the service processor is different from the numbering scheme used by AIX. To ensure the correct processor is selected, consult the AIX documentation before configuring or deconfiguring a processor. 2. The number of processors available to AIX can be determined by issuing the following command on the AIX command line: bindprocessor -q.v Memory Configuration/Deconfiguration Menu Enable/Disable Memory Repeat Gard: Memory repeat gard partially or fully deconfigures a memory book automatically during a system boot. If a memory book has failed BIST POST, caused a machine check or checkstop, or has reached a threshold of recoverable errors, the memory will remain deconfigured until repeat gard is disabled or the memory is replaced. Memory repeat gard is enabled by default. For more information, see “Configuring and Deconfiguring Processors or Memory” on page 812. Runtime Recoverable Error Repeat Gard: The runtime recoverable error repeat gard flag controls the deallocation of the memory if a recoverable error occurs during runtime. If a recoverable memory error occurs, and the runtime recoverable error repeat gard is disabled, the system will continue running with no change in the memory configuration. If a recoverable memory error occurs, and runtime recoverable error repeat gard is enabled, the memory half-book or book in which the error occurred will be taken offline. Runtime Recoverable Error Repeat Gard is disabled by default. These configuration/deconfiguration menus allow the user to change the system memory configuration. If it is necessary to take one of the memory books partially or completely offline, this menu allows you to deconfigure a book, and then reconfigure the book at a later time. This menu also allows you to see if the repeat gard function has partially or completely deconfigured a memory book. When this option is selected, a menu displays. The following is an example of this menu: Chapter
7.
Using
the
Service
Processor
783
MEMORY CONFIGURATION/DECONFIGURATION MENU 77. Enable/Disable Memory Repeat Gard: Currently Enabled 78. Runtime Recoverable Error Repeat Gard: Currently Enabled 1. Memory card 98. Return to Previous Menu
After you select the memory card option by entering 1, a menu displays, allowing the selection of a memory book. The following is an example of this menu. MEMORY CONFIGURATION/DECONFIGURATION MENU 1: 16.16(00, -) Configured by system 2: 17.17(00, -) Configured by system 3: 18.18(00, -) Configured by system 4: 19.19(00, 1) Partially deconfigured by system 98. Return to Previous Menu Note: This table is built from vital product data collected during the last boot sequence. The first time the system is powered on, or after the system’s nonvolatile RAM (NVRAM) has been erased, this table may be empty. The table is rebuilt during the next boot into AIX. The fields in the previous table represent the following: Column 1 1. Menu selection index/book number Column 2 xx.xx : Book address used by service processor Column 3 (00, -) Error/deconfiguration statusThe error status of the each memory book is indicated by (AB,-1) where B indicates the number of errors and A indicates the type of error according to the following: 1. Bring-up failure 2. Run-time non-recoverable failure 3. Run-time recoverable failure 4. Group integrity failure 5. Non-repeat-gardable error. The resource may be reconfigured on the next boot. An error status of (00, -) (for example, 11.16(00, -)) indicates that the memory book has not had any errors logged against it by the service processor, and it is fully configured. The field after the error status will be a dash (-), 0, or 1. The dash indicates that the memory book is fully configured. A 0 or 1 indicates that memory repeat gard has deconfigured half of the memory book. If this occurs, the status of the book in the menu is shown as Partially deconfigured by system. To change the memory configuration, select the number of the memory book. The memory book state will change from configured to deconfigured or from deconfigured to configured. This menu only allows the deconfiguration of an entire book; it does not allow the manual deconfiguration of half a book. If half a book has been configured by the system (Partially deconfigured), it can be manually reconfigured using this menu. In the previous example menu, each line shows two books and indicates whether they are configured, deconfigured, or partially deconfigured. To enable or disable Memory Repeat Gard, use menu option 77 of the Memory Configuration/Deconfiguration Menu. The default is enabled. 784
406/90
Service
Guide
To enable or disable runtime recoverable error repeat gard, use option 78 of the Memory Configuration/Deconfiguration Menu. The default is enabled. The failure history of each book is retained. If a book with a history of failures is brought back online by disabling Repeat Gard, it remains online if it passes testing during the boot process. However, if Repeat Gard is enabled, the book is taken offline again because of its history of failures. The four inner memory books, if present, are listed first, followed by the four outer books, if present. The memory books are shown in the following order of physical location code:
1
U1.18-P1-M2
5
U1.18-P1-M4
2
U1.18-P1-M3
6
U1.18-P1-M8
3
U1.18-P1-M7
7
U1.18-P1-M5
4
U1.18-P1-M6
8
U1.18-P1-M1
v Power Control Network Utilities Menu
POWER CONTROL NETWORK UTILITIES MENU 1. Lamp Test for all Operator Panels 2. Display I/O Type 3. Change I/O Type 4. Collect & display SPCN trace data 5. Start SPCN Flash Update 6. Display Power Subsystem FRU Code Levels 7. Power Subsystem Code Update via the Power Control Network Currently Enabled 8. Start Power Subsystem Code Update 98. Return to Previous Menu 0> – Lamp Test for All Operator Panels Selecting this option tests the media drawer operator panel’s indicators by causing them to blink on and off for approximately 30 seconds. – Display I/O Type This option, when selected, will display the SPCN address of the CEC drawer (D0), and the I/O type of the service processor (which is part of the primary I/O book assembly). – Change I/O Type Use this option to change the I/O type of the primary I/O book after a service action or configuration change if the I/O type is incorrect. If this option is chosen, you will be asked to make the following entries: - For the I/O drawer address, type 1. - For the I/O type, type: - If a High Performance Switch is installed in or attached to the system, type 9D. - If a High Performance Switch is not installed in or attached to the system, type 98.If either value is not valid, a failure message displays on the console. Press Enter to return to the Power Control Network Utilities Menu. – Collect & display SPCN trace data This option is used by service personnel to dump the SPCN trace data from the processor subsystem (CEC drawer) to gather additional debug information. Chapter
7.
Using
the
Service
Processor
785
To dump the SPCN trace data for the processor subsystem (CEC drawer), enter 0 when prompted for the SPCN drawer number and enter d0 when prompted for the unit address, as shown in the example screen: Enter the SPCN drawer number: 0 Enter the unit address: d0 Note: It may take up to 5 minutes for the trace data to be dumped to the screen. The SPCN trace data will be displayed on the screen. An example of this output follows: Please wait.... ***** Power Trace Data Start ***** 00000000ffffffffffff0006158800a000061574a00000060200353700060210 031300060242000000060241040b0006110a0040000611090200000611020804 0000000000061102080b0006041000000006d0e3850000061103000000061109 02000006040200000006040200010006041000010006d0e38501000604100002 000000000006d0e385020006041000030006d0e3850300061580100000061540 03180006101116000006101116010006158010010007158156a0000711000010 00000000000910120000000910120001 ***** Power Trace data End ***** (Press Return to Continue) v Start SPCN Flash Update On this system, selecting this option initiates updating (or ?flashing?) of the SPCN (system power control network) firmware. v Display Power Subsystem FRU Code Levels Selecting this option displays information about each power subsystem FRU. The physical location code, and the firmware level (which is shown as the ?Code level?) that is currently installed on the FRU, as well as other information, are displayed. A sample of the output of this option, which shows several of the types of power subystem FRUs in this system, follows. 786
406/90
Service
Guide
Please wait.... Fru type = 00f6 BPC-RH Cage id = ff BPA Port = A Fru id = 80 BPC Location Code = U1.35-P1-X4 Code level = 0000255e Status = 01 Successfully collected Fru type = 0048 DCA-RHP Cage id = 00 CEC Cage Fru id = 90 DCA 1 Location Code = U1.18-P1-V1 Code level = 00000027 Status = 01 Successfully collected Fru type = 0050 MDA-X Cage id = 00 CEC Cage Fru id = 98 MDA 1 Location Code = U1.18-X1 Code level = 00000007 Status = 01 Successfully collected Fru type = 0047 DCA-BC Cage id = 01 I/O Drawer 1 Fru id = 90 DCA 1 Location Code = U1.9-V1 Code level = 0000fb23 Status = 01 Successfully collected Fru type = 0054 MDA-BC Cage id = 01 I/O Drawer 1 Fru id = 98 MDA 1 Location Code = U1.9-V1 Code level = 00000005 Status = 01 Successfully collected
v Power Subsystem Code Update via the Power Control Network If this option is enabled (the default), the power subsystem’s code update will be controlled by the system power control network code. If this option is disabled, the power subsystem’s code update will be controlled by the HMC. Enabling the update by the SPCN locks out the update from the HMC and vise-versa. This option should only be disabled if a High Performance Switch was installed at the factory, or is added to the frame or connected to the system as an MES. If the High Performance Switch is removed from the system, or it is detached from the system, this option should be reenabled. v Start Power Subsystem Code Update Selecting this option forces the service processor to initiate a firmware update for all of the power subsystem codes; this includes the SPCN code. As the various types of firmware are updated, the service processor will report the progress of the update on the screen (as long as the system remains in standby) until the update is complete. An example of this output follows. Chapter
7.
Using
the
Service
Processor
787
0>8 Checking for SPCN code level... Updating SPCN Microcode PLEASE WAIT... System Power Control Network (SPCN) FLASH Update Successful: Initializing for Power Subsys code updates... Please wait...., spcn_rc = 21 Checking Power Subsystem code levels... Updating BPC (a0e00e01)... Please wait ..... Completed... (a0e00e01) Please wait...., spcn_rc = 21 Updating BPC (a0e00e01)... Please wait ..... Completed... (a0e00e01) Please wait...., spcn_rc = 21 Updating BPR (a0e00e02)... Please wait ..... Completed... (a0e00e02) Updating BPR (a0e00e02)... Please wait ..... Completed... (a0e00e02) Updating BPR (a0e00e02)... Please wait ..... Completed... (a0e00e02) Updating DCA-RHP (a0e00e09)... Please wait ..... Completed... (a0e00e09) Please wait...., spcn_rc = 21 Updating DCA-RHP (a0e00e09)... Please wait ..... Completed... (a0e00e09) Please wait...., spcn_rc = 21 Updating DCA-RHP (a0e00e09)... Please wait ..... Completed... (a0e00e09) Please wait...., spcn_rc = 21 Updating DCA-RHP (a0e00e09)... Please wait ..... Completed... (a0e00e09) Please wait...., spcn_rc = 21 Updating DCA-RHP (a0e00e09)... Please wait ..... Completed... (a0e00e09) Please wait...., spcn_rc = 21 Checking Power Subsystem code levels... Power Subsystem code update completed (Press Return to Continue)
v LED Control Menu This menu displays the state of the processor subsystem disturbance or system attention LED and the fault/identify LED on the I/O subsystem(s). Use this menu to toggle the attention/fault LEDs between identify (blinking) and off. Option 1 is available only when the system is in the error state (the CEC is powered on and the service processor menus are available). Option 1 is not available when the system is in standby. 788
406/90
Service
Guide
An example of this menu follows: LED Control Menu 1. Set/Reset Identify LED state 2. Clear System Attention Indicator 98. Return to Previous Menu Enter LED index 0 > The processor subsystem disturbance or system attention LED is located on the operator panel in the media drawer. The I/O drawer fault/identify LED is located on the front of each I/O subsystem. If option 1 is selected, a list of location codes of the I/O subsystems and the CEC drawer displays. The screen will be similar to the following: 1. U1.9-P1 2. U1.9-P2 3. U1.5-P1 4. U1.5-P2 5. U1.1-P1 6. U1.1-P2 7. U2.1-P1 8. U2.1-P2 9. U2.5-P1 10. U2.5-P2 Enter number corresponding to the location code, or press Return to continue, or ’x’ to return to the menu. 0>4 If one of the devices is selected using the index number, the present state of its LED will be displayed, and you are given the option to toggle it, as shown in the following screen. The final state of the LED will then be displayed, whether or not it was changed. U1.5-P2 is currently in the OFF state Select from the following (1=IDENTIFY ON, 2=IDENTIFY OFF) 0>2 Please wait... U1.5-P2 is currently in the OFF state (Press Return to continue) Option 2, Clear System Attention Indicator, will clear the attention indicator on the operator panel in the media drawer. v MCM/L3 Interposer Plug Count Menu Attention: Do not power on the system when in this menu. Fully exit this menu before powering on the system. This menu tracks the number of times that the MCMs and L3 cache modules have been plugged into the system backplane. Chapter
7.
Using
the
Service
Processor
789
If an MCM or L3 cache module is reseated or replaced, the plug count for the module must be incremented by 1. If the plug count exceeds the limit of 10 (reaches 11 or greater), a 450x yyyy or 4B2x yyyy error with a detail value of CFF0 that identifies an MCM or L3 cache module will be posted in the service processor error log. The FRU should be replaced during a deferred service call. If an MCM or L3 cache module is reseated or replaced or installed during an MES upgrade, the plug count must be set using the MCM/L3 Interposer Plug Count menu. If the plug count information is not included with the new or replacement module, enter the default value of 7. If the plug count is not entered, a B1xx 4698 error code, with a detail value of E10B or E10C, will be posted in the service processor error log. If the primary I/O book is replaced, the plug counts are retained. However the MCM/L3 Plug Count menu must be accessed and option 50, Commit the values and write to the VPD, must be executed so that the plug counts are revalidated. If the counts are not revalidated, a B1xx 4698 error code, with a detail value of E10B or E10C, will be posted in the service processor error log.Notes: 1. The plug count will be 0 for those positions in which L3 cache modules and MCMs have never been installed. 2. If MCM and L3 modules are removed from the system and not replaced, those plug counts are retained until they are manually changed. A screen similar to the following will be displayed. In this example, all four MCMs and all 16 L3 modules are shown. MCM/L3 Interposer Plug Count Menu 1. L3_13:7 2. L3_0:7 3. MCM_0:7 4. L3_1:7 5. L3_4:7 6. L3_14:7 7. L3_3:7 8. L3_2:7 9. L3_7:7 10. MCM_3:1 11. MCM_2:1 12. L3_15:7 13. L3_10:7 14. MCM_1:7 15. L3_11:7 16. L3_6:7 17. L3_12:7 18. L3_9:7 19. L3_8:7 20. L3_5:7 50. Commit the values and write to the VPD 98. Return to Previous Menu The format of the preceding menu entries is the menu index number followed by the plug count after the colon. The following table matches the index number with the physical location codes.
Menu Index Number
Physical Location Code
1. L3_13
U1.18-P1-C5
2. L3_0
U1.18-P1-C6
3. MCM_0
U1.18-P1-C1
4. L3_1
U1.18-P1-C7
5. L3_4
U1.18-P1-C8
6. L3_14
U1.18-P1-C9
7. L3_3
U1.18-P1-C10
8. L3_2
U1.18-P1-C11
9. L3_7
U1.18-P1-C12
10. MCM_3
U1.18-P1-C2
11. MCM_2
U1.18-P1-C3 790
406/90
Service
Guide
12. L3_15
U1.18-P1-C13
13. L3_10
U1.18-P1-C14
14. MCM_1
U1.18-P1-C4
15. L3_11
U1.18-P1-C15
16. L3_6
U1.18-P1-C16
17. L3_12
U1.18-P1-C17
18. L3_9
U1.18-P1-C18
19. L3_8
U1.18-P1-C19
20. L3_5
U1.18-P1-C20
To
change the plug count for a particular module, enter a menu index number. For example, to change the plug count of the L3 module that is physically in the upper-right corner (U1.18-P1-C8), type 5, then enter the new plug count. When all of the new plug counts have been entered, select 50, Commit the values and write to the VPD. This action stores the new values in NVRAM. Determining the number of MCMs in a system: To determine the number of MCMs installed in the system, do one of the following: – If the managed system is powered down, go to “System Information Menu” on page 781, then to the section titled “Processor Configuration/Deconfiguration Menu” and refer to the example menu. The VPD port is shown as x.y, where x is in the range 3-6 and y is in the range 0-7. The listing of the VPD ports on the screen is used to determine the number of MCMs in the managed system.
3.y=MCM0 (U1.18-P1-C1) 4.y=MCM1 (U1.18-P1-C4) 5.y=MCM2 (U1.18-P1-C3) 6.y=MCM3 (U1.18-P1-C2) – If AIX is booted on the managed system, issue the following command on the AIX command line, to display a line of information for each MCM installed in the system: lscfg -vp | grep MCM The following table shows the plugging order of each MCM and its associated L3 cache modules.
MCM
Physical Location CODE
Interposer Plug Count Menu Index Number
First MCM:L3 Modules
U1.18-P1-C1U1.18-P1-C6U1.18-P1-C7U1.18-P1-C10U1.18-P1-C11
3. MCM_02. L3_0 4. L3_17. L3_38. L3_2
Second MCM:L3 Modules
U1.18-P1-C4U1.18-P1-C14U1.18-P1-C15U1.18-P1-C18U1.18-P1-C19
14. MCM_113. L3_1015. L3_1118. L3_919. L3_8
Third MCM:L3 Modules
U1.18-P1-C3U1.18-P1-C8U1.18-P1-C12U1.18-P1-C16U1.18-P1-C20
11. MCM_25. L3_49. L3_7 16. L3_620. L3_5 Chapter
7.
Using
the
Service
Processor
791
MCM
Physical Location CODE
Interposer Plug Count Menu Index Number
Fourth MCM:L3 Modules
U1.18-P1-C2U1.18-P1-C5U1.18-P1-C9U1.18-P1-C13U1.18-P1-C17
10. MCM_31. L3_136. L3_1412. L3_1517. L3_12
v
Performance Mode Setup Menu If certain types of processor cards are installed in the system, this menu is not applicable. For other types of processor cards, this menu will be active after the first boot as noted below.Note: The first time the system is booted after NVRAM is cleared, Not Applicable displays under Performance Mode Setup Menu on the screen. This may also happen if the service processor is replaced, or the processor MCMs are upgraded. If option 12 is selected when Not Applicable is on the screen, the system responds with Not Applicable and redisplay the system information menu. The setup menu can be displayed after the performance mode is set, which happens the first time the system is booted. The default performance mode is set by the firmware during IPL. The default mode provides the optimum performance for the hardware configuration of the system. The performance mode is systemwide; it cannot be set on a per-partition basis. You can override the default setting by using the performance mode setup menu. The performance mode setup menu will be similar to the following:
Default Performance Mode: Large Commercial System optimization 1. Current Performance Mode: Large Commercial System optimization 98. Return to Previous Menu 0>1 Selecting option 1 displays the following performance modes: Select from the following options: 1. Large Commercial System optimization 2. Standard Operation 3. Turbo Database Mode 0> Notes: 1. Some MCM configuration changes, such as going from an 8-way to a 24-way system, will cause the default performance mode to change. This new setting will not be reflected in the menu until after the system is rebooted with the new configuration. 2. If the NVRAM has been cleared, the default performance mode and the current performance mode will indicate uninitialized. To override the default setting, a brief description of each performance mode follows: – Large Commercial System Optimization is the setting for systems that do not fall into the other two categories, standard operation and turbo database mode. This setting provides the best performance for most applications. – Standard Operation optimizes the system for large memory bandwidth applications where minimal sharing of data occurs and the likelihood of significant hardware data-prefetching exists. 792
406/90
Service
Guide
– Turbo Database Mode optimizes system operation for environments where there is a large amount of data sharing among processes running concurrently on the system.v L3 Mode Menu This function is not applicable to this system. v Remote I/O (RIO) Link Speed Set Up Menu This option is used to change the speed of the RIO interface to external drawers. On this system, setting this speed will have no effect on the system. The hardware capability is detected and set by the system firmware during the boot process. If the hardware capability is unitialized (after NVRAM is cleared), it will be set during the first boot and remain set during subsequent boots. The maximum RIO speed setting will remain uninitialized until set by the user. The user can set the speed lower than or equal to the hardware capability, but not higher. If the hardware capability is 1000 Mbps, the user can set the link speed to 500 Mbps, and the RIO link will run at 500 Mbps. However, if the hardware capability is 500 Mbps and the user selects 1000 Mbps, the user’s selection will be ignored by the system. If this option is chosen, the menu similar to the following is displayed: Remote I/O (RIO) Link Speed Set Up Menu Hardware Capability (internal and external): 1000 Mbps 1. Maximum RIO speed setting (internal and external): Currently Uninitialized 98. Return to previous menu. 0>1 If option 1 is selected, the RIO interface speed can be set as follows: Enter new value for this option: (1 = 1000 MBPS, 2 = 500 Mbps) 0> This setting will remain in effect until it is changed or NVRAM is cleared. Chapter
7.
Using
the
Service
Processor
793
Language Selection Menu The service processor menus and messages are available in various languages. This menu allows selecting languages in which the service processor and system firmware menus and messages are displayed. LANGUAGE SELECTION MENU 1. English 2. Francais 3. Deutsch 4. Italiano 5. Espanol 98. Return to Previous Menu 99. Exit from Menus 0> Note: Your virtual terminal window must support the ISO-8859 character set to correctly display languages other than English. 794
406/90
Service
Guide
Call-In/Call-Out Setup Menu Note: The information in this section regarding the configuring of serial ports, and modems attached to those serial ports, applies only to the serial ports (S1 and S2) on the primary I/O book (location U1.18-P1-H2). These serial ports are normally used for call-in and call-out. Call-out is disabled in partitioned systems. The call-out function is normally handled by the Service Focal Point application running on the HMC. None of these menus are applicable to the serial ports, or modems attached to those serial ports, on the hardware management console (HMC). CALL-IN/CALL-OUT SETUP MENU 1. Modem Configuration Menu 2. Serial Port Selection Menu 3. Serial Port Speed Setup Menu 4. Telephone Number Setup Menu 5. Call-Out Policy Setup Menu 6. Customer Account Setup Menu 7. Call-Out Test NOT supported in LPAR mode. 98. Return to Previous Menu 99. Exit from Menus 0> v Modem Configuration Menu, see “Modem Configuration Menu” on page 796. v Serial Port Selection Menu, see “Serial Port Selection Menu” on page 797. v Serial Port Speed Setup Menu, see “Serial Port Speed Setup Menu” on page 798. v Telephone Number Setup Menu, see “Telephone Number Setup Menu” on page 798. v Call-Out Policy Setup Menu, see “Call-Out Policy Setup Menu” on page 800. v Customer Account Setup Menu, see “Customer Account Setup Menu” on page 801. v Call-Out Test tests the configuration after the modem is installed and configured correctly.Note: If the system is running in partition mode, the call-out option is disabled. Chapter
7.
Using
the
Service
Processor
795
Modem Configuration Menu Note: This option applies only to a modem attached to serial port 1 (S1) or serial port 2 (S2) on the primary I/O book. The first two lines of the Modem Configuration Menu contain status of the current selections. Selections are made in the sections labeled Modem Ports and Modem Configuration File Name. Select the serial port that you want to activate and then select the modem configuration file for the modem on the port. If you want to set up all of the serial ports with modems, make your selections one port at a time. Modem Configuration Menu Port 1 Modem Configuration File Name: Port 2 Modem Configuration File Name: To make changes, First select the port and then the configuration file name Modem Ports: 1. Serial port 1 2. Serial port 2 Modem Configuration File Name: 5. none9. modem_z_sp 6. modem_f_sp 10. modem_m1_sp 7. modem_f0_sp 11. modem_m0_sp 8. modem_f1_sp 12. modem_m1_sp 30. Save configuration to NVRAM and Configure modem 98. Return to Previous Menu 0> For information on choosing a modem configuration file, see “Sample Modem Configuration Files” on page 1061 and “Transfer of a Modem Session” on page 1066. 796
406/90
Service
Guide
Serial Port Selection Menu This menu allows you to enable or disable the call-in and call-out functions of each serial port in any combination. Note: If the system is running in partition mode, call-out is disabled. Serial Port Selection Menu 1. Serial Port 1 Call-Out: Currently Disabled NOT supported in LPAR mode. 2. Serial Port 2 Call-Out: Currently Disabled NOT supported in LPAR mode. 3. Serial Port 1 Call-In: Currently Disabled 4. Serial Port 2 Call-In: Currently Disabled 98. Return to Previous Menu 0> Call-in and ring indicate power-on cannot be enabled at the same time. If ring-indicate power-on is already enabled and you try to enable call-in, a message prompts you for confirmation. Refer to the message displayed on the screen. Chapter
7.
Using
the
Service
Processor
797
Serial Port Speed Setup Menu This menu allows you to set serial port speed to enhance terminal performance or to accommodate modem capabilities. Serial Port Speed Setup Menu 1. Serial Port 1 Speed: Currently 9600 2. Serial Port 2 Speed: Currently 9600 98. Return to Previous Menu 0> A serial port speed of 9600 baud or higher is recommended. The following are valid serial port speeds:
50
600
4800
75
1200
7200
110
1800
9600
134
2000
19200
150
2400
38000
300
3600
57600
115200 Telephone Number Setup Menu Note: These menus are available, however, the function of calling out to report a system failure is normally handled by the Service Focal Point application running on the HMC. Use this menu to set or change the telephone numbers for reporting a system failure.
Telephone Number Setup Menu 1. Service Center Telephone Number: Currently Unassigned 2. Customer Administration Center Telephone Number: Currently Unassigned 3. Digital Pager Telephone Number: Currently Unassigned 4. Customer Voice Telephone Number: Currently Unassigned 5. Customer System Telephone Number: Currently Unassigned 98. Return to Previous Menu 0> 798
406/90
Service
Guide
v Service Center Telephone Number is the number of the service center computer. The service center usually includes a computer that takes calls from servers with call-out capability. This computer is referred to as the ?catcher.? The catcher expects messages in a specific format to which the service processor conforms. For more information about the format and catcher computers, refer to the readme file in the AIX /usr/samples/syscatch directory. Contact your service provider for the correct service center telephone number to enter. Until you have that number, leave this field unassigned. v Customer Administration Center Telephone Number is the number of the System Administration Center computer (catcher) that receives problem calls from servers. Contact your system administrator for the correct telephone number to enter here. Until you have that number, leave this field unassigned. v Digital Pager Telephone Number is the number for a numeric pager carried by someone who responds to problem calls from your server. Contact your administration center representative for the correct telephone number to enter. For test purposes, use a test number, which you can change later (see the note on page 815).Note: If the system is running in partition mode, call-out is disabled. However, if the system is booted in full system partition mode, at least one of the preceding three telephone numbers must be assigned in order for the call-out test to complete successfully. v Customer Voice Telephone Number is the telephone number of a phone near the server or answered by someone responsible for the server. This is the telephone number left on the pager for callback. For test purposes, use a test number, which you can change later. v Customer System Telephone Number is the telephone number to which your server’s modem is connected. The service or administrative center representatives need this number to make direct contact with your server for problem investigation. This is also referred to as the call-in phone number. Chapter
7.
Using
the
Service
Processor
799
Call-Out Policy Setup Menu Note: Call-out is disabled in partitioned systems. Call out settings can be set using the following menu: CALL-OUT POLICY SETUP MENU 1. Call-Out policy (First/All): Currently First 2. Remote timeout, (in seconds): Currently 120 3. Remote latency, (in seconds): Currently 2 4. Number of retries: Currently 2 98. Return to Previous Menu 0> v Call-Out policy can be set to first or all. If call-out policy is set to first, the service processor stops at the first successful call-out to one of the following numbers in the order listed: 1. Service Center 2. Customer Administrative Center 3. PagerIf call-out policy is set to all, the service processor attempts a call-out to all of the following numbers in the order listed: 1. Service Center 2. Customer Administrative Center 3. Pagerv Remote timeout and remote latency are functions of your service provider’s catcher computer. Either use the defaults or contact your service provider for recommended settings. v Number of retries is the number of times you want the server to retry calls that failed to complete. 800
406/90
Service
Guide
Customer Account Setup Menu This menu allows users to enter information that is specific to their account. Customer Account Setup Menu 1. Customer Account Number: Currently Unassigned 2. Customer RETAIN Login userid: Currently Unassigned 3. Customer RETAIN login password: Currently Unassigned 98. Return to Previous Menu 0> v Customer Account Number is assigned by your service provider for record-keeping and billing. If you have an account number, enter it. Otherwise, leave this field unassigned. v Customer RETAIN Login User ID and Customer RETAIN Login Password apply to a service function to which your service provider might have access. Leave these fields unassigned if your service provider does not use RETAIN.Call-out Test Call-out test is disabled in partitioned systems. Service Processor Parameters in Service Mode (Full System Partition) When the system is in service mode, the following service processor functions are suspended: v Unattended Start Mode v Reboot/Restart Policy v Call-Out v SurveillanceWhen service mode is exited, the service processor functions are re-activated. Chapter
7.
Using
the
Service
Processor
801
System Power-On Methods This section discusses the following system power-on methods: v Hardware Management Console (HMC) Using the HMC is the preferred method of powering on a managed system. For details on using the HMC to power up a system, see the Hardware Management Console Operations Guide. v Power-on Switch v Service Processor Menus Privileged users can power on the system by selecting the System Control Power Menu option from the main menu and then selecting the Power-on System option from the System Power Control Menu. General users should select Power-on System on the General User Menu. v Remote Power-on via Ring-Indicate Signal The server automatically powers on when it detects a ?ring indicate? signal from a modem attached to serial port 1 (S1) or serial port 2 (S2). A remote user can call the server to activate ring detection by the modem. Listen for a few more rings than the threshold number for starting the system. The system powers on without answering the call. v Unattended start mode - refer to Enable/Disable Unattended Start Mode on page 778. The service processor can be enabled to recover from the loss of ac power (see Enable/Disable Unattended Power-On Mode in the SYSTEM POWER CONTROL MENU). When ac power is restored, the system returns to the power state at the time ac loss occurred. For example, if the system was powered on when ac loss occurred, it reboots/restarts when power is restored. If the system was powered off when ac loss occurred, it remains off when power is restored. v Timed power-on - refer to the shutdown -t command on servers using AIX. Working in conjunction with AIX, the Service Processor in your server can operate a timer, much like the wake-up timer on your clock radio. You can set the timer so that your server powers on at a certain time after shutting down. The timer is battery-operated, so power interruptions that occur while the server is off do not affect its accuracy. Refer to the AIX shutdown -t command for details on setting the timer. Note: If an ac power loss (exceeding the hold-up time of any optional IBFs) is in progress when the timed power-on attempt occurs, the server cannot power on when ac power is restored. v Follow-up to a Failed Boot Attempt The service processor initiates a power-on sequence if a failed boot attempt is detected (due to a hardware or software failure). v Fast or Slow Boot (IPL) Using the service processor menus, you can select the IPL type, mode, and speed of your system. Attention: Selecting fast IPL results in several diagnostic tests being skipped and a shorter memory test being run. 802
406/90
Service
Guide
Service Processor Reboot/Restart Recovery Reboot describes bringing the system hardware back up; for example, from a system reset or power-on. The boot process ends when control passes to the operating system process. Restart describes activating the operating system after the system hardware is reinitialized. Restart must follow a successful reboot. Boot (IPL) Speed When the server enters reboot recovery, slow IPL is automatically started, which gives the POST an opportunity to locate and report any problems that might otherwise be unreported. Failure During Boot Process During the boot process, either initially after system power-on or upon reboot after a system failure, the service processor monitors the boot progress. If progress stops, the service processor can reinitiate the boot process (reboot) if enabled to do so. The service processor can re-attempt this process according to the number of retries selected in the Reboot/Restart Policy Setup Menu. Failure During Normal System Operation When the boot process completes and control transfers to the operating system (OS), the service processor can monitor operating system activity (see the Set Surveillance Parameters option in the SERVICE PROCESSOR SETUP MENU). If OS activity stops due to a hardware- or software-induced failure, the service processor can initiate a reboot/restart process based on the settings in the Service Processor Reboot/Restart Policy Setup Menu and the OS automatic restart settings (see the operating system documentation). If you are using the AIX operating system, the menu item under SMIT for setting the restart policy is Automatically Reboot After Crash. The default is false. When the setting is true, and if the service processor parameter ?Use OS-Defined Restart Policy? is yes (the default), the service processor takes over for AIX to reboot/restart after a hardware or surveillance failure. Service Processor Reboot/Restart Policy Controls The operating system’s automatic restart policy (see operating system documentation) indicates the operating system response to a system crash. The service processor can be instructed to refer to that policy by the Use OS-Defined Restart Policy setup menu. If the operating system has no automatic restart policy, or if it is disabled, then the service processor-restart policy can be controlled from the service processor menus. Use the Enable Supplemental Restart Policy selection. Use OS-Defined restart policy - The default setting is no. If set to yes on a full system partition, this causes the service processor to refer to the OS Automatic Restart Policy setting and take action (the same action the operating system would take if it could have responded to the problem causing the restart). When this setting is no, or if the operating system did not set a policy, the service processor refers to enable supplemental restart policy for its action. This setting is ignored on a partitioned system. Chapter
7.
Using
the
Service
Processor
803
Enable supplemental restart policy - The default setting is yes. When set to yes on a full system partition, the service processor restarts the server when the operating system loses control and either: The Use OS-Defined restart policy is set to No. OR The Use OS-Defined restart policy is set to Yes and the operating system has no automatic restart policy. If set to Yes on a partitioned system, the service processor restarts the system when the system loses control and it is detected by service processor surveillance. Refer to “Service Processor Reboot/Restart Recovery” on page 803. The following table describes the relationship among the operating system and service processor restart controls in a full system partition.
OS Automatic reboot/restart after crash setting
Service processor to use OS-Defined restart policy?
Service Processor Enable supplemental restart policy?
System response
None
No1
No
None
No1
Yes1
Restarts
None
Yes
No
None
Yes
Yes1
Restarts
False2
No1
No
False2
No1
Yes1
Restarts
False2
Yes
No
False2
Yes
Yes1
True
No1
No
True
No1
Yes1
Restarts
True
Yes
No
Restarts
True
Yes
Yes1
Restarts 1 Service processor default 2 AIX default In a partitioned system, the service processor’s supplemental restart policy is the only setting that is used, as shown in the following table:
Service Processor enable supplemental restart policy
System Response
No
Yes (default)
Restarts 804
406/90
Service
Guide
System Firmware Updates Attention: If the firmware on the system is RH021025 or later, the firmware can be updated to the latest level by the customer. If it is RH021024 or earlier, the firmware must be updated by a service representative. To determine the current level of firmware on the system, see “Determining the Level of Firmware on the System” on page 806. This section provides information and instructions for updating the system firmware. You may need to perform these steps if you are installing an option or if your support representative has requested that you update your firmware. If you cannot download from the Web, do the following: v If the service processor programming has been damaged, the service processor will automatically enter recovery mode when power is applied to the system. To check the level of firmware that is currently on the system, see “Determining the Level of Firmware on the System” on page 806. General Information on System Firmware Updates All the system firmware types that can be reprogrammed are updated at the same time. They are: v System power control network programming v Service processor programming v IPL programming v Run-time abstraction servicesRetain and store the latest firmware diskettes each time the firmware gets updated in the event that the firmware becomes damaged and must be reloaded. Chapter
7.
Using
the
Service
Processor
805
Determining the Level of Firmware on the System The firmware level can be denoted in either of the following forms: v RHyymmdd, where RH = the 406/90’s firmware designation, yy = year, mm = month, and dd = day of the release. v vHyymmdd, where v = version number, H= the 406/90’s firmware designation, yy = year, mm = month and dd = day of the release.If your system is running AIX, the platform firmware level can be determined by either of the following methods: v On the AIX command line, by typing: lscfg -vp|grep -p Platform A line that begins with ROM level (alterable).. displays the firmware level that is currently on the system. v Looking at the top of the service processor main menu.If the system is running Linux, the platform firmware level can be determined by any one of the following methods: v If the system was installed prior to May of 2003, and has not had a firmware update in May 2003 or later, type the following on the Linux command line: lscfg -vp | grep RH v If the system was installed in May of 2003 or later, or the system has firmware on it that was released in May 2003 or later, type the following on the Linux command line: lscfg -vp | grep 3H A line similar to Alterable ROM Level RH021114 or Alterable ROM Level 3H030509 displays the firmware level that is currently on the system. v Looking at the top of the service processor menu main menu. 806
406/90
Service
Guide
System Firmware Update in AIX Using a Locally Available Image To update the system firmware using a locally available image, perform the following steps: 1. Log in as root user. 2. If the /tmp/fwupdate directory does not exist, create it by issuing the following command: mkdir /tmp/fwupdate 3. The firmware update file must be downloaded or copied into the /tmp/fwupdate directory on the system. This can be done by using the ftp command to get the image from an ftp server or NFS-mounting the directory on the host system. The firmware updated file can also be transferred to the target system by backing up the image onto diskettes from another server and restoring it into the /tmp/fwupdate directory. After the firmware update file has been downloaded or copied into the /tmp/fwupdate directory, verify its existence by entering either of the following commands, depending on the name of the update image: ls /tmp/fwupdate/RH*.img or ls /tmp/fwupdate/3H*.img The update file will have either the RHyymmdd.img format or the 3Hyymmdd.img format. In both cases, the Hin the second position indicates that this is an update image for your system; yy is the year, mm is the month, and dd is the day of the update file. 4. After the update file has been written to the /tmp/fwupdate directory, enter the following commands: cd /usr/lpp/diagnostics/bin then either of these commands, depending on the name of the update file: ./update_flash -f /tmp/fwupdate/RHyymmdd.img or ./update_flash -f /tmp/fwupdate/3Hyymmdd.img Attention: Do not overlook the periods (.) in the commands shown above. AIX commands are case-sensitive. Type them exactly as shown.You will be asked for confirmation to proceed with the firmware update and the required reboot. If you confirm, the system will apply the new firmware, reboot, and return to the AIX prompt. This may take up to ten minutes depending on the configuration of the system. Attention: On some systems, the message Wait for rebooting before stopping may display on the system console. Do not turn off the system until the system has fully rebooted to the AIX login prompt. If a shutdown is necessary at that time, log in as root user and issue the shutdown command. While the update is in progress, you will see Rebooting... on the display for several minutes. The firmware update is complete. Chapter
7.
Using
the
Service
Processor
807
Recovery Mode Contact your marketing representative for information about obtaining and installing system firmware update packages. The eight-digit codes are as follows:
Code
Action
A1FD 0000
System firmware has been damaged and must be reflashed.
A1FD 0001
Insert update diskette 1.
A1FD 0002
Insert update diskette 2.
A1FD 0003
Insert update diskette 3.
A1FD 000n
Insert update diskette n. Notes: 1. If the wrong diskette is inserted at any time, or if the diskette is left in the drive after it has been read, B1FD 001F is displayed, indicating that the wrong diskette is in the drive. 2. If B1FD 001A is displayed at any time during the process, the service processor must be reset by activating the pinhole reset switch on the operator panel. After the update process is complete, the service processor resets itself and goes through a complete reboot. 808
406/90
Service
Guide
Frame (Power Subsystem) Firmware Update with a 45H/W4 Switch Installed When a 45H/W4 Switch is installed in or attached to the system, the frame firmware, which includes the power subsystem, is updated using a task on the HMC. Contact your marketing representative for code updates (also called corrective service). This task downloads a corrective service for the frame onto the HMC.Note: Because the HMC is a closed system, you cannot install additional applications on your HMC. All of the tasks you need to maintain the managed system, the underlying operating system, and the HMC application code are available by using the HMC’s management applications. To download corrective service on the HMC, do the following: 1. From the HMC interface, in the Navigation area (left side of the panel), open the Software Maintenance folder. 2. In the Contents area (right side of the panel), double-click the Frame icon. The Frame application opens in the Contents area. 3. In the Contents area, click Receive Corrective Service task. 4. Select one of the following by clicking on the circle to the left. v Upload corrective service from diskette. If you select this option, make sure the correct diskette is in the HMC diskette drive. v Download the corrective service file from a remote system. If you select this option, complete the Remote Site (host name), Patch file (file name or remote system), User ID (for remote system), and password (for remote system).5. Click OK to copy the corrective service to this HMC. Chapter
7.
Using
the
Service
Processor
809
Installing Corrective Service on the Frame This task allows you to update the level of firmware on the frame after you have downloaded a corrective service. This procedure should be run after any of the following components are replaced: v Bulk Power Regulator (BPR) v Bulk Power Controller (BPC) v Distributed Converter Assembly (DCA)Notes: 1. To install a corrective service on a frame, you must be a system administrator or a service representative. For information about system administrator or service representative roles, refer to the Hardware Management Console Operations Guide. 2. Do not power off any of the components in the frame at any time during this installation procedure. Interruptions can leave the power subsystem, or one of the other components in the frame, in an unrecoverable state.To install a corrective service, do the following: 1. If you have not installed the corrective fix from ftp or diskette onto this HMC, perform steps to receive frame corrective service on the HMC, as described in “Frame (Power Subsystem) Firmware Update with a 45H/W4 Switch Installed” on page 809. Then go to Step 4. 2. From the HMC interface, in the Navigation area (left side of the panel), open the Software Maintenance folder. 3. In the Contents area (right side of the panel), double-click the Frame icon. The Frame application opens in the Contents area. 4. In the Contents area, click Install Corrective Service. The Install Corrective Service window opens. 5. Select the Corrective Service Version, and select the frame where the service will be applied.Note: Unless you are directed otherwise, select the highest version number for the most recent fix. 6. Click Install. The installation may take up to an hour, depending on the number of parts in the frame that require an update. When installation is complete, a window opens to indicate installation status.Note: If you have replaced a single frame part and are using this procedure to update it, this process normally takes only two to five minutes. 7. If the installation fails, click Reason for Failure. a. If you are able to correct the problems(s), click Cancel and go to Step 6. b. If you are given a service request code (SRC), perform the steps to service the SRC, then return to this procedure. c. Otherwise, contact the next level of support.8. After the corrective service has been applied successfully, click Cancel to return. 810
406/90
Service
Guide
Reconfiguration Procedure for SNI Adapters After the system has booted to the AIX login prompt, log in and verify that the SNI adapters are configured properly. On the AIX command line, issue the following command: lscfg | grep sni The number of sni adapters identified in the output of the above command (represented by their logical device names) should match the number in the system (2-8). If the number of adapters does not equal the number of physical SNI ports in the system, issue the following commands at the AIX command line, where snix represents each SNI port. Run the following two commands for every sni port in the list generated by running the lscfg command above. ifconfig -d snix down detach rmdev -d -l snix After all sni ports have been deconfigured, run the following command: cfgmgr -v Then reboot the system and verify the presence of two SNI ports. Chapter
7.
Using
the
Service
Processor
811
Configuring and Deconfiguring Processors or Memory All failures that crash the system with a machine check or check stop, even if intermittent, are reported as a diagnostic callout for service repair. To prevent the recurrence of intermittent problems and improve the availability of the system until a scheduled maintenance window, processors and memory books with a failure history are marked ?bad? to prevent their being configured on subsequent boots. A processor or memory book is marked ?bad? under the following circumstances: v A processor or memory book fails built-in self-test (BIST) or power-on self-test (POST) testing during boot (as determined by the service processor). v A processor or memory book causes a machine check or check stop during runtime, and the failure can be isolated specifically to that processor or memory book (as determined by the processor runtime diagnostics in the service processor). v A processor or memory book reaches a threshold of recovered failures that results in a predictive callout (as determined by the processor run-time diagnostics in the service processor).During boot time, the service processor does not configure processors or memory books that are marked “bad.” If a processor or memory book is deconfigured, the processor or memory book remains offline for subsequent reboots until it is replaced or repeat gard is disabled. The repeat gard function also provides the user with the option of manually deconfiguring a processor or memory book, or re-enabling a previously deconfigured processor or memory book. For information on configuring or deconfiguring a processor, see the Processor Configuration/Deconfiguration Menu on page 782. For information on configuring or deconfiguring a memory book, see the Memory Configuration/Deconfiguration Menu on page 783. Both of these menus are submenus under the System Information Menu. You can enable or disable CPU Repeat Gard or Memory Repeat Gard using the Processor Configuration/Deconfiguration Menu. Run-Time CPU Deconfiguration (CPU Gard) L1 instruction cache recoverable errors, L1 data cache correctable errors, and L2 cache correctable errors are monitored by the processor runtime diagnostics (PRD) firmware running in the service processor. When a predefined error threshold is met, an error log with warning severity and threshold exceeded status is returned to AIX. At the same time, PRD marks the CPU for deconfiguration at the next boot. AIX will attempt to migrate all resources associated with that processor to another processor and then stop the defective processor. 812
406/90
Service
Guide
Service Processor System Monitoring - Surveillance Surveillance is a function in which the service processor monitors the system, and the system monitors the service processor. This monitoring is accomplished by periodic samplings called heartbeats. Surveillance is available during two phases: v System firmware bringup (automatic) v Operating system runtime (optional)Note: Operating system surveillance is disabled on partitioned systems. System Firmware Surveillance System firmware surveillance is automatically enabled during system power-on. It cannot be disabled by the user, and the surveillance interval and surveillance delay cannot be changed by the user. If the service processor detects no heartbeats during system IPL (for a set period of time), it cycles the system power to attempt a reboot. The maximum number of retries is set from the service processor menus. If the fail condition persists, the service processor leaves the machine powered on, logs an error, and displays menus to the user. If Call-out is enabled, the service processor calls to report the failure and displays the operating-system surveillance failure code on the operator panel. Chapter
7.
Using
the
Service
Processor
813
Operating System Surveillance Note: Operating system surveillance is disabled in partitioned systems. Operating system surveillance provides the service processor with a means to detect hang conditions, as well as hardware or software failures, while the operating system is running. It also provides the operating system with a means to detect a service processor failure caused by the lack of a return heartbeat. Operating system surveillance is not enabled by default, allowing you to run operating systems that do not support this service processor option. You can also use service processor menus and AIX service aids to enable or disable operating system surveillance. For operating system surveillance to work correctly, you must set these parameters: v Surveillance enable/disable v Surveillance interval The maximum time the service processor should wait for a heartbeat from the operating system before timeout. v Surveillance delay The length of time to wait from the time the operating system is started to when the first heartbeat is expected.Surveillance does not take effect until the next time the operating system is started after the parameters have been set. If desired, you can initiate surveillance mode immediately from service aids. In addition to the three options above, a fourth option allows you to select immediate surveillance, and rebooting of the system is not necessarily required. If operating system surveillance is enabled (and system firmware has passed control to the operating system), and the service processor does not detect any heartbeats from the operating system, the service processor assumes the system is hung and takes action according to the reboot/restart policy settings. See “Service Processor Reboot/Restart Recovery” on page 803. If surveillance is selected from the service processor menus that are only available at system boot, then surveillance is enabled by default as soon as the system boots. From service aids, the selection is optional. 814
406/90
Service
Guide
Call-Out (Call-Home) Note: Call-out is disabled on partitioned systems. The call-out function is handled by the Service Focal Point application on the Hardware Management Console (HMC). The service processor can call out (call-home) when it detects one of the following conditions: v System firmware surveillance failure v Operating system surveillance failure (if supported by operating system) v Restarts v Critical hardware failure v Abnormal operating system terminationTo enable the call-out feature, do the following: 1. Connect a modem to any serial port. 2. Set up the following using the service processor menus or diagnostic service aids: v Enable call-out for the serial port where the modem is connected. v Enter the modem configuration file name. v Set up site-specific parameters (phone numbers for call-out, call-out policy, number of call-out retries, and so on).3. To call out before restart, set Call-out before restart to ENABLED from the Reboot/Restart Policy Setup menu.Note: Some modems, are not designed for the paging function. Although they can be used for paging, they will return an error message when they do not get the expected response from another modem. Therefore, even though the paging was successful, the error message will cause the service processor to retry, continuing to place pager calls for the number of retries specified in the call-out policy setup menu. These retries result in redundant pages. Chapter
7.
Using
the
Service
Processor
815
Console Mirroring Console mirroring allows a user on one serial port to monitor the service processor activities on another serial port. This can be done on the locally attached HMC virtual terminal window or windows remotely attached through modems. Console mirroring is supported on any combination of locally or remotely attached monitors. The serial port from which console mirroring is enabled is called the primary port. The mirror port is determined when keyboard input is detected from one of the other two serial ports. From this point forward, the service processor sends information to the active port and the mirror port only. Console mirroring ends when the service processor releases control of the serial ports to the system firmware. Console mirroring is supported on serial port 1 (S1), serial port 2 (S2), and serial port 3 (S3). Remote attachment through a modem is supported on serial port 1 (S1) and serial port 2 (S2). System Configuration The following describes the configuration for console mirroring: v Service processor v Modem connected to one serial port and enabled for incoming calls v Local HMC virtual terminal window connected to the other serial port. This local terminal can be connected directly to your server or connected through another modem.Console mirroring can be started by either of the following methods: v Remote session first, then local session added: 1. Remote session is already in progress. 2. Remote user uses service processor menus to enable console mirroring, allowing both consoles to be active.v Local session first, then remote session added: 1. Local session is already in progress. 2. The service processor receives a call from the remote user. 3. The local user selects the option to enable console mirroring. The service processor immediately begins mirroring service processor menus. 816
406/90
Service
Guide
Service Processor Error Logs The service processor error logs, an example of which follows, contain error conditions detected by the service processor. Error Log 1. 11/30/99 19:41:56 Service Processor Firmware Failure B1004999 Enter error number for more details. Press Return to continue, or ’x’ to return to menu. Press "C" to clear error log, any other key to continue. > Note: The time stamp in this error log is coordinated universal time (UTC), which is also referred to as Greenwich mean time (GMT). Operating system error logs have additional information available and can time stamp with local time. Entering an error number provides nine words of system reference code (SRC) data; an example screen is shown below. Detail: 6005 SRC - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - word11:B1004999 word12:0110005D word13:00000000 word14:00000000 word15:00001111 word16:00000000 word17:B1004AAA word18:0114005D word19:A4F1E909 B1004999 Press Return to continue, or ’x’ to return to menu. If Return is pressed, the contents of NVRAM will be dumped 320 bytes at a time, starting at address 0000. Chapter
7.
Using
the
Service
Processor
817
LCD Progress Indicator Log The following is an example of the LCD progress indicator log. It shows the types of entries that may appear in the log, and is for example purposes only. The progress indicator codes are listed from top (latest) to bottom (oldest). LCD Progress Indicator Log B0FF 0539..17 0538..17 0539..17 0538..17 0539..17 0581 0538..17 0539..12 0538..12 0539.. 0821..01-K1-00 0539.. 0728..01-R1-00-00 0539.. 0664..40-60-00-1,0 0539.. 0777..U1.5-P1-I14/E1 0539.. 0742..U1.5-P1-I12/E1 0539.. 0776..U1.5-P1-I10/T1 E139 E1FB E139 E183 Press Return to continue, or ’x’ to return to menu. > EAA1..U1.9-P1-I14 E172..U1.9-P1 E172..U1.9-P1-I14 E172..U1.9-P1 94BB 9109 9380 9108 9107 9106 9105 9118 9104 9103 9102 90FD 818
406/90
Service
Guide
Resetting the Service Processor Some error conditions might cause the service processor to hang. The service processor must be reset to recover from a hung condition. If the system is powered on, resetting the service processor will cause the system to shut down. When the service processor is reset, it carries out its power-on sequence, including self-tests. Successful completion of the reset sequence is indicated by OK in the operator panel in the media drawer. Before resetting the service processor, if the managed system is powered on and running, shut down all partitions, or the full system partition. This action causes the system to shut down and the system power is turned off. To reset the service processor when the managed system is powered off, perform either of the following: v If the service processor is responding to input from the HMC (or ASCII terminal), select Service Processor Setup Menu from the main menu, then select Reset Service Processor. This can only be done by a privileged user. v Put the UEPO switch in the off position, then back to the on position.Note: Do not use the pinhole reset switch to reset the service processor. Chapter
7.
Using
the
Service
Processor
819
Service Processor Operational Phases This section provides a high-level flow of the phases of the service processor. SPPowerAppliedPre-StandbyPhaseStandbyPhaseSPMenusAvailableBring-UpPhaseSMSMenusAvailableRun-timePhaseOperatingSystemLoginPromptAvailable Pre-Standby Phase This phase is entered when the server is connected to a power source. The server may or may not be fully powered on. This phase is exited when the power-on self-tests (POSTs) and configurations tasks are completed. The pre-standby phase components are: v Service Processor Initialization - service processor performs any necessary hardware and software initialization. v Service Processor POST - service processor conducts power-on self-tests on its various work and code areas. v Service Processor Unattended Start Mode Checks - To assist fault recovery. If unattended start mode is set, the service processor automatically reboots the server. The service processor does not wait for user input or power-on command, but moves through the phase and into the bring-up phase. Access the SMS menus or the service processor menus to reset the unattended start mode. 820
406/90
Service
Guide
Standby Phase The standby phase can be reached in either of two ways: v With the server off and power connected (the normal path), recognized by OK in the LCD display. OR v With the server on after an operating system fault, recognized by an 8-digit code in the LCD display.In the standby phase, the service processor takes care of some automatic duties and is available for menus operation. The service processor remains in the standby phase until a power-on request is detected. The standby phase components are as follows: v Modem Configuration The service processor configures the modem (if installed) so that incoming calls can be received, or outgoing calls can be placed. v Dial In Monitor incoming phone line to answer calls, prompt for a password, verify the password, and remotely display the standby menu. The remote session can be mirrored on the local HMC virtual terminal window if the server is so equipped and if the user enables this function. v Menus The service processor menus are password-protected. Before you can access them, you need either the general user-password or privileged-user password. Service processor menus are available on ASCII terminals attached to the serial ports on the primary I/O book, and on terminal emulators. On HMC-managed systems, service processor menus are also available on the HMC graphical user interface.Bring-Up Phase On a system that is powered-on to full system partition, this phase is entered upon power-on, and exited upon loading of the operating system. On a system that is powered-on to partition standby, this phase is entered upon power-on, and exited when the partition manager and hypervisor have been loaded and become fully operational in system memory. The end of this phase is indicated when LPAR displays on the operator panel. At this point, the server can start multiple logical partitions which would be activated through the HMC graphical user interface. The bring-up phase components are as follows: v Retry Request Check The service processor checks to see if the previous boot attempt failed. If the specified number of failures are detected, the service processor displays an error code and places an outgoing call to notify an external party if the user has enabled this option. v Dial Out The service processor can dial a preprogrammed telephone number in the event of an IPL failure. The service processor issues an error report with the last reported IPL status indicated and any other available error information. v Update Operator Panel The service processor displays operator panel data on the HMC virtual terminal window if a remote connection is active. v Environmental Monitoring The service processor provides expanded error recording and reporting. v System Firmware Surveillance (Heartbeat Monitoring) Chapter
7.
Using
the
Service
Processor
821
The service processor monitors and times the interval between system firmware heartbeats. v Responding to System Processor Commands The service processor responds to any command issued by the system processor.Run-Time Phase This phase includes the tasks that the service processor performs during steady-state execution of the operating system. v Environmental Monitoring The service processor monitors voltages, temperatures, and fan speeds (on some servers). v Responding to System Processor Commands The service processor responds to any command issued by the system processor. v Run-Time Surveillance (not supported on partitioned systems) If the device driver is installed and surveillance enabled, the service processor monitors the system heartbeat. If the heartbeat times out, the service processor places an outgoing call. This is different from the bring-up phase scenario, where the specified number of reboot attempts are made before placing an outgoing call. v HMC surveillance On an HMC-managed system, the service processor monitors the communication link between the managed system and the HMC. If the service processor detects that this communication link has been broken, it will post an error to the operating system running on the managed system.
822
406/90
Service
Guide
Chapter 8. Using System Management Services Use the system management services menus to view information about your system or partition, and to perform tasks such as setting a password, changing the boot list, and setting the network parameters. Notes: 1. On some of the system management services (or service processor) screens, you will see the term LPAR. LPAR is equivalent to the term logically partitioned system or partitioned system. 2. In a partitioned system, only those devices that are assigned to the partition that is being booted display in the SMS menus. In a full system partition, all devices in the system display in the SMS menus. To start the system management services, do the following: 1. For a partitioned system, use the Hardware Management Console (HMC) to restart the partition. For a full system partition, restart the system. 2. For a partitioned system, watch the virtual terminal window on the HMC. For a full system partition, watch the firmware console. 3. Look for the POST indicators memory, keyboard, network, scsi, speaker, which appear across the bottom of the screen. Press the numeric 1 key after the word keyboard appears, and before the word speaker appears. For more information on the POST indicator words, refer to “POST Indicators” on page 12.After the system management services starts, the following screen displays: Main Menu 1 Select Language 2 Change Password Options NOT available in LPAR mode 3 View Error Log 4 Setup Remote IPL (Initial Program Load) 5 Change SCSI Settings 6 Select Console NOT available in LPAR mode 7 Select Boot Options -------------------------------------------------------------------------------------------------- Navigation keys: X = eXit System Management Services -------------------------------------------------------------------------------------------------- Type the number of the menu item and press Enter or Select a Navigation key: _ Note: The system management services can also be started using the Service Processor Boot Mode Menu. See page 780. 823
On all menus except the Main Menu, there are several navigation keys: M Return to the main menu. ESC Return to the previous menu. X Exit the system management services and start the operating system. If X is entered, you are asked to confirm your choice to exit the SMS menus and start the operating system. When there is more than one page of information to display, there are two additional navigation keys: N Display the next page of the list. P Display the previous page of the list.Note: The lowercase navigation key has the same effect as the uppercase key that is shown on the screen. For example, m or M returns you to the main menu. On each menu screen, you are given the option of choosing a menu item and pressing Enter (if applicable), or selecting a navigation key. Select Language Note: Your TTY must support the ISO-8859 character set to properly display languages other than English. This option allows you to change the language used by the text-based System Management Services menus. SELECT LANGUAGE 1. English 2. Francais 3. Deutsch 4. Italiano 5. Espanol -------------------------------------------------------------------------------------------------- Navigation keys: M = return to main menu ESC key = return to previous screen X = eXit System Management Services -------------------------------------------------------------------------------------------------- Type the number of the menu item and press Enter or Select a Navigation key: _ 824
406/90
Service
Guide
Change Password Options The Change Password Options menu enables you to select from password utilities. Note: This option is not available in partitioned systems. Password Utilities 1 Set Privileged-Access Password 2 Remove Privileged-Access Password -------------------------------------------------------------------------------------------------- Navigation keys: M = return to main menu ESC key = return to previous screen X = eXit System Management Services -------------------------------------------------------------------------------------------------- Type the number of the menu item and press Enter or Select a Navigation key: _ The privileged-access password protects against the unauthorized starting of the system programs. Note: If the privileged-access password has been enabled, you are asked for the privileged-access password at startup every time you boot your system. If you previously had set a privileged-access password and want to remove it, select Remove Privileged-Access Password. View Error Log Use this option to view or clear your system’s error log. A menu similar to the following displays when you select this option. Error Log Date Time Error Code Location Entry 1. 01/04/96 12:13:22 25A80011 00-00 Entry 2. no error logged 1. Clear error log -------------------------------------------------------------------------------------------------- Navigation keys: M = return to main menu ESC key = return to previous screen X = eXit System Management Services -------------------------------------------------------------------------------------------------- Type the number of the menu item and press Enter or Select a Navigation key: _ Note: The time stamp in his error log is coordinated universal time (UTC), which is also referred to as Greenwich mean time (GMT). The operating system error logs have more information available and can time stamp with your local time. Chapter
8.
Using
System
Management
Services
825
Setup Remote IPL (Initial Program Load) This option allows you to enable and set up the remote startup capability of your system or partition. A list of NIC (network interface card) adapters in the system displays first. An example of this screen follows: Attention: In a partitioned system, only those network adapters that have been assigned to the partition being booted display in the IP Parameters menu. In a full system partition, all network adapters in the system are listed in the adapter parameters menu.NIC Adapters Device Slot Hardware Address 1. 10/100 Mbps Ethernet PCI Adapt U1.9-P1-I4/E1 00096baeac10 2. IBM 10/100/1000 Base-TX PCI U1.5-P2-I3/T1 0009cbce0fde -------------------------------------------------------------------------------------------------- Navigation keys: M = return to main menu ESC key = return to previous screen X = eXit System Management Services -------------------------------------------------------------------------------------------------- Type the number of the menu item and press Enter or Select a Navigation key: _ When an adapter is selected, the Network Parameters menu displays: Network Parameters 10/100 Mbps Ethernet PCI Adapter II: U1.9-P1-I4/E1 1. IP Parameters 2. Adapter Parameters 3. Ping Test -------------------------------------------------------------------------------------------------- Navigation keys: M = return to main menu ESC key = return to previous screen X = eXit System Management Services -------------------------------------------------------------------------------------------------- Type the number of the menu item and press Enter or Select a Navigation key: _ Selecting the IP (Internet Protocol) parameters option displays the following menu: IP Parameters 10/100 Mbps Ethernet PCI Adapter II: U1.9-P1-I4/E1 1. Client IP Address [000.000.000.000] 2. Server IP Address [000.000.000.000] 3. Gateway IP Address [000.000.000.000] 4. Subnet Mask [255.255.255.000] -------------------------------------------------------------------------------------------------- Navigation keys: M = return to main menu ESC key = return to previous screen X = eXit System Management Services -------------------------------------------------------------------------------------------------- Type the number of the menu item and press Enter or Select a Navigation key: _ To change IP parameters, type the number of the parameters for which you want to change the value. Entering IP parameters on this screen will automatically update the parameters on the ping test screen. 826
406/90
Service
Guide
Attention: If the client system and the server are on the same subnet, set the gateway IP address to [0.0.0.0]. Selecting Adapter Configuration allows the setting of the network speed, enabling or disabling spanning tree, and setting the protocol as shown in the following example menu: Adapter Configuration 10/100 Mbps Ethernet PCI Adapter II: U1.9-P1-I4/E1 1. Speed, Duplex 2. Spanning Tree Enabled 3. Protocol -------------------------------------------------------------------------------------------------- Navigation keys: M = return to main menu ESC key = return to previous screen X = eXit System Management Services -------------------------------------------------------------------------------------------------- Type the number of the menu item and press Enter or Select a Navigation key: _ Selecting the Speed, Duplex option allows you to set the interface speed at which the card will run and half versus full duplex. The current setting is indicated by ?<===?. Adapter Configuration 10/100 Mbps Ethernet PCI Adapter II: U1.9-P1-I4/E1 1. auto, auto 2. 10,half 3. 10,full 4. 100,half 5. 100,full -------------------------------------------------------------------------------------------------- Navigation keys: M = return to main menu ESC key = return to previous screen X = eXit System Management Services -------------------------------------------------------------------------------------------------- Type the number of the menu item and press Enter or Select a Navigation key: _ Chapter
8.
Using
System
Management
Services
827
Selecting the Spanning Tree Enabled menu allows you to enable or disable the spanning tree flag. If this flag is enabled (because the network the system is being attached to supports spanning trees), the firmware will impose a waiting period of 60 seconds before the adapter is allowed to communicate with the network. If this flag is disabled, the network adapter will be able to access the network immediately after the system is connected. The Protocol option allows you to set the appropriate protocol for your network as shown below. Protocol 10/100 Mbps Ethernet PCI Adapter II: U1.9-P1-I4/E1 1. Standard <=== 2. IEEE802.3 -------------------------------------------------------------------------------------------------- Navigation keys: M = return to main menu ESC key = return to previous screen X = eXit System Management Services -------------------------------------------------------------------------------------------------- Type the number of the menu item and press Enter or Select a Navigation key: _ Select the ping test option from the network parameters menu to test an adapter’s network connection to a remote system. After the ping test option is selected, the same series of screens will take you through setting up the IP parameters and the adapter configuration before attempting the ping test. Notes: 1. Once the ping test is initiated, it may take 60 seconds or longer to return a result. 2. If the ping test passes or fails, the firmware will stop and wait for a key to be pressed before continuing.Change SCSI Settings This option allows you to view and change the addresses of the SCSI controllers attached to your system. SCSI Utilities 1. Hardware Spin Up Delay 2. Change SCSI Id -------------------------------------------------------------------------------------------------- Navigation keys: M = return to main menu ESC key = return to previous screen X = eXit System Management Services -------------------------------------------------------------------------------------------------- Type the number of the menu item and press Enter or Select a Navigation key: _ Select Console Note: This option is not available on partitioned systems. The Select Console utility allows the user to select which console to use to display the SMS menus. This selection is only for the SMS menus and does not affect the display used by the operating system. Follow the instructions that display on the screen. The firmware automatically returns to the SMS main menu. 828
406/90
Service
Guide
Select Boot Options Use this menu to view and set various options regarding the installation devices and boot devices. 1. Select Install or Boot a Device 2. Select Boot Devices 3. Multiboot Startup -------------------------------------------------------------------------------------------------- Navigation keys: M = return to main menu ESC key = return to previous screen X = eXit System Management Services -------------------------------------------------------------------------------------------------- Type the number of the menu item and press Enter or Select a Navigation key: _ Select Install or Boot a Device Allows you to select a device from which to boot or install the operating system. This option is for the current boot only. Select Boot Devices Allows you to set the boot list. Multiboot Startup Toggles the multiboot startup flag, which controls whether the multiboot menu is invoked automatically on startup. If Select Install or Boot a Device is selected, the following menu displays: Select Device Type 1. Diskette 2. Tape 3. CD/DVD 4. IDE 5. Hard Drive 6. Network 7. None 8. List All Devices -------------------------------------------------------------------------------------------------- Navigation keys: M = return to main menu ESC key = return to previous screen X = eXit System Management Services -------------------------------------------------------------------------------------------------- Type the number of the menu item and press Enter or Select a Navigation key: _ If a device is selected that is not in the system, a menu with the following message displays: .---------------------------------------------------------. | THE SELECTED DEVICES WERE NOT DETECTED IN THE SYSTEM ! | | Press any key to continue. | `---------------------------------------------------------’ Chapter
8.
Using
System
Management
Services
829
If Hard Drive is selected, the following menu displays: Select Hard Drive Type 1. SCSI 2. SSA 3. SAN 4. None 5. List All Devices -------------------------------------------------------------------------------------------------- Navigation keys: M = return to main menu ESC key = return to previous screen X = eXit System Management Services -------------------------------------------------------------------------------------------------- Type the number of the menu item and press Enter or Select a Navigation key: _ If List All Devices is selected, a menu similar to the following displays, depending on the devices that are installed in the system: Select Device Device Current Device Number Position Name 1. 1 SCSI 18200 MB Harddisk (loc=U0.1-P2/Z1-A8,0) 2. - SCSI 18200 MB Harddisk (loc=U0.1-P2/Z1-A9,0) 3. - SCSI 18200 MB Harddisk (loc=U0.1-P2/Z1-Aa,0) 4. None 5. List all devices -------------------------------------------------------------------------------------------------- Navigation keys: M = return to main menu ESC key = return to previous screen X = eXit System Management Services -------------------------------------------------------------------------------------------------- Type the number of the menu item and press Enter or Select a Navigation key: _ The appropriate device can then be selected for this boot or installation. When a device is selected for installing the operating system, or to boot from, the Select Task menu allows you to get more information about the device, or to boot from that device in normal mode or service mode. The following is an example of this menu. Select Task SCSI 18200 MB Harddisk (loc=U0.1-P2/Z1-A8,0) 1. Information 2. Normal Mode Boot 3. Service Mode Boot -------------------------------------------------------------------------------------------------- Navigation keys: M = return to main menu ESC key = return to previous screen X = eXit System Management Services -------------------------------------------------------------------------------------------------- Type the number of the menu item and press Enter or Select a Navigation key: _ If either Normal Mode Boot or Service Mode Boot is selected, the next screen will ask, Are you sure?. If you answer yes, the device will be booted in the appropriate mode. If you answer no, the firmware will return to the Select Task menu. 830
406/90
Service
Guide
Select Boot Devices Attention: In a partitioned system, only those devices from which an operating system can be booted that are assigned to the partition that is being booted display on the select boot devices menu. In a full system partition, devices from which an operating system can be booted display on the Select Boot Devices menu.Note: To keep the search time for bootable devices down, these menus follow a hierarchy of: device type -> bus type -> adapter -> devices attached to the adapter To view all of the potentially bootable devices at one time rather than traversing down the hierarchy with the submenus, go to the ?Select Device Type? menu or the ?Select Media Type? menu and select the last option, ?List All Devices?. The ?List All Devices? function may take a long time on a large system with many I/O adapters and devices, such as large disk arrays. Select this option to view and change the customized boot list, which is the sequence of devices read at startup. Configure Boot Device Order 1. Select 1st Boot Device 2. Select 2nd Boot Device 3. Select 3rd Boot Device 4. Select 4th Boot Device 5. Select 5th Boot Device 6. Display Current Setting 7. Restore Default Setting -------------------------------------------------------------------------------------------------- Navigation keys: M = return to main menu ESC key = return to previous screen X = eXit System Management Services -------------------------------------------------------------------------------------------------- Type the number of the menu item and press Enter or Select a Navigation key: _ When any of the options 1-5 is selected, the Select Device Type screen will be displayed, which is similar to the following: Select Device Type 1. Diskette 2. Tape 3. CD/DVD 4. IDE 5. Hard Drive 6. Network 7. None 8. List All Devices -------------------------------------------------------------------------------------------------- Navigation keys: M = return to main menu ESC key = return to previous screen X = eXit System Management Services -------------------------------------------------------------------------------------------------- Type the number of the menu item and press Enter or Select a Navigation key: _ When a device type is selected, such as item 5, a Select Media Type menu is displayed. The following is an example of that menu. Chapter
8.
Using
System
Management
Services
831
Select Media Type 1. SCSI 2. SSA 3. SAN 4. IDE 5. ISA 7. None 8. List All Devices -------------------------------------------------------------------------------------------------- Navigation keys: M = return to main menu ESC key = return to previous screen X = eXit System Management Services -------------------------------------------------------------------------------------------------- Type the number of the menu item and press Enter or Select a Navigation key: _
When the media type is selected, all adapters of that type are displayed on the Select Media Adapter menu. The following is an example of that menu for a SCSI media type. Select Media Adapter 1. U1.9-P1/Z1 2. U1.9-P1-l1/Z1 3. U1.9-P1-l1/Z2 4. None 5. List All Devices -------------------------------------------------------------------------------------------------- Navigation keys: M = return to main menu ESC key = return to previous screen X = eXit System Management Services -------------------------------------------------------------------------------------------------- Type the number of the menu item and press Enter or Select a Navigation key: _ Each adapter must then be selected individually to see the devices tat are attached to it. An example of this menu for the first adapter in the previous example is as follows: Select Device Device Current Device Number Position Name 1. 1 SCSI 18 GB Harddisk (loc=U1.9-P1/Z1-A8,0) 2. - SCSI 18 GB Harddisk (loc=U0.9-P1/Z1-A9,0) 3. None -------------------------------------------------------------------------------------------------- Navigation keys: M = return to main menu ESC key = return to previous screen X = eXit System Management Services -------------------------------------------------------------------------------------------------- Type the number of the menu item and press Enter or Select a Navigation key: _ If there are no devices of the type chosen earlier (on the Select Device Type menu) attached to the adapter that is specified, a message similar to the following displays: -------------------------------------------------------------------------------------------------- | THE SELECTED DEVICES WERE NOT DETECTED IN THE SYSTEM | | Press any key to continue. | -------------------------------------------------------------------------------------------------- When a device type is selected, the Select Task menu allows you to see detailed information about the device or set the device’s location in the boot list as shown below in the example menu. The following is an example of the menu for a hard disk. 832
406/90
Service
Guide
Select Task SCSI 18200 MB Harddisk (loc=U0.1-P2/Z1-A8,0) 1. Information 2. Set Boot Sequence: Configure as 1st Boot Device -------------------------------------------------------------------------------------------------- Navigation keys: M = return to main menu ESC key = return to previous screen X = eXit System Management Services -------------------------------------------------------------------------------------------------- Type the number of the menu item and press Enter or Select a Navigation key: _
Selecting Information displays a menu similar to the following for a hard disk. Device Information /pci@3fffdf0a000/pci@2,4/scsi@1/sd@8,0 : (Integrated) (Bootable) DEVICE : SCSI 18200 MB Harddisk ( loc=U0.1-P2/Z1-A8,0) NAME : sd DEVICE-TYPE : block -------------------------------------------------------------------------------------------------- Navigation keys: M = return to main menu ESC key = return to previous screen X = eXit System Management Services -------------------------------------------------------------------------------------------------- Type the number of the menu item and press Enter or Select a Navigation key: _ The Set Boot Sequence option allows you to set the location of the device in the boot list. Display Current Settings This option displays the current setting of the customized boot list. An example of this menu, with one device in the boot list, follows. Current Boot Sequence 1. SCSI 18200 MB Harddisk (loc=U0.1-P1/Z1-A8,0) 2. None 3. None 4. None 5. None -------------------------------------------------------------------------------------------------- Navigation keys: M = return to main menu ESC key = return to previous screen X = eXit System Management Services -------------------------------------------------------------------------------------------------- Type the number of the menu item and press Enter or Select a Navigation key: _ Chapter
8.
Using
System
Management
Services
833
Restore Default Settings This option restores the boot list to the default boot list. The default boot list will vary depending on the devices that are installed in the system. The default boot list is as follows: 1. Primary diskette drive (if installed) 2. CD-ROM drive (if installed) 3. Tape drive (if installed) 4. Hard disk drive (if installed) 5. Network adapterMultiboot Startup Multiboot Startup toggles the multiboot startup flag, which controls whether the multiboot menu is invoked automatically on startup. Exiting System Management Services After you have finished using the system management services, type x (for exit) to boot your system or partition. 834
406/90
Service
Guide
Chapter 9. Removal and Replacement Procedures Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .837 Handling Static-Sensitive Devices . . . . . . . . . . . . . . . . . . . . . . . . .837 Powering Off and Powering On the System . . . . . . . . . . . . . . . . . . . . . .838 Powering Off the System With an HMC Attached and AIX Installed . . . . . . . . . . . . .838 Powering Off the System With an HMC Attached and Linux Installed . . . . . . . . . . . .840 Powering the System On . . . . . . . . . . . . . . . . . . . . . . . . . . . .841 Powering On the System Using the Service Processor . . . . . . . . . . . . . . . . .841 Hot-Pluggable FRUs . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .842 Power Subsystem . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .844 Bulk Power Assembly (BPA) . . . . . . . . . . . . . . . . . . . . . . . . . . .844 Bulk Power Regulator (BPR) . . . . . . . . . . . . . . . . . . . . . . . . . . .844 Bulk Power Controller (BPC) . . . . . . . . . . . . . . . . . . . . . . . . . . .845 Bulk Power Distributor (BPD) . . . . . . . . . . . . . . . . . . . . . . . . . . .847 Bulk Power Fan (BPF) . . . . . . . . . . . . . . . . . . . . . . . . . . . . .848 Bulk Power Enclosure (BPE) . . . . . . . . . . . . . . . . . . . . . . . . . . .849 Unit Emergency Power Off (UEPO) Card Assembly . . . . . . . . . . . . . . . . . .850 Integrated Battery Feature (IBF) . . . . . . . . . . . . . . . . . . . . . . . . .851 Processor Subsystem . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .853 Installing the MCM/L3/Passthrough Plug Map . . . . . . . . . . . . . . . . . . . . .853 I/O Books and I/O Blanks . . . . . . . . . . . . . . . . . . . . . . . . . . . .854 Battery . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .859 Processor Subsystem DCA (Distributed Converter Assembly) Books and DCA Blanks . . . . . .861 Capacitor Books and Blanks . . . . . . . . . . . . . . . . . . . . . . . . . . .862 MCM Module (Processor) and Pass Through Modules . . . . . . . . . . . . . . . . .863 L3 Cache Modules . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .882 Capacity Upgrade on Demand (CUoD) VPD Module . . . . . . . . . . . . . . . . . .902 VPD Card . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .904 System Clock Card . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .905 Aluminum Backplane Stiffener . . . . . . . . . . . . . . . . . . . . . . . . . .906 Processor Subsystem Chassis . . . . . . . . . . . . . . . . . . . . . . . . . .908 Memory Books and Memory Blanks . . . . . . . . . . . . . . . . . . . . . . . .910 Fan Controller . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .911 Processor Fan Assemblies . . . . . . . . . . . . . . . . . . . . . . . . . . .912 Fan Chassis . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .912 Processor Subsystem Air Filters . . . . . . . . . . . . . . . . . . . . . . . . .912 I/O Subsystem . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .913 I/O Subsystem PCI Slot LED Definitions . . . . . . . . . . . . . . . . . . . . . .913 PCI Adapters . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .914 Replacing a Double-Wide Blind-Swap Adapter . . . . . . . . . . . . . . . . . . . .985 I/O Subsystem RIO Riser Card . . . . . . . . . . . . . . . . . . . . . . . . . .986 I/O Subsystem Distributed Converter Assembly (DCA) . . . . . . . . . . . . . . . . .987 I/O Subsystem I/O Backplane Assembly . . . . . . . . . . . . . . . . . . . . . . .989 I/O Subsystem DASD Hard Disk Drive Assembly . . . . . . . . . . . . . . . . . . .991 I/O Subsystem DASD 4-Pack . . . . . . . . . . . . . . . . . . . . . . . . . .993 I/O Subsystem Fan . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .994 I/O Subsystem Chassis . . . . . . . . . . . . . . . . . . . . . . . . . . . . .995 Media Subsystem . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .998 CD-ROM, DVD-RAM, or Tape Drive . . . . . . . . . . . . . . . . . . . . . . . .998 Diskette Drive . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .999 Operator Panel . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .1000 Media Subsystem Chassis . . . . . . . . . . . . . . . . . . . . . . . . . . .1001 Media Subsystem Internal Cables . . . . . . . . . . . . . . . . . . . . . . . .1002 835
p690 Cable Installation and Management . . . . . . . . . . . . . . . . . . . . . .1003 836
406/90
Service
Guide
Introduction Before performing any of the removal or replacement procedures in this chapter, read the following notices. DANGERAn electrical outlet that is not correctly wired could place hazardous voltage on metal parts of the system or the devices that attach to the system. It is the responsibility of the customer to ensure that the outlet is correctly wired and grounded to prevent an electrical shock. Use one hand, when possible, to connect or disconnect signal cables to prevent a possible shock from touching two surfaces with different electrical potentials. During an electrical storm, do not connect cables for display stations, printers, telephones, or station protectors for communications lines.D06 CAUTION: This product is equipped with a three-wire power cable and plug for the user’s safety. Use this power cable with a properly grounded electrical outlet to avoid electrical shock.C01 CAUTION: This unit has more than one power supply cord. To reduce the risk of electrical shock, disconnect two power supply cords before servicing.C21 Handling Static-Sensitive Devices Attention: Disk drives, tape drives, and CD-ROM drives are sensitive to static electricity discharge. These devices are shipped in antistatic containers to prevent damage caused by electrostatic discharge. Take the following precautions: v If you have an antistatic wrist strap available, use it while handling the device. v Do not remove the device from the antistatic container until you are ready to install the device in the system unit. v With the device still in its antistatic container, touch it to a metal frame of the system. v Grasp cards and boards by the edges. Hold drives by the frame. Avoid touching the solder joints or pins. v If you need to lay the device down while it is out of the antistatic container, lay it on the antistatic container. Before picking it up again, touch the antistatic container and the metal frame of the system unit at the same time. v To prevent permanent damage, handle the devices carefully. Chapter
9.
Removal
and
Replacement
Procedures
837
Powering Off and Powering On the System This section provides procedures for powering off and powering on the system. Powering Off the System With an HMC Attached and AIX Installed The HMC user interface provides a power-off function to turn off the power to the system. If the system is logically partitioned, AIX in each partition may be stopped individually to make repairs. If the system power must be turned off, all of the logical partitions must be stopped. AIX provides hot-swap procedures for adapters and devices that support hot-swap removal and installation. You do not have to shutdown a partition that controls an adapter or device that supports hot-swap. Individual power components and fans may be serviced with power on for power systems equipped with redundant power and cooling. Check removal and installation procedures before servicing the system. As the system goes off, the power LED on the operator panel starts blinking at a fast rate. B0FF appears in the primary operator panel display. When the power-off sequence is complete, the system goes into standby mode, as evidenced by the following: v OK displays in the operator panel display. v The operator-panel LED will start blinking at a slow rate. v The processor subsystem power LED will start blinking at a slow rate.To completely remove power from a subsystem in the frame, use one of the following procedures based on the presence or absence of the High Performance Switch (7045-SW4): v If a High Performance Switch (7045-SW4) is not installed in the processor subsystem frame, input power to a subsystem must be removed by turning off the UEPO switch and turning off the circuit breakers on all optional IBF subsystems (if provided). v If a High Performance Switch (7045-SW4) is installed in the processor subsystem frame, input power to a subsystem may be selectively removed from the processor subsystem, an I/O subsystem, or switch subsystem using the HMC as described below: The HMC user interface provides a power-off function to turn off the power to an individual subsystem. For information about shutting down and powering off an individual subsystem, see ?Basic System Management Tasks?, located in the Hardware Management Console Installation and Operations Guide. Attention: If a High Performance Switch is present in the frame, using the UEPO switch to remove power from a subsystem being serviced will remove power from ALL of the subsystems that are receiving power from the frame. This may result in unintended outages and extended down time. If you are servicing an individual I/O subsystem, refer to the information about deactivating a system’s I/O drawers, found in ?Basic System Management Tasks?, located in the Hardware Management Console Installation and Operations Guide to remove 350 V power from both of the I/O subsystem DCA UPIC cable connections. Before deactivating, ensure that you shut down the operating system on all partitions that are using the resources of the I/O subsystem being serviced. After deactivating the I/O subsystem, both the Power-In LED and the Power-Out LED on the front of the I/O subsystem DCAs should be off. If not, call service support. If you are servicing the processor subsystem, refer to the information about deactivating a managed system’s service processor found in ?Basic System Management Tasks?, located in the Hardware Management Console Installation and Operations Guide to remove 350 V power from the processor subsystem DCA UPIC cable connections. Before deactivating, ensure that you shut down the operating system on all partitions that are running on the processor subsystem being serviced. After deactivating the processor subsystem, both the Power-In LED and the Power-Out LED on the front of the processor subsystem DCA should be off. If not, do not continue. Call service support. v If a High Performance Switch (7045-SW4) is not installed in the processor subsystem frame, input power must be removed by turning off the UEPO switch, turning off the circuit breakers on all optional IBF subsystems (if provided), and then disconnecting both power cords from the customer’s receptacle.
838
406/90
Service
Guide
If the system is operating in a full system partition under AIX, typing the shutdown command causes the system to shut down and power off. The -F flag shuts down the system without warning system users. Check with the system administrator before using this command. If you cannot use this method, you can power off the system by pressing the power button on the operator panel or the appropriate command at the HMC. Attention: Using the power button on the operator panel or commands at the HMC to power off the system can cause unpredictable results in the data files. The next IPL will also take longer to complete if all applications in all partitions are not stopped before hand. Chapter
9.
Removal
and
Replacement
Procedures
839
Powering Off the System With an HMC Attached and Linux Installed The HMC user interface provides a power-off function to turn off the power to the system. If the system is logically partitioned, Linux may be stopped individually in each partition to make repairs. If the system power must be turned off, all of the logical partitions must be stopped. Linux does not provide hot-swap procedures for adapter and device removal and installation. You must shutdown all partitions and the system power must be turned off to service an adapter or device. Individual power components and fans may be serviced with power on for power systems equipped with redundant power and cooling. Check removal and installation procedures before servicing this system. As the system goes off, the power LED on the operator panel starts blinking at a fast rate. B0FF appears in the primary operator panel display. When the power-off sequence is complete, the system goes into standby mode, as evidenced by the following: v OK displays in the operator panel display. v The operator-panel LED will start blinking at a slow rate. v The processor subsystem power LED will start blinking at a slow rate.To completely remove power from a subsystem in the frame, use one of the following procedures based on the presence or absence of the High Performance Switch (7045-SW4): v If a High Performance Switch (7045-SW4) is not installed in the processor subsystem frame, input power to a subsystem must be removed by turning off the UEPO switch and turning off the circuit breakers on all optional IBF subsystems (if provided). v If a High Performance Switch (7045-SW4) is installed in the processor subsystem frame, input power to a subsystem may be selectively removed from the processor subsystem, an I/O subsystem, or switch subsystem using the HMC as described below: The HMC user interface provides a power-off function to turn off the power to an individual subsystem. For information about shutting down and powering off an individual subsystem, see ?Basic System Management Tasks?, located in the Hardware Management Console Installation and Operations Guide. Attention: If a High Performance Switch is present in the frame, using the UEPO switch to remove power from a subsystem being serviced will remove power from ALL of the subsystems that are receiving power from the frame. This may result in unintended outages and extended down time. If you are servicing an individual I/O subsystem, refer to the information about deactivating a system’s I/O drawers, found in ?Basic System Management Tasks, located in the Hardware Management Console Installation and Operations Guide to remove 350 V power from both of the I/O subsystem DCA UPIC cable connections. Before deactivating, ensure that you shut down the operating system on all partitions that are using the resources of the I/O subsystem being serviced. After deactivating the I/O subsystem, both the Power-In LED and the Power-Out LED on the front of the I/O subsystem DCAs should be off. If not, call service support. If you are servicing the processor subsystem, refer to the information about deactivating a managed system’s service processor, found in ?Basic System Management Tasks?, located in the Hardware Management Console Installation and Operations Guide to remove 350 V power from the processor subsystem DCA UPIC cable connections. Before deactivating, ensure that you shut down the operating system on all partitions that are running on the processor subsystem being serviced. After deactivating the processor subsystem, both the Power-In LED and the Power-Out LED on the front of the processor subsystem DCA should be off. If not, do not continue. Call service support. v If a High Performance Switch (7045-SW4) is not installed in the processor subsystem frame, input power must be removed by turning off the UEPO switch, turning off the circuit breakers on all optional IBF subsystems (if provided), and then disconnecting both power cords from the customer’s receptacle. If the system is operating in a full system partition under Linux, typing the shutdown now -h command causes the system to shut down and power off. The -F flag shuts down the system without warning 840
406/90
Service
Guide
system users. Check with the system administrator before using this command. If you cannot use this method, you can power off the system by pressing the power button on the operator panel or the appropriate command at the HMC. Attention: Using the power button on the operator panel or commands at the HMC to power off the system can cause unpredictable results in the data files. The next IPL will also take longer to complete if all applications in all partitions are not stopped beforehand. Powering the System On The system can be powered on after all of the following steps have been completed: v All I/O drawer cables are connected v UPIC cables are connected v All PCI cables to supported subsystems are connected v The hardware management console (HMC) is connected v Power is connected to the system and the UEPO switch is set to OnAfter the required cables are installed, and the power cables are connected, the HMC graphical user interface provides a power-on function to turn on the power to the system. The power button on the media subsystem operator-panel can be pushed to initialize the system, but the preferred method is to use the HMC. Progress indicators, also referred to as checkpoints, are visible on the media subsystem operator panel display as the system power is turned on. The power LED on the media subsystem stops blinking and stays on, indicating the system power is on. The processor subsystem and I/O subsystems are powered on through the system power control network (SPCN). When power is applied, the power LEDs on the media subsystem go from blinking to on continuously, and the power LEDs on the processor subsystem and I/O subsystem(s) come on and stay on. This indicates that power levels are satisfactory in the subsystems. Powering On the System Using the Service Processor The system can be powered on using the System Power Control menu, which is a Service Processor menu that is available to the privileged user. Chapter
9.
Removal
and
Replacement
Procedures
841
Hot-Pluggable FRUs Notes: 1. The Linux operating system does not support some hot-pluggable procedures. 2. Linux does not support hot-plugging any hot-pluggable PCI adapters or devices. A system with Linux installed on one or more partitions must be shutdown and powered off before replacing any PCI adapter or device assigned to a Linux partition. Follow the non-hot-pluggable adapter or device procedures when replacing a PCI adapter or device in any partition with Linux installed. 3. You do not have to power off the system to remove a hot-pluggable FRU from the processor subsystem or from I/O subsystems. FRUs can be considered as not concurrently replaceable, concurrently replaceable with hot plug, or concurrently replaceable. The following table indicates which FRUs are hot-pluggable or non-hot-pluggable. Each FRU shown is identified as available for concurrent maintenance if applicable.
Processor Subsystem FRUs
Hot-Pluggable Concurrent Maintenance
Notes
Blowers
Yes
Processor subsystem and power blowers can be replaced with power on, follow the appropriate removal and replacement procedures.
DCA
Yes
DCAs can be replaced with power on, follow the appropriate removal and replacement procedures.
Bulk Power Enclosure
Yes
One Bulk Power Enclosure can be replaced with the system running, follow the appropriate removal and replacement procedures. Note: If you use the Start Service button and the ENBL (Good) LEDs on the enclosure components do not go off, do not proceed. Contact the customer and schedule a time for replacement of the enclosure with power off.
Bulk Power Controller (BPC)
Yes
BPCs can be replaced with power on, follow the appropriate removal and replacement procedures.
Bulk Power Regulator (BPR)
Yes
BPRs can be replaced with power on, follow the appropriate removal and replacement procedures.
Bulk Power Distributor (BPD)
Yes
BPDs can be replaced with power on, follow the appropriate removal and replacement procedures.
UEPO Switch panel
Yes
UEPO Switch panel can be replaced with power on, follow the appropriate removal and replacement procedures.
BPF
Yes
BPF can be replaced with power on, follow the appropriate removal and replacement procedures.
IBF
Yes
IBF can be replaced with power on, follow the appropriate removal and replacement procedures.
Capacitor book
No
Processor subsystem chassis
No
MCM
No
Memory cards
No
L3 cache
No
Clock card
No
I/O books
No 842
406/90
Service
Guide
I/O Subsystem FRUs
Hot-Pluggable Concurrent Maintenance
Notes
- I/O backplane and riser card
No
- DASD backplane
No
- Disk drives
Yes (AIX only)
Might require additional consideration or preparation by the customer for replacement.
- DCA (power supplies)
Yes
Only one DCA can be removed at a time.
- I/O fan assemblies
Yes
Can be replaced only if the DASD backplane is not installed.
- PCI adapters
Yes (AIX only)
Might require additional consideration or preparation by the customer for replacement.
Operator Panel
No
Diskette drive
Yes (AIX only)
Must be removed from the configuration and the partition must be booted afterwards to ensure correct operation. Might require additional consideration or preparation by the customer for replacement.
CD ROM
Yes (AIX only)
Must be removed from the configuration and the partition must be booted afterwards to ensure correct operation. Might require additional consideration or preparation by the customer for replacement.
Optional media SCSI devices
Yes (AIX only)
Must be removed from the configuration and the partition must be booted afterwards to ensure correct operation. Might require additional consideration or preparation by the customer for replacement.
Chapter
9.
Removal
and
Replacement
Procedures
843
Power Subsystem This section contains the removal and replacement procedures for the power subsystem. Notes: 1. All pluggable BPA FRUs are replaced concurrently with system operation when the removal and replacement instructions are followed. Exceptions to this occur when multiple failures exist in the BPA. 2. Removal and replacement of the entire BPA enclosure (BPE) is also concurrent with system operation, unless there are multiple BPE failures. Attention: Be sure to check for FRU redundancy to determine if the repair can be performed concurrently. If you cannot determine redundancy, delay the repair until the system can be shut down. See the “Hot-Pluggable FRUs” on page 842 for more specific details. Bulk Power Assembly (BPA) The bulk power assembly is on the top of the rack. This assembly provides power to the whole system. There are two sides to the BPA so that power is never completely removed from the system unless the system power is shut down, and the power cables are removed from the customer’s power source. Bulk Power Regulator (BPR) The BPRs are located in the bottom positions of the bulk power enclosure (BPE).Notes: 1. Before installing the BPR in the BPE (Bulk Power Enclosure) during a service or repair action, the Lock/Unlock switch located on the BPR, MUST be set to the Unlock position. Failure to do so may damage the BPR guide rails in the BPE resulting in extended outages and unnecessary, additional parts replacement. 2. The field stocks two versions of BPR. The older version is identified by FRU part number 44P1995. Order the older version of the BPR if you are not replacing all of the BPRs installed in the system and the firmware on the system is below firmware release 3H030916. Install the upgrade BPR, identified by FRU part number 44P2645, only if the firmware on the system is 3H030916 or later. Do not mix the two versions without the firmware level being 3H030916 or later. 3. Removal Attention: The BPR unit weighs approximately 12.47 kg (27.5 lbs). 1. Locate the BPR to be replaced. 2. Note the state of the BPR ENBL (Good) LED. 3. Press the green Start Service button on the UEPO switch. 4. Proceed only if the ENBL (Good) LED goes off or the repair has been scheduled. If the repair has been scheduled, power off the system as described in “Powering Off and Powering On the System” on page 838, then return here. 5. Locate the BPR to be replaced and manually slide its Lock/Unlock switch to unlock (power switch to the right). At this point, the ENBL (Good) LEDs turn off. 6. If the integrated battery feature (IBF) is installed, manually turn off the IBF circuit breaker attached to the BPR being repaired. Unplug the battery cable from the front face of the BPR. 7. Loosen the left and right fasteners using the torque tool (part number 6422789). Push up on the BPR release levers, and slide the unit out of the slot.Replacement 1. The field stocks two versions of BPR. The older version is identified by FRU part number 44P1995. Order the older version of the BPR if you are not replacing all of the BPRs installed in the system and the firmware on the system is below firmware release 3H030916. Install the upgrade BPR, identified by 844
406/90
Service
Guide
FRU part number 44P2645, only if the firmware on the system is 3H030916 or later. Do not mix the two versions without the firmware level being 3H030916 or later. 2. Make sure that the BPR release levers are in the UP position. Slide the unit into slot until one inch from BPE. Using both hands on the front of the BPR, firmly and quickly seat the BPR against the backplane. 3. Tighten the left fastener four turns with torque tool followed by four turns to the right fastener. Torque the left fastener to completion, followed by the right fastener. Return to the left fastener and ensure that it is properly torqued. 4. Move the BPR release levers to the DOWN position. 5. If the battery feature is installed, reconnect the battery cable to the connector on the front face of the BPR. 6. Manually turn on the IBF circuit breaker. 7. Move the BPR Lock/Unlock switch to lock (power switch to the left). 8. Press the white Service Complete button on the UEPO switch to indicate completion of this procedure, if this repair was completed during a system outage, bring the system back up. 9. If a high performance switch is installed in or attached to the system, perform the steps described in “Frame (Power Subsystem) Firmware Update with a 45H/W4 Switch Installed” on page 809 to insure that the new BPR contains the correct code.Bulk Power Controller (BPC) The BPC is located immediately above the top BPR. Attention: This service procedure may produce other power error code(s) (101x xxxx) between the time the Start Service button is pressed and the Service Complete button is pressed, during concurrent repair. If normal system operation continues, as described by this procedure, ignore these error codes. Removal Attention: The BPC unit weighs approximately 3.18 kg (7 lbs). 1. Locate the BPC to be replaced. 2. Note the state of the BPC LEDs. 3. Press the green Start Service button on the UEPO switch. 4. Proceed only if the ENBL (Good) LED goes off or the repair has been scheduled. If the repair has been scheduled, power off the system as described in “Powering Off and Powering On the System” on page 838, then return here. 5. Check the ENBL (Good) LED for all the BPRs (LED on right side of FRU) on same BPA side. For each BPR that has this LED in the ON state, manually slide the power switches to the right. Attention: Do not switch off units where this LED is already off. 6. Verify that all cables are labeled for plug location and then unplug the cables from the BPC. 7. Loosen the left and right fasteners using the torque tool (part number 6422789). Slide the unit out of the slot. Chapter
9.
Removal
and
Replacement
Procedures
845
Replacement Attention: Before proceeding, make sure that all the BPR ENBL (Good) LEDs are OFF. For each BPR that has this LED in the ON state, manually slide the power switches to the right. Do not switch off units where this LED is already off. Attention: Make sure that the UEPO bypass switch on the BPC is in the BYPASS position. 1. Slowly slide the BPC into the slot and carefully seat the BPC against the backplane. 2. Tighten the left and right fasteners with the torque tool. 3. Plug all cables to the BPC, ensuring that the plug location and the cable label agree. 4. Manually move any BPR power switches that are to the right (OFF) position, to the left (ON) position. 5. Move the UEPO bypass switch to the NORMAL position. 6. Press the white Service Complete button on the UEPO switch to indicate completion of this procedure. The BPA automatically configures as it powers on. 7. If a high performance switch is installed in or attached to the system, perform the steps described in “Frame (Power Subsystem) Firmware Update with a 45H/W4 Switch Installed” on page 809 to insure that the new BPC contains the correct code. 846
406/90
Service
Guide
Bulk Power Distributor (BPD) There can be two BPDs, located immediately above the BPC. If only one BPD is required, an airflow baffle must installed in the top BPD position. Attention: This service procedure may produce other power error code(s) (101x xxxx) between the time the Start Service button is pressed, and the Service Complete button is pressed, during concurrent repair. If normal system operation continues, as described by this procedure, ignore these error codes. Removal Attention: The BPD unit weighs approximately 2.27 kg (5 lbs). 1. Locate the BPD to be replaced. 2. Note the state of the BPD ENBL (Good) LED. 3. Press the green Start Service button on the UEPO switch. 4. Proceed only if the ENBL (Good) LED goes off or the repair has been scheduled. If the repair has been scheduled, power off the system as described in “Powering Off and Powering On the System” on page 838, then return here. 5. Check the ENBL (Good) LED for all the BPRs (LED on right side of FRU) on same BPA side. For each BPR that has this LED in the ON state, manually slide the power switches to the right. Attention: Do not switch off units where this LED is already off. 6. Verify that all cables are labeled for plug location and then unplug the cables from the BPD. 7. Loosen the left and right fasteners using the torque tool (part number 6422789). Slide the unit out of the slot.Replacement Attention: Before proceeding, make sure that all the BPR ENBL (Good) LEDs are OFF. For each BPR that has this LED in the ON state, manually slide the power switches to the right. Do not switch off units where this LED is already off. Attention: Make sure that the UEPO bypass switch on the BPC is in the BYPASS position. 1. Slowly slide the BPD into the slot and carefully seat the BPD against the backplane. 2. Tighten the left and right fasteners with the torque tool. 3. Plug all cables to the BPD, ensuring that the plug location and the cable label agree. 4. Manually move any BPR power switches that are to the right (OFF) position, to the left (ON) position. 5. Move the UEPO bypass switch to the NORMAL position. 6. Press the white Service Complete button on the UEPO switch to indicate completion of this procedure. The BPA automatically configures as it powers on. 7. If a high performance switch is installed in or attached to the system, perform the steps described in “Frame (Power Subsystem) Firmware Update with a 45H/W4 Switch Installed” on page 809 to insure that the new BPD contains the correct code. Chapter
9.
Removal
and
Replacement
Procedures
847
Bulk Power Fan (BPF) The BPF is located immediately to the right of the BPC and positioned vertically in the BPA. Removal Attention: The BPF unit weighs approximately 3.63 kg (8 lbs). 1. Locate the BPF to be replaced. 2. Note the state of the BPC ENBL (Good) LED on the same BPA side as the BPF to be replaced. 3. Press the green Start Service button on the UEPO switch. 4. Proceed only if the ENBL (Good) LED goes off or the repair has been scheduled. If the repair has been scheduled, power off the system as described in “Powering Off and Powering On the System” on page 838, then return here. 5. Verify that the BPF cable is labeled for plug location and then unplug the cable from the BPC. 6. Remove the fan cover plate and set aside for the replacement procedure. 7. Loosen the upper and lower fasteners using the torque tool (part number 6422789)l. Move the BPR release, levers if necessary, and slide the unit out of the slot.Replacement 1. Ensure that the BPF fasteners are to the left side. 2. Slowly slide the BPF into the slot and carefully seat the BPF against the front of the BPE, moving the BPR release levers if necessary. 3. Tighten the upper and lower fasteners with the torque tool. 4. Return the BPR release levers to the down position, if necessary. 5. Plug the BPF cable to the BPC, ensuring that the plug location and the cable label agree. 6. Press the white Service Complete button on the UEPO switch to indicate completion of this procedure. 7. If a high performance switch is installed in or attached to the system, perform the steps described in “Frame (Power Subsystem) Firmware Update with a 45H/W4 Switch Installed” on page 809 to insure that the new BPF contains the correct code. 848
406/90
Service
Guide
Bulk Power Enclosure (BPE) Attention: This service procedure may produce other power error code(s) (101x xxxx) between the time the Start Service button is pressed and the Service Complete button is pressed, during concurrent repair. If the system behaves normally, as described by this procedure, ignore these error codes. Removal Attention: The BPE unit weighs approximately 18.14 kg (40 lbs). 1. Locate the BPE to be replaced. 2. Note the state of all ENBL (Good) LEDs in the BPE. 3. Press the green Start Service button on the UEPO switch. 4. Proceed only if the ENBL (Good) LED goes off or the repair has been scheduled. If the repair has been scheduled, power off the system as described in “Powering Off and Powering On the System” on page 838, then return here. 5. Manually slide all Lock/Unlock switches (power switches to the right, unlock) on all BPRs attached to the BPA side being repaired. At this point, the ENBL (Good) LEDs turn off. 6. Manually turn off all IBF circuit breakers attached to the BPA side being repaired. 7. Verify that all cables are labeled for plug location and then unplug the cables from the BPA in the following order: a. BPC b. BPD c. BPR d. IBF (if installed) e. ac line power cord 8. Remove all pluggable FRUs from the BPE (BPRs, BPCs, BPDs, and BPF). 9. Remove the four mounting screws.Note: Do not remove the two screws at the bottom left, as these fasten the cage rail to the frame. 10. Slide the unit out of the frame.Replacement Attention: Make sure that the all BPR Lock/Unlock switches (power switches) are in the unlock (right, OFF) position on this BPA before proceeding. Attention: Make sure that the UEPO bypass switch on the BPC front panel is in the NORMAL position. 1. Slowly slide the BPE into the frame using the right side guide pins until the side flanges contact the frame. 2. Fasten the four mounting screws. 3. Replace all pluggable FRUs to the BPE (BPR, BPC, BPDs, BPF, and baffles).Note: Be sure to check the board connector for each FRU for any visible signs of damage (such as broken sockets.) before plugging into the backplane. Replace any FRU that is damaged. 4. Plug all cables to the BPA FRUs, ensuring that the plug location and the cable label agree. 5. Plug in the ac line power cords. 6. Manually turn on all IBF circuit breakers (if installed) attached to the BPA side being repaired. 7. Manually move all Lock/Unlock switches (power switches) to the lock position (the left, ON position) on all BPRs attached to the BPA. 8. Press the white Service Complete button on the UEPO switch to indicate completion of this procedure. The BPA automatically configures as it powers on. Chapter
9.
Removal
and
Replacement
Procedures
849
Unit Emergency Power Off (UEPO) Card Assembly 41235667891011
1
Simplified UEPO Front Panel
7
J02 Connector (White)
2
Simplified UEPO Side Panel
8
Power Switch
3
Simplified UEPO Rear Panel
9
System Fault LED
4
J00 Connector (Back)
10
Start Service Button (Green)
5
J01 Connector (Back)
11
Service Complete Button (White)
6
Room EPO Bypass Interlock
Removal Note: If you are replacing this assembly with a new assembly, the manufacturing label with the machine type, model number, and the serial number for the subsystem must be ordered to apply to the new part. For information on replacement labels, see Chapter 10, “Parts Information,” on page 1009. 1. Locate the system with the faulty UEPO switch. 2. Press the green Start Service button on the UEPO switch. 3. Switch the UEPO bypass switches on both BPCs to the ?Bypass? position (to the right). 4. Ensure that the red switch on the EPO panel is in the ON position. 5. Verify that all cables are labeled for plug location and then unplug the cables from the UEPO card assembly. 6. Loosen, but do not remove, the two 8-mm M5 mounting screws. Slide the assembly forward and away from the frame.Replacement Note: If you are replacing this assembly with a new assembly, the manufacturing label with the machine type, model number, and the serial number for the subsystem must be ordered to apply to the new part. For information on replacement labels, see Chapter 10, “Parts Information,” on page 1009. 1. Place the assembly over the two 8-mm M5 screws, slide it downward, and tighten the screws. 2. Plug all cables to the UEPO card assembly, ensuring that the plug location and the cable label agree. 3. Install the external cable, if required, to connector J02. a. If the external EPO connection is installed, the internal toggle switch is mechanically forced into the ROOM EPO ACTV position (to the right). 850
406/90
Service
Guide
b. If there is no external EPO cable, manually set the internal toggle switch to the ROOM EPO BYPASS position. To set this switch, pull the knob down, to the ROOM EPO BYPASS position, and release the knob (to the left).4. Ensure that the red UEPO switch is on (up position). 5. Switch the UEPO BYPASS switches on both BPCs back to NORMAL mode. 6. Press the white Service Complete button on the UEPO switch to indicate completion of this procedure.Integrated Battery Feature (IBF) Removal Attention: This process requires: v The use of a lift tool (part number 09P2481) and lift-plate assembly tool (part number 11P4369). v The lift tool wheels must be chocked to prevent it from moving during the operation. The frame must not be on its casters, and the leveling pads must be engaged to prevent the frame from moving during the operation.The IBFs (if installed) are mounted under the media subsystem. 1. Locate the IBF to be replaced. 2. Press the green Start Service button on the UEPO switch. 3. Turn off the IBF circuit breaker. 4. Unplug the battery cable from the front of the IBF, and remove the 7-mm M5 screw (closest to the connector) that is holding the cable to the frame. This action allows the cable to move so that the IBF can be removed. 5. Remove the two mounting bolts from the guide rails and retain for reuse during the replacement procedure. 6. Read all safety instructions before performing this operation. Unpack the lift tool and follow assembly instructions included with the tool. Inspect the tool for damage (do not perform the removal procedure with a damaged lift tool). Replace the blue lift plate with the orange lift plate (part number 11P4369). Orient the two round holes in the lift plate to the side of the lift tool that will be closest to the frame. Install the wheel chocks and engage the brake. 7. Raise the plate to a height such that the lift plate is just below the base of the IBF you are removing. Release the pins on the lift plate and fully extend the center portion of the plate toward the IBF and lock it in place. Release the pins on the extender arms and position them on both left and right side. The arms must fit around the frame extenders and lock in place (do not adjust the lift tool after these arms are locked in place). 8. One person should gently pull and guide the IBF onto the lift plate. The IBF must be removed far enough to clear the front face of the frame. Retract the extender arms and center portion of lift plate. Note: Ensure that the IBF is in the center of the lift plate. 9. Lower the IBF to the bottom position of the lift tool. 10. Remove the wheel chocks and release the brake. Flip the crank handle 180 degrees. Carefully move the lift tool with the IBF to the staging area. The brake and wheel chocks must be in place when transferring the IBF to or from the packaging material. Chapter
9.
Removal
and
Replacement
Procedures
851
Replacement Note: Install rails in the frame if not already installed. 1. Position the lift tool such that the new IBF can be moved onto the lift plate. 2. Install the wheel chocks and engage the brake. 3. Move the new IBF onto the tool with the rear side toward the two holes of the lift plate. The side with the handles is considered the front side.Note: Ensure the IBF is in the center of the lift plate.Remove the wheel chocks and disengage the brake. 4. Position the lift tool as close as possible to the rack. Install the wheel chocks and engage the brake. 5. Raise the plate to a height such that the lift plate is just above the rails of the slot you are installing it in. Release pins on the lift plate and fully extend the center portion of the plate toward the frame and lock it in place. Release the pins on the extender arms and position them on both left and right side. Arms must fit around the frame extenders and lock in place (do not adjust the lift tool after these arms are locked in place). 6. One person should gently push the IBF into the frame. Adjust the height of the lift plate, as needed. 7. Lower the lift plate and remove it from the lift tool, repackage the lift tool. 8. Fasten the IBF to the mounting rails, using two mounting bolts. 9. Ensure the circuit breaker is in the off (right) position if it is not already off. 10. Plug the battery cable into the IBF. Fasten the cable-retaining clamp (closest to the cable’s connector) to the frame, using a 7-mm M5 screw 11. Reinstall all components removed during the removal process. 12. Set the circuit breaker to the on (left) position. 13. Press the white Service Complete button on the UEPO switch to indicate completion of this procedure. 852
406/90
Service
Guide
Processor Subsystem This section contains the removal and replacement procedures for the processor subsystem. Handle all processor subsystem FRUs with extreme care. Installing the MCM/L3/Passthrough Plug Map If the MCM/L3/Passthrough Plug Map is not installed in the system, install it before continuing with any repair action for MCMs or L3 cache module. With the front cover open, locate the right frame extender (located on the opposite side of the UEPO switch). The 50 mm (approximately 2 inch) wide label should be installed vertically, along this extender, approximately in line with EIA frame positions 22 - 28. The trifold label has two hook-and-loop fastener strips attached to the back. Remove the adhesive backing from the hook-and-loop fastener and attach it to the frame extender. A small piece of hook-and-loop fastener is used to help keep it folded while attached to the frame. The label appears similar to the following: P/N11P4914 Using this Label Whenever an MCM/Passthrough module is added or replaced/reseated during a repair action, a sticker with the module’s serial number is provided for use with the plug map label according to its plugging location. If serial number label is not available, then record the serial number of the MCM/Passthrough module is added, reseated, or replaced along with the number of times plugged. This module is then allowed three plugs with each plug recorded on label. If the module should need to be removed and reinstalled a fourth time, it must be replaced. The process is similar for the L3 module, except a separate serial number label is not provided, instead the serial number is located on a sticker attached to the L3 heatsink. Record the serial number of the L3 modules added, replaced, and reseated along with the number of times plugged. If the module should need to be removed and reinstalled a fourth time, it must be replaced. Chapter
9.
Removal
and
Replacement
Procedures
853
I/O Books and I/O Blanks Attention: Before starting the installation of the primary I/O book FRU, the customer must back up the HMC partition profiles, save upgrade data, and back up critical console data. The following procedures are for removing, inspecting, and replacing primary and secondary I/O books. Before proceeding to the removal and replacement procedures, read and perform the procedures in “Before I/O Book Replacement.” Before I/O Book Replacement Read and then perform the following procedures before replaceing any primary or secondary I/O book. Save the Service Processor Settings: If the primary I/O book is being replaced, locate the backup copy of the service processor settings. If a backup copy is not available, use the “Save or Restore Hardware Management Policies,” in the “Introduction to Tasks and Service Aids” chapter of the Diagnostic Information for Multiple Bus Systems, order number SA23-2769, to back up the service processor settings saved in NVRAM. If you cannot run the appropriate task, or the service processor settings were not previously backed up, record the service processor settings manually. See “Saving and Restoring Service Processor Settings” on page 770 for information about saving the service processor settings manually. The service processor settings must then be reentered manually after the primary I/O book is replaced. Record the System Name and Bootlists: From the ?Server and Partition Management? screen on the HMC, do the following: 1. Record the managed system’s name. This is the name that appears in the first line under the line that displays the machine type, model, and serial number of the managed system. 2. If the customer has not already backed up the profile data on the HMC, do so now: a. Right-click on the managed system’s name. b. Select the ?Profile Data? task with the ?Backup? option and perform that action. c. Record the filename that was entered as the backup file’s name.3. Record the LPAR mode bootlists. If the managed system is running in full system partition mode, skip to step 4. If the system is running in LPAR mode, continue. Ensure that all partitions have been shut down, then do the following for each partition: a. Right-click on the profile name and select ?Properties?. b. Select the ?Other? tab and change the boot mode to SMS. c. Click on ?OK?. d. Right-click on the profile name and select ?Activate?. e. On the ?Activate Partition? screen, click on the ?Open terminal? checkbox. f. Click on ?OK? to boot the partition to the SMS menus. g. At the SMS main menu, select ?Select Boot Options?, then select ?Select Boot Devices?, then select ?Display Current Settings?. h. Record the boot list found in ?Display Current Settings?. i. Click on the partition name and choose ?Operating system reset?; on the operating system reset screen select ?Hard Reset?. j. Click on ?OK? to shut down the partition. k. If the managed system is not powered down at this point, power it down now. 854
406/90
Service
Guide
4. Record the full system partition mode bootlist if one has been set. a. Right-click on the full system partition name, then select ?PowerOnSMSProfile?, click on the ?Open terminal window? box, then click on ?OK? to power the system up to the SMS menus. b. At the SMS main menu, choose ?Select Boot Options?, then choose option 2, ?Select Boot Devices?. Choose option 6, ?Display Current Settings?. Record this boot list. c. Click on the partition name and choose ?Operating system reset?; on the operating system reset screen select ?Hard Reset?, then click on ?OK? to shut down the system. d. Continue to step 5.5. Continue to “Checking the platform firmware level.”Checking the platform firmware level: When the managed system is powered off with standby power still on, open a virtual terminal window on the HMC and access the service processor menus. On the main menu, look for the platform firmware level (which is in the format RHyymmdd or vHyymmdd, where v is the version number) above the main menu on the screen. Record the firmware level.Note: If the firmware is RH020312 or earlier, it is GA1 firmware. If it is between RH020413 and RH021024 (inclusive), or 2Hyymmdd, it is GA2 firmware. If it is RH021025 or later, or 3Hyymmdd or later, it is GA3 firmware. Removal 1. Turn off power as described in “Powering Off and Powering On the System” on page 838. 2. At the back of the system, remove the safety bars from the I/O books and blanks. 3. To ease replacement, record the connector locations of all cables attached to the I/O book. Disconnect all of the cables to the I/O book. 4. If the primary I/O book is being replaced, remove all VPD cards from the old primary I/O book and install them into the new primary I/O book. See “VPD Card” on page 904 for instructions on removing and replacing the VPD cards. 5. Grasp the blue release latches located at the top and bottom of the memory book between your thumb and index finger, and press the tabs together. Actuate both blue handles simultaneously until the book fully unseats from the connector. Pivot the tabs and pull the book straight back. Chapter
9.
Removal
and
Replacement
Procedures
855
Inspection You must examine the connectors for any connector damage on the removed book using the 3X Magnification Glass (part number 11P4747) included in the system toolkit. If you identify a defect on the book, do not install the replacement book without contacting your next level support for instructions. A damaged connector on a defective book may indicate a damaged system backplane. If an I/O book connector is damaged, then inspect the system backplane for damage. If the removed I/O book is not damaged, carefully remove the new I/O book from the packaging. Be sure to protect the connector and the blue handles when you remove the I/O book from the cardboard box or ESD bag. Do not handle the I/O book by its connector or either of its blue handles. Remove the protective cover from the connector and install it on the defective I/O book.Notes: 1. Do not stack books on top of each other. ESD bags do not provide sufficient stacking protection. 2. Do not rest an I/O book on its connector or on its blue handles at any time.After the I/O book has been removed from the ESD bag, examine the Book VHDM connector using the 3X Magnification Glass (part number 11P4747) included in the system toolkit. If any connector shows signs of damage, do not use it. Replacement To replace an I/O book, do the following: 1. Inspect the new I/O book for damage. For more information, see “Inspection.” 2. Unlatch the two blue handles on the replacement book. 3. Taking care not to hit the connector against the CEC cage, use both hands to carefully place the book on the CEC guides. 4. Slide the book in until it reaches the alignment pins. 5. Using even pressure, actuate both blue handles simultaneously until the book is fully seated. 6. Restore the service processor settings previously saved. 7. Repeat this process for all replacement I/O books. 8. If any cables were removed during the removal procedure, plug all cables into their correct locations. 9. Turn the UEPO switch to the ON position. The service processor goes into standby. Standby is indicated by the OK prompt in the operator panel display.Checking the Platform Firmware Level on the FRU Just Installed: The primary I/O book FRU might have a later level of firmware than the HMC recognizes. This causes a ?version mismatch? error message on the HMC. If this happens, the HMC can no longer communicate with the managed system. To determine if this is a potential problem, check the HMC’s software level, as follows: 1. Click Help at the top of the HMC console. 2. Click About the Hardware Management Console. 3. Install the new primary I/O book FRU and follow the procedure in one of the following cases: v Case 1: Version Mismatch Error Displays: If the ?version mismatch? error appears on the HMC, the firmware level on the new primary I/O book must be determined.Note: You need a laptop computer and (cable part number 62H4857) capable of connecting to serial port 2 on the primary I/O book. a. Attach the laptop computer and cable to serial port 2 on the primary I/O book. b. When OK is displayed in the operator panel on the media drawer, press a key on the virtual terminal on the laptop computer. The firmware version is displayed above the main menu, in the format RHyymmdd or vHyymmdd. 856
406/90
Service
Guide
If the HMC is at release 2 version 1.2 (or earlier), and the system firmware is at GA3, which is any level including RH021025 and later, and 3Hyymmdd and later, there are two options for resolving the version mismatch: – The customer can upgrade to HMC release 3 version 1.0 (or later), also referred to as HMC recovery version 1.3 (or later). OR – Flash the GA2 level of platform firmware on the primary I/O book after it is installed. GA2 in this case might be RH021024; however, if a level of firmware with the name in the format of 2Hyymmdd or later is available, use that level instead of RH021024.v Case 2: Version Mismatch Error Does Not Display: If the ?version mismatch? error does not appear on the HMC, the firmware level of the new primary I/O book must still be determined.Note: You need a laptop computer (and cable part number 62H4857) capable of connecting to serial port 2 on the primary I/O book. a. Open a virtual terminal window on the HMC. b. When OK is displayed in the operator panel on the media drawer, press a key on the virtual terminal on the laptop computer. The firmware version (from the new I/O book) is displayed above the main menu, in the format RHyymmdd or vHyymmdd. – If the HMC is at release 2, no further action is required, (ensure that the system firmware is updated to the latest GA2 firmware level because it is a minimum mandatory level). – If the HMC is at release 3 version 1.0 (or later), also referred to as HMC recovery version 1.3 (or later), no further action is required, (ensure that the system firmware is updated to the latest GA2 firmware level because it is a minimum mandatory level).
Chapter
9.
Removal
and
Replacement
Procedures
857
Set the I/O Type: Verify, and change if necessary, the I/O type as described in “Change I/O Type” on page 785. Set the System Name: If you are replacing a primary I/O book on a system with an HMC attached, you must set the system name through the Main Menu (for a privileged user) of the service processor menus. Reboot the HMC after setting the system name to allow the HMC to recognize the attached managed system by its new name. If the HMC does not recognize the attached managed system, the default assigned name is unnamed. Restore the Service Processor Settings: If the service processor settings must be reentered manually, do so now. If the service processor settings were saved using the AIX diagnostic service aid, restore them using the service aid after the AIX partition with service authority is booted. Restore the Partition Data on the HMC: Using the HMC, bring the managed system up to the LPAR Ready state. After the system is selectable by its new name on the HMC console, the recover managed system task can be executed at the HMC console to restore partition information. The partition information is copied from the HMC console to the new service processor (part of the primary I/O book) NVRAM and may take 15 to 30 minutes to complete. Wait for the operation to complete as indicated on the HMC. Reenter the Boot Lists on the HMC: Using the HMC, reenter the partitions’ boot lists or the full system partition boot list using the lists that were recorded before the primary I/O book was replaced. Set the Date and Time: Set the system date and time using the operating system. 858
406/90
Service
Guide
Battery CAUTION: A lithium battery can cause fire, explosion, or a severe burn. Do not recharge, disassemble, heat above 100 degrees C (212 degrees F), solder directly to the cell, incinerate, or expose cell contents to water. Keep away from children. Replace only with the part number specified for your system. Use of another battery may represent a risk of fire or explosion. The battery connector is polarized; do not attempt to reverse the polarity. Dispose of the battery according to local regulations.C9 The battery is located in the primary I/O book. Before removing the primary I/O book to access the battery, follow the procedures titled “Save the Service Processor Settings” on page 854 and “Record the System Name and Bootlists” on page 854. Removal 1. Remove the primary I/O book (see “I/O Books and I/O Blanks” on page 854). 2. Use a T10 torx driver to remove the screws for the short cover near the connectors. 3. Locate the battery near the spare VPD card connector as shown in the following illustration. L3CUoD 4. Use one finger to gently press the top edge of the battery in and down from the battery socket. Then use your thumb and index finger to pull the battery out of the socket, as shown in the following illustration. Chapter
9.
Removal
and
Replacement
Procedures
859
Replacement Replace in reverse order. Insure that the battery polarity is correct: place the battery in the socket with the positive side facing out. After the primary I/O book has been reinstalled, turn the UEPO switch to the ON position. The service processor will come up to standby, indicated by OK in the operator panel in the media drawer. Perform these procedures in the order shown here: 1. “Set the I/O Type” on page 858 2. “Set the System Name” on page 858 3. “Restore the Service Processor Settings” on page 858 4. “Reenter the Boot Lists on the HMC” on page 858 5. “Set the Date and Time” on page 858. 860
406/90
Service
Guide
Processor Subsystem DCA (Distributed Converter Assembly) Books and DCA Blanks Removal The following safety notice applies to power-related field replaceable units. DANGERDo not attempt to open the covers of the power supply. Power supplies are not serviceable and are to be replaced as a unit.D021. Locate the DCA to be replaced. 2. Note the state of the DCA LED. 3. Press the green Start Service button on the UEPO switch. 4. Proceed only if all the DCA’s LEDs go off or the repair has been scheduled. If the repair has been scheduled, power off the system as described in “Powering Off and Powering On the System” on page 838, then return here. 5. Loosen the upper and lower fasteners, using the torque tool (part number 6422789), alternating between the fasteners until the unit can slide out of the slot. 6. Grasp both sides of the DCA and pull it out of the drawer.Replacement 1. Carefully slide the DCA into the slot as far as possible (the back edge of the top of the DCA is about 0.25 inch from the cage bracket).Note: Do not force the DCA against the backplane, as this may damage the connector. 2. Hold the upper part of the DCA against the backplane, and use the torque tool to engage the upper fastener about two turns into the cage bracket. 3. Repeat the previous step for the lower fastener. 4. Alternate tightening the upper and lower fasteners until both are fully torqued in. 5. Check torque on both fasteners. 6. Plug all cables into their correct location in the DCA. 7. Press the white Service Complete button on the UEPO switch to indicate completion of this procedure. Chapter
9.
Removal
and
Replacement
Procedures
861
Capacitor Books and Blanks Removal The following safety notice applies to power-related field replaceable units. DANGERDo not attempt to open the covers of the power supply. Power supplies are not serviceable and are to be replaced as a unit.D021. Turn off power as described in “Powering Off and Powering On the System” on page 838. 2. If you have not already done so, open the rear door of the rack unit. 3. Locate the capacitor book or blank to be removed. 4. Loosen the upper and lower fasteners, using the torque tool (part number 6422789), alternating between the fasteners until the unit can slide out of the slot. 5. Grasp both sides of the capacitor book or blank, and pull it out of the drawer.Replacement 1. Carefully slide the capacitor book into the slot as far as possible (the back edge of the top of the capacitor book is about .25 inch from the cage bracket).Note: Do not force the capacitor book against the backplane, as this may damage the connector. 2. Hold the upper part of the capacitor book against the backplane, and use the torque tool to engage the upper fastener about two turns into the cage bracket. 3. Repeat the previous step for the lower fastener. 4. Alternate tightening the upper and lower fasteners until both are fully torqued in. 5. Check torque on both fasteners. 6. If any cables were removed during the removal procedure, plug all cables into their correct locations. 7. Turn on the power as described in “Powering the System On” on page 841. 862
406/90
Service
Guide
MCM Module (Processor) and Pass Through Modules Attention: All components must be replaced to the location from which they were removed. Before replacing MCMs, check with your support center to ensure you have been directed to replace the MCM(s) by a valid error code in Chapter 5, “Error Code to FRU Index,” on page 413. Handling Static-Sensitive Modules Attention: MCM and L3 modules are sensitive to static electricity discharge. These devices are shipped in antistatic containers to prevent damage caused by electrostatic discharge. Take the following precautions: v Use an antistatic wrist strap while handling the device. v Do not remove the device from the antistatic container until you are ready to install the device in the system unit. v With the device still in its antistatic container, touch it to a metal frame of the system. v Grasp modules by the heatsink. Avoid touching the interposer buttons. v If you need to lay the device down while it is out of the antistatic container: – Lay the MCMs with the posts down on the antistatic mat. – Lay the L3s with the LGA connector down in the antistatic tray.Before picking it up again, touch the antistatic container and the metal frame of the system unit at the same time. v Handle the devices carefully in order to prevent permanent damage. Attention: Before you remove or replace any MCM or L3 cache module, stop, read and understand this entire procedure (MCM Module (Processor) and Pass Through Modules) and the procedure for “L3 Cache Modules” on page 882. Also, check to ensure that the MCM/L3 Label Map is installed. Refer to “Installing the MCM/L3/Passthrough Plug Map” on page 853. Attention: The MCM modules in this system can be attached to the backplane a maximum of three times. Check the MCM/L3 Label Map to confirm the number of times the target module has been unplugged. If it has been unplugged three times, order a new module. If an MCM module is removed for a fourth time, it must be replaced with a new MCM module. New MCMs are shipped with a new MCM/L3 plug count adhesive replacement label. When you install a new L3 cache module, place the new label in the correct position on the MCM/L3 Label Map. If a new MCM/L3 Label Map is needed, see “Tools” on page 1033 for replacement label part number.Note: The MCM module and its associated VPD card are a single FRU, and must be replaced as a set. See “VPD Card” on page 904 for VPD card removal and replacement procedure. Attention: If the base configuration of the system has only one MCM, The pass through modules are replaced with a filler cover that is attached only by the cam locks. Chapter
9.
Removal
and
Replacement
Procedures
863
Removal Attention: If more than one module is to be replaced or reseated, begin with the upperpmost module, replace it and then go to module(s) in the next lower position, until the lowermost module has been replaced or reseated. Before any MCM, pass through module, or cover is removed, perform a general cleaning operation, as described in step 3 of the following procedure. 1. Turn off the power as described in “Powering Off and Powering On the System” on page 838. 2. Remove the fan chassis. See “Fan Chassis” on page 912. 3. Perform a general cleaning operation, using a sash brush, (not the one included with the FRU kit) and vacuum cleaner (see “Tools” on page 1033 for the tool part numbers), to remove any accumulated dust and debris that has settled in the processor subsystem. Hold the vacuum hose near the brush to remove any debris that the brush generates. 4. Note the location of the module being removed on the MCM/L3 Label Map on the system to confirm the number of times the target module has been unplugged. If it has been unplugged three times, order a new module. Attention: Each MCM must be returned to the location from which it was removed.Note: If this is the first removal/reseat for the module, record the serial number of the module and the date of the action. 5. Determine the type of MCM modules used on your system. The following illustration shows the difference between the aluminum heatsink and copper heatsink MCMs.
1
MCM (with aluminum heatsink)
2
MCM Alignment Hole (on copper heatsink MCMs only)
3
MCM (with copper heatsink)
6. If the MCM modules that you are removing have an aluminum heatsink and a plastic stiffener attached to the backplane, go to substep 10 on page 865. 7. If the MCM modules that you are removing have a copper heatsink and an aluminum stiffener attached to the backplane, do the following:Note: Remove each MCM processor from the processor drawer, one at a time, until all MCM processors that you planned to remove have been removed. 864
406/90
Service
Guide
8. Beginning with the upper most module to be removed, loosen the two captive heatsink bushing screws until they are free floating. See the following illustration for heatsink bushing screw locations. 9. Insert the two steel alignment rods through the MCM alignment holes and into the aluminum stiffener. See the following illustration for MCM clearance hole locations.
1
Aluminum Backplane Stiffener
4
MCM Alignment Hole (quantity 2) (on MCM with copper heatsink only)
2
Steel Alignment Rods
5
Captive Heatsink Bushing Screw
3
MCM (with copper heatsink)
6
Locking Cam
10. Ensure that the locking cams at the front of the MCM module are engaged with the module. To adjust the locking cams, use a 5-mm socket wrench for systems containing the aluminum backplane stiffener and an 8-mm socket wrench for systems containing the plastic backplane stiffener. 11. Remove DCA books and blanks 2 through 6, as well as both capacitor books. See “Processor Subsystem DCA (Distributed Converter Assembly) Books and DCA Blanks” on page 861 and “Capacitor Books and Blanks” on page 862. Chapter
9.
Removal
and
Replacement
Procedures
865
12. Ensure that the actuation bolt behind the MCM module is in the down position. AAFront1243566418Bab777RightSideBack
1
L3 Filler
5
System Clock Card
2
L3 Module Assembly
6
MCM Filler or Pass Through Card
3
MCM Locking Cam
7
System Board
4
MCM Heatsink / Module Assembly
8
Actuation Bolt
13. Use the torque wrench (part number 11P4486, supplied with the system tool kit) to loosen the actuation bolt in the center of the MCM module. If installing a new MCM, remove the actuation bolt. 14. If installing a new MCM, insert the new actuation bolt supplied with the new MCM module. 15. Move the actuation bolt to the up position and hand-tighten to hold the assembly in the up position. 16. Hold the MCM module that you are removing with one hand. Use an 8-mm socket wrench (plastic backplane stiffener) or 5-mm socket wrench (aluminum backplane stiffener) to unlatch the locking cams. 17. If you are removing a MCM with the aluminum heatsink, go to substep 19 on page 867. 18. If you are removing a MCM with the copper heatsink, using the steel alignment rods, previously installed, slide the MCM processors out and away from the system backplane. Notes: a. The steel alignment rods are used to relieve some of the weight of the MCM processor when removing them from the processor drawer. b. The field restocking plan, is to replace the 1.7 GHz copper heatsink MCM (factory installed) with a 1.7 GHz aluminum heatsink MCM. 866
406/90
Service
Guide
19. With both hands, grasp the module across the narrow span of the heatsink cap, and pull the MCM module from the system board and place it, posts-down, on the ESD mat.
1
Processor Backplane
2
MCM (aluminum heatsink version)
3
MCM Pass Through
20. Remove the two steel alignment rods and move them to the next MCM processor to be removed. Continue doing the previous steps until all of the MCM processors you plan to remove have been removed. 21. Using the new brush supplied with the new module, follow the procedures described in this step to clean the LGA site on the system backplane. The LGA site is the position from which you removed the MCM module.Note: Before you remove the brush from its protective bag, loosen the bristles by pressing them on the ESD mat until they are 90 degrees from the handle.
Chapter
9.
Removal
and
Replacement
Procedures
867
Attention: Each time an MCM module is removed from system backplane, the LGA site where the module connects must be cleaned. With the brush supplied with the replacement kit, use only downward strokes, starting in the upper-left location. Using firm pressure, brush left to right, top to bottom, using overlapping strokes as shown in the following illustration. 1122553344................ The brush must be discarded after cleaning only one LGA site. 22. Each time an MCM module (or pass through) is installed, the associated interposer must be inspected. Inspect the MCM (or pass through) interposer, using a magnifying glass (part number 11P4747). If either of the following conditions occur, the interposer is not acceptable and the module must be returned: Shorts If the button, a loop of the button wire, or an individual button wire lead extends more than three times the width of the wire outside of a connector hole chamfer (top side) or counter bore pin indent (bottom side). Missing buttons 23. Record on the system log sheet on the system the date that this MCM has been removed.
868
406/90
Service
Guide
Returning MCM Modules: The following procedure provides handling instructions for MCM modules to be packaged and returned to the plant of manufacture. 1. If you are replacing an MCM, use the packaging from the new module to return the old module. If the packaging is not available, order the following parts for MCM or pass through protective packaging: v 1 Black Plastic Case (Top and Bottom) 02P3750 v 1 MCM Heat Sink Cap - 7335949 or 7335963 (not required for pass through) v 1 Corrugated Carton - 7335743 (carton and foam assembly including: 1 Bottom Foam Cushion, 1 Top Foam Cushion, 1 Desiccant (Optional for return packaging) v 1 ESD Bag - 7352456 v 1 Foam Sheet - 73329792. Inspect the base of packaging part number 02P3750 and verify to see if an ?AA? is labeled. 3. If the base has an ?AA,? place the MCM or pass through module in the package base. If the base does not have an ?AA,? place one sheet of thin pink foam (part number 7332979) over the base and place the MCM module in the package base. 4. Place top adapter foam piece (part number 7335949) on the heat sink of the MCM module (not required for pass through module). 5. Complete the inner package by placing the top cover over the module and onto the base. Ensure the cover is secured to the base. 6. Reuse ESD bag part number 7352456 and place the packaged module inside the bag along with the desiccant. Seal with tape. The desiccant is optional for returned parts. 7. Place this MCM or pass through assembly in the foam cushion packaging carton (part number 7335743). 8. Seal the carton and ship. Chapter
9.
Removal
and
Replacement
Procedures
869
Replacement or New Installation Note: This procedure is to be performed only by trained service personnel. 1. Go to “Methods for Testing MCMs and Pass Through Modules for a Short Circuit” on page 872 and install the MCM/L3 module short-circuit test tool. Use the test tool during the MCM replacement procedure to ensure that the MCMs are installed without a short circuit. If you install an MCM and encounter a short circuit, the MCM you installed must be replaced. 2. If installing a new MCM, insert the new actuation bolt supplied with the new MCM module and move the actuation bolt and spring assembly to the up position and hand-tighten to hold the assembly in the up position. Perform a general cleaning operation, using a sash brush, (not the one included with the FRU kit) and vacuum cleaner (see “Tools” on page 1033 for the part numbers), to remove any accumulated dust and debris that has settled in the processor subsystem. Hold the vacuum hose near the brush to remove any debris the brush generates. 3. Loosen the captive bolt in the center of the protective cover and remove the cover from the new MCM module, as shown in the following illustration. 4. If you are installing MCMs with the aluminum heatsink, go to substep 5. If you are installing MCMs with the copper heatsink, do the following:Note: The field restocking plan, is to replace the 1.7 GHz copper heatsink MCM (factory installed) with a 1.7 GHz aluminum heatsink MCM. a. Ensure that the stiffener located on the backplane is aluminum and not plastic. b. Insert the two steel alignment rods into the aluminum backplane stiffener. c. Ensure that the locking cams located on the aluminum backplane stiffener are in the open position d. Grasp the module across the narrow span of the heatsink cap. Align the corner chamfer (flat corner) on the MCM module or the pass through to match the outline on the backplane. e. Align the two steel alignment rods with the clearance holes located on each side of the MCM. f. Insert the MCM clearance holes over the two steel alignment rods and slide the MCM module into the socket, and maintain pressure while closing both locking cams. g. Use an 5-mm locking socket wrench to secure the locking cams over the MCM. h. Go to substep 6. 5. Grasp the module across the narrow span of the heatsink cap. Align the corner chamfer (flat corner) on the MCM module or the pass through to match the outline on the backplane. Attention: Verify that the guide pins protrude above the connectors.Carefully push the MCM module into the socket. If your system has the plastic backplane stiffener, use an 8-mm locking socket wrench to secure the locking cams over the MCM. Be sure to maintain pressure on the MCM when securing both locking cams. 6. Loosen the actuation bolt at the rear of the MCM module. 7. Push the actuation bolt in and slide the spring plate assembly to the down position. 870
406/90
Service
Guide
8. Use the torque wrench (supplied with the system tool kit, see “Tools” on page 1033 for the tool part number) to tighten the actuation bolt. When the wrench clicks, followed by about 15 degrees of free motion, the bolt is seated. 9. If you are installing MCMs with the copper heatsink, remove the two steel alignment rods and tighten the two captive heatsink bushing screws with a 5 mm socket wrench. Attention: Do not over-torque the bolt. Check the MCM/L3 short-circuit test tool to ensure that short circuit was not created when the MCM was installed. If this is the last MCM you are installing, remove the MCM/L3 module short-circuit test tool and meter. 10. Begin the above procedure again until all MCM modules have been replaced.Note: Remove and install the MCMs with the copper heatsink one at a time, completing the above substeps before servicing another MCM. 11. Remove the primary I/O book (see “I/O Books and I/O Blanks” on page 854) and replace the VPD card that was provided with the MCM module.L3CUoD 12. Replace the primary I/O book. 13. Replace the DCAs and DCA fillers that you removed earlier. See “Processor Subsystem DCA (Distributed Converter Assembly) Books and DCA Blanks” on page 861. 14. Replace the fan chassis. See “Fan Chassis” on page 912. 15. If an MCM or L3 cache module was just added or replaced, go to the service processor menu and select System Information Menu, then select MCM/L3 Interposer Plug Count Menu. If the MCM/L3 Interposer Plug Count Menu is not available, the system firmware level is too old, and the MCM plug count must be recorded as described in “Installing the MCM/L3/Passthrough Plug Map” on page 853 and “Using this Label” on page 853. You can also upgrade the system firmware to the latest level to make the MCM/L3 Interposer Plug Count Menu available, then continue with 16 on page 872. Chapter
9.
Removal
and
Replacement
Procedures
871
16. Find the index number that corresponds to the module that was just added or replaced and enter the new plug count that was included in the FRU or MES kit. If the plug count is not included with the FRU, enter the number 7 as the new plug count to account for the number of times the module was plugged during the manufacturing process. If the MCM/L3 was replaced from field spares, enter the number 8. Exit the menu completely before powering on the system. 17. Turn on power as described in “Powering the System On” on page 841.Note: The 4-way processor MCM FRU can have odd-numbered processors or even-numbered processors. If you have just replaced a 4-way MCM module, notify the customer that they can determine their processor numbering by issuing the command lsdev -C on the partition that is using the new processors. If the customer finds that they have new available processors, and four processors in the defined state, they can remove the defined processors by using the rmdev command, and then use the cfgmgr command. Methods for Testing MCMs and Pass Through Modules for a Short Circuit The following testing methods are available for use depending on your testing hardware and system requirements. Use the following list to determine which of the testing methods you should use: Method 1 — Multimeter and Original Capacitor Book (part number 11P1540) Use this testing method if you have a handheld multimeter and the original capacitor book. Use this method only when Method 2 and Method 3 are unavailable. Refer to “Method 1. Testing MCMs and Pass Through Modules for a Short Circuit” on page 873. Method 2 — Multimeter and New Capacitor Book (part number 44P2471) Use this testing method if you have a handheld multimeter and a new capacitor book, or any capacitor books with the necessary LGA test-support features. Refer to “Method 2. Testing MCMs and Pass Through Modules for a Short Circuit” on page 876. Method 3 — New Capacitor Book (part number 44P2471) and LGA Shorts Tester (LGA-ST) (part number 44P4107) This method is required for systems with a 1.5 GHz MCM or higher systems. To perform this procedure, you must have a new capacitor book (or any capacitor books with the necessary LGA test-support features), and the new LGA-ST. Refer to “Method 3. Testing MCMs and Pass Through Modules for a Short Circuit” on page 879. 872
406/90
Service
Guide
Method 1. Testing MCMs and Pass Through Modules for a Short Circuit Performing this test ensures that a short circuit was not created when installing an MCM or a pass through module. Follow the procedure to install and use the MCM/L3 module short-circuit test tool (part number 44P0209) and a multimeter during MCM replacement. Note: This procedure is to be performed by trained personnel only. Ensure the UEPO switch is in the Off position before performing these procedures.MCM Installation Shorts Test, Step 1: Attention: Be sure to record the resistance readings taken during these tests so that you can refer to them if needed. * See “Shorts Test Qualified Service Meters and Fail Criteria” on page 900. Chapter
9.
Removal
and
Replacement
Procedures
873
MCM Installation Shorts Test, Step 2: * See the Fail Criteria value in “Shorts Test Qualified Service Meters and Fail Criteria” on page 900. 874
406/90
Service
Guide
MCM Installation Shorts Test, Step 3: BAre there anyL3 cache modulesto install?YesRemove MeterNORemoveThe Meter &Test CardReinstall the DCAs and reassemblethe systemPower On the System and Perform aslow bootWere anyfaults detected?YesNOReturn System toCustomerUpdate the historyLog on SFPFollow maintenancepackage to isolatefault.If the problempersists contactsupportFollow the L3 cache modulereplacement and and testprocedures *Fold the paper with therecorded data and placeinside the plug trackingchart and then fold thechart * See “L3 Cache Modules” on page 882. Chapter
9.
Removal
and
Replacement
Procedures
875
Method 2. Testing MCMs and Pass Through Modules for a Short Circuit Performing this test ensures that a short circuit was not created when installing an MCM or a pass through module. Follow the procedure to install and use a multimeter and capacitor book (part number 44P2471 or later) during MCM replacement. Note: This procedure is to be performed by trained personnel only. Ensure the UEPO switch is in the Off position before performing these procedures.MCM Installation Shorts Test, Step 1: Attention: Be sure to record the resistance readings taken during these tests so that you can refer to them if needed. * See “Shorts Test Qualified Service Meters and Fail Criteria” on page 900. 876
406/90
Service
Guide
MCM Installation Shorts Test, Step 2: * See the Fail Criteria value in “Shorts Test Qualified Service Meters and Fail Criteria” on page 900. Chapter
9.
Removal
and
Replacement
Procedures
877
MCM Installation Shorts Test, Step 3: * See “L3 Cache Modules” on page 882. 878
406/90
Service
Guide
Method 3. Testing MCMs and Pass Through Modules for a Short Circuit You must test the LGA-ST unit before you can begin testing the MCM and pass through modules for a short circuit with the following steps: 1. Plug the LGA-ST test head on the test box into the 9-pin female self-test D-shell, located on the LGA-ST control box, which looks similar to the following:
1
Tester D-shell
5
Toggle switch
2
Test box
6
Self-test D-shell
3
Jack screw
7
Battery access
4
Control box
2. Perform the following steps to ensure that the LGA-ST battery is charged and that the LGA-ST is functioning properly to detect shorts: a. Place the toggle switch into the MCM/L3 PWR/GND position and observe the lit LEDs. Refer to the following table for valid LED states:
Toggle Switch Position
MCM Green LED
MCM Amber LED
L3 Green LED
L3 Amber LED
Battery Low Amber LED
Action
Forward (to the MCM/L3 PWR/GND position)
Off
On
Off
On
Off
1. If Low Battery LED is on, or if no LEDs are lit, replace the battery before proceeding. 2. If the battery appears to be functioning and the LEDs are not lit as shown in this table, the LGA-ST is defective. Replace the unit before proceeding. b. Place the toggle switch into the L3 PWR/PWR position and observe the lit LEDs. Refer to the following table for the valid LED states:
Toggle Switch Position
MCM Green LED
MCM Amber LED
L3 Green LED
L3 Amber LED
Battery Low Amber LED
Action
Backward (to the L3 PWR/PWR position)
Off
Off
Off
On
Off
1. If Low Battery LED is on, or if no LEDs are lit, replace the battery before proceeding. 2. If the battery appears to be functioning and the LEDs are not lit as shown in this table, the LGA-ST is defective. Replace the unit before proceeding. Chapter
9.
Removal
and
Replacement
Procedures
879
c. Power off the system. d. Set UEPO to OFF. e. Disengage all DCAs from the back plane. f. Plug the LGA-ST Tester D-Shell into the Capacitor card 9-pin D-Shell and tighten the screws. g. Place the toggle switch into the MCM/L3 PWR/GND position and observe the lit LEDs. Refer to the following table for valid LED states:
Toggle Switch Position
MCM Green LED
MCM Amber LED
L3 Green LED
L3 Amber LED
Battery Low Amber LED
Action
Forward (to the MCM/L3 PWR/GND position)
On
Off
On
Off
Off
1. If Low Battery LED is lit, or if no LEDs are lit, replace the battery before proceeding. 2. If the battery appears to be functioning and the LEDs are not lit as shown in this table, disengage the Capacitor book from the system. If the LED states are not as shown in this table after the Capacitor book is disengaged, the LGA-ST is defective. Replace the unit before proceeding. h. Place the toggle switch into the L3 PWR/PWR position and observe the lit LEDs. Refer to the following table for valid LED states:
Toggle Switch Position
MCM Green LED
MCM Amber LED
L3 Green LED
L3 Amber LED
Battery Low Amber LED
Action
Backward (to the L3 PWR/PWR position)
Off
Off
On
Off
Off
1. If Low Battery LED is lit, or if no LEDs are lit, replace the battery before proceeding. 2. If the battery appears to be functioning and the LEDs are not lit as shown in this table, disengage the Capacitor book from the system. If the LED states are not as shown in this table after the Capacitor book is disengaged, the LGA-ST is defective. Replace the unit before proceeding. 880
406/90
Service
Guide
3. Install an MCM or pass through module. 4. Press the toggle switch on the LGA-ST forward to the MCM/L3 PWR/GND position. This action will test the MCM or pass through modules for power-to-ground shorts. Observe the lit LEDs on the LGA-ST and take the corresponding following actions: Green MCM LED is lit This is a pass state. No further action is required. Amber MCM LED is lit This is a fail state. There might be a short across the MCM or pass through voltage and ground. Complete the following steps to correct this condition: a. Remove the MCM or pass through module and inspect the interposer for shorts. b. Replace the MCM or pass through module and repeat step 4. If the fail state persists, replace the MCM or pass through.Green L3 LED is lit This is a pass state. No further action is required. Amber L3 LED is lit This is a fail state. There might be a short between one of the two L3 voltages and ground, which is a failing condition, or the combination of memory books, I/O books, and L3 cache modules in parallel might be creating a low resistance. To ensure that no shorts are present on the L3 modules before power the system on, perform “Method 3. Testing L3 Cache Modules for a Short Circuit” on page 898.5. Install another MCM or pass through module and repeat step 4 until all required MCMs are installed. Chapter
9.
Removal
and
Replacement
Procedures
881
L3 Cache Modules Handling Static-Sensitive Modules Attention: MCM and L3 modules are sensitive to static electricity discharge. These devices are shipped in antistatic containers to prevent damage caused by electrostatic discharge. Take the following precautions: v Use an antistatic wrist strap while handling the device. v Do not remove the device from the antistatic container until you are ready to install the device in the system unit. v With the device still in its antistatic container, touch it to a metal frame of the system. v Grasp modules by the heatsink. Avoid touching the interposer buttons. v If you need to lay the device down while it is out of the antistatic container: – Lay the MCMs with the posts down on the antistatic mat. – Lay the L3s with the LGA connector down in the antistatic tray.Before picking it up again, touch the antistatic container and the metal frame of the system unit at the same time. v Handle the devices carefully in order to prevent permanent damage. Attention: Before you remove or replace any L3 cache module, stop, read and understand this entire procedure. Also, check to ensure that the MCM/L3 Label Map is installed, refer to “Installing the MCM/L3/Passthrough Plug Map” on page 853. Attention: The L3 cache modules in this system can be attached to the backplane a maximum of three times. If an L3 cache module is removed for a fourth time, it must be replaced with a new L3 cache module. Check the MCM/L3 Label Map to determine how many times the module has been removed before. New L3 cache modules are shipped with a new MCM/L3 plug count adhesive replacement label. When you install a new L3 cache module, place the new label in the correct position on the MCM/L3 Label Map. If a new MCM/L3 Label Map is needed, see “Tools” on page 1033 for replacement label part number. Removal Attention: If more than one module is to be replaced or reseated, begin with the uppermost module, replace it and then go to the next highest position module, until the lowermost module has been replaced or reseated. 1. Turn off power as described in “Powering Off and Powering On the System” on page 838. 2. Remove the fan chassis. See “Fan Chassis” on page 912. 3. Perform a general cleaning operation using a sash brush, (not the one included with the FRU kit) and vacuum cleaner (see “Tools” on page 1033 for tool part numbers) to remove any accumulated dust and debris that has settled in the processor subsystem. Hold the vacuum hose near the brush to remove any debris the brush generates. 4. Note the location of the module being removed. Note: Arrange the four 4-position trays on the ESD mat in a pattern that corresponds to the locations of the L3 cache modules on the system backplane. Attention: Each L3 cache module must be returned to the location from which it was removed.Check the MCM/L3 Label Map on the system to confirm the number of times the target module has been unplugged. If it has been unplugged three times, order a new module. If this is the first removal/reseat for the module, record the serial number of the module and the date of the action. 882
406/90
Service
Guide
5. Use a 5-mm nut driver to loosen the set screw in the center of the L3 module spring plates as shown in the following illustration..165432
1
Protective Cover
4
Rear Spring Plate
2
L3 Cache Module and Heatsink
5
Front Spring Plate with Set Screw
3
Load Posts
6
L3 Filler (Used if no L3 cache module is installed)
6. Slide both plates up and pull them away from the L3 cache module. 7. Remove the sleeves from the posts. 8. Use a disposable lint-free cloth to clean the sleeves. 9. Pull the L3 cache module straight back off the posts, and place it with the interposer, facing down in the four-position tray from the tool kit. 10. Clean the LGA site using the brush supplied with the new module.Note: Before you remove the brush from its protective bag, loosen the bristles by pressing them on the ESD mat until they are 90 degrees from the handle.
Chapter
9.
Removal
and
Replacement
Procedures
883
Attention: Each time an L3 cache module is removed from the system backplane, the LGA site where the module connects must be cleaned. With the brush supplied with the replacement kit, use only downward strokes, starting in the upper left corner. Move left to right, top to bottom, using overlapping strokes as shown in the following illustration. 1122553344................ The brush must be discarded after cleaning four L3 sites. 11. Each time an L3 cache module is installed, inspect the interposer before installation. The associated interposer must be inspected, using a magnifying glass. The interposer is not acceptable and the module must be returned if the either of the following conditions occur: Shorts If the button, a loop of the button wire, or an individual button wire lead extends more than three times the width of the wire outside of a connector hole chamfer (top side) or counter bore pin indent (bottom side). Missing buttons 12. Record on the system log sheet the date that this L3 cache module was removed.
884
406/90
Service
Guide
Returning L3 Modules: The following procedure provides handling instructions for L3 modules to be packaged and returned to the plant of manufacture. 1. If you are replacing an L3 module, use the packaging from the new module to return the old module. If the packaging is not available or does not include all the parts described below, order the following part for L3 module protective packaging: v 1 L3 protective package part number 7335692, which includes: 1 carton, 1 top foam cushion, 1 bottom foam cushion with 4 cavities, 1 ESD bag, 1 desiccant)2. Place bottom foam cushion, wrapped with ESD Bag, inside the corrugated carton. Leaving the ESD bag open, place the L3 module with the cover into the bottom cushion, heatsink down. This will fit snugly into the cavity of the foam. 3. Add the desiccant and seal the ESD bag with tape. The desiccant is optional for returned parts. 4. Place the top foam cushion over the assembly inside the carton. 5. Seal the carton and ship.Replacement or New Installation 1. Go to “Methods for Testing L3 Cache Modules for a Short Circuit” on page 887 and install the MCM/L3 module short-circuit test tool. Use the test tool during the L3 cache module replacement procedure to ensure that the L3 cache modules are installed without a short circuit. If you install an L3 cache module and encounter a short circuit, the L3 cache you installed must be replaced. 2. If installing a new L3 cache module, perform a general cleaning operation, using a sash brush, part number 450732 (not the one included with the FRU kit) and vacuum cleaner to remove any accumulated dust and debris that has settled in the processor subsystem. Hold the vacuum hose near the brush to remove any debris the brush generates, then remove the L3 filler from the site where the new L3 cache module is to be located. 3. Loosen the captive thumbscrews on the protective cover, and remove the cover from the new L3 cache module as shown in the following illustration. 4. Align the flat corner on the L3 cache module to match the outline on the system backplane. Carefully slide the L3 cache module straight onto the posts until it meets the system backplane. 5. If you are installing a new L3 cache module, replace the sleeves that you removed earlier with the new sleeves provided with the L3 cache module kit. If you are reseating an existing module, use a lint-free cloth to wipe any dust from the sleeves that you removed earlier. Chapter
9.
Removal
and
Replacement
Procedures
885
6. Be sure to align the notches at the upper right corner of each spring plate as shown in the following illustration. Replace the spring plates that you removed earlier with the new spring plates provided with the L3 cache module kit.
1
Notch on upper right corner
2
5-mm set screw
3
Reinforced center
7. Use your 5-mm nut driver to tighten the set screw in the center of the L3 cache module spring plates until the screw is seated. Check the MCM/L3 module short-circuit test tool to ensure that short circuit was not created when the L3 cache module was installed. If this is the last L3 cache module you are installing, remove the MCM/L3 module short-circuit test tool and meter. 8. Remove the primary I/O book (see “I/O Books and I/O Blanks” on page 854) and replace the VPD card that was provided with the L3 cache module.L3CUoD 886
406/90
Service
Guide
9. Replace the primary I/O book. 10. Replace the fan chassis. See “Fan Chassis” on page 912. 11. Turn on power as described in “Powering the System On” on page 841. 12. If an MCM or L3 cache module was just added or replaced, go to the service processor menu and select System Information Menu, then select MCM/L3 Interposer Plug Count Menu. If the MCM/L3 Interposer Plug Count Menu is not available, the system firmware level is too old, and the MCM plug count must be recorded as described in “Installing the MCM/L3/Passthrough Plug Map” on page 853 and “Using this Label” on page 853. You can also upgrade the system firmware to the latest level to make the MCM/L3 Interposer Plug Count Menu available, and then continue with 13. 13. Find the index number that corresponds to the module that was just added or replaced and enter the new plug count that was included in the FRU or MES kit. If the plug count is not included with the FRU, enter the number & as the new plug count to account for the number of times the module was plugged during the manufacturing process. If the MCM/L3 was replaced from field spares, enter the number 8. Exit the menu completely before powering on the system.Methods for Testing L3 Cache Modules for a Short Circuit The following testing methods are available for use depending on your testing hardware and system requirements. Use the following list to determine which of the testing methods you should use: Method 1. Multimeter and Original Capacitor Book (part number 11P1540) Use this testing method if you have a handheld multimeter and the original capacitor book. Use this method only when Method 2 and Method 3 are unavailable. Refer to “Method 1. Testing L3 Cache Modules for a Short Circuit” on page 888. Method 2. Multimeter and New Capacitor Book (part number 44P2471) Use this testing method if you have a handheld multimeter and a new capacitor book, or any capacitor books with the necessary LGA test support features. Refer to “Method 2. Testing L3 Cache Modules for a Short Circuit” on page 893. Method 3. New Capacitor Book (part number 44P2471) and LGA Shorts Tester (LGA-ST) (part number 44P4107) This method is required for systems with a 1.5 GHz MCM or higher systems. To perform this procedure, you must have a new capacitor book, (or any capacitor books with the necessary LGA test support features), and the new LGA-ST. Refer to “Method 3. Testing L3 Cache Modules for a Short Circuit” on page 898. Chapter
9.
Removal
and
Replacement
Procedures
887
Method 1. Testing L3 Cache Modules for a Short Circuit Performing this test ensures that a short circuit was not created when an L3-cache module is installed. Follow this procedure to install and use the MCM/L3 module short-circuit test tool (part number 44P0209) and a multimeter during L3-cache module replacement. Note: This procedure is to be performed by trained personnel only. Ensure the UEPO switch is in the Off position before performing this procedure.L3 Cache Shorts Test, Step 1: Attention: Be sure to record the resistance readings taken during these tests so that you can refer to them if needed. * See “Shorts Test Qualified Service Meters and Fail Criteria” on page 900. 888
406/90
Service
Guide
L3 Cache Shorts Test, Step 2: Enter recorded Fail Criteria *, MLR, 1.9vL3TR in theequation below1.9vL3TR - MLR > Fail CriteriaIs the left Side ofthe equation Greaterthan the Right Side?NOYesRecord Resistance.Call this resistance" 1.9v L3Total Resistance"( 1.9vL3TR)Is the left Sideof the equationgreater than 1Ohm?NOYesInspect Interposerfor shortsAre ShortsVisible?YesReject the L3cache moduleReinstall theL3 cache moduleHas this L3been removed &reinstalledNOYesCNOConnect Meter"V" toTest Card "1.9V L3 LGA" (Blue Jack)Meter "COM" toTest Card either "COMMON RET" (Black Jack)Install another L3cache module[U1.18-P1-Cx]Examples:0.10 > 0.40 Follow NO (Fail)0.90 > 0.40 FollowYES (Pass)Remove L3cache moduleNODYesAre all thememory & I/Odisengaged?DO NOT POWERONContact NextLevel Of SupportAre allThe MemoryDisengaged?Disengage A Memory BookMemory are removed one at a timeNODisengage One I/O BookPrimary I/O Book FirstSecondary I/O Books secondYes Chapter
9.
Removal
and
Replacement
Procedures
889
L3 Cache Shorts Test, Step 3: * See “Shorts Test Qualified Service Meters and Fail Criteria” on page 900. 890
406/90
Service
Guide
L3 Cache Shorts Test, Step 4: Chapter
9.
Removal
and
Replacement
Procedures
891
L3 Cache Shorts Test, Step 5: 892
406/90
Service
Guide
Method 2. Testing L3 Cache Modules for a Short Circuit Performing this test ensures that a short circuit was not created when an L3-cache module is installed. Follow this procedure to install and use the capacitor book (part number 44P2471) and a multimeter during L3-cache module replacement. Note: This procedure is to be performed by trained personnel only. Ensure the UEPO switch is in the Off position before performing this procedure.L3 Cache Shorts Test, Step 1: Attention: Be sure to record the resistance readings taken during these tests so that you can refer to them if needed. * See “Shorts Test Qualified Service Meters and Fail Criteria” on page 900. Chapter
9.
Removal
and
Replacement
Procedures
893
L3 Cache Shorts Test, Step 2: 894
406/90
Service
Guide
L3 Cache Shorts Test, Step 3: * See “Shorts Test Qualified Service Meters and Fail Criteria” on page 900. Chapter
9.
Removal
and
Replacement
Procedures
895
L3 Cache Shorts Test, Step 4: 896
406/90
Service
Guide
L3 Cache Shorts Test, Step 5: Chapter
9.
Removal
and
Replacement
Procedures
897
Method 3. Testing L3 Cache Modules for a Short Circuit You must test the LGA-ST unit before you can begin testing the MCM and pass through modules for a short circuit with the following steps: 1. Plug the LGA-ST test head on the test box into the 9-pin female self-test D-shell, located on the LGA-ST control box, which looks similar to the following:
1
Test head
5
Toggle switch
2
Test box
6
Self-test D-shell
3
Jack screw
7
Battery access
4
Control box
2. Perform the following steps to ensure that the LGA-ST battery is charged and that the LGA-ST is functioning properly to detect shorts: a. Place the toggle switch into the MCM/L3 PWR/GND position and observe the lit LEDs. Refer to the following table for valid LED states:
Toggle Switch Position
MCM Green LED
MCM Amber LED
L3 Green LED
L3 Amber LED
Battery Low Amber LED
Action
Forward (to the MCM/L3 PWR/GND position)
Off
On
Off
On
Off
1. If Low Battery LED is on, or if no LEDs are lit, replace the battery before proceeding. 2. If the battery appears to be functioning and the LEDs are not lit as shown in this table, the LGA-ST is defective. Replace the unit before proceeding. b. Place the toggle switch into the L3 PWR/PWR position and observe the lit LEDs. Refer to the following table for the valid LED states:
Toggle Switch Position
MCM Green LED
MCM Amber LED
L3 Green LED
L3 Amber LED
Battery Low Amber LED
Action
Backward (to the L3 PWR/PWR position)
Off
Off
Off
On
Off
1. If Low Battery LED is on, or if no LEDs are lit, replace the battery before proceeding. 2. If the battery appears to be functioning and the LEDs are not lit as shown in this table, the LGA-ST is defective. Replace the unit before proceeding. 898
406/90
Service
Guide
c. Power off the system. d. Set UEPO to OFF. e. Disengage all DCAs from the back plane. f. Plug the LGA-ST Tester D-Shell into the Capacitor card 9-pin D-Shell and tighten the screws. g. Place the toggle switch into the MCM/L3 PWR/GND position and observe the lit LEDs. Refer to the following table for valid LED states:
Toggle Switch Position
MCM Green LED
MCM Amber LED
L3 Green LED
L3 Amber LED
Battery Low Amber LED
Action
Forward (to the MCM/L3 PWR/GND position)
On
Off
On
Off
Off
1. If Low Battery LED is lit, or if no LEDs are lit, replace the battery before proceeding. 2. If the battery appears to be functioning and the LEDs are not lit as shown in this table, disengage the Capacitor book from the system. If the LED states are not as shown in this table after the Capacitor book is disengaged, the LGA-ST is defective. Replace the unit before proceeding. h. Place the toggle switch into the L3 PWR/PWR position and observe the lit LEDs. Refer to the following table for valid LED states:
Toggle Switch Position
MCM Green LED
MCM Amber LED
L3 Green LED
L3 Amber LED
Battery Low Amber LED
Action
Backward (to the L3 PWR/PWR position)
Off
Off
On
Off
Off
1. If Low Battery LED is lit, or if no LEDs are lit, replace the battery before proceeding. 2. If the battery appears to be functioning and the LEDs are not lit as shown in this table, disengage the Capacitor book from the system. If the LED states are not as shown in this table after the Capacitor book is disengaged, the LGA-ST is defective. Replace the unit before proceeding. 3. Install an L3 module. Chapter
9.
Removal
and
Replacement
Procedures
899
4. Press the toggle switch on the LGA-ST forward to the MCM/L3 PWR/GND position. This action will test the L3 modules for power-to-ground shorts. Observe the lit LEDs on the LGA-ST and take the corresponding following actions: Green L3 LED is lit This is a pass state. No further action is required. Amber L3 LED is lit This is a fail state. There might be a short between one of the two L3 voltages and ground, which is a failing condition, or the combination of memory books, I/O books, and L3 cache modules in parallel might be creating a low resistance. Complete the following steps to correct this condition: a. Disengage memory books followed by I/O books one at a time. b. Repeat step 2a on page 898 through step 2h on page 899 until either the L3 test results in a pass state, or all memory and I/O are disengaged and the L3 test remains in a fail state. c. If the L3 module test remains in a fail state after all memory and I/O are disengaged, remove the L3 module and inspect the interposer for shorts. d. Reinstall the L3 module once and repeat the test. If the fail persists, replace the L3 module.Green MCM LED is lit This is a pass state. No further action is required. Amber MCM LED is lit This is a fail state. There might be a short across the MCM voltage and ground. Perform the “Method 3. Testing MCMs and Pass Through Modules for a Short Circuit” on page 879 and correct any short conditions before powering on.5. Place the toggle switch into the L3 PWR/PWR position. Green and Amber MEM LEDs are not lit Power is removed from these circuits when the switch is in the L3 PWR/PWR position. Green L3 LED is lit This is a pass state. No further action is required. Amber L3 LED is lit This is a fail state. There is a short between the two L3 power domains. Complete the following steps to correct this condition: a. Remove the L3 module and inspect the interposer for shorts. b. Reinstall the L3 module once and repeat the test. If the fail condition persists, replace the L3 module.6. Install another L3 module and repeat step 3 on page 899 through step 5 until all required L3 modules are installed.Shorts Test Qualified Service Meters and Fail Criteria
Manufacturer
Model
Range
Accuracy @ 1.0 ohms or less
Fail Criteria
Fluke
8060A
200 Ohms
+/-0.04
+0.40
Fluke
8060A/AA
200 Ohms
+/-0.04
+0.40
Fluke
8062A
200 Ohms
+/-0.04
+0.40
Fluke
187
500 Ohms
+/-0.10
+0.40
Fluke
189
500 Ohms
+/-0.10
+0.40
Fluke
83 (original models)
400 Ohms
+/-0.10
+0.40
Fluke
85 (original models)
400 Ohms
+/-0.10
+0.40 900
406/90
Service
Guide
Manufacturer
Model
Range
Accuracy @ 1.0 ohms or less
Fail Criteria
Fluke
87 (original models)
400 Ohms
+/-0.10
+0.40
Fluke
87-4
500 Ohms
+/-0.10
+0.40
Fluke
89-4
500 Ohms
+/-0.10
+0.40
+/-0.20 Accuracy
Fluke
10
400 Ohms
+/-0.20
+0.50
Fluke
11
400 Ohms
+/-0.20
+0.50
Fluke
110
600 Ohms
+/-0.20
+0.50
Fluke
111
600 Ohms
+/-0.20
+0.50
Fluke
112
600 Ohms
+/-0.20
+0.50
Fluke
12
400 Ohms
+/-0.20
+0.50
Fluke
12B
400 Ohms
+/-0.20
+0.50
Fluke
16
400 Ohms
+/-0.20
+0.50
Fluke
175
600 Ohms
+/-0.20
+0.50
Fluke
177
600 Ohms
+/-0.20
+0.50
Fluke
179
600 Ohms
+/-0.20
+0.50
Fluke
18
400 Ohms
+/-0.20
+0.50
Fluke
21-2
320 Ohms
+/-0.20
+0.50
Fluke
21-3
320 Ohms
+/-0.20
+0.50
Fluke
23-2
320 Ohms
+/-0.20
+0.50
Fluke
26-3
400 Ohms
+/-0.20
+0.50
Fluke
29-2
400 Ohms
+/-0.20
+0.50
Fluke
70-2
320 Ohms
+/-0.20
+0.50
Fluke
70-3
320 Ohms
+/-0.20
+0.50
Fluke
73
320 Ohms
+/-0.20
+0.50
Fluke
73-2
320 Ohms
+/-0.20
+0.50
Fluke
73-3
320 Ohms
+/-0.20
+0.50
Fluke
75
320 Ohms
+/-0.20
+0.50
Fluke
75-2
320 Ohms
+/-0.20
+0.50
Fluke
75-3
320 Ohms
+/-0.20
+0.50
Fluke
76
400 Ohms
+/-0.20
+0.50
Fluke
77
320 Ohms
+/-0.20
+0.50
Fluke
77-2
320 Ohms
+/-0.20
+0.50
Fluke
79-2
400 Ohms
+/-0.20
+0.50
Fluke
79-3
400 Ohms
+/-0.20
+0.50
Fluke
83 (with CAT III front panel marking)
400 Ohms
+/-0.20
+0.50
Fluke
83-3
400 Ohms
+/-0.20
+0.50
Fluke
85 (with CAT III front panel marking)
400 Ohms
+/-0.20
+0.50
Fluke
85-3
400 Ohms
+/-0.20
+0.50 Chapter
9.
Removal
and
Replacement
Procedures
901
Manufacturer
Model
Range
Accuracy @ 1.0 ohms or less
Fail Criteria
Fluke
87 (with CAT III front panel marking)
400 Ohms
+/-0.20
+0.50
Fluke
87-3
200 Ohms
+/-0.20
+0.50
Capacity Upgrade on Demand (CUoD) VPD Module Use the following instructions to replace the CUoD VPD module. This action should only be taken if the VPD module is listed as a failing FRU. For more information about FRUs and error codes, see Chapter 5, “Error Code to FRU Index,” on page 413. 1. Power off the system and ensure standby power has been removed by moving the red switch to the OFF position. 2. Remove the primary I/O book (for more information, see “I/O Books and I/O Blanks” on page 854) and open the cover to expose the VPD cards. 3. Remove and replace the CUoD VPD card, which is located next to the installed L3 VPD modules, with the procedures in “VPD Card” on page 904. 4. Replace the primary I/O book in the system and connect the cables with the procedures in I/O Books and I/O Blanks “Replacement” on page 856. 5. Power on the system. 6. Move the red switch into the ON position. 7. Contact service support to obtain the correct processor activation codes for the customer. v If the customer is entitled to any processor activations, proceed to step 9. v If the customer is not entitled to any processor activations, service support will give you the zero activation code. Complete step 8. 8. After you complete this step, the procedure is complete for this customer. a. Right-click on the name of the managed system in the Contents area of the Web-based System Manager window. The name of the system displays as the system’s serial number. b. From the menu that displays, select the following options, in the following order: 1) Capacity Upgrade on Demand 2) Processor 3) Processor Activate The Activate CUoD Processors window displays. c. Enter the 34-character string of zeros (0s) into the Activation Code field. d. Reboot the system and return control of the system to the customer. 9. Service support will ask for system information to generate an activation code. To access this information, complete the following steps: a. Right-click on the name of the managed system in the Contents area of the Web-based System Manager window. The name of the system displays as the system’s serial number. b. From the menu that displays, select the following options, in the following order: 1) Capacity Upgrade on Demand 2) Processor 3) Processor Capacity Settings The Processor Capacity Settings window displays. 902
406/90
Service
Guide
c. Click the Display Order Information button in the Processor Capacity Settings window. This will display the Processor Order Information panel, which appears similar to the following: d. Dictate the contents of each field on this panel to service support.10. After service support generates the activation code, go to the Activate CUoD Processors menu by completing the following steps: a. Right-click the name of the managed system in the Contents area of the Web-based System Manager window. The name of the system displays as the system’s serial number. b. From the menu that displays, select the following options, in the following order: 1) Capacity Upgrade on Demand 2) Processor 3) Processor Activate The Activate CUoD Processors window displays.11. Type the 34-character activation code that service support dictates to you into the Activation Code field. 12. Return to the Processor Capacity Settings window by completing steps a and b in 9 on page 902. Verify that the number of processors shown in this window match the number of processors service support tells you the customer is entitled to. v If the number of processors match, reboot the system and return control of the system to the customer. v If the number of processors does not match, return to the Processor Order Information panel and verify the information with service support. Chapter
9.
Removal
and
Replacement
Procedures
903
VPD Card Notes: 1. The MCM module and its associated VPD card are a single FRU and must be replaced as a set. 2. There is only one VPD card for all of the L3 modules. Replacing any L3 module requires replacing the associated VPD card. The VPD cards are located in the primary I/O book. Removal 1. Remove the primary I/O book (see “I/O Books and I/O Blanks” on page 854). 2. Use a T10 torx screw to remove the short cover near the connectors. 3. Locate the VPD card that must be replaced.L3CUoD 4. Remove the VPD card for the associated MCM or L3 modules.Replacement Replace in reverse order with the VPD card that was provided with the new MCM or L3 modules. 904
406/90
Service
Guide
System Clock Card Removal 1. Turn off power as described in “Powering Off and Powering On the System” on page 838. 2. Remove the fan chassis. See “Fan Chassis” on page 912. 3. Squeeze the tabs on the clock card, and pull the card straight out as shown in the following illustration.
1
Processor Backplane
2
System Clock Card
4. Clean the system board as described at the start of this procedureReplacement Note: Be sure to align the guide feature on the card with the guide feature on the system board. 1. Squeeze the tabs on the clock card and push the card straight in. 2. Replace the fan chassis. See “Fan Chassis” on page 912. 3. Turn on power as described in “Powering the System On” on page 841. Chapter
9.
Removal
and
Replacement
Procedures
905
Aluminum Backplane Stiffener Use the following procedure when populating the system with MCM processors that employ the use of copper heatsinks, or if you will be replacing a system backplane prior to November 2003. Removal 1. Turn off the power as described in “Powering Off and Powering On the System” on page 838. 2. Remove the fan chassis as described in“Fan Chassis” on page 912. 3. Perform a general cleaning operation, using a sash brush, (not the one included with the FRU kit) and vacuum cleaner (see “Tools” on page 1033 for the tool part numbers), to remove any accumulated dust and debris that has settled in the processor subsystem. Hold the vacuum hose near the brush to remove any debris that the brush generates. 4. Remove the system clock card as described in “System Clock Card” on page 905 5. Remove the MCM modules as described in “MCM Module (Processor) and Pass Through Modules” on page 863. 6. Remove all LE Modules from the backplane as described in “L3 Cache Modules” on page 882. 7. Using a 5.5 mm deep well socket wrench (part number 39F8452), remove the eight retaining screws that secure the aluminum stiffener to the system backplane. See the following illustration for screw locations. Notes: a. Due to the zero clearance between the aluminum stiffener and the sides of the processor drawer, you must have the 5.5 mm deep well socket to loosen and remove the two outer-side retaining screws. b. (Service Tip) When removing the aluminum stiffener, remove the outer-top retaining screw last. When installing or replacing the aluminum stiffener insert the outer-top retaining screw first. 906
406/90
Service
Guide
8. Carefully remove the aluminum stiffener and put it in a safe place.
1
System Backplane
5
Clock Card
2
L3 Cache Assembly
6
MCM pass through Module
3
Aluminum Backplane Stiffener
7
MCM with VPD Card Kit
4
MCM Alignment Rods
8
Aluminum Backplane Stiffener Retaining Screws (quantity 8)
Replacement To replace the aluminum stiffener follow the removal procedure in reverse order. Chapter
9.
Removal
and
Replacement
Procedures
907
Processor Subsystem Chassis CAUTION: This unit weighs more than 55 kg (121.2 pounds). Material handling systems such as levers, slings, or lifts are required to safely move it. When this is not possible, specially trained persons or services (such as riggers or movers) must be used.C06 Three people are required to handle this operation: one person to push the chassis, and two people to pull the chassis. Removal Attention: All components must be replaced to the location from which they were removed. Before replacing processor subsystem chassis (backplane), check with your support center to ensure you have been directed to replace the processor subsystem chassis (backplane) by a valid error code in Chapter 5, “Error Code to FRU Index,” on page 413. Attention: This process requires: v A safe area reserved to store parts that are removed from the system during this procedure. v The use of a lift tool (part number 09P2481) and lift-plate assembly tool (part number 11P4369). v The following components must be removed from the processor subsystem chassis to reduce its weight to the minimum. – DCA Books – Capacitor Books – I/O Books – Memory Books – Processor Fan Assemblies – MCM(s) (remove the MCMs only if the chassis is to be replaced) – L3 Cache (remove the L3 cache only if the chassis is to be replaced) – System Clock Cardv The lift tool wheels must be chocked to prevent it from moving during the operation. The frame must not be on its casters, and the leveling pads must be engaged to prevent the frame from moving during the operation. 1. Read all safety instructions before performing this operation. Unpack the lift tool and follow assembly instructions included with the tool. Inspect the tool for damage. Do not perform the removal procedure with a damaged lift tool. Replace the blue lift plate with the orange lift plate (part number 11P4369). Orient the two round holes in the lift plate to the side of the lift tool that will be closest to the frame. 2. Turn off power as described in “Powering Off and Powering On the System” on page 838. 3. Open the rack doors and remove them from the rack. 4. Verify that all cables are labeled. If not, label all cables and components along with their location at the front and rear of the system. 5. Unplug all power cables from the line power source. Attention: Because this procedure is performed with power removed from the system, references to using the white or green service buttons on the UEPO switch do not apply during the remainder of this procedure. 6. Unplug any cables attached to the I/O books and processor subsystem DCAs. 7. At the back of the system, remove the safety bars from the I/O books and blanks, using an allen wrench on the captive screws. 8. Remove the I/O books (and blanks if applicable). See “I/O Books and I/O Blanks” on page 854. 9. Remove the DCAs and capacitor books (and blanks if applicable). See “Processor Subsystem DCA (Distributed Converter Assembly) Books and DCA Blanks” on page 861. 10. At the front of the system, remove the safety bars from the memory books and blanks. 908
406/90
Service
Guide
11. Remove the memory books (and blanks if applicable). See “Memory Books and Memory Blanks” on page 910. 12. Remove the four fan assemblies. See “Processor Fan Assemblies” on page 912. 13. Remove the fan chassis. See “Fan Chassis” on page 912. 14. Remove the MCM modules. See “MCM Module (Processor) and Pass Through Modules” on page 863. 15. Remove the L3 modules. See “L3 Cache Modules” on page 882. 16. Remove the clock card. See “System Clock Card” on page 905. 17. Remove the media chassis. See “Media Subsystem Chassis” on page 1001. 18. Remove two frame extender mounting bolts from EIA position 17 (10-mm bolt). 19. Remove the UEPO switch. See “Unit Emergency Power Off (UEPO) Card Assembly” on page 850. 20. Using an 8-mm extension socket, remove the screws attaching the hold-down bracket to the vertical EIA rails at the rear of the system. Remove the bracket and store it in a safe place.Note: The rear screws are used to mount the hold-down bracket. 21. Using an 8-mm extension socket, remove the screws attaching the chassis to the vertical EIA rails at the front of the system. 22. Position the lift tool at the front of the rack, and ensure that the extender arms on the lift plate are on the side facing the frame. Align the lift tool such that processor subsystem chassis will not be blocked (at the lip of the orange lift plate) from removal. 23. Raise the plate to a height such that the lift plate is just below the base of the processor subsystem chassis. Release the pins on the lift plate, and fully extend the center portion of the plate toward the processor subsystem chassis and lock it in place. Release pins on the extender arms and position them on both left and right side, arms must fit around the frame extenders and lock in place (do not adjust the lift tool once these arms are locked in place). Engage the lift tool brake. Install wheel chocks on lift tool casters.Note: When you attach the arms to the frame, ensure that any cables are clear of the extender arms. 24. One person should gently push the processor subsystem chassis toward the tool from the rear of the system (do not bend the vertical processor-subsystem chassis support bars). Two people should gently pull and guide the processor subsystem chassis onto the lift plate. The processor subsystem chassis must be removed far enough to clear the front face of the frame. Use the tie-down strap to hold the processor subsystem chassis to the lift plate. Retract the extender arms and center portion of lift plate. 25. Lower the processor subsystem chassis to the bottom position of the lift tool. 26. Remove the wheel chocks and release the brake. Flip the crank handle 180 degrees. Carefully move the lift tool with the processor subsystem chassis to the staging area. The brake and wheel chocks must be in place when transferring the processor subsystem chassis to or from the packaging material.Replacement Note: If you are replacing this assembly with a new assembly, the manufacturing label with the machine type, model number, and the serial number for the subsystem must be ordered to apply to the new part. For information on replacement labels, see Chapter 10, “Parts Information,” on page 1009. Attention: Look for small amounts of debris on the connecting surfaces between the backplane and processor modules. Mating the two surfaces is critical for the processor to function properly. 1. Position the lift tool such that the new processor subsystem chassis can be moved onto the lift plate. 2. Install the wheel chocks and engage the brake. Chapter
9.
Removal
and
Replacement
Procedures
909
3. Position the plate at the correct level, and move the new processor subsystem chassis onto the tool with the rear side flush with the lift plate. Strap the processor subsystem chassis to the lift plate using the tie-down strap. 4. Position the lift tool as close as possible to the front of the rack. 5. Raise the lift tool until it is level with the rails. 6. Release the pins on the lift plate and fully extend the center portion of the plate toward the frame and lock it in place. Release the pins on the extender arms and position them on both left and right side. The arms must fit around the frame extenders and lock in place (do not adjust the lift tool after these arms are locked in place). Engage the lift tool brake. Install wheel chocks on lift tool casters.Note: When you attach the arms to the frame, ensure that any cables are clear of the extender arms. 7. Remove the tie-down strap. One person should gently pull the processor subsystem chassis into the frame from the rear of the system (do not bend the vertical processor-subsystem chassis support bars). Two people should gently push and guide the processor subsystem chassis into the frame. Note: Adjust the height of the lift plate, as needed. 8. After the processor subsystem chassis is installed in the frame, lower the lift plate and remove it from the lift tool. Repackage the lift tool. 9. Reinstall all components removed during the removal process by reversing steps 22 on page 909 through 2 on page 908 of the removal procedure.Memory Books and Memory Blanks Removal 1. Turn off the system, and completely remove power as described in “Powering Off and Powering On the System” on page 838. 2. Remove the safety bar from the memory slot. 3. Grasp the blue release latches located at the top and bottom of the memory book between your thumb and index finger, and press the tabs together. Pivot the tabs and pull the book straight back.Inspection You must examine the connector for any connector damage on the removed book using the 3X Magnification Glass (part number 11P4747) included in the system toolkit. If you identify a defect on the book, do not install the replacement card without contacting your next level support for instructions. A damaged connector on a defective book may indicate a damaged system backplane. If the memory book connector is damaged, the adjacent memory book may need to be removed in order to inspect the system backplane for damage. If the removed book is not damaged, carefully remove the new book from the packaging. Be sure to protect the connector and the blue handles when you remove the book from the cardboard box or ESD bag. Do not handle the book by its connector or either of its blue handles. Remove the protective cover from the connector and install it on the defective book.Notes: 1. Do not stack books on top of each other. ESD bags do not provide sufficient stacking protection. 2. Do not rest a book on its connector or on its blue handles at any time.After the memory book is removed from the ESD bag, examine the book VHDM connector using the 3X Magnification Glass (part number 11P4747) included in the system toolkit. If any connector shows signs of damage, do not use it. 910
406/90
Service
Guide
Replacement Note: If there is no book installed in the position next to the one that you are replacing, install a filler (part number 21P8218) in that adjacent location. To replace a book, do the following: 1. Inspect the new book for damage. For more information, see “Inspection” on page 910. 2. Unlatch the two blue handles on the replacement book. 3. Taking care not to hit the connector against the CEC cage, use both hands to carefully place the book on the CEC guides. 4. Slide the book in until it reaches the alignment pins. 5. Using even pressure, actuate both blue handles simultaneously until the book is fully seated. 6. Repeat this process for all replacement books. 7. Reinstall any safety bars that were removed to perform this procedure. 8. If any cables were removed during the removal procedure, plug all cables into their correct locations. 9. Turn on the power as described in “Powering the System On” on page 841.Fan Controller Removal Attention: The fan controller weighs approximately 0.45 Kg (1 lb.) 1. Locate the fan controller to be replaced. 2. Note the state of the fan controller’s LEDs. 3. Press the green Start Service button on the UEPO switch. 4. Proceed only if all the fan controller’s LEDs go off or the repair has been scheduled. If the repair has been scheduled, power off the system as described in “Powering Off and Powering On the System” on page 838, then return here. 5. Label and remove the plugs from the fan controller. 6. Loosen the two knurled captive screws that attach the fan controller to the fan assembly. 7. Grasp the loosened screws, and pivot and pull the fan controller away from the fan assembly.Replacement 1. Align the metal tabs opposite the captive screws into the corresponding slots in the fan assembly. 2. Pivot gently to seat the autodocking connector. 3. Tighten the knurled captive screws to the fan assembly. 4. Plug all cables into their correct location in the DCA. 5. Press the white Service Complete button on the UEPO switch to indicate completion of this procedure. Chapter
9.
Removal
and
Replacement
Procedures
911
Processor Fan Assemblies Removal Attention: The fan controller weighs approximately 5 Kg (11 lb.) 1. Locate the fan assembly to be replaced. 2. Note the state of the fan controller’s LEDs. 3. Press the green Start Service button on the UEPO switch. 4. Proceed only if all the fan controller’s LEDs go off or the repair has been scheduled. If the repair has been scheduled, power off the system as described in “Powering Off and Powering On the System” on page 838, then return here. 5. Verify that all cables are labeled for plug location, and unplug the cables from the fan controller on the fan assembly. 6. Remove the fan controller (see “Fan Controller” on page 911). 7. Loosen the two knurled captive screws that attach the fan assembly to the enclosure. 8. Pull the fan assembly away from the enclosure opening.Replacement 1. Position the fan assembly in front of the opening through the spring-loaded doors in the enclosure. 2. Slide the fan assembly into the enclosure opening and tighten the upper and lower fasteners. 3. Replace the fan controller onto the face of the fan assembly. 4. Plug all cables into their correct location in the DCA. 5. Press the white Service Complete button on the UEPO switch to indicate completion of this procedure.Fan Chassis Removal 1. Turn off power as described in “Powering Off and Powering On the System” on page 838. 2. Remove the top two fans. See “Processor Fan Assemblies.” 3. Loosen the four retaining screws that attach the fan chassis to the processor subsystem chassis. 4. Pull the fan chassis out of the processor subsystem chassis.Replacement Replace in reverse order. Processor Subsystem Air Filters Attention: To maintain proper airflow, the air filters (part number 11P3781) must be inspected every six months and replaced as necessary. To replace the air filters, it is not necessary to remove power from the system or blower assemblies . Removal To remove, grasp finger tabs on the air filter and pull straight out. Replacement To remove, slide the air filter into the slot. The filter is keyed to ensure correct orientation. 912
406/90
Service
Guide
I/O Subsystem I/O Subsystem PCI Slot LED Definitions 2264411533
1
I/O LED (Currently Unused)
4
I/O Subsystem Backplane Power On (Green)
2
Auxiliary Power Good (Green)
5
I/O Card Power On LED (Green)
3
I/O Subsystem Backplane Fault (Amber)
6
Failing I/O Card LED (Amber)
The following table describes the possible states of the LEDs associated with the I/O subsystem PCI slots. These LEDs are located at the rear of the I/O subsystem. One LED is associated with each PCI slot.
Green PCI Slot LED Indication
Amber PCI Slot LED
PCI Slot Power/Status
Definition
On
Off
On/No action
The PCI slot power is on and the slot is operational. Do not remove or replace adapters.
On
Flashing
On/Identify
The PCI slot power is on, and the slot has been identified by the software or firmware. Do not remove or replace adapters at this time.
Off
Flashing
Off/IdentifyOff/Action
The PCI slot power is off, and the slot has been identified by the software or firmware. Perform PCI adapter remove, replace, or install only when directed by the PCI hot-plug procedures.
Off
Off
Off/No power
The PCI slot power is off. If the system power is off, perform non-hot-plug PCI adapter remove, replace, or install procedure. Chapter
9.
Removal
and
Replacement
Procedures
913
PCI Adapters Note: Linux does not support hot-plugging any hot-pluggable PCI adapters or devices. A system with Linux installed on one or more partitions must be shut down and powered off before replacing any PCI adapter or device assigned to a Linux partition. Follow the non-hot-pluggable adapter or device procedures when replacing a PCI adapter or device in any partition with Linux installed. Most PCI adapters can be removed and replaced, or installed in the system without turning off the power to the entire system. These adapters are referred to as hot-pluggable PCI adapters. Some adapters are not hot-pluggable and power must be removed from the system for adapter removal, replacement, or installation. Though some adapters are not hot-pluggable, they must still be installed in the cassette. Note: An adapter or an adapter blank filler must be installed into the PCI adapter cassette assembly before it is reinstalled in a system unit or an I/O drawer. Before you remove or replace an adapter, determine if the PCI adapter you are working with is hot-pluggable. See the PCI Adapter Placement Reference, order number SA23-2504. When assembling, inserting or removing a blindswap cassette, verify that the foam ?T-shaped? EMC gasket strip is firmly attached to the sheet metal bezel. This includes all applications where these gaskets are used (for example, riser cards, blank fillers or any of the various cassette assemblies). Visually inspect the top and bottom of the gasket strips insuring that they are firmly bonded to the sheet metal. Look for any of the following: v gaskets that are starting to unwrap v gaskets lifting or peeling off of the sheet metal v gaskets protruding from the sheet metal bezel when all cassettes, fillers, or risers are plugged into the system. Inspect both the gasket on the cassette or filler that was removed and the gasket visible in the PCI slot. If a bad gasket is found on a cassette, then the entire cassette must be replaced with a new cassette and gasket. If a bad gasket is found on a riser card, then the I/O Planar containing the riser card needs to be replaced. If you are removing a PCI adapter that is: v Not hot-pluggable, go to “Removing a Non-Hot-Pluggable PCI Adapter” on page 920. v Hot-pluggable, go to “Hot-Pluggable PCI Adapter” on page 921. 914
406/90
Service
Guide
Removing a PCI Adapter Cassette Attention: Perform this procedure only when you are instructed to by the removal and replacement procedures in “Removing a Non-Hot-Pluggable PCI Adapter” on page 920 or “Hot-Pluggable PCI Adapter” on page 921.Note: An adapter or a blank filler must be installed into the PCI adapter cassette before it is reinstalled in a processor subsystem or an I/O subsystem. The following illustration shows the PCI adapter cassette.
1
PCI Adapter Cassette
2
Handle-Release Lever Latch
3
Locking Bar
4
Handle Release Lever
5
Handle
6
PCI Adapter Tailstock Tab
7
Adapter Cassette Pull Tab
Chapter
9.
Removal
and
Replacement
Procedures
915
To remove a PCI adapter cassette, do the following: 1. Determine which PCI adapter cassette you are removing. 2. Disconnect any cables that are connected to the PCI adapter. 3. Press the handle release lever, then push the handle to the down position. See the following illustration.
1
Locking Bar
2
Handle Release Lever
3
Handle in the Down Position
4. Push in the locking bar until it clicks into the locked position. See the following illustration.Note: In this position, the locking bar locks the handle in the down position and prevents it from being pulled up while the PCI adapter cassette is being removed.
916
406/90
Service
Guide
1
Locking Bar
2
PCI Adapter Tailstock Tab
3
Adapter Cassette Pull Tab
5. Position your index finger under the handle and push up the adapter tailstock tab. Grasp the adapter cassette pull tab with the other hand. Pull the PCI adapter cassette from the adapter slot by pulling on the adapter cassette pull tab. Do not pull on the handle when removing the adapter cassette. Notes: a. Pulling on the handle may reengage the adapter connector, preventing the cassette from sliding out. b. Do not force the PCI adapter cassette out of the PCI adapter slot. The cassette will slide out smoothly when this step is performed correctly. c. Do not remove more than one adapter cassette at a time.
Chapter
9.
Removal
and
Replacement
Procedures
917
Replacing a PCI Adapter Cassette: Attention: Perform this procedure only when you are instructed to from “Removing a Non-Hot-Pluggable PCI Adapter” on page 920 or “Hot-Pluggable PCI Adapter” on page 921. Before you replace the PCI adapter cassette, check the following: v Before installing a PCI adapter cassette in the system, an adapter or blank filler must be installed in each PCI adapter cassette and the top cover for the system must be installed. v Check the adapter in the cassette that you are about to install, and make sure that the adapter is held firmly on all four corners by the retaining arms and ratchet clips. v Review the procedures beginning with “PCI Adapter or Blank Filler Removal from a Snap-Assembly-Type Cassette” on page 935 to make sure that the adapter is correctly installed in the cassette.To replace a PCI adapter cassette, do the following: 1. Depress the handle release lever located in the center of the PCI adapter cassette handle. Move the handle down completely to the fully down position. See the following illustration.
1
Cassette Dovetail Rails
2
Handle-Release Lever Latch
3
Handle Release Lever
2. Push the top of the locking bar in until it clicks into the locked position. 918
406/90
Service
Guide
Note: In this position, the locking bar locks the handle in the down position and prevents it from being pulled up while the PCI adapter cassette is being removed. 3. Examine the PCI adapter slot opening carefully, and note the location of the dovetail rails and the cassette guide rails. A flashlight can be used to enable you to see the dovetail rails and the cassette guide rail slots, as shown in the following illustration.
1
Cassette Left Dovetail Rail
4
Cassette Bottom Guide Rail
2
Cassette Right Dovetail Rail
5
Side Guide Rail Slot on System Board
3
Cassette Side Guide Rail
6
Bottom Guide Rail Slot on System Board
4. Hold the PCI adapter cassette at the PCI adapter slot so that the PCI adapter cassette is perpendicular to the slot and level with the slot opening. 5. Align the bottom edge of the PCI cassette cover with the PCI cassette guide rail on the system board or I/O unit. 6. Slide the cassette partially into the guide. Attention: If the cassette does not slide in smoothly, the cover might not have engaged the PCI cassette guide rail. Carefully pull the adapter back out and start again. 7. As you slide the cassette into the slot, ensure that the dovetail rail on the top of the PCI adapter cassette aligns with the mating slots on any cassettes that might be installed on the left and right sides. 8. Push the cassette in until it is flush with any other installed cassettes. Chapter
9.
Removal
and
Replacement
Procedures
919
9. Visually check to ensure that the cassette is parallel with the cassettes to the left or right. If the cassette is not parallel with the neighboring cassettes, the cassette dovetail rails might not be correctly engaged. If the cassette is not correctly aligned, you might need to pull it out so you can see the ends of the dovetail rails. If so, reinsert the cassette, ensuring that the dovetail rails engage on both sides of the cassette.Note: If there are no other cassettes installed, it is not necessary to check the alignment of the dovetail rails. 10. Depress the handle release latch. Lift the handle up until the handle-release lever clicks when it locks into position. Note: To aid you in seating the PCI adapter connector, it may be necessary to press down on the adapter tailstock tab. The PCI adapter is completely installed into the PCI adapter slot when the handle is in the fully raised position and the tailstock is seated into the slots located on the system board. Attention: Do not force the handle into the raised position.Removing a Non-Hot-Pluggable PCI Adapter: To remove a non-hot-pluggable adapter, do the following: 1. Turn off power and remove the power from the system as described in “Powering Off and Powering On the System” on page 838. 2. Determine the slot from which you are removing the adapter. 3. Refer to “Removing a PCI Adapter Cassette” on page 915. When you have completed the PCI adapter cassette removal procedure, return here. 4. If you are installing another adapter in this slot, follow the instructions given in “Replacing a Non-Hot-Pluggable PCI Adapter.” If you are not installing another adapter in this slot, install a blank PCI adapter cassette into the slot. Refer to “Replacing a PCI Adapter Cassette” on page 918. 5. Connect power to the system and turn the power on. 6. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus SystemsDiagnostic Information for Multiple Bus Systems.Replacing a Non-Hot-Pluggable PCI Adapter: Replace the adapter using the following procedure:Note: Because the adapter you are installing is not hot-pluggable, shut down the system and remove power before performing this procedure. 1. If you have just removed an adapter and you are replacing it in the system, use the procedures in “PCI Adapter or Blank Filler Removal from a Snap-Assembly-Type Cassette” on page 935 to install the replacement adapter into the PCI adapter cassette. 2. Go to “Replacing a PCI Adapter Cassette” on page 918 to replace the appropriate cassette in the PCI slot, then return here. 3. Connect power to the system. 4. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. Refer to ?Starting the System Without an HMC Attached? in your installation or service guides. 5. Set the color slide to blue. 6. Ensure that the adapter is configured when the system completes the boot process.
920
406/90
Service
Guide
Hot-Pluggable PCI Adapter: Before performing the following procedure, read “Safety Notices” on page vii. Attention: The Linux operating system does not support these hot-plug procedures. Also, Linux does not support hot-plugging any hot-pluggable PCI adapters. Systems with Linux installed on one or more partitions must be shut down and powered off before replacing any PCI adapter assigned to a Linux partition. Follow the non-hot-pluggable adapter procedures when replacing a PCI adapter in any partition with Linux installed. The following hot-plug procedures take you through removing and replacing hot-plug PCI adapters using software that presents procedures on your display. The LEDs on a PCI adapter cassette are described in either your installation guide or service guide. Removing and Replacing a Hot-Pluggable PCI Adapter: Notes: 1. Use this procedure only when you are replacing an adapter with an identical adapter. If you are replacing an adapter with an adapter that is not identical to the adapter removed, go to “Removing and Replacing a Hot-Pluggable PCI Adapter,” and then to “Installing a New Hot-Pluggable PCI Adapter” on page 923. 2. When a system containing a defective adapter is booted, the system considers that the PCI slot is empty, and adapter LEDs do not light. Under this condition, it is safe to remove the adapter from the system at this time. To replace an adapter, perform the following steps: 1. Determine the slot from which you are removing the adapter. 2. Ensure that any processes or applications that might use the adapter are stopped. Note: Removing a hot-pluggable PCI adapter requires the system administrator to take the PCI adapter offline before performing any PCI adapter hot-plug procedures. Before taking an adapter offline, the devices attached to the adapter must be taken offline as well. This action prevents a service representative or user from causing an unexpected outage for system users. For additional information about taking an adapter offline or removing it from the system configuration, see the AIX System Management Guide: Operating System and Devices. 3. Log in as root user. If the system is a partitioned system, log in as root user on the partition that has the adapter assigned to it. 4. At the command line, type smitty. 5. Select Devices. 6. Select PCI Hot Plug Manager. 7. Select Unconfigure a Device and press Enter. 8. Press F4 to display the Device Names menu. 9. From the menu, select the adapter you are removing. 10. If you are replacing the adapter, go to step 11 on page 922. If you are not replacing the adapter, do the following: a. Use the Tab key to answer NO to Keep Definition. Press Enter. b. Go to step 12 on page 922. Chapter
9.
Removal
and
Replacement
Procedures
921
11. Answer YES to Keep Definition. Press Enter. 12. The ARE YOU SURE screen displays. Press Enter to verify the information. Successful unconfiguration is indicated by the OK message displayed next to the Command field at the top of the screen. 13. Press F3 to return to the PCI Hot-Plug Manager menu. 14. Select List PCI Hot-Plug Slots. 15. If the List PCI Hot-Plug Slots menu indicates that the slot is not empty, go to step 17. 16. If the List PCI Hot-Plug Slots menu indicates that the slot is empty, do the following: a. Press F3 to cancel. b. Select Add PCI Hot-Plug Adapter, and press Enter. The display screen indicates that the PCI slot is empty. c. On the display screen, select the indicated empty slot. Look at the adapter you intend to remove. A slow blinking amber LED located next to the adapter on the PCI riser card indicates that the slot has been identified. Press Enter. The adapter has been put into the action state. d. Disconnect any cables that are connected to the adapter being removed. e. Completely remove the hot-plug adapter from the slot. The LED goes off. f. Go to step 21.17. When you are instructed to remove the adapter from the adapter slot, disconnect any cables that are connected to the adapter being removed, and then refer to “Removing a PCI Adapter Cassette” on page 915. 18. Install the replacement adapter into the PCI adapter cassette. See “PCI Adapter or Blank Filler Removal from a Snap-Assembly-Type Cassette” on page 935. 19. With the replaced adapter installed into the PCI adapter cassette, refer to “Replacing a PCI Adapter Cassette” on page 918. 20. If you installed an adapter into the adapter cassette, connect the appropriate cables and devices to the adapter connectors. 21. Continue to follow the screen instructions until you receive a message that the replacement is successful. Successful replacement is indicated by the OK message displayed next to the Command field at the top of the screen. 22. If you did not install an adapter into the adapter cassette, go to step 23. If you did install an adapter into the adapter cassette, do the following: a. Press the F3 key to return to the PCI Hot-Plug Manager menu. b. Select Install/Configure Devices Added After IPL and press Enter. Then follow the instructions on the screen. Successful replacement is indicated by the OK message displayed next to the Command field at the top of the screen.23. If you do not have other adapters to replace, continue with the next step.ORIf you have other adapters to replace, press the F3 key to return to the PCI Hot-Plug Manager menu and then return to step 14. 24. Press F10 to exit the Hot-Plug Manager. If you have added, removed, or replaced any adapters, run the diag -a command. If the system responds with a menu or prompt, follow the instructions to complete the device configuration.
922
406/90
Service
Guide
Installing a New Hot-Pluggable PCI Adapter: Note: Before installing a new adapter, ensure the device driver for the adapter is installed. Check with the system administrator if the device driver is not already installed. To install an adapter, perform the following steps: 1. Install the new adapter into a PCI adapter cassette. See “PCI Adapter or Blank Filler Removal from a Snap-Assembly-Type Cassette” on page 935. 2. Log in as root user. If the system is a partitioned system, log in as root user on the partition that has the adapter assigned to it. 3. At the command line, type smitty. 4. Select Devices. 5. Select PCI Hot Plug Manager. 6. From the PCI Hot-Plug Manager menu, select Add a PCI Hot-Plug Adapter and press Enter. The Add a Hot-Plug Adapter window displays. 7. For adapter placement information, see the PCI Adapter Placement Reference, order number SA23-2504. Select an empty PCI slot for the adapter. 8. Select the appropriate empty PCI slot from the ones listed on the screen, and press Enter. 9. Follow the instructions on the screen to install the adapter until the visual indicator (LED) for the specified PCI slot is set to the Action state. 10. Install the new adapter in the adapter slot by performing the following: a. Lower the black handle completely. b. Push the locking bar until you hear a clicking sound. c. Hold the assembly straight on and level with the slot. d. Align the bottom edge of the PCI cassette cover with the PCI adapter guide rail on the I/O backplane. Note: If there is a cassette to the left of the one you are installing, align the ridge on the cover with the ?tick? in the notch of the neighboring cassette. Chapter
9.
Removal
and
Replacement
Procedures
923
e. Slide the cassette partially into the guide. f. Ensure that the dovetail on the top track aligns with its mating component(s) on both sides.
1
I/O Subsystem
2
PCI Adapter Cassette
g. When the cassette is fully inserted, prepare to activate the handle by lowering the locking bar. Lift the handle all the way up until you hear a click. The PCI adapter should be completely seated. h. Set the color slide to the orange color indicating that the adapter in the cassette is hot-pluggable.11. Connect the appropriate cables and devices to the adapter. 12. Continue to follow the screen instructions until you receive a message that the installation is successful. Successful installation is indicated by the OK message displayed next to the Command field at the top of the screen. 13. Press the F3 key to return to the PCI Hot-Plug Manager menu. 14. Select Install/Configure Devices Added After IPL and press Enter. Then follow the instructions on the screen. Successful installation is indicated by the OK message displayed next to the Command field at the top of the screen. 15. If you do not have other adapters to install, continue with the next step. OR If you have other adapters to install, press the F3 key to return to the PCI Hot-Plug Manager menu and then return to step 6 on page 923. 16. Press F10 to exit the Hot-Plug Manager. If you have added, removed, or replaced any adapters, run the diag -a command. If the system responds with a menu or prompt, follow the instructions to complete the device configuration.PCI Hot-Plug Manager Access: Attention: The Linux operating system does not support these hot-plug procedures. Also, Linux does not support hot-plugging any hot-pluggable PCI adapters. Systems with Linux installed on one or more partitions must be shut down and powered off before replacing any PCI adapter assigned to a Linux partition. Follow the non-hot-pluggable adapter procedures when replacing a PCI adapter in any partition with Linux installed. The installation instructions for hot-pluggable PCI adapters refer you to these procedures when it is appropriate to perform them. Note: A PCI adapter is only hot-pluggable if the PCI adapter supports hot-plug applications. See the PCI Adapter Placement Reference, order number SA23-2504.
924
406/90
Service
Guide
Accessing Hot-Plug Management Functions: Note: Removing or installing a Hot-Pluggable PCI adapter requires the system administrator to take the PCI adapter offline prior to performing the operation. Before taking an adapter offline, the devices attached to the adapter must be taken offline as well. This action prevents a service representative or user from causing an unexpected outage for system users. For additional information about taking an adapter offline or removing it from the system configuration, see the AIX System Management Guide: Operating System and Devices. To access the hot-plug menus, do the following: 1. Log in as root user. If the system is a partitioned system, log in as root user on the partition that has the adapter assigned to it. 2. At the command line, type smitty. 3. Select Devices. 4. Select PCI Hot Plug Manager and press Enter. 5. The PCI Hot-Plug Manager menu displays. Return to the procedure that directed you here. For a description of the menu options, see below.PCI Hot-Plug Manager Menu: The following options are available from the PCI Hot Plug Manager menu: Note: For information about the PCI slot LED states, refer to either your installation guide or service guide. List PCI Hot-Plug Slots Provides a descriptive list of all slots that support PCI hot-plug capability. If the listing for a slot indicates it holds an “Unknown” device, select the Install/Configure Devices Added after IPL to configure the adapter in that slot. Add a PCI Hot-Plug Adapter Allows the user to add a new PCI hot-plug-capable adapter to the slot with the system turned on. You will be asked to identify the PCI slot that you have selected prior to the actual operation. The selected PCI slot will go into the Action state and finally into the On state. Note: The system will indicate the slot holds an “Unknown” device until you perform the Install/Configure Devices Added After IPL option to configure the adapter. Replace/Remove a PCI Hot-Plug Adapter Allows the user to remove an existing adapter, or replace an existing adapter with an identical one. For this option to work, the adapter must be in the Defined state (see “Unconfigure a Device” option below). You will be asked to identify the PCI slot prior to the actual operation. The selected PCI slot will go into the Action state. Chapter
9.
Removal
and
Replacement
Procedures
925
Identify a PCI Hot-Plug Slot Allows the user to identify a PCI slot. The selected PCI slot will go into the Identify state. Unconfigure a Device Allows the user to put an existing PCI adapter into the Defined state if the device is no longer in use. This step must be completed successfully before starting any removal or replacement operation. If this step fails, the customer must take action to release the device. Configure a Defined Device Allows a new PCI adapter to be configured into the system if software support is already available for the adapter. The selected PCI slot will go into the On state. Install/Configure Devices Added After IPL The system attempts to configure any new devices and tries to find and install any required software from a user-selected source. The add, remove, and replace functions return information to the user indicating whether the operation was successful. If additional instructions are provided on the screen, complete the recommended actions. If the instructions do not resolve the problem, see the following: v If the adapter is listed as Unknown, perform the Install/Configure Devices Added After IPL option to configure the adapter. v If you receive a warning indicating that needed device packages are not installed, the system administrator must install the specified packages before you can configure or diagnose the adapter. v If you receive a failure message indicating a hardware error, the problem might be either the adapter or the PCI slot. Isolate the problem by retrying the operation in a different PCI slot, or trying a different adapter in the slot. If you determine that you have failing hardware, call your service representative.
926
406/90
Service
Guide
PCI Adapter or Blank Filler Removal from a Screw Assembly Type Cassette: Note: Before doing the following procedure determine if your PCI adapter cassette is snapped together or held together with three screws. If your PCI adapter cassette is held together with three screws, follow this procedure. If your PCI adapter cassette is snapped together, see “PCI Adapter or Blank Filler Removal from a Snap-Assembly-Type Cassette” on page 935. PCI adapter cassettes that are snapped together, can be identified by the blue grommet and snap button located next to the PCI adapter cassette release handle.Removal: 1. Remove the PCI card assembly from the I/O subsystem (for hot-pluggable adapters see “Removing and Replacing a Hot-Pluggable PCI Adapter” on page 921, for non-hot-pluggable adapters, see “Removing a Non-Hot-Pluggable PCI Adapter” on page 920). 2. Position the assembly on a flat surface, with the tailstock to the left and the card connector toward you. 3. Remove the three screws from the top cover groove, and the two screws at the heel corner. 4. Pop the rear end of the cover off the guide bump and remove the cover. 5. Position the assembly on a flat surface, with the tailstock to the right and the card connector toward you. 6. Remove the three screws from the top of the bezel assembly. 7. Position the assembly on a flat surface, with the tailstock to the left and the card connector toward you. 8. Flex the support arm downward until the top clip on the card arm is accessible. 9. Use a small screwdriver to lift the guide lock, and slide the card arm away from the PCI card until the card end is free. 10. Push the lower-front heel off the card, and remove the PCI card from the assembly.
Chapter
9.
Removal
and
Replacement
Procedures
927
Replacement: 1. Ensure that you have the following parts: v Bezel and linkage subassembly v Cover v Clip v Insertion Clip (see the following note) v Short card arm v Long card arm v Screws (10--2 are extra)Note: If you are installing a Gigabit Ethernet (FC 2969 type 9U), Ultra3 SCSI (FC 6203 Type 4Y), or 10/100 BaseT Ethernet (FC 4962 Type AF), use the additional clips listed below to help when you are seating the adapter during installation. There are different clips (1) for each adapter type (2): 12 44P0323 Clip for Gigabit Ethernet FC 2969 type 9U 44P0324 Clip for Ultra3 SCSI FC 6203 Type 4Y 44P0321 Clip for 10/100 BaseT Ethernet FC 4962 Type AF 928
406/90
Service
Guide
2. Position the bezel and linkage subassembly with the bezel to the left, on a flat ESD surface. 3. Pivot the bezel so that the lower end is away from the linkage subassembly.Note: Ensure that the top front clip is in the uppermost position before continuing. a. If the adapter card is full-height (that is, taller than the tailstock of the card), do the following: 1) Place the top corner of the PCI card so that it seats in the top front clip, as shown in the following diagram:
1
Cassette Top Linkage Bracket
4
Front Heel Clip
2
Top Front Clip
5
Adapter Cassette Bezel
3
Long Adapter
2) Rotate the bottom of the tailstock to fit in the front heel clip. 3) To remove any slack in the fit, use pliers or screwdriver to adjust the top front clip downward.b. If the adapter card is not full-height (that is, shorter than the tailstock of the card), do the following: 1) Place the bottom corner of the PCI card on the front heel clip. 2) To remove any slack between the top and bottom clips, use pliers or screwdriver to adjust the top front clip downward. If the top clip is pushed too far down without engaging the card, remove the top front clip by popping it off the rail. Reattach the clip and start again. Chapter
9.
Removal
and
Replacement
Procedures
929
4. If the new PCI adapter does not extend beyond the mark at the top of the linkage subassembly, attach the appropriate short card arm card. If the adapter extends beyond this mark, attach the long card arm to the support arm of the linkage subassembly as follows:
1
Mark at the Top of Linkage Subassembly
5
Short Adapter
2
Top Rear Clip (Shown on the long, and short card arms and middle-spacer arm)
6
Top Front Clip
3
Long Card Arm
7
Long Adapter
4
Short Card Arm
Note: Ensure the card arm is fully on the track and clears the end of the support arm of the linkage subassembly. A flat head screwdriver might be needed to push the card arm in the remaining distance. a. Bend the support arm downward, to allow the middle spacer (long cards only) and card arm to slide onto the end of the support arm. b. Slide the card arm until the top-rear and lower clips align with the edge of the card. 930
406/90
Service
Guide
c. Press the lower clip under the bottom edge of the card. The bottom of the clip should be even with the bottom of the card. d. Slide the top rear clip of the card arm downward to engage the top corner of the card. The card should now be securely stabilized in all the clips, as shown in the following illustration.
1
Short Card Arm Top Clip
6
Short Card Arm Bottom Clip
2
Short Card Arm
7
Short Adapter Card
3
Long Card Arm Top Clip
8
Bottom Front Clip
4
Long Card Arm
9
Adapter Cassette Bezel
5
Long Card Arm Bottom Clip
10
Front Top Clip
5. Tilt the top of the bezel away from the PCI card while lowering the bottom of the tailstock into bottom of the bezel, as shown in the following illustration.
1
Adapter Cassette Bezel
2
Short Card Arm
3
Long Card Arm
Chapter
9.
Removal
and
Replacement
Procedures
931
6. When the lower corner of the card is fully seated into the bezel, rotate the top of the bezel over the tailstock of the card.
1
Adapter Card
2
Adapter Card (Lower Corner)
3
Front Bottom Clip
4
Adapter Cassette Bezel
7. Position the bezel fully onto the assembly by snapping the three small holes on bezel arm over the three small embossments on the linkage assembly, as shown in the following illustration.
1
Alignment Studs
2
Adapter Card
3
ESD Surface
932
406/90
Service
Guide
8. Secure the bezel arm to the linkage assembly with three small screws in the sequence shown in the following illustration.
1
Retaining Screws
9. With the tailstock to the left on a flat surface, position the cover over the linkage assembly. 10. Tuck the top and bottom tabs at the left of the cover under the bezel.
1
Front Top Tab Chapter
9.
Removal
and
Replacement
Procedures
933
2
Front Bottom Tab
11. Snap the top-right corner of the cover over the guide bump on the linkage assembly. 12. Align the holes at the top of the cover with those in the linkage subassembly and screw the cover to the linkage subassembly in the sequence shown in the following illustration.15432 13. Snap the tailstock clip onto the tailstock by rotating it onto the bracket while inside the bezel opening. 14. Raise the lever to the horizontal extended position and snap the tailstock clip onto the tailstock tab as follows. 15. Visually inspect the assembly and ensure that the following are correct: v The card moves up and down correctly when the handle is raised and lowered. v The tailstock clip rides within the bezel opening.
934
406/90
Service
Guide
PCI Adapter or Blank Filler Removal from a Snap-Assembly-Type Cassette: CAUTION: Metal edges might be sharp.C38Note: Before doing the following procedure determine if your PCI adapter cassette is snapped together or held together with three screws. If your PCI adapter cassette is held together with three screws, go to “PCI Adapter or Blank Filler Removal from a Screw Assembly Type Cassette” on page 927. PCI adapter cassettes that are snapped together, can be identified by the blue grommet and snap button located next to the PCI adapter cassette release handle. Use this procedure when you are preparing to install or replace an adapter or an adapter blank filler in the system unit or an I/O drawer. Note: An adapter or an adapter blank filler must be installed into the PCI adapter cassette assembly before it is reinstalled in a system unit or an I/O drawer. Determine the slot of the system or I/O drawer in which you plan to install or remove an adapter, and refer to the procedures for removing an adapter from the system. Before beginning this procedure, remove the PCI adapter cassette assembly from the system unit or I/O drawer.Note: It may take approximately 30 to 40 minutes to perform this procedure the first time. This time includes using the instructions in this guide and performing the steps. Thereafter, performing this procedure usually takes approximately 10 minutes. Before performing the following steps, familiarize yourself with the entire procedure. To remove a PCI adapter or blank filler from a cassette assembly, do the following: 1. Place the PCI adapter cassette assembly on a flat work surface with the cover facing up, and the top of the adapter facing you. See the following illustration.
1
Cover
3
Top of Cover
2
Handle
Chapter
9.
Removal
and
Replacement
Procedures
935
2. Using two fingers, remove the bushing-lock pin from the bushing. The bushing-lock pin can be removed by pulling it out of the bushing with your fingernails.
1
Bushing
2
Bushing-Lock Pin
1
Bushing-Lock Pin
936
406/90
Service
Guide
3. Remove the bushing. The bushing can be removed by pulling it out of the PCI adapter cassette assembly with your fingernails.
1
Bushing
4. Turn over the PCI adapter cassette assembly so that the top is facing away from you.
1
Top of Adapter
2
Handle
Chapter
9.
Removal
and
Replacement
Procedures
937
5. Remove the bezel, as follows: a. Locate the plastic latch fingers in the top part of the cassette.
1
Top of Cassette
2
Bezel
3
Plastic Latch Fingers
b. Using one hand, pinch the plastic latch fingers, and with your other hand, carefully lift the top part of the bezel extension out until the tabs clear the slots in the PCI adapter cassette assembly.
1
Slots
2
Tab
3
Bezel Extension
938
406/90
Service
Guide
c. While holding the bezel extension out, push the plastic cover latch out of the bezel hook, as shown in the following illustration. This action allows the bezel to be removed.
1
Bezel Extension
2
Plastic Cover Latch in Bezel Hook
d. On the opposite side of the cassette (cover side), push the cover latch to release the bezel.
1
Cover Side
2
Cover Latch
Chapter
9.
Removal
and
Replacement
Procedures
939
e. While holding the bezel extension out, carefully slide the bezel off. You might have to work from both sides to loosen the bezel assembly from the cassette assembly. When the bezel is free, slide it completely off the cassette assembly, and set it aside.
1
Bezel Extension
2
Bezel
1
Bezel
940
406/90
Service
Guide
6. Turn over the cassette so that the cover is facing up, as shown in the following illustration. Remove the cover from the cassette as follows: a. Slide the cover until it releases from the cassette assembly. Attention: The cover might be tight and difficult to slide. If you grasp the left end (handle end) of the cassette and the right end of the cover, you can use enough force to pull the cover off the PCI adapter cassette assembly.
1
Left End of the Cassette
2
Cassette Cover
b. Lift the cover off the assembly, and set it aside. c. Depress the handle release lever to unlock the locking bar. The locking bar is located on the handle, ensure that the handle is pulled into the unlocked position. Raise the handle on the cassette linkage assembly until it locks into the up position (the blank filler or adapter moves downward).
1
Handle Release Lever
2
Locking Bar
Chapter
9.
Removal
and
Replacement
Procedures
941
7. Remove the metal EMC shield from the top of the tailstock.12
1
Metal EMC Shield
8. Remove the blank filler or adapter that is installed in the cassette linkage assembly.Note: If there is a blank filler in the cassette linkage assembly, as shipped from the manufacturer, there are two adapter arms. One adapter arm is used with short adapters (short adapter arm) and the other adapter arm is used with long adapters (long adapter arm). If you are removing a short adapter or blank filler, see “Short Adapter or Blank Filler Removal” on page 943. If you are removing a long adapter, see “Long Adapter Removal” on page 946.
942
406/90
Service
Guide
Short Adapter or Blank Filler Removal: To remove a short adapter or blank filler, do the following: 1. Slide the long and short adapter arms away from the adapter or blank filler by doing the following: a. Each adapter arm has a release tab that allows the arm to be moved away from the adapter or blank filler in the cassette assembly. Use your fingernail to lift the tab, to allow each arm to be moved away from the adapter or blank filler. Lift the release tab on the short adapter arm, and push on the slotted tab to release the end of the blank filler.Note: If you plan to install a short adapter, leave the long adapter arm on the cassette linkage assembly. If you plan to install a long adapter, remove both the long and short adapter arms from the cassette linkage assembly in the next step. b. Slide the long and short adapter arms away from the blank filler or adapter.
1
Long Adapter Arm
2
Release Tab
3
Release Tab
4
Short Adapter Arm
5
Slotted Tab
Note: Some dual linkage PCI adapter cassettes might contain a middle-spacer arm to be used with long adapter cards only. The middle-spacer arm and its installation can be seen in the following Chapter
9.
Removal
and
Replacement
Procedures
943
illustration.213453
1
Short PCI Adapter Card
2
Short Adapter Arm
3
Middle-Spacer Arm
4
Long Adapter Arm
5
Long PCI Adapter Card
944
406/90
Service
Guide
2. Remove the adapter or blank filler from the cassette linkage assembly by rotating the bottom of the tailstock out, as shown in the following illustration. Store the adapter or blank filler in a safe place.123
1
Cassette Linkage Assembly
2
Bottom of Tailstock
3
Handle
3. The removal procedure for the PCI adapter cassette assembly is complete. To install a new adapter or blank filler in the cassette, go to “Installing a New Hot-Pluggable PCI Adapter” on page 923.
Chapter
9.
Removal
and
Replacement
Procedures
945
Long Adapter Removal: To remove a long adapter, do the following: 1. Each adapter arm has a release tab that allows the arm to be moved away from the adapter in the cassette assembly. Use your fingernail to lift the tab, to allow the arm to be moved away from the adapter. Lift the release tab on the long adapter arm, and slide it off the cassette linkage assembly.
1
Release Tab
2. Remove the adapter from the cassette linkage assembly by rotating the bottom of the tailstock out, as shown in the following illustration. Store the adapter in a safe place.
1
Cassette Linkage Assembly
3
Bottom of Tailstock
2
Adapter
4
Handle
3. The removal procedure for the PCI adapter cassette assembly is complete. To install a new adapter or blank filler in the cassette, go to “Installing a New Hot-Pluggable PCI Adapter” on page 923.Note: If you are going to install a short adapter card, and your cassette linkage has the middle-spacer arm installed, remove the middle-spacer arm and install the short adapter arm, refer to page 1a on page 943. The middle-spacer arm and long adapter arm can be stored on the cassette linkage. See the illustration on page 944. To install a short adapter into a cassette see “Replacing an Adapter in a PCI Adapter Cassette.”Replacing an Adapter in a PCI Adapter Cassette CAUTION: Metal edges might be sharp.C38
946
406/90
Service
Guide
This procedure is performed when preparing to install a new adapter or a blank filler in the processor subsystem or an I/O drawer. Familiarize yourself with the entire procedure before performing the following steps. 1. Place the empty PCI Adapter Cassette Assembly linkage on a flat work surface in front of you. Position the cassette so that the handle is in the raised (up) position, and on the left, with the top of the linkage facing away from you.Note: Your PCI Adapter Cassette Assembly linkage might not have the short or long adapter retaining arms or the middle-spacer arm installed if you removed them during the removal procedure. In the following illustration, both arms are shown.
1
Handle
2
Cassette Linkage Assembly
3
Retaining Arms
4
Middle-Spacer Arm
Chapter
9.
Removal
and
Replacement
Procedures
947
Note: If you are installing an adapter that is in the following list, use the additional brackets listed to help when you seat the adapter during installation. There are unique brackets (1) for each adapter type (2):12
Feature Code
Description
Bracket Part Number
2751
S/390 ESCON Channel PCI Adapter
31L8728
2969
Gigabit Ethernet-SX PCI Adapter
44P2676
2975
10/100/1000 Base-T Ethernet PCI Adapter
44P2676
4962
10/100 Mbps Ethernet PCI Adapter II
44P0321
5700
IBM Gigabit Ethernet-SX PCI-X Adapter
53P5450
5701
IBM 10/100/1000 Base-TX Ethernet PCI-X Adapter
53P5450
5706
IBM 2-Port 10/100/1000 Base-TX Ethernet PCI-X Adapter
53P5450
5707
IBM 2-Port Gigabit Ethernet-SX PCI-X Adapter
53P5450
6203
PCI Dual Channel Ultra3 SCSI Adapter
44P2661
6228
2 Gigabit Fibre Channel Adapter for 64-bit PCI Bus
44P2675
6239
2 Gigabit Fibre Channel PCI-X Adapter
44P3912
948
406/90
Service
Guide
2. To determine if you are installing a long adapter or a short adapter, use the following steps. If you know the length of your adapter, skip the following steps and proceed to step 3 on page 951. a. Remove the adapter from its protective packaging and, if present, remove the plastic extension handle from the end of the adapter. b. Place the adapter or a blank filler over the cassette, with the upper-left corner aligned into the top adapter-retaining clip. 54231
1
Top Adapter-Retaining Clip
3
Adapter
2
Cassette Linkage Assembly
4
Bottom of Tailstock
5
Handle
Chapter
9.
Removal
and
Replacement
Procedures
949
c. The cassette linkage assembly is marked to show long or short adapters similar to the adapter shown in the following illustration. Check the right end of the adapter to determine if it is long enough to fall into the long area of the cassette linkage assembly. Ensure that the corner on the left end of the adapter is still aligned into the top adapter-retaining clip, and determine the length of the adapter.LONGARMARMSHORT12
1
Top Adapter-Retaining Clip
2
Long Adapter-Retaining Arm and Clip
950
406/90
Service
Guide
3. Remove the adapter or blank filler and ensure that the handle is in the up position. If you need to move the handle, ensure that the locking bar is pulled into the unlocked position, and then rotate the adapter handle until the handle is in the up position (the cassette linkage assembly will extend down beyond the cassette top). If you are installing a short adapter or blank filler, go to “Short Adapter or Blank Filler Installation.” If you are installing a long adapter, go to “Long Adapter Installation” on page 968.21
1
Handle Release Lever
2
Lock
Short Adapter or Blank Filler Installation To install a short adapter or blank filler, do the following: Chapter
9.
Removal
and
Replacement
Procedures
951
1. Place the adapter or blank filler into the cassette so that the upper-left corner of the adapter engages the adjustable top adapter-retaining clip as shown in the following illustration.4312
1
Cassette Linkage Assembly
2
Bottom of Tailstock
3
Retaining Clip
4
Handle
952
406/90
Service
Guide
2. Rotate the adapter so that the adapter engages the slot in the bottom adapter-retaining clip and the top corner of the adapter is seated into the adjustable top adapter-retaining clip.Note: If the adapter is not a full-height adapter, you must slide the adjustable top adapter-retaining clip downward until the lower edge of the adapter is seated into the slot on the bottom adapter-retaining clip.321
1
Adapter
2
Slot
3
Bottom Retaining Clip
Chapter
9.
Removal
and
Replacement
Procedures
953
3. Slide the short adapter-retaining arm toward the adapter or blank filler on the cassette linkage rail. If the short-adapter retaining arm has been removed from the cassette linkage rail, install the short-adapter retaining arm and then slide the arm until it contacts the adapter or blank filler as shown in the following illustration.123
1
Short-adapter Retaining Arm
2
Adapter or Blank Filler
3
Cassette Linkage Rail
954
406/90
Service
Guide
4. Ensure that the adjustable top retaining clip catches the corner of the adapter as shown in the following illustration: 1
1
Short Adapter Top Retaining Clip
5. Use the lower short adapter arm clip to engage and hold the bottom of the adapter.Note: It might be necessary to apply pressure to engage and hold the bottom of the adapter.1
1
Lower Short Adapter Arm Clip
Chapter
9.
Removal
and
Replacement
Procedures
955
6. To hold the top of the adapter, slide down the adjustable top adapter-retaining clip on the retaining arm. Ensure that the bottom edge of the adapter is held by the lower part of the adapter retaining arm.
1
Adjustable Top Adapter-Retaining Clip
956
406/90
Service
Guide
7. Press the lock on the handle, and rotate the adapter handle until it is in the down position (adapter or blank filler moves up into the cassette assembly). 21
1
Handle Release Lever
2
Lock
Ensure that the right end of the cassette linkage rail moves up into the cassette linkage assembly as shown in the following illustration:1
1
Right End of Cassette Linkage
Chapter
9.
Removal
and
Replacement
Procedures
957
8. Position the adapter and cassette assembly with the handle on the left (in the down position) and the top facing away from you.21
1
Top of Cassette
2
Handle
958
406/90
Service
Guide
9. Install the cover on the cassette assembly, as follows: a. Place the cassette cover on the cassette assembly as shown in the following illustration. Slide the cover toward the handle until the hole in the cover aligns with the hole in the cassette assembly. 21
1
Handle
2
Holes Aligned
Chapter
9.
Removal
and
Replacement
Procedures
959
10. Install the bezel assembly using the following procedure: a. Carefully slide the bezel onto the cassette assembly.1
1
Bezel
b. Align and insert the cover arm latch in the hooked notch in the bezel.12
1
PCI Adapter Pull Tab
2
Hooked Notch in the Bezel for the Cover Arm Latch
960
406/90
Service
Guide
c. Align the top of the bezel assembly into the grooves on the top of the cassette assembly. Push the bezel onto the cassette linkage assembly until the tab on the top of the bezel is seated in the recess of the cassette assembly.1234
1
Recess
3
Tab
2
Groove
4
Bezel Assembly
d. Insert the two tabs on the bezel extension into the two slots on the cassette linkage assembly.1
1
Tabs
Chapter
9.
Removal
and
Replacement
Procedures
961
11. Check for the following: a. Ensure that the extension arm engages the pins on the cassette. You should be able to see the three pins in the holes in the arm. The bezel tab should be seated as shown in the following illustration.111
1
Pin
b. Ensure that the cover arm latch is completely pressed into the hooked notch on the bezel as shown.12
1
PCI Adapter Pull Tab
2
Hooked Notch in the Bezel
962
406/90
Service
Guide
c. Turn the cassette assembly over so the cover is facing up. Check the cover latch to ensure that it is holding the bezel to the cover as shown.1
1
Cover Latch
12. Ensure the holes are aligned, and insert the bushing-lock pin into the bushing as shown.
1
Bushing
2
Bushing-Lock Pin
Chapter
9.
Removal
and
Replacement
Procedures
963
The following illustration shows the location of the bushing in the cassette.1
1
Location of the Installed Bushing and Bushing-Lock Pin
13. Insert the bushing-lock pin into the hole in the bushing, and push it in until it seats as shown in the following illustration.1
1
Bushing-Lock Pin
964
406/90
Service
Guide
14. Depress the handle release lever to ensure the locking bar on the handle is pulled into the unlocked position. Raise the handle on the cassette linkage assembly until it locks into the up position (the blank filler or adapter moves downward).
1
Handle Release Lever
2
Locking Bar
15. To install the metal EMC shield on the adapter bracket, grasp the EMC shield as shown in the following illustration, and do the following: 12
1
Metal EMC Shield
2
Installation Location for the Metal EMC Shield
Chapter
9.
Removal
and
Replacement
Procedures
965
a. Ensure that the metal EMC shield slides up inside the top of the cassette.12
1
Metal EMC Shield
b. The metal EMC shield has clips that slide over the top of the tailstock. Ensure that these clips are holding the metal EMC shield to the tailstock.1
1
Top of Tailstock
966
406/90
Service
Guide
16. Depress the handle release lever on the handle, and rotate the adapter handle until the handle is in the down position (adapter or blank filler moves up into the cassette assembly). 1
1
Handle
17. Using the system documentation, determine if the adapter you are installing is hot-swappable. If the adapter is hot-swappable, move the slider on the color indicator to allow the orange color to be visible. If the adapter is not hot-swappable, the blue color is visible.12
1
Orange indicates Hot-Swappable
2
Blue indicates Not Hot-Swappable
18. The adapter is ready to be installed into a system or an I/O drawer. Chapter
9.
Removal
and
Replacement
Procedures
967
Long Adapter Installation To install a long adapter, use the following procedure. 1. If installed, remove both long and short adapter retaining arms. See step 1a on page 943 to remove the arms. 2. Install the middle-spacer arm, see the following illustration. Note: Some long adapters are shipped with a middle-spacer arm. If your adapter uses the middle-spacer arm, slide it onto the cassette linkage as shown in the following illustration. If there is an embossed line on the face of the top rail, align the middle-spacer arm as shown in the following illustration. If there is no line embossed on the face of the top rail, align the middle-spacer arm with the leading edge of the adapter’s connector, as shown in the following illustration.465312
1
Embossed Middle-Spacer Alignment Lines
4
Adapter Connector (Leading Edge)
2
Top Cassette Rail
5
Long Adapter Card
3
Middle-Spacer Arm
6
Long Adapter Arm
968
406/90
Service
Guide
3. Place the adapter into the cassette so that the upper-left corner of the adapter engages the top adjustable adapter-retaining clip as shown in the following illustration:54231
1
Top Adapter-Retaining Clip
3
Adapter
2
Cassette Linkage Assembly
4
Bottom of Tailstock
5
Handle
Chapter
9.
Removal
and
Replacement
Procedures
969
4. Rotate the adapter so that the adapter engages the slot in the bottom retaining clip and the top corner of the adapter is seated into the top adjustable adapter-retaining clip.Note: If the adapter is not a full-height adapter, slide the top adjustable adapter-retaining clip downward until the lower edge of the adapter is seated into the slot on the bottom retaining clip.213
1
Adapter
3
Bottom Retaining Clip
2
Slot
970
406/90
Service
Guide
5. Install the long adapter-retaining arm onto the cassette linkage rail, as shown in the following illustration, and slide the arm toward the adapter.123
1
Cassette Linkage Assembly
3
Top Adapter Retaining Clip
2
Adapter Retaining Arm
Chapter
9.
Removal
and
Replacement
Procedures
971
6. Ensure that the top adjustable adapter-retaining clip catches the corner of the adapter as shown in the following illustration.
1
Top Adjustable-Retaining Clip
7. Ensure that the bottom edge of the adapter is held by the groove in the lower part of the adapter-retaining arm.12
1
Top Adjustable-Retaining Clip
2
Long Adapter Retaining Arm Slot
972
406/90
Service
Guide
8. To hold the top of the adapter, slide down the top adjustable adapter-retaining clip on the retaining arm. 1
1
Long Adapter Retaining Arm
9. If you installed a middle-spacer arm, ensure that the adapter-retaining clip is seated around the top edge of the adapter. 10. Depress the handle release lever on the handle. Rotate the adapter handle until it is in the down position (adapter or blank filler moves up into the cassette assembly). 21
1
Handle
2
Handle Release Lever
Chapter
9.
Removal
and
Replacement
Procedures
973
Ensure the right end of the cassette linkage rail moves up into the cassette linkage assembly as shown:1
1
Right End of Cassette Linkage
11. Position the adapter and cassette assembly with the handle on the left (in the down position) and the top facing away from you.12
1
Handle
2
Top of Cassette
974
406/90
Service
Guide
12. Install the cover on the cassette assembly by placing the cassette cover on the cassette assembly as shown. Slide the cover toward the handle until the hole in the cover aligns with the hole in the cassette assembly. 21
1
Handle
2
Hole Aligned in the Cover
Chapter
9.
Removal
and
Replacement
Procedures
975
13. Install the bezel assembly using the following procedure: a. Carefully slide the bezel onto the cassette assembly.1
1
Bezel
b. Align and insert the cover arm latch in the hooked notch in the bezel.12
1
PCI Adapter Pull Tab
2
Hooked Notch in the Bezel
976
406/90
Service
Guide
c. Align the top of the bezel assembly into the grooves on the top of the cassette assembly, and push the bezel onto the cassette linkage assembly until the tab on the top of the bezel is seated in the recess of the cassette assembly.1234
1
Recess
3
Tab
2
Grooves
4
Bezel
d. Insert the two tabs on the bezel extension into the two slots on the cassette assembly.1
1
Tabs
Chapter
9.
Removal
and
Replacement
Procedures
977
14. Check for the following: a. Ensure that the extension arm engages the three pins on the cassette. You can see the pins in the holes in the arm. The bezel tab is seated as shown in the following illustration.111
1
Pins
b. Ensure that the cover arm latch is completely pressed into the hooked notch on the bezel as shown.12
1
PCI Adapter Pull Tab
2
Hooked Notch in the Bezel
978
406/90
Service
Guide
c. Turn the cassette assembly over so that the cover is facing up. Check the cover latch to ensure that it is holding the bezel to the cover as shown.1
1
Cover Latch
15. Position the adapter with the cover side facing up. Ensure that the holes are aligned, and insert the bushing as shown.
1
Bushing
2
Bushing-Lock Pin
Chapter
9.
Removal
and
Replacement
Procedures
979
1
1
Bushing
2
Bushing-Lock Pin
16. Insert the bushing-lock pin into the hole in the bushing. Push on the pin until it seats.1
1
Location of the Installed Bushing and Bushing-Lock Pin
980
406/90
Service
Guide
17. Depress the handle release lever to ensure the locking bar on the handle is pulled into the unlocked position. Raise the handle on the cassette linkage until it locks into the up position (the blank filler or adapter moves downward).
1
Handle
2
Locking Bar
18. To install the metal EMC shield on the adapter bracket, grasp the metal EMC shield as shown in the following illustration, and do the following: 12
1
Metal EMC Shield
2
Adapter Bracket
Chapter
9.
Removal
and
Replacement
Procedures
981
a. Ensure that the shield slides up inside the top of the cassette.12
1
Metal EMC Shield
2
Adapter Bracket
982
406/90
Service
Guide
b. The metal EMC shield has clips that slide over the top of the tailstock. Ensure that these clips are holding the EMC shield to the tailstock.1
1
Top of Tailstock
19. Press the lock on the handle and rotate the adapter handle until the handle is in the down position (adapter or blank filler moves up into the cassette assembly). 1
1
Handle
20. Using your system documentation, determine if the adapter you are installing is hot-swappable. If the adapter is hot-swappable, move the slider on the color indicator to allow the orange color to be Chapter
9.
Removal
and
Replacement
Procedures
983
visible. If the adapter is not hot-swappable, the blue color is visible.12
1
Orange indicates Hot-Swappable
2
Blue indicates Not Hot-Swappable
21. The adapter is ready to be installed into a system or an I/O drawer. For more information, refer to your system documentation. 984
406/90
Service
Guide
Replacing a Double-Wide Blind-Swap Adapter Before installing a replacement adapter in a cassette that uses two adapter positions, referred to as a double-wide cassette, perform the following checks: v Check that the screws for the cassette are in place: – Across the male dovetail side – Holding the extension bracket – In the top center holding the cover – In the end of the top trackv Check to ensure that the: – Rail and the metal bezel are aligned – Rail is not caught on the top of the bezel – Tailstock is aligned with the bezelv Check the handle assembly to ensure that the: – Pivot block is not broken – Handle is set to the blue color – Handle gray bar is not brokenv Check that all the EMC brackets are intact and in place. v Check that the light pipes are intact and free from damage. v Check the bezel hole to ensure that the: – Handle is activated – Hole in the tailstock aligns with the hole in the bezelv Check the cover assembly to ensure that the cover end is held by the rear arm of the bezel assembly.To replace the adapter, use the following procedure: 1. Press the center of the handle of the PCI card cassette to release the latch, then lower the handle completely. 2. When the handle is completely lowered, push the gray locking cross bar. 3. Hold the assembly straight on with the PCI slots. 4. Place the double-wide blind-swap cassette assembly with the lower leading corner between installed single-wide cassettes. 5. Slide the cassette partially into the guide. 6. Ensure the dovetail on the top track aligns with its mating component(s) on both sides. After the dovetails are engaged, push the cassette fully into the drawer. You will feel some resistance when the cassette is fully inserted. When the cassette is fully inserted, it aligns with the other installed cassettes. 7. If the cassette is fully inserted, press the button on the handle to unlock the gray crossbar. 8. Raise the handle with one hand and at the same time, use your other hand to press down on the metal tab (adapter tailstock tab just below the handle mechanism) until the handle latches. You should hear a click as you lift the handle all the way up. Note: To ensure that the PCI adapter is completely seated, you may have to release the handle and repeat this process while pressing on the tailstock tab. When you are sure the adapter is completely seated, be sure that the handle is latched. 9. Connect power to the system and go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. Refer to “Powering the System On” on page 841 if necessary. 10. Ensure that the adapter is configured when the system completes the boot process. Chapter
9.
Removal
and
Replacement
Procedures
985
I/O Subsystem RIO Riser Card Removal Note: The RIO Riser Card is part of the I/O Subsystem Backplane FRU. It cannot be ordered separately. 1. Turn off power as described in “Powering Off and Powering On the System” on page 838. 2. Disconnect all cables from the RIO riser card. 3. Remove the I/O subsystem backplane, as described in “I/O Subsystem I/O Backplane Assembly” on page 989. 4. Remove the two screws that attach the RIO riser card to the I/O subsystem backplane. 5. Carefully pull the card out of the slot as shown in the following illustration.
1
RIO Riser Card
Replacement Replace in reverse order. 986
406/90
Service
Guide
I/O Subsystem Distributed Converter Assembly (DCA) Notes: 1. If multiple I/O planars or I/O Drawer DCAs are being replaced during a maintenance window, then they must be replaced one at a time in order to ensure the I/O Drawer serial number does not become damaged. After a single part is replaced, the I/O drawer needs to have power applied to it and the system brought to the OK prompt. At this point, the system can be powered off to replace the next part. An algorithm is used (shortly after the UEPO is turned on) to overwrite the new part. If more than one part is replaced at a time, the VPD will not be updated and the system will not be able to boot in LPAR mode. 2. Each I/O subsystem can have up to two DCAs. You can replace each DCA separately. The removal and replacement procedures are identical for each DCA. 3. This procedure applies to the I/O subsystem DCAs. 4. If a subsystem has two DCAs, each is hot-pluggable. If only one needs to be removed, you do not have to power off the system. Do not attempt to replace two DCAs at once or to move DCAs from one I/O drawer to another to troubleshoot a problem. The machine type/serial number will be damaged when these actions are attempted.Removal DANGERDo not attempt to open the covers of the power supply. Power supplies are not serviceable and are to be replaced as a unit.D021. Open the rack front door 2. Remove the I/O subsystem front cover. 3. Locate the DCA to be replaced. 4. Proceed only if all the DCA’s LEDs go off or the repair has been scheduled. If the repair has been scheduled, power off the system as described in “Powering Off and Powering On the System” on page 838, then return here. 5. Press the green Start Service button on the UEPO switch. 6. Proceed only if all the DCA’s LEDs go off or the repair has been scheduled (System outage - EPO off), 7. Verify the cable labeling and locations, and then unplug the cables from the DCA. 8. Loosen the left and right captive thumbscrews, using the torque tool (P/N 6422789), alternating between the fasteners until the unit can slide out of the slot. Chapter
9.
Removal
and
Replacement
Procedures
987
9. Grasp both sides of the front DCA, and pull it out of the drawer.
1
I/O Subsystem
3
I/O Subsystem Front Cover
2
DCA Captive Thumbscrew (quantity 2)
4
Distributed Converter Assembly (DCA)
Replacement 1. Rotate, counter clockwise, the two captive thumbscrews located on the front of the DCA. Ensure that the each thumbscrew has been rotated fully into the counter clockwise position. Attention: Rotating the thumbscrews counter clockwise engages the stop mechanism located on the bottom of the DCA. Failure to do so could result in the system powering off during a concurrent replacement operation. The following illustrations shows a DCA stop mechanism completely engaged.
1
DCA Captive Thumbscrew (quantity 2)
2a
Stop Mechanism (disengaged)
2
Stop Mechanism (engaged)
3
Distributed Converter Assembly (DCA)
2. Carefully slide the DCA into the slot as far as possible (the back edge of the top of the DCA is about .25 inch from the cage bracket).Note: Do not force the DCA against the backplane, as this may damage the connector. 3. Hold the upper part of the DCA against the backplane, and use the torque tool to engage the left captive thumbscrew about two turns into the cage bracket. 4. Repeat the previous step for the right captive thumbscrew. 5. Alternate tightening the left and right thumbscrews until both are fully torqued in. 6. Check torque on both thumbscrews. 7. Plug all cables into their correct location in the DCA. 8. Press the white Service Complete button on the UEPO switch. 988
406/90
Service
Guide
I/O Subsystem I/O Backplane Assembly Removal Note: If multiple I/O planars or I/O Drawer DCAs are being replaced during a maintenance window, then they must be replaced one at a time in order to ensure the I/O Drawer serial number does not become damaged. After a single part is replaced, the I/O drawer needs to have power applied to it and the system brought to the OK prompt. At this point, the system can be powered off to replace the next part. An algorithm is used (shortly after the UEPO is turned on) to overwrite the new part. If more than one part is replaced at a time, the VPD will not be updated and the system will not be able to boot in LPAR mode. 1. Power off the system as described in “Powering Off and Powering On the System” on page 838. 2. Disconnect and label all cables from the adapters on the affected I/O backplane. 3. Remove and label adapters. (See “PCI Adapters” on page 914). 4. Remove the two screws securing the I/O backplane to the subsystem. 5. Insert the I/O subsystem backplane removal tool (part number 44P0549), and pry the tool backward to remove the board from the connector.Note: If the I/O subsystem is in the bottom position of the rack, the I/O subsystem may need to be removed from the rack to allow correct usage of the I/O subsystem backplane removal tool to install a new I/O backplane. 6. Slide the I/O subsystem backplane assembly to the rear.Note: Be sure to hold the board level to ensure that the board can clear the step in the cover and then again to clear the EMC gasket. 7. Remove the assembly. 12
1
Backplane removal tool (part number 11P4789)
2
Backplane removal and insertion tab
Chapter
9.
Removal
and
Replacement
Procedures
989
Replacement 1. Align the I/O subsystem backplane assembly with the chassis opening and ensure that the roll bar clears the top-edge EMC gasket without damaging it. 2. Slide the I/O subsystem backplane assembly into the chassis. Ensure that the two pins in the chassis align with the bottom of the backplane. Align the backplane with the connector. 3. Continue to slide the backplane in slowly until it makes contact with the connector. 4. Upon contact, insert the insertion tool (part number 11P4789) to cam the backplane into the connector. 5. Pry with the tool until the screw holes on the chassis and the board align. 6. Secure with two screws. 7. Reinstall all adapters and reconnect all cables to the replaced I/O backplane. 990
406/90
Service
Guide
I/O Subsystem DASD Hard Disk Drive Assembly Note: Linux does not support hot-plugging any hot-pluggable PCI adapters or devices. A system with Linux installed on one or more partitions must be shut down and powered off before replacing any PCI adapter or device assigned to a Linux partition. Follow the non-hot-pluggable adapter or device procedures when replacing a PCI adapter or device in any partition with Linux installed. Removal 1. This procedure is a hot-swap procedure. Refer to the ?“SCSI Hot-Swap Manager” in the “Introduction to Tasks and Service Aids”chapter in the Diagnostic Information for Multiple Bus Systems, order number SA23-2769. Use the SCSI Hot-Swap Manager Task to prepare to remove and replace a SCSI disk drive. Then continue with the following steps. 2. Press on the blue latch to release it from hooking the body. 3. Move the black handle all the way down and pull out the assembly. A slight initial resistance is normal. 4. Ensure that the 4-pack has not been damaged. If the DASD assembly was installed incorrectly, the removal could force the metal tab in the 4-pack to bend over the camming slot.
1
DASD 4-Pack Assembly
4
Cover
2
I/O Subsystem Chassis
5
Disk Drive Assembly
3
EMC Filler Shield
6
DASD Filler
Chapter
9.
Removal
and
Replacement
Procedures
991
Replacement Note: All four bays in each DASD 4-pack must be occupied. If a bay does not have a DASD assembly, it must be occupied by a filler assembly. 1. Inspect the DASD 4-pack to ensure it is not damaged. 2. Ensure that the lower retaining screws are inserted and secured. 3. Ensure that the front retaining screws are inserted and secured without interfering with the DCA. 4. Open fully the latch on the DASD assembly by pressing the blue handle and pulling the black handle. A thump signals that it is fully open. 5. With the handle to the bottom, insert the assembly into the appropriate slot. 6. Slide the assembly forward until you feel some resistance. Verify that the spring on the top is contacting the edge of the 4-pack. This should be the only resistance. 7. Push the assembly forward with slightly more force until you feel more resistance. Verify that the tab on the black handle is located above the cam slot in the 4-pack.Note: If the tab is not aligned with the slot, do not raise the handle. Remove the assembly and start again. 8. Lift the black handle to move the assembly in slightly. 9. Lift the handle and push forward until the blue latch hooks the assembly body. 992
406/90
Service
Guide
I/O Subsystem DASD 4-Pack Removal 1. Press the green Start Service button on the UEPO switch. 2. Locate the DASD backplane to be replaced. 3. Turn off the power as described in “Powering Off and Powering On the System” on page 838. 4. Remove the front cover from the I/O subsystem by grasping the pins at either side of the cover, pulling them towards the center of the drawer, and simultaneously lifting the cover up and to the front to release it from the I/O subsystem chassis. 5. Remove the DASD disk drive assemblies or DASD filler(s) from the affected 4-pack as shown in the following illustration.
1
DASD 4-Pack Assembly
4
Cover
2
I/O Subsystem Chassis
5
Disk Drive Assembly
3
EMC Filler Shield
6
DASD Filler
6. Remove two M4x0.7 screws at the lower inside front of the DASD 4-pack with a short-blade screwdriver. 7. Remove two M4x0.7 hex socket-head cap screws at the lower inside rear of the DASD 4-pack. 8. Grasp the DASD 4-pack through the cutouts in the top of the I/O subsystem, and pull the DASD 4-pack out of the front of the I/O subsystem.Replacement 1. Perform steps 4 through 8 of the removal instructions in reverse order. 2. Turn on the power as described in “Powering Off and Powering On the System” on page 838. 3. Press the white Service Complete button on the UEPO switch. Chapter
9.
Removal
and
Replacement
Procedures
993
I/O Subsystem Fan Removal 1. Press the green Start Service button on the UEPO switch. 2. Locate the fan to be replaced. 3. Turn off the power as described in “Powering Off and Powering On the System” on page 838. 4. Remove the DASD cage from the front of the fan to be removed (see “I/O Subsystem DASD 4-Pack” on page 993). 5. Slide the black retention latch on the fan assembly upward and, with the handle located on the fan, pull the fan from the front of the I/O subsystem. A door automatically closes when the fan is removed, preventing loss of airflow while the fan is out of the enclosure.123
1
DCA (DCA 1 shown)
2
I/O Subsystem Chassis
3
Fan (Fan 4 shown)
Replacement Note: All four openings to the front of the fans must be covered either by a DASD 4-pack or by a DASD 4-pack filler. Replace in reverse order. When you have completed the replacement of the DASD fan, turn on the power as described in “Powering Off and Powering On the System” on page 838, and press the white Service Complete button on the UEPO switch. This completes the replacement. 994
406/90
Service
Guide
I/O Subsystem Chassis Removal Attention: This process requires: v The use of a lift tool (part number 09P2481) and lift-plate assembly tool (part number 11P4369). v The lift tool wheels must be chocked to prevent the chassis from moving during the operation. The frame must not be on its casters, the leveling pads must be engaged to prevent the frame from moving during the operation. 1. Read all safety instructions before performing this operation. Unpack lift tool and follow assembly instructions included with the tool. Inspect the tool for damage (do not perform the remove procedure with a damaged lift tool). Replace the blue lift plate with the orange lift plate (P/N 11P4369). Orient two round holes in lift plate to the side of the lift tool that will be closest to the frame. 2. Turn off power as described in “Powering Off and Powering On the System” on page 838. 3. Unplug all power cables from the line power source. Attention: Because this procedure is performed with power removed from the system, references to using the white or green service buttons on the UEPO switch do not apply during the remainder of this procedure. 4. Open the rack doors and remove them from the rack. 5. Remove the I/O subsystem front cover. 6. If your system is configured with a half I/O drawer filler plate, remove the filler plate. 7. Verify that all cables are labeled. If not, then label all cables and components along with their location at the front and rear of your system. 8. Remove all DASD hard disk drives or DASD fillers as described in the removal section of “I/O Subsystem DASD Hard Disk Drive Assembly” on page 991. 9. Remove all DASD 4-pack or 4-pack filler assemblies as described in the removal section of “I/O Subsystem DASD 4-Pack” on page 993. 10. Remove all DCA units as described in the removal section of “I/O Subsystem Distributed Converter Assembly (DCA)” on page 987. 11. Remove all I/O subsystem fan assemblies as described in the removal section of “I/O Subsystem Fan” on page 994. 12. Remove all PCI adapter cassettes as described in the removal section of “PCI Adapters” on page 914. 13. Remove the I/O backplanes (up to 2) as described in the removal section of “I/O Subsystem I/O Backplane Assembly” on page 989. 14. Remove the screws that attach the I/O subsystem chassis to the rack. 15. Position the lift tool at the front of the rack. Align the lift tool so that I/O subsystem chassis will not be blocked (at the lip of the orange lift plate) from removal. Install wheel chocks on the lift tool casters. Engage the lift tool brake. 16. Raise the plate to a height such that the lift plate is just below the base of the I/O subsystem chassis you are removing. Release pins on the extender arms and position them on both left and right side. Arms must fit around the frame extenders and lock in place (do not adjust the lift tool after these arms are locked in place). 17. One person should gently pull and guide the I/O subsystem chassis onto the lift plate. The I/O subsystem chassis must be removed far enough to clear the front face of the frame. Retract the extender arms. 18. Lower I/O subsystem chassis to the bottom position of the lift tool. 19. Remove the wheel chocks and release the brake. Flip the crank handle 180 degrees. Carefully move the lift tool with the I/O subsystem chassis to the staging area. The brake and wheel chocks must be in place when transferring the I/O subsystem chassis to or from the packaging material.
Chapter
9.
Removal
and
Replacement
Procedures
995
Front:
1
I/O Subsystem Chassis
6
SCSI DASD Disk Drives
2
Fan assembly
7
DASD-4 Pack Cage
3
DASD 4-Pack Filler
8
DCA
4
DASD Filler
9
Half I/O drawer filler plate
5
Front Cover
Rear:
1
PCI I/O Subsystem Chassis
3
Blind-swap cassette or PCI filler with no adapter
2
RIO Riser Card, part of the I/O subsystem backplane assembly
4
PCI I/O Backplane
996
406/90
Service
Guide
Replacement Notes: 1. If you are replacing this assembly with a new assembly, the manufacturing label with the machine type, model number, and the serial number for the subsystem must be ordered to apply to the new part. For information on replacement labels, see Chapter 10, “Parts Information,” on page 1009. 2. Before beginning the replacement procedure, install rails in the frame if they are not already installed. 1. Position the lift tool such that the new I/O subsystem chassis can be moved onto the lift plate. 2. Install the wheel chocks and engage the brake. 3. Move the new I/O subsystem chassis onto the tool with its rear (PCI adapter end) flush with the lift plate. 4. Position the lift tool as close as possible to the front of the rack. Install wheel chocks on the lift tool casters. Engage the lift tool brake. 5. Raise the plate to a height such that the lift plate is just above the rails onto which the I/O subsystem chassis will be installed. Release pins on the extender arms and position them on both left and right side. Arms must fit around the frame extenders and lock in place (do not adjust the lift tool after these arms are locked in place). 6. One person should gently push the I/O subsystem chassis toward the frame from the front of the system. Adjust the height of the lift plate, as needed. After the I/O subsystem is installed in the frame, retract the extender arms. 7. Lower the lift plate and remove it from the lift tool. Repackage the lift tool. 8. Align the I/O subsystem chassis, then secure it to the rack with retaining screws. 9. Replace the I/O backplanes (up to 2) as described in the replacement section of “I/O Subsystem I/O Backplane Assembly” on page 989. 10. Replace all PCI adapter cassettes or fillers as described in “PCI Adapters” on page 914. 11. Replace all I/O subsystem fan assemblies as described in the replacement section of “I/O Subsystem Fan” on page 994. 12. Replace all DCA units as described in the replacement section of “I/O Subsystem Distributed Converter Assembly (DCA)” on page 987. 13. Replace all DASD 4-pack or 4-pack filler assemblies as described in the replacement section of “I/O Subsystem DASD 4-Pack” on page 993. 14. Replace all DASD hard disk drives or DASD fillers as described in the replacement section of “I/O Subsystem DASD Hard Disk Drive Assembly” on page 991. 15. Reconnect all cables to the front and rear of your system. 16. If your system is configured with a half I/O drawer filler plate, remove the filler plate. 17. Remove the I/O subsystem front cover. Chapter
9.
Removal
and
Replacement
Procedures
997
Media Subsystem CD-ROM, DVD-RAM, or Tape Drive Attention: DVD-RAM and 4-mm tape drives can be located in the rear of the media subsystem only if the room is maintained at a nominal room temperature of 24°C (75.2°F). Attention: Do not open the drive; no user adjustments or serviceable parts are inside. CAUTION: All laser modules are designed so that there is never any human access to laser radiation above a class 1 level during normal operation, user maintenance, or prescribed service conditions. Data processing environments can contain equipment transmitting on system links with laser modules that operate at greater than class 1 power levels. For this reason, never look into the end of an optical fiber cable or open receptacle. Only trained service personnel should perform the inspection or repair of optical fiber cable assemblies and receptacles.C25, C26 Removal 1. Ensure that the SCSI drive is in idle mode (the activity light remains off). 2. Loosen the thumbscrew that holds the SCSI drive assembly in the subsystem (the top screw is for the drive to the right, and the bottom screw is for the left drive).2SCSISCSI314 3. Grasping the thumbscrew and not the bezel, carefully pull the drive assembly out until you can access the cables at the rear. Attention: When performing the next two steps, always disconnect the power cable before disconnecting the SCSI cable. 4. Disconnect the power cable from the rear of the drive assembly. 5. Disconnect the SCSI cable from the rear of the drive assembly. 6. Remove the four mounting screws that hold the drive or blank to the carrier bracket and remove it from the carrier.Replacement Note: Ensure that the cables are routed through the grommet at the rear of the carrier. Attention: When replacing a SCSI drive, always connect the SCSI cable before connecting the power cable. Replace in reverse order. 998
406/90
Service
Guide
Diskette Drive Removal 1. Ensure that the diskette drive is in idle mode (the activity light remains off). 2. Loosen the thumbscrew that attaches the diskette carrier to the media subsystem.2SCSISCSI314 3. Carefully grasping the thumbscrew and not the bezel, pull the drive assembly out until you can access the cables at the rear. Attention: When performing the next two steps, always disconnect the power cable before disconnecting the diskette signal cable. 4. Disconnect the power cable from the rear of the drive assembly. 5. Disconnect the diskette signal cable from the rear of the drive assembly. 6. Remove the four screws that connect the carrier to the diskette drive.Replacement Replace in reverse order. Attention: When replacing a diskette drive, always connect the diskette signal cable before connecting the power cable. Chapter
9.
Removal
and
Replacement
Procedures
999
Operator Panel Removal 1. Turn off power as described in “Powering Off and Powering On the System” on page 838. 2. Remove the upper-left and lower-right screws that hold the media subsystem in the rack, and pull the subsystem out no more that two inches. 3. Pry the spring clip on the left side just enough to release the panel while grasping the operator panel by the top and bottom front (do not pull on the bezel). Pull the operator panel out of the chassis. 4. Note the locations and disconnect the cables from the operator panel, and then remove the operator panel as shown in the following illustration. Replacement 1. Remove the VPD module from the old operator panel and insert it in the new operator panel. Make sure you insert the module in the same position as it was in the old panel. 2. Reconnect the cables to the operator panel. 3. Push the operator panel into the chassis until the spring clip engages. 4. Push the chassis into the rack, and replace the two mounting screws. 5. Turn on the power as described in “Powering the System On” on page 841. 1000
406/90
Service
Guide
Media Subsystem Chassis Removal 1. Turn off power as described in “Powering Off and Powering On the System” on page 838. 2. Label and disconnect all cables from the rear of the media subsystem.28361SCSISCSI754 3. Remove the screw that holds the left side of the media subsystem chassis to the rack. 4. Remove the screw that holds the right side of the media subsystem chassis to the rack.2SCSISCSI314 5. Carefully pull the chassis out of the front of the rack.Replacement Replace in reverse order. If the new media drawer contains a new operator panel, remove the VPD module from the old operator panel and insert it in the new operator panel. Make sure you insert the module in the same position as it was in the old panel. Be sure to move any other parts or features from the old media drawer to the new media drawer if they are not included as part of the new media drawer. Chapter
9.
Removal
and
Replacement
Procedures
1001
Media Subsystem Internal Cables Removal 1. Turn off power as described in “Powering Off and Powering On the System” on page 838. 2. If you have not already done so, open the front door of the rack unit. 3. Remove the media subsystem from the rack as described in “Media Subsystem Chassis” on page 1001, and place it on a work surface. 4. Remove the six screws and the eight 7-mm nuts that hold the cover to the subsystem, as shown in the following illustration. 5. Loosen the four captive screws (one for each SCSI drive bay) to free the two truss brackets. 6. Remove the cover. 7. Release the ties that hold the affected cable. If ties must be cut, cut them from the underside of the chassis. 8. Disconnect the cable connector from the chassis. 9. Note the cable routing, and remove the cable from the subsystem chassis.Replacement Replace in reverse order.Note: Be sure to secure the cables with the ties as they were originally installed. Allow enough length for a service loop in case the drives are removed in the future. 1002
406/90
Service
Guide
p690 Cable Installation and Management In some configurations, cables that are not properly supported might damage the associated connectors. Connector damage can be prevented by following the procedures described in this section. These procedures must be used for installing I/O cables, administrative LAN cables, and High Performance Switch (HPS) cables in a M/T 7040-W42 frame. Note: Planning information for switch cable installation is detailed in ?Appendix D. Cabling the pSeries HPS? of the 45H/W4 Switch Planning, Installation, and Service Guide. This section describes the procedures you must follow when installing a 7040 frame, especially with administrative LAN and High Performance Switch cables. Failure to follow these procedures might damage network components. The objectives of these procedures are to: v Provide HPS cables with continuous support and strain relief. v Minimize the possibility of pin damage to the SNI ports. v Form consistent cable groups. v Maintain efficient cooling.A high-level workflow for cable installation is as follows: 1. Install the frame. 2. Route all signal cables up through the tailgate. 3. Separate cables into groups based on their location and size.Note: If a High Performance Switch is in the configuration, the cables must not be connected to SNIs or switch port connection cards until the weight of each cable is fully supported. 4. Make certain that cable groupings do not interfere with PCI adapter slots. 5. Make certain that the cable bend radius allows clearance for closing the rear door but is not so sharp that it places excessive strain on the cable or the associated connectors. 6. Cable the frame starting with the upper cable connections and working down. 7. Apply tailgate insert clamp supports for the cables from left to right.The following illustration shows the tailgate assembly and standard frame extenders of a properly cabled frame. Refer to the service guide for more parts information. Chapter
9.
Removal
and
Replacement
Procedures
1003
Note: Do not connect any High Performance Switch cables (if installed) until instructed to do so. 1. Move the frame into position over the floor tile cutouts. 2. If you have not already done so, install the frame using the appropriate installation instructions. 3. Verify that the cable management bracket (tailgate) is properly installed on the base at the rear of the frame, if not: a. Remove the shipping bracket from the rear of the frame. b. Install the stiffener into the rear cutout on the base of the frame. c. Attach the cable management bracket (tailgate) to the stiffener. d. Make sure that the extender is available for installation after all cables have been installed.4. Route all HMC, Ethernet, I/O, and HPS cables up through the tailgate.Notes: a. Main power cables do not pass through the tailgate assembly.
1
Rear Cutout
4
Cable Management Backet
2
Extender
5
Stiffener
3
Tailgate Insert
1004
406/90
Service
Guide
b. High Performance Switch cables must not be connected to SNIs or switch port connection cards until the cable weight is supported. The following illustration shows a properly cabled frame. Before starting this part of the procedure: v Verify that all HMC, Ethernet, I/O, and HPS cables (if present) are inside the tailgate. v Cables are clamped into the tailgate from left to right. If HPS cables are present, locate these cables closer to the right side of the tailgate by using the foam-block tailgate inserts. Chapter
9.
Removal
and
Replacement
Procedures
1005
– Install one or more foam-block tailgate inserts into any open slot (refer to the tailgate illustration above).Note: Do not connect any HPS cables until instructed to do so. After the preliminary steps are complete: 1. Sort the HPS cables into groups based on SNI port location, but do not clamp or connect cables until instructed to do so. An overview of the next few steps is as follows: a. Switch cables will be clamped into the tailgate in groups of either four or two. b. High Power Switch cables get clamped into the next available tailgate slot. c. Continue this cable pattern working down the frame and clamping to the right in the tailgate.2. Before routing the switch cables, position the HMC, Ethernet, and I/O cables but do not clamp these cables into the tailgate until instructed to do so. a. Route the HMC and Ethernet cables to the appropriate connectors and connect each cable. v If present, make certain the HMC and Ethernet cables do not interfere with the SNI ports.b. Wire tie the HMC and Ethernet cables into a bundle running down the side of the frame. c. Wire tie any I/O cables into a bundle running down the side of the frame.3. If HPS cables are present, without plugging the switch cables into the SNI ports, take the switch cables and raise them into position. a. Check any cable labels and verify that you have the correct cables for each SNI port. b. Make certain that the cable for the upper SNI port extends further than the cable for the lower SNI port. c. Place a wire tie far enough down the cable set so that the cables will have sufficient length for a gentle bend radius. d. Wire tie the cables together.4. Clamp any HPS cables into the next available tailgate slot using a rubber-faced, tailgate insert with the appropriate thickness of rubber. 5. If they are used, clamp the I/O cables into the tailgate using tailgate inserts. Depending on the number and size of the I/O cables, you may have to use one or two tailgate slots and clamping inserts. 6. Clamp the HMC and Ethernet cables into an available tailgate slot using a tailgate insert. 7. Insert foam-block tailgate inserts into any remaining tailgate slots.If you have properly cabled the frame, the cable layout should look like the following illustration. 1006
406/90
Service
Guide
HMCandEthernetcablesHMCandEthernetcablesI/Ocables(ifused)I/Ocables(ifused)I/Ocables(ifused)I/Ocables(ifused)Switchcablestoframeslots8and7Switchcablestoframeslots6and5Switchcablestoframeslots4and3Switchcablestoframeslot2Switchcablestoframeslot1Switchcablestoframeslots8and7Switchcablestoframeslots6and5Switchcablestoframeslots4and3Switchcablestoframeslot1Switchcablestoframeslot2Cablemanagementbracket(tailgate)Foamblocktailgateinsert
Chapter
9.
Removal
and
Replacement
Procedures
1007
1008
406/90
Service
Guide
Chapter 10. Parts Information This chapter contains parts information for the 406/90, which includes: v 406/1R Rack Subsystem v 406/81 Processor Subsystem v 406/1D I/O Subsystem 1009
406/1R Rack Subsystem The rack subsystem consists of the rack, power subsystem, and the integrated battery feature. 17953281064 11 - Integrated Battery Feature (not pictured) 1010
406/90
Service
Guide
Index
Part Number
Units Per Assembly
Description
1
44P010744P0127
1
Rear DoorRear Door (Accoustic)
2
11P394844P012544P012611P289011P2923
11112
42U FrameCover, side leftCover, side, rightTop hatShipping bar
3
44P042644P0434
1
Front Door Primary RackFront Door Secondary Rack
11P109711P1096
1
Front Latch Primary RackFront Latch Secondary Rack
11P1093
2
Latch Bracket
44P1152
1
Vertical Extender Latch Side
44P1153
1
Extender Bottom Front
44P1154
1
Extender Bottom Rear
11P4106
2
Door Hinge
11P3990
1
Door key
54G2882
12
Screw
11P3535
2
Hinge
2665525
Bolt
4
44P1124
1
Universal Emergency Power Off (UEPO) Switch
5
44P1958
2
Bulk Power Enclosure (BPE)
6
44P1959
4
Bulk Power Distribution (BPD)
7
11P3645
2
Bulk Power Controller (BPC)
11P1597
Y-Cable (CSP)
8
44P1995
6
Bulk Power Regulator (BPR) Note: Do not mix (BPR) 44P1995 in the same system with (BPR) 44P2645.
8
44P2645
6
Bulk Power Regulator (BPR) Notes: 1. To use this BPR your firmware must be at 3H030916 or later.Note: Do not mix (BPR) 44P2645 in the same system with (BPR) 44P1995.
9
11P1787
2
Bulk Power Fan (BPF)
11J5335
2
Bulk Power Assembly Rail
10
11P0287
1
Fan Cover Plate
11
11P3732
Up to 6
Integrated Battery Feature (IBF)
44P2471
1
Capacity card for power subsystem
12
31L8302 (Not Shown)
Rack Joining Kit - Contains bolts, washers, and standoffs to be used when bolting two or more rack cabinets together.
Chapter
10.
Parts
Information
1011
406/81 Processor Subsystem (Front) This processor subsystem front view consists of the processor chassis, memory books, fans, and filters. 3214 1012
406/90
Service
Guide
Index
Part Number
Units Per Assembly
Description
1
00P4015
1
Processor subsystem chassis (with backplane)
2
11P2935
1 to 8
Memory book retainerSee “406/81 Processor Subsystem (Front)” on page 1014 for Memory books part numbers.
3
31L7627
4
Fan Controller (MDA)
07H5349
4
Fans
4
11P3781
2
Air filter
Chapter
10.
Parts
Information
1013
406/81 Processor Subsystem (Front) This processor subsystem front view consists of the memory books, MCM and L3 modules. 1014
406/90
Service
Guide
Index
Part Number
CCIN
Units Per Assembly
Description
1
00P4015
25C0
1
System Backplane
2
NA
Protective Cover (Included as part of the L3 Cache Module FRU.) Note: Do not discard the protective cover. Use it when returning L3 FRUs.
3
00P2839
25D9
4 to 16
128 MB (4x32)/400 MHz L3 Cache kitIncludes: v PN 21P7972 400 MHz VPD card (CCIN 25D9) v PN 11P3873 L3 Cache Module v PN 04N6923 Brush v PN 11P4055 Label v PN 44P0360 Mech Spring Kit
00P2849
25F1
4 to 16
128 MB (4x32)/433 MHz L3 Cache kitIncludes: v PN 21P788175 433 MHz VPD card 433 MHz VPD card (CCIN 25F1) v PN 11P3874 433 MHz L3 Cache Module v PN 04N6923 Brush v PN 11P4055 Label v PN 44P0360 Mech Spring Kit
00P6346
500 MHz L3 Cache kitIncludes: v PN 00P5159 VPD Card v PN 04N6923 Pad Cleaning Brush v PN 11P4055 Label
00P6258
25E7
600 MHz L3 Cache kitIncludes: v PN 00P4355 VPD Card v PN 04N6923 Pad Cleaning Brush v PN 11P4055 Label
4
21P4589
Load Post Sleeve
5
11P2788
Rear Spring Plate
6
11P2785
Front Spring Plate with Set Screw
7
00P4668
272D
Up to 3
MCM Passthru Module
8
53P4833
25C1
1
Clock Card
9
00P2821
25D5
1 to 4
1.1 GHz 8-way MCM with VPD card kitIncludes: v PN 11P4952 3.2 STND MCM Module (and VPD Smart Card, CCIN 25D5 ) v PN 04N6923 Brush v PN 11P3876 Screw v PN 11P4055 Label
00P2796
25C7
1 to 4
1.3 GHz 4-way MCM with VPD card kitIncludes: v PN 11P4949 3.2 Fast MCM Module(and VPD Smart Card, CCIN 25C7)orPN 11P4950 3.2 Fast MCM Module(and VPD Smart Card, CCIN 25C7) v PN 04N6923 Brush v PN 11P3876 Screw v PN 11P4055 Label Chapter
10.
Parts
Information
1015
Index
Part Number
CCIN
Units Per Assembly
Description
00P2818
25D3
1 to 4
1.3 GHz 8-way MCM with VPD card kitIncludes: v PN 11P4951 3.2 Fast MCM Module (and VPD Smart Card, CCIN 25D3) v PN 04N6923 Brush v PN 11P3876 Screw v PN 11P4055 Label
00P4629
25CF
1 to 4
1.5 GHz 8-way processorIncludes: v PN 04N6923 Brush v PN 11P3867 Screw v PN 11P4055 Label
00P4687
25E9
1.7 GHz 8-way processorIncludes: v PN 04N6923 Brush v PN 11P3867 Screw v PN 11P4055 Label
00P2833
526C
Up to 3
CUoD 1.1 GHz Processor MCM Includes: v MCM Module (and VPD Smart Card, CCIN 527C) v etc...
00P2834
526D
Up to 3
CUoD 1.3 GHz Processor MCMIncludes: v MCM Module (and VPD Smart Card, CCIN 527A) v PN 04N6923 Brush v PN 11P3876 Screw v PN 11P4055 Label
80P3442
528B
CUoD 1.3 GHz MCM and VPD smart chip card
00P5882
526B525E
CUoD 1.5 GHz Processor MCMIncludes: v PN 00P4351 Capacity Card v PN 04N6923 Brush v PN 11P3867 Screw v PN 11P4055 Label
80P3428
529B
CUoD 1.5 GHz MCM and VPD smart chip card
00P5883
525F526E
CUoD 1.7 GHz Processor MCMIncludes: v PN 00P4353 Capacity Card v PN 04N6923 Brush v PN 11P3867 Screw v PN 11P4055 Label
80P3438
529C
CUoD 1.7 GHz MCM and VPD smart chip card
80P2941
25B0
CUoD 1.9 GHz MCM and VPD smart chip card
80P2936
25CC
1.9 GHz L3 VPD card
53P6170
527C
1
CUoD 1.1 GHz Processor MCM Capacity Card 1016
406/90
Service
Guide
Index
Part Number
CCIN
Units Per Assembly
Description
53P6168
527A
1
CUoD 1.3 GHz Processor MCM Capacity Card
80P3708
527D
CUoD 1.9 GHz Processor Capacity Card
10
11P2793
Up to 12
L3 Filler
11
53P560353P562797P346897P3489 53P5604 53P562897P347197P349253P560653P563097P347497P349553P560853P563297P347997P350097P348497P350597P403297P4035
305C305D30E030E1302A 302B30E2 30E330E4 30E5305E305F302C302D30E630E730E830E930EE30EF
1 to 8
4 GB Memory Book (Inner)4 GB Memory Book (Outer)4 GB Memory Book (Inner)4 GB Memory Book (Outer)8 GB Memory Book (Inner)8 GB Memory Book (Outer)8 GB Memory Book (Inner)8 GB Memory Book (Outer)16 GB Memory Book (Inner)16 GB Memory Book (Outer)16 GB Memory Book (Inner)16 GB Memory Book (Outer)32 GB Memory Book (Inner)32 GB Memory Book (Outer)32 GB Memory Book (Inner)32 GB Memory Book (Outer)64 GB Memory Book (Inner)64 GB Memory Book (Outer)128 GB Memory Book (Inner)128 GB Memory Book (Outer)
53P424253P4273 53P424753P427853P425253P4283 53P425953P429053P426653P4297
307A308A307B308B307C308C307D308D307E308E
1 to 8
4 GB Memory Book I, 567 MHz4 GB Memory Book O, 567 MHz8 GB Memory Book I, 567 MHz8 GB Memory Book O, 567 MHz16 GB Memory Book I, 567 MHz16 GB Memory Book O, 567 MHz32 GB Memory Book I, 567 MHz32 GB Memory Book O, 567 MHz64 GB Memory Book I, 567 MHz64 GB Memory Book O, 567 MHz
53P425453P428597P347697P349753P426153P429297P348197P3502
30BD30C630EA30EB30C130CA30EC30ED
CUoD 8/16 GB Memory Book ICUoD 8/16 GB Memory Book OCUoD 8/16 GB Memory Book ICUoD 8/16 GB Memory Book OCUoD 16/32 GB Memory Book ICUoD 16/32 GB Memory Book OCUoD 16/32 GB Memory Book ICUoD 16/32 GB Memory Book O
12
44P4420
1
Aluminum Backplane Stiffener
13
44P4415
2
Steel Alignment Rods
14
Use the 1.7 GHz MCM FRU part number when ordering this FRU.
MCM (with copper heat sync)
Chapter
10.
Parts
Information
1017
406/81 Processor Subsystem (Rear) This processor subsystem rear view consists of the RIO books, DCAs, and capacitor books, MCM and L3 modules, as well as the media subsystem. 1018
406/90
Service
Guide
Index
Part Number
CCIN
Units Per Assembly
Description
1
53P1573
2738
1
Fourth I/O book without RIO-2 ports
53P4554
2806
1
Fourth I/O book with RIO-2 ports
11P2936
I/O book retainer
2
44P3024
1
Distributed converter assembly DCA1
11P1543
1
Cable assembly (DCA1) P00
11P1544
1
Cable assembly (DCA1) P01Attention: Do not plug DCA PN 11P4741 into the U1.18-P1-V1 location.
3
44P3024
1
Distributed Converter Assembly DCA2
11P1545
1
Cable assembly (DCA2) P00
11P1546
1
Cable assembly (DCA2) P01Attention: Do not plug DCA PN 11P4741 into the U1.18-P1-V2 location.
4
11P1540
1
Capacitor card
44P2471
1
Capacitor card with LGA test support
5
44P3024
1
Distributed converter assembly DCA3
11P1547
1
Cable assembly (DCA3) P00
11P1548
1
Cable assembly (DCA3) P01Attention: DCA PN 11P4741 can be plugged into U1.18-P1-V4 (DCA3) location.
6
44P3024
1
Distributed converter assembly DCA4
11P1549
1
Cable assembly (DCA4) P00
11P1550
1
Cable assembly (DCA4) P01Attention: DCA PN 11P4741 can be plugged into U1.18-P1-V5 (DCA4) location.
7
44P3024
1
Distributed converter assembly DCA5
11P1551
1
Cable assembly (DCA5) P00
11P1552
1
Cable assembly (DCA5) P01Attention: DCA PN 11P4741 can be plugged into U1.18-P1-V6 (DCA5) location.
8
11P1540
1
Capacitor card
44P2471
1
Capacitor card with LGA test support
9
44P3024
1
Distributed converter assembly DCA6
44P4085
1
Cable assembly (DCA6) P00
44P4086
1
Cable assembly (DCA6) P01Attention: DCA PN 11P4741 can be plugged into U1.18-P1-V8 (DCA6) location.
10
09P5687
273C
1
Primary I/O book without RIO-2 ports
09P3462
273D
Primary I/O book with RIO-2 ports Chapter
10.
Parts
Information
1019
Index
Part Number
CCIN
Units Per Assembly
Description
11P2936
I/O book retainer
16G8095
1
TOD Battery (inside primary I/O book)
44P0206
1
9-9 pin serial cable connects the managed system primary I/O book to the HMC
31L7196
1
9-25 pin serial cable connects the managed system primary I/O book to HMC async adapter
11
53P1573
2738
1
Third I/O book without RIO-2 ports
53P4554
2806
1
Third I/O book with RIO-2 ports
11P2936
1
I/O book retainer
12
53P1573
2738
1
Secondary I/O book without RIO-2 ports
53P4554
2806
1
Secondary I/O book with RIO-2 ports
11P2936
I/O book retainer
1020
406/90
Service
Guide
406/81 Media Subsystem 12345678910 Chapter
10.
Parts
Information
1021
Index
Part Number
Units Per Assembly
Description
1
24L108905G2882
1
Operator PanelScrew
2
See Note
1
Optional Media Device (SCSI Drive)
3
See Note
1 or 2
CD-ROM Drive (SCSI Drive)
4
76H4091
1
Diskette Drive
5
44P1799
1
Operator panel cable (J05)
6
44P18005420242
1
Diskette Drive Signal Cable (J04)Cable Tie
7
11P42155420242
2
Media Device SCSI Cables from J01 to rear drives, J03 to front drivesCable Tie
8
11P42145420242
2
Media Device Power Cables (J00 to rear drives, J02 to front drives)Cable Tie
9
See Note
1
Optional Media Device (SCSI Drive)
10
See Note
1
Optional Media Device (SCSI Drive)
44P2566
Up to 3
Media Bay Cover
44P1055
Cassette, PCI adapter
44P3592
Cassette, PCI adapter (type 2)
97P2686
1
Ultra320 SCSI
44P1055
1
Ultra320 SCSI cassette
44P0239
1
BSM
Note: See Diagnostic Information for Multiple Bus Systems for part numbers.
1022
406/90
Service
Guide
Power and SCSI Cables to the Media Subsystem 123546 Chapter
10.
Parts
Information
1023
Index
Part Number
Units Per Assembly
Description
1
11P4301
1
Cable Connecting the Operator Panel to the Media Subsystem
2
11P4300
1
Cable Connecting the Diskette Drive to the Media Subsystem
3
11P2352
1
Power Cable for Media Subsystem Front Components
4
11P4303
1
SCSI Cable for Media Subsystem Rear Components
5
11P4303
1
SCSI Cable for Media Subsystem Front Components
6
11P2351
1
Power Cable for Media Subsystem Rear Components
1024
406/90
Service
Guide
406/1D I/O Subsystem The system can contain up to eight I/O subsystems. This section lists detail for one I/O subsystem. Depending on your configuration, the quantities of parts listed may increase by the actual number of I/O subsystems in your system. Front
Index
Part Number
Units Per Assembly
Description
1
44P0705
1
I/O Subsystem Chassis
07H5247
2
Frame Rail
2
11P4624
1 to 4
Fan assembly
3
44P1277
1 to 4
DASD 4-Pack Filler
4
11P3662
Up to 15
DASD Filler
5
44P0189
1
Front Cover
6
See Note
1 to 16
SCSI DASD Disk Drives
7
11P485511P243611P3457
4
DASD-4 Pack Cage/Backplane AssemblyScrew - used on the DASD backplane assemblyScrew
11P485511P3667
4
DASD-4 Pack Cage/Backplane AssemblyScrew - used on the DASD backplane assembly
8
11P3582
2
DCA
Note: See Diagnostic Information for Multiple Bus Systems for part numbers.
Chapter
10.
Parts
Information
1025
Rear 123456
Index
Part Number
Units Per Assembly
Description
1
44P0705
1
PCI I/O Subsystem Chassis
1
44P2285
1
PCI-X I/O Subsystem Chassis
2
NA
2
RIO Riser Card, part of the I/O subsystem backplane assembly
3
11P4861
1 to 20
PCI Adapter Cassette Assembly
44P0239
PCI Adapter Cassette Assembly
11P4089
1 to 20
PCI Cassette Filler
4
44P1055
1 to 20
Blind-swap cassette, for standard PCI adapter
44P1390
As needed
Blind-swap cassette, for two-adapter-wide PCI adapter
44P3320
As needed
Blind-swap cassette, for Gigabit Ethernet PCI adapter
5
44P032344P032444P032144P0406
As needed
Adapter clip, used with blind-swap cassette: for Gigabit Ethernet FC 2969 type 9U for Ultra3 SCSI FC 6203 Type 4Y for 10/100 BaseT Ethernet FC 4962 Type AF for Gigabit Fibre Channel PCI FC 6228 Type 4W
11P4861
1 to 20
PCI filler, for slots with no adapter
6
44P1414
1 to 2
PCI I/O Backplane
44P1970
1 to 2
PCI-X I/O Backplane
(Not Shown)
21P5454
RIO-2 Remote I/O-2 Cable, 1.2 M
1026
406/90
Service
Guide
Power Cabling Chapter
10.
Parts
Information
1027
Index
Part Number
Units Per Assembly
Description
1
NA11P1598
Bulk Power Distribution on the A side of the system (BPD A Side)Cable A35-BPC-P00 to B35-BPC-P00
2
NA11P1598
Bulk Power Distribution on the B side of the system (BPD B Side)Cable B35-BPC-P00 to A35-BPC-P00
3
NA
1
A35-BPC-P03 to Bulk Power Fan P1-F1, Cable attached to FAN (BPF-A)
4
NA
1
B35-BPC-P03 to Bulk Power Fan P2-F1, Cable attached to FAN (BPF-B)
5
11P4449
1
B35-BPC-P08 to DCA-1-P01, Red
6
11P4448
1
A35-BPC-P08 to DCA-1-P00, Red with white stripe
7
11P4451
1
B35-BPC-P09 to DCA-2-P01, Red
8
11P4450
1
A35-BPC-P09 to DCA-2-P00, Red with white stripe
9
11P4453
1
B35-BPC-P10 to DCA-3-P01, Red
10
11P4452
1
A35-BPC-P10 to DCA-3-P00, Red with white stripe
11
11P4455
1
B35-BPD1-P08 to DCA-4-P01, Red
12
11P4454
1
A35-BPD1-P08 to DCA-4-P00, Red with white stripe
13
11P4457
1
B35-BPD1-P09 to DCA-5-P01, Red
14
11P4456
1
A35-BPD1-P09 to DCA-5-P00, Red with white stripe
14a
1
A35-BPD1-P06 to DCA-6-P01, Red with white stripe
14b
1
A35-BPD1-P06 to DCA-6-P00, Red
15 to 17
11P4734
1
I/OB9-P00 to Media P02
16 to 18
11P4735
1
I/OB9-P01 to Media P00
19, 20,21,22
NA
B35-BPDx-P0x to I/O Subsystem #x “RIO Cables and I/O Power Cables” on page 1029
23
11P4462
1
B35-BPC-P05 to (AMD2-P01) MDA2-J02, Red
24
11P4461
1
A35-BPC-P05 to (AMD2-P00) MDA2-J01, Red with white stripe
25
11P4460
1
B35-BPC-P04 to (AMD1-P01) MDA1-J02, Red
26
11P4459
1
A35-BPC-P04 to (AMD1-P00) MDA1-J01, Red with white stripe
27
11P4463
1
A35-BPC-P06 to (AMD3-P00) MDA3-J01, Red with white stripe
28
11P4464
1
B35-BPC-P06 to (AMD3-P01) MDA3-J02, Red
29
11P4466
1
A35-BPC-P07 to (AMD4-P01) MDA4-J02, Red
30
11P4465
1
B35-BPC-P07 to (AMD4-P00) MDA4-J01, Red with white stripe
31
11P1596
1
B35-BPC-P02 to UEPO J01, Red
32
11P1595
1
A35-BPC-P02 to UEPO J00, Red with white stripe
33
05N6740
1
A35-BPR-P00 to A13-IBF1-P00B35-BPR-P00 to B13-IBF2-P00
34
05N6740
1
A35-BPR-P00 to A13-IBF1-P00B35-BPR-P00 to B13-IBF2-P00
35
11P2998
1
A35-BPR-P00 to A-IBF3-P00 (Secondary Frame)
35
11P4248
1
B35-BPR-P00 to B-IBF3-P00 (Secondary Frame)
1028
406/90
Service
Guide
RIO Cables and I/O Power Cables
Index Number
FRU Part Number
Units Per Assembly
Description
11P2355
1
RIO cable 0.5-meter, per drawer
11P0272
Up to 2
RIO cable 2-meter, per drawer
44H9137
Up to 2
RIO cable 15-meter, per drawer
00P5238
Up to 2
RIO-2 cable 1.75-meter
00P5239
Up to 2
RIO-2 cable 2.5-meter
21P5477
Up to 2
RIO to RIO-2 cable 6-meter, per drawer
53P5242
1
RIO to RIO-2 cable 1-meter, per drawer
53P5243
Up to 2
RIO to RIO-2 cable 3-meter, per drawer
53P5454
1
RIO-2 to RIO-2 cable 1-meter, per drawer
53P2675
Up to 2
RIO-2 to RIO-2 cable 8-meter, per drawer
53P2676
Up to 2
RIO-2 to RIO-2 cable 3.5-meter, per drawer
11P4490
1
I/O Subsystem 1 power cable (DCA1-P01)
11P4491
1
I/O Subsystem 1 power cable (DCA1-P00)
11P4492
1
I/O Subsystem 1 power cable (DCA2-P01)
11P4493
1
I/O Subsystem 1 power cable (DCA2-P00)
11P4494
1
I/O Subsystem 2 power cable (DCA1-P01)
11P4495
1
I/O Subsystem 2 power cable (DCA1-P00)
11P4496
1
I/O Subsystem 2 power cable (DCA2-P01)
11P4497
1
I/O Subsystem 2 power cable (DCA2-P00)
11P4498
1
I/O Subsystem 3 power cable (DCA1-P01)
11P4499
1
I/O Subsystem 3 power cable (DCA1-P00)
11P4500
1
I/O Subsystem 3 power cable (DCA2-P01)
11P4501
1
I/O Subsystem 3 power cable (DCA2-P00)
11P4502
1
I/O Subsystem 4 power cable, primary rack (DCA1-P01)
11P4503
1
I/O Subsystem 4 power cable, primary rack (DCA1-P00)
11P4504
1
I/O Subsystem 4 power cable, primary rack (DCA2-P01)
11P4505
1
I/O Subsystem 4 power cable, primary (DCA2-P00)
11P4514
1
I/O Subsystem 4 power cable, secondary rack (DCA1-P01)
11P4515
1
I/O Subsystem 4 power cable, secondary rack (DCA1-P00)
11P4516
1
I/O Subsystem 4 power cable, secondary rack (DCA2-P01)
11P4517
1
I/O Subsystem 4 power cable, secondary rack (DCA2-P00)
11P4506
1
I/O Subsystem 5 power cable (DCA1-P01)
11P4507
1
I/O Subsystem 5 power cable (DCA1-P00)
11P4508
1
I/O Subsystem 5 power cable (DCA2-P01)
11P4509
1
I/O Subsystem 5 power cable (DCA2-P00)
11P4510
1
I/O Subsystem 6 power cable (DCA1-P01)
11P4511
1
I/O Subsystem 6 power cable (DCA1-P00)
11P4512
1
I/O Subsystem 6 power cable (DCA2-P01)
11P4513
1
I/O Subsystem 6 power cable (DCA2-P00)
44P0190
1
I/O Subsystem 7 power cable (DCA1-P01)
44P0191
1
I/O Subsystem 7 power cable (DCA1-P00)
44P0192
1
I/O Subsystem 7 power cable (DCA2-P01)
44P0193
1
I/O Subsystem 7 power cable (DCA2-P00)
44P0194
1
I/O Subsystem 8 power cable (DCA1-P01)
44P0195
1
I/O Subsystem 8 power cable (DCA1-P00)
44P0196
1
I/O Subsystem 8 power cable (DCA2-P01) Chapter
10.
Parts
Information
1029
Index Number
FRU Part Number
Units Per Assembly
Description
44P0197
1
I/O Subsystem 8 power cable (DCA2-P00)
1030
406/90
Service
Guide
Keyboards and Mouse
Index Number
FRU Part Number
Units Per Assembly
Description
1
24P0440
1
Keyboard, 101 US English (ID 103P) USB
24P0450
1
Keyboard, 101 French (ID 189) USB
24P0458
1
Keyboard, 101 Italian (ID 142) USB
24P0453
1
Keyboard, 101 German/Austrian (ID 129) USB
24P0476
1
Keyboard, 101 UK English (ID 166) USB
24P0470
1
Keyboard, 101 Spanish (ID 172) USB
24P0459
1
Keyboard, 101 Japanese (ID 194) USB
24P0444
1
Keyboard, 101 Brazilian Portuguese (ID 275) USB
24P0431
1
Keyboard, 101 Swedish, Finnish (ID 153) USB
24P0471
1
Keyboard, 101 LA Spanish (ID 171) USB
1
Keyboard, 101 Korean (ID 413) USB
24P0446
1
Keyboard, 101 Chinese (ID 467) USB
24P0466
1
Keyboard, 101 Russian (ID 443) USB
2
09N5532
1
Three Button Mouse (USB) 8841
Chapter
10.
Parts
Information
1031
System Power Cables
Index Number
FRU Part Number
Units Per Assembly
Description
1
11P0916
2
Line Power Cord, US, Canada, Japan, 30 AMP 480 V ac
1
11P0367
2
Line Power Cord, US, Canada, Japan, 60 AMP 240 V ac
1
11P0914
2
Line Power Cord, US, Chicago, 30 AMP 480 V ac
1
11P0365
2
Line Power Cord, US, Chicago, 60 AMP 240 V ac
1
11P0918
2
World Trade Power Cord, 4.27m (14 ft.), 8AWG, high voltage
1032
406/90
Service
Guide
Tools The following tools are needed to perform service actions on the 406/90. These tools can be ordered through your normal tools supplier.
Index
Part Number
Units Per Assembly
Description
1
46H9635
1
Tool box
2
11P4486
1
Torque tool MCM
3
04N6923
1
Sash brush
4
11P4747
1
Magnifying glass
5
11P4654
1
L3 cache tray
6
44P0549
1
I/O backplane insertion/removal tool
7
6422789
1
DCA removal tool
8
31L8313
1
Leveling tool (for rack)
9
47H0182
1
Torx, T10 (this tool might not be available in the shipped tool kit. A T10 tip used with a replacable driver handle can be substituted)
10
11P4861
1
Blank cassette
11
11P3990
1
Door key
12
44P0182
1
DASD service tool
13
11P4914
1
MCM/L3/Pass-through plug-location map (tri-fold chart) should be installed in the primary rack
14
11P4055
NA
Plug count adhesive replacement label
15
44P0209
1
Test card asm, MCM/L3 Short-To-GND
11P473211P473411P4735
1
Cabe asmCabe asmCabe asm
16
00P7023
NA
Vacuum cleaner (110 Vac)
36P243536P243636P2437
NA
Vacuum cleaner (220 Vac)OMEGA FILTER 0.3 MICRON CORD OMEGA UK PLUG
17
07H6824
12
Bracket, insert
18
07H6823
2
Bracket, 11 x 6 insert
19
11P1262
4
Bracket, asm w/ foam
20
07H6827
4
Blank filler plate - 3
21
07H6826
3
Blank filler plate - 1
22
54G2882
16
Screws
23
31L7174
1
Hook-and-loop fastener (roll)
24
08J5557
1
Wheel chocks
25
44P0144
Tailgate assembly
26
44P4107
1
Shorts Tester
27
00P5758
1
PCI short adapter Chapter
10.
Parts
Information
1033
Fiber Optic Connector Cleaning Tool Kit To clean fiber optic connectors in the field a fiber optic cleaning procedure has been developed. Use order number SY27-2604 to order the fiber optic cleaning procedure. Along with the cleaning procedure you should order the following: v Fiber Optic Cleaning Kit 46G6844 v Extra Alcohol Saturated Pads 9900679 The kits can be ordered through Mechanicsburg. The kit contains some parts which can be replenished by ordering direct from Texwipe, which are the lint-free cloth pads (Texwipe p/n TX 304) and microswaps (TX 732). 1034
406/90
Service
Guide
Chapter 11. SRNs for Linux The x in the following group of SRNs is encoded as follows: 8 4 2 1 | | | | | | | Replace all FRUs listed | | Hot swap supported | Software might be the cause Reserved Note: Some SRNs in this chapter may have 4 rather than 3 digits after the dash (-). Use the physical location codes and FRU numbers listed on the diagnostics Problem Report Screen.Failing Function Codes (FFCs) are not used in this group of SRNs. For more detailed information refer to “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
Service Request Number
Description and Action
651-88x
The CEC or SPCN reported an error. Action: Refer to the entry MAP in this system unit’s system service guide, with the 8 digit error and location codes, for the necessary repair action. If the 8 digit error and location codes were NOT reported, then call support.
651-89x
The CEC or SPCN reported an error. Action: Refer to the entry MAP in this system unit’s system service guide, with the 8 digit error and location codes, for the necessary repair action. If the 8 digit error and location codes were NOT reported, then call support.
652-88x
The CEC or SPCN reported a non-critical error. Action: Schedule deferred maintenance. Refer to the entry MAP in this system unit’s system service guide, with the 8 digit error and location codes, for the necessary repair action. If the 8 digit error and location codes were NOT reported, then call support.
652-89x
The CEC or SPCN reported a non-critical error. Action: Schedule deferred maintenance. Refer to the entry MAP in this system unit’s system service guide, with the 8 digit error and location codes, for the necessary repair action. If the 8 digit error and location codes were NOT reported, then call support.
A00-FF0
Description: Error log analysis is unable to determine the error. The error log indicates the following physical FRU locations as the probable causes. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A01-00x
Description: Error log analysis indicates an error detected by the CPU, but the failure could not be isolated. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A01-01x
Description: CPU internal error. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A01-02x
Description: CPU internal cache or cache controller error. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A01-03x
Description: External cache parity or multi-bit ECC error. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A01-04x
Description: External cache ECC single-bit error. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A01-05x
Description: System bus time-out error. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A01-06x
Description: Time-out error waiting for I/O. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. 1035
Service Request Number
Description and Action
A01-07x
Description: System bus parity error. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A01-08x
Description: System bus protocol/transfer error. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A02-00x
Description: Error log analysis indicates an error detected by the memory controller, but the failure could not be isolated. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A02-01x
Description: Uncorrectable Memory Error. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A02-02x
Description: ECC correctable error. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A02-03x
Description: Correctable error threshold exceeded. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A02-04x
Description: Memory Control subsystem internal error. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A02-05x
Description: Memory Address Error (invalid address or access attempt). Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A02-06x
Description: Memory Data error (Bad data going to memory). Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A02-07x
Description: Memory bus/switch internal error. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A02-08x
Description: Memory time-out error. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A02-09x
Description: System bus parity error. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A02-10x
Description: System bus time-out error. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A02-11x
Description: System bus protocol/transfer error. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A02-12x
Description: I/O Host Bridge time-out error. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A02-13x
Description: I/O Host Bridge address/data parity error. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A02-15x
Description: System support function error. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A02-16x
Description: System bus internal hardware/switch error.
A03-00x
Description: Error log analysis indicates an error detected by the I/O device, but the failure could not be isolated. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A03-01x
Description: I/O Bus Address parity error. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A03-02x
Description: I/O Bus Data parity error. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. 1036
406/90
Service
Guide
Service Request Number
Description and Action
A03-03x
Description: I/O bridge/device time-out, access or other error. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A03-04x
Description: I/O bridge/device internal error. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A03-05x
Description: I/O Error on non-PCI bus. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A03-06x
Description: Mezzanine bus address parity error.
A03-07x
Description: System bus address parity error. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A03-08x
Description: Mezzanine bus data parity error. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A03-09x
Description: System bus data parity error. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A03-10x
Description: Mezzanine bus time-out, transfer or protocol error. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A03-11x
Description: System bus time-out error. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A03-12x
Description: Error on System bus. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A03-13x
Description: I/O Expansion bus parity error. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A03-14x
Description: I/O Expansion bus time-out error. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A03-15x
Description: I/O Expansion bus connection failure. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A03-16x
Description: I/O Expansion unit not in an operating state. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A05-00x
Description: Error log analysis indicates an environmental and power warning, but the failure could not be isolated. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A05-01x
Description: Sensor indicates a fan has failed. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A05-02x
Description: System shutdown due to a fan failure. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A05-03x
Description: Sensor indicates a voltage outside normal range. Action: Use “MAP 1520: Power” on page 192.
A05-04x
Description: System shutdown due to voltage outside normal range. Action: Use “MAP 1520: Power” on page 192. Chapter
11.
SRNs
for
Linux
1037
Service Request Number
Description and Action
A05-05x
Description: Sensor indicates an abnormally high internal temperature. Action: Verify that: 1. The room ambient temperature is within the system operating environment. 2. There is unrestricted air flow around the system. 3. All system covers are closed. 4. There are no fan failures.If none of these problems exist, then proceed with “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A05-06x
Description: System shutdown due to abnormally high internal temperature. Action: Verify that: 1. The room ambient temperature is within the system operating environment. 2. There is unrestricted air flow around the system. 3. All system covers are closed. 4. There are no fan failures.If none of these problems exist, then proceed with “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A05-07x
Description: Sensor indicates a power supply has failed. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A05-08x
Description: System shutdown due to power supply failure. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A05-09x
Description: Sensor detected a FRU that has failed. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A05-10x
Description: System shutdown due to FRU that has failed. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A05-14x
Description: System shutdown due to power fault with an unspecified cause. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A05-16x
Description: System shutdown due to internal power supply failure. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A05-19x
Description: System shutdown due to Fan failure. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A05-21x
Description: System shutdown due to Over temperature condition. Action: Verify the following: v The room ambient temperature is within the system operating environment. v There is unrestricted air flow around the system. v All system covers are closed.If all conditions are met, then use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A05-22x
Description: System shutdown due to over temperature and fan failure. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A05-24x
Description: Power Fault specifically due to internal battery failure. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A0D-00x
Description: Error log analysis indicates an error detected by the Service Processor, but the failure could not be isolated. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. 1038
406/90
Service
Guide
Service Request Number
Description and Action
A0D-01x
Note: Ensure that the system IPLROS and Service Processor are at the latest firmware level before removing any parts from the system. Description: Time-out communication response from Service Processor. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A0D-02x
Description: I/O (I2C) general bus error. Note: If you are servicing a 9076/Nxx system, go to the Service Processor Error Log (SVP). Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A0D-03x
Description: Secondary I/O (I2C) general bus error. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A0D-04x
Description: Internal Service Processor memory error. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A0D-05x
Description: Service Processor error accessing special registers. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A0D-06x
Description: Service Processor reports unknown communication error. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A0D-07x
Description: Internal service processor firmware error or incorrect version. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A0D-08x
Description: Other internal Service Processor hardware error. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A0D-09x
Description: Service Processor error accessing Vital Product Data EEPROM. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A0D-10x
Description: Service Processor error accessing Operator Panel. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A0D-11x
Description: Service Processor error accessing Power Controller. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A0D-12x
Description: Service Processor error accessing Fan Sensor. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A0D-13x
Description: Service Processor error accessing Thermal Sensor. Note: If you are servicing a 9076/Nxx system, go to the Service Processor Error Log (SVP).Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A0D-14x
Description: Service Processor error accessing Voltage Sensor. Note: If you are servicing a 9076/Nxx system, go to the Service Processor Error Log (SVP).Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A0D-17x
Description: Service Processor error accessing serial port. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A0D-18x
Description: Service Processor detected NVRAM error. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A0D-19x
Description: Service Processor error accessing Real Time Clock/Time-of-Day Clock. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A0D-20x
Description: Service Processor error accessing scan controller/hardware. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A0D-21x
Description: Service Processor detect error with Time-of-Day Clock backup battery. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A0D-23x
Description: Loss of heart beat from Service Processor. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. Chapter
11.
SRNs
for
Linux
1039
Service Request Number
Description and Action
A0D-24x
Description: Service Processor detected a surveillance time-out. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A0D-25x
Description: Power Control Network general connection failure. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A0D-26x
Description: Power Control Network node failure. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A0D-29x
Description: Service Processor error accessing Power Control Network. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A0D-30x
Description: Non-supported hardware. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A0D-31x
Description: Error detected while handling an attention/interrupt from the system hardware. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A0D-33x
Description: Array of Logic Built in Self Test Error. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A0D-34x
Description: Wire Test Error. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A0D-35x
Description: Mainstore or Cache IPL Diagnostic Error. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A0D-36x
Description: Other IPL Diagnostic Error. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A0D-37x
Description: Clock or PLL Error. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A0D-38x
Description: Hardware Scan or Initialization Error. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A0D-39x
Description: Chip ID Verification Error. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A0D-40x
Description: FRU Presence/Detect Error (Mis-Plugged). Action: Use“MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A10-100
Description: The resource is unavailable due to an error. System is operating in degraded mode. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A10-200
Description: The resource was marked failed by the platform. The system is operating in degraded mode. Action: Schedule maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A10-210
Description: The processor has been deconfigured. The system is operating in degraded mode. Action: Schedule maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A11-00x
Description: A non-critical error has been detected. Error log analysis indicates an error detected by the CPU, but the failure could not be isolated. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A11-01x
Description: A non-critical error has been detected, a CPU internal error. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A11-02x
Description: A non-critical error has been detected, a CPU internal cache or cache controller error. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. 1040
406/90
Service
Guide
Service Request Number
Description and Action
A11-03x
Description: A non-critical error has been detected, an external cache parity or multi-bit ECC error. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A11-04x
Description: A non-critical error has been detected, an external cache ECC single-bit error. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A11-05x
Description: A non-critical error has been detected, a system bus time-out error. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A11-06x
Description: A non-critical error has been detected, a time-out error waiting for an I/O device. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A11-07x
Description: A non-critical error has been detected, a system bus parity error. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A11-08x
Description: A non-critical error has been detected, a system bus protocol/transfer error. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A11-50x
Description: Recoverable errors on resource indicate a trend toward an unrecoverable error. However, the resource could not be deconfigured and is still in use. The system is operating with the potential for an unrecoverable error. Action: If repair is not immediately available, reboot and the resource will be deconfigured. Then operations can continue in a degraded mode. To repair use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A11-510
Description: Resource has been deconfigured and is no longer in use due to a trend toward an unrecoverable error. Action: The system is operating in a degraded mode. Schedule maintenance. To repair use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A11-520
Description: Recoverable errors on resource indicate a trend toward an unrecoverable error. However, the resource could not be deconfigured and is still in use. The system is operating with the potential for an unrecoverable error. Action: Try removing the processes from the failing processor and retry the deconfiguration, use the ha_star -C command. An alternative is reboot, and the processor will be deconfigured. Then the system can continue in a degraded mode. To repair use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A11-530
Description: Recoverable errors on resource indicate a trend toward an unrecoverable error. However, the resource could not be deconfigured and is still in use. The system is operating with the potential for an unrecoverable error. Run-time processor deconfiguration can be enabled to deconfigure the processor. Action: To enable run-time processor deconfiguration, use the chdev -a cpuguard=enable -l sys0 command. Then to retry the deconfiguration, use the ha_star -C command. An alternative is reboot, and the processor will be deconfigured. Then the system can continue in a degraded mode. To repair use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A11-540
Description: Recoverable errors on resource indicate a trend toward an unrecoverable error. However, the resource could not be deconfigured and is still in use. The system is operating with the potential for an unrecoverable error. Action: If repair is not immediately available, reboot and the resource will be deconfigured. Then operations can continue in a degraded mode. To repair use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. Chapter
11.
SRNs
for
Linux
1041
Service Request Number
Description and Action
A11-550
Description: Recoverable errors on resource indicate a trend toward an unrecoverable error. However, the resource could not be deconfigured and is still in use. The system is operating with the potential for an unrecoverable error. Action: If repair is not immediately available, reboot and the resource will be deconfigured. Then operations can continue in a degraded mode. To repair use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A11-560
Description: Recoverable errors on a resource indicate a potential for unrecoverable errors. The resource cannot be deconfigured and is still in use. The problem may be corrected by array bit steering. Use “MAP 1235: System Array Self-Repair Problem Resolution for Linux” on page 168.
A11-570
Description: A processor has been deconfigured due to a trend toward an unrecoverable error. The system is operating in a degraded mode. The problem may be corrected by array bit steering. Use “MAP 1235: System Array Self-Repair Problem Resolution for Linux” on page 168.
A11-580
Description: Recoverable errors on a processor indicate a potential for an unrecoverable error. The resource cannot be deconfigured and is still in use. The problem may be corrected by array bit steering. Use “MAP 1235: System Array Self-Repair Problem Resolution for Linux” on page 168.
A12-00x
Description: A non-critical error has been detected. Error log analysis indicates an error detected by the memory controller, but the failure could not be isolated. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A12-01x
Description: A non-critical error has been detected, an uncorrectable memory error. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A12-02x
Description: A non-critical error has been detected, an ECC correctable error. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A12-03x
Description: A non-critical error has been detected, a correctable error threshold exceeded. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A12-04x
Description: A non-critical error has been detected, a memory control subsystem internal error. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A12-05x
Description: A non-critical error has been detected, a memory address error (invalid address or access attempt). Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A12-06x
Description: A non-critical error has been detected, a memory data error (bad data going to memory). Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A12-07x
Description: A non-critical error has been detected, a memory bus/switch internal error. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A12-08x
Description: A non-critical error has been detected, a memory time-out error. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A12-09x
Description: A non-critical error has been detected, a system bus parity error. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. 1042
406/90
Service
Guide
Service Request Number
Description and Action
A12-10x
Description: A non-critical error has been detected, a system bus time-out error. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A12-11x
Description: A non-critical error has been detected, a system bus protocol/transfer error. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A12-12x
Description: A non-critical error has been detected, an I/O host bridge time-out error. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A12-13x
Description: A non-critical error has been detected, a I/O host bridge address/data parity error. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A12-15x
Description: A non-critical error has been detected, a system support function error. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A12-16x
Description: A non-critical error has been detected, a system bus internal hardware/switch error. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A12-50x
Description: Recoverable errors on resource indicate a trend toward an unrecoverable error. However, the resource could not be deconfigured and is still in use. The system is operating with the potential for an unrecoverable error. Action: If repair is not immediately available, reboot and the resource will be deconfigured. Then operations can continue in a degraded mode. To repair use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A13-00x
Description: A non-critical error has been detected, a error log analysis indicates an error detected by the I/O device, but the failure could not be isolated. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A13-01x
Description: A non-critical error has been detected, an I/O bus address parity error. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A13-02x
Description: A non-critical error has been detected, an I/O bus data parity error. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A13-03x
Description: A non-critical error has been detected, an I/O bus time-out, access or other error. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A13-04x
Description: A non-critical error has been detected, an I/O bridge/device internal error. Action: Schedule deferred maintenance. Use v.
A13-05x
Description: A non-critical error has been detected, an I/O error on non-PCI bus. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A13-06x
Description: A non-critical error has been detected, a mezzanine bus address parity error. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A13-07x
Description: A non-critical error has been detected, a system bus address parity error. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A13-08x
Description: A non-critical error has been detected, a mezzanine bus data parity error. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. Chapter
11.
SRNs
for
Linux
1043
Service Request Number
Description and Action
A13-09x
Description: A non-critical error has been detected, a system bus data parity error. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A13-10x
Description: A non-critical error has been detected, a mezzanine bus time-out error. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A13-11x
Description: A non-critical error has been detected, a system bus time-out error Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A13-12x
Description: A non-critical error has been detected, an error on system bus. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A13-13x
Description: A non-critical error has been detected, an I/O expansion bus parity error. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A13-14x
Description: A non-critical error has been detected, an I/O expansion bus time-out error. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A13-15x
Description: A non-critical error has been detected, an I/O expansion bus connection failure. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A13-16x
Description: A non-critical error has been detected, an I/O expansion unit not in an operating state. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A13-50x
Description: Recoverable errors on resource indicate a trend toward an unrecoverable error. However, the resource could not be deconfigured and is still in use. The system is operating with the potential for an unrecoverable error. Action: If repair is not immediately available, reboot and the resource will be deconfigured. Then operations can continue in a degraded mode. To repair use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A15-01x
Description: Sensor indicates a fan is turning too slowly. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A15-03x
Description: Sensor indicates a voltage outside normal range. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A15-05x
Description: Sensor indicates an abnormally high internal temperature. Action: Verify that: 1. The room ambient temperature is within the system operating environment. 2. There is unrestricted air flow around the system. 3. All system covers are closed. 4. There are no fan failures.If none of these problems exist, then proceed with “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A15-07x
Description: Sensor indicates a power supply has failed. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A15-09x
Description: Sensor indicates a FRU has failed. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A15-11x
Description: Sensor detected a redundant fan failure. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. 1044
406/90
Service
Guide
Service Request Number
Description and Action
A15-12x
Description: Sensor detected redundant power supply failure. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A15-13x
Description: Sensor detected a redundant FRU that has failed. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A15-14x
Description: Power fault due to unspecified cause. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A15-16x
Description: Internal power supply failure. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A15-17x
Description: Internal redundant power supply failure. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A15-19x
Description: Fan failure. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A15-20x
Description: Non-critical cooling problem, loss of redundant fan. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A15-21x
Description: Over temperature condition. Action: Verify the following: v The room ambient temperature is within the system operating environment. v There is unrestricted air flow around the system. v All system covers are closed.If all conditions are met, then use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A15-22x
Description: Fan failure and Over temperature condition. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A15-23x
Description: Non-critical power problem, loss of redundant power supply. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A15-24x
Description: Power Fault specifically due to internal battery failure. Action: Schedule deferred maintainence. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A15-50x
Description: Recoverable errors on resource indicate a trend toward an unrecoverable error. However, the resource could not be deconfigured and is still in use. The system is operating with the potential for an unrecoverable error. Action: If repair is not immediately available, reboot and the resource will be deconfigured. Then operations can continue in a degraded mode. To repair use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A1D-00x
Description: A non-critical error has been detected. Error log analysis indicates an error detected by the Service Processor, but the failure could not be isolated. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A1D-01x
Note: Ensure that the system IPLROS and Service Processor are at the latest firmware level before removing any parts from the system. Description: A non-critical error has been detected. Time Out communication response from Service Processor Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A1D-02x
Description: A non-critical error has been detected, an I/O (I2C) general bus error. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A1D-03x
Description: A non-critical error has been detected, a secondary I/O (I2C) general bus error. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. Chapter
11.
SRNs
for
Linux
1045
Service Request Number
Description and Action
A1D-04x
Description: A non-critical error has been detected, an internal service processor memory error. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A1D-05x
Description: A non-critical error has been detected, a service processor error accessing special registers. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A1D-06x
Description: A non-critical error has been detected, a service processor reports unknown communication error. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A1D-07x
Description: A non-critical error has been detected,: Internal service processor firmware error or incorrect version. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A1D-08x
Description: A non-critical error has been detected, an other internal service processor hardware error. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A1D-09x
Description: A non-critical error has been detected, a service processor error accessing vital product data EEPROM. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A1D-10x
Description: A non-critical error has been detected, a service processor error accessing operator panel. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A1D-11x
Description: A non-critical error has been detected, a service processor error accessing power controller. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A1D-12x
Description: A non-critical error has been detected, a service processor error accessing fan sensor. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A1D-13x
Description: A non-critical error has been detected, a service processor error accessing a thermal sensor. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A1D-14x
Description: A non-critical error has been detected, a service processor error accessing voltage sensor. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A1D-17x
Description: A non-critical error has been detected, a service processor error accessing serial port. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A1D-18x
Description: A non-critical error has been detected, a service processor detected NVRAM error. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A1D-19x
Description: A non-critical error has been detected, a service processor error accessing real time clock/time-of-day clock. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A1D-20x
Description: A non-critical error has been detected: Service processor error accessing scan controller/hardware. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A1D-21x
Description: A non-critical error has been detected, a service processor detected error with time-of-day clock backup battery. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. 1046
406/90
Service
Guide
Service Request Number
Description and Action
A1D-23x
Description: A non-critical error has been detected: Loss of heart beat from Service Processor. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A1D-24x
Description: A non-critical error has been detected, a service processor detected a surveillance time-out. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A1D-25x
Description: A non-critical error has been detected, a power control network general connection failure. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A1D-26x
Description: A non-critical error has been detected, a power control network node failure. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A1D-29x
Description: A non-critical error has been detected, a service process error accessing power control network. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A1D-30x
Description: A non-critical error has been detected: Non-supported hardware. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A1D-31x
Description: A non-critical error has been detected: Error detected while handling an attention/interrupt from the system hardware. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A1D-33x
Description: A non-critical error has been detected: Array of Logic Built in Self Test Error. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A1D-34x
Description: A non-critical error has been detected: Wire Test Error. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A1D-35x
Description: A non-critical error has been detected: Mainstore or Cache IPL Diagnostic Error. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A1D-36x
Description: A non-critical error has been detected: Other IPL Diagnostic Error. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A1D-37x
Description: A non-critical error has been detected: Clock or PLL Error. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A1D-38x
Description: A non-critical error has been detected: Hardware Scan or Initialization Error. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A1D-39x
Description: A non-critical error has been detected: Chip ID Verification Error. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
A1D-40x
Description: A non-critical error has been detected: Presence/Detect Error (Mis-Plugged). Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. Chapter
11.
SRNs
for
Linux
1047
Service Request Number
Description and Action
A1D-50x
Description: Recoverable errors on resource indicate a trend toward an unrecoverable error. However, the resource could not be deconfigured and is still in use. The system is operating with the potential for an unrecoverable error. Action: If repair is not immediately available, reboot and the resource will be deconfigured. Then operations can continue in a degraded mode. To repair use “MAP 1230: Linux Platform Error Problem Resolution” on page 163.
1048
406/90
Service
Guide
Appendix A. Environmental Notices Product Recycling and Disposal Components of the system unit, such as structural parts and circuit boards, can be recycled where recycling facilities exist. Companies are available to disassemble, reutilize, recycle, or dispose of electronic products. Contact your account representative for more information. This system unit contains parts such as circuit boards, cables, electromagnetic compatibility gaskets and connectors which may contain lead and copper/beryllium alloys that require special handling and disposal at end of life. Before this unit is disposed, these materials must be removed and recycled or discarded according to applicable regulations. This book contains specific information on each battery type where applicable. This product may contain a sealed lead acid, nickel cadmium, nickel metal hydride, lithium, or lithium ion battery. Consult your user manual or service manual for specific battery information. The battery must be recycled or disposed of properly. Recycling facilities may not be available in your area. For information on disposal of batteries, contact your local waste disposal facility. 1049
Detteproduktkanindeholdeetforsegletbatteri,derindeholderbly,nikkel-kadmium,nikkel-metal-hydrid,litiumellerlitium-ion.Dererflereoplysningerombatterietibruger-ellerservicevejledningen.Batterietmåikkekasseressammenmeddetalmindeligeaffald.Batterietskalkasseresihenholdtildelokalebestemmelser.Spørgeventueltkommunenstekniskeforvaltning(Miljøafdelingen).Ditproductbevatmogelijkeenafgeslotenbatterijvanhettypelood/zuur,nikkel/cadmium,nikkel/metaalhydride,lithium,oflithiumionen.Raadpleeghethandboekvoordegebruikerofhetservicehandboekvoorinformatieoverdebatterij.Debatterijmoetcorrectwordengerecycledofweggegooid.MogelijkzijnerinuwregiogeenKCA-inzamelingspunten.Meerinformatieoverhetverwerkenvangebruiktebatterijenkuntuverkrijgenbijuwafvalverwerkingsbedrijf.DiesesProduktenthälteineauslaufsichereBlei-,Nickel-Cadmium-,Nickel-metallhydrid-,Lithium-oderLithiumionenbatterie.SpezifischeInformationenzurBatterieentnehmenSiebittedemBenutzer-oderServicehandbuch.DieBatteriemusswiederverwertetoderalsSondermüllentsorgtwerden.Esistmöglich,dassesinIhrerNähekeineWiederverwertungsanlagegibt.WeitereInformationenzurEntsorgungvonBatterienerhaltenSievonIhremörtlichenMüllentsorgungsunternehmen.
1050
406/90
Service
Guide
Questoprodottopotrebbecontenereunabatteriaalpiombo,alnichel-cadmio,all’idrurodinichel,allitiooagliionidilitio.Consultareilmanualeutenteoilmanualediassistenzaperinformazionispecifichesullabatteria.Labatteriadeveessereappropriatamentericiclataosmaltita.Ilservizioperilriciclaggiopotrebberononesseredisponibilenellavostrazona.Perinformazionisullosmaltimentodellebatterie,contattarel’aziendaprepostaperlosmaltimentodeirifiuti.
Appendix
A.
Environmental
Notices
1051
Produktenkaninnehållaettförseglatbatteriavnågonavtypernablyackumulator,nickel-kadmium,nickel-metallhydrid,litiumellerlitiumjon.DetaljeradbatteriinformationfinnsIanvändar-ellerservicehandboken.Batterietmåstelämnastillåtervinningellerkastaspåettmiljösäkertsätt.Detkanskeintefinnsnågraåtervinningsmöjligheterdärdubor.Närdetgälleromhändertagandeavbatterierberviattfåhänvisatillmyndigheternasanvisningarirespektiveland.
1052
406/90
Service
Guide
Appendix
A.
Environmental
Notices
1053
Acoustical Noise Emissions The equivalent continuous A-weighted sound pressure level at workstations (emission sound pressure level at the 1-meter bystander positions) does not exceed 70 dB(A). Der Geräuschpegel der Einheit ist kleiner oder gleich 70 db(A). Declared Acoustical Noise Emissions
Acoustical Characteristic
Product Configuration
Declared A-Weighted Sound Power Level, LWAd (B)
Declared A-Weighted Sound Pressure Level, LpAm (dB)
Operating
Idle
Operating
Idle
Processor Subsystem
6.15
6.15
446
446
Notes: 1. Noise levels cited are for the typical configuration (A-Frame: Bulk Power, CEC cage, battery option, media drawer, and two I/O drawers). 2. The 0.6-B (6-dB) reduction in noise emission levels with the acoustical rear door corresponds to a factor of 4 reduction. That is, the noise level of a single A-Frame with Slimline covers is about the same as the noise level of four A-Frames with acoustical covers. 3. LWAd is the upper-limit A-weighted sound power level; LpAm is the mean A-weighted sound pressure level at the 1-meter bystander positions;1 B =10 dB. 4. All measurements made in conformance with ISO 7779 and declared in conformance with ISO 9296.
1054
406/90
Service
Guide
Appendix B. Notices This information was developed for products and services offered in the U.S.A. The manufacturer may not offer the products, services, or features discussed in this document in other countries. Consult the manufacturer’s representative for information on the products and services currently available in your area. Any reference to the manufacturer’s product, program, or service is not intended to state or imply that only that product, program, or service may be used. Any functionally equivalent product, program, or service that does not infringe any intellectual property right of the manufacturer may be used instead. However, it is the user’s responsibility to evaluate and verify the operation of any product, program, or service. The manufacturer may have patents or pending patent applications covering subject matter described in this document. The furnishing of this document does not give you any license to these patents. You can send license inquiries, in writing, to the manufacturer. The following paragraph does not apply to the United Kingdom or any country where such provisions are inconsistent with local law: THIS MANUAL IS PROVIDED ?AS IS? WITHOUT WARRANTY OF ANY KIND, EITHER EXPRESSED OR IMPLIED, INCLUDING, BUT NOT LIMITED TO, THE IMPLIED WARRANTIES OF NON-INFRINGEMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Some states do not allow disclaimer of express or implied warranties in certain transactions; therefore, this statement may not apply to you. This information could include technical inaccuracies or typographical errors. Changes are periodically made to the information herein; these changes will be incorporated in new editions of the publication. The manufacturer may make improvements and/or changes in the product(s) and/or the program(s) described in this publication at any time without notice. Information concerning products made by other than the manufacturer was obtained from the suppliers of those products, their published announcements, or other publicly available sources. The manufacturer has not tested those products and cannot confirm the accuracy of performance, compatibility or any other claims related to products made by other than the manufacturer. Questions on the capabilities of products made by other than the manufacturer should be addressed to the suppliers of those products. 1055
1056
406/90
Service
Guide
Appendix C. Service Processor Setup and Test Attention: This procedure applies to modems attached to the serial ports (S1 and S2) on the primary I/O book (location U1.18-P1-H2). It does not affect the operation of the modem attached to the Hardware Management Console (HMC). The call-out function is normally handled by the Service Focal Point application running on the HMC. Note: The call-out and surveillance options are disabled in partitioned systems. For your convenience, an example of a basic service processor setup checklist is included here. Your setup may include more or fewer of the available features, so you can adjust this checklist for your own application. Service Processor Setup Checklist 1. Shut down the managed system, and wait for the OK in the physical operator panel. 2. Open a virtual terminal window on the HMC. 3. Start the service processor menus. 4. Set the system name. 5. Enable Surveillance. 6. Configure call-in/call-out. 7. Attach modems if necessary. If modem attachment is not necessary, proceed to step 8. To attach a modem, do the following: a. Exit the service processor menus. b. Disable the 350 V dc outputs from the Bulk Power Assembly (BPA) by placing the UEPO switch in the off position. Attention: With the UEPO off, power is still present within the BPA. c. Attach the cables and modems. d. Place the UEPO switch in the on position.8. Test both of the following: v Call-In, go to “Testing Call-In” on page 1058 v Call-Out, go to “Testing Call-Out” on page 1058 Your service processor is now ready for use. 1057
Testing the Service Processor Setup This section contains sample testing procedures to help ensure your service processor setup is working. These tests include communicating with the server operating system. Before you start, ensure that the necessary serial port(s) is configured. If you need assistance, refer to “Serial Port Configuration” on page 1059. The server should be powered off as a result of the “Service Processor Setup Checklist” on page 1057. Testing Call-In 1. At your remote terminal, call in to your server. Your server answers and offers you the service processor Main Menu after requesting your privileged-access password. 2. Select System Power Control. 3. Select Power-On System. When you are asked if you wish to continue powering on the system, type Y. 4. After the system firmware and operating system have initialized the server, the login prompt displays at your remote terminal if you set up seamless modem transfer. This may take several minutes. When the login prompt displays, you have successfully called the service processor. 5. Type logout to disconnect from the operating system. The message No Carrier displays on your remote terminal. 6. Call your server again. The operating system answers and offers you the login prompt. If these tests are successful, call-in is working. 7. Log in and type shutdown -F to shut down your server. 8. The message No Carrier displays on your remote terminal.Testing Call-Out During the service processor setup, you entered your phone numbers for the pager and customer voice. These numbers are used for this test. 1. Your remote terminal is disconnected as a result of the Call-In test. 2. Call your server again. 3. At the service processor Main Menu, select Call-In/Call-Out Setup menu, then select Call-Out test. This action causes a simulated error condition for the purposes of this test. 4. After a few moments, a message displays, regarding an illegal entry. Press Enter to clear the message and return to the main menu. 5. When your telephone rings, answer the call. You should hear the sound of a telephone being dialed. This is your computer trying to page you.If this test is successful, call-out is working correctly. 1058
406/90
Service
Guide
Serial Port Configuration To configure the serial port on an AIX system, do the following from an AIX console: 1. Log in as root user. 2. To determine if you have any serial ports already configured, type: lsdev -Cc tty If no serial ports are configured, none are listed. If you want to configure serial ports that are not listed, continue with the remaining steps. 3. Identify the serial port(s) with the modem(s). 4. Type smit tty. 5. Select add tty. 6. Select RS232. 7. Select Baud rate 9600 or higher. 8. Select login enable and set the flow control to RTS. 9. Commit the selections and set up any other needed serial ports. 10. Exit SMIT.
Appendix
C.
Service
Processor
Setup
and
Test
1059
1060
406/90
Service
Guide
Appendix D. Modem Configurations Note: This appendix applies only to modems attached to the serial ports (S1 and S2) on the primary I/O book (location U1.18-P1-H2). These configuration files do not apply to the modem attached to the Hardware Management Console (HMC). Sample Modem Configuration Files This appendix contains information about several sample modem configuration files that either work directly with your modem or provide a good starting point for a custom setup. The sample modem configuration files are located in the firmware in /usr/share/modems subdirectory (if your server is using AIX) with the following names. A listing of each specific file is included in this appendix. Generic Modem Configuration Files
AIX File Name
Service Processor Firmware File Name
modem_z.cfg
modem_z_sp
modem_z0.cfg
modem_z0_sp
modem_f.cfg
modem_f_sp
modem_f0.cfg
modem_f0_sp
modem_f1.cfg
modem_f1_sp
Specific Modem Configuration Files
AIX File Name
Service Processor Firmware File Name
modem_m0.cfg
modem_m0_sp
modem_m1.cfg
modem_m1_sp
Use the following selection procedures and your modem manual to determine which of the configuration files is suitable for your use. 1061
Configuration File Selection Use the following steps to select a configuration file: 1. Is your modem an IBM 7852-400? If Yes, use modem configuration file modem_m0.cfg and go to step 7. Note: The IBM 7852-400 modem has DIP switches on the right side of the unit. See “IBM 7852-400 DIP Switch Settings” on page 1064 for the correct switch settings. If No, continue with step 2. 2. Is your modem an IBM 7857-017? If Yes, use modem configuration file modem_m1.cfg and go to step 7. Note: The IBM 7857-017 modem has two telephone line connections on the back of the unit. One is marked LL (for Leased Line), and the other is marked PTSN (for Public Telephone Switched Network). The service processor expects to use the modem on the public network, so the telephone line should attach to the PTSN connector. If No, continue with step 3. 3. Does your modem respond to the extended command set (prefixed with &)? If Yes, go to step 5. If No, continue with step 4. 4. Does your modem respond to: v ATZ reset command or v ATZn reset commands, where n can be 0, 1, and so on? If ATZ, configuration file modem_z.cfg is recommended. If ATZn, configuration file modem_z0.cfg is recommended.Go to step 7. 5. Does your modem command set include a test for V.42 error correction at the remote modem (often called “Auto-Reliable Mode”)? If Yes, disable this test. You can use sample configuration files /usr/share/modem_m0.cfg or /usr/share/modem_m1.cfg as models to help you create a file for your particular modem. See “Customizing the Modem Configuration Files” on page 1064 for more information. Go to step 7. If No, go to step 6. 6. Does your modem respond to: v AT&F reset command or v AT&Fn reset commands, where n can be 0, 1, and so on?If AT&F, configuration file modem_f.cfg is recommended. If AT&Fn, configuration file modem_f0.cfg or modem_f1.cfg is recommended, depending on which provides the hardware flow control profile. 7. You have completed selection of the configuration file. If your modem configuration selection is not available in the Service Processor Modem Configuration Menu, you must access it through the Configure Remote Maintenance Policy Service Aid. If you find it necessary to adjust any of these configuration files, use the manual provided with your modem to accomplish that task. It is recommended you select settings that enable hardware flow control and respond to DTR. Note: Some older modems do not respond to the X0 or &R1 commands. Edit out these commands from the modem configuration file if yours is such a modem. See your modem manual for more information.
1062
406/90
Service
Guide
Some modems, such as the IBM 7857-017, are not designed for the paging function. Although they can be used for paging, they return an error message when they do not get the expected response from another modem. Therefore, even though the paging was successful, the error message causes the service processor to retry, continuing to place pager calls for the number of retries specified in the Call-Out Policy Setup Menu. These retries result in redundant pages. Examples for Using the Generic Sample Modem Configuration Files The following table contains information to help you determine which modem configuration file to use with various modems.
Modem
Setup Z
Setup Z0 (Rare)
Setup F
Setup F0
Setup F1
AT&T DataPort 2001 (Ring interrupt only on first ring)
X
Bocamodem 1440E
X
Hayes Smart Modem 300
X
IBM 5841
X
IBM 5843
X
IBM 7851
X
IBM 7852-10
X
IBM 7855
X
USRobotics 36.6K Sportster
X
Zoom V.32
X
Appendix
D.
Modem
Configurations
1063
Customizing the Modem Configuration Files You can create your own modem configuration files or modify the samples provided. After you customize your modem configuration files, you must access them through the Configure Remote Maintenance Policy Service Aid rather than from the service processor menus. If you have already set up your serial ports, line speeds, authorizations, and telephone numbers from the service processor menus, use the service aid to specify your customized modem configuration files. If you have not already set up your serial ports, line speeds, authorizations, and telephone numbers from the service processor menus, use the service aids to set them while you specify your customized modem configuration files. To disable Auto-Reliable Mode testing of the remote modem, use the sample modem configuration file /usr/share/modems/modem_f.cfg as a model that you can modify, as follows: 1. Find the necessary command in your modem manual. 2. Copy the /usr/share/modems/modem_f.cfg file to a new file with a different name (for example, modem_fx.cfg). 3. In the new file (modem_fx.cfg), change the line Send "ATE0T\r" to Send "ATcccE0T\r" where ccc is the added command as specified in your modem manual, as follows. Change the third line of each of the following stanzas: v condout v condin v ripo4. Save the changes.IBM 7852-400 DIP Switch Settings If you are using a 7852-400 modem to enable service processor communications, for proper operation, the dual inline package (DIP) switches must be set according to the following table:
Switch
Position
Function
1
Up
Force DTR
2
Up
Flow Control &E4
3
Down
Result Codes Enabled
4
Down
Modem Emulation Disabled
5
Up
Auto Answer Enabled
6
Up
Maximum Throughput Enabled
7
Up
RTS Normal Functions
8
Down
Enable Command Mode
9
Down
Remote Digital Loopback Test Enabled
10
Up
Dial-Up Line Enabled
11
*Up
AT Responses Enabled (Extended Responses Disabled)
12
*Down
Asynchronous Operation
13
UP
28.8KB Line Speed
14
Up
15
Up
CD and DSR Normal Functions
16
Up
2-Wire Leased Line Enabled
* Only switches 11 and 12 are changed from the factory default settings. 1064
406/90
Service
Guide
Xon/Xoff Modems Some early modems assume software flow control (Xon/Xoff) between the computer and the modem. Modems with this design send extra characters during and after the transmitted data. The service processor cannot accept these extra characters. If your configuration includes such a modem, your functional results may be unpredictable. The sample modem configuration files included in this appendix do not support these modems, so custom configuration files are necessary. Anchor Automation 2400E is an example of such a modem. If you experience unexplainable performance problems that may be due to Xon/Xoff characters, it is recommended that you upgrade your modem. Ring Detection Most modems produce an interrupt request each time they detect a ring signal. Some modems generate an interrupt only on the first ring signal that they receive. AT&T DataPort 2001 is an example of such a modem. The service processor uses the ring interrupt request to count the number of rings when Ring Indicate Power-On (RIPO) is enabled. If your modem produces an interrupt on only the first ring, set Ring Indicate Power-On to start on the first ring. Otherwise, you can choose to start Ring Indicate Power-On on any ring count. Terminal Emulators The service processor is compatible with simple ASCII terminals, and therefore compatible with most emulators. When a remote session is handed off from the service processor to the operating system, agreement between terminal emulators becomes important. The server’s operating system will have some built-in terminal emulators. You may also have a commercially available terminal emulation. It is important that the local and host computers select the same or compatible terminal emulators so that the key assignments and responses match, ensuring successful communications and control. For best formatting, choose line wrap in your terminal emulator setup. Recovery Procedures Situations such as line noises and power surges can sometimes cause your modem to enter an undefined state. When it is being used for dial-in, dial-out or ring indicate power-on, your modem is initialized each time one of these actions is expected. If one of these environmental conditions occur after your modem has been initialized, it might be necessary to recover your modem to a known state. If your modem communicates correctly with remote users, it is probably in control. It may be wise to occasionally change some of the functional settings and then change them back, just for the sense of security that the modem is communicating, and to ensure it has been initialized recently. If your system is particularly difficult to access physically, another strategy is to protect it with an Uninterruptible Power Source (UPS) and a phone-line surge protector. In case recovery becomes necessary, shut down your system using established procedures. Disconnect the power cable and press the power button to drain capacitance while power is disconnected. Disconnect and reconnect modem power, and then reconnect system power to completely reinitialize your system. Appendix
D.
Modem
Configurations
1065
Transfer of a Modem Session Because many modem command variations exist, the sample modem configuration files located at the end of this appendix have been written to capture the largest number of workable modem settings. The modem command &Dn (where n is a number) generally sets the modem response to the Data Terminal Ready (DTR) signal from the server’s serial port. The desired response is that the modem will hold a connection while DTR is enabled, and drop the connection when DTR is released. Using this mechanism, the server hangs up on a connection under normal conditions. Consult your modem manual for its specific response scheme for the &Dn command. Two strategies are available for dealing with the modem’s response to DTR: v Recovery v PreventionBefore you use one of these strategies, determine if your server’s modem is set up correctly to respond to DTR. With the remote terminal connected to serial port 1 and defined as the Primary Console Device, there are two tests you can perform: v Will the modem drop the connection after the System initialization complete message displays at the remote terminal? If Yes, the modem is set up correctly. If No, try another &Dn setting for your server’s modem. See your modem manual for this information. The &Dn command appears in three places each in three of the sample modem configuration files, as follows: v Will the server’s modem disconnect when the power drops? You can make this observation at the remote terminal by commanding your server to shut down and power off. (Use the AIX command shutdown -F.) Watch for the message NO CARRIER on your remote terminal. If Yes, this is the correct response. The modem is set up correctly. If No, try another &Dn setting for your server’s modem. See your model manual for this information. The &Dn command appears in three places each in three of the sample modem configuration files. Only the following sample modem configuration files contain the &Dn command (in three places each): – modem_f.cfg – modem_f0.cfg – modem_f1.cfgIf you are using modem_z.cfg or modem_z0.cfg, you cannot control DTR response. If your remote terminal does not disconnect after logging off, you must command the remote terminal emulator to hang up. This then breaks the connection. 1066
406/90
Service
Guide
Recovery Strategy The recovery strategy consists of making two calls to establish a remote session. This solution is the easiest to implement and allows more freedom for configuring your server’s serial ports. To set up a remote terminal session, dial into the service processor and start the system. After the operating system is loaded and initialized, the connection will be dropped. At this point, call the server back and the operating system will answer and offer you the login prompt. Prevention Strategy The disconnect is caused by the operating system when it initializes the Primary Console. The tests listed in “Transfer of a Modem Session” on page 1066 are conducted with the remote terminal selected as the primary console to manifest the modem’s response to DTR transitions. v If a local ASCII terminal or a graphics console is to be a permanent part of your server, then make one of them the primary console. Your remote terminal will no longer experience the connection loss. v If a local console is not a permanent part of your server, you can still assign either the unused graphics console or the unused serial port as the primary console. This gives you the desired seamless connection at your remote terminal.If you choose to use the unused serial port as the primary console, some initialization traffic will be sent to any serial device attached to that port. As a result, that serial device’s connection and function could be affected. These impacts may make that port unattractive for devices other than a temporary local ASCII terminal. Appendix
D.
Modem
Configurations
1067
Modem Configuration Sample Files Sample File modem_m0.cfg # # COMPONENT_NAME: (ESPSETUP) ENTRY SERVICE PROCESSOR SETUP: modem_m0 # # FUNCTIONS: Modem configuration file specifically for IBM 7852-400 # modem with Auto-Reliable feature. This feature must be turned off # for Catcher calls. This example uses the AT&F reset command to # choose the factory defaults. # # (C) COPYRIGHT International Business Machines Corp. 1996 # All Rights Reserved # Licensed Materials - Property of IBM # # US Government Users Restricted Rights - Use, duplication or # disclosure restricted by GSA ADP Schedule Contract with IBM Corp. # # # The modem has configuration switches. They should be set to the # factory default settings, except switches 11 and 12. These must be # to UP ("AT" responses) and DOWN (Asynchronous operation), respectively. ICDelay 1 DefaultTO 10 CallDelay 120 # # %N Call-Out phone number %R Return phone number # # # PROGRAMMING NOTE: No blanks between double quote marks ("). condout: send "AT&F&E2E0T\r" # Reset to factory defaults # Reliable mode # Echo off ignore "0\r" or "OK\r\n" timeout 2 # Ignore modem response. send "AT&E12&E14\r" # Disable pacing # Disable data compression expect "0\r" or "OK\r\n" timeout 2 # Confirm commands successful. send "AT&SF1&S0S9=1\r" # DSR independent of CD # Force DSR on. # CD respond time=100ms expect "0\r" or "OK\r\n" timeout 2 # Confirm commands successful. send "ATV0S0=0\r" # Numeric response code # Auto-Answer off expect "0\r" or "OK\r\n" timeout 2 # Confirm commands successful. done connect: send "ATDT%N\r" # Tone dialing command. # %N from Call Home setup. # Expect a connection response. expect "33\r" or "31\r" or "28\r" or "26\r" or "24\r" or "21\r" or "19\r" or "13\r" or "12\r" or "1\r" busy "7\r" timeout 60 done retry: send "A/" # Repeat the previous command. # Expect a connection response. expect "33\r" or "31\r" or "28\r" or "26\r" or "24\r" or "21\r" or
1068
406/90
Service
Guide
"19\r" or "13\r" or "12\r" or "1\r" busy "7\r" timeout 60 done disconnect: delay 2 # Separate from previous data. send "+++" # Assure command mode. delay 2 # Allow mode switching delay. send "ATH0T\r" # Set modem switch-hook down # (i.e., hang up). ignore "0\r" or "OK\r" timeout 2 # Ignore modem response. send "ATE0Q1\r" # Initialize modem: Echo OFF, # Disable responses. ignore "0\r" timeout 1 done condin: send "AT&F&E2E0T\r" # Reset to factory defaults. # Reliable mode # Echo off ignore "0\r" or "OK\r\n" timeout 2 # Ignore modem response. send "AT&E12&E14\r" # Disable pacing # Disable data compression expect "0\r" or "OK\r\n" timeout 2 # Confirm commands successful send "AT&SF1&S0S9=1\r" # DSR independent of CD. # Force DSR on. # CD respond time=100ms expect "0\r" or "OK\r\n" timeout 2 # Confirm commands successful. send "ATV0S0=2\r" # Numberic response code # Answer on 2nd ring expect "0\r" timeout 2 # Confirm commands successful. done waitcall: ignore "2\r" timeout 1 # Ignore first ring. expect "2\r" timeout 10 # Pickup 2nd ring or timeout # Expect a connection response. expect "33\r" or "31\r" or "28\r" or "26\r" or "24\r" or "21\r" or "19\r" or "13\r" or "12\r" or "1\r" busy "7\r" timeout 60 done page: send "ATDT%N,,,,%R;\r" # %N = pager call center number # Add enough commas to wait for # time to enter paging number. # %R = paging number expect "0\r" timeout 60 # Confirm successful command. delay 2 # Wait before hanging up. send "ATH0\r" # Hang up. expect "0\r" timeout 2 # Confirm successful command. done ripo: send "AT&F&E2E0T\r" # Reset to factory defaults. # Reliable mode # Echo off ignore "0\r" or "OK\r\n" timeout 2 # Ignore modem response. send "AT&E12&E14\r" # Disable pacing # Disable data compression expect "0\r" or "OK\r\n" timeout 2 # Confirm successful command. send "AT&SF1&S0S9=1\r" # DSR independent of CD. # Force DSR on. # CD respond time=100ms
Appendix
D.
Modem
Configurations
1069
expect "0\r" or "OK\r\n" timeout 2 # Confirm commands successful. send "ATV0S0=0\r" # Numeric response code # Auto Answer OFF expect "0\r" timeout 2 # Confirm commands successful. done # error: # Handle unexpected modem # responses. expect "8\r" or "7\r" or "6\r" or "4\r" or "3\r" delay 2 done
1070
406/90
Service
Guide
Sample File modem_m1.cfg # # COMPONENT_NAME: (ESPSETUP) ENTRY SERVICE PROCESSOR SETUP modem_m1 # # FUNCTIONS: Modem configuration file specifically for IBM 7857-017 modem with # Auto-Reliable feature. This feature must be turned off for Catcher calls. # This example uses the AT&F reset command to choose the factory defaults. # # To allow dial commands for digital pagers, it is necessary to reduce # the number of characters in the dial command. Each comma (delay) has # been set to 6 seconds (S8=6) for that reason. # # # (C) COPYRIGHT International Business Machines Corp. 1996 # All Rights Reserved # Licensed Materials - Property of IBM # # US Government Users Restricted Rights - Use, duplication or # disclosure restricted by GSA ADP Schedule Contract with IBM Corp. # ICDelay 1 DefaultTO 10 CallDelay 120 # # %N Call-Out phone number %R Return phone number # # # PROGRAMMING NOTE: No blanks between double quote marks ("). condout: send "AT&F*E0E0\r" # Reset to factory defaults. # *E0=data compression disabled # E0=echo disabled ignore "0\r" or "OK\r\n" timeout 2 # Ignore modem response. send "AT#F0*Q2S8=6\r" # Trellis modulation disabled # Retrain with adaptive rate # Set ,=6second expect "0\r" or "OK\r\n" timeout 2 # Confirm commands successful send "ATV0X0S0=0\r" # Numeric response code # AT compatible messages # Auto-Answer disabled expect "0\r" or "OK\r\n" timeout 2 # Confirm commands successful. done connect: send "ATDT%N\r" # Tone dialing command. # %N from Call Home setup. expect "1\r" busy "7\r" timeout 60 # Expect a connection response. done retry: send "A/" # Repeat the previous command. expect "1\r" busy "7\r" timeout 60 # Expect a connection response. done disconnect: delay 2 # Separate from previous data. send "+++" # Assure command mode. delay 2 # Allow mode switching delay. send "ATH0\r" # Set modem switch-hook down # (i.e., hang up). ignore "0\r" or "OK\r" timeout 2 # Ignore modem response.
Appendix
D.
Modem
Configurations
1071
send "ATE0Q1\r" # Initialize modem: Echo OFF, # Disable responses. ignore "0\r" timeout 1 done condin: send "AT&F*E0E0\r" # Reset to factory defaults. # *E0=data compression disabled # E0=echo disabled ignore "0\r" or "OK\r\n" timeout 2 # Ignore modem response. send "AT#F0*Q2\r" # Trellis modulation disabled # Retrain with adaptive rate expect "0\r" or "OK\r\n" timeout 2 # Confirm commands successful send "ATV0X0S0=2\r" # Numeric response code # AT compatible messages # Answer on 2nd ring expect "0\r" timeout 2 # Confirm commands successful. done waitcall: ignore "2\r" timeout 1 # Ignore first ring. expect "2\r" timeout 10 # Pick up second ring # or timeout. # Expect a connection response. expect "1\r" timeout 60 done page: send "ATD%N,%R\r" # %N = pager call center number # commas=6sec wait time to # enter paging number. # %R = return number expect "0\r" or "3\r" timeout 30 # Confirm successful command. delay 2 # Wait before hanging up. send "+++" # Assure command mode. delay 2 # Allow mode switching delay. send "ATH0\r" # Hang up. expect "0\r" timeout 2 # Confirm successful command. done ripo: send "AT&F*E0E0\r" # Reset to factory defaults. # *E0=data compression disabled # E0=echo disabled ignore "0\r" or "OK\r\n" timeout 2 # Ignore modem response. send "AT#F0*Q2\r" # Trellis modulation disabled # Retrain with adaptive rate expect "0\r" or "OK\r\n" timeout 2 # Confirm successful command. send "ATV0X0S0=0\r" # Numeric response code # AT compatible messages # Auto-Answer disabled expect "0\r" timeout 2 # Confirm commands successful. done # error: # Handle unexpected modem # responses. expect "8\r" or "7\r" or "4\r" or "3\r" delay 2 done
1072
406/90
Service
Guide
Sample File modem_z.cfg # # COMPONENT_NAME: (ESPSETUP) ENTRY SERVICE PROCESSOR SETUP Z # # FUNCTIONS: Modem configuration file for many early Hayes* compatible modems. # This example uses the ATZ reset command to choose the factory defaults. # This setup will work for many modems, but it is required for early vintage # modems which respond to neither the ATZ0 reset command nor the extended (&) # commands. Refer to your modem manual. # # * Trademark of Hayes Microcomputer Products, Inc. # # (C) COPYRIGHT International Business Machines Corp. 1996 # All Rights Reserved # Licensed Materials - Property of IBM # # US Government Users Restricted Rights - Use, duplication or # disclosure restricted by GSA ADP Schedule Contract with IBM Corp. # # # If the modem has configuration switches, they should be set to the # factory default settings. ICDelay 1 DefaultTO 10 CallDelay 120 # AT Attention Code , Inserts delay in dialing commands # Z Reset to factory defaults Q0 Turn on responses # E0 Turn echo off Q1 Turn off responses # V0 Use numeric responses S0=0 Automatic answer inhibit # +++ Escape to command mode S0=2 Answer on second ring # H0 Hang-up T = Tone mode. When used as T\r, it is a # no op to maintain program synchronization # when modem may/will echo the commands. # # %N Call-Out phone number %P Paging phone number # %S Modem speed (available to users) # # Following are common responses from a wide range of modems: # 16, 15, 12, 10, 5 and 1 are connection responses. Add others as required. # 7=busy; 6=no dial tone; 4=error; 3=no carrier; 2=ring; 0=OK # # PROGRAMMING NOTE: No blanks between double quote marks ("). condout: send "ATZQ0T\r" # Reset to factory defaults. ignore "0\r" or "OK\r\n" timeout 2 # Ignore modem response. send "ATE0T\r" # Initialize modem: Echo OFF, expect "0\r" or "OK\r\n" timeout 2 # Enable responses (Numeric), send "ATQ0V0T\r" # Limit response codes. expect "0\r" timeout 2 # Confirm commands successful. send "ATS0=0\r" # Set AutoAnswer OFF expect "0\r" timeout 2 # Confirm command successful. done connect: send "ATDT%N\r" # Tone dialing command. # %N from Call Home setup. # Expect a connection response. expect "16\r" or "15\r" or "14\r" or "12\r" or "10\r" or "5\r" or "1\r" busy "7\r" timeout 60
Appendix
D.
Modem
Configurations
1073
done retry: send "A/" # Repeat the previous command. # Expect a connection response. expect "16\r" or "15\r" or "14\r" or "12\r" or "10\r" or "5\r" or "1\r" busy "7\r" timeout 60 done disconnect: delay 2 # Separate from previous data. send "+++" # Assure command mode. delay 2 # Allow mode switching delay. send "ATH0T\r" # Set modem switch-hook down # (i.e., hang up). ignore "0\r" or "OK\r" timeout 2 # Ignore modem response. send "ATE0Q1\r" # Initialize modem: Echo OFF, # Disable responses. ignore "0\r" timeout 1 done condin: send "ATZQ0T\r" # Reset to factory defaults. ignore "0\r" or "OK\r\n" timeout 2 # Ignore modem response. send "ATE0T\r" # Initialize modem: Echo OFF, expect "0\r" or "OK\r\n" timeout 2 # Enable responses (Numeric), send "ATQ0V0T\r" # Limit response codes. expect "0\r" timeout 2 # Confirm commands successful. send "ATS0=2\r" # Set AutoAnswer ON expect "0\r" timeout 2 # Confirm command successful. done waitcall: ignore "2\r" timeout 1 # Ignore first ring. expect "2\r" timeout 10 # Pick up second ring # or timeout. # Expect a connection response. expect "16\r" or "15\r" or "14\r" or "12\r" or "10\r" or "5\r" or "1\r" timeout 60 done page: send "ATDT%N,,,,%R;\r" # %N = pager call center number # Add enough commas to wait for # time to enter paging number. # %R = paging number # Confirm successful command. expect "0\r" timeout 60 delay 2 # Wait before hanging up. send "ATH0T\r" # Hang up. expect "0\r" timeout 2 # Confirm successful command. done ripo: send "ATZQ0T\r" # Reset to factory defaults. ignore "0\r" or "OK\r\n" timeout 2 # Ignore modem response. send "ATE0T\r" # Initialize modem: Echo OFF, expect "0\r" or "OK\r\n" timeout 2 # Enable responses (Numeric), send "ATQ0V0T\r" # Limit response codes. expect "0\r" timeout 2 # Confirm commands successful. send "ATS0=0\r" # Set AutoAnswer OFF expect "0\r" timeout 2 # Confirm command successful. done # RI Power On enabled.
1074
406/90
Service
Guide
error: # Handle unexpected modem # responses. expect "8\r" or "7\r" or "6\r" or "4\r" or "3\r" delay 2 done
Appendix
D.
Modem
Configurations
1075
Sample File modem_z0.cfg # # COMPONENT_NAME: (ESPSETUP) ENTRY SERVICE PROCESSOR SETUP Z0 # # FUNCTIONS: Modem configuration file for some early Hayes* compatible modems. # This example uses the ATZ0 reset command to choose the factory defaults. # This setup is recommended for modems that will respond to the ATZ0 command # and which do not respond to the extended (&) commands. Refer to your modem # manual. # # * Trademark of Hayes Microcomputer Products, Inc. # # (C) COPYRIGHT International Business Machines Corp. 1996 # All Rights Reserved # Licensed Materials - Property of IBM # # US Government Users Restricted Rights - Use, duplication or # disclosure restricted by GSA ADP Schedule Contract with IBM Corp. # # # If the modem has configuration switches, they should be set to the # factory default settings. ICDelay 1 DefaultTO 10 CallDelay 120 # AT Attention Code , Inserts delay in dialing commands # Z0 Reset. Restore Profile 0 Q0 Turn on responses # E0 Turn echo off Q1 Turn off responses # V0 Use numeric responses S0=0 Automatic answer inhibit # +++ Escape to command mode S0=2 Answer on second ring # H0 Hang-up X0=0 Limit modem response codes # T = Tone mode. When used as T\r, it is a # no op to maintain program synchronization # when modem may/will echo the commands. # # %N Call-Out phone number %P Paging phone number # %S Modem speed (available to users) # # Following are common responses from a wide range of modems: # 16, 15, 12, 10, 5 and 1 are connection responses. Add others as required. # 7=busy; 6=no dial tone; 4=error; 3=no carrier; 2=ring; 0=OK # # PROGRAMMING NOTE: No blanks between double quote marks ("). condout: send "ATZ0Q0T\r" # Reset modem. Select profile 0 ignore "0\r" or "OK\r\n" timeout 2 # Ignore modem response. send "ATE0T\r" # Initialize modem: Echo OFF, expect "0\r" or "OK\r\n" timeout 2 # Enable responses (Numeric), send "ATQ0V0X0T\r" # Limit response codes. expect "0\r" timeout 2 # Confirm commands successful. send "ATS0=0\r" # Set AutoAnswer OFF expect "0\r" timeout 2 # Confirm command successful. done connect: send "ATDT%N\r" # Tone dialing command. # %N from Call Home setup. # Expect a connection response. expect "16\r" or "15\r" or "14\r" or "12\r" or "10\r" or "5\r" or "1\r" busy "7\r"
1076
406/90
Service
Guide
timeout 60 done retry: send "A/" # Repeat the previous command. # Expect a connection response. expect "16\r" or "15\r" or "14\r" or "12\r" or "10\r" or "5\r" or "1\r" busy "7\r" timeout 60 done disconnect: delay 2 # Separate from previous data. send "+++" # Assure command mode. delay 2 # Allow mode switching delay. send "ATH0T\r" # Set modem switch-hook down # (i.e., hang up). ignore "0\r" or "OK\r" timeout 2 # Ignore modem response. send "ATE0Q1\r" # Initialize modem: Echo OFF, # Disable responses. ignore "0\r" timeout 1 done condin: send "ATZ0Q0T\r" # Reset modem. Select profile 0 ignore "0\r" or "OK\r\n" timeout 2 # Ignore modem response. send "ATE0T\r" # Initialize modem: Echo OFF, expect "0\r" or "OK\r\n" timeout 2 # Enable responses (Numeric), send "ATQ0V0X0T\r" # Limit response codes. expect "0\r" timeout 2 # Confirm commands successful. send "ATS0=2\r" # Set AutoAnswer ON expect "0\r" timeout 2 # Confirm command successful. done waitcall: ignore "2\r" timeout 1 # Ignore first ring. expect "2\r" timeout 10 # Pick up second ring # or timeout. # Expect a connection response. expect "16\r" or "15\r" or "14\r" or "12\r" or "10\r" or "5\r" or "1\r" timeout 60 done page: send "ATDT%N,,,,%R;\r" # %N = pager call center number # Add enough commas to wait for # time to enter paging number. # %R = paging number # Confirm successful command. expect "0\r" timeout 60 delay 2 # Wait before hanging up. send "ATH0T\r" # Hang up. expect "0\r" timeout 2 # Confirm successful command. done ripo: send "ATZ0Q0T\r" # Reset modem. Select profile 0 ignore "0\r" or "OK\r\n" timeout 2 # Ignore modem response. send "ATE0T\r" # Initialize modem: Echo OFF, expect "0\r" or "OK\r\n" timeout 2 # Enable responses (Numeric), send "ATQ0V0X0T\r" # Limit response codes. expect "0\r" timeout 2 # Confirm commands successful. send "ATS0=0\r" # Set AutoAnswer OFF expect "0\r" timeout 2 # Confirm command successful.
Appendix
D.
Modem
Configurations
1077
done # RI Power On enabled. error: # Handle unexpected modem # responses. expect "8\r" or "7\r" or "6\r" or "4\r" or "3\r" delay 2 done
1078
406/90
Service
Guide
Sample File modem_f.cfg # # COMPONENT_NAME: (ESPSETUP) ENTRY SERVICE PROCESSOR SETUP F # # FUNCTIONS: Modem configuration file for many recent Hayes* compatible modems. # This example uses the AT&F reset command to choose the factory defaults. # This set up is preferred for modems with extended (&) commands. For early # vintage modems, setup Z or Z0 is recommended. If your modem responds to # the extended (&) commands and to factory default choices (&Fn), setup file # F0 or F1 is recommended. # # * Trademark of Hayes Microcomputer Products, Inc. # # (C) COPYRIGHT International Business Machines Corp. 1996 # All Rights Reserved # Licensed Materials - Property of IBM # # US Government Users Restricted Rights - Use, duplication or # disclosure restricted by GSA ADP Schedule Contract with IBM Corp. # # # If the modem has configuration switches, they should be set to the # factory default settings. ICDelay 1 DefaultTO 10 CallDelay 120 # AT Attention Code , Inserts delay in dialing commands # &F Reset to default profile Q0 Turn on responses # E0 Turn echo off Q1 Turn off responses # V0 Use numeric responses S0=0 Automatic answer inhibit # +++ Escape to command mode S0=2 Answer on second ring # H0 Hang-up X0=0 Limit modem response codes # T = Tone mode. When used as T\r, it is a # no op to maintain program synchronization # when modem may/will echo the commands. # # &C1 Detect CD &D2 Respond to DTR (often the default) # # %N Call-Out phone number %P Paging phone number # %S Modem speed (available to users) # # Following are common responses from a wide range of modems: # 16, 15, 12, 10, 5 and 1 are connection responses. Add others as required. # 7=busy; 6=no dial tone; 4=error; 3=no carrier; 2=ring; 0=OK # # PROGRAMMING NOTE: No blanks between double quote marks ("). condout: send "AT&FQ0T\r" # Reset to factory defaults. ignore "0\r" or "OK\r\n" timeout 2 # Ignore modem response. send "ATE0T\r" # Initialize modem: Echo OFF, expect "0\r" or "OK\r\n" timeout 2 # Enable responses (Numeric), send "ATQ0V0X0T\r" # Limit response codes. expect "0\r" timeout 2 # Confirm commands successful. send "ATS0=0\r" # Set AutoAnswer OFF expect "0\r" timeout 2 # Confirm command successful. send "AT&C1&D2\r" # Detect carrier and DTR. expect "0\r" timeout 2 # Confirm command successful. done Appendix
D.
Modem
Configurations
1079
connect: send "ATDT%N\r" # Tone dialing command. # %N from Call Home setup. # Expect a connection response. expect "16\r" or "15\r" or "14\r" or "12\r" or "10\r" or "5\r" or "1\r" busy "7\r" timeout 60 done retry: send "A/" # Repeat the previous command. # Expect a connection response. expect "16\r" or "15\r" or "14\r" or "12\r" or "10\r" or "5\r" or "1\r" busy "7\r" timeout 60 done disconnect: delay 2 # Separate from previous data. send "+++" # Assure command mode. delay 2 # Allow mode switching delay. send "ATH0T\r" # Set modem switch-hook down # (i.e., hang up). ignore "0\r" or "OK\r" timeout 2 # Ignore modem response. send "ATE0Q1\r" # Initialize modem: Echo OFF, # Disable responses. ignore "0\r" timeout 1 done condin: send "AT&FQ0T\r" # Reset to factory defaults. ignore "0\r" or "OK\r\n" timeout 2 # Ignore modem response. send "ATE0T\r" # Initialize modem: Echo OFF, expect "0\r" or "OK\r\n" timeout 2 # Enable responses (Numeric), send "ATQ0V0X0T\r" # Limit response codes. expect "0\r" timeout 2 # Confirm commands successful. send "ATS0=2\r" # Set AutoAnswer ON expect "0\r" timeout 2 # Confirm command successful. send "AT&C1&D2\r" # Detect carrier and DTR. expect "0\r" timeout 2 # Confirm command successful. done waitcall: ignore "2\r" timeout 1 # Ignore first ring. expect "2\r" timeout 10 # Pick up second ring # or timeout. # Expect a connection response. expect "16\r" or "15\r" or "14\r" or "12\r" or "10\r" or "5\r" or "1\r" timeout 60 done page: send "ATDT%N,,,,%R;\r" # %N = pager call center number # Add enough commas to wait for # time to enter paging number. # %R = paging number # Confirm successful command. expect "0\r" timeout 60 delay 2 # Wait before hanging up. send "ATH0T\r" # Hang up. expect "0\r" timeout 2 # Confirm successful command. done ripo: send "AT&FQ0T\r" # Reset to factory defaults.
1080
406/90
Service
Guide
ignore "0\r" or "OK\r\n" timeout 2 # Ignore modem response. send "ATE0T\r" # Initialize modem: Echo OFF, expect "0\r" or "OK\r\n" timeout 2 # Enable responses (Numeric), send "ATQ0V0X0T\r" # Limit response codes. expect "0\r" timeout 2 # Confirm commands successful. send "ATS0=0\r" # Set AutoAnswer OFF expect "0\r" timeout 2 # Confirm command successful. send "AT&C1&D2\r" # Detect carrier and DTR. expect "0\r" timeout 2 # Confirm command successful. done # RI Power On enabled. error: # Handle unexpected modem # responses. expect "8\r" or "7\r" or "6\r" or "4\r" or "3\r" delay 2 done
Appendix
D.
Modem
Configurations
1081
Sample File modem_f0.cfg # # COMPONENT_NAME: (ESPSETUP) ENTRY SERVICE PROCESSOR SETUP F0 # # FUNCTIONS: Modem configuration file for many recent Hayes* compatible modems. # This example uses the AT&F0 reset command to choose the factory defaults. # This set up is preferred for modems with extended (&) commands. For early # vintage modems, setup Z or Z0 is recommended. If your modem responds to # the extended (&) commands and to factory default choices (&Fn), but doesn’t # work properly with this setup file, setup F1 is recommended. # # * Trademark of Hayes Microcomputer Products, Inc. # # (C) COPYRIGHT International Business Machines Corp. 1996 # All Rights Reserved # Licensed Materials - Property of IBM # # US Government Users Restricted Rights - Use, duplication or # disclosure restricted by GSA ADP Schedule Contract with IBM Corp. # # # If the modem has configuration switches, they should be set to the # factory default settings. ICDelay 1 DefaultTO 10 CallDelay 120 # AT Attention Code , Inserts delay in dialing commands # &F0 Reset. Restore profile 0 Q0 Turn on responses # E0 Turn echo off Q1 Turn off responses # V0 Use numeric responses S0=0 Automatic answer inhibit # +++ Escape to command mode S0=2 Answer on second ring # H0 Hang-up X0=0 Limit modem response codes # T = Tone mode. When used as T\r, it is a # no op to maintain program synchronization # when modem may/will echo the commands. # # &C1 Detect CD &D2 Respond to DTR (often the default) # &R1 Ignore RTS (CTS) # # %N Call-Out phone number %P Paging phone number # %S Modem speed (available to users) # # Following are common responses from a wide range of modems: # 16, 15, 12, 10, 5 and 1 are connection responses. Add others as required. # 7=busy; 6=no dial tone; 4=error; 3=no carrier; 2=ring; 0=OK # # PROGRAMMING NOTE: No blanks between double quote marks ("). condout: send "AT&F0Q0T\r" # Reset modem. Select profile 0 ignore "0\r" or "OK\r\n" timeout 2 # Ignore modem response. send "ATE0T\r" # Initialize modem: Echo OFF, expect "0\r" or "OK\r\n" timeout 2 # Enable responses (Numeric), send "ATQ0V0X0T\r" # Limit response codes. expect "0\r" timeout 2 # Confirm commands successful. send "ATS0=0\r" # Set AutoAnswer OFF expect "0\r" timeout 2 # Confirm command successful. send "AT&C1&D2&R1\r" # Detect carrier and DTR, # Ignore RTS. expect "0\r" timeout 2 # Confirm command successful.
1082
406/90
Service
Guide
done connect: send "ATDT%N\r" # Tone dialing command. # %N from Call Home setup. # Expect a connection response. expect "16\r" or "15\r" or "14\r" or "12\r" or "10\r" or "5\r" or "1\r" busy "7\r" timeout 60 done retry: send "A/" # Repeat the previous command. # Expect a connection response. expect "16\r" or "15\r" or "14\r" or "12\r" or "10\r" or "5\r" or "1\r" busy "7\r" timeout 60 done disconnect: delay 2 # Separate from previous data. send "+++" # Assure command mode. delay 2 # Allow mode switching delay. send "ATH0T\r" # Set modem switch-hook down # (i.e., hang up). ignore "0\r" or "OK\r" timeout 2 # Ignore modem response. send "ATE0Q1\r" # Initialize modem: Echo OFF, # Disable responses. ignore "0\r" timeout 1 done condin: send "AT&F0Q0T\r" # Reset modem. Select profile 0 ignore "0\r" or "OK\r\n" timeout 2 # Ignore modem response. send "ATE0T\r" # Initialize modem: Echo OFF, expect "0\r" or "OK\r\n" timeout 2 # Enable responses (Numeric), send "ATQ0V0X0T\r" # Limit response codes. expect "0\r" timeout 2 # Confirm commands successful. send "ATS0=2\r" # Set AutoAnswer ON expect "0\r" timeout 2 # Confirm command successful. send "AT&C1&D2&R1\r" # Detect carrier and DTR, # Ignore RTS. expect "0\r" timeout 2 # Confirm command successful. done waitcall: ignore "2\r" timeout 1 # Ignore first ring. expect "2\r" timeout 10 # Pick up second ring # or timeout. # Expect a connection response. expect "16\r" or "15\r" or "14\r" or "12\r" or "10\r" or "5\r" or "1\r" timeout 60 done page: send "ATDT%N,,,,%R;\r" # %N = pager call center number # Add enough commas to wait for # time to enter paging number. # %R = paging number # Confirm successful command. expect "0\r" timeout 60 delay 2 # Wait before hanging up. send "ATH0T\r" # Hang up. expect "0\r" timeout 2 # Confirm successful command.
Appendix
D.
Modem
Configurations
1083
done ripo: send "AT&F0Q0T\r" # Reset modem. Select profile 0 ignore "0\r" or "OK\r\n" timeout 2 # Ignore modem response. send "ATE0T\r" # Initialize modem: Echo OFF, expect "0\r" or "OK\r\n" timeout 2 # Enable responses (Numeric), send "ATQ0V0X0T\r" # Limit response codes. expect "0\r" timeout 2 # Confirm commands successful. send "ATS0=0\r" # Set AutoAnswer OFF expect "0\r" timeout 2 # Confirm command successful. send "AT&C1&D2&R1\r" # Detect carrier and DTR, # Ignore RTS. expect "0\r" timeout 2 # Confirm command successful. done # RI Power On enabled. error: # Handle unexpected modem # responses. expect "8\r" or "7\r" or "6\r" or "4\r" or "3\r" delay 2 done
1084
406/90
Service
Guide
Sample File modem_f1.cfg # # COMPONENT_NAME: (ESPSETUP) ENTRY SERVICE PROCESSOR SETUP F1 # # FUNCTIONS: Modem configuration file for many recent Hayes* compatible modems. # This example uses the AT&F1 reset command to choose the factory defaults. # This set up is for modems with extended (&) commands and which do not work # properly with setup F0. For early vintage modems, setup Z or Z0 is # recommended. # # * Trademark of Hayes Microcomputer Products, Inc. # # (C) COPYRIGHT International Business Machines Corp. 1996 # All Rights Reserved # Licensed Materials - Property of IBM # # US Government Users Restricted Rights - Use, duplication or # disclosure restricted by GSA ADP Schedule Contract with IBM Corp. # # # If the modem has configuration switches, they should be set to the # factory default settings. ICDelay 1 DefaultTO 10 CallDelay 120 # AT Attention Code , Inserts delay in dialing commands # &F1 Reset. Restore profile 1 Q0 Turn on responses # E0 Turn echo off Q1 Turn off responses # V0 Use numeric responses S0=0 Automatic answer inhibit # +++ Escape to command mode S0=2 Answer on second ring # H0 Hang-up X0=0 Limit modem response codes # T = Tone mode. When used as T\r, it is a # no op to maintain program synchronization # when modem may/will echo the commands. # # &C1 Detect CD &D2 Respond to DTR (often the default) # &R1 Ignore RTS (CTS) # # %N Call-Out phone number %P Paging phone number # %S Modem speed (available to users) # # Following are common responses from a wide range of modems: # 16, 15, 12, 10, 5 and 1 are connection responses. Add others as required. # 7=busy; 6=no dial tone; 4=error; 3=no carrier; 2=ring; 0=OK # # PROGRAMMING NOTE: No blanks between double quote marks ("). condout: send "AT&F1Q0T\r" # Reset modem. Select profile 1 ignore "0\r" or "OK\r\n" timeout 2 # Ignore modem response. send "ATE0T\r" # Initialize modem: Echo OFF, expect "0\r" or "OK\r\n" timeout 2 # Enable responses (Numeric), send "ATQ0V0X0T\r" # Limit response codes. expect "0\r" timeout 2 # Confirm commands successful. send "ATS0=0\r" # Set AutoAnswer OFF expect "0\r" timeout 2 # Confirm command successful. send "AT&C1&D2&R1\r" # Detect carrier and DTR, # Ignore RTS. expect "0\r" timeout 2 # Confirm command successful. done
Appendix
D.
Modem
Configurations
1085
connect: send "ATDT%N\r" # Tone dialing command. # %N from Call Home setup. # Expect a connection response. expect "16\r" or "15\r" or "14\r" or "12\r" or "10\r" or "5\r" or "1\r" busy "7\r" timeout 60 done retry: send "A/" # Repeat the previous command. # Expect a connection response. expect "16\r" or "15\r" or "14\r" or "12\r" or "10\r" or "5\r" or "1\r" busy "7\r" timeout 60 done disconnect: delay 2 # Separate from previous data. send "+++" # Assure command mode. delay 2 # Allow mode switching delay. send "ATH0T\r" # Set modem switch-hook down # (i.e., hang up). ignore "0\r" or "OK\r" timeout 2 # Ignore modem response. send "ATE0Q1\r" # Initialize modem: Echo OFF, # Disable responses. ignore "0\r" timeout 1 done condin: send "AT&F1Q0T\r" # Reset modem. Select profile 1 ignore "0\r" or "OK\r\n" timeout 2 # Ignore modem response. send "ATE0T\r" # Initialize modem: Echo OFF, expect "0\r" or "OK\r\n" timeout 2 # Enable responses (Numeric), send "ATQ0V0X0T\r" # Limit response codes. expect "0\r" timeout 2 # Confirm commands successful. send "ATS0=2\r" # Set AutoAnswer ON expect "0\r" timeout 2 # Confirm command successful. send "AT&C1&D2&R1\r" # Detect carrier and DTR, # Ignore RTS. expect "0\r" timeout 2 # Confirm command successful. done waitcall: ignore "2\r" timeout 1 # Ignore first ring. expect "2\r" timeout 10 # Pick up second ring # or timeout. # Expect a connection response. expect "16\r" or "15\r" or "14\r" or "12\r" or "10\r" or "5\r" or "1\r" timeout 60 done page: send "ATDT%N,,,,%R;\r" # %N = pager call center number # Add enough commas to wait for # time to enter paging number. # %R = paging number # Confirm successful command. expect "0\r" timeout 60 delay 2 # Wait before hanging up. send "ATH0T\r" # Hang up. expect "0\r" timeout 2 # Confirm successful command. done
1086
406/90
Service
Guide
ripo: send "AT&F1Q0T\r" # Reset modem. Select profile 1 ignore "0\r" or "OK\r\n" timeout 2 # Ignore modem response. send "ATE0T\r" # Initialize modem: Echo OFF, expect "0\r" or "OK\r\n" timeout 2 # Enable responses (Numeric), send "ATQ0V0X0T\r" # Limit response codes. expect "0\r" timeout 2 # Confirm commands successful. send "ATS0=0\r" # Set AutoAnswer OFF expect "0\r" timeout 2 # Confirm command successful. send "AT&C1&D2&R1\r" # Detect carrier and DTR, # Ignore RTS. expect "0\r" timeout 2 # Confirm command successful. done # RI Power On enabled. error: # Handle unexpected modem # responses. expect "8\r" or "7\r" or "6\r" or "4\r" or "3\r" delay 2 done
Appendix
D.
Modem
Configurations
1087
1088
406/90
Service
Guide
Appendix E. Ground Path The ground path for the 406/90 is shown in the following figure. 1089
1090
406/90
Service
Guide
Index Numerics406/90 overview 1 Aabout this book xiii acousticalnoiseemissions 1054 addingFRUs 9 serviceable event comments 8 air filters 912 AIX location codes 66, 71 aluminum backplane stiffener 906 attention LED 22 HMC 22 automatic call homeoverview 5 Bbatterydisposal, recycling 1049 removal 859 bay locations 20 bay locations, DASD 26 boot list 762 boot mode menu 780 boot problems 409 bypass interlock, room EPO 16, 850 Ccable installation and management procedures 1003 cablingI/O subsystem power 30, 33 rulesRIO 40 RIO-2 40 call-intesting 1058 call-in/call-out setup menu 795 call-outtesting 1058 call-out (call-home) 815 call-out policy setup menu 800 capacitor books and capacitor blanks 862 CD-ROM removal 998 change general access password 774 change privileged access password 774 chassisfan 912 checkpoints 369 description 145 firmware 379 service processor 372 CoD transactions 752 code, error 413 codes, spcn informational 755 codes, VPD informations 756 common service processor error codes 737 configurationclient 767 modemcustomizing files 1064 file selection 1062 sample files 1061 specific files 1061 NIM server 766 service processorserial port 1059 configurationsmodem 1061 configurememory 812 processor 812 console mirroring 816 enable/disable 774 customer account setup menu 801 customizing filesmodemconfiguration 1064 Ddata flow 11 DCA books and DCA blanks 861 DCA power cables 30 deconfigurememory 812 processor 812 description, ipl flow 369 diagnosticsconsiderations 759 loading 764 online 760 concurrent mode 762, 763 maintenance mode 763 overview 145 runningservice mode 761 running diagnostics on S1 port 148 service mode 760 standalone 764, 765 considerations 764 NIM server 766 partition considerations 764 using 759 disable console mirroring 774 diskette drive removal 999 documentationAIX 763 DVD-RAM removal 998 1091
Eenable console mirroring 774 entering open firmware 13 entry MAP 145, 149 EPO bypass interlock 16, 850 error code to FRU index 413 error code, B1xx 8FF0 752 error codescommon firmware 737 firmware/POST 488 HMC surveillance 756 operator panel 419 POST 413 problem determination-generated 757 scan dump 736 service processor 516 SPCN 420 system firmware 736 error logsservice processor 817 extended error datamanaging 8 external cables 1029 Ffan assembly 912 fan chassis 912 fan controller 911 fan power cables 30 file selectionmodemconfiguration 1062 firmware 806 frame (power subsystem controller firmware update) 809 level 806 systemupdates 805 update 806 firmware checkpoints 379 firmware/POST error codes 488 flow, data 11 flow, IPL 369 focal point procedures, service 145 FRUhot-pluggable 842 isolation 147 updating information 8 FRU index 413 FRU LEDsactivating 9 deactivating 9 FRUsadding 9 replacing 9 Ggeneral user menu 771 Hhandling static-sensitive devices 837 handling static-sensitive L3 modules 863, 882 handling static-sensitive MCM modules 863, 882 hardware service functionsactivating and deactivating FRU LEDs 9 HMCattention LED 22 cabling 59 resetting the system attention LED 22 HMC (hardware management console)system error access 21 HMC systemresetting attention LED 22 HMC system error access 21 hot-pluggable FRUs 842 II/O blanks 854 I/O books 854 I/O subsystem backplane assembly 989 I/O subsystem chassis 995 I/O subsystem DASD 4-pack 993 I/O subsystem disk drive assembly 991 I/O subsystem I/O bus adapter 994 I/O subsystem power cables 33 I/O subsystem RIO riser card 986 IBF 37 IBF power cables 30 index, error code to FRU 413 indicators, POST 12 information, reference 1 installcorrective service on the frame 810 reconfiguration procedure for SNI adapters 811 installation and managementswitch and network cables 1003 integrated battery feature 35 introducing the system 1 IPLflow 369 phases 369 isolation, FRU 147 Kkeyboards 1031, 1032 keysnumeric 1 key 12 numeric 5 12 numeric 6 12 numeric 8 key 13 Llanguage selection menu 794 laser compliance statement ix, 998 laser safety information ix LCD progress indicator log 818 1092
406/90
Service
Guide
LEDindicatormenu 788 LED status 21 Linux SRNs 1035 location codeformat 64 location codes 64, 71 AIX 66 physical 64 locations 13 AIX 68 DASD bays 26 DASD SCSI IDs 26 I/O subsystem front 24, 26 I/O subsystem rear 25 location code reference tables 68 logical 64 media bays 20 media SCSI IDs 20 media subsystem operator panel 21 media, front 20 media, rear 20 physical 64, 68 processor subsystem front 13 processor subsystem rear 18, 19 Mmaintenance analysis procedures 145, 149 MAP 1230, platform error problem resolution 163 MAP 1235, system array self-repair problem resolution 168 quick entry MAP 151 maintenance analysis procedures (MAPs) 149 hot-plug MAP 188 minimum configuration MAP 318 power MAP 192 problem determination MAP 158 managingextended error data 8 MAP hot-plug 188 media subsystem cable removal 1002 media subsystem operator panel locations 21 media subsystem, front 20 media subsystem, rear 20 memoryconfigure 812 deconfigure 812 memory books and memory blanks 910 memory configuration/deconfiguration menu 783, 784 menuboot mode 780 call-in/call-out 795 call-out policy setup 800 customer account setup 801 generaluser 771 language selection 794 LED indicator 788 main 772 menu (continued)memory configuration/deconfiguration 783 modem configuration 796 OS surveillancesetup 775 power control network utilities 785 privileged user 772 processor configuration/deconfiguration 782 reboot/restart policy setup 779 ring indicate 778 serial port selection 797 serial port speed setup 798 setup 773 system information 781 system power control 778 telephone setup 798 messages, service processor checkpoints 372 minimum configuration MAP 318 modemconfigurationcustomizing files 1064 file selection 1062 sample files 1061 specific files 1061 configurations 1061 modem sessiontransfer 1066 modem_f.cfg, sample file 1079 modem_f0.cfg, sample file 1082 modem_f1.cfg, sample file 1085 modem_z.cfg, sample file 1073 modem_z0.cfg, sample file 1076 NNIM serverconfiguration 766 client 767 standalone diagnostics 766 noise emissionsacoustical 1054 non-HMC attached system error access 22 Oonline diagnosticsconcurrent mode 762, 763 maintenance mode 763 modes of operation 760 service mode 760 open firmware, entering 13 operating considerationsonline and standalone diagnostics 759 operational phasesservice processor 820 operator panel error codes 419 operator panel removal 1000 OS defined restart policy 779 overview 1 diagnostics 145 service focal point 4 Index
1093
overview (continued)system 1 Pparametersservice processorservice mode 801 partitionsconsiderationsstandalone 764 partskeyboard 1031, 1032 parts information 1009 external cables 1029 media subassembly 1021 power subsystem 1010 processor subsystem 1012, 1014, 1018 rack subsystem 1010 subassembly 1025 passwords 774 general, changing 774 privileged, changing 774 PCIslot LED definitions 913 PCI adapters 914 accessing hot-plug functions 925 hot-plug manager access 924 hot-pluggable adapters 921 hot-pluggable manager menu 925 install a short adapter into cassette 951 installing a long adapter into a cassette 968 installing a new hot-pluggable adapter 923 long adapter removal 946 removing adapter from cassette 935 removing and replacing hot-pluggable adapter 921 removing non-hot-pluggable adapter 920 removing PCI adapter cassette 915 replacing a PCI adapter cassette 918 replacing an adapter in a cassette 946 replacing non-hot-pluggable adapter 920 short adapter removal 943 phases, ipl 369 physical location codes 64, 71 platform error problem resolution, MAP 1230 163 POST 413 POST indicators 12 POST keysnumeric 1 12 numeric 5 key 12 numeric 6 key 12 numeric 8 key 13 power cablesDCAs 30 fans 30 I/O subsystem 33 IBF 30 power cables, I/O subsystem 33 power control network utilities menu 785 power MAP 145, 192 power source 133 power subsystem 29 power supply fan removal 987 power supply removal 987 power-onmethods 802 power-on self-test 12 powering off 838 powering on 838, 841 powering on and off 838 powering on with service processor 841 preface xiii preventive maintenance 142 air filters 142 primary console 1067 primary I/O book 854 privileged user menus 772 problem determination MAP 158 problem determination-generated error codes 757 problem isolation MAP 145 problems, boot 409 procedurescable installation and management 1003 procedures, removal and replacement 834 procedures, service focal point 145 processorconfigure 812 deconfigure 812 fans 912 processor configuration/deconfiguration menu 782 processor fan assembly 912 processor fan controller 911 processor fans 912 processor subsystem chassis 908 product disposal 1049 Qquick entry MAP 145, 151 Rracksecondary 37, 38, 39, 43, 48, 49, 52, 53 reboot/restart policy setup menu 779 recoveryprocedures 1065 strategy 1067 recycling 1049 reference information 1 reference tablesAIX location code 68 physical location code 68 related publications xiv removal and replacement introduction 837 removal and replacement procedures 834 air filters 912 battery 859 capacitor books and capacitor blanks 862 CD-ROM drive 998 DCA books and DCA blanks 861 diskette drive 999 1094
406/90
Service
Guide
removal and replacement procedures (continued)DVD-RAM drive 998 I/O blanks 854 I/O books 854 I/O subsystem backplane assembly 989 I/O subsystem chassis 995 I/O subsystem DASD 4-pack 993 I/O subsystem DASD assembly 991 I/O subsystem I/O bus adapter 994 I/O subsystem RIO riser card 986 media subsystem cables 1002 memory books and memory blanks 910 operator panel 1000 power supply 987 power supply fan 987 primary I/O book 854 processor subsystem chassis 908 removal procedures 863 L3 cache modules 882 mcm filler covers 863 mcm modules 863 pass through modules 863 replacement procedures 870 I/O book 856 L3 cache modules 885 mcm modules 870 memory modules 911 replacingFRUs 9 resetting the system attention LED 22 HMC 22 restart policyOS defined 779 supplemental 779 restart recovery 803 RETAIN 801 retries 800 ring indicate power-on menu 778 RIOcabling 40 rules 40 RIO-2cabling 40 rules 40 room EPO bypass interlock 16, 850 running online diagnostics with no attached HMC 762 running standalone diagnostics with no attached HMC 765 SS1 port testingnative serial port 148 safetyelectrical vii safety notices vii laser compliance statement ix safety statement, laser 998 sample filesmodemconfigurations 1061 scan dump messages 736 scan log dump policy 776 SCSI IDs 20, 26 self-test, power-on 12 serial portselection menu 797 snoop setup menu 776 speed setup menu 798 service focal pointactivating and deactivating FRU LEDs 9 adding FRUs 9 call home overview 5 closing a serviceable event 8 enabling surveillance notifications 6 overview 4 procedures 145 replacing FRUs 9 saving and managing extended error data 8 serviceable events 7 setting up surveillance 6 settings 5 testing error reporting 5 updating FRU information 8 viewing and adding serviceable event comments 8 viewing error details 8 viewing service processor error details 8 viewing serviceable event details 7 viewing serviceable event partition information 9 viewing serviceable events 7 service inspection guide 142 service processor 769 accesslocal 769 remote 770 boot mode menu 780 call-intest 1057 call-in/call-out setup menu 795 call-outtest 1057 call-out call-home 815 call-out policy setup menu 800 checklist 1057 checkpoints 372 console mirroring 816 customer account setup menu 801 error logs 817 general user menu 771 language selection menu 794 main menu 772 memory configuration/deconfiguration menu 783, 784 menus 769 modem configuration menu 796 monitoring - surveillance 813 operational phases 820 OS surveillance setup menu 775 parametersservice mode 801 passwords 774 power control network utilities menu 785 Index
1095
service processor (continued)privileged user menus 772 procedures in service mode 801 processor configuration/deconfiguration menu 782 reboot/restart policy setup menu 779 reboot/restart recovery 803 ring indicate power-on menu 778 serial portconfiguration 1059 serial port selection menu 797 serial port snoop setup menu 776 serial port speed setup menu 798 settingssaving and restoring 770 setup 1057 setup checklist 1057 setup menu 773 system information menu 781 system power control menu 778 telephone setup menu 798 test 1057 service processor checkpoints 372 service processor error codes 516 serviceable eventviewing partition information 9 serviceable eventsadding comments 8 selecting and viewing 7 updating FRU information 8 viewing comments 8 viewing details 7 viewing error details 8 viewing service processor error details 8 working with 7 setup and testservice processor 1057 setup menu 773 SPCN error codes 420 spcn informational codes 755 specifc filesmodemconfiguration 1061 specifications 129 distribution, weight 135 environmental 135 height 133 noise emissions 135 processor subsystem 129 weight 133, 134 SRNs for Linux 1035 standalone diagnostics 764, 765 NIM server 766, 767 partition considerations 764 supplemental restart policy 779 surveillancemonitoring 813 switch cableinstallation and management procedures 1003 systemattention LED 21 HMC attached system error access 21 system (continued)non-HMC attached system error access 22 resetting attention LED 22 system array self-repair problem resolution , MAP 1235 168 system firmware update messages 736 system firmware updates 805 system information menu 781 system introduction 1 system management services 823 boot devices, select 831 boot options, select 829 error log, view 825 exiting SMS 834 IPL, remote setup 826 IPL, setup 826 multiboot startup 834 password, change options 825 password, privileged-access 825 SCSI settings, change 828 select console 828 select language 824 settings, display current 833 settings, restore default 834 system power control menu 778 system power-on methods 802 system specifications 129 system, powering 841 Ttelephone setup menu 798 terminal emulators 1065 terminal typeidentifying 760 testing the setupcall-in 1058 call-out 1058 trademarks xiv transactions, CoD 752 transfer of a modem session 1066 UUEPO card assembly 850 UEPO locations 16 UEPO removal and replacement 850 unit emergency power off card assembly 850 updatingframe firmware 809 FRU information 8 Vviewingserviceable event comments 8 serviceable event details 7 serviceable event error details 8 serviceable event partition information 9 VPD informational codes 756 1096
406/90
Service
Guide
Wweb sitesfirmware updates 806 microcode updates 806
Index
1097
1098
406/90
Service
Guide
Vos remarques sur ce document / Technical publication remark form
Titre / Title : Bull ESCALA PL 3200R Service Guide
Nº Reférence / Reference Nº : 86 A1 82EF 03 Daté / Dated : March 2004
ERREURS DETECTEES / ERRORS IN PUBLICATION
AMELIORATIONS SUGGEREES / SUGGESTIONS FOR IMPROVEMENT TO PUBLICATION
Vos remarques et suggestions seront examinées attentivement.
Si vous désirez une réponse écrite, veuillez indiquer ci-après votre adresse postale complète.
Your comments will be promptly investigated by qualified technical personnel and action will be taken as required.
If you require a written reply, please furnish your complete mailing address below.
NOM / NAME : Date :
SOCIETE / COMPANY :
ADRESSE / ADDRESS :
Remettez cet imprimé à un responsable BULL ou envoyez-le directement à :
Please give this technical publication remark form to your BULL representative or mail to:
BULL CEDOC
357 AVENUE PATTON
B.P.20845
49008 ANGERS CEDEX 01
FRANCE
Technical Publications Ordering Form
Bon de Commande de Documents Techniques
To order additional publications, please fill up a copy of this form and send it via mail to:
Pour commander des documents techniques, remplissez une copie de ce formulaire et envoyez-la à :
BULL CEDOC
ATTN / Mr. L. CHERUBIN
357 AVENUE PATTON
B.P.20845
49008 ANGERS CEDEX 01
FRANCE
Phone / Téléphone : +33 (0) 2 41 73 63 96
FAX / Télécopie +33 (0) 2 41 73 60 19
E–Mail / Courrier Electronique : srv.Cedoc@franp.bull.fr
Or visit our web sites at: / Ou visitez nos sites web à:
http://www.logistics.bull.net/cedoc
http://www–frec.bull.com http://www.bull.com
CEDOC Reference #
No Référence CEDOC
Qty
Qté
CEDOC Reference #
No Référence CEDOC
Qty
Qté
CEDOC Reference #
No Référence CEDOC
Qty
Qté
_ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ]
_ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ]
_ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ]
_ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ]
_ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ]
_ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ]
_ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ]
[ _ _ ] : no revision number means latest revision / pas de numéro de révision signifie révision la plus récente
NOM / NAME : Date :
SOCIETE / COMPANY :
ADRESSE / ADDRESS :
PHONE / TELEPHONE : FAX :
E–MAIL :
For Bull Subsidiaries / Pour les Filiales Bull :
Identification:
For Bull Affiliated Customers / Pour les Clients Affiliés Bull :
Customer Code / Code Client :
For Bull Internal Customers / Pour les Clients Internes Bull :
Budgetary Section / Section Budgétaire :
For Others / Pour les Autres :
Please ask your Bull representative. / Merci de demander à votre contact Bull.
BULL CEDOC
357 AVENUE PATTON
B.P.20845
49008 ANGERS CEDEX 01
FRANCE
86 A1 82EF 03
ORDER REFERENCE
Utiliser les marques de découpe pour obtenir les étiquettes.
Use the cut marks to get the labels.
86 A1 82EF 03
Service Guide
ESCALA PL
3200R
86 A1 82EF 03
Service Guide
ESCALA PL
3200R
86 A1 82EF 03
Activation SAS (Serial-Attached SCSI)
RAID pour les modèles E1-700, E1-705,
E3-700 ou E3-705
REFERENCE
86 F1 89FF 03
ESCALA Power7
Activation SAS (Serial-Attached SCSI) RAID pour les
modèles E1-700, E1-705, E3-700 ou E3-705
Les publications pour ESCALA Power7 concernent les modèles suivants :
- Bull Escala E5-700 (Power 750 / 8233-E8B)
- Bull Escala M6-700 (Power 770 / 9117-MMB)
- Bull Escala M6-705 (Power 770 / 9117-MMC)
- Bull Escala M7-700 (Power 780 / 9179-MHB)
- Bull Escala M7-705 (Power 780 / 9179-MHC)
- Bull Escala E1-700 (Power 710 / 8231-E2B)
- Bull Escala E1-705 (Power 710 / 8231-E1C)
- Bull Escala E2-700 / E2-700T (Power 720 / 8202-E4B)
- Bull Escala E2-705 / E2-705T (Power 720 / 8202-E4C)
- Bull Escala E3-700 (Power 730 / 8231-E2B)
- Bull Escala E3-705 (Power 730 / 8231-E2C)
- Bull Escala E4-700 / E4-700T (Power 740 / 8205-E6B)
- Bull Escala E4-705 (Power 740 / 8205-E6C)
Les références au modèle Power 755 / 8236-E8C sont sans objet.
Hardware
October 2011
BULL CEDOC
357 AVENUE PATTON
B.P.20845
49008 ANGERS CEDEX 01
FRANCE
REFERENCE
86 F1 89FF 03
L’avis juridique de copyright ci–après place le présent document sous la protection des lois de Copyright qui prohibent, sans s’y
limiter, des actions comme la copie, la distribution, la modification et la création de produits dérivés.
Copyright Bull SAS 2011
Imprimé en France
Marques déposées
Toutes les marques citées dans ce manuel sont la propriété de leurs détenteurs respectifs.
Tous les noms de marques ainsi que les noms de produits matériels et/ou logiciels sont régis par le droit des marques et/ou des
brevets.
La citation des noms de marques et de produits est purement informative et ne constitue pas une violation du droit des marques
et/ou des brevets.
Des corrections ou des modifications au contenu de ce document peuvent intervenir sans préavis. Bull SAS ne pourra pas
être tenu pour responsable des éventuelles erreurs qui pourraient être contenues dans ce manuel, ni pour tout dommage
p ouvant résulter de son application.
Table des matières
Avis aux lecteurs canadiens . . . . . . . . . . . . . . . . . . . . . . . . . . . v
Consignes de sécurité . . . . . . . . . . . . . . . . . . . . . . . . . . . . . ix
Activation SAS RAID et bloc de batteries de cache pour le modèle 31E/2B,
31E/1C ou 31E/2C. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1
Activation SA RAID - Nouveautés. . . . . . . . . . . . . . . . . . . . . . . . . 3
Installation du fond de panier de stockage - 6 disques SFF/DVD SATA (FC 5267 et FC
EJ0D). . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5
Installation du fond de panier de stockage - 3 disques SFF/DVD SATA/Bande HH (FC
5263 et FC EJ0E). . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 9
Installation du fond de panier de stockage - 6 disques SFF/DVD SATA/RAID 175
Mo/Adaptateur d'E-S double/Port SAS externe (FC 5268 et FC EJ0F) . . . . . . . . . 13
Retrait et remise en place des adaptateurs et batteries SAS RAID . . . . . . . . . . 23
Retrait et remplacement du fond de panier d'unités de disque . . . . . . . . . . . . . . . . . . 23
Retrait du fond de panier d'unités de disques sur les modèles 31E/2B, 31E/1C, 31E/2C. . . . . . . . . 23
Remplacement du fond de panier d'unités de disques sur les modèles 31E/2B, 31E/1C, 31E/2C . . . . . . 29
Retrait et remplacement du fond de panier système 31E/2B, 31E/1C ou 31E/2C . . . . . . . . . . . . 34
Retrait du fond de panier système 31E/2B, 31E/1C ou 31E/2C . . . . . . . . . . . . . . . . . 34
Remplacement du fond de panier système 31E/2B, 31E/1C ou 31E/2C . . . . . . . . . . . . . . 45
Considérations relatives à la maintenant du sous-système SAS 31E/2B . . . . . . . . . . . . . . . 60
Considérations de maintenance pour le bloc de batterie de cache . . . . . . . . . . . . . . . . 61
Retrait de la carte d'activation RAID/cache31E/2B . . . . . . . . . . . . . . . . . . . . . . 63
Remplacement de la carte d'activation RAID/cache31E/2B . . . . . . . . . . . . . . . . . . . 64
Retrait du contrôleur de stockage RAID/cache sur le système 31E/2B, 31E/1C ou 31E/2C. . . . . . . . . 66
Remplacement du contrôleur de stockage RAID/cache sur le système 31E/2B, 31E/1C ou 31E/2C . . . . . . 70
Retrait et installation de la carte de batterie de cache . . . . . . . . . . . . . . . . . . . . . 73
Retrait et remplacement du bloc de batterie de cache . . . . . . . . . . . . . . . . . . . . . 80
Procédures communes aux dispositifs installables . . . . . . . . . . . . . . . . 83
Avant de commencer . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 83
Identification d'un composant défaillant. . . . . . . . . . . . . . . . . . . . . . . . . . 86
Voyants du panneau de commande . . . . . . . . . . . . . . . . . . . . . . . . . . 87
Identification d'un composant défaillant sur un système AIX ou une partition logique . . . . . . . . . 88
Localisation d'un composant défaillant sur un système AIX ou une partition logique. . . . . . . . . 88
Activation du voyant associé au composant défaillant . . . . . . . . . . . . . . . . . . . 88
Désactivation du voyant d'un composant défaillant . . . . . . . . . . . . . . . . . . . . 89
Identification d'un composant défaillant sur un système Linux ou une partition logique. . . . . . . . . 89
Localisation d'un composant défaillant sur un système Linux ou une partition logique . . . . . . . . 89
Localisation d'un composant défaillant sur un système ou une partition logique Linux . . . . . . . . 89
Activation du voyant associé au composant défaillant . . . . . . . . . . . . . . . . . . . 90
Désactivation du voyant d'un composant défaillant . . . . . . . . . . . . . . . . . . . . 90
Localisation d'un composant défaillant sur un système ou une partition logique Virtual I/O Server . . . . . 90
Identification d'un composant à l'aide du Virtual I/O Server . . . . . . . . . . . . . . . . . 91
Démarrage du système ou de la partition logique . . . . . . . . . . . . . . . . . . . . . . 91
Démarrage d'un système non géré par une console HMC ou une console SDMC . . . . . . . . . . . 91
Démarrage d'un système ou d'une partition logique via la console HMC. . . . . . . . . . . . . . 93
iii
Démarrage d'un système ou d'un serveur virtuel via la console SDMC . . . . . . . . . . . . . . 93
Arrêt d'un système ou d'une partition logique. . . . . . . . . . . . . . . . . . . . . . . . 93
Arrêt d'un système non géré par une console HMC ou une console SDMC . . . . . . . . . . . . . 94
Arrêt d'un système via la console HMC . . . . . . . . . . . . . . . . . . . . . . . . . 95
Arrêt d'un système via la console SDMC . . . . . . . . . . . . . . . . . . . . . . . . 96
Retrait et remplacement des capots sur le système 31E/2B, 31E/1C ou 31E/2C. . . . . . . . . . . . . 96
Retrait du capot d'accès du système 31E/2B, 31E/1C ou 31E/2C . . . . . . . . . . . . . . . . 96
Installation du capot d'accès du système 31E/2B, 31E/1C ou 31E/2C . . . . . . . . . . . . . . . 97
Mise en position de maintenance ou de fonctionnement du système 31E/2B, 31E/1C ou 31E/2C. . . . . . . 98
Mise en position de maintenance du système 31E/2B, 31E/1C ou 31E/2C monté en armoire . . . . . . . 98
Mise en position de fonctionnement du système 31E/2B, 31E/1C ou 31E/2C monté en armoire . . . . . . 99
Déconnexion des cordons d'alimentation du système 31E/2B, 31E/1C ou 31E/2C . . . . . . . . . . . 101
Connexion des cordons d'alimentation au système 31E/2B, 31E/1C ou 31E/2C . . . . . . . . . . . . 101
Installation d'un dispositif via la console HMC . . . . . . . . . . . . . . . . . . . . . . . 102
Retrait d'un composant via la console HMC . . . . . . . . . . . . . . . . . . . . . . . . 103
Remplacement d'un composant via la console HMC . . . . . . . . . . . . . . . . . . . . . 103
Vérification du composant installé . . . . . . . . . . . . . . . . . . . . . . . . . . . 104
Vérification d'un dispositif installé ou d'un composant remplacé sur un système ou une partition logique AIX 104
Vérification d'un composant installé sur un système Linux ou une partition logique. . . . . . . . . . 107
Vérification d'un composant installé via les programmes de diagnostic autonome . . . . . . . . . . 107
Vérification d'un composant installé à l'aide de la console HMC (Hardware Management Console). . . . . 108
Activation/Désactivation des voyants via la console HMC . . . . . . . . . . . . . . . . . 109
Désactivation d'un voyant d'avertissement système ou de partition . . . . . . . . . . . . . 109
Activation ou désactivation d'un voyant d'identification . . . . . . . . . . . . . . . . . 110
Affichage des événements réparables à l'aide de la console HMC . . . . . . . . . . . . . . . 111
Vérification du composant installé via la console SDMC . . . . . . . . . . . . . . . . . . . 111
Activation/Désactivation de voyants via la console de gestion Systems Director . . . . . . . . . . 112
Désactivation d'un voyant d'avertissement système ou de partition . . . . . . . . . . . . . 112
Activation ou désactivation de voyants d'identification via la console SDMC . . . . . . . . . . 113
Visualisation d'événements réparables via la console de gestion Systems Director. . . . . . . . . . 113
Vérification d'un dispositif installé ou d'un composant remplacé sur un système ou une partition logique à
l'aide des outils Virtual I/O Server . . . . . . . . . . . . . . . . . . . . . . . . . . 114
Installation d'un composant via la console SDMC . . . . . . . . . . . . . . . . . . . . . . 117
Remplacement d'un composant via la console SDMC . . . . . . . . . . . . . . . . . . . . . 117
Retrait d'un composant via la console SDMC. . . . . . . . . . . . . . . . . . . . . . . . 118
Remarques . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 119
Marques . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 120
Bruits radioélectriques . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 120
Remarques sur la classe A . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 121
Remarques sur la classe B . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 124
Dispositions. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 127
iv Activation SAS (Serial-attached SCSI) RAID
Avis aux lecteurs canadiens
Le présent document a été traduit en France. Voici les principales différences et particularités dont vous
devez tenir compte.
Illustrations
Les illustrations sont fournies à titre d'exemple. Certaines peuvent contenir des données propres à la
France.
Terminologie
La terminologie des titres IBM peut différer d'un pays à l'autre. Reportez-vous au tableau ci-dessous, au
besoin.
IBM France IBM Canada
ingénieur commercial représentant
agence commerciale succursale
ingénieur technico-commercial informaticien
inspecteur technicien du matériel
Claviers
Les lettres sont disposées différemment : le clavier français est de type AZERTY, et le clavier
français-canadien, de type QWERTY.
OS/2 - Paramètres canadiens
Au Canada, on utilise :
v les pages de codes 850 (multilingue) et 863 (français-canadien),
v le code pays 002,
v le code clavier CF.
Nomenclature
Les touches présentées dans le tableau d'équivalence suivant sont libellées différemment selon qu'il s'agit
du clavier de la France, du clavier du Canada ou du clavier des États-Unis. Reportez-vous à ce tableau
pour faire correspondre les touches françaises figurant dans le présent document aux touches de votre
clavier.
v
Recommandations à l'utilisateur
Ce matériel utilise et peut émettre de l'énergie radiofréquence. Il risque de parasiter les communications
radio et télévision s'il n'est pas installé et utilisé conformément aux instructions du constructeur
(instructions d'utilisation, manuels de référence et manuels d'entretien).
Si cet équipement provoque des interférences dans les communications radio ou télévision, mettez-le hors
tension puis sous tension pour vous en assurer. Il est possible de corriger cet état de fait par une ou
plusieurs des mesures suivantes :
v Réorienter l'antenne réceptrice ;
v Déplacer l'équipement par rapport au récepteur ;
v Éloigner l'équipement du récepteur ;
v Brancher l'équipement sur une prise différente de celle du récepteur pour que ces unités fonctionnent
sur des circuits distincts ;
v S'assurer que les vis de fixation des cartes et des connecteurs ainsi que les fils de masse sont bien
serrés ;
v Vérifier la mise en place des obturateurs sur les connecteurs libres.
Si vous utilisez des périphériques non IBM avec cet équipement, nous vous recommandons d'utiliser des
câbles blindés mis à la terre, à travers des filtres si nécessaire.
En cas de besoin, adressez-vous à votre détaillant.
Le fabricant n'est pas responsable des interférences radio ou télévision qui pourraient se produire si des
modifications non autorisées ont été effectuées sur l'équipement.
vi Activation SAS (Serial-attached SCSI) RAID
L'obligation de corriger de telles interférences incombe à l'utilisateur.
Au besoin, l'utilisateur devrait consulter le détaillant ou un technicien qualifié pour obtenir de plus
amples renseignements.
Brevets
Il est possible qu'IBM détienne des brevets ou qu'elle ait déposé des demandes de brevets portant sur
certains sujets abordés dans ce document. Le fait qu'IBM vous fournisse le présent document ne signifie
pas qu'elle vous accorde un permis d'utilisation de ces brevets. Vous pouvez envoyer, par écrit, vos
demandes de renseignements relatives aux permis d'utilisation au directeur général des relations
commerciales d'IBM, 3600 Steeles Avenue East, Markham, Ontario, L3R 9Z7.
Assistance téléphonique
Si vous avez besoin d'assistance ou si vous voulez commander du matériel, des logiciels et des
publications IBM, contactez IBM direct au 1 800 465-1234.
Avis aux lecteurs canadiens vii
viii Activation SAS (Serial-attached SCSI) RAID
Consignes de sécurité
Les consignes de sécurité peuvent être imprimées tout au long de ce guide.
v DANGER - Consignes attirant votre attention sur un risque de blessures graves, voire mortelles.
v ATTENTION - Consignes attirant votre attention sur un risque de blessures graves, en raison de
certaines circonstances réunies.
v Avertissement - Consignes attirant votre attention sur un risque de dommages sur un programme, une
unité, un système ou des données.
Consignes de sécurité relatives au commerce international
Plusieurs pays nécessitent la présentation des consignes de sécurité indiquées dans les publications du
produit dans leur langue nationale. Si votre pays en fait partie, un livret de consignes de sécurité est
inclus dans l'ensemble des publications livré avec le produit. Ce livret contient les consignes de sécurité
dans votre langue en faisant référence à la source en anglais (Etats-Unis). Avant d'utiliser une publication
en version originale anglaise pour installer, faire fonctionner ou dépanner ce produit, vous devez vous
familiariser avec les consignes de sécurité figurant dans ce livret. Vous devez également consulter ce
livret chaque fois que les consignes de sécurité des publications en anglais (Etats-Unis) ne sont pas assez
claires pour vous.
Informations sur les appareils à laser
Les serveurs peuvent comprendre des cartes d'E-S ou des composants à fibres optiques, utilisant des
lasers ou des diodes électroluminescentes (LED).
Conformité aux normes relatives aux appareils à laser
Les serveurs peuvent être installés à l'intérieur ou à l'extérieur d'une armoire d'équipement informatique.
ix
DANGER
Lorsque vous utilisez le système ou travaillez à proximité de ce dernier, observez les consignes
suivantes :
Le courant électrique provenant de l'alimentation, du téléphone et des câbles de transmission peut
présenter un danger. Pour éviter tout risque de choc électrique :
v Branchez cette unité uniquement avec le cordon d'alimentation fourni par . N'utilisez pas ce
dernier avec un autre produit.
v N'ouvrez pas et n'entretenez pas le bloc d'alimentation électrique.
v Ne manipulez aucun câble et n'effectuez aucune opération d'installation, d'entretien ou de
reconfiguration de ce produit au cours d'un orage.
v Le produit peut être équipé de plusieurs cordons d'alimentation. Pour supprimer tout risque de
danger électrique, débranchez tous les cordons d'alimentation.
v Branchez tous les cordons d'alimentation sur un socle de prise de courant correctement câblé et
mis à la terre. Vérifiez que la tension et l'ordre des phases des prises de courant correspondent
aux informations de la plaque d'alimentation électrique du système.
v Branchez sur des socles de prise de courant correctement câblés tout équipement connecté à ce
produit.
v Lorsque cela est possible, n'utilisez qu'une seule main pour connecter ou déconnecter les cordons
d'interface.
v Ne mettez jamais un équipement sous tension en cas d'incendie ou d'inondation, ou en présence
de dommages matériels.
v Avant de retirer les capots de l'unité, mettez celle-ci hors tension et déconnectez ses cordons
d'alimentation, ainsi que les câbles qui la relient aux réseaux, aux systèmes de télécommunication
et aux modems (sauf mention contraire dans les procédures d'installation et de configuration).
v Lorsque vous installez, que vous déplacez, ou que vous manipulez le présent produit ou des
périphériques qui lui sont raccordés, reportez-vous aux instructions ci-dessous pour connecter et
déconnecter les différents cordons.
Pour déconnecter les cordons :
1. Mettez toutes les unités hors tension (sauf mention contraire).
2. Débranchez les cordons d'alimentation des prises.
3. Débranchez les cordons d'interface des connecteurs.
4. Débranchez tous les câbles des unités.
Pour connecter les cordons :
1. Mettez toutes les unités hors tension (sauf mention contraire).
2. Branchez tous les cordons sur les unités.
3. Branchez les cordons d'interface sur des connecteurs.
4. Branchez les cordons d'alimentation aux prises.
5. Mettez l'unité sous tension.
(D005a)
x Activation SAS (Serial-attached SCSI) RAID
DANGER
Observez les consignes suivantes lors de l'utilisation du système en armoire ou lorsque vous
travaillez à proximité de ce dernier :
v Un mauvais maniement de l'équipement lourd peut engendrer blessures et dommages matériels.
v Abaissez toujours les vérins de mise à niveau de l'armoire.
v Installez toujours des équerres de stabilisation sur l'armoire.
v Pour prévenir tout danger lié à une mauvaise répartition de la charge, installez toujours les
unités les plus lourdes dans la partie inférieure de l'armoire. Installez toujours les serveurs et les
unités en option en commençant par le bas de l'armoire.
v Un serveur monté en armoire n'est pas une étagère ou un espace de travail. Ne posez pas d'objet
sur un serveur monté en armoire.
v Chaque armoire peut être équipée de plusieurs cordons d'alimentation. Avant de manipuler
l'armoire, vous devez débrancher l'ensemble des cordons d'alimentation.
v Reliez toutes les unités installées dans l'armoire aux dispositifs d'alimentation installés dans la
même armoire. Vous ne devez pas brancher le cordon d'alimentation d'une unité installée dans
une armoire au dispositif d'alimentation installé dans une autre armoire.
v Un mauvais câblage du socle de prise de courant peut provoquer une mise sous tension
dangereuse des parties métalliques du système ou des unités qui lui sont raccordées. Il
appartient au client de s'assurer que le socle de prise de courant est correctement câblé et mis à la
terre afin d'éviter tout risque de choc électrique.
ATTENTION
v N'installez pas d'unité dans une armoire dont la température ambiante interne dépasse la
température ambiante que le fabricant recommande pour toutes les unités montées en armoire.
v N'installez pas d'unité dans une armoire où la ventilation n'est pas assurée. Vérifiez que les
côtés, l'avant et l'arrière de l'unité sont correctement ventilés.
v Le matériel doit être correctement raccordé au circuit d'alimentation pour éviter qu'une surcharge
des circuits n'entrave le câblage des dispositifs d'alimentation ou de protection contre les
surintensités. Pour choisir des connexions d'alimentation à l'armoire adaptées, consultez les
étiquettes de puissance nominale situées sur le matériel dans l'armoire afin de déterminer
l'alimentation totale requise par le circuit d'alimentation.
v Armoires dotées de tiroirs coulissants : Si l'armoire n'est pas équipée d'équerres de stabilisation, ne
sortez et n'installez pas de tiroir ou de dispositif. Ne retirez pas plusieurs tiroirs à la fois. Si vous
retirez plusieurs tiroirs simultanément, l'armoire risque de devenir instable.
v Armoires dotées de tiroirs fixes : Sauf indication du fabricant, les tiroirs fixes ne doivent pas être
retirés à des fins de maintenance. Si vous tentez de retirer une partie ou l'ensemble du tiroir,
l'armoire risque de devenir instable et le tiroir risque de tomber.
(R001)
Consignes de sécurité xi
ATTENTION :
Le retrait des composants des parties supérieures de l'armoire améliore sa stabilité au cours du
déplacement. Pour déplacer une armoire remplie de composants dans une pièce ou dans un bâtiment,
procédez comme suit :
v Pour réduire le poids de l'armoire, retirez les équipements, à commencer par celui situé en haut. Si
possible, restaurez la configuration d'origine de l'armoire. Si vous ne connaissez pas cette
configuration, procédez comme suit :
– Retirez toutes les unités de la position 32U et plus.
– Assurez-vous que les unités les plus lourdes sont installées dans la partie inférieure de l'armoire.
– Assurez-vous qu'il ne reste aucun niveau U vide entre les unités installées dans l'armoire, en
dessous du niveau 32U.
v Si l'armoire déplacée fait partie d'un groupe d'armoires, séparez-la de ce dernier.
v Vérifiez l'itinéraire envisagé pour éliminer tout risque.
v Vérifiez que l'armoire une fois chargée n'est pas trop lourde pour l'itinéraire choisi. Pour plus
d'informations sur le poids d'une armoire chargée, consultez la documentation fournie avec votre
armoire.
v Vérifiez que toutes les ouvertures mesurent au moins 760 x 230 mm.
v Vérifiez que toutes les unités, toutes les étagères, tous les tiroirs, toutes les portes et tous les câbles
sont bien fixés.
v Vérifiez que les vérins de mise à niveau sont à leur position la plus haute.
v Vérifiez qu'aucune équerre de stabilisation n'est installée sur l'armoire pendant le déplacement.
v N'utilisez pas de rampe inclinée à plus de dix degrés.
v Dès que l'armoire est à son nouvel emplacement, procédez comme suit :
– Abaissez les quatre vérins de mise à niveau.
– Installez des équerres de stabilisation sur l'armoire.
– Si vous avez retiré des unités de l'armoire, remettez-les à leur place, en remontant de la partie
inférieure à la partie supérieure de l'armoire.
v Si un déplacement important est nécessaire, restaurez la configuration d'origine de l'armoire. Mettez
l'armoire dans son emballage d'origine ou dans un autre emballage équivalent. De plus, abaissez les
vérins de mise à niveau pour que les roulettes ne soient plus au contact de la palette et fixez
l'armoire à celle-ci.
(R002)
(L001)
(L002)
xii Activation SAS (Serial-attached SCSI) RAID
(L003)
ou
Aux Etats-Unis, tous les appareils à laser sont certifiés conformes aux normes indiquées dans le
sous-chapitre J du DHHS 21 CFR relatif aux produits à laser de classe 1. Dans les autres pays, ils sont
certifiés être des produits à laser de classe 1 conformes aux normes CEI 60825. Consultez les étiquettes
sur chaque pièce du laser pour les numéros d'accréditation et les informations de conformité.
ATTENTION :
Ce produit peut contenir des produits à laser de classe 1 : lecteur de CD-ROM, DVD-ROM,
DVD-RAM ou module à laser. Notez les informations suivantes :
v Ne retirez pas les capots. En ouvrant le produit à laser, vous vous exposez au rayonnement
dangereux du laser. Vous ne pouvez effectuer aucune opération de maintenance à l'intérieur.
v Pour éviter tout risque d'exposition au rayon laser, respectez les consignes de réglage et d'utilisation
des commandes, ainsi que les procédures décrites dans le présent manuel.
(C026)
Consignes de sécurité xiii
ATTENTION :
Les installations informatiques peuvent comprendre des modules à laser fonctionnant à des niveaux
de rayonnement excédant les limites de la classe 1. Il est donc recommandé de ne jamais examiner à
l'oeil nu la section d'un cordon optique ni une prise de fibres optiques ouverte. (C027)
ATTENTION :
Ce produit contient un laser de classe 1M. Ne l'observez pas à l'aide d'instruments optiques. (C028)
ATTENTION :
Certains produits à laser contiennent une diode à laser intégrée de classe 3A ou 3B. Prenez
connaissance des informations suivantes. Rayonnement laser lorsque le capot est ouvert. Evitez toute
exposition directe au rayon laser. Evitez de regarder fixement le faisceau ou de l'observer à l'aide
d'instruments optiques. (C030)
ATTENTION :
Cette pile contient du lithium. Pour éviter tout risque d'explosion, n'essayez pas de la recharger et ne
la faites pas brûler.
Ne pas :
v ___ la jeter à l'eau
v ___ l'exposer à une température supérieure à 100 °C
v ___ chercher à la réparer ou à la démonter
Ne la remplacez que par une pile agréée. Pour le recyclage ou la mise au rebut, reportez-vous à la
réglementation en vigueur. (C003a)
Informations sur l'alimentation électrique et sur le câblage relatives au document
GR-1089-CORE du NEBS (Network Equipment-Building System)
Les commentaires suivants s'appliquent aux serveurs qui ont été déclarés conformes au document
GR-1089-CORE du NEBS (Network Equipment-Building System) :
Cet équipement peut être installé :
v dans des infrastructures de télécommunications réseau
v aux endroits préconisés dans les directives NEC (National Electrical Code).
Les ports de ce matériel qui se trouvent à l'intérieur du bâtiment peuvent être connectés à des câbles
internes ou non exposés uniquement. Ils ne doivent pas être connectés par leur partie métallique aux
interfaces connectées au réseau extérieur ou à son câblage. Ces interfaces sont conçues pour être
exclusivement utilisées à l'intérieur d'un bâtiment (ports de type 2 ou 4 décrits dans le document
GR-1089-CORE) ; elles doivent être isolées du câblage à découvert du réseau extérieur. L'ajout de
dispositifs de protection primaires n'est pas suffisant pour pouvoir connecter ces interfaces par leur partie
métallique au câblage du réseau extérieur.
Remarque : Tous les câbles Ethernet doivent être blindés et mis à la terre aux deux extrémités.
Dans le cas d'un système alimenté en courant alternatif, il n'est pas nécessaire d'installer un dispositif
externe de protection contre les surtensions (SPD).
Un système alimenté en courant continu fait appel à un dispositif de retour du continu (DC-I). La borne
de retour de la batterie en courant continu ne doit pas être connectée à la masse.
xiv Activation SAS (Serial-attached SCSI) RAID
Activation SAS RAID et bloc de batteries de cache pour le
modèle 31E/2B, 31E/1C ou 31E/2C
Vous pouvez être amené à configurer des disques RAID (Redundant Array of Independent Disks) sur les
unités de disque SAS (serial-attached SCSI) du boîtier système. Suivez les instructions de cette rubrique
pour installer sur le système les dispositifs d'activation RAID.
Les instructions suivantes permettent d'installer des cartes d'activation SAS RAID afin de préparer le
système à l'utilisation d'unités de disque SAS dans une configuration RAID.
1
2 Activation SAS (Serial-attached SCSI) RAID
Activation SA RAID - Nouveautés
Prenez connaissance des nouveautés et des modifications significatives apportées à "Activation SAS
RAID" depuis la dernière mise à jour de cet ensemble de rubriques.
Octobre 2011
v Ajout d'informations relatives aux systèmes 31E/1C et 31E/2C.
3
4 Activation SAS (Serial-attached SCSI) RAID
Installation du fond de panier de stockage - 6 disques
SFF/DVD SATA (FC 5267 et FC EJ0D)
Utilisez cette procédure pour installer les composants inclus avec le dispositif de fond de panier de
stockage - 6 disques SFF/DVD SATA. Cette procédure s'applique à l'installation de nouveaux dispositifs.
Avant de commencer
Remarque :
FC 5267 s'applique à 31E/2B et FC EJ0D s'applique à 31E/1C ou 31E/2C.
Avertissement : Pour maintenir la disponibilité du système et éviter d'éventuelles pertes de données,
vous devez comprendre que ces fonctions ont des conséquences sur la configuration de SAS
(serial-attached SCSI) et l'accessibilité des données du système. Consultez la rubrique Sous-système SAS
pour comprendre les différentes remarques sur la configuration et la maintenance SAS.
Avant toute opération d'installation ou de maintenance d'un dispositif, consultez les configurations d'E-S
pour chaque partition logique de votre système. Il est important de comprendre les différences entre les
diverses configurations et les considérations de maintenance qui en résultent. En outre, tenez compte des
incidences possibles sur les domaines suivants :
v Partitions logiques et leurs affectations à l'adaptateur d'E-S
v Configurations de grappes RAID
v Emplacements de disque physique
v Chemin d'accès à votre unité d'amorçage
Pourquoi et quand exécuter cette tâche
Avant d'installer ou de remplacer un dispositif, assurez-vous que le logiciel requis pour prendre en
charge ce dispositif est installé sur votre système. Si votre système est géré par la console HMC
(Hardware Management Console), utilisez la console HMC pour installer un dispositif sur le serveur.
Pour plus d'instructions, voir «Installation d'un dispositif via la console HMC», à la page 102.
Si votre système est géré par la console SDMC (Systems Director Management Console), utilisez la
console SDMC pour installer un dispositif sur le serveur. Pour plus d'instructions, voir «Installation d'un
composant via la console SDMC», à la page 117.
Procédure
Effectuez les étapes suivantes pour installer les composants inclus avec le dispositif de fond de panier de
stockage - 6 disques SFF/DVD SATA.
1. Effectuez les tâches prérequises telles qu'elles sont décrites dans «Avant de commencer», à la page
83.
Important : Ne coupez pas d'alimentation du système.
2. Arrêtez le système (voir «Arrêt d'un système ou d'une partition logique», à la page 93).
3. Placez le système en position de maintenance, comme indiqué dans la section «Mise en position de
maintenance du système 31E/2B, 31E/1C ou 31E/2C monté en armoire», à la page 98.
4. Retirez le capot d'accès, comme indiqué à la rubrique «Retrait du capot d'accès du système 31E/2B,
31E/1C ou 31E/2C», à la page 96.
5
5. Utilisez les voyants de maintenance pour vous aider à identifier le composant, comme indiqué dans
«Identification d'un composant défaillant», à la page 86.
6. Débranchez le système pour l'isoler de la source d'alimentation.
Remarque : Ce système peut être équipé d'un deuxième bloc d'alimentation. Avant de poursuivre
cette procédure, vérifiez que la source d'alimentation du système est complètement déconnectée.
(L003)
ou
7. Attachez le bracelet antistatique.
Avertissement :
v Portez un bracelet antistatique relié à une surface métallique non peinte pour protéger le matériel
contre les risques de décharge électrostatique.
v Si vous portez un bracelet antistatique, respectez toutes les consignes de sécurité relatives aux
dangers électriques. Un bracelet antistatique permet de réguler l'électricité statique. Il ne réduit et
n'augmente en aucun cas les risques d'électrocution liés à l'utilisation ou à la manipulation
d'appareils électriques.
v Si vous ne disposez pas d'un bracelet antistatique, touchez une surface métallique non peinte du
système pendant au moins 5 secondes avant de déballer un produit de son emballage antistatique,
d'installer ou de remplacer du matériel.
8. Installez l'interposeur système. Pour des instructions, voir Remplacement hors tension de
l'interposeur de stockage 31E/2B, 31E/1C ou 31E/2C.
9. Installez le fond de panier de l'unité de disque. Pour des instructions, voir Remplacement du fond
de panier de l'unité de disque 31E/2B, 31E/1C ou 31E/2C.
10. Installez les unités de disque.
v Pour les unités de disque internes, voir Remplacement de l'unité de disque interne dans un
système 31E/2B, 31E/1C ou 31E/2C.
v Pour une unité SSD, voir Installation d'une unité de disque ou d'une unité SSD.
6 Activation SAS (Serial-attached SCSI) RAID
11. Installez l'unité de DVD SATA. Pour des instructions, voir Installation d'une unité de stockage
extra-plate sur le modèle 31E/2B, 31E/1C, ou 31E/2C.
12. Replacez le capot d'accès, comme indiqué à la rubrique «Installation du capot d'accès du système
31E/2B, 31E/1C ou 31E/2C», à la page 97.
13. Pour un système monté en armoire, mettez-le en position de fonctionnement, comme indiqué dans
«Mise en position de fonctionnement du système 31E/2B, 31E/1C ou 31E/2C monté en armoire», à
la page 99.
14. Rebranchez la source d'alimentation.
15. Démarrez le système comme indiqué dans «Démarrage du système ou de la partition logique», à la
page 91.
16. Vérifiez le composant installé :
v Si vous avez installé le composant pour toute autre raison, vérifiez le composant installé comme
décrit dans Vérification du composant installé.
Installation du fond de panier de stockage - 6 disques SFF/DVD SATA (FC 5267 et FC EJ0D) 7
8 Activation SAS (Serial-attached SCSI) RAID
Installation du fond de panier de stockage - 3 disques
SFF/DVD SATA/Bande HH (FC 5263 et FC EJ0E)
Utilisez cette procédure pour installer les composants inclus avec le dispositif de fond de panier de
stockage - 3 disques SFF/DVD SATA/Bande HH. Cette procédure s'applique à l'installation de nouveaux
dispositifs.
Avant de commencer
Avertissement : Pour maintenir la disponibilité du système et éviter d'éventuelles pertes de données,
vous devez comprendre que ces fonctions ont des conséquences sur la configuration de SAS
(serial-attached SCSI) et l'accessibilité des données du système. Consultez la rubrique Sous-système SAS
pour comprendre les différentes remarques sur la configuration et la maintenance SAS.
Avant toute opération d'installation ou de maintenance d'un dispositif, consultez les configurations d'E-S
pour chaque partition logique de votre système. Il est important de comprendre les différences entre les
diverses configurations et les considérations de maintenance qui en résultent. En outre, tenez compte des
incidences possibles sur les domaines suivants :
v Partitions logiques et leurs affectations à l'adaptateur d'E-S
v Configurations de grappes RAID
v Emplacements de disque physique
v Chemin d'accès à votre unité d'amorçage
Pourquoi et quand exécuter cette tâche
Avant d'installer ou de remplacer un dispositif, assurez-vous que le logiciel requis pour prendre en
charge ce dispositif est installé sur votre système.
Si votre système est géré par la console HMC (Hardware Management Console), utilisez la console HMC
pour installer un dispositif sur le serveur. Pour plus d'instructions, voir «Installation d'un dispositif via la
console HMC», à la page 102.
Si votre système est géré par la console SDMC (Systems Director Management Console), utilisez la
console SDMC pour installer un dispositif sur le serveur. Pour plus d'instructions, voir «Installation d'un
composant via la console SDMC», à la page 117.
Procédure
1. Effectuez les tâches prérequises telles qu'elles sont décrites dans «Avant de commencer», à la page
83.
Important : Ne coupez pas d'alimentation du système.
2. Arrêtez le système (voir «Arrêt d'un système ou d'une partition logique», à la page 93).
3. Placez le système en position de maintenance, comme indiqué dans la section «Mise en position de
maintenance du système 31E/2B, 31E/1C ou 31E/2C monté en armoire», à la page 98.
4. Retirez le capot d'accès, comme indiqué à la rubrique «Retrait du capot d'accès du système 31E/2B,
31E/1C ou 31E/2C», à la page 96.
5. Utilisez les voyants de maintenance pour vous aider à identifier le composant, comme indiqué dans
«Identification d'un composant défaillant», à la page 86.
6. Débranchez le système pour l'isoler de la source d'alimentation.
9
Remarque : Ce système peut être équipé d'un deuxième bloc d'alimentation. Avant de poursuivre
cette procédure, vérifiez que la source d'alimentation du système est complètement déconnectée.
(L003)
ou
7. Attachez le bracelet antistatique.
Avertissement :
v Portez un bracelet antistatique relié à une surface métallique non peinte pour protéger le matériel
contre les risques de décharge électrostatique.
v Si vous portez un bracelet antistatique, respectez toutes les consignes de sécurité relatives aux
dangers électriques. Un bracelet antistatique permet de réguler l'électricité statique. Il ne réduit et
n'augmente en aucun cas les risques d'électrocution liés à l'utilisation ou à la manipulation
d'appareils électriques.
v Si vous ne disposez pas d'un bracelet antistatique, touchez une surface métallique non peinte du
système pendant au moins 5 secondes avant de déballer un produit de son emballage antistatique,
d'installer ou de remplacer du matériel.
8. Installez l'interposeur système. Pour des instructions, voir Remplacement de l'interposeur de
stockage 31E/2B, 31E/1C ou 31E/2C avec mise hors tension.
9. Installez le fond de panier de l'unité de disque. Pour des instructions, voir Remise en place du fond
de panier de l'unité de disque 31E/2B, 31E/1C ou 31E/2C.
10. Installez les unités de disque.
v Pour les unités de disque internes, voir Remise en place de l'unité de disque interne dans un
système 31E/2B, 31E/1C ou 31E/2C .
v Pour une unité SSD, voir Installation d'une unité de disque ou d'une unité SSD.
11. Installez l'unité de bande SAS. Pour des instructions, voir Installation d'une unité de bande SAS dans
le système 31E/2B, 31E/1C ou 31E/2C .
12. Replacez le capot d'accès, comme indiqué à la rubrique «Installation du capot d'accès du système
31E/2B, 31E/1C ou 31E/2C», à la page 97.
10 Activation SAS (Serial-attached SCSI) RAID
13. Pour un système monté en armoire, mettez-le en position de fonctionnement, comme indiqué dans
«Mise en position de fonctionnement du système 31E/2B, 31E/1C ou 31E/2C monté en armoire», à
la page 99.
14. Rebranchez la source d'alimentation.
15. Démarrez le système comme indiqué dans «Démarrage du système ou de la partition logique», à la
page 91.
16. Vérifiez le composant installé :
v Si vous avez installé le composant pour toute autre raison, vérifiez le composant installé comme
décrit dans Vérification du composant installé.
Installation du fond de panier de stockage - 3 disques SFF/DVD SATA/Bande HH (FC 5263 et FC EJ0E) 11
12 Activation SAS (Serial-attached SCSI) RAID
Installation du fond de panier de stockage - 6 disques
SFF/DVD SATA/RAID 175 Mo/Adaptateur d'E-S double/Port
SAS externe (FC 5268 et FC EJ0F)
Utilisez cette procédure pour installer les composants inclus avec le dispositif de fond de panier de
stockage - 6 disques SFF/DVD SATA/RAID 175 Mo/Adaptateur d'E-S double/Port SAS externe. Cette
procédure s'applique à l'installation de nouveaux dispositifs.
Avant de commencer
Remarque :
FC 5267 s'applique à 31E/2B et FC EJ0F s'applique à 31E/1C ou 31E/2C. FC EJ0F ne dispose pas de la
carte d'activation RAID.
Identifiez le système sur lequel vous travaillez à l'aide du processus d'identification du système afin
d'activer l'indicateur de système (bleu). Pour plus d'informations, voir «Voyants du panneau de
commande», à la page 87.
Avertissement : Pour maintenir la disponibilité du système et éviter d'éventuelles pertes de données,
vous devez comprendre que ces fonctions ont des conséquences sur la configuration de SAS
(serial-attached SCSI) et l'accessibilité des données du système. Consultez la rubrique Sous-système SAS
pour comprendre les différentes remarques sur la configuration et la maintenance SAS.
Avant toute opération d'installation ou de maintenance d'un dispositif, consultez les configurations d'E-S
pour chaque partition logique de votre système. Il est important de comprendre les différences entre les
diverses configurations et les considérations de maintenance qui en résultent. En outre, tenez compte des
incidences possibles sur les domaines suivants :
v Partitions logiques et leurs affectations à l'adaptateur d'E-S
v Configurations de grappes RAID
v Emplacements de disque physique
v Chemin d'accès à votre unité d'amorçage
Avertissement : Si vous effectuez la maintenance d'un composant défaillant, voir les procédures de
maintenance suivantes :
v Pour la carte d'activation RAID, voir Retrait de la carte d'activation 31E/2B RAID/cache et Remise en
place de la carte d'activation 31E/2B RAID/cache.
v Pour le contrôleur de stockage RAID/cache, voir Retrait du contrôleur de stockage 31E/2B, 31E/1C ou
31E/2C RAID/cache et Remise en place du contrôleur de stockage 31E/2B, 31E/1C ou 31E/2C
RAID/cache.
Pourquoi et quand exécuter cette tâche
Avant d'installer ou de remplacer un dispositif, assurez-vous que le logiciel requis pour prendre en
charge ce dispositif est installé sur votre système.
Si votre système est géré par la console HMC (Hardware Management Console), utilisez la console HMC
pour installer un dispositif sur le serveur. Pour plus d'instructions, voir «Installation d'un dispositif via la
console HMC», à la page 102.
13
Si votre système est géré par la console SDMC (Systems Director Management Console), utilisez la
console SDMC pour installer un dispositif sur le serveur. Pour plus d'instructions, voir «Installation d'un
composant via la console SDMC», à la page 117.
Si vous ne disposez pas d'une console HMC ou SDMC, procédez comme suit :
Procédure
1. Effectuez les tâches prérequises, comme décrit dans la rubrique «Avant de commencer», à la page 83.
Important : Ne coupez pas d'alimentation du système.
2. Arrêtez le système (voir «Arrêt d'un système ou d'une partition logique», à la page 93).
3. Placez le système en position de maintenance, comme indiqué dans la section «Mise en position de
maintenance du système 31E/2B, 31E/1C ou 31E/2C monté en armoire», à la page 98.
4. Retirez le capot d'accès, comme indiqué à la rubrique «Retrait du capot d'accès du système 31E/2B,
31E/1C ou 31E/2C», à la page 96.
5. Utilisez les voyants de maintenance pour vous aider à identifier le composant, comme indiqué dans
«Identification d'un composant défaillant», à la page 86.
6. Débranchez le système pour l'isoler de la source d'alimentation.
Remarque : Ce système peut être équipé d'un deuxième bloc d'alimentation. Avant de poursuivre
cette procédure, vérifiez que la source d'alimentation a été débranchée du système.
(L003)
ou
7. Attachez le bracelet antistatique.
14 Activation SAS (Serial-attached SCSI) RAID
Avertissement :
v Portez un bracelet antistatique relié à une surface métallique non peinte pour protéger le matériel
contre les risques de décharge électrostatique.
v Si vous portez un bracelet antistatique, respectez toutes les consignes de sécurité relatives aux
dangers électriques. Un bracelet antistatique permet de réguler l'électricité statique. Il ne réduit et
n'augmente en aucun cas les risques d'électrocution liés à l'utilisation ou à la manipulation
d'appareils électriques.
v Si vous ne disposez pas d'un bracelet antistatique, touchez une surface métallique non peinte du
système pendant au moins 5 secondes avant de déballer un produit de son emballage antistatique,
d'installer ou de remplacer du matériel.
8. Pour 31E/2B, localisez la carte d'activation RAID (P1-C12), le contrôleur de stockage RAID/cache
(P1-C18) et la carte de batterie de cache (P1-C13).
9. Pour 31E/1C ou 31E/2C, localisez le contrôleur de stockage RAID/cache (P1-C18) et la carte de
batterie de cache (P1-C13).
Figure 1. Emplacement d'installation de la carte d'activation RAID, du contrôleur de stockage et de la carte de batterie
de cache pour 31E/2B
Installation du fond de panier de stockage - 6 disques SFF/DVD SATA/RAID 175 Mo/Adaptateur d'E-S double/Port SAS
externe 15
10. Pour 31E/2B, installez la carte d'activation RAID en procédant comme suit :
Remarque : Pour 31E/1C ou 31E/2C, passez à l'étape 11, à la page 17
a. Alignez la carte d'activation RAID (A) sur l'emplacement dans le système. La carte d'activation
RAID comporte une plaque métallique, comme illustré ci-après.
b. Faites glisser la carte d'activation RAID dans le système jusqu'au verrouillage du loquet, comme
indiqué dans l'illustration suivante.
Figure 2. Emplacement d'installation du contrôleur de stockage et de la carte de batterie de cache pour 31E/1C ou
31E/2C
16 Activation SAS (Serial-attached SCSI) RAID
11. Installez le contrôleur de stockage RAID/cache en procédant comme suit :
a. Alignez les contrôleurs de stockage RAID et cache sur l'emplacement dans le système.
b. Faites glisser le contrôleur de stockage RAID/cache ou cache dans le système jusqu'au
verrouillage du loquet comme indiqué dans l'illustration suivante.
Figure 3. Modèle monté en armoire : remplacement de la carte d'activation RAID
Installation du fond de panier de stockage - 6 disques SFF/DVD SATA/RAID 175 Mo/Adaptateur d'E-S double/Port SAS
externe 17
12. Installez la carte de batterie de cache en procédant comme suit :
Figure 4. Modèle monté en armoire : remplacement des contrôleurs de stockage RAID/cache pour 31E/2B
Figure 5. Modèle monté en armoire : remplacement des contrôleurs de stockage RAID/cache pour 31E/1C ou 31E/2C
18 Activation SAS (Serial-attached SCSI) RAID
Avertissement : Laissez la carte de batterie de cache déconnectée pendant au moins 60 secondes
avant de connecter la nouvelle carte de batterie.
a. Faites glisser la carte de batterie de cache dans les glissières de montage du contrôleur jusqu'à ce
qu'elle s'insère dans le connecteur de batterie.
b. Une fois que la batterie est correctement positionnée dans le connecteur, ramenez la languette en
position verrouillée pour immobiliser la batterie sur le connecteur.
Figure 6. Remise en place de la carte de batterie de cache pour 31E/2B
Installation du fond de panier de stockage - 6 disques SFF/DVD SATA/RAID 175 Mo/Adaptateur d'E-S double/Port SAS
externe 19
13. Remplacez l'interposeur système par l'interposeur système à haute fonctionnalité. Pour des
instructions, voir Remplacement hors tension de l'interposeur de stockage 31E/2B, 31E/1C ou
31E/2C.
14. Installez le fond de panier de l'unité de disque. Pour des instructions, voir Remplacement du fond
de panier de l'unité de disque 31E/2B, 31E/1C ou 31E/2C.
15. Installez les unités de disque.
v Pour les unités de disque internes, voir Remplacement de l'unité de disque interne dans un
système 31E/2B, 31E/1C ou 31E/2C.
v Pour une unité SSD, voir Installation d'une unité de disque ou d'une unité SSD.
16. Installez l'unité de DVD SATA. Pour des instructions, voir Installation d'une unité de stockage
extra-plate sur le modèle 31E/2B, 31E/1C, ou 31E/2C.
17. Installez le câble de connexion d'E-S.
Figure 7. Remise en place de la carte de batterie de cache pour 31E/1C ou 31E/2C
20 Activation SAS (Serial-attached SCSI) RAID
18. Replacez le capot d'accès, comme indiqué à la rubrique «Installation du capot d'accès du système
31E/2B, 31E/1C ou 31E/2C», à la page 97.
19. Pour un système monté en armoire, mettez-le en position de fonctionnement, comme indiqué dans
«Mise en position de fonctionnement du système 31E/2B, 31E/1C ou 31E/2C monté en armoire», à
la page 99.
20. Rebranchez la source d'alimentation.
Figure 8. Câble de connexion d'E-S SAS pour 31E/2B
Figure 9. Câble de connexion d'E-S SAS pour 31E/1C ou 31E/2C
Installation du fond de panier de stockage - 6 disques SFF/DVD SATA/RAID 175 Mo/Adaptateur d'E-S double/Port SAS
externe 21
21. Démarrez le système (voir «Démarrage du système ou de la partition logique», à la page 91).
22. Vérifiez le composant installé :
v Si vous avez installé le composant pour toute autre raison, vérifiez le composant installé comme
décrit dans Vérification du composant installé.
22 Activation SAS (Serial-attached SCSI) RAID
Retrait et remise en place des adaptateurs et batteries SAS
RAID
Il se peut que vous deviez retirer ou remettre en place les adaptateurs SAS RAID et les batteries. La
présente section comprend les procédures vous permettant d'effectuer ces tâches.
Remarque : Le remplacement du dispositif incombe à l'utilisateur. Vous pouvez soit exécuter la tâche
vous-même, soit faire appel à un prestataire agréé. Il vous imputera probablement les frais
correspondants.
Retrait et remplacement du fond de panier d'unités de disque
Utilisez cette procédure pour retirer, remplacer ou installer le fond de panier d'unités de disques sur les
serveurs 7/10 et 7/30 (31E/2B, 31E/1C et 31E/2C) et
Les figures suivantes représentent le fond de panier d'unités de disques sur les modèles 31E/2B, 31E/1C
ou 31E/2C.
Retrait du fond de panier d'unités de disques sur les modèles 31E/2B,
31E/1C, 31E/2C
Cette procédure permet de retirer le fond de panier d'unités de disque du serveur.
Pourquoi et quand exécuter cette tâche
Si le système est géré par la console HMC (Hardware Management Console), utilisez la console HMC
pour retirer le fond de panier d'unité de disque du serveur. Pour obtenir des informations sur l'utilisation
de la console HMC en vue de retirer le fond de panier d'unité de disque, voir Retrait d'un composant via
la console HMC.
Si le système est géré par la console SDMC (Systems Director Management Console), utilisez la console
SDMC pour retirer le fond de panier d'unité de disque du serveur. Pour des instructions, voir Retrait
d'un composant via la console SDMC.
Si vous ne disposez pas d'une console HMC ou SDMC, procédez comme suit pour retirer le fond de
panier d'unité de disque :
Figure 10. Fond de panier d'unités de disques
Figure 11. Fond de panier d'unités de disques avec bande magnétique
23
Avertissement : Veillez à retirer le panneau de commande avant le fond de panier d'unité de disque pour
ne pas endommager les câbles du panneau de commande.
Procédure
1. Identifiez le système sur lequel vous travaillez à l'aide du processus d'identification du système afin
d'activer l'indicateur de système (bleu). Pour plus d'informations, voir Voyants du panneau de
commande.
2. Effectuez les tâches prérequises telles qu'elles sont décrites dans Avant de commencer.
3. Effectuez l'une des opérations suivantes :
a. Si vous retirez le fond de panier d'unité de disques en vue d'une mise à niveau système ou dans
le cadre d'une autre procédure, passez à l'étape 4.
b. Si vous retirez le fond de panier d'unités de disques en raison d'une défaillance du système,
consultez le journal de maintenance pour identifier le composant défaillant. Voir Identification
d'un composant défaillant.
4. Arrêtez le système comme indiqué dans Arrêt d'un système ou d'une partition logique.
5. Mettez le système en position de maintenance, comme indiqué dans Mise en position de
maintenance d'un modèle monté en armoire.
6. Débranchez le système pour l'isoler de la source d'alimentation, comme décrit dans Déconnexion des
cordons d'alimentation.
Remarque : Si le système est équipé d'un second bloc d'alimentation, vérifiez que la source
d'alimentation a été débranchée du système.
(L003)
ou
7. Retirez le capot d'accès, comme indiqué dans Retrait du capot d'accès d'un modèle monté en
armoire.
24 Activation SAS (Serial-attached SCSI) RAID
8. Aidez-vous des voyants de maintenance pour identifier le composant, comme indiqué dans
Identification d'un composant.
9. Attachez le bracelet antistatique.
Avertissement :
v Portez un bracelet antistatique relié à une surface métallique non peinte pour protéger le matériel
contre les risques de décharge électrostatique.
v Si vous portez un bracelet antistatique, respectez toutes les consignes de sécurité relatives aux
dangers électriques. Un bracelet antistatique permet de réguler l'électricité statique. Il ne réduit et
n'augmente en aucun cas les risques d'électrocution liés à l'utilisation ou à la manipulation
d'appareils électriques.
v Si vous ne disposez pas d'un bracelet antistatique, touchez une surface métallique non peinte du
système pendant au moins 5 secondes avant de déballer un produit de son emballage antistatique,
d'installer ou de remplacer du matériel.
10. Si vous devez remplacer le fond de panier en vue d'une mise à niveau ou d'un incident système,
procédez comme suit afin de retirer les unités de disques, en fonction de votre fond de panier :
Remarque : Si vous retirez un fond de panier comportant des unités de disques, le fond de panier
est lourd. Assurez-vous de pouvoir effectuer cette procédure en toute sécurité.
a. Pour retirer l'unité de disque du fond de panier, procédez comme suit:
Remarque : Notez l'emplacement de chaque unité de disque retirée, de manière à remettre en
place les unités de disque à leur emplacement d'origine.
1) Déverrouillez la poignée de l'unité de disques (A) et tirez-la vers vous pour la dégager.
2) Placez une main sous l'unité de disques et tirez avec précaution l'unité de disques du
système, comme indiqué dans la figure 12.
Figure 12. Retrait de l'unité de disque du fond de panier
Retrait et remise en place des adaptateurs et batteries SAS RAID 25
11. Retirez l'unité de stockage en appuyant sur le taquet (A) et faites-la glisser hors du système. Voir la
figure 2 ou 3 en fonction de la configuration mécanique de votre système.
Pour plus d'informations sur le retrait des différents types d'unités de stockage, voir Retrait d'une
unité de stockage SAS sur les modèles 31E/2B, 31E/1C ou 31E/2C.
Remarque : L'unité est maintenue en place par une pièce d'attache par friction. Il n'y a aucune vis
ou rivet qui la maintient en place. Veillez à retirer l'unité avec précaution afin de ne pas faire tomber
la pièce.
Figure 13. Retrait de l'unité de stockage extra-plate du fond de panier d'unités de disques
26 Activation SAS (Serial-attached SCSI) RAID
12. Retirez le panneau de commande et le câble d'interface du fond de panier. Pour plus d'informations,
voir Panneau de commande et câbles d'interface.
13. Déverrouillez le fond de panier de l'unité de disques en tirant les poignées (A) afin de les mettre en
position ouverte, au centre du fond de panier, comme indiqué dans la figure 15, à la page 28 et
figure 16, à la page 28.
Figure 14. Retrait de l'unité de stockage SAS du fond de panier d'unités de disques
Retrait et remise en place des adaptateurs et batteries SAS RAID 27
14. Extrayez le fond de panier du serveur.
15. Si vous avez retiré le fond de panier dans le cadre d'une autre procédure, reprenez cette procédure.
Pour remettre en place le fond de panier, voir Remplacement du fond de panier d'unités de disques
sur les modèles 31E/2B, 31E/1C, 31E/2C.
Figure 15. Retrait du fond de panier comportant des unités de disque
Figure 16. Retrait du fond de panier ne comportant pas d'unités de disque
28 Activation SAS (Serial-attached SCSI) RAID
Remplacement du fond de panier d'unités de disques sur les modèles
31E/2B, 31E/1C, 31E/2C
Utilisez cette procédure pour remplacer le fond de panier d'unités de disque.
Pourquoi et quand exécuter cette tâche
Si le système est géré par la console HMC (Hardware Management Console), utilisez la console HMC
pour effectuer la procédure de remplacement du fond de panier d'unité de disque sur le serveur. Pour
plus d'informations, voir la rubrique Remplacement d'un composant via la console HMC.
Si le système est géré par la console SDMC (Systems Director Management Console), utilisez la console
SDMC pour effectuer la procédure de remplacement du fond de panier d'unité de disque sur le serveur.
Pour obtenir des instructions, voir la rubrique Remplacement d'un composant via la console SDMC.
Si vous ne disposez pas d'une console HMC ou SDMC, procédez comme suit pour remplacer le fond de
panier d'unité de disque :
Procédure
1. Identifiez le système sur lequel vous travaillez à l'aide du processus d'identification du système afin
d'activer l'indicateur de système (bleu). Pour plus d'informations, voir Voyants du panneau de
commande.
2. Effectuez les tâches prérequises telles qu'elles sont décrites dans Avant de commencer.
3. Arrêtez le système comme indiqué dans Arrêt d'un système ou d'une partition logique.
4. Mettez le système en position de maintenance, comme indiqué dans Mise en position de
maintenance d'un modèle monté en armoire.
5. Débranchez le système pour l'isoler de la source d'alimentation, comme décrit dans Déconnexion des
cordons d'alimentation.
Remarque : Ce système est équipé d'un second bloc d'alimentation. Avant de poursuivre cette
procédure, vérifiez que la source d'alimentation du système est complètement déconnectée.
(L003)
ou
Retrait et remise en place des adaptateurs et batteries SAS RAID 29
6. Retirez le capot d'accès, comme indiqué dans Retrait du capot d'accès d'un modèle monté en
armoire.
7. Aidez-vous des voyants de maintenance pour identifier le composant, comme indiqué dans
Identification d'un composant.
8. Attachez le bracelet antistatique.
Avertissement :
v Portez un bracelet antistatique relié à une surface métallique non peinte pour protéger le matériel
contre les risques de décharge électrostatique.
v Si vous portez un bracelet antistatique, respectez toutes les consignes de sécurité relatives aux
dangers électriques. Un bracelet antistatique permet de réguler l'électricité statique. Il ne réduit et
n'augmente en aucun cas les risques d'électrocution liés à l'utilisation ou à la manipulation
d'appareils électriques.
v Si vous ne disposez pas d'un bracelet antistatique, touchez une surface métallique non peinte du
système pendant au moins 5 secondes avant de déballer un produit de son emballage antistatique,
d'installer ou de remplacer du matériel.
9. Installez le panneau de commande et le cordon d'interface. Pour plus d'informations, voir Panneau
de commande et câbles d'interface.
Remarque : Vérifiez que le câble d'interface n'est pas connecté lors de l'installation du panneau de
commande.
10. Vérifiez la présence éventuelle d'un capot de protection sur les broches sur la partie inférieure droite.
Si un capot de protection est installé, retirez-le.
11. Soulevez délicatement le fond de panier sur les deux côtés et alignez-le avec son emplacement dans
le serveur.
12. Insérez le fond de panier fermement dans le serveur.
13. Fixez le fond de panier à l'aide des taquets de verrouillage (A), comme indiqué dans les figures
suivantes.
30 Activation SAS (Serial-attached SCSI) RAID
14. Réinstallez les unités de disques si vous les avez préalablement retirées. Utilisez les informations
d'emplacement que vous avez enregistrées lors du retrait des unités pour les installer dans les
emplacements appropriés. Pour remplacer les unités de disque, procédez comme suit :
Figure 17. Installation d'un fond de panier sans unités de disque dans un modèle monté en armoire
Figure 18. Installation d'un fond de panier doté d'unités de disque dans un modèle monté en armoire
Retrait et remise en place des adaptateurs et batteries SAS RAID 31
a. Tenez l'unité de disque par les bords hauts et bas pour insérer le lecteur dans l'unité d'extension.
Ne la tenez pas par la poignée.
b. Faites glisser complètement l'unité de disques dans l'unité centrale ou l'unité d'extension, puis
verrouillez l'unité dans son emplacement en poussant la poignée (A), comme indiqué dans la
figure suivante.
15. Replacez l'unité de stockage Slimline en procédant comme suit :
a. Alignez la languette (a) sur le côté de l'unité de stockage.
b. Alignez les petits taquets dans les emplacements de l'unité de stockage (B) et insérez l'unité et le
taquet en même temps. Voir la figure 4 ou 5 en fonction de la configuration mécanique de votre
système.
Pour plus d'informations sur le remplacement des différents types d'unités de stockage, voir Retrait
d'une unité de stockage SAS sur les modèles 31E/2B, 31E/1C ou 31E/2C.
Figure 19. Installation des unités de disque du fond de panier
32 Activation SAS (Serial-attached SCSI) RAID
16. Remplacez le capot d'accès, comme décrit dans Installation du capot d'accès sur un système 31E/2B,
31E/1C, 31E/2C monté en armoire.
Figure 20. Installation de l'unité de stockage SAS
Figure 21. Installation de l'unité de stockage extra-plate
Retrait et remise en place des adaptateurs et batteries SAS RAID 33
17. Mettez le système en position de fonctionnement, comme indiqué dans Mise en position de
fonctionnement d'un système 31E/2B, 31E/1C, 31E/2C monté en armoire.
18. Connectez les cordons d'alimentation au système, comme décrit dans Connexion des cordons
d'alimentation.
19. Démarrez le système ou la partition logique. Voir Démarrage du système ou de la partition logique.
20. Vérifiez le composant installé.
v Si vous avez installé le composant pour toute autre raison, vérifiez le composant installé comme
décrit dans Vérification du composant installé.
Retrait et remplacement du fond de panier système 31E/2B, 31E/1C ou
31E/2C
Découvrez comment retirer ou remplacer le fond de panier système.
Pourquoi et quand exécuter cette tâche
Les procédures suivantes expliquent comment retirer et remplacer le fond de panier système.
Retrait du fond de panier système 31E/2B, 31E/1C ou 31E/2C
Prenez connaissance des instructions de retrait du fond de panier système d'un serveur en vue du
remplacement d'un composant défaillant ou dans le cadre d'une autre procédure de maintenance.
Avant de commencer
Remarques :
v Avant de commencer cette procédure, vérifiez que vous disposez des droits de connexion en tant que
fournisseur de services agréé. Les différents niveaux d'autorisation sont décrits dans Niveaux
d'autorisation ASMI.
v Identifiez le système sur lequel vous travaillez à l'aide du processus d'identification du système afin
d'activer l'indicateur de système (bleu). Pour de plus amples informations, voir Voyant du panneau de
commande.
Pourquoi et quand exécuter cette tâche
Si votre système est géré par la console HMC (Hardware Management Console), utilisez-la pour retirer le
fond de panier système. Pour obtenir des instructions, voir Retrait d'un composant à l'aide de la console
SDMC.
Si votre système est géré par la console SDMC (Systems Director Management Console), utilisez-la pour
retirer le fond de panier système. Pour des instructions, voir Retrait d'un composant à l'aide de la console
SDMC.
Si vous ne disposez pas d'une console HMC ou SDMC, procédez comme suit pour retirer le fond de
panier système :
Procédure
1. Accédez à l'interface ASMI.
L'interface ASMI (Advanced System Management Interface) est l'interface du processeur de service
requise pour effectuer des tâches de maintenance générales et de niveau administrateur, telles que la
lecture des journaux d'erreurs du processeur de service, la lecture des données essentielles du
produit, la configuration du processeur de service et le contrôle de l'alimentation système. L'interface
34 Activation SAS (Serial-attached SCSI) RAID
ASMI est également désignée comme l'ensemble des menus du processeur de service. Pour accéder à
l'interface ASMI, voir Accès à l'interface ASMI. Une fois que vous avez accédé à l'interface ASMI,
passez à l'étape suivante.
2. Vérifiez et enregistrez le type de boîtier et le code dispositif.
Le numéro de série du boîtier est un numéro à sept chiffres imprimé sur une étiquette code barre
placée sur l'unité centrale.
Remarque : Le code dispositif du fond de panier système 31E/2B, 31E/1C ou 31E/2C est 78AB-001.
3. Vérifiez et notez les paramètres du serveur.
Avant de remplacer cette unité remplaçable sur site (FRU), vérifiez et enregistrez tous les paramètres
du serveur étant donné que vous devrez peut-être mettre à jour le système après avoir remplacé
l'unité FRU. Cela peut être effectué via l'interface ASMI (Advanced System Management Interface).
a. Recherchez et notez le niveau de microprogramme du serveur indiqué sur le bord droit de la
fenêtre d'état, la zone dans laquelle se trouve le bouton Déconnexion. Vous devrez peut-être
mettre à jour le système avec le niveau de code de microprogramme de serveur le plus récent
après avoir remplacé l'unité FRU.
b. Vérifiez et notez les paramètres du processeur de service que vous avez précédemment définis à
l'aide de l'interface ASMI. Vous devrez réinitialiser les paramètres du processeur de service après
la mise à jour de cette unité FRU, faute de quoi les paramètres par défaut seront utilisés. Vous
devez noter les paramètres suivants :
v Paramètre de nom de système
v Paramètres de contrôle de l'alimentation système
v Paramètres de l'aide à la maintenance système
v Paramètres de configuration système
v Paramètres des services réseau
v Paramètres des options de performances
v Paramètres de profil de connexion
v Paramètres de liste des unités d'amorçage
v Paramètres de carte Ethernet hôte
c. Notez tous les paramètres du processeur de service que vous avez éventuellement définis à l'aide
des commandes du système d'exploitation.
4. Arrêtez les partitions et mettez le système hors tension.
L'entretien de cette unité remplaçable sur site ne peut pas s'effectuer en simultané. Chaque partition
doit être arrêtée et le système mis hors tension pour poursuivre la réparation. Le client doit donner
l'autorisation d'arrêter toutes les partitions en cours d'exécution et de mettre le système hors tension.
Si le client n'autorise pas l'arrêt des partitions ou la mise hors tension du système, la procédure de
maintenance sera retardée. Si le client utilise une secondaire pour gérer le système, assurez-vous qu'il
ne met pas le système sous tension à ce moment là. Pour obtenir des instructions, voir Arrêt du
système ou de la partition logique.
5. Débranchez le système pour l'isoler de la source d'alimentation.
Remarque : Ce système est équipé en option d'un second bloc d'alimentation. Avant de poursuivre
cette procédure, vérifiez que la source d'alimentation du système est complètement déconnectée.
(L003)
Retrait et remise en place des adaptateurs et batteries SAS RAID 35
ou
6. Débranchez les câbles externes du système. Etiquetez et débranchez les câbles externes connectés au
système dont vous effectuez la maintenance.
7. Placez le système en position de maintenance, comme indiqué dans la rubrique Mise en position de
maintenance d'un système 31E/2B, 31E/1C ou 31E/2C monté en armoire.
8. Retirez le capot d'accès comme indiqué dans la rubrique Retrait du capot d'accès d'un système
31E/2B, 31E/1C ou 31E/2C monté en armoire.
9. Retirez les modules de mémoire. Pour obtenir des instructions, voir Retrait du module de mémoire
31E/2B, 31E/1C ou 31E/2C.
10. Retirez les adaptateurs GX. Pour obtenir des instructions, voir Retrait des cartes GX sur 31E/2B,
31E/1C ou 31E/2C.
11. Retirez les cartes PCI. Pour obtenir des instructions, voir Retrait hors tension d'une carte PCI sur le
serveur 31E/2B, 31E/1C ou 31E/2C.
12. Retirez la carte Ethernet hôte (31E/2B). Pour obtenir des instructions, voir Retrait hors tension de la
carte Ethernet hôte 31E/2B.
13. Retirez le régulateur de tension. Pour obtenir des instructions, voir Retrait du régulateur de tension
31E/2B, 31E/1C ou 31E/2C.
14. Retirez les ventilateurs. Pour obtenir des instructions, voir Retrait des ventilateurs 31E/2B, 31E/1C
ou 31E/2C.
15. Retirez les blocs d'alimentation. Pour obtenir des instructions, voir Retrait hors tension des blocs
d'alimentation 31E/2B, 31E/1C ou 31E/2C.
16. Débranchez tous les câbles internes du fond de panier système. Exemples de câbles à débrancher :
a. Débranchez le câble du panneau de commande (A).
36 Activation SAS (Serial-attached SCSI) RAID
b. Débranchez le câble SAS (A).
Figure 22. Débranchement du câble du panneau de commande
Figure 23. Débranchement du câble SAS
Retrait et remise en place des adaptateurs et batteries SAS RAID 37
c. Débranchez les câbles de l'unité de stockage (A).
d. Retirez le câble SPCN à l'arrière du châssis (31E/2C).
17. Retirez le boîtier de l'unité de disque :
a. Saisissez les loquets de l'unité de disque et tirez cette dernière vers le haut pour déverrouiller le
boîtier de l'unité de disque.
b. Saisissez les poignées, puis tirez le boîtier d'unité de disque vers vous.
Figure 24. Débranchement des câbles de l'unité de stockage
38 Activation SAS (Serial-attached SCSI) RAID
18. Retirez le régulateur de tension et le répartiteur d'air.
Figure 25. Retrait du boîtier d'unité de disque
Figure 26. Retrait du régulateur de tension et du répartiteur d'air (31E/2B)
Retrait et remise en place des adaptateurs et batteries SAS RAID 39
19. Retirez le logement de barrette DIMM :
a. Retirez les quatre vis qui maintiennent le boîtier de barrette DIMM (A).
b. Retirez le boîtier de barrette DIMM en le soulevant vers le haut.
Figure 27. Retrait du régulateur de tension et du répartiteur d'air (31E/1C ou 31E/2C)
40 Activation SAS (Serial-attached SCSI) RAID
20. Retirez le câble du ventilateur avant.
21. Retirez le boîtier de ventilation :
Figure 28. Retrait du boîtier de barrette DIMM
Figure 29. Retrait du câble du ventilateur avant
Retrait et remise en place des adaptateurs et batteries SAS RAID 41
a. Desserrez les deux vis (A) qui maintiennent le boîtier de ventilation.
b. Retirez la vis (B) qui maintient le boîtier de ventilation à l'extérieur du boîtier du système.
c. Retirez le boîtier de ventilation en le soulevant vers le haut.
22. Retirez le commutateur de verrouillage d'alimentation. Pour obtenir des instructions, voir Retrait
hors tension du commutateur de verrouillage d'alimentation 31E/2B, 31E/1C ou 31E/2C.
23. Retirez la carte de données techniques essentielles. Pour obtenir des instructions, voir Retrait de la
carte de données techniques essentielles 31E/2B, 31E/1C ou 31E/2C.
24. Retirez l'interposeur de stockage. Pour obtenir des instructions, voir Retrait hors tension de
l'interposeur de stockage 31E/2B, 31E/1C ou 31E/2C.
25. Retirez le second régulateur de tension, le cas échéant. Pour obtenir des instructions, voir Retrait du
régulateur de tension 31E/2B, 31E/1C ou 31E/2C.
26. Retirez le support métallique du second régulateur de tension :
a. Retirez les deux vis qui maintiennent le support au châssis.
b. Soulevez le support du châssis.
27. Retirez les modules du processeur système. Pour obtenir des instructions, voir Retrait du module
processeur système 31E/2B, 31E/1C ou 31E/2C.
28. Retirez le fond de panier système.
Remarque : Retirez le fond du panier système avec précaution. Ne le soulevez pas à l'aide des
modules fixés.
a. Retirez les vis (A).
Remarque : Ne retirez aucune des vis Torx. Retirez uniquement les cinq vis à tête hexagonale et
à fente.
Figure 30. Retrait du boîtier de ventilation
42 Activation SAS (Serial-attached SCSI) RAID
b. Saisissez le fond de panier système par les deux bras de chargement du dissipateur thermique
(B).
Figure 31. Retrait du fond de panier système (cinq vis pour 31E/2B)
Figure 32. Retrait du fond de panier système (quatre vis pour 31E/1C ou 31E/2C)
Retrait et remise en place des adaptateurs et batteries SAS RAID 43
c. Soulevez l'avant du fond de panier système pour dégager les chevilles de positionnement (6,35
mm environ).
d. Faites coulisser le fond de panier vers l'avant du système de façon à ce que ses connecteurs
laissent apparaître le châssis arrière.
e. Une fois les connecteurs dégagés du châssis, soulevez l'arrière du fond de panier système et
retirez ce dernier en l'inclinant.
Figure 33. Retrait du fond de panier système (31E/2B)
44 Activation SAS (Serial-attached SCSI) RAID
Remplacement du fond de panier système 31E/2B, 31E/1C ou 31E/2C
Informations sur le remplacement du fond de panier système dans le cadre d'une opération de
maintenance.
Pourquoi et quand exécuter cette tâche
Avant d'effectuer cette procédure, vous devez avoir effectué le retrait. Pour connaître les instructions de la
procédure de retrait, voir Retrait du fond de panier système 31E/2B, 31E/1C ou 31E/2C.
Si votre système est géré par la console HMC (Hardware Management Console), utilisez-la pour
remplacer le fond de panier système. Pour plus d'informations, voir la rubrique Remplacement d'un
composant à l'aide de la console HMC.
Si votre système est géré par la console Systems Director Management Console (SDMC), utilisez-la pour
remplacer le fond de panier système. Pour obtenir des instructions, voir Remplacement d'un composant à
l'aide de la console SDMC.
Si vous ne disposez pas d'une console HMC ou SDMC, procédez comme suit pour remplacer le fond de
panier système :
Figure 34. Retrait du fond de panier système (31E/1C ou 31E/2C)
Retrait et remise en place des adaptateurs et batteries SAS RAID 45
Avertissement :
Attachez un bracelet antistatique à une surface métallique non peinte de votre matériel afin
d'empêcher d'éventuelles décharges électrostatiques d'endommager celui-ci.
Si vous portez un bracelet antistatique, respectez toutes les consignes de sécurité relatives aux dangers
électriques. Un bracelet antistatique permet de réguler l'électricité statique. Il ne réduit et n'augmente
en aucun cas les risques d'électrocution liés à l'utilisation ou à la manipulation d'appareils électriques.
Si vous ne disposez pas d'un bracelet antistatique, touchez une surface métallique non peinte du
système pendant au moins 5 secondes avant de déballer le produit de son emballage antistatique et
d'installer ou de remettre en place le matériel.
Procédure
1. Remplacez le fond de panier système.
Remarque : Remplacez le fond de panier système avec précaution.
a. Saisissez avec précaution le fond de panier système par les deux bras de chargement du
dissipateur thermique (A).
b. Abaissez le fond de panier système vers l'avant du boîtier du système.
c. Abaissez l'arrière du fond de panier système (1), puis faites-le glisser vers l'arrière du système
jusqu'à ce que les connecteurs laissent apparaître les ouvertures dans le boîtier.
d. Abaissez l'avant du fond de panier système (2) sur les broches d'alignement (b).
Figure 35. Remplacement du fond de panier système (31E/2B)
46 Activation SAS (Serial-attached SCSI) RAID
e. Serrez les vis (C) qui maintiennent le fond de panier système au boîtier du système.
Figure 36. Remplacement du fond de panier système (31E/1C ou 31E/2C)
Figure 37. Remplacement du fond de panier système (cinq vis pour 31E/2B)
Retrait et remise en place des adaptateurs et batteries SAS RAID 47
f. Rebranchez tous les câbles Ethernet qui ont été précédemment retirés.
2. Remettez en place les modules du processeur système. Pour obtenir des instructions, voir
Remplacement du module processeur système 31E/2B, 31E/1C ou 31E/2C.
3. Remplacez le support métallique du second régulateur de tension :
a. Alignez le support sur les broches du châssis
b. Replacez les deux vis pour maintenir le support au châssis.
4. Remplacez le second régulateur de tension, le cas échéant. Pour obtenir des instructions, voir
Remplacement du régulateur de tension 31E/2B, 31E/1C ou 31E/2C.
5. Remplacez le commutateur de verrouillage d'alimentation. Pour obtenir des instructions, voir
Remplacement hors tension du commutateur de verrouillage d'alimentation 31E/2B, 31E/1C ou
31E/2C.
6. Remplacez l'interposeur de stockage. Pour obtenir des instructions, voir Remplacement hors tension
de l'interposeur de stockage 31E/2B, 31E/1C ou 31E/2C.
7. Remplacez la carte de données techniques essentielles. Pour obtenir des instructions, voir
Remplacement de la carte de données techniques essentielles 31E/2B, 31E/1C ou 31E/2C.
8. Remplacez le boîtier de ventilation :
a. Faites glisser le boîtier vers le bas afin que les trois encoches (A) et (C) s'adaptent sur les
goupilles de montage (B) et (D).
b. Serrez la vis (F) pour fixer au côté du châssis.
c. Serrez les deux vis captives (E) pour fixer au fond du châssis.
Figure 38. Remplacement du fond de panier système (quatre vis pour 31E/1C ou 31E/2C)
48 Activation SAS (Serial-attached SCSI) RAID
9. Connectez le câble du ventilateur (A) en l'enfonçant dans le connecteur correspondant.
10. Remplacez le logement de barrette DIMM :
Figure 39. Remplacement du boîtier de ventilation
Figure 40. Connexion du câble du ventilateur
Retrait et remise en place des adaptateurs et batteries SAS RAID 49
a. Positionnez le boîtier de la barrette DIMM en le baissant et en vous assurant que sa marque
d'alignement (A) et le guide 'alignement (B) sont alignés.
b. Insérez et serrez les six vis (C) qui maintiennent le boîtier de barrette DIMM.
11. Remplacez le régulateur de tension et le répartiteur d'air.
a. Abaissez le régulateur de tension (a) pour le positionner et serrez les trois vis (B) qui le
maintiennent en place.
b. Abaissez le répartiteur d'air (C) pour le positionner, puis appuyez fermement jusqu'à ce qu'il se
mette en place.
Figure 41. Remplacement du boîtier de barrette DIMM
50 Activation SAS (Serial-attached SCSI) RAID
12. Remplacez le boîtier de l'unité de disque :
Figure 42. Remplacement du répartiteur d'air du régulateur de tension (31E/2B)
Figure 43. Remplacement du répartiteur d'air du régulateur de tension (31E/1C ou 31E/2C)
Retrait et remise en place des adaptateurs et batteries SAS RAID 51
a. Les loquets de l'unité de disque étant ouverts, poussez le boîtier de l'unité de disque à l'intérieur
du système.
b. Poussez les loquets de l'unité de disque vers le système jusqu'à ce qu'ils soient verrouillés.
13. Connectez les câbles internes au fond de panier :
a. Connectez les câbles de l'unité de stockage. Connectez les câbles de l'unité de stockage, le cas
échéant, en appuyant fermement sur les fiches du câble pour les insérer dans les prises
correspondantes (A).
Figure 44. Remplacement du boîtier de l'unité de disque
52 Activation SAS (Serial-attached SCSI) RAID
b. Connectez le câble SAS, le cas échéant, en appuyant fermement sur sa fiche (A) pour l'insérer
dans la prise correspondante.
Figure 45. Connexion des câbles de l'unité de stockage
Figure 46. Connexion du câble SAS
Retrait et remise en place des adaptateurs et batteries SAS RAID 53
c. Connectez le câble du panneau de commande en appuyant fermement sur ses fiches pour les
insérer dans les prises correspondantes (A).
d. Replacez le câble SPCN à l'arrière du châssis (31E/2C).
14. Remplacez les blocs d'alimentation. Pour obtenir des instructions, voir Remplacement hors tension
des blocs d'alimentation 31E/2B, 31E/1C ou 31E/2C ou Remplacement sous tension des blocs
d'alimentation 31E/2B, 31E/1C ou 31E/2C.
15. Remplacez les ventilateurs. Pour obtenir des instructions, voir Remplacement des ventilateurs
31E/2B, 31E/1C ou 31E/2C.
16. Remplacez les modules de mémoire. Pour obtenir des instructions, voir Remplacement du module
de mémoire 31E/2B, 31E/1C ou 31E/2C.
17. Remplacez le régulateur de tension. Pour obtenir des instructions, voir Remplacement du régulateur
de tension31E/2B, 31E/1C ou 31E/2C.
18. Remplacez la carte Ethernet hôte (31E/2B). Pour obtenir des instructions, voir Remplacement hors
tension de la carte Ethernet hôte du modèle 31E/2B.
19. Remplacez les cartes GX. Pour obtenir des instructions, voir Remplacement des cartes GX pour
31E/2B, 31E/1C ou 31E/2C.
20. Remplacez les cartes PCI. Pour obtenir des instructions, voir Remplacement hors tension d'une carte
PCI sur le serveur 31E/2B, 31E/1C ou 31E/2C.
21. Remplacez le capot d'accès comme indiqué dans la rubrique Installation du capot d'accès d'un
système 31E/2B, 31E/1C ou 31E/2C monté en armoire.
22. Placez le système en position de fonctionnement, comme indiqué dans la rubrique Mise en position
de fonctionnement du système 31E/2B, 31E/1C ou 31E/2C monté en armoire.
23. Rebranchez tous les cordons d'alimentation et d'interface.
24. Démarrez le système ou la partition logique. Voir Démarrage du système ou de la partition logique.
25. Restaurez la connectivité réseau.
Figure 47. Connexion du câble du panneau de commande
54 Activation SAS (Serial-attached SCSI) RAID
Le processeur de service doit être connecté au réseau pour être accessible à partir des console de
gestion.
Comment l'adresse IP du processeur de service sera-t-elle gérée ?
Obtenez automatiquement une adresse IP à partir d'un
serveur DHCP (Dynamic Host Configuration Protocol)
Passez à l'étape suivante.
Configurez une adresse IP statique Passez à l'étape 27.
26. Restaurez les paramètres du processeur de service.
Activez l'accès réseau sur le processeur de service. Si la connexion réseau utilise le protocole DHCP
(Dynamic Host Configuration Protocol) pour établir une adresse IP, et utilise les Consoles de gestion
comme serveur DHCP, aucune configuration supplémentaire n'est nécessaire pour activer l'accès
réseau sur le processeur de service. Procédez comme suit :
a. Assurez-vous que le processeur de service est connecté au réseau de service existant en vérifiant
que le câble console de gestion est connecté au port console de gestion du fond de panier
système.
b. Le cas échéant, connectez tous les câbles d'alimentation système en les branchant dans les prises.
Remarque : ne démarrez pas le système à ce stade.
Pouvez-vous accéder au réseau de maintenance ?
Oui Non
Passez à l'étape 29, à la page 56. Passez à l'étape 32, à la page 56.
27. Restaurez les paramètres du processeur de service.
L'interface ASMI (Advanced System Management Interface) est l'interface d'accès au processeur de
service qui est requise pour effectuer des tâches de maintenance générales et de niveau
administrateur, telles que la lecture des journaux d'erreurs du processeur de service, la lecture des
données essentielles du produit, la configuration du processeur de service, le contrôle de
l'alimentation système et la configuration des ports réseau du processeur de service. L'interface
ASMI est également désignée comme l'ensemble des menus du processeur de service. L'interface
ASMI est accessible via HTTPS. La connexion réseau utilisant des affectations d'adresses IP statiques,
procédez comme suit.
a. Pour identifier les ports réseau du processeur de service, vous pouvez utiliser le panneau de
commande en mode manuel. Reportez-vous aux fonctions de maintenance pour savoir comment
utiliser le panneau de commande dans ce cas.
b. Connectez un client disposant d'un navigateur Web directement aux ports réseau du processeur
de service que vous avez notés précédemment. Il peut s'agir d'une URL similaire aux exemples
suivants :
v https://169.254.2.147
v https://169.254.3.147
c. Le cas échéant, connectez tous les câbles d'alimentation système en les branchant dans les prises.
Remarque : ne démarrez pas le système à ce stade.
d. Connectez-vous à l'interface ASMI à l'aide de l'ID utilisateur admin et du mot de passe par
défaut admin.
e. A l'invite, modifiez le mot de passe de l'ID administrateur et le mot de passe de l'ID utilisateur
général.
f. Pour configurer l'accès réseau, procédez comme suit.
1) Cliquez sur Configuration de réseau sous le noeud Services réseau.
2) Dans le panneau de droite, configurez les interfaces réseau eth0 et eth1, sélectionnez Statique
comme Type d'adresse IP, puis définissez un nom d'hôte, une adresse IP et un masque de
sous-réseau.
Retrait et remise en place des adaptateurs et batteries SAS RAID 55
28. Restaurez les paramètres du processeur de service.
La connexion réseau étant maintenant configurée pour utiliser des affectations d'adresses IP
statiques, essayez d'accéder aux ports réseau du processeur de service.
Pouvez-vous accéder au réseau de maintenance ?
Oui Non
Passez à l'étape suivante. Passez à l'étape 33, à la page 57.
29. Lancez l'interface ASMI.
L'interface ASMI (Advanced System Management Interface) est l'interface du processeur de service
requise pour effectuer des tâches de maintenance générales et de niveau administrateur, telles que la
lecture des journaux d'erreurs du processeur de service, la lecture des données essentielles du
produit, la configuration du processeur de service et le contrôle de l'alimentation système. L'interface
ASMI est également désignée comme l'ensemble des menus du processeur de service. Pour accéder à
l'interface ASMI, voir Accès à l'interface ASMI. Une fois que vous avez accédé à l'interface ASMI,
passez à l'étape suivante.
30. Définissez le type de boîtier et le code dispositif.
Le numéro de série du boîtier est un numéro à sept chiffres imprimé sur une étiquette code barre
placée sur l'unité centrale. Lorsque le fond de panier système est remplacé, le numéro de série à sept
chiffres du boîtier et le code dispositif doivent être entrés manuellement et stockés dans le nouveau
fond de panier système à l'aide de l'interface ASMI (Advanced System Management Interface).
Remarque : Le code dispositif du fond de panier système 31E/2B, 31E/1C ou 31E/2C est 78AB-001.
Remarque : Pour effectuer cette opération, vous devez disposer du niveau d'autorisation Prestataire
de services agréé (ID : celogin).
Pour entrer le numéro de série du système :
a. Dans la fenêtre d'accueil de l'interface ASMI, indiquez votre ID utilisateur et votre mot de passe,
et cliquez sur Connexion.
b. Dans la zone de navigation, développez Configuration système et Données techniques
essentielles du programme.
c. Sélectionnez Boîtiers du système. Dans le panneau de droite, les boîtiers du système actuel
s'affichent.
d. Sur le système concerné, enregistrez le numéro de série du boîtier imprimé l'étiquette code barre
apposée sur l'unité centrale.
e. Entrez le numéro de série du boîtier enregistré dans la zone Numéro de série du boîtier de
l'unité centrale.
f. Entrez le numéro du code dispositif dans la zone Code dispositif/Numéro de séquence.
g. Cliquez sur Sauvegarder les paramètres pour mettre à jour et enregistrer le numéro de série du
boîtier et le code dispositif.
31. Passez à l'étape 35, à la page 57.
32. Restaurez les paramètres du processeur de service.
Si la connexion réseau utilise DHCP pour établir une adresse IP, mais n'utilise pas de en tant que
serveur DHCP, procédez comme suit :
a. Effectuez toutes les configurations de réseau requises pour permettre au serveur DHCP
d'identifier une adresse IP et de l'affecter au processeur de service.
b. Assurez-vous que le processeur de service est connecté au réseau de service existant en vérifiant
que le câble réseau est connecté au port réseau.
c. Le cas échéant, connectez tous les câbles d'alimentation système en les branchant dans les prises.
Remarque : Ne démarrez pas le système à ce stade.
56 Activation SAS (Serial-attached SCSI) RAID
Pouvez-vous accéder au réseau de maintenance ?
Oui Non
Accédez à 29, à la page 56 Passez à l'étape suivante.
33. Restaurez les paramètres du processeur de service.
Vous avez indiqué que vous n'êtes toujours pas en mesure d'accéder au réseau de service. Vous
devrez réinitialiser le processeur de service en appuyant sur le commutateur (E) du panneau de
commande.
34. Passez à l'étape 25, à la page 54.
35. Restaurez les paramètres du processeur de service.
Modifiez le mot de passe de l'ID utilisateur d'administrateur lorsque vous y êtes invité en procédant
de la manière suivante.
a. Dans la zone de navigation de l'interface ASMI, développez Profil de connexion.
b. Sélectionnez Modifier le mot de passe sous Profil de connexion.
Figure 48. Commutateur du panneau de commande
Retrait et remise en place des adaptateurs et batteries SAS RAID 57
c. Indiquez les informations requises, puis cliquez sur Continuer.
Par mesure de sécurité, vous êtes invité à entrer votre mot de passe d'utilisateur en cours dans le
champ Mot de passe en cours pour l'ID utilisateur en cours.
Remarque : Les mots de passe peuvent être une combinaison quelconque de 64 caractères
alphanumériques au maximum. Le mot de passe par défaut pour l'ID utilisateur d'administrateur
est admin. Après votre première connexion à l'interface ASMI, le mot de passe admin doit être
modifié.
Par mesure de sécurité, vous êtes invité à entrer le mot de passe de votre ID utilisateur en cours
dans le champ Mot de passe en cours de l'utilisateur en cours.
36. Restaurez les paramètres du processeur de service.
Définissez le nom du système. Le nom du système peut être modifié pour toute chaîne ASCII
autorisée. Il ne doit pas nécessairement correspondre au format numéro de série_modèle_type de
machine. Pour modifier le nom du système, procédez comme suit.
a. Dans la zone de navigation de l'interface, développez Configuration système.
b. Sélectionnez Nom du système sous Configuration système.
c. Entrez le nom souhaité.
d. Cliquez sur Sauvegarder les paramètres pour mettre à jour le nom du système avec la nouvelle
valeur.
Le nouveau nom du système s'affiche dans le cadre d'état, zone dans laquelle le bouton
Déconnexion se trouve. Si une autre méthode, telle que la console de gestion est utilisée pour
modifier le nom du système, le cadre d'état ne reflète pas la modification.
37. Restaurez les paramètres du processeur de service.
Pour définir l'heure, procédez comme suit.
a. Dans la zone de navigation de l'interface, développez Configuration système.
b. Sélectionnez Heure locale sous Configuration système.
c. Si le système est hors tension, le volet droit affiche un formulaire qui indique la date (mois, jour
et année) et l'heure (heures, minutes, secondes).
d. Modifiez la date ou l'heure (ou les deux) et cliquez sur Sauvegarder les paramètres.
38. Restaurez les paramètres du processeur de service.
Vérifiez que le niveau du microprogramme de serveur est identique à la valeur enregistrée avant le
remplacement du fond de panier système. Si la valeur est différente, elle doit être corrigée. Pour plus
d'informations sur la correction du microprogramme, voir Obtention des correctifs de
microprogramme.
39. Restaurez les paramètres du processeur de service.
Entrez à nouveau les paramètres suivants qui ont été précédemment modifiés via l'interface ASMI
(Advanced System Management Interface), sauf si vous souhaitez utiliser les paramètres par défaut.
v Paramètre de nom de système
v Paramètres de contrôle de l'alimentation système
v Paramètres de l'aide à la maintenance système
v Paramètres de configuration système
v Paramètres des services réseau
v Paramètres des options de performances
v Paramètres de profil de connexion
v Paramètres de liste des unités d'amorçage
v Paramètres de carte Ethernet hôte
40. Restaurez les paramètres du processeur de service.
58 Activation SAS (Serial-attached SCSI) RAID
a. Redéfinissez les paramètres du processeur de service que vous aviez éventuellement définis via
les commandes du système d'exploitation. Vous avez enregistré ces paramètres au début de cette
procédure.
b. Si vous choisissez de réinitialiser le mot de passe d'accès aux Consoles de gestion, procédez
comme suit :
Pour modifier le mot de passe d'accès à la console HMC, choisissez l'une des méthodes suivantes
;
1) A partir de l'interface graphique de la console HMC (méthode recommandée) :
a) Développez le dossier Gestion de systèmes dans le volet de l'arborescence de navigation.
b) Cliquez deux fois sur le dossier Serveurs.
c) Sélectionnez un serveur dans le panneau central.
d) Sous le dossier Opérations, sélectionnez Modifier le mot de passe.
e) Indiquez les informations souhaitées et cliquez sur OK.
2) Sur la ligne de commande de la console HMC, saisissez :
/usr/hmcrbin/chsyspwd -m système_géré -t access --passwd --newpasswd
nouveau_mot_de_passe où :
v La valeur de système_géré correspond au nom du système géré du nouveau processeur de
service.
v Aucune valeur n'est indiquée pour passwd-- de façon à permettre l'authentification.
v La valeur de newpasswd est celle de nouveau_mot_de_passe.
Pour modifier le mot de passe d'accès à la console SDMC, procédez comme suit :
1) Allez dans l'onglet Resources et cliquez sur Hosts.
2) Sélectionnez un serveur dans le panneau de droite.
3) Dans le menu Actions, cliquez sur Operations > Change Password.
4) Indiquez les informations souhaitées et cliquez sur OK.
41. Récupérez les données de partition sur le processeur de service.
Les données de profil stockées sur le serveur géré ont été supprimées ou endommagées.
Pour restaurer les données de profil à l'aide de la console HMC, procédez comme suit.
a. Développez le dossier Gestion de systèmes dans le volet de l'arborescence de navigation.
b. Cliquez deux fois sur le dossier Serveurs.
c. Dans le panneau central, sélectionnez un serveur en cochant la case correspondante.
d. Dans le dossier Configuration, si vous avez accès au dossier Gestion des données de partition,
sélectionnez Restaurer.
Pour restaurer les données de profil à l'aide de la console SDMC, procédez comme suit :
a. Allez dans l'onglet Resources et cliquez sur Hosts.
b. Sélectionnez un serveur dans le panneau de droite.
c. Dans le menu Actions, cliquez sur System Configuration > Manage Partition Data > Restore.
d. Sélectionnez l'option de restauration souhaitée et cliquez sur OK.
42. Mettez à jour les données de produit essentielles.
Si votre système exécute et n'est pas géré par les Consoles de gestion, vous devez mettre à jour la
source de chargement après avoir remplacé le fond de panier système. Pour mettre à jour la source
de chargement sans l'aide des Consoles de gestion, procédez comme suit :
a. Chargez le support optique du microcode sous licence I_BASE_01 dans l'unité qui est définie
pour le système et accédez-y pour effectuer un IPL (procédure de chargement initial).
Remarque : Cet IPL peut prendre plusieurs minutes.
Retrait et remise en place des adaptateurs et batteries SAS RAID 59
b. Une fois l'écran de sélection de groupe de langues affiché, appuyez sur la touche Entrée pour
sélectionner la langue par défaut 2924 (anglais).
Remarque : Pour modifier la langue, tapez celle qui apparaît sur le support du système
d'exploitation, puis appuyez sur la touche Entrée.
Confirmez la langue en appuyant sur la touche Entrée.
c. A partir de l'écran d'installation du microcode sous licence, sélectionnez l'option 2 (utilisation des
outils de maintenance en mode dédié) et appuyez sur la touche Entrée.
Utilisez les outils de maintenance en mode dédié pour vous assurer que les unités de disque, y
compris la source de chargement, génèrent correctement des rapports et qu'il n'y a pas d'erreurs
susceptibles d'empêcher l'exécution de l'IPL à partir du disque. Lorsque toutes les erreurs ont été
résolues et que le disque source de chargement génère correctement des rapports, passez à l'étape
suivante.
d. Dans l'écran d'utilisation des outils de maintenance en mode dédié (DST), sélectionnez l'option 2
(utilisation des unités de disque) et appuyez sur la touche Entrée.
e. Sélectionnez l'option 8 (mise à jour des données produit essentielles du système) pour identifier
l'emplacement de l'unité de disque source de chargement. Appuyez sur Entrée. L'écran de mise à
jour des données produit essentielles du système s'affiche.
f. Appuyez sur la touche Entrée pour confirmer que vous souhaitez que les données produit
essentielles soient enregistrées.
g. Appuyez deux fois sur F12 (Précédent).
h. Tapez 1 et appuyez sur la touche Entrée pour quitter l'écran des outils de maintenance en mode
dédié. Utilisez le panneau de configuration système pour retarder l'arrêt du système. Passez le
mode IPL sur B et démarrez le système pour effectuer un IPL à partir du disque.
43. Restaurez les paramètres de liste d'amorçage et de carte Ethernet hôte.
La liste des unités d'amorçage n'est pas présente sur un nouveau fond de panier système. Par
conséquent, il est nécessaire de créer les unités d'amorçage en démarrant les menus des services
SMS. Voir Sélection des options d'amorçage pour obtenir des informations sur la configuration de la
liste des unités d'amorçage.
Une fois le système d'exploitation démarré, vous pouvez utiliser la Commande bootlist pour vérifier
la liste des unités d'amorçage d'un système ou d'une partition, ou y apporter des modifications
supplémentaires.
Pour restaurer les paramètres de la carte Ethernet hôte, voir Configuration de ports physiques sur
une carte Ethernet hôte.
44. Vérifiez la réparation. Pour plus d'informations, voir Vérification d'une réparation.
Considérations relatives à la maintenant du sous-système SAS 31E/2B
Il existe plusieurs éléments à consulter avant d'effectuer la maintenance des fonctions au sein du
sous-système SAS pour le système 31E/2B.
Avertissement : Ne tentez pas de retirer les composants associés au sous-système SAS si le voyant des
données en cache sur la carte de P1-C18 ou P1-C13 clignote. Pour voir ce voyant, vous devez arrêter le
système et retirer le capot de service. Une icône d'interdiction de toucher (une main barrée) s'affiche sur
le voyant, indiquant la présence possible de données dans le cache de l'adaptateur. Pour éviter la perte de
données, assurez-vous tout d'abord que le système a été arrêté correctement. Si le voyant des données en
cache clignote, reportez-vous à l'une des rubriques suivantes avant de retirer tout composant du
sous-système SAS. Vous pouvez remplacer en toute sécurité le bloc de batteries de cache lorsque Yes est
affiché en regard de Battery pack can be safely replaced sur l'écran d'information de batterie (pour les
systèmes Linux et IBM® i) ou sur l'écran de statut de la commande (pour les systèmes AIX).
60 Activation SAS (Serial-attached SCSI) RAID
Avant d'effectuer la maintenance, vérifiez les configurations d'E-S de chaque partition logique sur le
système. Il est important de comprendre les différences entre les diverses configurations et les
considérations qui en découlent en matière de maintenance. En outre, tenez compte des incidences
possibles sur les domaines suivants :
v Partitions logiques et leurs affectations d'adaptateur d'E-S
v Configuration des grappes de disques RAID
v Emplacements de disque physique
v Chemin d'accès à votre unité d'amorçage
Considérations de maintenance pour le bloc de batterie de cache
Pour éviter la perte de données, suivez les procédures appropriées avant de remplacer le bloc de batterie
de cache sur la carte Cache RAID.
Remarque : La maintenance simultanée de la batterie de l'adaptateur n'est pas prise en charge. Même en
cas de batteries défaillantes ou manquantes, la carte associée à la batterie doit être retirée en premier.
Avertissement : Pour maintenir la disponibilité du système et éviter d'éventuelles pertes de données,
vous devez comprendre que ces fonctions ont des conséquences sur la configuration de SAS et
l'accessibilité des données du système.
Avertissement : Ne tentez pas de retirer les composants associés au sous-système SAS si le voyant des
données en cache sur la carte de P1-C18 ou P1-C13 clignote. Pour voir ce voyant, vous devez arrêter le
système et retirer le capot de service. Une icône d'interdiction de toucher (une main barrée) s'affiche sur
le voyant, indiquant la présence possible de données dans le cache de l'adaptateur. Pour éviter la perte de
données, assurez-vous tout d'abord que le système a été arrêté correctement. Si le voyant des données en
cache clignote, reportez-vous à l'une des rubriques suivantes avant de retirer tout composant du
sous-système SAS. Vous pouvez remplacer en toute sécurité le bloc de batteries de cache lorsque Yes est
affiché en regard de Battery pack can be safely replaced sur l'écran d'information de batterie (pour les
systèmes Linux et IBM i) ou sur l'écran de statut de la commande (pour les systèmes AIX).
La figure suivante indique l'emplacement du voyant de présence de données en cache dans les
sous-systèmes SAS 31E/2B.
Retrait et remise en place des adaptateurs et batteries SAS RAID 61
Le tableau suivant décrit les états du voyant de présence des données en cache et les éléments à prendre
en considération avant de tenter une opération de maintenance.
Tableau 1. Indications du voyant et actions
Carte de batterie de
cache (Un-P1-C13)
Contrôleur de stockage
RAID/cache
(Un-P1-C18) Indication Action
désactivée désactivée
La mémoire cache n'est pas
active.
Les actions de maintenance
peuvent s'effectuer.
désactivée clignote
La mémoire cache est active sur
C18. La mémoire cache peut
avoir été vidée sur C13 en cas
de défaillance de C18.
Essayez de mettre le système
sous tension et de l'éteindre
correctement avant d'effectuer
toute action de maintenance.
clignote désactivée
La mémoire cache est active sur
C13. La mémoire cache peut
avoir été vidée sur C18 en cas
de défaillance de C13.
Essayez de mettre le système
sous tension et de l'éteindre
correctement avant d'effectuer
toute action de maintenance.
clignote clignote
La mémoire cache est active sur
les contrôleurs doubles.
Essayez de mettre le système
sous tension et de l'éteindre
correctement avant d'effectuer
toute action de maintenance. Si
cela ne parvient pas à effacer
les voyants de cache actifs,
contactez le support. Voir
Contacter le support et la
maintenance IBM.
Figure 49. Voyant des données en cache
62 Activation SAS (Serial-attached SCSI) RAID
Retrait de la carte d'activation RAID/cache31E/2B
Utilisez cette procédure pour retirer des cartes d'activation RAID (Redundant Array of Independent
Disks) et cache avant d'installer une carte d'activation nouvelle ou mise à niveau.
Pourquoi et quand exécuter cette tâche
Si le système est géré par la console HMC (Hardware Management Console), utilisez-la pour retirer la
carte du serveur. Pour plus d'informations, voir «Retrait d'un composant via la console HMC», à la page
103.
Si le système est géré par la console Systems Director Management Console (SDMC), utilisez SDMC pour
retirer la carte du serveur. Pour des instructions, voir «Retrait d'un composant via la console SDMC», à la
page 118.
Si vous ne disposez pas d'une console HMC ou SDMC, procédez comme suit pour retirer la carte
d'activation :
Procédure
1. Effectuez les tâches prérequises, comme décrit dans la rubrique «Avant de commencer», à la page 83.
2. Arrêtez le système (voir «Arrêt d'un système ou d'une partition logique», à la page 93).
3. Placez le système en position de maintenance, comme indiqué dans la section «Mise en position de
maintenance du système 31E/2B, 31E/1C ou 31E/2C monté en armoire», à la page 98.
4. Retirez le capot d'accès, comme indiqué à la rubrique «Retrait du capot d'accès du système 31E/2B,
31E/1C ou 31E/2C», à la page 96.
5. Utilisez les voyants de maintenance pour vous aider à identifier le composant, comme indiqué dans
«Identification d'un composant défaillant», à la page 86.
6. Débranchez le système pour l'isoler de la source d'alimentation.
7. Attachez le bracelet antistatique.
Avertissement :
v Portez un bracelet antistatique relié à une surface métallique non peinte pour protéger le matériel
contre les risques de décharge électrostatique.
v Si vous portez un bracelet antistatique, respectez toutes les consignes de sécurité relatives aux
dangers électriques. Un bracelet antistatique permet de réguler l'électricité statique. Il ne réduit et
n'augmente en aucun cas les risques d'électrocution liés à l'utilisation ou à la manipulation
d'appareils électriques.
v Si vous ne disposez pas d'un bracelet antistatique, touchez une surface métallique non peinte du
système pendant au moins 5 secondes avant de déballer un produit de son emballage antistatique,
d'installer ou de remplacer du matériel.
8. Identifiez les cartes d'activation RAID et cache à retirer.
Remarque : Si une unité de bande en option est installée, elle doit être déplacée pour pouvoir
accéder à la carte d'activation RAID.
9. Vérifiez le voyant de validité des données de cache situé sur la carte de batterie de cache de l'autre
côté du système.
Remarque : La carte de batterie alimente les données de cache placées sur la carte RAID. Si le
voyant clignote, vous risquez de perdre des données en cache lors du retrait de cette carte.
10. Appuyez sur le loquet et faites glisser la carte en dehors du système, comme illustré.
Retrait et remise en place des adaptateurs et batteries SAS RAID 63
11. Replacez le capot d'accès, comme indiqué à la rubrique «Installation du capot d'accès du système
31E/2B, 31E/1C ou 31E/2C», à la page 97.
12. Pour un système monté en armoire, mettez-le en position de fonctionnement, comme indiqué dans
«Mise en position de fonctionnement du système 31E/2B, 31E/1C ou 31E/2C monté en armoire», à
la page 99.
13. Rebranchez la source d'alimentation.
14. Démarrez le système, comme décrit dans Mise sous tension.
15. Vérifiez le composant installé :
v Si vous avez installé le composant pour toute autre raison, vérifiez le composant installé comme
décrit dans Vérification du composant installé.
Que faire ensuite
Remplacez les cartes d'activation RAID et cache. Pour obtenir des instructions, voir «Remplacement de la
carte d'activation RAID/cache31E/2B».
Remplacement de la carte d'activation RAID/cache31E/2B
Utilisez cette procédure pour remplacer des cartes d'activation RAID (Redundant Array of Independent
Disks) et cache.
Pourquoi et quand exécuter cette tâche
Vous devez avoir exécuté la procédure «Retrait de la carte d'activation RAID/cache31E/2B», à la page 63.
Pour remplacer une carte d'activation RAID/cache, procédez comme suit :
Figure 50. Retrait de cartes d'activation RAID et cache d'une unité centrale montée en armoire
64 Activation SAS (Serial-attached SCSI) RAID
Procédure
1. Effectuez les tâches prérequises, comme décrit dans la rubrique «Avant de commencer», à la page 83.
2. Arrêtez le système (voir «Arrêt d'un système ou d'une partition logique», à la page 93).
3. Placez le système en position de maintenance, comme indiqué dans la section «Mise en position de
maintenance du système 31E/2B, 31E/1C ou 31E/2C monté en armoire», à la page 98.
4. Retirez le capot d'accès, comme indiqué à la rubrique «Retrait du capot d'accès du système 31E/2B,
31E/1C ou 31E/2C», à la page 96.
5. Utilisez les voyants de maintenance pour vous aider à identifier le composant, comme indiqué dans
«Identification d'un composant défaillant», à la page 86.
6. Débranchez le système pour l'isoler de la source d'alimentation.
Remarque : Ce système peut être équipé d'un deuxième bloc d'alimentation. Avant de poursuivre
cette procédure, vérifiez que la source d'alimentation du système est complètement déconnectée.
(L003)
ou
7. Attachez le bracelet antistatique.
Avertissement :
v Portez un bracelet antistatique relié à une surface métallique non peinte pour protéger le matériel
contre les risques de décharge électrostatique.
v Si vous portez un bracelet antistatique, respectez toutes les consignes de sécurité relatives aux
dangers électriques. Un bracelet antistatique permet de réguler l'électricité statique. Il ne réduit et
n'augmente en aucun cas les risques d'électrocution liés à l'utilisation ou à la manipulation
d'appareils électriques.
v Si vous ne disposez pas d'un bracelet antistatique, touchez une surface métallique non peinte du
système pendant au moins 5 secondes avant de déballer un produit de son emballage antistatique,
d'installer ou de remplacer du matériel.
8. Identifiez l'emplacement de la carte d'activation RAID/cache
Retrait et remise en place des adaptateurs et batteries SAS RAID 65
9. Alignez la carte d'activation RAID/cache sur l'emplacement dans le système.
10. Faites glisser complètement la carte d'activation RAID/cache dans le système jusqu'au verrouillage
du taquet (A).
11. Replacez le capot d'accès, comme indiqué à la rubrique «Installation du capot d'accès du système
31E/2B, 31E/1C ou 31E/2C», à la page 97.
12. Pour un système monté en armoire, mettez-le en position de fonctionnement, comme indiqué dans
«Mise en position de fonctionnement du système 31E/2B, 31E/1C ou 31E/2C monté en armoire», à
la page 99.
13. Rebranchez la source d'alimentation.
14. Démarrez le système (voir «Démarrage du système ou de la partition logique», à la page 91).
15. Vérifiez le composant installé ou réparé
v Si vous avez installé le composant pour toute autre raison, vérifiez le composant installé comme
décrit dans Vérification du composant installé.
Retrait du contrôleur de stockage RAID/cache sur le système 31E/2B,
31E/1C ou 31E/2C
Utilisez cette procédure pour retirer un contrôleur de stockage RAID (Redundant Array of Independent
Disks) et cache avant d'installer un contrôleur de stockage nouveau ou mis à niveau. Vous pouvez
également utiliser cette procédure pour retirer un contrôleur de stockage RAID-10.
Pourquoi et quand exécuter cette tâche
Identifiez le système sur lequel vous travaillez à l'aide du processus d'identification du système afin
d'activer l'indicateur de système (bleu). Pour plus d'informations, voir «Voyants du panneau de
commande», à la page 87.
Figure 51. Remplacement des cartes d'activation RAID et cache sur une unité centrale montée en armoire
66 Activation SAS (Serial-attached SCSI) RAID
Si le système est géré par la console HMC (Hardware Management Console), utilisez la console HMC
pour retirer les contrôleurs de stockage RAID et cache du serveur. Pour plus d'informations, voir «Retrait
d'un composant via la console HMC», à la page 103.
Si le système est géré par la console Systems Director Management Console (SDMC), utilisez SDMC pour
retirer les contrôleurs de stockage RAID et cache du serveur. Pour des instructions, voir «Retrait d'un
composant via la console SDMC», à la page 118.
Si vous ne disposez pas d'une console HMC ou SDMC, procédez comme suit pour retirer le contrôleur de
stockage RAID/cache :
Procédure
1. Effectuez les tâches prérequises, comme décrit dans la rubrique «Avant de commencer», à la page 83.
2. Arrêtez le système (voir «Arrêt d'un système ou d'une partition logique», à la page 93).
3. Mettez le système en position de maintenance, comme décrit dans «Mise en position de maintenance
du système 31E/2B, 31E/1C ou 31E/2C monté en armoire», à la page 98.
4. Retirez le capot d'accès, comme indiqué dans «Retrait du capot d'accès du système 31E/2B, 31E/1C
ou 31E/2C», à la page 96.
5. Utilisez les voyants de maintenance pour vous aider à identifier le composant, comme indiqué dans
«Identification d'un composant défaillant», à la page 86.
6. Débranchez le système pour l'isoler de la source d'alimentation.
Remarque : Ce système peut être équipé d'un deuxième bloc d'alimentation. Avant de poursuivre
cette procédure, vérifiez que la source d'alimentation du système est complètement déconnectée.
(L003)
ou
7. Attachez le bracelet antistatique.
Retrait et remise en place des adaptateurs et batteries SAS RAID 67
Avertissement :
v Portez un bracelet antistatique relié à une surface métallique non peinte pour protéger le matériel
contre les risques de décharge électrostatique.
v Si vous portez un bracelet antistatique, respectez toutes les consignes de sécurité relatives aux
dangers électriques. Un bracelet antistatique permet de réguler l'électricité statique. Il ne réduit et
n'augmente en aucun cas les risques d'électrocution liés à l'utilisation ou à la manipulation
d'appareils électriques.
v Si vous ne disposez pas d'un bracelet antistatique, touchez une surface métallique non peinte du
système pendant au moins 5 secondes avant de déballer un produit de son emballage antistatique,
d'installer ou de remplacer du matériel.
8. Identifiez les contrôleurs de stockage RAID et cache à retirer.
Avertissement : Ne tentez pas de retirer les composants associés au sous-système SAS sur votre
système si le voyant de la carte dans C18-C18 ou P1-C13 clignote. Ce voyant indique l'écriture de
données en cache. Pour éviter la perte de données, voir Affichage des informations relatives aux
batteries rechargeables. Vous pouvez remplacer le bloc de batterie de cache sans danger si la mention
Yes apparaît en regard du bloc de batterie.
Figure 52. Emplacement d'installation de la carte d'activation RAID (P1 à C12), du contrôleur de stockage (P1 à C18)
et de la carte de batterie de cache (P1 à C13) pour 31E/2B
68 Activation SAS (Serial-attached SCSI) RAID
9. Appuyez sur le taquet (A) et tirez sur la poignée (B) dans les directions indiquées, puis faites glisser
la carte hors du système, comme l'illustre la figure suivante.
Figure 53. Emplacement d'installation du contrôleur de stockage (P1 à C18) et de la carte de batterie de cache (P1 à
C13) pour 31E/1C ou 31E/2C
Figure 54. Retrait de contrôleurs de stockage RAID et cache d'une unité centrale montée en armoire pour 31E/2B
Retrait et remise en place des adaptateurs et batteries SAS RAID 69
10. Réinstallez le capot d'accès. Voir «Installation du capot d'accès du système 31E/2B, 31E/1C ou
31E/2C», à la page 97
11. Pour un système monté en armoire, mettez le système en position de fonctionnement, comme décrit
dans «Mise en position de fonctionnement du système 31E/2B, 31E/1C ou 31E/2C monté en
armoire», à la page 99.
12. Rebranchez la source d'alimentation.
13. Démarrez le système (voir «Démarrage du système ou de la partition logique», à la page 91).
14. Vérifiez le composant installé :
v Si vous avez installé le composant pour toute autre raison, vérifiez le composant installé comme
décrit dans Vérification du composant installé.
Que faire ensuite
Pour remplacer le contrôleur de stockage RAID/cache, voir «Remplacement du contrôleur de stockage
RAID/cache sur le système 31E/2B, 31E/1C ou 31E/2C».
Remplacement du contrôleur de stockage RAID/cache sur le système
31E/2B, 31E/1C ou 31E/2C
Utilisez cette procédure pour remplacer des contrôleurs de stockage RAID et cache. Vous pouvez
également utiliser cette procédure pour remplacer un contrôleur de stockage RAID-10.
Pourquoi et quand exécuter cette tâche
Identifiez le système sur lequel vous travaillez à l'aide du processus d'identification du système afin
d'activer l'indicateur de système (bleu). Pour plus d'informations, voir «Voyants du panneau de
commande», à la page 87.
Figure 55. Retrait de contrôleurs de stockage RAID et cache d'une unité centrale montée en armoire pour 31E/1C ou
31E/2C
70 Activation SAS (Serial-attached SCSI) RAID
Vous devez avoir exécuté la procédure «Retrait du contrôleur de stockage RAID/cache sur le système
31E/2B, 31E/1C ou 31E/2C», à la page 66.
Pour remplacer des contrôleurs de stockage RAID et cache, procédez comme suit :
Procédure
1. Effectuez les tâches prérequises, comme décrit dans la rubrique «Avant de commencer», à la page 83.
2. Arrêtez le système (voir «Arrêt d'un système ou d'une partition logique», à la page 93).
3. Mettez le système en position de maintenance, comme décrit dans «Mise en position de maintenance
du système 31E/2B, 31E/1C ou 31E/2C monté en armoire», à la page 98.
4. Retirez le capot d'accès, comme indiqué dans «Retrait du capot d'accès du système 31E/2B, 31E/1C
ou 31E/2C», à la page 96.
5. Utilisez les voyants de maintenance pour vous aider à identifier le composant, comme indiqué dans
«Identification d'un composant défaillant», à la page 86.
6. Débranchez le système pour l'isoler de la source d'alimentation.
Remarque : Ce système peut être équipé d'un deuxième bloc d'alimentation. Avant de poursuivre
cette procédure, vérifiez que la source d'alimentation du système est complètement déconnectée.
(L003)
ou
7. Attachez le bracelet antistatique.
Retrait et remise en place des adaptateurs et batteries SAS RAID 71
Avertissement :
v Portez un bracelet antistatique relié à une surface métallique non peinte pour protéger le matériel
contre les risques de décharge électrostatique.
v Si vous portez un bracelet antistatique, respectez toutes les consignes de sécurité relatives aux
dangers électriques. Un bracelet antistatique permet de réguler l'électricité statique. Il ne réduit et
n'augmente en aucun cas les risques d'électrocution liés à l'utilisation ou à la manipulation
d'appareils électriques.
v Si vous ne disposez pas d'un bracelet antistatique, touchez une surface métallique non peinte du
système pendant au moins 5 secondes avant de déballer un produit de son emballage antistatique,
d'installer ou de remplacer du matériel.
8. Identifiez l'emplacement des contrôleurs de stockage RAID et cache.
9. Alignez les contrôleurs de stockage RAID et cache sur l'emplacement dans le système.
10. Faites glisser complètement les contrôleurs de stockage RAID et cache dans le système jusqu'au
verrouillage du loquet (A).
Figure 56. Remplacement des contrôleurs de stockage RAID/cache pour 31E/2B
72 Activation SAS (Serial-attached SCSI) RAID
11. Replacez le capot d'accès, comme indiqué à la rubrique «Installation du capot d'accès du système
31E/2B, 31E/1C ou 31E/2C», à la page 97.
12. Pour un système monté en armoire, mettez-le en position de fonctionnement, comme indiqué dans
«Mise en position de fonctionnement du système 31E/2B, 31E/1C ou 31E/2C monté en armoire», à
la page 99.
13. Rebranchez la source d'alimentation.
14. Démarrez le système comme indiqué dans «Démarrage du système ou de la partition logique», à la
page 91.
15. Vérifiez le composant installé :
v Si vous avez installé le composant pour toute autre raison, vérifiez le composant installé comme
décrit dans Vérification du composant installé.
Retrait et installation de la carte de batterie de cache
Pour effectuer la maintenance de la carte de batterie de cache, procédez comme suit :
1. Commencez avec le système sous tension.
Pour éviter la perte de données, la carte de batterie de cache doit être à l'état d'erreur avant d'être
retirée du système.
Si votre système est hors tension, rallumez-le avant de continuer.
2. Choisissez votre système d'exploitation :
v AIX : Passez à l'étape 3.
v Linux : Passez à l'étape 5.
v IBM i : Passez à l'étape 7.
3. Sous AIX, mettez la carte de batterie de cache à l'état d'erreur (afin d'éviter d'éventuelles pertes de
données) en procédant comme suit :
a. Accédez au gestionnaire de grappes de disques SAS IBM, comme indiqué ci-dessous :
Figure 57. Remplacement des contrôleurs de stockage RAID/cache pour 31E/1C ou 31E/2C
Retrait et remise en place des adaptateurs et batteries SAS RAID 73
1) A l'invite de commande, tapez smit, puis appuyez sur Entrée.
2) Sélectionnez Devices.
3) Sélectionnez Disk Array.
4) Sélectionnez IBM SAS Disk Array.
5) Sélectionnez IBM SAS Disk Array Manager dans le menu comprenant les options de
configuration et de gestion du contrôleur IBM SAS RAID.
b. Sélectionnez Diagnostics and Recovery Options.
c. Sélectionnez l'option Controller Rechargeable Battery Maintenance.
d. Sélectionnez l'option Force Controller Rechargeable Battery Error.
e. Sélectionnez le contrôleur SAS RAID IBM dont vous souhaitez remplacer la batterie.
Cette option met la batterie à l'état d'erreur, ce qui nécessite son remplacement.
f. Déterminez si vous pouvez remplacer la carte de batterie de cache sans risque. Voir la procédure
Affichage des informations relatives aux batteries rechargeables. Vous pouvez remplacer la carte
de batterie de cache sans danger si la mention Yes apparaît en regard de l'option Battery pack
can be safely replaced. Vous devrez peut-être sélectionner à plusieurs reprises l'option permettant
d'afficher les informations relatives à la batterie rechargeable du contrôleur car plusieurs
minutes peuvent s'écouler avant que le remplacement de la carte de batterie de cache puisse
s'effectuer sans danger.
Avertissement : Ne tentez pas de retirer les composants associés au sous-système SAS sur votre
système si le voyant de la carte dans C18-C18 ou P1-C13 clignote. Ce voyant indique l'écriture de
données en cache.
g. Vérifiez que le voyant de présence de données en cache ne clignote pas.
Affichage des informations relatives aux batteries rechargeables :
a. Accédez au gestionnaire de grappes de disques SAS IBM en suivant la procédure précédente.
b. Sélectionnez Diagnostics and Recovery Options.
c. Sélectionnez l'option Controller Rechargeable Battery Maintenance.
d. Sélectionnez l'option Displaying Rechargeable Battery Information.
e. Sélectionnez le contrôleur IBM SAS RAID.
4. Passez à l'étape 8.
5. Mettez la carte de batterie de cache à l'état d'erreur sous Linux (afin d'éviter d'éventuelles pertes de
données) en procédant comme suit :
a. Exécutez l'utilitaire iprconfig en saisissant iprconfig.
b. Sélectionnez Work with disk unit recovery.
c. Sélectionnez Work with resources containing cache battery packs.
d. Sélectionnez votre carte et tapez 2. Appuyez ensuite sur Entrée pour mettre la batterie à l'état
d'erreur.
Remarque : Cette option permet de mettre la batterie à l'état d'erreur, ce qui nécessite son
remplacement.
e. Si vous êtes sûr de devoir mettre la batterie à l'état d'erreur, tapez c pour confirmer. Si vous ne
souhaitez pas mettre la batterie à l'état d'erreur, tapez q pour annuler.
f. Déterminez si vous pouvez remplacer la carte de batterie de cache sans risque. Voir Affichage des
informations relatives aux batteries rechargeables.
g. Vous pouvez remplacer la carte de batterie de cache sans danger si la mention Yes apparaît en
regard de l'option Battery pack can be safely replaced. Vous devrez peut-être sélectionner à
plusieurs reprises l'option permettant d'afficher les informations relatives à la batterie
rechargeable du contrôleur car plusieurs minutes peuvent s'écouler avant que le remplacement
de la carte de batterie de cache puisse s'effectuer sans danger.
6. Passez à l'étape 8.
74 Activation SAS (Serial-attached SCSI) RAID
7. Mettez la carte de batteries de cache à l'état d'erreur sous (afin d'éviter d'éventuelles pertes de
données) en appliquant la procédure suivante au système ou à la partition logique qui comporte
l'adaptateur :
a. Vous devez avoir ouvert une session avec au minimum les droits d'accès aux outils de
maintenance.
b. Tapez strsst sur la ligne de commande et appuyez sur Entrée.
c. Sur l'écran System Service Tools (SST) Sign On, tapez votre ID utilisateur et votre mot de passe
d'accès aux outils de maintenance. Appuyez sur Entrée.
d. Sur l'écran System Service Tools (SST), sélectionnez Start a Service Tool et appuyez sur Entrée.
e. Dans l'écran Start a Service Tool, sélectionnez Hardware Service Manager et appuyez sur la
touche Entrée.
f. Sélectionnez Work with resources containing cache battery packs dans l'écran Hardware Service
Manager.
g. Sélectionnez Force battery pack into error state pour l'adaptateur d'entrée-sortie que vous
utilisez dans l'écran Work with Resources containing cache battery packs, puis appuyez sur
Entrée.
h. Dans l'écran Force Battery Packs Into Error State, vérifiez que l'adaptateur d'entrée-sortie
approprié est sélectionné et appuyez sur la touche de fonction correspondant à votre choix pour
confirmer.
i. Revenez à l'écran Work with Resources containing cache battery packs et sélectionnez Display
battery information. Assurez-vous que la zone Safe to replace cache battery a la valeur Yes.
Cette opération peut prendre plusieurs minutes et vous devrez peut-être appuyer sur le bouton
d'actualisation pour que la zone soit mise à jour.
8. Placez le système en position de maintenance, comme indiqué dans la section «Mise en position de
maintenance du système 31E/2B, 31E/1C ou 31E/2C monté en armoire», à la page 98.
9. Retirez le capot d'accès de maintenance. Voir «Retrait du capot d'accès du système 31E/2B, 31E/1C
ou 31E/2C», à la page 96
10. Vérifiez le voyant de la batterie de cache en examinant le voyant de la carte de batterie de cache.
v Si le voyant clignote, passez à l'étape suivante.
v Si le voyant ne clignote pas, passez à l'étape 12.
Retrait et remise en place des adaptateurs et batteries SAS RAID 75
11. Arrêt du retrait de la carte de batterie de cache.
Le voyant clignotant indique que la batterie conserve les données en cache pour les cartes
d'activation RAID et cache. Ces données seront perdues si la batterie est retirée à ce stade. Effectuez
cette procédure de maintenance sans remplacer la batterie, puis effectuez-la de nouveau pour vous
assurer que toutes les données en cache sont enregistrées sur le disque avant de remplacer la
batterie. Passez à l'étape 18.
12. Retirez la carte de batterie de cache en procédant comme suit :
Avertissement : Ne retirez pas la batterie si la DEL (B) clignote ; cela signifie qu'il existe des
données en cache.
a. Enfoncez la partie métallique (A) pour retirer la carte.
b. Retirez la carte de batterie de cache des glissières de montage afin de l'extraire du contrôleur.
Figure 58. Voyant des données en cache
76 Activation SAS (Serial-attached SCSI) RAID
13. Installez la carte de batterie de cache en procédant comme suit :
Figure 59. Retrait de la carte de batterie de cache pour31E/2B
Figure 60. Retrait de la carte de batterie de cache pour 31E/1C ou 31E/2C
Retrait et remise en place des adaptateurs et batteries SAS RAID 77
Avertissement : Laissez la carte de batterie de cache déconnectée pendant au moins 60 secondes
avant de connecter la nouvelle batterie. Il s'agit du temps minimum nécessaire à l'adaptateur pour
détecter que la batterie a été remplacée.
a. Faites glisser la carte de batterie de cache dans les glissières de montage du contrôleur jusqu'à ce
qu'elle s'insère dans le connecteur de batterie.
b. Alignez les trous d'alignement (B) sur ceux du châssis afin que la carte s'insère correctement.
Figure 61. Installation de la carte de batterie de cache pour31E/2B
78 Activation SAS (Serial-attached SCSI) RAID
14. Redémarrez le cache d'écriture de l'adaptateur en procédant comme suit :
Choisissez votre système d'exploitation :
v AIX : Passez à l'étape 15.
v Linux : Passez à l'étape 16.
v IBM i : Passez à l'étape 17.
15. Procédez comme suit :
a. Accédez au gestionnaire de grappes de disques IBM SAS en suivant la procédure de la section
3a.
b. Sélectionnez Diagnostics and Recovery Options.
c. Sélectionnez Controller Rechargeable Battery Maintenance.
d. Sélectionnez Start Adapter Cache.
e. Sélectionnez le contrôleur dont vous venez de remplacer la batterie et appuyez sur Entrée.
f. Passez à l'étape 18.
16. Procédez comme suit :
a. Exécutez l'utilitaire iprconfig en saisissant iprconfig.
b. Sélectionnez Work with disk unit recovery.
c. Sélectionnez Work with resources containing cache battery packs.
d. Démarrez le cache de l'adaptateur d'E-S sur l'adaptateur correspondant à la batterie que vous
venez de remplacer en saisissant 3, puis en appuyant sur Entrée.
e. Passez à l'étape 18.
17. Procédez comme suit :
a. Revenez à l'écran Work with Resources containing Cache Battery Packs en utilisant les
informations obtenues à l'étape 7a à 7f et sélectionnez Start IOA Cache. Appuyez sur Entrée.
b. Vérifiez que vous obtenez bien le message Cache was started.
Figure 62. Installation de la carte de batterie de cache pour 31E/1C ou 31E/2C
Retrait et remise en place des adaptateurs et batteries SAS RAID 79
c. Passez à l'étape 18.
18. Replacez le capot d'accès, comme indiqué à la rubrique «Installation du capot d'accès du système
31E/2B, 31E/1C ou 31E/2C», à la page 97.
19. Pour un système monté en armoire, mettez-le en position de fonctionnement, comme indiqué dans
«Mise en position de fonctionnement du système 31E/2B, 31E/1C ou 31E/2C monté en armoire», à
la page 99
20. Fermez le volet arrière de l'unité dont vous effectuez la maintenance.
21. Vérifiez que la nouvelle ressource fonctionne correctement, comme indiqué dans la rubrique
«Vérification du composant installé», à la page 104.
Retrait et remplacement du bloc de batterie de cache
Cette procédure permet de remplacer le bloc de batterie de cache sur une carte de batterie.
Avant de commencer
Identifiez le système sur lequel vous travaillez à l'aide du processus d'identification du système afin
d'activer l'indicateur de système (bleu). Pour plus d'informations, voir Voyants du panneau de
commande.
Avertissement : Avant de poursuivre ces procédures, déterminez si vous pouvez remplacer le bloc de
batterie de cache sans risque. Voir Affichage des informations sur la batterie rechargeable. Vous pouvez
remplacer le bloc de batterie de cache sans danger si la mention Yes apparaît en regard de l'option
Battery pack can be safely replaced.
Avertissement : Pour maintenir la disponibilité du système et éviter d'éventuelles pertes de données,
vous devez comprendre que ces fonctions ont des conséquences sur la configuration de SAS
(serial-attached SCSI) et l'accessibilité des données du système. Consultez la rubrique Sous-système SAS
pour comprendre les différentes remarques sur la configuration et la maintenance SAS.
Avant toute opération d'installation ou de maintenance d'un dispositif, consultez les configurations d'E-S
pour chaque partition logique de votre système. Il est important de comprendre les différences entre les
diverses configurations et les considérations de maintenance qui en résultent. En outre, tenez compte des
incidences possibles sur les domaines suivants :
v Partitions logiques et leurs affectations à l'adaptateur d'E-S
v Configurations de grappes RAID
v Emplacements de disque physique
v Chemin d'accès à votre unité d'amorçage
Pourquoi et quand exécuter cette tâche
Procédez comme suit pour remplacer un bloc de batterie n'acceptant pas la maintenance simultanée :
Procédure
1. Retirez le contrôleur du système Pour plus d'informations, voir «Retrait du contrôleur de stockage
RAID/cache sur le système 31E/2B, 31E/1C ou 31E/2C», à la page 66.
2. Placez le contrôleur sur une surface protégée des décharges électrostatiques.
3. Débranchez le connecteur de batterie (A) de son connecteur sur la carte, en tirant la languette de
verrouillage tout en tirant doucement sur la fiche. La fiche est connectée à la carte mère d'une seule
façon ; elle ne peut donc être insérée de manière incorrecte lors de la procédure de remplacement.
80 Activation SAS (Serial-attached SCSI) RAID
Remarque : Laissez le bloc de batterie de cache déconnecté pendant au moins 60 secondes avant de
connecter la nouvelle batterie. Il s'agit du temps minimum nécessaire à la carte pour détecter que la
batterie a été remplacée.
Connecteur de la batterie (A)
(B) Bloc de batterie du cache
4. La batterie est montée sur une plaque en plastique noir avec deux taquets, maintenue par des vis de
butée à l'aide de goupilles de retenue. Pour retirer le bloc de batterie, pressez le dispositif de retenue
de chaque goupille, puis soulevez le taquet de la plaque de montage noire afin de la libérer des deux
broches de retenue.
5. Installez la batterie sur les supports de façon à ce que les loquets s'enclenchent.
Figure 63. Retrait de la batterie de cache
Retrait et remise en place des adaptateurs et batteries SAS RAID 81
6. Connectez le connecteur du bloc de batteries de cache (B) à la carte. La fiche est connectée à la carte
mère d'une seule façon ; elle ne peut donc être insérée de manière incorrecte.
7. Réinstallez l'adaptateur et redémarrez le cache. Pour plus d'informations, voir les procédures
suivantes :
v «Retrait et installation de la carte de batterie de cache», à la page 73
v «Retrait du contrôleur de stockage RAID/cache sur le système 31E/2B, 31E/1C ou 31E/2C», à la
page 66.
Figure 64. Remplacement de la batterie de cache
82 Activation SAS (Serial-attached SCSI) RAID
Procédures communes aux dispositifs installables
Cette section contient toutes les procédures communes concernant l'installation, le retrait et le
remplacement de fonctionnalités.
Avant de commencer
Respectez ces précautions lorsque vous installez, retirez ou remplacez des dispositifs ou des composants.
Pourquoi et quand exécuter cette tâche
Ces précautions visent à créer un environnement sûr pour la maintenance de votre système, mais ne
proposent pas de procédure de maintenance du système. Les procédures d'installation, de retrait et de
remplacement fournissent les processus étape par étape pour la maintenance de votre système.
83
DANGER
Lorsque vous utilisez le système ou travaillez à proximité de ce dernier, observez les consignes
suivantes :
Le courant électrique provenant de l'alimentation, du téléphone et des câbles de transmission peut
présenter un danger. Pour éviter tout risque de choc électrique :
v Branchez cette unité uniquement avec le cordon d'alimentation fourni par . N'utilisez pas ce
dernier avec un autre produit.
v N'ouvrez pas et n'entretenez pas le bloc d'alimentation électrique.
v Ne manipulez aucun câble et n'effectuez aucune opération d'installation, d'entretien ou de
reconfiguration de ce produit au cours d'un orage.
v Le produit peut être équipé de plusieurs cordons d'alimentation. Pour supprimer tout risque de
danger électrique, débranchez tous les cordons d'alimentation.
v Branchez tous les cordons d'alimentation sur un socle de prise de courant correctement câblé et
mis à la terre. Vérifiez que la tension et l'ordre des phases des prises de courant correspondent
aux informations de la plaque d'alimentation électrique du système.
v Branchez sur des socles de prise de courant correctement câblés tout équipement connecté à ce
produit.
v Lorsque cela est possible, n'utilisez qu'une seule main pour connecter ou déconnecter les cordons
d'interface.
v Ne mettez jamais un équipement sous tension en cas d'incendie ou d'inondation, ou en présence
de dommages matériels.
v Avant de retirer les capots de l'unité, mettez celle-ci hors tension et déconnectez ses cordons
d'alimentation, ainsi que les câbles qui la relient aux réseaux, aux systèmes de télécommunication
et aux modems (sauf mention contraire dans les procédures d'installation et de configuration).
v Lorsque vous installez, que vous déplacez, ou que vous manipulez le présent produit ou des
périphériques qui lui sont raccordés, reportez-vous aux instructions ci-dessous pour connecter et
déconnecter les différents cordons.
Pour déconnecter les cordons :
1. Mettez toutes les unités hors tension (sauf mention contraire).
2. Débranchez les cordons d'alimentation des prises.
3. Débranchez les cordons d'interface des connecteurs.
4. Débranchez tous les câbles des unités.
Pour connecter les cordons :
1. Mettez toutes les unités hors tension (sauf mention contraire).
2. Branchez tous les cordons sur les unités.
3. Branchez les cordons d'interface sur des connecteurs.
4. Branchez les cordons d'alimentation aux prises.
5. Mettez l'unité sous tension.
84 Activation SAS (Serial-attached SCSI) RAID
(D005a)
DANGER
Observez les consignes suivantes lors de l'utilisation du système en armoire ou lorsque vous
travaillez à proximité de ce dernier :
v Un mauvais maniement de l'équipement lourd peut engendrer blessures et dommages matériels.
v Abaissez toujours les vérins de mise à niveau de l'armoire.
v Installez toujours des équerres de stabilisation sur l'armoire.
v Pour prévenir tout danger lié à une mauvaise répartition de la charge, installez toujours les
unités les plus lourdes dans la partie inférieure de l'armoire. Installez toujours les serveurs et les
unités en option en commençant par le bas de l'armoire.
v Un serveur monté en armoire n'est pas une étagère ou un espace de travail. Ne posez pas d'objet
sur un serveur monté en armoire.
v Chaque armoire peut être équipée de plusieurs cordons d'alimentation. Avant de manipuler
l'armoire, vous devez débrancher l'ensemble des cordons d'alimentation.
v Reliez toutes les unités installées dans l'armoire aux dispositifs d'alimentation installés dans la
même armoire. Vous ne devez pas brancher le cordon d'alimentation d'une unité installée dans
une armoire au dispositif d'alimentation installé dans une autre armoire.
v Un mauvais câblage du socle de prise de courant peut provoquer une mise sous tension
dangereuse des parties métalliques du système ou des unités qui lui sont raccordées. Il
appartient au client de s'assurer que le socle de prise de courant est correctement câblé et mis à la
terre afin d'éviter tout risque de choc électrique.
ATTENTION
v N'installez pas d'unité dans une armoire dont la température ambiante interne dépasse la
température ambiante que le fabricant recommande pour toutes les unités montées en armoire.
v N'installez pas d'unité dans une armoire où la ventilation n'est pas assurée. Vérifiez que les
côtés, l'avant et l'arrière de l'unité sont correctement ventilés.
v Le matériel doit être correctement raccordé au circuit d'alimentation pour éviter qu'une surcharge
des circuits n'entrave le câblage des dispositifs d'alimentation ou de protection contre les
surintensités. Pour choisir des connexions d'alimentation à l'armoire adaptées, consultez les
étiquettes de puissance nominale situées sur le matériel dans l'armoire afin de déterminer
l'alimentation totale requise par le circuit d'alimentation.
v Armoires dotées de tiroirs coulissants : Si l'armoire n'est pas équipée d'équerres de stabilisation, ne
sortez et n'installez pas de tiroir ou de dispositif. Ne retirez pas plusieurs tiroirs à la fois. Si vous
retirez plusieurs tiroirs simultanément, l'armoire risque de devenir instable.
v Armoires dotées de tiroirs fixes : Sauf indication du fabricant, les tiroirs fixes ne doivent pas être
retirés à des fins de maintenance. Si vous tentez de retirer une partie ou l'ensemble du tiroir,
l'armoire risque de devenir instable et le tiroir risque de tomber.
(R001)
Avant de commencer toute procédure de remplacement ou d'installation, procédez comme suit :
Procédures communes aux dispositifs installables 85
Procédure
1. Si vous installez un nouveau dispositif, vous devez disposer des logiciels requis pour la prise en
charge de ce nouveau dispositif.
2. Si vous exécutez une procédure d'installation ou de remplacement qui risque d'affecter la sécurité des
données, vous devez disposer, dans la mesure du possible, d'une copie de sauvegarde récente de
votre système ou de la partition logique (systèmes d'exploitation, logiciels sous licence et données).
3. Passez en revue la procédure d'installation ou de remplacement relative au dispositif ou au
composant.
4. Notez la correspondance des couleurs utilisées sur le système.
La couleur bleue ou ocre sur les composants matériels indique un point de contact que vous pouvez
utiliser pour retirer ou installer le composant sur le système, ouvrir ou fermer un levier, etc. La
couleur ocre peut également désigner un composant que vous pouvez retirer ou remplacer alors que
le système ou la partition logique est sous tension.
5. Vous devez vous munir d'un tournevis à lame plate moyen, d'un tournevis cruciforme et d'une paire
de ciseaux.
6. Si certains composants sont incorrects, manquants ou visiblement endommagés, procédez comme
suit :
v Si vous remplacez un composant, contactez votre fournisseur de services ou le support technique.
v Si vous installez une fonctionnalité, contactez l'un des services suivants :
– Le fournisseur de services ou le support technique
7. Si vous rencontrez des difficultés lors de l'installation, contactez votre fournisseur de services, ou le
support technique.
8. Si vous installez du nouveau matériel dans une partition logique, vous devez planifier et comprendre
les implications inhérentes au partitionnement du système. Pour plus d'informations, consultez la
rubrique Partitionnement logique.
Identification d'un composant défaillant
Ces instructions permettent d'apprendre à localiser et identifier un composant défaillant ou à retirer, ou
encore de localiser l'emplacement d'installation d'un nouveau composant sur votre système ou unité
d'extension en utilisant une méthode adaptée à votre système.
Pourquoi et quand exécuter cette tâche
Pour les serveurs systèmes dotés du processeur POWER7, les voyants peuvent être utilisés pour identifier
ou vérifier un composant à retirer, installer ou en cours de maintenance.
La combinaison de la fonction d'identification et du voyant d'incident (couleur ambre) montre
l'emplacement d'une unité remplaçable sur site (unité FRU). Lorsque vous retirez une unité FRU, vérifiez
d'abord que vous travaillez sur l'unité FRU correcte à l'aide de la fonction d'identification dans la console
de gestion ou dans une autre interface utilisateur. Lorsque vous retirez une unité FRU dans la console de
gestion du matériel, la fonction d'identification est activée et désactivée automatiquement aux moments
opportuns.
La fonction d'identification produit le clignotement du voyant jaune. Lorsque vous désactivez la fonction
d'identification, le voyant revient à son état précédent. Pour les composants dotés d'un bouton de
maintenance bleu, la fonction d'identification définit les informations du voyant pour le bouton de
maintenance de sorte que lorsque le bouton est activé, les voyants corrects du composant clignotent.
Si vous avez besoin d'utiliser la fonction d'identification, utilisez les procédures suivantes.
86 Activation SAS (Serial-attached SCSI) RAID
Voyants du panneau de commande
Ces informations peuvent s'utiliser comme un guide sur les voyants et boutons du panneau de
commande.
Le panneau de commande est doté de voyants qui indiquent les différents états du système.
v A : Bouton de mise sous tension
v B : Voyant d'alimentation
– Un voyant fixe indique que l'unité est sous alimentation système complète.
– Un voyant clignotant indique que l'unité est sous alimentation de mode veille.
Remarque : Une période d'environ 30 secondes s'écoule entre le moment où vous appuyez sur le
bouton de mise sous tension et celui où le voyant d'alimentation cesse de clignoter pour devenir fixe.
Durant cette période de transition, le clignotement peut s'accélérer.
v C : Voyant d'identification du boîtier
– Une lumière constante indique l'état d'identification, qui permet d'identifier une pièce.
– L'absence de lumière indique un fonctionnement normal du système.
v D: Voyant d'information système
– L'absence de lumière indique un fonctionnement normal du système.
– Un voyant allumé signifie qu'une intervention est requise sur le système.
v E : Port USB
v F : Voyant de panne de boîtier
– Un voyant fixe indique une panne dans l'unité centrale.
– L'absence de lumière indique un fonctionnement normal du système.
v G : Ecran Fonctions/Données
v H : Bouton de décrémentation
v I : Bouton Entrée
Figure 65. Panneau de commande
Procédures communes aux dispositifs installables 87
v J : Bouton d'incrémentation
v K : Bouton de réinitialisation
Concepts associés
Identification d'un composant défaillant
Ces instructions permettent de savoir comment localiser et identifier un composant défaillant sur un
système ou une unité d'extension en utilisant la méthode propre à votre système.
Identification d'un composant défaillant sur un système AIX ou une
partition logique
Les instructions permettent de savoir comment rechercher un composant défaillant, puis d'activer le
voyant de ce composant sur un système ou une partition logique exécutant le système d'exploitation AIX.
Localisation d'un composant défaillant sur un système AIX ou une partition
logique
Il se peut que vous deviez utiliser les outils AIX, avant d'activer le voyant dans le cadre de la recherche
d'un composant défaillant.
Procédure
1. Connectez-vous en tant qu'utilisateur root ou celogin-.
2. A l'invite, tapez diag et appuyez sur Entrée.
3. Dans le menu Sélection des fonctions, choisissez Sélection d'une tâche et appuyez sur Entrée.
4. Sélectionnez Affichage des résultats de tests de diagnostic précédents, puis appuyez sur Entrée.
5. Dans les précédents diagnostics, choisissez Affichage du journal des diagnostics abrégé. L'écran
Affichage du journal des diagnostics qui contient une liste chronologique des événements s'affiche.
6. Dans la colonne T, recherchez l'entrée S la plus récente. Sélectionnez la ligne, puis appuyez sur
Entrée.
7. Sélectionnez Validation. Le programme affiche les détails de l'entrée du journal.
8. Notez l'emplacement et le code SRN figurant vers la fin de l'entrée.
9. Retournez à la ligne de commande.
Que faire ensuite
Utilisez les informations de localisation du composant défaillant pour activer le voyant qui identifie ce
dernier. Pour plus d'informations, voir «Activation du voyant associé au composant défaillant».
Activation du voyant associé au composant défaillant
Ces instructions permettent d'identifier physiquement l'emplacement d'un composant pour lequel vous
intervenez.
Procédure
1. Connectez-vous en tant qu'utilisateur root.
2. A l'invite, tapez diag et appuyez sur Entrée.
3. Dans le menu Sélection des fonctions, choisissez Sélection d'une tâche et appuyez sur Entrée.
4. Dans le menu Sélection des tâches, sélectionnez Indicateurs d'identification et d'avertissement, puis
appuyez sur Entrée.
5. Dans la liste des voyants, sélectionnez le code d'emplacement associé au composant défaillant et
appuyez sur Entrée.
6. Sélectionnez Validation. Le programme allume le voyant d'incident associé au composant défaillant.
7. Retournez à la ligne de commande.
88 Activation SAS (Serial-attached SCSI) RAID
Désactivation du voyant d'un composant défaillant
Cette procédure permet d'éteindre un voyant allumé dans le cadre d'une opération de maintenance.
Pourquoi et quand exécuter cette tâche
Pour le désactiver, procédez comme suit :
Procédure
1. Connectez-vous en tant qu'utilisateur root.
2. A l'invite, tapez diag et appuyez sur Entrée.
3. Dans le menu Sélection des fonctions, choisissez Sélection d'une tâche et appuyez sur Entrée.
4. Dans le menu Sélection des tâches, sélectionnez Indicateurs d'identification et d'avertissement, puis
appuyez sur Entrée.
5. Dans la liste des voyants, sélectionnez le code d'emplacement associé au composant défaillant et
appuyez sur Entrée. Lorsque le voyant d'un composant défaillant est activé, le caractère I précède le
code d'emplacement.
6. Sélectionnez Validation. Le programme désactive l'intervention du système ainsi que le voyant associé
au composant défaillant.
7. Retournez à la ligne de commande.
Identification d'un composant défaillant sur un système Linux ou une
partition logique
Si l'aide à la maintenance a été installée sur un système ou une partition logique, vous pouvez activer ou
désactiver les voyants pour localiser un composant ou effectuer une opération de maintenance.
Localisation d'un composant défaillant sur un système Linux ou une partition
logique
Si l'aide à la maintenance a été installée sur un système ou une partition logique, vous devez activer les
voyants pour localiser un composant.
Pourquoi et quand exécuter cette tâche
Pour l'activer, procédez comme suit :
Procédure
1. Connectez-vous en tant qu'utilisateur root.
2. Sur la ligne de commande, tapez /usr/sbin/usysident -s identify -l et
appuyez sur Entrée.
3. Repérez le voyant d'incident pour identifier le boîtier qui renferme le composant défaillant.
Localisation d'un composant défaillant sur un système ou une partition logique
Linux
Pour retrouver le code d'emplacement d'un composant défaillant si vous avez oublié cette information,
utilisez la procédure décrite dans cette rubrique.
Pourquoi et quand exécuter cette tâche
Pour localiser le composant défaillant sur un système ou une partition logique, procédez comme suit :
Procédure
1. Connectez-vous en tant qu'utilisateur root.
2. Sur la ligne de commande, tapez grep diagela /var/log/platform et appuyez sur Entrée.
Procédures communes aux dispositifs installables 89
3. Recherchez l'entrée la plus récente contenant un code SRC (System Reference Code).
4. Notez les informations d'emplacement.
Activation du voyant associé au composant défaillant
Si vous connaissez le code d'emplacement du composant défaillant, activez le voyant pour vous aider à
rechercher le composant à remplacer.
Pourquoi et quand exécuter cette tâche
Pour l'activer, procédez comme suit :
Procédure
1. Connectez-vous en tant qu'utilisateur root.
2. Sur la ligne de commande, tapez /usr/sbin/usysident -s identify -l et
appuyez sur Entrée.
3. Repérez le voyant d'incident pour identifier le boîtier qui renferme le composant défaillant.
Résultats
Désactivation du voyant d'un composant défaillant
Après avoir terminé la procédure de retrait et de remplacement, vous devez désactiver le voyant du
composant défaillant.
Pourquoi et quand exécuter cette tâche
Pour le désactiver, procédez comme suit :
Procédure
1. Connectez-vous en tant qu'utilisateur root.
2. Sur la ligne de commande, tapez /usr/sbin/usysident -s normal -l et appuyez
sur Entrée.
Localisation d'un composant défaillant sur un système ou une
partition logique Virtual I/O Server
Vous pouvez utiliser les outils Virtual I/O Server (VIOS) avant d'activer le voyant, pour rechercher un
composant défaillant.
Procédure
1. Connectez-vous en tant qu'utilisateur root ou celogin-.
2. A l'invite, tapez diagmenu et appuyez sur Entrée.
3. Dans le menu Sélection des fonctions, choisissez Sélection d'une tâche et appuyez sur Entrée.
4. Sélectionnez Affichage des résultats de tests de diagnostic précédents, puis appuyez sur Entrée.
5. Dans l'écran Affichage des résultats de tests de diagnostic précédents, sélectionnez Affichage du
journal des diagnostics abrégé. L'écran Affichage du journal des diagnostics apparaît. Il contient la
liste des événements triée dans l'ordre chronologique.
6. Dans la colonne T, recherchez l'entrée S la plus récente. Sélectionnez la ligne, puis appuyez sur
Entrée.
7. Sélectionnez Validation. Le programme affiche les détails de l'entrée du journal.
8. Notez l'emplacement et le code SRN figurant vers la fin de l'entrée.
9. Retournez à la ligne de commande.
90 Activation SAS (Serial-attached SCSI) RAID
Résultats
Utilisez les informations de localisation du composant défaillant pour activer le voyant qui identifie ce
dernier. Pour plus d'informations, voir «Identification d'un composant à l'aide du Virtual I/O Server».
Identification d'un composant à l'aide du Virtual I/O Server
Ces instructions permettent d'activer le voyant afin de repérer l'emplacement physique d'un composant à
l'aide du Virtual I/O Server (VIOS).
Procédure
1. Connectez-vous en tant qu'utilisateur root.
2. A l'invite, tapez diagmenu et appuyez sur Entrée.
3. Dans le menu Sélection des fonctions, choisissez Sélection d'une tâche. Appuyez sur Entrée.
4. Dans le menu Sélection des tâches, sélectionnez Indicateurs d'identification et d'avertissement.
Appuyez sur Entrée.
5. Dans la liste des voyants, sélectionnez le code d'emplacement associé au composant défaillant et
appuyez sur Entrée.
6. Sélectionnez Validation. Le programme allume le voyant d'incident associé au composant défaillant.
7. Retournez à la ligne de commande.
Démarrage du système ou de la partition logique
La présente section explique comment démarrer un système ou une partition logique après avoir effectué
une action de maintenance ou une mise à niveau du système.
Démarrage d'un système non géré par une console HMC ou une
console SDMC
Vous pouvez utiliser le bouton d'alimentation ou l'interface ASMI pour démarrer un système qui n'est pas
géré par une console HMC (Hardware Management Console) ni par une console Systems Director
Management Console (SDMC).
Pourquoi et quand exécuter cette tâche
Pour démarrer un système non géré par une console HMC ou SDMC, procédez comme suit :
Procédure
1. Si nécessaire, ouvrez le volet avant de l'armoire.
2. Avant d'appuyer sur le bouton d'alimentation du panneau de commande, vérifiez que le système est
bien relié à une source d'alimentation :
v Tous les cordons d'alimentation système doivent être reliés à une source d'alimentation.
v Le voyant d'alimentation clignote lentement, comme indiqué dans la figure suivante.
v Comme indiqué dans la figure ci-après, la partie supérieure de l'écran doit afficher la mention 01
V=F.
3. Appuyez sur le bouton d'alimentation (A) du panneau de commande (voir figure suivante).
Procédures communes aux dispositifs installables 91
v A : Bouton de mise sous tension
v B : Voyant d'alimentation
– Un voyant fixe indique que l'unité est sous alimentation système complète.
– Un voyant clignotant indique que l'unité est sous alimentation de mode veille.
Remarque : Une période d'environ 30 secondes s'écoule entre le moment où vous appuyez sur le
bouton de mise sous tension et celui où le voyant d'alimentation cesse de clignoter pour devenir
fixe. Durant cette période de transition, le clignotement peut s'accélérer.
v C : Voyant d'identification de boîtier
– Une lumière fixe indique l'état d'identification du boîtier ou d'une ressource du boîtier.
– Aucune lumière signifie qu'aucune ressource du boîtier n'est identifiée.
v D : Voyant d'incident
– L'absence de lumière indique un fonctionnement normal du système.
– Un voyant fixe indique qu'une intervention est requise sur le système.
v E : Port USB
v F : Voyant de panne de boîtier
– Un voyant fixe correspond à un voyant de panne actif sur le système.
– L'absence de lumière indique un fonctionnement normal du système.
v G : Ecran Fonctions/Données
v H : Bouton de décrémentation
v I : Bouton Entrée
v J : Bouton d'incrémentation
v K : Bouton de réinitialisation
4. Une fois que vous avez appuyé sur le bouton d'alimentation, notez les éléments suivants :
v Le voyant d'alimentation commence à clignoter plus rapidement.
v Les ventilateurs système sont activés après environ 30 secondes et commencent à tourner plus
rapidement.
Figure 66. Panneau de commande
92 Activation SAS (Serial-attached SCSI) RAID
v L'écran du panneau de commande affiche les jauges (ou points de contrôle) pendant le démarrage
du système. Le voyant de mise sous tension arrête de clignoter et reste fixe, indiquant que le
système est sous tension.
Que faire ensuite
Conseil : Si le système ne démarre pas alors que vous avez appuyé sur le bouton d'alimentation, utilisez
la procédure suivante pour démarrer le système à l'aide de l'interface ASMI (Advanced System
Management Interface) :
1. Lancez l'interface ASMI. Pour plus d'informations, reportez-vous à la rubrique Accès à l'interface
ASMI.
2. Démarrez le système à l'aide de l'interface ASMI. Pour plus d'informations, reportez-vous à la
rubrique Mise sous tension et arrêt du système.
Démarrage d'un système ou d'une partition logique via la console
HMC
Vous pouvez utiliser l'interface utilisateur de la console HMC (Hardware Management Console) pour
démarrer le système ou la partition logique une fois les câbles requis installés et connectés à une source
d'alimentation.
Pourquoi et quand exécuter cette tâche
Pour obtenir des instructions sur l'utilisation de la console HMC, voir Gestion de la console HMC
(Hardware Management Console). Pour plus d'informations sur le démarrage d'une partition logique,
reportez-vous à la rubrique Partitionnement logique. Pour plus d'informations sur le démarrage du
système, reportez-vous à la rubrique Mise sous tension du système géré.
L'écran du panneau de commande affiche les jauges (ou points de contrôle) pendant le démarrage du
système. Lorsque le voyant d'alimentation s'arrête de clignoter et reste fixe, cela signifie que le système
est sous tension.
Démarrage d'un système ou d'un serveur virtuel via la console SDMC
Vous pouvez utiliser l'interface utilisateur de la console SDMC (Systems Director Management Console)
pour démarrer le système ou le serveur virtuel une fois les câbles requis installés et les cordons
d'alimentation connectés à une alimentation.
Pourquoi et quand exécuter cette tâche
Pour obtenir des instructions sur l'utilisation de la console SDMC, voir Gestion et configuration de la
console SDMC. Pour des instructions sur le démarrage d'un serveur virtuel, voir Gestion de serveurs
virtuels. Pour des instructions sur l'arrêt et le redémarrage de serveurs virtuels, voir Arrêt et redémarrage
de serveurs virtuels.
L'écran du panneau de commande affiche les jauges, ou points de contrôle, pendant le démarrage du
système. Lorsque le voyant de mise sous tension du panneau de commande cesse de clignoter et reste
fixe, le système est sous tension.
Arrêt d'un système ou d'une partition logique
La présente section explique comment arrêter un système ou une partition logique dans le cadre d'une
mise à niveau du système ou d'une procédure de maintenance.
Procédures communes aux dispositifs installables 93
Pourquoi et quand exécuter cette tâche
Avertissement : Si vous utilisez le bouton de mise sous tension ou les commandes de la console HMC
(Hardware Management Console) pour arrêter le système, les fichiers de données risquent d'être
endommagés. Par ailleurs, le système risque de mettre plus de temps à démarrer la prochaine fois si
toutes les applications n'ont pas été fermées avant l'arrêt du système.
Pour arrêter le système ou la partition logique, sélectionnez la procédure appropriée.
Arrêt d'un système non géré par une console HMC ou une console
SDMC
Pour effectuer une autre tâche, vous devez arrêter le système. Ces instructions indiquent comment arrêter
le système à l'aide du bouton d'alimentation ou de l'interface ASMI.
Avant de commencer
Avant d'arrêter le système, procédez comme suit :
1. Vérifiez que tous les travaux sont terminés et fermez toutes les applications.
2. Assurez-vous que le système d'exploitation est arrêté.
Avertissement : Si vous ne le faites pas, vous risquez de perdre des données.
3. Si une partition logique Virtual I/O Server (VIOS) est en cours d'exécution, assurez-vous que tous les
clients sont déconnectés ou qu'ils peuvent accéder à leurs unités par un autre moyen.
Pourquoi et quand exécuter cette tâche
La procédure ci-dessous explique comment arrêter un système non géré par une console HMC (Hardware
Management Console).
Procédure
1. Connectez-vous au système sous un nom d'utilisateur bénéficiant des droits d'accès nécessaires pour
exécuter la commande shutdown ou pwrdwnsys (mettre le système hors tension).
2. A l'invite, tapez la commande appropriée :
v Si le système exécute le système d'exploitation AIX, tapez shutdown.
v Si votre système exécute le système d'exploitation Linux, tapez shutdown -h now.
v Si votre système exécute le système d'exploitation , tapez PWRDWNSYS. Si votre système est
partitionné, utilisez la commande PWRDWNSYS pour arrêter chaque partition secondaire. Ensuite,
utilisez la commande PWRDWNSYS pour arrêter la partition principale.
La commande arrête le système d'exploitation. Une fois le système d'exploitation arrêté, le voyant de
mise sous tension commence à clignoter lentement et le système passe en veille.
3. Sur la ligne de commande Linux, tapez shutdown -h now.
La commande arrête le système d'exploitation. Une fois le système d'exploitation arrêté, le voyant de
mise sous tension commence à clignoter lentement et le système passe en veille.
4. Sur la ligne de commande Linux, tapez shutdown -h now.
La commande arrête le système d'exploitation. Une fois le système d'exploitation arrêté, le voyant de
mise sous tension commence à clignoter lentement et le système passe en veille.
5. Pour restaurer l'état du système à l'issue de l'opération d'installation ou de remplacement, notez le
type et le mode de l'IPL.
6. Mettez les interrupteurs d'alimentation des unités reliées au système sur Off.
7. Débranchez tous les cordons d'alimentation connectés à l'unité des socles de prise de courant. Vérifiez
que vous débranchez également les cordons d'alimentation des périphériques (imprimantes et unités
d'extension, par exemple).
94 Activation SAS (Serial-attached SCSI) RAID
Important : Le système est équipé d'un second bloc d'alimentation. Avant de continuer la procédure,
vérifiez que toutes les sources d'alimentation ont été débranchées du système.
(L003)
ou
Arrêt d'un système via la console HMC
Vous pouvez utiliser l'interface utilisateur de la console HMC (Hardware Management Console) pour
arrêter le système ou une partition logique.
Pourquoi et quand exécuter cette tâche
Par défaut, le système géré est configuré pour se mettre hors tension automatiquement lorsque vous
arrêtez sa dernière partition logique en cours d'exécution. Si vous définissez les propriétés du système
géré sur la console HMC de façon à empêcher la mise hors tension automatique du système, vous devez
utiliser cette procédure pour mettre hors tension le système géré.
Avertissement : Dans la mesure du possible, arrêtez les partitions logiques en cours d'exécution avant
de mettre le système géré hors tension. Si vous n'arrêtez pas d'abord les partitions logiques, vous risquez
de provoquer leur arrêt anormal et de perdre des données. Si vous utilisez une partition logique Virtual
I/O Server (VIOS), assurez-vous que tous les clients sont déconnectés ou qu'ils peuvent accéder à leurs
unités par un autre moyen.
Pour mettre le système géré hors tension, vous devez utiliser l'un des profils utilisateur suivants :
v Administrateur central
v Technicien de maintenance
v Opérateur
v Responsable produit
Procédures communes aux dispositifs installables 95
Procédez comme suit pour arrêter le système via la console HMC.
Procédure
1. Dans la zone de navigation, développez le dossier Gestion de systèmes.
2. Cliquez sur l'icône Serveurs.
3. Dans la zone de contenu, sélectionnez le système géré.
4. Sélectionnez Tâches, Opérations, puis Mise hors tension
5. Sélectionnez le mode de mise hors tension approprié et cliquez sur OK.
Information associée
Arrêt et redémarrage de partitions logiques
Arrêt d'un système via la console SDMC
Vous pouvez utiliser l'interface utilisateur de la console SDMC (Systems Director Management Console)
pour arrêter le système ou un serveur virtuel.
Pourquoi et quand exécuter cette tâche
Par défaut, le système géré est configuré pour se mettre hors tension automatiquement lorsque vous
arrêtez le dernier serveur virtuel en cours d'exécution. Si vous définissez les propriétés du système géré
sur la console SDMC de façon à empêcher la mise hors tension automatique du système, vous devez
utiliser cette procédure pour mettre hors tension le système géré.
Avertissement : Si possible, arrêtez les serveurs en cours d'exécution sur le système géré avant de
mettre ce dernier hors tension. Si vous n'arrêtez pas d'abord les serveurs virtuels, vous risquez de
provoquer leur arrêt anormal et de perdre des données. Si vous utilisez une partition logique Virtual I/O
Server (VIOS) assurez-vous que tous les clients sont déconnectés ou qu'ils peuvent accéder à leurs unités
par un autre moyen.
Pour mettre le système géré hors tension, vous devez utiliser l'un des profils utilisateur suivants :
v Administrateur central
v Technicien de maintenance
v Opérateur
v Responsable produit
Procédez comme suit pour arrêter le système via la console SDMC.
Procédure
1. Dans la zone des ressources Power Systems sélectionnez le système géré que vous voulez mettre hors
tension.
2. Dans le menu Actions, sélectionnez Opérations > Mise hors tension.
3. Sélectionnez le mode de mise hors tension approprié et cliquez sur OK.
Retrait et remplacement des capots sur le système 31E/2B, 31E/1C ou
31E/2C
Les présentes instructions permettent de retirer, remettre en place ou installer des capots pour accéder à
des composants ou effectuer des opérations de maintenance.
Retrait du capot d'accès du système 31E/2B, 31E/1C ou 31E/2C
Cette procédure décrit comment retirer le capot d'accès.
96 Activation SAS (Serial-attached SCSI) RAID
Procédure
1. Desserrez la vis moletée (A) située à l'arrière du capot.
2. Faites glisser le capot (B) vers l'arrière de l'unité centrale. Lorsque la partie avant du capot d'accès
n'est plus sur le bord du cadre supérieur, soulevez le capot et retirez-le de l'unité centrale.
Avertissement : Lorsque le capot d'accès pour la maintenance est retiré, le système est mis hors
tension.
Installation du capot d'accès du système 31E/2B, 31E/1C ou 31E/2C
Cette procédure décrit comment installer le capot d'accès.
Procédure
1. Placez le capot d'accès (A) sur le haut du système, à environ 25 mm du rebord du châssis supérieur.
2. Maintenez le capot d'accès contre l'unité centrale et faites-le glisser vers l'avant du système. Les
languettes du capot d'accès glissent sous le rebord du boîtier supérieur.
3. Alignez la vis moletée (B) située à l'arrière du capot d'accès avec le trou à l'arrière du boîtier du
système.
4. Serrez les vis moletées afin de fixer le capot d'accès.
Figure 67. Retrait du capot d'accès
Procédures communes aux dispositifs installables 97
Mise en position de maintenance ou de fonctionnement du système
31E/2B, 31E/1C ou 31E/2C
Ces procédures décrivent comment mettre un système en maintenance et en fonctionnement dans le
cadre d'une opération de maintenance ou pour accéder à des composants internes.
Mise en position de maintenance du système 31E/2B, 31E/1C ou
31E/2C monté en armoire
Utilisez cette procédure pour placer le système monté en armoire en position de maintenance.
Avant de commencer
Remarques :
v Lorsque vous placez l'armoire en position de maintenance, il est essentiel de positionner correctement
toutes les plaques de stabilisation pour empêcher l'armoire de basculer. Assurez-vous qu'il n'y ait
qu'une seule unité centrale en position de maintenance à la fois.
v Veillez à ne pas bloquer ou plier les câbles situés à l'arrière de l'unité centrale lorsque vous extrayez
celle-ci de l'armoire.
v Une fois les glissières de l'unité centrale complètement étendues, les taquets de sécurité s'enclenchent,
ce qui empêche l'unité de trop sortir de son emplacement.
Procédure
1. Ouvrez le volet avant de l'armoire.
2. Identifiez l'unité système dont vous effectuez la maintenance dans l'armoire.
Figure 68. Installation du capot d'accès
98 Activation SAS (Serial-attached SCSI) RAID
3. Tout en maintenant enfoncés les taquets de déverrouillage droit et gauche de l'unité centrale, tirez
l'unité centrale hors de l'armoire jusqu'à ce que les glissières soient complètement étendues et
verrouillées.
Mise en position de fonctionnement du système 31E/2B, 31E/1C ou
31E/2C monté en armoire
Utilisez cette procédure pour placer le système monté en armoire en position de fonctionnement.
Avant de commencer
Lors du placement en position de fonctionnement du système, veillez à ne pas bloquer ou plier les câbles
situés à l'arrière du système lorsque vous poussez l'unité dans l'armoire.
Procédure
1. Déverrouillez les taquets de sécurité bleus (A) en les levant vers le haut.
Figure 69. Placement du système en position de maintenance
Procédures communes aux dispositifs installables 99
2. Poussez l'unité centrale dans l'armoire jusqu'à ce que les deux taquets de déverrouillage de l'unité
soient en position.
Figure 70. Déverrouillage des taquets de sécurité des glissières
Figure 71. Mise en position de fonctionnement du système
100 Activation SAS (Serial-attached SCSI) RAID
3. Fermez le volet avant de l'unité dont vous effectuez la maintenance.
Déconnexion des cordons d'alimentation du système 31E/2B, 31E/1C
ou 31E/2C
Utilisez ces procédures pour déconnecter les cordons d'alimentation du système.
Procédure
1. Ouvrez le volet arrière de l'unité dont vous effectuez la maintenance.
2. Identifiez l'unité système dont vous effectuez la maintenance dans l'armoire.
3. Débranchez tous les cordons d'alimentation de l'unité dont vous effectuez la maintenance. Le système
est peut-être équipé de deux blocs d'alimentation. Avant de continuer toute procédure de retrait et de
remplacement, vérifiez que toutes les deux sources d'alimentation ont été débranchées du système.
Connexion des cordons d'alimentation au système 31E/2B, 31E/1C ou
31E/2C
Utilisez ces procédures pour connecter les cordons d'alimentation au système.
Procédure
1. Ouvrez le volet arrière de l'unité dont vous effectuez la maintenance.
2. Identifiez l'unité système dont vous effectuez la maintenance dans l'armoire.
3. Débranchez tous les cordons d'alimentation de l'unité dont vous effectuez la maintenance. Vérifiez
que les cordons d'alimentation passent par les poignées.
Figure 72. Retrait des cordons d'alimentation
Procédures communes aux dispositifs installables 101
Installation d'un dispositif via la console HMC
Vous pouvez utiliser la console HMC (Hardware Management Console) pour effectuer diverses tâches de
maintenance, y compris l'installation d'un nouveau dispositif ou composant.
Avant de commencer
Pour employer l'interface utilisateur de la console HMC pour installer un dispositif ou un composant
dans une unité centrale ou une unité d'extension gérée par une console HMC, procédez comme suit :
Procédure
1. Dans la zone de navigation, développez Gestion de systèmes > Serveurs.
2. Sélectionnez le système géré sur lequel vous allez installer le composant.
3. Dans la zone des tâches, développez Maintenabilité > Matériel > Tâches de modification de
matériel.
4. Sélectionnez Ajout d'une FRU (unité remplaçable sur site).
5. Dans la fenêtre Installation/ajout/suppression de matériel, sélectionnez le système ou le boîtier dans
lequel vous installez le dispositif.
6. Sélectionnez dans la liste le type de dispositif que vous souhaitez installer et cliquez sur Suivant.
7. Sélectionnez le code d'emplacement pour l'installation du dispositif et cliquez sur Ajouter.
8. Une fois l'unité remplaçable sur site FRU placée dans la catégorie des actions en attente, cliquez sur
Lancer la procédure et suivez les instructions d'installation du dispositif.
Remarque : La console HMC peut afficher des instructions expliquant comment installer le dispositif.
Dans ce cas, suivez ces instructions pour installer le dispositif.
Figure 73. Connexion des cordons d'alimentation
102 Activation SAS (Serial-attached SCSI) RAID
Retrait d'un composant via la console HMC
Vous pouvez utiliser la console HMC (Hardware Management Console) pour effectuer de nombreuses
opérations de maintenance, y compris le retrait d'une unité remplaçable sur site (FRU) ou d'un
composant.
Pourquoi et quand exécuter cette tâche
Pour employer l'interface utilisateur de la console HMC afin de retirer un composant dans une unité
centrale ou une unité d'extension gérée par une console HMC, procédez comme suit :
Procédure
1. Dans la zone de navigation, développez Gestion de systèmes > Serveurs.
2. Sélectionnez le système géré sont vous souhaitez retirer un composant.
3. Sélectionnez l'une des options suivantes.
v Si vous installez un nouveau composant dans le cadre d'une mise à niveau, dans la zone des
tâches, développez Maintenabilité > Matériel > Tâches de modification de matériel.
v Si vous remplacez un composant dans le cadre d'une opération de maintenance, depuis la zone des
tâches, développez, Maintenabilité > Matériel > Retrait de FRU.
4. Pour installer un nouveau composant dans le cadre d'une mise à niveau, sélectionnez Ajout de FRU.
Pour remplacer un composant dans le cadre d'une opération de maintenance, sélectionnez Retrait de
FRU.
5. Dans la fenêtre Installation/ajout/suppression de matériel - Retrait de FRU, sélectionnez le type de
FRU, sélectionnez le système ou le boîtier duquel vous souhaitez retirer le composant.
6. Sélectionnez dans la liste le type de composant que vous souhaitez retirer et cliquez sur Suivant.
7. Sélectionnez l'emplacement du composant que vous retirez et cliquez sur Ajouter.
8. Une fois que le composant figure dans la liste des actions en attente, cliquez sur Lancer la procédure
et suivez les instructions de retrait du composant.
Remarque : La console HMC peut afficher des instructions de l'Information Center expliquant
comment retirer le composant. Dans ce cas, suivez ces instructions pour retirer le composant.
Remplacement d'un composant via la console HMC
Vous pouvez utiliser la console HMC (Hardware Management Console) pour effectuer diverses tâches de
maintenance, y compris le remplacement d'une unité remplaçable sur site (FRU) ou d'un composant.
Pourquoi et quand exécuter cette tâche
Si vous procédez à un échange de composant dans le but de solutionner un événement réparable, suivez
ces instructions. Si vous échangez un composant dans le cadre d'une autre procédure à l'aide de la
version 7 ou d'une version ultérieure de la console HMC, procédez comme suit :
Procédure
1. Dans la zone de navigation, développez Gestion de systèmes > Serveurs.
2. Sélectionnez le système géré dont vous souhaitez remplacer un composant.
3. Sélectionnez l'une des options suivantes.
v Si vous installez un nouveau composant dans le cadre d'une mise à niveau, dans la zone des
tâches, développez Maintenabilité > Matériel > Tâches de modification de matériel.
v Si vous remplacez un composant dans le cadre d'une opération de maintenance, depuis la zone des
tâches, développez, Maintenabilité > Matériel > Remplacement de FRU.
Procédures communes aux dispositifs installables 103
4. Pour installer un nouveau composant dans le cadre d'une mise à niveau, sélectionnez Ajout de FRU.
Pour remplacer un composant dans le cadre d'une opération de maintenance, sélectionnez
Remplacement de FRU.
5. Dans la fenêtre Remplacement de matériel - Remplacement de FRU, Sélectionnez le type de FRU,
sélectionnez dans la liste le système ou le boîtier dans lequel vous souhaitez échanger un composant.
6. Dans le menu, sélectionnez le type du composant et cliquez sur Suivant.
7. Dans le menu, sélectionnez le code emplacement du composant et cliquez sur Ajouter.
8. Une fois l'unité remplaçable sur site FRU placée dans la catégorie des actions en attente, cliquez sur
Lancer la procédure et suivez les instructions d'échange du dispositif.
Remarque : Il se peut que la console HMC ouvre une fenêtre d'instructions pour le remplacement du
composant. Dans ce cas, suivez les instructions pour remplacer le composant.
Vérification du composant installé
Vous pouvez vérifier un nouveau composant installé ou remplacé sur votre système, partition logique ou
unité d'extension à l'aide du système d'exploitation, de diagnostics autonomes ou de la console HMC
(Hardware Management Console).
Vérification d'un dispositif installé ou d'un composant remplacé sur un
système ou une partition logique AIX
Si vous avez installé un dispositif ou remplacé un composant, il est recommandé d'utiliser les outils dans
le système d'exploitation AIX pour vérifier que ce dispositif ou composant est reconnu par le système ou
la partition logique.
Pourquoi et quand exécuter cette tâche
Pour vérifier le fonctionnement d'un dispositif ou d'un composant de remplacement récemment installé,
sélectionnez la procédure appropriée :
v Vérification du dispositif installé via AIX
v Vérification du composant de remplacement via AIX
Vérification du dispositif installé via le système d'exploitation AIX :
1. Connectez-vous en tant qu'utilisateur root.
2. A l'invite, tapez diag et appuyez sur Entrée.
3. Sélectionnez Programmes de diagnostic, mode étendu et appuyez sur Entrée.
4. Dans le menu Sélection du mode de diagnostic, sélectionnez Vérification du système et appuyez sur
Entrée.
5. Dans le menu Sélection des tests de diagnostic, mode étendu, exécutez l'une des procédures
suivantes :
v Pour tester une seule ressource, sélectionnez la ressource que vous venez d'installer dans la liste des
ressources et appuyez sur Entrée.
v Pour tester toutes les ressources disponibles du système d'exploitation, sélectionnez Toutes les
ressources et appuyez sur Entrée.
6. Sélectionnez Validation, puis attendez la fin de l'exécution des programmes de diagnostic en
répondant aux invites éventuelles.
7. Les programmes de diagnostic ont-ils été exécutés complètement avant d'afficher le message Aucun
incident n’a été détecté ?
v Non : Si un code SRN (Service Request Number - numéro de demande d'intervention) ou un autre
code de référence s'affiche, il est possible qu'un adaptateur ou qu'un câble ne soit pas bien
connecté. Passez en revue les procédures d'installation pour vérifier que le nouveau dispositif est
104 Activation SAS (Serial-attached SCSI) RAID
installé correctement. Si vous ne parvenez pas à corriger l'incident, rassemblez tous les codes SRN
et autres codes de référence éventuels. Si le système fonctionne en mode de partitionnement
logique, notez la partition logique sur laquelle le dispositif est installé. Pour obtenir de l'aide,
contactez votre fournisseur de services.
v Oui : Le nouveau dispositif est installé correctement. Quittez les programmes de diagnostic et
revenez en mode de fonctionnement normal.
Vérification du composant de remplacement via le système d'exploitation AIX :
Pour vérifier le fonctionnement d'un dispositif ou d'un composant de remplacement récemment installé,
procédez comme suit :
1. Avez remplacé le composant à l'aide du système d'exploitation AIX ou des opérations de
remplacement à chaud du service de diagnostic en ligne ?
Non : Passez à l'étape 2.
Oui : Passez à l'étape 5.
2. Le système est-il hors tension ?
Non : Passez à l'étape 4.
Oui : Si le système prend en charge l'initialisation lente, configurez la fonction. Pour plus
d'informations, consultez la rubrique Exécution d'une initialisation lente.
3. Démarrez le système et attendez que l'invite de connexion du système d'exploitation AIX s'affiche ou
que l'activité apparente du système sur le panneau de commande ou l'écran s'arrête.
Avez-vous vu l'invite de connexion AIX ?
v Non : Si un code SRN (Service Request Number - numéro de demande d'intervention) ou un autre
code de référence s'affiche, il est possible qu'un adaptateur ou qu'un câble ne soit pas bien
connecté. Passez en revue les procédures de remplacement pour vérifier que le nouveau
composant est installé correctement. Si vous ne parvenez pas à corriger l'incident, rassemblez tous
les codes SRN et autres codes de référence éventuels. Si le système ne démarre pas ou que l'invite
de connexion ne s'affiche pas, consultez la rubrique : Incidents de chargement et de démarrage du
système d'exploitation.
Si le système est partitionné, notez la partition logique sur laquelle vous avez remplacé le
composant. Pour obtenir de l'aide, contactez votre fournisseur de services.
v Oui : Passez à l'étape 4.
4. A l'invite, tapez diag —a et appuyez sur Entrée pour vérifier s'il manque des ressources. Si une invite
s'affiche, passez à l'étape 5.
Si le menu Sélection des tests de diagnostic s'affiche avec un M en regard d'une ressource, procédez
comme suit :
a. Sélectionnez la ressource, puis appuyez sur Entrée.
b. Sélectionnez Validation.
c. Suivez les instructions affichées.
d. Si le message Souhaitez-vous revoir l'erreur affichée précédemment ? s'affiche, cliquez sur Oui et
appuyez sur Entrée.
e. Si le système génère un code SRN, une carte ou un câble est probablement mal connecté. Si
aucun incident évident n'apparaît, notez le code SRN et contactez votre fournisseur de services.
f. Si aucun code SRN ne s'affiche, passez à l'étape 5.
5. Testez le composant en procédant comme suit :
a. A l'invite, tapez diag et appuyez sur Entrée.
b. Dans le menu Sélection des fonctions, sélectionnez Programmes de diagnostic, mode étendu et
appuyez sur Entrée.
c. Dans le menu Sélection du mode de diagnostic, sélectionnez Vérification du système et
appuyez sur Entrée.
Procédures communes aux dispositifs installables 105
d. Sélectionnez Toutes les ressources ou sélectionnez les programmes de diagnostic du composant
pour tester uniquement le composant remplacé ou les unités qui y raccordées et appuyez sur
Entrée.
Le menu Action corrective sur ressource s'est-il affiché ?
Non : Passez à l'étape 6.
Oui : Passez à l'étape 7.
6. Le message Test terminé, aucun incident n'a été détecté s'est-il affiché ?
v Non : Il existe encore un incident. Contactez votre fournisseur de services. La procédure est
terminée.
v Oui : Si l'incident ne figure pas dans le journal des erreurs, sélectionnez Consignation d'action
corrective dans le menu Sélection des tâches pour mettre à jour le journal des erreurs AIX. Si la
réparation consistait à remettre en place un câble ou une carte, sélectionnez la ressource
correspondante. Si la ressource associée à l'action ne s'affiche pas dans la liste des ressources,
sélectionnez sysplanar0 et appuyez sur Entrée.
Conseil : Le voyant du composant passe de l'état Incident à l'état normal.
Passez à l'étape 9.
7. Dans le menu Action corrective sur ressource, sélectionnez la ressource remplacée. En cas de test
réussi sur une ressource en mode de vérification du système, dont une entrée figure dans le journal
des erreurs AIX, le menu Action corrective sur ressource apparaît. Pour mettre à jour le journal des
erreurs AIX pour indiquer qu'un composant détectable par le système a été remplacé, procédez
comme suit.
Remarque : Sur les systèmes équipés d'un voyant correspondant au composant défaillant, le voyant
passe à l'état normal.
a. Dans le menu Action corrective sur ressource, sélectionnez la ressource remplacée. Si la
réparation consistait à remettre en place un câble ou une carte, sélectionnez la ressource
correspondante. Si la ressource associée à l'action ne s'affiche pas dans la liste des ressources,
sélectionnez sysplanar0 et appuyez sur Entrée.
b. Sélectionnez Validation une fois les sélections effectuées. L'écran Action corrective sur ressource
s'est-il affiché de nouveau ?
Non : Si l'écran Aucun incident détecté apparaît, passez à l'étape 9.
Oui : Passez à l'étape 8.
8. Dans le menu Action corrective sur ressource, le composant parent ou enfant de la ressource que
vous venez de remplacer, si nécessaire. En cas de test réussi sur une ressource en mode de
vérification du système, dont une entrée figure dans le journal des erreurs AIX, le menu Action
corrective sur ressource apparaît. Pour mettre à jour le journal des erreurs AIX pour indiquer qu'un
composant détectable par le système a été remplacé, procédez comme suit.
Remarque : Le voyant du composant passe de l'état Incident à l'état normal.
a. Dans le menu Action corrective sur ressource, sélectionnez le composant parent ou enfant de la
ressource remplacée. Si la réparation consistait à remettre en place un câble ou une carte,
sélectionnez la ressource correspondante. Si la ressource associée à l'action ne s'affiche pas dans la
liste des ressources, sélectionnez sysplanar0 et appuyez sur Entrée.
b. Sélectionnez Validation une fois les sélections effectuées.
c. Si l'écran Aucun incident détecté apparaît, passez à l'étape 9.
9. Si vous avez modifié les paramètres du processeur de service ou du réseau (voir procédures
précédentes), restaurez la valeur initiale des paramètres.
10. Avez-vous exécuté des procédures de remplacement à chaud avant cette procédure ?
Non : Passez à l'étape 11, à la page 107.
Oui : Passez à l'étape 12, à la page 107.
106 Activation SAS (Serial-attached SCSI) RAID
11. Démarrez le système d'exploitation (système ou partition logique en mode normal). Avez-vous pu
démarrer le système d'exploitation ?
Non : Contactez votre fournisseur de services. La procédure est terminée.
Oui : Passez à l'étape 12.
12. Les voyants sont-ils encore allumés ?
v Non. Cela marque la fin de la procédure.
v Oui : Désactivez les voyants. Pour plus d'informations, consultez la rubrique suivante :
Modification des indicateurs de service.
Vérification d'un composant installé sur un système Linux ou une
partition logique
Si vous avez installé un nouveau composant, utilisez les instructions de cette section pour savoir
comment vérifier que le système reconnaît ce composant.
Pourquoi et quand exécuter cette tâche
Pour vérifier un composant que vous venez d'installer ou de remplacer consultez la rubrique
«Vérification d'un composant installé via les programmes de diagnostic autonome».
Vérification d'un composant installé via les programmes de diagnostic
autonome
Si vous avez installé ou remplacé un composant, vérifiez que le système le reconnaît. Vous pouvez
utiliser les programmes de diagnostic autonomes pour vérifier un composant installé sur un système AIX
ou Linux, une unité d'extension ou une partition logique.
Avant de commencer
v Si le serveur est directement connecté à un autre serveur ou à un réseau, vérifiez que la
communication avec les autres serveurs est interrompue.
v Les programmes de diagnostic autonomes doivent utiliser l'ensemble des ressources de la partition
logique. Aucune autre activité ne peut être exécutée sur la partition.
v Les programmes de diagnostic autonomes doivent accéder à la console système.
Pourquoi et quand exécuter cette tâche
Pour accéder aux programmes de diagnostic, utilisez le CD-ROM approprié ou le serveur NIM (Network
Installation Management). La procédure suivante explique comment utiliser les programmes de
diagnostic depuis le CD-ROM. Pour obtenir plus d'informations sur l'exécution de diagnostics à partir du
serveur NIM, voir Exécution de diagnostics autonomes à partir d'un serveur NIM.
Pour utiliser les programmes de diagnostic autonomes, procédez comme suit :
Procédure
1. Arrêtez les travaux et les applications, puis arrêtez le système d'exploitation sur le système ou la
partition logique.
2. Retirez les bandes, les disquettes et les CD-ROM.
3. Mettez l'unité centrale hors tension. L'étape suivante permet d'initialiser le serveur ou la partition
logique à partir du CD-ROM des programmes de diagnostic autonomes. Si l'unité de disques optique
n'est pas configurée comme unité d'amorçage sur le serveur ou la partition logique que vous utilisez,
procédez comme suit :
a. Lancez l'interface ASMI. Pour plus d'informations sur l'utilisation de l'interface ASMI,
reportez-vous à la rubrique Accès à l'interface ASMI.
Procédures communes aux dispositifs installables 107
b. Dans le menu principal de l'interface ASMI, cliquez sur Contrôle d'alimentation/redémarrage.
c. Cliquez sur Mettre le système sous/hors tension.
d. Dans le menu déroulant des modes d'initialisation de la partition logique AIX ou Linux
sélectionnez l'option Amorçage du mode service à partir de la liste par défaut.
e. Cliquez sur Sauvegarder les paramètres et mettre sous tension. Dès que l'unité de disques
optique est sous tension, insérez le CD-ROM des programmes de diagnostic autonomes.
f. Passez à l'étape 5.
4. Mettez l'unité centrale sous tension et insérez immédiatement le CD-ROM de programme de
diagnostic autonome dans l'unité de disque optique.
5. Après l'affichage du voyant POST clavier sur la console système et avant l'affichage du dernier
voyant POST (haut-parleur), appuyez sur la touche numérique 5 de la console système pour
indiquer qu'une initialisation en mode service doit être effectuée au moyen de la liste des unités
d'amorçage en mode service par défaut.
6. Tapez le mot de passe requis.
7. Dans l'écran des instructions d'exécution des tests de diagnostic, appuyez sur Entrée.
Conseil : Si un code SRN (Service Request Number - numéro de demande d'intervention) ou un
autre code de référence s'affiche, il est possible qu'un adaptateur ou qu'un câble ne soit pas bien
connecté.
Remarque : Si le système a généré un code SRN ou un autre code de référence alors que vous
tentiez de le démarrer, contactez votre fournisseur de service pour obtenir de l'aide.
8. Si le type de terminal est requis, sélectionnez l'option Initialisation du terminal du menu Sélection
des fonctions pour initialiser le système d'exploitation.
9. Dans le menu Sélection des fonctions, sélectionnez Programmes de diagnostic, mode étendu et
appuyez sur Entrée.
10. Dans le menu Sélection du mode de diagnostic, sélectionnez Vérification du système et appuyez sur
Entrée.
11. Lorsque le menu Sélection des tests de diagnostic, mode étendu s'affiche, sélectionnez Toutes les
ressources ou testez uniquement le composant remplacé ou les unités qui y sont rattachées en
sélectionnant les programmes de diagnostic appropriés puis en appuyant sur Entrée.
12. Le message Test terminé, Aucun incident n'a été détecté s'est-il affiché ?
v Non : Il existe encore un incident. Contactez votre fournisseur de services.
v Oui : Passez à l'étape 13.
13. Si vous avez modifié les paramètres du processeur de service ou du réseau (voir procédures
précédentes), restaurez la valeur initiale des paramètres.
14. Si les voyants sont toujours allumés, procédez comme suit :
a. Dans le menu Sélection des tâches, sélectionnez Indicateurs d'identification et d'avertissement
pour désactiver les voyants d'identification et d'avertissement et appuyez sur Entrée.
b. Sélectionnez Mettre l'indicateur d'avertissement système à NORMAL, puis appuyez sur Entrée.
c. Sélectionnez Mettre tous les indicateurs d'identification à NORMAL, puis appuyez sur Entrée.
d. Sélectionnez Validation.
Remarque : Les voyants d'identification et d'incident passent de l'état Incident à l'état Normal.
e. Retournez à la ligne de commande.
Vérification d'un composant installé à l'aide de la console HMC
(Hardware Management Console)
Si vous avez installé ou remplacé un composant, utilisez la console HMC (Hardware Management
Console) pour mettre à jour les enregistrements de la console HMC une fois que vous avez effectué une
108 Activation SAS (Serial-attached SCSI) RAID
opération de maintenance sur le serveur. Si vous vous êtes servi de codes de référence, de symptômes ou
de codes d'emplacement lors de l'opération, repérez les enregistrements à utiliser pendant la procédure.
Pourquoi et quand exécuter cette tâche
Pour vérifier un composant installé, procédez comme suit :
Procédure
1. Sur la console HMC, recherchez d'éventuels événements d'action de maintenance ouverts dans le
journal des événements de maintenance. Pour plus d'informations, reportez-vous à la rubrique
«Affichage des événements réparables à l'aide de la console HMC», à la page 111.
2. Des événements d'action de maintenance sont-ils ouverts ?
Non : Si le voyant d'avertissement système est encore allumé, utilisez la console HMC pour le
désactiver. Consultez la rubrique «Activation/Désactivation des voyants via la console HMC». La
procédure est terminée.
Oui : Passez à l'étape suivante.
3. Notez la liste des événements d'action de maintenance ouverts.
4. Examinez les détails de chacun de ces événements. Les codes d'erreur associés à ces derniers sont-ils
les mêmes que ceux que vous avez collectés auparavant ?
v Non : Sélectionnez l'une des options suivantes :
– Examinez les autres événements réparables pour en trouver un dont le code d'erreur correspond
et passez à l'étape suivante.
– Si le contenu du journal ne correspond pas à ce que vous avez collecté précédemment, contactez
votre fournisseur de services.
v Oui : Passez à l'étape suivante.
5. Dans la fenêtre Erreur associée à cet événement réparable, sélectionnez et mettez en évidence
l'événement d'action de maintenance.
6. Cliquez sur Fermer l'événement.
7. Ajoutez des commentaires sur l'événement réparable. Incluez toute autre information propre à cet
événement. Cliquez sur OK.
8. Avez-vous remplacé, ajouté ou modifié une unité remplaçable sur site (FRU) de l'événement d'action
de maintenance ouvert ?
v Non : Sélectionnez l'option Aucune FRU remplacée pour cet événement réparable et cliquez sur
OK pour fermer l'événement d'action de maintenance.
v Oui : Procédez comme suit :
a. Dans la liste des unités remplaçables sur site, sélectionnez une unité à mettre à jour.
b. Cliquez deux fois sur l'unité, puis mettez à jour les informations correspondantes.
c. Cliquez sur OK pour refermer l'événement d'action de maintenance.
9. Si les incidents persistent, contactez votre fournisseur de services.
Activation/Désactivation des voyants via la console HMC
Utilisez cette procédure pour activer ou désactiver des voyants via le Point focal de service pour la
console HMC (Hardware Management Console).
Pourquoi et quand exécuter cette tâche
Choisissez l'une des options suivantes :
v «Désactivation d'un voyant d'avertissement système ou de partition»
v «Activation ou désactivation d'un voyant d'identification», à la page 110
Désactivation d'un voyant d'avertissement système ou de partition :
Procédures communes aux dispositifs installables 109
Pourquoi et quand exécuter cette tâche
Vous pouvez désactiver un voyant d'avertissement système ou un voyant de partition logique. Par
exemple, vous pouvez constater qu'un incident n'est pas prioritaire et peut être résolu ultérieurement.
Cependant, si vous souhaitez être averti en cas d'incident, vous devez désactiver le voyant
d'avertissement système pour qu'il puisse être activé quand un nouvel incident survient.
Procédure
1. Dans la zone de navigation, ouvrez le dossier Gestion de systèmes.
2. Ouvrez Serveurs et sélectionnez le système approprié.
3. Dans la zone de contenu, cochez la case correspondant à la partition concernée.
4. Sélectionnez Tâches, Opérations, puis Gestion des voyants d'avertissement système.
5. Sélectionnez la partition appropriée.
6. Sélectionnez Désactivation du voyant d'avertissement système dans le menu Action. La fenêtre de
confirmation qui s'affiche fournit les informations suivantes :
v Un message de vérification indique que le voyant d'avertissement système a été désactivé.
v Un message indique que le système présente encore des incidents non résolus.
v Un message indique que le voyant d'avertissement système ne peut pas être activé.
7. Sélectionnez l'une des partitions logiques dans le tableau du bas, puis sélectionnez Désactivation du
voyant de partition dans le menu Opérations sur la partition. La fenêtre de confirmation qui s'affiche
fournit les informations suivantes :
v Une vérification indique que le voyant de la partition logique a été désactivé.
v Un message indique que la partition logique présente encore des incidents non résolus.
v Un message indique que le voyant de la partition logique ne peut pas être activé.
Activation ou désactivation d'un voyant d'identification :
Pourquoi et quand exécuter cette tâche
Le système comporte plusieurs voyants d'identification de composants, tels que les boîtiers ou les unités
remplaçables sur site (FRU). On les appelle donc voyants d'identification.
Vous pouvez activer ou désactiver les types de voyants d'identification suivants :
v Voyant d'identification d'un boîtier : Si vous souhaitez ajouter un adaptateur dans un tiroir spécifique
(boîtier), vous devez connaître les type, modèle et numéro de série (MTMS) du tiroir. Pour déterminer
si vous disposez des informations MTMS appropriées du tiroir nécessitant le nouvel adaptateur, vous
pouvez activer le voyant d'un tiroir et vérifier que le MTMS correspond au tiroir qui doit accueillir le
nouveau matériel.
v Voyant d'identification d'une unité FRU associée à un boîtier : Pour connecter un câble à un
adaptateur d'E-S, vous pouvez activer le voyant de l'adaptateur s'il s'agit d'une unité FRU, puis vérifier
sur le matériel où le câble doit être fixé. Cette opération est particulièrement utile en présence de ports
ouverts sur plusieurs adaptateurs.
Pour activer ou désactiver le voyant d'identification d'un boîtier ou d'une unité FRU, procédez comme
suit :
Procédure
1. Dans la zone de navigation, ouvrez l'élément Gestion de systèmes.
2. Sélectionnez Serveurs.
3. Dans la zone de contenu, cochez la case correspondant au système concerné.
4. Sélectionnez Tâches, Opérations, Statut du voyant, puis Voyant d'identification.
110 Activation SAS (Serial-attached SCSI) RAID
5. Pour activer ou désactiver le voyant d'identification d'un boîtier, sélectionnez le boîtier dans le
tableau, puis cliquez sur Activation du voyant ou Désactivation du voyant. Le voyant associé
s'allume ou s'éteint.
6. Pour activer ou désactiver le voyant d'identification d'une unité FRU, choisissez un boîtier dans le
tableau, puis sélectionnez Sélectionné > Liste des FRU.
7. Sélectionnez une ou plusieurs unités FRU dans le tableau, et cliquez sur Activation du voyant ou
Désactivation du voyant. Le voyant associé s'allume ou s'éteint.
Affichage des événements réparables à l'aide de la console HMC
Utilisez cette procédure pour afficher un événement réparable, y compris les détails, commentaires et
historique de maintenance.
Pourquoi et quand exécuter cette tâche
Pour pouvoir consulter des événements réparables et les informations qui leur sont associées, vous devez
avoir l'un des rôles suivants :
v Administrateur central
v Technicien de maintenance
v Opérateur
v Responsable produit
v Visionneur
Pour afficher des événements réparables, procédez comme suit :
Procédure
1. Dans la zone de navigation, sélectionnez Gestion de la maintenance.
2. Sélectionnez Gestion des événements réparables.
3. Sélectionnez les critères des événements réparables à visualiser, puis cliquez sur OK. La fenêtre
Présentation d'un événement réparable s'affiche. La liste comporte les événements réparables
correspondant à vos critères de sélection. La barre de menus vous propose des actions réalisables sur
les événements réparables.
4. Sélectionnez une ligne dans la fenêtre Présentation d'un événement réparable, puis choisissez
Sélectionné > Affichage des détails La fenêtre Détails de l'événement réparable affiche des
informations détaillées sur l'événement réparable. Le tableau du haut comporte des informations telles
que le numéro d'incident et le code de référence. Le tableau du bas indique les unités remplaçables
sur site (FRU) associées à cet événement.
5. Sélectionnez l'erreur pour laquelle vous souhaitez consulter les commentaires et l'historique, puis
procédez comme suit :
a. Sélectionnez Actions > Visualisation des commentaires.
b. Pour quitter les commentaires, cliquez sur Fermeture.
c. Sélectionnez Actions > Affichage de l'historique de maintenance. La fenêtre Historique de
maintenance comporte l'historique de maintenance associé à l'erreur sélectionnée.
d. Pour quitter l'historique de maintenance, cliquez sur Fermeture.
6. Quand vous avez terminé, cliquez sur Annulation dans les fenêtres Détails de l'événement réparable
et Présentation de l'événement réparable pour les refermer.
Vérification du composant installé via la console SDMC
Si vous avez installé ou remplacé un composant, utilisez la console Systems Director Management
Console (SDMC) pour mettre à jour vos enregistrements SDMC lorsqu'une opération de maintenance est
terminée sur le serveur. Si vous vous êtes servi de codes de référence, de symptômes ou de codes
d'emplacement lors de l'opération, repérez les enregistrements à utiliser pendant la procédure.
Procédures communes aux dispositifs installables 111
Pourquoi et quand exécuter cette tâche
Pour vérifier un composant installé, procédez comme suit :
Procédure
1. Depuis la console SDMC, recherchez dans le journal des événements les opérations de maintenance
ouvertes. Pour plus de détails, voir «Visualisation d'événements réparables via la console de gestion
Systems Director», à la page 113.
2. Des événements d'action de maintenance sont-ils ouverts ?
Non : Si le voyant d'avertissement système est encore allumé, utilisez la console SDMC pour
l'éteindre. Pour plus d'informations, voir «Activation/Désactivation de voyants via la console de
gestion Systems Director». La procédure est terminée.
Oui : Passez à l'étape suivante.
3. Notez la liste des événements d'action de maintenance ouverts.
4. Examinez les détails de chacun de ces événements. Les codes d'erreur associés à ces derniers sont-ils
les mêmes que ceux que vous avez collectés auparavant ?
v Non : Sélectionnez l'une des options suivantes :
– Examinez les autres événements réparables pour en trouver un dont le code d'erreur correspond
et passez à l'étape suivante.
– Si le contenu du journal ne correspond pas à ce que vous avez collecté précédemment, contactez
votre fournisseur de services.
v Oui : Passez à l'étape suivante.
5. Dans la fenêtre Erreur associée à cet événement réparable, sélectionnez et mettez en évidence
l'événement d'action de maintenance.
6. Cliquez sur Supprimer ou Ignorer.
Remarque : Ces options ne sont disponibles qu'à partir du journal des événements.
Activation/Désactivation de voyants via la console de gestion Systems Director
Utilisez cette procédure pour activer ou désactiver des voyants via la console Systems Director
Management Console.
Pourquoi et quand exécuter cette tâche
Choisissez l'une des options suivantes :
v «Désactivation d'un voyant d'avertissement système ou de partition»
v «Activation ou désactivation de voyants d'identification via la console SDMC», à la page 113
Désactivation d'un voyant d'avertissement système ou de partition :
Pourquoi et quand exécuter cette tâche
Vous pouvez désactiver un voyant d'avertissement système ou un voyant de partition logique. Par
exemple, vous pouvez constater qu'un incident n'est pas prioritaire et peut être résolu ultérieurement.
Cependant, si vous souhaitez être averti en cas d'incident, vous devez désactiver le voyant
d'avertissement système pour qu'il puisse être activé quand un nouvel incident survient.
Procédure
1. Dans l'onglet Ressources, sélectionnez l'hôte ou le serveur virtuel approprié.
2. Sélectionnez Actions > Service et support > Matériel > Voyant d'avertissement système.
3. Sélectionnez Désactivation du voyant d'avertissement système. La fenêtre de confirmation qui
s'affiche fournit les informations suivantes :
v Un message de vérification indique que le voyant d'avertissement système a été désactivé.
112 Activation SAS (Serial-attached SCSI) RAID
v Un message indique que le système présente encore des incidents non résolus.
v Un message indique que le voyant d'avertissement système ne peut pas être activé.
4. Sélectionnez l'un des serveurs virtuels, puis sélectionnez Désactivation du voyant d'avertissement
système. La fenêtre de confirmation qui s'affiche fournit les informations suivantes :
v Un message de vérification indique que le voyant d'avertissement système a été désactivé.
v Un message indique que la partition logique présente encore des incidents non résolus.
v Un message indique que le voyant du serveur virtuel ne peut pas être activé.
Activation ou désactivation de voyants d'identification via la console SDMC :
Pourquoi et quand exécuter cette tâche
Le système comporte différents voyants d'identification de composants, tels que les boîtiers ou les unités
remplaçables sur site (FRU). On les appelle donc voyants d'identification.
Vous pouvez activer ou désactiver les types de voyants d'identification suivants :
v Voyant d'identification d'un boîtier : Si vous souhaitez ajouter un adaptateur dans un tiroir spécifique
(boîtier), vous devez connaître les type, modèle et numéro de série (MTMS) du tiroir. Pour déterminer
si vous disposez des informations MTMS appropriées du tiroir nécessitant le nouvel adaptateur, vous
pouvez activer le voyant d'un tiroir et vérifier que le MTMS correspond au tiroir qui doit accueillir le
nouveau matériel.
v Voyant d'identification d'une unité FRU associée à un boîtier : Pour connecter un câble à un
adaptateur d'E-S, vous pouvez activer le voyant de l'adaptateur s'il s'agit d'une unité FRU, puis vérifier
sur le matériel où le câble doit être fixé. Cette opération est particulièrement utile en présence de ports
ouverts sur plusieurs adaptateurs.
Pour activer ou désactiver le voyant d'identification d'un boîtier ou d'une unité FRU, procédez comme
suit :
Procédure
1. Dans l'onglet Ressources, sélectionnez l'hôte ou le serveur virtuel approprié.
2. Sélectionnez Actions > Service et support > Matériel > Voyant d'identification.
3. Sous Voyant d'identification, sélectionnez la fenêtre Boîtier puis l'unité centrale ou le boîtier.
4. Pour activer ou désactiver un voyant d'identification, cliquez sur Activation du voyant ou
Désactivation du voyant. Le voyant associé s'allume ou s'éteint.
5. Pour activer ou désactiver le voyant d'identification d'une unité FRU, sélectionnez un système ou un
boîtier dans le tableau, puis sélectionnez Liste des unités FRU.
6. Sélectionnez une ou plusieurs unités FRU dans le tableau, et cliquez sur Activation du voyant ou
Désactivation du voyant. Le voyant associé s'allume ou s'éteint.
Visualisation d'événements réparables via la console de gestion Systems Director
Utilisez cette procédure pour afficher un événement réparable, y compris les détails, commentaires et
historique de maintenance.
Pourquoi et quand exécuter cette tâche
Pour afficher des événements réparables, procédez comme suit :
Procédure
1. Dans l'onglet Ressources, sélectionnez l'hôte ou le serveur virtuel approprié.
2. Sélectionnez Actions > Etat et santé du système > Journal des événements.
3. Facultatif : Vous pouvez limiter les critères de recherche d'événement à l'aide du menu de filtrage des
événements.
Procédures communes aux dispositifs installables 113
4. Sélectionnez une ligne dans la fenêtre des événements puis sélectionnez Actions > Propriétés La
fenêtre Propriétés s'ouvre, affichant les informations détaillées relatives à l'événement réparables. Le
tableau indique des informations telles que le numéro d'incident, le code de référence et les unités
remplaçables sur site (FRU) associées à l'événement.
Vérification d'un dispositif installé ou d'un composant remplacé sur un
système ou une partition logique à l'aide des outils Virtual I/O Server
Si vous avez installé un dispositif ou remplacé un composant, il est recommandé d'utiliser les outils
Virtual I/O Server (VIOS) pour vérifier que ce dispositif ou composant est reconnu par le système ou la
partition logique.
Pourquoi et quand exécuter cette tâche
Pour vérifier le fonctionnement d'un dispositif récemment installé ou d'un composant de remplacement,
sélectionnez la procédure appropriée :
v Vérification du dispositif installé via VIOS
v Vérification du composant de remplacement via VIOS
Vérifiez le dispositif installé via VIOS :
1. Connectez-vous en tant qu'utilisateur root.
2. Sur la ligne de commande, tapez diagmenu et appuyez sur Entrée.
3. Sélectionnez Routines de diagnostic avancé et appuyez sur Entrée.
4. Dans le menu Sélection du mode de diagnostic, sélectionnez Vérification du système et appuyez sur
Entrée.
5. Dans le menu Sélection des tests de diagnostic, mode étendu, exécutez l'une des procédures
suivantes :
v Pour tester une seule ressource, sélectionnez la ressource que vous venez d'installer dans la liste des
ressources et appuyez sur Entrée.
v Pour tester toutes les ressources disponibles du système d'exploitation, sélectionnez Toutes les
ressources et appuyez sur Entrée.
6. Sélectionnez Validation, puis attendez la fin de l'exécution des programmes de diagnostic en
répondant aux invites éventuelles.
7. Les programmes de diagnostic ont-ils été exécutés complètement avant d'afficher le message Aucun
incident n’a été détecté ?
v Non : Si un code SRN (Service Request Number - numéro de demande d'intervention) ou un autre
code de référence s'affiche, il est possible qu'un adaptateur ou qu'un câble ne soit pas bien
connecté. Passez en revue les procédures d'installation pour vérifier que le nouveau dispositif est
installé correctement. Si vous ne parvenez pas à corriger l'incident, rassemblez tous les codes SRN
et autres codes de référence éventuels. Si le système fonctionne en mode de partitionnement
logique, notez la partition logique sur laquelle le dispositif est installé. Pour obtenir de l'aide,
contactez votre fournisseur de services.
v Oui : Le nouveau dispositif est installé correctement. Quittez les programmes de diagnostic et
revenez en mode de fonctionnement normal.
Vérifiez le composant de remplacement via VIOS :
Pour vérifier le fonctionnement d'un dispositif ou d'un composant de remplacement récemment installé,
procédez comme suit :
1. Avez-vous remplacé le composant à l'aide de VIOS ou des opérations de remplacement à chaud du
service de diagnostic en ligne ?
Non : Passez à l'étape 2, à la page 115.
114 Activation SAS (Serial-attached SCSI) RAID
Oui : Passez à l'étape 5.
2. Le système est-il hors tension ?
Non : Passez à l'étape 4.
Oui : Si le système prend en charge l'initialisation lente, configurez la fonction. Pour plus
d'informations, reportez-vous à la rubrique Exécution d'une initialisation lente.
3. Démarrez le système et attendez que l'invite de connexion du système d'exploitation VIOS s'affiche
ou que l'activité apparente du système sur le panneau de commande ou l'écran s'arrête.
Avez-vous vu l'invite de connexion VIOS ?
v Non : Si un code SRN ou un autre code de référence s'affiche, il est possible qu'un adaptateur ou
un câble ne soit pas bien connecté. Passez en revue les procédures de remplacement pour vérifier
que le nouveau composant est installé correctement. Si vous ne parvenez pas à corriger l'incident,
rassemblez tous les codes SRN et autres codes de référence éventuels. Si le système ne démarre
pas ou que l'invite de connexion ne s'affiche pas, consultez la rubrique : Incidents de chargement
et de démarrage du système d'exploitation.
Si le système est partitionné, notez la partition logique sur laquelle vous avez remplacé le
composant. Pour obtenir de l'aide, contactez votre fournisseur de services.
v Oui : Passez à l'étape 4.
4. A l'invite, tapez diag —a et appuyez sur Entrée pour vérifier s'il manque des ressources. Si une invite
s'affiche, passez à l'étape 5.
Si le menu Sélection des tests de diagnostic s'affiche avec un M en regard d'une ressource, procédez
comme suit :
a. Sélectionnez la ressource, puis appuyez sur Entrée.
b. Sélectionnez Validation.
c. Suivez les instructions affichées.
d. Si un message vous demande si vous souhaitez revoir l'erreur affichée précédemment, cliquez sur
Oui et appuyez sur Entrée.
e. Si le système génère un code SRN, une carte ou un câble est probablement mal connecté. Si
aucun incident évident n'apparaît, notez le code SRN et contactez votre fournisseur de services
pour obtenir de l'aide.
f. Si aucun code SRN ne s'affiche, passez à l'étape 5.
5. Testez le composant en effectuant les opérations suivantes :
a. Sur la ligne de commande, tapez diagmenu et appuyez sur Entrée.
b. Dans le menu Sélection des fonctions, sélectionnez Programmes de diagnostic, mode étendu et
appuyez sur Entrée.
c. Dans le menu Sélection du mode de diagnostic, sélectionnez Vérification du système et
appuyez sur Entrée.
d. Sélectionnez Toutes les ressources ou les programmes de diagnostic du composant pour tester
uniquement le composant remplacé ou les périphériques qui y sont raccordées et appuyez sur
Entrée.
Le menu Action corrective sur ressource s'est-il affiché ?
Non : Passez à l'étape 6.
Oui : Passez à l'étape 7, à la page 116.
6. Le message Test terminé, aucun incident n'a été détecté s'est-il affiché ?
v Non : Il existe encore un incident. Contactez votre fournisseur de services. La procédure est
terminée.
v Oui : Si l'incident ne figure pas dans le journal des erreurs, sélectionnez Consignation d'action
corrective dans le menu Sélection des tâches pour mettre à jour le journal des erreurs AIX. Si la
Procédures communes aux dispositifs installables 115
réparation consistait à remettre en place un câble ou une carte, sélectionnez la ressource
correspondante. Si la ressource associée à l'action ne s'affiche pas dans la liste des ressources,
sélectionnez sysplanar0 et appuyez sur Entrée.
Conseil : Le voyant du composant passe de l'état Incident à l'état normal.
Passez à l'étape 9.
7. Dans le menu Action corrective sur ressource, sélectionnez la ressource du composant remplacé. En
cas de test réussi sur une ressource en mode de vérification du système, dont une entrée figure dans
le journal des erreurs AIX, le menu Action corrective sur ressource apparaît. Pour mettre à jour le
journal des erreurs AIX pour indiquer qu'un composant détectable par le système a été remplacé,
procédez comme suit.
Remarque : Sur les systèmes équipés d'un voyant correspondant au composant défaillant, le voyant
passe à l'état normal.
a. Dans le menu Action corrective sur ressource, sélectionnez la ressource remplacée. Si la
réparation consistait à remettre en place un câble ou une carte, sélectionnez la ressource
correspondante. Si la ressource associée à votre action n'apparaît pas dans la liste des ressources,
sélectionnez sysplanar0. Appuyez sur Entrée.
b. Sélectionnez Validation une fois les sélections effectuées. L'écran Action corrective sur ressource
s'est-il affiché de nouveau ?
Non : Si l'écran Aucun incident détecté apparaît, passez à l'étape 9.
Oui : Passez à l'étape 8.
8. Dans le menu Action corrective sur ressource, sélectionnez le composant parent ou enfant de la
ressource que vous venez de remplacer, si nécessaire. En cas de test réussi sur une ressource en
mode de vérification du système, dont une entrée figure dans le journal des erreurs AIX, le menu
Action corrective sur ressource apparaît. Pour mettre à jour le journal des erreurs AIX pour indiquer
qu'un composant détectable par le système a été remplacé, procédez comme suit.
Remarque : Le voyant du composant passe de l'état Incident à l'état normal.
a. Dans le menu Action corrective sur ressource, sélectionnez le composant parent ou enfant de la
ressource remplacée. Si la réparation consistait à remettre en place un câble ou une carte,
sélectionnez la ressource correspondante. Si la ressource associée à votre action n'apparaît pas
dans la liste des ressources, sélectionnez sysplanar0. Appuyez sur Entrée.
b. Sélectionnez Validation une fois les sélections effectuées.
c. Si l'écran Aucun incident détecté apparaît, passez à l'étape 9.
9. Si vous avez modifié les paramètres du processeur de service ou du réseau (voir procédures
précédentes), restaurez la valeur initiale des paramètres.
10. Avez-vous exécuté des procédures de remplacement à chaud avant cette procédure ?
Non : Passez à l'étape 11.
Oui : Passez à l'étape 12.
11. Démarrez le système d'exploitation (système ou partition logique en mode normal). Avez-vous pu
démarrer le système d'exploitation ?
Non : Contactez votre fournisseur de services. La procédure est terminée.
Oui : Passez à l'étape 12.
12. Les voyants sont-ils encore allumés ?
v Non. La procédure est terminée.
v Oui : Désactivez les voyants. Pour plus d'informations, consultez l'une de ces rubriques :
Modification des indicateurs de service
116 Activation SAS (Serial-attached SCSI) RAID
Installation d'un composant via la console SDMC
Vous pouvez utiliser la console SDMC (Systems Director Management Console) pour effectuer diverses
tâches de maintenance, y compris l'installation d'une nouvelle unité remplaçable sur site (FRU) ou d'un
nouveau composant.
Avant de commencer
Pour utiliser l'interface utilisateur de la console SDMC pour installer un composant sur un système ou
une unité d'extension géré(e) par une console SDMC, procédez comme suit :
Procédure
1. Dans la zone des ressources Power Systems sélectionnez le système sur lequel vous voulez installer
un composant.
2. Dans le menu Actions, développez Service and Support > Matériel > Tâches de modification de
matériel > Ajout de FRU.
3. Dans l'onglet d'ajout de FRU, sélectionnez le système ou le type de boîtier.
4. Sélectionnez dans le menu le type de FRU que vous souhaitez installer et cliquez sur Suivant.
5. Sélectionnez le code d'emplacement pour l'installation du composant et cliquez sur Ajouter.
6. Une fois l'unité remplaçable sur site (FRU) placée dans la catégorie des actions en attente, cliquez sur
Lancer la procédure et suivez les instructions d'installation du composant.
Remarque : Il se peut que la console SDMC ouvre une fenêtre d'instructions pour l'installation du
composant. Dans ce cas, suivez les instructions pour installer le composant.
Remplacement d'un composant via la console SDMC
Vous pouvez utiliser la console SDMC (Systems Director Management Console) pour effectuer diverses
tâches de maintenance, y compris le remplacement d'une unité remplaçable sur site (FRU) ou d'un
composant.
Pourquoi et quand exécuter cette tâche
Si vous procédez à un échange de composant dans le but de solutionner un événement réparable, suivez
ces instructions.
Procédure
1. Dans la zone des ressources Power Systems, sélectionnez le système sur lequel vous souhaitez
remplacer un composant.
2. Sélectionnez l'une des options suivantes :
v Si vous remplacez un composant en dehors du cadre d'une opération de maintenance, depuis le
menu Actions, développez Service and Support > Matériel > Remplacement de FRU.
v Si vous procédez à un échange de composant dans le but de corriger un événement réparable, voir
Démarrage d'une réparation.
3. Pour remplacer un composant dans le cadre d'une opération de maintenance, sélectionnez
Remplacement de FRU.
4. Dans l'onglet Remplacement de FRU, sélectionnez l'unité centrale ou d'extension sur laquelle vous
souhaitez remplacer un composant.
5. Sélectionnez dans le menu le type de composant que vous souhaitez remplacer et cliquez sur Suivant.
6. Sélectionnez dans le menu le code d'emplacement du composant que vous souhaitez remplacer et
cliquez sur Ajouter.
Procédures communes aux dispositifs installables 117
7. Une fois l'unité remplaçable sur site (FRU) placée dans la fenêtre des actions en attente, cliquez sur
Lancer la procédure et suivez les instructions d'installation du composant.
Remarque : Il se peut que la console SDMC ouvre une fenêtre d'instructions pour le remplacement du
composant. Dans ce cas, suivez les instructions pour remplacer le composant.
Retrait d'un composant via la console SDMC
Vous pouvez utiliser la console SDMC (Systems Director Management Console) pour effectuer diverses
tâches de maintenance, y compris le retrait d'une unité remplaçable sur site (FRU) ou d'un composant.
Pourquoi et quand exécuter cette tâche
Pour utiliser l'interface utilisateur de la console SDMC pour retirer un composant sur un système ou une
unité d'extension géré(e) par une console SDMC, procédez comme suit :
Procédure
1. Dans la zone des ressources Power Systems, sélectionnez le système duquel vous souhaitez retirer un
composant.
2. Sélectionnez l'une des options suivantes :
v Si vous installez un nouveau composant dans le cadre d'une mise à niveau, depuis le menu Actions
développez Service and Support > Matériel > Tâches de modification de matériel.
v Si vous retirez un composant dans le cadre d'une opération de maintenance, depuis le menu
Actions développez Service and Support > Matériel > Remplacement de FRU.
3. Pour installer un nouveau composant dans le cadre d'une mise à niveau, sélectionnez Ajout de FRU.
Pour remplacer un composant dans le cadre d'une opération de maintenance, sélectionnez
Remplacement de FRU.
4. Dans la fenêtre d'ajout ou de remplacement de FRU, sélectionnez le système ou l'unité d'extension
duquel ou de laquelle vous souhaitez retirer le composant.
5. Sélectionnez dans la liste le type de composant que vous souhaitez retirer et cliquez sur Suivant.
6. Sélectionnez l'emplacement du composant, puis cliquez sur Ajouter.
7. Une fois que le composant figure dans la fenêtre des actions en attente, cliquez sur Lancer la
procédure et suivez les instructions de retrait du composant.
Remarque : Il se peut que la console SDMC ouvre une fenêtre d'instructions pour le retrait du
composant. Dans ce cas, suivez ces instructions pour retirer le composant.
118 Activation SAS (Serial-attached SCSI) RAID
Remarques
Le présent document peut contenir des informations ou des références concernant certains produits,
logiciels ou services du fabricant non annoncés dans ce pays.
Le présent document peut contenir des informations ou des références concernant certains produits,
logiciels ou services non annoncés dans ce pays. Pour plus d'informations, référez-vous aux documents
d'annonce disponibles dans votre pays ou adressez-vous au partenaire commercial du fabricant. Toute
référence à un produit, logiciel ou service du fabricant n'implique pas que seul ce produit, logiciel ou
service puisse convenir. Tout autre élément fonctionnellement équivalent peut être utilisé, s'il n'enfreint
aucun droit du fabricant. Il est toutefois de la responsabilité de l'utilisateur d'évaluer et de vérifier
lui-même le fonctionnement de tout produit, programme ou service.
Le fabricant peut détenir des brevets ou des demandes de brevet couvrant les produits mentionnés dans
le présent document. La remise de ce document ne vous octroie aucun droit de licence sur ces brevets ou
demandes de brevet. Si vous désirez recevoir des informations concernant l'acquisition de licences,
veuillez en faire la demande par écrit au fabricant.
Le paragraphe suivant ne s'applique ni au Royaume-Uni, ni dans aucun pays dans lequel il serait
contraire aux lois locales. LE PRESENT DOCUMENT EST LIVRE «EN L'ETAT» SANS AUCUNE
GARANTIE EXPLICITE OU IMPLICITE. LE FABRICANT DECLINE NOTAMMENT TOUTE
RESPONSABILITE RELATIVE A CES INFORMATIONS EN CAS DE CONTREFAÇON AINSI QU'EN
CAS DE DEFAUT D'APTITUDE A L'EXECUTION D'UN TRAVAIL DONNE. Certaines juridictions
n'autorisent pas l'exclusion des garanties implicites, auquel cas l'exclusion ci-dessus ne vous sera pas
applicable.
Le présent document peut contenir des inexactitudes ou des coquilles. Il est mis à jour périodiquement.
Chaque nouvelle édition inclut les mises à jour. IBM peut modifier sans préavis les produits et logiciels
décrits dans ce document.
Les références à des sites Web non référencés par le fabricant sont fournies à titre d'information
uniquement et n'impliquent en aucun cas une adhésion aux données qu'ils contiennent. Les éléments
figurant sur ces sites Web ne font pas partie des éléments du présent produit et l'utilisation de ces sites
relève de votre seule responsabilité.
Le fabricant pourra utiliser ou diffuser, de toute manière qu'il jugera appropriée et sans aucune obligation
de sa part, tout ou partie des informations qui lui seront fournies.
Les données de performance indiquées dans ce document ont été déterminées dans un environnement
contrôlé. Par conséquent, les résultats peuvent varier de manière significative selon l'environnement
d'exploitation utilisé. Certaines mesures évaluées sur des systèmes en cours de développement ne sont
pas garanties sur tous les systèmes disponibles. En outre, elles peuvent résulter d'extrapolations. Les
résultats peuvent donc varier. Il incombe aux utilisateurs de ce document de vérifier si ces données sont
applicables à leur environnement d'exploitation.
Les informations concernant les produits de fabricants tiers ont été obtenues auprès des fournisseurs de
ces produits, par l'intermédiaire d'annonces publiques ou via d'autres sources disponibles. Ce fabricant
n'a pas testé ces produits et ne peut confirmer l'exactitude de leurs performances ni leur compatibilité. Il
ne peut recevoir aucune réclamation concernant des produits de fabricants tiers. Toute question
concernant les performances de produits de fabricants tiers doit être adressée aux fournisseurs de ces
produits.
119
Toute instruction relative aux intentions du fabricant pour ses opérations à venir est susceptible d'être
modifiée ou annulée sans préavis, et doit être considérée uniquement comme un objectif.
Tous les tarifs indiqués sont les prix de vente actuels suggérés et sont susceptibles d'être modifiés sans
préavis. Les tarifs appliqués peuvent varier selon les revendeurs.
Ces informations sont fournies uniquement à titre de planification. Elles sont susceptibles d'être modifiées
avant la mise à disposition des produits décrits.
Le présent document peut contenir des exemples de données et de rapports utilisés couramment dans
l'environnement professionnel. Ces exemples mentionnent des noms fictifs de personnes, de sociétés, de
marques ou de produits à des fins illustratives ou explicatives uniquement. Tous ces noms sont fictifs et
toute ressemblance avec des noms et adresses utilisés par une entreprise réelle serait purement fortuite.
Si vous visualisez ces informations en ligne, il se peut que les photographies et illustrations en couleur
n'apparaissent pas à l'écran.
Les figures et les spécifications contenues dans le présent document ne doivent pas être reproduites,
même partiellement, sans l'autorisation écrite d'IBM.
Le fabricant a conçu le présent document pour expliquer comment utiliser les machines indiquées. Il n'est
exploitable dans aucun autre but.
Les ordinateurs du fabricant contiennent des mécanismes conçus pour réduire les risques d'altération ou
de perte de données. Ces risques, cependant, ne peuvent pas être éliminés. En cas de rupture de tension,
de défaillances système, de fluctuations ou de rupture de l'alimentation ou d'incidents au niveau des
composants, l'utilisateur doit s'assurer de l'exécution rigoureuse des opérations, et que les données ont été
sauvegardées ou transmises par le système au moment de la rupture de tension ou de l'incident (ou peu
de temps avant ou après). De plus, ces utilisateurs doivent établir des procédures garantissant la
vérification indépendante des données, afin de permettre une utilisation fiable de ces dernières dans le
cadre d'opérations stratégiques. Ces utilisateurs doivent enfin consulter régulièrement sur les sites Web de
support IBM les mises à jour et les correctifs applicables au système et aux logiciels associés.
Restrictions liées à l'utilisation de connexion Ethernet
Ce produit n'est pas destiné à être connecté, directement ou indirectement, par quelque moyen que ce
soit, à des interfaces de réseaux de télécommunications publiques.
Marques
IBM, le logo IBM et ibm.com sont des marques d'International Business Machines aux Etats-Unis et/ou
dans certains autres pays. Les autres noms de produits et de services peuvent être des marques d'IBM ou
d'autres sociétés. La liste actualisée de toutes les marques d'IBM est disponible sur la page Web
Copyright and trademark information à http://www.ibm.com/legal/copytrade.shtml.
Linux est une marque déposée de Linus Torvalds aux Etats-Unis et/ou dans certains autres pays.
Les autres noms de produits et de services peuvent être des marques d'IBM ou d'autres sociétés.
Bruits radioélectriques
Lorsque vous connectez un moniteur à l'équipement, vous devez utiliser le câble fourni à cet effet, ainsi
que toute unité de suppression des interférences.
120 Activation SAS (Serial-attached SCSI) RAID
Remarques sur la classe A
Les avis de conformité de classe A suivants s'appliquent aux serveurs.
Recommandation de la Federal Communications Commission (FCC) [Etats-Unis]
Remarque : Cet appareil respecte les limites des caractéristiques d'immunité des appareils numériques
définies pour la classe A, conformément au chapitre 15 de la réglementation de la FCC. La conformité
aux spécifications de cette classe offre une garantie acceptable contre les perturbations électromagnétiques
dans les zones commerciales. Ce matériel génère, utilise et peut émettre de l'énergie radiofréquence. Il
risque de parasiter les communications radio s'il n'est pas installé conformément aux instructions du
constructeur. L'exploitation faite en zone résidentielle peut entraîner le brouillage des réceptions radio et
télé, ce qui obligerait le propriétaire à prendre les dispositions nécessaires pour en éliminer les causes.
Utilisez des câbles et connecteurs correctement blindés et mis à la terre afin de respecter les limites de
rayonnement définies par la réglementation de la FCC. IBM ne peut pas être tenue pour responsable du
brouillage des réceptions radio ou télévision résultant de l'utilisation de câbles et connecteurs inadaptés
ou de modifications non autorisées apportées à cet appareil. Toute modification non autorisée pourra
annuler le droit d'utilisation de cet appareil.
Cet appareil est conforme aux restrictions définies dans le chapitre 15 de la réglementation de la FCC.
Son utilisation est soumise aux deux conditions suivantes : (1) il ne peut pas causer de perturbations
électromagnétiques gênantes et (2) il doit accepter toutes les perturbations reçues, y compris celles
susceptibles d'occasionner un fonctionnement indésirable.
Avis de conformité à la réglementation d'Industrie Canada
Cet appareil numérique de la classe A est conforme à la norme NMB-003 du Canada.
Avis de conformité aux exigences de l'Union européenne
Le présent produit satisfait aux exigences de protection énoncées dans la directive 2004/108/CEE du
Conseil concernant le rapprochement des législations des Etats membres relatives à la compatibilité
électromagnétique. IBM décline toute responsabilité en cas de non-respect de cette directive résultat d'une
modification non recommandée du produit, y compris l'ajout de cartes en option non IBM.
Ce produit respecte les limites des caractéristiques d'immunité des appareils de traitement de
l'information définies par la classe A de la norme européenne EN 55022 (CISPR 22). La conformité aux
spécifications de la classe A offre une garantie acceptable contre les perturbations avec les appareils de
communication agréés, dans les zones commerciales et industrielles.
Dans l'Union européenne, contactez :
IBM Deutschland GmbH
Technical Regulations, Department M456
IBM-Allee 1, 71139 Ehningen, Germany
Tele: +49 7032 15-2937
email: tjahn@de.ibm.com
Avertissement : Ce matériel appartient à la classe A. Il est susceptible d'émettre des ondes
radioélectriques risquant de perturber les réceptions radio. Son emploi dans une zone résidentielle peut
créer des perturbations électromagnétiques. L'utilisateur devra alors prendre les mesures nécessaires pour
en éliminer les causes.
Remarques 121
Avis de conformité aux exigences du Voluntary Control Council for Interference
(VCCI) - Japon
Voici un résumé de la recommandation du VCCI japonais figurant dans l'encadré ci-dessus :
Ce produit de la classe A respecte les limites des caractéristiques d'immunité définies par le VCCI
(Voluntary Control Council for Interference) japonais. Si ce matériel est utilisé dans une zone résidentielle,
il peut créer des perturbations électromagnétiques. L'utilisateur devra alors prendre les mesures
nécessaires pour en éliminer les causes.
Directive relative aux harmoniques confirmée par l'association japonaise JEITA
(Japanese Electronics and Information Technology Industries Association)
(produits inférieurs ou égaux à 20 A par phase)
Directive relative aux harmoniques confirmée avec modifications par l'association
japonaise JEITA (Japanese Electronics and Information Technology Industries
Association) (produits supérieurs 20 A par phase)
Avis d'interférences électromagnétiques (EMI) - République populaire de Chine
Ce matériel appartient à la classe A. Il est susceptible d'émettre des ondes radioélectriques risquant de
perturber les réceptions radio. L'utilisateur devra alors prendre les mesures nécessaires pour en éliminer
les causes.
122 Activation SAS (Serial-attached SCSI) RAID
Avis d'interférences électromagnétiques (EMI) - Taïwan
Voici un résumé de l'avis EMI de Taïwan figurant ci-dessus.
Avertissement : Ce matériel appartient à la classe A. Il est susceptible d'émettre des ondes
radioélectriques risquant de perturber les réceptions radio. Son emploi dans une zone résidentielle peut
créer des interférences. L'utilisateur devra alors prendre les mesures nécessaires pour les supprimer.
Avis d'interférences électromagnétiques (EMI) - Corée
Avis de conformité pour l'Allemagne
Deutschsprachiger EU Hinweis: Hinweis für Geräte der Klasse A EU-Richtlinie zur
Elektromagnetischen Verträglichkeit
Dieses Produkt entspricht den Schutzanforderungen der EU-Richtlinie 2004/108/EG zur Angleichung der
Rechtsvorschriften über die elektromagnetische Verträglichkeit in den EU-Mitgliedsstaaten und hält die
Grenzwerte der EN 55022 Klasse A ein.
Um dieses sicherzustellen, sind die Geräte wie in den Handbüchern beschrieben zu installieren und zu
betreiben. Des Weiteren dürfen auch nur von der IBM empfohlene Kabel angeschlossen werden. IBM
übernimmt keine Verantwortung für die Einhaltung der Schutzanforderungen, wenn das Produkt ohne
Zustimmung von IBM verändert bzw. wenn Erweiterungskomponenten von Fremdherstellern ohne
Empfehlung von IBM gesteckt/eingebaut werden.
EN 55022 Klasse A Geräte müssen mit folgendem Warnhinweis versehen werden:
"Warnung: Dieses ist eine Einrichtung der Klasse A. Diese Einrichtung kann im Wohnbereich
Funk-Störungen verursachen; in diesem Fall kann vom Betreiber verlangt werden, angemessene
Maßnahmen zu ergreifen und dafür aufzukommen."
Deutschland: Einhaltung des Gesetzes über die elektromagnetische Verträglichkeit von Geräten
Dieses Produkt entspricht dem “Gesetz über die elektromagnetische Verträglichkeit von Geräten
(EMVG)“. Dies ist die Umsetzung der EU-Richtlinie 2004/108/EG in der Bundesrepublik Deutschland.
Remarques 123
Zulassungsbescheinigung laut dem Deutschen Gesetz über die elektromagnetische Verträglichkeit von
Geräten (EMVG) (bzw. der EMC EG Richtlinie 2004/108/EG) für Geräte der Klasse A
Dieses Gerät ist berechtigt, in Übereinstimmung mit dem Deutschen EMVG das EG-Konformitätszeichen
- CE - zu führen.
Verantwortlich für die Einhaltung der EMV Vorschriften ist der Hersteller:
International Business Machines Corp.
New Orchard Road
Armonk, New York 10504
Tel: 914-499-1900
Der verantwortliche Ansprechpartner des Herstellers in der EU ist:
IBM Deutschland GmbH
Technical Regulations, Abteilung M456
IBM-Allee 1, 71139 Ehningen, Germany
Tel: +49 7032 15-2937
email: tjahn@de.ibm.com
Generelle Informationen:
Das Gerät erfüllt die Schutzanforderungen nach EN 55024 und EN 55022 Klasse A.
Avis d'interférences électromagnétiques (EMI) - Russie
Remarques sur la classe B
Les avis de conformité de classe B suivants s'appliquent aux dispositifs déclarés comme relevant de la
compatibilité électromagnétique (EMC) de classe B dans les informations d'installation des dispositifs.
Recommandation de la Federal Communications Commission (FCC) [Etats-Unis]
Cet appareil respecte les limites des caractéristiques d'immunité des appareils numériques définies par la
classe B, conformément au chapitre 15 de la réglementation de la FCC. La conformité aux spécifications
de la classe B offre une garantie acceptable contre les perturbations électromagnétiques dans les zones
résidentielles.
Ce matériel génère, utilise et peut émettre de l'énergie radiofréquence. Il risque de parasiter les
communications radio s'il n'est pas installé conformément aux instructions du constructeur. Toutefois, il
n'est pas garanti que des perturbations n'interviendront pas pour une installation particulière.
Si cet appareil provoque des perturbations gênantes dans les communications radio ou télévision,
mettez-le hors tension puis sous tension pour vous en assurer. L'utilisateur peut tenter de remédier à cet
incident en appliquant une ou plusieurs des mesures suivantes :
v Réorienter ou repositionner l'antenne de réception.
v Eloigner l'appareil du récepteur.
124 Activation SAS (Serial-attached SCSI) RAID
v Brancher l'appareil sur une prise différente de celle du récepteur, sur un circuit distinct.
v Prendre contact avec un distributeur agréé IBM ou un représentant commercial IBM pour obtenir de
l'aide.
Utilisez des câbles et connecteurs correctement blindés et mis à la terre afin de respecter les limites de
rayonnement définies par la réglementation de la FCC. Ces câbles et connecteurs sont disponibles chez
votre distributeur agréé IBM. IBM ne peut pas être tenue pour responsable du brouillage des réceptions
radio ou télévision résultant de modifications non autorisées apportées à cet appareil. Toute modification
non autorisée pourra annuler le droit d'utilisation de cet appareil.
Cet appareil est conforme aux restrictions définies dans le chapitre 15 de la réglementation de la FCC.
Son utilisation est soumise aux deux conditions suivantes : (1) il ne peut pas causer de perturbations
électromagnétiques gênantes et (2) il doit accepter toutes les perturbations reçues, y compris celles
susceptibles d'occasionner un fonctionnement indésirable.
Avis de conformité à la réglementation d'Industrie Canada
Cet appareil numérique de la classe B est conforme à la norme NMB-003 du Canada.
Avis de conformité aux exigences de l'Union européenne
Le présent produit satisfait aux exigences de protection énoncées dans la directive 2004/108/CEE du
Conseil concernant le rapprochement des législations des Etats membres relatives à la compatibilité
électromagnétique. IBM décline toute responsabilité en cas de non-respect de cette directive résultant
d'une modification non recommandée du produit, y compris l'ajout de cartes en option non IBM.
Ce produit respecte les limites des caractéristiques d'immunité des appareils de traitement de
l'information définies par la classe B de la norme européenne EN 55022 (CISPR 22). La conformité aux
spécifications de la classe B offre une garantie acceptable contre les perturbations avec les appareils de
communication agréés, dans les zones résidentielles.
Dans l'Union européenne, contactez :
IBM Deutschland GmbH
Technical Regulations, Department M456
IBM-Allee 1, 71139 Ehningen, Germany
Tele: +49 7032 15-2937
email: tjahn@de.ibm.com
Avis de conformité aux exigences du Voluntary Control Council for Interference
(VCCI) - Japon
Directive relative aux harmoniques confirmée par l'association japonaise JEITA
(Japanese Electronics and Information Technology Industries Association)
(produits inférieurs ou égaux à 20 A par phase)
Remarques 125
Directive relative aux harmoniques confirmée avec modifications par l'association
japonaise JEITA (Japanese Electronics and Information Technology Industries
Association) (produits supérieurs 20 A par phase)
Avis d'interférences électromagnétiques (EMI) - Corée
Avis de conformité pour l'Allemagne
Deutschsprachiger EU Hinweis: Hinweis für Geräte der Klasse B EU-Richtlinie zur
Elektromagnetischen Verträglichkeit
Dieses Produkt entspricht den Schutzanforderungen der EU-Richtlinie 2004/108/EG zur Angleichung der
Rechtsvorschriften über die elektromagnetische Verträglichkeit in den EU-Mitgliedsstaaten und hält die
Grenzwerte der EN 55022 Klasse B ein.
Um dieses sicherzustellen, sind die Geräte wie in den Handbüchern beschrieben zu installieren und zu
betreiben. Des Weiteren dürfen auch nur von der IBM empfohlene Kabel angeschlossen werden. IBM
übernimmt keine Verantwortung für die Einhaltung der Schutzanforderungen, wenn das Produkt ohne
Zustimmung von IBM verändert bzw. wenn Erweiterungskomponenten von Fremdherstellern ohne
Empfehlung von IBM gesteckt/eingebaut werden.
Deutschland: Einhaltung des Gesetzes über die elektromagnetische Verträglichkeit von Geräten
Dieses Produkt entspricht dem “Gesetz über die elektromagnetische Verträglichkeit von Geräten
(EMVG)“. Dies ist die Umsetzung der EU-Richtlinie 2004/108/EG in der Bundesrepublik Deutschland.
Zulassungsbescheinigung laut dem Deutschen Gesetz über die elektromagnetische Verträglichkeit von
Geräten (EMVG) (bzw. der EMC EG Richtlinie 2004/108/EG) für Geräte der Klasse B
Dieses Gerät ist berechtigt, in Übereinstimmung mit dem Deutschen EMVG das EG-Konformitätszeichen
- CE - zu führen.
Verantwortlich für die Einhaltung der EMV Vorschriften ist der Hersteller:
International Business Machines Corp.
New Orchard Road
Armonk, New York 10504
Tel: 914-499-1900
Der verantwortliche Ansprechpartner des Herstellers in der EU ist:
IBM Deutschland GmbH
Technical Regulations, Abteilung M456
126 Activation SAS (Serial-attached SCSI) RAID
IBM-Allee 1, 71139 Ehningen, Germany
Tel: +49 7032 15-2937
email: tjahn@de.ibm.com
Generelle Informationen:
Das Gerät erfüllt die Schutzanforderungen nach EN 55024 und EN 55022 Klasse B.
Dispositions
Les droits d'utilisation relatifs à ces publications sont soumis aux dispositions suivantes.
Usage personnel : Vous pouvez reproduire ces publications pour votre usage personnel, non commercial,
sous réserve que toutes les mentions de propriété soient conservées. Vous ne pouvez distribuer ou
publier tout ou partie de ces publications ou en faire des oeuvres dérivées, sans le consentement exprès
du fabricant.
Usage commercial : Vous pouvez reproduire, distribuer et afficher ces publications uniquement au sein
de votre entreprise, sous réserve que toutes les mentions de propriété soient conservées. Vous ne pouvez
reproduire, distribuer, afficher ou publier tout ou partie de ces publications en dehors de votre entreprise,
ou en faire des oeuvres dérivées, sans le consentement exprès du fabricant.
Excepté les droits d'utilisation expressément accordés dans ce document, aucun autre droit, licence ou
autorisation, implicite ou explicite, n'est accordé pour ces publications ou autres informations, données,
logiciels ou droits de propriété intellectuelle contenus dans ces publications.
Le fabricant se réserve le droit de retirer les autorisations accordées dans le présent document si
l'utilisation des publications s'avère préjudiciable à ses intérêts ou que, selon son appréciation, les
instructions susmentionnées n'ont pas été respectées.
Vous ne pouvez télécharger, exporter ou réexporter ces informations qu'en total accord avec toutes les lois
et règlements applicables dans votre pays, y compris les lois et règlements américains relatifs à
l'exportation.
LE FABRICANT NE DONNE AUCUNE GARANTIE SUR LE CONTENU DE CES PUBLICATIONS. CES
PUBLICATIONS SONT LIVREES EN L'ETAT SANS AUCUNE GARANTIE EXPLICITE OU IMPLICITE.
LE FABRICANT DECLINE NOTAMMENT TOUTE RESPONSABILITE RELATIVE A CES
INFORMATIONS EN CAS DE CONTREFAÇON AINSI QU'EN CAS DE DEFAUT D'APTITUDE A
L'EXECUTION D'UN TRAVAIL DONNE.
Disk drives
REFERENCE
86 A1 14FG 01
ESCALA Power7
Disk drives
This publication concerns the following models:
- Bull Escala E5-700 (Power 750 / 8233-E8B)
- Bull Escala M6-700 (Power 770 / 9117-MMB)
- Bull Escala M7-700 (Power 780 / 9179-MHB)
- Bull Escala E1-700 (Power 710 / 8231-E2B)
- Bull Escala E2-700 / E2-700T (Power 720 / 8202-E4B)
- Bull Escala E3-700 (Power 730 / 8231-E2B)
- Bull Escala E4-700 / E4-700T (Power 740 / 8205-E6B)
References to Power 755 / 8236-E8C models are irrelevant.
Hardware
September 2010
BULL CEDOC
357 AVENUE PATTON
B.P.20845
49008 ANGERS CEDEX 01
FRANCE
REFERENCE
86 A1 14FG 01
The following copyright notice protects this book under Copyright laws which prohibit such actions as, but not limited
to, copying, distributing, modifying, and making derivative works.
Copyright Bull SAS 2007-2010
Printed in France
Suggestions and criticisms concerning the form, content, and presentation of this
book are invited. A form is provided at the end of this book for this purpose.
To order additional copies of this book or other Bull Technical Publications, you
are invited to use the Ordering Form also provided at the end of this book.
Trademarks and Acknowledgements
We acknowledge the right of proprietors of trademarks mentioned in this book.
The information in this document is subject to change without notice. Bull will not be liable for errors
ontained herein, or for incidental or consequential damages in connection with the use of this material.
c
Contents
Safety notices. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . ix
Disk drives. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1
What's new in Disk drives . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3
Installing a disk drive or solid-state drive . . . . . . . . . . . . . . . . . . . . . 5
Solid-state drives . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5
Installing a disk drive or solid-state drive with the AIX system or logical partition powered on . . . . . . . 10
Prepare the system . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 11
Install the disk drive . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 12
Complete the procedure . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 16
Installing a disk drive or solid-state drive with the Linux system or logical partition powered on . . . . . . 17
Installing a disk drive using Linux kernel 2.4 . . . . . . . . . . . . . . . . . . . . . . . 19
Installing a disk drive . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 19
Perform the procedure. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 24
Installing the disk drive using Linux kernel 2.6 . . . . . . . . . . . . . . . . . . . . . . 25
Install the disk drive . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 25
Verify that the new disk drive is operational . . . . . . . . . . . . . . . . . . . . . . 30
Perform the procedure. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 31
Installing a disk drive or solid-state drive with the Virtual I/O Server system or logical partition powered on . . 31
Prepare the system . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 32
Install the disk drive . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 33
Perform the procedure. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 37
Installing a disk drive or solid-state drive with the IBM i system or logical partition powered on. . . . . . . 37
Prepare the system . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 39
Prepare to install the disk drive . . . . . . . . . . . . . . . . . . . . . . . . . . . 39
Install the disk drive . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 41
Perform the procedure. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 45
Configuring a disk drive in an system or logical partition. . . . . . . . . . . . . . . . . . . 45
Rebuilding data on a replacement disk drive using IBM i . . . . . . . . . . . . . . . . . . . 47
Determining the IBM i disk drive protection status . . . . . . . . . . . . . . . . . . . . . 48
Installing a disk drive or solid-state drive with the system or logical partition power turned off . . . . . . . 49
Prepare the system . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 50
Install the disk drive . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 51
Perform the procedure. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 55
Removing a disk drive or solid-state drive . . . . . . . . . . . . . . . . . . . . 57
Removing a disk drive from a system or expansion unit controlled by IBM i . . . . . . . . . . . . . 57
Prepare the system . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 59
Remove the disk drive. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 59
Perform the procedure. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 61
Preparing to remove a disk drive from a system or logical partition controlled by IBM i. . . . . . . . . 61
Replacing a disk drive . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 63
Replacing a disk drive with the AIX system or logical partition powered on. . . . . . . . . . . . . . 63
Prepare the system . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 64
Remove a disk drive . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 65
Replace a disk drive . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 68
Complete the procedure . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 71
Replacing a disk drive with the Linux system or logical partition powered on . . . . . . . . . . . . . 72
Replacing the disk drive using Linux kernel 2.4 . . . . . . . . . . . . . . . . . . . . . . 74
Prepare to remove the disk drive . . . . . . . . . . . . . . . . . . . . . . . . . . 74
Remove the disk drive. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 75
© Copyright IBM Corp. 2010 iii
Replace the disk drive. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 79
Perform the procedure. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 83
Replacing the disk drive using Linux kernel 2.6 . . . . . . . . . . . . . . . . . . . . . . 83
Prepare to remove the disk drive . . . . . . . . . . . . . . . . . . . . . . . . . . 83
Remove the disk drive. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 85
Replace the disk drive. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 87
Perform the procedure. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 89
Replacing a disk drive with the IBM i system or logical partition powered on . . . . . . . . . . . . . 89
Prepare the system . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 90
Remove the disk drive. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 91
Replace the disk drive. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 94
Complete the procedure . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 96
Replacing a disk drive using hot spare in an IBM i system or logical partition . . . . . . . . . . . . . 97
Prepare the system . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 98
Remove the disk drive. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 98
Replace the disk drive . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 102
Perform the procedure . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 104
Replacing a disk drive with the system or logical partition powered off. . . . . . . . . . . . . . . 106
Prepare the system . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 107
Remove the disk drive . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 108
Replace the disk drive . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 111
Complete the procedure. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 113
Replacing a SCSI disk drive . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 114
Remove the disk drive . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 114
Replace the disk drive . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 116
Perform the procedure . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 119
Removing or installing a disk-drive filler. . . . . . . . . . . . . . . . . . . . . 121
Removing a disk-drive filler . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 121
Installing a disk-drive filler. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 123
Disk drive locations and concurrent maintenance lights . . . . . . . . . . . . . . 125
PCIe RAID and SSD SAS adapter locations . . . . . . . . . . . . . . . . . . . . . . . . 125
Disk drive locations and concurrent maintenance lights for an 8202-E4B or 8205-E6B system . . . . . . . . 125
Disk drive locations and concurrent maintenance lights for 8231-E2B. . . . . . . . . . . . . . . . 127
Disk drive locations and concurrent maintenance lights for 8233-E8B and 8236-E8C . . . . . . . . . . . 129
Disk drive locations and concurrent maintenance lights for models 9117-MMB or 9179-MHB. . . . . . . . 130
Disk drive locations and concurrent maintenance lights for the 5886 disk enclosure . . . . . . . . . . . 131
Disk drive locations and concurrent maintenance lights for the 5786, 5787, 7031-D24, and 7031-T24 SCSI
disk-drive enclosures. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 133
Disk drive locations and concurrent maintenance lights for the 5802 expansion unit. . . . . . . . . . . 135
Related information. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 137
Preparing a disk drive for use in an AIX system or logical partition . . . . . . . . . . . . . . . . 137
Preparing to remove a disk drive from a system or expansion unit controlled by AIX . . . . . . . . . . 137
Rebuilding data on a replacement disk drive using AIX . . . . . . . . . . . . . . . . . . . . 137
Rebuilding data on a replacement disk drive using Linux . . . . . . . . . . . . . . . . . . . 137
Rebuilding data on a disk drive using Linux kernel 2.4 . . . . . . . . . . . . . . . . . . . 138
Rebuilding data on a disk drive using Linux kernel 2.6 . . . . . . . . . . . . . . . . . . . 139
Rebuilding data on a replacement disk drive using IBM i . . . . . . . . . . . . . . . . . . . 140
Configuring a disk drive in an system or logical partition . . . . . . . . . . . . . . . . . . 141
Preparing to remove a disk drive from a system or logical partition controlled by IBM i . . . . . . . . 143
Determining the IBM i disk drive protection status. . . . . . . . . . . . . . . . . . . . . 144
Configuring a disk drive on a load source adapter for hot spare using the IBM i operating system. . . . . 145
Internal disk drive sharing on model 8202-E4B or 8205-E6B system. . . . . . . . . 147
Internal disk drive sharing on an 8233-E8B or 8236-E8C . . . . . . . . . . . . . . 149
iv Disk drives
Internal disk drive sharing in model 9117-MMB or 9179-MHB . . . . . . . . . . . . 151
Installing the external SAS port on model 8202-E4B or 8205-E6B . . . . . . . . . . 153
Installing the external SAS port on model 8231-E2B. . . . . . . . . . . . . . . . 155
Installing the external SAS port on an 8233-E8B or 8236-E8C. . . . . . . . . . . . 157
Installing the external SAS port on a 9117-MMB or 9179-MHB . . . . . . . . . . . 165
Replacing a SAS conduit card in a 5802 expansion unit with power off . . . . . . . 167
Removing a SAS conduit card from a 5802 expansion unit with power off . . . . . . 175
Replacing a SAS expander card in a 5802 expansion unit with power off . . . . . . 183
Removing a SAS expander card from a 5802 expansion unit with power off . . . . . 185
Common procedures for installable features . . . . . . . . . . . . . . . . . 187
Before you begin. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 189
Identifying a failing part. . . . . . . . . . . . . . . . . . . . . . . . . . . . 193
Control panel LEDs . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 193
Identifying a failing part in an AIX system or logical partition . . . . . . . . . . . . . . . . . . 194
Locating a failing part in an AIX system or logical partition. . . . . . . . . . . . . . . . . . 194
Activating the indicator light for the failing part. . . . . . . . . . . . . . . . . . . . . . 194
Deactivating the failing-part indicator light . . . . . . . . . . . . . . . . . . . . . . . 195
Identifying a failing part in an IBM i system or logical partition . . . . . . . . . . . . . . . . . 195
Activating the failing-part indicator light . . . . . . . . . . . . . . . . . . . . . . . . 195
Deactivating the failing-part indicator light . . . . . . . . . . . . . . . . . . . . . . . 196
Identifying a failing part in a Linux system or logical partition. . . . . . . . . . . . . . . . . . 196
Locating a failing part in a Linux system or logical partition . . . . . . . . . . . . . . . . . 196
Finding the location code of a failing part in a Linux system or logical partition . . . . . . . . . . . 197
Activating the indicator light for the failing part. . . . . . . . . . . . . . . . . . . . . . 197
Deactivating the failing-part indicator light . . . . . . . . . . . . . . . . . . . . . . . 197
Locating a failing part in a Virtual I/O Server system or logical partition . . . . . . . . . . . . . . 198
Identifying a part by using the Virtual I/O Server . . . . . . . . . . . . . . . . . . . . . 198
Starting the system or logical partition . . . . . . . . . . . . . . . . . . . . . 199
Starting a system that is not managed by a Hardware Management Console . . . . . . . . . . . . . 199
Starting a system or logical partition by using the Hardware Management Console . . . . . . . . . . . 200
Stopping a system or logical partition. . . . . . . . . . . . . . . . . . . . . . 201
Stopping a system that is not managed by a Hardware Management Console. . . . . . . . . . . . . 201
Stopping a system by using the Hardware Management Console . . . . . . . . . . . . . . . . . 202
Removing and replacing the expansion unit cover or door. . . . . . . . . . . . . 205
Removing the front cover on the 7314-G30 or 5796. . . . . . . . . . . . . . . . . . . . . . 205
Installing the front cover on the 7314-G30 or 5796 . . . . . . . . . . . . . . . . . . . . . . 205
System covers. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 207
Removing and replacing covers for the 8202-E4B or 8205-E6B . . . . . . . . . . . . . . . . . . 207
Removing the front cover on a rack-mounted 8202-E4B or 8205-E6B system . . . . . . . . . . . . 207
Removing the front cover on a stand-alone 8202-E4B or 8205-E6B system . . . . . . . . . . . . . 207
Installing the front cover on a rack-mounted 8202-E4B or 8205-E6B system. . . . . . . . . . . . . 208
Contents v
Installing the front cover and front door on a stand-alone 8202-E4B or 8205-E6B system . . . . . . . . 209
Removing the service access cover on a rack-mounted 8202-E4B or 8205-E6B system . . . . . . . . . 210
Removing the service access cover on a stand-alone 8202-E4B or 8205-E6B system . . . . . . . . . . 211
Installing the service access cover on a rack-mounted 8202-E4B or 8205-E6B system. . . . . . . . . . 213
Installing the service access cover on a stand-alone 8202-E4B or 8205-E6B system. . . . . . . . . . . 213
Removing and replacing covers for the 8231-E2B . . . . . . . . . . . . . . . . . . . . . . 215
Removing the service access cover for the 8231-E2B . . . . . . . . . . . . . . . . . . . . 215
Installing the service access cover for the 8231-E2B. . . . . . . . . . . . . . . . . . . . . 216
Removing and replacing covers on the 8233-E8B or 8236-E8C system. . . . . . . . . . . . . . . . 217
Removing the front cover from an 8233-E8B or 8236-E8C system . . . . . . . . . . . . . . . . 217
Installing the front cover on an 8233-E8B or 8236-E8C system . . . . . . . . . . . . . . . . . 218
Removing the service access cover from an 8233-E8B or 8236-E8C system . . . . . . . . . . . . . 219
Installing the service access cover on an 8233-E8B or 8236-E8C system . . . . . . . . . . . . . . 220
Removing and replacing covers for the 9117-MMB or 9179-MHB . . . . . . . . . . . . . . . . . 221
Removing the front cover from the 9117-MMB or 9179-MHB . . . . . . . . . . . . . . . . . 221
Installing the front cover on the 9117-MMB or 9179-MHB . . . . . . . . . . . . . . . . . . 222
Placing the rack-mounted 8233-E8B or 8236-E8C system into the service position or
operating position . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 225
Placing a rack-mounted 8231-E2B system into the service position. . . . . . . . . . . . . . . . . 225
Placing the rack-mounted 8231-E2B system into the operating position . . . . . . . . . . . . . . . 226
Placing a rack-mounted 8202-E4B or 8205-E6B system into the service position . . . 227
Placing the rack-mounted 8202-E4B or 8205-E6B system into the operating position 229
Service and operating positions . . . . . . . . . . . . . . . . . . . . . . . . 231
Placing the 8202-E4B or 8205-E6B system into the service or operating position . . . . . . . . . . . . 231
Placing a rack-mounted 8202-E4B or 8205-E6B system into the service position . . . . . . . . . . . 231
Placing the rack-mounted 8202-E4B or 8205-E6B system into the operating position . . . . . . . . . . 232
Placing the rack-mounted 8233-E8B or 8236-E8C system into the service position or operating position . . . . 234
Placing a rack-mounted 8231-E2B system into the service position. . . . . . . . . . . . . . . . 234
Placing the rack-mounted 8231-E2B system into the operating position . . . . . . . . . . . . . . 235
Placing the rack-mounted 8233-E8B or 8236-E8C system into the service position or operating position . . . . 236
Placing a 8233-E8B or 8236-E8C in the service position . . . . . . . . . . . . . . . . . . . 236
Placing the 8233-E8B or 8236-E8C in the operating position . . . . . . . . . . . . . . . . . . 239
Power cords. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 241
Disconnecting the power cords from the 8202-E4B or 8205-E6B system . . . . . . . . . . . . . . . 241
Connecting the power cords to the 8202-E4B or 8205-E6B system . . . . . . . . . . . . . . . . . 242
Disconnecting the power cords from the 8231-E2B . . . . . . . . . . . . . . . . . . . . . . 243
Connecting the power cords to the 8231-E2B . . . . . . . . . . . . . . . . . . . . . . . . 244
Installing a feature by using the Hardware Management Console . . . . . . . . . . 247
Removing a part by using the Hardware Management Console . . . . . . . . . . . . . . . . . 247
Exchanging a part by using the Hardware Management Console . . . . . . . . . . . . . . . . . 247
Verifying the installed part. . . . . . . . . . . . . . . . . . . . . . . . . . . 249
Verifying an installed feature or replaced part in an AIX system or logical partition. . . . . . . . . . . 249
Verifying the installed part in an IBM i system or logical partition. . . . . . . . . . . . . . . . . 251
Deactivating the failing-part indicator light . . . . . . . . . . . . . . . . . . . . . . . 252
Verifying the installed part in a Linux system or logical partition . . . . . . . . . . . . . . . . . 252
Verifying an installed part by using stand-alone diagnostics. . . . . . . . . . . . . . . . . . . 253
Verifying the installed part by using Hardware Management Console . . . . . . . . . . . . . . . 254
Activating and deactivating LEDs . . . . . . . . . . . . . . . . . . . . . . . . . . 255
Deactivating a system attention LED or partition LED. . . . . . . . . . . . . . . . . . . 255
Activating or deactivating identify LED . . . . . . . . . . . . . . . . . . . . . . . 255
Viewing serviceable events. . . . . . . . . . . . . . . . . . . . . . . . . . . . . 256
Verifying an installed feature or replaced part on a system or logical partition by using Virtual I/O Server tools 256
vi Disk drives
Verifying a repair. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 261
Verifying the repair in AIX . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 262
Verifying a repair using an IBM i system or logical partition . . . . . . . . . . . . . . . . . . 265
Verifying the repair in Linux . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 267
Verifying the repair from the HMC . . . . . . . . . . . . . . . . . . . . . . . . . . . 267
Closing a service call. . . . . . . . . . . . . . . . . . . . . . . . . . . . . 269
Closing a service call using AIX or Linux . . . . . . . . . . . . . . . . . . . . . . . . . 273
Closing a service call using Integrated Virtualization Manager . . . . . . . . . . . . . . . . . . 277
Activating and deactivating LEDs. . . . . . . . . . . . . . . . . . . . . . . . 283
Deactivating a system attention LED or partition LED using the HMC . . . . . . . . . . . . . . . 283
Activating or deactivating an identify LED using the HMC . . . . . . . . . . . . . . . . . . . 284
Deactivating a system attention LED or logical partition LED using the Advanced System Management Interface 284
Activating or deactivating an identify LED using the Advanced System Management Interface . . . . . . . 285
Notices . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 287
Trademarks . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 288
Electronic emission notices . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 288
Class A Notices. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 288
Class B Notices. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 292
Terms and conditions. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 295
Contents vii
viii Disk drives
Safety notices
Safety notices may be printed throughout this guide:
v DANGER notices call attention to a situation that is potentially lethal or extremely hazardous to
people.
v CAUTION notices call attention to a situation that is potentially hazardous to people because of some
existing condition.
v Attention notices call attention to the possibility of damage to a program, device, system, or data.
World Trade safety information
Several countries require the safety information contained in product publications to be presented in their
national languages. If this requirement applies to your country, a safety information booklet is included
in the publications package shipped with the product. The booklet contains the safety information in
your national language with references to the U.S. English source. Before using a U.S. English publication
to install, operate, or service this product, you must first become familiar with the related safety
information in the booklet. You should also refer to the booklet any time you do not clearly understand
any safety information in the U.S. English publications.
German safety information
Das Produkt ist nicht für den Einsatz an Bildschirmarbeitsplätzen im Sinne § 2 der
Bildschirmarbeitsverordnung geeignet.
Laser safety information
IBM® servers can use I/O cards or features that are fiber-optic based and that utilize lasers or LEDs.
Laser compliance
IBM servers may be installed inside or outside of an IT equipment rack.
© Copyright IBM Corp. 2010 ix
DANGER
When working on or around the system, observe the following precautions:
Electrical voltage and current from power, telephone, and communication cables are hazardous. To
avoid a shock hazard:
v Connect power to this unit only with the IBM provided power cord. Do not use the IBM
provided power cord for any other product.
v Do not open or service any power supply assembly.
v Do not connect or disconnect any cables or perform installation, maintenance, or reconfiguration
of this product during an electrical storm.
v The product might be equipped with multiple power cords. To remove all hazardous voltages,
disconnect all power cords.
v Connect all power cords to a properly wired and grounded electrical outlet. Ensure that the outlet
supplies proper voltage and phase rotation according to the system rating plate.
v Connect any equipment that will be attached to this product to properly wired outlets.
v When possible, use one hand only to connect or disconnect signal cables.
v Never turn on any equipment when there is evidence of fire, water, or structural damage.
v Disconnect the attached power cords, telecommunications systems, networks, and modems before
you open the device covers, unless instructed otherwise in the installation and configuration
procedures.
v Connect and disconnect cables as described in the following procedures when installing, moving,
or opening covers on this product or attached devices.
To Disconnect:
1. Turn off everything (unless instructed otherwise).
2. Remove the power cords from the outlets.
3. Remove the signal cables from the connectors.
4. Remove all cables from the devices
To Connect:
1. Turn off everything (unless instructed otherwise).
2. Attach all cables to the devices.
3. Attach the signal cables to the connectors.
4. Attach the power cords to the outlets.
5. Turn on the devices.
(D005)
DANGER
x Disk drives
Observe the following precautions when working on or around your IT rack system:
v Heavy equipment–personal injury or equipment damage might result if mishandled.
v Always lower the leveling pads on the rack cabinet.
v Always install stabilizer brackets on the rack cabinet.
v To avoid hazardous conditions due to uneven mechanical loading, always install the heaviest
devices in the bottom of the rack cabinet. Always install servers and optional devices starting
from the bottom of the rack cabinet.
v Rack-mounted devices are not to be used as shelves or work spaces. Do not place objects on top
of rack-mounted devices.
v Each rack cabinet might have more than one power cord. Be sure to disconnect all power cords in
the rack cabinet when directed to disconnect power during servicing.
v Connect all devices installed in a rack cabinet to power devices installed in the same rack
cabinet. Do not plug a power cord from a device installed in one rack cabinet into a power
device installed in a different rack cabinet.
v An electrical outlet that is not correctly wired could place hazardous voltage on the metal parts of
the system or the devices that attach to the system. It is the responsibility of the customer to
ensure that the outlet is correctly wired and grounded to prevent an electrical shock.
CAUTION
v Do not install a unit in a rack where the internal rack ambient temperatures will exceed the
manufacturer's recommended ambient temperature for all your rack-mounted devices.
v Do not install a unit in a rack where the air flow is compromised. Ensure that air flow is not
blocked or reduced on any side, front, or back of a unit used for air flow through the unit.
v Consideration should be given to the connection of the equipment to the supply circuit so that
overloading of the circuits does not compromise the supply wiring or overcurrent protection. To
provide the correct power connection to a rack, refer to the rating labels located on the
equipment in the rack to determine the total power requirement of the supply circuit.
v (For sliding drawers.) Do not pull out or install any drawer or feature if the rack stabilizer brackets
are not attached to the rack. Do not pull out more than one drawer at a time. The rack might
become unstable if you pull out more than one drawer at a time.
v (For fixed drawers.) This drawer is a fixed drawer and must not be moved for servicing unless
specified by the manufacturer. Attempting to move the drawer partially or completely out of the
rack might cause the rack to become unstable or cause the drawer to fall out of the rack.
(R001)
Safety notices xi
CAUTION:
Removing components from the upper positions in the rack cabinet improves rack stability during
relocation. Follow these general guidelines whenever you relocate a populated rack cabinet within a
room or building:
v Reduce the weight of the rack cabinet by removing equipment starting at the top of the rack
cabinet. When possible, restore the rack cabinet to the configuration of the rack cabinet as you
received it. If this configuration is not known, you must observe the following precautions:
– Remove all devices in the 32U position and above.
– Ensure that the heaviest devices are installed in the bottom of the rack cabinet.
– Ensure that there are no empty U-levels between devices installed in the rack cabinet below the
32U level.
v If the rack cabinet you are relocating is part of a suite of rack cabinets, detach the rack cabinet from
the suite.
v Inspect the route that you plan to take to eliminate potential hazards.
v Verify that the route that you choose can support the weight of the loaded rack cabinet. Refer to the
documentation that comes with your rack cabinet for the weight of a loaded rack cabinet.
v Verify that all door openings are at least 760 x 230 mm (30 x 80 in.).
v Ensure that all devices, shelves, drawers, doors, and cables are secure.
v Ensure that the four leveling pads are raised to their highest position.
v Ensure that there is no stabilizer bracket installed on the rack cabinet during movement.
v Do not use a ramp inclined at more than 10 degrees.
v When the rack cabinet is in the new location, complete the following steps:
– Lower the four leveling pads.
– Install stabilizer brackets on the rack cabinet.
– If you removed any devices from the rack cabinet, repopulate the rack cabinet from the lowest
position to the highest position.
v If a long-distance relocation is required, restore the rack cabinet to the configuration of the rack
cabinet as you received it. Pack the rack cabinet in the original packaging material, or equivalent.
Also lower the leveling pads to raise the casters off of the pallet and bolt the rack cabinet to the
pallet.
(R002)
(L001)
(L002)
xii Disk drives
(L003)
or
All lasers are certified in the U.S. to conform to the requirements of DHHS 21 CFR Subchapter J for class
1 laser products. Outside the U.S., they are certified to be in compliance with IEC 60825 as a class 1 laser
product. Consult the label on each part for laser certification numbers and approval information.
CAUTION:
This product might contain one or more of the following devices: CD-ROM drive, DVD-ROM drive,
DVD-RAM drive, or laser module, which are Class 1 laser products. Note the following information:
v Do not remove the covers. Removing the covers of the laser product could result in exposure to
hazardous laser radiation. There are no serviceable parts inside the device.
v Use of the controls or adjustments or performance of procedures other than those specified herein
might result in hazardous radiation exposure.
(C026)
Safety notices xiii
CAUTION:
Data processing environments can contain equipment transmitting on system links with laser modules
that operate at greater than Class 1 power levels. For this reason, never look into the end of an optical
fiber cable or open receptacle. (C027)
CAUTION:
This product contains a Class 1M laser. Do not view directly with optical instruments. (C028)
CAUTION:
Some laser products contain an embedded Class 3A or Class 3B laser diode. Note the following
information: laser radiation when open. Do not stare into the beam, do not view directly with optical
instruments, and avoid direct exposure to the beam. (C030)
Power and cabling information for NEBS (Network Equipment-Building System)
GR-1089-CORE
The following comments apply to the IBM servers that have been designated as conforming to NEBS
(Network Equipment-Building System) GR-1089-CORE:
The equipment is suitable for installation in the following:
v Network telecommunications facilities
v Locations where the NEC (National Electrical Code) applies
The intrabuilding ports of this equipment are suitable for connection to intrabuilding or unexposed
wiring or cabling only. The intrabuilding ports of this equipment must not be metallically connected to the
interfaces that connect to the OSP (outside plant) or its wiring. These interfaces are designed for use as
intrabuilding interfaces only (Type 2 or Type 4 ports as described in GR-1089-CORE) and require isolation
from the exposed OSP cabling. The addition of primary protectors is not sufficient protection to connect
these interfaces metallically to OSP wiring.
Note: All Ethernet cables must be shielded and grounded at both ends.
The ac-powered system does not require the use of an external surge protection device (SPD).
The dc-powered system employs an isolated DC return (DC-I) design. The DC battery return terminal
shall not be connected to the chassis or frame ground.
xiv Disk drives
Disk drives
Learn about installing, removing, and replacing a disk drive.
© Copyright IBM Corp. 2010 1
2 Disk drives
What's new in Disk drives
Read about new or changed information in Disk drives since the previous update of this topic collection.
September 2010
The following updates are made to the content:
v Added information for Internal disk drive sharing on model 8231-E2B .
v Added information for Internal disk drive sharing on model 8202-E4B or 8205-E6B .
v Added information for “Disk drive locations and concurrent maintenance lights for 8231-E2B” on page
127.
v Added information for “Disk drive locations and concurrent maintenance lights for an 8202-E4B or
8205-E6B system” on page 125.
v Added information for “PCIe RAID and SSD SAS adapter locations” on page 125.
v Added a procedure for “Installing the external SAS port on model 8231-E2B” on page 155.
v Added a procedure for “Installing the external SAS port on model 8202-E4B or 8205-E6B” on page 153.
March 2010
The following updates are made to the content:
v Added information for “Disk drive locations and concurrent maintenance lights for models 9117-MMB
or 9179-MHB” on page 130.
v Added a procedure for “Internal disk drive sharing in model 9117-MMB or 9179-MHB” on page 151.
February 2010
v Added information for servers that contain the POWER7™ processor.
© Copyright IBM Corp. 2010 3
4 Disk drives
Installing a disk drive or solid-state drive
You can install a Small Computer System Interface (SCSI), serial-attached SCSI (SAS) disk drive, or
solid-state drive (SSD) in the server or expansion unit. Use the procedures in this topic to perform this
task.
The system or logical partition can either be turned off or turned on when you install the disk drive.
Identify the system for performing the installation or an upgrade. Turn on the System Identify (BLUE)
LED.
Solid-state drives
To install and configure solid-state drives (SSD), you must review the supported machine type model,
adapter, and configuration details.
Solid®-state drives, also known as flash drives, follow many of the same rules as a regular hard disk drive
(HDD). For example, SSDs physically resemble, install in the same manner, and in most of the same slots
as HDDs. However, some restrictions and configuration rules that apply to SSD are different from HDD.
The following tables can help you determine which operating system, adapter, machine type model, and
disk drive configuration rules apply to your system.
After you have reviewed the configuration rules, you can install the drive as you would any other disk
drive. For more information, see “Installing a disk drive or solid-state drive.”
Restriction: Solid-state drives must be part of a RAID array or System Mirroring (IBM i).
Select the operating system for the server or partition where you want to install drives from the following
list:
v Solid-state drive rules for the AIX® operating system
v Solid-state drive rules for the Linux® operating system
v Solid-state drive rules for the IBM i operating system
Attention: Before installing SSD into your system, you must ensure that you have all the prerequisite
updates for both the SSD and the adapters. To check the prerequisites, go to the IBM Prerequisite website:
http://www-912.ibm.com/e_dir/eServerPrereq.nsf.
Table 1. Solid-state drive rules for the AIX operating system
Machine type model Adapter Mixing rules
5886 v (5904) - PCI-X DDR 1.5GB Cache
SAS RAID Adapter
v (5906 and 5908) - PCI-X DDR
1.5GB Cache SAS RAID Adapter
(BSC)
v (5903) - PCIe 380 MB Cache Dual -
x4 3 Gb SAS RAID Adapter
v Solid-state drives and hard disk
drives cannot be mixed in this
drive enclosure.
v A maximum of eight drives are
supported in this configuration.
v Only one 5886 can be attached to
the adapter, or pair of adapters,
when solid-state drives are
installed.
v Same rules apply when using dual
SAS Raid adapters connected to
5886.
© Copyright IBM Corp. 2010 5
Table 1. Solid-state drive rules for the AIX operating system (continued)
Machine type model Adapter Mixing rules
8233-E8B and 8236-E8C v (5679) - SAS RAID Enablement:
internal adapter
v (5904) - PCI-X DDR 1.5GB Cache
SAS RAID Adapter
v (5903) - PCIe 380 MB Cache Dual -
x4 3 Gb SAS RAID Adapter
v SSD and HDD can never be mixed
in the same RAID array or
mirroring group.
v SSD and HDD can be mixed in the
system disk drive backplane if the
backplane is not split.
v SSD and HDD can both be present
in a split backplane if each side of
the split backplane has only SSD or
HDD.
Restriction: Split backplane for
these models is not supported with
dual SAS Raid adapters.
Remember: SSD must be on one side
of the split backplane and HDD must
be on the other side of the split
backplane. SSD and HDD drive types
cannot be mixed within a parity set.
If you are mixing SSD and HDD in a
system, ensure that you have enough
of each drive type to manage your
protection scheme.
9117-MMB and 9179-MHB v (5652) - Disk/Media Backplane
v (5662) - 175 MB Cache RAID -
Dual IOA Enablement Card
v (5903) - PCIe 380 MB Cache Dual -
x4 3 Gb SAS RAID Adapter
v SSD and HDD cannot be mixed in
the same RAID array or mirroring
group.
v SSD and HDD can be mixed in the
system disk-drive backplane.
Remember: You cannot mix SSD
and HDD drive types within a
parity set. If you are mixing SSD
and HDD in a system, ensure that
you have enough of each drive
type to manage your protection
scheme.
5802 (5903) - PCIe 380 MB Cache Dual - x4
3 Gb SAS RAID Adapter
v SSD and HDD cannot be mixed in
the same system disk drive
backplane.
v SSD and HDD can both be present
in a split backplane if each side of
the split backplane has only SSD or
HDD. SSD must be on one side of
the split backplane and HDD must
be on the other side of the split
backplane.
v SSD and HDD cannot be mixed on
a pair of 5903 adapters.
v A maximum of nine SSDs are
supported on a pair of 5903
adapters.
6 Disk drives
Table 2. Solid-state drive rules for the Linux operating system
Machine type model Adapter Mixing rules
5886 v (5904) - PCI-X DDR 1.5GB Cache
SAS RAID Adapter
v (5906 and 5908) - PCI-X DDR
1.5GB Cache SAS RAID Adapter
(BSC)
v (5903) - PCIe 380 MB Cache Dual -
x4 3 Gb SAS RAID Adapter
v Solid-state drives and hard disk
drives cannot be mixed in this
drive enclosure.
v A maximum of eight drives are
supported in this configuration.
v Only one 5886 can be attached to
the adapter, or pair of adapters,
when solid-state drives are
installed.
v Same rules apply when using dual
SAS Raid adapters connected to
5886.
8233-E8B and 8236-E8C v (5679) - SAS RAID Enablement:
internal adapter
v (5904) - PCI-X DDR 1.5GB Cache
SAS RAID Adapter
v (5903) - PCIe 380 MB Cache Dual -
x4 3 Gb SAS RAID Adapter
v SSD and HDD can never be mixed
in the same RAID array or
mirroring group.
v SSD and HDD can be mixed in the
system disk drive backplane if the
backplane is not split.
v SSD and HDD can both be present
in a split backplane if each side of
the split backplane has only SSD or
HDD.
Restriction: Split backplane for
these models is not supported with
dual SAS Raid adapters.
Remember: SSD must be on one side
of the split backplane and HDD must
be on the other side of the split
backplane. SSD and HDD drive types
cannot be mixed within a parity set.
If you are mixing SSD and HDD in a
system, ensure that you have enough
of each drive type to manage your
protection scheme.
9117-MMB and 9179-MHB v (5652) - Disk/Media Backplane
v (5662) - 175 MB Cache RAID -
Dual IOA Enablement Card
v (5903) - PCIe 380 MB Cache Dual -
x4 3 Gb SAS RAID Adapter
v SSD and HDD cannot be mixed in
the same RAID array or mirroring
group.
v SSD and HDD can be mixed in the
system disk-drive backplane.
Remember: You cannot mix SSD
and HDD drive types within a
parity set. If you are mixing SSD
and HDD in a system, ensure that
you have enough of each drive
type to manage your protection
scheme.
Installing a disk drive or solid-state drive 7
Table 2. Solid-state drive rules for the Linux operating system (continued)
Machine type model Adapter Mixing rules
5802 (5903) - PCIe 380 MB Cache Dual - x4
3 Gb SAS RAID Adapter
v SSD and HDD cannot be mixed in
the same system disk drive
backplane.
v SSD and HDD can both be present
in a split backplane if each side of
the split backplane has only SSD or
HDD. SSD must be on one side of
the split backplane and HDD must
be on the other side of the split
backplane.
v SSD and HDD cannot be mixed on
a pair of 5903 adapters.
v A maximum of nine SSDs are
supported on a pair of 5903
adapters.
Table 3. Solid-state drive rules for the IBM i operating system
Machine type model Adapter Mixing rules
5886 v (5904) - PCI-X DDR 1.5GB Cache
SAS RAID Adapter
v (5906 and 5908) - PCI-X DDR
1.5GB Cache SAS RAID Adapter
(BSC)
v (5903) - PCIe 380 MB Cache Dual -
x4 3 Gb SAS RAID Adapter
v Solid-state drives and hard disk
drives cannot be mixed in this
drive enclosure.
v A maximum of eight drives are
supported in this configuration.
v Only one 5886 can be attached to
the adapter, or pair of adapters,
when solid-state drives are
installed.
v Same rules apply when using dual
SAS Raid adapters connected to
5886.
8233-E8B and 8236-E8C v (5679) - SAS RAID Enablement:
internal adapter
v (5904) - PCI-X DDR 1.5GB Cache
SAS RAID Adapter
v (5903) - PCIe 380 MB Cache Dual -
x4 3 Gb SAS RAID Adapter
v SSD and HDD cannot be mixed in
the same RAID array or mirroring
group.
v SSD and HDD can be mixed in the
system disk-drive backplane.
Remember: You cannot mix SSD
and HDD drive types within a
parity set. If you are mixing SSD
and HDD in a system, ensure that
you have enough of each drive
type to manage your protection
scheme.
8 Disk drives
Table 3. Solid-state drive rules for the IBM i operating system (continued)
Machine type model Adapter Mixing rules
9117-MMB and 9179-MHB v (5652) - Disk/Media Backplane
v (5662) - 175 MB Cache RAID -
Dual IOA Enablement Card
v (5903) - PCIe 380 MB Cache Dual -
x4 3 Gb SAS RAID Adapter
v SSD and HDD cannot be mixed in
the same RAID array or mirroring
group.
v SSD and HDD can be mixed in the
system disk-drive backplane.
Remember: You cannot mix SSD
and HDD drive types within a
parity set. If you are mixing SSD
and HDD in a system, ensure that
you have enough of each drive
type to manage your protection
scheme.
5802 (5903) - PCIe 380 MB Cache Dual - x4
3 Gb SAS RAID Adapter
v SSD and HDD cannot be mixed in
the same system disk drive
backplane.
v SSD and HDD can both be present
in a split backplane if each side of
the split backplane has only SSD or
HDD. SSD must be on one side of
the split backplane and HDD must
be on the other side of the split
backplane.
v SSD and HDD cannot be mixed on
a pair of 5903 adapters.
v A maximum of nine SSDs are
supported on a pair of 5903
adapters.
Installing a disk drive or solid-state drive 9
Related tasks
“Installing a disk drive or solid-state drive with the AIX system or logical partition powered on”
You can install a disk drive or solid-state drive in a system or expansion unit while the AIX operating
system or logical partition that controls the disk drive location is powered on. Use the procedure in this
section to perform this task.
“Installing a disk drive or solid-state drive with the Linux system or logical partition powered on” on
page 17
You might need to install a disk drive or solid-state drive in a system or expansion unit while the Linux
operating system or logical partition that controls the disk drive location is powered on. Use the
procedure in this section to perform this task.
“Installing a disk drive or solid-state drive with the IBM i system or logical partition powered on” on
page 37
Learn how to install a disk drive or solid-state drive in a system or expansion unit while the IBM i
operating system or logical partition that controls the disk drive location is powered on. Use the
procedure in this information to perform this task.
“Installing a disk drive or solid-state drive with the system or logical partition power turned off” on page
49
Learn how to install a disk drive in a system or logical partition that controls the disk drive location with
the power turned off. Use the procedure in this information to perform this task.
Related information
Cabling the 5802 disk subsystem
Installing a disk drive or solid-state drive with the AIX system or
logical partition powered on
You can install a disk drive or solid-state drive in a system or expansion unit while the AIX operating
system or logical partition that controls the disk drive location is powered on. Use the procedure in this
section to perform this task.
Attention: If you are servicing a failing part, see the service procedures for “Replacing a disk drive with
the AIX system or logical partition powered on” on page 63. This procedure is intended for installing a
new or upgraded disk drive.
Before installing or replacing a feature, ensure that the software required to support the feature is
installed on your system. For information about software prerequisites, see IBM Prerequisite. If the
required software is not installed, see the following Web sites to download it, and then install it before
continuing:
v To download firmware and software updates and fixes, see Microcode downloads.
v To download Hardware Management Console (HMC) updates and fixes, see Hardware Management
Console Support and downloads.
If your system is managed by the HMC, use the HMC to complete the steps for installing a feature in the
server. For information about using the HMC to install a feature, see “Installing a feature by using the
Hardware Management Console” on page 247. If you do not have an HMC, complete the following steps
to install a disk drive from the system or partition that controls the disk drive.
If you are installing Solid State Drives, review the configuration rules and then return here. For more
information, see “Solid-state drives” on page 5.
10 Disk drives
DANGER
When working on or around the system, observe the following precautions:
Electrical voltage and current from power, telephone, and communication cables are hazardous. To
avoid a shock hazard:
v Connect power to this unit only with the IBM provided power cord. Do not use the IBM
provided power cord for any other product.
v Do not open or service any power supply assembly.
v Do not connect or disconnect any cables or perform installation, maintenance, or reconfiguration
of this product during an electrical storm.
v The product might be equipped with multiple power cords. To remove all hazardous voltages,
disconnect all power cords.
v Connect all power cords to a properly wired and grounded electrical outlet. Ensure that the outlet
supplies proper voltage and phase rotation according to the system rating plate.
v Connect any equipment that will be attached to this product to properly wired outlets.
v When possible, use one hand only to connect or disconnect signal cables.
v Never turn on any equipment when there is evidence of fire, water, or structural damage.
v Disconnect the attached power cords, telecommunications systems, networks, and modems before
you open the device covers, unless instructed otherwise in the installation and configuration
procedures.
v Connect and disconnect cables as described in the following procedures when installing, moving,
or opening covers on this product or attached devices.
To Disconnect:
1. Turn off everything (unless instructed otherwise).
2. Remove the power cords from the outlets.
3. Remove the signal cables from the connectors.
4. Remove all cables from the devices
To Connect:
1. Turn off everything (unless instructed otherwise).
2. Attach all cables to the devices.
3. Attach the signal cables to the connectors.
4. Attach the power cords to the outlets.
5. Turn on the devices.
(D005)
Attention: The powered-on procedure is a complex set of steps that must be performed precisely as
instructed to avoid potential data loss or system damage. You might also consider the powered-off
procedure, which is a simpler task that involves less risk of data loss or system damage. For instructions,
see “Installing a disk drive or solid-state drive with the system or logical partition power turned off” on
page 49.
Note: Some of the figures in these procedures might not look exactly like the system or expansion unit
that you have. However, the steps to perform the task are the same.
Prepare the system
1. Perform prerequisite tasks as described in “Before you begin” on page 189.
2. If applicable, remove the system unit door, remove the expansion unit door, or open the rack.
3. If applicable, remove the front cover.
4. Determine the next available disk drive position in your system or expansion unit. For information,
see “Disk drive locations and concurrent maintenance lights” on page 125.
Installing a disk drive or solid-state drive 11
Note: When you have a system with an expansion unit, fill the slot positions in the system unit first.
However, you can choose a different disk drive placement depending on your data protection
strategy.
5. Record the position where the new disk drive will be installed. For example, the next available disk
drive slot might be P3-D4.
6. Perform the following steps to install a disk drive.
Install the disk drive
1. Log in as root user.
2. At the command line, type diag and press Enter.
3. Press Enter to continue.
4. On the Function Selection display, select Task Selection.
5. On the Tasks Selection display, select Hot Plug Task.
6. Select SCSI and SCSI RAID Hot Plug Manager.
7. Select Attach a Device to a SCSI Hot Swap Enclosure Device. A list of empty slots in the SCSI hot
swap enclosure device is shown.
8. Select the slot where you want to install the disk drive and press Enter.
The disk drive slot enters the Identify state. The concurrent maintenance light for the slot begins
flashing. To view the locations of the concurrent maintenance lights, select the model into which you
are installing a disk drive. For a list of models, see “Disk drive locations and concurrent maintenance
lights” on page 125.
9. If the slot you want to use contains a disk-drive filler, store the filler for future use, or discard it,
then move on to step Insert the disk drive.
Note: The disk drive filler is a single piece that does not attach to the new disk drive.
Important: .
Pull the disk-drive filler from the slot that you want to use, as shown in Figure 1 on page 13
Attention:
v Attach a wrist strap to an unpainted surface of your hardware to prevent electrostatic discharge
(ESD) from damaging your hardware.
v When using a wrist strap, follow all electrical safety procedures. A wrist strap is for static control.
It does not increase or decrease your risk of receiving electric shock when using or working on
electrical equipment.
v If you do not have a wrist strap, just prior to removing the product from ESD packaging and
installing or replacing hardware, touch an unpainted surface of the system for a minimum of 5
seconds.
12 Disk drives
10. If the slot you want to use contains a disk-drive filler, complete the following step.
Remove the disk-drive bezel latch (A) from the front of the filler, as shown in the following figure.
11. If the slot you want to use contains a disk-drive filler, complete the following step.
Loosen, and then pull out on the thumbscrew (C) to release the bezel (B) from the front edge of the
filler.
Figure 1. Removing a disk-drive filler from a model .
Figure 2. Removing a disk-drive bezel from a disk-drive filler
Installing a disk drive or solid-state drive 13
12. Find the package that contains the new disk drive.
Attention: Disk drives are fragile. Handle with care.
13. Remove the disk drive from the static-protective package.
14. Unlock the disk drive handle by squeezing it and pulling it out toward you. If the handle is not all
the way out, the disk drive will not slide into the system or expansion unit.
15. If the disk drive you are installing does not have a bezel preinstalled, then complete the following
step.
Attach the disk-drive bezel (A) to the disk drive, as shown in the following figure.
16. If the disk drive you are installing does not have a bezel preinstalled, then complete the following
step.
Press in and then tighten the thumbscrew (B) to secure the bezel (A) to the disk drive.
17. Support the bottom of the disk drive as you align it with the guide rails in the system unit. Do not
hold the disk drive by the handle.
18. Slide the disk drive into the system unit until it stops, as shown in Figure 4 on page 15.
19. Push the disk drive handle in until it locks. The light on the disk drive will turn on.
Figure 3. Installing a disk-drive bezel on a disk drive
14 Disk drives
Figure 4. Installing a disk drive in a model
Figure 5. Installing a small-form factor disk drive
Installing a disk drive or solid-state drive 15
20. Press Enter.
21. If you are installing more than one disk drive, repeat step 1 on page 12 through step 20 until all disk
drives are installed.
22. Cancel to get to the SCSI and SCSI RAID Hot Plug Manager menu.
23. Select Configure Added/Replaced Devices.
24. Exit to the command line.
Complete the procedure
1. Replace the front cover.
2. Install or close the system, expansion unit, or rack door.
3. Verify the installed part. For instructions, see “Verifying an installed feature or replaced part in an AIX
system or logical partition” on page 249.
4. To prepare the newly installed disk drive for use, go to “Preparing a disk drive for use in an AIX
system or logical partition” on page 137.
Figure 6. Installing a small-form factor disk drive in the 5803 expansion unit
16 Disk drives
Related concepts
Removing and replacing covers and doors
Use these procedures to remove and replace the covers on IBM Power Systems™ servers that contain the
POWER7 processor.
Related reference
“Removing and replacing the expansion unit cover or door” on page 205
You might need to remove, replace, or install covers or doors on an expansion unit as a part of accessing
components or performing a service action.
Installing a disk drive or solid-state drive with the Linux system or
logical partition powered on
You might need to install a disk drive or solid-state drive in a system or expansion unit while the Linux
operating system or logical partition that controls the disk drive location is powered on. Use the
procedure in this section to perform this task.
Attention: If you are servicing a failing part, see the service procedures for “Replacing a disk drive with
the Linux system or logical partition powered on” on page 72. This procedure is intended for installing a
new or upgraded disk drive.
Before installing or replacing a feature, ensure that the software required to support the feature is
installed on your system. For information about software prerequisites, see IBM Prerequisite. If the
required software is not installed, see the following Web sites to download it, and then install it before
continuing:
v To download firmware and software updates and fixes, see Microcode downloads.
v To download Hardware Management Console (HMC) updates and fixes, see Hardware Management
Console Support and downloads.
If your system is managed by the HMC, use the HMC to complete the steps for installing a feature in the
server. For information about using the HMC to install a feature, see “Installing a feature by using the
Hardware Management Console” on page 247. If you do not have an HMC, complete the following steps
to install a disk drive from the system or partition that controls the disk drive.
If you are installing Solid State Drives, review the configuration rules and then return here. For more
information, see “Solid-state drives” on page 5.
Installing a disk drive or solid-state drive 17
DANGER
When working on or around the system, observe the following precautions:
Electrical voltage and current from power, telephone, and communication cables are hazardous. To
avoid a shock hazard:
v Connect power to this unit only with the IBM provided power cord. Do not use the IBM
provided power cord for any other product.
v Do not open or service any power supply assembly.
v Do not connect or disconnect any cables or perform installation, maintenance, or reconfiguration
of this product during an electrical storm.
v The product might be equipped with multiple power cords. To remove all hazardous voltages,
disconnect all power cords.
v Connect all power cords to a properly wired and grounded electrical outlet. Ensure that the outlet
supplies proper voltage and phase rotation according to the system rating plate.
v Connect any equipment that will be attached to this product to properly wired outlets.
v When possible, use one hand only to connect or disconnect signal cables.
v Never turn on any equipment when there is evidence of fire, water, or structural damage.
v Disconnect the attached power cords, telecommunications systems, networks, and modems before
you open the device covers, unless instructed otherwise in the installation and configuration
procedures.
v Connect and disconnect cables as described in the following procedures when installing, moving,
or opening covers on this product or attached devices.
To Disconnect:
1. Turn off everything (unless instructed otherwise).
2. Remove the power cords from the outlets.
3. Remove the signal cables from the connectors.
4. Remove all cables from the devices
To Connect:
1. Turn off everything (unless instructed otherwise).
2. Attach all cables to the devices.
3. Attach the signal cables to the connectors.
4. Attach the power cords to the outlets.
5. Turn on the devices.
(D005)
Attention: Be sure you read these instructions carefully and understand the complexity of the
powered-on procedure before you begin.
If any of the following steps apply to you, go to “Installing a disk drive or solid-state drive with the
system or logical partition power turned off” on page 49.
v You are installing a disk drive that will be connected with the 6203 or 6204 SCSI adapter.
v You do not feel comfortable using the powered-on procedure.
Note: Some of the figures in these procedures might not look exactly like the system or expansion unit
that you have. However, the steps to perform the task are the same.
Prepare the system
1. Perform the prerequisite tasks as described in “Before you begin” on page 189.
2. If applicable, remove the system unit door, expansion unit door, or the rack door.
3. If applicable, remove the front cover.
v If you have a front cover from any other models, remove the front cover.
18 Disk drives
4. Look at the front of the system or expansion unit and determine the next available disk drive
position. For information, see “Disk drive locations and concurrent maintenance lights” on page 125.
Note: When you have a system with an expansion unit, fill the slot positions in the system unit first.
However, you can choose a different disk drive placement depending on your data protection
strategy.
5. Record the position where the new disk drive will be installed. For example, the next available disk
drive position might be P3-D4.
Install a SCSI disk drive in a system or expansion unit using Linux kernel 2.4 or Linux kernel 2.6.
Related concepts
Removing and replacing covers and doors
Use these procedures to remove and replace the covers on IBM Power Systems servers that contain the
POWER7 processor.
Related reference
“Removing and replacing the expansion unit cover or door” on page 205
You might need to remove, replace, or install covers or doors on an expansion unit as a part of accessing
components or performing a service action.
Installing a disk drive using Linux kernel 2.4
If you are installing a disk drive in a location that is controlled by Linux kernel 2.4, follow the procedure
in this section to perform this task.
Installing a disk drive
1. If the slot you want to use contains a disk-drive filler, store the filler for future use, or discard it,
then move on to step 7 on page 21.
Note: The disk drive filler is a single piece that does not attach to the new disk drive.
Attention:
v Attach a wrist strap to an unpainted surface of your hardware to prevent electrostatic discharge
(ESD) from damaging your hardware.
v When using a wrist strap, follow all electrical safety procedures. A wrist strap is for static control.
It does not increase or decrease your risk of receiving electric shock when using or working on
electrical equipment.
v If you do not have a wrist strap, just prior to removing the product from ESD packaging and
installing or replacing hardware, touch an unpainted surface of the system for a minimum of 5
seconds.
a. Pull the disk-drive filler from the slot, as shown in Figure 7 on page 20.
Installing a disk drive or solid-state drive 19
b. Pull off the disk-drive bezel latch (A) from the front of the filler, as shown in the following
figure.
c. Loosen, and then pull out on the thumbscrew (C) to release the disk-drive bezel (B) from the
front edge of the filler.
2. Find the package that contains the new disk drive.
Figure 7. Removing a disk-drive filler from model
Figure 8. Removing the disk-drive bezel
20 Disk drives
Attention: Disk drives are fragile. Handle with care.
3. Remove the disk drive from the static-protective package. Record the serial number (use the last 4
digits) of the disk drive.
4. Unlock the disk drive handle by squeezing and pulling it out toward you. If the handle is not all the
way out, the disk drive will not slide into the system or expansion unit.
5. If your disk drive does not have a bezel preinstalled, follow these steps to install the disk-drive bezel
on the new disk drive:
a. Attach the bezel (A) to the disk drive, as shown in the following figure.
b. Press in and then tighten the thumbscrew (B) to secure the bezel to the disk drive.
6. Support the bottom of the disk drive as you align it with the guide rails in the system or expansion
unit. Do not hold the disk drive by the handle.
7. Slide the disk drive into the system.
Slide the disk drive all the way in to the system or expansion unit and then lock it in place by
pushing in the disk drive handle, as shown in Figure 10 on page 22.
Figure 9. Installing a disk-drive bezel
Installing a disk drive or solid-state drive 21
Figure 10. Installing a disk drive in a model
Figure 11. Installing a small-form factor disk drive
22 Disk drives
Note: The concurrent maintenance light stops blinking and remains on when you have installed the
disk drive correctly.
To view the locations of the concurrent maintenance lights, select the appropriate model from the
following list:
v “Disk drive locations and concurrent maintenance lights for the 5786, 5787, 7031-D24, and
7031-T24 SCSI disk-drive enclosures” on page 133
v “Disk drive locations and concurrent maintenance lights for the 5886 disk enclosure” on page 131
v “Disk drive locations and concurrent maintenance lights for the 5802 expansion unit” on page 135
8. If you are installing more than one disk drive, repeat step 1 on page 19 through step 7 on page 21,
until all disk drives are installed.
9. Log in as root user.
10. Wait 30 seconds after installing the disk drive, and then type iprconfig on the command line of the
Linux session. Press Enter. The Work with Disk Units display is shown.
11. Select Display disk hardware status from the Work with Disk Units display. Press Enter.
A Display Disk Hardware Status display is shown, similar to the following figure.
Figure 12. Installing a small-form factor disk drive in the 5803 expansion unit
Installing a disk drive or solid-state drive 23
12. Locate the disk drive that you installed. The serial number might be 0.
If the disk drive is displayed as a model 50 disk, you are done. If not, follow these steps:
a. Find the /dev/sdx name associated with the disk drive that you installed.
b. Exit to the command line.
c. Type sg_map on the command line and press Enter.
d. Using the output from running the sg_map command, correlate the /dev/sdx device to a
/dev/sgx device.
e. Type sg_inq /dev/sgx on the command line using the /dev/sgx value from the previous step.
Press Enter.
f. Type iprconfig on the command line and press Enter.
g. Select Display disk hardware status from the Work with Disk Units display. Press Enter.
h. The serial number for the newly installed disk drive will be displayed on the Display Hardware
Status display.
13. Record the resource name, when you configure the disk drive.
Perform the procedure
1. Replace the front cover.
2. Install or close the system, expansion unit or rack front door.
3. Configure the newly installed disk drive for your environment. For information, see the PCI-X SCSI
RAID Controller Reference Guide for Linux.
This guide is available from the SCSI PCI Adapters web page.
Display Disk Hardware Status
Serial Vendor Product Resource Hardware
Number ID ID Model Name Status
03132041 IBM 2780001 001 /dev/ipr3 Operational
000015A5 IBM ST373453LC 079 DPY/Active
E3V1WEAB IBM IC35L036UCDY10-0 070 DPY/Active
E3V0J55B IBM IC35L036UCDY10-0 079 DPY/Active
E3V1W1UB IBM IC35L036UCDY10-0 070 DPY/Active
04B81193 IBM 3C98D203 210 /dev/sdi Operational
000227CE IBM ST336605LC 020 /dev/sdh Operational
00000001 IBM 570B001 001 /dev/ipr2 Operational
00006719 IBM ST336607LC 020 /dev/sdd Operational
E3V0E77B IBM IC35L036UCDY10-0 020 /dev/sde Operational
0007E0C4 IBM ST373453LC 020 /dev/sdf Operational
00000001 IBM 5702001 001 /dev/ipr1 Operational
03060038 IBM 5703001 001 /dev/ipr0 Operational
More...
Press Enter to continue.
e=Exit q=Cancel r=Refresh f=PageDn b=PageUp
d=Display disk unit details p=Display device parity status
Figure 13. Display Disk Hardware Status display
24 Disk drives
Related concepts
Removing and replacing covers and doors
Use these procedures to remove and replace the covers on IBM Power Systems servers that contain the
POWER7 processor.
Related reference
“Removing and replacing the expansion unit cover or door” on page 205
You might need to remove, replace, or install covers or doors on an expansion unit as a part of accessing
components or performing a service action.
Installing the disk drive using Linux kernel 2.6
If you are installing a disk drive in a location that is controlled by Linux kernel 2.6, follow the procedure
in this section to perform this task.
Install the disk drive
1. Log in as root user.
2. Type lsslot on the command line of the Linux session and press Enter.
A display like the following figure, is shown.
3. Note the Linux Name for the slot that contains the adapter under which you want to install the disk
drive.
4. Type iprconfig on the command line of the Linux session and press Enter. The IBM Power RAID
Configuration Utility display is shown.
5. Select Work with disk unit recovery from the IBM Power RAID Configuration Utility display. Press
Enter.
6. Select Concurrent add device from the Work with Disk Unit Recovery display. Press Enter.
A Concurrent Device Add display, like the following figure, is shown.
7. Type option 1 next to the location into which you want to install the disk drive and press Enter.
The Verify Device Concurrent Add display is shown.
x0537p1:~ # lsslot
# Slot Description Linux Name Device(s)
U787A.001.DNZ00XV-P1-C1 Logical I/O Slot 0000:00:02.0 ethernet
U787A.001.DNZ00XV-P1-C2 Logical I/O Slot 0000:00:02.2 pci
U787A.001.DNZ00XV-P1-C3 Logical I/O Slot 0001:00:02.2 pci1069,b166
U787A.001.DNZ00XV-P1-C4 Logical I/O Slot 0000:00:02.6 pci1069,b166
U787A.001.DNZ00XV-P1-C5 Logical I/O Slot 0001:00:02.6 pci1014,0180
U787A.001.DNZ00XV-P1-T5 Logical I/O Slot 0000:00:02.4 ethernet
U787A.001.DNZ00XV-P1-T7 Logical I/O Slot 0000:00:02.3 usb
U787A.001.DNZ00XV-P1-T10 Logical I/O Slot 0001:00:02.4 scsi
U787A.001.DNZ00XV-P1-T12 Logical I/O Slot 0001:00:02.3 ide
U9111.520.10D3C6C-V1-C0 Virtual I/O Slot 30000000 vty
Figure 14. Find the Linux Name for the location for which you want to install the disk drive.
Concurrent Device Add
Choose a single location for add operations
1=Select
OPT Name PCI/SCSI Location Description Status
--- ------ -------------------------- ------------------------- -----------------
0000:41:01.0.1/0:5: Empty
0000:41:01.0.1/1:8: Empty
0001:61:01.0.2/0:8: Empty
Figure 15. Example Concurrent Device Remove display
Installing a disk drive or solid-state drive 25
A concurrent maintenance light indicates the slot where the disk drive can be installed.
8. Press Enter on the Verify Device Concurrent Add display to verify the location.
9. If the slot you want to use contains a disk-drive filler, store the filler for future use, or discard it,
then move on to step 16 on page 28.
Note: The disk drive filler is a single piece that does not attach to the new disk drive.
Attention:
v Attach a wrist strap to an unpainted surface of your hardware to prevent electrostatic discharge
(ESD) from damaging your hardware.
v When using a wrist strap, follow all electrical safety procedures. A wrist strap is for static control.
It does not increase or decrease your risk of receiving electric shock when using or working on
electrical equipment.
v If you do not have a wrist strap, just prior to removing the product from ESD packaging and
installing or replacing hardware, touch an unpainted surface of the system for a minimum of 5
seconds.
a. Pull the disk-drive filler from the slot as shown in Figure 16.
b. Pull off the disk-drive bezel latch (A) from the front of the filler, as shown in the following
figure.
Figure 16. Removing a disk-drive filler from model
26 Disk drives
c. Loosen, and then pull out on the thumbscrew (C) to release the disk-drive bezel (B) from the
front edge of the filler.
10. Find the package that contains the new disk drive.
Attention: Disk drives are fragile. Handle with care.
11. Remove the disk drive from the static-protective package. Record the serial number (use the last four
digits) of the disk drive.
12. Unlock the disk drive handle by squeezing and pulling it out toward you. If the handle is not all the
way out, the disk drive cannot slide into the system or expansion unit.
13. If your disk drive does not have a bezel preinstalled, follow these steps to install the disk-drive bezel
on the new disk drive:
a. Attach the bezel (A) to the disk drive, as shown in the following figure.
Figure 17. Removing a disk-drive bezel from the disk drive.
Installing a disk drive or solid-state drive 27
b. Press in and then tighten the thumbscrew (B) to secure the bezel to the disk drive.
14. Support the bottom of the disk drive as you align it with the guide rails in the system or expansion
unit. Do not hold the disk drive by the handle.
15. The Complete Device Concurrent Add display is shown.
16. Slide the disk drive into the system.
Slide the disk drive all the way in into the system or expansion unit and then lock it in place by
pushing in the disk drive handle, as shown in Figure 19 on page 29.
Figure 18. Attaching the disk-drive bezel to the disk drive.
28 Disk drives
Figure 19. Installing a disk drive in a model
Figure 20. Installing a small-form factor disk drive
Installing a disk drive or solid-state drive 29
17. Press Enter. The concurrent maintenance light stops flashing and remains on when you have
installed the disk drive properly.
To view the locations of the concurrent maintenance lights, select the appropriate model from the
following list:
v “Disk drive locations and concurrent maintenance lights for the 5786, 5787, 7031-D24, and
7031-T24 SCSI disk-drive enclosures” on page 133
v “Disk drive locations and concurrent maintenance lights for the 5886 disk enclosure” on page 131
v “Disk drive locations and concurrent maintenance lights for the 5802 expansion unit” on page 135
18. If you are installing more than one disk drive, repeat step 2 on page 25 through step 17 until all disk
drives are installed.
Verify that the new disk drive is operational
To verify that the new disk drive is operational, follow these steps:
1. Log in as root user.
2. Type iprconfig on the command line of the Linux session and press Enter. The IBM Power RAID
Configuration Utility display is shown.
3. Select Display hardware status.
The Display Hardware Status display is shown, like the following figure.
Figure 21. Installing a small-form factor disk drive in the 5803 expansion unit
30 Disk drives
4. Verify that the disk drive that you installed is shown on this display.
Perform the procedure
1. Replace the front cover.
2. Install or close the system, expansion unit, or rack front door.
3. Configure the newly installed disk drive for your environment. For information, go to the PCI-X SCSI
RAID Controller Reference Guide for Linux.
This guide is available from the SCSI PCI Adapters web page.
Related concepts
Removing and replacing covers and doors
Use these procedures to remove and replace the covers on IBM Power Systems servers that contain the
POWER7 processor.
Related reference
“Removing and replacing the expansion unit cover or door” on page 205
You might need to remove, replace, or install covers or doors on an expansion unit as a part of accessing
components or performing a service action.
Installing a disk drive or solid-state drive with the Virtual I/O Server
system or logical partition powered on
You can install a disk drive or solid-state drive in a system or expansion unit while the Virtual I/O
Server operating system or logical partition that controls the disk drive location is powered on. Use the
procedure in this information to perform this task.
Before installing or replacing a feature, ensure that the software required to support the feature is
installed on your system. For information about software prerequisites, see IBM Prerequisite. If the
required software is not installed, see the following websites to download it, and then install it before
continuing:
v To download firmware and software updates and fixes, see Microcode downloads.
v To download Hardware Management Console (HMC) updates and fixes, see Hardware Management
Console Support and downloads.
Display Hardware Status
Type option, press Enter.
1=Display hardware resource information details
OPT Name PCI/SCSI Location Description Status
--- ------ -------------------------- ------------------------- ----------------
0000:01:01.0.0/ PCI-X SCSI Adapter Operational
0000:41:01.0.1/ PCI-X SCSI Adapter Operational
sda 0000:41:01.0.1/0:3:0 Physical Disk Active
sdb 0000:41:01.0.1/0:4:0 Physical Disk Active
sdc 0000:41:01.0.1/0:8:0 Physical Disk Active
sdd 0000:41:01.0.1/1:3:0 Physical Disk Active
sde 0000:41:01.0.1/1:4:0 Physical Disk Active
sdf 0000:41:01.0.1/1:5:0 Physical Disk Active
0001:61:01.0.2/ PCI-X SCSI RAID Adapter Operational
sdg 0001:61:01.0.2/0:3:0 Physical Disk Active
0001:61:01.0.2/0:6:0 Advanced Function Disk Active
sdi 0001:61:01.0.2/0:9:0 Physical Disk Active
sdh 0001:61:01.0.2/255:0:0 RAID 10 Disk Array Failed
0001:61:01.0.2/0:4:0 RAID 10 Array Member Failed
0001:61:01.0.2/0:5:0 RAID 10 Array Member Failed
e=Exit q=Cancel r=Refresh t=Toggle
Figure 22. Example Display Hardware Status
Installing a disk drive or solid-state drive 31
If your system is managed by the HMC, use the HMC to perform the steps for installing a feature in the
server. For information about using the HMC to install a feature, see “Installing a feature by using the
Hardware Management Console” on page 247. If you do not have an HMC, perform the following steps
to install a disk drive from the system or partition that controls the disk drive.
DANGER
When working on or around the system, observe the following precautions:
Electrical voltage and current from power, telephone, and communication cables are hazardous. To
avoid a shock hazard:
v Connect power to this unit only with the IBM provided power cord. Do not use the IBM
provided power cord for any other product.
v Do not open or service any power supply assembly.
v Do not connect or disconnect any cables or perform installation, maintenance, or reconfiguration
of this product during an electrical storm.
v The product might be equipped with multiple power cords. To remove all hazardous voltages,
disconnect all power cords.
v Connect all power cords to a properly wired and grounded electrical outlet. Ensure that the outlet
supplies proper voltage and phase rotation according to the system rating plate.
v Connect any equipment that will be attached to this product to properly wired outlets.
v When possible, use one hand only to connect or disconnect signal cables.
v Never turn on any equipment when there is evidence of fire, water, or structural damage.
v Disconnect the attached power cords, telecommunications systems, networks, and modems before
you open the device covers, unless instructed otherwise in the installation and configuration
procedures.
v Connect and disconnect cables as described in the following procedures when installing, moving,
or opening covers on this product or attached devices.
To Disconnect:
1. Turn off everything (unless instructed otherwise).
2. Remove the power cords from the outlets.
3. Remove the signal cables from the connectors.
4. Remove all cables from the devices
To Connect:
1. Turn off everything (unless instructed otherwise).
2. Attach all cables to the devices.
3. Attach the signal cables to the connectors.
4. Attach the power cords to the outlets.
5. Turn on the devices.
(D005)
Attention: The powered-on procedure is a complex set of steps that must be performed precisely as
instructed to avoid potential data loss or system damage. You might also consider the powered-off
procedure, which is a simpler task that involves less risk of data loss or system damage. For instructions,
see “Installing a disk drive or solid-state drive with the system or logical partition power turned off” on
page 49.
Note: Some of the figures in these procedures might not look exactly like the system or expansion unit
that you have. However, the steps to perform the task are the same.
Prepare the system
1. Perform prerequisite tasks as described in “Before you begin” on page 189.
2. Remove the front covers. For instructions, see “System covers” on page 207
32 Disk drives
3. Determine the next available disk drive position in your system or expansion unit. For information,
see “Disk drive locations and concurrent maintenance lights” on page 125.
Note: When you have a system with an expansion unit, fill the slot positions in the system unit first.
However, you can select a different disk drive placement depending on your data protection strategy.
4. Record the position where the new disk drive can be installed. For example, the next available disk
drive slot might be P3-D4.
5. Perform the following steps to install a disk drive.
Install the disk drive
1. Log in as root user.
2. At the command line, type diagmenu and press Enter.
3. Press Enter to continue.
4. On the Function Selection display, select Task Selection.
5. On the Tasks Selection display, select Hot Plug Task.
6. Select SCSI and SCSI RAID Hot Plug Manager.
7. Select Attach a Device to a SCSI Hot Swap Enclosure Device. A list of empty slots in the SCSI hot
swap enclosure device is shown.
8. Select the slot where you want to install the disk drive and press Enter.
The disk drive slot enters the Identify state. The concurrent maintenance light for the slot begins
flashing. To view the locations of the concurrent maintenance lights, select the model you are
installing a disk drive in to. For a list of models see, “Disk drive locations and concurrent
maintenance lights” on page 125.
9. If the slot you want to use contains a disk-drive filler, store the filler for future use, or discard it,
then move on to step Insert the disk drive.
Note: The disk drive filler is a single piece that does not attach to the new disk drive.
Important: .
Pull the disk-drive filler from the slot that you want to use, as shown in Figure 23 on page 34
Attention:
v Attach a wrist strap to an unpainted surface of your hardware to prevent electrostatic discharge
(ESD) from damaging your hardware.
v When using a wrist strap, follow all electrical safety procedures. A wrist strap is for static control.
It does not increase or decrease your risk of receiving electric shock when using or working on
electrical equipment.
v If you do not have a wrist strap, just prior to removing the product from ESD packaging and
installing or replacing hardware, touch an unpainted surface of the system for a minimum of 5
seconds.
Installing a disk drive or solid-state drive 33
10. If the slot you want to use contains a disk-drive filler, perform the following step.
Remove the disk-drive bezel latch (A) from the front of the filler, as shown in the following figure.
11. If the slot you want to use contains a disk-drive filler, perform the following step.
Loosen, and then pull out on the thumbscrew (C) to release the bezel (B) from the front edge of the
filler.
Figure 23. Removing a disk-drive filler from a model
Figure 24. Removing a disk-drive bezel from a disk-drive filler
34 Disk drives
12. Find the package that contains the new disk drive.
Attention: Disk drives are fragile. Handle with care.
13. Remove the disk drive from the static-protective package.
14. Unlock the disk drive handle by squeezing it and pulling it out toward you. If the handle is not all
the way out, the disk drive does not slide into the system or expansion unit.
15. If the disk drive you are installing does not have a bezel preinstalled, then perform the following
step.
Attach the disk-drive bezel (A) to the disk drive, as shown in the following figure.
16. If the disk drive you are installing does not have a bezel preinstalled, then perform the following
step.
Press in and then tighten the thumbscrew (B) to secure the bezel (A) to the disk drive.
17. Support the bottom of the disk drive as you align it with the guide rails in the system unit. Do not
hold the disk drive by the handle.
18. Slide the disk drive into the system unit until it stops, as shown in Figure 26 on page 36.
19. Push the disk drive handle in until it locks. The light on the disk drive turns on.
Figure 25. Installing a disk-drive bezel on a disk drive
Installing a disk drive or solid-state drive 35
Figure 26. Installing a disk drive in a model
Figure 27. Installing a small-form factor disk drive
36 Disk drives
20. Press Enter.
21. If you are installing more than one disk drive, repeat step 1 on page 33 through step 20 until all disk
drives are installed.
22. Cancel to get to the SCSI and SCSI RAID Hot Plug Manager menu.
23. Select Configure Added/Replaced Devices.
24. Exit to the command line.
Perform the procedure
1. Replace the front cover. For instructions on replacing the cover, see “System covers” on page 207.
2. Verify the installed part. For instructions, see “Verifying the installed part” on page 249.
Installing a disk drive or solid-state drive with the IBM i system or
logical partition powered on
Learn how to install a disk drive or solid-state drive in a system or expansion unit while the IBM i
operating system or logical partition that controls the disk drive location is powered on. Use the
procedure in this information to perform this task.
Attention: If you are servicing a failing part, see the service procedures for “Replacing a disk drive with
the IBM i system or logical partition powered on” on page 89. This procedure is intended for installing a
new or upgraded disk drive.
Before installing or replacing a feature, ensure that the software required to support the feature is
installed on your system. For information about software prerequisites, see IBM Prerequisite. If the
required software is not installed, see the following websites to download it, and then install it before
continuing:
v To download firmware and software updates and fixes, see Microcode downloads.
Figure 28. Installing a small-form factor disk drive in the 5803 expansion unit
Installing a disk drive or solid-state drive 37
v To download Hardware Management Console (HMC) updates and fixes, see Hardware Management
Console Support and downloads.
If your system is managed by the HMC, use the HMC to perform the steps for installing a feature in the
server. For information about using the HMC to install a feature, see “Installing a feature by using the
Hardware Management Console” on page 247. If you do not have an HMC, perform the following steps
to install a disk drive from the system or partition that controls the disk drive.
If you are installing Solid State Drives, review the configuration rules and then return here. For more
information, see “Solid-state drives” on page 5.
DANGER
When working on or around the system, observe the following precautions:
Electrical voltage and current from power, telephone, and communication cables are hazardous. To
avoid a shock hazard:
v Connect power to this unit only with the IBM provided power cord. Do not use the IBM
provided power cord for any other product.
v Do not open or service any power supply assembly.
v Do not connect or disconnect any cables or perform installation, maintenance, or reconfiguration
of this product during an electrical storm.
v The product might be equipped with multiple power cords. To remove all hazardous voltages,
disconnect all power cords.
v Connect all power cords to a properly wired and grounded electrical outlet. Ensure that the outlet
supplies proper voltage and phase rotation according to the system rating plate.
v Connect any equipment that will be attached to this product to properly wired outlets.
v When possible, use one hand only to connect or disconnect signal cables.
v Never turn on any equipment when there is evidence of fire, water, or structural damage.
v Disconnect the attached power cords, telecommunications systems, networks, and modems before
you open the device covers, unless instructed otherwise in the installation and configuration
procedures.
v Connect and disconnect cables as described in the following procedures when installing, moving,
or opening covers on this product or attached devices.
To Disconnect:
1. Turn off everything (unless instructed otherwise).
2. Remove the power cords from the outlets.
3. Remove the signal cables from the connectors.
4. Remove all cables from the devices
To Connect:
1. Turn off everything (unless instructed otherwise).
2. Attach all cables to the devices.
3. Attach the signal cables to the connectors.
4. Attach the power cords to the outlets.
5. Turn on the devices.
(D005)
Attention: The powered-on procedure is a complex set of steps that must be performed exactly to avoid
potential data loss or system damage. You might also consider the powered-off procedure, which is a
simpler task that involves less risk of data loss or system damage. For instructions, see “Installing a disk
drive or solid-state drive with the system or logical partition power turned off” on page 49.
Note: Some of the figures in these procedures might not look exactly like the system or expansion unit
that you have. However, the steps to perform the task are the same.
38 Disk drives
Prepare the system
1. Perform prerequisite tasks as described in “Before you begin” on page 189.
2. Determine the next available disk drive position for your system or expansion unit. For information,
see “Disk drive locations and concurrent maintenance lights” on page 125.
3. Record the position where the new disk drive can be installed. For example, the next available disk
drive slot might be P3-D4.
4. If your system is managed by a Hardware Management Console (HMC), use the HMC to perform the
steps for installing the disk drive. For instructions, see “Installing a feature by using the Hardware
Management Console” on page 247. If you do not have an HMC, do the following steps to install a
disk drive.
Prepare to install the disk drive
1. Record the physical location where the new disk drive can be installed.
The physical location has the following form: U787A.001.AAAXXXX-P3-D4 where P3-D4 is the position of
the disk drive in the system unit or expansion unit. Find the U787A.001.AAAXXXX location information
from the control panel display on the system unit or expansion unit.
If you cannot find the location information from the control panel, find the feature code (FC) and
sequence number (SN or SEQ) on the label that is shown on the front of the system unit or expansion
unit, as shown in the following figure.
In this example, U787A.001.AAAXXXX-P3-D4, U787A.001 comes from the feature code (FC) on the label,
AAAXXXX comes from the last seven digits of the sequence number (SN or SEQ) on the label, and
P3–D4 is the slot where you are installing the disk drive as shown in “Disk drive locations and
concurrent maintenance lights” on page 125.
In the example, U5074.001.AAAXXXX-DB3-D32, you would install a disk drive in the expansion unit with
type 5074, model 001, sequence number AAAXXXX, and disk drive position DB3–D32.
2. Sign on to service level authority.
3. Type strsst on the command line of the IBM i session and press Enter.
4. Type your service tools user ID and service tools password on the Start Service Tools (STRSST) Sign
On display. Press Enter.
Remember: The service tools password is case-sensitive.
5. Select Start a service tool from the System Service Tools (SST) display. Press Enter.
6. Select Hardware service manager from the Start a Service Tool display. Press Enter.
7. If the slot you want to use contains a disk-drive filler, store the filler for future use, or discard it, then
move on to step “Install the disk drive” on page 41.
Figure 29. Example of a system or expansion unit label
Installing a disk drive or solid-state drive 39
Note: The disk drive filler is a single piece that does not attach to the new disk drive.
Attention:
v Attach a wrist strap to an unpainted surface of your hardware to prevent electrostatic discharge
(ESD) from damaging your hardware.
v When using a wrist strap, follow all electrical safety procedures. A wrist strap is for static control. It
does not increase or decrease your risk of receiving electric shock when using or working on
electrical equipment.
v If you do not have a wrist strap, just prior to removing the product from ESD packaging and
installing or replacing hardware, touch an unpainted surface of the system for a minimum of 5
seconds.
a. Pull the disk-drive filler from the slot, as shown in the following figure.
b. Pull off the disk drive bezel latch (A) from the front of the filler, as shown in the following figure.
Figure 30. Removing a disk-drive filler
40 Disk drives
c. Loosen, and then pull out the thumbscrew (C) to release the disk-drive bezel (B) from the front
edge of the filler.
Install the disk drive
1. Find the package that contains the new disk drive.
Attention: Disk drives are fragile. Handle with care.
2. Remove the disk drive from the protective package. Record the serial number (use the last four
digits) of the disk drive.
3. Unlock the disk drive handle by squeezing it and pulling it out toward you. If the handle is not all
the way out, the disk drive does does not slide into the system or expansion unit.
4. Follow these steps to install the disk-drive bezel on the new disk drive:
a. Attach the bezel (A) to the disk drive, as shown in the following figure.
Figure 31. Removing a disk-drive bezel
Installing a disk drive or solid-state drive 41
b. Press in and then tighten the thumbscrew (B) to secure the bezel to the disk drive.
5. Support the bottom of the disk drive as you align it with the guide rails in the system or expansion
unit. Do not hold the disk drive by the handle.
6. Slide the disk drive halfway into the system or expansion unit.
7. Select Device Concurrent Maintenance from the Hardware Service Manager display. Press Enter.
The Device Concurrent Maintenance is displayed, as shown in the following figure.
8. In the Specify either physical location or resource name field, type the physical location that you
had previously recorded in this procedure, as shown in Figure 33.
9. Select option 2 (Install device) for the Action to be performed.
Figure 32. Installing a disk-drive bezel on a disk drive
Device Concurrent Maintenance
Type the choices, then press Enter.
Specify either physical location or resource name.
Physical location:
U787A.001.1234567-P3-D4
or
Device resource name:
Specify action as 1=Remove device 2=Install device
Action to be performed . . . . . . . .: 2
Enter a time value between 00 and 19.
Time delay needed in minutes . . . . .: 01
F3=Exit F5=Refresh F12=Cancel
Figure 33. Example Device Concurrent Maintenance display
42 Disk drives
10. Set the time delay for one minute - 01.
Important: Do not press Enter at this time.
11. Locate the concurrent maintenance light that corresponds to the position of the disk drive that you
are installing. To view the locations of the concurrent maintenance lights, select the appropriate
model from the following list:
v “Disk drive locations and concurrent maintenance lights for the 5786, 5787, 7031-D24, and
7031-T24 SCSI disk-drive enclosures” on page 133
v “Disk drive locations and concurrent maintenance lights for the 5886 disk enclosure” on page 131
v “Disk drive locations and concurrent maintenance lights for the 5802 expansion unit” on page 135
12. Press Enter on the console.
Important: When you press Enter, after a one minute delay, the light comes on and begins to flash
rapidly. You then have nine seconds to slide the disk drive all the way in and then lock it in place by
pushing in the disk drive handle.
13. When the concurrent maintenance light starts blinking, slide the disk drive all the way in and lock it
in place by closing the latch, as shown in the following figure.
Note: The light stops blinking and remains on when you have installed the disk drive properly.
Figure 34. Sliding the disk drive in to the system unit
Installing a disk drive or solid-state drive 43
14. If you are installing more than one disk drive, perform step 1 on page 39 through 13 on page 43
until all disk drives are installed.
Figure 35. Installing the small-form factor disk drive into the system
Figure 36. Installing a small-form factor disk drive in the 5803 expansion unit
44 Disk drives
Perform the procedure
1. Return to the console and wait until the Concurrent Maintenance Results display is shown. Press
Enter.
2. When the Hardware Service Manager display is shown, press F12 twice and then press Enter.
3. To perform this procedure, go to “Configuring a disk drive in an system or logical partition.”
Related concepts
Removing and replacing covers and doors
Use these procedures to remove and replace the covers on IBM Power Systems servers that contain the
POWER7 processor.
Related reference
“Removing and replacing the expansion unit cover or door” on page 205
You might need to remove, replace, or install covers or doors on an expansion unit as a part of accessing
components or performing a service action.
Configuring a disk drive in an system or logical partition
To configure a newly installed disk drive into an existing device parity set, follow these steps:
If you want to start a new device parity set or change the protection on your disks, see Manage device
parity protection or Start device parity protection in the Disk Management section of the IBM i
Information Center website at http://www.ibm.com/systems/i/infocenter .
1. If necessary, start System Service Tools (SST) by typing strsst on the command line of the IBM i
session and pressing Enter.
2. Type your service tools user ID and service tools password on the Start Service Tools (STRSST) Sign
On display. Press Enter.
Restriction: The service tools password is case sensitive.
3. Select Work with Disk Units on the System Service Tools display. Press Enter.
4. Select Display disk configuration on the Work with Disk Units display. Press Enter.
5. Select Display non-configured units on the Display disk configuration display. Press Enter.
6. The disk drive that you installed is listed. The serial number should match the serial number you
recorded (the last 4 digits).
Note: The new disk drive might take up to five minutes to appear in the list. If the disk drive is not
listed, ensure that the disk drive was properly installed.
7. Press F12 twice.
8. Select Work with disk configuration on the Work with Disk Units display. Press Enter.
9. Are you adding a disk drive to an existing device parity set?
v Yes: Continue with the next step.
v No: Go to step 23 on page 46 for mirroring or no protection, go to 1 on page 47 for hot spare.
Tip: Select No for mirroring, hot spare or no protection.
10. Select Work with device parity protection on the Work with Disk Configuration display. Press Enter.
11. Select Include unit in device parity protection on the Work with Device Parity Protection display.
Press Enter.
12. Did the Include Device Parity Protection Failed display appear?
v Yes: Continue with the next step.
v No: Go to step 15 on page 46.
13. The following conditions must be met before you can start device parity protection.
Installing a disk drive or solid-state drive 45
v If there are enough disk drives available to create a new parity set, the drives are eligible for Start
Device Parity Protection operation and not for the Include Device Parity Protection operation.
v All disk drives in a parity set must be the same capacity with a minimum number of three or four
disk drives and maximum of 10 or 18 disk drives in the resulting parity set, depending on the
disk drive type.
v Not all disk drives attached to an advanced function input output processor have reported to the
system. Retry the operation.
v The type or model of the disk drives cannot be used for the requested operation.
When these conditions are met, press F12 to return to the Work with Disk Configuration display.
14. Select Include unit in device parity protection on the Work with disk configuration display. Press
Enter.
15. On the Include Disk Units in Device Parity Protection display, select the disk drives to be included
in Device Parity Protection. Type a 1 in the Option column. Press Enter.
16. Press Enter to confirm your choice. The configuration starts and might take 30 to 60 minutes to
complete.
17. When the device parity protection is complete, the message “Selected drives have been included
successfully” is shown on the Work with device parity protection display. Press F12.
18. Select Display disk configuration on the Work with disk configuration display. Press Enter.
19. Select Display disk configuration status on the Display disk configuration display.
20. If you have more than one ASP, determine the auxiliary storage pool (ASP) to which you want to
add the disk drive.
21. Press F12 twice to get to the Work with Disk Units display.
22. Select Work with disk configuration on the Work with Disk Units display. Press Enter.
23. Select the option to add units to ASPs and balance data on the Work with disk configuration display.
Press Enter.
24. Specify the number of the ASP to which you want to add the disk drive. The system ASP is ASP 1.
Press Enter.
Notes:
a. If ASP is mirror protected, disk drives must be added in pairs of the same size.
b. When one or more pairs of disk drives are added to a mirrored ASP, they automatically become
mirror protected.
c. To start mirrored protection on an ASP, refer to Recovering your system in the IBM i Information
Center.
To access the IBM i Information Center Web site, go to http://www.ibm.com/systems/i/
infocenter and select from the following options:
v If you select V6R1, expand IBM i 6.1 Information Center > Systems management > Backup
and recovery > Recovering your system
v If you select V5R4, expand i5/OS Information Center > Systems management > Backup and
recovery > Backing up and recovering your system > Recovering your system
25. The Confirm Add Units display is shown showing the configuration your system will have when the
add operation completes.
Note: If you have selected the wrong ASP, press F12 to change your options. Press the Enter key to
continue. The add process will take several minutes to complete.
26. When the message Selected units have been added successfully is shown, press F3 three times
and press Enter to return to the Main Menu display.
27. If you have a printer, print the configuration list. A service provider might refer to the configuration
list in the future.
a. You need to sign on with at least service level authority
46 Disk drives
b. Type strsst on the Main Menu command line. Press Enter.
c. Type your service tools user ID and service tools password on the Start Service Tools (STRSST)
Sign On display. Press Enter.
Remember: The service tools password is case sensitive.
d. Select Start a service tool on the System Service Tools (SST) display. Press Enter.
e. Select Hardware service manager on the Start a Service Tool display. Press Enter.
f. Press F6 (print the configuration) from the Hardware Service Manager display.
g. To return to the Main Menu display, press F3 (Exit) twice and then press Enter.
h. Place the configuration list where you can refer to it in the future.
28. If you are a service provider performing this task, return to the procedure that sent you here.
1. Select Work with hot spare protection from the Work with disk configuration menu.
2. Select Start device parity protection - RAID 5 with hot spare or Start device parity protection -
RAID 6 with hot spare depending on the level of parity protection that is desired.
3. Return to the procedure that sent you here.
Rebuilding data on a replacement disk drive using IBM i
Learn how to rebuild data on replacement disk.
To rebuild data on the replacement disk drive, complete the following steps:
1. If necessary, start System Service Tools (SST) by typing strsst on the command line of the IBM i
session and pressing Enter.
2. Type your service tools user ID and service tools password on the Start Service Tools (STRSST) Sign
On display. Press Enter.
Note: The service tools password is case sensitive.
3. Select Work with disk units from the Start System Service Tools (SST) display. Press Enter.
4. Select Work with disk unit recovery from the Work with Disk Units display. Press Enter.
5. If you determined that the disk drive you replaced had device parity protection (RAID), follow these
steps to restore device parity protection (RAID) on the disk drive:
a. Select Rebuild disk unit data on the Work with Disk unit recovery display. Press Enter.
b. Select 1 to rebuild the disk drive displayed (the disk drive displayed is the disk drive that you
removed) on the Rebuild Disk Unit Data display. Press Enter.
c. Press Enter on the Confirm Rebuild Disk Unit Data display. The rebuild process might take several
minutes to complete.
d. Press F5 to refresh the display until the Percent complete shows 5%.
e. When the display shows at least 5% complete, you can either continue to monitor this display to
completion, or press F3 (Exit) to return to the Work with disk units display.
f. Press F3 (Exit) to return to the System service tools display.
g. Press F3 (Exit) to return to the Exit SST display and press Enter.
h. Return to the procedure that sent you here.
6. If the disk drive that you replaced had mirrored protection, follow these steps:
a. Select Replace configured unit on the Work with Disk unit recovery display. Press Enter.
b. Select the configured disk drive that you are exchanging (suspended drive) on the Select
Configured Unit to Replace display. Press Enter.
c. Select the disk drive that you just installed on the Select Replacement Unit display. This drive has
a non-configured status.
Installing a disk drive or solid-state drive 47
Note: In some cases, it might take several minutes for a new disk drive to display. Repeat these
steps until the new drive is shown.
Press Enter.
d. Press Enter on the Confirm Replace of Configured Unit display to confirm your choice for
replacement.
The replacement process might take several minutes to complete. When the process is complete,
the Work with Disk unit recovery display is shown.
e. Press F3 (Exit) to return to the Work with disk units display.
f. Select Display disk configuration on the Work with disk units display.
g. Select Display disk configuration status on the Display Disk Configuration display.
Mirrored status shows Resuming. When complete, the mirrored status shows Active. This process
might take several minutes to complete. You can either monitor this display to completion, or
press F3 (Exit) three times, and then press Enter to return to the IBM i main menu.
Determining the IBM i disk drive protection status
To determine the protection status of the disk drives in your IBM i system or logical partition, follow
these steps from the System Service Tools (SST) display:
Tip: If you do not know the protection status of the disk drives and you cannot get to the System Service
Tools display, contact your next level of support.
1. Sign on with at least service level authority.
2. Type strsst on the command line of the IBM i session and press Enter.
3. Type your service tools user ID and service tools password on the Start Service Tools (STRSST) Sign
On display. Press Enter.
Note: The service tools password is case-sensitive.
4. Select Work with disk units from the System Service Tools (SST) display. Press Enter.
5. Select Display Disk Configuration from the Work with Disk Units display. Press Enter.
6. Select Display Disk Configuration Status from the Display Disk Configuration display. Press Enter.
A list of each auxiliary storage pool (ASP) displays, showing the disk drives that are members of the
ASP. The Status column shows one of the following protection statuses for the ASP:
v Mirrored. These ASPs are configured for mirrored protection. If the failing disk drive has mirrored
protection, record the status of both disk drives in the mirrored pair. You need this information in
the recovery procedure.
v Unprotected. An ASP with a status of Unprotected might contain disk drives that are device parity
protected. Record the status of the failing disk drive as it is shown in the Status column of the
Display Disk Configuration Status display.
A disk drive with one of the following statuses has device parity protection only if all other disk
drives in the array are operational:
– DPY/Active
– DPY/Failed
– DPY/HDW Failure
– DPY/Degraded
– DPY/Power Loss
– DPY/Not Ready
– DPY/Unknown
A disk drive with any other status (such as DPY/Rebuild or DPY/Unprotected) does not have
device parity protection.
Examples:
48 Disk drives
– Suppose one disk drive has a status of DPY/Failed while all other disk drives have a status of
DPY/Unprotected. In this case, only the disk drive with the DPY/Failed status is protected.
Servicing the failed disk drive does not result in data loss. Removing any of the unprotected disk
drives might result in data loss.
– All disk drives have a status of DPY/Active or DPY/Degraded, therefore, all disk drives are
protected. In this case, exchanging any disk drive does not result in data loss.
Tip: For more information about the meaning of a status for any disk drive, use the Help key on
the display.
Installing a disk drive or solid-state drive with the system or logical
partition power turned off
Learn how to install a disk drive in a system or logical partition that controls the disk drive location with
the power turned off. Use the procedure in this information to perform this task.
Attention: If you are servicing a failing part, see the service procedures for “Replacing a disk drive with
the system or logical partition powered off” on page 106. This procedure is intended for installing a new
or upgraded disk drive.
Before installing or replacing a feature, ensure that the software required to support the feature is
installed on your system. For information about software prerequisites, see IBM Prerequisite. If the
required software is not installed, see the following websites to download it, and then install it before
continuing:
v To download firmware and software updates and fixes, see Microcode downloads.
v To download Hardware Management Console (HMC) updates and fixes, see Hardware Management
Console Support and downloads.
If your system is managed by the HMC, use the HMC to perform the steps for installing a feature in the
server. For information about using the HMC to install a feature, see http://publib.boulder.ibm.com/
infocenter/powersys/v3r1m5/topic/p7haj/hmcinstall.htm. If you do not have an HMC, perform the
following steps to install a disk drive from the system or partition that controls the disk drive.
If you are installing Solid State Drives, review the configuration rules and then return here. For more
information, see “Solid-state drives” on page 5.
Installing a disk drive or solid-state drive 49
DANGER
When working on or around the system, observe the following precautions:
Electrical voltage and current from power, telephone, and communication cables are hazardous. To
avoid a shock hazard:
v Connect power to this unit only with the IBM provided power cord. Do not use the IBM
provided power cord for any other product.
v Do not open or service any power supply assembly.
v Do not connect or disconnect any cables or perform installation, maintenance, or reconfiguration
of this product during an electrical storm.
v The product might be equipped with multiple power cords. To remove all hazardous voltages,
disconnect all power cords.
v Connect all power cords to a properly wired and grounded electrical outlet. Ensure that the outlet
supplies proper voltage and phase rotation according to the system rating plate.
v Connect any equipment that will be attached to this product to properly wired outlets.
v When possible, use one hand only to connect or disconnect signal cables.
v Never turn on any equipment when there is evidence of fire, water, or structural damage.
v Disconnect the attached power cords, telecommunications systems, networks, and modems before
you open the device covers, unless instructed otherwise in the installation and configuration
procedures.
v Connect and disconnect cables as described in the following procedures when installing, moving,
or opening covers on this product or attached devices.
To Disconnect:
1. Turn off everything (unless instructed otherwise).
2. Remove the power cords from the outlets.
3. Remove the signal cables from the connectors.
4. Remove all cables from the devices
To Connect:
1. Turn off everything (unless instructed otherwise).
2. Attach all cables to the devices.
3. Attach the signal cables to the connectors.
4. Attach the power cords to the outlets.
5. Turn on the devices.
(D005)
Prepare the system
1. Perform prerequisite tasks as described in “Before you begin” on page 189.
Note: Do not disconnect the power source when you stop the system or logical partition to install a
disk drive or solid-state drive.
2. If applicable, remove the system unit door or expansion unit door, or open the rack.
3. If applicable, remove the front cover.
4. Determine the next available disk drive position in your system or expansion unit. For information,
see “Disk drive locations and concurrent maintenance lights” on page 125.
Note: When you have a system with an expansion unit, you must fill the slot positions in the system
unit first. However, you can select a different disk drive placement depending on your data protection
strategy.
5. Stop the system or the logical partition that controls the disk drive location. For instructions, see
http://publib.boulder.ibm.com/infocenter/powersys/v3r1m5/topic/p7haj/crustopsys.htm.
50 Disk drives
Install the disk drive
1. If the slot you want to use contains a disk-drive filler, store the filler for future use, or discard it, then
move on to step Insert the disk.
Note: The disk drive filler is a single piece that does not attach to the new disk drive.
Attention:
v Attach a wrist strap to an unpainted surface of your hardware to prevent electrostatic discharge
(ESD) from damaging your hardware.
v When using a wrist strap, follow all electrical safety procedures. A wrist strap is for static control. It
does not increase or decrease your risk of receiving electric shock when using or working on
electrical equipment.
v If you do not have a wrist strap, just prior to removing the product from ESD packaging and
installing or replacing hardware, touch an unpainted surface of the system for a minimum of 5
seconds.
a. Pull the disk-drive filler from the slot you want to use, as shown in Figure 37.
b. Pull off the disk-drive bezel latch (A) from the front of the filler, as shown in the following figure.
Figure 37. Removing a disk-drive filler from a model
Installing a disk drive or solid-state drive 51
c. Loosen, and then pull out on the thumbscrew (C) to release the bezel (B) from the front edge of
the filler.
2. Find the package that contains the new disk drive.
Attention: Disk drives are fragile. Handle with care.
3. Remove the disk drive from its protective package. Record the serial number (use the last four digits)
of the disk drive.
4. Unlock the disk drive handle by squeezing and pulling it out toward you before you install the disk
drive. If the handle is not all the way out, the disk drive does not slide into the system or expansion
unit.
5. If your disk drive does not have a bezel preinstalled, follow these steps to install the disk-drive bezel
on the new disk drive:
a. Attach the bezel (A) to the disk drive, as shown in the following figure.
Figure 38. Removing a disk-drive bezel
52 Disk drives
b. Press in and then tighten the thumbscrew (B) to secure the bezel to the disk drive.
6. Support the bottom of the disk drive as you align it with the guide rails in the system or expansion
unit. Do not hold the disk drive by the handle.
7. Slide the disk drive into the system or expansion unit until it stops.
Note: When installing a disk drive, the drive must be fully seated and all the way into the system.
8. Push the disk drive handle in until it locks, as shown in Figure 40 on page 54.
Figure 39. Installing a disk-drive bezel
Installing a disk drive or solid-state drive 53
Figure 40. Installing a disk drive in the
Figure 41. Installing a small-form factor disk drive
54 Disk drives
9. If you are installing more than one disk drive, repeat step 1 on page 51 through step 8 on page 53
until all disk drives are installed.
Perform the procedure
1. Replace the front cover.
2. Install or close the system, expansion unit, or rack front door.
3. If the system power cables were removed, reconnect them.
4. Start the system or logical partition. For instructions, see “Starting the system or logical partition” on
page 199.
5. To configure the newly installed disk drive, select the procedure for the operating system that controls
the disk drive.
v “Preparing a disk drive for use in an AIX system or logical partition” on page 137
v For information about configuring the newly installed disk drive for a Linux environment, see the
PCI-X SCSI RAID Controller Reference Guide for Linux.
This guide is available from the SCSI PCI Adapters web page.
Related concepts
Removing and replacing covers and doors
Use these procedures to remove and replace the covers on IBM Power Systems servers that contain the
POWER7 processor.
Related reference
“Removing and replacing the expansion unit cover or door” on page 205
You might need to remove, replace, or install covers or doors on an expansion unit as a part of accessing
components or performing a service action.
Figure 42. Installing a small-form factor disk drive in the 5803 expansion unit
Installing a disk drive or solid-state drive 55
56 Disk drives
Removing a disk drive or solid-state drive
You might need to remove a disk drive or solid-state drive from the sever or expansion unit. This section
includes information about the disk-drive enclosure and procedures so that you can perform these tasks.
To remove and replace a failed disk drive, go to “Replacing a disk drive” on page 63. If your system is
managed by the Hardware Management Console (HMC), use the HMC to complete the steps for
removing a disk drive. For instructions, see “Removing a part by using the Hardware Management
Console” on page 247.
Remove a disk drive from a system or expansion unit that is controlled by different operating systems.
“Replacing a disk drive with the AIX system or logical partition powered on” on page 63
“Replacing a disk drive with the Linux system or logical partition powered on” on page 72
“Replacing a disk drive with the IBM i system or logical partition powered on” on page 89
Removing a disk drive from a system or expansion unit controlled by
IBM i
Learn how to remove a Small Computer System Interface (SCSI) or serial-attached SCSI (SAS) disk drive
from a system or expansion unit with an IBM i operating system or a logical partition that controls the
disk drive location. Use the procedure in this information to perform this task.
Attention: If you are removing a disk drive in order to install a new or upgraded disk drive, see
“Installing a disk drive or solid-state drive with the IBM i system or logical partition powered on” on
page 37. If you are removing a disk drive as part of a service procedure, continue to use the following
procedure.
If your system is managed by the Hardware Management Console (HMC), use the HMC to perform the
steps for removing disk drives from the server. For information about using the HMC to remove memory
modules, see “Removing a part by using the Hardware Management Console” on page 247. If you do not
have an HMC, perform the following steps to remove the disk drive.
© Copyright IBM Corp. 2010 57
DANGER
When working on or around the system, observe the following precautions:
Electrical voltage and current from power, telephone, and communication cables are hazardous. To
avoid a shock hazard:
v Connect power to this unit only with the IBM provided power cord. Do not use the IBM
provided power cord for any other product.
v Do not open or service any power supply assembly.
v Do not connect or disconnect any cables or perform installation, maintenance, or reconfiguration
of this product during an electrical storm.
v The product might be equipped with multiple power cords. To remove all hazardous voltages,
disconnect all power cords.
v Connect all power cords to a properly wired and grounded electrical outlet. Ensure that the outlet
supplies proper voltage and phase rotation according to the system rating plate.
v Connect any equipment that will be attached to this product to properly wired outlets.
v When possible, use one hand only to connect or disconnect signal cables.
v Never turn on any equipment when there is evidence of fire, water, or structural damage.
v Disconnect the attached power cords, telecommunications systems, networks, and modems before
you open the device covers, unless instructed otherwise in the installation and configuration
procedures.
v Connect and disconnect cables as described in the following procedures when installing, moving,
or opening covers on this product or attached devices.
To Disconnect:
1. Turn off everything (unless instructed otherwise).
2. Remove the power cords from the outlets.
3. Remove the signal cables from the connectors.
4. Remove all cables from the devices
To Connect:
1. Turn off everything (unless instructed otherwise).
2. Attach all cables to the devices.
3. Attach the signal cables to the connectors.
4. Attach the power cords to the outlets.
5. Turn on the devices.
(D005)
1. Before removing a disk drive, back up all the data from the disk drive or the array that the drive is a
part of, and then move the data from the disk drive that you want to remove. For instructions, choose
one of the following options:
v Move and remove disk units
v Recovering your system in the IBM i Information Center
To access the IBM i Information Center website, go to http://www.ibm.com/systems/i/infocenter
and select from the following options:
– If you select V6R1, expand IBM i 6.1 Information Center > Systems management > Backup and
recovery > Recovering your system
– If you select V5R4, expand i5/OS Information Center > Systems management > Backup and
recovery > Backing up and recovering your system > Recovering your system
2. Prepare to remove the disk drive from the system configuration as described in “Preparing to remove
a disk drive from a system or logical partition controlled by IBM i” on page 61.
3. Locate the disk drive to be removed. See “Disk drive locations and concurrent maintenance lights” on
page 125.
4. Stop the system or logical partition that controls the disk drive location. For instructions, see
“Stopping a system or logical partition” on page 201.
58 Disk drives
Prepare the system
1. If applicable, remove the system unit door, expansion unit door, or open the rack.
2. If applicable, remove the front cover.
v If you have a front cover from any other models, remove the front cover.
Remove the disk drive
1. Unlock the disk drive handle by squeezing it and pulling it out toward you.
Attention:
v Attach a wrist strap to an unpainted surface of your hardware to prevent electrostatic discharge
(ESD) from damaging your hardware.
v When using a wrist strap, follow all electrical safety procedures. A wrist strap is for static control. It
does not increase or decrease your risk of receiving electric shock when using or working on
electrical equipment.
v If you do not have a wrist strap, just prior to removing the product from ESD packaging and
installing or replacing hardware, touch an unpainted surface of the system for a minimum of 5
seconds.
Figure 43. Removing a disk drive
Removing a disk drive or solid-state drive 59
2. Support the bottom of the disk drive as you slide it out of the system or expansion unit. Do not hold
the disk drive by the handle.
3. If a disk-drive filler is available, follow these steps:
Figure 44. Removing a small-form factor disk drive
Figure 45. Removing a small-form factor disk drive from the 5803 expansion unit
60 Disk drives
Note: Refill the disk drive slots with either another disk drive or a disk-drive filler. Filling the disk
drive slot can help ensure proper airflow for cooling and help maintain optimal EMI compliance.
a. Attach the disk-drive bezel to the filler.
b. Press in and then tighten the thumbscrew (B) to secure the bezel to the disk drive.
c. Install the disk-drive filler in the system or expansion unit.
Perform the procedure
1. Install or close the front door of the system, expansion unit, or rack.
2. Reconnect all system power cables.
3. Reconnect the system to the power supply by plugging it into the system.
4. Start the system or logical partition. For instructions, see “Starting the system or logical partition” on
page 199.
Related concepts
Removing and replacing covers and doors
Use these procedures to remove and replace the covers on IBM Power Systems servers that contain the
POWER7 processor.
Related reference
“Removing and replacing the expansion unit cover or door” on page 205
You might need to remove, replace, or install covers or doors on an expansion unit as a part of accessing
components or performing a service action.
Preparing to remove a disk drive from a system or logical partition
controlled by IBM i
Learn the actions you need to take to properly remove a disk drive.
Before removing a disk drive from a system or expansion unit that is controlled by IBM i, follow these
steps:
1. Determine the protection status of the disk drive that you are replacing. For instructions, see
“Determining the IBM i disk drive protection status” on page 48.
2. The service provider must perform one of the following tasks:
v Save data and initialize the disk drive.
v Initialize the disk drive and copy to disk.
3. For mirrored disk drives on an IBM i system or logical partition, follow these steps to verify that the
disk drive you are replacing has been suspended:
a. Sign on with service level authority.
b. Type strsst on the command line of the IBM i session and press Enter.
c. Type your service tools user ID and service tools password on the Start Service Tools (STRSST)
Sign On display. Press Enter.
Remember: The service tools password is case sensitive.
d. Select Work with disk units from the System Service Tools (SST) display. Press Enter.
e. Select Display Disk Configuration from the Work with Disk Units display. Press Enter.
f. Select Display Disk Configuration Status from the Display Disk Configuration display. Press
Enter.
g. Does the disk drive that is mirrored to the disk drive that you are replacing (same drive number)
have a status of Active?
v No: The replacement must be performed by a service provider.
v Yes: Does the disk drive that you are replacing have a status of Suspended?
– Yes: Continue with the next step.
Removing a disk drive or solid-state drive 61
– No: Follow these steps to suspend mirrored protection on the disk drive that you are
replacing:
1) Press F3 from the Display Disk Configuration display to return to the Work with Disk
Units display.
2) Select Work with Disk Unit Recovery from the Work with Disk Units display. Press
Enter.
3) Select Suspend mirrored protection from the Work with Disk Unit Recovery display.
Press Enter.
4) Select the option to suspend the disk drive that you are replacing from the Suspend
Mirrored Protection display. Press Enter.
h. Press F3 repeatedly to exit System Service Tools and return to the main menu.
62 Disk drives
Replacing a disk drive
You might need to replace a Small Computer System Interface (SCSI) or serial-attached SCSI (SAS) disk
drive in the server or expansion unit. Use the procedures in this section to perform this task.
Note: These instructions (the first 3 sections) can also be used if you are replacing an SAS disk drive.
To remove and replace a disk drive in a different enclosure, use the information for that enclosure.
Use this procedure to remove a failed disk drive and replace it with the same type of disk drive. If you
plan to remove a disk drive and install a different type of disk drive, remove the existing disk drive as
described in “Removing a disk drive or solid-state drive” on page 57, then install the new disk drive as
described in “Installing a disk drive or solid-state drive” on page 5. If your system is managed by the
Hardware Management Console (HMC), use the HMC to complete the steps for replacing a disk drive.
For instructions, see “Exchanging a part by using the Hardware Management Console” on page 247.
Your system or logical partition can be powered off or powered on when you replace the disk drive. If
the disk drive that is being replaced is in the AIX or Linux root volume group (rootvg) and does not have
disk protection replace your disk with the powered-off procedure.
Replacing a disk drive with the AIX system or logical partition powered
on
You might need to replace a Small Computer System Interface (SCSI) or serial-attached SCSI (SAS) disk
drive in a system or expansion unit while the AIX operating system or logical partition that controls the
disk drive location is powered on. Use the procedure in this section to perform this task.
Attention: If you are installing a new or upgraded disk drive, see “Installing a disk drive or solid-state
drive with the AIX system or logical partition powered on” on page 10. If you are removing a disk drive
as part of a service procedure, continue to use the following procedure.
© Copyright IBM Corp. 2010 63
DANGER
When working on or around the system, observe the following precautions:
Electrical voltage and current from power, telephone, and communication cables are hazardous. To
avoid a shock hazard:
v Connect power to this unit only with the IBM provided power cord. Do not use the IBM
provided power cord for any other product.
v Do not open or service any power supply assembly.
v Do not connect or disconnect any cables or perform installation, maintenance, or reconfiguration
of this product during an electrical storm.
v The product might be equipped with multiple power cords. To remove all hazardous voltages,
disconnect all power cords.
v Connect all power cords to a properly wired and grounded electrical outlet. Ensure that the outlet
supplies proper voltage and phase rotation according to the system rating plate.
v Connect any equipment that will be attached to this product to properly wired outlets.
v When possible, use one hand only to connect or disconnect signal cables.
v Never turn on any equipment when there is evidence of fire, water, or structural damage.
v Disconnect the attached power cords, telecommunications systems, networks, and modems before
you open the device covers, unless instructed otherwise in the installation and configuration
procedures.
v Connect and disconnect cables as described in the following procedures when installing, moving,
or opening covers on this product or attached devices.
To Disconnect:
1. Turn off everything (unless instructed otherwise).
2. Remove the power cords from the outlets.
3. Remove the signal cables from the connectors.
4. Remove all cables from the devices
To Connect:
1. Turn off everything (unless instructed otherwise).
2. Attach all cables to the devices.
3. Attach the signal cables to the connectors.
4. Attach the power cords to the outlets.
5. Turn on the devices.
(D005)
Attention: The powered-on procedure is a complex set of steps that must be performed exactly to avoid
potential data loss or system damage. You might also consider the powered-off procedure, which is a
simpler task that involves less risk of data loss or system damage. If the system or logical partition that
controls the disk drive you are replacing is at the AIX command line, you can use this powered-on
procedure. If the disk drive that you are replacing is in the root volume group (rootvg) and it is not
protected by either Redundant Array of Independent Disks (RAID) or mirroring, or to use the
powered-off procedure, go to “Replacing a disk drive with the system or logical partition powered off”
on page 106.
Note: Some of the figures in these procedures might not look exactly like the system or expansion unit
that you have. However, the steps to perform the task are the same.
Prepare the system
1. Perform prerequisite tasks as described in “Before you begin” on page 189.
2. Prepare to remove the disk drive from the system or expansion unit as described in “Preparing to
remove a disk drive from a system or expansion unit controlled by AIX” on page 137.
3. If applicable, remove the system unit door, expansion unit door, or the rack door.
64 Disk drives
4. If applicable, remove the front cover.
v If you have a front cover from any other models, remove the front cover.
To replace a Just a Bunch of Disks (JBOD) disk drive attached to any SCSI or SCSI RAID adapter, see the
procedure in “Replacing a SCSI disk drive” on page 114.
Remove a disk drive
1. If you are removing a solid state device located on a PCIe RAID and SSD SAS adapter, go to
Removing and replacing the PCIe RAID and SSD SAS adapterRemoving and replacing the PCIe
RAID and SSD SAS adapter and complete that procedure, then return here and perform the
following steps.
2. Log in as root user.
3. At the command line, type diag and press Enter.
4. Press Enter to continue.
5. On the Function Selection display, select Task Selection.
6. Select Hot Plug Task.
7. Select SCSI and SCSI RAID Hot Plug Manager or RAID Hot Plug Devices, depending on the
adapter that your drive is attached to.
Tip: One or both of these options might be available, depending on the configuration of your
system. If the drive you want to replace is not listed when you follow the steps for the SCSI and
SCSI RAID Hot Plug Manager, go to step 8.
v If you have a RAID disk drive attached to a 2498 PCI SCSI RAID adapter, continue with step 8.
v If you have a JBOD disk drive attached to any SCSI or SCSI RAID adapter, go to “Replacing a
SCSI disk drive” on page 114.
v If you have RAID drives attached to any other adapter, go to the PCI-X SCSI RAID Controller
Reference Guide for AIX. This guide is available from the SCSI PCI Adapters Web page at
http://publib.boulder.ibm.com/infocenter/pseries/v5r3/index.jsp?topic=/com.ibm.pseries.doc/
hardware_docs/scsipciadapters.htm.
8. Select RAID Hot Plug Devices.
9. On the Resource Selection List, select the adapter that is connected to the array that contains the disk
drive you want to replace. Press Enter.
10. Select Commit.
11. On the Identify and Remove Resources menu, select Remove. Press Enter.
A list of the disk drives that you can remove is shown. If the disk drive you want to remove does
not appear in the list, ask your system administrator to put the disk drive in the Failed state before
continuing with this procedure. For information, see AIX System Management Guide: Operating System
and Devices. This guide is available at the System p® and AIX Information Center
(http://publib.boulder.ibm.com/infocenter/pseries/v5r3/index.jsp).
12. Select the drive that you want to remove and press Enter. The concurrent maintenance light for the
slot will flash rapidly.
Attention: If the light does not flash rapidly, do not remove the disk drive.
To view the locations of the concurrent maintenance lights, go to “Disk drive locations and
concurrent maintenance lights” on page 125.
13. Unlock the disk drive handle by squeezing it and pulling it out toward you as shown in Figure 46
on page 66.
Replacing a disk drive 65
Figure 46. Removing a disk drive from a model
Figure 47. Removing a small-form factor disk drive
66 Disk drives
14. Support the bottom of the disk drive as you slide it out of the system or expansion unit. Do not hold
the disk drive by the handle.
The concurrent maintenance light for the slot turns off when you remove the disk drive.
15. Does your new disk drive have a bezel preinstalled?
v No: Continue with the procedure from here.
v Yes: Go to step 6 on page 69 of the replacement procedure.
Important: If the disk drive bezel does not have a thumbscrew, it is not removable and is a
serial-attached SCSI (SAS) disk drive. The replacement drive should come with an attached bezel.
Loosen and pull out on the thumbscrew (B) to release the bezel (C) from the disk drive, as shown in
the following figure.
Figure 48. Removing a small-form factor disk drive from the 5803 expansion unit
Replacing a disk drive 67
Replace a disk drive
1. Find the package that contains the new disk drive.
Attention: Disk drives are fragile. Handle with care.
2. Remove the disk drive from its protective package.
3. Unlock the handle of the replacement disk drive by squeezing and pulling it out toward you. If the
handle is not all the way out, the disk drive cannot slide into the system.
4. If your new disk drive does not have a bezel preinstalled, then perform the following step.
Attach the disk-drive bezel (A) to the replacement disk drive, as shown in the following figure.
5. If your new disk drive does not have a bezel preinstalled, then perform the following step.
Press in and then tighten the thumbscrew (B) to secure the bezel to the disk drive.
Figure 49. Removing a disk-drive bezel
68 Disk drives
6. Install the disk drive as shown in Figure 51 on page 70.
a. Support the bottom of the disk drive as you align it with the guide rails in the system unit. Do
not hold the disk drive by the handle.
b. Slide the disk drive into the system until it stops.
Important: Ensure that when installing a disk drive, that the drive is fully seated and all the way
into the system.
c. Push the disk drive handle in until it locks.
Figure 50. Installing a disk-drive bezel
Replacing a disk drive 69
Figure 51. Installing a disk drive in a model
Figure 52. Installing a small-form factor disk drive
70 Disk drives
7. Press Enter. The disk drive slot exits the Remove state and enters the Normal state.
8. Exit to the RAID Hot-Plug Devices menu. Press F3 to return.
9. Exit to the Task selection display.
10. Select Log Repair Action.
11. Select the disk drive that you replaced and then press Enter.
12. Select Commit after you have made your selection and then press Enter.
13. Exit to the command line.
Complete the procedure
1. Replace the front cover.
2. Install or close the front door of the system, expansion unit or rack.
3. To prepare the disk drive for use, go to “Rebuilding data on a replacement disk drive using AIX” on
page 137.
Figure 53. Installing a small-form factor disk drive in the 5803 expansion unit
Replacing a disk drive 71
Related concepts
Removing and replacing covers and doors
Use these procedures to remove and replace the covers on IBM Power Systems servers that contain the
POWER7 processor.
Related reference
“Removing and replacing the expansion unit cover or door” on page 205
You might need to remove, replace, or install covers or doors on an expansion unit as a part of accessing
components or performing a service action.
Replacing a disk drive with the Linux system or logical partition
powered on
You might need to replace a Small Computer System Interface (SCSI) or serial-attached SCSI (SAS) disk
drive in a system or expansion unit while the Linux operating system or logical partition that controls the
disk drive location is powered on. Use the procedure in this section to perform this task.
Attention: If you are installing a new or upgraded disk drive, see “Installing a disk drive or solid-state
drive with the Linux system or logical partition powered on” on page 17. If you are removing a disk
drive as part of a service procedure, continue to use the following procedure.
72 Disk drives
DANGER
When working on or around the system, observe the following precautions:
Electrical voltage and current from power, telephone, and communication cables are hazardous. To
avoid a shock hazard:
v Connect power to this unit only with the IBM provided power cord. Do not use the IBM
provided power cord for any other product.
v Do not open or service any power supply assembly.
v Do not connect or disconnect any cables or perform installation, maintenance, or reconfiguration
of this product during an electrical storm.
v The product might be equipped with multiple power cords. To remove all hazardous voltages,
disconnect all power cords.
v Connect all power cords to a properly wired and grounded electrical outlet. Ensure that the outlet
supplies proper voltage and phase rotation according to the system rating plate.
v Connect any equipment that will be attached to this product to properly wired outlets.
v When possible, use one hand only to connect or disconnect signal cables.
v Never turn on any equipment when there is evidence of fire, water, or structural damage.
v Disconnect the attached power cords, telecommunications systems, networks, and modems before
you open the device covers, unless instructed otherwise in the installation and configuration
procedures.
v Connect and disconnect cables as described in the following procedures when installing, moving,
or opening covers on this product or attached devices.
To Disconnect:
1. Turn off everything (unless instructed otherwise).
2. Remove the power cords from the outlets.
3. Remove the signal cables from the connectors.
4. Remove all cables from the devices
To Connect:
1. Turn off everything (unless instructed otherwise).
2. Attach all cables to the devices.
3. Attach the signal cables to the connectors.
4. Attach the power cords to the outlets.
5. Turn on the devices.
(D005)
Attention: Read these instructions carefully and understand the complexity of the powered-on
procedure before you begin. If the system or logical partition that controls the disk drive you are
replacing is at the Linux command line, you can use this powered-on procedure.
Go to “Replacing a disk drive with the system or logical partition powered off” on page 106, if:
v You are installing a disk drive that will be connected with the 6203 or 6204 SCSI adapter
v You do not feel comfortable using the powered-on procedure
Note: Some of the figures in these procedures might not look exactly like the system or expansion unit
that you have. However, the steps to perform the task are the same.
Prepare the system
1. Perform prerequisite tasks as described in “Before you begin” on page 189.
2. Is the disk drive you are replacing in a redundant array of independent disk (RAID) 0 array or is the
disk drive you are replacing in a failed RAID level 5 or 10 array?
v If yes, you must delete the disk array before replacing the disk. For instructions, see PCI-X SCSI
RAID Controller Reference Guide for Linux.
Replacing a disk drive 73
This guide is available from the SCSI PCI Adapters Web page at http://publib.boulder.ibm.com/
infocenter/pseries/v5r3/index.jsp?topic=/com.ibm.pseries.doc/hardware_docs/scsipciadapters.htm.
v If no, your drive is configured as Just a Bunch of Disks (JBOD) or is configured within a degraded
array. Proceed to the next step.
3. If the disk drive you are replacing is in a redundant array of independent disk (RAID) Level 0 disk
array or in a failed RAID Level 5 or RAID Level 10 disk array, you must delete the disk array before
replacing the disk. For instructions, see the PCI-X SCSI RAID Controller Reference Guide for Linux.
This guide is available from the SCSI PCI Adapters Web page at http://publib.boulder.ibm.com/
infocenter/pseries/v5r3/index.jsp?topic=/com.ibm.pseries.doc/hardware_docs/scsipciadapters.htm.
4. If applicable, remove the system unit door or expansion unit door or open the rack front door.
5. If applicable, remove the front cover.
v If you have a front cover from any other models, remove the front cover.
To replace a disk drive in a location with the Linux system or logical partition that controls the disk drive
location powered on, select the appropriate procedure from the following list:
Related concepts
Removing and replacing covers and doors
Use these procedures to remove and replace the covers on IBM Power Systems servers that contain the
POWER7 processor.
Related reference
“Removing and replacing the expansion unit cover or door” on page 205
You might need to remove, replace, or install covers or doors on an expansion unit as a part of accessing
components or performing a service action.
Replacing the disk drive using Linux kernel 2.4
Learn how to replace a disk drive in a location that is controlled by a system or logical partition that is
running Linux kernel 2.4.
Prepare to remove the disk drive
1. Log in as root user.
2. Type iprconfig on the command line of the Linux session and press Enter. The Work with Disk
Units display is shown.
3. From the Work with Disk Units display, select Analyze log. Press Enter. The Kernel Messages Log
display is shown, like the following display.
4. From the Kernel Messages Log display, select Use vi to view most recent IBM Storage error
messages. Press Enter.
5. Find the device error log entry for the disk drive you want to replace.
Kernel Messages Log
Select one of the following:
1. Use vi to view most recent IBM Storage error messages
2. Use vi to view IBM Storage error messages
3. Use vi to view all kernel error messages
4. Use vi to view sisconfig error messages
5. Set root kernel message log directory
6. Set default editor
7. Restore defaults
8. Use vi to view IBM Storage boot time messages
Selection:
e=Exit
Figure 54. Kernel Messages Log
74 Disk drives
6. Record the location information for the disk drive. The location information has the form of
P1–I2/Z1–A5.
7. Exit to the Work with Disk Units display.
8. Select Display disk hardware status from the Work with Disk Units display. Press Enter.
9. Record the hardware status of the disk drive you want to replace.
10. Type d (Display disk unit details) next to the disk drive you want to replace.
11. On the Display Disk Unit Details display, type 5 (Display hardware resource information details)
next to the disk drive you want to replace.
The Disk Unit Hardware Resource Information Details display is shown, like the following display.
12. Record the physical location information.
13. Exit to the Work with Disk Units display.
14. If the disk drive you want to replace is unprotected or in use, move the data from the disk drive
before continuing with this procedure.
For information, see the PCI-X SCSI RAID Controller Reference Guide for Linux.
This guide is available from the SCSI PCI Adapters web page.
Remove the disk drive
1. If you are removing a solid state device located on a PCIe RAID and SSD SAS adapter, go to
Removing and replacing the PCIe RAID and SSD SAS adapterRemoving and replacing the PCIe
RAID and SSD SAS adapter and complete that procedure, then return here and perform the
following steps.
2. From the Work with Disk Units display, select Work with disk unit recovery. Press Enter.
3. From the Work with Disk Unit Recovery display, select Device Concurrent maintenance. Press Enter.
A Device Concurrent Maintenance display is shown, like the following display.
Disk Unit Hardware Resource Information Details
Manufacturer. . . . . . . . : IBM
Machine Type and Model. . . : IC35L073UCDY10-0
Firmware Version. . . . . . : 53323846
Model . . . . . . . . . . . : 020
Serial Number . . . . . . . : E6VKUNPD
Capacity. . . . . . . . . . : 73.41 GB
Resource Name . . . . . . . : /dev/sda
Physical location:
Location . . . . . . . : U1.9-P1/Z1-A8
PCI Bus. . . . . . . . . . : 257
PCI Device . . . . . . . . : 1
SCSI Host Number . . . . . : 0
SCSI Channel. . . . . . . : 0
SCSI Id. . . . . . . . . . : 8
SCSI Lun. . . . . . . . . : 0
Press Enter to continue.
e=Exit q=Cancel v=Display Extended VPD
Figure 55. Disk Unit Hardware Resource Information Details display
Replacing a disk drive 75
4. Enter either the physical location information or the PCI or SCSI location for the disk drive you want
to replace. Press Enter.
5. Specify 1 (Remove device) for the action to be performed.
6. Enter a time value in minutes, that is the time required by you to remove the disk drive.
The concurrent maintenance light indicates that you can remove the disk drive. To view the locations
of the concurrent maintenance lights, select the appropriate model from the following list:
v “Disk drive locations and concurrent maintenance lights for the 5786, 5787, 7031-D24, and
7031-T24 SCSI disk-drive enclosures” on page 133
7. Press Enter.
A Device Concurrent Maintenance Action display is shown, like the following display.
8. Enter c to confirm that you want to remove the disk drive at the specified location.
A Device Concurrent Maintenance Action In Progress display is shown, like the following display.
Device Concurrent Maintenance
Type the choices, then press Enter.
Specify either Physical Location or PCI/SCSI location.
Physical Location: U0.1-P1-I4/Z1-A8
OR
PCI/SCSI . . . PCI Bus: PCI Device:
SCSI Channel: SCSI Id: SCSI Lun:
Specify action as 1=Remove device 2=Install device
Action to be performed . . . . . . . . . : 1
Enter a time value between 01 and 19.
Time needed in minutes . . . . . . . . . : 1
Or leave blank and press Enter to cancel
Figure 56. Example Device Concurrent Maintenance display
Device Concurrent Maintenance Action
Removal of device:
Location: U0.1-P1-I4/Z1-A8
You have 1 minute to perform the operation
During this time, your system can seem unresponsive
c=Confirm q=Cancel
Figure 57. Example Device Concurrent Maintenance Action display
Device Concurrent Maintenance Action In Progress
Please remove the device located in:
Location: U0.1-P1-I4/Z1-A8
Figure 58. Example Device Concurrent Maintenance Action In Progress display
76 Disk drives
Attention:
v Attach a wrist strap to an unpainted surface of your hardware to prevent electrostatic discharge
(ESD) from damaging your hardware.
v When using a wrist strap, follow all electrical safety procedures. A wrist strap is for static control.
It does not increase or decrease your risk of receiving electric shock when using or working on
electrical equipment.
v If you do not have a wrist strap, just prior to removing the product from ESD packaging and
installing or replacing hardware, touch an unpainted surface of the system for a minimum of 5
seconds.
9. Squeeze or lift the handle of the disk drive out toward you before you remove the disk drive, as
shown in Figure 59. If the handle is not all the way out, the disk drive cannot slide out of the system
or expansion unit.
Figure 59. Removing a disk drive from a model or 7311-D20
Replacing a disk drive 77
10. Support the bottom of the disk drive as you slide it out of the system. Do not hold the disk drive by
the handle.
Figure 60. Removing a small-form factor disk drive
Figure 61. Removing a small-form factor disk drive from the 5803 expansion unit
78 Disk drives
Note: The concurrent maintenance light for the slot stops flashing and remains off when you remove
the disk drive.
11. Press Enter.
12. Does your disk drive have a bezel preinstalled?
v No: Continue with the procedure from here.
v Yes: Move on to step 5 on page 80 of the replace procedure.
Loosen and pull out on the thumbscrew (B) to release the disk-drive bezel (C) from the disk drive,
as shown in the following figure.
Replace the disk drive
1. Find the package that contains the new disk drive.
Attention: Disk drives are fragile. Handle with care.
2. Remove the disk drive from its protective package.
3. Unlock the disk drive handle by squeezing and pulling it out toward you before you install the disk
drive. If the handle is not all the way out, the disk drive does not slide into the system or expansion
unit.
4. If your replacement disk drive does not have a bezel preinstalled, then follow these steps to install
the disk-drive bezel on the replacement disk drive:
a. Attach the bezel (A) to the disk drive, as shown in the following figure.
Figure 62. Removing a disk-drive bezel
Replacing a disk drive 79
b. Press in and then tighten the thumbscrew (B) to secure the bezel to the disk drive.
5. Select Device Concurrent maintenance from the Work with Disk Unit Recovery display. Press Enter.
6. Enter either the physical location code or the PCI/SCSI location from which you removed the disk
drive. Press Enter.
7. Specify action as 2 (Install device).
8. Enter a time value in minutes, that is time required by you to install the new disk drive.
The concurrent maintenance light for the slot indicates that you can install the disk drive.
9. Press Enter. A Device Concurrent Maintenance Action display is shown, like the following display.
10. Enter c to confirm that you want to install the disk drive in the specified location.
A Device Concurrent Maintenance Action In Progress display like the following sample is shown.
Figure 63. Attaching a bezel to the disk drive
Device Concurrent Maintenance Action
Installation of device:
Location: U0.1-P1-I4/Z1-A8
You have 1 minute to perform the operation
During this time, your system can seem unresponsive
c=Confirm q=Cancel
Figure 64. Example Device Concurrent Action Maintenance display
80 Disk drives
11. Support the bottom of the disk drive as you align it with the guide rails in the system or expansion
unit. Do not hold the disk drive by the handle.
12. Slide the disk drive all the way in into the system or expansion unit and then lock it in place by
pushing in the disk drive handle as shown in Figure 66.
Device Concurrent Maintenance Action In Progress
Please insert the device located in:
Location: U0.1-P1-I4/Z1-A8
Figure 65. Example Device Concurrent Maintenance Action In Progress display
Figure 66. Installing a disk drive in a model or 7311-D20
Replacing a disk drive 81
The concurrent maintenance light for the slot stops flashing and remains on when you have installed
the disk drive properly.
Figure 67. Installing a small-form factor disk drive
Figure 68. Installing a small-form factor disk drive in the 5803 expansion unit
82 Disk drives
Perform the procedure
1. Rebuild the data on the replacement disk drive. For instructions, go to “Rebuilding data on a
replacement disk drive using Linux” on page 137.
2. Replace the front cover.
3. Install or close the system, expansion unit, or rack front door.
Related concepts
Removing and replacing covers and doors
Use these procedures to remove and replace the covers on IBM Power Systems servers that contain the
POWER7 processor.
Related reference
“Removing and replacing the expansion unit cover or door” on page 205
You might need to remove, replace, or install covers or doors on an expansion unit as a part of accessing
components or performing a service action.
Replacing the disk drive using Linux kernel 2.6
Learn how to replace a disk drive in a location that is controlled by a system or logical partition that is
running Linux kernel 2.6.
Prepare to remove the disk drive
1. Log in as root user.
2. Type iprconfig on the command line of the Linux session and press Enter.
The IBM Power RAID Configuration Utility display is shown.
3. Select Analyze log. Press Enter. The Kernel Messages Log display is shown.
4. Select View most recent ipr error messages from the Kernel Messages Log display. Press Enter.
5. Find the entry in the log for the disk drive you want to replace.
6. Record the location information for the disk drive.
Note: The location information has the form of 2:0:8:0. In this example, 2 is the SCSI host number, 0
is the SCSI bus, 8 is the SCSI target ID, and 0 is the LUN (logical unit).
7. Return to the command line.
8. Type the following command:
ls -ld /sys/class/scsi_host/host#/device
Where # is the SCSI host number. Press Enter.
Kernel Messages Log
Select one of the following:
1. View most recent ipr error messages
2. View ipr error messages
3. View all kernel error messages
4. View iprconfig error messages
5. Set root kernel message log directory
6. Set default editor
7. Restore defaults
8. View ipr boot time messages
Selection:
e=Exit
Figure 69. Kernel Messages Log
Replacing a disk drive 83
9. Record the PCI location information.
Note: The PCI location information has the form of 61:01:0:2.
10. Type iprconfig on the command line and press Enter.
The IBM Power RAID Configuration Utility display is shown.
11. Select Display hardware status from the IBM Power RAID Configuration Utility display. Press Enter.
The Display Hardware Status display is shown.
12. Look for the disk drive at the PCI location you recorded. The disk drive might have a Failed status.
13. If the disk drive you want to replace is unprotected or in use, move the data from the disk drive
before continuing with this procedure.
For information, see the PCI-X SCSI RAID Controller Reference Guide for Linux.
This guide is available from the SCSI PCI Adapters web page.
14. Type option 1 (Display hardware resource information details) next to the disk drive you want to
replace. Press Enter.
The Disk Hardware Resource Information Details is displayed like the shown in the following
sample.
Display Hardware Status
Type option, press Enter.
1=Display hardware resource information details
OPT Name PCI/SCSI Location Description Status
--- ------ -------------------------- ------------------------- ----------------
0000:01:01.0.0/ PCI-X SCSI Adapter Operational
0000:41:01.0.1/ PCI-X SCSI Adapter Operational
sda 0000:41:01.0.1/0:3:0 Physical Disk Active
sdb 0000:41:01.0.1/0:4:0 Physical Disk Active
sdc 0000:41:01.0.1/0:8:0 Physical Disk Active
sdd 0000:41:01.0.1/1:3:0 Physical Disk Active
sde 0000:41:01.0.1/1:4:0 Physical Disk Active
sdf 0000:41:01.0.1/1:5:0 Physical Disk Active
0001:61:01.0.2/ PCI-X SCSI RAID Adapter Operational
sdg 0001:61:01.0.2/0:3:0 Physical Disk Active
0001:61:01.0.2/0:6:0 Advanced Function Disk Active
sdi 0001:61:01.0.2/0:9:0 Physical Disk Active
sdh 0001:61:01.0.2/255:0:0 RAID 10 Disk Array Failed
0001:61:01.0.2/0:4:0 RAID 10 Array Member Failed
0001:61:01.0.2/0:5:0 RAID 10 Array Member Failed
e=Exit q=Cancel r=Refresh t=Toggle
Figure 70. Example Display Hardware Status
84 Disk drives
15. Record the physical location information.
16. Return to the IBM Power RAID Configuration Utility display.
Remove the disk drive
1. If you are removing a solid state device located on a PCIe RAID and SSD SAS adapter, go to
Removing and replacing the PCIe RAID and SSD SAS adapterRemoving and replacing the PCIe
RAID and SSD SAS adapter and complete that procedure, then return here and perform the
following steps.
2. From the IBM Power RAID Configuration Utility display, select Work with disk unit recovery. Press
Enter.
3. From the Work with Disk Unit Recovery display, select Concurrent remove device. Press Enter. A
Concurrent Device Remove display is shown, like the following sample.
4. Type option 1 (Select) next to the location for the disk drive you want to replace. Press Enter.
5. The Verify Device Concurrent Remove display is shown. The concurrent maintenance light turns on
for that disk drive slot. To view the locations of the concurrent maintenance lights, select the
appropriate model from the following list:
Disk Unit Hardware Resource Information Details
Manufacturer . . . . . . . . . . . . . . : IBM
Product ID . . . . . . . . . . . . . . . : ST336607LC
Firmware Version . . . . . . . . . . . . : 43353048 (C50H)
Serial Number. . . . . . . . . . . . . . : 00006719
Capacity . . . . . . . . . . . . . . . . : 36.40 GB
Resource Name. . . . . . . . . . . . . . : /dev/sdd
Physical location
PCI Address. . . . . . . . . . . . . . . : 0001:50:01.0
SCSI Host Number . . . . . . . . . . . . : 2
SCSI Channel . . . . . . . . . . . . . . : 0
SCSI Id. . . . . . . . . . . . . . . . . : 4
SCSI Lun . . . . . . . . . . . . . . . . : 0
Extended Details
FRU Number . . . . . . . . . . . . . . . : 00P2676
More...
Press Enter to Continue
e=Exit q=Cancel f=PageDn b=PageUp
Figure 71. Example Disk Hardware Resource Information Details display
Concurrent Device Remove
Choose a single location for remove operations
1=Select
OPT Name PCI/SCSI Location Description Status
--- ------ -------------------------- ------------------------- -----------------
sdc 0000:41:01.0.1/0:8:0 Physical Disk Active
0000:41:01.0.1/0:5: Empty
sdb 0000:41:01.0.1/0:4:0 Physical Disk Active
sda 0000:41:01.0.1/0:3:0 Physical Disk Active
0000:41:01.0.1/1:8: Empty
sdf 0000:41:01.0.1/1:5:0 Physical Disk Active
sde 0000:41:01.0.1/1:4:0 Physical Disk Active
sdd 0000:41:01.0.1/1:3:0 Physical Disk Active
0001:61:01.0.2/0:8: Empty
sdh 0001:61:01.0.2/0:9:0 Physical Disk Active
sdg 0001:61:01.0.2/0:3:0 Physical Disk Active
Figure 72. Example Concurrent Device Remove display
Replacing a disk drive 85
v “Disk drive locations and concurrent maintenance lights for the 5786, 5787, 7031-D24, and
7031-T24 SCSI disk-drive enclosures” on page 133
6. On the Verify Device Concurrent Remove display, verify that the selected disk drive is the disk drive
you want to replace, then press Enter. The identify light turns on for the disk drive.
Note: Ensure that the disk drive is not in use to prevent loss of data.
7. The Device Concurrent Remove display is shown.
Attention:
v Attach a wrist strap to an unpainted surface of your hardware to prevent electrostatic discharge
(ESD) from damaging your hardware.
v When using a wrist strap, follow all electrical safety procedures. A wrist strap is for static control.
It does not increase or decrease your risk of receiving electric shock when using or working on
electrical equipment.
v If you do not have a wrist strap, just prior to removing the product from ESD packaging and
installing or replacing hardware, touch an unpainted surface of the system for a minimum of 5
seconds.
8. Squeeze and pull the handle of the disk drive out toward you before you remove the disk drive as
shown in Figure 73. If the handle is not all the way out, the disk drive does not slide out of the
system or expansion unit.
9. Support the bottom of the disk drive as you slide it out of the system or expansion unit. Do not hold
the disk drive by the handle.
10. If your replacement disk drive does not have a bezel preinstalled, then perform the following step.
Loosen and pull out on the thumbscrew (B) to release the disk-drive bezel (C) from the disk drive,
as shown in the following figure.
Figure 73. Removing a disk drive from a model or 7311-D20
86 Disk drives
11. Press Enter on the IBM Power RAID Configuration Utility to indicate that you have removed the
disk drive.
Replace the disk drive
1. Find the package that contains the new disk drive.
Attention: Disk drives are fragile. Handle them with care.
2. Remove the disk drive from its protective package.
3. Unlock the disk drive handle by squeezing and pulling it out toward you before you install the disk
drive. If the handle is not all the way out, the disk drive does not slide into the system or expansion
unit.
4. If your replacement disk drive does not have a bezel preinstalled, then follow these steps to install
the disk-drive bezel on the replacement disk drive:
a. Attach the bezel (A) to the disk drive, as shown in Figure 75 on page 88.
b. Press in and then tighten the thumbscrew (B) to secure the bezel to the disk drive.
Figure 74. Removing a disk-drive bezel
Replacing a disk drive 87
5. From the IBM Power RAID Configuration Utility display, select Work with disk unit recovery. Press
Enter.
6. From the Work with Disk Unit Recovery display, select Concurrent add device. Press Enter.
A Concurrent Device Add display like the following sample is shown.
7. Type option1 (Select) next to the location from which you removed the disk drive.
The Verify Device Concurrent Add display is shown.
8. Press Enter on the Verify Device Concurrent Add display. The Complete Device Concurrent Add
display is shown.
9. Support the bottom of the disk drive as you align it with the guide rails in the system or expansion
unit. Do not hold the disk drive by the handle.
10. Slide the disk drive all the way into the system or expansion unit and then lock it in place by
pushing in the disk drive handle, as shown in Figure 77 on page 89.
Figure 75. Attaching a bezel to the disk drive
Concurrent Device Add
Choose a single location for add operations
1=Select
OPT Name PCI/SCSI Location Description Status
--- ------ -------------------------- ------------------------- -----------------
0000:41:01.0.1/0:5: Empty
0000:41:01.0.1/1:8: Empty
0001:61:01.0.2/0:8: Empty
Figure 76. Example Concurrent Device Add display
88 Disk drives
11. Press Enter on the Complete Device Concurrent Add display to indicate that the disk drive is
installed.
Perform the procedure
1. Rebuild the data on the replacement disk drive. For instructions, go to “Rebuilding data on a
replacement disk drive using Linux” on page 137.
2. Replace the front cover.
3. Install or close the system, expansion unit, or rack front door.
Related concepts
Removing and replacing covers and doors
Use these procedures to remove and replace the covers on IBM Power Systems servers that contain the
POWER7 processor.
Related reference
“Removing and replacing the expansion unit cover or door” on page 205
You might need to remove, replace, or install covers or doors on an expansion unit as a part of accessing
components or performing a service action.
Replacing a disk drive with the IBM i system or logical partition
powered on
Learn how to replace a Small Computer System Interface (SCSI) or serial-attached SCSI (SAS) disk drive
in a system or expansion unit while the IBM i operating system or logical partition that controls the disk
drive location is powered on. Use the procedure in this information to perform this task.
Attention: If you are installing a new or upgraded disk drive, see “Installing a disk drive or solid-state
drive with the IBM i system or logical partition powered on” on page 37. If you are removing a disk
drive as part of a service procedure, continue to use the following procedure.
Figure 77. Installing a disk drive in a model or 7311-D20
Replacing a disk drive 89
DANGER
When working on or around the system, observe the following precautions:
Electrical voltage and current from power, telephone, and communication cables are hazardous. To
avoid a shock hazard:
v Connect power to this unit only with the IBM provided power cord. Do not use the IBM
provided power cord for any other product.
v Do not open or service any power supply assembly.
v Do not connect or disconnect any cables or perform installation, maintenance, or reconfiguration
of this product during an electrical storm.
v The product might be equipped with multiple power cords. To remove all hazardous voltages,
disconnect all power cords.
v Connect all power cords to a properly wired and grounded electrical outlet. Ensure that the outlet
supplies proper voltage and phase rotation according to the system rating plate.
v Connect any equipment that will be attached to this product to properly wired outlets.
v When possible, use one hand only to connect or disconnect signal cables.
v Never turn on any equipment when there is evidence of fire, water, or structural damage.
v Disconnect the attached power cords, telecommunications systems, networks, and modems before
you open the device covers, unless instructed otherwise in the installation and configuration
procedures.
v Connect and disconnect cables as described in the following procedures when installing, moving,
or opening covers on this product or attached devices.
To Disconnect:
1. Turn off everything (unless instructed otherwise).
2. Remove the power cords from the outlets.
3. Remove the signal cables from the connectors.
4. Remove all cables from the devices
To Connect:
1. Turn off everything (unless instructed otherwise).
2. Attach all cables to the devices.
3. Attach the signal cables to the connectors.
4. Attach the power cords to the outlets.
5. Turn on the devices.
(D005)
Attention: The powered-on procedure is a complex set of steps that must be performed exactly to avoid
potential data loss or system damage. You might also consider the powered-off procedure, which is a
simpler task that involves less risk of data loss or system damage. For instructions, see “Replacing a disk
drive with the system or logical partition powered off” on page 106.
Note: Some of the figures in these procedures might not look exactly like the system or expansion unit
that you have. However, the steps to perform the task are the same.
Prepare the system
1. Perform prerequisite tasks as described in “Before you begin” on page 189.
2. If applicable, remove the system unit door, expansion unit door, or the rack door.
3. If applicable, remove the front cover.
v If you have a front cover from any other models, remove the front cover.
90 Disk drives
Remove the disk drive
1. If you are removing a solid-state device located on a PCIe RAID and SSD SAS adapter, go to
Removing and replacing the PCIe RAID and SSD SAS adapterRemoving and replacing the PCIe
RAID and SSD SAS adapter and perform that procedure, then return here and perform the following
steps.
2. Locate the disk drive to be replaced and record the location information, then use the location
information to activate the identify light for that disk drive. For instructions, see “Identifying a
failing part in an IBM i system or logical partition” on page 195.
3. Determine if the disk drive to be removed is a non-configured unit by performing the following
steps:
a. Type strsst on the command line of the IBM i session and press Enter.
b. Type your service tools user ID and service tools password and press Enter.
c. Select Work with disk units and press Enter.
d. Select Display Disk Configuration and press Enter.
e. Select Display non-configured units and press Enter.
Is the failed drive a non-configured unit?
v No: Continue to Verify disk is suspended.
v Yes: Continue to next step.
f. Determine if the disk drive to be removed was replaced by a hot spare when it failed. An
xxxx9031 error logged close to the same time of the original disk failure indicates the automatic
Rebuild of the data to a hot spare device.
4. Verify that the disk drive you are replacing has been suspended before continuing with this
procedure. For instructions, see “Preparing to remove a disk drive from a system or logical partition
controlled by IBM i” on page 61.
5. Did you find the service action log entry related to this problem?
v No: Exit the service action log and go to step 7.
v Yes: Continue with the next step.
6. Select the concurrent maintenance option. This option fills in the disk drive location on the
concurrent maintenance display. Go to step 9 on page 92.
7. Select Device Concurrent Maintenance from the Hardware Service Manager display. Press Enter.
The Device Concurrent Maintenance display is shown, as shown in the following figure.
8. If the physical location does not appear in the Device Concurrent Maintenance display, type the
physical location where you are replacing the disk drive as shown in the preceding example,
Figure 78.
Device Concurrent Maintenance
Type the choices, then press Enter.
Specify either physical location or resource name.
Physical location:
U787A.001.1234567-P3-D4
or
Device resource name:
Specify action as 1=Remove device 2=Install device
Action to be performed . . . . . . . .: 1
Enter a time value between 00 and 19.
Time delay needed in minutes . . . . .: 01
F3=Exit F5=Refresh F12=Cancel
Figure 78. Example Device Concurrent Maintenance display
Replacing a disk drive 91
The physical location has the following form: U787A.001.AAAXXXX-P3-D4 where P3-D4 is the position
of the disk drive in the system unit or expansion unit. You can get the U787A.001.AAAXXXX location
information from the control panel display on the system unit or expansion unit.
If you cannot get the information from the control panel, find the feature code (FC) and sequence
number (SN or SEQ) on the label that appears on the front of the system unit or expansion unit as
shown in the following figure.
In this example, U787A.001.AAAXXXX-P3-D4, U787A.001 comes from the feature code (FC) on the label,
AAAXXXX comes from the last seven digits of the sequence number (SN or SEQ) on the label, and
P3–D4 is the slot where you are replacing the disk drive as shown in “Disk drive locations and
concurrent maintenance lights” on page 125.
In the example, U5074.001.AAAXXXX-DB3-D32, you would replace a disk drive in the expansion unit
with type 5074, model 001, sequence number AAAXXXX, and disk drive position DB3–D32.
9. Select option 1 (Remove device) for the Action to be performed.
10. Set the time delay for one minute: 01.
Important: Do not press Enter at this time.
11. Locate the concurrent maintenance light that corresponds to the position of the disk drive that you
are replacing. To view the locations of the concurrent maintenance lights, select the appropriate
model from the following list:
v “Disk drive locations and concurrent maintenance lights for the 5786, 5787, 7031-D24, and
7031-T24 SCSI disk-drive enclosures” on page 133
Important: When you press Enter, after a 1-minute delay, this light comes on and begins to flash
rapidly. You then have nine seconds to remove the disk drive.
12. Press Enter on the console.
Attention:
v Attach a wrist strap to an unpainted surface of your hardware to prevent electrostatic discharge
(ESD) from damaging your hardware.
v When using a wrist strap, follow all electrical safety procedures. A wrist strap is for static control.
It does not increase or decrease your risk of receiving electric shock when using or working on
electrical equipment.
v If you do not have a wrist strap, just prior to removing the product from ESD packaging and
installing or replacing hardware, touch an unpainted surface of the system for a minimum of 5
seconds.
13. When the concurrent maintenance light starts blinking, squeeze and pull the handle of the disk drive
out toward you before you remove the disk drive, as shown in the following figure. If the handle is
not all the way out, the disk drive cannot slide out of the system or expansion unit.
Figure 79. Example system or expansion unit label
92 Disk drives
14. Support the bottom of the disk drive as you slide it out of the system. Do not hold the disk drive by
the handle.
Note: The concurrent maintenance light will stop blinking and remain off when the disk drive is
removed.
15. Loosen and pull out on the thumbscrew B to release the disk-drive bezel C from the disk drive, as
shown in the following figure.
Important: If the disk drive bezel does not have a thumbscrew it is not removable and is a SAS disk
drive. The replacement drive should come with an attached bezel.
Figure 80. Removing a disk drive
Replacing a disk drive 93
Replace the disk drive
1. Return to the console and wait until the Concurrent Maintenance Results display is shown.
Press F12.
2. The physical locations you entered in step 8 on page 91 of this procedure might still appear on the
display. If not, retype the physical location where you will be installing the new disk drive.
In the example, U787A.001.1234567-P3-D4, you would install a disk drive in the system unit with
type 9406, Model 520, sequence number 1234567, and disk drive position P3–D4.
In the example, U5074.001.1234567-DB3-D32, you would install a disk drive in the expansion unit
with type 5074, model 001, sequence number 1234567, and disk drive position DB3–D32.
3. Select option 2 (Install device) for the Action to be performed.
4. Set the time delay for one minute:01.
Important: Do not press Enter at this time.
5. Locate the concurrent maintenance light that corresponds to the position of the disk drive that you
are replacing.
Important: When you press Enter, after a one minute delay, this light comes on and begins to blink
rapidly. You then have nine seconds to slide the disk drive all the way in and then lock it in place by
pushing in the disk drive handle.
6. Find the package that contains the new disk drive.
Attention: Disk drives are fragile. Handle with care.
7. Remove the disk drive from its protective package.
Figure 81. Removing a disk-drive bezel
94 Disk drives
8. Unlock the disk drive handle by squeezing and pulling it out toward you before you install the disk
drive. If the handle is not all the way out, the disk drive can not slide into the system or expansion
unit.
9. Follow these steps to install the disk-drive bezel on the replacement disk drive:
a. Attach the bezel A to the disk drive, as shown in the following figure.
b. Press in and then tighten the thumbscrew B to secure the bezel to the disk drive.
10. Support the bottom of the disk drive as you align it with the guide rails in the system or expansion
unit. Do not hold the disk drive by the handle.
11. Slide the disk drive halfway into the system or expansion unit.
12. Press Enter on the console.
13. When the light starts blinking, slide the disk drive all the way in. Then, lock the disk drive in place
by pushing in the disk drive handle, as shown in the following figure.
Note: The light stops blinking and remains on when you have installed the disk drive properly.
Figure 82. Installing a disk-drive bezel
Replacing a disk drive 95
Complete the procedure
1. Replace the front cover.
2. Install or close the system, expansion unit or rack front door.
3. Return to the console and wait until the Concurrent Maintenance Results display is shown. Press
Enter.
4. If you return to the Service Action Log display, exit the service action log.
5. When the Hardware Service Manager display is shown, press F3.
6. To complete this procedure, go to “Rebuilding data on a replacement disk drive using IBM i” on page
47.
Figure 83. Installing a disk drive
96 Disk drives
Related concepts
Removing and replacing covers and doors
Use these procedures to remove and replace the covers on IBM Power Systems servers that contain the
POWER7 processor.
Related reference
“Removing and replacing the expansion unit cover or door” on page 205
You might need to remove, replace, or install covers or doors on an expansion unit as a part of accessing
components or performing a service action.
Replacing a disk drive using hot spare in an IBM i system or logical
partition
You can use the hot spare function to replace a disk drive in a system or expansion unit while the IBM i
operating system or logical partition that controls the disk drive location is powered on. Use the
procedure in this section to perform this task.
DANGER
When working on or around the system, observe the following precautions:
Electrical voltage and current from power, telephone, and communication cables are hazardous. To
avoid a shock hazard:
v Connect power to this unit only with the IBM provided power cord. Do not use the IBM
provided power cord for any other product.
v Do not open or service any power supply assembly.
v Do not connect or disconnect any cables or perform installation, maintenance, or reconfiguration
of this product during an electrical storm.
v The product might be equipped with multiple power cords. To remove all hazardous voltages,
disconnect all power cords.
v Connect all power cords to a properly wired and grounded electrical outlet. Ensure that the outlet
supplies proper voltage and phase rotation according to the system rating plate.
v Connect any equipment that will be attached to this product to properly wired outlets.
v When possible, use one hand only to connect or disconnect signal cables.
v Never turn on any equipment when there is evidence of fire, water, or structural damage.
v Disconnect the attached power cords, telecommunications systems, networks, and modems before
you open the device covers, unless instructed otherwise in the installation and configuration
procedures.
v Connect and disconnect cables as described in the following procedures when installing, moving,
or opening covers on this product or attached devices.
To Disconnect:
1. Turn off everything (unless instructed otherwise).
2. Remove the power cords from the outlets.
3. Remove the signal cables from the connectors.
4. Remove all cables from the devices
To Connect:
1. Turn off everything (unless instructed otherwise).
2. Attach all cables to the devices.
3. Attach the signal cables to the connectors.
4. Attach the power cords to the outlets.
5. Turn on the devices.
(D005)
Note: Some of the figures in these procedures might not look exactly like the system or expansion unit
Replacing a disk drive 97
that you have. However, the steps to perform the task are the same.
Prepare the system
1. Perform prerequisite tasks as described in “Before you begin” on page 189.
2. Remove the front cover and door, if you need to, to access the disk drives. For instructions on
removing the cover, see “System covers” on page 207.
3. Locate the disk drive to be replaced and record the location information, then use the location
information to activate the identify light for that disk drive. For instructions, see “Identifying a failing
part in an IBM i system or logical partition” on page 195
Remove the disk drive
1. If you are removing a solid state device located on a PCIe RAID and SSD SAS adapter, go to
Removing and replacing the PCIe RAID and SSD SAS adapterRemoving and replacing the PCIe
RAID and SSD SAS adapter and complete that procedure, then return here and perform the
following steps.
2. Type strsst on the command line of the IBM i session and press Enter.
3. Type your service tools user ID and service tools password on the Start Service Tools (STRSST)
Sign On display, and press Enter.
4. Select Work with disk units from the System Service Tools display, press Enter.
5. Select Display Disk Configuration from the Work with Disk Units display, and press Enter.
6. Select Display non-configured units from the Display Disk Configuration display, and press Enter.
7. Is the failed drive a non-configured unit?
v No: Go to “Replacing a disk drive with the IBM i system or logical partition powered on” on page
89.
v Yes: Determine if the disk drive to be removed was replaced by a hot spare when it failed. A
xxxx9031 error logged close to the same time of the original disk failure indicates the automatic
Rebuild of the data to a hot spare device, then go on to the next step.
8. Is the disk drive to be removed controlled by the load source adapter?
v No: Go to step 11.
v Yes: Go to the next step.
9. Is the disk drive to be removed located in a valid load source position?
v No: Go to “Configuring a disk drive on a load source adapter for hot spare using the IBM i
operating system” on page 145.
v Yes: Go to step 11.
10. Choose the concurrent maintenance option. This option fills in the disk drive location on the
concurrent maintenance display. Go to step 13 on page 99.
11. Select Device Concurrent Maintenance from the Hardware Service Manager display. Press Enter.
The Device Concurrent Maintenance display is shown, as shown in the following figure.
98 Disk drives
12. If the physical location does not appear in the Device Concurrent Maintenance display, type the
physical location where you will be replacing the disk drive as shown in the preceding example,
Figure 84.
The physical location has the following form: U787A.001.AAAXXXX-P3-D4 where P3-D4 is the position
of the disk drive in the system unit or expansion unit. You can get the U787A.001.AAAXXXX location
information from the control panel display on the system unit or expansion unit.
If you cannot get the information from the control panel, find the feature code (FC) and sequence
number (SN or SEQ) on the label that appears on the front of the system unit or expansion unit as
shown in the following figure.
In this example, U787A.001.AAAXXXX-P3-D4, U787A.001 comes from the feature code (FC) on the label,
AAAXXXX comes from the last seven digits of the sequence number (SN or SEQ) on the label, and
P3–D4 is the slot where you are replacing the disk drive as shown in “Disk drive locations and
concurrent maintenance lights” on page 125.
In the example, U5074.001.AAAXXXX-DB3-D32, you would replace a disk drive in the expansion unit
with type 5074, model 001, sequence number AAAXXXX, and disk drive position DB3–D32.
13. Select option 1 (Remove device) for the Action to be performed.
14. Set the time delay for one minute: 01.
Important: Do not press Enter at this time.
15. Locate the concurrent maintenance light that corresponds to the position of the disk drive that you
are replacing. To view the locations of the concurrent maintenance lights, select the appropriate
model from the following list:
v “Disk drive locations and concurrent maintenance lights for the 5786, 5787, 7031-D24, and
7031-T24 SCSI disk-drive enclosures” on page 133
Important: When you press Enter, after a one minute delay, this light comes on and begins to blink
rapidly. You then have nine seconds to remove the disk drive.
Device Concurrent Maintenance
Type the choices, then press Enter.
Specify either physical location or resource name.
Physical location:
U787A.001.1234567-P3-D4
or
Device resource name:
Specify action as 1=Remove device 2=Install device
Action to be performed . . . . . . . .: 1
Enter a time value between 00 and 19.
Time delay needed in minutes . . . . .: 01
F3=Exit F5=Refresh F12=Cancel
Figure 84. Example Device Concurrent Maintenance display
Figure 85. Example system or expansion unit label
Replacing a disk drive 99
16. Press Enter on the console.
Attention:
v Attach a wrist strap to an unpainted surface of your hardware to prevent electrostatic discharge
(ESD) from damaging your hardware.
v When using a wrist strap, follow all electrical safety procedures. A wrist strap is for static control.
It does not increase or decrease your risk of receiving electric shock when using or working on
electrical equipment.
v If you do not have a wrist strap, just prior to removing the product from ESD packaging and
installing or replacing hardware, touch an unpainted surface of the system for a minimum of 5
seconds.
17. When the concurrent maintenance light starts blinking, squeeze and pull the handle of the disk drive
out toward you before you remove the disk drive, as shown in the following figure. If the handle is
not all the way out, the disk drive cannot slide out of the system or expansion unit.
Figure 86. Removing a disk drive
100 Disk drives
18. Support the bottom of the disk drive as you slide it out of the system. Do not hold the disk drive by
the handle.
Note: The concurrent maintenance light will stop blinking and remain off when the disk drive is
removed.
19. Loosen and pull out on the thumbscrew B to release the disk-drive bezel C from the disk drive, as
shown in the following figure.
Important: If the disk drive bezel does not have a thumbscrew it is not removable and is a SAS disk
drive. The replacement drive should come with an attached bezel.
Figure 87. Removing a small-form factor disk drive from the 5803 expansion unit
Replacing a disk drive 101
Replace the disk drive
1. Return to the console and wait until the Concurrent Maintenance Results display is shown.
Press F12.
2. The physical locations you entered in step 12 on page 99 of this procedure might still appear on the
display. If not, retype the physical location where you will be installing the new disk drive.
In the example, U787A.001.1234567-P3-D4, you would install a disk drive in the system unit with
type 9406, Model 520, sequence number 1234567, and disk drive position P3–D4.
In the example, U5074.001.1234567-DB3-D32, you would install a disk drive in the expansion unit
with type 5074, model 001, sequence number 1234567, and disk drive position DB3–D32.
3. Select option 2 (Install device) for the Action to be performed.
4. Set the time delay for one minute:01.
Important: Do not press Enter at this time.
5. Locate the concurrent maintenance light that corresponds to the position of the disk drive that you
are replacing.
Important: When you press Enter, after a one minute delay, this light comes on and begins to blink
rapidly. You then have nine seconds to slide the disk drive all the way in and then lock it in place by
pushing in the disk drive handle.
6. Find the package that contains the new disk drive.
Attention: Disk drives are fragile. Handle with care.
7. Remove the disk drive from its protective package.
Figure 88. Removing a disk-drive bezel
102 Disk drives
8. Unlock the disk drive handle by squeezing and pulling it out toward you before you install the disk
drive. If the handle is not all the way out, the disk drive can not slide into the system or expansion
unit.
9. Follow these steps to install the disk-drive bezel on the replacement disk drive:
a. Attach the bezel A to the disk drive, as shown in the following figure.
b. Press in and then tighten the thumbscrew B to secure the bezel to the disk drive.
10. Support the bottom of the disk drive as you align it with the guide rails in the system or expansion
unit. Do not hold the disk drive by the handle.
11. Slide the disk drive halfway into the system or expansion unit.
12. Press Enter on the console.
13. When the light starts blinking, slide the disk drive all the way in. Then, lock the disk drive in place
by pushing in the disk drive handle, as shown in the following figure.
Note: The light stops blinking and remains on when you have installed the disk drive properly.
Figure 89. Installing a disk-drive bezel
Replacing a disk drive 103
Perform the procedure
1. Replace the front cover.
2. Install or close the system, expansion unit or rack front door.
3. Return to the console and wait until the Concurrent Maintenance Results display is shown, and
press Enter if you used the concurrent maintenance process above, otherwise continue with the next
step.
Figure 90. Installing a disk drive
Figure 91. Installing a small-form factor disk drive in the 5803 expansion unit
104 Disk drives
4. If you return to the Hardware Service Manager (HSM), exit the HSM.
5. Select Work with disk units from the Start System Service Tools display.
6. Select Non-configured disk units from the Work with Disk Units display, and press Enter.
7. Is the new disk unit displayed with a status of non-configured?
Note: It may take several minutes for the new disk drive to display.
v No: Contact your next level of support.
v Yes: Continue with the next step.
8. Press F3 to return to the Work with Disk Units display.
9. Select Work with disk unit recovery from the Work with Disk Units display, and press Enter.
10. Select Disk unit problem recovery procedure, and press Enter.
11. Select Initialize and format disk unit, and press Enter.
12. Select the new disk and press Enter.
Note: This will take several minutes to complete.
13. When the disk drive is initialized and formatted, press F3 to return to the Work with Disk Units
display.
14. Select Start hot spare, and press Enter.
15. Select the IOA with the new disk, then press Enter and press Enter again.
16. Does the new hot spare disk drive have a smaller capacity than previous hot spare disk drive? Refer
to the resource name identified in the xxxx9031 error which was logged close to the same time of the
original disk failure.
v No: The procedure is complete. Press F3 to return to the System service tools display.
v Yes: Go to the next step.
17. Select Display Device Parity Status, and press Enter. If the new disk drive is not active and hot
spare protected, do not continue. Call your next level of support.
Restriction: You must complete the following steps to ensure your data is protected. The following
steps will allow you to exchange the two disk drives so that the larger disk drive will become the
hot spare, and the smaller disk drive will become a parity set member.
18. Fail the previous hot spare disk drive by unlocking the disk drive handle by squeezing it and
pulling it out toward you.
19. Pull the drive out about one inch and wait ten seconds.
20. Push the disk drive back into the slot and close the handle.
21. Return to step 3 on page 104 and start hot spare on the larger drive.
Replacing a disk drive 105
Related concepts
Removing and replacing covers and doors
Use these procedures to remove and replace the covers on IBM Power Systems servers that contain the
POWER7 processor.
Related reference
“Removing and replacing the expansion unit cover or door” on page 205
You might need to remove, replace, or install covers or doors on an expansion unit as a part of accessing
components or performing a service action.
Replacing a disk drive with the system or logical partition powered off
You might need to install a Small Computer System Interface (SCSI) or serial-attached SCSI (SAS) disk
drive in a system or logical partition that controls the disk drive location with the power off. Use the
procedure in this section to perform this task.
Attention: If you are installing a new or upgraded disk drive, see “Installing a disk drive or solid-state
drive with the system or logical partition power turned off” on page 49. If you are removing a disk drive
as part of a service procedure, continue to use the following procedure.
106 Disk drives
DANGER
When working on or around the system, observe the following precautions:
Electrical voltage and current from power, telephone, and communication cables are hazardous. To
avoid a shock hazard:
v Connect power to this unit only with the IBM provided power cord. Do not use the IBM
provided power cord for any other product.
v Do not open or service any power supply assembly.
v Do not connect or disconnect any cables or perform installation, maintenance, or reconfiguration
of this product during an electrical storm.
v The product might be equipped with multiple power cords. To remove all hazardous voltages,
disconnect all power cords.
v Connect all power cords to a properly wired and grounded electrical outlet. Ensure that the outlet
supplies proper voltage and phase rotation according to the system rating plate.
v Connect any equipment that will be attached to this product to properly wired outlets.
v When possible, use one hand only to connect or disconnect signal cables.
v Never turn on any equipment when there is evidence of fire, water, or structural damage.
v Disconnect the attached power cords, telecommunications systems, networks, and modems before
you open the device covers, unless instructed otherwise in the installation and configuration
procedures.
v Connect and disconnect cables as described in the following procedures when installing, moving,
or opening covers on this product or attached devices.
To Disconnect:
1. Turn off everything (unless instructed otherwise).
2. Remove the power cords from the outlets.
3. Remove the signal cables from the connectors.
4. Remove all cables from the devices
To Connect:
1. Turn off everything (unless instructed otherwise).
2. Attach all cables to the devices.
3. Attach the signal cables to the connectors.
4. Attach the power cords to the outlets.
5. Turn on the devices.
(D005)
Prepare the system
1. Perform prerequisite tasks as described in “Before you begin” on page 189.
2. If you cannot start the system or logical partition that controls the disk drive location, or if the system
or logical partition are not powered on, go to step 6 on page 108.
3. Identify the disk drive to be replaced and record the location information. For information, see
“Identifying a failing part” on page 193.
4. Prepare to remove the disk drive from the system or expansion unit as follows:
v If the system or logical partition that controls the disk drive location is running AIX, follow the
steps in “Preparing to remove a disk drive from a system or expansion unit controlled by AIX” on
page 137.
v If the system or logical partition that controls the disk drive location is running IBM i, follow the
steps in “Preparing to remove a disk drive from a system or logical partition controlled by IBM i”
on page 61.
Replacing a disk drive 107
v If the system or logical partition that controls the disk drive location is running Linux and the disk
drive you want to replace is unprotected or in use, move the data from the disk drive before
continuing with this procedure. For information, see the PCI-X SCSI RAID Controller Reference Guide
for Linux.
This guide is available from the SCSI PCI Adapters Web page at http://publib.boulder.ibm.com/
infocenter/pseries/v5r3/index.jsp?topic=/com.ibm.pseries.doc/hardware_docs/scsipciadapters.htm.
5. Stop the system or logical partition that controls the disk drive location. For instructions, see
“Stopping a system or logical partition” on page 201.
6. If you stopped the logical partition, do not disconnect the power source. If you stopped the system,
disconnect the power source from the system by unplugging the system.
Note: This system might be equipped with a second power supply. Before continuing with this
procedure, ensure that all power sources to the system have been completely disconnected.
(L003)
or
7. Remove the system unit door or expansion unit door or open the rack front door. For instructions,
select the appropriate procedure from the following list:
v “Removing and replacing the expansion unit cover or door” on page 205
8. Remove the front cover.
Remove the disk drive
1. If you are removing a solid state device located on a PCIe RAID and SSD SAS adapter, go to
Removing and replacing the PCIe RAID and SSD SAS adapterRemoving and replacing the PCIe RAID
and SSD SAS adapter and complete that procedure, then return here and perform the following steps.
2. Unlock the disk drive handle by squeezing it and pulling it out toward you, as shown in Figure 92 on
page 109.
108 Disk drives
Attention:
v Attach a wrist strap to an unpainted surface of your hardware to prevent electrostatic discharge
(ESD) from damaging your hardware.
v When using a wrist strap, follow all electrical safety procedures. A wrist strap is for static control. It
does not increase or decrease your risk of receiving electric shock when using or working on
electrical equipment.
v If you do not have a wrist strap, just prior to removing the product from ESD packaging and
installing or replacing hardware, touch an unpainted surface of the system for a minimum of 5
seconds.
Figure 92. Removing a disk drive from a model or 7311-D20
Replacing a disk drive 109
3. Support the bottom of the disk drive as you slide it out of the system or expansion unit. Do not hold
the disk drive by the handle.
4. If your replacement disk drive does not have a bezel preinstalled, then perform the following step.
Loosen and then pull out on the thumbscrew (B) to release the disk-drive bezel (C) from the disk
drive, as shown in the following figure.
Important: If the disk drive bezel does not have a thumbscrew it is not removable and is a SAS disk
drive. The replacement drive should come with an attached bezel.
Figure 93. Removing a small-form factor disk drive from the 5803 expansion unit
110 Disk drives
Replace the disk drive
1. Find the package that contains the new disk drive.
Attention: Disk drives are fragile. Handle with care.
2. Remove the disk drive from its protective package.
3. Unlock the disk drive handle by squeezing and pulling it out toward you before you install the disk
drive. If the handle is not all the way out, the disk drive will not slide into the system or expansion
unit.
4. If your replacement disk drive does not have a bezel preinstalled, then follow these steps to install the
disk-drive bezel on the replacement disk drive:
a. Attach the bezel (A) to the disk drive, as shown in the following figure.
b. Press in and then tighten the thumbscrew (B) to secure the bezel to the disk drive.
Figure 94. Removing a disk-drive bezel
Replacing a disk drive 111
5. Support the bottom of the disk drive as you align it with the guide rails in the system or expansion
unit. Do not hold the disk drive by the handle.
6. Slide the disk drive into the system or expansion unit until it stops.
Note: It is important to ensure that when installing a disk drive, that the drive is fully seated and all
the way into the system.
7. Push the disk drive handle in until it locks, as shown in Figure 96 on page 113.
Figure 95. Attaching a disk-drive bezel
112 Disk drives
Complete the procedure
1. Replace the front cover.
2. Install or close the system, expansion unit or rack front door.
3. Reconnect all system power cables.
4. Reconnect the system to the power supply by plugging it in.
Figure 96. Installing a disk drive in a model or 7311-D20
Figure 97. Installing a small-form factor disk drive in the 5803 expansion unit
Replacing a disk drive 113
5. Start the system or logical partition. For instructions, see “Starting the system or logical partition” on
page 199.
6. If the system or logical partition that controls the disk drive you replaced is running AIX, verify that
the disk drive is working as described in “Verifying an installed feature or replaced part in an AIX
system or logical partition” on page 249. If it is not possible to run online diagnostics, for example if
you replaced the root volume group (rootvg) disk drive, use standalone diagnostics as described in
“Verifying an installed part by using stand-alone diagnostics” on page 253.
7. To rebuild the data on the replacement disk drive, select the operating system for the system or
logical partition that controls the disk drive:
v “Rebuilding data on a replacement disk drive using AIX” on page 137
v “Rebuilding data on a replacement disk drive using IBM i” on page 47
v “Rebuilding data on a replacement disk drive using Linux” on page 137
Related concepts
Removing and replacing covers and doors
Use these procedures to remove and replace the covers on IBM Power Systems servers that contain the
POWER7 processor.
Related reference
“Removing and replacing the expansion unit cover or door” on page 205
You might need to remove, replace, or install covers or doors on an expansion unit as a part of accessing
components or performing a service action.
Replacing a SCSI disk drive
Learn how to replace a Small Computer System Interface (SCSI) disk drive in a system or expansion unit.
Use the procedure in this information to perform this task.
Remove the disk drive
1. Log in as root user.
2. At the command line, type diag and press Enter.
3. Press Enter to continue.
4. On the Function Selection display, select Task Selection.
5. On the Task Selection display, select Hot Plug Task.
6. Select SCSI and SCSI RAID Hot Plug Manager.
7. Select Identify a Device Attached to a SCSI Hot Swap Enclosure Device.
8. Select the disk drive you want to replace and press Enter.
The disk drive slot enters the Identify state, indicated by the flashing concurrent maintenance light.
To view the locations of the concurrent maintenance lights, go to “Disk drive locations and
concurrent maintenance lights” on page 125.
9. Exit to the SCSI and SCSI RAID Hot Plug Manager display.
10. On the SCSI and SCSI RAID Hot Plug Manager display, select Replace/Remove a Device Attached
to a SCSI Hot Swap Enclosure Device. A list of all slots from which you can remove a disk drive is
shown.
11. Select the slot that contains the disk drive you want to remove. The disk drive slot enters the
Remove state and power is removed from the slot. The concurrent maintenance light for the slot
begins flashing rapidly.
12. Unlock the disk drive. Unlock the disk drive handle by squeezing it and pulling it out toward you
as shown in the following figures.
114 Disk drives
13. Support the bottom of the disk drive as you slide it out of the system or expansion unit. Do not hold
the disk drive by the handle.
Note: The concurrent maintenance light for the slot turns off when you remove the disk drive.
14. Loosen and pull out on the thumbscrew B to release the bezel C from the disk drive.
Figure 98. Removing a disk drive from a model or 7311-D20
Figure 99. Removing a small-form factor disk drive from the 5803 expansion unit
Replacing a disk drive 115
Replace the disk drive
1. Find the package that contains the new disk drive.
Attention: Disk drives are fragile. Handle with care.
2. Remove the disk drive from its protective package.
3. Unlock the handle of the replacement disk drive by squeezing and pulling it out toward you. If the
handle is not all the way out, the disk drive cannot slide into the system.
4. Attach disk-drive bezel A to the replacement disk drive, as shown in the following figure.
Figure 100. Removing a disk-drive bezel
116 Disk drives
5. Press in and then tighten thumbscrew B to secure bezel A to the disk drive.
6. Install the disk drive as follows:
a. Support the bottom of the disk drive as you align it with the guide rails in the system unit. Do
not hold the disk drive by the handle.
b. Slide the disk drive into the system until it stops.
Note: It is important to ensure that when installing a disk drive, that the drive is fully seated
and all the way into the system.
c. Push the disk drive handle in until it locks, as shown in Figure 102 on page 118.
Figure 101. Attaching a disk-drive bezel
Replacing a disk drive 117
7. Press Enter.
8. Exit to the SCSI and SCSI RAID Hot-Plug Manager menu.
9. Select Configure Added/Replaced Devices. Press Enter.
10. Select the disk drive that you replaced and then press Enter.
The disk drive is now configured.
11. Exit to the Task Selection display.
Figure 102. Installing a disk drive in a model a model or 7311-D20
Figure 103. Installing a small-form factor disk drive in the 5803 expansion unit
118 Disk drives
12. Select Log Repair Action.
13. Select the disk drive that you replaced and then press Enter.
14. Select Commit after you have made your selection and then press Enter.
15. Exit to the command line.
Perform the procedure
1. Replace the front cover.
2. Install or close the system, expansion unit, or rack front door.
3. Verify that the disk drive is working. For instructions, see “Verifying an installed feature or replaced
part in an AIX system or logical partition” on page 249. If it is not possible to run online AIX
diagnostics, for example if you replaced the root volume group (rootvg) disk drive, use stand-alone
diagnostics as described in “Verifying an installed part by using stand-alone diagnostics” on page 253.
4. To prepare the disk drive for use, go to “Rebuilding data on a replacement disk drive using AIX” on
page 137.
Related concepts
Removing and replacing covers and doors
Use these procedures to remove and replace the covers on IBM Power Systems servers that contain the
POWER7 processor.
Related reference
“Removing and replacing the expansion unit cover or door” on page 205
You might need to remove, replace, or install covers or doors on an expansion unit as a part of accessing
components or performing a service action.
Replacing a disk drive 119
120 Disk drives
Removing or installing a disk-drive filler
You might need to remove or install a Small Computer System Interface (SCSI) disk-drive filler in the
server or expansion unit. This section includes procedures so that you can perform these tasks.
Note: Refill the disk drive slots with either another disk drive or a disk-drive filler. Filling the disk drive
slot will help ensure proper air flow for cooling and help maintain optimal EMI compliance.
Removing a disk-drive filler
Learn how to remove a disk-drive filler from the server or expansion unit. Use the procedure in this
information to perform this task.
Note: Some of the figures in this procedure might not look exactly like the system or expansion unit that
you have. However, the steps to perform the task are the same.
1. Remove the system unit door, expansion unit door, or open the rack front door. For instructions, see
“Removing and replacing the expansion unit cover or door” on page 205.
2. Remove the front cover. For instructions on removing the cover, see “System covers” on page 207. If
you have a front cover from any other models, remove the front cover.
3. Identify the disk-drive filler you want to remove.
4. Pull the disk-drive filler from the slot. Pull the disk-drive filler from the slot that you want to use, as
shown in Figure 104 on page 122 and Figure 105 on page 122.
Attention:
v Attach a wrist strap to an unpainted surface of your hardware to prevent electrostatic discharge
(ESD) from damaging your hardware.
v When using a wrist strap, follow all electrical safety procedures. A wrist strap is for static control. It
does not increase or decrease your risk of receiving electric shock when using or working on
electrical equipment.
v If you do not have a wrist strap, just prior to removing the product from ESD packaging and
installing or replacing hardware, touch an unpainted surface of the system for a minimum of 5
seconds.
© Copyright IBM Corp. 2010 121
Figure 104. Removing a disk-drive filler from model 8233-E8B or 8236-E8C
Figure 105. Removing a disk-drive filler from model 9117-MMB or 9179-MHB
122 Disk drives
Installing a disk-drive filler
You might need to install a disk-drive filler in the server or expansion unit. Use the procedure in this
section to perform this task.
Note: Some of the figures in this procedure might not look exactly like the system or expansion unit that
you have. However, the steps to perform the task are the same.
1. Remove the system unit door, expansion unit door, or open the rack front door. For instructions, see
“Removing and replacing the expansion unit cover or door” on page 205.
2. Remove the front cover. For instructions on removing the cover, see “System covers” on page 207. If
you have a front cover from any other models, remove the front cover.
3. Install the disk-drive filler in the system or expansion unit, as shown in the following figures.
Figure 106. Installing a disk-drive filler for model 8233-E8B or 8236-E8C
Removing or installing a disk-drive filler 123
4. Replace the front cover.
5. Install or close the system, expansion unit or rack front door.
Related concepts
Removing and replacing covers and doors
Use these procedures to remove and replace the covers on IBM Power Systems servers that contain the
POWER7 processor.
Related reference
“Removing and replacing the expansion unit cover or door” on page 205
You might need to remove, replace, or install covers or doors on an expansion unit as a part of accessing
components or performing a service action.
Figure 107. Installing a disk-drive filler for model 9117-MMB or 9179-MHB
124 Disk drives
Disk drive locations and concurrent maintenance lights
Find out where the disk drives and concurrent maintenance lights are located on different models.
Locate your system or expansion unit to find the disk drive locations and where the concurrent
maintenance lights are located.
PCIe RAID and SSD SAS adapter locations
Learn about the PCIe RAID and SSD SAS adapter locations and the location of the concurrent
maintenance lights on the PCIe RAID and SSD SAS adapter.
The figure Figure 108 shows the PCIe RAID and SSD SAS adapter locations.
Disk drive locations and concurrent maintenance lights for an
8202-E4B or 8205-E6B system
Learn about the disk drive locations and understand where the concurrent maintenance lights are located
for your system.
The following figures show the disk drive locations and concurrent maintenance lights for the 8202-E4B
or 8205-E6B. Concurrent maintenance lights are located above the latch handle on the disk drives.
Figure 108. PCIe RAID and SSD SAS adapter locations
© Copyright IBM Corp. 2010 125
Figure 109. Disk drive locations for the rack model
Figure 110. Disk drive locations for the tower model
126 Disk drives
Disk drive locations and concurrent maintenance lights for 8231-E2B
Learn about the disk drive locations and the location of the concurrent maintenance lights on the system.
The following figures show the disk drive locations and concurrent maintenance lights for the 8231-E2B.
Concurrent maintenance lights are located above the latch handle on the disk drives.
Figure 111. Concurrent maintenance light locations for the rack model
Figure 112. Concurrent maintenance light locations for the tower model
Disk drive locations and concurrent maintenance lights 127
Figure 113. Disk drive locations for base model with six SFF bays
Figure 114. Disk drive locations for model with three SFF bays (optional)
Figure 115. Concurrent maintenance light locations for base model with six SFF bays
128 Disk drives
Disk drive locations and concurrent maintenance lights for 8233-E8B
and 8236-E8C
Learn the disk drive locations and where the concurrent maintenance lights are located for your system.
The following figures show the disk drive locations and concurrent maintenance lights for the 8233-E8B
and 8236-E8C. Concurrent maintenance lights are located above the latch handle on the disk drives.
Figure 116. Concurrent maintenance light locations for model with three SFF bays (optional)
Figure 117. Disk drive locations
Disk drive locations and concurrent maintenance lights 129
Disk drive locations and concurrent maintenance lights for models
9117-MMB or 9179-MHB
Learn the disk drive locations for your system.
The following figures show the disk drive locations and concurrent maintenance lights for the IBM model
9117-MMB or 9179-MHB.
Figure 118. Concurrent maintenance light locations
Figure 119. Disk drive locations
130 Disk drives
Disk drive locations and concurrent maintenance lights for the 5886
disk enclosure
Learn the disk drive locations and where the concurrent maintenance lights are located for your disk
enclosure.
The following figure shows the disk drive locations for the 5886 disk enclosure.
Figure 120. Concurrent maintenance light locations
Disk drive locations and concurrent maintenance lights 131
The following figure shows the locations of the disk-drive concurrent maintenance lights for the 5886
disk-drive enclosure.
Figure 121. Disk drive locations for the 5886 disk enclosure
Figure 122. Front view of disk-drive enclosure concurrent maintenance lights for model 5886
132 Disk drives
Disk drive locations and concurrent maintenance lights for the 5786,
5787, 7031-D24, and 7031-T24 SCSI disk-drive enclosures
Learn the disk drive locations and where the concurrent maintenance lights are located for your
disk-drive enclosure.
The availability of disk drive locations depends on the number of SCSI interface cards installed in the
SCSI disk-drive enclosure. If the next available disk drive position is the first slot in a disk-drive
enclosure, such as D21, D26, or D46, look at the disk-drive controller to verify that the enclosure is
connected to a SCSI bus port. The controller end of the cable is clearly labeled with the disk-drive
enclosure to which it connects. If the enclosure is not connected to a disk-drive controller, you must either
install the disk drive in a different disk-drive enclosure or connect the enclosure to a SCSI bus port on a
disk-drive controller. You might need to add a disk-drive controller before you can add disk drives to the
expansion unit.
The following figures show the disk drive locations for the front and back of the 5786, 5787, 7031-D24,
and 7031-T24 SCSI disk-drive enclosures.
Figure 123. Front view of 5786, 5787, 7031-D24, and 7031-T24 SCSI disk-drive enclosures
Disk drive locations and concurrent maintenance lights 133
The following figures show the locations of the disk-drive concurrent maintenance light A for the models
5786, 5787, 7031-D24, and 7031-T24 SCSI disk-drive enclosures.
Figure 124. Back view of 5786, 5787, 7031-D24, and 7031-T24 SCSI disk-drive enclosures
Figure 125. Front view of SCSI disk-drive enclosure concurrent maintenance lights for model 5786, 5787, 7031-D24,
and 7031-T24
Figure 126. Back view of SCSI disk-drive enclosure concurrent maintenance lights for model 5786, 5787, 7031-D24,
and 7031-T24
134 Disk drives
Disk drive locations and concurrent maintenance lights for the 5802
expansion unit
Learn the disk drive locations and where the concurrent maintenance lights are located for your
expansion unit.
The following figure shows the disk drive locations for the 5802.
Concurrent maintenance lights are located at the bottom of the latch handle on each disk drive.
Related information
Cabling the 5802 disk subsystem
Figure 127. Disk drive locations
Disk drive locations and concurrent maintenance lights 135
136 Disk drives
Related information
Locate additional disk-drive related information and instructions you might need.
Preparing a disk drive for use in an AIX system or logical partition
Configure the newly installed disk drive for your environment.
If the disk drive is attached to a PCI-X SCSI RAID controller, see the PCI-X SCSI RAID Controller Reference
Guide for AIX. If the disk drive is attached to another controller, see the AIX System Management Guide:
Operating System and Devices. These guides are available at the System p and AIX Information Center.
(http://publib.boulder.ibm.com/infocenter/pseries/v5r3/index.jsp)
Preparing to remove a disk drive from a system or expansion unit
controlled by AIX
Learn the actions you need to take to properly remove a disk drive.
Before removing a disk drive from a system or expansion unit that is controlled by AIX, ensure that all
data on that disk drive or the array that contains the drive is backed up and removed from the disk drive
and that the disk drive is in the Defined state.
If the disk drive is attached to a PCI-X SCSI RAID controller, see the PCI-X SCSI RAID Controller Reference
Guide for AIX. If the disk drive is attached to another controller, see the information for that controller or
the AIX System Management Guide: Operating System and Devices. These guides are available at the System
p and AIX Information Center. (http://publib.boulder.ibm.com/infocenter/pseries/v5r3/index.jsp)
Rebuilding data on a replacement disk drive using AIX
To rebuild data on the replacement disk drive, refer to the information for the controller to which the
disk drive is attached.
v If the disk drive is attached to a PCI-X SCSI RAID controller, see the PCI-X SCSI RAID Controller
Reference Guide for AIX.
v If the disk drive is attached to a PCI SCSI RAID adapter (feature code 2498), see the PCI 4–Channel
Ultra3 SCSI RAID Adapter Reference Guide.
For more information, see the AIX System Management Guide: Operating System and Devices.These guides
are available at the System p and AIX Information Center. (http://publib.boulder.ibm.com/infocenter/
pseries/v5r3/index.jsp)
Rebuilding data on a replacement disk drive using Linux
Use this procedure to rebuild data on the replacement disk drive.
To rebuild data on the replacement disk drive, complete the steps below.
For an unprotected disk drive
If the disk drive you are replacing is in a RAID Level 0 disk array or in a failed RAID Level 5 or RAID
Level 10 disk array, perform these tasks:
1. Re-create the disk array.
© Copyright IBM Corp. 2010 137
2. Re-create the file systems on the disk array.
3. Copy the data back to the restored disk array from your backup media.
For information on these tasks, see the PCI-X SCSI RAID Controller Reference Guide for Linux. This guide is
available from the SCSI PCI Adapters Web page at http://publib.boulder.ibm.com/infocenter/pseries/
v5r3/index.jsp?topic=/com.ibm.pseries.doc/hardware_docs/scsipciadapters.htm.
For a disk drive in a RAID Level 5 or RAID Level 10 disk array, select the appropriate procedure:
Rebuilding data on a disk drive using Linux kernel 2.4
Complete the steps listed here to rebuild your data.
If a hot-spare disk drive was available and protected the disk array throughout the disk drive failure, you
can configure the newly installed disk drive as a hot spare. For instructions, see the PCI-X SCSI RAID
Controller Reference Guide for Linux.
This guide is available from the SCSI PCI Adapters Web page at http://publib.boulder.ibm.com/
infocenter/pseries/v5r3/index.jsp?topic=/com.ibm.pseries.doc/hardware_docs/scsipciadapters.htm.
If a hot spare rebuild was not initiated by the controller, you need to initiate a rebuild on the newly
installed disk drive. To initiate the rebuild, follow these steps:
1. Log in as root user.
2. Type iprconfig on the command line of the Linux session and press Enter. The Work with Disk Units
display is shown.
3. From the Work with Disk Units display, select Work with disk unit recovery. The Work with Disk
Unit Recovery display is shown.
4. From the Work with Disk Unit Recovery display, select Rebuild disk unit data.
A Rebuild Disk Unit Data display similar to the following figure is shown below.
5. Type option 1 (Rebuild) next to the disk drive you want to rebuild and press Enter.
Note: Performing a rebuild on a disk drive overwrites the data currently on the disk drive.
A Confirm Rebuild Disk Unit Data display similar to the following figure shown below.
Rebuild Disk Unit Data
Select the units to be rebuilt
Type choice, press Enter.
1=Rebuild
Vendor Product Serial PCI PCI SCSI SCSI SCSI
Option ID ID Number Bus Dev Bus ID Lun
IBM IC35L036UCDY10-0 E3V0J55B 353 01 0 5 0
e=Exit q=Cancel f=PageDn b=PageUp
Figure 128. Example Rebuild Disk Unit Data display
138 Disk drives
6. To confirm that you want to rebuild the data on the disk drive, press Enter.
The message that the rebuild has begun is shown at the bottom of the Work with Disk Unit Recovery
display. The rebuild process might take several minutes to complete.
Rebuilding data on a disk drive using Linux kernel 2.6
Learn the steps to rebuild your data on a disk drive.
If a hot-spare disk drive was available and protected the disk array throughout the disk drive failure, you
can configure the newly installed disk drive as a hot spare. For instructions, see the PCI-X SCSI RAID
Controller Reference Guide for Linux.
This guide is available from the SCSI PCI Adapters Web page at http://publib.boulder.ibm.com/
infocenter/pseries/v5r3/index.jsp?topic=/com.ibm.pseries.doc/hardware_docs/scsipciadapters.htm.
If a hot-spare rebuild was not initiated by the controller, you need to initiate a rebuild on the newly
installed disk drive. To initiate the rebuild, follow these steps:
1. Log in as root user.
2. Type iprconfig on the command line of the Linux session and press Enter. The IBM Power RAID
Configuration Utility display is shown.
3. Select Work with disk unit recovery from the IBM Power RAID Configuration Utility display. Press
Enter. The Work with Disk Unit Recovery display is shown.
4. From the Work with Disk Unit Recovery display, select Rebuild disk unit data.
A Rebuild Disk Unit Data display similar to the following is shown.
Confirm Rebuild Disk Unit Data
Rebuilding the disk unit data could take several minutes for
each unit selected.
Press Enter to confirm having the data rebuilt.
Press q=Cancel to return and change your choice.
Vendor Product Serial PCI PCI SCSI SCSI SCSI
Option ID ID Number Bus Dev Bus ID Lun
1 IBM IC35L036UCDY10-0 E3V0J55B 353 01 0 5 0
q=Cancel f=PageDn b=PageUp
Figure 129. Example Confirm Rebuild Disk Unit Data display
Related information 139
5. Type option 1 (Rebuild) next to the disk drive you want to rebuild and press Enter.
Note: Performing a rebuild on a disk drive overwrites the data currently on the disk drive.
A Confirm Rebuild Disk Unit Data display similar to the following is shown.
6. To confirm that you want to rebuild the data on the disk drive, press Enter.
The message that the rebuild has begun is shown at the bottom of the Work with Disk Unit Recovery
display. The rebuild process might take several minutes to complete.
Rebuilding data on a replacement disk drive using IBM i
Learn how to rebuild data on replacement disk.
To rebuild data on the replacement disk drive, complete the following steps:
1. If necessary, start System Service Tools (SST) by typing strsst on the command line of the IBM i
session and pressing Enter.
2. Type your service tools user ID and service tools password on the Start Service Tools (STRSST) Sign
On display. Press Enter.
Note: The service tools password is case sensitive.
3. Select Work with disk units from the Start System Service Tools (SST) display. Press Enter.
4. Select Work with disk unit recovery from the Work with Disk Units display. Press Enter.
Rebuild Disk Unit Data
Select the disks to be rebuilt
Type choice, press Enter.
1=Rebuild
OPT Name PCI/SCSI Location Description Status
--- ------ -------------------------- ------------------------- -------------
1 0000:58:01.0.0/0:4:0 RAID Array Member Failed
e=Exit q=Cancel t=Toggle
Figure 130. Example Rebuild Disk Unit Data display
Confirm Rebuild Disk Unit Data
Rebuilding the disk unit data might take several minutes for each disk
selected.
Press Enter to confirm having the data rebuilt.
q=Cancel to return and change your choice.
OPT Name PCI/SCSI Location Description Status
--- ------ -------------------------- ------------------------- ---------------
1 0000:58:01.0.0/0:4:0 RAID Array Member Failed
q=Cancel t=Toggle
Figure 131. Example Confirm Rebuild Disk Unit Data display
140 Disk drives
5. If you determined that the disk drive you replaced had device parity protection (RAID), follow these
steps to restore device parity protection (RAID) on the disk drive:
a. Select Rebuild disk unit data on the Work with Disk unit recovery display. Press Enter.
b. Select 1 to rebuild the disk drive displayed (the disk drive displayed is the disk drive that you
removed) on the Rebuild Disk Unit Data display. Press Enter.
c. Press Enter on the Confirm Rebuild Disk Unit Data display. The rebuild process might take several
minutes to complete.
d. Press F5 to refresh the display until the Percent complete shows 5%.
e. When the display shows at least 5% complete, you can either continue to monitor this display to
completion, or press F3 (Exit) to return to the Work with disk units display.
f. Press F3 (Exit) to return to the System service tools display.
g. Press F3 (Exit) to return to the Exit SST display and press Enter.
h. Return to the procedure that sent you here.
6. If the disk drive that you replaced had mirrored protection, follow these steps:
a. Select Replace configured unit on the Work with Disk unit recovery display. Press Enter.
b. Select the configured disk drive that you are exchanging (suspended drive) on the Select
Configured Unit to Replace display. Press Enter.
c. Select the disk drive that you just installed on the Select Replacement Unit display. This drive has
a non-configured status.
Note: In some cases, it might take several minutes for a new disk drive to display. Repeat these
steps until the new drive is shown.
Press Enter.
d. Press Enter on the Confirm Replace of Configured Unit display to confirm your choice for
replacement.
The replacement process might take several minutes to complete. When the process is complete,
the Work with Disk unit recovery display is shown.
e. Press F3 (Exit) to return to the Work with disk units display.
f. Select Display disk configuration on the Work with disk units display.
g. Select Display disk configuration status on the Display Disk Configuration display.
Mirrored status shows Resuming. When complete, the mirrored status shows Active. This process
might take several minutes to complete. You can either monitor this display to completion, or
press F3 (Exit) three times, and then press Enter to return to the IBM i main menu.
Configuring a disk drive in an system or logical partition
To configure a newly installed disk drive into an existing device parity set, follow these steps:
If you want to start a new device parity set or change the protection on your disks, see Manage device
parity protection or Start device parity protection in the Disk Management section of the IBM i
Information Center website at http://www.ibm.com/systems/i/infocenter .
1. If necessary, start System Service Tools (SST) by typing strsst on the command line of the IBM i
session and pressing Enter.
2. Type your service tools user ID and service tools password on the Start Service Tools (STRSST) Sign
On display. Press Enter.
Restriction: The service tools password is case sensitive.
3. Select Work with Disk Units on the System Service Tools display. Press Enter.
4. Select Display disk configuration on the Work with Disk Units display. Press Enter.
5. Select Display non-configured units on the Display disk configuration display. Press Enter.
Related information 141
6. The disk drive that you installed is listed. The serial number should match the serial number you
recorded (the last 4 digits).
Note: The new disk drive might take up to five minutes to appear in the list. If the disk drive is not
listed, ensure that the disk drive was properly installed.
7. Press F12 twice.
8. Select Work with disk configuration on the Work with Disk Units display. Press Enter.
9. Are you adding a disk drive to an existing device parity set?
v Yes: Continue with the next step.
v No: Go to step 23 on page 46 for mirroring or no protection, go to 1 on page 47 for hot spare.
Tip: Select No for mirroring, hot spare or no protection.
10. Select Work with device parity protection on the Work with Disk Configuration display. Press Enter.
11. Select Include unit in device parity protection on the Work with Device Parity Protection display.
Press Enter.
12. Did the Include Device Parity Protection Failed display appear?
v Yes: Continue with the next step.
v No: Go to step 15 on page 46.
13. The following conditions must be met before you can start device parity protection.
v If there are enough disk drives available to create a new parity set, the drives are eligible for Start
Device Parity Protection operation and not for the Include Device Parity Protection operation.
v All disk drives in a parity set must be the same capacity with a minimum number of three or four
disk drives and maximum of 10 or 18 disk drives in the resulting parity set, depending on the
disk drive type.
v Not all disk drives attached to an advanced function input output processor have reported to the
system. Retry the operation.
v The type or model of the disk drives cannot be used for the requested operation.
When these conditions are met, press F12 to return to the Work with Disk Configuration display.
14. Select Include unit in device parity protection on the Work with disk configuration display. Press
Enter.
15. On the Include Disk Units in Device Parity Protection display, select the disk drives to be included
in Device Parity Protection. Type a 1 in the Option column. Press Enter.
16. Press Enter to confirm your choice. The configuration starts and might take 30 to 60 minutes to
complete.
17. When the device parity protection is complete, the message “Selected drives have been included
successfully” is shown on the Work with device parity protection display. Press F12.
18. Select Display disk configuration on the Work with disk configuration display. Press Enter.
19. Select Display disk configuration status on the Display disk configuration display.
20. If you have more than one ASP, determine the auxiliary storage pool (ASP) to which you want to
add the disk drive.
21. Press F12 twice to get to the Work with Disk Units display.
22. Select Work with disk configuration on the Work with Disk Units display. Press Enter.
23. Select the option to add units to ASPs and balance data on the Work with disk configuration display.
Press Enter.
24. Specify the number of the ASP to which you want to add the disk drive. The system ASP is ASP 1.
Press Enter.
Notes:
a. If ASP is mirror protected, disk drives must be added in pairs of the same size.
142 Disk drives
b. When one or more pairs of disk drives are added to a mirrored ASP, they automatically become
mirror protected.
c. To start mirrored protection on an ASP, refer to Recovering your system in the IBM i Information
Center.
To access the IBM i Information Center Web site, go to http://www.ibm.com/systems/i/
infocenter and select from the following options:
v If you select V6R1, expand IBM i 6.1 Information Center > Systems management > Backup
and recovery > Recovering your system
v If you select V5R4, expand i5/OS Information Center > Systems management > Backup and
recovery > Backing up and recovering your system > Recovering your system
25. The Confirm Add Units display is shown showing the configuration your system will have when the
add operation completes.
Note: If you have selected the wrong ASP, press F12 to change your options. Press the Enter key to
continue. The add process will take several minutes to complete.
26. When the message Selected units have been added successfully is shown, press F3 three times
and press Enter to return to the Main Menu display.
27. If you have a printer, print the configuration list. A service provider might refer to the configuration
list in the future.
a. You need to sign on with at least service level authority
b. Type strsst on the Main Menu command line. Press Enter.
c. Type your service tools user ID and service tools password on the Start Service Tools (STRSST)
Sign On display. Press Enter.
Remember: The service tools password is case sensitive.
d. Select Start a service tool on the System Service Tools (SST) display. Press Enter.
e. Select Hardware service manager on the Start a Service Tool display. Press Enter.
f. Press F6 (print the configuration) from the Hardware Service Manager display.
g. To return to the Main Menu display, press F3 (Exit) twice and then press Enter.
h. Place the configuration list where you can refer to it in the future.
28. If you are a service provider performing this task, return to the procedure that sent you here.
1. Select Work with hot spare protection from the Work with disk configuration menu.
2. Select Start device parity protection - RAID 5 with hot spare or Start device parity protection -
RAID 6 with hot spare depending on the level of parity protection that is desired.
3. Return to the procedure that sent you here.
Preparing to remove a disk drive from a system or logical partition
controlled by IBM i
Learn the actions you need to take to properly remove a disk drive.
Before removing a disk drive from a system or expansion unit that is controlled by IBM i, follow these
steps:
1. Determine the protection status of the disk drive that you are replacing. For instructions, see
“Determining the IBM i disk drive protection status” on page 48.
2. The service provider must perform one of the following tasks:
v Save data and initialize the disk drive.
v Initialize the disk drive and copy to disk.
3. For mirrored disk drives on an IBM i system or logical partition, follow these steps to verify that the
disk drive you are replacing has been suspended:
Related information 143
a. Sign on with service level authority.
b. Type strsst on the command line of the IBM i session and press Enter.
c. Type your service tools user ID and service tools password on the Start Service Tools (STRSST)
Sign On display. Press Enter.
Remember: The service tools password is case sensitive.
d. Select Work with disk units from the System Service Tools (SST) display. Press Enter.
e. Select Display Disk Configuration from the Work with Disk Units display. Press Enter.
f. Select Display Disk Configuration Status from the Display Disk Configuration display. Press
Enter.
g. Does the disk drive that is mirrored to the disk drive that you are replacing (same drive number)
have a status of Active?
v No: The replacement must be performed by a service provider.
v Yes: Does the disk drive that you are replacing have a status of Suspended?
– Yes: Continue with the next step.
– No: Follow these steps to suspend mirrored protection on the disk drive that you are
replacing:
1) Press F3 from the Display Disk Configuration display to return to the Work with Disk
Units display.
2) Select Work with Disk Unit Recovery from the Work with Disk Units display. Press
Enter.
3) Select Suspend mirrored protection from the Work with Disk Unit Recovery display.
Press Enter.
4) Select the option to suspend the disk drive that you are replacing from the Suspend
Mirrored Protection display. Press Enter.
h. Press F3 repeatedly to exit System Service Tools and return to the main menu.
Determining the IBM i disk drive protection status
To determine the protection status of the disk drives in your IBM i system or logical partition, follow
these steps from the System Service Tools (SST) display:
Tip: If you do not know the protection status of the disk drives and you cannot get to the System Service
Tools display, contact your next level of support.
1. Sign on with at least service level authority.
2. Type strsst on the command line of the IBM i session and press Enter.
3. Type your service tools user ID and service tools password on the Start Service Tools (STRSST) Sign
On display. Press Enter.
Note: The service tools password is case-sensitive.
4. Select Work with disk units from the System Service Tools (SST) display. Press Enter.
5. Select Display Disk Configuration from the Work with Disk Units display. Press Enter.
6. Select Display Disk Configuration Status from the Display Disk Configuration display. Press Enter.
A list of each auxiliary storage pool (ASP) displays, showing the disk drives that are members of the
ASP. The Status column shows one of the following protection statuses for the ASP:
v Mirrored. These ASPs are configured for mirrored protection. If the failing disk drive has mirrored
protection, record the status of both disk drives in the mirrored pair. You need this information in
the recovery procedure.
144 Disk drives
v Unprotected. An ASP with a status of Unprotected might contain disk drives that are device parity
protected. Record the status of the failing disk drive as it is shown in the Status column of the
Display Disk Configuration Status display.
A disk drive with one of the following statuses has device parity protection only if all other disk
drives in the array are operational:
– DPY/Active
– DPY/Failed
– DPY/HDW Failure
– DPY/Degraded
– DPY/Power Loss
– DPY/Not Ready
– DPY/Unknown
A disk drive with any other status (such as DPY/Rebuild or DPY/Unprotected) does not have
device parity protection.
Examples:
– Suppose one disk drive has a status of DPY/Failed while all other disk drives have a status of
DPY/Unprotected. In this case, only the disk drive with the DPY/Failed status is protected.
Servicing the failed disk drive does not result in data loss. Removing any of the unprotected disk
drives might result in data loss.
– All disk drives have a status of DPY/Active or DPY/Degraded, therefore, all disk drives are
protected. In this case, exchanging any disk drive does not result in data loss.
Tip: For more information about the meaning of a status for any disk drive, use the Help key on
the display.
Configuring a disk drive on a load source adapter for hot spare using
the IBM i operating system
To start hot spare protection on your load source adapter you will need to determine the status of the
disk drive in the load source slot and configure or exchange that disk drive, which might require you to
power down the system.
1. If necessary, start System Service Tools (SST) by typing strsst on the command line of the IBM i
session and pressing Enter.
2. Type your service tools user ID and service tools password on the Start Service Tools (STRSST) Sign
On display. Press Enter.
Note: The service tools password is case sensitive.
3. Determine the protection status of all disks controlled by the load source adapter. Do all disk drives
that are a member of a parity sets or mirror sets have a status of active?
v No: Replace the disk using the following procedure in, “Replacing a disk drive with the IBM i
system or logical partition powered on” on page 89.
v Yes: Continue to the next step.
Restriction: There must be a disk drive of the same or larger capacity located in a load source
capable slot to act as a hot spare for the active load source disk drive. The next steps will have you
change the disk drive in the desired slot if needed.
Attention: If the disk drive acting as hot spare for the load source is of smaller capacity you can
suffer data loss.
4. Determine whether a non-configured disk drive of the same or larger capacity is located in a load
source capable slot but is not the active load source. Only parallel SCSI places a restriction on the
location of the load source. Any SAS, Fibre Channel, or virtual disk drive does not have this
restriction.
Related information 145
v No non-configured drive is located in a load source capable slot. Continue with the next step
v Yes a non-configured drive is located in a load source capable slot. Return to the procedure that
sent you here, and continue to the next step of that procedure.
5. Choose the disk drive to be installed or exchanged with the drive in the load source slot by doing the
following:
a. Select a disk drive in a parity set located in a load source capable slot, but is not the active load
source drive, and record the location of this disk drive.
b. Select a non-configured disk drive of the same or larger capacity from either a non-load source
slot or a failed drive you are replacing with a non-configured disk drive.
6. Stop the system. For instructions, see “Stopping a system or logical partition” on page 201.
7. Exchange the disk drives from the two locations you recorded in step 5 by removing both disk drives
and placing them into the other disk drives slot.
8. Start the system. For instructions, see “Starting the system or logical partition” on page 199.
9. Return to the procedure that sent you here and continue with the next step in that procedure.
146 Disk drives
Internal disk drive sharing on model 8202-E4B or 8205-E6B
system
Learn how to split the internal disks in the system unit enclosure into groups, which you can manage
separately. Use the following information to perform the task.
DASD backplane supports Split DASD mode. FC 5631 split DASD enablement card is used to put DASD
backplane in the Split DASD mode in which 3 SFF DASD on left (from front view) are assigned to the
split DASD enablement card (CCIN 2BE0), and 3 SFF DASD on right are assigned to the integrated
storage controller (CCIN 572C). A PCIe or PCIX SAS adapter (such as FC 5901, FC5904, or FC5912) can
access the 3 SFF DASD through an external AI SAS cable, as shown in Figure 132 on page 148.
Important: This configuration is not supported with the 5630 backplane feature code.
Use the following information after the adapter is installed. For more information, see PCI adapters.
Perform the prerequisite tasks as described in “Before you begin” on page 189.
Note: Internal cable FC 3670 for an 8202-E4B or 8205-E6B must be installed to enable the split DASD
configuration. For installation instructions, see “Installing the external SAS port on model 8202-E4B or
8205-E6B” on page 153.
Perform the following tasks to split the internal disks in the system unit enclosure into groups, which
you can manage separately.
1. Stop and turn off the power to the system, see Stopping the system or logical partition.
2. Cable the system by connecting the SAS cable to the adapter and then to the internal SAS connection,
as shown in Figure 132 on page 148.
© Copyright IBM Corp. 2010 147
3. Start the system, see “Starting the system or logical partition” on page 199.
4. Verify that the feature is installed and working, see “Verifying the installed part” on page 249.
After performing this procedure, three disks in the system enclosure is managed by the split DASD
enablement card and three disks is managed by the internal system.
You can now manage the disks as you would manage any other disks.
Notes: The second connector on the adapter can be used to attach a disk expansion or media expansion drawer.
Figure 132. SAS adapter to internal SAS disk slots through the system external SAS port
148 Disk drives
Internal disk drive sharing on an 8233-E8B or 8236-E8C
You might need to cable and understand the function of this feature. Use the following information to
complete these tasks.
DASD backplane supports Split DASD mode. FC 3669 internal SAS cable for Split DASD mode is used to
put DASD backplane in the Split DASD mode in which 4 SFF DASD on left (from front view) are
assigned to the integrated SAS controller, and 4 SFF DASD on right are assigned to the external rear SAS
port. A PCIe or PCIX SAS adapter (such as FC 5901, FC5904, or FC5912) can access the right 4 SFF DASD
via an external AI SAS cable, as shown in the figure below.
The following information is for use after the adapter is installed. Install the adapter and then return
here. See the PCI adapters topic for more information, PCI adapters.
You should also review the tasks in “Before you begin” on page 189 if you have not already done so.
Note: Internal cable FC3669 for model 8233-E8B or 8236-E8C must be installed to enable this
configuration. For installation instructions, see Installing the external SAS port on a model 8233-E8B or
8236-E8C
This feature allows you to split the internal disks in the system unit enclosure into groups which you can
manage separately.
1. Stop and power off the system if it is not already, see “Stopping a system or logical partition” on page
201.
2. Cable the system by connecting the SAS cable to the adapter and then to the internal SAS connection,
as shown in the following graphic.
© Copyright IBM Corp. 2010 149
3. Start the system, see “Starting the system or logical partition” on page 199.
4. Verify that the feature is installed and working, see “Verifying the installed part” on page 249.
Once you have completed the procedure, four of the disks in the system enclosure will be managed by
the PCI SAS adapter and four will be managed by the integrated SAS controller in the system unit.
Now, you can manage the disks as you would manage any other disks.
Related information
SAS RAID controllers for AIX on the 8233-E8B and 8236-E8C
Notes:
v The second connector on the adapter can be used to attach a disk expansion or media expansion drawer.
Figure 133. SAS adapter to internal SAS disk slots through the system external SAS port
150 Disk drives
Internal disk drive sharing in model 9117-MMB or 9179-MHB
You might need to cable and understand the function of this feature. Use the following information to
complete these tasks.
Review the disk drive modes and supported disk drive locations, as shown in Table 1. The following
figure shows the disk drive locations for the IBM model 9117-MMB or 9179-MHB.
Table 4. Disk drive modes and supported disk drive locations
Disk drive mode SAS Port Card Adapter SAS Cable Disk drive location
RAID internal drives None Integrated Dual RAID
controller
None D1, D2, D3, D4, D5,
and D6
RAID internal and
external drives
SAS External Port
feature (FC1819) -
SAS Cable Assembly
for SAS Port
Integrated Dual RAID
controller
v YI Cable (FC3686) -
System to SAS
Enclosure, Single
Controller/Dual
Path 1.5 meter
v YI Cable (FC3687) -
System to SAS
Enclosure, Single
Controller/Dual
Path 3 meter
D1, D2, D3, D4, D5,
and D6
Dual Split Backplane None v Integrated SAS
controller (SAS A)
v Integrated SAS
controller (SAS B)
None v D1, D2, and D3
v D4, D5, and D6
Triple Split Backplane SAS Internal Port
feature (FC1815) -
SAS Cable for triple
split disk drive
backplane
v Integrated SAS
controller (SAS A)
v Integrated SAS
controller (SAS B)
v (FC5901) - PCIe
Dual-x4 SAS
Adapter
AI Cable (FC3679) -
Adapter to Internal
drive 1 meter
v D1 and D4
v D2 and D5
v D3 and D6
Figure 134. Disk drive locations
© Copyright IBM Corp. 2010 151
The following information is for use after the FC5901 SAS Storage Adapter is installed. Install the adapter
and then return here. See the PCI adapters topic for more information, PCI adapters.
You should also review the tasks in “Before you begin” on page 189 if you have not already done so.
This feature allows you to split the internal disks in the system unit enclosure into groups which you can
manage separately.
1. Stop and power off the system if it is not already, see “Stopping a system or logical partition” on page
201.
2. Cable a single system unit enclosure by doing the following:
a. Attach the cable to the SAS port on the rear bulkhead of the system unit enclosure to the top port
in the SAS Storage Adapter, as shown in the following figure.
Restriction: Internal disk drive sharing is only available when internal cable feature FC1815 is
installed from the DASD backplane to the read bulkhead of the system unit enclosure. Also,
feature FC5662 175 MB cache RAID - dual IOA enablement card must not be installed. The SAS
Storage Adapter can be in any of the other slots that support it.
b. Secure any extra cable.
3. Start the system, see “Starting the system or logical partition” on page 199.
4. Verify that the feature is installed and working, see “Verifying the installed part” on page 249.
With this function installed, 2 of the 6 of the disks in the system enclosure can be managed by the SAS
Storage Adapter.
Note: The Removable Media device is always controlled by a separate embedded SAS controller on the
system planar.
Now, you can manage the disks as you would manage any other disks.
152 Disk drives
Installing the external SAS port on model 8202-E4B or
8205-E6B
Use this procedure to install an external SAS port as a part of upgrading your disk-drive backplane, or to
enable internal disk sharing.
Perform the prerequisite tasks as described in “Before you begin” on page 189.
Perform the following steps to install an external SAS port into the server:
1. Stop the system if it is running. For instructions on stopping the system, see “Stopping a system or
logical partition” on page 201.
2. Remove all PCI adapter cassettes. See PCI adapter single-width, fourth-generation cassette.
3. If necessary, remove the existing SAS cable or filler, see Figure 135.
a. Press the side tab and pull out the SAS shroud (A) from the SAS connection on the system
backplane.
b. Open the cable clamp (B) on the side wall. The cable clamp opens from the top.
c. Pull the SAS cable (D) out of the system.
4. Install the new SAS cable, see Figure 136 on page 154.
a. Connect the external port to the rear of the system until it snaps in place.
b. Open the cable clamp (C) on the side wall, placing the cable (A) through the clamp. The cable
clamp opens from the top.
Figure 135. Removing the SAS cable
© Copyright IBM Corp. 2010 153
c. Close the cable clamp (C) and continue to slide the cable towards the SAS connection on the
system backplane.
d. Push the SAS shroud (D) into the SAS connection on the system backplane until it snaps in place.
5. Replace all PCI adapter cassettes. See PCI adapter single-width, fourth-generation cassette.
6. Start the system. See http://publib.boulder.ibm.com/infocenter/powersys/v3r1m5/topic/p7haj/
crustartsys.htm
You can now attach other supported SAS features to the system.
Figure 136. Replacing the SAS cable
154 Disk drives
Installing the external SAS port on model 8231-E2B
Use this procedure to install an external SAS port as a part of upgrading your disk-drive backplane, or to
enable internal disk sharing.
Perform the prerequisite tasks as described in “Before you begin” on page 189.
Perform the following steps to install the external SAS port into the server:
1. Stop the system if it is running. For instructions on stopping the system, see “Stopping a system or
logical partition” on page 201.
2. Remove all PCI adapter cassettes. See PCI adapter single-width, fourth-generation cassette.
3. If necessary, remove the existing SAS cable or filler, see Figure 137.
a. Press the side tab and pull the SAS shroud out of the SAS connection on the system backplane.
b. Open the cable clamp on the side wall and pull out the cable (B). The cable clamp opens from the
top.
c. Push the external SAS port down (A) and then pull the SAS cable out of the system.
4. Install the new SAS cable. See Figure 138 on page 156.
a. Connect the external port to the back of the system (B) until it snaps in place.
b. If necessary, open the cable clamp on the side wall, placing the cable through the clamp. The cable
clamp opens from the top.
c. Close the cable clamp and continue to slide the cable (A) back to the SAS connection on the
system backplane.
d. Push the SAS shroud into the SAS connection (B) on the system backplane until it snaps in place.
Figure 137. Removing the SAS cable
© Copyright IBM Corp. 2010 155
5. Replace all PCI adapter cassettes. See PCI adapter single-width, fourth-generation cassette.
6. Start the system. See http://publib.boulder.ibm.com/infocenter/powersys/v3r1m5/topic/p7haj/
crustartsys.htm
You can now attach other supported SAS features to the system.
Figure 138. Replacing the SAS cable
156 Disk drives
Installing the external SAS port on an 8233-E8B or 8236-E8C
You might need to install an external SAS port as a part of upgrading your disk-drive backplane, or to
enable internal disk sharing. Use the procedure in this section to perform this task.
If your system is managed by the Hardware Management Console (HMC), use the HMC to complete the
steps to install the SAS feature 3699 into the server. For instructions, see “Removing a part by using the
Hardware Management Console” on page 247.
If you do not have an HMC, complete the following steps to install SAS feature 3699 into the server:
1. Perform the prerequisite tasks described in “Before you begin” on page 189.
2. Stop the system. For instructions, see “Stopping a system or logical partition” on page 201.
3. If the system is rack-mounted, place the system into the service position. For instructions, see
“Placing a 8233-E8B or 8236-E8C in the service position” on page 236.
4. Remove the service access cover. For instructions, see “Removing the service access cover from an
8233-E8B or 8236-E8C system” on page 219.
5. Remove the system processor assembly or filler in position P1-C16 by lifting the handle (A) and
lifting the assembly out of the system, as shown in the following figure.
Note: You only need to remove the one assembly or filler in P1-C16. To remove a filler, grasp the
filler in the center and lift the filler out of the system.
6. Remove the fans and fan cage from the system by doing the following steps:
a. To remove the fans, squeeze the tab (A) and lift the fan out of the fan cage as shown in the
following figure.
© Copyright IBM Corp. 2010 157
b. To remove the air baffles, grasp and pull the air baffle out of the fan cage, as shown in the
following figure.
Figure 139. Removing the fan from the 8233-E8B or 8236-E8C fan cage
Figure 140. Removing the air baffles from the 8233-E8B or 8236-E8C fan cage
158 Disk drives
c. To remove the fan cage, loosen the two thumbscrews (A) and lift the fan cage (B) out of the
system as shown in the following figure.
7. Slide the front of the cable, with the male connector, though the opening between the processor cage
and the side of the system.
Tip: Refer to Figure 142 on page 160 to complete the next few steps.
8. Insert the pins (A) on the side of the system into the slots (B) on the SAS cable assembly.
9. Slide the cable assembly back until the pins (A) are seated and the latch (C) snaps in place.
Tip: The port on the back of the cable assembly should be seated in the slot at the back of the
system.
10. Attach the cable bracket (H) by placing the notch on the bracket over the post on the side of the
system, aligning the screw hole (I) with the hole on the side of the system, and securing the cable
bracket with the screw.
Note: The bracket should fit snugly over the SAS cable assembly.
Figure 141. Removing the fan cage from the 8233-E8B or 8236-E8C system
Installing the external SAS port on an 8233-E8B or 8236-E8C 159
11. Depending on the disk-drive backplane you have, complete one of the following choices:
Tip: Refer to Figure 143 on page 161 to complete this step.
v If you can see the SAS connection in the center of the disk-drive backplane (D), attach the cable to
the connection and continue to step 12 on page 161.
v If you can see the SAS cable shroud (E) extending from the disk-drive backplane, continue to step
11a.
Figure 142. Attaching the SAS cable
160 Disk drives
a. Locate the SAS cable shroud (E) on the disk-drive backplane and pull the back of the shroud
away from the side of the system to unseat the shroud.
b. Pull the shroud out of the SAS port.
c. Attach the shroud to the male SAS connector (F) on the cable assembly by sliding the shroud
over the top of the cable connector (F) from behind. The SAS connector on the cable should fit
into the two slots at the front of the shroud, and the connector should extend away from the
shroud so it can be seated in the port.
d. Slide the cable and shroud (G) into the SAS port until it is firmly seated in place.
12. Replace the fans and the fan cage, by doing the following steps:
a. To install the fan cage (A) into the system, align the locator pins through the holes, and tighten
the two thumbscrews (B), as shown in the following figure.
Figure 143. SAS cable connections
Installing the external SAS port on an 8233-E8B or 8236-E8C 161
b. To install the air baffles, insert the air baffles into their location in the fan cage, and push down
until they lock into place, as shown in the following figure.
Figure 144. Installing the fan cage in the 8233-E8B or 8236-E8C system
Figure 145. Installing the air baffles in the 8233-E8B or 8236-E8C fan cage
162 Disk drives
c. To install the fans, insert the fans into their location in the fan cage, and push down on the fan
until it locks into place, as shown in the following figure.
13. Install the system processor (B) and secure both latches (C), as shown in the following figure.
14. Replace the service access cover. For instructions, see “Installing the service access cover on an
8233-E8B or 8236-E8C system” on page 220.
Figure 146. Installing the fans in the 8233-E8B or 8236-E8C fan cage
Installing the external SAS port on an 8233-E8B or 8236-E8C 163
15. Place the system back into the operating position. For instructions, see “Placing the 8233-E8B or
8236-E8C in the operating position” on page 239.
You may now attach supported SAS features to your system. Return to the instructions that sent you here
or start your system. For instructions, see “Starting the system or logical partition” on page 199.
164 Disk drives
Installing the external SAS port on a 9117-MMB or 9179-MHB
Use this procedure to install an external SAS port as a part of upgrading your disk-drive backplane, or to
enable internal disk sharing.
Complete the following steps to install the external SAS port into the server:
1. Perform the prerequisite tasks described in “Before you begin” on page 189.
2. Stop the system. See “Stopping a system or logical partition” on page 201.
3. Remove all PCI adapter cassettes. See PCI adapter single-width, fourth generation cassette.
4. If necessary, remove the existing SAS cable or filler.
a. Press the side tab and pull the SAS shroud (A) out of the SAS connection on the system
backplane.
b. Open the cable clamp on the side wall and pull the cable out. The cable clamp opens from the top.
c. Push the external SAS port down (B) and then pull the SAS cable out of the system.
5. Install the new SAS cable.
a. Connect the external port to the back of the system (B) until it snaps in place.
b. If necessary, open the cable clamp on the side wall, placing the cable through the clamp. The cable
clamp opens from the top.
c. Close the cable clamp and continue to slide the cable back to the SAS connection on the system
backplane.
d. Push the SAS shroud (A) into the SAS connection on the system backplane until it snaps in place.
Figure 147. Removing the SAS cable
© Copyright IBM Corp. 2010 165
6. Replace all PCI adapter cassettes. See PCI adapter single-width, fourth generation cassette.
7. Start the system. See “Starting the system or logical partition” on page 199
You may now attach other supported SAS features to your system.
Figure 148. Replacing the SAS cable
166 Disk drives
Replacing a SAS conduit card in a 5802 expansion unit with
power off
Learn how to replace a SAS conduit card in a 5802 expansion unit with power off.
Review the requirements in “Before you begin” on page 189.
If your system is managed by the Hardware Management Console (HMC), use the HMC to replace the
card. For instructions see, “Removing a part by using the Hardware Management Console” on page 247
1. Stop the system if it is running. For instructions see, “Stopping a system or logical partition” on
page 201.
2. Disconnect the power source from the system by unplugging the system.
Attention: This system might be equipped with a second power supply. You must disconnect the
power source from the system by disconnecting all power cords to prevent system damage during
this procedure.
(L003)
or
3. Place the card connector (C) onto the on the midplane connection (D) using the guide track (B) to
ensure a good connection. Then lift the handle (A) to seat and secure the SAS conduit card to the
midplane as shown in the following figure.
© Copyright IBM Corp. 2010 167
4. Place the midplane and SAS conduit card back into the system by doing the following steps:
a. Align the midplane with the track and ensure the slot (A) on the midplane attaches to the post
(B) in the system.
b. Secure the thumbscrews (C) and (D) on the SAS conduit card.
c. Secure the thumbscrews (E) on the sides of the midplane.
Figure 149. Installing the SAS conduit card into the midplane
168 Disk drives
5. Attach the power cord cover and secure the thumbscrew (A) to the system.
Figure 150. Installing the SAS conduit card and the midplane
Figure 151. Installing the power cord plate
Replacing a SAS conduit card in a 5802 expansion unit with power off 169
6. Carefully slide the enclosure management controller into the system and close the latch (A) as shown
in the following figure.
7. Carefully lift and align the I/O planar with the slot at the back of the expansion unit.
8. Insert the I/O planar firmly into the server, as shown in the following figure.
9. Secure the I/O planar with the locking tabs (A) as shown in the following figure.
Figure 152. Replacing the enclosure management controller
170 Disk drives
10. To replace the power supply, with the locking handle (A) in the open position, push the power
supply into the expansion unit as shown in the following figure.
11. Close the locking handle (A) until the power supply locks into position.
Figure 153. Install the I/O planar on a rack mounted model
Replacing a SAS conduit card in a 5802 expansion unit with power off 171
12. To replace the disk drive backplane, lift the backplane along the two sides and align with the guide
rails in the expansion unit.
CAUTION:
The backplane may be heavy. Ensure that you can safely perform this task before you begin.
13. Using your hand to support the bottom of the backplane, slide it into the expansion unit as shown
in the following figure.
14. Secure the backplane in place by moving the locking tabs from the open position (B) to the locked
position (A).
Figure 154. Replacing a power supply
172 Disk drives
15. Replace all disk drive units and fillers, if you removed them earlier.
a. Support the bottom of the disk drive unit as you align it with the guide rails in the backplane.
b. Slide the disk drive unit all the way into the backplane.
c. Push the handle (A) toward the disk drive unit to lock it into place.
Figure 155. Installing the backplane on a rack-mounted model
Replacing a SAS conduit card in a 5802 expansion unit with power off 173
16. Replace all expander cards and fillers, if you removed them earlier.
a. Supporting the bottom of the expander card with your hand, align it with the guide rails in the
backplane.
b. Slide the expander card all the way into the backplane.
c. Push the handle (A) toward the card to lock it into place.
17. Start the system. For instructions see, “Starting the system or logical partition” on page 199.
Continue with any other service actions you need to perform.
v For information on connecting the system see Connecting your expansion units
v For information an installing the PCI adapters into the planar, see Model 5802 and 5877 expansion
units, PCI adapters, and cassettes.
174 Disk drives
Removing a SAS conduit card from a 5802 expansion unit
with power off
Learn how to remove a SAS conduit card from a 5802 (rack-mounted) expansion unit if you need to
replace a failing part with power off.
If your system is managed by the Hardware Management Console (HMC) use the HMC to remove the
SAS conduit card. For instructions see, “Removing a part by using the Hardware Management Console”
on page 247.
1. Perform the prerequisite tasks as described in “Before you begin” on page 189.
2. Identify the card you are going to remove. For instructions see, “Identifying a failing part” on page
193.
3. Stop the system. For instructions see, “Stopping a system or logical partition” on page 201.
4. Remove the disk drives and fillers as shown in the following figure.
a. Squeeze the latch of the disk drive and pull the handle (A) toward you to release the drives.
b. Supporting the bottom of the disk drive with your hand, slide it out of the backplane.
5. Remove the port cards and fillers as shown in the following figure.
a. Grasp the end of the latching handle (A) and pull it upward to unlock the port card.
b. Using your hand to support the bottom of the port card, slide it out of the backplane.
c. Place the port card on an electrostatic discharge (ESD) surface.
Figure 156. Removing the disk drives from the backplane
© Copyright IBM Corp. 2010 175
6. Remove the backplane as shown in the following figure.
CAUTION:
The backplane might be heavy. Ensure that you can safely complete the procedure.
a. Squeeze the blue latches (A) to unlock the handles.
b. Pull the handles (B) outward in the direction shown to release the backplane.
c. Slide the backplane out of the enclosure, using your hand to support the bottom of the
backplane.
Figure 157. Removing the port card from the backplane
176 Disk drives
7. Disconnect both offline converter assembly (OCA) power cords by doing the following:
a. On the front of both OCAs, slide the power cord holders (A) to the left.
b. Label and disconnect both OCA power cords (B) from each OCA.
Figure 158. Removing the backplane from the enclosure
Removing a SAS conduit card from a 5802 expansion unit with power off 177
8. Remove the offline converter assemblies by doing the following steps:
a. Pull the OCA handle (A) all the way down in the direction shown to unseat the OCA.
b. Grasp both sides of the OCA and pull it out of the enclosure.
Note: Use 2 hands to support the OCA as it is pulled out of the enclosure.
c. Repeat these steps to remove both OCAs.
Figure 159. Disconnecting the OCA power cords
178 Disk drives
9. Remove the enclosure management controller by doing the following steps:
a. Label and disconnect any cables connected to the enclosure management controller card.
b. Pinch the latch (A) to unlock the handle (B).
c. Pull the handle outward in the direction shown to unseat the EMC card.
d. Slide the enclosure management controller card straight out of the enclosure.
Figure 160. Remove the offline converter assemblies
Removing a SAS conduit card from a 5802 expansion unit with power off 179
10. Remove the power cord plate by turning the thumbscrew (A) and pulling the plate away from the
system.
11. Disconnect the cables from the back of the I/O planar.
Figure 161. Remove the enclosure management controller
Figure 162. Removing the power cord plate
180 Disk drives
12. Squeeze the release latches (A) and pull the levers (B) out away from the system, as shown in the
following figure.
13. Slide the planar out of the system.
14. Remove the SAS conduit card and midplane from the enclosure by doing the following steps:
a. Pull the spring plungers (A) up and turn them one quarter to the left.
b. Turn the captive screws (B) and (C) counterclockwise until they are completely loosened.
c. Pull the SAS conduit and midplane component out of the enclosure.
Figure 163. Removing the I/O planar
Removing a SAS conduit card from a 5802 expansion unit with power off 181
15. Push the handles (A) on the conduit card down to disengage the card from the midplane, and
remove the card.
Continue with replacing the SAS expander card or return to the procedure that sent you here.
Figure 164. Removing the SAS conduit card and midplane
Figure 165. Removing the SAS conduit card from the midplane
182 Disk drives
Replacing a SAS expander card in a 5802 expansion unit with
power off
Learn how to replace a SAS expander card in a 5802 expansion unit with power off.
Review the requirements in “Before you begin” on page 189.
If your system is managed by the Hardware Management Console (HMC) use the HMC to replace the
card. For instructions see, “Removing a part by using the Hardware Management Console” on page 247
1. Before installing or reinstalling a SAS expander card, inspect the signal connector pins to ensure that
the pins are not bent or damaged.
Attention: Bent pins damage the backplane receptacle. Be careful not to touch or damage the signal
connector pins.
2. Slide the card completely into the slot and push the lever (A) down to secure the card as shown in
the following figure.
3. Start the system. For instructions see, “Starting the system or logical partition” on page 199.
4. Verify the presence of the newly installed card. For instructions see, “Verifying the installed part” on
page 249.
Continue with any other service actions you need to perform.
Figure 166. Installing the SAS expander card
© Copyright IBM Corp. 2010 183
184 Disk drives
Removing a SAS expander card from a 5802 expansion unit
with power off
Learn how to remove a SAS expander card from a 5802 (rack-mounted) expansion unit if you need to
replace a failing part with power off.
If your system is managed by the Hardware Management Console (HMC) use the HMC to remove the
SAS expander card. For instructions see, “Removing a part by using the Hardware Management Console”
on page 247.
1. Perform the prerequisite tasks as described in “Before you begin” on page 189.
2. Identify the card you are going to remove. For instructions see, “Identifying a failing part” on page
193.
3. Lift the handle (A) and pull the card out of the system, as shown in the following figure.
Continue with replacing the SAS expander card or return to the procedure that sent you here.
Figure 167. Removing the SAS expander card.
© Copyright IBM Corp. 2010 185
186 Disk drives
Common procedures for installable features
This section contains all the common procedures that are related to installing, removing, and replacing
features.
© Copyright IBM Corp. 2010 187
188 Disk drives
Before you begin
Observe these precautions when you are installing, removing, or replacing features and parts.
These precautions are intended to create a safe environment to service your system and do not provide
steps for servicing your system. The installation, removal and replacement procedures provide the
step-by-step processes required to service your system.
DANGER
When working on or around the system, observe the following precautions:
Electrical voltage and current from power, telephone, and communication cables are hazardous. To
avoid a shock hazard:
v Connect power to this unit only with the IBM provided power cord. Do not use the IBM
provided power cord for any other product.
v Do not open or service any power supply assembly.
v Do not connect or disconnect any cables or perform installation, maintenance, or reconfiguration
of this product during an electrical storm.
v The product might be equipped with multiple power cords. To remove all hazardous voltages,
disconnect all power cords.
v Connect all power cords to a properly wired and grounded electrical outlet. Ensure that the outlet
supplies proper voltage and phase rotation according to the system rating plate.
v Connect any equipment that will be attached to this product to properly wired outlets.
v When possible, use one hand only to connect or disconnect signal cables.
v Never turn on any equipment when there is evidence of fire, water, or structural damage.
v Disconnect the attached power cords, telecommunications systems, networks, and modems before
you open the device covers, unless instructed otherwise in the installation and configuration
procedures.
v Connect and disconnect cables as described in the following procedures when installing, moving,
or opening covers on this product or attached devices.
To Disconnect:
1. Turn off everything (unless instructed otherwise).
2. Remove the power cords from the outlets.
3. Remove the signal cables from the connectors.
4. Remove all cables from the devices
To Connect:
1. Turn off everything (unless instructed otherwise).
2. Attach all cables to the devices.
3. Attach the signal cables to the connectors.
4. Attach the power cords to the outlets.
5. Turn on the devices.
(D005)
DANGER
© Copyright IBM Corp. 2010 189
Observe the following precautions when working on or around your IT rack system:
v Heavy equipment–personal injury or equipment damage might result if mishandled.
v Always lower the leveling pads on the rack cabinet.
v Always install stabilizer brackets on the rack cabinet.
v To avoid hazardous conditions due to uneven mechanical loading, always install the heaviest
devices in the bottom of the rack cabinet. Always install servers and optional devices starting
from the bottom of the rack cabinet.
v Rack-mounted devices are not to be used as shelves or work spaces. Do not place objects on top
of rack-mounted devices.
v Each rack cabinet might have more than one power cord. Be sure to disconnect all power cords in
the rack cabinet when directed to disconnect power during servicing.
v Connect all devices installed in a rack cabinet to power devices installed in the same rack
cabinet. Do not plug a power cord from a device installed in one rack cabinet into a power
device installed in a different rack cabinet.
v An electrical outlet that is not correctly wired could place hazardous voltage on the metal parts of
the system or the devices that attach to the system. It is the responsibility of the customer to
ensure that the outlet is correctly wired and grounded to prevent an electrical shock.
CAUTION
v Do not install a unit in a rack where the internal rack ambient temperatures will exceed the
manufacturer's recommended ambient temperature for all your rack-mounted devices.
v Do not install a unit in a rack where the air flow is compromised. Ensure that air flow is not
blocked or reduced on any side, front, or back of a unit used for air flow through the unit.
v Consideration should be given to the connection of the equipment to the supply circuit so that
overloading of the circuits does not compromise the supply wiring or overcurrent protection. To
provide the correct power connection to a rack, refer to the rating labels located on the
equipment in the rack to determine the total power requirement of the supply circuit.
v (For sliding drawers.) Do not pull out or install any drawer or feature if the rack stabilizer brackets
are not attached to the rack. Do not pull out more than one drawer at a time. The rack might
become unstable if you pull out more than one drawer at a time.
v (For fixed drawers.) This drawer is a fixed drawer and must not be moved for servicing unless
specified by the manufacturer. Attempting to move the drawer partially or completely out of the
rack might cause the rack to become unstable or cause the drawer to fall out of the rack.
(R001)
Before you begin a replacement or installation procedure, perform these tasks:
1. If you are installing a new feature, ensure that you have the software required to support the new
feature.
To do this, go to the following Web site: IBM Prerequisite
2. If you are performing an installation or replacement procedure that might put your data at risk,
ensure, wherever possible, that you have a current backup of your system or logical partition
(including operating systems, licensed programs, and data).
3. Review the installation or replacement procedure for the feature or part.
4. Note the significance of color on your system.
190 Disk drives
Blue or terra-cotta on a part of the hardware indicates a touch point where you can grip the hardware
to remove it from or install it in the system, open or close a latch, and so on. Terra-cotta might also
indicate that the part can be removed and replaced with the system or logical partition power on.
5. Ensure that you have access to a medium flat-blade screwdriver, a Phillips screwdriver, and a pair of
scissors.
6. If parts are incorrect, missing, or visibly damaged, do the following:
v If you are replacing a part, contact the provider of your parts or next level of support.
v If you are installing a feature, contact one of the following service organizations:
– The provider of your parts or next level of support.
– In the United States, the IBM Rochester Manufacturing Automated Information Line (R–MAIL)
at 1–800–300–8751.
In countries and regions outside of the United States, use the following Web site to locate your service
and support telephone numbers:
http://www.ibm.com/planetwide
7. If you encounter difficulties during the installation, contact your service provider, your IBM reseller,
or your next level of support.
8. If you are installing new hardware in a logical partition, you need to understand and plan for the
implications of partitioning your system. For information, see Logical Partitioning.
Before you begin 191
192 Disk drives
Identifying a failing part
Use these instructions to learn how to locate and identify a failing part on your system or expansion unit
using the appropriate method for your system.
For IBM Power Systems servers that contain the POWER7 processor, the light-emitting diodes (LEDs) can
be used to identify or verify a part that you are servicing. The fault (amber) LED indicates an error and
corresponds to the location code in the system reference code (SRC). The LED is activated and
deactivated automatically.
If you need to use the identify function, use the following procedures.
Control panel LEDs
Use this information as a guide to the control panel LEDs and buttons.
The control panel has LEDs that indicate various system status.
v A: Power-on button
v B: Power LED
– A constant light indicates full system power to the unit.
– A blinking light indicates standby power to the unit.
Note: There is approximately a 30-second transition period from the time the power-on button is
pressed to when the power LED goes from blinking to solid. During the transition period, the LED
might blink faster.
v C: Enclosure identify light
Figure 168. Control panel
© Copyright IBM Corp. 2010 193
– A constant light indicates the identify state, which is used to identify a part.
– No light indicates that the system is operating normally.
v D: System information light
– No light indicates that the system is operating normally.
– Light on indicates that the system requires attention.
v E: USB port
v F: Enclosure fault roll-up light
– A constant light indicates a fault in the system unit.
– No light indicates that the system is operating normally.
v G: Function/Data display
v H: Decrement button
v I: Enter button
v J: Increment button
v K: Pinhole reset button
Related concepts
Identifying a failing part
Use these instructions to learn how to locate and identify a failing part on your system or expansion unit
using the appropriate method for your system.
Identifying a failing part in an AIX system or logical partition
Use these instructions to learn how to locate a failing part, and then activate the indicator light for that
part on a system or logical partition running the AIX operating system.
Locating a failing part in an AIX system or logical partition
You might need to use AIX tools, before activating the indicator light, to locate a part that is failing.
1. Log in as root user or celogin-.
2. At the command line, type diag and press Enter.
3. From the Function Selection menu, select Task Selection and press Enter.
4. Select Display Previous Diagnostic Results and press Enter.
5. From the Display Previous Diagnostic Results display, select Display Diagnostic Log Summary. The
Display Diagnostic Log display shows a chronological list of events.
6. Look in the T column for the most recent S entry. Select this row in the table and press Enter.
7. Select Commit. The details of this log entry are shown.
8. Record the location information and the SRN value shown near the end of the entry.
9. Exit to the command line.
Use the location information for the failing part to activate the indicator light that identifies the failing
part. “Activating the indicator light for the failing part.”
Activating the indicator light for the failing part
Use these instructions to help physically identify the location of a part you are servicing.
1. Log in as root user.
2. At the command line, type diag and press Enter.
3. From the Function Selection menu, select Task Selection and press Enter.
4. From the Task Selection menu, select Identify and Attention Indicators and press Enter.
5. From the list of lights, select the location code for the failing part and press Enter.
194 Disk drives
6. Select Commit. This turns on the system attention and indicator light for the failing part.
7. Exit to the command line.
Deactivating the failing-part indicator light
Use this procedure to turn off any indicator light that you turned on as a part of a service action.
To deactivate the indicator light, follow these steps:
1. Log in as root user.
2. At the command line, type diag and press Enter.
3. From the Function Selection menu, select Task Selection and press Enter.
4. From the Task Selection menu, select Identify and Attention Indicators and press Enter.
5. From the list of lights, select the location code for the failing part and press Enter. When a light is
activated for a failing part, an I character precedes the location code.
6. Select Commit. This turns off the system attention and indicator light for the failing part.
7. Exit to the command line.
Identifying a failing part in an IBM i system or logical partition
You can activate or deactivate the indicator light by using IBM i to assist in locating a failing part.
Activating the failing-part indicator light
You can search the service action log for an entry that matches the time, reference code, or resource of a
problem, and then activate the indicator light for a failing part.
1. Sign on to an IBM i session, with at least service level authority.
2. On the command line of the session, type strsst and press Enter.
Note: If you cannot get to the System Service Tools display, use function 21 from the control panel.
Alternatively, if the system is managed by a Hardware Management Console (HMC), use the Service
Focal Point™ utilities to get to the Dedicated Service Tools (DST) display.
3. Type your service tools user ID and service tools password on the System Service Tools (SST) Sign
On display. Press Enter.
Remember: The service tools password is case-sensitive.
4. Select Start a service tool from the System Service Tools (SST) display and press Enter.
5. Select Hardware service manager from the Start a Service Tool display and press Enter.
6. Select Work with service action log from the Hardware Service Manager display and press Enter.
7. On the Select Timeframe display, change the From: Date and Time field to a date and time prior to
when the problem occurred.
8. Search for an entry that matches one or more conditions of the problem:
v System Reference code
v Resource
v Date and time
v Failing item list
9. Select option 2 (Display failing item information) to display the service action log entry.
10. Select option 2 (Display details) to display location information for the failing part to be replaced.
The information displayed in the date and time fields is the date and time for the first occurrence of
the specific System reference code for the resource displayed during the time range selected.
11. If location information is available, select option 6 (Indicator on) to turn on the failing part's
indicator light.
Identifying a failing part 195
Tip: If the failing part does not contain a physical indicator light, a higher-level indicator light is
activated. For example, the indicator light for the backplane or unit that contains the failing part
might be lit. In this case, use the location information to locate the actual failing part.
12. Look for the enclosure indicator light to locate the enclosure that contains the failing part.
Deactivating the failing-part indicator light
Use this procedure to turn off any indicator light that you turned on as a part of a service action.
To deactivate the indicator light, follow these steps:
1. Sign on to an IBM i session, with at least service level authority.
2. On the command line of the session, type strsst and press Enter.
Note: If you cannot get to the System Service Tools display, use function 21 from the control panel.
Alternatively, if the system is managed by a Hardware Management Console (HMC), use the Service
Focal Point utilities to get to the Dedicated Service Tools (DST) display.
3. Type your service tools user ID and service tools password on the System Service Tools (SST) Sign
On display. Press Enter.
Remember: The service tools password is case-sensitive.
4. Select Start a service tool from the System Service Tools (SST) display and press Enter.
5. Select Hardware service manager from the Start a Service Tool display and press Enter.
6. Select Work with service action log from the Hardware Service Manager display and press Enter.
7. On the Select Timeframe display, change the From: Date and Time field to a date and time prior to
when the problem occurred.
8. Search for an entry that matches one or more conditions of the problem:
v System Reference code
v Resource
v Date and time
v Failing item list
9. Select option 2 (Display failing item information) to display the service action log entry.
10. Select option 2 (Display details) to display location information for the failing part to be replaced.
The information displayed in the date and time fields is the date and time for the first occurrence of
the specific System reference code for the resource displayed during the time range selected.
11. Select option 7 (Indicator off) to turn off the indicator light.
12. Select the Acknowledge all errors function at the bottom of the Service Action Log display, if all
problems have been resolved.
13. Close the log entry by selecting option 8 (Close new entry) on the Service Action Log Report display.
Identifying a failing part in a Linux system or logical partition
If the service aids have been installed on a system or logical partition, you can activate or deactivate the
indicator lights to locate a part or compete a service action.
Locating a failing part in a Linux system or logical partition
If the service aids have been installed on a system or logical partition, you need to activate the indicator
lights to locate a part.
To activate the indicator light, follow these steps:
1. Log in as root user.
2. At the command line, type /usr/sbin/usysident -s identify -l and press Enter.
196 Disk drives
3. Look for the system attention light to identify the enclosure that contains the failing part.
Related information
Service and productivity tools for Linux on POWER systems
IBM provides hardware diagnostic aids and productivity tools, as well as installation aids for Linux
operating systems on IBM Power systems.
Finding the location code of a failing part in a Linux system or logical
partition
To retrieve the location code of the failing part, if you do not know the location code, use the procedure
in this topic.
To locate the failing part in a system or logical partition follow these steps:
1. Log in as root user.
2. At the command line, type grep diagela /var/log/platform and press Enter.
3. Look for the most recent entry that contains a system reference code (SRC).
4. Record the location information.
Note: IBM provides hardware diagnostic aids and productivity tools, as well as installation aids for
Linux operating systems on IBM Power systems. See Service and productivity tools for Linux on
POWER® systems
Activating the indicator light for the failing part
If you know the location code of the failing part, activate the indicator light to help you locate which part
to replace.
To activate the indicator light, follow these steps:
1. Log in as root user.
2. At the command line, type /usr/sbin/usysident -s identify -l and press Enter.
3. Look for the system attention light to identify the enclosure that contains the failing part.
Note: IBM provides hardware diagnostic aids and productivity tools, as well as installation aids for
Linux operating systems on IBM Power systems. See Service and productivity tools for Linux on POWER
systems
Deactivating the failing-part indicator light
After you complete a removal and replacement procedure, you must deactivate the failing-part indicator
light.
To deactivate the indicator light, follow these steps:
1. Log in as root user.
2. At the command line, type /usr/sbin/usysident -s normal -l and press Enter.
Note: IBM provides hardware diagnostic aids and productivity tools, as well as installation aids for
Linux operating systems on IBM Power systems. See Service and productivity tools for Linux on POWER
systems
Identifying a failing part 197
Locating a failing part in a Virtual I/O Server system or logical partition
You can use Virtual I/O Server (VIOS) tools, before activating the indicator light, to locate a part that is
failing.
1. Log in as root user or celogin-.
2. At the command line, type diagmenu and press Enter.
3. From the Function Selection menu, select Task Selection and press Enter.
4. Select Display Previous Diagnostic Results and press Enter.
5. From the Display Previous Diagnostic Results display, select Display Diagnostic Log Summary. A
Display Diagnostic Log display appears. This display contains a chronological list of events.
6. Look in the T column for the most recent S entry. Select this row in the table and press Enter.
7. Choose Commit. The details of this log entry are shown.
8. Record the location information and the SRN value shown near the end of the entry.
9. Exit to the command line.
Use the location information for the failing part to activate the indicator light that identifies the failing
part. For instructions, see “Identifying a part by using the Virtual I/O Server.”
Identifying a part by using the Virtual I/O Server
Use these instructions to turn on the indicator light to help you physically locate a part by using the
Virtual I/O Server (VIOS).
1. Log in as root user.
2. At the command line, type diagmenu and press Enter.
3. From the Function Selection menu, select Task Selection. Press Enter.
4. From the Task Selection menu, select Identify and Attention Indicators. Press Enter.
5. From the list of lights, select the location code for the failing part and press Enter.
6. Select Commit. This turns on the system attention and indicator light for the failing part.
7. Exit to the command line.
198 Disk drives
Starting the system or logical partition
Learn how to start a system or logical partition after performing a service action or system upgrade.
Starting a system that is not managed by a Hardware Management
Console
You can use the power button or the Advanced System Management Interface to start a system that is
not managed by a Hardware Management Console.
To start a system that is not managed by a Hardware Management Console (HMC), follow these steps:
1. Open the front rack door, if necessary.
2. Before you press the power button on the control panel, ensure that power is connected to the system
unit as follows:
v All system power cables are connected to a power source.
v The power-on light, as shown in the following figure, is slowly blinking.
v The top of the display, as shown in the following figure, shows 01 V=F.
3. Press the power button (A), as shown in the following figure, on the control panel.
v A: Power-on button
v B: Power LED
– A constant light indicates full system power to the unit.
– A blinking light indicates standby power to the unit.
Figure 169. Control panel
© Copyright IBM Corp. 2010 199
Note: There is approximately a 30-second transition period from the time the power-on button is
pressed to when the power LED goes from blinking to solid. During the transition period, the LED
might blink faster.
v C: Enclosure identify light
– A constant light indicates the identify state, which is used to identify a part.
– No light indicates that the system is operating normally.
v D: Attention light
– No light indicates that the system is operating normally.
– A blinking light indicates that the system requires attention.
v E: USB port
v F: Enclosure fault roll-up light
– A constant light indicates a fault in the system.
– No light indicates that the system is operating normally.
v G: Function/Data display
v H: Decrement button
v I: Enter button
v J: Increment button
v K: Pinhole reset button
4. Observe the following after pressing the power button:
v The power-on light begins to blink faster.
v The system cooling fans are activated after approximately 30 seconds and begin to accelerate to
operating speed.
v Progress indicators, also referred to as checkpoints, appear on the control panel display while the
system is being started. The power-on light on the control panel stops blinking and remains on,
indicating that system power is on.
Tip: If pressing the power button does not start the system, do the following steps to start the system
using the Advanced System Management Interface (ASMI):
1. Access the ASMI. For instructions, see Accessing the ASMI.
2. Start the system using the ASMI. For instructions, see Powering the system on and off.
Starting a system or logical partition by using the Hardware
Management Console
You can use the Hardware Management Console (HMC) user interface to start the system or logical
partition after the required cables are installed and the power cables are connected to a power source.
For instructions on working with the HMC, see Managing the Hardware Management Console. For
instructions on starting a logical partition, see Logical partitioning. For instructions on starting the
system, see Powering on the managed system.
Progress indicators, also referred to as checkpoints, appear on the control panel display while the system
is being started. When the power-on light on the control panel stops blinking and remains on, the system
power is on.
200 Disk drives
Stopping a system or logical partition
Learn how to stop a system or logical partition as a part of a system upgrade or service action.
Attention: Using either the power-on button on the control panel or entering commands at the
Hardware Management Console (HMC) to stop the system can cause unpredictable results in the data
files. Also, the next time you start the system, it might take longer if all applications are not ended before
stopping the system.
To stop the system or logical partition, select the appropriate procedure.
Related tasks
Shutting down logical partitions using the Integrated Virtualization Manager
Use this procedure to shut down a logical partition or the entire managed system that is managed using
the Integrated Virtualization Manager.
Stopping a system that is not managed by a Hardware Management
Console
You might need to stop the system to perform another task. Use these instructions to stop the system
using the power button or Advanced System Management Interface.
Before you stop the system, follow these steps:
1. If an Integrated xSeries® Adapter (IXA) is present on the system, shut it down using IBM i options.
2. Ensure that all jobs are completed and end all applications.
3. Ensure that the operating system is stopped.
Attention: Failure to do so can result in the loss of data.
4. If a Virtual I/O Server (VIOS) logical partition is running, ensure that all clients are shut down or that
the clients have access to their devices using an alternate method.
The following procedure describes how to stop a system that is not managed by a Hardware
Management Console (HMC).
1. Log in to the system as a user with the authority to run the shutdown or pwrdwnsys (Power Down
System) command.
2. At the command line, enter one of the following commands:
v If your system is running the AIX operating system, type shutdown.
v If your system is running the Linux operating system, type shutdown -h now.
v If your system is running the IBM i operating system, type PWRDWNSYS. If your system is partitioned,
use the PWRDWNSYS command to power down each of the secondary partitions. Then, use the
PWRDWNSYS command to power down the primary partition.
The command stops the operating system. The system power turns off, the power-on light begins to
slowly flash, and the system goes into a standby state.
3. Record the IPL type and the IPL mode from the control panel display to help you return the system
to this state when the installation or replacement procedure is completed.
4. Set the power switches of any devices connected to the system to off.
5. Unplug any power cables that are attached to the unit from electrical outlets. Ensure that you unplug
power cables from peripheral devices, such as printers and expansion units.
Important: The system is equipped with a second power supply. Before continuing with this
procedure, ensure that all power sources to the system have been disconnected.
© Copyright IBM Corp. 2010 201
(L003)
or
Stopping a system by using the Hardware Management Console
You can use the Hardware Management Console (HMC) user interface to stop the system or a logical
partition. Use the following steps to accomplish this task.
By default, the managed system is set to power off automatically when you shut down the last running
logical partition on the managed system. If you set the managed system properties on the HMC so that
the managed system does not power off automatically, you must use this procedure to power off your
managed system.
Attention: If possible, shut down the running logical partitions on the managed system before powering
off the managed system. Powering off the managed system without shutting down the logical partitions
first causes the logical partitions to shut down abnormally and can cause data loss. If you use a Virtual
I/O Server (VIOS) logical partition, ensure that all clients are shut down or that the clients have access to
their devices using an alternate method.
To power off a managed system, you must be a member of one of the following roles:
v Super administrator
v Service representative
v Operator
v Product engineer
1. In the Navigation area, expand the Systems Management folder.
2. Click the Servers icon.
3. In the Contents area, select the managed system.
202 Disk drives
4. Select Tasks, then Operations, and then Power Off
5. Select the appropriate power-off mode and click OK.
Related information
Shutting down and restarting logical partitions
Stop the system 203
204 Disk drives
Removing and replacing the expansion unit cover or door
You might need to remove, replace, or install covers or doors on an expansion unit as a part of accessing
components or performing a service action.
Removing the front cover on the 7314-G30 or 5796
You might need to remove the front cover to perform service to the system.
To remove the front cover, follow these steps:
1. Remove the two thumbscrews (B) located on the left and right of the cover.
2. Press in on the two latches (C) located left and right of the cover to release the cover.
3. Lift out the cover (A) and away from the chassis.
Tip: If an airflow block is present on an unpopulated side of the chassis do not remove it unless you
are populating that side of the chassis.
Installing the front cover on the 7314-G30 or 5796
Use this procedure to install the front cover after installing the system or performing a service action.
If you only have one side of the chassis populated, ensure that the airflow block is present on the
unpopulated side.
1. Insert the posts on the bottom of the cover (A) into the hooks on the chassis of the enclosure.
Figure 170. Remove the front cover for model 7314-G30 or 5796
© Copyright IBM Corp. 2010 205
2. Press in on the two latches (C) on the right and left of the front cover.
3. Press the cover onto the chassis and release the latches. The cover securely snaps into place.
4. Replace the two thumbscrews (B) into the slots on the left and right of the front cover.
Figure 171. Front cover on a model 7314-G30 or 5796
206 Disk drives
System covers
Use these procedures to remove and replace the covers on IBM Power Systems servers that contain the
POWER7 processor.
Removing and replacing covers for the 8202-E4B or 8205-E6B
Use these instructions to remove, replace, or install covers to access components or perform service.
Removing the front cover on a rack-mounted 8202-E4B or 8205-E6B
system
Use this procedure to remove the cover to access components or perform service.
To remove the front cover, follow these steps:
1. Remove the two screws that secure the system to the rack.
2. Pull the cover away from the system. The cover has an indentation where you can hold onto it more
easily.
Removing the front cover on a stand-alone 8202-E4B or 8205-E6B
system
Use this procedure to remove the cover to access components or perform service.
To remove the front cover, follow these steps:
1. Open the front door (A) by swinging it towards the left as shown in the following figure.
Figure 172. Removing the front cover
© Copyright IBM Corp. 2010 207
2. Press latch (B) and pull the cover away from the system. The cover has an indentation where you can
hold onto it more easily.
Installing the front cover on a rack-mounted 8202-E4B or 8205-E6B
system
Use this procedure to install the cover to access components or perform service.
To install the front cover, follow these steps:
1. Push the cover onto the system.
2. Gently push the cover in until the four cover clips are seated in their respective mounting posts as
shown in the following figure. The cover snaps into place and has an indentation where you can hold
onto it more easily.
Figure 173. Removing the front cover
208 Disk drives
Installing the front cover and front door on a stand-alone 8202-E4B or
8205-E6B system
Use this procedure to install the cover to access components or perform service.
To install the front cover and front door, complete the following steps.
1. Align the cover (A) until the two cover tabs are seated into the slots on the base plate (B), as shown
in the following figure.
2. Rotate the cover up and towards the system until the release latch is seated into its respective slot.
The cover has an indentation where you can hold onto it more easily.
Figure 174. Removing the front cover
System covers 209
3. Hold the front door at an approximate 120 degree angle to the system as show in the following figure.
Engage the latches.
Removing the service access cover on a rack-mounted 8202-E4B or
8205-E6B system
Use this procedure to remove the service access cover.
1. Loosen the two thumbscrews (A) located at the back of the cover.
Figure 175. Installing the front cover
Figure 176. Installing the front door
210 Disk drives
2. Slide the cover toward the back of the system unit. When the front of the service access cover has
cleared the upper frame ledge, lift the cover up and off the system unit.
Attention: For proper cooling and air flow, re-install the cover before turning on the system.
Operating the system without the cover for more than 30 minutes could damage the system
components.
Removing the service access cover on a stand-alone 8202-E4B or
8205-E6B system
Use this procedure to remove the service access cover.
1. Loosen the thumbscrew (A) located at the back of the outside cover.
2. Slide the cover (B) toward the back of the system unit. When the front of the service access cover has
cleared the upper frame ledge, lift the cover up and off the system unit.
Figure 177. Removing the service access cover
System covers 211
3. Loosen the two thumbscrews (A) located at the back of the inside cover.
4. Slide the cover (B) toward the back of the system unit. When the front of the service access cover has
cleared the upper frame ledge, lift the cover up and off the system unit.
Figure 178. Installing the outside service access cover
Figure 179. Removing the inside service access cover
212 Disk drives
Installing the service access cover on a rack-mounted 8202-E4B or
8205-E6B system
Use this procedure to install the service access cover.
1. Place the service access cover on top of the system, about 25 mm (1 in.) from the upper chassis ledge.
2. Hold the service access cover against the system unit and slide it toward the front of the system. The
tabs of the service access cover slide beneath the upper chassis ledge.
3. Align the two thumbscrews (A) located on the back of the service access cover with the two holes on
the back of the system chassis.
4. Tighten the thumbscrews to secure the service access cover.
Installing the service access cover on a stand-alone 8202-E4B or
8205-E6B system
Use this procedure to install the service access cover.
1. Place the inside service access cover (A) on top of the system, about 25 mm (1 in.) from the upper
chassis ledge.
2. Hold the service access cover against the system unit and slide it toward the front of the system. The
tabs of the service access cover slide beneath the upper chassis ledge.
3. Align the two thumbscrews (B) located on the back of the service access cover with the two holes on
the back of the system chassis.
4. Tighten the thumbscrews to secure the service access cover.
Figure 180. Installing the service access cover
System covers 213
5. Place the outside service access cover (A) on top of the system.
6. Hold the service access cover against the system unit and slide it toward the front of the system. The
tabs of the service access cover slide beneath the upper chassis ledge.
7. Align the thumbscrew (B) located on the back of the service access cover with the hole on the back of
the system chassis.
8. Tighten the thumbscrew to secure the service access cover.
Figure 181. Installing the inside service access cover
214 Disk drives
Removing and replacing covers for the 8231-E2B
Use these instructions to remove, replace, or install covers to access components or perform service.
Removing the service access cover for the 8231-E2B
Use this procedure to remove the service access cover.
1. Loosen the thumbscrew (A) located at the back of the cover.
2. Slide the cover (B) toward the back of the system unit. When the front of the service access cover has
cleared the upper frame ledge, lift the cover up and off the system unit.
Figure 182. Installing the outside service access cover
System covers 215
Attention: For proper cooling and air flow, re-install the cover before turning on the system.
Operating the system without the cover for more than 30 minutes could damage the system
components.
Installing the service access cover for the 8231-E2B
Use this procedure to install the service access cover.
1. Place the service access cover (A) on top of the system, about 25 mm (1 in.) from the upper chassis
ledge.
2. Hold the service access cover against the system unit and slide it toward the front of the system. The
tabs of the service access cover slide beneath the upper chassis ledge.
3. Align the thumbscrew (B) located on the back of the service access cover with the two holes on the
back of the system chassis.
4. Tighten the thumbscrews to secure the service access cover.
Figure 183. Removing the service access cover
216 Disk drives
Removing and replacing covers on the 8233-E8B or 8236-E8C system
Use these instructions to remove, replace, or install covers to access components or perform service.
Removing the front cover from an 8233-E8B or 8236-E8C system
Use this procedure to remove the cover to access components or perform service.
To remove the front cover, follow these steps:
1. Remove the two screws (A) that secure the system to the rack (B) as shown in the following figure.
2. Pull the cover away from the system. The cover has an indentation where you can hold onto it more
easily.
Figure 184. Installing the service access cover
System covers 217
Installing the front cover on an 8233-E8B or 8236-E8C system
Use this procedure to install the cover after accessing components or performing service.
To install the front cover, follow these steps:
1. Align the cover with the system.
2. Gently push the cover in until the four cover clips are seated in their respective mounting posts (B)
on the system.
3. Replace the two screws (C) that secure the system to the rack (A).
Figure 185. Removing the front cover
218 Disk drives
Removing the service access cover from an 8233-E8B or 8236-E8C
system
Use this procedure to remove the service access cover to perform service or to gain access to internal
components.
To remove the service access cover, follow these steps:
1. Place the system into the service position. For instructions, see “Placing a 8233-E8B or 8236-E8C in the
service position” on page 236.
2. Loosen the two thumbscrews (A) located at the back of the cover.
3. Slide the cover (B) toward the back of the system unit. When the front of the service access cover
clears the upper frame ledge, lift the cover up and off the system unit.
Attention: For proper cooling and airflow, install the cover before starting the system. Operating the
system without the cover for more than 30 minutes could damage the system components.
Figure 186. Installing the front cover
System covers 219
Installing the service access cover on an 8233-E8B or 8236-E8C
system
Use this procedure to install the service access cover after performing service or accessing internal
components.
To install the service access cover, follow these steps:
1. Place the service access cover (A) on the top of the system unit, approximately 25 mm (1 in.) from the
front of the system unit.
2. Hold the service access cover against the system unit, and slide it toward the front of the system.
The tabs on the service access cover slide beneath the upper chassis ledge, and the two screws align
with the screw holes at the back of the system unit.
Important: Ensure that the fan LED cables do not get caught on the front edge of the service access
cover as you move it forward.
3. Tighten the screws (B) located at the back of the cover.
Figure 187. Removing the service access cover
220 Disk drives
Removing and replacing covers for the 9117-MMB or 9179-MHB
Use these instructions to remove, replace, or install covers to access components or perform service.
Removing the front cover from the 9117-MMB or 9179-MHB
Use this procedure to remove the cover to access components or perform service.
To remove the front cover follow these steps:
1. If necessary, open the front rack door.
2. Pull out the latches (A) located on both sides of the cover as shown in the following figure.
Figure 188. Installing the service access cover
System covers 221
3. Pull off the cover (B) to remove it from the system unit.
Installing the front cover on the 9117-MMB or 9179-MHB
Use this procedure to install the cover after accessing components or performing service.
To install the front cover, follow these steps:
1. Position the cover (A) on the front of the system unit so that the four pins on the system match the
four holes on the back of the cover.
Figure 189. Removing the front cover
222 Disk drives
2. Press the tabs (B) to snap the cover into position.
3. Close the front rack door.
Figure 190. Installing the front cover
System covers 223
224 Disk drives
Placing the rack-mounted 8233-E8B or 8236-E8C system into
the service position or operating position
Use these procedures to place a system into the service position or operating position to perform service
or to gain access to internal components.
Placing a rack-mounted 8231-E2B system into the service position
Use this procedure to place the rack-mounted system into the service position.
Notes:
v When placing the system into the service position, it is essential that all stability plates are firmly in
position to prevent the rack from toppling. Ensure that only one system unit is in the service position
at a time.
v Ensure that the cables at the back of the system unit do not catch or bind as you pull the unit forward
in the rack.
v When the system unit rails are fully extended, the rail safety latches lock into place. This action
prevents the system from being pulled out too far.
1. Open the front rack door.
2. Identify the system unit you are servicing in the rack.
3. While holding the system unit release latches down on both the left and right sides, pull the system
unit out from the rack until the rails are fully extended and locked.
Figure 191. Placing the system in the service position
© Copyright IBM Corp. 2010 225
Placing the rack-mounted 8231-E2B system into the operating position
Use this procedure to place the rack-mounted system into the operating position.
When placing the system into operating position, ensure that the cables at the back of the system do not
catch or bind as you push the system unit back into the rack.
1. Unlock the rail safety latches by sliding them toward the front of the system.
2. Push the system unit back into the rack until both system-unit release latches have locked into
position.
3. Close the front rack door of the unit you are servicing.
Figure 192. Placing the system into the operating position
226 Disk drives
Placing a rack-mounted 8202-E4B or 8205-E6B system into
the service position
Use this procedure to place the rack-mounted system into the service position.
Notes:
v When placing the system into the service position, it is essential that all stability plates are firmly in
position to prevent the rack from toppling. Ensure that only one system unit is in the service position
at a time.
v Ensure that the cables at the back of the system unit do not catch or bind as you pull the unit forward
in the rack.
v When the system unit rails are fully extended, the rail safety latches lock into place. This action
prevents the system from being pulled out too far.
1. Open the front rack door.
2. Identify the system unit you are servicing in the rack.
3. Remove the two screws (A) (if present) that secure the system unit (B) to the rack.
4. While holding the system unit release latches (A) down on both the left and right sides, pull the
system unit (B) out from the rack until the rails are fully extended and locked.
Figure 193. Placing the system in the service position
© Copyright IBM Corp. 2010 227
228 Disk drives
Placing the rack-mounted 8202-E4B or 8205-E6B system into
the operating position
Use this procedure to place the rack-mounted system into the operating position.
When placing the system into operating position, ensure that the cables at the back of the system do not
catch or bind as you push the system unit back into the rack.
1. Unlock the rail safety latches (A) by sliding them upward from the system. The rail safety latches are
located near the middle of the system.
2. Push the system unit (B) back into the rack until both system-unit release latches have locked into
position.
3. Replace and tighten the two screws (A) that secure the system unit (B) to the rack.
4. Close the front rack door of the unit you are servicing.
Figure 194. Placing the system into the operating position
© Copyright IBM Corp. 2010 229
Figure 195. Placing the system into the operating position
230 Disk drives
Service and operating positions
Use these procedures to place IBM Power Systems servers into the service or operating position.
Placing the 8202-E4B or 8205-E6B system into the service or operating
position
Use these procedures to place a system into the service position or operating position to perform service
or to gain access to internal components.
Placing a rack-mounted 8202-E4B or 8205-E6B system into the service
position
Use this procedure to place the rack-mounted system into the service position.
Notes:
v When placing the system into the service position, it is essential that all stability plates are firmly in
position to prevent the rack from toppling. Ensure that only one system unit is in the service position
at a time.
v Ensure that the cables at the back of the system unit do not catch or bind as you pull the unit forward
in the rack.
v When the system unit rails are fully extended, the rail safety latches lock into place. This action
prevents the system from being pulled out too far.
1. Open the front rack door.
2. Identify the system unit you are servicing in the rack.
3. Remove the two screws (A) (if present) that secure the system unit (B) to the rack.
© Copyright IBM Corp. 2010 231
4. While holding the system unit release latches (A) down on both the left and right sides, pull the
system unit (B) out from the rack until the rails are fully extended and locked.
Placing the rack-mounted 8202-E4B or 8205-E6B system into the
operating position
Use this procedure to place the rack-mounted system into the operating position.
Figure 196. Placing the system in the service position
232 Disk drives
When placing the system into operating position, ensure that the cables at the back of the system do not
catch or bind as you push the system unit back into the rack.
1. Unlock the rail safety latches (A) by sliding them upward from the system. The rail safety latches are
located near the middle of the system.
2. Push the system unit (B) back into the rack until both system-unit release latches have locked into
position.
3. Replace and tighten the two screws (A) that secure the system unit (B) to the rack.
4. Close the front rack door of the unit you are servicing.
Figure 197. Placing the system into the operating position
Service and operating positions 233
Placing the rack-mounted 8233-E8B or 8236-E8C system into the
service position or operating position
Use these procedures to place a system into the service position or operating position to perform service
or to gain access to internal components.
Placing a rack-mounted 8231-E2B system into the service position
Use this procedure to place the rack-mounted system into the service position.
Notes:
v When placing the system into the service position, it is essential that all stability plates are firmly in
position to prevent the rack from toppling. Ensure that only one system unit is in the service position
at a time.
v Ensure that the cables at the back of the system unit do not catch or bind as you pull the unit forward
in the rack.
v When the system unit rails are fully extended, the rail safety latches lock into place. This action
prevents the system from being pulled out too far.
1. Open the front rack door.
2. Identify the system unit you are servicing in the rack.
3. While holding the system unit release latches down on both the left and right sides, pull the system
unit out from the rack until the rails are fully extended and locked.
Figure 198. Placing the system into the operating position
234 Disk drives
Placing the rack-mounted 8231-E2B system into the operating position
Use this procedure to place the rack-mounted system into the operating position.
When placing the system into operating position, ensure that the cables at the back of the system do not
catch or bind as you push the system unit back into the rack.
1. Unlock the rail safety latches by sliding them toward the front of the system.
2. Push the system unit back into the rack until both system-unit release latches have locked into
position.
Figure 199. Placing the system in the service position
Service and operating positions 235
3. Close the front rack door of the unit you are servicing.
Placing the rack-mounted 8233-E8B or 8236-E8C system into the
service position or operating position
Use these procedures to place a system into the service position or operating position to perform service
or to gain access to internal components.
Placing a 8233-E8B or 8236-E8C in the service position
Use this procedure to perform service or gain access to internal components by placing the rack-mounted
system or expansion unit in the service position.
Note: Some of the figures in these procedures might not look exactly like the system or expansion unit
that you have. However, the steps to perform the task are the same.
Figure 200. Placing the system into the operating position
236 Disk drives
DANGER
When working on or around the system, observe the following precautions:
Electrical voltage and current from power, telephone, and communication cables are hazardous. To
avoid a shock hazard:
v Connect power to this unit only with the IBM provided power cord. Do not use the IBM
provided power cord for any other product.
v Do not open or service any power supply assembly.
v Do not connect or disconnect any cables or perform installation, maintenance, or reconfiguration
of this product during an electrical storm.
v The product might be equipped with multiple power cords. To remove all hazardous voltages,
disconnect all power cords.
v Connect all power cords to a properly wired and grounded electrical outlet. Ensure that the outlet
supplies proper voltage and phase rotation according to the system rating plate.
v Connect any equipment that will be attached to this product to properly wired outlets.
v When possible, use one hand only to connect or disconnect signal cables.
v Never turn on any equipment when there is evidence of fire, water, or structural damage.
v Disconnect the attached power cords, telecommunications systems, networks, and modems before
you open the device covers, unless instructed otherwise in the installation and configuration
procedures.
v Connect and disconnect cables as described in the following procedures when installing, moving,
or opening covers on this product or attached devices.
To Disconnect:
1. Turn off everything (unless instructed otherwise).
2. Remove the power cords from the outlets.
3. Remove the signal cables from the connectors.
4. Remove all cables from the devices
To Connect:
1. Turn off everything (unless instructed otherwise).
2. Attach all cables to the devices.
3. Attach the signal cables to the connectors.
4. Attach the power cords to the outlets.
5. Turn on the devices.
(D005)
DANGER
Service and operating positions 237
Observe the following precautions when working on or around your IT rack system:
v Heavy equipment–personal injury or equipment damage might result if mishandled.
v Always lower the leveling pads on the rack cabinet.
v Always install stabilizer brackets on the rack cabinet.
v To avoid hazardous conditions due to uneven mechanical loading, always install the heaviest
devices in the bottom of the rack cabinet. Always install servers and optional devices starting
from the bottom of the rack cabinet.
v Rack-mounted devices are not to be used as shelves or work spaces. Do not place objects on top
of rack-mounted devices.
v Each rack cabinet might have more than one power cord. Be sure to disconnect all power cords in
the rack cabinet when directed to disconnect power during servicing.
v Connect all devices installed in a rack cabinet to power devices installed in the same rack
cabinet. Do not plug a power cord from a device installed in one rack cabinet into a power
device installed in a different rack cabinet.
v An electrical outlet that is not correctly wired could place hazardous voltage on the metal parts of
the system or the devices that attach to the system. It is the responsibility of the customer to
ensure that the outlet is correctly wired and grounded to prevent an electrical shock.
CAUTION
v Do not install a unit in a rack where the internal rack ambient temperatures will exceed the
manufacturer's recommended ambient temperature for all your rack-mounted devices.
v Do not install a unit in a rack where the air flow is compromised. Ensure that air flow is not
blocked or reduced on any side, front, or back of a unit used for air flow through the unit.
v Consideration should be given to the connection of the equipment to the supply circuit so that
overloading of the circuits does not compromise the supply wiring or overcurrent protection. To
provide the correct power connection to a rack, refer to the rating labels located on the
equipment in the rack to determine the total power requirement of the supply circuit.
v (For sliding drawers.) Do not pull out or install any drawer or feature if the rack stabilizer brackets
are not attached to the rack. Do not pull out more than one drawer at a time. The rack might
become unstable if you pull out more than one drawer at a time.
v (For fixed drawers.) This drawer is a fixed drawer and must not be moved for servicing unless
specified by the manufacturer. Attempting to move the drawer partially or completely out of the
rack might cause the rack to become unstable or cause the drawer to fall out of the rack.
(R001)
To place the rack-mounted system or expansion unit into the service position, follow these steps:
1. If necessary, open the front rack door.
2. Remove the two screws (A) that secure the system unit to the rack as shown in the following figure.
3. Release the rack latches (B) on both the left and right sides as shown in the following figure.
238 Disk drives
4. If required, unplug the cables from the back of the system or expansion unit before you pull the unit
out from the rack. Ensure that the cables at the rear of the system or expansion unit do not catch or
bind as you pull the unit out from the rack.
5. Slowly pull the system or expansion unit out from the rack until the rails are fully extended and
locked. Ensure that the rails are fully extended. When the rails are fully extended, the rail safety
latches lock into place. This action prevents the system or expansion unit from being pulled out too
far.
Placing the 8233-E8B or 8236-E8C in the operating position
Use this procedure to place the system or expansion unit in the operating position to make the unit
available for use.
To place the system into the operating position follow these steps:
Tip: Some of the figures in these procedures might not look exactly like the system or expansion unit
that you have. However, the steps to perform the task are the same.
1. Simultaneously release the blue rail safety latches (B), located near the front of each rail, and push the
system or expansion unit into the rack as shown in the following figure.
Note: Ensure that the cables at the rear of the system or expansion unit do not catch or bind as you
push the unit back into the rack.
Figure 201. Releasing the rack latches
Service and operating positions 239
2. Replace and tighten the two thumbscrews (C) that secure the system or expansion unit (A) to the rack
as shown in the following figure.
3. Close the front rack door.
Figure 202. Releasing the rail safety latches
Figure 203. Replacing the thumbscrews
240 Disk drives
Power cords
Use these procedures to disconnect and connect the power cords on IBM Power Systems servers that
contain the POWER7 processor.
Disconnecting the power cords from the 8202-E4B or 8205-E6B system
Use these procedures to disconnect the power cords from the 8202-E4B or 8205-E6B system.
1. For a rack-mounted system, open the back rack door on the unit you are servicing and identify the
system unit you are servicing in the rack.
2. For both a rack-mounted system and a stand-alone system, disconnect all power cords from the unit
you are servicing. This system might be equipped with two power supplies. Before continuing with
any remove and replace procedures that require the power off, ensure that both power sources to the
system have been completely disconnected.
Figure 204. Removing the power cords for a rack-mounted system
© Copyright IBM Corp. 2010 241
Connecting the power cords to the 8202-E4B or 8205-E6B system
Use this procedure to connect the power cords.
1. Identify the system unit you are servicing in the rack.
2. Connect all power cords to the unit you are servicing. Ensure that the cords are threaded through the
handles.
Figure 205. Removing the power cords for a stand-alone system
242 Disk drives
3. Close the back rack door.
Disconnecting the power cords from the 8231-E2B
Use these procedures to disconnect the power cords from the8231-E2B system.
1. Open the back rack door on the unit you are servicing.
2. Identify the system unit you are servicing in the rack.
3. Disconnect all power cords from the unit you are servicing. This system might be equipped with two
power supplies. Before continuing with any remove and replace procedures that require the power
off, ensure that both power sources to the system have been completely disconnected.
Figure 206. Connecting the power cords
Power cords 243
Connecting the power cords to the 8231-E2B
Use these procedures to connect the power cords to the8231-E2B system.
1. Open the back rack door on the unit you are servicing.
2. Identify the system unit you are servicing in the rack.
3. Disconnect all power cords from the unit you are servicing. Ensure that the cords are threaded
through the handles.
Figure 207. Removing the power cords
244 Disk drives
Figure 208. Connecting the power cords
Power cords 245
246 Disk drives
Installing a feature by using the Hardware Management
Console
You can use the Hardware Management Console to perform many service actions, including the
installation of a new feature or part.
To use the Hardware Management Console user interface to install a feature or part into a system or
expansion unit that is managed by an HMC, follow these steps:
1. In the navigation area, expand Systems Management ? Servers.
2. Select the managed system you will install the part in.
3. In the Tasks area expand Serviceability ? Hardware.
4. Select Add FRU (field replaceable unit).
5. In the Add/Install/Remove Hardware window select the system or enclosure into which you are
installing the feature.
6. Select the type of feature you are installing from the menu and click Next
7. Select the location code for where you will install the feature, and click Add.
8. After the FRU is placed in the pending actions category click Launch Procedure and follow the
instructions to install the feature.
Note: The HMC might open external instructions for installing the feature. If so, follow those
instructions to install the feature.
Removing a part by using the Hardware Management Console
You can use the Hardware Management Console (HMC) to perform many service actions, including the
removal of a field replaceable unit (FRU) or part.
To use the Hardware Management Console user interface to remove a part in a system or expansion unit
that is managed by an HMC, follow these steps:
1. In the navigation area, expand Systems Management ? Servers.
2. Select the managed system from which you are removing a part.
3. In the Tasks area, expand Serviceability ? Hardware.
4. Select Remove FRU.
5. In the Add/Install/Remove Hardware - Remove FRU, Select FRU Type window, select the system or
enclosure from which you are removing the part.
6. Select the type of part you are removing from the menu and click Next.
7. Select the location of the part you are removing and click Add.
8. After the part is listed in the Pending Actions display click Launch Procedure and follow the
instructions to remove the part.
Note: The HMC might open the information center instructions for removing the part. If so, follow
those instructions to remove the part.
Exchanging a part by using the Hardware Management Console
You can use the Hardware Management Console (HMC) to perform many service actions, including
exchanging a field replaceable unit (FRU) or part.
© Copyright IBM Corp. 2010 247
If you are exchanging a part to repair a serviceable event follow those instructions. If you are exchanging
a part as a part of any other procedure using HMC Version 7 or later use these steps:
1. In the navigation area, expand Systems Management ? Servers.
2. Select the managed system in which you are exchanging a part.
3. In the Tasks area expand Serviceability ? Hardware.
4. Select Exchange FRU.
5. In the Replace Hardware - Replace FRU, Select FRU Type window select the system or enclosure in
which you will exchange a part from the list.
6. Select the type of part you will exchange from the menu and click Next.
7. Select the location code of the part you will exchange from the menu and click Add.
8. After the FRU is placed in the pending actions category click Launch Procedure and follow the
instructions to exchange the feature.
Note: The HMC might open external instructions for replacing the part. If so, follow those
instructions to replace the part.
248 Disk drives
Verifying the installed part
You can verify a newly installed or replaced part on your system, logical partition, or expansion unit
using the operating system, stand-alone diagnostics, or the Hardware Management Console (HMC).
Verifying an installed feature or replaced part in an AIX system or
logical partition
If you installed feature or replaced a part, you might want to use the tools in AIX to verify that the
feature or part is recognized by the system or logical partition.
To verify the operation of a newly installed feature or replacement part, select the appropriate procedure:
v Verify the installed feature using AIX
v Verifying the replaced part using AIX
Verify the installed feature by using AIX:
1. Log in as root user.
2. At the command line, type diag and press Enter.
3. Select Advanced Diagnostics Routines and press Enter.
4. From the Diagnostic Mode Selection menu, select System Verification and press Enter.
5. When the Advanced Diagnostic Selection menu appears, do one of the following:
v To test a single resource, select the resource that you just installed from the list of resources and
press Enter.
v To test all the resources available to the operating system, select All Resources and press Enter.
6. Select Commit, and wait until the diagnostic programs run to completion, responding to any prompts
that appear.
7. Did the diagnostics run to completion and display the message No trouble was found?
v No: If a service request number (SRN) or other reference code is displayed, suspect a loose adapter
or cable connection. Review the installation procedures to ensure that the new feature is installed
correctly. If you cannot correct the problem, collect all SRNs or any other reference code
information that you see. If the system is running in logical partitioning (LPAR) mode, note the
logical partition in which you installed the feature. Contact your service provider for assistance.
v Yes: The new device is installed correctly. Exit the diagnostic programs and return the system to
normal operations.
Verify the replacement part by using AIX:
To verify the operation of a newly installed feature or replacement part, follow these steps:
1. Did you use either the AIX operating system or the online diagnostics service aid concurrent
(hot-swap) service to replace the part?
No: Go to step 2.
Yes: Go to step 5 on page 250.
2. Is the system powered off?
No: Go to step 4 on page 250.
Yes: If the system supports slow boot, set the system to perform a slow boot. For information, see
Performing a slow boot.
© Copyright IBM Corp. 2010 249
3. Start the system and wait until the AIX operating system login prompt is displayed or until apparent
system activity on the operator panel or display has stopped.
Did the AIX login prompt display?
v No: If a service request number (SRN) or other reference code is displayed, suspect a loose
adapter or cable connection. Review the procedures for the part that you replaced to ensure that
the new part is installed correctly. If you cannot correct the problem, collect all SRNs or any other
reference code information that you see. If the system does not start or you have no login prompt,
see: Problems with loading and starting the operating system.
If the system is partitioned, note the logical partition in which you replaced the part. Contact your
service provider for assistance.
v Yes: Go to step 4.
4. At the command prompt, type diag —a and press Enter to check for missing resources. If you see a
command prompt, go to step 5.
If the Diagnostic selection menu is shown with M appearing next to any resource, follow these
steps:
a. Select the resource and press Enter.
b. Select Commit.
c. Follow any instructions that are shown.
d. If the Do you want to review the previously displayed error? message is shown, select Yes and press
Enter.
e. If an SRN is shown, suspect a loose card or connection. If no obvious problem is shown, record
the SRN and contact your service provider for assistance..
f. If no SRN is shown, go to step 5.
5. Test the part by doing the following steps:
a. At the command line, type diag and press Enter.
b. From the Function Selection menu, select Advanced Diagnostics Routines and press Enter.
c. From the Diagnostic Mode Selection menu, select System Verification and press Enter.
d. Select All Resources, or select the diagnostics for the individual part to test only the part you
replaced and any devices that are attached to the part you replaced and press Enter.
Did the Resource Repair Action menu appear?
No: Go to step 6.
Yes: Go to step 7.
6. Did the Testing Complete, No trouble was found message appear?
v No: There is still a problem. Contact your service provider. This ends the procedure.
v Yes: Select Log Repair Action, if not previously logged, from the Task Selection menu to update
the AIX error log. If the repair action was reseating a cable or adapter, select the resource
associated with that repair action. If the resource associated with your action is not displayed on
the resource list, select sysplanar0 and press Enter.
Tip: This action changes the indicator light for the part from the fault state to the normal state.
Go to step 9 on page 251.
7. Select the resource for the replaced part from the Resource Repair Action menu. When a test is run
on a resource in system verification mode, and that resource has an entry in the AIX error log, if the
test on the resource was successful, the Resource Repair Action menu appears. Complete the
following steps to update the AIX error log to indicate that a system-detectable part has been
replaced.
Note: On systems with an indicator light for the failing part, this action changes the indicator light
to the normal state.
250 Disk drives
a. Select the resource that has been replaced from the Resource Repair Action menu. If the repair
action was reseating a cable or adapter, select the resource associated with that repair action. If
the resource associated with your action does not appear on the resource list, select sysplanar0
and press Enter.
b. Select Commit after you make your selections. Did another Resource Repair Action display
appear?
No: If the No Trouble Found display appears, go to step 9
Yes: Go to step 8.
8. Select the parent or child of the resource for the replaced part from the Resource Repair Action
menu if necessary. When a test is run on a resource in system verification mode, and that resource
has an entry in the AIX error log, if the test on the resource was successful, the Resource Repair
Action menu appears. Complete the following steps to update the AIX error log to indicate that a
system-detectable part has been replaced.
Note: This action changes the indicator light for the part from the fault state to the normal state.
a. From the Resource Repair Action menu, select the parent or child of the resource that has been
replaced. If the repair action was to reseat a cable or adapter, select the resource associated with
that repair action. If the resource associated with your action does not appear on the resource list,
select sysplanar0 and press Enter.
b. Select Commit after you make your selections.
c. If the No Trouble Found display appears, go to step 9.
9. If you changed the service processor or network settings, as instructed in previous procedures,
restore the settings to the values they had prior to servicing the system.
10. Did you do any hot-plug procedures before doing this procedure?
No: Go to step 11.
Yes: Go to step 12.
11. Start the operating system, with the system or logical partition in normal mode. Were you able to
start the operating system?
No: Contact your service provider. This ends the procedure.
Yes: Go to step 12.
12. Are the indicator lights still on?
v No. This ends the procedure.
v Yes. Turn off the lights. See the following for instructions: Changing service indicators.
Verifying the installed part in an IBM i system or logical partition
If you have installed a new feature or part, verify that the system recognizes the feature or part by using
the IBM i system service tools.
To verify the installed part, follow these steps:
1. Deactivate the failing item indicator light. For instructions, see “Deactivating the failing-part indicator
light” on page 196.
2. Sign on with at least service level authority.
3. On the command line of the IBM i session, type strsst and press Enter.
Note: If you cannot get to the System Service Tools display, use function 21 from the control panel.
Alternatively, if the system is managed by Hardware Management Console (HMC), use the Service
Focal Point Utilities to get to the Dedicated Service Tools (DST) display.
4. Type your service tools user ID and service tools password on the System Service Tools (SST) Sign On
display and press Enter.
Verifying the installed part 251
Note: The service tools password is case-sensitive.
5. Select Start a service tool from the System Service Tools (SST) display and press Enter.
6. Select Hardware service manager from the Start a Service Tool display and press Enter.
7. Select Logical hardware resources (buses, IOPs, controllers) from the Hardware Service Manager
display and press Enter. This option allows you to display and work with logical resources. Logical
hardware resources are the functional resources of the system used by the operating system.
With the Logical Hardware Resources display, you can show logical hardware resource status or
information, and associated packaging hardware resources. Use the online Help information to better
understand specific functions, fields, or symbols.
Deactivating the failing-part indicator light
Use this procedure to turn off any indicator light that you turned on as a part of a service action.
To deactivate the indicator light, follow these steps:
1. Sign on to an IBM i session, with at least service level authority.
2. On the command line of the session, type strsst and press Enter.
Note: If you cannot get to the System Service Tools display, use function 21 from the control panel.
Alternatively, if the system is managed by a Hardware Management Console (HMC), use the Service
Focal Point utilities to get to the Dedicated Service Tools (DST) display.
3. Type your service tools user ID and service tools password on the System Service Tools (SST) Sign
On display. Press Enter.
Remember: The service tools password is case-sensitive.
4. Select Start a service tool from the System Service Tools (SST) display and press Enter.
5. Select Hardware service manager from the Start a Service Tool display and press Enter.
6. Select Work with service action log from the Hardware Service Manager display and press Enter.
7. On the Select Timeframe display, change the From: Date and Time field to a date and time prior to
when the problem occurred.
8. Search for an entry that matches one or more conditions of the problem:
v System Reference code
v Resource
v Date and time
v Failing item list
9. Select option 2 (Display failing item information) to display the service action log entry.
10. Select option 2 (Display details) to display location information for the failing part to be replaced.
The information displayed in the date and time fields is the date and time for the first occurrence of
the specific System reference code for the resource displayed during the time range selected.
11. Select option 7 (Indicator off) to turn off the indicator light.
12. Select the Acknowledge all errors function at the bottom of the Service Action Log display, if all
problems have been resolved.
13. Close the log entry by selecting option 8 (Close new entry) on the Service Action Log Report display.
Verifying the installed part in a Linux system or logical partition
If you have installed a new part, learn how to verify that the system recognizes the part.
To verify the newly installed or replaced part, continue with “Verifying an installed part by using
stand-alone diagnostics” on page 253.
252 Disk drives
Verifying an installed part by using stand-alone diagnostics
If you have installed or replaced a part, verify that the system recognizes the new part. You can use
stand-alone diagnostics to verify an installed part in a Linux system, expansion unit, or logical partition.
v If this server is directly attached to another server or attached to a network, ensure communications
with the other servers have stopped.
v The stand-alone diagnostics require use of all of the logical partition resources. No other activity can be
running on the logical partition.
v The stand-alone diagnostics require access to the system console.
You access these diagnostics from a CD-ROM or from the Network Installation Management (NIM)
server. This procedure describes how to use the diagnostics from a CD-ROM. For information on running
diagnostics from the Network Installation Management (NIM) server, see Running stand-alone
diagnostics from a Network Installation Management server.
To use stand-alone diagnostics, follow these steps:
1. Stop all jobs and applications and then stop the operating system on the system or logical partition.
2. Remove all tapes, diskettes, and CD-ROM.
3. Turn off the system unit power. The next step boots the server or logical partition from the
stand-alone diagnostics CD-ROM. If the optical drive is not available as the boot device on the
server or logical partition on which you are working, follow these steps:
a. Access the ASMI. See Accessing the ASMI for information on using the ASMI.
b. On the ASMI main menu, click on Power/Restart Control.
c. Click Power On/Off System.
d. Select the Service mode boot from default boot list option in the AIX or Linux logical partition
mode boot drop-down menu.
e. Click Save settings and power on. As soon as the optical drive has power, insert the standalone
diagnostic CD-ROM.
f. Go to step 5.
4. Turn on the system unit power and immediately insert the diagnostics CD-ROM into the optical
drive.
5. After the keyboard POST indicator displays on the system console and before the last POST
indicator (speaker) displays, press the numeric 5 key on the system console to indicate that a service
mode boot should be initiated using the default-service mode boot list.
6. Enter any requested password.
7. At the Diagnostic Operating Instructions display, press Enter.
Tip: If a service request number (SRN) or other reference code is displayed, suspect a loose adapter
or cable connection.
Note: If you received an SRN or any other reference code when you attempted to start the system,
contact your service provider for assistance..
8. If the terminal type is requested, select the Initialize Terminal option on the Function Selection
menu to initialize the operating system.
9. From the Function Selection menu, select Advanced Diagnostics Routines and press Enter.
10. From the Diagnostic Mode Selection menu, select System Verification and press Enter.
11. When the Advanced Diagnostic Selection menu appears, select All Resources, or test only the part
you replaced, and any devices that are attached to the part you replaced, by selecting the diagnostics
for the individual part and press Enter.
12. Did the Testing Complete, No trouble was found message appear?
Verifying the installed part 253
v No: There is still a problem. Contact your service provider.
v Yes: Go to step 13.
13. If you changed the service processor or network settings, as instructed in previous procedures,
restore the settings to the value they had prior to servicing the system.
14. If the indicator lights are still on, follow these steps:
a. Select Identify and Attention Indicators from the Task Selection menu to turn off the system
attention and indicator lights and press Enter.
b. Select Set System Attention Indicator to NORMAL and press Enter.
c. Select Set All Identify Indicators to NORMAL and press Enter.
d. Choose Commit.
Note: This changes the system attention and identify indicators from the Fault state to the Normal
state.
e. Exit to the command line.
Verifying the installed part by using Hardware Management Console
If you have installed or replaced a part, use the Hardware Management Console (HMC) to update your
HMC records after you have completed a service action on your server. If you have reference codes,
symptoms, or location codes that you used during the service action, locate the records for use during
this procedure.
To verify the installed part, complete these steps:
1. At the HMC, examine the service action event log for any open service action events. See “Viewing
serviceable events” on page 256 for details.
2. Are there any service action events that are open?
No: If the system attention LED is still on, use the HMC to turn off the LED. See “Activating and
deactivating LEDs” on page 255. This ends the procedure.
Yes: Continue with the next step.
3. Record the list of open service action events.
4. Examine the details of the open service action event. Is the error code associated with this service
action event the same as you gathered earlier.
v No: Select one of the following options:
– Review the other serviceable events, find one that does match, and continue with the next step.
– If the log does not match what you had gathered earlier, contact your service provider.
v Yes: Continue with the next step.
5. Select and highlight the service action event from the Error Associated With This Serviceable Event
window.
6. Click Close Event.
7. Add comments for the serviceable event. Include any unique additional information. Click OK.
8. Did you replace, add, or modify a field replaceable unit (FRU) of the open service action event?
v No: Select the No FRU Replaced for this Serviceable Event option, and click OK to close the
service action event.
v Yes: Perform the following steps:
a. From the FRU list, select a FRU that you need to update.
b. Double-click the FRU and update the FRU information.
c. Click OK to close the service action event.
9. If you continue to have problems, contact your service provider.
254 Disk drives
Activating and deactivating LEDs
Use this procedure to activate or deactivate LEDs using Service Focal Point for the HMC.
Choose from the following:
v “Deactivating a system attention LED or partition LED”
v “Activating or deactivating identify LED”
Deactivating a system attention LED or partition LED
You can deactivate a system attention LED or a logical partition LED. For example, you might determine
that a problem is not a high priority and decide to repair the problem at a later time. However, you want
to be alerted if another problem occurs, so you must deactivate the system attention LED so that it can be
activated again if another problem occurs.
1. In the navigation area, open Systems Management.
2. Open Servers and select the appropriate system.
3. In the content area, check the box for the appropriate Partition.
4. Select Tasks, then Operations, and then Manage Attention LED.
5. Select the appropriate Partition.
6. Select Deactivate System Attention LED from the Action menu. A confirmation window is displayed
that provides the following information:
v A verification that the system attention LED was deactivated.
v An indication that there still might be open problems within the system.
v An indication that you cannot activate the system attention LED.
7. Select one of the logical partitions in the lower table, and select Deactivate partition LED from the
Partition Operations menu. A confirmation window is displayed that provides the following
information:
v A verification that the logical partition LED was deactivated.
v An indication that there still might be open problems within the logical partition.
v An indication that you cannot activate the logical partition LED.
Activating or deactivating identify LED
The system provides several LEDs that help identify various components, such as enclosures or field
replaceable units (FRUs), in the system. For this reason, they are called identify LEDs.
You can activate or deactivate the following types of identify LEDs:
v Identify LED for an enclosure If you want to add an adapter to a specific drawer (enclosure), you
need to know the machine type, model, and serial number (MTMS) of the drawer. To determine
whether you have the correct MTMS for the drawer that needs the new adapter, you can activate the
LED for a drawer and verify that the MTMS corresponds to the drawer that requires the new adapter.
v Identify LED for a FRU associated with a specified enclosure If you want to hook up a cable to a
specific I/O adapter, you can activate the LED for the adapter which is a field replaceable unit (FRU),
and then physically check to see where you should hook up the cable. This is especially useful when
you have several adapters with open ports.
To activate or deactivate an identify LED for an enclosure or FRU, follow these steps:
1. In the navigation area, open Systems Management.
2. Select Servers.
3. In the content area, check the box for the appropriate System.
4. Select Tasks, then Operations, then LED Status, and then Identify LED.
Verifying the installed part 255
5. To activate or deactivate an identify LED for an enclosure, select an enclosure from the table, and click
either Activate LED or Deactivate LED. The associated LED is either turned on or off.
6. To activate or deactivate an identify LED for a FRU, select an enclosure from the table, select Selected
? List FRUs.
7. Select one or more FRUs from the table, and click either Activate LED or Deactivate LED. The
associated LED is either turned on or off.
Viewing serviceable events
Use this procedure to view a serviceable event, including details, comments, and service history.
To view serviceable events and other information about the events, you must be a member of one of the
following roles:
v Super administrator
v Service representative
v Operator
v Product engineer
v Viewer
To view serviceable events, follow these steps:
1. In the navigation area, select Service Management.
2. Select Manage Serviceable Events.
3. Select the criteria for the serviceable events that you want to view, and click OK. The Serviceable
Event Overview window opens. The list shows all serviceable events that match your selection
criteria. You can use the menu options to perform actions on the serviceable events.
4. Select a line in the Serviceable Event Overview window, and select Selected ? View Details The
Serviceable Event Details window opens, showing detailed information about the serviceable event.
The upper table shows information, such as problem number and reference code. The lower table
shows the field replaceable units (FRUs) associated with this event.
5. Select the error for which you want to view comments and history, and follow these steps:
a. Select Actions ? View Comments.
b. When you are finished viewing the comments, click Close.
c. Select Actions ? View Service History. The Service History window opens, showing service
history associated with the selected error.
d. When you are finished viewing the service history, click Close.
6. When you are finished, click Cancel twice to close the Serviceable Event Details window and the
Serviceable Event Overview window.
Verifying an installed feature or replaced part on a system or logical
partition by using Virtual I/O Server tools
If you installed feature or replaced a part, you might want to use the tools in Virtual I/O Server (VIOS)
to verify that the feature or part is recognized by the system or logical partition.
To verify the operation of a newly installed feature or replacement part, select the appropriate procedure:
v Verify the installed feature using VIOS
v Verifying the replaced part using VIOS
Verify the installed feature by using VIOS:
1. Log in as root user.
2. At the command line, type diagmenu and press Enter.
256 Disk drives
3. Select Advanced Diagnostics Routines and press Enter.
4. From the Diagnostic Mode Selection menu, select System Verification and press Enter.
5. When the Advanced Diagnostic Selection menu appears, do one of the following:
v To test a single resource, select the resource that you just installed from the list of resources and
press Enter.
v To test all the resources available to the operating system, select All Resources and press Enter.
6. Select Commit, and wait until the diagnostic programs run to completion, responding to any prompts
that appear.
7. Did the diagnostics run to completion and display the message No trouble was found?
v No: If a service request number (SRN) or other reference code is displayed, suspect a loose adapter
or cable connection. Review the installation procedures to ensure that the new feature is installed
correctly. If you cannot correct the problem, collect all SRNs or any other reference code
information that you see. If the system is running in LPAR mode, note the logical partition in
which you installed the feature. Contact your service provider for assistance.
v Yes: The new device is installed correctly. Exit the diagnostic programs and return the system to
normal operations.
Verify the replacement part by using VIOS:
To verify the operation of a newly installed feature or replacement part, follow these steps:
1. Did you replace the part by using either VIOS or the online diagnostics service aid's concurrent
(hot-swap) service operation?
No: Go to step 2.
Yes: Go to step 5 on page 258.
2. Is the system powered off?
No: Go to step 4.
Yes: If the system supports slow boot, set the system to perform a slow boot. For information, see
Performing a slow boot.
3. Start the system and wait until the VIOS operating system login prompt displays or until apparent
system activity on the operator panel or display has stopped.
Did the VIOS login prompt display?
v No: If an SRN or other reference code is displayed, suspect a loose adapter or cable connection.
Review the procedures for the part that you replaced to ensure that the new part is installed
correctly. If you cannot correct the problem, collect all SRNs or any other reference code
information that you see. If the system does not start or you have no login prompt, see: Problems
with loading and starting the operating system.
If the system is partitioned, note the logical partition in which you replaced the part. Contact your
service provider for assistance.
v Yes: Go to step 4
4. At the command prompt, type diag —a and press Enter to check for missing resources. If you see a
command prompt, go to step 5 on page 258.
If the Diagnostic selection menu is shown with M appearing next to any resource, follow these
steps:
a. Select the resource and press Enter.
b. Select Commit.
c. Follow any instructions that are shown.
d. If a Do you want to review the previously displayed error? message is shown, select Yes and press
Enter.
Verifying the installed part 257
e. If an SRN is shown, suspect a loose card or connection. If no obvious problem is shown, record
the SRN and contact your service provider for assistance..
f. If no SRN is shown, go to 5.
5. Test the part by doing the following:
a. At the command line, type diagmenu and press Enter.
b. From the Function Selection menu, select Advanced Diagnostics Routines and press Enter.
c. From the Diagnostic Mode Selection menu, select System Verification and press Enter.
d. Select All Resources, or select the diagnostics for the individual part to test only the part you
replaced, and any devices that are attached to the part you replaced and press Enter.
Did the Resource Repair Action menu appear?
No: Go to step 6.
Yes: Go to step 7.
6. Did the Testing Complete, No trouble was found message appear?
v No: There is still a problem. Contact your service provider. This ends the procedure.
v Yes: Select Log Repair Action, if not previously logged, from the Task Selection menu to update
the AIX error log. If the repair action was reseating a cable or adapter, select the resource
associated with that repair action. If the resource associated with your action is not displayed on
the Resource List, select sysplanar0 and press Enter.
Tip: This action changes the indicator light for the part from the fault state to the normal state.
Go to step 9 on page 259
7. Select the resource for the replaced part from the Resource Repair Action menu. When a test is run
on a resource in system verification mode, and that resource has an entry in the AIX error log, if the
test on the resource was successful, the Resource Repair Action menu appears. Complete the
following steps to update the AIX error log to indicate that a system-detectable part has been
replaced.
Note: On systems with a indicator light for the failing part, this changes the indicator light to the
normal state.
a. Select the resource that has been replaced from the Resource Repair Action menu. If the repair
action was reseating a cable or adapter, select the resource associated with that repair action. If
the resource associated with your action does not appear on the Resource List, select sysplanar0.
Press Enter.
b. Select Commit after you make your selections. Did another Resource Repair Action display
appear?
No: If the No Trouble Found display appears, go to step 9 on page 259
Yes: Go to step 8.
8. Select the parent or child of the resource for the replaced part from the Resource Repair Action
menu if necessary. When a test is run on a resource in system verification mode, and that resource
has an entry in the AIX error log, if the test on the resource was successful, the Resource Repair
Action menu appears. Complete the following steps to update the AIX error log to indicate that a
system-detectable part has been replaced.
Note: This changes the indicator light for the part from the fault state to the normal state.
a. From the Resource Repair Action menu, select the parent or child of the resource that has been
replaced. If the repair action was to reseat a cable or adapter, select the resource associated with
that repair action. If the resource associated with your action does not appear on the Resource
List, select sysplanar0. Press Enter.
b. Select Commit after you make your selections.
c. If the No Trouble Found display appears, go to step 9 on page 259.
258 Disk drives
9. If you changed the service processor or network settings, as instructed in previous procedures,
restore the settings to the values they had prior to servicing the system.
10. Did you do any hot-plug procedures before doing this procedure?
No: Go to step 11.
Yes: Go to step 12.
11. Start the operating system, with the system or logical partition in normal mode. Were you able to
start the operating system?
No: Contact your service provider. This ends the procedure.
Yes: Go to step 12.
12. Are the indicator lights still on?
v No. This ends the procedure.
v Yes. Turn off the lights. See one of the following for instructions: Changing service indicators
Verifying the installed part 259
260 Disk drives
Verifying a repair
Use these procedures to verify hardware operation after making repairs to the system.
Choose from the following options:
v To verify the repair of a system that is currently powered off, go to step 1.
v To verify the repair of a system that is currently powered on with no operating system loaded, go to
step 3.
v To verify the repair of a system that is currently powered on and that has an operating system loaded,
go to step 5.
1. Power on the server and all attached I/O enclosures.
Did all the enclosures power on?
Yes: Go to step 3.? Go to step 3.
No: Continue with the next step.
2. Choose from the following options:
v If the original problem was that an enclosure would not power on and you have another FRU to replace, locate
and replace the next field-replaceable unit (FRU).
v If the next FRU in the FRU list is an isolation procedure, perform the isolation procedure.
v If the original problem was that an enclosure would not power on and you have an isolation procedure to
complete, perform the isolation procedure.
v If the original problem was that an enclosure would not power on and there are no more FRUs or isolation
procedures in the FRU list, contact your next level of support.
v If you have a new problem, perform problem analysis and repair the new problem.
3. Load the operating system.
Did the operating system load successfully?
Yes: Go to step 5.
No: Continue with the next step.
4. Choose from the following options:
v If the original problem was a failing disk drive that contained the operating system software, go to step 5.
v If the original problem was that the operating system would not load and you have another FRU to replace, go to
your FRU locations section to locate the next FRU.
v If the next FRU in the FRU list is an isolation procedure, perform the isolation procedure.
v If the original problem was that the operating system would not load and you have an isolation procedure to
complete, perform the isolation procedure.
v If the original problem was that the operating system would not load and there are no more FRUs or isolation
procedures in the FRU list, contact your next level of support.
v If you have a new problem, perform problem analysis and repair the new problem.
5. Choose from the following options:
v “Verifying the repair in AIX” on page 262
v “Verifying the repair in Linux” on page 267
v “Verifying a repair using an IBM i system or logical partition” on page 265
© Copyright IBM Corp. 2010 261
Verifying the repair in AIX
You can use this procedure to verify that a repair is complete using the AIX operating system.
Use this maintenance analysis procedure (MAP) to check out the server after a repair is completed.
1. Did you replace a disk drive in the root volume group?
No Go to step 3.
Yes Continue with the next step.
2. Run stand-alone diagnostics either from a CD or from a Network Installation Management (NIM)
server.
Did you encounter any problems?
No Reinstall the operating system and continue with step 5.
Yes If the original problem still exists, replace the
field-replaceable unit (FRU) or perform the isolation
procedure that is next in the FRU list. If you have reached
the end of the FRU list, contact your next level of support.
If a new problem has occurred, go to Beginning problem
analysis.
3. Did you replace a FRU with the power turned on and concurrently with system operations?
No Go to step 5.
Yes Continue with the next step.
4. Did you use an AIX diagnostics service aid hot-swap operation to change the FRU?
Yes Go to step 6.
No Go to step 7 on page 263.
Note: The AIX diagnostic service aid was
used if a resource was removed using the Hot
Plug task.
5.
Note: If any FRUs have been removed that should be reinstalled, reinstall them now.
Perform the following steps:
1. If the system is not powered on, power it on now.
2. Perform a slow boot.
3. Wait until the AIX operating system login prompt displays or until system activity on the operator panel or
display apparently has stopped.
4. Did you encounter any problems?
No Continue at step 6.
Yes If the original problem still exists, replace the FRU or perform the isolation procedure that is next in the FRU
list. If you have reached the end of the FRU list, contact your next level of support.
If a new problem has occurred, go to Beginning problem analysis.
6. If the Resource Repair Action menu is already displayed, go to step 9 on page 263; otherwise,
perform the following steps:
262 Disk drives
1. Log into the operating system either with root authority (if
needed, ask the customer to enter the password) or use the CE
login.
2. Enter the diag -a command and check for missing resources.
Follow any instructions that display. If an SRN displays, suspect a
loose card or connection. If no instructions display, no resources
were detected as missing. Continue with the next step.
7. Perform the following steps:
1. Enter diag at the command prompt.
2. Press Enter.
3. Select the Diagnostics Routines option.
4. When the Diagnostic Mode Selection menu displays, select System verification.
5. When the Diagnostic Selection menu displays, select the All Resources option or test the FRUs you exchanged,
and any devices that are attached to the FRU(s) you exchanged, by selecting the diagnostics for the individual
FRU.
Did the Resource Repair Action menu (801015) display?
No Continue with the next step.
Yes Go to step 9.
8. Did the Testing Complete, no trouble was found menu (801010) display?
Yes Use the Log Repair Action option, if not previously logged,
in the TASK SELECTION menu to update the AIX error log.
If the repair action was reseating a cable or adapter, select
the resource associated with that repair action.
If the resource associated with your action is not displayed
on the resource list, select sysplanar0.
Note: If the system attention indicator is on, this action will
set it back to the normal state.
Go to step 11 on page 264.
No If the original problem still exists, replace the FRU or
perform the isolation procedure that is next in the FRU list.
If you have reached the end of the FRU list, contact your
next level of support.
If a new problem has occurred, go to Beginning problem
analysis.
9. When a test is run on a resource in system verification mode, that resource has an entry in the AIX
error log. If the test on that resource was successful, the Resource Repair Action menu displays.
After replacing a FRU, you must select the resource for that FRU from the Resource Repair Action
menu. This updates the AIX error log to indicate that a system-detectable FRU has been replaced.
Note: If the system attention indicator is on, this action will set it back to the normal state.
Perform the following steps:
Verifying a repair 263
1. Select the resource that has been replaced from the Resource
Repair Action menu. If the repair action was reseating a cable or
adapter, select the resource associated with that repair action. If
the resource associated with your action is not displayed on the
resource list, select sysplanar0.
2. Press Commit after you make your selections.
Did another Resource Repair Action (801015) display?
No If the No Trouble Found menu displays, go to step 11.
Yes Continue with the next step.
10. The parent or child of the resource you just replaced might also require that you run the Resource
Repair Action service aid on it.
When a test is run on a resource in system verification mode, that resource has an entry in the AIX
error log. If the test on the resource was successful, the Resource Repair Action menu displays.
After replacing that FRU, you must select the resource for that FRU from the Resource Repair Action
menu. This updates the AIX error log to indicate that a system-detectable FRU has been replaced.
Note: If the system attention indicator is on, this action will set it back to the normal state.
Perform the following steps:
1. From the RESOURCE REPAIR ACTION menu, select the parent
or child of the resource that has been replaced. If the repair action
was reseating a cable or adapter, select the resource associated
with that repair action. If the resource associated with your action
is not displayed on the resource list, select sysplanar0.
2. Press COMMIT after you make your selections.
3. If the No Trouble Found menu displays, continue with the next
step.
11. If you changed the service processor or network settings, as instructed in previous MAPs, restore the
settings to the value they had prior to servicing the system. If you ran stand-alone diagnostics from
CD-ROM, remove the stand-alone diagnostics CD-ROM from the system.
Did you perform service on a RAID subsystem involving changing of the PCI RAID adapter
cache card or changing the configuration?
Note: This information does not apply to the PCI-X RAID adapter or cache.
No Go to the close of call procedure.
Yes Continue with the next step.
12. Use the Recover Options selection to resolve the RAID configuration. To do this, perform the
following steps:
264 Disk drives
1. On the PCI SCSI Disk Array Manager display, select Recovery
options.
2. If a previous configuration exists on the replacement adapter, this
must be cleared. Select Clear PCI SCSI Adapter Configuration
and press F3.
3. On the Recovery Options screen, select Resolve PCI SCSI RAID
Adapter Configuration.
4. On the Resolve PCI SCSI RAID Adapter Configuration screen,
select Accept Configuration on Drives.
5. On the PCI SCSI RAID Adapter selections menu, select the
adapter that you changed.
6. On the next screen, press Enter.
7. When you see the Are You Sure selection menu, press Enter to
continue.
8. If you see a Failed status message, verify that you selected the
correct adapter, and then repeat this procedure. When the
recovery is complete, exit the operating system.
9. Go to the close of call procedure.
Verifying a repair using an IBM i system or logical partition
Use this procedure to verify a repair using the IBM i operating system.
1. Was the system powered off during the repair?
Yes: Continue with the next step.
No: Continue with step 3.
2. Perform the following tasks:
a. Verify that the power cable is plugged into the power outlet.
b. Verify that power is available at the customer's power outlet.
3. Was the partition powered off during the repair?
Yes: Continue with the next step.
No: Continue with step 6.
4. Select the IPL type and mode for the system or logical partition that the customer uses (see IPL type
mode and speed options in the Service functions).
5. Start an IPL by powering on the system or partition (see Powering on and powering off). Did the
system complete the IPL?
Yes: Continue with the next step.
No: This might be a new problem. Go to the Start of call procedure. This ends the procedure.
6. Did the system or partition remain running throughout the repair, and was the I/O processor, I/O
adapter, or storage device replaced?
Yes: Continue with step 10.
No: Continue with the next step.
7. Use the service action log or serviceable event view (if the system is managed by an HMC) to look
for any reference codes that are related to this IPL (see Using the Service Action Log). Are there any
reference codes that are related to this IPL?
Yes: Continue with the next step.
No: If the problem was related to removable media or communications, perform the verification
procedures in the Service functions to verify that the problem is corrected. Then return the
system to the customer and have the customer verify the system date and time. This ends the
procedure.
Verifying a repair 265
8. Is the new reference code the same as the original reference code?
Yes: Continue with the next step.
No: A new symptom might have occurred. Go to the Start of call procedure. This ends the
procedure.
9. Are there any other failing items that remain to be replaced?
Yes: Replace the next failing item listed for this reference code. This ends the procedure.
No: Contact your next level of support for assistance. This ends the procedure.
10. Was concurrent maintenance performed on an optical storage unit?
Yes: The product activity log and service action log, in most cases, contain a reference code for
the optical storage unit when concurrent maintenance is performed. You can ignore this reference
code. Perform the following:
- Perform the verification procedures in the Service functions topic to verify that the problem is
corrected.
- Return the system to the customer and have the customer verify the system date and time. This
ends the procedure.
No: Continue with the next step.
11. Use the service action log to look for any new reference codes (see Using the Service Action Log).
Are there any new reference codes?
Yes: Continue with the next step.
No: Go to step 14.
12. Is the new reference code the same as the original reference code?
Yes: Continue with the next step.
No: A new symptom might have occurred. Go to the Start of call procedure to determine the
cause of the problem. This ends the procedure.
13. Are there any other failing items that need to be replaced?
Yes: Replace the next failing item listed for the reference code. This ends the procedure..
No: Contact your next level of support for assistance. This ends the procedure.
14. Are you working with a tape device?
Yes: Perform the verification procedures in the Service functions to verify that the problem is
corrected. After the verification test is complete, the tape device description will be set to the
failed state because a resource change was detected. Perform the following tasks:
- Vary the tape device description off and then on.
- Return the system to the customer and have the customer verify the system date and time.
Then go to Verifying the repair from the HMC. This ends the procedure.
No: Continue with the next step.
15. Are you working with an IOP or an IOA?
Yes: Use the display hardware configuration service function to check for any missing or failed
hardware:
- On the command line, enter the STRSST (Start System Service Tools command). If you cannot get
to SST, select DST. Do not IPL the system or partition to get to DST.
- On the Start Service Tools Sign On display, enter the user ID with the service authority and
password.
- Select Start a service tool ? Hardware service manager ? Logical hardware resources ? System
bus resources.
- Select the function key for Include nonreporting resources.
- If the IOP and IOA that you just replaced is a failed or non-reporting resource, the problem has
not been fixed. Continue to the next failing item in the failing item list. This ends the procedure.
266 Disk drives
No: Perform the verification procedures in the Service functions topics to verify that the problem
is corrected. Resources that usually vary on automatically during an IPL, or that were previously
varied on manually, might need to be varied on again after the verification procedures are
complete. Return the system to the customer and have the customer verify the system date and
time. This ends the procedure.
Verifying the repair in Linux
You can use this procedure to verify that a repair is complete using the Linux operating system.
1. Run stand-alone diagnostics from either a CD or from a Network Installation Management (NIM)
server. See Running the stand-alone diagnostics from CD-ROM.
Did you encounter any problems?
No Reboot the operating system and continue with the close of call procedure.
Yes If the original problem still exists, replace the field-replaceable unit (FRU) or perform the isolation procedure
that is next in the FRU list. If you have reached the end of the FRU list, contact your next level of support.
If a new problem has occurred, go to Beginning problem analysis and repair the new problem.
Verifying the repair from the HMC
Perform these procedures to close problem numbers, clear hardware messages, and prepare the server to
return to the customer using the hardware management console (HMC).
Follow this checklist before performing the procedures:
v You return the server to the state that the customer normally uses such as IPL type, IPL mode, and the
way the system is configured or partitioned.
Attention: Before returning the system to the customer, remove the system from service mode. If the
system is left in service mode, it automatically places a call for service every two hours.
v While you were performing the problem analysis on the original serviceable event, other
serviceable-event numbers might have been opened. Close all serviceable events that were opened as a
result of your service activity.
v Server verification has been performed and there are no problems that require additional service
actions.
v If the repair was done using the HMC online repair procedures, ensure that the original serviceable
event is now closed.
1. Is an HMC used to manage the server that you are servicing?
v Yes: Continue with the next step.
v No: Return to the “Verifying a repair” on page 261. This ends the procedure.
2. Are you closing a service event that was a repair on the HMC personal computer?
v Yes: Continue with the next step.
v No: Go to step 4 on page 268.
3. Power on the HMC. Did the power-on process complete without errors?
v Yes: Ensure that the HMC can be used to perform server management tasks, and return the HMC to normal
operations. Go to “Closing a service call” on page 269. This ends the procedure.
v No: Go to the HMC isolation procedures. This ends the procedure.
Verifying a repair 267
4. Log into the HMC as the service representative. If invalid user or invalid password displays, get the
correct login information from the system administer.
1. If logged on the System Manager, select Exit from the Console, located on the System Manager window.
2. Login to the System Manager with the following: User identification service Password service mode
5. View Serviceable Event Details.
1. In the navigation area, select the Service Applications icon.
2. In the navigation area, select the Service Focal Point icon.
3. In the Contents area, select Manage Serviceable Events.
4. Designate the set of Serviceable Events you want to view. When
you are finished, click OK. The Service Event Overview window
opens.
Note: Only the events that match all of the
criteria you specify are shown.
6. Close open or delayed events.
1. Select the problem to close, on the Service Event Overview window.
2. Select the menu Selected, located on the menu bar.
3. Click Close Event.
4. Enter your comments in the Serviceable Event Comments window, and click Close Event.
5. Close all events associated with the problem on which you were working.
7. Did the Service Event Overview window contain the event or events on which you were working?
v Yes: Return the HMC to normal operations. Go to “Closing a service call” on page 269. This ends the procedure.
v No: Go to Detecting problems. This ends the procedure.
268 Disk drives
Closing a service call
Perform these procedures to close problem numbers, clear hardware messages, and prepare the server to
return to the customer.
Follow this checklist before performing the procedures:
v Return the server to the state that the customer normally uses such as IPL type, IPL mode, and the
way the system is configured or partitioned.
Attention: Before returning the system to the customer, remove the system from service mode. If the
system is left in service mode, it automatically places a call for service every two hours.
v While you were performing the problem analysis on the original serviceable event, other serviceable
events might have been opened. Close all serviceable events that were opened as a result of your
service activity.
v Ensure that server verification has been performed and no problems require additional service actions.
v If the repair was done using the Hardware Management Console (HMC) online repair procedures,
ensure that the original serviceable event is now closed.
1. Record the system reference code (SRC) or symptom and the location code of the field-replaceable
unit (FRU) you replaced, for future reference. Is the server managed by an HMC?
v Yes: Continue with the next step.
v No: Do one of the following steps:
– If the server is managed by Integrated Virtualization Manager (IVM), go to “Closing a service call using
Integrated Virtualization Manager” on page 277.
– If the server is not partitioned and is running the AIX or Linux operating system, go to “Closing a service call
using AIX or Linux” on page 273.
2. On the HMC, open Manage Serviceable Events and examine the service action event log for any
open service action events.
3. Are there any service action events that are open?
v Yes: Continue with the next step.
v No: If the system attention LED is still on, turn off the LED as described in “Activating and deactivating LEDs” on
page 283. Return the system to the customer. This completes the repair.
4. Record the list of open service action events.
5. From the list of serviceable events recorded in step 4, perform the following step 6 through step 32
on page 272 for each open service action event.
6. Determine the error class of the serviceable event. Record for future use.
7. Examine the details of the open service action event.
© Copyright IBM Corp. 2010 269
Is the error code that is associated with this service action event the same as recorded in Step 1 on page 269?
v Yes: Go to step 11.
v No: Continue with the next step.
8. Examine the FRU list of the service action event. Are any FRUs listed for the service action event?
v Yes: Continue with the next step.
v No: Go to step 11.
9. Is the FRU list identical (same FRUs, same number of FRUs, and same order of FRUs) to the FRU
list of the error code recorded in step 1 on page 269?
v Yes: Go to step 11.
v No: Continue with the next step.
10. The FRU list is different. Is the FRU you replaced and recorded in step 1 on page 269 in the list of
FRUs for this service action event?
v Yes: Continue with the next step.
v No: Go to step 32 on page 272.
Note: There are service action events that will remain open when you leave this MAP. Further service actions
might be required to complete the repair.
11. Examine the details of this service action event, and record the partitions involved in this service
action event for use in a later step.
12. Is the error code associated with this service action event of the form A11-xxx or A01-xxx?
v Yes: Continue with the next step.
v No: Go to step 17 on page 271.
13. Have you begun a list of Axx partitions from prior service action events that you processed in this
maintenance analysis procedure (MAP)?
v Yes: Go to step 15.
v No: Continue with the next step.
14. Begin a new list of Axx partitions by copying the list of partitions obtained in step 11. Go to step 16.
15. Add the partition list obtained in step 11 to the existing list of Axx partitions obtained from
processing previous service action events in this MAP.
16. Remove all entries in the list of all partitions you recorded in step 11. If you are referred to the list of
partitions obtained in step 11 in future steps, the list is empty. Go to step 17 on page 271.
270 Disk drives
17. Select and highlight the service action event from the Error Associated With This Serviceable Event
window.
18. Click Close Event.
19. Add comments for the serviceable event. Include any unique additional information. Click OK. The
following steps will add or update FRU information.
20. Did you replace, add, or modify a FRU of the open service action event?
v Yes: Continue with the next step.
v No: Go to step 22.
21. From the FRU list, select a FRU that you need to update. Double-click the FRU, and update the FRU
information. Go to step 23.
22. Select the No FRU Replaced for this Serviceable Event option.
23. Click OK to close the service action event.
24. Is the list of all partitions you recorded in step 11 on page 270 empty?
v Yes: Go to step 32 on page 272.
v No: Continue with the next step.
25. Does the list of all partitions you recorded in step 11 on page 270 contain more than one entry?
v Yes: Continue with the next step.
v No: Go to step 32 on page 272.
26. Is the error class recorded in step 25 AIX?
v Yes: Continue with the next step.
v No: Go to step 32 on page 272.
27. Perform the following steps for each entry in the list of all partitions you recorded in step 11 on page
270, except the partition you were using to debug the original problem.
28. From the HMC virtual terminal window of a partition in the list of all partitions, type diag at the
AIX command prompt.
Closing a service call 271
29. When the diagnostic operating instructions are displayed, do the following steps:
1. Press Enter.
2. Select the Task Selection option.
3. Select the Log Repair option.
4. Select the resource associated with the repair action. If the repair
action was reseating a cable or adapter, select the resource
associated with that repair action. If the resource associated with
your repair action is not displayed on the Resource List, select
sysplanar0.
5. Click Commit after you have made your selection.
Note: If the terminal type is not defined, you
are prompted to define it before you can
proceed.
30. Exit from diagnostics in this partition and return to the AIX prompt.
31. Have all the partitions in the list of all the partitions you recorded in step 11 on page 270 been
processed?
v Yes: Continue with the next step.
v No: Go to step 24 on page 271 to process the next partition in the list you recorded in step 11 on page 270.
32. Have all the serviceable events recorded in step 4 on page 269 been processed?
v Yes: Continue with the next step.
v No: Go to step 5 on page 269 and process the next service action event in the list of serviceable events recorded in
step 4 on page 269.
33. While processing all service action events, were you directed to step 14 on page 270?
v Yes: Continue with the next step.
v No: If the system attention LED is still on, turn off the LED as described in “Activating and deactivating LEDs” on
page 283. Return the system to the customer. This completes the repair.
Note: If during the processing of the list of open service action events, some service action events remained open,
further service actions might be required to complete the repair.
34. Perform the following steps for each entry in the list of Axx partitions you began recording in step
14 on page 270, except the partition you were using to debug the original problem.
35. From the HMC virtual terminal window of a partition in the list of Axx partitions, type diag at the
AIX command prompt.
36. When the diagnostic operating instructions are displayed, do the following steps:
272 Disk drives
1. Press Enter.
2. Select the Task Selection option.
Note: If the terminal type is not defined, you are prompted to
define it before you can proceed.
3. Select the Log Repair option.
4. Select the resource associated with the repair action. If the repair
action was reseating a cable or adapter, select the resource
associated with that repair action. If the resource associated with
your repair action is not displayed on the Resource List, select
sysplanar0.
5. Click Commit after you have made your selection.
37. Exit from diagnostics in this partition and return to the AIX prompt.
38. Have all the partitions in the list of Axx partitions you began recording in step 14 on page 270 been
processed?
v Yes: Continue with the next step.
v No: Go to step 34 on page 272 to process the next partition in the list you recorded in step 14 on page 270.
39. If the system attention LED is still on, turn off the LED as described in “Activating and deactivating
LEDs” on page 283. This completes the repair. Return the system to the customer.
Note: If during the processing of the list of open service action events, some service action events
remained open, further service actions might be required to complete the repair.
Closing a service call using AIX or Linux
If the server is not connected to an Hardware Management Console (HMC) and not using Integrated
Virtualization Manager (IVM), perform these procedures to close problem numbers, clear hardware
messages, and prepare the server to return to the customer.
Follow this checklist before performing the procedures:
v Return the server to the state that the customer normally uses, such as IPL type, IPL mode, and the
way the system is configured or partitioned.
Attention: Before returning the system to the customer, remove the system from service mode. If the
system is left in service mode, it automatically places a call for service every two hours.
v While you were performing the problem analysis on the original serviceable event, other
serviceable-event numbers might have been opened. Close all serviceable events that were opened as a
result of your service activity.
v Ensure that server verification has been performed and that no problems require additional service
actions.
v If the repair was done using the IVM online repair procedures, ensure that the original serviceable
event is now closed.
1. Did you use an AIX diagnostics service aid hot-swap operation to change the FRU?
v Yes: Go to step 4 on page 274
v No: Continue with the next step.
Closing a service call 273
2. Do you have any field-replaceable units (FRUs) (for example cards, adapters, cables, or devices) that
were removed during problem analysis that you want to put back into the system?
Note: If the system planar or battery has been replaced and you are loading diagnostics from a
server over a network, it might be necessary for the customer to set the network boot information
for this system before diagnostics can be loaded. The system time and date information should also
be set when the repair is completed.
v Yes: Reinstall all of the FRUs that were removed during problem analysis. Go to step 3
v No: Continue with the next step.
3. Is the system or logical partition that you are performing a repair action on running the AIX
operating system?
v Yes: Continue with the next step.
v No: Go to step 5.
4. Does the system or logical partition you are performing a repair action on have AIX installed?
Note: Answer no to this question if you have just replaced a hard disk in the root volume group.
v Yes: Go to step 7.
v No: Continue with the next step.
5. Run stand-alone diagnostics in problem determination mode from either a CD-ROM or from a
Network Installation Management (NIM) server.
Note: For instructions on running stand-alone diagnostics from a CD and not using an HMC, go to
Running the stand-alone diagnostics from CD on a server without an HMC attached.
For instructions on running stand-alone diagnostics from a NIM server, go to Running the
stand-alone diagnostics from a Network Installation Management server.
Did you encounter any problems?
v Yes: Go to problem analysis.
v No: Continue with the next step.
6. The system hardware is functioning correctly.
If the system attention LED is still on, turn off the LED as described
in “Activating and deactivating LEDs” on page 283.
This completes the repair.
Note: If, during the processing of the list of open service action
events, some service action events remained open, further service
actions might be required to complete the repair.
Return the server to the state that the customer normally uses, such
as IPL type, IPL mode, and the way the system is configured or
partitioned. This might require you to reboot the operating system.
Attention: Before returning the system to the customer, remove the
system from service mode. If the system is left in service mode, it
automatically places a call for service every two hours.
7. Complete the following steps:
274 Disk drives
1. If the system supports slow boot (see Performing a slow boot), do
a slow boot on the system. If the system does not support slow
boot, do a normal boot.
2. Power on the system.
3. Wait until the AIX operating system login prompt displays or
until system activity on the operator panel or display apparently
has stopped.
Did the AIX Login Prompt display?
v Yes: Continue with the next step.
v No: Go to problem analysis.
8. If the Resource Repair Action menu is already displayed, go to 12 on page 276; otherwise, do the
following steps:
1. Log into the operating system, either with root authority (if
needed, ask the customer to enter the password) or use the CE
login.
2. Enter the diag -a command and check for missing resources.
Follow any instructions that display. If an system request number
(SRN) displays, suspect a loose card or connection. If no
instructions display, no resources were detected as missing.
Continue with 9.
9. Complete the following steps:
1. Enter diag at the command prompt.
2. Press Enter.
3. Select the Diagnostics Routines option.
4. When the Diagnostic Mode Selection menu displays, select
Problem determination.
5. When the Advanced Diagnostic Selection menu displays, select
the All Resources option, or test the FRUs you exchanged, and
any devices that are attached to the FRUs you exchanged, by
selecting the diagnostics for the individual FRU.
Did the Resource Repair Action menu (801015) display?
v Yes: Go to step 13 on page 276.
v No: Continue with the next step.
10. Did the TESTING COMPLETE, no trouble was found menu (801010) display?
v Yes: Continue with the next step.
v No: There is still a problem. Go to problem analysis.
11. Use the Log Repair Action option, if not previously logged, in the TASK SELECTION menu to
update the AIX error log. If the repair action was reseating a cable or adapter, select the resource
associated with that repair action.
Closing a service call 275
If the resource associated with your action is not displayed on the
resource list, select sysplanar0.
Note: If the system attention indicator is on, this will set it back to
the normal state. Go to step 14.
12. When a test is run on a resource in system verification mode, and that resource has an entry in the
AIX error log, if the test on the resource was successful, the Resource Repair Action menu displays.
After replacing a FRU, you must select the resource for that FRU from the Resource Repair Action menu. This
updates the AIX error log to indicate that a system-detectable FRU has been replaced.
Note: If the system attention indicator is on, this action will set it back to the normal state.
Do the following steps:
1. Select the resource that has been replaced from the Resource Repair Action menu. If the repair action was
reseating a cable or adapter, select the resource associated with that repair action. If the resource associated with
your action is not displayed on the resource list, select sysplanar0.
2. Press Commit after you make your selections.
Did another Resource Repair Action (801015) display?
v Yes: Continue with the next step.
v No: If the No Trouble Found menu displays, go to step 14.
13. The parent or child of the resource you just replaced might also require that you run the Resource
Repair Action service aid on it.
When a test is run on a resource in system verification mode, and
that resource has an entry in the AIX error log, if the test on the
resource was successful, the Resource Repair Action menu displays.
After replacing that FRU, you must select the resource for that FRU
from the Resource Repair Action menu. This updates the AIX error
log to indicate that a system-detectable FRU has been replaced.
Note: If the system attention indicator is on, this action will set it
back to the normal state.
Do the following steps:
1. From the Resource Repair Action menu, select the parent or child
of the resource that has been replaced. If the repair action was
reseating a cable or adapter, select the resource associated with
that repair action. If the resource associated with your action is
not displayed on the resource list, select sysplanar0.
2. Press COMMIT after you make your selections.
Did the No Trouble Found menu display?
v Yes: Continue with the next step.
v No: Go to problem analysis.
14. If you changed the service processor or network settings, as instructed in previous MAPs, restore the
settings to the value they had prior to servicing the system. If you ran stand-alone diagnostics from
CD-ROM, remove the stand-alone diagnostics CD-ROM from the system.
276 Disk drives
Did you perform service on a RAID subsystem involving changing of the PCI RAID adapter cache card or changing
the configuration?
Note: This does not refer to the PCI-X RAID adapter or cache.
v Yes: Continue with the next step.
v No: Go to step 16.
15. Use the Recover Options selection to resolve the RAID configuration. To do this, do the following
steps:
1. On the PCI SCSI Disk Array Manager dialog, select Recovery options.
2. If a previous configuration exists on the replacement adapter, this must be cleared. Select Clear PCI SCSI
Adapter Configuration and press F3.
3. On the Recovery Options dialog, select Resolve PCI SCSI RAID Adapter Configuration.
4. On the Resolve PCI SCSI RAID Adapter Configuration dialog, select Accept Configuration on Drives.
5. On the PCI SCSI RAID Adapter selections menu, select the adapter that you changed.
6. On the next dialog, press Enter.
7. When you see the Are You Sure selection menu, press Enter to continue.
8. You should see an OK status message when the recover is complete. If you get a Failed status message, verify that
you selected the correct adapter, and then repeat this procedure. When recover is complete, exit the operating
system.
9. Go to 16.
16. The system hardware is functioning correctly. Return the server to the state that the customer
normally uses, such as IPL type, IPL mode, and the way the system is configured or partitioned.
Attention: Before returning the system to the customer, remove the
system from service mode. If the system is left in service mode, it
automatically places a call for service every two hours.
Closing a service call using Integrated Virtualization Manager
Perform these procedures to close problem numbers, clear hardware messages, and prepare the server to
return to the customer.
Follow this checklist before performing the procedures:
v Return the server to the state that the customer normally uses, such as IPL type, IPL mode, and the
way the system is configured or partitioned.
Attention: Before returning the system to the customer, remove the system from service mode. If the
system is left in service mode, it automatically places a call for service every two hours.
v While you were performing the problem analysis on the original serviceable event, other
serviceable-event numbers might have been opened. Close all serviceable events that were opened as a
result of your service activity.
v Ensure that server verification has been performed and there are no problems that require additional
service actions.
v If the repair was done using the Integrated Virtualization Manager (IVM) online repair procedures,
ensure that the original serviceable event is now closed.
1. Record the system reference code (SRC) or symptom and the location code of the field-replaceable
unit (FRU) you replaced, for future reference.
Closing a service call 277
2. On the IVM, open Manage Serviceable Events and look at existing serviceable events.
3. Are there any service action events that are open?
v Yes: Continue with the next step.
v No: If the system attention LED is still on, turn off the LED as described in “Activating and deactivating LEDs” on
page 283. Return the system to the customer. This completes the repair.
4. Record the list of open service action events.
5. From the list of serviceable events recorded in step 4, perform step 6 through step 30 on page 280 for
each open service action event.
6. Determine the error class of the serviceable event. Record for future use.
7. Examine the details of the open service action event.
Is the error code associated with this service action event the same as
recorded in step 1 on page 277?
v Yes: Go to step 11.
v No: Continue with the next step.
8. Examine the FRU list of the service action event. Are any FRUs listed for the service action event?
v Yes: Continue with the next step.
v No: Go to step 11.
9. Is the FRU list identical (same FRUs, same number of FRUs, and same order of FRUs) to the FRU
list of the error code recorded in step 1 on page 277?
v Yes: Go to step 11.
v No: Continue with the next step.
10. Is the FRU you replaced and recorded in step 1 on page 277 in the list of FRUs for this service action
event?
v Yes: Continue with the next step.
v No: Go to step 30 on page 280.
Note: There are service action events that will remain open when you leave this MAP. Further service actions
might be required to complete the repair.
11. Examine the details of this service action event, and record the partitions involved in this service
action event for use in a later step.
278 Disk drives
12. Is the error code associated with this service action event of the form A11-xxx or A01-xxx?
v Yes: Continue with the next step.
v No: Go to step 17.
13. Have you begun a list of Axx partitions from prior service action events that you processed in this
maintenance analysis procedure (MAP)?
v Yes: Go to step 15.
v No: Continue with the next step.
14. Begin a new list of Axx partitions by copying the list of partitions obtained in step 11 on page 278.
Go to step 16.
15. Add the partition list obtained in step 11 on page 278 to the existing list of Axx partitions obtained
from processing previous service action events in this MAP.
16. Remove all entries in the list of all partitions you recorded in step 11 on page 278. If you are referred
to the list of partitions obtained in step 11 on page 278 in future steps, the list is empty. Go to step
17.
17. Select and highlight the service action event from the Manage Serviceable Events window.
18. Click Close Event.
19. Add comments for the serviceable event. Include any unique additional information. Click OK.
20. Add or update FRU information:
Did you replace, add, or modify a FRU of the open service action event?
v Yes: Continue with the next step.
v No: Go to step 22.
21. Click OK to close the service action event.
22. Is the list of all partitions you recorded in step 11 on page 278 empty?
v Yes: Go to step 30 on page 280.
v No: Continue with the next step.
Closing a service call 279
23. Does the list of all partitions you recorded in step 11 on page 278 contain more than one entry?
v Yes: Continue with the next step.
v No: Go to step 30.
24. Is the error class recorded in step 23?
v Yes: Continue with the next step.
v No: Go to step 30.
25. Perform the following steps for each entry in the list of all partitions you recorded in step 11 on page
278, except the partition you were using to debug the original problem.
26. From the IVM virtual terminal window of a partition in the list of all partitions, type diag at the AIX
command prompt.
27. When the diagnostic operating instructions are displayed, do the following steps:
1. Press Enter.
2. Select the Task Selection option.
3. Select the Log Repair option.
4. Select the resource associated with the repair action. If the repair
action was reseating a cable or adapter, select the resource
associated with that repair action. If the resource associated with
your repair action is not displayed on the Resource List, select
sysplanar0.
5. Click Commit after you have made your selection.
Note: If the terminal type is not defined, you
are prompted to define it before you can
proceed.
28. Exit from diagnostics in this partition and return to the AIX prompt.
29. Have all the partitions in the list of all partitions you recorded in step 11 on page 278 been
processed?
v Yes: Continue with the next step.
v No: Go to step 25 to process the next partition in the list you recorded in step 11 on page 278.
30. Have all the serviceable events recorded in step 4 on page 278 been processed?
v Yes: Continue with the next step.
v No: Go to step 5 on page 278 and process the next service action event in the list of serviceable events recorded in
step 4 on page 278.
31. While processing all service action events, were you directed to step 14 on page 279?
280 Disk drives
v Yes: Continue with the next step.
v No: If the system attention LED is still on, turn off the LED as described in “Activating and deactivating LEDs” on
page 283. Return the system to the customer. This completes the repair.
Note: If during the processing of the list of open service action events, some service action events remained open,
further service actions might be required to complete the repair.
32. Perform the following steps for each entry in the list of Axx partitions you began recording in step
14 on page 279, except the partition you were using to debug the original problem.
33. From the IVM virtual terminal window of a partition in the list of Axx partitions, type diag at the
AIX command prompt.
34. When the diagnostic operating instructions are displayed, do the following steps:
1. Press Enter.
2. Select the Task Selection option.
Note: If the terminal type is not defined, you are prompted to
define it before you can proceed.
3. Select the Log Repair option.
4. Select the resource associated with the repair action. If the repair
action was reseating a cable or adapter, select the resource
associated with that repair action. If the resource associated with
your repair action is not displayed on the Resource List, select
sysplanar0.
5. Click Commit after you have made your selection.
35. Exit from diagnostics in this partition and return to the AIX prompt.
36. Have all the partitions in the list of Axx partitions you began recording in step 14 on page 279 been
processed?
v Yes: Continue with the next step.
v No: Go to step 32 to process the next partition in the list you recorded in step 14 on page 279.
37. If the system attention LED is still on, turn off the LED as described in “Activating and deactivating
LEDs” on page 283. This completes the repair. Return the system to the customer.
Note: If, during the processing of the list of open service action events, some service action events remained open,
further service actions might be required to complete the repair.
Closing a service call 281
282 Disk drives
Activating and deactivating LEDs
You can use these procedures to activate or deactivate light-emitting diodes (LEDs) using the Hardware
Management Console (HMC) or the Advanced System Management Interface (ASMI).
For IBM Power Systems servers that contain the POWER7 processor, the LEDs can be used to identify or
verify a part that you are servicing. The error and identify function (amber) LED indicates an error and
corresponds to the location code in the system reference code (SRC). The LED is activated and
deactivated automatically.
Additionally, the following procedures can also be used to activate and deactivate LEDs.
v “Deactivating a system attention LED or partition LED using the HMC”
v “Activating or deactivating an identify LED using the HMC” on page 284
v “Deactivating a system attention LED or logical partition LED using the Advanced System
Management Interface” on page 284
v “Activating or deactivating an identify LED using the Advanced System Management Interface” on
page 285
Deactivating a system attention LED or partition LED using the HMC
You can deactivate a system attention LED or a logical partition LED if you decide that a problem is not
a high priority and you decide to repair the problem at a later time.
If you want to be alerted if another problem occurs, you must deactivate the system attention LED so
that it can be activated again if another problem occurs.
To deactivate a system attention LED using the HMC, complete the following steps:
1. In the navigation area, open Systems management.
2. Select the server you are working on by checking the box next to its name.
3. Open Operations.
4. Open LED Status.
5. Select View System Attention. The system attention LED window opens. The selected system and its
LED state are displayed in the upper part of the window. The logical partition and its LED state are
displayed in the lower part of the window. From the system attention LED window, you can
deactivate both the system attention LED and the logical partition LED.
6. Select Deactivate System Attention LED from the Action menu. A confirmation window is displayed
that provides the following information:
v A verification that the system attention LED was deactivated.
v An indication that there still might be open problems within the system.
v An indication that you cannot activate the system attention LED.
7. Select one of the logical partitions in the lower table, and select Deactivate partition LED from the
Partition Operations menu. A confirmation window is displayed that provides the following
information:
v A verification that the logical partition LED was deactivated.
v An indication that there still might be open problems within the logical partition.
v An indication that you cannot activate the logical partition LED.
© Copyright IBM Corp. 2010 283
Activating or deactivating an identify LED using the HMC
You can activate or deactivate an identify LED for components attached to the system.
The system provides several LEDs that help identify various components, such as enclosures or
field-replaceable units (FRUs), in the system. For this reason, they are called identify LEDs.
You can activate or deactivate the following types of identify LEDs:
v Identify LED for an enclosure If you want to add an adapter to a specific drawer (enclosure), you
need to know the machine type, model, and serial number (MTMS) of the drawer. To determine
whether you have the correct MTMS for the drawer that needs the new adapter, you can activate the
LED for a drawer and verify that the MTMS corresponds to the drawer that requires the new adapter.
v Identify LED for a FRU associated with a specified enclosure If you want to hook up a cable to a
specific I/O adapter, you can activate the LED for the adapter, which is a field replaceable unit (FRU),
and then physically check to see where you should hook up the cable. This is especially useful when
you have several adapters with open ports.
To activate or deactivate an identify LED for an enclosure or FRU, follow these steps:
1. In the navigation area, open Systems management.
2. Select the server you are working on.
3. Click Operations ? LED Status ? Identify LED. The Identify LED, Select Enclosure window opens.
4. To activate or deactivate an identify LED for an enclosure, select an enclosure from the table, and click
either Activate LED or Deactivate LED. The associated LED is either turned on or off.
5. To activate or deactivate an identify LED for a FRU, select an enclosure from the table, and then select
Selected ? List FRUs.
6. Select one or more FRUs from the table, and click either Activate LED or Deactivate LED. The
associated LED is either turned on or off.
Deactivating a system attention LED or logical partition LED using the
Advanced System Management Interface
You can deactivate a system attention LED or a logical partition LED using the Advanced System
Management Interface (ASMI).
The system attention indicator provides a visual signal that the system as a whole requires attention or
service. Each system has a single system attention indicator. When an event occurs that either needs your
intervention or that of service and support, the system attention indicator lights continuously. The system
attention indicator is turned on when an entry is made in the service processor error log. The error entry
is transmitted to the system error log and the operating system error log.
To perform this operation, your authority level must be one of the following levels:
v Administrator
v Authorized service provider
To turn off the system attention indicator, do the following steps:
1. On the ASMI Welcome pane, specify your user ID and password, and click Log In.
2. In the navigation area, expand System Configuration ? Service Indicators ? System Attention
Indicator.
3. In the right pane, click Turn off system attention indicator. If the attempt is unsuccessful, an error
message is displayed.
284 Disk drives
Activating or deactivating an identify LED using the Advanced System
Management Interface
You can activate or deactivate an identify LED using the Advanced System Management Interface
(ASMI).
You can specify the location code of any indicator to view or modify its current state. If you provide the
wrong location code, the advanced system manager attempts to go to the next higher level of the location
code.
The next level is the base-level location code for that field replaceable unit (FRU). For example, a user
types the location code for the FRU located on the second I/O slot of the third enclosure in the system. If
the location code for the second I/O slot is incorrect (the FRU does not exist at this location), an attempt
to set the indicator for the third enclosure is initiated. This process continues until a FRU is located or no
other level is available.
To perform this operation, your authority level must be one of the following levels:
v Administrator
v Authorized service provider
To change the current state of an indicator, do the following steps:
1. On the ASMI Welcome pane, specify your user ID and password, and click Log In.
2. In the navigation area, expand System Configuration ? Service Indicators ? Indicators by Location
code.
3. In the right pane, enter the location code of the FRU and click Continue.
4. Select the preferred state from the list.
5. Click Save settings.
Activating and deactivating LEDs 285
286 Disk drives
Notices
This information was developed for products and services offered in the U.S.A.
The manufacturer may not offer the products, services, or features discussed in this document in other
countries. Consult the manufacturer's representative for information on the products and services
currently available in your area. Any reference to the manufacturer's product, program, or service is not
intended to state or imply that only that product, program, or service may be used. Any functionally
equivalent product, program, or service that does not infringe any intellectual property right of the
manufacturer may be used instead. However, it is the user's responsibility to evaluate and verify the
operation of any product, program, or service.
The manufacturer may have patents or pending patent applications covering subject matter described in
this document. The furnishing of this document does not grant you any license to these patents. You can
send license inquiries, in writing, to the manufacturer.
The following paragraph does not apply to the United Kingdom or any other country where such
provisions are inconsistent with local law: THIS INFORMATION IS PROVIDED “AS IS” WITHOUT
WARRANTY OF ANY KIND, EITHER EXPRESS OR IMPLIED, INCLUDING, BUT NOT LIMITED TO,
THE IMPLIED WARRANTIES OF NON-INFRINGEMENT, MERCHANTABILITY OR FITNESS FOR A
PARTICULAR PURPOSE. Some states do not allow disclaimer of express or implied warranties in certain
transactions, therefore, this statement may not apply to you.
This information could include technical inaccuracies or typographical errors. Changes are periodically
made to the information herein; these changes will be incorporated in new editions of the publication.
The manufacturer may make improvements and/or changes in the product(s) and/or the program(s)
described in this publication at any time without notice.
Any references in this information to Web sites not owned by the manufacturer are provided for
convenience only and do not in any manner serve as an endorsement of those Web sites. The materials at
those Web sites are not part of the materials for this product and use of those Web sites is at your own
risk.
The manufacturer may use or distribute any of the information you supply in any way it believes
appropriate without incurring any obligation to you.
Any performance data contained herein was determined in a controlled environment. Therefore, the
results obtained in other operating environments may vary significantly. Some measurements may have
been made on development-level systems and there is no guarantee that these measurements will be the
same on generally available systems. Furthermore, some measurements may have been estimated through
extrapolation. Actual results may vary. Users of this document should verify the applicable data for their
specific environment.
Information concerning products not produced by this manufacturer was obtained from the suppliers of
those products, their published announcements or other publicly available sources. This manufacturer has
not tested those products and cannot confirm the accuracy of performance, compatibility or any other
claims related to products not produced by this manufacturer. Questions on the capabilities of products
not produced by this manufacturer should be addressed to the suppliers of those products.
All statements regarding the manufacturer's future direction or intent are subject to change or withdrawal
without notice, and represent goals and objectives only.
© Copyright IBM Corp. 2010 287
The manufacturer's prices shown are the manufacturer's suggested retail prices, are current and are
subject to change without notice. Dealer prices may vary.
This information is for planning purposes only. The information herein is subject to change before the
products described become available.
This information contains examples of data and reports used in daily business operations. To illustrate
them as completely as possible, the examples include the names of individuals, companies, brands, and
products. All of these names are fictitious and any similarity to the names and addresses used by an
actual business enterprise is entirely coincidental.
If you are viewing this information in softcopy, the photographs and color illustrations may not appear.
The drawings and specifications contained herein shall not be reproduced in whole or in part without the
written permission of the manufacturer.
The manufacturer has prepared this information for use with the specific machines indicated. The
manufacturer makes no representations that it is suitable for any other purpose.
The manufacturer's computer systems contain mechanisms designed to reduce the possibility of
undetected data corruption or loss. This risk, however, cannot be eliminated. Users who experience
unplanned outages, system failures, power fluctuations or outages, or component failures must verify the
accuracy of operations performed and data saved or transmitted by the system at or near the time of the
outage or failure. In addition, users must establish procedures to ensure that there is independent data
verification before relying on such data in sensitive or critical operations. Users should periodically check
the manufacturer's support websites for updated information and fixes applicable to the system and
related software.
Trademarks
IBM, the IBM logo, and ibm.com are trademarks or registered trademarks of International Business
Machines Corp., registered in many jurisdictions worldwide. Other product and service names might be
trademarks of IBM or other companies. A current list of IBM trademarks is available on the Web at
Copyright and trademark information at www.ibm.com/legal/copytrade.shtml.
Linux is a registered trademark of Linus Torvalds in the United States, other countries, or both.
Other company, product, or service names may be trademarks or service marks of others.
Electronic emission notices
Class A Notices
The following Class A statements apply to the IBM servers that contain the POWER7 processor and its
features unless designated as electromagnetic compatibility (EMC) Class B in the feature information.
Federal Communications Commission (FCC) statement
Note: This equipment has been tested and found to comply with the limits for a Class A digital device,
pursuant to Part 15 of the FCC Rules. These limits are designed to provide reasonable protection against
harmful interference when the equipment is operated in a commercial environment. This equipment
generates, uses, and can radiate radio frequency energy and, if not installed and used in accordance with
the instruction manual, may cause harmful interference to radio communications. Operation of this
equipment in a residential area is likely to cause harmful interference, in which case the user will be
required to correct the interference at his own expense.
288 Disk drives
Properly shielded and grounded cables and connectors must be used in order to meet FCC emission
limits. IBM is not responsible for any radio or television interference caused by using other than
recommended cables and connectors or by unauthorized changes or modifications to this equipment.
Unauthorized changes or modifications could void the user's authority to operate the equipment.
This device complies with Part 15 of the FCC rules. Operation is subject to the following two conditions:
(1) this device may not cause harmful interference, and (2) this device must accept any interference
received, including interference that may cause undesired operation.
Industry Canada Compliance Statement
This Class A digital apparatus complies with Canadian ICES-003.
Avis de conformité à la réglementation d'Industrie Canada
Cet appareil numérique de la classe A est conforme à la norme NMB-003 du Canada.
European Community Compliance Statement
This product is in conformity with the protection requirements of EU Council Directive 2004/108/EC on
the approximation of the laws of the Member States relating to electromagnetic compatibility. IBM cannot
accept responsibility for any failure to satisfy the protection requirements resulting from a
non-recommended modification of the product, including the fitting of non-IBM option cards.
This product has been tested and found to comply with the limits for Class A Information Technology
Equipment according to European Standard EN 55022. The limits for Class A equipment were derived for
commercial and industrial environments to provide reasonable protection against interference with
licensed communication equipment.
European Community contact:
IBM Deutschland GmbH
Technical Regulations, Department M456
IBM-Allee 1, 71139 Ehningen, Germany
Tele: +49 7032 15-2937
e-mail: tjahn@de.ibm.com
Warning: This is a Class A product. In a domestic environment, this product may cause radio
interference, in which case the user may be required to take adequate measures.
VCCI Statement - Japan
The following is a summary of the VCCI Japanese statement in the box above:
This is a Class A product based on the standard of the VCCI Council. If this equipment is used in a
domestic environment, radio interference may occur, in which case, the user may be required to take
corrective actions.
Notices 289
Japanese Electronics and Information Technology Industries Association (JEITA)
Confirmed Harmonics Guideline (products less than or equal to 20 A per phase)
Japanese Electronics and Information Technology Industries Association (JEITA)
Confirmed Harmonics Guideline with Modifications (products greater than 20 A per
phase)
Electromagnetic Interference (EMI) Statement - People's Republic of China
Declaration: This is a Class A product. In a domestic environment this product may cause radio
interference in which case the user may need to perform practical action.
Electromagnetic Interference (EMI) Statement - Taiwan
The following is a summary of the EMI Taiwan statement above.
Warning: This is a Class A product. In a domestic environment this product may cause radio interference
in which case the user will be required to take adequate measures.
IBM Taiwan Contact Information:
290 Disk drives
Electromagnetic Interference (EMI) Statement - Korea
Please note that this equipment has obtained EMC registration for commercial use. In the event that it
has been mistakenly sold or purchased, please exchange it for equipment certified for home use.
Germany Compliance Statement
Deutschsprachiger EU Hinweis: Hinweis für Geräte der Klasse A EU-Richtlinie zur
Elektromagnetischen Verträglichkeit
Dieses Produkt entspricht den Schutzanforderungen der EU-Richtlinie 2004/108/EG zur Angleichung der
Rechtsvorschriften über die elektromagnetische Verträglichkeit in den EU-Mitgliedsstaaten und hält die
Grenzwerte der EN 55022 Klasse A ein.
Um dieses sicherzustellen, sind die Geräte wie in den Handbüchern beschrieben zu installieren und zu
betreiben. Des Weiteren dürfen auch nur von der IBM empfohlene Kabel angeschlossen werden. IBM
übernimmt keine Verantwortung für die Einhaltung der Schutzanforderungen, wenn das Produkt ohne
Zustimmung von IBM verändert bzw. wenn Erweiterungskomponenten von Fremdherstellern ohne
Empfehlung von IBM gesteckt/eingebaut werden.
EN 55022 Klasse A Geräte müssen mit folgendem Warnhinweis versehen werden:
"Warnung: Dieses ist eine Einrichtung der Klasse A. Diese Einrichtung kann im Wohnbereich
Funk-Störungen verursachen; in diesem Fall kann vom Betreiber verlangt werden, angemessene
Maßnahmen zu ergreifen und dafür aufzukommen."
Deutschland: Einhaltung des Gesetzes über die elektromagnetische Verträglichkeit von Geräten
Dieses Produkt entspricht dem “Gesetz über die elektromagnetische Verträglichkeit von Geräten
(EMVG)“. Dies ist die Umsetzung der EU-Richtlinie 2004/108/EG in der Bundesrepublik Deutschland.
Zulassungsbescheinigung laut dem Deutschen Gesetz über die elektromagnetische Verträglichkeit von
Geräten (EMVG) (bzw. der EMC EG Richtlinie 2004/108/EG) für Geräte der Klasse A
Dieses Gerät ist berechtigt, in Übereinstimmung mit dem Deutschen EMVG das EG-Konformitätszeichen
- CE - zu führen.
Verantwortlich für die Einhaltung der EMV Vorschriften ist der Hersteller:
International Business Machines Corp.
Notices 291
New Orchard Road
Armonk, New York 10504
Tel: 914-499-1900
Der verantwortliche Ansprechpartner des Herstellers in der EU ist:
IBM Deutschland GmbH
Technical Regulations, Abteilung M456
IBM-Allee 1, 71139 Ehningen, Germany
Tel: +49 7032 15-2937
e-mail: tjahn@de.ibm.com
Generelle Informationen:
Das Gerät erfüllt die Schutzanforderungen nach EN 55024 und EN 55022 Klasse A.
Electromagnetic Interference (EMI) Statement - Russia
Class B Notices
The following Class B statements apply to features designated as electromagnetic compatibility (EMC)
Class B in the feature installation information.
Federal Communications Commission (FCC) statement
This equipment has been tested and found to comply with the limits for a Class B digital device,
pursuant to Part 15 of the FCC Rules. These limits are designed to provide reasonable protection against
harmful interference in a residential installation.
This equipment generates, uses, and can radiate radio frequency energy and, if not installed and used in
accordance with the instructions, may cause harmful interference to radio communications. However,
there is no guarantee that interference will not occur in a particular installation.
If this equipment does cause harmful interference to radio or television reception, which can be
determined by turning the equipment off and on, the user is encouraged to try to correct the interference
by one or more of the following measures:
v Reorient or relocate the receiving antenna.
v Increase the separation between the equipment and receiver.
v Connect the equipment into an outlet on a circuit different from that to which the receiver is
connected.
v Consult an IBM-authorized dealer or service representative for help.
Properly shielded and grounded cables and connectors must be used in order to meet FCC emission
limits. Proper cables and connectors are available from IBM-authorized dealers. IBM is not responsible for
any radio or television interference caused by unauthorized changes or modifications to this equipment.
Unauthorized changes or modifications could void the user's authority to operate this equipment.
292 Disk drives
This device complies with Part 15 of the FCC rules. Operation is subject to the following two conditions:
(1) this device may not cause harmful interference, and (2) this device must accept any interference
received, including interference that may cause undesired operation.
Industry Canada Compliance Statement
This Class B digital apparatus complies with Canadian ICES-003.
Avis de conformité à la réglementation d'Industrie Canada
Cet appareil numérique de la classe B est conforme à la norme NMB-003 du Canada.
European Community Compliance Statement
This product is in conformity with the protection requirements of EU Council Directive 2004/108/EC on
the approximation of the laws of the Member States relating to electromagnetic compatibility. IBM cannot
accept responsibility for any failure to satisfy the protection requirements resulting from a
non-recommended modification of the product, including the fitting of non-IBM option cards.
This product has been tested and found to comply with the limits for Class B Information Technology
Equipment according to European Standard EN 55022. The limits for Class B equipment were derived for
typical residential environments to provide reasonable protection against interference with licensed
communication equipment.
European Community contact:
IBM Deutschland GmbH
Technical Regulations, Department M456
IBM-Allee 1, 71139 Ehningen, Germany
Tele: +49 7032 15-2937
e-mail: tjahn@de.ibm.com
VCCI Statement - Japan
Japanese Electronics and Information Technology Industries Association (JEITA)
Confirmed Harmonics Guideline (products less than or equal to 20 A per phase)
Japanese Electronics and Information Technology Industries Association (JEITA)
Confirmed Harmonics Guideline with Modifications (products greater than 20 A per
phase)
Notices 293
IBM Taiwan Contact Information
Electromagnetic Interference (EMI) Statement - Korea
Germany Compliance Statement
Deutschsprachiger EU Hinweis: Hinweis für Geräte der Klasse B EU-Richtlinie zur
Elektromagnetischen Verträglichkeit
Dieses Produkt entspricht den Schutzanforderungen der EU-Richtlinie 2004/108/EG zur Angleichung der
Rechtsvorschriften über die elektromagnetische Verträglichkeit in den EU-Mitgliedsstaaten und hält die
Grenzwerte der EN 55022 Klasse B ein.
Um dieses sicherzustellen, sind die Geräte wie in den Handbüchern beschrieben zu installieren und zu
betreiben. Des Weiteren dürfen auch nur von der IBM empfohlene Kabel angeschlossen werden. IBM
übernimmt keine Verantwortung für die Einhaltung der Schutzanforderungen, wenn das Produkt ohne
Zustimmung von IBM verändert bzw. wenn Erweiterungskomponenten von Fremdherstellern ohne
Empfehlung von IBM gesteckt/eingebaut werden.
Deutschland: Einhaltung des Gesetzes über die elektromagnetische Verträglichkeit von Geräten
Dieses Produkt entspricht dem “Gesetz über die elektromagnetische Verträglichkeit von Geräten
(EMVG)“. Dies ist die Umsetzung der EU-Richtlinie 2004/108/EG in der Bundesrepublik Deutschland.
Zulassungsbescheinigung laut dem Deutschen Gesetz über die elektromagnetische Verträglichkeit von
Geräten (EMVG) (bzw. der EMC EG Richtlinie 2004/108/EG) für Geräte der Klasse B
Dieses Gerät ist berechtigt, in Übereinstimmung mit dem Deutschen EMVG das EG-Konformitätszeichen
- CE - zu führen.
Verantwortlich für die Einhaltung der EMV Vorschriften ist der Hersteller:
International Business Machines Corp.
New Orchard Road
Armonk, New York 10504
Tel: 914-499-1900
Der verantwortliche Ansprechpartner des Herstellers in der EU ist:
IBM Deutschland GmbH
Technical Regulations, Abteilung M456
294 Disk drives
IBM-Allee 1, 71139 Ehningen, Germany
Tel: +49 7032 15-2937
e-mail: tjahn@de.ibm.com
Generelle Informationen:
Das Gerät erfüllt die Schutzanforderungen nach EN 55024 und EN 55022 Klasse B.
Terms and conditions
Permissions for the use of these publications is granted subject to the following terms and conditions.
Personal Use: You may reproduce these publications for your personal, noncommercial use provided that
all proprietary notices are preserved. You may not distribute, display or make derivative works of these
publications, or any portion thereof, without the express consent of the manufacturer.
Commercial Use: You may reproduce, distribute and display these publications solely within your
enterprise provided that all proprietary notices are preserved. You may not make derivative works of
these publications, or reproduce, distribute or display these publications or any portion thereof outside
your enterprise, without the express consent of the manufacturer.
Except as expressly granted in this permission, no other permissions, licenses or rights are granted, either
express or implied, to the publications or any information, data, software or other intellectual property
contained therein.
The manufacturer reserves the right to withdraw the permissions granted herein whenever, in its
discretion, the use of the publications is detrimental to its interest or, as determined by the manufacturer,
the above instructions are not being properly followed.
You may not download, export or re-export this information except in full compliance with all applicable
laws and regulations, including all United States export laws and regulations.
THE MANUFACTURER MAKES NO GUARANTEE ABOUT THE CONTENT OF THESE
PUBLICATIONS. THESE PUBLICATIONS ARE PROVIDED "AS-IS" AND WITHOUT WARRANTY OF
ANY KIND, EITHER EXPRESSED OR IMPLIED, INCLUDING BUT NOT LIMITED TO IMPLIED
WARRANTIES OF MERCHANTABILITY, NON-INFRINGEMENT, AND FITNESS FOR A PARTICULAR
PURPOSE.
Bull
2104 Model DU3 Expandable Storage Plus
Operator’s Guide
86 A1 16EF 01
ORDER REFERENCE
Bull
2104 Model DU3 Expandable Storage Plus
Operator’s Guide
Hardware
December 2000
BULL CEDOC
357 AVENUE PATTON
B.P.20845
49008 ANGERS CEDEX 01
FRANCE
86 A1 16EF 01
ORDER REFERENCE
The following copyright notice protects this book under the Copyright laws of the United States of America
and other countries which prohibit such actions as, but not limited to, copying, distributing, modifying, and
making derivative works.
Copyright Bull S.A. 1992, 2000
Printed in France
Suggestions and criticisms concerning the form, content, and presentation of
this book are invited. A form is provided at the end of this book for this purpose.
To order additional copies of this book or other Bull Technical Publications, you
are invited to use the Ordering Form also provided at the end of this book.
Trademarks and Acknowledgements
We acknowledge the right of proprietors of trademarks mentioned in this book.
AIX is a registered trademark of International Business Machines Corporation, and is being used under
licence.
UNIX is a registered trademark in the United States of America and other countries licensed exclusively through
the Open Group.
Year 2000
The product documented in this manual is Year 2000 Ready.
The information in this document is subject to change without notice. Groupe Bull will not be liable for errors
contained herein, or for incidental or consequential damages in connection with the use of this material.
Contents
Communications Statements . . . . . . . . . . . . . . . . . . . v
Federal Communications Commission (FCC) Statement . . . . . . . . . . v
Japanese Voluntary Control Council for Interference (VCCI) Statement. . . . . . v
Korean Government Ministry of Communication (MOC) Statement . . . . . . . v
New Zealand Compliance Statement . . . . . . . . . . . . . . . . . v
International Electrotechnical Commission (IEC) Statement . . . . . . . . . vi
Avis de conformité à la réglementation d’Industrie Canada . . . . . . . . . vi
Industry Canada Compliance Statement. . . . . . . . . . . . . . . . vi
United Kingdom Telecommunications Requirements. . . . . . . . . . . . vi
European Union (EU) Statement . . . . . . . . . . . . . . . . . . vi
Radio Protection for Germany . . . . . . . . . . . . . . . . . . . vi
Taiwan Class A Compliance Statement . . . . . . . . . . . . . . . . vii
Safety Notices. . . . . . . . . . . . . . . . . . . . . . . . ix
Definitions of Safety Notices. . . . . . . . . . . . . . . . . . . . ix
Acoustic Noise Declaration . . . . . . . . . . . . . . . . . . . . ix
Electrostatic Discharge (ESD) . . . . . . . . . . . . . . . . . . xi
About This Book . . . . . . . . . . . . . . . . . . . . . . xiii
If You Need More Information . . . . . . . . . . . . . . . . . . . xiii
Trademarks. . . . . . . . . . . . . . . . . . . . . . . . . xiii
Numbering Convention . . . . . . . . . . . . . . . . . . . . . xiv
Chapter 1. Using the 2104 Model DU3 or TU3 . . . . . . . . . . . . . 1
2104 Model DU3 . . . . . . . . . . . . . . . . . . . . . . . 2
2104 Model TU3. . . . . . . . . . . . . . . . . . . . . . . . 5
Controls and Lights. . . . . . . . . . . . . . . . . . . . . . . 7
Disk Drive Module Lights. . . . . . . . . . . . . . . . . . . . 8
Fan-and-Power-Supply Assembly and Fan Assembly Lights and Switch . . . . 11
SCSI Interface Card Lights. . . . . . . . . . . . . . . . . . . 14
Switch Card. . . . . . . . . . . . . . . . . . . . . . . . 16
Identifying 2104 Disk Enclosures. . . . . . . . . . . . . . . . . . 20
Identifying Disk Drive Modules. . . . . . . . . . . . . . . . . . . 21
Security . . . . . . . . . . . . . . . . . . . . . . . . . . 22
Ordering Cover Lock Keys . . . . . . . . . . . . . . . . . . . 23
Chapter 2. Adding Disk Drive Modules . . . . . . . . . . . . . . . 25
Before Adding a Disk Drive Module . . . . . . . . . . . . . . . . . 27
To Add a Disk Drive Module . . . . . . . . . . . . . . . . . . . 29
Chapter 3. Exchanging FRUs . . . . . . . . . . . . . . . . . . 35
Exchanging Disk Drive Modules . . . . . . . . . . . . . . . . . . 36
Before Exchanging a Disk Drive Module . . . . . . . . . . . . . . 37
To Exchange a Disk Drive Module . . . . . . . . . . . . . . . . 38
Exchanging a Fan-and-Power-Supply Assembly. . . . . . . . . . . . . 45
Before Exchanging a Fan-and-Power-Supply Assembly . . . . . . . . . 45
iii
To Exchange a Fan-and-Power-Supply Assembly . . . . . . . . . . . 46
Exchanging a Fan Assembly . . . . . . . . . . . . . . . . . . . 55
Before Exchanging a Fan Assembly. . . . . . . . . . . . . . . . 55
To Exchange a Fan Assembly. . . . . . . . . . . . . . . . . . 56
Exchanging a SCSI Interface Card Assembly. . . . . . . . . . . . . . 62
Before Exchanging the SCSI Interface Card Assembly. . . . . . . . . . 63
To Exchange a SCSI Interface Card Assembly . . . . . . . . . . . . 64
Exchanging the Switch Card Assembly. . . . . . . . . . . . . . . . 73
Before Exchanging the Switch Card Assembly . . . . . . . . . . . . 73
To Exchange the Switch Card Assembly . . . . . . . . . . . . . . 74
Reporting Problems . . . . . . . . . . . . . . . . . . . . . . 80
Appendix A. Operating with RISC Systems . . . . . . . . . . . . . 81
SCSI Adapters and Cables. . . . . . . . . . . . . . . . . . . . 81
SCSI Service Aids. . . . . . . . . . . . . . . . . . . . . . . 81
Identifying a 2104 and its Disk Drive Modules . . . . . . . . . . . . . 81
Problem Determination . . . . . . . . . . . . . . . . . . . . . 82
Checking the SCSI Error Log . . . . . . . . . . . . . . . . . . . 82
Configuring a 2104 to the using system . . . . . . . . . . . . . . . 82
Unconfiguring a 2104 from the using system . . . . . . . . . . . . . . 82
Web Support Page . . . . . . . . . . . . . . . . . . . . . . 82
Related Publications . . . . . . . . . . . . . . . . . . . . . . 82
Appendix B. Translated Safety Notices . . . . . . . . . . . . . . . 83
Index . . . . . . . . . . . . . . . . . . . . . . . . . . 117
iv 2104 Models DU3 and TU3 Operator’s Guide
Communications Statements
The following statements apply to this product. The statements for other products
intended for use with this product appear in their accompanying manuals.
Federal Communications Commission (FCC) Statement
This equipment has been tested and found to comply with the limits for a Class A
digital device, pursuant to Part 15 of the FCC Rules. These limits are designed to
provide reasonable protection against harmful interference when the equipment is
operated in a commercial environment. This equipment generates, uses, and can
radiate radio frequency energy and, if not installed and used in accordance with the
instruction manual, may cause interference to radio communications. Operation of this
equipment in a residential area is likely to cause harmful interference, in which case the
user will be required to correct the interference at his own expense.
Properly shielded and grounded cables and connectors must be used in order to meet
FCC emission limits. Neither the provider nor the manufacturer is responsible for any
radio or television interference caused by using other than recommended cables and
connectors or by unauthorized changes or modifications to this equipment.
Unauthorized changes or modifications could void the user’s authority to operate the
equipment.
This device complies with Part 15 of FCC Rules. Operation is subject to the following
two conditions: (1) this device may not cause harmful interference, and (2) this device
must accept any interference received, including interference that may cause undesired
operation.
Japanese Voluntary Control Council for Interference (VCCI) Statement
This product is a Class A Information Technology Equipment and conforms to the
standards set by the Voluntary Control Council for Interference by Information
Technology Equipment (VCCI). In a domestic environment, this product might cause
radio interference, in which event the user might be required to take adequate
measures.
Korean Government Ministry of Communication (MOC) Statement
Please note that this device has been approved for business purposes with regard to
electromagnetic interference. If you find that this device is not suitable for your use, you
can exchange it for one that is approved for non-business purposes.
New Zealand Compliance Statement
This is a Class A product. In a domestic environment this product might cause radio
interference, in which event the user might be required to take adequate measures
v
International Electrotechnical Commission (IEC) Statement
This product has been designed and built to comply with (IEC) Standard 60950.
Avis de conformité à la réglementation d’Industrie Canada
Cet appareil numérique de la classe A est conforme à la norme NMB-003 du Canada.
Industry Canada Compliance Statement
This Class A digital apparatus complies with IECS-003.
United Kingdom Telecommunications Requirements
This apparatus is manufactured to the International Safety Standard EN60950 and as
such is approved in the U.K. under approval number NS/G/1234/J/100003 for indirect
connection to public telecommunications systems in the United Kingdom.
European Union (EU) Statement
This product is in conformity with the protection requirements of EU council directive
89/336/EEC on the approximation of the laws of the Member States relating to
electromagnetic compatibility. Neither the provider nor the manufacturer can accept
responsibility for any failure to satisfy the protection requirements resulting from a
non-recommended modification of the product, including the fitting of option cards not
supplied by the manufacturer.
This product is in conformity with the EU council directive 73/23/EEC on the
approximation of the laws of the Member States relating to electrical equipment
designed for use within certain voltage limits. This conformity is based on compliance
with the following harmonized standard: EN60950.
Radio Protection for Germany
Zulassungsbescheinigung laut dem Deutschen Gesetz über die
elektromagnetische Verträglichkeit von Geräten (EMVG) vom 30. August 1995
(bzw. der EMC EG Richtlinie 89/336):
Dieses Gerät ist berechtigt in Übereinstimmung mit dem Deutschen EMVG das
EG-Konformitätszeichen - CE - zu führen. Verantwortlich für die Konformitätserklärung
nach Paragraph 5 des EMVG ist die:
IBM Deutschland Informationssysteme GmbH, 70548 Stuttgart.
Informationen in Hinsicht EMVG Paragraph 3 Abs. (2) :
Das Gerät erfüllt die Schutzanforderungen nach EN 50082-1 und EN 55022 Klasse A.
EN 55022 Klasse A Geräte müssen mit folgendem Warnhinweis versehen werden:
?Warnung: dies ist eine Einrichtung der Klasse A. Diese Einrichtung kann im
vi 2104 Models DU3 and TU3 Operator’s Guide
Wohnbereich Funkstörungen verursachen; in diesem Fall kann vom Betreiber verlangt
werden, angemessene Massnahmen durchzuführen und dafür aufzukommen.?
EN 50082-1 Hinweis:
?Wird dieses Gerät in einer industriellen Umgebung betrieben (wie in EN 50082-2
festgelegt), dann kann es dabei eventuell gestört werden. In solch einem Fall ist der
Abstand bzw. die Abschirmung zu der industriellen Störquelle zu vergrössern.?
Anmerkung:
Um die Einhaltung des EMVG sicherzustellen sind die Geräte, wie in den Handbüchern
angegeben, zu installieren und zu betreiben.
Taiwan Class A Compliance Statement
Communications Statements vii
viii 2104 Models DU3 and TU3 Operator’s Guide
Safety Notices
For a translation of the danger and caution notices contained in this book, see
“Appendix B. Translated Safety Notices” on page 83.
Definitions of Safety Notices
A danger notice indicates the presence of a hazard that has the potential of causing
death or serious injury.
A danger notice appears on page 50.
A caution notice indicates the presence of a hazard that has the potential of causing
moderate or minor personal injury.
Caution notices appear on pages 47, 49, 58, 67, 69, and 76.
An attention notice indicates an action that could cause damage to a program, device,
system, or data.
Acoustic Noise Declaration
The equivalent continuous A-weighted sound pressure level at the bystander’s position
measured for a 2104 Model DU3 or TU3 does not exceed 68 dB(A). This level has
been measured under the following conditions:
v Rack-mounted unit:
– The 2104 Model DU3 had fourteen disk drive modules installed.
– The 2104 Model DU3 was installed in a 7202 Rack with the bottom of the 2104
Model DU3 system at EIA position 17.
– The remainder of the rack was filled with units to which power was not connected.
v Deskside unit:
– The 2104 Model TU3 had fourteen disk drive modules installed.
These levels were measured using a procedure in accordance with ISO standard
DIS7779, Measurement of Airborne Noise Emitted by Computer and Business
Equipment. The equipment was installed and operated as described in Appendix C.8 of
that standard.
ix
x 2104 Models DU3 and TU3 Operator’s Guide
Electrostatic Discharge (ESD)
Attention:
When you handle field-replaceable units (FRUs) and other computer parts, take
these precautions to avoid static damage:
v Limit your movement. Movement can cause static electricity to build up around
you.
v Always touch computer parts carefully. Hold adapters and memory-modules by
their edges. Never touch any exposed circuits.
v Prevent people who are not correctly grounded from touching computer parts.
v Before you install a new part, touch the static-protective package that contains
the part against an unpainted metal part of the 2104 or using system for at
least two seconds. This action reduces static electricity in the package and in
your body.
v Remove the part from its package and, if possible, install it directly into the
2104 without putting the part down. If you need to put the part down, first place
the static-protective package that contained the part onto a smooth, level
surface, then place the part onto the package. Do not place the part onto any
metal surface.
xi
xii 2104 Models DU3 and TU3 Operator’s Guide
About This Book
This book is for people who operate a system that has one or more 2104 Model DU3 or
TU3 connected to it.
Throughout this book, service representative means a person who has been authorized
by your organization to maintain your 2104 Model DU3 or TU3.
Chapter 1 describes the 2104 Models DU3 and TU3, their controls, and how to use
them.
Chapter 2 describes how to add disk drive modules to a 2104 Model DU3 or TU3 that is
already installed as part of your system.
Chapter 3 describes what to do if you have a problem with your 2104 Model DU3 or
TU3. It tells you how to replace particular failed components, such as disk drive
modules, and what to report if you need to call your service representative for
assistance.
Appendix A gives additional information about using a 2104 Model DU3 or TU3 that is
attached to a RISC system.
Appendix B gives a translation of all danger and caution notices contained in this book.
An index is provided at the back of this book.
If You Need More Information
Other books that you might need are:
v Expandable Storage Plus : 2104 Model DU3 Installation Guide, 86 A1 15EF
v Expandable Storage Plus : 2104 Model TU3 Installation Guide,
v Expandable Storage Plus : 2104 Models DU3 and TU3 Service Guide, 86 A1 17EF
v The Operator Guide for your system
v The User’s Guide for your using-system SCSI attachment (for example, your SCSI
adapter)
v The Site and Hardware Planning Information for your system
v The Problem Solving Guide and Reference for your system.
Trademarks
The following items are trademarks of International Business Machines Corporation in
the United States, or other countries, or both:
v AIX
v IBM
v RS/6000
xiii
v TM
Numbering Convention
In this book:
KB means 1 000 bytes.
MB means 1 000 000 bytes.
GB means 1 000 000 000 bytes.
xiv 2104 Models DU3 and TU3 Operator’s Guide
Chapter 1. Using the 2104 Model DU3 or TU3
Important: The installation of this product is a customer responsibility.
Attention: Before continuing with any of the actions described in this book,
please refer to “Safety Notices” on page ix.
This section describes the Expandable Storage Plus: 2104 Models DU3 and TU3, their
controls, and how to use them. A 2104 Model DU3 or TU3 can be attached to any
supported RS/6000™ or IBM
server
TM
computer that provides support for any of the Small Computer System Interface (SCSI)
adapters listed in “SCSI Adapters and Cables” on page 81.
The 2104 Models DU3 and TU3 look like this:
1
2104 Model DU3
The 2104 Model DU3 is a rack-mounted SCSI disk enclosure that can be installed into
a standard Electrical Industries Association (EIA) 19-inch rack.
A fully configured 2104 Model DU3 looks like this:
Figure 1. 2104 Model TU3 (Left) and 2104 Model DU3 (Right)
2 2104 Models DU3 and TU3 Operator’s Guide
Figure 2. 2104 Model DU3 from the Front (Top) and Back (Bottom)
Chapter 1. Using the 2104 Model DU3 or TU3 3
At the front of a 2104 Model DU3 there are 14 device slots 1. Each slot must contain
either a SCSI disk drive module, or a dummy disk module. Each disk drive module
connects to a backplane which is mounted vertically about halfway between the front
and back of the 2104 Model DU3.
Note: To run I/O at least one SCSI disk drive module must be present in each 2104
Model DU3.
The 14 device slots are divided into two groups of seven slots. Between the two groups
of seven slots there is a SCSI bridge card2. The SCSI bridge card assigns the disk
drive modules to a SCSI bus, in accordance with the setting of the SCSI bus split
switch on the switch card 5.
On the back of the backplane are connectors for two fan-and-power-supply
assemblies3, two SCSI interface cards4, and a switch card5.
The 2104 Model DU3 has either two fan-and-power-supply assemblies or one
fan-and-power-supply assembly and one fan assembly. Either option provides all the
necessary power and cooling for the 2104 Model DU3.
It is better, however, to have two fan-and-power-supply assemblies, rather than one
fan-and-power-supply assembly and one fan assembly, because then the amount of
power required from each power supply is reduced, and, if one power supply fails,
power for the 2104 disk enclosure continues to be supplied by the other power supply.
The SCSI interface cards4 are used to connect host machines to the 2104 Model
DU3. These cards contain logic that provides information about what is happening in
the 2104 Model DU3 and the status of components within it.
The switch card5 contains switches that control which SCSI bus mode is selected,
how the 2104 Model DU3 is supplied with power, and what enclosure services are
enabled. It also contains a rotary switch that sets the ID of the 2104 Model DU3.
Options for configuring a 2104 Model DU3 are described in the Expandable Storage
Plus: 2104 Model DU3 Installation Guide.
4 2104 Models DU3 and TU3 Operator’s Guide
2104 Model TU3
The 2104 Model TU3 is a deskside SCSI disk enclosure.
A fully configured 2104 Model TU3 looks like this:
Figure 3. 2104 Model TU3 from the Front (Left) and Back (Right)
Chapter 1. Using the 2104 Model DU3 or TU3 5
At the front of a 2104 Model TU3 are 14 device slots 1. Each slot must contain either
an SCSI disk drive module, or a dummy module. Each disk drive module connects to a
backplane that is mounted vertically about halfway between the front and back of the
2104 Model TU3.
Note: To run I/O at least one SCSI disk drive module must be present in each 2104
Model TU3.
The 14 device slots are split into two groups of seven slots. Between the two groups of
slots, there is a SCSI bus bridge card 2. The SCSI bus bridge card assigns the disk
drive modules to a SCSI bus, in accordance with the setting of the SCSI bus split
switch on the switch card5.
On the back of the backplane are connectors for two fan-and-power-supply
assemblies3, two SCSI interface cards4, and a switch card5.
The 2104 Model TU3 can have either two fan-and-power-supply assemblies or one
fan-and-power-supply assembly and one fan assembly. Either option provides all the
necessary power and cooling for the 2104 Model TU3.
It is better, however, to have two fan-and-power-supply assemblies, rather than one
fan-and-power-supply assembly and one fan assembly, because then the amount of
power required from each power supply is reduced, and, if one power supply fails,
power for the 2104 disk enclosure continues to be supplied by the other power supply.
The SCSI interface cards4 are used to connect host machines to the 2104 Model
TU3. These cards contain logic that provides information about what is happening in the
2104 Model TU3, and controls the operation of the subsystem.
The switch card5 contains switches that control which SCSI bus mode is selected,
how the 2104 Model TU3 is supplied with power, and what enclosure services are
enabled. It also contains a rotary switch that is used to set the ID of the 2104 Model
TU3.
Options for configuring a 2104 Model TU3 are described in the Expandable Storage
Plus: 2104 Model TU3 Installation Guide.
6 2104 Models DU3 and TU3 Operator’s Guide
Controls and Lights
This section describes the switches and lights on a 2104 Model DU3 or TU3.
The 2104 has no main power switch. However, each fan-and-power-supply assembly
has an On/Standby switch.
You can see the 2104 lights at the front of the 2104 Model DU3 or TU3.
Power light 1
The Power light, which is green, comes on and stays on continuously when
power is supplied to the 2104 by one or both of the fan-and-power-supply
assemblies within the 2104.
Check light 2
The Check light, which is amber, comes on if a failure occurs in the 2104.
Note: The 2104 might be able to continue operating correctly even when the
failure of a single part has been detected.
Figure 4. Disk Enclosure Lights on a 2104 Model TU3 (Left) and a 2104 Model DU3
(Right)
Chapter 1. Using the 2104 Model DU3 or TU3 7
Disk Drive Module Lights
The lights on the disk drive modules that are installed in a 2104 Model DU3 are visible
at the front of the 2104 Model DU3.
However, to see the lights on the disk drive modules that are installed in a 2104 Model
TU3, open the front cover:
1. If necessary, unlock the cover by using the key provided.
2. Grip the right-hand side of the front cover, and pivot the cover to the left.
Figure 5. 2104 Model TU3 with Cover Opened
8 2104 Models DU3 and TU3 Operator’s Guide
Each disk drive module has two lights:
1Activity light
The Activity light, which is green, shows the following conditions:
Off The SCSI link to the disk drive is not active.
Flashing
This disk drive module is active and a command is in progress.
2Check light
The Check light, which is amber, shows the following conditions:
Off Normal operating condition
Permanently On
One of the following conditions:
v The service aid has set Remove for this disk drive module (See
“Appendix A. Operating with RISC Systems” on page 81).
v The disk drive has reported a Predictive Failure Analysis (PFA)
error; this error indicates that the disk drive has had an excessive
number of internally recovered errors.
v The disk drive is faulty; this is a Conner/Intel SCSI-Accessed
Fault-Tolerant Enclosures (SAF-TE) function.
Flashing (2 seconds on, 2 seconds off)
The Check light has been set by a SCSI service aid to identify the
position of this disk drive module.
Figure 6. Disk Drive Module Lights
Chapter 1. Using the 2104 Model DU3 or TU3 9
Flashing (0.25 seconds on, 0.25 seconds off)
The disk drive is part of an array that is being rebuilt (this is a
SAF-TE function).
Note:SAF-TE services are not used with RS/6000 or IBM
server
pSeries computers.
10 2104 Models DU3 and TU3 Operator’s Guide
Fan-and-Power-Supply Assembly and Fan Assembly Lights and Switch
Either one or two fan-and-power-supply assemblies 1 are installed in the back of a
2104 Model DU3 or TU3.
The power switch and lights are visible at the back of the 2104.
Figure 7. Fan-and-Power-Supply Assemblies in a 2104 Model TU3 (Left) and 2104
Model DU3 (Right)
Chapter 1. Using the 2104 Model DU3 or TU3 11
A fan-and-power-supply assembly has the following lights and switch:
AC PWR light 1
The AC PWR light, which is green, comes on when the mainline power supply
is connected to the fan-and-power-supply assembly.
DC PWR light 2
The DC PWR light, which is green, comes on when this fan-and-power-supply
assembly is supplying power to the 2104.
CHK light 3
The CHK light, which is amber, indicates either that there is a failure in the
fan-and-power-supply assembly or that the DC On/Standby switch is set to
Standby.
Note: This light flashes when the failure in the fan-and-power-supply assembly
is a fan failure.
DC On/Standby switch 4
The DC On/Standby switch connects dc electrical power from the
fan-and-power-supply assembly to the disk drives and other components in the
2104. This switch must be set to on for the fan-and-power-supply unit to start.
If the DC On/Standby switch on either of the two fan-and-power-supply
assemblies is set to on, power in a 2104 Model DU3 or TU3 unit is switched
on automatically if all of the following conditions exist:
v Input power to the 2104 is present.
v The fan-and-power-supply assembly is fully home in its slot.
Figure 8. Fan-and-Power-Supply Assembly Lights and Switch
12 2104 Models DU3 and TU3 Operator’s Guide
v Either the power control switch on the switch card is set to on, or terminator
power is active in an external SCSI connection.
A fan assembly has one light:
CHK light 1
The CHK light, which is amber, flashes when the fan fails.
Figure 9. Fan Assembly
Chapter 1. Using the 2104 Model DU3 or TU3 13
SCSI Interface Card Lights
Up to two SCSI interface 1cards can be installed in the back of a 2104.
Figure 10. SCSI Interface Cards in a 2104 Model TU3 (Left) and 2104 Model DU3
(Right)
14 2104 Models DU3 and TU3 Operator’s Guide
An SCSI interface card has the following lights:
TERM POWER light 1
The TERM POWER light, which is green, is lit when an active SCSI
connection is present.
LVD/SE light 2
The LVD/SE light indicates the type of host SCSI cable. It is green, and when
lit indicates an active Low Voltage Differential (LVD) SCSI connection. If this
light is off and the TERM POWER light is on, there is an active single-ended
(SE) SCSI connection.
ACTIVE light 3
The ACTIVE light, which is green, is lit when the using system is
communicating with the SCSI Enclosure Services (SES) processor or the disk
drives.
RESET light 4
The RESET light, which is green, comes on when a Power On Reset (POR) or
a SCSI Bus Reset occurs. This light is switched off by the SES processor.
FAULT light 5
The FAULT light, which is amber, comes on if the SCSI interface card fails.
Figure 11. SCSI Interface Card Assembly Lights
Chapter 1. Using the 2104 Model DU3 or TU3 15
Switch Card
A Switch Card 1 is installed in the back of a 2104.
Figure 12. Switch Card in a 2104 Model TU3 (Left) and 2104 Model DU3 (Right)
16 2104 Models DU3 and TU3 Operator’s Guide
Refer to Figure 13 to identify the switches on the switch card.
The switch card contains the following switches that are accessible to the operator.
Note: The ?default logical setting? of a switch is the setting that is used if the switch
card assembly is not present when the 2104 is switched on, or when a reset
operation occurs.
Power Control switch 1
When this switch is set to Off, the 2104 automatically switches off or on when
the using system is switched off or on.
When this switch is set to On, the 2104 is powered up or down by the DC
On/Standby switch on a fan-and-power-supply assembly.
The default logical setting for this switch is On.
Drive Autostart switch 2
The disk drive motors are controlled by the combination of this switch and the
Delay Motor Start Switch 5.
Figure 13. Switch Card
Chapter 1. Using the 2104 Model DU3 or TU3 17
When this switch is set to On, and the Delay Motor Start Switch 5 is set to
Off, the disk drive motors do not start until a START MOTOR command is
issued. The timing sequence of disk motor startup is under the control of the
using system software.
When this switch is set to Off, and the Delay Motor Start Switch 5 is set to
On, the disk drives are set to Delay Motor Start mode. The disk motor startup
delay time is different for each disk drive and is usually computed by
multiplying 12 seconds by its SCSI ID. For example, the disk drive motor of a
disk drive having a SCSI ID equal to two will start 24 (2 x 12) seconds after
power is applied to the 2104.
When this switch is set to Off, and the Delay Motor Start Switch 5 is set to
Off, the disk drives are set to Normal Start mode. The disk drive motors will
start when power is applied to the 2104.
The effect of both this switch and the Delay Motor Start Switch 5 being set
to On is undefined.
The default logical setting for this switch is On.
Enable Enclosure Services switch 3
When this switch is set to On, the enclosure services can operate.
When the switch is to Off, no response occurs to any request to use the
enclosure services.
The default logical setting for this switch is On.
Select Enclosure Services switch 4
This switch selects which enclosure services can be used — SAF-TE or SES.
When set to Off, it selects SAF-TE.
When set to On, it selects SES.
The default logical setting for this switch is On.Note:SAF-TE services are not
used with RS/6000 or IBM
server
pSeries computers.
Delay Motor Start switch 5
Refer to the explanation of the Drive Autostart Switch 2.
The default logical setting for this switch is Off.
Box ID switch 6
This 10-position rotary switch indicates the ID of the 2104. The operator
selects the required setting which can be used by the SES Inquiry command
and the SAF-TE buffer 1 command.
The default logical setting for this switch is 0.Note:SAF-TE services are not
used with RS/6000 or IBM
server
pSeries computers.
18 2104 Models DU3 and TU3 Operator’s Guide
The switch card has other switches that are accessible only when the switch card is
removed from the 2104. They should only be changed when the 2104 is being
converted from a Model DU3 to a Model TU3, or from a Model TU3 to a Model DU3.
SCSI Address switch 7
When this switch is set to Off, which is its default setting in a 2104 Model DU3,
the SCSI addresses of the disk drives are 0, 1, 2, 3, 4, 5, 6, 8, 9, 10, 11, 12,
13, 14, from left to right. When the switch is set to On, these addresses are
reversed.
Note: This setting, On, is not supported in a 2104 Model TU3 or 2104 Model
DU3.
Orientation switch 8
This switch should be set to Off in a 2104 Model DU3, and to On in a 2104
Model TU3. Its setting determines the position of the Power and Check lights
on the 2104.
SCSI Bus Split switch 9
This switch controls the SCSI bus configuration. When the switch is set to Off,
the 2104 is configured as a single SCSI bus. When the switch is set to On, the
2104 is configured as a dual (or split) SCSI bus. In a 2104 Model DU3, with
this switch set to On and the SCSI address switch set to Off, the SCSI
addresses of the disk drives are 0, 1, 2, 3, 4, 5, 6, from left to center, on one
SCSI bus, and 8, 9, 10, 11, 12, 13, 14, from center to right, on the other SCSI
bus.
The default logical setting for this switch is Off.
Reserved 10
This switch should be set to Off.
Chapter 1. Using the 2104 Model DU3 or TU3 19
Identifying 2104 Disk Enclosures
The Box ID switch on the switch card is a 10-position rotary switch that indicates the ID
of the 2104. Its setting is used by the SES Inquiry command or the SAF-TE buffer 1
command.
Labels are supplied with the 2104 to enable the operator to identify each 2104. It is
recommended that the label that represents the ID set by the Box ID switch is attached
to the front of the 2104.
20 2104 Models DU3 and TU3 Operator’s Guide
Identifying Disk Drive Modules
You can identify a disk drive module by the serial number that appears on a label 1
on the front of the module:
This label also shows the size of the disk drive.
Each disk drive module also has a SCSI address that is related to its position in the
2104.
The addresses used are 0, 1, 2, 3, 4, 5, 6, 8, 9, 10, 11, 12, 13 and 14. These
addresses are usually assigned, in sequence, from left to right in the 2104 Model DU3
and from top to bottom in the 2104 Model TU3.
Figure 14. Disk Drive Module Label
Chapter 1. Using the 2104 Model DU3 or TU3 21
Security
On the 2104 Model TU3, a lock 1 on the front cover provides physical security for the
disk drives. You do not need to remove this cover during normal operations. If you lock
the cover, it cannot be opened for servicing without the use of the key. You do not need
to lock the cover for it to stay closed. The 2104 Model DU3 has no cover or lock.
Figure 15. 2104 Model TU3 Lock
22 2104 Models DU3 and TU3 Operator’s Guide
Ordering Cover Lock Keys
For protection against unauthorized key duplication, the cover lock is a high-security
lock. Keys (see Figure 16) for these locks are a factory-restricted series and duplicate
keys are not available through normal commercial channels. The serial number of the
lock is stamped on each key. Make a note of this number. The additional key supplied,
and your note of its number, should be stored in separate secure areas.
If you lose the key, or it becomes faulty, a replacement key can be purchased from the
Illinois Lock Company.
Their address is:
Illinois Lock Company,
301, W. Hintz Road,
Wheeling, Illinois 60090.
Phone:(800) 299-5880 (inside USA)
001-847-537-1800 (from outside USA)
FAX:001-847-537-1881
e-mail:illock@aol.com
website:www.illinoislock.com
Figure 16. Cover Lock Keys
Chapter 1. Using the 2104 Model DU3 or TU3 23
24 2104 Models DU3 and TU3 Operator’s Guide
Chapter 2. Adding Disk Drive Modules
This chapter describes how to add disk drive modules to a 2104 that is already installed
as part of your system.
You can do this only if:
1. You are authorized by your organization.
2. You have the correct disk drive module. Refer to Figure 17. This disk drive module
has, at the front, a handle 1 that can be pulled upward to open. The disk drive
module should also be the correct size. The size of a disk drive module is shown on
a label 2 at the front of the disk drive module. The label is shown in more detail in
Figure 14 on page 21.
Attention:
v Disk drive modules are fragile. Handle them with care. Keep them well away from
strong magnetic fields.
v Any slot that has no disk drive module installed must contain a dummy disk drive
module. The dummy module ensures that the correct airflow is maintained around
the disk drive modules in the other slots. If a slot remains empty, overheating might
occur.
You do not need to remove power from the 2104 when adding a disk drive module.
Figure 17. Disk Drive Module Handle and Label
25
After you have added a disk drive module to a 2104, you must add it into your system
software configuration by using the system programs. Refer to “Appendix A. Operating
with RISC Systems” on page 81.
26 2104 Models DU3 and TU3 Operator’s Guide
Before Adding a Disk Drive Module
1. Identify the slot for the new disk drive module by using the configuration planning
information for your system. A dummy disk drive module should be in the slot. Refer
to Figure 18.
Use the SCSI Device Identification and Removal service aid to set the disk drive
slot to Insert. If you then remove the dummy disk drive module from the identified
slot, an amber light on the backplane should be visible. This verifies that you have
selected the correct slot into which to add the new disk drive module. This light
goes off when a disk drive module is added. If it does not, it is visible as the amber
Check light on the disk drive module that you have added.
Figure 18. Dummy Disk Drive Module
Chapter 2. Adding Disk Drive Modules 27
2. If you are adding a disk drive module into a 2104 Model TU3, open the front cover
of the unit:
a. If necessary, unlock the cover using the key provided.
b. Grip the right-hand side of the cover and pivot the cover to the left.
Figure 19. Disk Drive Modules in a 2104 Model TU3
28 2104 Models DU3 and TU3 Operator’s Guide
To Add a Disk Drive Module
The diagrams in the following instructions show a 2104 Model DU3. The steps for
adding a disk drive module into a 2104 Model TU3 are the same, but the parts are
turned through 90 degrees.
1. Pull the dummy module out of the 2104.
Attention: Do not leave the slot empty for more than 30 minutes. If you do, the
2104 and its disk drive modules might overheat and be damaged.
Figure 20. Removing a Dummy Module
Chapter 2. Adding Disk Drive Modules 29
2. With one hand supporting the base of the disk drive module and the other hand
holding the handle 1, insert the module and push it into the slot.
When the handle touches the front surface of the enclosure, the module stops.
However, the module is not yet fully home.
Figure 21. Adding a Disk Drive Module
30 2104 Models DU3 and TU3 Operator’s Guide
3. While continuing to push the disk drive module into the slot, slowly close the handle
1 until it stops with a click. This action pushes the module fully home.
Attention: The fans might decrease in speed.
4. Verify that the disk drive module that you have just installed is aligned with the sides
of the 2104, and that there is no gap between this module and the modules that are
next to it.
Verify also that the front edge of this disk drive module aligns with the front edges of
the modules next to it.
If the disk drive module is not correctly aligned, remove it (see step 2 on page 39)
and reinstall it (steps 2 through 3 above).
Figure 22. Closing the Handle of a Disk Drive Module
Chapter 2. Adding Disk Drive Modules 31
5. If the amber check light 2 comes on, first verify that the new disk drive module
has been correctly installed, and then run diagnostics in Problem Determination
mode to check the error log. See “Appendix A. Operating with RISC Systems” on
page 81.
Note: The green activity light 1 does not come on until disk activity begins.
Figure 23. Disk Drive Module Lights
32 2104 Models DU3 and TU3 Operator’s Guide
6. If you have just added a disk drive module to a 2104 Model TU3, close the cover
and, if required, lock it with the key provided.
7. Add the new disk drive module into your system software configuration, using the
system programs. Refer to “Appendix A. Operating with RISC Systems” on page 81.
Figure 24. Closing the Cover of a 2104 Model TU3
Chapter 2. Adding Disk Drive Modules 33
34 2104 Models DU3 and TU3 Operator’s Guide
Chapter 3. Exchanging FRUs
This chapter describes what to do if you need to exchange a Field Replaceable Unit
(FRU) of your 2104 Model DU3 or TU3. A disk drive module, a fan-and-power-supply
assembly, a fan assembly, a SCSI interface card, a switch card, and a SCSI bus bridge
card are all FRUs in the 2104 Model DU3 or TU3.
If you need to exchange the SCSI bus bridge card, please refer to Expandable Storage
Plus: 2104 Models DU3 and TU3 Service Guide
35
Exchanging Disk Drive Modules
Sometimes, you can exchange a faulty disk drive module without the assistance of a
service representative. You can do this only if:
v You are authorized by your organization.
v You have the correct disk drive module. Refer to Figure 25. This disk drive module
has, at the front, a handle 1 that can be pulled upward to open. The disk drive
module should also be the correct size. The size of a disk drive module is shown on
a label 2 at the front of the disk drive module. The label is shown in more detail in
Figure 14 on page 21.
Attention:
v Disk drive modules are fragile. Handle them with care. Keep them well away from
strong magnetic fields.
v Any slot that has no disk drive module installed must contain a dummy disk drive
module. The dummy module ensures that the correct airflow is maintained around
the disk drive modules in the other slots. If a slot remains empty, overheating might
occur.
You do not need to remove power from the 2104 when exchanging a disk drive module.
After you have exchanged a disk drive module in a 2104, you must restore it into your
system by using the system programs.
Figure 25. Disk Drive Module Handle and Capacity
36 2104 Models DU3 and TU3 Operator’s Guide
Before Exchanging a Disk Drive Module
1. Use the system error log to identify the faulty disk drive module. “Appendix A.
Operating with RISC Systems” on page 81 describes how to do this if you have a
RISC-based system.
2. Find the failing disk drive module by using the location code supplied in the error
log.
3. Use the SCSI Device Identification and Removal service aid to set Remove on the
failing disk drive module. (“Appendix A. Operating with RISC Systems” on page 81
describes how to do this if you have a RISC-based system.)
4. Check the lights on this disk drive module. (“Disk Drive Module Lights” on page 8
describes where these are, and how to see them.)
If the Check light is on, exchange the disk drive module for a new one using the
procedure described in the following section.
Chapter 3. Exchanging FRUs 37
To Exchange a Disk Drive Module
1. If the failing disk drive module is in a 2104 Model TU3, open the front cover of the
2104:
a. If necessary, unlock the cover, using the key provided.
b. Grip the cover at the right and pivot it to the left.
Figure 26. Disk Drive Modules in a 2104 Model TU3
38 2104 Models DU3 and TU3 Operator’s Guide
The diagrams in the following instructions show a 2104 Model DU3. The steps for
exchanging a disk drive module in a 2104 Model TU3 are the same, but the parts
are turned through 90 degrees.
2. Pull upward on the trigger 1 and pull outward the handle 2 of the failing disk
drive module. This action moves the module partly out of the slot. Wait for about 20
seconds to allow the disk drive motor time to spin down.
Figure 27. Opening the Disk Drive Module Handle
Chapter 3. Exchanging FRUs 39
3. Carefully pull the module out of the 2104. Support the module with your other hand
as you take it from the 2104.
Attention:The fan speed might increase. Do not leave the slot empty for more than
30 minutes. If you do, the 2104, and its disk drive modules, might overheat and be
damaged.
Figure 28. Removing a Disk Drive Module
40 2104 Models DU3 and TU3 Operator’s Guide
4. With one hand supporting the base of the replacement module and the other hand
holding the handle 1, insert the module and push it into the slot.
When the handle touches the front surface of the enclosure, the module stops. Note
that the disk drive module is not yet fully home.
Figure 29. Inserting a Disk Drive Module
Chapter 3. Exchanging FRUs 41
5. While continuing to push the disk drive module into the slot, slowly close the
handle 1 until it stops with a click. This action pushes the module fully home.
Attention:The fans might decrease their speed.
6. Verify that the disk drive module that you have just installed is aligned with the sides
of the 2104 and there is no gap between this module and the modules that are next
to it.
Verify also that the front edge of this disk drive module aligns with the front edges of
the modules next to it.
If the disk drive module is not correctly aligned, remove it (see step 2 on page 39)
and reinstall it (steps 4 through 6 above).
Figure 30. Closing the Handle
42 2104 Models DU3 and TU3 Operator’s Guide
7. If the amber check light 2 comes on, first verify that the new disk drive module
has been correctly installed, then run diagnostics in Problem Determination mode to
check the error log. See “Appendix A. Operating with RISC Systems” on page 81.
Note: The green activity light 1 comes on for about one second after the disk
drive module has been installed. The light then goes out and comes on again
only when disk I/O activity begins.
Figure 31. Disk Drive Module Lights
Chapter 3. Exchanging FRUs 43
8. If you have exchanged a disk drive module in a 2104 Model TU3, close the cover
and, if required, lock it with the key provided.
9. Restore the disk drive module into your system, using the system programs. Refer
to “Appendix A. Operating with RISC Systems” on page 81.
Figure 32. Closing the Cover of a 2104 Model TU3
44 2104 Models DU3 and TU3 Operator’s Guide
Exchanging a Fan-and-Power-Supply Assembly
You can exchange a faulty fan-and-power-supply assembly if:
v You have a correct spare fan-and-power-supply assembly or fan assembly.
v Another fan-and-power-supply assembly is still present in the 2104.
Note: If your 2104 contains just one fan-and-power-supply assembly (and one fan
assembly), refer to Expandable Storage Plus: 2104 Models DU3 and TU3
Service Guide for more guidance about replacing the fan-and-power-supply
assembly.
You do not need to remove power from the 2104 when exchanging a
fan-and-power-supply assembly.
Before Exchanging a Fan-and-Power-Supply Assembly
v Check the lights on this fan-and-power-supply assembly (“Fan-and-Power-Supply
Assembly and Fan Assembly Lights and Switch” on page 11 describes these lights
and how to see them.). Ensure that the DC On/Standby switch is set to On.
If the CHK light is on, exchange the fan-and-power-supply assembly by using the
procedure in the following section.
Figure 33. A Fan-and-Power-Supply Assembly
Chapter 3. Exchanging FRUs 45
To Exchange a Fan-and-Power-Supply Assembly
Note: The illustrations in the following instructions show a 2104 Model DU3. The steps
for exchanging a fan-and-power-supply assembly in a 2104 Model TU3 are the
same, but the parts are turned through 90 degrees.
1. Set the DC On/Standby switch 1 on the fan-and-power-supply assembly to
Standby.
Note: The DC PWR light might stay on for a short time. Wait until it is off before
you go to the next step.
Figure 34. Fan-and-Power-Supply DC On/Standby Switch
46 2104 Models DU3 and TU3 Operator’s Guide
2. Unplug the mainline power cable from the failing fan-and-power-supply assembly.
CAUTION:
Ensure that the mainline power cable has been removed from the failing
fan-and-power-supply before continuing.
Figure 35. Unplugging the Mainline Power Cable
Chapter 3. Exchanging FRUs 47
3. Unscrew the two thumbscrews on the fan-and-power-supply assembly.
Figure 36. Unscrewing the Thumbscrews on the Fan-and-Power-Supply Assembly
48 2104 Models DU3 and TU3 Operator’s Guide
4. Pull the fan-and-power-supply assembly from the 2104. The speed of the other fan
in the 2104 might increase.
Attention: Do not leave the space empty for more than 30 minutes; if you do, the
2104, and the disk drive modules, might overheat and be damaged.
CAUTION:
Do not insert hands or tools into the empty space where the
fan-and-power-supply assembly was.
Figure 37. Removing the Fan-and-Power-Supply Assembly
Chapter 3. Exchanging FRUs 49
5. Push the replacement fan-and-power-supply assembly fully into the 2104.
DANGER
Do not plug a power cable into the fan-and-power-supply assembly until
the assembly is fully home and its thumbscrews are fully tightened.
Figure 38. Inserting a Replacement Fan-and-Power-Supply Assembly
50 2104 Models DU3 and TU3 Operator’s Guide
6. Tighten the two thumbscrews by turning them clockwise.
Figure 39. Tightening the Screws on a Fan-and-Power-Supply Assembly
Chapter 3. Exchanging FRUs 51
7. Plug the mainline power cable into the new fan-and-power-supply assembly. Verify
that the green AC PWR light 1 comes on immediately.
Figure 40. Plugging in the Mainline Power Cable
52 2104 Models DU3 and TU3 Operator’s Guide
8. Set the DC On/Standby switch 1 on the new fan-and-power-supply assembly to
On.
Note: The fan speeds might decrease.
Figure 41. Fan-and-Power-Supply DC On/Standby Switch
Chapter 3. Exchanging FRUs 53
9. Verify that the green DC PWR light 1 comes on within five seconds.
If it does not, or the amber CHK light 2 comes on, verify that the new
fan-and-power-supply assembly is correctly installed.
If it is, report the problem to your service representative.
Figure 42. Fan-and-Power-Supply Lights
54 2104 Models DU3 and TU3 Operator’s Guide
Exchanging a Fan Assembly
You can exchange a faulty fan assembly if:
v You have a correct spare fan assembly.
v Another fan-and-power-supply assembly is present in the 2104.
You do not need to remove power from the 2104 when exchanging a fan assembly.
Before Exchanging a Fan Assembly
v Look at the Check light on this fan assembly (see “Fan-and-Power-Supply Assembly
and Fan Assembly Lights and Switch” on page 11).
If the Check light is flashing, exchange the fan assembly by using the procedure in
the following section.
Figure 43. Fan Assembly
Chapter 3. Exchanging FRUs 55
To Exchange a Fan Assembly
The diagrams in the following instructions show a 2104 Model DU3. The steps for
exchanging a fan assembly in a 2104 Model TU3 are the same, but the parts are
turned through 90 degrees.
1. Verify that the amber CHK light 1 is flashing.
Figure 44. Fan Assembly Light
56 2104 Models DU3 and TU3 Operator’s Guide
2. Unscrew the two thumbscrews.
Figure 45. Unscrewing the Thumbscrews on a Fan Assembly
Chapter 3. Exchanging FRUs 57
3. Pull the fan assembly from the 2104.
Note: The speed of the other fan in the 2104 might increase.
Attention: Do not leave the space empty for more than 30 minutes; if you do, the
2104, and its disk drive modules, might overheat and be damaged.
CAUTION:
Do not insert hands or tools into the empty fan assembly opening.
Figure 46. Pulling out a Fan Assembly
58 2104 Models DU3 and TU3 Operator’s Guide
4. Push the replacement fan assembly fully into the 2104.
Note: The speed of the other fan in the 2104 might decrease.
Figure 47. Inserting a Fan Assembly
Chapter 3. Exchanging FRUs 59
5. Tighten the two thumbscrews.
Figure 48. Tightening the Thumbscrews
60 2104 Models DU3 and TU3 Operator’s Guide
6. Verify that the amber CHK light 1 is off.
Figure 49. Fan Assembly Light
Chapter 3. Exchanging FRUs 61
Exchanging a SCSI Interface Card Assembly
Attention: The SCSI interface card is sensitive to electrostatic discharge (ESD). Use
the tools and procedures defined by your organization to protect such parts.
You can exchange a faulty SCSI interface card assembly if:
v You are authorized by your organization.
v You have a correct spare SCSI interface card assembly.
You do not need to remove power from the 2104 when exchanging an SCSI interface
card assembly.
Figure 50. SCSI Interface Card Assembly
62 2104 Models DU3 and TU3 Operator’s Guide
Before Exchanging the SCSI Interface Card Assembly
Check the lights on the SCSI interface card.
If the FAULT light is on, exchange the SCSI interface card assembly using the
procedure in the following section.
Figure 51. FAULT Light on an SCSI Interface Card Assembly
Chapter 3. Exchanging FRUs 63
To Exchange a SCSI Interface Card Assembly
The diagrams in the following instructions show a 2104 Model DU3. The steps for
exchanging a SCSI interface card assembly in a 2104 Model TU3 are the same, but the
parts are turned through 90 degrees.
1. Unscrew and unplug the SCSI cable.
Figure 52. Removing the SCSI Cable
64 2104 Models DU3 and TU3 Operator’s Guide
2. Unscrew the thumbscrew 1 on the failing SCSI interface card.
Figure 53. Unscrewing the SCSI Interface Card
Chapter 3. Exchanging FRUs 65
3. Open the lever fully by moving it to the right. This action unplugs the card assembly
from the backplane.
Figure 54. Opening the Lever
66 2104 Models DU3 and TU3 Operator’s Guide
4. Pull the SCSI interface card assembly out of the 2104.
CAUTION:
Do not insert hands or tools into the empty space above the
fan-and-power-supply assembly.
Figure 55. Pulling the SCSI Interface Card Out
Chapter 3. Exchanging FRUs 67
5. Push the replacement SCSI interface card assembly into the 2104.
Figure 56. Pushing the Replacement SCSI Interface Card In
68 2104 Models DU3 and TU3 Operator’s Guide
6. Move the lever to the left.
CAUTION:
As you push the assembly fully home, the lever automatically moves towards
its closed position. Ensure that your fingers do not become pinched between
the lever and the assembly.
Figure 57. Closing the Lever
Chapter 3. Exchanging FRUs 69
7. Tighten the thumbscrew.
Figure 58. Tightening the Thumbscrew
70 2104 Models DU3 and TU3 Operator’s Guide
8. Plug the SCSI connector into the SCSI interface card, and tighten the retaining
screws.
Figure 59. Plugging in the SCSI Cable
Chapter 3. Exchanging FRUs 71
9. Check that the green TERM POWER light 1 comes on within five seconds. It
indicates that the TERM POWER from a host bus adapter is activated. If it does
not, check that the new SCSI interface card has been correctly installed.
Figure 60. SCSI Interface Card Assembly TERM POWER Light
72 2104 Models DU3 and TU3 Operator’s Guide
Exchanging the Switch Card Assembly
Attention: The switch card is sensitive to electrostatic discharge (ESD). Use the tools
and procedures defined by your organization to protect such parts.
You can exchange a faulty switch card assembly if:
v You are authorized by your organization.
v You have a correct spare switch card assembly.
You do not need to remove power from the 2104 when exchanging the switch card.
Before Exchanging the Switch Card Assembly
Notes:
1. The settings of the switches on the replacement switch card must be the same as
those on the original card. See “Switch Card” on page 16 for details of the switch
settings.
2. If the switch card is removed while the 2104 is on, the 2104 might be powered off.
Figure 61. Switch Card
Chapter 3. Exchanging FRUs 73
To Exchange the Switch Card Assembly
The diagrams in the following instructions show a 2104 Model DU3. The steps for
exchanging the switch card assembly in a 2104 Model TU3 are the same, but the parts
are turned through 90 degrees.
1. Unscrew the thumbscrew1 on the failing switch card assembly.
Figure 62. Unscrewing the Thumbscrew on the Switch Card Assembly
74 2104 Models DU3 and TU3 Operator’s Guide
2. Pull the lever up. This action unplugs the switch card assembly from the backplane.
Note: If the 2104 is switched on, it might switch off when you unplug the switch
card assembly.
Figure 63. Opening the Lever
Chapter 3. Exchanging FRUs 75
3. Pull the switch card assembly out of the 2104.
CAUTION:
Do not insert hands or tools into the empty space between the power supply
assemblies.
Figure 64. Remove the Switch Card
76 2104 Models DU3 and TU3 Operator’s Guide
4. Push the replacement switch card assembly into the disk enclosure.
Figure 65. Pushing the Switch Card Assembly into the 2104
Chapter 3. Exchanging FRUs 77
5. Push the lever down.
Figure 66. Closing the Lever
78 2104 Models DU3 and TU3 Operator’s Guide
6. Tighten the thumbscrew.
Figure 67. Tightening the Thumbscrew
Chapter 3. Exchanging FRUs 79
Reporting Problems
When you report a problem that has occurred in a 2104, it is important that you report
the following information, in addition to the error information (the SRN) given to you by
your operating system:
The machine type number 2104
The machine model number DU3 or TU3
The machine serial number xx-xxxxx
The serial number can be read from the label at the bottom right of the front of the
2104, or from the label on the back of the 2104.
80 2104 Models DU3 and TU3 Operator’s Guide
Appendix A. Operating with RISC Systems
This appendix contains information that is specific to Expandable Storage Plus disk
enclosures that are attached to RISC systems.
SCSI Adapters and Cables
An RS/6000 or IBM
server
pSeries computer uses one of the following SCSI adapters to connect to the 2104
Model DU3 or 2104 Model TU3:
v PCI 3-Channel Ultra2 SCSI RAID Adapter (Feature Code 2494 Type Number 4-T)
v PCI Dual Channel Ultra2 SCSI Adapter (Feature Code 6205 Type Number 4-R)
v PCI Single-Ended Ultra SCSI Adapter (Feature Code 6206 Type Number 4-K)
v PCI Single-Ended Plus SCSI RAID Adapter (Feature Code 6208 Type Number 4_A)
SCSI cables supported by these adapters are:
Adapter (Feature
Code)
Cable Length Feature Code Part Number
6205 or 2494 20 meters 9320 09L3307
10 meters 9310 09L3305
5 meters 9305 09L3303
3 meters 9303 09L3301
1 meter 9301 09L3299
6206 3 meters 9313 09L3309
2415 3 meters 9323 09L3311
SCSI Service Aids
Service aids are available on the using system to help you service the 2104. Service
aids described in the Expandable Storage Plus: 2104 Models DU3 and TU3 Service
Guide are:
v Format Media
v Certify Media
v SCSI Device Identification and Removal
v Download Microcode
Identifying a 2104 and its Disk Drive Modules
You can identify a 2104 and the disk drive modules installed in it either by the location
code contained in system messages that refer to that unit, or by using the SCSI Device
Identification and Removal service aid.
81
See “Location Codes” in the operator guide for your system for general information
about location codes.
Problem Determination
For some problems, your system diagnostics tell you to check the lights on the system
to determine which part, if any, has a fault. “Controls and Lights” on page 7 of this book
shows the position of the lights. The diagnostic programs tell you what to report to your
service representative. (“Reporting Problems” on page 80 describes other information
you should report.)
Checking the SCSI Error Log
You can check the error log by running diagnostics in Problem Determination mode.
They check the error log and present messages about any problems that have been
logged. Your system manuals describe how to run the diagnostic programs.
Configuring a 2104 to the using system
Use the cfgmgr command to configure or reconfigure a 2104 to the using system.
Unconfiguring a 2104 from the using system
To remove a 2104 from the using system, use the following command:
rmdev -l [enclosurenumber] -d
where [enclosurenumber] is the enclosure device that was generated by the cfgmgr
command (for example, [ses0], [ses1], [ses2]).
Remove the ses_healthcheck job from the system cron table.
Web Support Page
You can find the web support page at:
http://www.rs6000.ibm.com/support
Related Publications
v Diagnostic Information for Micro Channel Bus Systems, SA23-2765
v Diagnostic Information for Multiple Bus Systems, SA38-0509
v Site and Hardware Planning Information,SA38-0508
v Adapters, Devices and Cable Information for Micro Channel Bus Systems,
SA23-2764
v Adapters, Devices and Cable Information for Multiple Bus Systems, SA38-0516
82 2104 Models DU3 and TU3 Operator’s Guide
Appendix B. Translated Safety Notices
This appendix contains the danger and caution notices that are used in the various
books relating to the Expandable Storage Plus: 2104 Models DU3 and TU3.
The notices are shown in English and in various other languages.
Danger notice A danger notice calls attention to a situation that is potentially lethal or extremely
hazardous to people.
Caution notice A caution notice calls attention to a situation that is potentially hazardous to people
because of some existing condition.
Always use safe working procedures whenever you work on machines. Use your own
judgment to identify safety conditions that these notices do not describe.
Danger Notices
DANGER
In the following step you are going to remove the power cables.
These cables are live if the rack power distribution unit or
uninterruptable power supply (UPS) unit is still switched on.
DANGER
Do not try to open the covers of the fan-and-power-supply
assembly.
DANGER
Do not plug a power cable into the fan-and-power-supply
assembly until the assembly is fully home and its thumbscrews
are fully tightened.
DANGER
An electrical outlet that is not correctly wired could place
hazardous voltage on metal parts of the system or the devices
that attach to that system.
It is the customers’s responsibility to ensure that the outlet is
correctly wired and grounded to prevent an electrical shock.
During an electrical storm, do not disconnect cables for display
stations, printers, telephones, or station protectors for
communication lines.
83
Caution Notices
CAUTION:
This unit may have two linecords. To remove all power,
disconnect both linecords.
CAUTION:
This unit weighs 38.5 kg.
CAUTION:
Do not remove cover, do not service, no serviceable parts.
CAUTION:
Double pole/neutral fusing
CAUTION:
A ?Standby? condition is indicated by the symbol to the right of
?DC? directly above the switch, SW1. When SW1 is toggled to
the right position directly under the ?Standby? symbol, the
unit’s AC-power is not shut off.
CAUTION:
The stabilizer must be correctly attached to the bottom front of
the rack to prevent the rack from tipping forward while the 2104
Model DU3 is being removed from the rack.
Do not pull out or install any unit if a stabilizer is not attached
to the rack.
CAUTION:
The stabilizer must be correctly attached to the bottom front of
the rack to prevent the rack from tipping forward while the 2104
Model DU3 is being installed into the rack.
Do not remove or install any unit if a stabilizer is not attached
to the rack.
CAUTION:
A 2104 Model DU3 weighs up to 38.5 kg (85 lb) with the
maximum number of disk drive modules installed. Do not
attempt to lift the 2104 into the rack unless all the disk drive
modules have been removed.
CAUTION:
A 2104 Model DU3 weighs up to 38.5 kg (85 lb) with the
maximum number of disk drive modules installed. Do not
attempt to remove the 2104 from the rack unless all the disk
drive modules have been removed.
CAUTION:
Do not attempt to lift the 2104 by yourself. Ask another person
for aid.
CAUTION:
Do not insert hands or tools into the empty space that
contained the fan assembly.
84 2104 Models DU3 and TU3 Operator’s Guide
CAUTION:
Do not insert hands or tools into the empty space that
contained the fan-and-power-supply assembly.
CAUTION:
Do not insert hands or tools into the empty space that
contained the switch card assembly.
CAUTION:
Do not insert hands or tools into the empty space that
contained the SCSI interface card assembly.
CAUTION:
This product is equipped with a 3-wire power cable and plug for
the user’s safety. Use this power cable in conjunction with a
correctly grounded electrical outlet to avoid an electrical shock.
CAUTION:
Do not touch the power outlet or the power outlet face plate
with anything other than test probes before you have completed
this safety check.
CAUTION:
If the reading is not infinity, do not proceed. Make the necessary
corrections to the wiring before you continue. Do not switch on
the branch circuit CB until all the above steps are satisfactorily
completed.
CAUTION:
A 2104 Model TU3 can weigh up to 54.5 kg (120 lb) with the
maximum number of disk drive modules installed. Do not
attempt to lift one without help from a second person.
CAUTION:
Do not use the handles of the fan or fan-and-power-supply
assemblies to carry the 2104. These handles are not intended to
support the weight of the unit.
CAUTION:
As you push the assembly fully home, the lever automatically
moves toward its closed position. Ensure that your fingers do
not become pinched between the lever and the assembly.
Appendix B. Translated Safety Notices 85
86 2104 Models DU3 and TU3 Operator’s Guide
Appendix B. Translated Safety Notices 87
88 2104 Models DU3 and TU3 Operator’s Guide
Appendix B. Translated Safety Notices 89
90 2104 Models DU3 and TU3 Operator’s Guide
Appendix B. Translated Safety Notices 91
92 2104 Models DU3 and TU3 Operator’s Guide
Appendix B. Translated Safety Notices 93
94 2104 Models DU3 and TU3 Operator’s Guide
Appendix B. Translated Safety Notices 95
96 2104 Models DU3 and TU3 Operator’s Guide
Appendix B. Translated Safety Notices 97
98 2104 Models DU3 and TU3 Operator’s Guide
Appendix B. Translated Safety Notices 99
100 2104 Models DU3 and TU3 Operator’s Guide
Appendix B. Translated Safety Notices 101
102 2104 Models DU3 and TU3 Operator’s Guide
Appendix B. Translated Safety Notices 103
104 2104 Models DU3 and TU3 Operator’s Guide
Appendix B. Translated Safety Notices 105
106 2104 Models DU3 and TU3 Operator’s Guide
Appendix B. Translated Safety Notices 107
108 2104 Models DU3 and TU3 Operator’s Guide
Appendix B. Translated Safety Notices 109
110 2104 Models DU3 and TU3 Operator’s Guide
Appendix B. Translated Safety Notices 111
112 2104 Models DU3 and TU3 Operator’s Guide
Appendix B. Translated Safety Notices 113
114 2104 Models DU3 and TU3 Operator’s Guide
Appendix B. Translated Safety Notices 115
116 2104 Models DU3 and TU3 Operator’s Guide
Index
A
AC PWR light, fan-and-power- supply assembly 12
Active light
SCSI interface card 15
Active light, SCSI interface card 15
activity light, disk drive module 9
B
Box ID switch
interface card 18
C
check light
disk drive module 9
disk enclosure 7
fan assembly 13
power supply assembly 12
check light, power supply assembly 12
configurations
deskside 5
rack-mounted 2
controls
disk drive module 8
Subsystem 7
cover locks 22
D
DC On/Standby switch
fan-and-power-supply assembly 12
DC PWR light, fan-and-power- supply assembly 12
Delay Motor Start switch
switch card 18
Delay Motor Start switch, interface card 18
disk drive module
adding 25
exchanging 36
identifying 21
lights 8
Drive Autostart switch
switch card 17
E
Enable Enclosure Services switch
switch card 18
Enable Enclosure Services switch, interface card 18
F
fan-and-power- supply assembly
lights 11
Power light 12
fan assembly
exchanging 55
lights 11
Fault light
SCSI interface card 15
Fault light, SCSI interface card 15
I
identifying disk drive modules 21
interface card
lights 14
K
keys, reordering 23
L
lights
2104 power 7
disk drive module 8
enclosure check 7
fan-and-power- supply assembly 11
fan assembly 11
interface card 14
Subsystem 7
switch card 16
locks
deskside unit covers 22
LVD/SE light
SCSI interface card 15
LVD/SE light, SCSI interface card 15
M
model number, machine 80
N
numbers, machine 80
O
Orientation switch
switch card 19
P
Power Control switch
switch card 17
power light
2104 7
Power light
fan-and-power-supply assembly 12
Power–Good light
fan-and-power- supply assembly 12
power supply assembly 12
power-supply assembly
exchanging 45
power switch
fan-and-power- supply assembly 12
problems
dealing with 35
117
problems (continued)
reporting 80
R
Reset light
SCSI interface card 15
Reset light, SCSI interface card 15
Rotary switch
interface card 18
S
SCSI Address switch
switch card 19
SCSI Bus Split switch
switch card 19
SCSI Cable type
SCSI interface card 15
SCSI Interface Card
exchanging 62
security 22
Select Enclosure Services switch
switch card 18
Select Enclosure Services switch, interface card 18
serial number, machine 80
Subsystem
configuration 2, 5
controls 7
lights 7
power control 7
turning power on and off 7
switch card
lights 16
Switch Card
exchanging 73
system
identifying 80
T
TERM POWER light
SCSI interface card 15
TERM POWER light, SCSI interface card 15
turning power on and off 7
type number, machine 80
118 2104 Models DU3 and TU3 Operator’s Guide
Vos remarques sur ce document / Technical publication remark form
Titre / Title : Bull 2104 Model DU3 Expandable Storage Plus Operator’s Guide
Nº Reférence / Reference Nº : 86 A1 16EF 01 Daté / Dated : December 2000
ERREURS DETECTEES / ERRORS IN PUBLICATION
AMELIORATIONS SUGGEREES / SUGGESTIONS FOR IMPROVEMENT TO PUBLICATION
Vos remarques et suggestions seront examinées attentivement.
Si vous désirez une réponse écrite, veuillez indiquer ci-après votre adresse postale complète.
Your comments will be promptly investigated by qualified technical personnel and action will be taken as required.
If you require a written reply, please furnish your complete mailing address below.
NOM / NAME : Date :
SOCIETE / COMPANY :
ADRESSE / ADDRESS :
Remettez cet imprimé à un responsable BULL ou envoyez-le directement à :
Please give this technical publication remark form to your BULL representative or mail to:
BULL CEDOC
357 AVENUE PATTON
B.P.20845
49008 ANGERS CEDEX 01
FRANCE
Technical Publications Ordering Form
Bon de Commande de Documents Techniques
To order additional publications, please fill up a copy of this form and send it via mail to:
Pour commander des documents techniques, remplissez une copie de ce formulaire et envoyez-la à :
BULL CEDOC
ATTN / MME DUMOULIN
357 AVENUE PATTON
B.P.20845
49008 ANGERS CEDEX 01
FRANCE
Managers / Gestionnaires :
Mrs. / Mme : C. DUMOULIN +33 (0) 2 41 73 76 65
Mr. / M : L. CHERUBIN +33 (0) 2 41 73 63 96
FAX : +33 (0) 2 41 73 60 19
E–Mail / Courrier Electronique : srv.Cedoc@franp.bull.fr
Or visit our web sites at: / Ou visitez nos sites web à:
http://www.logistics.bull.net/cedoc
http://www–frec.bull.com http://www.bull.com
CEDOC Reference #
No Référence CEDOC
Qty
Qté
CEDOC Reference #
No Référence CEDOC
Qty
Qté
CEDOC Reference #
No Référence CEDOC
Qty
Qté
_ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ]
_ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ]
_ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ]
_ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ]
_ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ]
_ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ]
_ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ]
[ _ _ ] : no revision number means latest revision / pas de numéro de révision signifie révision la plus récente
NOM / NAME : Date :
SOCIETE / COMPANY :
ADRESSE / ADDRESS :
PHONE / TELEPHONE : FAX :
E–MAIL :
For Bull Subsidiaries / Pour les Filiales Bull :
Identification:
For Bull Affiliated Customers / Pour les Clients Affiliés Bull :
Customer Code / Code Client :
For Bull Internal Customers / Pour les Clients Internes Bull :
Budgetary Section / Section Budgétaire :
For Others / Pour les Autres :
Please ask your Bull representative. / Merci de demander à votre contact Bull.
BULL CEDOC
357 AVENUE PATTON
B.P.20845
49008 ANGERS CEDEX 01
FRANCE
86 A1 16EF 01
ORDER REFERENCE
PLACE BAR CODE IN LOWER
LEFT CORNER
Utiliser les marques de découpe pour obtenir les étiquettes.
Use the cut marks to get the labels.
86 A1 16EF 01
2104 Model DU3
Expandable
Storage Plus
Operator’s Guide
86 A1 16EF 01
2104 Model DU3
Expandable
Storage Plus
Operator’s Guide
86 A1 16EF 01
Disk drives for the E2-700, E2-700T, E2-
705, E2-705T, E4-700, E4-700T or E4-
705
REFERENCE
86 A1 02FH 01
ESCALA Power7
Disk drives for the E2-700, E2-700T, E2-705, E2-705T,
E4-700, E4-700T or E4-705
The ESCALA Power7 publications concern the following models:
- Bull Escala E5-700 (Power 750 / 8233-E8B)
- Bull Escala M6-700 (Power 770 / 9117-MMB)
- Bull Escala M6-705 (Power 770 / 9117-MMC)
- Bull Escala M7-700 (Power 780 / 9179-MHB)
- Bull Escala M7-705 (Power 780 / 9179-MHC)
- Bull Escala E1-700 (Power 710 / 8231-E2B)
- Bull Escala E1-705 (Power 710 / 8231-E1C)
- Bull Escala E2-700 / E2-700T (Power 720 / 8202-E4B)
- Bull Escala E2-705 / E2-705T (Power 720 / 8202-E4C)
- Bull Escala E3-700 (Power 730 / 8231-E2B)
- Bull Escala E3-705 (Power 730 / 8231-E2C)
- Bull Escala E4-700 / E4-700T (Power 740 / 8205-E6B)
- Bull Escala E4-705 (Power 740 / 8205-E6C)
References to Power 755 / 8236-E8C models are irrelevant.
Hardware
October 2011
BULL CEDOC
357 AVENUE PATTON
B.P.20845
49008 ANGERS CEDEX 01
FRANCE
REFERENCE
86 A1 02FH 01
The following copyright notice protects this book under Copyright laws which prohibit such actions as, but not limited to, copying,
distributing, modifying, and making derivative works.
Copyright Bull SAS 2011
Printed in France
Suggestions and criticisms concerning the form, content, and presentation of this book are
invited. A form is provided at the end of this book for this purpose.
To order additional copies of this book or other Bull Technical Publications, you are invited
to use the Ordering Form also provided at the end of this book.
Trademarks and Acknowledgements
We acknowledge the right of proprietors of trademarks mentioned in this book.
The information in this document is subject to change without notice. Bull will not be liable for errors contained herein, or
f or incidental or consequential damages in connection with the use of this material.
Contents
Safety notices. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . ix
Disk drives. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1
What's new in Disk drives . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3
Installing a disk drive or solid-state drive in the 8202-E4B, 8202-E4C, 8205-E6B, or
8205-E6C . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5
Solid-state drives . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5
Installing a disk drive or solid-state drive in an 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C with the AIX system or
logical partition power turned on . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8
Preparing the system . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10
Preparing to install a disk drive by using the HSM . . . . . . . . . . . . . . . . . . . . . 10
Installing a disk drive in the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C. . . . . . . . . . . . . . 10
Installing a disk drive in the 5887 disk-drive expansion unit . . . . . . . . . . . . . . . . . . 11
Completing the disk drive installation by using the HSM . . . . . . . . . . . . . . . . . . . 12
Completing the procedure . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 12
Installing a disk drive or solid-state drive in the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C with the Linux
system or logical partition power turned on . . . . . . . . . . . . . . . . . . . . . . . . 13
Preparing the system . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 14
Preparing to install a disk drive by using the HSM . . . . . . . . . . . . . . . . . . . . . 15
Installing a disk drive in the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C. . . . . . . . . . . . . . 16
Installing a disk drive in the 5887 disk-drive expansion unit . . . . . . . . . . . . . . . . . . 17
Verifying that the new disk drive is operational . . . . . . . . . . . . . . . . . . . . . . 18
Completing the procedure . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 19
Installing a disk drive or solid-state drive in the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C with the Virtual I/O
Server system or logical partition power turned on . . . . . . . . . . . . . . . . . . . . . . 20
Preparing the system . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 21
Preparing to install a disk drive by using the HSM . . . . . . . . . . . . . . . . . . . . . 22
Installing a disk drive in the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C. . . . . . . . . . . . . . 22
Installing a disk drive in the 5887 disk-drive expansion unit . . . . . . . . . . . . . . . . . . 23
Completing the disk drive installation by using the HSM . . . . . . . . . . . . . . . . . . . 24
Completing the procedure . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 24
Installing a disk drive or solid-state drive in the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C with the IBM i system
or logical partition power turned on . . . . . . . . . . . . . . . . . . . . . . . . . . . 25
Preparing the system . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 26
Preparing to install a disk drive by using the HSM . . . . . . . . . . . . . . . . . . . . . 27
Installing a disk drive in the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C. . . . . . . . . . . . . . 28
Installing a disk drive in the 5887 disk-drive expansion unit . . . . . . . . . . . . . . . . . . 29
Completing the disk drive installation by using the HSM . . . . . . . . . . . . . . . . . . . 30
Completing the procedure . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 30
Installing a disk drive or solid-state drive in the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C with the system or
logical partition power turned off . . . . . . . . . . . . . . . . . . . . . . . . . . . . 31
Preparing the system . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 32
Installing a disk drive in the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C. . . . . . . . . . . . . . 33
Installing a disk drive in the 5887 disk-drive expansion unit . . . . . . . . . . . . . . . . . . 35
Completing the procedure . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 35
Removing a disk drive or solid-state drive from the 8202-E4B, 8202-E4C, 8205-E6B, or
8205-E6C. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 37
Removing a disk drive in the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C with the AIX system or logical partition
power turned on . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 37
Preparing the system for removing a disk drive . . . . . . . . . . . . . . . . . . . . . . 38
Preparing the system for removing a disk drive by using the HSM. . . . . . . . . . . . . . . . 39
© Copyright IBM Corp. 2010, 2011 iii
Removing the disk drive from the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C . . . . . . . . . . . . 40
Removing the disk drive in the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C using Linux kernel 2.6 with the Linux
system or logical partition power turned on . . . . . . . . . . . . . . . . . . . . . . . . 41
Preparing the system . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 42
Preparing to remove the disk drive by using the HSM . . . . . . . . . . . . . . . . . . . . 43
Removing the disk drive from the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C . . . . . . . . . . . . 46
Removing a disk drive from the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C or an expansion unit controlled by
IBMi . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 47
Preparing the system for removing a disk drive . . . . . . . . . . . . . . . . . . . . . . 48
Preparing to remove the disk drive by using the HSM . . . . . . . . . . . . . . . . . . . . 49
Removing the disk drive from the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C . . . . . . . . . . . . 51
Removing a disk drive using hot-spare in an IBM i system or logical partition from an 8202-E4B, 8202-E4C,
8205-E6B, or 8205-E6C. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 52
Preparing the system . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 52
Preparing to remove a disk drive by using the HSM. . . . . . . . . . . . . . . . . . . . . 52
Removing the disk drive from the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C . . . . . . . . . . . . 54
Removing a SCSI disk drive from the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C. . . . . . . . . . . . 55
Preparing to remove the SCSI disk drive by using the HSM . . . . . . . . . . . . . . . . . . 55
Removing the disk drive from the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C . . . . . . . . . . . . 56
Removing a disk drive from the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C with the system or logical partition
power turned off . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 57
Preparing the system . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 58
Removing the disk drive from the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C . . . . . . . . . . . . 60
Replacing a disk drive or solid state drive in the 8202-E4B, 8202-E4C, 8205-E6B, or
8205-E6C. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 63
Replacing a disk drive in the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C with the AIX system or logical partition
power turned on . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 63
Preparing the system . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 64
Replacing a disk drive in the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C. . . . . . . . . . . . . . 65
Completing the replacing the disk drive by using the HSM . . . . . . . . . . . . . . . . . . 66
Completing the procedure . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 66
Replacing a disk drive in the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C by using Linux kernel 2.6, with the
power to the Linux system or logical partition turned on . . . . . . . . . . . . . . . . . . . . 67
Preparing the system to replace a disk drive . . . . . . . . . . . . . . . . . . . . . . . 68
Preparing to replace a disk drive by using the HSM. . . . . . . . . . . . . . . . . . . . . 69
Replacing a disk drive in the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C. . . . . . . . . . . . . . 69
Completing the disk drive replace by using the HSM . . . . . . . . . . . . . . . . . . . . 70
Completing the procedure . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 70
Replacing a disk drive in the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C with the IBM i system or logical
partition power turned on . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 71
Preparing to replace the disk drive by using the HSM . . . . . . . . . . . . . . . . . . . . 72
Replacing a disk drive in the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C. . . . . . . . . . . . . . 73
Completing the disk drive replace by using the HSM . . . . . . . . . . . . . . . . . . . . 74
Completing the procedure . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 74
Replacing a disk drive in the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C using hot-spare in an IBM i system or
logical partition . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 75
Preparing to replace the disk drive by using the HSM . . . . . . . . . . . . . . . . . . . . 75
Replacing a disk drive in the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C. . . . . . . . . . . . . . 75
Completing the procedure by using the HSM . . . . . . . . . . . . . . . . . . . . . . . 76
Replacing a SCSI disk drive in the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C . . . . . . . . . . . . . 77
Replacing a disk drive in the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C. . . . . . . . . . . . . . 77
Completing the disk drive replace by using the HSM . . . . . . . . . . . . . . . . . . . . 78
Completing the procedure . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 79
Replacing a disk drive in the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C with the system or logical partition
power turned off . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 79
Replacing a disk drive in the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C. . . . . . . . . . . . . . 80
Completing the procedure . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 81
iv Disk drives
Installing the external SAS port in the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C
system. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 83
Removing the external SAS port from the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C
system. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 87
Replacing the external SAS port in the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C
system. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 91
Internal disk drive sharing on an 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system 93
Removing or installing a disk drive filler . . . . . . . . . . . . . . . . . . . . . 95
Removing a disk drive filler from the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C. . . . . . . . . . . . 95
Installing a disk drive filler in the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C . . . . . . . . . . . . . 96
Removing or installing a disk drive bezel. . . . . . . . . . . . . . . . . . . . . 99
Removing a disk drive bezel from a disk drive or filler. . . . . . . . . . . . . . . . . . . . . 99
Installing a disk drive bezel in a disk drive or filler . . . . . . . . . . . . . . . . . . . . . 100
Disk drive locations and service indicators . . . . . . . . . . . . . . . . . . . 103
PCIe RAID and SSD SAS adapter locations . . . . . . . . . . . . . . . . . . . . . . . . 103
Disk drive locations and service indicators for the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system. . . . . 103
Disk drive locations and service indicators for the 5886 disk-drive enclosure . . . . . . . . . . . . . 105
Disk-drive locations and service indicators for the 5887 disk-drive enclosure . . . . . . . . . . . . . 107
Disk drive locations and service indicators for the 5786, 5787, 7031-D24, and 7031-T24 SCSI disk-drive enclosures 108
Disk-drive locations and service indicators for the 5802 expansion unit . . . . . . . . . . . . . . . 110
Related information. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 113
Preparing a disk drive for use in an AIX system or logical partition . . . . . . . . . . . . . . . . 113
Preparing to remove a disk drive from a system or expansion unit controlled by AIX . . . . . . . . . . 113
Rebuilding data on a replacement disk drive using AIX . . . . . . . . . . . . . . . . . . . . 113
Removing a disk drive using Linux kernel 2.6 . . . . . . . . . . . . . . . . . . . . . . . 113
Preparing to remove the disk drive . . . . . . . . . . . . . . . . . . . . . . . . . . 113
Removing the disk drive . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 116
Replacing the disk drive. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 119
Completing the procedure . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 121
Rebuilding data on a replacement disk drive using Linux . . . . . . . . . . . . . . . . . . . 122
Rebuild data on a disk drive using Linux kernel 2.6 . . . . . . . . . . . . . . . . . . . . 122
Preparing to remove a disk-drive from a system or logical partition controlled by IBM i . . . . . . . . . 124
Determining the IBM i disk-drive protection status. . . . . . . . . . . . . . . . . . . . . . 125
Configuring a disk drive in an IBM i system or logical partition . . . . . . . . . . . . . . . . . 126
Configuring a disk drive on a load source adapter for hot spare using the IBM i operating system. . . . . . 128
Rebuilding data on a replacement disk drive using IBM i . . . . . . . . . . . . . . . . . . . 129
Replacing a SAS conduit card in a 5802 expansion unit with the power turned off . . . . . . . . . . . 130
Removing a SAS conduit card from a 5802 expansion unit with the power turned off . . . . . . . . . . 137
Replacing a SAS expander card in a 5802 expansion unit with the power turned off. . . . . . . . . . . 146
Removing a SAS expander card from a 5802 expansion unit with power turned off . . . . . . . . . . . 146
Common procedures for installable features . . . . . . . . . . . . . . . . . 149
Before you begin. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 151
Identifying a part. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 155
Control panel LEDs . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 155
Identifying a failing part in an AIX system or logical partition . . . . . . . . . . . . . . . . . . 156
Locating a failing part in an AIX system or logical partition. . . . . . . . . . . . . . . . . . 156
Activating the indicator light for the failing part. . . . . . . . . . . . . . . . . . . . . . 157
Contents v
Deactivating the failing-part indicator light . . . . . . . . . . . . . . . . . . . . . . . 157
Identifying a failing part in an IBM i system or logical partition . . . . . . . . . . . . . . . . . 157
Activating the failing-part indicator light . . . . . . . . . . . . . . . . . . . . . . . . 157
Deactivating the failing-part indicator light . . . . . . . . . . . . . . . . . . . . . . . 158
Identifying a failing part in a Linux system or logical partition. . . . . . . . . . . . . . . . . . 159
Locating a failing part in a Linux system or logical partition . . . . . . . . . . . . . . . . . 159
Finding the location code of a failing part in a Linux system or logical partition . . . . . . . . . . . 159
Activating the indicator light for the failing part. . . . . . . . . . . . . . . . . . . . . . 159
Deactivating the failing-part indicator light . . . . . . . . . . . . . . . . . . . . . . . 159
Locating a failing part in a Virtual I/O Server system or logical partition . . . . . . . . . . . . . . 160
Identifying a part by using the Virtual I/O Server . . . . . . . . . . . . . . . . . . . . . 160
Starting the system or logical partition . . . . . . . . . . . . . . . . . . . . . 161
Starting a system that is not managed by a Hardware Management Console or or a Systems Director
Management Console. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 161
Starting a system or logical partition by using the Hardware Management Console . . . . . . . . . . . 162
Starting a system or virtual server with the Systems Director Management Console. . . . . . . . . . . 163
Stopping a system or logical partition. . . . . . . . . . . . . . . . . . . . . . 165
Stopping a system that is not managed by an HMC or an SDMC . . . . . . . . . . . . . . . . . 165
Stopping a system by using the Hardware Management Console . . . . . . . . . . . . . . . . . 166
Stopping a system with the Systems Director Management Console . . . . . . . . . . . . . . . . 167
Removing and replacing the expansion unit cover or door. . . . . . . . . . . . . 169
Removing the front cover on the 7314-G30 or 5796. . . . . . . . . . . . . . . . . . . . . . 169
Installing the front cover on the 7314-G30 or 5796 . . . . . . . . . . . . . . . . . . . . . . 169
System covers. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 171
Removing and replacing covers for the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C . . . . . . . . . . . 171
Removing the front cover on a rack-mounted 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system . . . . . 171
Removing the front cover on a stand-alone 8202-E4B, 8202-E4C, or 8205-E6B system . . . . . . . . . 172
Installing the front cover on a rack-mounted 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system . . . . . 172
Installing the front cover and front door on a stand-alone 8202-E4B, 8202-E4C, or 8205-E6B system. . . . . 173
Removing the service access cover on a rack-mounted 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system . . 174
Removing the service access cover on a stand-alone 8202-E4B, 8202-E4C, or 8205-E6B system . . . . . . 175
Installing the service access cover on a rack-mounted 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system . . 177
Installing the service access cover on a stand-alone 8202-E4B, 8202-E4C, or 8205-E6B system . . . . . . . 177
Removing and replacing covers for the 8231-E2B, 8231-E1C, or 8231-E2C . . . . . . . . . . . . . . 179
Removing the service access cover for the 8231-E2B, 8231-E1C, or 8231-E2C . . . . . . . . . . . . 179
Installing the service access cover for the 8231-E2B, 8231-E1C, or 8231-E2C. . . . . . . . . . . . . 180
Removing and replacing covers on the 8233-E8B or 8236-E8C system. . . . . . . . . . . . . . . . 181
Removing the front cover from an 8233-E8B or 8236-E8C system . . . . . . . . . . . . . . . . 181
Installing the front cover on an 8233-E8B or 8236-E8C system . . . . . . . . . . . . . . . . . 182
Removing the service access cover from an 8233-E8B or 8236-E8C system . . . . . . . . . . . . . 183
Installing the service access cover on an 8233-E8B or 8236-E8C system . . . . . . . . . . . . . . 184
Removing and replacing covers for the 9117-MMB, 9117-MMC, 9179-MHB, or 9179-MHC. . . . . . . . . 185
Removing the front cover from the 9117-MMB, 9117-MMC, 9179-MHB, or 9179-MHC . . . . . . . . . 185
Removing the front cover with the cover removal tool. . . . . . . . . . . . . . . . . . . . 186
Installing the front cover on the 9117-MMB, 9117-MMC, 9179-MHB, or 9179-MHC . . . . . . . . . . 191
Placing the 8231-E2B, 8231-E1C, or 8231-E2C system into the service or operating
position. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 193
Placing the rack-mounted 8231-E2B, 8231-E1C, or 8231-E2C system into the service position . . . . . . . . 193
Placing the rack-mounted 8231-E2B, 8231-E1C, or 8231-E2C system into the operating position . . . . . . . 194
Placing the rack-mounted 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system into
the service position . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 197
vi Disk drives
Placing the rack-mounted 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system into
the operating position . . . . . . . . . . . . . . . . . . . . . . . . . . . . 199
Service and operating positions . . . . . . . . . . . . . . . . . . . . . . . . 201
Placing the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system into the service or operating position . . . . . 201
Placing the rack-mounted 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system into the service position . . . 201
Placing the rack-mounted 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system into the operating position . . 202
Placing the 8231-E2B, 8231-E1C, or 8231-E2C system into the service or operating position . . . . . . . . 204
Placing the rack-mounted 8231-E2B, 8231-E1C, or 8231-E2C system into the service position . . . . . . . 204
Placing the rack-mounted 8231-E2B, 8231-E1C, or 8231-E2C system into the operating position . . . . . . 205
Placing the rack-mounted 8233-E8B or 8236-E8C system into the service position or operating position . . . . 207
Placing the 8233-E8B or 8236-E8C in the service position . . . . . . . . . . . . . . . . . . . 207
Placing the 8233-E8B or 8236-E8C in the operating position . . . . . . . . . . . . . . . . . . 209
Power cords. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 211
Disconnecting the power cords from the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system . . . . . . . . 211
Connecting the power cords to the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system. . . . . . . . . . 212
Disconnecting the power cords from the 8231-E2B, 8231-E1C, or 8231-E2C . . . . . . . . . . . . . . 213
Connecting the power cords to the 8231-E2B, 8231-E1C, or 8231-E2C. . . . . . . . . . . . . . . . 214
Verifying the installed part. . . . . . . . . . . . . . . . . . . . . . . . . . . 217
Verifying an installed feature or replaced part in an AIX system or logical partition. . . . . . . . . . . 217
Verifying the installed part in an IBM i system or logical partition. . . . . . . . . . . . . . . . . 219
Deactivating the failing-part indicator light . . . . . . . . . . . . . . . . . . . . . . . 220
Verifying the installed part in a Linux system or logical partition . . . . . . . . . . . . . . . . . 220
Verifying an installed part by using stand-alone diagnostics. . . . . . . . . . . . . . . . . . . 221
Verifying the installed part by using Hardware Management Console . . . . . . . . . . . . . . . 222
Activating and deactivating LEDs by using the HMC . . . . . . . . . . . . . . . . . . . . 223
Deactivating a system attention LED or partition LED. . . . . . . . . . . . . . . . . . . 223
Activating or deactivating an identify LED . . . . . . . . . . . . . . . . . . . . . . 223
Viewing serviceable events by using the HMC . . . . . . . . . . . . . . . . . . . . . . 224
Verifying the installed part by using Systems Director Management Console . . . . . . . . . . . . . 224
Activating and deactivating LEDs by using the Systems Director Management Console . . . . . . . . 225
Deactivating a system attention LED or partition LED. . . . . . . . . . . . . . . . . . . 225
Activating or deactivating an identify LED by using the SDMC . . . . . . . . . . . . . . . 225
Viewing serviceable events by using the Systems Director Management Console . . . . . . . . . . . 226
Verifying an installed feature or replaced part on a system or logical partition by using Virtual I/O Server tools 226
Verifying a repair. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 231
Verifying the repair in AIX . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 232
Verifying a repair using an IBM i system or logical partition . . . . . . . . . . . . . . . . . . 235
Verifying the repair in Linux . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 237
Verifying the repair from the management console . . . . . . . . . . . . . . . . . . . . . . 237
Closing a service call. . . . . . . . . . . . . . . . . . . . . . . . . . . . . 239
Closing a service call using AIX or Linux . . . . . . . . . . . . . . . . . . . . . . . . . 243
Closing a service call using Integrated Virtualization Manager . . . . . . . . . . . . . . . . . . 247
Activating and deactivating LEDs. . . . . . . . . . . . . . . . . . . . . . . . 253
Deactivating a system attention LED or partition LED using the management console . . . . . . . . . . 253
Activating or deactivating an identify LED using the management console. . . . . . . . . . . . . . 254
Deactivating a system attention LED or logical partition LED using the Advanced System Management Interface 254
Activating or deactivating an identify LED using the Advanced System Management Interface . . . . . . . 255
Notices . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 257
Trademarks . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 258
Electronic emission notices . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 258
Class A Notices. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 258
Class B Notices. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 262
Contents vii
Terms and conditions. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 265
viii Disk drives
Safety notices
Safety notices may be printed throughout this guide:
v DANGER notices call attention to a situation that is potentially lethal or extremely hazardous to
people.
v CAUTION notices call attention to a situation that is potentially hazardous to people because of some
existing condition.
v Attention notices call attention to the possibility of damage to a program, device, system, or data.
World Trade safety information
Several countries require the safety information contained in product publications to be presented in their
national languages. If this requirement applies to your country, a safety information booklet is included
in the publications package shipped with the product. The booklet contains the safety information in
your national language with references to the U.S. English source. Before using a U.S. English publication
to install, operate, or service this product, you must first become familiar with the related safety
information in the booklet. You should also refer to the booklet any time you do not clearly understand
any safety information in the U.S. English publications.
German safety information
Das Produkt ist nicht für den Einsatz an Bildschirmarbeitsplätzen im Sinne § 2 der
Bildschirmarbeitsverordnung geeignet.
Laser safety information
IBM® servers can use I/O cards or features that are fiber-optic based and that utilize lasers or LEDs.
Laser compliance
IBM servers may be installed inside or outside of an IT equipment rack.
© Copyright IBM Corp. 2010, 2011 ix
DANGER
When working on or around the system, observe the following precautions:
Electrical voltage and current from power, telephone, and communication cables are hazardous. To
avoid a shock hazard:
v Connect power to this unit only with the IBM provided power cord. Do not use the IBM
provided power cord for any other product.
v Do not open or service any power supply assembly.
v Do not connect or disconnect any cables or perform installation, maintenance, or reconfiguration
of this product during an electrical storm.
v The product might be equipped with multiple power cords. To remove all hazardous voltages,
disconnect all power cords.
v Connect all power cords to a properly wired and grounded electrical outlet. Ensure that the outlet
supplies proper voltage and phase rotation according to the system rating plate.
v Connect any equipment that will be attached to this product to properly wired outlets.
v When possible, use one hand only to connect or disconnect signal cables.
v Never turn on any equipment when there is evidence of fire, water, or structural damage.
v Disconnect the attached power cords, telecommunications systems, networks, and modems before
you open the device covers, unless instructed otherwise in the installation and configuration
procedures.
v Connect and disconnect cables as described in the following procedures when installing, moving,
or opening covers on this product or attached devices.
To Disconnect:
1. Turn off everything (unless instructed otherwise).
2. Remove the power cords from the outlets.
3. Remove the signal cables from the connectors.
4. Remove all cables from the devices
To Connect:
1. Turn off everything (unless instructed otherwise).
2. Attach all cables to the devices.
3. Attach the signal cables to the connectors.
4. Attach the power cords to the outlets.
5. Turn on the devices.
(D005)
DANGER
x Disk drives
Observe the following precautions when working on or around your IT rack system:
v Heavy equipment–personal injury or equipment damage might result if mishandled.
v Always lower the leveling pads on the rack cabinet.
v Always install stabilizer brackets on the rack cabinet.
v To avoid hazardous conditions due to uneven mechanical loading, always install the heaviest
devices in the bottom of the rack cabinet. Always install servers and optional devices starting
from the bottom of the rack cabinet.
v Rack-mounted devices are not to be used as shelves or work spaces. Do not place objects on top
of rack-mounted devices.
v Each rack cabinet might have more than one power cord. Be sure to disconnect all power cords in
the rack cabinet when directed to disconnect power during servicing.
v Connect all devices installed in a rack cabinet to power devices installed in the same rack
cabinet. Do not plug a power cord from a device installed in one rack cabinet into a power
device installed in a different rack cabinet.
v An electrical outlet that is not correctly wired could place hazardous voltage on the metal parts of
the system or the devices that attach to the system. It is the responsibility of the customer to
ensure that the outlet is correctly wired and grounded to prevent an electrical shock.
CAUTION
v Do not install a unit in a rack where the internal rack ambient temperatures will exceed the
manufacturer's recommended ambient temperature for all your rack-mounted devices.
v Do not install a unit in a rack where the air flow is compromised. Ensure that air flow is not
blocked or reduced on any side, front, or back of a unit used for air flow through the unit.
v Consideration should be given to the connection of the equipment to the supply circuit so that
overloading of the circuits does not compromise the supply wiring or overcurrent protection. To
provide the correct power connection to a rack, refer to the rating labels located on the
equipment in the rack to determine the total power requirement of the supply circuit.
v (For sliding drawers.) Do not pull out or install any drawer or feature if the rack stabilizer brackets
are not attached to the rack. Do not pull out more than one drawer at a time. The rack might
become unstable if you pull out more than one drawer at a time.
v (For fixed drawers.) This drawer is a fixed drawer and must not be moved for servicing unless
specified by the manufacturer. Attempting to move the drawer partially or completely out of the
rack might cause the rack to become unstable or cause the drawer to fall out of the rack.
Serial-Attached SCSI RAID Enablement in
E2-700, E2-700T, E2-705, E2-705T, E4-
700, E4-700T or E4-705
REFERENCE
86 A1 90FF 03
ESCALA Power7
Serial-Attached SCSI RAID Enablement in E2-700, E2-
700T, E2-705, E2-705T, E4-700, E4-700T or E4-705
The ESCALA Power7 publications concern the following models:
- Bull Escala E5-700 (Power 750 / 8233-E8B)
- Bull Escala M6-700 (Power 770 / 9117-MMB)
- Bull Escala M6-705 (Power 770 / 9117-MMC)
- Bull Escala M7-700 (Power 780 / 9179-MHB)
- Bull Escala M7-705 (Power 780 / 9179-MHC)
- Bull Escala E1-700 (Power 710 / 8231-E2B)
- Bull Escala E1-705 (Power 710 / 8231-E1C)
- Bull Escala E2-700 / E2-700T (Power 720 / 8202-E4B)
- Bull Escala E2-705 / E2-705T (Power 720 / 8202-E4C)
- Bull Escala E3-700 (Power 730 / 8231-E2B)
- Bull Escala E3-705 (Power 730 / 8231-E2C)
- Bull Escala E4-700 / E4-700T (Power 740 / 8205-E6B)
- Bull Escala E4-705 (Power 740 / 8205-E6C)
References to Power 755 / 8236-E8C models are irrelevant.
Hardware
October 2011
BULL CEDOC
357 AVENUE PATTON
B.P.20845
49008 ANGERS CEDEX 01
FRANCE
REFERENCE
86 A1 90FF 03
The following copyright notice protects this book under Copyright laws which prohibit such actions as, but not limited to, copying,
distributing, modifying, and making derivative works.
Copyright Bull SAS 2011
Printed in France
Suggestions and criticisms concerning the form, content, and presentation of this book are
invited. A form is provided at the end of this book for this purpose.
To order additional copies of this book or other Bull Technical Publications, you are invited
to use the Ordering Form also provided at the end of this book.
Trademarks and Acknowledgements
We acknowledge the right of proprietors of trademarks mentioned in this book.
The information in this document is subject to change without notice. Bull will not be liable for errors contained herein, or
f or incidental or consequential damages in connection with the use of this material.
Contents
Safety notices . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . vii
SAS RAID enablement and cache battery pack for the 8202-E4B, 8202-E4C,
8205-E6B, or 8205-E6C . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1
What's new in SAS RAID enablement . . . . . . . . . . . . . . . . . . . . . . . 3
Installing the Storage Backplane - 6 SFF Bays/SATA DVD (FC 5618). . . . . . . . . . 5
Installing the 3x3 split-drive bays no RAID 5/6 support (FC 5631 and FC EJ02) . . . . . 9
Installing the Storage Backplane - 8 SFF Bays/175MB RAID/Dual IOA (FC 5630 and FC
EJ01) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 15
Removing and replacing SAS RAID adapters and batteries . . . . . . . . . . . . . 23
Removing and replacing the disk drive backplane . . . . . . . . . . . . . . . . . . . . . . 23
Removing the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C disk drive backplane . . . . . . . . . . . 24
Replacing the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C disk drive backplane . . . . . . . . . . . 29
Removing and replacing the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system backplane. . . . . . . . . 33
Removing the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system backplane . . . . . . . . . . . . 33
Replacing the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system backplane. . . . . . . . . . . . . 44
8202-E4B or 8205-E6B SAS subsystem service considerations . . . . . . . . . . . . . . . . . . . 59
Service considerations for the cache battery pack. . . . . . . . . . . . . . . . . . . . . . 60
Removing the 8202-E4B or 8205-E6B RAID enablement card . . . . . . . . . . . . . . . . . . . 61
Replacing the 8202-E4B or 8205-E6B RAID enablement card . . . . . . . . . . . . . . . . . . . 64
Removing the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C RAID/cache storage controller . . . . . . . . . 66
Replacing the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C RAID/cache storage controller . . . . . . . . . 69
Removing and replacing the cache battery card . . . . . . . . . . . . . . . . . . . . . . . 70
Removing and replacing a cache battery pack . . . . . . . . . . . . . . . . . . . . . . . . 75
Common procedures for installable features . . . . . . . . . . . . . . . . . . . 79
Before you begin . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 79
Identifying a part . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 81
Control panel LEDs. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 81
Identifying a failing part in an AIX system or logical partition . . . . . . . . . . . . . . . . . 83
Locating a failing part in an AIX system or logical partition . . . . . . . . . . . . . . . . . 83
Activating the indicator light for the failing part . . . . . . . . . . . . . . . . . . . . . 83
Deactivating the failing-part indicator light. . . . . . . . . . . . . . . . . . . . . . . 83
Identifying a failing part in an IBM i system or logical partition. . . . . . . . . . . . . . . . . 84
Activating the failing-part indicator light . . . . . . . . . . . . . . . . . . . . . . . 84
Deactivating the failing-part indicator light. . . . . . . . . . . . . . . . . . . . . . . 84
Identifying a failing part in a Linux system or logical partition . . . . . . . . . . . . . . . . . 85
Locating a failing part in a Linux system or logical partition . . . . . . . . . . . . . . . . . 85
Finding the location code of a failing part in a Linux system or logical partition . . . . . . . . . . 85
Activating the indicator light for the failing part . . . . . . . . . . . . . . . . . . . . . 86
Deactivating the failing-part indicator light. . . . . . . . . . . . . . . . . . . . . . . 86
Locating a failing part in a Virtual I/O Server system or logical partition. . . . . . . . . . . . . . 86
Identifying a part by using the Virtual I/O Server . . . . . . . . . . . . . . . . . . . . 86
Starting the system or logical partition . . . . . . . . . . . . . . . . . . . . . . . . . . 87
Starting a system that is not managed by a Hardware Management Console or or a Systems Director
Management Console . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 87
Starting a system or logical partition by using the Hardware Management Console . . . . . . . . . . 88
Starting a system or virtual server with the Systems Director Management Console . . . . . . . . . . 89
© Copyright IBM Corp. 2010, 2011 iii
Stopping a system or logical partition . . . . . . . . . . . . . . . . . . . . . . . . . . 89
Stopping a system that is not managed by an HMC or an SDMC . . . . . . . . . . . . . . . . 89
Stopping a system by using the Hardware Management Console . . . . . . . . . . . . . . . . 90
Stopping a system with the Systems Director Management Console . . . . . . . . . . . . . . . 91
Removing and replacing covers for the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C . . . . . . . . . . . 91
Removing the front cover on a rack-mounted 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system . . . . . 92
Removing the front cover on a stand-alone 8202-E4B, 8202-E4C, or 8205-E6B system . . . . . . . . . . 92
Installing the front cover on a rack-mounted 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system . . . . . . 93
Installing the front cover and front door on a stand-alone 8202-E4B, 8202-E4C, or 8205-E6B system . . . . . 94
Removing the service access cover on a rack-mounted 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system. . . 95
Removing the service access cover on a stand-alone 8202-E4B, 8202-E4C, or 8205-E6B system . . . . . . . 96
Installing the service access cover on a rack-mounted 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system . . . 98
Installing the service access cover on a stand-alone 8202-E4B, 8202-E4C, or 8205-E6B system . . . . . . . 98
Placing the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system into the service or operating position . . . . . 100
Placing the rack-mounted 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system into the service position . . . 100
Placing the rack-mounted 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system into the operating position . . 101
Disconnecting the power cords from the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system . . . . . . . . 103
Connecting the power cords to the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system. . . . . . . . . . 105
Installing a feature by using the Hardware Management Console . . . . . . . . . . . . . . . . . 105
Removing a part by using the Hardware Management Console . . . . . . . . . . . . . . . . . 106
Replacing a part by using the Hardware Management Console. . . . . . . . . . . . . . . . . . 106
Verifying the installed part. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 107
Verifying an installed feature or replaced part in an AIX system or logical partition. . . . . . . . . . 107
Verifying the installed part in an IBM i system or logical partition. . . . . . . . . . . . . . . . 109
Deactivating the failing-part indicator light . . . . . . . . . . . . . . . . . . . . . . 110
Verifying the installed part in a Linux system or logical partition . . . . . . . . . . . . . . . . 110
Verifying an installed part by using stand-alone diagnostics . . . . . . . . . . . . . . . . . . 111
Verifying the installed part by using Hardware Management Console . . . . . . . . . . . . . . 112
Activating and deactivating LEDs by using the HMC . . . . . . . . . . . . . . . . . . . 113
Deactivating a system attention LED or partition LED. . . . . . . . . . . . . . . . . . 113
Activating or deactivating an identify LED . . . . . . . . . . . . . . . . . . . . . 113
Viewing serviceable events by using the HMC . . . . . . . . . . . . . . . . . . . . . 114
Verifying the installed part by using Systems Director Management Console . . . . . . . . . . . . 114
Activating and deactivating LEDs by using the Systems Director Management Console . . . . . . . 115
Deactivating a system attention LED or partition LED. . . . . . . . . . . . . . . . . . 115
Activating or deactivating an identify LED by using the SDMC . . . . . . . . . . . . . . 115
Viewing serviceable events by using the Systems Director Management Console . . . . . . . . . . 116
Verifying an installed feature or replaced part on a system or logical partition by using Virtual I/O Server
tools . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 116
Verifying a repair . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 119
Verifying the repair in AIX . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 120
Verifying a repair using an IBM i system or logical partition . . . . . . . . . . . . . . . . . 123
Verifying the repair in Linux . . . . . . . . . . . . . . . . . . . . . . . . . . . . 125
Verifying the repair from the management console . . . . . . . . . . . . . . . . . . . . . 125
Closing a service call. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 126
Closing a service call using AIX or Linux . . . . . . . . . . . . . . . . . . . . . . . . 131
Closing a service call using Integrated Virtualization Manager . . . . . . . . . . . . . . . . . 135
Activating and deactivating LEDs . . . . . . . . . . . . . . . . . . . . . . . . . . . 139
Deactivating a system attention LED or partition LED using the management console . . . . . . . . . 139
Activating or deactivating an identify LED using the management console. . . . . . . . . . . . . 140
Deactivating a system attention LED or logical partition LED using the Advanced System Management
Interface . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 141
Activating or deactivating an identify LED using the Advanced System Management Interface . . . . . . 141
Installing a part by using the Systems Director Management Console . . . . . . . . . . . . . . . 142
Replacing a part by using the Systems Director Management Console . . . . . . . . . . . . . . . 142
Removing a part by using the Systems Director Management Console . . . . . . . . . . . . . . . 143
Notices . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 145
Trademarks . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 146
Electronic emission notices . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 146
Class A Notices. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 146
iv Power Systems: Serial-attached SCSI RAID enablement
Class B Notices. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 150
Terms and conditions. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 153
Contents v
vi Power Systems: Serial-attached SCSI RAID enablement
Safety notices
Safety notices may be printed throughout this guide:
v DANGER notices call attention to a situation that is potentially lethal or extremely hazardous to
people.
v CAUTION notices call attention to a situation that is potentially hazardous to people because of some
existing condition.
v Attention notices call attention to the possibility of damage to a program, device, system, or data.
World Trade safety information
Several countries require the safety information contained in product publications to be presented in their
national languages. If this requirement applies to your country, a safety information booklet is included
in the publications package shipped with the product. The booklet contains the safety information in
your national language with references to the U.S. English source. Before using a U.S. English publication
to install, operate, or service this product, you must first become familiar with the related safety
information in the booklet. You should also refer to the booklet any time you do not clearly understand
any safety information in the U.S. English publications.
German safety information
Das Produkt ist nicht für den Einsatz an Bildschirmarbeitsplätzen im Sinne § 2 der
Bildschirmarbeitsverordnung geeignet.
Laser safety information
IBM® servers can use I/O cards or features that are fiber-optic based and that utilize lasers or LEDs.
Laser compliance
IBM servers may be installed inside or outside of an IT equipment rack.
© Copyright IBM Corp. 2010, 2011 vii
DANGER
When working on or around the system, observe the following precautions:
Electrical voltage and current from power, telephone, and communication cables are hazardous. To
avoid a shock hazard:
v Connect power to this unit only with the IBM provided power cord. Do not use the IBM
provided power cord for any other product.
v Do not open or service any power supply assembly.
v Do not connect or disconnect any cables or perform installation, maintenance, or reconfiguration
of this product during an electrical storm.
v The product might be equipped with multiple power cords. To remove all hazardous voltages,
disconnect all power cords.
v Connect all power cords to a properly wired and grounded electrical outlet. Ensure that the outlet
supplies proper voltage and phase rotation according to the system rating plate.
v Connect any equipment that will be attached to this product to properly wired outlets.
v When possible, use one hand only to connect or disconnect signal cables.
v Never turn on any equipment when there is evidence of fire, water, or structural damage.
v Disconnect the attached power cords, telecommunications systems, networks, and modems before
you open the device covers, unless instructed otherwise in the installation and configuration
procedures.
v Connect and disconnect cables as described in the following procedures when installing, moving,
or opening covers on this product or attached devices.
To Disconnect:
1. Turn off everything (unless instructed otherwise).
2. Remove the power cords from the outlets.
3. Remove the signal cables from the connectors.
4. Remove all cables from the devices
To Connect:
1. Turn off everything (unless instructed otherwise).
2. Attach all cables to the devices.
3. Attach the signal cables to the connectors.
4. Attach the power cords to the outlets.
5. Turn on the devices.
(D005)
DANGER
viii Power Systems: Serial-attached SCSI RAID enablement
Observe the following precautions when working on or around your IT rack system:
v Heavy equipment–personal injury or equipment damage might result if mishandled.
v Always lower the leveling pads on the rack cabinet.
v Always install stabilizer brackets on the rack cabinet.
v To avoid hazardous conditions due to uneven mechanical loading, always install the heaviest
devices in the bottom of the rack cabinet. Always install servers and optional devices starting
from the bottom of the rack cabinet.
v Rack-mounted devices are not to be used as shelves or work spaces. Do not place objects on top
of rack-mounted devices.
v Each rack cabinet might have more than one power cord. Be sure to disconnect all power cords in
the rack cabinet when directed to disconnect power during servicing.
v Connect all devices installed in a rack cabinet to power devices installed in the same rack
cabinet. Do not plug a power cord from a device installed in one rack cabinet into a power
device installed in a different rack cabinet.
v An electrical outlet that is not correctly wired could place hazardous voltage on the metal parts of
the system or the devices that attach to the system. It is the responsibility of the customer to
ensure that the outlet is correctly wired and grounded to prevent an electrical shock.
CAUTION
v Do not install a unit in a rack where the internal rack ambient temperatures will exceed the
manufacturer's recommended ambient temperature for all your rack-mounted devices.
v Do not install a unit in a rack where the air flow is compromised. Ensure that air flow is not
blocked or reduced on any side, front, or back of a unit used for air flow through the unit.
v Consideration should be given to the connection of the equipment to the supply circuit so that
overloading of the circuits does not compromise the supply wiring or overcurrent protection. To
provide the correct power connection to a rack, refer to the rating labels located on the
equipment in the rack to determine the total power requirement of the supply circuit.
v (For sliding drawers.) Do not pull out or install any drawer or feature if the rack stabilizer brackets
are not attached to the rack. Do not pull out more than one drawer at a time. The rack might
become unstable if you pull out more than one drawer at a time.
v (For fixed drawers.) This drawer is a fixed drawer and must not be moved for servicing unless
specified by the manufacturer. Attempting to move the drawer partially or completely out of the
rack might cause the rack to become unstable or cause the drawer to fall out of the rack.
(R001)
Safety notices ix
CAUTION:
Removing components from the upper positions in the rack cabinet improves rack stability during
relocation. Follow these general guidelines whenever you relocate a populated rack cabinet within a
room or building:
v Reduce the weight of the rack cabinet by removing equipment starting at the top of the rack
cabinet. When possible, restore the rack cabinet to the configuration of the rack cabinet as you
received it. If this configuration is not known, you must observe the following precautions:
– Remove all devices in the 32U position and above.
– Ensure that the heaviest devices are installed in the bottom of the rack cabinet.
– Ensure that there are no empty U-levels between devices installed in the rack cabinet below the
32U level.
v If the rack cabinet you are relocating is part of a suite of rack cabinets, detach the rack cabinet from
the suite.
v Inspect the route that you plan to take to eliminate potential hazards.
v Verify that the route that you choose can support the weight of the loaded rack cabinet. Refer to the
documentation that comes with your rack cabinet for the weight of a loaded rack cabinet.
v Verify that all door openings are at least 760 x 230 mm (30 x 80 in.).
v Ensure that all devices, shelves, drawers, doors, and cables are secure.
v Ensure that the four leveling pads are raised to their highest position.
v Ensure that there is no stabilizer bracket installed on the rack cabinet during movement.
v Do not use a ramp inclined at more than 10 degrees.
v When the rack cabinet is in the new location, complete the following steps:
– Lower the four leveling pads.
– Install stabilizer brackets on the rack cabinet.
– If you removed any devices from the rack cabinet, repopulate the rack cabinet from the lowest
position to the highest position.
v If a long-distance relocation is required, restore the rack cabinet to the configuration of the rack
cabinet as you received it. Pack the rack cabinet in the original packaging material, or equivalent.
Also lower the leveling pads to raise the casters off of the pallet and bolt the rack cabinet to the
pallet.
(R002)
(L001)
(L002)
x Power Systems: Serial-attached SCSI RAID enablement
(L003)
or
All lasers are certified in the U.S. to conform to the requirements of DHHS 21 CFR Subchapter J for class
1 laser products. Outside the U.S., they are certified to be in compliance with IEC 60825 as a class 1 laser
product. Consult the label on each part for laser certification numbers and approval information.
CAUTION:
This product might contain one or more of the following devices: CD-ROM drive, DVD-ROM drive,
DVD-RAM drive, or laser module, which are Class 1 laser products. Note the following information:
v Do not remove the covers. Removing the covers of the laser product could result in exposure to
hazardous laser radiation. There are no serviceable parts inside the device.
v Use of the controls or adjustments or performance of procedures other than those specified herein
might result in hazardous radiation exposure.
(C026)
Safety notices xi
CAUTION:
Data processing environments can contain equipment transmitting on system links with laser modules
that operate at greater than Class 1 power levels. For this reason, never look into the end of an optical
fiber cable or open receptacle. (C027)
CAUTION:
This product contains a Class 1M laser. Do not view directly with optical instruments. (C028)
CAUTION:
Some laser products contain an embedded Class 3A or Class 3B laser diode. Note the following
information: laser radiation when open. Do not stare into the beam, do not view directly with optical
instruments, and avoid direct exposure to the beam. (C030)
CAUTION:
The battery contains lithium. To avoid possible explosion, do not burn or charge the battery.
Do Not:
v ___ Throw or immerse into water
v ___ Heat to more than 100°C (212°F)
v ___ Repair or disassemble
Exchange only with the IBM-approved part. Recycle or discard the battery as instructed by local
regulations. In the United States, IBM has a process for the collection of this battery. For information,
call 1-800-426-4333. Have the IBM part number for the battery unit available when you call. (C003)
Power and cabling information for NEBS (Network Equipment-Building System)
GR-1089-CORE
The following comments apply to the IBM servers that have been designated as conforming to NEBS
(Network Equipment-Building System) GR-1089-CORE:
The equipment is suitable for installation in the following:
v Network telecommunications facilities
v Locations where the NEC (National Electrical Code) applies
The intrabuilding ports of this equipment are suitable for connection to intrabuilding or unexposed
wiring or cabling only. The intrabuilding ports of this equipment must not be metallically connected to the
interfaces that connect to the OSP (outside plant) or its wiring. These interfaces are designed for use as
intrabuilding interfaces only (Type 2 or Type 4 ports as described in GR-1089-CORE) and require isolation
from the exposed OSP cabling. The addition of primary protectors is not sufficient protection to connect
these interfaces metallically to OSP wiring.
Note: All Ethernet cables must be shielded and grounded at both ends.
The ac-powered system does not require the use of an external surge protection device (SPD).
The dc-powered system employs an isolated DC return (DC-I) design. The DC battery return terminal
shall not be connected to the chassis or frame ground.
xii Power Systems: Serial-attached SCSI RAID enablement
SAS RAID enablement and cache battery pack for the
8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C
You might need to configure a redundant array of independent disks (RAID) on your serial-attached SCSI
(SAS) disk drives in your system enclosure. Use these instructions to install features to enable RAID on
your system.
The following instructions will enable you to install SAS RAID enablement cards to prepare your system,
to use SAS disk drives in a RAID configuration.
To find information on installing adapters or configuring RAID on your system, see the following topics:
v To view the PCI adapters, see PCI adapters
v To view the SAS RAID controllers for AIX® operating system, see SAS RAID controllers for AIX
v To view the SAS RAID controllers for IBM i operating system, see SAS RAID controllers for IBM i and
the IBM i Information Center web site at http://www.ibm.com/systems/i/infocenter .
v To view the SAS RAID controllers for Linux operating system, see SAS RAID controllers for Linux
© Copyright IBM Corp. 2010, 2011 1
2 Power Systems: Serial-attached SCSI RAID enablement
What's new in SAS RAID enablement
Read about new or significantly changed information in SAS RAID enablement since the previous update
of this topic collection.
October 2011
v Added information for 8202-E4C and 8205-E6C servers.
© Copyright IBM Corp. 2010, 2011 3
4 Power Systems: Serial-attached SCSI RAID enablement
Installing the Storage Backplane - 6 SFF Bays/SATA DVD (FC
5618)
Use this procedure to install the parts included with feature Storage Backplane - 6 SFF Bays/SATA DVD.
This procedure is intended for installing new features.
Attention: If you are servicing a failing part, see the following service procedure:
v For the RAID storage controller, see “Removing the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C
RAID/cache storage controller” on page 66 and “Replacing the 8202-E4B, 8202-E4C, 8205-E6B, or
8205-E6C RAID/cache storage controller” on page 69.
Before installing or replacing a feature, ensure that the software required to support the feature is
installed on your system. For information about software prerequisites, see IBM Prerequisite. If the
required software is not installed, see the following websites to download it, and then install it before
continuing:
v To download firmware and software updates and fixes, see Fix Central.
v To download Hardware Management Console (HMC) updates and fixes, see Hardware Management
Console Support and downloads.
Attention: To maintain system availability and prevent possible data loss, it is important to understand
that these features have implications on the serial-attached SCSI (SAS) configuration and data accessibility
of the system. Review the SAS subsystem topic to understand the various SAS configuration and service
considerations.
Before servicing or installing any features, review the I/O configurations for each logical partition on
your system. It is important to understand the differences between the various configurations and the
resulting service considerations. In addition, consider possible impacts to the following areas:
v Logical partitions and their I/O adapter assignments
v RAID array configurations
v Physical disk locations
v Path to your boot device
If your system is managed by the HMC, use the HMC to complete the steps for installing a feature in the
server. For instructions, see “Installing a feature by using the Hardware Management Console” on page
105.
If your system is managed by the IBM Systems Director Management Console (SDMC), use the SDMC to
complete the steps for installing a feature in the server. For instructions, see “Replacing a part by using
the Systems Director Management Console” on page 142.
If you do not have an HMC or an SDMC, complete the following procedure.
1. Complete the prerequisite tasks described in “Before you begin” on page 79.
2. Stop the system, as described in “Stopping a system or logical partition” on page 89.
3. Remove the front cover.
v For a rack-mounted system, see “Removing the front cover on a rack-mounted 8202-E4B,
8202-E4C, 8205-E6B, or 8205-E6C system” on page 92.
v For a stand-alone system, see “Removing the front cover on a stand-alone 8202-E4B, 8202-E4C, or
8205-E6B system” on page 92.
© Copyright IBM Corp. 2010, 2011 5
4. For a rack-mounted system, place the system into the service position, as described in “Placing the
rack-mounted 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system into the service position” on page
100.
5. Remove the service access cover.
v For a rack-mounted system, see “Removing the service access cover on a rack-mounted 8202-E4B,
8202-E4C, 8205-E6B, or 8205-E6C system” on page 95.
v For a stand-alone system, see “Removing the service access cover on a stand-alone 8202-E4B,
8202-E4C, or 8205-E6B system” on page 96.
6. Disconnect the power source from the system by unplugging the system.
Note: This system is equipped with a second power supply. Before continuing with this procedure,
ensure that the power source to the system has been completely disconnected.
(L003)
or
7. Attach the wrist strap.
Attention:
v Attach a wrist strap to an unpainted metal surface of your hardware to prevent electrostatic
discharge (ESD) from damaging your hardware.
v When using a wrist strap, follow all electrical safety procedures. A wrist strap is for static control.
It does not increase or decrease your risk of receiving electric shock when using or working on
electrical equipment.
v If you do not have a wrist strap, just prior to removing the product from ESD packaging and
installing or replacing hardware, touch an unpainted metal surface of the system for a minimum
of 5 seconds.
8. Install the disk drive backplane. For instructions, see Replacing the 8202-E4B, 8202-E4C, 8205-E6B, or
8205-E6C disk drive backplane.
9. Install the disk drives.
6 Power Systems: Serial-attached SCSI RAID enablement
v For internal disk drive, see Replacing the internal disk drive in an 8202-E4B, 8202-E4C, 8205-E6B,
or 8205-E6C system.
v For solid-state drive, see Installing a disk drive or solid-state drive.
10. Install the SATA DVD drive. For instructions, see Installing a Slimline media device in the 8202-E4B,
8202-E4C, 8205-E6B, or 8205-E6C.
11. Replace the service access cover.
v For a rack-mounted system, see “Installing the service access cover on a rack-mounted 8202-E4B,
8202-E4C, 8205-E6B, or 8205-E6C system” on page 98.
v For a stand-alone system, see “Installing the service access cover on a stand-alone 8202-E4B,
8202-E4C, or 8205-E6B system” on page 98.
12. For a rack-mounted system, put the system into the operating position, as described in “Placing the
rack-mounted 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system into the operating position” on
page 101.
13. Replace the front cover.
v For a rack-mounted system, see “Installing the front cover on a rack-mounted 8202-E4B, 8202-E4C,
8205-E6B, or 8205-E6C system” on page 93.
v For a stand-alone system, see “Installing the front cover and front door on a stand-alone 8202-E4B,
8202-E4C, or 8205-E6B system” on page 94.
14. Reconnect the power source.
15. Start the system as described in “Starting the system or logical partition” on page 87.
16. Verify the installed part:
v If you replaced the part because of a service action, verify the installed part as described in
Verifying a repair.
v If you installed the part for any other reason, verify the installed part as described in Verifying the
installed part.
Installing the Storage Backplane - 6 SFF Bays/SATA DVD (FC 5618) 7
8 Power Systems: Serial-attached SCSI RAID enablement
Installing the 3x3 split-drive bays no RAID 5/6 support (FC
5631 and FC EJ02)
Use this procedure to install the parts included with feature 3x3 split-drive bays No RAID 5/6 support.
Note: The FC 5631 is for the 8202-E4B or 8205-E6B, and the FC EJ02 is for the 8202-E4C or 8205-E6C.
Attention: This procedure is intended for installing a split-drive bay capability.
Before installing or replacing a feature, ensure that the software required to support the feature is
installed on your system. For information about software prerequisites, see IBM Prerequisite. If the
required software is not installed, see the following websites to download it, and then install it before
continuing:
v To download firmware and software updates and fixes, see Fix Central.
v To download Hardware Management Console (HMC) updates and fixes, see Hardware Management
Console Support and downloads.
Attention: To maintain system availability and prevent possible data loss, it is important to understand
that these features have implications on the serial-attached SCSI (SAS) configuration and data accessibility
of the system. Review the SAS subsystem topic to understand the various SAS configuration and service
considerations.
Before servicing or installing any features, review the I/O configurations for each logical partition on
your system. It is important to understand the differences between the various configurations and the
resulting service considerations. In addition, consider possible impacts to the following areas:
v Logical partitions and their I/O adapter assignments
v RAID array configurations
v Physical disk locations
v Path to your boot device
If your system is managed by the HMC, use the HMC to complete the steps for installing a feature in the
server. For instructions, see “Installing a feature by using the Hardware Management Console” on page
105
If your system is managed by the IBM Systems Director Management Console (SDMC), use the SDMC to
complete the steps for installing a feature in the server. For instructions, see “Replacing a part by using
the Systems Director Management Console” on page 142.
If you do not have an HMC or an SDMC, complete the following procedure.
1. Complete the prerequisite tasks described in “Before you begin” on page 79.
2. Stop the system, as described in “Stopping a system or logical partition” on page 89.
3. For a rack-mounted system, place the system into the service position, as described in “Placing the
rack-mounted 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system into the service position” on page
100.
4. Remove the service access cover.
v For a rack-mounted system, see “Removing the service access cover on a rack-mounted 8202-E4B,
8202-E4C, 8205-E6B, or 8205-E6C system” on page 95.
v For a stand-alone system, see “Removing the service access cover on a stand-alone 8202-E4B,
8202-E4C, or 8205-E6B system” on page 96.
© Copyright IBM Corp. 2010, 2011 9
5. Use the service indicator light-emitting diodes (LEDs) to help identify the part, as described in
“Identifying a part” on page 81.
6. Disconnect the power source from the system by unplugging the system.
Note: This system is equipped with a second power supply. Before continuing with this procedure,
ensure that the power source to the system has been completely disconnected.
(L003)
or
7. Attach the wrist strap.
Attention:
v Attach a wrist strap to an unpainted metal surface of your hardware to prevent electrostatic
discharge (ESD) from damaging your hardware.
v When using a wrist strap, follow all electrical safety procedures. A wrist strap is for static control.
It does not increase or decrease your risk of receiving electric shock when using or working on
electrical equipment.
v If you do not have a wrist strap, just prior to removing the product from ESD packaging and
installing or replacing hardware, touch an unpainted metal surface of the system for a minimum
of 5 seconds.
8. Locate the RAID-10 storage controller slots.
10 Power Systems: Serial-attached SCSI RAID enablement
9. Align the RAID-10 storage controller (A) with the slot in the system.
Figure 1. Slot locations of RAID-10 storage controller (P1-C19) for 8202-E4B or 8205-E6B
Figure 2. Slot locations of RAID-10 storage controller (P1-C19) for 8202-E4C or 8205-E6C
Installing the 3x3 split-drive bays no RAID 5/6 support (FC 5631 and FC EJ02) 11
10. Slide the RAID-10 storage controller into the system until the tab (B) locks into place as shown in the
following figure.
11. Install the disk drive backplane. For instructions, see Removing the 8202-E4B, 8202-E4C, 8205-E6B, or
8205-E6C disk drive backplane.
12. Install the disk drives.
v For internal disk drive, see Replacing the internal disk drive in an 8202-E4B, 8202-E4C, 8205-E6B,
or 8205-E6C system.
v For solid-state drive, see Installing a disk drive or solid-state drive.
13. Install the SATA DVD drive. For instructions, see Installing a Slimline media device in the 8202-E4B,
8202-E4C, 8205-E6B, or 8205-E6C.
14. Install the external SAS port. For instructions, see Installing the external SAS port on an 8202-E4B,
8202-E4C, 8205-E6B, or 8205-E6C system.
15. Replace the service access cover.
v For a rack-mounted system, see “Installing the service access cover on a rack-mounted 8202-E4B,
8202-E4C, 8205-E6B, or 8205-E6C system” on page 98.
v For a stand-alone system, see “Installing the service access cover on a stand-alone 8202-E4B,
8202-E4C, or 8205-E6B system” on page 98.
16. For a rack-mounted system, put the system into the operating position, as described in “Placing the
rack-mounted 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system into the operating position” on
page 101.
17. Replace the front cover.
v For a rack-mounted system, see “Installing the front cover on a rack-mounted 8202-E4B, 8202-E4C,
8205-E6B, or 8205-E6C system” on page 93.
v For a stand-alone system, see “Installing the front cover and front door on a stand-alone 8202-E4B,
8202-E4C, or 8205-E6B system” on page 94.
Figure 3. Rack-mounted model RAID-10 storage controller replacement
12 Power Systems: Serial-attached SCSI RAID enablement
18. Reconnect the power source.
19. Start the system as described in “Starting the system or logical partition” on page 87.
20. Verify the installed part or repair.
v If you replaced the part because of a service action, verify the installed part as described in
Verifying a repair.
v If you installed the part for any other reason, verify the installed part as described in Verifying the
installed part.
Installing the 3x3 split-drive bays no RAID 5/6 support (FC 5631 and FC EJ02) 13
14 Power Systems: Serial-attached SCSI RAID enablement
Installing the Storage Backplane - 8 SFF Bays/175MB
RAID/Dual IOA (FC 5630 and FC EJ01)
Use this procedure to install the parts included with feature Storage Backplane - 8 SFF Bays/175MB
RAID/Dual IOA. This procedure is intended for installing new features.
Note: The FC 5630 is for the 8202-E4B or 8205-E6B, and the FC EJ01 is for the 8202-E4C or 8205-E6C.
Attention: If you are servicing a failing part, see the following service procedures:
v For the enablement card, see Removing enablement card and Replacing enablement card.
v For the RAID/cache storage controller, see Removing storage controller and Replacing storage
controller.
Attention: To maintain system availability and prevent possible data loss, it is important to understand
that these features have implications on the serial-attached SCSI (SAS) configuration and data accessibility
of the system. Review the SAS subsystem topic to understand the various SAS configuration and service
considerations.
Before servicing or installing any features, review the I/O configurations for each logical partition on
your system. It is important to understand the differences between the various configurations and the
resulting service considerations. In addition, consider possible impacts to the following areas:
v Logical partitions and their I/O adapter assignments
v RAID array configurations
v Physical disk locations
v Path to your boot device
Before installing or replacing a feature, ensure that the software required to support the feature is
installed on your system. For information about software prerequisites, see IBM Prerequisite. If the
required software is not installed, see the following web sites to download it, and then install it before
continuing:
v To download firmware and software updates and fixes, see Fix Central.
v To download Hardware Management Console (HMC) updates and fixes, see Hardware Management
Console Support and downloads.
If your system is managed by the HMC, use the HMC to complete the steps for installing a feature in the
server. For instructions, see “Installing a feature by using the Hardware Management Console” on page
105.
If your system is managed by the IBM Systems Director Management Console (SDMC), use the SDMC to
complete the steps for installing a feature in the server. For instructions, see “Replacing a part by using
the Systems Director Management Console” on page 142.
If you do not have an HMC or an SDMC, complete the following procedure.
1. Complete the prerequisite tasks described in “Before you begin” on page 79.
2. Use the service indicator light-emitting diodes (LEDs) to help identify the part, as described in
“Identifying a part” on page 81.
3. Stop the system, as described in “Stopping a system or logical partition” on page 89.
4. For a rack-mounted system, place the system into the service position, as described in “Placing the
rack-mounted 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system into the service position” on page
100.
© Copyright IBM Corp. 2010, 2011 15
5. Disconnect the power source from the system by unplugging the system.
Note: This system is equipped with a second power supply. Before continuing with this procedure,
ensure that the power source to the system has been completely disconnected.
(L003)
or
6. Remove the service access cover.
v For a rack-mounted system, see “Removing the service access cover on a rack-mounted 8202-E4B,
8202-E4C, 8205-E6B, or 8205-E6C system” on page 95.
v For a stand-alone system, see “Removing the service access cover on a stand-alone 8202-E4B,
8202-E4C, or 8205-E6B system” on page 96.
7. Attach the wrist strap.
Attention:
v Attach a wrist strap to an unpainted metal surface of your hardware to prevent electrostatic
discharge (ESD) from damaging your hardware.
v When using a wrist strap, follow all electrical safety procedures. A wrist strap is for static control.
It does not increase or decrease your risk of receiving electric shock when using or working on
electrical equipment.
v If you do not have a wrist strap, just prior to removing the product from ESD packaging and
installing or replacing hardware, touch an unpainted metal surface of the system for a minimum
of 5 seconds.
8. For 8202-E4B or 8205-E6B system, locate the RAID enablement card slot (P1–C13), RAID/cache
storage controller slot (P1-C19), and cache battery card (P1-C14), and go to step 10 on page 17.
16 Power Systems: Serial-attached SCSI RAID enablement
9. For 8202-E4C or 8205-E6C system, locate the RAID/cache storage controller slot (P1-C19) and cache
battery card (P1-C14), and go to step 11 on page 18.
10. Install RAID enablement card for 8202-E4B or 8205-E6B, as follows:
Note: The 8202-E4C or 8205-E6C does not have the enablement card.
a. Align the RAID enablement card (A) with the slot (B) in the system.
Figure 4. Slot location for RAID enablement card, RAID/cache storage controller, cache battery card
Figure 5. Slot location for RAID/cache storage controller, cache battery card
Installing the Storage Backplane - 8 SFF Bays/175MB RAID/Dual IOA (FC 5630 and FC EJ01) 17
b. Slide the RAID enablement card into the system until the tab locks into place as shown in the
following figure.
11. Install RAID/cache storage controller as follows:
a. Align the RAID/cache storage controller (A) with the slot in the system.
b. Slide the RAID/cache storage controller into the system until the tab (B) locks into place as
shown in the following figure.
Figure 6. Rack-mounted model RAID enablement card replacement
18 Power Systems: Serial-attached SCSI RAID enablement
12. Install the cache battery card as follows:
a. Slide the cache battery card (A) into the mounting guides on the controller until it is seated in
the battery connector.
b. After the battery is seated in the connector, move the lever to the latched position to fully seat
the battery into the connector.
Figure 7. Rack-mounted model RAID/cache storage controller replacement
Installing the Storage Backplane - 8 SFF Bays/175MB RAID/Dual IOA (FC 5630 and FC EJ01) 19
13. Remove the disk drives.
v For internal disk drives, see Removing the internal disk drive in an 8202-E4B, 8202-E4C, 8205-E6B,
or 8205-E6C system.
v For solid-state drives, see Removing a disk drive or solid-state drive.
14. Remove the disk drives backplane [CCIN 2BD5]. For instructions, see Removing the 8202-E4B,
8202-E4C, 8205-E6B, or 8205-E6C disk drive backplane.
15. Install the high-function disk drive backplane [CCIN 2BD6]. For instructions, see Replacing the
8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C disk drive backplane.
16. Replace the disk drives in to the high-function disk drive backplane.
v For internal disk drives, see Replacing the internal disk drive in an 8202-E4B, 8202-E4C, 8205-E6B,
or 8205-E6C system.
v For solid-state drives, see Installing a disk drive or solid-state drive.
17. Replace the service access cover. See
v For a rack-mounted system, see “Installing the service access cover on a rack-mounted 8202-E4B,
8202-E4C, 8205-E6B, or 8205-E6C system” on page 98.
v For a stand-alone system, see “Installing the service access cover on a stand-alone 8202-E4B,
8202-E4C, or 8205-E6B system” on page 98.
18. For a rack-mounted system, put the system into the operating position, as described in “Placing the
rack-mounted 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system into the operating position” on
page 101.
19. Replace the front cover.
v For a rack-mounted system, see “Installing the front cover on a rack-mounted 8202-E4B, 8202-E4C,
8205-E6B, or 8205-E6C system” on page 93.
v For a stand-alone system, see “Installing the front cover and front door on a stand-alone 8202-E4B,
8202-E4C, or 8205-E6B system” on page 94.
20. Reconnect the power source.
Figure 8. Rack-mounted model RAID cache battery card replacement
20 Power Systems: Serial-attached SCSI RAID enablement
21. Start the system as described in “Starting the system or logical partition” on page 87.
22. Verify the installed part:
v If you replaced the part because of a service action, verify the installed part as described in
Verifying a repair.
v If you installed the part for any other reason, verify the installed part as described in Verifying the
installed part.
Installing the Storage Backplane - 8 SFF Bays/175MB RAID/Dual IOA (FC 5630 and FC EJ01) 21
22 Power Systems: Serial-attached SCSI RAID enablement
Removing and replacing SAS RAID adapters and batteries
You might need to remove or replace the SAS RAID adapters and batteries. This section includes
procedures for these tasks.
Note: Replacing this feature is a customer task. You can perform this task yourself, or contact an
authorized service provider to perform the task for you. You might be charged a fee by the authorized
service provider for this service.
Removing and replacing the disk drive backplane
Use these procedures to remove, replace, or install the disk drive backplane in the IBM Power 720
Express (8202-E4B and 8202-E4C) and IBM Power 740 Express (8205-E6B and 8205-E6C) servers.
The following figures show the disk drive backplanes of a rack-mounted system and a stand-alone
system.
Figure 9. Disk drive backplane of a rack-mounted system
© Copyright IBM Corp. 2010, 2011 23
Removing the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C disk drive
backplane
Use this procedure to remove a disk drive backplane from the server.
If your system is managed by the Hardware Management Console (HMC), use the HMC to complete the
steps for removing the disk drive backplane from the server. For information about using the HMC to
remove the disk drive backplane, see Removing a part using the Hardware Management Console.
If your system is managed by the IBM Systems Director Management Console (SDMC), use the SDMC to
complete the steps for removing the disk drive backplane from the server. For instructions, see Removing
a part using the Systems Director Management Console.
If you do not have an HMC or SDMC, complete the following steps to remove the disk drive backplane:
Attention: Ensure that you remove the control panel before you remove the disk drive backplane to
prevent damage to the control panel cables.
1. Identify the system that you will be working on by using the identify the system process to turn on
the system locate (blue) indicator. For more information, see Control panel LEDs.
2. Use the service indicator light-emitting diodes (LEDs) to help identify the part, as described in
“Identifying a part” on page 81.
3. Complete the prerequisite tasks as described in “Before you begin” on page 79.
4. Stop the system, as described in “Stopping a system or logical partition” on page 89.
5. Remove the front cover:
v For a rack-mounted system, see “Removing the front cover on a rack-mounted 8202-E4B,
8202-E4C, 8205-E6B, or 8205-E6C system” on page 92.
Figure 10. Disk drive backplane of a stand-alone system
24 Power Systems: Serial-attached SCSI RAID enablement
v For a stand-alone system, see “Removing the front cover on a stand-alone 8202-E4B, 8202-E4C, or
8205-E6B system” on page 92.
6. For a rack-mounted system, place the system into the service position, as described in “Placing the
rack-mounted 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system into the service position” on page
100.
7. Disconnect the power source from the system by unplugging the system, as described in
“Disconnecting the power cords from the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system” on
page 103.
Note: This system is equipped with a second power supply. Before continuing with this procedure,
ensure that the power source to the system has been completely disconnected.
(L003)
or
8. Remove the service access cover:
v For a rack-mounted system, see “Removing the service access cover on a rack-mounted 8202-E4B,
8202-E4C, 8205-E6B, or 8205-E6C system” on page 95.
v For a stand-alone system, see “Removing the service access cover on a stand-alone 8202-E4B,
8202-E4C, or 8205-E6B system” on page 96.
9. Attach the wrist strap.
Removing and replacing SAS RAID adapters and batteries 25
Attention:
v Attach a wrist strap to an unpainted metal surface of your hardware to prevent electrostatic
discharge (ESD) from damaging your hardware.
v When using a wrist strap, follow all electrical safety procedures. A wrist strap is for static control.
It does not increase or decrease your risk of receiving electric shock when using or working on
electrical equipment.
v If you do not have a wrist strap, just prior to removing the product from ESD packaging and
installing or replacing hardware, touch an unpainted metal surface of the system for a minimum
of 5 seconds.
10. If you need to replace the backplane because of an upgrade or system failure, complete the following
steps to remove the disk drives, depending on your backplane.
Note: If you remove a backplane that is populated with disk drives, the backplane will be heavy.
Ensure that you can safely complete the procedure.
a. To remove the disk drive from the backplane, perform the following steps:
Note: Record the location of each disk drive you remove so that the disk drive can be replaced
in the same location again.
1) Unlock the disk drive handle (A) and pull it toward you to unseat the disk drive.
2) Place one hand under the disk drive and carefully pull the drive out of the system as shown
in Figure 11.
11. Remove the media device by pressing in on the latch (A) and sliding the media device out of the
system, as shown in the following figure.
For information about removing the different types of media devices, see Removing and replacing
media devices in the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C.
Figure 11. Removing the disk drive from the backplane
26 Power Systems: Serial-attached SCSI RAID enablement
12. Disconnect the cables from the back of the disk drive backplane. If you have connected an external
serial-attached SCSI (SAS) cable, ensure that you disconnect the SAS cable from the disk drive
backplane. For more information, see Installing the external SAS port on model 8202-E4B, 8202-E4C,
8205-E6B, or 8205-E6C.
13. Unlock the disk drive backplane by pulling the handles (A) open from the center of the backplane as
shown in Figure 13 on page 28 and Figure 14 on page 28.
Figure 12. Removing the media device from the disk drive backplane
Removing and replacing SAS RAID adapters and batteries 27
14. Pull the backplane out of the server.
Figure 13. Removing the backplane that is populated with disk drives
Figure 14. Removing the backplane that is not populated disk drives
28 Power Systems: Serial-attached SCSI RAID enablement
If you removed the backplane as a part of another procedure, continue with that procedure. To replace
the backplane or install a new backplane, see Replacing the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C
disk drive backplane.
Replacing the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C disk drive
backplane
Use this procedure to replace a disk drive backplane.
If you are replacing an existing disk drive backplane as part of a service procedure and want to remove
the disk drive backplane, see Removing the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C disk drive
backplane.
If your system is managed by the Hardware Management Console (HMC), use the HMC to complete the
steps for replacing the disk drive backplane in the server. For instructions, see Exchanging a part using
the Hardware Management Console.
If your system is managed by the IBM Systems Director Management Console (SDMC), use the SDMC to
complete the steps for replacing the disk drive backplane in the server. For instructions, see Replacing a
part using the Systems Director Management Console.
If you do not have an HMC or SDMC, complete the following steps to replace the disk drive backplane:
1. Check for a protective cover that might be present on the connector pins, in the lower-middle part at
the rear of the backplane. If a protective cover is present, remove the cover.
2. Check for the locking tabs (A) shown in the following figures. If the tabs are in locked position, open
the tabs.
3. Carefully lift the backplane along two sides and align it with the slot in the server.
4. Insert the backplane firmly into the server.
5. Secure the backplane with the locking tabs (A), as shown in the following figures.
Removing and replacing SAS RAID adapters and batteries 29
6. Replace the disk drives, if you removed them earlier. Use the information that you recorded when
you removed the drives to install them in the correct locations. To replace the disk drives, perform
the following steps:
Figure 15. Installing the backplane that is not populated with disk drives in a rack-mounted model
Figure 16. Installing the backplane that is populated with disk drives in a rack-mounted model
30 Power Systems: Serial-attached SCSI RAID enablement
a. Hold the disk drive by the top and bottom edges as you position the drive and insert it into the
expansion unit. Do not hold the disk drive by the handle.
b. Slide the disk drive all the way into the system or expansion unit, and push the disk drive
handle in until it locks, as shown in the following figure.
7. Replace the Slimline media device by inserting it in the slot and pushing it in until the lever (B)
locks in place, as shown in the following figure.
For information about replacing the different types of media devices, see Removing and replacing
media devices in the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C.
Figure 17. Installing the backplane disk drives
Removing and replacing SAS RAID adapters and batteries 31
8. If required, connect the cable to the backplane. For more information, see Installing the external SAS
port on an 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C.
9. If required, install the RAID enablement card. For more information, see SAS RAID enablement and
cache battery pack for the model 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C.
10. Replace the service access cover:
v For a rack-mounted system, see “Installing the service access cover on a rack-mounted 8202-E4B,
8202-E4C, 8205-E6B, or 8205-E6C system” on page 98.
v For a stand-alone system, see “Installing the service access cover on a stand-alone 8202-E4B,
8202-E4C, or 8205-E6B system” on page 98.
11. For a rack-mounted system, place the system into the operating position as described in “Placing the
rack-mounted 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system into the operating position” on
page 101.
12. Connect the power cords to the system, as described in Connecting the power cords.
13. Replace the front cover:
v For a rack-mounted system, see “Installing the front cover on a rack-mounted 8202-E4B, 8202-E4C,
8205-E6B, or 8205-E6C system” on page 93.
v For a stand-alone system, see “Installing the front cover and front door on a stand-alone 8202-E4B,
8202-E4C, or 8205-E6B system” on page 94.
14. Start the system or logical partition. See “Starting the system or logical partition” on page 87.
15. Verify the installed part:
v If you replaced the part because of a failure, verify the installed part as described in Verifying a
repair.
v If you installed the part for any other reason, verify the installed part as described in Verifying the
installed part.
Figure 18. Installing the media device
32 Power Systems: Serial-attached SCSI RAID enablement
Removing and replacing the 8202-E4B, 8202-E4C, 8205-E6B, or
8205-E6C system backplane
Learn how to remove or replace the system backplane.
The following procedures describe the removal and replacement of the system backplane.
Removing the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system
backplane
Learn how to remove the system backplane from a server to replace a failing part or as part of another
service procedure.
Notes:
v Before you begin this procedure, ensure that you have authority to log in as an authorized service
provider. The different authority levels are described in ASMI authority levels.
v Identify the system that you will be working on by using the identify the system process to turn on the
system locate (blue) indicator. For more information, see Control panel LEDs.
If your system is managed by the Hardware Management Console (HMC), use the HMC to remove the
system backplane. For instructions, see Exchanging a part using the Hardware Management Console.
If your system is managed by the IBM Systems Director Management Console (SDMC), use the SDMC to
remove the system backplane. For instructions, see Exchanging a part using the Systems Director
Management Console.
If you do not have an HMC or an SDMC, complete the following steps to remove the system backplane:
1. Access the Advanced System Management Interface (ASMI).
The Advanced System Management Interface (ASMI) is the interface to the service processor that is
required to perform general and administrator-level service tasks, such as reading service processor
error logs, reading vital product data, setting up the service processor, and controlling the system
power. The ASMI might also be referred to as the service processor menus. To access the ASMI, refer
to Accessing the Advanced System Management Interface. After you have accessed the ASMI,
continue with the next step.
2. Check and record the System Enclosure Type and Feature Code.
The enclosure serial number is a seven-digit number printed on a bar-coded label on the system
unit.
Note: The feature code number for 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system backplane is
78AA-001.
3. Check and record the server Host Ethernet Adapter (HEA) device settings.
v For HMC:
a. Expand the Systems Management folder in the navigation tree pane.
b. Double click the Servers folder.
c. Select a server in the central panel.
d. On the Tasks menu, click Hardware Information > Adapters > Host Ethernet. The Host
Ethernet Adapters window opens.
e. Select the required physical port and click Configure. The HEA Physical Port Configuration
window opens.
f. Record the value in the Pending Port Group Multi-Core Scaling value field.
v For SDMC:
a. Go to the Resources tab and click Hosts.
Removing and replacing SAS RAID adapters and batteries 33
b. Select a server in the right pane.
c. On the Actions menu, click Hardware Information > Adapters > Host Ethernet. The Host
Ethernet Adapters window opens.
d. Select the required physical port and click Configure. The HEA Physical Port Configuration
window opens.
e. Record the value in the Pending Port Group Multi-Core Scaling value field.
4. Check and record the server settings.
Prior to replacing this field-replaceable unit (FRU), check and record all server settings since you
might need to update the system after you replace this FRU. This can be done through the
Advanced System Management Interface (ASMI).
a. Check and record the server firmware level indicated on the right edge of the status frame, the
area where the Log out button is located. You might need to update the system to the latest
server firmware code level after you replace this FRU.
b. Check and record the service processor settings you previously set using the ASMI. You will need
to reset these service processor settings after you update this FRU or the default settings will be
used. Settings to record include the following:
v System Name setting
v System Power Control settings
v System Service Aids settings
v System Configuration settings
v Network Services settings
v Performance Option settings
v Login Profile settings settings
v Boot List settings
v Host Ethernet Adapter settings
c. Record any service processor settings you might have set using the operating system commands.
5. Stop the system or logical partition. For instructions, see Stopping a system or logical partition.
6. Disconnect the power source from the system by unplugging the system.
Note: This system is optionally equipped with a second power supply. Before continuing with this
procedure, ensure that the power source to the system has been completely disconnected.
(L003)
or
34 Power Systems: Serial-attached SCSI RAID enablement
7. Label and disconnect the external cables connected to the field-replaceable unit (FRU) you are
servicing.
8. Place the system into the service position. For instructions, see Placing a rack-mounted 8202-E4B,
8202-E4C, 8205-E6B, or 8205-E6C system into the service position..
9. Remove the service access cover:
v For a rack-mounted system, see Removing the service access cover on a rack-mounted 8202-E4B,
8202-E4C, 8205-E6B, or 8205-E6C system.
v For a stand-alone system, see Removing the service access cover on a stand-alone 8202-E4B,
8202-E4C, or 8205-E6B system.
10. Remove the memory riser. For instructions, see Removing the 8202-E4B, 8202-E4C, 8205-E6B, or
8205-E6C memory riser.
11. Remove the GX adapters. For instructions, see Removing GX adapters for the 8202-E4B, 8202-E4C,
8205-E6B, or 8205-E6C.
12. Remove the PCI adapters. For instructions, see Removing a PCI adapter from the 8202-E4B,
8202-E4C, 8205-E6B, or 8205-E6C server with the power off.
13. Remove the Host Ethernet Adapter (8202-E4B or 8205-E6B). For instructions, see Removing the
8202-E4B or 8205-E6B Host Ethernet Adapter with the power off.
14. Remove all the voltage regulator modules. For instructions, see Removing the 8202-E4B, 8202-E4C,
8205-E6B, or 8205-E6C voltage regulator module.
15. Remove the middle fans. For instructions, see Removing the 8202-E4B, 8202-E4C, 8205-E6B, or
8205-E6C middle fans.
16. Remove the power supplies. For instructions, see Removing the 8202-E4B, 8202-E4C, 8205-E6B, or
8205-E6C power supply with the power off or Removing the 8202-E4B, 8202-E4C, 8205-E6B, or
8205-E6C power supply with the power on.
17. Disconnect the internal cables from backplane.
Remove the SPCN cable from the chassis rear (8202-E4C or 8205-E6C).
Removing and replacing SAS RAID adapters and batteries 35
18. Remove the disk drive cage:
a. Grasp the disk drive latches and pull then outward.
b. Grasp the handles and pull the disk drive cage toward you.
Figure 19. Disconnecting the internal cables
36 Power Systems: Serial-attached SCSI RAID enablement
19. Remove the PCI-riser air divider. Firmly grasp the PCI-riser air divider and lift it upward.
20. Remove the PCI riser (optional):
Figure 20. Removing the disk drive cage
Figure 21. Removing the PCI riser air divider
Removing and replacing SAS RAID adapters and batteries 37
a. Loosen the two thumbscrews (A) by turning them counterclockwise.
b. Unlock the PCI riser latches (B) by pressing the end of the latches and lifting them upward.
c. Remove the PCI riser by pulling it upward.
21. Remove the disk-drive cage shield and voltage regulator guide and air divider.
a. Remove the five screws that secure the disk-drive cage shield and the voltage regulator guide
and air divider.
b. Remove the disk drive cage shield and the voltage regulator guide and air divider by lifting
them upward.
Figure 22. Removing the PCI riser
38 Power Systems: Serial-attached SCSI RAID enablement
22. Remove the dual inline memory module (DIMM) cage:
a. Remove the four screws that secure the DIMM cage (View A).
Figure 23. Removing the disk-drive cage shield (8202-E4B or 8205-E6B)
Figure 24. Removing the disk-drive cage shield (8202-E4C or 8205-E6C)
Removing and replacing SAS RAID adapters and batteries 39
b. Remove the DIMM cage by lifting it upward.
23. Disconnect the front fan cable.
Figure 25. Removing the DIMM cage
40 Power Systems: Serial-attached SCSI RAID enablement
24. Remove the middle fan cage:
a. Remove the screws (B) that secure the fan cage.
b. Remove the two screws (A) that secure the fan cage on the outside of the system chassis.
c. Remove the fan cage by lifting it upward.
Figure 26. Disconnecting the front fan cable
Removing and replacing SAS RAID adapters and batteries 41
25. Remove the power interlock switch. For instructions, see Removing the 8202-E4B, 8202-E4C,
8205-E6B, or 8205-E6C power interlock switch with power off.
26. Remove the system processor module. For instructions, see Removing the 8202-E4B, 8202-E4C,
8205-E6B, or 8205-E6C system processor module.
27. Remove the metal guide bracket for the voltage regulator module.
a. Locate the three screws securing the bracket to the system sidewall.
b. Remove the three screws.
28. Remove the system backplane.
Note: Use care when removing the system backplane. Do not lift the system backplane by any of
the attached modules.
a. Label and disconnect all the Ethernet cables that are present.
b. Remove the four screws (A).
Note: Do not remove any of the Torx screws. Remove only the four hex and slot combination
screws.
Figure 27. Remove the middle fan cage
42 Power Systems: Serial-attached SCSI RAID enablement
c. Grip the system backplane by both heat-sink load arms (B).
Figure 28. Removing the system backplane (8202-E4B or 8205-E6B)
Figure 29. Removing the system backplane
Removing and replacing SAS RAID adapters and batteries 43
d. Lift the front of the system backplane up to clear the front alignment pins (approximately 6.35
mm (1/4 in.)).
e. Slide the system backplane slightly toward the front of the system so that the connectors on the
system backplane clear the rear chassis.
f. After the connectors are clear of the chassis, lift the rear of the system backplane and remove the
system backplane at an angle.
Replacing the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system
backplane
Learn how to replace the system backplane that is part of a service action.
To complete this procedure, you must have already completed the removal procedure. For the removal
procedure instructions, see Removing the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system backplane.
If your system is managed by the Hardware Management Console (HMC), use the HMC to replace the
system backplane. For instructions, see Exchanging a part using the Hardware Management Console.
If your system is managed by the IBM Systems Director Management Console (SDMC), use the SDMC to
replace the system backplane. For instructions, see Exchanging a part using the Systems Director
Management Console.
If you do not have an HMC or an SDMC, complete the following steps to replace the system backplane:
Attention:
Attach a wrist strap to an unpainted metal surface of your hardware to prevent electrostatic discharge
from damaging your hardware.
When using a wrist strap, follow all electrical safety procedures. A wrist strap is for static control. It
does not increase or decrease your risk of receiving electric shock when using or working on electrical
equipment.
If you do not have a wrist strap, prior to removing the product from ESD packaging and installing or
replacing hardware, touch an unpainted metal surface of the system for a minimum of 5 seconds.
1. Replace the system backplane.
Note: Use care when you replace the system backplane.
a. Carefully grasp the replacement system backplane by both of the heat-sink load arms (A).
b. Lower the system backplane at an angle toward the front of the system chassis.
c. Lower the rear of the system backplane, and then slide it toward the rear of the system until the
connectors clear the openings in the chassis (see inset).
d. Lower the front of the system backplane onto the alignment pins (B).
44 Power Systems: Serial-attached SCSI RAID enablement
e. Tighten the four screws (C) that secure the system backplane to the system chassis.
f. Reconnect all of the Ethernet cables that were previously removed.
Figure 30. Installing the system backplane (8202-E4B or 8205-E6B)
Figure 31. Installing the system backplane
Removing and replacing SAS RAID adapters and batteries 45
2. Install the metal guide bracket for the voltage regulator module.
a. Align the three holes on the bracket to the three holes along the system sidewall.
b. Tighten the three screws.
3. Replace the system processor module. For instructions, see Replacing the 8202-E4B, 8202-E4C,
8205-E6B, or 8205-E6C system processor module.
4. Replace the power interlock switch. For instructions, see Replacing the 8202-E4B, 8202-E4C,
8205-E6B, or 8205-E6C power interlock switch with the power off.
5. Replace the fan cage:
a. Lower the fan cage into position ensuring that alignment groove (A) and alignment pin (B) are
correctly aligned.
b. Tighten the two screws (D) that secure the fan cage to the system chassis.
c. Tighten the screws (C) that secure the fan cage.
6. Connect the front fan cable.
Figure 32. Installing the fan cage
46 Power Systems: Serial-attached SCSI RAID enablement
7. Replace the dual inline memory module (DIMM) cage:
a. Lower the DIMM cage into position.
b. Insert and tighten the four screws that secure the DIMM cage (View A).
Figure 33. Connecting the front fan cable
Removing and replacing SAS RAID adapters and batteries 47
8. Replace the plastic voltage regulator module guide (A).
a. Lower the plastic guide into position.
b. Insert and tighten the three screws that secure the guide.
9. Replace the disk-drive cage shield (B).
a. Lower the disk-drive cage shield into position.
b. Insert and tighten the two screws that secure the disk-drive cage shield.
Figure 34. Replacing the DIMM cage
48 Power Systems: Serial-attached SCSI RAID enablement
10. Replace the PCI riser:
a. Lower the PCI riser into position.
Figure 35. Replacing the disk-drive cage shield (8202-E4B or 8205-E6B)
Figure 36. Replacing the disk-drive cage shield (8202-E4C or 8205-E6C)
Removing and replacing SAS RAID adapters and batteries 49
b. Press the PCI riser latches (A) down until they lock into place.
c. Tighten the two thumbscrews (D).
11. Replace the PCI-riser air divider.
a. Lower the PCI-riser air divider and press it into place.
Figure 37. Replacing the PCI riser
50 Power Systems: Serial-attached SCSI RAID enablement
12. Replace the disk drive cage:
a. With the disk drive latches extended, push the disk drive cage into the system.
b. Push the disk drive latches toward system until they lock.
Figure 38. Replacing the PCI riser air divider
Removing and replacing SAS RAID adapters and batteries 51
13. Connect the internal cables to the backplane.
Replace the SPCN cable to the chassis rear (8202-E4C or 8205-E6C).
Figure 39. Replacing the disk drive cage
Figure 40. Connecting the internal cables
52 Power Systems: Serial-attached SCSI RAID enablement
14. Replace the power supplies. For instructions, see Replacing the 8202-E4B, 8202-E4C, 8205-E6B, or
8205-E6C power supply with the power off or Replacing the 8202-E4B, 8202-E4C, 8205-E6B, or
8205-E6C power supply with the power on.
15. Replace the fans. For instructions, see Replacing the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C fans.
16. Replace the memory riser. For instructions, see Replacing the 8202-E4B, 8202-E4C, 8205-E6B, or
8205-E6C memory riser.
17. Replace all the voltage regulator modules. For instructions, see Replacing the 8202-E4B, 8202-E4C,
8205-E6B, or 8205-E6C voltage regulator module.
18. Replace the Host Ethernet Adapter (8202-E4B or 8205-E6B). For instructions, see Replacing the
8202-E4B or 8205-E6B Host Ethernet adapter with the power off.
19. Replace the GX adapters. For instructions, see Replacing GX adapters for the 8202-E4B, 8202-E4C,
8205-E6B, or 8205-E6C.
20. Replace the PCI adapters. For instructions, see Replacing a PCI adapter from the 8202-E4B, 8202-E4C,
8205-E6B, or 8205-E6C server with the power off.
21. Replace the service access cover:
v For a rack-mounted system, see Installing the service access cover on a rack-mounted 8202-E4B,
8202-E4C, 8205-E6B, or 8205-E6C system.
v For a stand-alone system, see Installing the service access cover on a stand-alone 8202-E4B,
8202-E4C, or 8205-E6B system.
22. Replace the front cover:
v For a rack-mounted system, see Installing the front cover on a rack-mounted 8202-E4B, 8202-E4C,
8205-E6B, or 8205-E6C system.
v For a stand-alone system, see Installing the front cover on a stand-alone 8202-E4B, 8202-E4C, or
8205-E6B system.
23. Place the system in the operating position. For instructions, see Placing the rack-mounted 8202-E4B,
8202-E4C, 8205-E6B, or 8205-E6C system into the operating position.
24. Start the system or logical partition. See Starting the system or logical partition.
25. Restore network connectivity.
The service processor must connect to the network to be accessible from the management console.
How will the IP Address of the Service Processor be managed?
Obtain an IP address automatically from a Dynamic Host
Configuration Protocol (DHCP) server
Go to next step.
Configure a static IP address Go to step 27 on page 54
26. Restore service processor settings.
Enable network access to the service processor. If the network connection uses the Dynamic Host
Configuration Protocol (DHCP) to establish an IP address, and uses the Management consoles as the
DHCP server, no additional configuration is necessary to enable network access to the service
processor. Perform the following steps:
a. Ensure that the service processor is connected to the existing service network by verifying that
the management console cable is connected to the management console port on the system
backplane.
b. If not already connected, connect all system power cables by plugging them into power outlets.
Note: Do not start the system at this time.
Can you access the service network?
Yes No
Go to step 29 on page 54. Go to step 32 on page 55
Removing and replacing SAS RAID adapters and batteries 53
27. Restore service processor settings.
The Advanced System Management Interface (ASMI) is the interface to the service processor that is
required to perform general and administrator-level service tasks, such as reading service processor
error logs, reading vital product data, setting up the service processor, controlling the system power,
and setting service processor network ports. The ASMI may also be referred to as the service
processor menus. The ASMI can be accessed through https. As the network connection uses static IP
address assignments, perform the following steps.
a. To know your service processor network ports, if necessary, you can use the control panel in
manual mode. Refer to service functions to get control panel documentation in that case.
b. Connect a client with a Web browser directly to the service processor network ports you
previously noted. It could be something like the following URLs:
v https://169.254.2.147
v https://169.254.3.147
c. If not already connected, connect all system power cables by plugging them into power outlets.
Note: Do not start the system at this time.
d. Log on to the ASMI with the admin User ID and admin default Password.
e. Change the admin user ID's Password and the general user ID's Password when prompted.
f. To configure network access perform the following steps.
1) Click Network Configuration under the Network Services node.
2) On the right pane, configure eth0 and eth1 network interfaces, select Static for the Type of IP
Address field, and set and a Host name, an IP address, a Subnet mask.
28. Restore system processor settings.
With the network connection now configured to use static IP address assignments, try to access the
service processor network ports.
Can you access the service network?
Yes No
Go to next step. Go to step 33 on page 55
29. Access the ASMI.
The Advanced System Management Interface (ASMI) is the interface to the service processor that is
required to perform general and administrator-level service tasks, such as reading service processor
error logs, reading vital product data, setting up the service processor, and controlling the system
power. The ASMI might also be referred to as the service processor menus. To access the ASMI, refer
to Accessing the Advanced System Management Interface. After you have accessed the ASMI,
continue with the next step.
30. Set System Enclosure Type and Feature Code.
The enclosure serial number is a seven-digit number printed on a bar-coded label on the system
unit. When the system backplane is replaced, the seven-digit enclosure serial number and feature
code number needs to be manually inputted and stored in the new system backplane using the
Advanced System Management Interface (ASMI).
Note: The feature code number for 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system backplane is
78AA-001.
Note: To perform this operation, your authority level must be Authorized Service Provider (ID:
celogin).
To input the system serial number:
a. On the ASMI Welcome pane, specify your user ID and password, and click Log in.
b. In the navigation area, expand System Configuration and Program Vital Product Data.
c. Select System Enclosures. In the right pane, the current system enclosures are displayed.
54 Power Systems: Serial-attached SCSI RAID enablement
d. On the system being serviced, record the enclosure serial number printed on the bar-coded label
on the system unit.
e. Enter the enclosure serial number recorded from the system unit in the Enclosure serial number
field.
f. Enter the feature code number in the Feature Code/Sequence Number field.
g. Click Save settings to update and save the enclosure serial number and the feature code.
31. Go to step 35 on page 56
32. Restore service processor settings.
If the network connection uses DHCP to establish an IP address, but does not use a management
console as the DHCP server, perform the following steps.
a. Complete any network configuration necessary to allow the DHCP server to recognize and assign
an IP address to the service processor.
b. Ensure that the service processor is connected to the existing service network by verifying that
the network cable is connected to the network port.
c. If not already connected, connect all system power cables by plugging them into power outlets.
Note: Do not start the system at this time.
Can you access the service network?
Yes No
Go to 29 on page 54 Go to next step.
33. Restore service processor settings.
You have indicated that you are still not able to access the service network. You will need to reset
the service processor by pressing the pin-hole switch (E) on the operator panel.
Removing and replacing SAS RAID adapters and batteries 55
34. Got to step 25 on page 53.
35. Restore service processor settings.
Change the password of the admin user ID when prompted by performing the following steps.
a. In the navigation area of the ASMI, expand Login Profile.
b. Select Change Password under Login Profile.
c. Specify the required information, and click Continue.
As a security measure, you are required to enter your current user password in the Current
password for current user ID field.
Note: Passwords can be any combination of up to 64 alphanumeric characters. The default
password for the admin User ID is admin. After your initial login to the ASMI, the admin password
must be changed.
As a security measure, you are required to enter the password for your user ID in the Current
password for current user field.
36. Restore service processor settings.
Figure 41. Pin-hole switch on the operator information panel
56 Power Systems: Serial-attached SCSI RAID enablement
Set the system name. The system name can be changed to any valid ASCII string. It does not have to
follow the initialized machine type_model_serial number format. To change the system name, complete
the following steps.
a. In the navigation area of the ASMI, expand System Configuration.
b. Select System Name under System Configuration.
c. Enter the desired system name.
d. Click Save settings to update the system name to the new value.
The new system name is displayed in the status frame, the area where the Log out button is located.
If another method, such as the Management consoles, are used to change the system name, the
status frame does not reflect the change.
37. Restore service processor settings.
To set the time-of-day, complete the following steps.
a. In the navigation area of the ASMI, expand System Configuration.
b. Select Time of Day under System Configuration.
c. If the system is powered off, the right pane displays a form that shows the current date (month,
day, and year) and time (hours, minutes, seconds).
d. Change either the date or the time or both, and click Save settings.
38. Restore service processor settings.
Verify if the server firmware level is same as the value recorded prior to replacing the system
backplane. If the value is different then it must be fixed. For information on how to fix the firmware,
see Getting firmware fixes.
39. Restore the server Host Ethernet Adapter (HEA) device settings.
v For HMC:
a. Expand the Systems Management folder in the navigation tree pane.
b. Double click the Servers folder.
c. Select a server in the central panel.
d. On the Tasks menu, click Hardware Information > Adapters > Host Ethernet. The Host
Ethernet Adapters window opens.
e. Select the required physical port and click Configure. The HEA Physical Port Configuration
window opens.
f. Enter the value in the Pending Port Group Multi-Core Scaling value field and click OK.
v For SDMC:
a. Go to the Resources tab and click Hosts.
b. Select a server in the right pane.
c. On the Actions menu, click Hardware Information > Adapters > Host Ethernet. The Host
Ethernet Adapters window opens.
d. Select the required physical port and click Configure. The HEA Physical Port Configuration
window opens.
e. Enter the value in the Pending Port Group Multi-Core Scaling value field and click OK.
40. Restore service processor settings.
Reenter any of the following settings that were previously changed through the Advanced System
Management Interface (ASMI), unless you want to use the default settings.
v System name setting
v System Power Control settings
v System Service Aids settings
v System Configuration settings
v Network Services settings
Removing and replacing SAS RAID adapters and batteries 57
v Performance Options settings
v Login Profile settings
v Boot List settings
v Host Ethernet Adapter settings
41. Restore service processor settings.
a. Reset any service processor settings that you may have set using operating system commands.
You recorded these settings at the beginning of this procedure.
b. If you choose to reset the Management consoles Access password, perform the following:
To change the HMC access password, select one of the following methods:
1) From the HMC GUI (preferred method):
a) Expand the Systems Management folder in the navigation tree pane.
b) Double click the Servers folder.
c) Select a server in the central panel.
d) Under the Operations folder, select Change Password.
e) Provide the desired information and click OK.
2) From the HMC command line, type:
/usr/hmcrbin/chsyspwd -m managedsystem -t access --passwd --newpasswd newpassword
where:
v The value for managedsystem is the new service processor's managed system name.
v No value for --passwd is entered thereby allowing authentication.
v The value for newpasswd is the newpassword value.
To change the SDMC access password, follow these steps:
1) Go to the Resources tab and click Hosts.
2) Select a server in the right pane.
3) On the Actions menu, click Operations > Change Password.
4) Provide the desired information and click OK.
42. Recover partition data on the service processor.
Profile data stored in the managed server has been cleared or corrupted.
To recover profile data using the HMC, complete the following steps.
a. Expand the Systems Management folder in the navigation tree pane.
b. Double click the Servers folder.
c. Use the check box to select a server, in the central panel.
d. In the Configuration folder if you have the Manage Partition Data folder, select Restore.
To recover profile data using the SDMC, complete the following steps:
a. Go to the Resources tab and click Hosts.
b. Select a server in the right pane.
c. On the Actions menu, click System Configuration > Manage Partition Data > Restore.
d. Select the desired restore option and click OK.
43. Update vital product data.
If your system is running IBM i and is not managed by the Management consoles, you must update
the load source after replacing the system backplane. To update the load source without using the
Management consoles, do the following steps:
a. Load the I_BASE_01 Licensed Internal Code optical media into the device that is defined for the
system and access it to perform an initial program load (IPL).
Note: This IPL can take several minutes.
58 Power Systems: Serial-attached SCSI RAID enablement
b. After the Select a Language Group display is shown, press Enter to select the default language
feature 2924 (English).
Note: To change the language feature, type the language feature that appears on the media that
contains the operating system, and then press Enter.
Confirm the language feature by pressing Enter.
c. From the Install Licensed Internal Code display, select option 2 (Work with Dedicated Service
Tools (DST)) and press Enter.
Use the Dedicated Service Tools to make sure the disk units, including the load source, are
reporting correctly and that there are no errors that might prevent the IPL from disk. When all
errors have been resolved and the load source disk is reporting correctly, continue with the next
step.
d. From the Work with Dedicated Service Tools (DST) display, select option 2 (Work with disk units)
and press Enter.
e. Select option 8 (Update system vital product data) to identify the location of the load source disk
unit. Press Enter. The Update System Vital Product Data display is shown.
f. Press Enter to confirm that you want the vital product data to be written.
g. Press F12 (Previous) twice.
h. Type 1 and press Enter to exit DST. Use the system control panel to delay turning off the system.
Change the IPL mode to B, and power on the system to perform an IPL from disk.
44. Restore the boot list and the Host Ethernet Adapter settings.
The boot list is not present on a new system backplane, therefore; it is necessary to create the boot
devices by booting into the system management services (SMS) menus. See Selecting boot options for
information on setting the boot device list.
After the operating system is booted, the bootlist command can be used to verify, or make additional
changes to, the list of boot devices for a system or partition.
To restore the Host Ethernet Adapter settings, see Configuring physical ports on a Host Ethernet
Adapter.
45. Verify the repair. For more information, see Verifying a repair.
8202-E4B or 8205-E6B SAS subsystem service considerations
There are several considerations to review before servicing features within the SAS subsystem for the
8202-E4B or 8205-E6B system.
Attention: Do not attempt to remove any parts related to the SAS subsystem if the cache data present
LED on the card in either P1-C19 or P1-C14 is flashing. To see this LED, you must shut down the system
and remove the service cover. The LED shows a no-touch icon (a hand with a slash through it), and
indicates that there is possible data in cache on the adapter. To prevent data loss, first make sure that the
system has been shut down correctly. If either cache data present LED is flashing, refer to one of the
following topics before removing any SAS subsystem parts. You can safely replace the cache battery pack
when Yes is displayed next to Battery pack can be safely replaced on the Battery Information panel
(for Linux and IBM i systems) or COMMAND STATUS panel (for AIX systems).
Before servicing any features, review the I/O configurations for each logical partition on your system. It
is important to understand the differences between the various configurations and the resulting service
considerations. In addition, consider possible impacts to the following areas.
v Logical partitions and their I/O adapter assignments
v RAID array configurations
v Physical disk locations
v Path to your boot device
Removing and replacing SAS RAID adapters and batteries 59
Service considerations for the cache battery pack
To prevent data loss, follow proper procedures before replacing the cache battery pack on the Cache
RAID cards.
Note: Concurrent maintenance of adapter battery is not supported. Even with failed or missing batteries,
the card associated with the battery must first be removed. For more information, see Removing and
replacing SAS RAID adapters and batteries.
Attention: To maintain system availability and prevent possible data loss, you must understand that
these features have implications on the SAS configuration and data accessibility of the system.
Attention: Do not attempt to remove any parts related to the SAS subsystem if the cache data present
LED on the card in either P1-C19 or P1-C14 is flashing. To see this LED, you must shut down the system
and remove the service cover. The LED shows a no-touch icon (a hand with a slash through it), and
indicates that there is possible data in cache on the adapter. To prevent data loss, first make sure the
system has been shut down correctly. If either cache data present LED is flashing, refer to one of the
following topics before removing any SAS subsystem parts. You can safely replace the cache battery pack
when Yes is displayed next to Battery pack can be safely replaced on the Battery Information panel
(for the Linux and IBM i systems) or COMMAND STATUS panel (for AIX systems).
The following figure shows the location of the cache data present LED in the 8202-E4B or 8205-E6B SAS
subsystem.
The following table describes the states of the cache data present LED and what to consider before
attempting a service action.
Figure 42. Cache data present LED
60 Power Systems: Serial-attached SCSI RAID enablement
Table 1. LED indications and actions
Cache battery card
(Un-P1-C14)
RAID/Cache storage
controller (Un-P1-C19) Indication Action
Off Off
Cache is not active. Service actions can proceed.
Off Flashing
Cache is active on C19. Cache
might have been flushed on
C14 if C19 is failed.
Attempt to power on and
perform a proper shutdown
prior to any service action.
Flashing Off
Cache is active on C14. Cache
might have been flushed on
C19 if C14 is failed.
Attempt to power on and
perform a proper shutdown
prior to any service action.
Flashing Flashing
Cache is active on the dual
controllers.
Attempt to power on and
perform a proper shutdown
prior to any service action. If
this is unsuccessful in clearing
the cache active LEDs, contact
support. See Contacting IBM
service and support.
Removing the 8202-E4B or 8205-E6B RAID enablement card
Use this procedure to remove a Redundant Array of Independent Disks (RAID) enablement card before
installing a new or upgraded enablement card.
Attention: To maintain system availability and prevent possible data loss, it is important to understand
that these features have implications on the serial-attached SCSI (SAS) configuration and data accessibility
of the system. Review the SAS subsystem topic to understand the various SAS configuration and service
considerations.
Before servicing or installing any features, review the I/O configurations for each logical partition on
your system. It is important to understand the differences between the various configurations and the
resulting service considerations. In addition, consider possible impacts to the following areas:
v Logical partitions and their I/O adapter assignments
v RAID array configurations
v Physical disk locations
v Path to your boot device
If your system is managed by the Hardware Management Console (HMC), use the HMC to remove the
enablement card from the server. For instructions, see “Removing a part by using the Hardware
Management Console” on page 106.
If your system is managed by the IBM Systems Director Management Console (SDMC), use the SDMC to
remove the enablement card from the server. For instructions, see “Removing a part by using the Systems
Director Management Console” on page 143.
If you do not have an HMC or an SDMC, complete the following steps to remove the enablement card:
1. Complete the prerequisite tasks described in “Before you begin” on page 79.
2. Stop the system, as described in “Stopping a system or logical partition” on page 89.
Removing and replacing SAS RAID adapters and batteries 61
3. Ensure the system is powered down. Check the operator panel to see that the power LED is flashing
on or off.
4. Remove the front cover.
v For a rack-mounted system, see “Removing the front cover on a rack-mounted 8202-E4B,
8202-E4C, 8205-E6B, or 8205-E6C system” on page 92.
v For a stand-alone system, see “Removing the front cover on a stand-alone 8202-E4B, 8202-E4C, or
8205-E6B system” on page 92.
5. For a rack-mounted system, place the system into the service position, as described in “Placing the
rack-mounted 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system into the service position” on page
100.
6. Remove the service access cover.
v For a rack-mounted system, see “Removing the service access cover on a rack-mounted 8202-E4B,
8202-E4C, 8205-E6B, or 8205-E6C system” on page 95.
v For a stand-alone system, see “Removing the service access cover on a stand-alone 8202-E4B,
8202-E4C, or 8205-E6B system” on page 96.
7. Check the status of the green Write Cache LEDs on cards located in P1-C19 and P1-C14 of Figure 1.
v If both the green cache LEDs are flashing, replace the system cover and follow the “Starting the
system or logical partition” on page 87 before returning here.
v If both LEDs are off, proceed to the next step.
v If by chance, one cache LED is flashing and the other is not, contact your service provider for
assistance.
8. Disconnect the power source from the system by unplugging the system.
Note: This system is equipped with a second power supply. Before continuing with this procedure,
ensure that the power source to the system has been completely disconnected.
(L003)
or
62 Power Systems: Serial-attached SCSI RAID enablement
9. Attach the wrist strap.
Attention:
v Attach a wrist strap to an unpainted metal surface of your hardware to prevent electrostatic
discharge (ESD) from damaging your hardware.
v When using a wrist strap, follow all electrical safety procedures. A wrist strap is for static control.
It does not increase or decrease your risk of receiving electric shock when using or working on
electrical equipment.
v If you do not have a wrist strap, just prior to removing the product from ESD packaging and
installing or replacing hardware, touch an unpainted metal surface of the system for a minimum
of 5 seconds.
10. Locate the RAID enablement card that you want to remove.
Note: If an optional tape drive is installed, then move the tape drive to access the RAID enablement
card.
11. Press the tab and slide the card out of the system as shown in the following figures. The RAID
enablement card is attached to a plastic plate. You must slide the plastic plate out of the system to
remove the RAID enablement card.
Figure 43. Slot location for RAID enablement card (P1-C13)
Removing and replacing SAS RAID adapters and batteries 63
To replace the RAID enablement card, see “Replacing the 8202-E4B or 8205-E6B RAID enablement
card.”
12. Replace the service access cover.
v For a rack-mounted system, see “Installing the service access cover on a rack-mounted 8202-E4B,
8202-E4C, 8205-E6B, or 8205-E6C system” on page 98.
v For a stand-alone system, see “Installing the service access cover on a stand-alone 8202-E4B,
8202-E4C, or 8205-E6B system” on page 98.
13. For a rack-mounted system, put the system into the operating position, as described in “Placing the
rack-mounted 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system into the operating position” on
page 101.
14. Replace the front cover.
v For a rack-mounted system, see “Installing the front cover on a rack-mounted 8202-E4B, 8202-E4C,
8205-E6B, or 8205-E6C system” on page 93.
v For a stand-alone system, see “Installing the front cover and front door on a stand-alone 8202-E4B,
8202-E4C, or 8205-E6B system” on page 94.
15. Reconnect the power source.
16. Start the system as described in “Starting the system or logical partition” on page 87.
17. Verify the installed part:
v If you replaced the part because of a service action, verify the installed part as described in
Verifying a repair.
v If you installed the part for any other reason, verify the installed part as described in Verifying the
installed part.
Replacing the 8202-E4B or 8205-E6B RAID enablement card
Use this procedure to replace an existing Redundant Array of Independent Disks (RAID) enablement
card.
Figure 44. RAID enablement card removal from a rack-mounted system unit
64 Power Systems: Serial-attached SCSI RAID enablement
You must have already completed the procedure “Removing the 8202-E4B or 8205-E6B RAID enablement
card” on page 61.
To replace a RAID enablement card, complete the following steps:
1. Locate the slot for the RAID enablement card.
Note: If an optional tape drive is installed, then move the tape drive to access the RAID enablement
card.
2. Align the RAID enablement card with the slot in the system.
3. Slide the RAID enablement card fully into the system until the tab (A) locks into place.
4. Replace the service access cover:
v For a rack-mounted system, see “Installing the service access cover on a rack-mounted 8202-E4B,
8202-E4C, 8205-E6B, or 8205-E6C system” on page 98.
v For a stand-alone system, see “Installing the service access cover on a stand-alone 8202-E4B,
8202-E4C, or 8205-E6B system” on page 98.
5. For a rack-mounted system, put the system into the operating position, as described in “Placing the
rack-mounted 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system into the operating position” on page
101.
6. Replace the front cover:
v For a rack-mounted system, see “Installing the front cover on a rack-mounted 8202-E4B, 8202-E4C,
8205-E6B, or 8205-E6C system” on page 93.
v For a stand-alone system, see “Installing the front cover and front door on a stand-alone 8202-E4B,
8202-E4C, or 8205-E6B system” on page 94.
7. Reconnect the power source.
8. Start the system as described in “Starting the system or logical partition” on page 87.
9. Verify the installed part:
Figure 45. RAID enablement card replacement in a rack-mounted system unit
Removing and replacing SAS RAID adapters and batteries 65
v If you replaced the part because of a service action, verify the installed part as described in
Verifying a repair.
v If you installed the part for any other reason, verify the installed part as described in Verifying the
installed part.
Removing the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C RAID/cache
storage controller
Use this procedure to remove a Redundant Array of Independent Disks (RAID) storage controller before
installing a new or upgraded storage controller. You can also use this procedure to remove a RAID-10
storage controller.
Identify the system that you will be working on by using the identify the system process to turn on the
system locate (blue) indicator. For more information, see “Control panel LEDs” on page 81.
Attention: To maintain system availability and prevent possible data loss, it is important to understand
that these features have implications on the serial-attached SCSI (SAS) configuration and data accessibility
of the system. Review the SAS subsystem topic to understand the various SAS configuration and service
considerations.
Before servicing or installing any features, review the I/O configurations for each logical partition on
your system. It is important to understand the differences between the various configurations and the
resulting service considerations. In addition, consider possible impacts to the following areas:
v Logical partitions and their I/O adapter assignments
v RAID array configurations
v Physical disk locations
v Path to your boot device
If your system is managed by the Hardware Management Console (HMC), use the HMC to remove
RAID/cache storage controller from the server. For instructions, see “Removing a part by using the
Hardware Management Console” on page 106
If your system is managed by the IBM Systems Director Management Console (SDMC), use the SDMC to
remove the RAID/cache storage controller from the server. For instructions, see “Removing a part by
using the Systems Director Management Console” on page 143.
If you do not have an HMC or an SDMC, complete the following steps to remove RAID/cache storage
controller:
1. Complete the prerequisite tasks described in “Before you begin” on page 79.
2. Stop the system, as described in “Stopping a system or logical partition” on page 89.
3. Remove the front cover.
v For a rack-mounted system, see “Removing the front cover on a rack-mounted 8202-E4B,
8202-E4C, 8205-E6B, or 8205-E6C system” on page 92.
v For a stand-alone system, see “Removing the front cover on a stand-alone 8202-E4B, 8202-E4C, or
8205-E6B system” on page 92.
4. For a rack-mounted system, place the system into the service position, as described in “Placing the
rack-mounted 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system into the service position” on page
100.
5. Remove the service access cover.
v For a rack-mounted system, see “Removing the service access cover on a rack-mounted 8202-E4B,
8202-E4C, 8205-E6B, or 8205-E6C system” on page 95.
66 Power Systems: Serial-attached SCSI RAID enablement
v For a stand-alone system, see “Removing the service access cover on a stand-alone 8202-E4B,
8202-E4C, or 8205-E6B system” on page 96.
6. Use the service indicator light-emitting diodes (LEDs) to help identify the part, as described in
“Identifying a part” on page 81.
7. Disconnect the power source from the system by unplugging the system.
Note: This system is equipped with a second power supply. Before continuing with this procedure,
ensure that the power source to the system has been completely disconnected.
(L003)
or
8. Attach the wrist strap.
Attention:
v Attach a wrist strap to an unpainted metal surface of your hardware to prevent electrostatic
discharge (ESD) from damaging your hardware.
v When using a wrist strap, follow all electrical safety procedures. A wrist strap is for static control.
It does not increase or decrease your risk of receiving electric shock when using or working on
electrical equipment.
v If you do not have a wrist strap, just prior to removing the product from ESD packaging and
installing or replacing hardware, touch an unpainted metal surface of the system for a minimum
of 5 seconds.
9. Locate the RAID/cache storage controller that you want to remove.
10. Pull the tab (A) out and slide the card (B) out of the system as shown in the following figure.
Removing and replacing SAS RAID adapters and batteries 67
To replace the RAID/cache storage controller, see “Replacing the 8202-E4B, 8202-E4C, 8205-E6B, or
8205-E6C RAID/cache storage controller” on page 69.
11. Replace the service access cover:
v For a rack-mounted system, see “Installing the service access cover on a rack-mounted 8202-E4B,
8202-E4C, 8205-E6B, or 8205-E6C system” on page 98.
v For a stand-alone system, see “Installing the service access cover on a stand-alone 8202-E4B,
8202-E4C, or 8205-E6B system” on page 98.
12. For a rack-mounted system, put the system into the operating position, as described in “Placing the
rack-mounted 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system into the operating position” on
page 101.
13. Replace the front cover:
v For a rack-mounted system, see “Installing the front cover on a rack-mounted 8202-E4B, 8202-E4C,
8205-E6B, or 8205-E6C system” on page 93.
v For a stand-alone system, see “Installing the front cover and front door on a stand-alone 8202-E4B,
8202-E4C, or 8205-E6B system” on page 94.
14. Reconnect the power source.
15. Start the system as described in “Starting the system or logical partition” on page 87.
16. Verify the installed part:
v If you replaced the part because of a service action, verify the installed part as described in
Verifying a repair.
v If you installed the part for any other reason, verify the installed part as described in Verifying the
installed part.
Figure 46. RAID/cache storage controller removal from a rack-mounted system unit
68 Power Systems: Serial-attached SCSI RAID enablement
Replacing the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C RAID/cache
storage controller
Use this procedure to replace an existing Redundant Array of Independent Disks (RAID) storage
controller. You can also use this procedure to replace a RAID-10 storage controller.
Identify the system that you will be working on by using the identify the system process to turn on the
system locate (blue) indicator. For more information, see “Control panel LEDs” on page 81.
You must have already completed the procedure “Removing the 8202-E4B, 8202-E4C, 8205-E6B, or
8205-E6C RAID/cache storage controller” on page 66.
To replace a RAID/cache storage controller, complete the following steps:
1. Locate the slot for the RAID/cache storage controller.
2. Align the RAID/cache storage controller with the slot in the system.
3. Slide the RAID/cache storage controller (A) fully into the system until the tab (B) locks into place.
4. Replace the service access cover:
v For a rack-mounted system, see “Installing the service access cover on a rack-mounted 8202-E4B,
8202-E4C, 8205-E6B, or 8205-E6C system” on page 98.
v For a stand-alone system, see “Installing the service access cover on a stand-alone 8202-E4B,
8202-E4C, or 8205-E6B system” on page 98.
5. For a rack-mounted system, put the system into the operating position, as described in “Placing the
rack-mounted 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system into the operating position” on page
101.
6. Replace the front cover.
Figure 47. RAID/cache storage controller replacement in a rack-mounted system unit
Removing and replacing SAS RAID adapters and batteries 69
v For a rack-mounted system, see “Installing the front cover on a rack-mounted 8202-E4B, 8202-E4C,
8205-E6B, or 8205-E6C system” on page 93.
v For a stand-alone system, see “Installing the front cover and front door on a stand-alone 8202-E4B,
8202-E4C, or 8205-E6B system” on page 94.
7. Reconnect the power source.
8. Start the system as described in “Starting the system or logical partition” on page 87
9. Verify the installed part:
v If you replaced the part because of a service action, verify the installed part as described in
Verifying a repair.
v If you installed the part for any other reason, verify the installed part as described in Verifying the
installed part.
Removing and replacing the cache battery card
To service the cache battery card, complete the following procedure:
Attention: To maintain system availability and prevent possible data loss, it is important to understand
that these features have implications on the serial-attached SCSI (SAS) configuration and data accessibility
of the system. Review the SAS subsystem topic to understand the various SAS configuration and service
considerations.
Before servicing or installing any features, review the I/O configurations for each logical partition on
your system. It is important to understand the differences between the various configurations and the
resulting service considerations. In addition, consider possible impacts to the following areas:
v Logical partitions and their I/O adapter assignments
v RAID array configurations
v Physical disk locations
v Path to your boot device
Complete the following steps to remove and install the cache battery card:
1. Begin with power on.
To prevent data loss, the cache battery card must be in an error state before it is removed from the
system.
If you have powered the system off, power it on before you continue.
2. Select your operating system:
v AIX: Go to step 3.
v Linux: Go to step 5.
v IBM i: Go to step 7.
3. Force the cache battery card into an error state on AIX (to prevent possible data loss), as follows:
a. Navigate to the IBM SAS Disk Array Manager by using the following steps:
1) At the command prompt, type smit, and press Enter.
2) Select Devices.
3) Select Disk Array.
4) Select IBM SAS Disk Array.
5) Select IBM SAS Disk Array Manager from the menu with options for configuring and
managing the IBM SAS RAID Controller.
b. Select Diagnostics and Recovery Options.
c. Select Controller Rechargeable Battery Maintenance.
d. Select Force Controller Rechargeable Battery Error.
70 Power Systems: Serial-attached SCSI RAID enablement
e. Select the IBM SAS RAID Controller for the battery you want to replace.
This option places the battery into the error state, which requires it to be replaced.
f. Determine whether it is safe to replace the cache battery card. See the Displaying rechargeable
battery information procedure. It is safe to replace the cache battery card when Yes is displayed
next to Battery pack can be safely replaced. You might need to reselect the option Display
Controller Rechargeable Battery Information multiple times as it might take several minutes
before it is safe to replace the cache battery card.
g. Verify that the Cache Present LED is not flashing.
Displaying rechargeable battery information
a. Navigate to the IBM SAS Disk Array Manager by using the previous procedure.
b. Select Diagnostics and Recovery Options.
c. Select Controller Rechargeable Battery Maintenance.
d. Select Display Controller Rechargeable Battery Information.
e. Select IBM SAS RAID Controller.
4. Go to step 8.
5. Force the cache battery card into an error state on the Linux operating system (to prevent possible
data loss), as follows:
a. Run the iprconfig utility by typing iprconfig.
b. Select Work with disk unit recovery.
c. Select Work with resources containing cache battery packs.
d. Select your adapter and type 2. Then press Enter to force the battery error.
Note: This option places the battery into the error state, which requires it to be replaced.
e. If you are sure you want to force a battery error, type c to confirm. If you do not want to force a
battery error, type q to cancel.
f. Determine whether it is safe to replace the cache battery card. See Displaying rechargeable battery
information.
It is safe to replace the cache battery card when Yes is displayed next to Battery pack can be
safely replaced. You might need to reselect the option Display Controller Rechargeable Battery
Information multiple times, because it might take several minutes before it is safe to replace the
cache battery card.
6. Go to step 8.
7. Force the cache battery card into an error state on IBM i (to prevent possible data loss), on the
system or partition that contains the adapter as follows:
a. Be sure that you are signed on to the system with at least service level authority.
b. Type strsst on the command line and press Enter.
c. Type your service tools user ID and service tools password on the System Service Tools (SST)
Sign On display. Press Enter.
d. Select Start a Service Tool from the System Service Tools (SST) display and press Enter.
e. Select Hardware Service Manager from the Start a Service Tool display and press Enter.
f. Select Work with resources containing cache battery packs from the Hardware Service Manager
display and press Enter.
g. Select Force battery pack into error state for the I/O card that you are working with from the
Work with Resources containing cache battery packs display and press Enter.
h. On the Force Battery Packs Into Error State display, verify that the correct I/O adapter has been
selected and press the function key that confirms your choice.
i. Return to the Work with Resources containing cache battery packs display and select Display
battery information. Ensure that the field Safe to replace cache battery is set to Yes. This might
take several minutes and you might need to press the refresh key to see the field update.
Removing and replacing SAS RAID adapters and batteries 71
8. Place a rack-mounted system into operating position. See “Placing the rack-mounted 8202-E4B,
8202-E4C, 8205-E6B, or 8205-E6C system into the operating position” on page 101.
9. Remove the service access cover:
v For a rack-mounted system, see “Removing the service access cover on a rack-mounted 8202-E4B,
8202-E4C, 8205-E6B, or 8205-E6C system” on page 95.
v For a stand-alone system, see “Removing the service access cover on a stand-alone 8202-E4B,
8202-E4C, or 8205-E6B system” on page 96.
10. Check cache data present LED (A) by examining the LED on the cache battery card.
v If the LED is flashing, go to next step.
v If the LED is not flashing, go to step 12.
11. Do not remove the cache battery card if the LED is flashing.
The flashing LED indicates that there is cached data on the battery that has not been saved. This
data will be lost if the battery is removed at this time. Complete this service procedure without
replacing the battery, and then retry it to ensure all cache data is saved before the battery is replaced.
Go to step 18.
12. Remove the cache battery card as follows:
Attention: Do not remove the cache battery card if the cache data present LED is flashing because
this indicates cache data exists.
a. Disengage the cache battery card (B) from the connector (A).
b. Slide the cache battery card out of the mounting guides and remove it from the system.
c. After the card has been removed from the system, remove the battery pack from the battery card:
1) Unplug the battery pack from the battery card.
2) Grasp the retaining clips and squeeze them closed to allow the battery pack to lift off of the
retaining clips.
Figure 48. Cache data present LED
72 Power Systems: Serial-attached SCSI RAID enablement
If you are replacing the cache battery pack, go to the “Removing and replacing a cache battery pack”
on page 75, and then return here and continue with the next step.
13. Replace the cache battery card as follows:
Attention:
Ensure that the cache battery card is disconnected for at least 60 seconds before you connect the new
battery. This is the minimum amount of time needed for the adapter to recognize that the battery has
been replaced.
a. Slide the cache battery card into the mounting guides on the controller until it is seated in the
battery connector.
Figure 49. Removing the cache battery card
Removing and replacing SAS RAID adapters and batteries 73
14. Power on the system and restart the adapter's write cache by doing the following steps:
Select your operating system:
v AIX: Go to step 15.
v Linux: Go to step 16.
v IBM i: Go to step 17.
15. Perform the following steps:
a. Navigate to the IBM SAS Disk Array Manager by using the information in step 3a.
b. Select Diagnostics and Recovery Options.
c. Select Controller Rechargeable Battery Maintenance.
d. Select Start Adapter Cache.
e. Select the controller with the battery you just replaced and press Enter.
f. Go to step 18.
16. Perform the following steps:
a. Run the iprconfig utility by typing iprconfig.
b. Select Work with disk unit recovery.
c. Select Work with resources containing cache battery packs.
d. Start the I/O adapter cache on the adapter for the battery that you just replaced by typing 3 and
press Enter.
e. Go to step 18.
17. Perform the following steps:
a. Return to the Work with Resources containing Cache Battery Packs display using the information
in step 7a - 7f and select the Start IOA cache. Press Enter.
b. Ensure that you get the message Cache was started.
c. Go to step 18.
Figure 50. Installing the cache battery card
74 Power Systems: Serial-attached SCSI RAID enablement
18. Install the service access cover:
v For a rack-mounted system, see “Installing the service access cover on a rack-mounted 8202-E4B,
8202-E4C, 8205-E6B, or 8205-E6C system” on page 98.
v For a stand-alone system, see “Installing the service access cover on a stand-alone 8202-E4B,
8202-E4C, or 8205-E6B system” on page 98.
19. Place into operating position. See “Placing the rack-mounted 8202-E4B, 8202-E4C, 8205-E6B, or
8205-E6C system into the operating position” on page 101.
20. Close the back door on the unit you are servicing.
21. Verify that the new resource is working correctly, as described in “Verifying the installed part” on
page 107.
Removing and replacing a cache battery pack
Use this procedure to replace the cache battery pack on a battery card.
Identify the system that you will be working on by using the identify the system process to turn on the
system locate (blue) indicator. For more information, see “Control panel LEDs” on page 81.
Attention: Before continuing with this procedures, determine that it is safe to replace the cache battery
pack. See Displaying rechargeable battery information. It is safe to replace the cache battery pack when
Yes is displayed next to Battery pack can be safely replaced.
Attention: To maintain system availability and prevent possible data loss, it is important to understand
that these features have implications on the serial-attached SCSI (SAS) configuration and data accessibility
of the system. Review the SAS subsystem topic to understand the various SAS configuration and service
considerations.
Before servicing or installing any features, review the I/O configurations for each logical partition on
your system. It is important to understand the differences between the various configurations and the
resulting service considerations. In addition, consider possible impacts to the following areas:
v Logical partitions and their I/O adapter assignments
v RAID array configurations
v Physical disk locations
v Path to your boot device
Complete the following steps to replace a nonconcurrent maintainable battery pack:
1. Remove the controller from the system. For more information, see “Removing the 8202-E4B, 8202-E4C,
8205-E6B, or 8205-E6C RAID/cache storage controller” on page 66.
2. Place the controller on a surface that is electrostatic-discharge protected.
3. Unplug the battery connector (A) from its connector on the adapter, squeezing the retaining latch
while gently pulling on the plug. The plug connects to the board in only one way; therefore, it cannot
be inserted incorrectly during the replacement procedure.
Note: Ensure that the cache battery pack is disconnected for at least 60 seconds before connecting the
new battery. This is the minimum amount of time needed for the adapter to recognize that the battery
has been replaced.
Removing and replacing SAS RAID adapters and batteries 75
(A) Battery connector
(B) Cache battery pack
4. The battery is mounted on a black plastic plate with two tabs, secured by white plastic standoffs with
retaining tabs. To remove the battery assembly, remove the mounting plate from the standoffs, one
standoff at a time.
5. Push in the standoff retaining tab and pull up on the mounting tab until the standoff releases through
the hole in the mounting tab. Repeat for the second standoff.
Note: If the retaining tab is difficult to push in, turn the white standoff so its retaining tab faces away
from the battery pack.
6. Install the battery on to the posts until the latches click into place.
Figure 51. Removing the cache battery
76 Power Systems: Serial-attached SCSI RAID enablement
7. Reinsert the pins into the rivets from the rear of the adapter.
8. Connect the cache battery pack connector (A) to the adapter. The plug connects to the adapter in only
one way; therefore it cannot be inserted incorrectly.
9. Reinstall the adapter and restart the cache. For more information, see the following procedures:
v “Removing and replacing the cache battery card” on page 70.
v “Removing the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C RAID/cache storage controller” on page
66.
Figure 52. Replacing the cache battery
Removing and replacing SAS RAID adapters and batteries 77
78 Power Systems: Serial-attached SCSI RAID enablement
Common procedures for installable features
This section contains all the common procedures related to installing, removing and replacing features.
Before you begin
Observe these precautions when you are installing, removing, or replacing features and parts.
These precautions are intended to create a safe environment to service your system and do not provide
steps for servicing your system. The installation, removal, and replacement procedures provide the
step-by-step processes required to service your system.
DANGER
When working on or around the system, observe the following precautions:
Electrical voltage and current from power, telephone, and communication cables are hazardous. To
avoid a shock hazard:
v Connect power to this unit only with the IBM provided power cord. Do not use the IBM
provided power cord for any other product.
v Do not open or service any power supply assembly.
v Do not connect or disconnect any cables or perform installation, maintenance, or reconfiguration
of this product during an electrical storm.
v The product might be equipped with multiple power cords. To remove all hazardous voltages,
disconnect all power cords.
v Connect all power cords to a properly wired and grounded electrical outlet. Ensure that the outlet
supplies proper voltage and phase rotation according to the system rating plate.
v Connect any equipment that will be attached to this product to properly wired outlets.
v When possible, use one hand only to connect or disconnect signal cables.
v Never turn on any equipment when there is evidence of fire, water, or structural damage.
v Disconnect the attached power cords, telecommunications systems, networks, and modems before
you open the device covers, unless instructed otherwise in the installation and configuration
procedures.
v Connect and disconnect cables as described in the following procedures when installing, moving,
or opening covers on this product or attached devices.
To Disconnect:
1. Turn off everything (unless instructed otherwise).
2. Remove the power cords from the outlets.
3. Remove the signal cables from the connectors.
4. Remove all cables from the devices
To Connect:
1. Turn off everything (unless instructed otherwise).
2. Attach all cables to the devices.
3. Attach the signal cables to the connectors.
4. Attach the power cords to the outlets.
5. Turn on the devices.
(D005)
DANGER
© Copyright IBM Corp. 2010, 2011 79
Observe the following precautions when working on or around your IT rack system:
v Heavy equipment–personal injury or equipment damage might result if mishandled.
v Always lower the leveling pads on the rack cabinet.
v Always install stabilizer brackets on the rack cabinet.
v To avoid hazardous conditions due to uneven mechanical loading, always install the heaviest
devices in the bottom of the rack cabinet. Always install servers and optional devices starting
from the bottom of the rack cabinet.
v Rack-mounted devices are not to be used as shelves or work spaces. Do not place objects on top
of rack-mounted devices.
v Each rack cabinet might have more than one power cord. Be sure to disconnect all power cords in
the rack cabinet when directed to disconnect power during servicing.
v Connect all devices installed in a rack cabinet to power devices installed in the same rack
cabinet. Do not plug a power cord from a device installed in one rack cabinet into a power
device installed in a different rack cabinet.
v An electrical outlet that is not correctly wired could place hazardous voltage on the metal parts of
the system or the devices that attach to the system. It is the responsibility of the customer to
ensure that the outlet is correctly wired and grounded to prevent an electrical shock.
CAUTION
v Do not install a unit in a rack where the internal rack ambient temperatures will exceed the
manufacturer's recommended ambient temperature for all your rack-mounted devices.
v Do not install a unit in a rack where the air flow is compromised. Ensure that air flow is not
blocked or reduced on any side, front, or back of a unit used for air flow through the unit.
v Consideration should be given to the connection of the equipment to the supply circuit so that
overloading of the circuits does not compromise the supply wiring or overcurrent protection. To
provide the correct power connection to a rack, refer to the rating labels located on the
equipment in the rack to determine the total power requirement of the supply circuit.
v (For sliding drawers.) Do not pull out or install any drawer or feature if the rack stabilizer brackets
are not attached to the rack. Do not pull out more than one drawer at a time. The rack might
become unstable if you pull out more than one drawer at a time.
v (For fixed drawers.) This drawer is a fixed drawer and must not be moved for servicing unless
specified by the manufacturer. Attempting to move the drawer partially or completely out of the
rack might cause the rack to become unstable or cause the drawer to fall out of the rack.
(R001)
Before you begin a replacement or installation procedure, perform these tasks:
1. If you are installing a new feature, ensure that you have the software required to support the new
feature.
To do this, go to the following Web site: IBM Prerequisite
2. If you are performing an installation or replacement procedure that might put your data at risk,
ensure, wherever possible, that you have a current backup of your system or logical partition
(including operating systems, licensed programs, and data).
3. Review the installation or replacement procedure for the feature or part.
4. Note the significance of color on your system.
80 Power Systems: Serial-attached SCSI RAID enablement
Blue or terra-cotta on a part of the hardware indicates a touch point where you can grip the hardware
to remove it from or install it in the system, open or close a latch, and so on. Terra-cotta might also
indicate that the part can be removed and replaced with the system or logical partition power on.
5. Ensure that you have access to a medium flat-blade screwdriver, a Phillips screwdriver, and a pair of
scissors.
6. If parts are incorrect, missing, or visibly damaged, do the following:
v If you are replacing a part, contact the provider of your parts or next level of support.
v If you are installing a feature, contact one of the following service organizations:
– The provider of your parts or next level of support.
– In the United States, the IBM Rochester Manufacturing Automated Information Line (R–MAIL)
at 1–800–300–8751.
In countries and regions outside of the United States, use the following Web site to locate your service
and support telephone numbers:
http://www.ibm.com/planetwide
7. If you encounter difficulties during the installation, contact your service provider, your IBM reseller,
or your next level of support.
8. If you are installing new hardware in a logical partition, you need to understand and plan for the
implications of partitioning your system. For information, see Logical Partitioning.
Identifying a part
Use these instructions to learn how to identify the location of a failed part, the location of a part to be
removed, or the location to install a new part on your system or expansion unit using the appropriate
method for your system.
For IBM Power Systems™ servers that contain the POWER7® processor, the light-emitting diodes (LEDs)
can be used to identify or verify the location of a part that you are removing, servicing, or installing.
The combination identify and fault LED (amber color) shows the location of a field replaceable unit
(FRU). When removing a FRU, first verify whether you are working on the correct FRU by using the
identify function in the management console or other user interface. When removing a FRU under the
direction of the hardware management console, the identify function is activated and deactivated
automatically at the correct times.
The identify function causes the amber LED to blink. When you turn off the identify function, the LED
returns to the state it was previously. For parts that have a blue service button, the identify function sets
LED information for the service button so that when the button is pressed, the correct LEDs on that part
blink.
If you need to use the identify function, use the following procedures.
Control panel LEDs
Use this information as a guide to the control panel LEDs and buttons.
The control panel has LEDs that indicate various system status.
Common procedures for installable features 81
v A: Power-on button
v B: Power LED
– A constant light indicates full system power to the unit.
– A blinking light indicates standby power to the unit.
Note: There is approximately a 30-second transition period from the time the power-on button is
pressed to when the power LED goes from blinking to solid. During the transition period, the LED
might blink faster.
v C: Enclosure identify light
– A constant light indicates the identify state, which is used to identify a part.
– No light indicates that the system is operating normally.
v D: System information light
– No light indicates that the system is operating normally.
– Light on indicates that the system requires attention.
v E: USB port
v F: Enclosure fault roll-up light
– A constant light indicates a fault in the system unit.
– No light indicates that the system is operating normally.
v G: Function/Data display
v H: Decrement button
v I: Enter button
v J: Increment button
v K: Pinhole reset button
Figure 53. Control panel
82 Power Systems: Serial-attached SCSI RAID enablement
Related concepts
Identifying a failing part
Use these instructions to learn how to locate and identify a failing part on your system or expansion unit
using the appropriate method for your system.
Identifying a failing part in an AIX system or logical partition
Use these instructions to learn how to locate a failing part, and then activate the indicator light for that
part on a system or logical partition running the AIX operating system.
Locating a failing part in an AIX system or logical partition
You might need to use AIX tools, before activating the indicator light, to locate a part that is failing.
1. Log in as root user or celogin-.
2. At the command line, type diag and press Enter.
3. From the Function Selection menu, select Task Selection and press Enter.
4. Select Display Previous Diagnostic Results and press Enter.
5. From the Display Previous Diagnostic Results display, select Display Diagnostic Log Summary. The
Display Diagnostic Log display shows a chronological list of events.
6. Look in the T column for the most recent S entry. Select this row in the table and press Enter.
7. Select Commit. The details of this log entry are shown.
8. Record the location information and the SRN value shown near the end of the entry.
9. Exit to the command line.
Use the location information for the failing part to activate the indicator light that identifies the failing
part. See “Activating the indicator light for the failing part.”
Activating the indicator light for the failing part
Use these instructions to help physically identify the location of a part you are servicing.
1. Log in as root user.
2. At the command line, type diag and press Enter.
3. From the Function Selection menu, select Task Selection and press Enter.
4. From the Task Selection menu, select Identify and Attention Indicators and press Enter.
5. From the list of lights, select the location code for the failing part and press Enter.
6. Select Commit. This turns on the system attention and indicator light for the failing part.
7. Exit to the command line.
Deactivating the failing-part indicator light
Use this procedure to turn off any indicator light that you turned on as a part of a service action.
To deactivate the indicator light, follow these steps:
1. Log in as root user.
2. At the command line, type diag and press Enter.
3. From the Function Selection menu, select Task Selection and press Enter.
4. From the Task Selection menu, select Identify and Attention Indicators and press Enter.
5. From the list of lights, select the location code for the failing part and press Enter. When a light is
activated for a failing part, an I character precedes the location code.
6. Select Commit. This turns off the system attention and indicator light for the failing part.
7. Exit to the command line.
Common procedures for installable features 83
Identifying a failing part in an IBM i system or logical partition
You can activate or deactivate the indicator light by using IBM i to assist in locating a failing part.
Activating the failing-part indicator light
You can search the service action log for an entry that matches the time, reference code, or resource of a
problem, and then activate the indicator light for a failing part.
1. Sign on to an IBM i session, with at least service level authority.
2. On the command line of the session, type strsst and press Enter.
Note: If you cannot get to the System Service Tools display, use function 21 from the control panel.
Alternatively, if the system is managed by a Hardware Management Console (HMC), use the Service
Focal Point™ utilities to get to the Dedicated Service Tools (DST) display.
3. Type your service tools user ID and service tools password on the System Service Tools (SST) Sign
On display. Press Enter.
Remember: The service tools password is case-sensitive.
4. Select Start a service tool from the System Service Tools (SST) display and press Enter.
5. Select Hardware service manager from the Start a Service Tool display and press Enter.
6. Select Work with service action log from the Hardware Service Manager display and press Enter.
7. On the Select Timeframe display, change the From: Date and Time field to a date and time prior to
when the problem occurred.
8. Search for an entry that matches one or more conditions of the problem:
v System Reference code
v Resource
v Date and time
v Failing item list
9. Select option 2 (Display failing item information) to display the service action log entry.
10. Select option 2 (Display details) to display location information for the failing part to be replaced.
The information displayed in the date and time fields is the date and time for the first occurrence of
the specific System reference code for the resource displayed during the time range selected.
11. If location information is available, select option 6 (Indicator on) to turn on the failing part's
indicator light.
Tip: If the failing part does not contain a physical indicator light, a higher-level indicator light is
activated. For example, the indicator light for the backplane or unit that contains the failing part
might be lit. In this case, use the location information to locate the actual failing part.
12. Look for the enclosure indicator light to locate the enclosure that contains the failing part.
Deactivating the failing-part indicator light
Use this procedure to turn off any indicator light that you turned on as a part of a service action.
To deactivate the indicator light, follow these steps:
1. Sign on to an IBM i session, with at least service level authority.
2. On the command line of the session, type strsst and press Enter.
Note: If you cannot get to the System Service Tools display, use function 21 from the control panel.
Alternatively, if the system is managed by a Hardware Management Console (HMC), use the Service
Focal Point utilities to get to the Dedicated Service Tools (DST) display.
3. Type your service tools user ID and service tools password on the System Service Tools (SST) Sign
On display. Press Enter.
84 Power Systems: Serial-attached SCSI RAID enablement
Remember: The service tools password is case-sensitive.
4. Select Start a service tool from the System Service Tools (SST) display and press Enter.
5. Select Hardware service manager from the Start a Service Tool display and press Enter.
6. Select Work with service action log from the Hardware Service Manager display and press Enter.
7. On the Select Timeframe display, change the From: Date and Time field to a date and time prior to
when the problem occurred.
8. Search for an entry that matches one or more conditions of the problem:
v System Reference code
v Resource
v Date and time
v Failing item list
9. Select option 2 (Display failing item information) to display the service action log entry.
10. Select option 2 (Display details) to display location information for the failing part to be replaced.
The information displayed in the date and time fields is the date and time for the first occurrence of
the specific System reference code for the resource displayed during the time range selected.
11. Select option 7 (Indicator off) to turn off the indicator light.
12. Select the Acknowledge all errors function at the bottom of the Service Action Log display, if all
problems have been resolved.
13. Close the log entry by selecting option 8 (Close new entry) on the Service Action Log Report display.
Identifying a failing part in a Linux system or logical partition
If the service aids have been installed on a system or logical partition, you can activate or deactivate the
indicator lights to locate a part or compete a service action.
Locating a failing part in a Linux system or logical partition
If the service aids have been installed on a system or logical partition, you need to activate the indicator
lights to locate a part.
To activate the indicator light, follow these steps:
1. Log in as root user.
2. At the command line, type /usr/sbin/usysident -s identify -l and press Enter.
3. Look for the system attention light to identify the enclosure that contains the failing part.
Related information
Service and productivity tools for Linux on POWER systems
IBM provides hardware diagnostic aids and productivity tools, as well as installation aids for Linux
operating systems on IBM Power systems.
Finding the location code of a failing part in a Linux system or logical partition
To retrieve the location code of the failing part, if you do not know the location code, use the procedure
in this topic.
To locate the failing part in a system or logical partition follow these steps:
1. Log in as root user.
2. At the command line, type grep diagela /var/log/platform and press Enter.
3. Look for the most recent entry that contains a system reference code (SRC).
4. Record the location information.
Note: IBM provides hardware diagnostic aids and productivity tools, as well as installation aids for
Linux operating systems on IBM Power systems. See Service and productivity tools for Linux on
Power Systems servers
Common procedures for installable features 85
Activating the indicator light for the failing part
If you know the location code of the failing part, activate the indicator light to help you locate which part
to replace.
To activate the indicator light, follow these steps:
1. Log in as root user.
2. At the command line, type /usr/sbin/usysident -s identify -l and press Enter.
3. Look for the system attention light to identify the enclosure that contains the failing part.
Note: IBM provides hardware diagnostic aids and productivity tools, as well as installation aids for
Linux operating systems on IBM Power systems. See Service and productivity tools for Linux on Power
Systems servers
Deactivating the failing-part indicator light
After you complete a removal and replacement procedure, you must deactivate the failing-part indicator
light.
To deactivate the indicator light, follow these steps:
1. Log in as root user.
2. At the command line, type /usr/sbin/usysident -s normal -l and press Enter.
Note: IBM provides hardware diagnostic aids and productivity tools, as well as installation aids for
Linux operating systems on IBM Power systems. See Service and productivity tools for Linux on Power
Systems servers
Locating a failing part in a Virtual I/O Server system or logical partition
You can use Virtual I/O Server (VIOS) tools, before activating the indicator light, to locate a part that is
failing.
1. Log in as root user or celogin-.
2. At the command line, type diagmenu and press Enter.
3. From the Function Selection menu, select Task Selection and press Enter.
4. Select Display Previous Diagnostic Results and press Enter.
5. From the Display Previous Diagnostic Results display, select Display Diagnostic Log Summary. A
Display Diagnostic Log display appears. This display contains a chronological list of events.
6. Look in the T column for the most recent S entry. Select this row in the table and press Enter.
7. Choose Commit. The details of this log entry are shown.
8. Record the location information and the SRN value shown near the end of the entry.
9. Exit to the command line.
Use the location information for the failing part to activate the indicator light that identifies the failing
part. For instructions, see “Identifying a part by using the Virtual I/O Server.”
Identifying a part by using the Virtual I/O Server
Use these instructions to turn on the indicator light to help you physically locate a part by using the
Virtual I/O Server (VIOS).
1. Log in as root user.
2. At the command line, type diagmenu and press Enter.
3. From the Function Selection menu, select Task Selection. Press Enter.
4. From the Task Selection menu, select Identify and Attention Indicators. Press Enter.
5. From the list of lights, select the location code for the failing part and press Enter.
86 Power Systems: Serial-attached SCSI RAID enablement
6. Select Commit. This turns on the system attention and indicator light for the failing part.
7. Exit to the command line.
Starting the system or logical partition
Learn how to start a system or logical partition after performing a service action or system upgrade.
Starting a system that is not managed by a Hardware Management
Console or or a Systems Director Management Console
You can use the power button or the Advanced System Management Interface to start a system that is
not managed by a Hardware Management Console (HMC) or an IBM Systems Director Management
Console (SDMC).
To start a system that is not managed by a HMC or SDMC, follow these steps:
1. Open the front rack door, if necessary.
2. Before you press the power button on the control panel, ensure that power is connected to the system
unit as follows:
v All system power cables are connected to a power source.
v The Power LED, as shown in the following figure, is slowly blinking.
v The top of the display, as shown in the following figure, shows 01 V=F.
3. Press the power button (A), as shown in the following figure, on the control panel.
v A: Power-on button
v B: Power LED
– A constant light indicates full system power to the unit.
– A blinking light indicates standby power to the unit.
Figure 54. Control panel
Common procedures for installable features 87
Note: There is approximately a 30-second transition period from the time the power-on button is
pressed to when the power LED goes from blinking to solid. During the transition period, the LED
might blink faster.
v C: Enclosure identify light
– A constant light indicates the identify state for the enclosure or for a resource within the
enclosure.
– No light indicates that no resources in the enclosure are being identified.
v D: Attention light
– No light indicates that the system is operating normally.
– A solid light indicates that the system requires attention.
v E: USB port
v F: Enclosure fault roll-up light
– A constant light indicates a fault indicator active in the system.
– No light indicates that the system is operating normally.
v G: Function/Data display
v H: Decrement button
v I: Enter button
v J: Increment button
v K: Pinhole reset button
4. Observe the following after pressing the power button:
v The power-on light begins to blink faster.
v The system cooling fans are activated after approximately 30 seconds and begin to accelerate to
operating speed.
v Progress indicators, also referred to as checkpoints, appear on the control panel display while the
system is being started. The power-on light on the control panel stops blinking and remains on,
indicating that system power is on.
Tip: If pressing the power button does not start the system, do the following steps to start the system
using the Advanced System Management Interface (ASMI):
1. Access the ASMI. For instructions, see Accessing the ASMI.
2. Start the system using the ASMI. For instructions, see Powering the system on and off.
Starting a system or logical partition by using the Hardware
Management Console
You can use the Hardware Management Console (HMC) user interface to start the system or logical
partition after the required cables are installed and the power cables are connected to a power source.
For instructions on working with the HMC, see Managing the Hardware Management Console. For
instructions on starting a logical partition, see Logical partitioning. For instructions on starting the
system, see Powering on the managed system.
Progress indicators, also referred to as checkpoints, appear on the control panel display while the system
is being started. When the power-on light on the control panel stops blinking and remains on, the system
power is on.
88 Power Systems: Serial-attached SCSI RAID enablement
Starting a system or virtual server with the Systems Director
Management Console
You can use the IBM Systems Director Management Console (SDMC) user interface to start the system or
virtual server after the required cables are installed and the power cables are connected to a power
source.
For instructions on working with the SDMC, see Managing and configuring the SDMC. For instructions
on starting a virtual server, see Managing virtual servers. For instructions on shutting down and
restarting virtual servers, see Shutting down and restarting virtual servers.
Progress indicators, also known as checkpoints, display on the control panel while the system is being
started. When the power-on light on the control panel stops flashing and remains on, the system power is
on.
Stopping a system or logical partition
Learn how to stop a system or logical partition as a part of a system upgrade or service action.
Attention: Using either the power-on button on the control panel or entering commands at the
Hardware Management Console (HMC) to stop the system can cause unpredictable results in the data
files. Also, the next time you start the system, it might take longer if all applications are not ended before
stopping the system.
To stop the system or logical partition, select the appropriate procedure.
Stopping a system that is not managed by an HMC or an SDMC
You might need to stop the system to perform another task. Use these instructions to stop the system
using the power button or Advanced System Management Interface.
Before you stop the system, follow these steps:
1. If an Integrated xSeries® Adapter (IXA) is present on the system, shut it down using IBM i options.
2. Ensure that all jobs are completed and end all applications.
3. Ensure that the operating system is stopped.
Attention: Failure to do so can result in the loss of data.
4. If a Virtual I/O Server (VIOS) logical partition is running, ensure that all clients are shut down or that
the clients have access to their devices using an alternate method.
The following procedure describes how to stop a system that is not managed by a Hardware
Management Console (HMC).
1. Log in to the system as a user with the authority to run the shutdown or pwrdwnsys (Power Down
System) command.
2. At the command line, enter one of the following commands:
v If your system is running the AIX operating system, type shutdown.
v If your system is running the Linux operating system, type shutdown -h now.
v If your system is running the IBM i operating system, type PWRDWNSYS. If your system is partitioned,
use the PWRDWNSYS command to power down each of the secondary partitions. Then, use the
PWRDWNSYS command to power down the primary partition.
The command stops the operating system. The system power turns off, the power-on light begins to
slowly flash, and the system goes into a standby state.
3. At the Linux command line, type shutdown -h now.
Common procedures for installable features 89
The command stops the operating system. The system power turns off, the power-on light begins to
slowly flash, and the system goes into a standby state.
4. At the Linux command line, type shutdown -h now.
The command stops the operating system. The system power turns off, the power-on light begins to
slowly flash, and the system goes into a standby state.
5. Record the IPL type and the IPL mode from the control panel display to help you return the system
to this state when the installation or replacement procedure is completed.
6. Set the power switches of any devices connected to the system to off.
7. Unplug any power cables that are attached to the unit from electrical outlets. Ensure that you unplug
power cables from peripheral devices, such as printers and expansion units.
Important: The system is equipped with a second power supply. Before continuing with this
procedure, ensure that all power sources to the system have been disconnected.
(L003)
or
Stopping a system by using the Hardware Management Console
You can use the Hardware Management Console (HMC) user interface to stop the system or a logical
partition.
By default, the managed system is set to power off automatically when you shut down the last running
logical partition on the managed system. If you set the managed system properties on the HMC so that
the managed system does not power off automatically, you must use this procedure to power off your
managed system.
90 Power Systems: Serial-attached SCSI RAID enablement
Attention: If possible, shut down the running logical partitions on the managed system before powering
off the managed system. Powering off the managed system without shutting down the logical partitions
first causes the logical partitions to shut down abnormally and can cause data loss. If you use a Virtual
I/O Server (VIOS) logical partition, ensure that all clients are shut down or that the clients have access to
their devices using an alternate method.
To power off a managed system, you must be a member of one of the following roles:
v Super administrator
v Service representative
v Operator
v Product engineer
Use the following steps to stop the system using the HMC.
1. In the Navigation area, expand the Systems Management folder.
2. Click the Servers icon.
3. In the Contents area, select the managed system.
4. Select Tasks, then Operations, and then Power Off
5. Select the appropriate power-off mode and click OK.
Related information
Shutting down and restarting logical partitions
Stopping a system with the Systems Director Management Console
You can use the IBM Systems Director Management Console (SDMC) user interface to stop the system or
a virtual server.
By default, the managed system is set to power off automatically when you shut down the last running
virtual server on the managed system. If you set the managed system properties on the SDMC so that the
managed system does not power off automatically, you must use this procedure to power off your
managed system.
Attention: If possible, shut down the running virtual servers on the managed system before powering
off the managed system. Powering off the managed system without shutting down the virtual servers
first causes the virtual servers to shut down abnormally and can cause data loss. If you use a Virtual I/O
Server (VIOS) logical partition, ensure that all clients are shut down or that the clients have access to
their devices with an alternate method.
To power off a managed system, you must be a member of one of the following roles:
v Super administrator
v Service representative
v Operator
v Product engineer
Use the following steps to stop the system using the SDMC.
1. In the Power Systems Resource area, select the managed system you want to power off.
2. From the Actions menu, select Operations ? Power Off.
3. Select the appropriate power-off mode and click OK.
Removing and replacing covers for the 8202-E4B, 8202-E4C, 8205-E6B,
or 8205-E6C
Use these instructions to remove, replace, or install covers to access components or perform service.
Common procedures for installable features 91
Removing the front cover on a rack-mounted 8202-E4B, 8202-E4C,
8205-E6B, or 8205-E6C system
Use this procedure to remove the cover to access components or perform service.
To remove the front cover, follow these steps:
1. Remove the two shipping screws (if present) that secure the system to the rack.
Note: Replacing the shipping screws is optional but should be done in geographic areas that are
prone to seismic activity.
2. Pull the cover away from the system. The cover has an indentation where you can hold onto it more
easily.
Removing the front cover on a stand-alone 8202-E4B, 8202-E4C, or
8205-E6B system
Use this procedure to remove the cover to access components or perform service.
To remove the front cover, follow these steps:
1. Open the front door (A) by swinging it towards the left as shown in the following figure.
2. Press latch (B) and pull the cover away from the system. The cover has an indentation where you can
hold onto it more easily.
Figure 55. Removing the front cover
92 Power Systems: Serial-attached SCSI RAID enablement
Installing the front cover on a rack-mounted 8202-E4B, 8202-E4C,
8205-E6B, or 8205-E6C system
Use this procedure to install the cover to access components or perform service.
To install the front cover, follow these steps:
1. Push the cover onto the system.
2. Gently push the cover in until the four cover clips are seated in their respective mounting posts as
shown in the following figure. The cover snaps into place and has an indentation where you can hold
onto it more easily.
Note: Replacing the shipping screws is optional but should be done in geographic areas that are
prone to seismic activity.
Figure 56. Removing the front cover
Common procedures for installable features 93
Installing the front cover and front door on a stand-alone 8202-E4B,
8202-E4C, or 8205-E6B system
Use this procedure to install the cover to access components or perform service.
To install the front cover and front door, complete the following steps.
1. Align the cover (A) until the two cover tabs are seated into the slots on the base plate (B), as shown
in the following figure.
2. Rotate the cover up and towards the system until the release latch is seated into its respective slot.
The cover has an indentation where you can hold onto it more easily.
Figure 57. Removing the front cover
94 Power Systems: Serial-attached SCSI RAID enablement
3. Hold the front door at an approximate 120 degree angle to the system as shown in the following
figure. Engage the latches.
Removing the service access cover on a rack-mounted 8202-E4B,
8202-E4C, 8205-E6B, or 8205-E6C system
Use this procedure to remove the service access cover.
1. Loosen the two thumbscrews (A) located at the back of the cover.
2. Slide the cover toward the back of the system unit. When the front of the service access cover has
cleared the upper frame ledge, lift the cover up and off the system unit.
Figure 58. Installing the front cover
Figure 59. Installing the front door
Common procedures for installable features 95
Attention: When the service access cover is removed, the system powers down.
Removing the service access cover on a stand-alone 8202-E4B,
8202-E4C, or 8205-E6B system
Use this procedure to remove the service access cover.
1. Loosen the thumbscrew (A) located at the back of the outside cover.
2. Slide the cover (B) toward the back of the system unit. When the front of the service access cover has
cleared the upper frame ledge, lift the cover up and off the system unit.
Figure 60. Removing the service access cover
96 Power Systems: Serial-attached SCSI RAID enablement
3. Loosen the two thumbscrews (A) located at the back of the inside cover.
4. Slide the cover (B) toward the back of the system unit. When the front of the service access cover has
cleared the upper frame ledge, lift the cover up and off the system unit.
Attention: When the service access cover is removed, the system powers down.
Figure 61. Installing the outside service access cover
Figure 62. Removing the inside service access cover
Common procedures for installable features 97
Installing the service access cover on a rack-mounted 8202-E4B,
8202-E4C, 8205-E6B, or 8205-E6C system
Use this procedure to install the service access cover.
1. Place the service access cover on top of the system, about 25 mm (1 in.) from the upper chassis ledge.
2. Hold the service access cover against the system unit and slide it toward the front of the system. The
tabs of the service access cover slide beneath the upper chassis ledge.
3. Align the two thumbscrews (A) located on the back of the service access cover with the two holes on
the back of the system chassis.
4. Tighten the thumbscrews to secure the service access cover.
Installing the service access cover on a stand-alone 8202-E4B,
8202-E4C, or 8205-E6B system
Use this procedure to install the service access cover.
1. Place the inside service access cover (A) on top of the system, about 25 mm (1 in.) from the upper
chassis ledge.
2. Hold the service access cover against the system unit and slide it toward the front of the system. The
tabs of the service access cover slide beneath the upper chassis ledge.
3. Align the two thumbscrews (B) located on the back of the service access cover with the two holes on
the back of the system chassis.
4. Tighten the thumbscrews to secure the service access cover.
Figure 63. Installing the service access cover
98 Power Systems: Serial-attached SCSI RAID enablement
5. Place the outside service access cover (A) on top of the system.
6. Hold the service access cover against the system unit and slide it toward the front of the system. The
tabs of the service access cover slide beneath the upper chassis ledge.
7. Align the thumbscrew (B) located on the back of the service access cover with the hole on the back of
the system chassis.
8. Tighten the thumbscrew to secure the service access cover.
Figure 64. Installing the inside service access cover
Common procedures for installable features 99
Placing the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system into
the service or operating position
Use these procedures to place a system into the service position or operating position to perform service
or to gain access to internal components.
Placing the rack-mounted 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C
system into the service position
Use this procedure to place the rack-mounted system into the service position.
Notes:
v When placing the system into the service position, it is essential that all stability plates are firmly in
position to prevent the rack from toppling. Ensure that only one system unit is in the service position
at a time.
v Ensure that the cables at the back of the system unit do not catch or bind as you pull the unit forward
in the rack.
v When the system unit rails are fully extended, the rail safety latches lock into place. This action
prevents the system from being pulled out too far.
1. Open the front rack door.
2. Identify the system unit you are servicing in the rack.
3. Remove the two screws (A) (if present) that secure the system unit (B) to the rack.
Note: Replacing the shipping screws is optional but should be done in geographic areas that are
prone to seismic activity.
Figure 65. Installing the outside service access cover
100 Power Systems: Serial-attached SCSI RAID enablement
4. While holding the system unit release latches (A) down on both the left and right sides, pull the
system unit (B) out from the rack until the rails are fully extended and locked.
Placing the rack-mounted 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C
system into the operating position
Use this procedure to place the rack-mounted system into the operating position.
Figure 66. Placing the system in the service position
Common procedures for installable features 101
When placing the system into operating position, ensure that the cables at the back of the system do not
catch or bind as you push the system unit back into the rack.
1. Unlock the blue rail safety latches (A) by lifting them upward. The rail safety latches are located near
the middle of the system.
2. Push the system unit (B) back into the rack until both system-unit release latches have locked into
position.
3. Replace and tighten the two screws (if present) (A) that secure the system unit (B) to the rack.
Note: Replacing the two shipping screws is optional but should be done in geographic areas that are
prone to seismic activity.
4. Close the front rack door of the unit you are servicing.
Figure 67. Placing the system into the operating position
102 Power Systems: Serial-attached SCSI RAID enablement
Disconnecting the power cords from the 8202-E4B, 8202-E4C,
8205-E6B, or 8205-E6C system
Use these procedures to disconnect the power cords from the system.
1. For a rack-mounted system, open the back rack door on the unit you are servicing and identify the
system unit you are servicing in the rack.
2. For both a rack-mounted system and a stand-alone system, disconnect all power cords from the unit
you are servicing. This system might be equipped with two power supplies. Before continuing with
any remove and replace procedures that require the power off, ensure that both power sources to the
system have been completely disconnected.
Figure 68. Placing the system into the operating position
Common procedures for installable features 103
Figure 69. Removing the power cords for a rack-mounted system
Figure 70. Removing the power cords for a stand-alone system
104 Power Systems: Serial-attached SCSI RAID enablement
Connecting the power cords to the 8202-E4B, 8202-E4C, 8205-E6B, or
8205-E6C system
Use this procedure to connect the power cords to the system.
1. Identify the system unit you are servicing in the rack.
2. Connect all power cords to the unit you are servicing. Ensure that the cords are threaded through the
handles.
3. Close the back rack door.
Installing a feature by using the Hardware Management Console
You can use the Hardware Management Console (HMC) to perform many service actions, including the
installation of a new feature or part.
To use the HMC user interface to install a feature or part into a system or expansion unit that is managed
by an HMC, follow these steps:
1. In the navigation area, expand Systems Management ? Servers.
2. Select the managed system you will install the part in.
3. In the Tasks area expand Serviceability ? Hardware ? MES Tasks.
4. Select Add FRU (field replaceable unit).
5. In the Add/Install/Remove Hardware window select the system or enclosure into which you are
installing the feature.
6. Select the type of feature you are installing from the menu and click Next
7. Select the location code for where you will install the feature, and click Add.
8. After the FRU is placed in the pending actions category click Launch Procedure and follow the
instructions to install the feature.
Figure 71. Connecting the power cords
Common procedures for installable features 105
Note: The HMC might open external instructions for installing the feature. If so, follow those
instructions to install the feature.
Removing a part by using the Hardware Management Console
You can use the Hardware Management Console (HMC) to perform many service actions, including the
removal of a field replaceable unit (FRU) or part.
To use the HMC user interface to remove a part in a system or expansion unit that is managed by an
HMC, follow these steps:
1. In the navigation area, expand Systems Management ? Servers.
2. Select the managed system from which you are removing a part.
3. Select one of the following options.
v If you are installing a new part as part of an upgrade, in the Tasks area expand Serviceability ?
Hardware ? MES Tasks.
v If you are replacing a part as part of a service action, in the Tasks area expand Serviceability ?
Hardware ? Remove FRU.
4. To install a new part as part of an upgrade, select Add FRU. To replace a part as part of a service
action, select Remove FRU.
5. In the Add/Install/Remove Hardware - Remove FRU, Select FRU Type window, select the system or
enclosure from which you are removing the part.
6. Select the type of part you are removing from the menu and click Next.
7. Select the location of the part you are removing and click Add.
8. After the part is listed in the Pending Actions display click Launch Procedure and follow the
instructions to remove the part.
Note: The HMC might open the information center instructions for removing the part. If so, follow
those instructions to remove the part.
Replacing a part by using the Hardware Management Console
You can use the Hardware Management Console (HMC) to perform many service actions, including
exchanging a field replaceable unit (FRU) or part.
If you are exchanging a part to repair a serviceable event follow those instructions. If you are exchanging
a part as a part of any other procedure using HMC Version 7, or later, complete the following steps:
1. In the navigation area, expand Systems Management ? Servers.
2. Select the managed system in which you are exchanging a part.
3. Select one of the following options.
v If you are installing a new part as part of an upgrade, in the Tasks area expand Serviceability ?
Hardware ? MES Tasks.
v If you are replacing a part as part of a service action, in the Tasks area expand Serviceability ?
Hardware ? Exchange FRU.
4. To install a new part as part of an upgrade, select Add FRU. To replace a part as part of a service
action, select Exchange FRU.
5. In the Replace Hardware - Replace FRU, Select FRU Type window select the system or enclosure in
which you will exchange a part from the list.
6. Select the type of part you will exchange from the menu, and click Next.
7. Select the location code of the part you will exchange from the menu, and click Add.
8. After the FRU is placed in the pending actions category click Launch Procedure, and follow the
instructions to exchange the feature.
106 Power Systems: Serial-attached SCSI RAID enablement
Note: The HMC might open external instructions for replacing the part. If so, follow those
instructions to replace the part.
Verifying the installed part
You can verify a newly installed or replaced part on your system, logical partition, or expansion unit
using the operating system, stand-alone diagnostics, or the Hardware Management Console (HMC).
Verifying an installed feature or replaced part in an AIX system or
logical partition
If you installed feature or replaced a part, you might want to use the tools in the AIX operating system to
verify that the feature or part is recognized by the system or logical partition.
To verify the operation of a newly installed feature or replacement part, select the appropriate procedure:
v Verify the installed feature using AIX
v Verifying the replaced part using AIX
Verify the installed feature by using the AIX operating system:
1. Log in as root user.
2. At the command line, type diag and press Enter.
3. Select Advanced Diagnostics Routines and press Enter.
4. From the Diagnostic Mode Selection menu, select System Verification and press Enter.
5. When the Advanced Diagnostic Selection menu appears, do one of the following:
v To test a single resource, select the resource that you just installed from the list of resources and
press Enter.
v To test all the resources available to the operating system, select All Resources and press Enter.
6. Select Commit, and wait until the diagnostic programs run to completion, responding to any prompts
that appear.
7. Did the diagnostics run to completion and display the message No trouble was found?
v No: If a service request number (SRN) or other reference code is displayed, suspect a loose adapter
or cable connection. Review the installation procedures to ensure that the new feature is installed
correctly. If you cannot correct the problem, collect all SRNs or any other reference code
information that you see. If the system is running in logical partitioning (LPAR) mode, note the
logical partition in which you installed the feature. Contact your service provider for assistance.
v Yes: The new device is installed correctly. Exit the diagnostic programs and return the system to
normal operations.
Verify the replacement part by using the AIX operating system:
To verify the operation of a newly installed feature or replacement part, follow these steps:
1. Did you use either the AIX operating system or the online diagnostics service aid concurrent
(hot-swap) service to replace the part?
No: Go to step 2.
Yes: Go to step 5 on page 108.
2. Is the system powered off?
No: Go to step 4 on page 108.
Yes: If the system supports slow boot, set the system to perform a slow boot. For information, see
Performing a slow boot.
3. Start the system and wait until the AIX operating system login prompt is displayed or until apparent
system activity on the operator panel or display has stopped.
Common procedures for installable features 107
Did the AIX login prompt display?
v No: If a service request number (SRN) or other reference code is displayed, suspect a loose
adapter or cable connection. Review the procedures for the part that you replaced to ensure that
the new part is installed correctly. If you cannot correct the problem, collect all SRNs or any other
reference code information that you see. If the system does not start or you have no login prompt,
see: Problems with loading and starting the operating system.
If the system is partitioned, note the logical partition in which you replaced the part. Contact your
service provider for assistance.
v Yes: Go to step 4.
4. At the command prompt, type diag —a and press Enter to check for missing resources. If you see a
command prompt, go to step 5.
If the Diagnostic selection menu is shown with M appearing next to any resource, follow these
steps:
a. Select the resource and press Enter.
b. Select Commit.
c. Follow any instructions that are shown.
d. If the Do you want to review the previously displayed error? message is shown, select Yes and press
Enter.
e. If an SRN is shown, suspect a loose card or connection. If no obvious problem is shown, record
the SRN and contact your service provider for assistance..
f. If no SRN is shown, go to step 5.
5. Test the part by doing the following steps:
a. At the command line, type diag and press Enter.
b. From the Function Selection menu, select Advanced Diagnostics Routines and press Enter.
c. From the Diagnostic Mode Selection menu, select System Verification and press Enter.
d. Select All Resources, or select the diagnostics for the individual part to test only the part you
replaced and any devices that are attached to the part you replaced and press Enter.
Did the Resource Repair Action menu appear?
No: Go to step 6.
Yes: Go to step 7.
6. Did the Testing Complete, No trouble was found message appear?
v No: There is still a problem. Contact your service provider. This ends the procedure.
v Yes: Select Log Repair Action, if not previously logged, from the Task Selection menu to update
the AIX error log. If the repair action was reseating a cable or adapter, select the resource
associated with that repair action. If the resource associated with your action is not displayed on
the resource list, select sysplanar0 and press Enter.
Tip: This action changes the indicator light for the part from the fault state to the normal state.
Go to step 9 on page 109.
7. Select the resource for the replaced part from the Resource Repair Action menu. When a test is run
on a resource in system verification mode, and that resource has an entry in the AIX error log, if the
test on the resource was successful, the Resource Repair Action menu appears. Complete the
following steps to update the AIX error log to indicate that a system-detectable part has been
replaced.
Note: On systems with an indicator light for the failing part, this action changes the indicator light
to the normal state.
108 Power Systems: Serial-attached SCSI RAID enablement
a. Select the resource that has been replaced from the Resource Repair Action menu. If the repair
action was reseating a cable or adapter, select the resource associated with that repair action. If
the resource associated with your action does not appear on the resource list, select sysplanar0
and press Enter.
b. Select Commit after you make your selections. Did another Resource Repair Action display
appear?
No: If the No Trouble Found display appears, go to step 9
Yes: Go to step 8.
8. Select the parent or child of the resource for the replaced part from the Resource Repair Action
menu if necessary. When a test is run on a resource in system verification mode, and that resource
has an entry in the AIX error log, if the test on the resource was successful, the Resource Repair
Action menu appears. Complete the following steps to update the AIX error log to indicate that a
system-detectable part has been replaced.
Note: This action changes the indicator light for the part from the fault state to the normal state.
a. From the Resource Repair Action menu, select the parent or child of the resource that has been
replaced. If the repair action was to reseat a cable or adapter, select the resource associated with
that repair action. If the resource associated with your action does not appear on the resource list,
select sysplanar0 and press Enter.
b. Select Commit after you make your selections.
c. If the No Trouble Found display appears, go to step 9.
9. If you changed the service processor or network settings, as instructed in previous procedures,
restore the settings to the values they had prior to servicing the system.
10. Did you do any hot-plug procedures before doing this procedure?
No: Go to step 11.
Yes: Go to step 12.
11. Start the operating system, with the system or logical partition in normal mode. Were you able to
start the operating system?
No: Contact your service provider. This ends the procedure.
Yes: Go to step 12.
12. Are the indicator lights still on?
v No. This ends the procedure.
v Yes. Turn off the lights. See the following for instructions: Changing service indicators.
Verifying the installed part in an IBM i system or logical partition
If you have installed a new feature or part, verify that the system recognizes the feature or part by using
the IBM i system service tools.
To verify the installed part, follow these steps:
1. Deactivate the failing item indicator light. For instructions, see “Deactivating the failing-part indicator
light” on page 84.
2. Sign on with at least service level authority.
3. On the command line of the IBM i session, type strsst and press Enter.
Note: If you cannot get to the System Service Tools display, use function 21 from the control panel.
Alternatively, if the system is managed by Hardware Management Console (HMC), use the Service
Focal Point Utilities to get to the Dedicated Service Tools (DST) display.
4. Type your service tools user ID and service tools password on the System Service Tools (SST) Sign On
display and press Enter.
Common procedures for installable features 109
Note: The service tools password is case-sensitive.
5. Select Start a service tool from the System Service Tools (SST) display and press Enter.
6. Select Hardware service manager from the Start a Service Tool display and press Enter.
7. Select Logical hardware resources (buses, IOPs, controllers) from the Hardware Service Manager
display and press Enter. This option allows you to display and work with logical resources. Logical
hardware resources are the functional resources of the system used by the operating system.
With the Logical Hardware Resources display, you can show logical hardware resource status or
information, and associated packaging hardware resources. Use the online Help information to better
understand specific functions, fields, or symbols.
Deactivating the failing-part indicator light
Use this procedure to turn off any indicator light that you turned on as a part of a service action.
To deactivate the indicator light, follow these steps:
1. Sign on to an IBM i session, with at least service level authority.
2. On the command line of the session, type strsst and press Enter.
Note: If you cannot get to the System Service Tools display, use function 21 from the control panel.
Alternatively, if the system is managed by a Hardware Management Console (HMC), use the Service
Focal Point utilities to get to the Dedicated Service Tools (DST) display.
3. Type your service tools user ID and service tools password on the System Service Tools (SST) Sign
On display. Press Enter.
Remember: The service tools password is case-sensitive.
4. Select Start a service tool from the System Service Tools (SST) display and press Enter.
5. Select Hardware service manager from the Start a Service Tool display and press Enter.
6. Select Work with service action log from the Hardware Service Manager display and press Enter.
7. On the Select Timeframe display, change the From: Date and Time field to a date and time prior to
when the problem occurred.
8. Search for an entry that matches one or more conditions of the problem:
v System Reference code
v Resource
v Date and time
v Failing item list
9. Select option 2 (Display failing item information) to display the service action log entry.
10. Select option 2 (Display details) to display location information for the failing part to be replaced.
The information displayed in the date and time fields is the date and time for the first occurrence of
the specific System reference code for the resource displayed during the time range selected.
11. Select option 7 (Indicator off) to turn off the indicator light.
12. Select the Acknowledge all errors function at the bottom of the Service Action Log display, if all
problems have been resolved.
13. Close the log entry by selecting option 8 (Close new entry) on the Service Action Log Report display.
Verifying the installed part in a Linux system or logical partition
If you have installed a new part, learn how to verify that the system recognizes the part.
To verify the newly installed or replaced part, continue with “Verifying an installed part by using
stand-alone diagnostics” on page 111.
110 Power Systems: Serial-attached SCSI RAID enablement
Verifying an installed part by using stand-alone diagnostics
If you have installed or replaced a part, verify that the system recognizes the new part. You can use
stand-alone diagnostics to verify an installed part in an AIX or Linux system, expansion unit, or logical
partition.
v If this server is directly attached to another server or attached to a network, ensure communications
with the other servers have stopped.
v The stand-alone diagnostics require use of all of the logical partition resources. No other activity can be
running on the logical partition.
v The stand-alone diagnostics require access to the system console.
You access these diagnostics from a CD-ROM or from the Network Installation Management (NIM)
server. This procedure describes how to use the diagnostics from a CD-ROM. For information on running
diagnostics from the Network Installation Management (NIM) server, see Running stand-alone
diagnostics from a Network Installation Management server.
To use stand-alone diagnostics, follow these steps:
1. Stop all jobs and applications and then stop the operating system on the system or logical partition.
2. Remove all tapes, diskettes, and CD-ROM.
3. Turn off the system unit power. The next step boots the server or logical partition from the
stand-alone diagnostics CD-ROM. If the optical drive is not available as the boot device on the
server or logical partition on which you are working, follow these steps:
a. Access the ASMI. See Accessing the ASMI for information on using the ASMI.
b. On the ASMI main menu, click on Power/Restart Control.
c. Click Power On/Off System.
d. Select the Service mode boot from default boot list option in the AIX or Linux logical partition
mode boot drop-down menu.
e. Click Save settings and power on. As soon as the optical drive has power, insert the standalone
diagnostic CD-ROM.
f. Go to step 5.
4. Turn on the system unit power and immediately insert the diagnostics CD-ROM into the optical
drive.
5. After the keyboard POST indicator displays on the system console and before the last POST
indicator (speaker) displays, press the numeric 5 key on the system console to indicate that a service
mode boot should be initiated using the default-service mode boot list.
6. Enter any requested password.
7. At the Diagnostic Operating Instructions display, press Enter.
Tip: If a service request number (SRN) or other reference code is displayed, suspect a loose adapter
or cable connection.
Note: If you received an SRN or any other reference code when you attempted to start the system,
contact your service provider for assistance..
8. If the terminal type is requested, select the Initialize Terminal option on the Function Selection
menu to initialize the operating system.
9. From the Function Selection menu, select Advanced Diagnostics Routines and press Enter.
10. From the Diagnostic Mode Selection menu, select System Verification and press Enter.
11. When the Advanced Diagnostic Selection menu appears, select All Resources, or test only the part
you replaced, and any devices that are attached to the part you replaced, by selecting the diagnostics
for the individual part and press Enter.
12. Did the Testing Complete, No trouble was found message appear?
Common procedures for installable features 111
v No: There is still a problem. Contact your service provider.
v Yes: Go to step 13.
13. If you changed the service processor or network settings, as instructed in previous procedures,
restore the settings to the value they had prior to servicing the system.
14. If the indicator lights are still on, follow these steps:
a. Select Identify and Attention Indicators from the Task Selection menu to turn off the system
attention and indicator lights and press Enter.
b. Select Set System Attention Indicator to NORMAL and press Enter.
c. Select Set All Identify Indicators to NORMAL and press Enter.
d. Choose Commit.
Note: This changes the system attention and identify indicators from the Fault state to the Normal
state.
e. Exit to the command line.
Verifying the installed part by using Hardware Management Console
If you have installed or replaced a part, use the Hardware Management Console (HMC) to update your
HMC records after you have completed a service action on your server. If you have reference codes,
symptoms, or location codes that you used during the service action, locate the records for use during
this procedure.
To verify the installed part, complete these steps:
1. At the HMC, examine the service action event log for any open service action events. See “Viewing
serviceable events by using the HMC” on page 114 for details.
2. Are there any service action events that are open?
No: If the system attention LED is still on, use the HMC to turn off the LED. See “Activating and
deactivating LEDs by using the HMC” on page 113. This ends the procedure.
Yes: Continue with the next step.
3. Record the list of open service action events.
4. Examine the details of the open service action event. Is the error code associated with this service
action event the same as you gathered earlier.
v No: Select one of the following options:
– Review the other serviceable events, find one that does match, and continue with the next step.
– If the log does not match what you had gathered earlier, contact your service provider.
v Yes: Continue with the next step.
5. Select and highlight the service action event from the Error Associated With This Serviceable Event
window.
6. Click Close Event.
7. Add comments for the serviceable event. Include any unique additional information. Click OK.
8. Did you replace, add, or modify a field replaceable unit (FRU) of the open service action event?
v No: Select the No FRU Replaced for this Serviceable Event option, and click OK to close the
service action event.
v Yes: Perform the following steps:
a. From the FRU list, select a FRU that you need to update.
b. Double-click the FRU and update the FRU information.
c. Click OK to close the service action event.
9. If you continue to have problems, contact your service provider.
112 Power Systems: Serial-attached SCSI RAID enablement
Activating and deactivating LEDs by using the HMC
Use this procedure to activate or deactivate LEDs using Service Focal Point for the Hardware
Management Console (HMC).
Choose from the following:
v “Deactivating a system attention LED or partition LED”
v “Activating or deactivating an identify LED”
Deactivating a system attention LED or partition LED:
You can deactivate a system attention LED or a logical partition LED. For example, you might determine
that a problem is not a high priority and decide to repair the problem at a later time. However, you want
to be alerted if another problem occurs, so you must deactivate the system attention LED so that it can be
activated again if another problem occurs.
1. In the navigation area, open Systems Management.
2. Open Servers and select the appropriate system.
3. In the content area, check the box for the appropriate Partition.
4. Select Tasks, then Operations, and then Manage Attention LED.
5. Select the appropriate Partition.
6. Select Deactivate System Attention LED from the Action menu. A confirmation window is displayed
that provides the following information:
v A verification that the system attention LED was deactivated.
v An indication that there still might be open problems within the system.
v An indication that you cannot activate the system attention LED.
7. Select one of the logical partitions in the lower table, and select Deactivate partition LED from the
Partition Operations menu. A confirmation window is displayed that provides the following
information:
v A verification that the logical partition LED was deactivated.
v An indication that there still might be open problems within the logical partition.
v An indication that you cannot activate the logical partition LED.
Activating or deactivating an identify LED:
The system provides several LEDs that help identify various components, such as enclosures or field
replaceable units (FRUs), in the system. For this reason, they are called identify LEDs.
You can activate or deactivate the following types of identify LEDs:
v Identify LED for an enclosure If you want to add an adapter to a specific drawer (enclosure), you
need to know the machine type, model, and serial number (MTMS) of the drawer. To determine
whether you have the correct MTMS for the drawer that needs the new adapter, you can activate the
LED for a drawer and verify that the MTMS corresponds to the drawer that requires the new adapter.
v Identify LED for a FRU associated with a specified enclosure If you want to hook up a cable to a
specific I/O adapter, you can activate the LED for the adapter which is a field replaceable unit (FRU),
and then physically check to see where you should hook up the cable. This is especially useful when
you have several adapters with open ports.
To activate or deactivate an identify LED for an enclosure or FRU, follow these steps:
1. In the navigation area, open Systems Management.
2. Select Servers.
3. In the content area, check the box for the appropriate System.
4. Select Tasks, then Operations, then LED Status, and then Identify LED.
Common procedures for installable features 113
5. To activate or deactivate an identify LED for an enclosure, select an enclosure from the table, and click
either Activate LED or Deactivate LED. The associated LED is either turned on or off.
6. To activate or deactivate an identify LED for a FRU, select an enclosure from the table, select Selected
? List FRUs.
7. Select one or more FRUs from the table, and click either Activate LED or Deactivate LED. The
associated LED is either turned on or off.
Viewing serviceable events by using the HMC
Use this procedure to view a serviceable event, including details, comments, and service history.
To view serviceable events and other information about the events, you must be a member of one of the
following roles:
v Super administrator
v Service representative
v Operator
v Product engineer
v Viewer
To view serviceable events, follow these steps:
1. In the navigation area, select Service Management.
2. Select Manage Serviceable Events.
3. Select the criteria for the serviceable events that you want to view, and click OK. The Serviceable
Event Overview window opens. The list shows all serviceable events that match your selection
criteria. You can use the menu options to perform actions on the serviceable events.
4. Select a line in the Serviceable Event Overview window, and select Selected ? View Details The
Serviceable Event Details window opens, showing detailed information about the serviceable event.
The upper table shows information, such as problem number and reference code. The lower table
shows the field replaceable units (FRUs) associated with this event.
5. Select the error for which you want to view comments and history, and follow these steps:
a. Select Actions ? View Comments.
b. When you are finished viewing the comments, click Close.
c. Select Actions ? View Service History. The Service History window opens, showing service
history associated with the selected error.
d. When you are finished viewing the service history, click Close.
6. When you are finished, click Cancel twice to close the Serviceable Event Details window and the
Serviceable Event Overview window.
Verifying the installed part by using Systems Director Management
Console
If you installed or replaced a part, use the IBM Systems Director Management Console (SDMC) to update
your SDMC records after you have completed a service action on your server. If you have reference
codes, symptoms, or location codes that you used during the service action, locate the records for use
during this procedure.
To verify the installed part, complete these steps:
1. From the SDMC, examine the service action event log for any open service action events. See
“Viewing serviceable events by using the Systems Director Management Console” on page 116 for
details.
2. Are there any service action events that are open?
114 Power Systems: Serial-attached SCSI RAID enablement
No: If the system attention LED is still on, use the SDMC to turn off the LED. See “Activating and
deactivating LEDs by using the Systems Director Management Console.” This ends the procedure.
Yes: Continue with the next step.
3. Record the list of open service action events.
4. Examine the details of the open service action event. Is the error code associated with this service
action event the same as you gathered earlier.
v No: Select one of the following options:
– Review the other serviceable events, find one that does match, and continue with the next step.
– If the log does not match what you had gathered earlier, contact your service provider.
v Yes: Continue with the next step.
5. Select and highlight the service action event from the Error Associated With This Serviceable Event
window.
6. Click Delete or Ignore.
Note: These options are only available from the problem event log.
Activating and deactivating LEDs by using the Systems Director Management
Console
Use this procedure to activate or deactivate LEDs by using the IBM Systems Director Management
Console (SDMC).
Choose from the following:
v “Deactivating a system attention LED or partition LED”
v “Activating or deactivating an identify LED by using the SDMC”
Deactivating a system attention LED or partition LED:
You can deactivate a system attention LED or a logical partition LED. For example, you might determine
that a problem is not a high priority and decide to repair the problem at a later time. However, you want
to be alerted if another problem occurs, so you must deactivate the system attention LED so that it can be
activated again if another problem occurs.
1. On the Resources tab, select the appropriate host or virtual server.
2. Select Actions ? Service and Support ? Hardware ? System Attention LED.
3. Select Deactivate System Attention LED. A confirmation window is displayed that provides the
following information:
v A verification that the system attention LED was deactivated.
v An indication that there still might be open problems within the system.
v An indication that you cannot activate the system attention LED.
4. Select one of the virtual servers, and select Deactivate System Attention LED. A confirmation
window is displayed that provides the following information:
v A verification that the system attention LED was deactivated.
v An indication that there still might be open problems within the logical partition.
v An indication that you cannot activate the virtual server LED.
Activating or deactivating an identify LED by using the SDMC:
The system provides several LEDs that help identify various components, such as enclosures or field
replaceable units (FRUs). For this reason, they are called identify LEDs.
You can activate or deactivate the following types of identify LEDs:
Common procedures for installable features 115
v Identify LED for an enclosure If you want to add an adapter to a specific drawer (enclosure), you
need to know the machine type, model, and serial number (MTMS) of the drawer. To determine
whether you have the correct MTMS for the drawer that needs the new adapter, you can activate the
LED for a drawer and verify that the MTMS corresponds to the drawer that requires the new adapter.
v Identify LED for a FRU associated with a specified enclosure If you want to hook up a cable to a
specific I/O adapter, you can activate the LED for the adapter which is a field replaceable unit (FRU),
and then physically check to see where you should hook up the cable. This is especially useful when
you have several adapters with open ports.
To activate or deactivate an identify LED for an enclosure or FRU, follow these steps:
1. On the Resources tab, select the appropriate host or virtual server.
2. Select Actions ? Service and Support ? Hardware ? Identify LED.
3. In the Identify LED, Select Enclosure window, select the system unit or enclosure.
4. To activate or deactivate an identify LED, click either Activate LED or Deactivate LED. The
associated LED is either turned on or off.
5. To activate or deactivate an identify LED for a FRU, select a system or enclosure from the table, and
then select List FRUs....
6. Select one or more FRUs from the table, and click either Activate LED or Deactivate LED. The
associated LED is either turned on or off.
Viewing serviceable events by using the Systems Director Management Console
Use this procedure to view a serviceable event, including details, comments, and service history.
To view serviceable events, follow these steps:
1. On the Resources tab, select the appropriate host or virtual server.
2. Select Actions ? System Status and Health ? Event Log.
3. Optional: You can narrow the event criteria using the Event filter menu.
4. Select a line in the Events window, and select Actions ? Properties The Properties window opens,
showing detailed information about the serviceable event. The table shows information, such as
problem number, reference code, and the field replaceable units (FRUs) associated with this event.
Verifying an installed feature or replaced part on a system or logical
partition by using Virtual I/O Server tools
If you installed feature or replaced a part, you might want to use the tools in Virtual I/O Server (VIOS)
to verify that the feature or part is recognized by the system or logical partition.
To verify the operation of a newly installed feature or replacement part, select the appropriate procedure:
v Verify the installed feature using VIOS
v Verifying the replaced part using VIOS
Verify the installed feature by using VIOS:
1. Log in as root user.
2. At the command line, type diagmenu and press Enter.
3. Select Advanced Diagnostics Routines and press Enter.
4. From the Diagnostic Mode Selection menu, select System Verification and press Enter.
5. When the Advanced Diagnostic Selection menu appears, do one of the following:
v To test a single resource, select the resource that you just installed from the list of resources and
press Enter.
v To test all the resources available to the operating system, select All Resources and press Enter.
116 Power Systems: Serial-attached SCSI RAID enablement
6. Select Commit, and wait until the diagnostic programs run to completion, responding to any prompts
that appear.
7. Did the diagnostics run to completion and display the message No trouble was found?
v No: If a service request number (SRN) or other reference code is displayed, suspect a loose adapter
or cable connection. Review the installation procedures to ensure that the new feature is installed
correctly. If you cannot correct the problem, collect all SRNs or any other reference code
information that you see. If the system is running in LPAR mode, note the logical partition in
which you installed the feature. Contact your service provider for assistance.
v Yes: The new device is installed correctly. Exit the diagnostic programs and return the system to
normal operations.
Verify the replacement part by using VIOS:
To verify the operation of a newly installed feature or replacement part, follow these steps:
1. Did you replace the part by using either VIOS or the online diagnostics service aid's concurrent
(hot-swap) service operation?
No: Go to step 2.
Yes: Go to step 5.
2. Is the system powered off?
No: Go to step 4.
Yes: If the system supports slow boot, set the system to perform a slow boot. For information, see
Performing a slow boot.
3. Start the system and wait until the VIOS operating system login prompt displays or until apparent
system activity on the operator panel or display has stopped.
Did the VIOS login prompt display?
v No: If an SRN or other reference code is displayed, suspect a loose adapter or cable connection.
Review the procedures for the part that you replaced to ensure that the new part is installed
correctly. If you cannot correct the problem, collect all SRNs or any other reference code
information that you see. If the system does not start or you have no login prompt, see: Problems
with loading and starting the operating system.
If the system is partitioned, note the logical partition in which you replaced the part. Contact your
service provider for assistance.
v Yes: Go to step 4
4. At the command prompt, type diag —a and press Enter to check for missing resources. If you see a
command prompt, go to step 5.
If the Diagnostic selection menu is shown with M appearing next to any resource, follow these
steps:
a. Select the resource and press Enter.
b. Select Commit.
c. Follow any instructions that are shown.
d. If a Do you want to review the previously displayed error? message is shown, select Yes and press
Enter.
e. If an SRN is shown, suspect a loose card or connection. If no obvious problem is shown, record
the SRN and contact your service provider for assistance..
f. If no SRN is shown, go to 5.
5. Test the part by doing the following:
a. At the command line, type diagmenu and press Enter.
b. From the Function Selection menu, select Advanced Diagnostics Routines and press Enter.
c. From the Diagnostic Mode Selection menu, select System Verification and press Enter.
Common procedures for installable features 117
d. Select All Resources, or select the diagnostics for the individual part to test only the part you
replaced, and any devices that are attached to the part you replaced and press Enter.
Did the Resource Repair Action menu appear?
No: Go to step 6.
Yes: Go to step 7.
6. Did the Testing Complete, No trouble was found message appear?
v No: There is still a problem. Contact your service provider. This ends the procedure.
v Yes: Select Log Repair Action, if not previously logged, from the Task Selection menu to update
the AIX error log. If the repair action was reseating a cable or adapter, select the resource
associated with that repair action. If the resource associated with your action is not displayed on
the Resource List, select sysplanar0 and press Enter.
Tip: This action changes the indicator light for the part from the fault state to the normal state.
Go to step 9
7. Select the resource for the replaced part from the Resource Repair Action menu. When a test is run
on a resource in system verification mode, and that resource has an entry in the AIX error log, if the
test on the resource was successful, the Resource Repair Action menu appears. Complete the
following steps to update the AIX error log to indicate that a system-detectable part has been
replaced.
Note: On systems with a indicator light for the failing part, this changes the indicator light to the
normal state.
a. Select the resource that has been replaced from the Resource Repair Action menu. If the repair
action was reseating a cable or adapter, select the resource associated with that repair action. If
the resource associated with your action does not appear on the Resource List, select sysplanar0.
Press Enter.
b. Select Commit after you make your selections. Did another Resource Repair Action display
appear?
No: If the No Trouble Found display appears, go to step 9
Yes: Go to step 8.
8. Select the parent or child of the resource for the replaced part from the Resource Repair Action
menu if necessary. When a test is run on a resource in system verification mode, and that resource
has an entry in the AIX error log, if the test on the resource was successful, the Resource Repair
Action menu appears. Complete the following steps to update the AIX error log to indicate that a
system-detectable part has been replaced.
Note: This changes the indicator light for the part from the fault state to the normal state.
a. From the Resource Repair Action menu, select the parent or child of the resource that has been
replaced. If the repair action was to reseat a cable or adapter, select the resource associated with
that repair action. If the resource associated with your action does not appear on the Resource
List, select sysplanar0. Press Enter.
b. Select Commit after you make your selections.
c. If the No Trouble Found display appears, go to step 9.
9. If you changed the service processor or network settings, as instructed in previous procedures,
restore the settings to the values they had prior to servicing the system.
10. Did you do any hot-plug procedures before doing this procedure?
No: Go to step 11.
Yes: Go to step 12 on page 119.
11. Start the operating system, with the system or logical partition in normal mode. Were you able to
start the operating system?
No: Contact your service provider. This ends the procedure.
118 Power Systems: Serial-attached SCSI RAID enablement
Yes: Go to step 12.
12. Are the indicator lights still on?
v No. This ends the procedure.
v Yes. Turn off the lights. See one of the following for instructions: Changing service indicators
Verifying a repair
Use these procedures to verify hardware operation after making repairs to the system.
Choose from the following options:
v To verify the repair of a system that is currently powered off, go to step 1.
v To verify the repair of a system that is currently powered on with no operating system loaded, go to
step 3.
v To verify the repair of a system that is currently powered on and that has an operating system loaded,
go to step 5.
1. Power on the server and all attached I/O enclosures.
Did all the enclosures power on?
Yes: Go to step 3.? Go to step 3.
No: Continue with the next step.
2. Choose from the following options:
v If the original problem was that an enclosure would not power on and you have another FRU to replace, locate
and replace the next field-replaceable unit (FRU).
v If the next FRU in the FRU list is an isolation procedure, perform the isolation procedure.
v If the original problem was that an enclosure would not power on and you have an isolation procedure to
complete, perform the isolation procedure.
v If the original problem was that an enclosure would not power on and there are no more FRUs or isolation
procedures in the FRU list, contact your next level of support.
v If you have a new problem, perform problem analysis and repair the new problem.
3. Load the operating system.
Did the operating system load successfully?
Yes: Go to step 5.
No: Continue with the next step.
4. Choose from the following options:
v If the original problem was a failing disk drive that contained the operating system software, go to step 5.
v If the original problem was that the operating system would not load and you have another FRU to replace, go to
your FRU locations section to locate the next FRU.
v If the next FRU in the FRU list is an isolation procedure, perform the isolation procedure.
v If the original problem was that the operating system would not load and you have an isolation procedure to
complete, perform the isolation procedure.
v If the original problem was that the operating system would not load and there are no more FRUs or isolation
procedures in the FRU list, contact your next level of support.
v If you have a new problem, perform problem analysis and repair the new problem.
5. Choose from the following options:
v “Verifying the repair in AIX” on page 120
v “Verifying the repair in Linux” on page 125
Common procedures for installable features 119
v “Verifying a repair using an IBM i system or logical partition” on page 123
Verifying the repair in AIX
You can use this procedure to verify that a repair is complete using the AIX operating system.
Use this maintenance analysis procedure (MAP) to check out the server after a repair is completed.
1. Did you replace a disk drive in the root volume group?
No Go to step 3.
Yes Continue with the next step.
2. Run stand-alone diagnostics either from a CD or from a Network Installation Management (NIM)
server.
Did you encounter any problems?
No Reinstall the operating system and continue with step 5.
Yes If the original problem still exists, replace the
field-replaceable unit (FRU) or perform the isolation
procedure that is next in the FRU list. If you have reached
the end of the FRU list, contact your next level of support.
If a new problem has occurred, go to Beginning problem
analysis.
3. Did you replace a FRU with the power turned on and concurrently with system operations?
No Go to step 5.
Yes Continue with the next step.
4. Did you use an AIX diagnostics service aid hot-swap operation to change the FRU?
Yes Go to step 6.
No Go to step 7 on page 121.
Note: The AIX diagnostic service aid was
used if a resource was removed using the Hot
Plug task.
5.
Note: If any FRUs have been removed that should be reinstalled, reinstall them now.
Perform the following steps:
1. If the system is not powered on, power it on now.
2. Perform a slow boot.
3. Wait until the AIX operating system login prompt displays or until system activity on the operator panel or
display apparently has stopped.
4. Did you encounter any problems?
No Continue at step 6.
Yes If the original problem still exists, replace the FRU or perform the isolation procedure that is next in the FRU
list. If you have reached the end of the FRU list, contact your next level of support.
If a new problem has occurred, go to Beginning problem analysis.
6. If the Resource Repair Action menu is already displayed, go to step 9 on page 121; otherwise,
perform the following steps:
120 Power Systems: Serial-attached SCSI RAID enablement
1. Log into the operating system either with root authority (if
needed, ask the customer to enter the password) or use the CE
login.
2. Enter the diag -a command and check for missing resources.
Follow any instructions that display. If an SRN displays, suspect a
loose card or connection. If no instructions display, no resources
were detected as missing. Continue with the next step.
7. Perform the following steps:
1. Enter diag at the command prompt.
2. Press Enter.
3. Select the Diagnostics Routines option.
4. When the Diagnostic Mode Selection menu displays, select System verification.
5. When the Diagnostic Selection menu displays, select the All Resources option or test the FRUs you exchanged,
and any devices that are attached to the FRU(s) you exchanged, by selecting the diagnostics for the individual
FRU.
Did the Resource Repair Action menu (801015) display?
No Continue with the next step.
Yes Go to step 9.
8. Did the Testing Complete, no trouble was found menu (801010) display?
Yes Use the Log Repair Action option, if not previously logged,
in the TASK SELECTION menu to update the AIX error log.
If the repair action was reseating a cable or adapter, select
the resource associated with that repair action.
If the resource associated with your action is not displayed
on the resource list, select sysplanar0.
Note: If the system attention indicator is on, this action will
set it back to the normal state.
Go to step 11 on page 122.
No If the original problem still exists, replace the FRU or
perform the isolation procedure that is next in the FRU list.
If you have reached the end of the FRU list, contact your
next level of support.
If a new problem has occurred, go to Beginning problem
analysis.
9. When a test is run on a resource in system verification mode, that resource has an entry in the AIX
error log. If the test on that resource was successful, the Resource Repair Action menu displays.
After replacing a FRU, you must select the resource for that FRU from the Resource Repair Action
menu. This updates the AIX error log to indicate that a system-detectable FRU has been replaced.
Note: If the system attention indicator is on, this action will set it back to the normal state.
Perform the following steps:
Common procedures for installable features 121
1. Select the resource that has been replaced from the Resource
Repair Action menu. If the repair action was reseating a cable or
adapter, select the resource associated with that repair action. If
the resource associated with your action is not displayed on the
resource list, select sysplanar0.
2. Press Commit after you make your selections.
Did another Resource Repair Action (801015) display?
No If the No Trouble Found menu displays, go to step 11.
Yes Continue with the next step.
10. The parent or child of the resource you just replaced might also require that you run the Resource
Repair Action service aid on it.
When a test is run on a resource in system verification mode, that resource has an entry in the AIX
error log. If the test on the resource was successful, the Resource Repair Action menu displays.
After replacing that FRU, you must select the resource for that FRU from the Resource Repair Action
menu. This updates the AIX error log to indicate that a system-detectable FRU has been replaced.
Note: If the system attention indicator is on, this action will set it back to the normal state.
Perform the following steps:
1. From the RESOURCE REPAIR ACTION menu, select the parent
or child of the resource that has been replaced. If the repair action
was reseating a cable or adapter, select the resource associated
with that repair action. If the resource associated with your action
is not displayed on the resource list, select sysplanar0.
2. Press COMMIT after you make your selections.
3. If the No Trouble Found menu displays, continue with the next
step.
11. If you changed the service processor or network settings, as instructed in previous MAPs, restore the
settings to the value they had prior to servicing the system. If you ran stand-alone diagnostics from
CD-ROM, remove the stand-alone diagnostics CD-ROM from the system.
Did you perform service on a RAID subsystem involving changing of the PCI RAID adapter
cache card or changing the configuration?
Note: This information does not apply to the PCI-X RAID adapter or cache.
No Go to the close of call procedure.
Yes Continue with the next step.
12. Use the Recover Options selection to resolve the RAID configuration. To do this, perform the
following steps:
122 Power Systems: Serial-attached SCSI RAID enablement
1. On the PCI SCSI Disk Array Manager display, select Recovery
options.
2. If a previous configuration exists on the replacement adapter, this
must be cleared. Select Clear PCI SCSI Adapter Configuration
and press F3.
3. On the Recovery Options screen, select Resolve PCI SCSI RAID
Adapter Configuration.
4. On the Resolve PCI SCSI RAID Adapter Configuration screen,
select Accept Configuration on Drives.
5. On the PCI SCSI RAID Adapter selections menu, select the
adapter that you changed.
6. On the next screen, press Enter.
7. When you see the Are You Sure selection menu, press Enter to
continue.
8. If you see a Failed status message, verify that you selected the
correct adapter, and then repeat this procedure. When the
recovery is complete, exit the operating system.
9. Go to the close of call procedure.
Verifying a repair using an IBM i system or logical partition
Use this procedure to verify a repair using the IBM i operating system.
1. Was the system powered off during the repair?
Yes: Continue with the next step.
No: Continue with step 3.
2. Perform the following tasks:
a. Verify that the power cable is plugged into the power outlet.
b. Verify that power is available at the customer's power outlet.
3. Was the partition powered off during the repair?
Yes: Continue with the next step.
No: Continue with step 6.
4. Select the IPL type and mode for the system or logical partition that the customer uses (see IPL type
mode and speed options in the Service functions).
5. Start an IPL by powering on the system or partition (see Powering on and powering off). Did the
system complete the IPL?
Yes: Continue with the next step.
No: This might be a new problem. Go to the Start of call procedure. This ends the procedure.
6. Did the system or partition remain running throughout the repair, and was the I/O processor, I/O
adapter, or storage device replaced?
Yes: Continue with step 10.
No: Continue with the next step.
7. Use the service action log or serviceable event view (if the system is managed by an HMC) to look
for any reference codes that are related to this IPL (see Using the Service Action Log). Are there any
reference codes that are related to this IPL?
Yes: Continue with the next step.
No: If the problem was related to removable media or communications, perform the verification
procedures in the Service functions to verify that the problem is corrected. Then return the
system to the customer and have the customer verify the system date and time. This ends the
procedure.
Common procedures for installable features 123
8. Is the new reference code the same as the original reference code?
Yes: Continue with the next step.
No: A new symptom might have occurred. Go to the Start of call procedure. This ends the
procedure.
9. Are there any other failing items that remain to be replaced?
Yes: Replace the next failing item listed for this reference code. This ends the procedure.
No: Contact your next level of support for assistance. This ends the procedure.
10. Was concurrent maintenance performed on an optical storage unit?
Yes: The product activity log and service action log, in most cases, contain a reference code for
the optical storage unit when concurrent maintenance is performed. You can ignore this reference
code. Perform the following:
- Perform the verification procedures in the Service functions topic to verify that the problem is
corrected.
- Return the system to the customer and have the customer verify the system date and time. This
ends the procedure.
No: Continue with the next step.
11. Use the service action log to look for any new reference codes (see Using the Service Action Log).
Are there any new reference codes?
Yes: Continue with the next step.
No: Go to step 14.
12. Is the new reference code the same as the original reference code?
Yes: Continue with the next step.
No: A new symptom might have occurred. Go to the Start of call procedure to determine the
cause of the problem. This ends the procedure.
13. Are there any other failing items that need to be replaced?
Yes: Replace the next failing item listed for the reference code. This ends the procedure..
No: Contact your next level of support for assistance. This ends the procedure.
14. Are you working with a tape device?
Yes: Perform the verification procedures in the Service functions to verify that the problem is
corrected. After the verification test is complete, the tape device description will be set to the
failed state because a resource change was detected. Perform the following tasks:
- Vary the tape device description off and then on.
- Return the system to the customer and have the customer verify the system date and time.
Then go to Verifying the repair from the HMC. This ends the procedure.
No: Continue with the next step.
15. Are you working with an IOP or an IOA?
Yes: Use the display hardware configuration service function to check for any missing or failed
hardware:
- On the command line, enter the STRSST (Start System Service Tools command). If you cannot get
to SST, select DST. Do not IPL the system or partition to get to DST.
- On the Start Service Tools Sign On display, enter the user ID with the service authority and
password.
- Select Start a service tool ? Hardware service manager ? Logical hardware resources ? System
bus resources.
- Select the function key for Include nonreporting resources.
- If the IOP and IOA that you just replaced is a failed or non-reporting resource, the problem has
not been fixed. Continue to the next failing item in the failing item list. This ends the procedure.
124 Power Systems: Serial-attached SCSI RAID enablement
No: Perform the verification procedures in the Service functions topics to verify that the problem
is corrected. Resources that usually vary on automatically during an IPL, or that were previously
varied on manually, might need to be varied on again after the verification procedures are
complete. Return the system to the customer and have the customer verify the system date and
time. This ends the procedure.
Verifying the repair in Linux
You can use this procedure to verify that a repair is complete using the Linux operating system.
1. Run stand-alone diagnostics from either a CD or from a Network Installation Management (NIM)
server. See Running the stand-alone diagnostics from CD-ROM.
Did you encounter any problems?
No Reboot the operating system and continue with the close of call procedure.
Yes If the original problem still exists, replace the field-replaceable unit (FRU) or perform the isolation procedure
that is next in the FRU list. If you have reached the end of the FRU list, contact your next level of support.
If a new problem has occurred, go to Beginning problem analysis and repair the new problem.
Verifying the repair from the management console
Perform these procedures to close problem numbers, clear hardware messages, and prepare the server to
return to the customer using the management console.
Follow this checklist before performing the procedures:
v You return the server to the state that the customer normally uses such as IPL type, IPL mode, and the
way the system is configured or partitioned.
Attention: Before returning the system to the customer, remove the system from service mode. If the
system is left in service mode, it automatically places a call for service every two hours.
v While you were performing the problem analysis on the original serviceable event, other
serviceable-event numbers might have been opened. Close all serviceable events that were opened as a
result of your service activity.
v Server verification has been performed and there are no problems that require additional service
actions.
v If the repair was done using the HMC online repair procedures, ensure that the original serviceable
event is now closed.
1. Is a management console used to manage the server that you are servicing?
v Yes: Continue with the next step.
v No: Return to the “Verifying a repair” on page 119. This ends the procedure.
2. Are you closing a service event that was a repair on the management console personal computer?
v Yes: Continue with the next step.
v No: Go to step 4.
3. Power on the management console. Did the power-on process complete without errors?
v Yes: Ensure that the management console can be used to perform server management tasks, and return the
management console to normal operations. Go to “Closing a service call” on page 126. This ends the procedure.
v No: Go to the HMC isolation procedures. This ends the procedure.
4. Log into the management console as the service representative. If invalid user or invalid password
displays, get the correct login information from the system administer.
Common procedures for installable features 125
1. If logged on the System Manager, select Exit from the Console, located on the System Manager window.
2. Login to the System Manager with the following: User identification service Password service mode
5. View Serviceable Event Details.
1. In the navigation area, select the Service Applications icon.
2. In the navigation area, select the Service Focal Point icon.
3. In the Contents area, select Manage Serviceable Events.
4. Designate the set of Serviceable Events you want to view. When
you are finished, click OK. The Service Event Overview window
opens.
Note: Only the events that match all of the
criteria you specify are shown.
6. Close open or delayed events.
1. Select the problem to close, on the Service Event Overview window.
2. Select the menu Selected, located on the menu bar.
3. Click Close Event.
4. Enter your comments in the Serviceable Event Comments window, and click Close Event.
5. Close all events associated with the problem on which you were working.
7. Did the Service Event Overview window contain the event or events on which you were working?
v Yes: Return the HMC to normal operations. Go to “Closing a service call.” This ends the procedure.
v No: Go to Detecting problems. This ends the procedure.
Closing a service call
Perform these procedures to close problem numbers, clear hardware messages, and prepare the server to
return to the customer.
Follow this checklist before performing the procedures:
v Return the server to the state that the customer normally uses such as IPL type, IPL mode, and the
way the system is configured or partitioned.
Attention: Before returning the system to the customer, remove the system from service mode. If the
system is left in service mode, it automatically places a call for service every two hours.
v While you were performing the problem analysis on the original serviceable event, other serviceable
events might have been opened. Close all serviceable events that were opened as a result of your
service activity.
v Ensure that server verification has been performed and no problems require additional service actions.
v If the repair was done using the management console online repair procedures, ensure that the original
serviceable event is now closed.
1. Record the system reference code (SRC) or symptom and the location code of the field-replaceable
unit (FRU) you replaced, for future reference. Is the server managed by a management console?
v Yes: Continue with the next step.
v No: Do one of the following steps:
– If the server is managed by Integrated Virtualization Manager (IVM), go to “Closing a service call using
Integrated Virtualization Manager” on page 135.
– If the server is not partitioned and is running the AIX or Linux operating system, go to “Closing a service call
using AIX or Linux” on page 131.
126 Power Systems: Serial-attached SCSI RAID enablement
2. On the HMC, open Manage Serviceable Events and examine the service action event log for any
open service action events.
3. Are there any service action events that are open?
v Yes: Continue with the next step.
v No: If the system attention LED is still on, turn off the LED as described in “Activating and deactivating LEDs” on
page 139. Return the system to the customer. This completes the repair.
4. Record the list of open service action events.
5. From the list of serviceable events recorded in step 4, perform the following step 6 through step 32
on page 129 for each open service action event.
6. Determine the error class of the serviceable event. Record for future use.
7. Examine the details of the open service action event.
Is the error code that is associated with this service action event the same as recorded in Step 1 on page 126?
v Yes: Go to step 11.
v No: Continue with the next step.
8. Examine the FRU list of the service action event. Are any FRUs listed for the service action event?
v Yes: Continue with the next step.
v No: Go to step 11.
9. Is the FRU list identical (same FRUs, same number of FRUs, and same order of FRUs) to the FRU
list of the error code recorded in step 1 on page 126?
v Yes: Go to step 11.
v No: Continue with the next step.
10. The FRU list is different. Is the FRU you replaced and recorded in step 1 on page 126 in the list of
FRUs for this service action event?
v Yes: Continue with the next step.
v No: Go to step 32 on page 129.
Note: There are service action events that will remain open when you leave this MAP. Further service actions
might be required to complete the repair.
11. Examine the details of this service action event, and record the partitions involved in this service
action event for use in a later step.
Common procedures for installable features 127
12. Is the error code associated with this service action event of the form A11-xxx or A01-xxx?
v Yes: Continue with the next step.
v No: Go to step 17.
13. Have you begun a list of Axx partitions from prior service action events that you processed in this
maintenance analysis procedure (MAP)?
v Yes: Go to step 15.
v No: Continue with the next step.
14. Begin a new list of Axx partitions by copying the list of partitions obtained in step 11 on page 127.
Go to step 16.
15. Add the partition list obtained in step 11 on page 127 to the existing list of Axx partitions obtained
from processing previous service action events in this MAP.
16. Remove all entries in the list of all partitions you recorded in step 11 on page 127. If you are referred
to the list of partitions obtained in step 11 on page 127 in future steps, the list is empty. Go to step
17.
17. Select and highlight the service action event from the Error Associated With This Serviceable Event
window.
18. Click Close Event.
19. Add comments for the serviceable event. Include any unique additional information. Click OK. The
following steps will add or update FRU information.
20. Did you replace, add, or modify a FRU of the open service action event?
v Yes: Continue with the next step.
v No: Go to step 22.
21. From the FRU list, select a FRU that you need to update. Double-click the FRU, and update the FRU
information. Go to step 23.
22. Select the No FRU Replaced for this Serviceable Event option.
23. Click OK to close the service action event.
128 Power Systems: Serial-attached SCSI RAID enablement
24. Is the list of all partitions you recorded in step 11 on page 127 empty?
v Yes: Go to step 32.
v No: Continue with the next step.
25. Does the list of all partitions you recorded in step 11 on page 127 contain more than one entry?
v Yes: Continue with the next step.
v No: Go to step 32.
26. Is the error class recorded in step 25 AIX?
v Yes: Continue with the next step.
v No: Go to step 32.
27. Perform the following steps for each entry in the list of all partitions you recorded in step 11 on page
127, except the partition you were using to debug the original problem.
28. From the HMC virtual terminal window of a partition in the list of all partitions, type diag at the
AIX command prompt.
29. When the diagnostic operating instructions are displayed, do the following steps:
1. Press Enter.
2. Select the Task Selection option.
3. Select the Log Repair option.
4. Select the resource associated with the repair action. If the repair
action was reseating a cable or adapter, select the resource
associated with that repair action. If the resource associated with
your repair action is not displayed on the Resource List, select
sysplanar0.
5. Click Commit after you have made your selection.
Note: If the terminal type is not defined, you
are prompted to define it before you can
proceed.
30. Exit from diagnostics in this partition and return to the AIX command prompt.
31. Have all the partitions in the list of all the partitions you recorded in step 11 on page 127 been
processed?
v Yes: Continue with the next step.
v No: Go to step 24 to process the next partition in the list you recorded in step 11 on page 127.
32. Have all the serviceable events recorded in step 4 on page 127 been processed?
v Yes: Continue with the next step.
v No: Go to step 5 on page 127 and process the next service action event in the list of serviceable events recorded in
step 4 on page 127.
Common procedures for installable features 129
33. While processing all service action events, were you directed to step 14 on page 128?
v Yes: Continue with the next step.
v No: If the system attention LED is still on, turn off the LED as described in “Activating and deactivating LEDs” on
page 139. Return the system to the customer. This completes the repair.
Note: If during the processing of the list of open service action events, some service action events remained open,
further service actions might be required to complete the repair.
34. Perform the following steps for each entry in the list of Axx partitions you began recording in step
14 on page 128, except the partition you were using to debug the original problem.
35. From the management console virtual terminal window of a partition in the list of Axx partitions,
type diag at the AIX command prompt.
36. When the diagnostic operating instructions are displayed, do the following steps:
1. Press Enter.
2. Select the Task Selection option.
Note: If the terminal type is not defined, you are prompted to
define it before you can proceed.
3. Select the Log Repair option.
4. Select the resource associated with the repair action. If the repair
action was reseating a cable or adapter, select the resource
associated with that repair action. If the resource associated with
your repair action is not displayed on the Resource List, select
sysplanar0.
5. Click Commit after you have made your selection.
37. Exit from diagnostics in this partition and return to the AIX command prompt.
38. Have all the partitions in the list of Axx partitions you began recording in step 14 on page 128 been
processed?
v Yes: Continue with the next step.
v No: Go to step 34 to process the next partition in the list you recorded in step 14 on page 128.
39. If the system attention LED is still on, turn off the LED as described in “Activating and deactivating
LEDs” on page 139. This completes the repair. Return the system to the customer.
Note: If during the processing of the list of open service action events, some service action events
remained open, further service actions might be required to complete the repair.
130 Power Systems: Serial-attached SCSI RAID enablement
Closing a service call using AIX or Linux
If the server is not connected to an management console and not using Integrated Virtualization Manager
(IVM), perform these procedures to close problem numbers, clear hardware messages, and prepare the
server to return to the customer.
Follow this checklist before performing the procedures:
v Return the server to the state that the customer normally uses, such as IPL type, IPL mode, and the
way the system is configured or partitioned.
Attention: Before returning the system to the customer, remove the system from service mode. If the
system is left in service mode, it automatically places a call for service every two hours.
v While you were performing the problem analysis on the original serviceable event, other
serviceable-event numbers might have been opened. Close all serviceable events that were opened as a
result of your service activity.
v Ensure that server verification has been performed and that no problems require additional service
actions.
v If the repair was done using the IVM online repair procedures, ensure that the original serviceable
event is now closed.
1. Did you use an AIX diagnostics service aid hot-swap operation to change the FRU?
v Yes: Go to step 4
v No: Continue with the next step.
2. Do you have any field-replaceable units (FRUs) (for example cards, adapters, cables, or devices) that
were removed during problem analysis that you want to put back into the system?
Note: If the system planar or battery has been replaced and you are loading diagnostics from a
server over a network, it might be necessary for the customer to set the network boot information
for this system before diagnostics can be loaded. The system time and date information should also
be set when the repair is completed.
v Yes: Reinstall all of the FRUs that were removed during problem analysis. Go to step 3
v No: Continue with the next step.
3. Is the system or logical partition that you are performing a repair action on running the AIX
operating system?
v Yes: Continue with the next step.
v No: Go to step 5.
4. Does the system or logical partition you are performing a repair action on have AIX installed?
Note: Answer no to this question if you have just replaced a hard disk in the root volume group.
v Yes: Go to step 7 on page 132.
v No: Continue with the next step.
5. Run stand-alone diagnostics in problem determination mode from either a CD-ROM or from a
Network Installation Management (NIM) server.
Note: For instructions on running stand-alone diagnostics from a CD and not using an HMC, go to
Running the stand-alone diagnostics from CD on a server without an HMC attached.
Common procedures for installable features 131
For instructions on running stand-alone diagnostics from a NIM server, go to Running the
stand-alone diagnostics from a Network Installation Management server.
Did you encounter any problems?
v Yes: Go to problem analysis.
v No: Continue with the next step.
6. The system hardware is functioning correctly.
If the system attention LED is still on, turn off the LED as described
in “Activating and deactivating LEDs” on page 139.
This completes the repair.
Note: If, during the processing of the list of open service action
events, some service action events remained open, further service
actions might be required to complete the repair.
Return the server to the state that the customer normally uses, such
as IPL type, IPL mode, and the way the system is configured or
partitioned. This might require you to reboot the operating system.
Attention: Before returning the system to the customer, remove the
system from service mode. If the system is left in service mode, it
automatically places a call for service every two hours.
7. Complete the following steps:
1. If the system supports slow boot (see Performing a slow boot), do
a slow boot on the system. If the system does not support slow
boot, do a normal boot.
2. Power on the system.
3. Wait until the AIX operating system login prompt displays or
until system activity on the operator panel or display apparently
has stopped.
Did the AIX Login Prompt display?
v Yes: Continue with the next step.
v No: Go to problem analysis.
8. If the Resource Repair Action menu is already displayed, go to 12 on page 133; otherwise, do the
following steps:
1. Log into the operating system, either with root authority (if
needed, ask the customer to enter the password) or use the CE
login.
2. Enter the diag -a command and check for missing resources.
Follow any instructions that display. If an system request number
(SRN) displays, suspect a loose card or connection. If no
instructions display, no resources were detected as missing.
Continue with 9.
9. Complete the following steps:
132 Power Systems: Serial-attached SCSI RAID enablement
1. Enter diag at the command prompt.
2. Press Enter.
3. Select the Diagnostics Routines option.
4. When the Diagnostic Mode Selection menu displays, select
Problem determination.
5. When the Advanced Diagnostic Selection menu displays, select
the All Resources option, or test the FRUs you exchanged, and
any devices that are attached to the FRUs you exchanged, by
selecting the diagnostics for the individual FRU.
Did the Resource Repair Action menu (801015) display?
v Yes: Go to step 13.
v No: Continue with the next step.
10. Did the TESTING COMPLETE, no trouble was found menu (801010) display?
v Yes: Continue with the next step.
v No: There is still a problem. Go to problem analysis.
11. Use the Log Repair Action option, if not previously logged, in the TASK SELECTION menu to
update the AIX error log. If the repair action was reseating a cable or adapter, select the resource
associated with that repair action.
If the resource associated with your action is not displayed on the
resource list, select sysplanar0.
Note: If the system attention indicator is on, this will set it back to
the normal state. Go to step 14 on page 134.
12. When a test is run on a resource in system verification mode, and that resource has an entry in the
AIX error log, if the test on the resource was successful, the Resource Repair Action menu displays.
After replacing a FRU, you must select the resource for that FRU from the Resource Repair Action menu. This
updates the AIX error log to indicate that a system-detectable FRU has been replaced.
Note: If the system attention indicator is on, this action will set it back to the normal state.
Do the following steps:
1. Select the resource that has been replaced from the Resource Repair Action menu. If the repair action was
reseating a cable or adapter, select the resource associated with that repair action. If the resource associated with
your action is not displayed on the resource list, select sysplanar0.
2. Press Commit after you make your selections.
Did another Resource Repair Action (801015) display?
v Yes: Continue with the next step.
v No: If the No Trouble Found menu displays, go to step 14 on page 134.
13. The parent or child of the resource you just replaced might also require that you run the Resource
Repair Action service aid on it.
Common procedures for installable features 133
When a test is run on a resource in system verification mode, and
that resource has an entry in the AIX error log, if the test on the
resource was successful, the Resource Repair Action menu displays.
After replacing that FRU, you must select the resource for that FRU
from the Resource Repair Action menu. This updates the AIX error
log to indicate that a system-detectable FRU has been replaced.
Note: If the system attention indicator is on, this action will set it
back to the normal state.
Do the following steps:
1. From the Resource Repair Action menu, select the parent or child
of the resource that has been replaced. If the repair action was
reseating a cable or adapter, select the resource associated with
that repair action. If the resource associated with your action is
not displayed on the resource list, select sysplanar0.
2. Press COMMIT after you make your selections.
Did the No Trouble Found menu display?
v Yes: Continue with the next step.
v No: Go to problem analysis.
14. If you changed the service processor or network settings, as instructed in previous MAPs, restore the
settings to the value they had prior to servicing the system. If you ran stand-alone diagnostics from
CD-ROM, remove the stand-alone diagnostics CD-ROM from the system.
Did you perform service on a RAID subsystem involving changing of the PCI RAID adapter cache card or changing
the configuration?
Note: This does not refer to the PCI-X RAID adapter or cache.
v Yes: Continue with the next step.
v No: Go to step 16.
15. Use the Recover Options selection to resolve the RAID configuration. To do this, do the following
steps:
1. On the PCI SCSI Disk Array Manager dialog, select Recovery options.
2. If a previous configuration exists on the replacement adapter, this must be cleared. Select Clear PCI SCSI
Adapter Configuration and press F3.
3. On the Recovery Options dialog, select Resolve PCI SCSI RAID Adapter Configuration.
4. On the Resolve PCI SCSI RAID Adapter Configuration dialog, select Accept Configuration on Drives.
5. On the PCI SCSI RAID Adapter selections menu, select the adapter that you changed.
6. On the next dialog, press Enter.
7. When you see the Are You Sure selection menu, press Enter to continue.
8. You should see an OK status message when the recover is complete. If you get a Failed status message, verify that
you selected the correct adapter, and then repeat this procedure. When recover is complete, exit the operating
system.
9. Go to 16.
16. The system hardware is functioning correctly. Return the server to the state that the customer
normally uses, such as IPL type, IPL mode, and the way the system is configured or partitioned.
134 Power Systems: Serial-attached SCSI RAID enablement
Attention: Before returning the system to the customer, remove the
system from service mode. If the system is left in service mode, it
automatically places a call for service every two hours.
Closing a service call using Integrated Virtualization Manager
Perform these procedures to close problem numbers, clear hardware messages, and prepare the server to
return to the customer.
Follow this checklist before performing the procedures:
v Return the server to the state that the customer normally uses, such as IPL type, IPL mode, and the
way the system is configured or partitioned.
Attention: Before returning the system to the customer, remove the system from service mode. If the
system is left in service mode, it automatically places a call for service every two hours.
v While you were performing the problem analysis on the original serviceable event, other
serviceable-event numbers might have been opened. Close all serviceable events that were opened as a
result of your service activity.
v Ensure that server verification has been performed and there are no problems that require additional
service actions.
v If the repair was done using the Integrated Virtualization Manager (IVM) online repair procedures,
ensure that the original serviceable event is now closed.
1. Record the system reference code (SRC) or symptom and the location code of the field-replaceable
unit (FRU) you replaced, for future reference.
2. On the IVM, open Manage Serviceable Events and look at existing serviceable events.
3. Are there any service action events that are open?
v Yes: Continue with the next step.
v No: If the system attention LED is still on, turn off the LED as described in “Activating and deactivating LEDs” on
page 139. Return the system to the customer. This completes the repair.
4. Record the list of open service action events.
5. From the list of serviceable events recorded in step 4, perform step 6 through step 30 on page 138 for
each open service action event.
6. Determine the error class of the serviceable event. Record for future use.
7. Examine the details of the open service action event.
Is the error code associated with this service action event the same as
recorded in step 1?
Common procedures for installable features 135
v Yes: Go to step 11.
v No: Continue with the next step.
8. Examine the FRU list of the service action event. Are any FRUs listed for the service action event?
v Yes: Continue with the next step.
v No: Go to step 11.
9. Is the FRU list identical (same FRUs, same number of FRUs, and same order of FRUs) to the FRU
list of the error code recorded in step 1 on page 135?
v Yes: Go to step 11.
v No: Continue with the next step.
10. Is the FRU you replaced and recorded in step 1 on page 135 in the list of FRUs for this service action
event?
v Yes: Continue with the next step.
v No: Go to step 30 on page 138.
Note: There are service action events that will remain open when you leave this MAP. Further service actions
might be required to complete the repair.
11. Examine the details of this service action event, and record the partitions involved in this service
action event for use in a later step.
12. Is the error code associated with this service action event of the form A11-xxx or A01-xxx?
v Yes: Continue with the next step.
v No: Go to step 17.
13. Have you begun a list of Axx partitions from prior service action events that you processed in this
maintenance analysis procedure (MAP)?
v Yes: Go to step 15.
v No: Continue with the next step.
14. Begin a new list of Axx partitions by copying the list of partitions obtained in step 11. Go to step 16.
15. Add the partition list obtained in step 11 to the existing list of Axx partitions obtained from
processing previous service action events in this MAP.
16. Remove all entries in the list of all partitions you recorded in step 11. If you are referred to the list of
partitions obtained in step 11 in future steps, the list is empty. Go to step 17.
17. Select and highlight the service action event from the Manage Serviceable Events window.
136 Power Systems: Serial-attached SCSI RAID enablement
18. Click Close Event.
19. Add comments for the serviceable event. Include any unique additional information. Click OK.
20. Add or update FRU information:
Did you replace, add, or modify a FRU of the open service action event?
v Yes: Continue with the next step.
v No: Go to step 22.
21. Click OK to close the service action event.
22. Is the list of all partitions you recorded in step 11 on page 136 empty?
v Yes: Go to step 30 on page 138.
v No: Continue with the next step.
23. Does the list of all partitions you recorded in step 11 on page 136 contain more than one entry?
v Yes: Continue with the next step.
v No: Go to step 30 on page 138.
24. Is the error class recorded in step 23?
v Yes: Continue with the next step.
v No: Go to step 30 on page 138.
25. Perform the following steps for each entry in the list of all partitions you recorded in step 11 on page
136, except the partition you were using to debug the original problem.
26. From the IVM virtual terminal window of a partition in the list of all partitions, type diag at the AIX
command prompt.
27. When the diagnostic operating instructions are displayed, do the following steps:
Common procedures for installable features 137
1. Press Enter.
2. Select the Task Selection option.
3. Select the Log Repair option.
4. Select the resource associated with the repair action. If the repair
action was reseating a cable or adapter, select the resource
associated with that repair action. If the resource associated with
your repair action is not displayed on the Resource List, select
sysplanar0.
5. Click Commit after you have made your selection.
Note: If the terminal type is not defined, you
are prompted to define it before you can
proceed.
28. Exit from diagnostics in this partition and return to the AIX command prompt.
29. Have all the partitions in the list of all partitions you recorded in step 11 on page 136 been
processed?
v Yes: Continue with the next step.
v No: Go to step 25 on page 137 to process the next partition in the list you recorded in step 11 on page 136.
30. Have all the serviceable events recorded in step 4 on page 135 been processed?
v Yes: Continue with the next step.
v No: Go to step 5 on page 135 and process the next service action event in the list of serviceable events recorded in
step 4 on page 135.
31. While processing all service action events, were you directed to step 14 on page 136?
v Yes: Continue with the next step.
v No: If the system attention LED is still on, turn off the LED as described in “Activating and deactivating LEDs” on
page 139. Return the system to the customer. This completes the repair.
Note: If during the processing of the list of open service action events, some service action events remained open,
further service actions might be required to complete the repair.
32. Perform the following steps for each entry in the list of Axx partitions you began recording in step
14 on page 136, except the partition you were using to debug the original problem.
33. From the IVM virtual terminal window of a partition in the list of Axx partitions, type diag at the
AIX command prompt.
34. When the diagnostic operating instructions are displayed, do the following steps:
138 Power Systems: Serial-attached SCSI RAID enablement
1. Press Enter.
2. Select the Task Selection option.
Note: If the terminal type is not defined, you are prompted to
define it before you can proceed.
3. Select the Log Repair option.
4. Select the resource associated with the repair action. If the repair
action was reseating a cable or adapter, select the resource
associated with that repair action. If the resource associated with
your repair action is not displayed on the Resource List, select
sysplanar0.
5. Click Commit after you have made your selection.
35. Exit from diagnostics in this partition and return to the AIX command prompt.
36. Have all the partitions in the list of Axx partitions you began recording in step 14 on page 136 been
processed?
v Yes: Continue with the next step.
v No: Go to step 32 on page 138 to process the next partition in the list you recorded in step 14 on page 136.
37. If the system attention LED is still on, turn off the LED as described in “Activating and deactivating
LEDs.” This completes the repair. Return the system to the customer.
Note: If, during the processing of the list of open service action events, some service action events remained open,
further service actions might be required to complete the repair.
Activating and deactivating LEDs
You can use these procedures to activate or deactivate light-emitting diodes (LEDs) using the
management console or the Advanced System Management Interface (ASMI).
For IBM Power Systems servers that contain the POWER7 processor, the LEDs can be used to identify or
verify a part that you are servicing. The error and identify function (amber) LED indicates an error and
corresponds to the location code in the system reference code (SRC). The LED is activated and
deactivated automatically.
Additionally, the following procedures can also be used to activate and deactivate LEDs.
v “Deactivating a system attention LED or partition LED using the management console”
v “Activating or deactivating an identify LED using the management console” on page 140
v “Deactivating a system attention LED or logical partition LED using the Advanced System
Management Interface” on page 141
v “Activating or deactivating an identify LED using the Advanced System Management Interface” on
page 141
Deactivating a system attention LED or partition LED using the
management console
You can deactivate a system attention LED or a logical partition LED if you decide that a problem is not
a high priority and you decide to repair the problem at a later time. You can do this task from the
Hardware Management Console (HMC) or the IBM Systems Director Management Console (SDMC).
Common procedures for installable features 139
If you want to be alerted if another problem occurs, you must deactivate the system attention LED so
that it can be activated again if another problem occurs.
To deactivate a system attention LED by using HMC, complete the following steps:
1. In the navigation area, open Systems management.
2. Select the server you are working on by checking the box next to its name.
3. Open Operations.
4. Open LED Status.
5. Select View System Attention. The system attention LED window opens. The selected system and its
LED state are displayed in the upper part of the window. The logical partition and its LED state are
displayed in the lower part of the window. From the system attention LED window, you can
deactivate both the system attention LED and the logical partition LED.
6. Select Deactivate System Attention LED from the Action menu. A confirmation window is displayed
that provides the following information:
v A verification that the system attention LED was deactivated.
v An indication that there still might be open problems within the system.
v An indication that you cannot activate the system attention LED.
7. Select one of the logical partitions in the lower table, and select Deactivate partition LED from the
Partition Operations menu. A confirmation window is displayed that provides the following
information:
v A verification that the logical partition LED was deactivated.
v An indication that there still might be open problems within the logical partition.
v An indication that you cannot activate the logical partition LED.
To deactivate a system attention LED by using SDMC, complete the following steps:
1. On the Welcome page, under the Resources tab, select the server.
2. Click Actions > Service and Support> Hardware> System Attention LED.
3. Click OK.
Activating or deactivating an identify LED using the management
console
You can activate or deactivate an identify LED for components attached to the system from the Hardware
Management Console (HMC) or the IBM Systems Director Management Console (SDMC).
The system provides several LEDs that help identify various components, such as enclosures or
field-replaceable units (FRUs), in the system. For this reason, they are called identify LEDs.
You can activate or deactivate the following types of identify LEDs:
v Identify LED for an enclosure If you want to add an adapter to a specific drawer (enclosure), you
need to know the machine type, model, and serial number (MTMS) of the drawer. To determine
whether you have the correct MTMS for the drawer that needs the new adapter, you can activate the
LED for a drawer and verify that the MTMS corresponds to the drawer that requires the new adapter.
v Identify LED for a FRU associated with a specified enclosure If you want to hook up a cable to a
specific I/O adapter, you can activate the LED for the adapter, which is a field replaceable unit (FRU),
and then physically check to see where you should hook up the cable. This is especially useful when
you have several adapters with open ports.
To activate or deactivate an identify LED for an enclosure or FRU by using the HMC, follow these steps:
1. In the navigation area, open Systems management.
2. Select the server you are working on.
140 Power Systems: Serial-attached SCSI RAID enablement
3. Click Operations ? LED Status ? Identify LED. The Identify LED, Select Enclosure window opens.
4. To activate or deactivate an identify LED for an enclosure, select an enclosure from the table, and click
either Activate LED or Deactivate LED. The associated LED is either turned on or off.
5. To activate or deactivate an identify LED for a FRU, select an enclosure from the table, and then select
Selected ? List FRUs.
6. Select one or more FRUs from the table, and click either Activate LED or Deactivate LED. The
associated LED is either turned on or off.
To activate or deactivate an identify LED for an enclosure or FRU by using the SDMC, complete these
steps:
1. On the Welcome page, under the Resources tab, select the server.
2. Click Actions > Service and Support> Hardware> Identify LED.
3. To activate or deactivate an identify LED for an enclosure, select an enclosure from the table, and click
either Activate LED or Deactivate LED. The associated LED is either turned on or off.
4. To activate or deactivate an identify LED for a FRU, select an enclosure from the table, and then click
List FRUs.
5. Select one or more FRUs from the table, and then click either Activate LED or Deactivate LED. The
associated LED is either turned on or off.
Deactivating a system attention LED or logical partition LED using the
Advanced System Management Interface
You can deactivate a system attention LED or a logical partition LED using the Advanced System
Management Interface (ASMI).
The system attention indicator provides a visual signal that the system as a whole requires attention or
service. Each system has a single system attention indicator. When an event occurs that either needs your
intervention or that of service and support, the system attention indicator lights continuously. The system
attention indicator is turned on when an entry is made in the service processor error log. The error entry
is transmitted to the system error log and the operating system error log.
To perform this operation, your authority level must be one of the following levels:
v Administrator
v Authorized service provider
To turn off the system attention indicator, do the following steps:
1. On the ASMI Welcome pane, specify your user ID and password, and click Log In.
2. In the navigation area, expand System Configuration ? Service Indicators ? System Attention
Indicator.
3. In the right pane, click Turn off system attention indicator. If the attempt is unsuccessful, an error
message is displayed.
Activating or deactivating an identify LED using the Advanced System
Management Interface
You can activate or deactivate an identify LED using the Advanced System Management Interface
(ASMI).
You can specify the location code of any indicator to view or modify its current state. If you provide the
wrong location code, the advanced system manager attempts to go to the next higher level of the location
code.
Common procedures for installable features 141
The next level is the base-level location code for that field replaceable unit (FRU). For example, a user
types the location code for the FRU located on the second I/O slot of the third enclosure in the system. If
the location code for the second I/O slot is incorrect (the FRU does not exist at this location), an attempt
to set the indicator for the third enclosure is initiated. This process continues until a FRU is located or no
other level is available.
To perform this operation, your authority level must be one of the following levels:
v Administrator
v Authorized service provider
To change the current state of an indicator, do the following steps:
1. On the ASMI Welcome pane, specify your user ID and password, and click Log In.
2. In the navigation area, expand System Configuration ? Service Indicators ? Indicators by Location
code.
3. In the right pane, enter the location code of the FRU and click Continue.
4. Select the preferred state from the list.
5. Click Save settings.
Installing a part by using the Systems Director Management Console
You can use the IBM Systems Director Management Console (SDMC) to perform many service actions,
including the installation of a new field-replaceable unit (FRU) or part.
To use the SDMC user interface to install a part into a system or expansion unit that is managed by an
SDMC, follow these steps:
1. In the Power Systems Resources area, select the system on which you want to install a part.
2. From the Actions menu, expand Service and Support ? Hardware ? MES Tasks ? Add FRU.
3. On the ADD FRU tab, select the system or enclosure type.
4. Select the FRU type you are installing from the menu and click Next
5. Select the location code for where you want to install the part, and click Add.
6. After the FRU is placed in the pending actions category, click Launch Procedure and follow the
instructions to install the part.
Note: The SDMC might open external instructions for installing the feature. If so, follow those
instructions to install the part.
Replacing a part by using the Systems Director Management Console
You can use the IBM Systems Director Management Console (SDMC) to perform many service actions,
including exchanging a field replaceable unit (FRU) or part.
If you are exchanging a part to repair a serviceable event follow those instructions.
1. In the Power Systems Resource area, select the managed system in which you are exchanging a part.
2. Select one of the following options:
v If you are replacing a part that is not part of a serviceable event, from the Actions menu, expand
Service and Support ? Hardware ? Exchange FRU.
v If you are exchanging a part to repair a serviceable event, see Starting a repair action.
3. To replace a part as part of a service action, select Exchange FRU.
4. From the Exchange FRU tab, select the system or expansion unit in which you want to exchange a
part.
5. Select the type of part you want to exchange from the menu and click Next.
142 Power Systems: Serial-attached SCSI RAID enablement
6. Select the location code of the part you want to exchange from the menu and click Add.
7. After the FRU is placed in the Pending actions window, click Launch Procedure and follow the
instructions to exchange the part.
Note: The SDMC might open external instructions for replacing the part. If so, follow those
instructions to replace the part.
Removing a part by using the Systems Director Management Console
You can use the IBM Systems Director Management Console (SDMC) to perform many service actions,
including the removal of a field replaceable unit (FRU) or part.
To use the SDMC user interface to remove a part in a system or expansion unit that is managed by an
SDMC, follow these steps:
1. In the Power Systems Resources area, select the managed system from which you are removing a
part.
2. Select one of the following options:
v If you are installing a new part as part of an upgrade, from the Actions menu, expand Service and
Support ? Hardware ? MES Tasks.
v If you are removing a part as part of a service action, from the Actions menu, expand Service and
Support ? Hardware ? Exchange FRU.
3. To install a new part as part of an upgrade, select Add FRU. To replace a part as part of a service
action, select Exchange FRU.
4. On the Add FRU tab or the Exchange FRU tab, select the system or expansion unit from which you
are removing the part.
5. Select the type of part you are removing from the menu and click Next.
6. Select the location of the part you are removing and click Add.
7. After the part is listed in the Pending Actions window, click Launch Procedure and follow the
instructions to remove the part.
Note: The SDMC might open external instructions for removing the part. If so, follow those
instructions to remove the part.
Common procedures for installable features 143
144 Power Systems: Serial-attached SCSI RAID enablement
Notices
This information was developed for products and services offered in the U.S.A.
The manufacturer may not offer the products, services, or features discussed in this document in other
countries. Consult the manufacturer's representative for information on the products and services
currently available in your area. Any reference to the manufacturer's product, program, or service is not
intended to state or imply that only that product, program, or service may be used. Any functionally
equivalent product, program, or service that does not infringe any intellectual property right of the
manufacturer may be used instead. However, it is the user's responsibility to evaluate and verify the
operation of any product, program, or service.
The manufacturer may have patents or pending patent applications covering subject matter described in
this document. The furnishing of this document does not grant you any license to these patents. You can
send license inquiries, in writing, to the manufacturer.
The following paragraph does not apply to the United Kingdom or any other country where such
provisions are inconsistent with local law: THIS INFORMATION IS PROVIDED “AS IS” WITHOUT
WARRANTY OF ANY KIND, EITHER EXPRESS OR IMPLIED, INCLUDING, BUT NOT LIMITED TO,
THE IMPLIED WARRANTIES OF NON-INFRINGEMENT, MERCHANTABILITY OR FITNESS FOR A
PARTICULAR PURPOSE. Some states do not allow disclaimer of express or implied warranties in certain
transactions, therefore, this statement may not apply to you.
This information could include technical inaccuracies or typographical errors. Changes are periodically
made to the information herein; these changes will be incorporated in new editions of the publication.
The manufacturer may make improvements and/or changes in the product(s) and/or the program(s)
described in this publication at any time without notice.
Any references in this information to websites not owned by the manufacturer are provided for
convenience only and do not in any manner serve as an endorsement of those websites. The materials at
those websites are not part of the materials for this product and use of those websites is at your own risk.
The manufacturer may use or distribute any of the information you supply in any way it believes
appropriate without incurring any obligation to you.
Any performance data contained herein was determined in a controlled environment. Therefore, the
results obtained in other operating environments may vary significantly. Some measurements may have
been made on development-level systems and there is no guarantee that these measurements will be the
same on generally available systems. Furthermore, some measurements may have been estimated through
extrapolation. Actual results may vary. Users of this document should verify the applicable data for their
specific environment.
Information concerning products not produced by this manufacturer was obtained from the suppliers of
those products, their published announcements or other publicly available sources. This manufacturer has
not tested those products and cannot confirm the accuracy of performance, compatibility or any other
claims related to products not produced by this manufacturer. Questions on the capabilities of products
not produced by this manufacturer should be addressed to the suppliers of those products.
All statements regarding the manufacturer's future direction or intent are subject to change or withdrawal
without notice, and represent goals and objectives only.
The manufacturer's prices shown are the manufacturer's suggested retail prices, are current and are
subject to change without notice. Dealer prices may vary.
© Copyright IBM Corp. 2010, 2011 145
This information is for planning purposes only. The information herein is subject to change before the
products described become available.
This information contains examples of data and reports used in daily business operations. To illustrate
them as completely as possible, the examples include the names of individuals, companies, brands, and
products. All of these names are fictitious and any similarity to the names and addresses used by an
actual business enterprise is entirely coincidental.
If you are viewing this information in softcopy, the photographs and color illustrations may not appear.
The drawings and specifications contained herein shall not be reproduced in whole or in part without the
written permission of the manufacturer.
The manufacturer has prepared this information for use with the specific machines indicated. The
manufacturer makes no representations that it is suitable for any other purpose.
The manufacturer's computer systems contain mechanisms designed to reduce the possibility of
undetected data corruption or loss. This risk, however, cannot be eliminated. Users who experience
unplanned outages, system failures, power fluctuations or outages, or component failures must verify the
accuracy of operations performed and data saved or transmitted by the system at or near the time of the
outage or failure. In addition, users must establish procedures to ensure that there is independent data
verification before relying on such data in sensitive or critical operations. Users should periodically check
the manufacturer's support websites for updated information and fixes applicable to the system and
related software.
Ethernet connection usage restriction
This product is not intended to be connected directly or indirectly by any means whatsoever to interfaces
of public telecommunications networks.
Trademarks
IBM, the IBM logo, and ibm.com are trademarks or registered trademarks of International Business
Machines Corp., registered in many jurisdictions worldwide. Other product and service names might be
trademarks of IBM or other companies. A current list of IBM trademarks is available on the Web at
Copyright and trademark information at www.ibm.com/legal/copytrade.shtml.
Linux is a registered trademark of Linus Torvalds in the United States, other countries, or both.
Other product and service names might be trademarks of IBM or other companies.
Electronic emission notices
When attaching a monitor to the equipment, you must use the designated monitor cable and any
interference suppression devices supplied with the monitor.
Class A Notices
The following Class A statements apply to the IBM servers that contain the POWER7 processor and its
features unless designated as electromagnetic compatibility (EMC) Class B in the feature information.
Federal Communications Commission (FCC) statement
Note: This equipment has been tested and found to comply with the limits for a Class A digital device,
pursuant to Part 15 of the FCC Rules. These limits are designed to provide reasonable protection against
harmful interference when the equipment is operated in a commercial environment. This equipment
146 Power Systems: Serial-attached SCSI RAID enablement
generates, uses, and can radiate radio frequency energy and, if not installed and used in accordance with
the instruction manual, may cause harmful interference to radio communications. Operation of this
equipment in a residential area is likely to cause harmful interference, in which case the user will be
required to correct the interference at his own expense.
Properly shielded and grounded cables and connectors must be used in order to meet FCC emission
limits. IBM is not responsible for any radio or television interference caused by using other than
recommended cables and connectors or by unauthorized changes or modifications to this equipment.
Unauthorized changes or modifications could void the user's authority to operate the equipment.
This device complies with Part 15 of the FCC rules. Operation is subject to the following two conditions:
(1) this device may not cause harmful interference, and (2) this device must accept any interference
received, including interference that may cause undesired operation.
Industry Canada Compliance Statement
This Class A digital apparatus complies with Canadian ICES-003.
Avis de conformité à la réglementation d'Industrie Canada
Cet appareil numérique de la classe A est conforme à la norme NMB-003 du Canada.
European Community Compliance Statement
This product is in conformity with the protection requirements of EU Council Directive 2004/108/EC on
the approximation of the laws of the Member States relating to electromagnetic compatibility. IBM cannot
accept responsibility for any failure to satisfy the protection requirements resulting from a
non-recommended modification of the product, including the fitting of non-IBM option cards.
This product has been tested and found to comply with the limits for Class A Information Technology
Equipment according to European Standard EN 55022. The limits for Class A equipment were derived for
commercial and industrial environments to provide reasonable protection against interference with
licensed communication equipment.
European Community contact:
IBM Deutschland GmbH
Technical Regulations, Department M456
IBM-Allee 1, 71139 Ehningen, Germany
Tele: +49 7032 15-2937
email: tjahn@de.ibm.com
Warning: This is a Class A product. In a domestic environment, this product may cause radio
interference, in which case the user may be required to take adequate measures.
VCCI Statement - Japan
The following is a summary of the VCCI Japanese statement in the box above:
Notices 147
This is a Class A product based on the standard of the VCCI Council. If this equipment is used in a
domestic environment, radio interference may occur, in which case, the user may be required to take
corrective actions.
Japanese Electronics and Information Technology Industries Association (JEITA)
Confirmed Harmonics Guideline (products less than or equal to 20 A per phase)
Japanese Electronics and Information Technology Industries Association (JEITA)
Confirmed Harmonics Guideline with Modifications (products greater than 20 A per
phase)
Electromagnetic Interference (EMI) Statement - People's Republic of China
Declaration: This is a Class A product. In a domestic environment this product may cause radio
interference in which case the user may need to perform practical action.
Electromagnetic Interference (EMI) Statement - Taiwan
The following is a summary of the EMI Taiwan statement above.
Warning: This is a Class A product. In a domestic environment this product may cause radio interference
in which case the user will be required to take adequate measures.
IBM Taiwan Contact Information:
148 Power Systems: Serial-attached SCSI RAID enablement
Electromagnetic Interference (EMI) Statement - Korea
Germany Compliance Statement
Deutschsprachiger EU Hinweis: Hinweis für Geräte der Klasse A EU-Richtlinie zur
Elektromagnetischen Verträglichkeit
Dieses Produkt entspricht den Schutzanforderungen der EU-Richtlinie 2004/108/EG zur Angleichung der
Rechtsvorschriften über die elektromagnetische Verträglichkeit in den EU-Mitgliedsstaaten und hält die
Grenzwerte der EN 55022 Klasse A ein.
Um dieses sicherzustellen, sind die Geräte wie in den Handbüchern beschrieben zu installieren und zu
betreiben. Des Weiteren dürfen auch nur von der IBM empfohlene Kabel angeschlossen werden. IBM
übernimmt keine Verantwortung für die Einhaltung der Schutzanforderungen, wenn das Produkt ohne
Zustimmung von IBM verändert bzw. wenn Erweiterungskomponenten von Fremdherstellern ohne
Empfehlung von IBM gesteckt/eingebaut werden.
EN 55022 Klasse A Geräte müssen mit folgendem Warnhinweis versehen werden:
"Warnung: Dieses ist eine Einrichtung der Klasse A. Diese Einrichtung kann im Wohnbereich
Funk-Störungen verursachen; in diesem Fall kann vom Betreiber verlangt werden, angemessene
Maßnahmen zu ergreifen und dafür aufzukommen."
Deutschland: Einhaltung des Gesetzes über die elektromagnetische Verträglichkeit von Geräten
Dieses Produkt entspricht dem “Gesetz über die elektromagnetische Verträglichkeit von Geräten
(EMVG)“. Dies ist die Umsetzung der EU-Richtlinie 2004/108/EG in der Bundesrepublik Deutschland.
Zulassungsbescheinigung laut dem Deutschen Gesetz über die elektromagnetische Verträglichkeit von
Geräten (EMVG) (bzw. der EMC EG Richtlinie 2004/108/EG) für Geräte der Klasse A
Dieses Gerät ist berechtigt, in Übereinstimmung mit dem Deutschen EMVG das EG-Konformitätszeichen
- CE - zu führen.
Notices 149
Verantwortlich für die Einhaltung der EMV Vorschriften ist der Hersteller:
International Business Machines Corp.
New Orchard Road
Armonk, New York 10504
Tel: 914-499-1900
Der verantwortliche Ansprechpartner des Herstellers in der EU ist:
IBM Deutschland GmbH
Technical Regulations, Abteilung M456
IBM-Allee 1, 71139 Ehningen, Germany
Tel: +49 7032 15-2937
email: tjahn@de.ibm.com
Generelle Informationen:
Das Gerät erfüllt die Schutzanforderungen nach EN 55024 und EN 55022 Klasse A.
Electromagnetic Interference (EMI) Statement - Russia
Class B Notices
The following Class B statements apply to features designated as electromagnetic compatibility (EMC)
Class B in the feature installation information.
Federal Communications Commission (FCC) statement
This equipment has been tested and found to comply with the limits for a Class B digital device,
pursuant to Part 15 of the FCC Rules. These limits are designed to provide reasonable protection against
harmful interference in a residential installation.
This equipment generates, uses, and can radiate radio frequency energy and, if not installed and used in
accordance with the instructions, may cause harmful interference to radio communications. However,
there is no guarantee that interference will not occur in a particular installation.
If this equipment does cause harmful interference to radio or television reception, which can be
determined by turning the equipment off and on, the user is encouraged to try to correct the interference
by one or more of the following measures:
v Reorient or relocate the receiving antenna.
v Increase the separation between the equipment and receiver.
v Connect the equipment into an outlet on a circuit different from that to which the receiver is
connected.
v Consult an IBM-authorized dealer or service representative for help.
Properly shielded and grounded cables and connectors must be used in order to meet FCC emission
limits. Proper cables and connectors are available from IBM-authorized dealers. IBM is not responsible for
150 Power Systems: Serial-attached SCSI RAID enablement
any radio or television interference caused by unauthorized changes or modifications to this equipment.
Unauthorized changes or modifications could void the user's authority to operate this equipment.
This device complies with Part 15 of the FCC rules. Operation is subject to the following two conditions:
(1) this device may not cause harmful interference, and (2) this device must accept any interference
received, including interference that may cause undesired operation.
Industry Canada Compliance Statement
This Class B digital apparatus complies with Canadian ICES-003.
Avis de conformité à la réglementation d'Industrie Canada
Cet appareil numérique de la classe B est conforme à la norme NMB-003 du Canada.
European Community Compliance Statement
This product is in conformity with the protection requirements of EU Council Directive 2004/108/EC on
the approximation of the laws of the Member States relating to electromagnetic compatibility. IBM cannot
accept responsibility for any failure to satisfy the protection requirements resulting from a
non-recommended modification of the product, including the fitting of non-IBM option cards.
This product has been tested and found to comply with the limits for Class B Information Technology
Equipment according to European Standard EN 55022. The limits for Class B equipment were derived for
typical residential environments to provide reasonable protection against interference with licensed
communication equipment.
European Community contact:
IBM Deutschland GmbH
Technical Regulations, Department M456
IBM-Allee 1, 71139 Ehningen, Germany
Tele: +49 7032 15-2937
email: tjahn@de.ibm.com
VCCI Statement - Japan
Japanese Electronics and Information Technology Industries Association (JEITA)
Confirmed Harmonics Guideline (products less than or equal to 20 A per phase)
Notices 151
Japanese Electronics and Information Technology Industries Association (JEITA)
Confirmed Harmonics Guideline with Modifications (products greater than 20 A per
phase)
IBM Taiwan Contact Information
Electromagnetic Interference (EMI) Statement - Korea
Germany Compliance Statement
Deutschsprachiger EU Hinweis: Hinweis für Geräte der Klasse B EU-Richtlinie zur
Elektromagnetischen Verträglichkeit
Dieses Produkt entspricht den Schutzanforderungen der EU-Richtlinie 2004/108/EG zur Angleichung der
Rechtsvorschriften über die elektromagnetische Verträglichkeit in den EU-Mitgliedsstaaten und hält die
Grenzwerte der EN 55022 Klasse B ein.
Um dieses sicherzustellen, sind die Geräte wie in den Handbüchern beschrieben zu installieren und zu
betreiben. Des Weiteren dürfen auch nur von der IBM empfohlene Kabel angeschlossen werden. IBM
übernimmt keine Verantwortung für die Einhaltung der Schutzanforderungen, wenn das Produkt ohne
Zustimmung von IBM verändert bzw. wenn Erweiterungskomponenten von Fremdherstellern ohne
Empfehlung von IBM gesteckt/eingebaut werden.
Deutschland: Einhaltung des Gesetzes über die elektromagnetische Verträglichkeit von Geräten
Dieses Produkt entspricht dem “Gesetz über die elektromagnetische Verträglichkeit von Geräten
(EMVG)“. Dies ist die Umsetzung der EU-Richtlinie 2004/108/EG in der Bundesrepublik Deutschland.
Zulassungsbescheinigung laut dem Deutschen Gesetz über die elektromagnetische Verträglichkeit von
Geräten (EMVG) (bzw. der EMC EG Richtlinie 2004/108/EG) für Geräte der Klasse B
152 Power Systems: Serial-attached SCSI RAID enablement
Dieses Gerät ist berechtigt, in Übereinstimmung mit dem Deutschen EMVG das EG-Konformitätszeichen
- CE - zu führen.
Verantwortlich für die Einhaltung der EMV Vorschriften ist der Hersteller:
International Business Machines Corp.
New Orchard Road
Armonk, New York 10504
Tel: 914-499-1900
Der verantwortliche Ansprechpartner des Herstellers in der EU ist:
IBM Deutschland GmbH
Technical Regulations, Abteilung M456
IBM-Allee 1, 71139 Ehningen, Germany
Tel: +49 7032 15-2937
email: tjahn@de.ibm.com
Generelle Informationen:
Das Gerät erfüllt die Schutzanforderungen nach EN 55024 und EN 55022 Klasse B.
Terms and conditions
Permissions for the use of these publications is granted subject to the following terms and conditions.
Personal Use: You may reproduce these publications for your personal, noncommercial use provided that
all proprietary notices are preserved. You may not distribute, display or make derivative works of these
publications, or any portion thereof, without the express consent of the manufacturer.
Commercial Use: You may reproduce, distribute and display these publications solely within your
enterprise provided that all proprietary notices are preserved. You may not make derivative works of
these publications, or reproduce, distribute or display these publications or any portion thereof outside
your enterprise, without the express consent of the manufacturer.
Except as expressly granted in this permission, no other permissions, licenses or rights are granted, either
express or implied, to the publications or any information, data, software or other intellectual property
contained therein.
The manufacturer reserves the right to withdraw the permissions granted herein whenever, in its
discretion, the use of the publications is detrimental to its interest or, as determined by the manufacturer,
the above instructions are not being properly followed.
You may not download, export or re-export this information except in full compliance with all applicable
laws and regulations, including all United States export laws and regulations.
THE MANUFACTURER MAKES NO GUARANTEE ABOUT THE CONTENT OF THESE
PUBLICATIONS. THESE PUBLICATIONS ARE PROVIDED "AS-IS" AND WITHOUT WARRANTY OF
ANY KIND, EITHER EXPRESSED OR IMPLIED, INCLUDING BUT NOT LIMITED TO IMPLIED
WARRANTIES OF MERCHANTABILITY, NON-INFRINGEMENT, AND FITNESS FOR A PARTICULAR
PURPOSE.
Notices 153
ESCALA Power7
Serial-Attached SCSI RAID Enablement in
E2-700, E2-700T, E2-705, E2-705T, E4-
700, E4-700T or E4-705
REFERENCE
86 A1 90FF 03
ESCALA Power7
Serial-Attached SCSI RAID Enablement in E2-700, E2-
700T, E2-705, E2-705T, E4-700, E4-700T or E4-705
The ESCALA Power7 publications concern the following models:
- Bull Escala E5-700 (Power 750 / 8233-E8B)
- Bull Escala M6-700 (Power 770 / 9117-MMB)
- Bull Escala M6-705 (Power 770 / 9117-MMC)
- Bull Escala M7-700 (Power 780 / 9179-MHB)
- Bull Escala M7-705 (Power 780 / 9179-MHC)
- Bull Escala E1-700 (Power 710 / 8231-E2B)
- Bull Escala E1-705 (Power 710 / 8231-E1C)
- Bull Escala E2-700 / E2-700T (Power 720 / 8202-E4B)
- Bull Escala E2-705 / E2-705T (Power 720 / 8202-E4C)
- Bull Escala E3-700 (Power 730 / 8231-E2B)
- Bull Escala E3-705 (Power 730 / 8231-E2C)
- Bull Escala E4-700 / E4-700T (Power 740 / 8205-E6B)
- Bull Escala E4-705 (Power 740 / 8205-E6C)
References to Power 755 / 8236-E8C models are irrelevant.
Hardware
October 2011
BULL CEDOC
357 AVENUE PATTON
B.P.20845
49008 ANGERS CEDEX 01
FRANCE
REFERENCE
86 A1 90FF 03
The following copyright notice protects this book under Copyright laws which prohibit such actions as, but not limited to, copying,
distributing, modifying, and making derivative works.
Copyright Bull SAS 2011
Printed in France
Suggestions and criticisms concerning the form, content, and presentation of this book are
invited. A form is provided at the end of this book for this purpose.
To order additional copies of this book or other Bull Technical Publications, you are invited
to use the Ordering Form also provided at the end of this book.
Trademarks and Acknowledgements
We acknowledge the right of proprietors of trademarks mentioned in this book.
Disk drives for the E1-700, E1-705, E3-
700 or E3-705
REFERENCE
86 A1 01FH 01
ESCALA Power7
Disk drives for the E1-700, E1-705, E3-700 or E3-705
The ESCALA Power7 publications concern the following models:
- Bull Escala E5-700 (Power 750 / 8233-E8B)
- Bull Escala M6-700 (Power 770 / 9117-MMB)
- Bull Escala M6-705 (Power 770 / 9117-MMC)
- Bull Escala M7-700 (Power 780 / 9179-MHB)
- Bull Escala M7-705 (Power 780 / 9179-MHC)
- Bull Escala E1-700 (Power 710 / 8231-E2B)
- Bull Escala E1-705 (Power 710 / 8231-E1C)
- Bull Escala E2-700 / E2-700T (Power 720 / 8202-E4B)
- Bull Escala E2-705 / E2-705T (Power 720 / 8202-E4C)
- Bull Escala E3-700 (Power 730 / 8231-E2B)
- Bull Escala E3-705 (Power 730 / 8231-E2C)
- Bull Escala E4-700 / E4-700T (Power 740 / 8205-E6B)
- Bull Escala E4-705 (Power 740 / 8205-E6C)
References to Power 755 / 8236-E8C models are irrelevant.
Hardware
October 2011
BULL CEDOC
357 AVENUE PATTON
B.P.20845
49008 ANGERS CEDEX 01
FRANCE
REFERENCE
86 A1 01FH 01
The following copyright notice protects this book under Copyright laws which prohibit such actions as, but not limited to, copying,
distributing, modifying, and making derivative works.
Copyright Bull SAS 2011
Printed in France
Suggestions and criticisms concerning the form, content, and presentation of this book are
invited. A form is provided at the end of this book for this purpose.
To order additional copies of this book or other Bull Technical Publications, you are invited
to use the Ordering Form also provided at the end of this book.
Trademarks and Acknowledgements
We acknowledge the right of proprietors of trademarks mentioned in this book.
The information in this document is subject to change without notice. Bull will not be liable for errors contained herein, or
f or incidental or consequential damages in connection with the use of this material.
Contents
Safety notices. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . ix
Disk drives. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1
What's new in Disk drives . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3
Installing a disk drive or solid-state drive in the 8231-E2B, 8231-E1C, or 8231-E2C . . . 5
Solid-state drives . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5
Installing a disk drive or solid-state drive in the 8231-E2B, 8231-E1C, or 8231-E2C with the AIX system or logical
partition power turned on. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8
Preparing the system . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10
Installing a disk drive in an 8231-E2B, 8231-E1C, or 8231-E2C . . . . . . . . . . . . . . . . . 10
Installing a disk drive in the 5887 disk-drive expansion unit . . . . . . . . . . . . . . . . . . 11
Completing the disk drive installation by using the HSM . . . . . . . . . . . . . . . . . . . 12
Completing the procedure . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 12
Installing a disk-drive or solid-state drive in the 8231-E2B, 8231-E1C, or 8231-E2C with the Linux system or logical
partition power turned on . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 13
Preparing the system . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 14
Preparing to install a disk drive by using the HSM . . . . . . . . . . . . . . . . . . . . . 15
Installing a disk drive in an 8231-E2B, 8231-E1C, or 8231-E2C . . . . . . . . . . . . . . . . . 16
Installing a disk drive in the 5887 disk-drive expansion unit . . . . . . . . . . . . . . . . . . 17
Verifying that the new disk drive is operational . . . . . . . . . . . . . . . . . . . . . . 18
Completing the procedure . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 19
Installing a disk-drive or solid-state drive in the 8231-E2B, 8231-E1C, or 8231-E2C with the Virtual I/O Server
system or logical partition power turned on . . . . . . . . . . . . . . . . . . . . . . . . 20
Preparing the system . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 21
Preparing to install a disk drive by using the HSM . . . . . . . . . . . . . . . . . . . . . 22
Installing a disk drive in an 8231-E2B, 8231-E1C, or 8231-E2C . . . . . . . . . . . . . . . . . 22
Installing a disk drive in the 5887 disk-drive expansion unit . . . . . . . . . . . . . . . . . . 23
Completing the disk drive installation by using the HSM . . . . . . . . . . . . . . . . . . . 24
Completing the procedure . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 24
Installing a disk drive or solid-state drive in the 8231-E2B, 8231-E1C, or 8231-E2C with the IBM i system or logical
partition power turned on . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 25
Preparing the system . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 26
Preparing to install a disk drive by using the HSM . . . . . . . . . . . . . . . . . . . . . 27
Installing a disk drive in an 8231-E2B, 8231-E1C, or 8231-E2C . . . . . . . . . . . . . . . . . 28
Installing a disk drive in the 5887 disk-drive expansion unit . . . . . . . . . . . . . . . . . . 29
Completing the disk drive installation by using the HSM . . . . . . . . . . . . . . . . . . . 30
Completing the procedure . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 30
Installing a disk drive or solid-state drive in the 8231-E2B, 8231-E1C, or 8231-E2C with the system or logical
partition power turned off . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 31
Preparing the system . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 32
Installing a disk drive in an 8231-E2B, 8231-E1C, or 8231-E2C . . . . . . . . . . . . . . . . . 33
Installing a disk drive in the 5887 disk-drive expansion unit . . . . . . . . . . . . . . . . . . 35
Completing the procedure . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 35
Removing a disk drive or solid-state drive from the 8231-E2B, 8231-E1C, or 8231-E2C 37
Removing a disk drive from the 8231-E2B, 8231-E1C, or 8231-E2C with the AIX system or logical partition power
turned on . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 37
Preparing the system for removing a disk drive . . . . . . . . . . . . . . . . . . . . . . 38
Preparing to remove a disk drive by using the HSM. . . . . . . . . . . . . . . . . . . . . 39
Removing the disk drive from the 8231-E2B, 8231-E1C, or 8231-E2C . . . . . . . . . . . . . . . 39
Removing the disk drive from the 8231-E2B, 8231-E1C, or 8231-E2C using Linux kernel 2.6 with the Linux system
or logical partition power turned on . . . . . . . . . . . . . . . . . . . . . . . . . . . 41
© Copyright IBM Corp. 2010, 2011 iii
Preparing the system . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 42
Preparing to remove the disk drive by using the HSM . . . . . . . . . . . . . . . . . . . . 43
Removing the disk drive from the 8231-E2B, 8231-E1C, or 8231-E2C . . . . . . . . . . . . . . . 46
Removing a disk drive from the 8231-E2B, 8231-E1C, or 8231-E2C or an expansion unit controlled by IBM i . . . 47
Preparing the system for removing a disk drive . . . . . . . . . . . . . . . . . . . . . . 48
Preparing to remove the disk drive by using the HSM . . . . . . . . . . . . . . . . . . . . 49
Removing the disk drive from the 8231-E2B, 8231-E1C, or 8231-E2C . . . . . . . . . . . . . . . 51
Removing a disk drive using hot-spare in an IBM i system or logical partition from the 8231-E2B, 8231-E1C, or
8231-E2C . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 52
Preparing the system . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 52
Preparing to remove a disk drive by using the HSM. . . . . . . . . . . . . . . . . . . . . 52
Removing the disk drive from the 8231-E2B, 8231-E1C, or 8231-E2C . . . . . . . . . . . . . . . 54
Removing a SCSI disk drive from the 8231-E2B, 8231-E1C, or 8231-E2C . . . . . . . . . . . . . . . 55
Preparing to remove the SCSI disk drive by using the HSM . . . . . . . . . . . . . . . . . . 55
Removing the disk drive from the 8231-E2B, 8231-E1C, or 8231-E2C . . . . . . . . . . . . . . . 55
Removing a disk drive from the 8231-E2B, 8231-E1C, or 8231-E2C with the system or logical partition power
turned off . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 56
Preparing the system . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 57
Removing the disk drive from the 8231-E2B, 8231-E1C, or 8231-E2C . . . . . . . . . . . . . . . 59
Replacing a disk drive or solid state drive in the 8231-E2B, 8231-E1C, or 8231-E2C . . 61
Replacing a disk drive in the 8231-E2B, 8231-E1C, or 8231-E2C with the AIX system or logical partition power
turned on . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 61
Preparing the system . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 62
Replacing a disk drive in the 8231-E2B, 8231-E1C, or 8231-E2C . . . . . . . . . . . . . . . . . 63
Completing the disk drive replace by using the HSM . . . . . . . . . . . . . . . . . . . . 64
Completing the procedure . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 64
Replacing a disk drive using Linux kernel 2.6, with the Linux system or logical partition power turned on . . . 65
Preparing the system to replace a disk drive . . . . . . . . . . . . . . . . . . . . . . . 66
Preparing to replace a disk drive by using the HSM. . . . . . . . . . . . . . . . . . . . . 67
Replacing a disk drive in the 8231-E2B, 8231-E1C, or 8231-E2C . . . . . . . . . . . . . . . . . 67
Completing the disk drive replace by using the HSM . . . . . . . . . . . . . . . . . . . . 68
Completing the procedure . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 68
Replacing a disk drive in the 8231-E2B, 8231-E1C, or 8231-E2C with the IBM i system or logical partition power
turned on . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 69
Preparing to replace the disk drive . . . . . . . . . . . . . . . . . . . . . . . . . . 70
Replacing a disk drive in the 8231-E2B, 8231-E1C, or 8231-E2C . . . . . . . . . . . . . . . . . 71
Completing the disk drive replace by using the HSM . . . . . . . . . . . . . . . . . . . . 72
Completing the procedure . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 72
Replacing a disk drive in an 8231-E2B, 8231-E1C, or 8231-E2C using hot-spare in an IBM i system or logical
partition . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 73
Preparing to replace the disk drive by using the HSM . . . . . . . . . . . . . . . . . . . . 73
Replacing a disk drive in the 8231-E2B, 8231-E1C, or 8231-E2C . . . . . . . . . . . . . . . . . 73
Completing the procedure by using the HSM . . . . . . . . . . . . . . . . . . . . . . . 74
Replacing a SCSI disk drive in the 8231-E2B, 8231-E1C, or 8231-E2C . . . . . . . . . . . . . . . . 75
Replacing a disk drive in the 8231-E2B, 8231-E1C, or 8231-E2C . . . . . . . . . . . . . . . . . 75
Completing the disk drive replace by using the HSM . . . . . . . . . . . . . . . . . . . . 76
Completing the procedure . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 76
Replacing a disk drive in the 8231-E2B, 8231-E1C, or 8231-E2C with the system or logical partition power turned
off . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 77
Replacing a disk drive in the 8231-E2B, 8231-E1C, or 8231-E2C . . . . . . . . . . . . . . . . . 78
Completing the procedure . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 79
Installing the external SAS port in the 8231-E2B, 8231-E1C, or 8231-E2C system. . . . 81
Removing the external SAS port from the 8231-E2B, 8231-E1C, or 8231-E2C system . . 85
Replacing the external SAS port in the 8231-E2B, 8231-E1C, or 8231-E2C system . . . 87
Removing or installing a disk drive filler . . . . . . . . . . . . . . . . . . . . . 89
iv Disk drives
Removing a disk drive filler from the 8231-E2B, 8231-E1C, or 8231-E2C system. . . . . . . . . . . . . 89
Installing a disk drive filler in the 8231-E2B, 8231-E1C, or 8231-E2C system . . . . . . . . . . . . . . 90
Removing or installing a disk drive bezel. . . . . . . . . . . . . . . . . . . . . 93
Removing a disk drive bezel from a disk drive or filler. . . . . . . . . . . . . . . . . . . . . 93
Installing a disk drive bezel in a disk drive or filler . . . . . . . . . . . . . . . . . . . . . . 94
Disk drive locations and service indicators . . . . . . . . . . . . . . . . . . . . 97
PCIe RAID and SSD SAS adapter locations. . . . . . . . . . . . . . . . . . . . . . . . . 97
Disk drive locations and service indicators for the 8231-E2B, 8231-E1C, or 8231-E2C system . . . . . . . . 97
Disk drive locations and service indicators for the 5886 disk-drive enclosure . . . . . . . . . . . . . 99
Disk-drive locations and service indicators for the 5887 disk-drive enclosure . . . . . . . . . . . . . 100
Disk drive locations and service indicators for the 5786, 5787, 7031-D24, and 7031-T24 SCSI disk-drive enclosures 101
Disk-drive locations and service indicators for the 5802 expansion unit . . . . . . . . . . . . . . . 103
Related information. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 105
Preparing a disk drive for use in an AIX system or logical partition . . . . . . . . . . . . . . . . 105
Preparing to remove a disk drive from a system or expansion unit controlled by AIX . . . . . . . . . . 105
Rebuilding data on a replacement disk drive using AIX . . . . . . . . . . . . . . . . . . . . 105
Removing a disk drive using Linux kernel 2.6 . . . . . . . . . . . . . . . . . . . . . . . 105
Preparing to remove the disk drive . . . . . . . . . . . . . . . . . . . . . . . . . . 105
Removing the disk drive . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 108
Replacing the disk drive. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 111
Completing the procedure . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 113
Rebuilding data on a replacement disk drive using Linux . . . . . . . . . . . . . . . . . . . 114
Rebuild data on a disk drive using Linux kernel 2.6 . . . . . . . . . . . . . . . . . . . . 114
Preparing to remove a disk-drive from a system or logical partition controlled by IBM i . . . . . . . . . 116
Determining the IBM i disk-drive protection status. . . . . . . . . . . . . . . . . . . . . . 117
Configuring a disk drive in an IBM i system or logical partition . . . . . . . . . . . . . . . . . 118
Configuring a disk drive on a load source adapter for hot spare using the IBM i operating system. . . . . . 120
Rebuilding data on a replacement disk drive using IBM i . . . . . . . . . . . . . . . . . . . 121
Replacing a SAS conduit card in a 5802 expansion unit with the power turned off . . . . . . . . . . . 122
Removing a SAS conduit card from a 5802 expansion unit with the power turned off . . . . . . . . . . 129
Replacing a SAS expander card in a 5802 expansion unit with the power turned off. . . . . . . . . . . 138
Removing a SAS expander card from a 5802 expansion unit with power turned off . . . . . . . . . . . 138
Common procedures for installable features . . . . . . . . . . . . . . . . . 141
Before you begin. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 143
Identifying a part. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 147
Control panel LEDs . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 147
Identifying a failing part in an AIX system or logical partition . . . . . . . . . . . . . . . . . . 148
Locating a failing part in an AIX system or logical partition. . . . . . . . . . . . . . . . . . 148
Activating the indicator light for the failing part. . . . . . . . . . . . . . . . . . . . . . 149
Deactivating the failing-part indicator light . . . . . . . . . . . . . . . . . . . . . . . 149
Identifying a failing part in an IBM i system or logical partition . . . . . . . . . . . . . . . . . 149
Activating the failing-part indicator light . . . . . . . . . . . . . . . . . . . . . . . . 149
Deactivating the failing-part indicator light . . . . . . . . . . . . . . . . . . . . . . . 150
Identifying a failing part in a Linux system or logical partition. . . . . . . . . . . . . . . . . . 151
Locating a failing part in a Linux system or logical partition . . . . . . . . . . . . . . . . . 151
Finding the location code of a failing part in a Linux system or logical partition . . . . . . . . . . . 151
Activating the indicator light for the failing part. . . . . . . . . . . . . . . . . . . . . . 151
Deactivating the failing-part indicator light . . . . . . . . . . . . . . . . . . . . . . . 151
Locating a failing part in a Virtual I/O Server system or logical partition . . . . . . . . . . . . . . 152
Identifying a part by using the Virtual I/O Server . . . . . . . . . . . . . . . . . . . . . 152
Starting the system or logical partition . . . . . . . . . . . . . . . . . . . . . 153
Contents v
Starting a system that is not managed by a Hardware Management Console or or a Systems Director
Management Console. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 153
Starting a system or logical partition by using the Hardware Management Console . . . . . . . . . . . 154
Starting a system or virtual server with the Systems Director Management Console. . . . . . . . . . . 155
Stopping a system or logical partition. . . . . . . . . . . . . . . . . . . . . . 157
Stopping a system that is not managed by an HMC or an SDMC . . . . . . . . . . . . . . . . . 157
Stopping a system by using the Hardware Management Console . . . . . . . . . . . . . . . . . 158
Stopping a system with the Systems Director Management Console . . . . . . . . . . . . . . . . 159
Removing and replacing the expansion unit cover or door. . . . . . . . . . . . . 161
Removing the front cover on the 7314-G30 or 5796. . . . . . . . . . . . . . . . . . . . . . 161
Installing the front cover on the 7314-G30 or 5796 . . . . . . . . . . . . . . . . . . . . . . 161
System covers. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 163
Removing and replacing covers for the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C . . . . . . . . . . . 163
Removing the front cover on a rack-mounted 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system . . . . . 163
Removing the front cover on a stand-alone 8202-E4B, 8202-E4C, or 8205-E6B system . . . . . . . . . 164
Installing the front cover on a rack-mounted 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system . . . . . 164
Installing the front cover and front door on a stand-alone 8202-E4B, 8202-E4C, or 8205-E6B system. . . . . 165
Removing the service access cover on a rack-mounted 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system . . 166
Removing the service access cover on a stand-alone 8202-E4B, 8202-E4C, or 8205-E6B system . . . . . . 167
Installing the service access cover on a rack-mounted 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system . . 169
Installing the service access cover on a stand-alone 8202-E4B, 8202-E4C, or 8205-E6B system . . . . . . . 169
Removing and replacing covers for the 8231-E2B, 8231-E1C, or 8231-E2C . . . . . . . . . . . . . . 171
Removing the service access cover for the 8231-E2B, 8231-E1C, or 8231-E2C . . . . . . . . . . . . 171
Installing the service access cover for the 8231-E2B, 8231-E1C, or 8231-E2C. . . . . . . . . . . . . 172
Removing and replacing covers on the 8233-E8B or 8236-E8C system. . . . . . . . . . . . . . . . 173
Removing the front cover from an 8233-E8B or 8236-E8C system . . . . . . . . . . . . . . . . 173
Installing the front cover on an 8233-E8B or 8236-E8C system . . . . . . . . . . . . . . . . . 174
Removing the service access cover from an 8233-E8B or 8236-E8C system . . . . . . . . . . . . . 175
Installing the service access cover on an 8233-E8B or 8236-E8C system . . . . . . . . . . . . . . 176
Removing and replacing covers for the 9117-MMB, 9117-MMC, 9179-MHB, or 9179-MHC. . . . . . . . . 177
Removing the front cover from the 9117-MMB, 9117-MMC, 9179-MHB, or 9179-MHC . . . . . . . . . 177
Removing the front cover with the cover removal tool. . . . . . . . . . . . . . . . . . . . 178
Installing the front cover on the 9117-MMB, 9117-MMC, 9179-MHB, or 9179-MHC . . . . . . . . . . 183
Placing the 8231-E2B, 8231-E1C, or 8231-E2C system into the service or operating
position. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 185
Placing the rack-mounted 8231-E2B, 8231-E1C, or 8231-E2C system into the service position . . . . . . . . 185
Placing the rack-mounted 8231-E2B, 8231-E1C, or 8231-E2C system into the operating position . . . . . . . 186
Placing the rack-mounted 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system into
the service position . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 189
Placing the rack-mounted 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system into
the operating position . . . . . . . . . . . . . . . . . . . . . . . . . . . . 191
Service and operating positions . . . . . . . . . . . . . . . . . . . . . . . . 193
Placing the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system into the service or operating position . . . . . 193
Placing the rack-mounted 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system into the service position . . . 193
Placing the rack-mounted 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system into the operating position . . 194
Placing the 8231-E2B, 8231-E1C, or 8231-E2C system into the service or operating position . . . . . . . . 196
Placing the rack-mounted 8231-E2B, 8231-E1C, or 8231-E2C system into the service position . . . . . . . 196
Placing the rack-mounted 8231-E2B, 8231-E1C, or 8231-E2C system into the operating position . . . . . . 197
Placing the rack-mounted 8233-E8B or 8236-E8C system into the service position or operating position . . . . 199
Placing the 8233-E8B or 8236-E8C in the service position . . . . . . . . . . . . . . . . . . . 199
Placing the 8233-E8B or 8236-E8C in the operating position . . . . . . . . . . . . . . . . . . 201
vi Disk drives
Power cords. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 203
Disconnecting the power cords from the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system . . . . . . . . 203
Connecting the power cords to the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system. . . . . . . . . . 204
Disconnecting the power cords from the 8231-E2B, 8231-E1C, or 8231-E2C . . . . . . . . . . . . . . 205
Connecting the power cords to the 8231-E2B, 8231-E1C, or 8231-E2C. . . . . . . . . . . . . . . . 206
Verifying the installed part. . . . . . . . . . . . . . . . . . . . . . . . . . . 209
Verifying an installed feature or replaced part in an AIX system or logical partition. . . . . . . . . . . 209
Verifying the installed part in an IBM i system or logical partition. . . . . . . . . . . . . . . . . 211
Deactivating the failing-part indicator light . . . . . . . . . . . . . . . . . . . . . . . 212
Verifying the installed part in a Linux system or logical partition . . . . . . . . . . . . . . . . . 212
Verifying an installed part by using stand-alone diagnostics. . . . . . . . . . . . . . . . . . . 213
Verifying the installed part by using Hardware Management Console . . . . . . . . . . . . . . . 214
Activating and deactivating LEDs by using the HMC . . . . . . . . . . . . . . . . . . . . 215
Deactivating a system attention LED or partition LED. . . . . . . . . . . . . . . . . . . 215
Activating or deactivating an identify LED . . . . . . . . . . . . . . . . . . . . . . 215
Viewing serviceable events by using the HMC . . . . . . . . . . . . . . . . . . . . . . 216
Verifying the installed part by using Systems Director Management Console . . . . . . . . . . . . . 216
Activating and deactivating LEDs by using the Systems Director Management Console . . . . . . . . 217
Deactivating a system attention LED or partition LED. . . . . . . . . . . . . . . . . . . 217
Activating or deactivating an identify LED by using the SDMC . . . . . . . . . . . . . . . 217
Viewing serviceable events by using the Systems Director Management Console . . . . . . . . . . . 218
Verifying an installed feature or replaced part on a system or logical partition by using Virtual I/O Server tools 218
Verifying a repair. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 223
Verifying the repair in AIX . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 224
Verifying a repair using an IBM i system or logical partition . . . . . . . . . . . . . . . . . . 227
Verifying the repair in Linux . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 229
Verifying the repair from the management console . . . . . . . . . . . . . . . . . . . . . . 229
Closing a service call. . . . . . . . . . . . . . . . . . . . . . . . . . . . . 231
Closing a service call using AIX or Linux . . . . . . . . . . . . . . . . . . . . . . . . . 235
Closing a service call using Integrated Virtualization Manager . . . . . . . . . . . . . . . . . . 239
Activating and deactivating LEDs. . . . . . . . . . . . . . . . . . . . . . . . 245
Deactivating a system attention LED or partition LED using the management console . . . . . . . . . . 245
Activating or deactivating an identify LED using the management console. . . . . . . . . . . . . . 246
Deactivating a system attention LED or logical partition LED using the Advanced System Management Interface 246
Activating or deactivating an identify LED using the Advanced System Management Interface . . . . . . . 247
Notices . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 249
Trademarks . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 250
Electronic emission notices . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 250
Class A Notices. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 250
Class B Notices. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 254
Terms and conditions. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 257
Contents vii
viii Disk drives
Safety notices
Safety notices may be printed throughout this guide:
v DANGER notices call attention to a situation that is potentially lethal or extremely hazardous to
people.
v CAUTION notices call attention to a situation that is potentially hazardous to people because of some
existing condition.
v Attention notices call attention to the possibility of damage to a program, device, system, or data.
World Trade safety information
Several countries require the safety information contained in product publications to be presented in their
national languages. If this requirement applies to your country, a safety information booklet is included
in the publications package shipped with the product. The booklet contains the safety information in
your national language with references to the U.S. English source. Before using a U.S. English publication
to install, operate, or service this product, you must first become familiar with the related safety
information in the booklet. You should also refer to the booklet any time you do not clearly understand
any safety information in the U.S. English publications.
German safety information
Das Produkt ist nicht für den Einsatz an Bildschirmarbeitsplätzen im Sinne § 2 der
Bildschirmarbeitsverordnung geeignet.
Laser safety information
IBM® servers can use I/O cards or features that are fiber-optic based and that utilize lasers or LEDs.
Laser compliance
IBM servers may be installed inside or outside of an IT equipment rack.
© Copyright IBM Corp. 2010, 2011 ix
DANGER
When working on or around the system, observe the following precautions:
Electrical voltage and current from power, telephone, and communication cables are hazardous. To
avoid a shock hazard:
v Connect power to this unit only with the IBM provided power cord. Do not use the IBM
provided power cord for any other product.
v Do not open or service any power supply assembly.
v Do not connect or disconnect any cables or perform installation, maintenance, or reconfiguration
of this product during an electrical storm.
v The product might be equipped with multiple power cords. To remove all hazardous voltages,
disconnect all power cords.
v Connect all power cords to a properly wired and grounded electrical outlet. Ensure that the outlet
supplies proper voltage and phase rotation according to the system rating plate.
v Connect any equipment that will be attached to this product to properly wired outlets.
v When possible, use one hand only to connect or disconnect signal cables.
v Never turn on any equipment when there is evidence of fire, water, or structural damage.
v Disconnect the attached power cords, telecommunications systems, networks, and modems before
you open the device covers, unless instructed otherwise in the installation and configuration
procedures.
v Connect and disconnect cables as described in the following procedures when installing, moving,
or opening covers on this product or attached devices.
To Disconnect:
1. Turn off everything (unless instructed otherwise).
2. Remove the power cords from the outlets.
3. Remove the signal cables from the connectors.
4. Remove all cables from the devices
To Connect:
1. Turn off everything (unless instructed otherwise).
2. Attach all cables to the devices.
3. Attach the signal cables to the connectors.
4. Attach the power cords to the outlets.
5. Turn on the devices.
(D005)
DANGER
x Disk drives
Observe the following precautions when working on or around your IT rack system:
v Heavy equipment–personal injury or equipment damage might result if mishandled.
v Always lower the leveling pads on the rack cabinet.
v Always install stabilizer brackets on the rack cabinet.
v To avoid hazardous conditions due to uneven mechanical loading, always install the heaviest
devices in the bottom of the rack cabinet. Always install servers and optional devices starting
from the bottom of the rack cabinet.
v Rack-mounted devices are not to be used as shelves or work spaces. Do not place objects on top
of rack-mounted devices.
v Each rack cabinet might have more than one power cord. Be sure to disconnect all power cords in
the rack cabinet when directed to disconnect power during servicing.
v Connect all devices installed in a rack cabinet to power devices installed in the same rack
cabinet. Do not plug a power cord from a device installed in one rack cabinet into a power
device installed in a different rack cabinet.
v An electrical outlet that is not correctly wired could place hazardous voltage on the metal parts of
the system or the devices that attach to the system. It is the responsibility of the customer to
ensure that the outlet is correctly wired and grounded to prevent an electrical shock.
CAUTION
v Do not install a unit in a rack where the internal rack ambient temperatures will exceed the
manufacturer's recommended ambient temperature for all your rack-mounted devices.
v Do not install a unit in a rack where the air flow is compromised. Ensure that air flow is not
blocked or reduced on any side, front, or back of a unit used for air flow through the unit.
v Consideration should be given to the connection of the equipment to the supply circuit so that
overloading of the circuits does not compromise the supply wiring or overcurrent protection. To
provide the correct power connection to a rack, refer to the rating labels located on the
equipment in the rack to determine the total power requirement of the supply circuit.
v (For sliding drawers.) Do not pull out or install any drawer or feature if the rack stabilizer brackets
are not attached to the rack. Do not pull out more than one drawer at a time. The rack might
become unstable if you pull out more than one drawer at a time.
v (For fixed drawers.) This drawer is a fixed drawer and must not be moved for servicing unless
specified by the manufacturer. Attempting to move the drawer partially or completely out of the
rack might cause the rack to become unstable or cause the drawer to fall out of the rack.
(R001)
Safety notices xi
CAUTION:
Removing components from the upper positions in the rack cabinet improves rack stability during
relocation. Follow these general guidelines whenever you relocate a populated rack cabinet within a
room or building:
v Reduce the weight of the rack cabinet by removing equipment starting at the top of the rack
cabinet. When possible, restore the rack cabinet to the configuration of the rack cabinet as you
received it. If this configuration is not known, you must observe the following precautions:
– Remove all devices in the 32U position and above.
– Ensure that the heaviest devices are installed in the bottom of the rack cabinet.
– Ensure that there are no empty U-levels between devices installed in the rack cabinet below the
32U level.
v If the rack cabinet you are relocating is part of a suite of rack cabinets, detach the rack cabinet from
the suite.
v Inspect the route that you plan to take to eliminate potential hazards.
v Verify that the route that you choose can support the weight of the loaded rack cabinet. Refer to the
documentation that comes with your rack cabinet for the weight of a loaded rack cabinet.
v Verify that all door openings are at least 760 x 230 mm (30 x 80 in.).
v Ensure that all devices, shelves, drawers, doors, and cables are secure.
v Ensure that the four leveling pads are raised to their highest position.
v Ensure that there is no stabilizer bracket installed on the rack cabinet during movement.
v Do not use a ramp inclined at more than 10 degrees.
v When the rack cabinet is in the new location, complete the following steps:
– Lower the four leveling pads.
– Install stabilizer brackets on the rack cabinet.
– If you removed any devices from the rack cabinet, repopulate the rack cabinet from the lowest
position to the highest position.
v If a long-distance relocation is required, restore the rack cabinet to the configuration of the rack
cabinet as you received it. Pack the rack cabinet in the original packaging material, or equivalent.
Also lower the leveling pads to raise the casters off of the pallet and bolt the rack cabinet to the
pallet.
(R002)
(L001)
(L002)
xii Disk drives
(L003)
or
All lasers are certified in the U.S. to conform to the requirements of DHHS 21 CFR Subchapter J for class
1 laser products. Outside the U.S., they are certified to be in compliance with IEC 60825 as a class 1 laser
product. Consult the label on each part for laser certification numbers and approval information.
CAUTION:
This product might contain one or more of the following devices: CD-ROM drive, DVD-ROM drive,
DVD-RAM drive, or laser module, which are Class 1 laser products. Note the following information:
v Do not remove the covers. Removing the covers of the laser product could result in exposure to
hazardous laser radiation. There are no serviceable parts inside the device.
v Use of the controls or adjustments or performance of procedures other than those specified herein
might result in hazardous radiation exposure.
(C026)
Safety notices xiii
CAUTION:
Data processing environments can contain equipment transmitting on system links with laser modules
that operate at greater than Class 1 power levels. For this reason, never look into the end of an optical
fiber cable or open receptacle. (C027)
CAUTION:
This product contains a Class 1M laser. Do not view directly with optical instruments. (C028)
CAUTION:
Some laser products contain an embedded Class 3A or Class 3B laser diode. Note the following
information: laser radiation when open. Do not stare into the beam, do not view directly with optical
instruments, and avoid direct exposure to the beam. (C030)
CAUTION:
The battery contains lithium. To avoid possible explosion, do not burn or charge the battery.
Do Not:
v ___ Throw or immerse into water
v ___ Heat to more than 100°C (212°F)
v ___ Repair or disassemble
Exchange only with the IBM-approved part. Recycle or discard the battery as instructed by local
regulations. In the United States, IBM has a process for the collection of this battery. For information,
call 1-800-426-4333. Have the IBM part number for the battery unit available when you call. (C003)
Power and cabling information for NEBS (Network Equipment-Building System)
GR-1089-CORE
The following comments apply to the IBM servers that have been designated as conforming to NEBS
(Network Equipment-Building System) GR-1089-CORE:
The equipment is suitable for installation in the following:
v Network telecommunications facilities
v Locations where the NEC (National Electrical Code) applies
The intrabuilding ports of this equipment are suitable for connection to intrabuilding or unexposed
wiring or cabling only. The intrabuilding ports of this equipment must not be metallically connected to the
interfaces that connect to the OSP (outside plant) or its wiring. These interfaces are designed for use as
intrabuilding interfaces only (Type 2 or Type 4 ports as described in GR-1089-CORE) and require isolation
from the exposed OSP cabling. The addition of primary protectors is not sufficient protection to connect
these interfaces metallically to OSP wiring.
Note: All Ethernet cables must be shielded and grounded at both ends.
The ac-powered system does not require the use of an external surge protection device (SPD).
The dc-powered system employs an isolated DC return (DC-I) design. The DC battery return terminal
shall not be connected to the chassis or frame ground.
xiv Disk drives
Disk drives
Learn about installing, removing, and replacing a disk drive for the 8231-E2B, 8231-E1C, or 8231-E2C.
© Copyright IBM Corp. 2010, 2011 1
2 Disk drives
What's new in Disk drives
Read about new or changed information in Disk drives since the previous update of this topic collection.
October 2011
The following updates were made to the content:
v Added information for the 8231-E1C and 8231-E2C.
May 2011
The following updates were made to the content:
v Added information for installing a disk drive in the 5887 under the installing a disk drive section.
v Added information for Disk drive locations and concurrent maintenance lights for the 5887 disk drive
enclosure unit.
September 2010
The following updates were made to the content:
v Added information for Internal disk drive sharing on an 8202-E4B or 8205-E6B system.
v Added information for Disk drive locations and concurrent maintenance lights for 8231-E2B.
v Added information for Disk drive locations and concurrent maintenance lights for an 8202-E4B or
8205-E6B system.
v Added information for PCIe RAID and SSD SAS adapter locations.
v Added a procedure for Installing the external SAS port on an 8231-E2B system.
v Added a procedure for Installing the external SAS port on an 8202-E4B or 8205-E6B system.
March 2010
The following updates were made to the content:
v Added information for Disk drive locations and concurrent maintenance lights for models 9117-MMB
or 9179-MHB.
v Added a procedure for Internal disk drive sharing in a 9117-MMB or 9179-MHB system.
February 2010
v Added information for servers that contain the POWER7® processor.
© Copyright IBM Corp. 2010, 2011 3
4 Disk drives
Installing a disk drive or solid-state drive in the 8231-E2B,
8231-E1C, or 8231-E2C
You can install a Small Computer System Interface (SCSI), serial-attached SCSI (SAS) disk drive, or
solid-state drive (SSD) in the server or expansion unit.
The system or logical partition can either be turned off or turned on when you install the disk drive.
Identify the system for performing the installation or an upgrade. Turn on the System Identify (BLUE)
LED.
Solid-state drives
To install and configure solid-state drives (SSD), you must review the supported machine type, adapter,
and configuration details.
SSDs, also known as flash drives, follow many of the same rules as a regular hard disk drive (HDD). For
example, SSDs physically resemble, install in the same manner, and in most of the same slots as HDDs.
However, some restrictions and configuration rules that apply to SSD are different from HDD. The
following tables can help you determine which operating system, adapter, machine type model, and disk
drive configuration rules apply to your system or expansion unit.
After you have reviewed the configuration rules, you can install the drive as you would any other disk
drive.
Restriction: Solid-state drives must be part of a RAID array or System Mirroring (IBM i).
Refer to the Solid-state drive rules for the IBM AIX®, IBM i, or Linux operating system for the server or
partition where you want to install drives.
Attention: Before installing SSD into your system, you must ensure that you have all the prerequisite
updates for both the SSD and the adapters. To check for the prerequisites, go to the website: IBM
Prerequisite.
© Copyright IBM Corp. 2010, 2011 5
Table 1. Solid-state drive rules for the IBM AIX, IBM i, or Linux operating system
Machine type and model Adapter Mixing rules
5887 v FC 5805: PCIe 380 MB Cache Dual
- x4 3Gb SAS RAID Adapter
v FC 5901: PCIe Dual - x4 SAS
Adapter
v FC 5908: PCI-X DDR 1.5 GB Cache
SAS RAID Adapter (BSC)
v A maximum of 24 drives are
supported in this expansion unit.
v The EXP24S can be configured as
one set of 24 bays (mode 1), two
sets of 12 bays (mode 2), or four
sets of 6 bays (mode 4).
v The EXP24S serial-attached SCSI
(SAS) ports are attached to SAS
controllers, which can be a SAS
Peripheral Component
Interconnect-X (PCI-X) adapter, PCI
Express (PCIe) adapter, or a pair of
adapters. The EXP24S can also be
attached to an imbedded SAS
controller in a server with an
imbedded SAS port.
v The mixing of the Solid-state
drives and hard disk drives (HDD)
is as follows:
– SSDs and HDDs cannot be
mixed in a logical group when
configured as in mode 1.
– SSDs and HDDs can be mixed
when configured as in mode 2:
one set can be SSDs and the
other set can be HDDs.
– SSDs and HDDs cannot be
mixed when configured as in
mode 4 because the SSDs are
not supported by the adapter
that is required to configure the
5887 for mode 4.
5886 v FC 5904: PCI-X DDR 1.5GB Cache
SAS RAID Adapter
v FC 5906 and FC 5908: PCI-X DDR
1.5 GB Cache SAS RAID Adapter
(BSC)
v FC 5903: PCIe 380 MB Cache Dual
- x4 3 Gb SAS RAID Adapter
v SSDs and hard disk drives can be
mixed in this drive enclosure.
v A maximum of 24 drives are
supported in this expansion unit.
v SSDs and HDDs can be attached to
the SAS Controller PCI or PCI-X
adapters. Also, attached to the
imbedded SAS controller in a
server with an imbedded SAS port.
8231-E2B, 8231-E1C, or 8231-E2C v FC 2053: 3 Gbps SAS RAID
Controller with 4X SSD Sockets
v FC 5278: SAS Controller
v SSD and HDD can never be mixed
in the same RAID array or
mirroring group.
v SSDs and HDDs cannot split the
disks.
v SSDs and HDDs can be connected
to the 5886 from the external SAS
port.
6 Disk drives
Table 1. Solid-state drive rules for the IBM AIX, IBM i, or Linux operating system (continued)
Machine type and model Adapter Mixing rules
8202-E4B, 8202-E4C, 8205-E6B, or
8205-E6C
v FC 5631: Split disks Enablement
adapter
v SSD and HDD can never be mixed
in the same RAID array or
mirroring group.
v SSDs and HDDs can be connected
to the 5886 from the external SAS
port.
v 5631 can be used to split disks
8233-E8B and 8236-E8C v FC 5679: SAS RAID Enablement:
internal adapter
v FC 5904: PCI-X DDR 1.5GB Cache
SAS RAID Adapter
v FC 5903: PCIe 380 MB Cache Dual
- x4 3 Gb SAS RAID Adapter
v SSD and HDD can never be mixed
in the same RAID array or
mirroring group.
v SSD and HDD can be mixed in the
system disk drive backplane if the
backplane is not split.
v SSD and HDD can both be present
in a split backplane if each side of
the split backplane has only SSD or
HDD.
Restriction: Split backplane for
these models is not supported with
dual SAS Raid adapters.
Remember: SSD must be on one side
of the split backplane and HDD must
be on the other side of the split
backplane. SSD and HDD drive types
cannot be mixed within a parity set.
If you are mixing SSD and HDD in a
system, ensure that you have enough
of each drive type to manage your
protection scheme.
9117-MMB, 9117-MMC, 9179-MHB, or
9179-MHC
v FC 5662: Disk/Media Backplane
v FC 5662: 175 MB Cache RAID -
Dual IOA Enablement Card
v FC 5903: PCIe 380 MB Cache Dual
- x4 3 Gb SAS RAID Adapter
v SSD and HDD cannot be mixed in
the same RAID array or mirroring
group.
v SSD and HDD can be mixed in the
system disk-drive backplane.
Remember: You cannot mix SSD
and HDD drive types within a
parity set. If you are mixing SSD
and HDD in a system, ensure that
you have enough of each drive
type to manage your protection
scheme.
Installing a disk drive or solid-state drive in the 8231-E2B, 8231-E1C, or 8231-E2C 7
Table 1. Solid-state drive rules for the IBM AIX, IBM i, or Linux operating system (continued)
Machine type and model Adapter Mixing rules
5802 FC 5903: PCIe 380 MB Cache Dual -
x4 3 Gb SAS RAID Adapter
v SSD and HDD cannot be mixed in
the same system disk drive
backplane.
v SSD and HDD can both be present
in a split backplane if each side of
the split backplane has only SSD or
HDD. SSD must be on one side of
the split backplane and HDD must
be on the other side of the split
backplane.
v SSD and HDD cannot be mixed on
a pair of 5903 adapters.
v A maximum of nine SSDs are
supported on a pair of 5903
adapters.
Installing a disk drive or solid-state drive in the 8231-E2B, 8231-E1C,
or 8231-E2C with the AIX system or logical partition power turned on
Learn how to install a disk-drive or solid-state drive in a system or expansion unit while the power of
the AIX operating system or logical partition that controls the disk drive location is turned on.
If you are servicing a failing part, see the service procedures for “Replacing a disk drive in the 8231-E2B,
8231-E1C, or 8231-E2C with the AIX system or logical partition power turned on” on page 61. This
procedure is intended for installing a new or upgraded disk drive.
Before installing or replacing a feature, ensure that the software required to support the feature is
installed on your system. For information about software prerequisites, see IBM prerequisite. If the
required software is not installed, go to Fix Central to download it, and install it before continuing.
If you are installing solid-state drives (SSD), review the configuration rules and then return here. For
more information, see “Solid-state drives” on page 5.
8 Disk drives
DANGER
When working on or around the system, observe the following precautions:
Electrical voltage and current from power, telephone, and communication cables are hazardous. To
avoid a shock hazard:
v Connect power to this unit only with the IBM provided power cord. Do not use the IBM
provided power cord for any other product.
v Do not open or service any power supply assembly.
v Do not connect or disconnect any cables or perform installation, maintenance, or reconfiguration
of this product during an electrical storm.
v The product might be equipped with multiple power cords. To remove all hazardous voltages,
disconnect all power cords.
v Connect all power cords to a properly wired and grounded electrical outlet. Ensure that the outlet
supplies proper voltage and phase rotation according to the system rating plate.
v Connect any equipment that will be attached to this product to properly wired outlets.
v When possible, use one hand only to connect or disconnect signal cables.
v Never turn on any equipment when there is evidence of fire, water, or structural damage.
v Disconnect the attached power cords, telecommunications systems, networks, and modems before
you open the device covers, unless instructed otherwise in the installation and configuration
procedures.
v Connect and disconnect cables as described in the following procedures when installing, moving,
or opening covers on this product or attached devices.
To Disconnect:
1. Turn off everything (unless instructed otherwise).
2. Remove the power cords from the outlets.
3. Remove the signal cables from the connectors.
4. Remove all cables from the devices
To Connect:
1. Turn off everything (unless instructed otherwise).
2. Attach all cables to the devices.
3. Attach the signal cables to the connectors.
4. Attach the power cords to the outlets.
5. Turn on the devices.
(D005)
Attention: This procedure is a complex set of steps that must be performed precisely as instructed to
avoid potential data loss or system damage. Alternatively, you can use the procedure for installing a disk
drive or solid-state drive with the power off, which is a simpler task that involves less risk of data loss or
system damage. For instructions, see “Installing a disk drive or solid-state drive in the 8231-E2B,
8231-E1C, or 8231-E2C with the system or logical partition power turned off” on page 31.
Attention:
v Attach a wrist strap to an unpainted metal surface of your hardware to prevent electrostatic discharge
(ESD) from damaging your hardware.
v When using a wrist strap, follow all electrical safety procedures. A wrist strap is for static control. It
does not increase or decrease your risk of receiving electric shock when using or working on electrical
equipment.
v If you do not have a wrist strap, just prior to removing the product from ESD packaging and installing
or replacing hardware, touch an unpainted metal surface of the system for a minimum of 5 seconds.
Note: Some of the figures in these procedures might not look exactly like the system or expansion unit
that you have. However, the steps to perform the task are the same.
Installing a disk drive or solid-state drive in the 8231-E2B, 8231-E1C, or 8231-E2C 9
Preparing the system
Complete the following steps for preparing the system before installing a disk drive.
1. Perform prerequisite tasks. For instructions, see Before you begin.
2. If applicable, remove the system unit door, remove the expansion unit door, or open the rack.
3. Determine the next available disk drive position in your system or expansion unit. For information,
see “Disk drive locations and service indicators for the 8231-E2B, 8231-E1C, or 8231-E2C system” on
page 97.
Note: When you have a system with an expansion unit, fill the slot positions in the system unit first.
However, you can choose a different disk drive placement depending on your data protection
strategy.
4. Record the position where the new disk drive must be installed. For example, the next available disk
drive slot might be P3-D4.
Installing a disk drive in an 8231-E2B, 8231-E1C, or 8231-E2C
Complete the following steps to install a disk drive in the system or an expansion unit.
1. Use the service indicator light-emitting diodes (LEDs) to help identify the part. For information, see
Identifying a part.
2. Use the location information that you recorded while removing the disk drives to install or replace
the disk drive to the correct location.
3. Find the package that contains the new disk drive.
Attention: Disk drives are fragile. Handle with care.
4. Remove the disk drive from the static-protective package. Record the serial number (use the last four
digits) of the disk drive.
5. If the slot you want to use contains a disk drive filler, remove the disk drive filler from the slot. For
instructions, see “Removing a disk drive filler from the 8231-E2B, 8231-E1C, or 8231-E2C system” on
page 89.
6. If the disk drive you are installing does not have a bezel preinstalled, then perform the tasks to
install a bezel on the disk drive, then continue with this procedure from step 8. For instructions to
install a bezel, see “Installing a disk drive bezel in a disk drive or filler” on page 94.
7. If the disk drive you are installing has a bezel preinstalled, then go to step 8.
8. Unlock the disk drive handle by squeezing it and pulling it out toward you. If the handle is not all
the way out, the disk drive does not slide into the system or expansion unit.
9. Hold the disk drive by the top and bottom edges as you position the drive, and insert it into the
system or the expansion unit. Do not hold the disk drive by the handle.
10. Slide the disk drive all the way into the system or expansion unit, and push the disk drive handle
(A) in until it locks, as shown in Figure 1 on page 11. The service indicator on the disk drive turns
on.
Important: When installing a disk drive ensure that the disk drive is fully seated and is all the way
into the system.
Note: The light stops flashing and remains on when you have installed the disk drive properly.
10 Disk drives
11. If you are installing more than one disk drive, perform all the steps in this procedure until all disk
drives are installed.
Installing a disk drive in the 5887 disk-drive expansion unit
Perform the following steps to install a disk drive in the 5887 disk-drive expansion unit.
1. With the handle in the unlocked position, see Figure 2 on page 12, support the bottom of the disk unit
as you align it with the guide rails in the expansion unit. Do not hold the disk unit solely by the
handle.
2. Slide the disk unit into the expansion unit until the drive stops.
Important: When installing a disk drive ensure that the disk drive is fully seated and is all the way
into the system.
3. Rotate the handle to the locked position.
Figure 1. Installing the disk drive in the 8231-E2B, 8231-E1C, or 8231-E2C
Installing a disk drive or solid-state drive in the 8231-E2B, 8231-E1C, or 8231-E2C 11
Completing the disk drive installation by using the HSM
Perform the following steps in the Hardware Service Manager (HSM):
1. Press Enter.
2. Exit to the command line.
3. If you are installing more than one disk drive, repeat this procedure until all disk drives are installed.
4. Cancel to get to the SCSI and SCSI RAID Hot Plug Manager menu.
5. Select Configure Added/Replaced Devices.
Completing the procedure
1. Verify the installed part:
v If you replaced the part because of a failure, verify the installed part. For instructions, see Verifying
a repair.
v If you installed the part for any other reason, verify the installed part. For instructions, see
Verifying the installed part.
2. To configure the newly installed disk drive, see the procedure for the operating system or logical
partition that controls the disk drive:
v Preparing a disk drive for use in an AIX system or logical partition.
v Preparing a disk drive for use in a Linux system or logical partition
v Configuring a disk drive in an IBM i system or logical partition.
3. To rebuild the data on the replacement disk drive, see the procedure for the operating system or
logical partition that controls the disk drive:
v Rebuilding data on a replacement drive using AIX.
v Rebuilding data on a replacement drive using Linux.
v Rebuilding data on a replacement drive using IBM i.
Figure 2. Installing the disk drive in the 5887 expansion unit
12 Disk drives
Installing a disk-drive or solid-state drive in the 8231-E2B, 8231-E1C,
or 8231-E2C with the Linux system or logical partition power turned on
Learn how to install a disk drive or solid-state drive in a system or expansion unit while the power to
the Linux operating system or logical partition that controls the disk drive location is turned on. This
procedure uses the Linux kernel 2.6.
Attention: If you are servicing a failing part, see the service procedures for “Replacing a disk drive
using Linux kernel 2.6, with the Linux system or logical partition power turned on” on page 65. This
procedure is intended for installing a new or upgraded disk drive.
Before installing or replacing a feature, ensure that the software required to support the feature is
installed on your system. For information about software prerequisites, see IBM prerequisite. If the
required software is not installed, go to Fix Central to download it, and install it before continuing.
If you are installing solid-state drives (SSD), review the configuration rules and then return here. For
more information, see “Solid-state drives” on page 5.
Attention: This procedure is a complex set of steps that must be performed precisely as instructed to
avoid potential data loss or system damage. Alternatively, you can use the procedure for installing a disk
drive or solid-state drive with the power off, which is a simpler task that involves less risk of data loss or
system damage. For instructions, see “Installing a disk drive or solid-state drive in the 8231-E2B,
8231-E1C, or 8231-E2C with the system or logical partition power turned off” on page 31.
Installing a disk drive or solid-state drive in the 8231-E2B, 8231-E1C, or 8231-E2C 13
DANGER
When working on or around the system, observe the following precautions:
Electrical voltage and current from power, telephone, and communication cables are hazardous. To
avoid a shock hazard:
v Connect power to this unit only with the IBM provided power cord. Do not use the IBM
provided power cord for any other product.
v Do not open or service any power supply assembly.
v Do not connect or disconnect any cables or perform installation, maintenance, or reconfiguration
of this product during an electrical storm.
v The product might be equipped with multiple power cords. To remove all hazardous voltages,
disconnect all power cords.
v Connect all power cords to a properly wired and grounded electrical outlet. Ensure that the outlet
supplies proper voltage and phase rotation according to the system rating plate.
v Connect any equipment that will be attached to this product to properly wired outlets.
v When possible, use one hand only to connect or disconnect signal cables.
v Never turn on any equipment when there is evidence of fire, water, or structural damage.
v Disconnect the attached power cords, telecommunications systems, networks, and modems before
you open the device covers, unless instructed otherwise in the installation and configuration
procedures.
v Connect and disconnect cables as described in the following procedures when installing, moving,
or opening covers on this product or attached devices.
To Disconnect:
1. Turn off everything (unless instructed otherwise).
2. Remove the power cords from the outlets.
3. Remove the signal cables from the connectors.
4. Remove all cables from the devices
To Connect:
1. Turn off everything (unless instructed otherwise).
2. Attach all cables to the devices.
3. Attach the signal cables to the connectors.
4. Attach the power cords to the outlets.
5. Turn on the devices.
(D005)
Preparing the system
Complete the following steps for preparing the system before installing a disk drive.
1. Perform prerequisite tasks. For instructions, see Before you begin.
2. If applicable, remove the system unit door, remove the expansion unit door, or open the rack.
3. Determine the next available disk drive position in your system or expansion unit. For information,
see “Disk drive locations and service indicators for the 8231-E2B, 8231-E1C, or 8231-E2C system” on
page 97.
Note: When you have a system with an expansion unit, fill the slot positions in the system unit first.
However, you can choose a different disk drive placement depending on your data protection
strategy.
4. Record the position where the new disk drive must be installed. For example, the next available disk
drive slot might be P3-D4.
14 Disk drives
Preparing to install a disk drive by using the HSM
Perform the following steps to install a disk drive by using the Hardware Service Manager (HSM):
1. Log in as root user.
2. Type lsslot on the command line of the Linux session and then press Enter.
A display like the following figure, is shown.
3. Note the Linux Name for the slot that contains the adapter under which you want to install the disk
drive.
4. Type iprconfig on the command line of the Linux session and press Enter. The IBM Power RAID
Configuration Utility display is shown.
5. Select Work with disk unit recovery from the IBM Power RAID Configuration Utility display. Press
Enter.
6. Select Concurrent add device from the Work with Disk Unit Recovery display. Press Enter.
A Concurrent Device Add display, like the following figure, is shown.
x0537p1:~ # lsslot
# Slot Description Linux Name Device(s)
U787A.001.DNZ00XV-P1-C1 Logical I/O Slot 0000:00:02.0 ethernet
U787A.001.DNZ00XV-P1-C2 Logical I/O Slot 0000:00:02.2 pci
U787A.001.DNZ00XV-P1-C3 Logical I/O Slot 0001:00:02.2 pci1069,b166
U787A.001.DNZ00XV-P1-C4 Logical I/O Slot 0000:00:02.6 pci1069,b166
U787A.001.DNZ00XV-P1-C5 Logical I/O Slot 0001:00:02.6 pci1014,0180
U787A.001.DNZ00XV-P1-T5 Logical I/O Slot 0000:00:02.4 ethernet
U787A.001.DNZ00XV-P1-T7 Logical I/O Slot 0000:00:02.3 usb
U787A.001.DNZ00XV-P1-T10 Logical I/O Slot 0001:00:02.4 scsi
U787A.001.DNZ00XV-P1-T12 Logical I/O Slot 0001:00:02.3 ide
U9111.520.10D3C6C-V1-C0 Virtual I/O Slot 30000000 vty
Figure 3. Find the Linux Name for the location for which you want to install the disk drive.
Installing a disk drive or solid-state drive in the 8231-E2B, 8231-E1C, or 8231-E2C 15
7. Type option 1 next to the location into which you want to install the disk drive and press Enter.
The Verify Device Concurrent Add display is shown.
A service indicator indicates the slot where the disk drive can be installed.
Installing a disk drive in an 8231-E2B, 8231-E1C, or 8231-E2C
Complete the following steps to install a disk drive in the system or an expansion unit.
1. Use the service indicator light-emitting diodes (LEDs) to help identify the part. For information, see
Identifying a part.
2. Use the location information that you recorded while removing the disk drives to install or replace
the disk drive to the correct location.
3. Find the package that contains the new disk drive.
Attention: Disk drives are fragile. Handle with care.
4. Remove the disk drive from the static-protective package. Record the serial number (use the last four
digits) of the disk drive.
5. If the slot you want to use contains a disk drive filler, remove the disk drive filler from the slot. For
instructions, see “Removing a disk drive filler from the 8231-E2B, 8231-E1C, or 8231-E2C system” on
page 89.
6. If the disk drive you are installing does not have a bezel preinstalled, then perform the tasks to
install a bezel on the disk drive, then continue with this procedure from step 8. For instructions to
install a bezel, see “Installing a disk drive bezel in a disk drive or filler” on page 94.
7. If the disk drive you are installing has a bezel preinstalled, then go to step 8.
8. Unlock the disk drive handle by squeezing it and pulling it out toward you. If the handle is not all
the way out, the disk drive does not slide into the system or expansion unit.
9. Hold the disk drive by the top and bottom edges as you position the drive, and insert it into the
system or the expansion unit. Do not hold the disk drive by the handle.
10. Slide the disk drive all the way into the system or expansion unit, and push the disk drive handle
(A) in until it locks, as shown in Figure 5 on page 17. The service indicator on the disk drive turns
on.
Important: When installing a disk drive ensure that the disk drive is fully seated and is all the way
into the system.
Note: The light stops flashing and remains on when you have installed the disk drive properly.
Concurrent Device Add
Choose a single location for add operations
1=Select
OPT Name PCI/SCSI Location Description Status
--- ------ -------------------------- ------------------------- -----------------
0000:41:01.0.1/0:5: Empty
0000:41:01.0.1/1:8: Empty
0001:61:01.0.2/0:8: Empty
Figure 4. Example Concurrent Device Remove display
16 Disk drives
11. If you are installing more than one disk drive, perform all the steps in this procedure until all disk
drives are installed.
Installing a disk drive in the 5887 disk-drive expansion unit
Perform the following steps to install a disk drive in the 5887 disk-drive expansion unit.
1. With the handle in the unlocked position, see Figure 6 on page 18, support the bottom of the disk unit
as you align it with the guide rails in the expansion unit. Do not hold the disk unit solely by the
handle.
2. Slide the disk unit into the expansion unit until the drive stops.
Important: When installing a disk drive ensure that the disk drive is fully seated and is all the way
into the system.
3. Rotate the handle to the locked position.
Figure 5. Installing the disk drive in the 8231-E2B, 8231-E1C, or 8231-E2C
Installing a disk drive or solid-state drive in the 8231-E2B, 8231-E1C, or 8231-E2C 17
Verifying that the new disk drive is operational
Do the following to confirm if you have installed the disk drive correctly:
v Press Enter on the Verify Device Concurrent Add display to verify the location.
v Press Enter. The service indicator stops flashing and remains on when you have installed the disk
drive correctly.
Perform the following steps to verify that the new disk drive is operational:
1. Log in as root user.
2. Type iprconfig on the command line of the Linux session and press Enter. The IBM Power RAID
Configuration Utility display is shown.
3. Select Display hardware status. The Display Hardware Status display is shown, like the following
figure.
Figure 6. Installing the disk drive in the 5887 expansion unit
18 Disk drives
4. Verify that the disk drive that you installed is shown on this display.
Completing the procedure
1. Verify the installed part:
v If you replaced the part because of a failure, verify the installed part. For instructions, see Verifying
a repair.
v If you installed the part for any other reason, verify the installed part. For instructions, see
Verifying the installed part.
2. To configure the newly installed disk drive, see the procedure for the operating system or logical
partition that controls the disk drive:
v Preparing a disk drive for use in an AIX system or logical partition.
v Preparing a disk drive for use in a Linux system or logical partition
v Configuring a disk drive in an IBM i system or logical partition.
3. To rebuild the data on the replacement disk drive, see the procedure for the operating system or
logical partition that controls the disk drive:
Display Hardware Status
Type option, press Enter.
1=Display hardware resource information details
OPT Name PCI/SCSI Location Description Status
--- ------ -------------------------- ------------------------- ----------------
0000:01:01.0.0/ PCI-X SCSI Adapter Operational
0000:41:01.0.1/ PCI-X SCSI Adapter Operational
sda 0000:41:01.0.1/0:3:0 Physical Disk Active
sdb 0000:41:01.0.1/0:4:0 Physical Disk Active
sdc 0000:41:01.0.1/0:8:0 Physical Disk Active
sdd 0000:41:01.0.1/1:3:0 Physical Disk Active
sde 0000:41:01.0.1/1:4:0 Physical Disk Active
sdf 0000:41:01.0.1/1:5:0 Physical Disk Active
0001:61:01.0.2/ PCI-X SCSI RAID Adapter Operational
sdg 0001:61:01.0.2/0:3:0 Physical Disk Active
0001:61:01.0.2/0:6:0 Advanced Function Disk Active
sdi 0001:61:01.0.2/0:9:0 Physical Disk Active
sdh 0001:61:01.0.2/255:0:0 RAID 10 Disk Array Failed
0001:61:01.0.2/0:4:0 RAID 10 Array Member Failed
0001:61:01.0.2/0:5:0 RAID 10 Array Member Failed
e=Exit q=Cancel r=Refresh t=Toggle
Figure 7. Example Display Hardware Status
Installing a disk drive or solid-state drive in the 8231-E2B, 8231-E1C, or 8231-E2C 19
v Rebuilding data on a replacement drive using AIX.
v Rebuilding data on a replacement drive using Linux.
v Rebuilding data on a replacement drive using IBM i.
Installing a disk-drive or solid-state drive in the 8231-E2B, 8231-E1C,
or 8231-E2C with the Virtual I/O Server system or logical partition
power turned on
You can install a disk-drive or solid-state drive in a system or expansion unit while the power to the
Virtual I/O Server operating system or logical partition that controls the disk-drive location is turned on.
Before installing or replacing a feature, ensure that the software required to support the feature is
installed on your system. For information about software prerequisites, see IBM prerequisite. If the
required software is not installed, go to Fix Central to download it, and install it before continuing.
Attention: This procedure is a complex set of steps that must be performed precisely as instructed to
avoid potential data loss or system damage. Alternatively, you can use the procedure for installing a disk
drive or solid-state drive with the power off, which is a simpler task that involves less risk of data loss or
system damage. For instructions, see “Installing a disk drive or solid-state drive in the 8231-E2B,
8231-E1C, or 8231-E2C with the system or logical partition power turned off” on page 31.
20 Disk drives
DANGER
When working on or around the system, observe the following precautions:
Electrical voltage and current from power, telephone, and communication cables are hazardous. To
avoid a shock hazard:
v Connect power to this unit only with the IBM provided power cord. Do not use the IBM
provided power cord for any other product.
v Do not open or service any power supply assembly.
v Do not connect or disconnect any cables or perform installation, maintenance, or reconfiguration
of this product during an electrical storm.
v The product might be equipped with multiple power cords. To remove all hazardous voltages,
disconnect all power cords.
v Connect all power cords to a properly wired and grounded electrical outlet. Ensure that the outlet
supplies proper voltage and phase rotation according to the system rating plate.
v Connect any equipment that will be attached to this product to properly wired outlets.
v When possible, use one hand only to connect or disconnect signal cables.
v Never turn on any equipment when there is evidence of fire, water, or structural damage.
v Disconnect the attached power cords, telecommunications systems, networks, and modems before
you open the device covers, unless instructed otherwise in the installation and configuration
procedures.
v Connect and disconnect cables as described in the following procedures when installing, moving,
or opening covers on this product or attached devices.
To Disconnect:
1. Turn off everything (unless instructed otherwise).
2. Remove the power cords from the outlets.
3. Remove the signal cables from the connectors.
4. Remove all cables from the devices
To Connect:
1. Turn off everything (unless instructed otherwise).
2. Attach all cables to the devices.
3. Attach the signal cables to the connectors.
4. Attach the power cords to the outlets.
5. Turn on the devices.
(D005)
Note: Some of the figures in these procedures might not look exactly like the system or expansion unit
that you have. However, the steps to perform the task are the same.
Preparing the system
Complete the following steps for preparing the system before installing a disk drive.
1. Perform prerequisite tasks. For instructions, see Before you begin.
2. If applicable, remove the system unit door, remove the expansion unit door, or open the rack.
3. Determine the next available disk drive position in your system or expansion unit. For information,
see “Disk drive locations and service indicators for the 8231-E2B, 8231-E1C, or 8231-E2C system” on
page 97.
Note: When you have a system with an expansion unit, fill the slot positions in the system unit first.
However, you can choose a different disk drive placement depending on your data protection
strategy.
4. Record the position where the new disk drive must be installed. For example, the next available disk
drive slot might be P3-D4.
Installing a disk drive or solid-state drive in the 8231-E2B, 8231-E1C, or 8231-E2C 21
Preparing to install a disk drive by using the HSM
Complete the following steps by using the Hardware Service Manager (HSM) before installing a disk
drive.
1. Log in as root user.
2. At the command line, type diag and press Enter.
3. Press Enter to continue.
4. On the Function Selection display, select Task Selection.
5. On the Tasks Selection display, select Hot Plug Task.
6. Select SCSI and SCSI RAID Hot Plug Manager.
7. Select Attach a Device to a SCSI Hot Swap Enclosure Device. A list of empty slots in the SCSI hot
swap enclosure device is shown.
8. Select the slot where you want to install the disk drive.
9. Do not press Enter yet, first complete installing the disk drive.
Important: When you press Enter, after a one minute delay, the disk drive slot enters the Identify
state and the service indicator comes on and begins to flash rapidly. You then have nine seconds to
slide the disk drive all the way in and then lock it in place by pushing in the disk drive handle. You
can then press Enter on the console.
Installing a disk drive in an 8231-E2B, 8231-E1C, or 8231-E2C
Complete the following steps to install a disk drive in the system or an expansion unit.
1. Use the service indicator light-emitting diodes (LEDs) to help identify the part. For information, see
Identifying a part.
2. Use the location information that you recorded while removing the disk drives to install or replace
the disk drive to the correct location.
3. Find the package that contains the new disk drive.
Attention: Disk drives are fragile. Handle with care.
4. Remove the disk drive from the static-protective package. Record the serial number (use the last four
digits) of the disk drive.
5. If the slot you want to use contains a disk drive filler, remove the disk drive filler from the slot. For
instructions, see “Removing a disk drive filler from the 8231-E2B, 8231-E1C, or 8231-E2C system” on
page 89.
6. If the disk drive you are installing does not have a bezel preinstalled, then perform the tasks to
install a bezel on the disk drive, then continue with this procedure from step 8. For instructions to
install a bezel, see “Installing a disk drive bezel in a disk drive or filler” on page 94.
7. If the disk drive you are installing has a bezel preinstalled, then go to step 8.
8. Unlock the disk drive handle by squeezing it and pulling it out toward you. If the handle is not all
the way out, the disk drive does not slide into the system or expansion unit.
9. Hold the disk drive by the top and bottom edges as you position the drive, and insert it into the
system or the expansion unit. Do not hold the disk drive by the handle.
10. Slide the disk drive all the way into the system or expansion unit, and push the disk drive handle
(A) in until it locks, as shown in Figure 8 on page 23. The service indicator on the disk drive turns
on.
Important: When installing a disk drive ensure that the disk drive is fully seated and is all the way
into the system.
Note: The light stops flashing and remains on when you have installed the disk drive properly.
22 Disk drives
11. If you are installing more than one disk drive, perform all the steps in this procedure until all disk
drives are installed.
Installing a disk drive in the 5887 disk-drive expansion unit
Perform the following steps to install a disk drive in the 5887 disk-drive expansion unit.
1. With the handle in the unlocked position, see Figure 9 on page 24, support the bottom of the disk unit
as you align it with the guide rails in the expansion unit. Do not hold the disk unit solely by the
handle.
2. Slide the disk unit into the expansion unit until the drive stops.
Important: When installing a disk drive ensure that the disk drive is fully seated and is all the way
into the system.
3. Rotate the handle to the locked position.
Figure 8. Installing the disk drive in the 8231-E2B, 8231-E1C, or 8231-E2C
Installing a disk drive or solid-state drive in the 8231-E2B, 8231-E1C, or 8231-E2C 23
Completing the disk drive installation by using the HSM
Perform the following steps in the Hardware Service Manager (HSM) after you have installed the disk
drive:
1. Cancel to get to the SCSI and SCSI RAID Hot Plug Manager menu.
2. Select Configure Added/Replaced Devices.
3. Exit to the command line.
Completing the procedure
1. Verify the installed part:
v If you replaced the part because of a failure, verify the installed part. For instructions, see Verifying
a repair.
v If you installed the part for any other reason, verify the installed part. For instructions, see
Verifying the installed part.
2. To configure the newly installed disk drive, see the procedure for the operating system or logical
partition that controls the disk drive:
v Preparing a disk drive for use in an AIX system or logical partition.
v Preparing a disk drive for use in a Linux system or logical partition
v Configuring a disk drive in an IBM i system or logical partition.
3. To rebuild the data on the replacement disk drive, see the procedure for the operating system or
logical partition that controls the disk drive:
v Rebuilding data on a replacement drive using AIX.
v Rebuilding data on a replacement drive using Linux.
v Rebuilding data on a replacement drive using IBM i.
Figure 9. Installing the disk drive in the 5887 expansion unit
24 Disk drives
Installing a disk drive or solid-state drive in the 8231-E2B, 8231-E1C,
or 8231-E2C with the IBM i system or logical partition power turned on
Learn how to install a disk drive or solid-state drive in a system or expansion unit while the power to
the IBM i operating system or logical partition that controls the disk drive location is turned on.
If you are servicing a failing part, see the service procedures for “Replacing a disk drive in the 8231-E2B,
8231-E1C, or 8231-E2C with the IBM i system or logical partition power turned on” on page 69. This
procedure is intended for installing a new or upgraded disk drive.
Attention: This procedure is a complex set of steps that must be performed precisely as instructed to
avoid potential data loss or system damage. Alternatively, you can use the procedure for installing a disk
drive or solid-state drive with the power off, which is a simpler task that involves less risk of data loss or
system damage. For instructions, see “Installing a disk drive or solid-state drive in the 8231-E2B,
8231-E1C, or 8231-E2C with the system or logical partition power turned off” on page 31.
Before installing or replacing a feature, ensure that the software required to support the feature is
installed on your system. For information about software prerequisites, see IBM prerequisite. If the
required software is not installed, go to Fix Central to download it, and install it before continuing.
If you are installing solid-state drives (SSD), review the configuration rules and then return here. For
more information, see “Solid-state drives” on page 5.
Installing a disk drive or solid-state drive in the 8231-E2B, 8231-E1C, or 8231-E2C 25
DANGER
When working on or around the system, observe the following precautions:
Electrical voltage and current from power, telephone, and communication cables are hazardous. To
avoid a shock hazard:
v Connect power to this unit only with the IBM provided power cord. Do not use the IBM
provided power cord for any other product.
v Do not open or service any power supply assembly.
v Do not connect or disconnect any cables or perform installation, maintenance, or reconfiguration
of this product during an electrical storm.
v The product might be equipped with multiple power cords. To remove all hazardous voltages,
disconnect all power cords.
v Connect all power cords to a properly wired and grounded electrical outlet. Ensure that the outlet
supplies proper voltage and phase rotation according to the system rating plate.
v Connect any equipment that will be attached to this product to properly wired outlets.
v When possible, use one hand only to connect or disconnect signal cables.
v Never turn on any equipment when there is evidence of fire, water, or structural damage.
v Disconnect the attached power cords, telecommunications systems, networks, and modems before
you open the device covers, unless instructed otherwise in the installation and configuration
procedures.
v Connect and disconnect cables as described in the following procedures when installing, moving,
or opening covers on this product or attached devices.
To Disconnect:
1. Turn off everything (unless instructed otherwise).
2. Remove the power cords from the outlets.
3. Remove the signal cables from the connectors.
4. Remove all cables from the devices
To Connect:
1. Turn off everything (unless instructed otherwise).
2. Attach all cables to the devices.
3. Attach the signal cables to the connectors.
4. Attach the power cords to the outlets.
5. Turn on the devices.
(D005)
Note: Some of the figures in these procedures might not look exactly like the system or expansion unit
that you have. However, the steps to perform the task are the same.
Preparing the system
Complete the following steps for preparing the system before installing a disk drive.
1. Perform prerequisite tasks. For instructions, see Before you begin.
2. If applicable, remove the system unit door, remove the expansion unit door, or open the rack.
3. Determine the next available disk drive position in your system or expansion unit. For information,
see “Disk drive locations and service indicators for the 8231-E2B, 8231-E1C, or 8231-E2C system” on
page 97.
Note: When you have a system with an expansion unit, fill the slot positions in the system unit first.
However, you can choose a different disk drive placement depending on your data protection
strategy.
4. Record the position where the new disk drive must be installed. For example, the next available disk
drive slot might be P3-D4.
26 Disk drives
Preparing to install a disk drive by using the HSM
Perform the following steps to install a disk drive by using the Hardware Service Manager (HSM):
1. Record the physical location where the new disk drive can be installed.
a. The physical location has the following form: U787A.001.AAAXXXX-P3-D4 where P3-D4 is the
position of the disk drive in the system unit or expansion unit. Find the U787A.001.AAAXXXX
location information from the control panel display on the system unit or expansion unit.
b. In this example, U787A.001.AAAXXXX-P3-D4, U787A.001 comes from the feature code (FC) on the
label, AAAXXXX comes from the last seven digits of the sequence number (SN or SEQ) on the
label, and P3–D4 is the slot where you are installing the disk drive as shown in “Disk drive
locations and service indicators” on page 97.
c. If you cannot find the location information from the control panel, find the feature code (FC) and
sequence number (SN or SEQ) on the label that is shown on the front of the system unit or
expansion unit, as shown in Figure 10.
In the example, U5074.001.AAAXXXX-DB3-D32, you would install a disk drive in the expansion unit
with type 5074, model 001, sequence number AAAXXXX, and disk drive position DB3–D32.
2. Sign on to service level authority.
3. Type strsst on the command line of the IBM i session and press Enter.
4. Type your service tools user ID and service tools password on the Start Service Tools (STRSST) Sign
On display. Press Enter.
Remember: The service tools password is case-sensitive.
5. Select Start a service tool from the System Service Tools (SST) display. Press Enter.
6. Select Hardware service manager from the Start a Service Tool display. Press Enter.
7. If the slot you want to use contains a disk drive filler, do not remove the disk drive filler yet.
8. Select Device Concurrent Maintenance from the Hardware Service Manager display, and then press
Enter.
The Device Concurrent Maintenance is displayed, as shown in the following figure.
Figure 10. Example of a system or expansion unit label
Installing a disk drive or solid-state drive in the 8231-E2B, 8231-E1C, or 8231-E2C 27
9. In the Specify either physical location or resource name field, type the physical location that you
had previously recorded in this procedure.
10. Select option 2 (Install device) for the Action to be performed.
11. Set the time delay for one minute - 01.
12. Do not press Enter yet, first complete installing the disk drive.
Important: When you press Enter, after a one minute delay, the disk drive slot enters the Identify
state and the service indicator comes on and begins to flash rapidly. You then have nine seconds to
slide the disk drive all the way in and then lock it in place by pushing in the disk drive handle. You
can then press Enter on the console.
Installing a disk drive in an 8231-E2B, 8231-E1C, or 8231-E2C
Complete the following steps to install a disk drive in the system or an expansion unit.
1. Use the service indicator light-emitting diodes (LEDs) to help identify the part. For information, see
Identifying a part.
2. Use the location information that you recorded while removing the disk drives to install or replace
the disk drive to the correct location.
3. Find the package that contains the new disk drive.
Attention: Disk drives are fragile. Handle with care.
4. Remove the disk drive from the static-protective package. Record the serial number (use the last four
digits) of the disk drive.
5. If the slot you want to use contains a disk drive filler, remove the disk drive filler from the slot. For
instructions, see “Removing a disk drive filler from the 8231-E2B, 8231-E1C, or 8231-E2C system” on
page 89.
Device Concurrent Maintenance
Type the choices, then press Enter.
Specify either physical location or resource name.
Physical location:
U787A.001.1234567-P3-D4
or
Device resource name:
Specify action as 1=Remove device 2=Install device
Action to be performed . . . . . . . .: 2
Enter a time value between 00 and 19.
Time delay needed in minutes . . . . .: 01
F3=Exit F5=Refresh F12=Cancel
Figure 11. Example Device Concurrent Maintenance display
28 Disk drives
6. If the disk drive you are installing does not have a bezel preinstalled, then perform the tasks to
install a bezel on the disk drive, then continue with this procedure from step 8. For instructions to
install a bezel, see “Installing a disk drive bezel in a disk drive or filler” on page 94.
7. If the disk drive you are installing has a bezel preinstalled, then go to step 8.
8. Unlock the disk drive handle by squeezing it and pulling it out toward you. If the handle is not all
the way out, the disk drive does not slide into the system or expansion unit.
9. Hold the disk drive by the top and bottom edges as you position the drive, and insert it into the
system or the expansion unit. Do not hold the disk drive by the handle.
10. Slide the disk drive all the way into the system or expansion unit, and push the disk drive handle
(A) in until it locks, as shown in Figure 12. The service indicator on the disk drive turns on.
Important: When installing a disk drive ensure that the disk drive is fully seated and is all the way
into the system.
Note: The light stops flashing and remains on when you have installed the disk drive properly.
11. If you are installing more than one disk drive, perform all the steps in this procedure until all disk
drives are installed.
Installing a disk drive in the 5887 disk-drive expansion unit
Perform the following steps to install a disk drive in the 5887 disk-drive expansion unit.
1. With the handle in the unlocked position, see Figure 13 on page 30, support the bottom of the disk
unit as you align it with the guide rails in the expansion unit. Do not hold the disk unit solely by the
handle.
2. Slide the disk unit into the expansion unit until the drive stops.
Figure 12. Installing the disk drive in the 8231-E2B, 8231-E1C, or 8231-E2C
Installing a disk drive or solid-state drive in the 8231-E2B, 8231-E1C, or 8231-E2C 29
Important: When installing a disk drive ensure that the disk drive is fully seated and is all the way
into the system.
3. Rotate the handle to the locked position.
Completing the disk drive installation by using the HSM
1. Return to the console and wait until the Concurrent Maintenance Results display is shown, then press
Enter.
2. When the Hardware Service Manager display is shown, press F12 twice and then press Enter.
Completing the procedure
1. Verify the installed part:
v If you replaced the part because of a failure, verify the installed part. For instructions, see Verifying
a repair.
v If you installed the part for any other reason, verify the installed part. For instructions, see
Verifying the installed part.
2. To configure the newly installed disk drive, see the procedure for the operating system or logical
partition that controls the disk drive:
v Preparing a disk drive for use in an AIX system or logical partition.
v Preparing a disk drive for use in a Linux system or logical partition
v Configuring a disk drive in an IBM i system or logical partition.
3. To rebuild the data on the replacement disk drive, see the procedure for the operating system or
logical partition that controls the disk drive:
v Rebuilding data on a replacement drive using AIX.
v Rebuilding data on a replacement drive using Linux.
v Rebuilding data on a replacement drive using IBM i.
Figure 13. Installing the disk drive in the 5887 expansion unit
30 Disk drives
Installing a disk drive or solid-state drive in the 8231-E2B, 8231-E1C,
or 8231-E2C with the system or logical partition power turned off
Learn how to install a disk drive in a system or logical partition that controls the disk-drive location with
the power turned off.
If you are servicing a failing part, see the service procedures for “Replacing a disk drive in the 8231-E2B,
8231-E1C, or 8231-E2C with the system or logical partition power turned off” on page 77. This procedure
is intended for installing a new or upgraded disk drive.
Before installing or replacing a feature, ensure that the software required to support the feature is
installed on your system. For information about software prerequisites, see IBM prerequisite. If the
required software is not installed, go to Fix Central to download it, and install it before continuing.
If you are installing solid-state drives (SSD), review the configuration rules and then return here. For
more information, see “Solid-state drives” on page 5.
Installing a disk drive or solid-state drive in the 8231-E2B, 8231-E1C, or 8231-E2C 31
DANGER
When working on or around the system, observe the following precautions:
Electrical voltage and current from power, telephone, and communication cables are hazardous. To
avoid a shock hazard:
v Connect power to this unit only with the IBM provided power cord. Do not use the IBM
provided power cord for any other product.
v Do not open or service any power supply assembly.
v Do not connect or disconnect any cables or perform installation, maintenance, or reconfiguration
of this product during an electrical storm.
v The product might be equipped with multiple power cords. To remove all hazardous voltages,
disconnect all power cords.
v Connect all power cords to a properly wired and grounded electrical outlet. Ensure that the outlet
supplies proper voltage and phase rotation according to the system rating plate.
v Connect any equipment that will be attached to this product to properly wired outlets.
v When possible, use one hand only to connect or disconnect signal cables.
v Never turn on any equipment when there is evidence of fire, water, or structural damage.
v Disconnect the attached power cords, telecommunications systems, networks, and modems before
you open the device covers, unless instructed otherwise in the installation and configuration
procedures.
v Connect and disconnect cables as described in the following procedures when installing, moving,
or opening covers on this product or attached devices.
To Disconnect:
1. Turn off everything (unless instructed otherwise).
2. Remove the power cords from the outlets.
3. Remove the signal cables from the connectors.
4. Remove all cables from the devices
To Connect:
1. Turn off everything (unless instructed otherwise).
2. Attach all cables to the devices.
3. Attach the signal cables to the connectors.
4. Attach the power cords to the outlets.
5. Turn on the devices.
(D005)
Preparing the system
Complete the following steps for preparing the system before installing a disk drive.
1. Perform prerequisite tasks. For instructions, see Before you begin.
2. If applicable, remove the system unit door, remove the expansion unit door, or open the rack.
3. Determine the next available disk drive position in your system or expansion unit. For information,
see “Disk drive locations and service indicators for the 8231-E2B, 8231-E1C, or 8231-E2C system” on
page 97.
Note: When you have a system with an expansion unit, fill the slot positions in the system unit first.
However, you can choose a different disk drive placement depending on your data protection
strategy.
4. Record the position where the new disk drive must be installed. For example, the next available disk
drive slot might be P3-D4.
5. Stop the system. For instructions, see Stopping the system or logical partition.
32 Disk drives
6. Place the system into the service position. For instructions, see Placing a rack-mounted 8231-E2B,
8231-E1C, or 8231-E2C system into the service position.
7. Remove the service access cover. For instructions, see Removing the service access cover from a
rack-mounted 8231-E2B, 8231-E1C, or 8231-E2C system.
8. Disconnect the power source from the system by unplugging the system. For instructions, see
Disconnecting the power cords.
Note: This system can be equipped with a second power supply. Disconnect it before continuing with
this procedure. Ensure that the power source to the system has been disconnected.
(L003)
or
9. Attach the wrist strap.
Attention:
v Attach a wrist strap to an unpainted metal surface of your hardware to prevent electrostatic
discharge (ESD) from damaging your hardware.
v When using a wrist strap, follow all electrical safety procedures. A wrist strap is for static control. It
does not increase or decrease your risk of receiving electric shock when using or working on
electrical equipment.
v If you do not have a wrist strap, just prior to removing the product from ESD packaging and
installing or replacing hardware, touch an unpainted metal surface of the system for a minimum of
5 seconds.
Installing a disk drive in an 8231-E2B, 8231-E1C, or 8231-E2C
Complete the following steps to install a disk drive in the system or an expansion unit.
1. Use the service indicator light-emitting diodes (LEDs) to help identify the part. For information, see
Identifying a part.
Installing a disk drive or solid-state drive in the 8231-E2B, 8231-E1C, or 8231-E2C 33
2. Use the location information that you recorded while removing the disk drives to install or replace
the internal disk drives to the correct locations.
3. Find the package that contains the new disk drive.
Attention: Disk drives are fragile. Handle with care.
4. Remove the disk drive from the static-protective package. Record the serial number (use the last four
digits) of the disk drive.
5. If the slot you want to use contains a disk drive filler, remove the disk drive filler from the slot. For
instructions, see “Removing a disk drive filler from the 8231-E2B, 8231-E1C, or 8231-E2C system” on
page 89.
6. If the disk drive you are installing does not have a bezel preinstalled, then perform the tasks to
install a bezel on the disk drive, then continue with this procedure from step 7. For instructions to
install a bezel, see “Installing a disk drive bezel in a disk drive or filler” on page 94.
7. Unlock the disk drive handle by squeezing it and pulling it out toward you. If the handle is not all
the way out, the disk drive does not slide into the system or expansion unit.
8. Hold the disk drive by the top and bottom edges as you position the drive, and insert it into the
system or the expansion unit. Do not hold the disk drive by the handle.
9. Slide the disk drive all the way into the system or expansion unit, and push the disk drive handle
(A) in until it locks, as shown in Figure 14.
Important: When installing a disk drive ensure that the disk drive is fully seated and is all the way
into the system.
10. If you are installing more than one disk drive, perform all the steps in this procedure until all disk
drives are installed.
Figure 14. Installing the disk drive into the 8231-E2B, 8231-E1C, or 8231-E2C
34 Disk drives
Installing a disk drive in the 5887 disk-drive expansion unit
Perform the following steps to install a disk drive in the 5887 disk-drive expansion unit.
1. With the handle in the unlocked position, see Figure 15, support the bottom of the disk unit as you
align it with the guide rails in the expansion unit. Do not hold the disk unit solely by the handle.
2. Slide the disk unit into the expansion unit until the drive stops.
Important: When installing a disk drive ensure that the disk drive is fully seated and is all the way
into the system.
3. Rotate the handle to the locked position.
Completing the procedure
1. Replace the service access cover. For instructions, see Installing the service access cover on a
rack-mounted 8231-E2B, 8231-E1C, or 8231-E2C system.
2. Connect the power cords to the system. For instructions, see Connecting the power cords to the
8231-E2B, 8231-E1C, or 8231-E2C.
3. Place the system in the operating position. For instructions, see Placing the rack-mounted 8231-E2B,
8231-E1C, or 8231-E2C system into the operating position.
4. Start the system or logical partition. For instructions, see Starting the system or logical partition.
5. Verify the installed part:
v If you replaced the part because of a failure, verify the installed part. For instructions, see Verifying
a repair.
v If you installed the part for any other reason, verify the installed part. For instructions, see
Verifying the installed part.
6. To configure the newly installed disk drive, see the procedure for the operating system or logical
partition that controls the disk drive:
v Preparing a disk drive for use in an AIX system or logical partition.
v Preparing a disk drive for use in a Linux system or logical partition
v Configuring a disk drive in an IBM i system or logical partition.
Figure 15. Installing the disk drive in the 5887 expansion unit
Installing a disk drive or solid-state drive in the 8231-E2B, 8231-E1C, or 8231-E2C 35
7. To rebuild the data on the replacement disk drive, see the procedure for the operating system or
logical partition that controls the disk drive:
v Rebuilding data on a replacement drive using AIX.
v Rebuilding data on a replacement drive using Linux.
v Rebuilding data on a replacement drive using IBM i.
36 Disk drives
Removing a disk drive or solid-state drive from the 8231-E2B,
8231-E1C, or 8231-E2C
Learn to remove a failed disk drive or solid-state drive from the server or an expansion unit with the
system or logical partition power turned on or turned off.
Removing a disk drive from the 8231-E2B, 8231-E1C, or 8231-E2C with
the AIX system or logical partition power turned on
Learn to remove a failed Small Computer System Interface (SCSI) or serial-attached SCSI (SAS) disk drive
in a system or an expansion unit while the power of the AIX operating system or logical partition that
controls the disk drive location is turned on.
If you are installing a new or upgraded disk drive, see “Installing a disk drive or solid-state drive in the
8231-E2B, 8231-E1C, or 8231-E2C with the AIX system or logical partition power turned on” on page 8. If
you are removing a disk drive as part of a service procedure, continue to use the following procedure.
Attention: This procedure is a complex set of steps that must be performed exactly to avoid potential
data loss or system damage. Alternatively, you can use the procedure, which is a simpler task that
involves less risk of data loss or system damage. If the system or logical partition that controls the disk
drive you are replacing is at the AIX command line, you can use this powered-on procedure.
If the disk drive that you are removing is in the root volume group (rootvg) and it is not protected by
either Redundant Array of Independent Disks (RAID) or mirroring, or to use the powered-off procedure,
go to “Removing a disk drive from the 8231-E2B, 8231-E1C, or 8231-E2C with the system or logical
partition power turned off” on page 56.
To replace a Just a Bunch of Disks (JBOD) disk drive that is attached to any SCSI or SCSI RAID adapter,
see the procedure in “Removing a SCSI disk drive from the 8231-E2B, 8231-E1C, or 8231-E2C” on page
55.
© Copyright IBM Corp. 2010, 2011 37
DANGER
When working on or around the system, observe the following precautions:
Electrical voltage and current from power, telephone, and communication cables are hazardous. To
avoid a shock hazard:
v Connect power to this unit only with the IBM provided power cord. Do not use the IBM
provided power cord for any other product.
v Do not open or service any power supply assembly.
v Do not connect or disconnect any cables or perform installation, maintenance, or reconfiguration
of this product during an electrical storm.
v The product might be equipped with multiple power cords. To remove all hazardous voltages,
disconnect all power cords.
v Connect all power cords to a properly wired and grounded electrical outlet. Ensure that the outlet
supplies proper voltage and phase rotation according to the system rating plate.
v Connect any equipment that will be attached to this product to properly wired outlets.
v When possible, use one hand only to connect or disconnect signal cables.
v Never turn on any equipment when there is evidence of fire, water, or structural damage.
v Disconnect the attached power cords, telecommunications systems, networks, and modems before
you open the device covers, unless instructed otherwise in the installation and configuration
procedures.
v Connect and disconnect cables as described in the following procedures when installing, moving,
or opening covers on this product or attached devices.
To Disconnect:
1. Turn off everything (unless instructed otherwise).
2. Remove the power cords from the outlets.
3. Remove the signal cables from the connectors.
4. Remove all cables from the devices
To Connect:
1. Turn off everything (unless instructed otherwise).
2. Attach all cables to the devices.
3. Attach the signal cables to the connectors.
4. Attach the power cords to the outlets.
5. Turn on the devices.
(D005)
Note: Some of the figures in these procedures might not look exactly like the system or expansion unit
that you have. However, the steps to perform the task are the same.
Preparing the system for removing a disk drive
1. Perform prerequisite tasks. For instructions, see Before you begin.
2. Prepare to remove the disk drive from the system or expansion unit. For instructions, see “Preparing
to remove a disk drive from a system or expansion unit controlled by AIX” on page 105.
3. Place the system into the service position. For instructions, see Placing a rack-mounted 8231-E2B,
8231-E1C, or 8231-E2C system into the service position.
4. Remove the service access cover. For instructions, see Removing the service access cover from a
rack-mounted 8231-E2B, 8231-E1C, or 8231-E2C system.
5. Use the service indicator light-emitting diodes (LEDs) to help identify the part. For information, see
Identifying a part.
38 Disk drives
Preparing to remove a disk drive by using the HSM
1. If you are removing a solid-state device located on a PCIe RAID and SSD SAS adapter, go to
Removing and replacing the PCIe RAID and SSD SAS adapterRemoving and replacing the PCIe
RAID and SSD SAS adapter and complete that procedure. Then, return here and perform the
following steps.
Important: Ensure to follow the concurrent or nonconcurrent replacement procedures depending on
the type of data protection enabled:
v If the data protection is RAID, use the concurrent procedure.
v If the data protection is mirrored (card to card) and the SSD is located in the 5802 or 5803
expansion unit, use the concurrent procedure.
v If the data protection is mirrored (card to card) but the SSD is not located in the 5802 or 5803
expansion unit, use the nonconcurrent procedure.
2. Perform the following steps on the Hardware Service Manager (HSM)
3. Log in as root user.
4. At the command line, type diag and press Enter.
5. Press Enter to continue.
6. On the Function Selection display, select Task Selection.
7. Select Hot Plug Task.
8. Select SCSI and SCSI RAID Hot Plug Manager or RAID Hot Plug Devices, depending on the
adapter that your drive is attached to.
Tip: One or both of these options might be available, depending on the configuration of your
system. If the drive you want to replace is not listed when you follow the steps for the SCSI and
SCSI RAID Hot Plug Manager, go to step 9.
v If you have a RAID disk drive attached to a 2498 PCI SCSI RAID adapter, continue with step 9.
v If you have a JBOD disk drive attached to any SCSI or SCSI RAID adapter, go to “Removing a
SCSI disk drive from the 8231-E2B, 8231-E1C, or 8231-E2C” on page 55.
v If you have RAID drives attached to any other adapter, see the PCI-X SCSI RAID Controller
Reference Guide for AIX. This guide is available from the SCSI PCI Adapters web page.
9. Select RAID Hot Plug Devices.
10. On the Resource Selection List, select the adapter that is connected to the array that contains the disk
drive you want to remove or replace. Press Enter.
11. Select Commit.
12. On the Identify and Remove Resources menu, select Remove. Press Enter.
A list of the disk drives that you can remove is shown. If the disk drive you want to remove does
not appear in the list, ask your system administrator to put the disk drive in the Failed state before
continuing with this procedure. For information, see AIX System Management Guide: Operating System
and Devices. This guide is available at the System p® and AIX Information Center .
13. Select the drive that you want to remove and press Enter. The service indicator for the slot flashes
rapidly.
Attention: If the light does not flash rapidly, do not remove the disk drive.
To view the locations of the service indicators, go to “Disk drive locations and service indicators for
the 8231-E2B, 8231-E1C, or 8231-E2C system” on page 97.
Removing the disk drive from the 8231-E2B, 8231-E1C, or 8231-E2C
1. Unlock the disk drive handle by squeezing it and pulling it out toward you, as shown in Figure 16 on
page 40.
Removing a disk drive or solid-state drive from the 8231-E2B, 8231-E1C, or 8231-E2C 39
2. Press Enter and when the service indicator starts flashing, then squeeze and pull the handle of the
disk drive out toward you before you remove the disk drive. If the handle is not all the way out, the
disk drive cannot slide out of the system or expansion unit.
Important: When you press Enter, after a one minute delay, the service indicator comes on and begins
to flash rapidly. You then have nine seconds to remove the disk drive.
Note: If you are removing a disk drive from the 5887 expansion unit, the service indicator is on
(solid) and does not flash during this procedure.
3. Support the bottom of the disk drive as you slide it out of the system or expansion unit. Do not hold
the disk drive by the handle.
Note: The service indicator stops flashing and remains off when the disk drive is removed.
4. If the removed disk drive has a bezel attached to it, you can remove the disk drive bezel. For
instructions, see “Removing a disk drive bezel from a disk drive or filler” on page 93.
Important: If the disk drive bezel does not have a thumbscrew then the bezel is not removable and
the disk drive is a SAS disk drive. The replacement SAS disk drive comes with an preinstalled bezel.
v You can proceed to install a disk drive as a replacement to the failed disk drive, see “Replacing a disk
drive or solid state drive in the 8231-E2B, 8231-E1C, or 8231-E2C” on page 61 and select the procedure
for replacing a disk drive for your operating system.
v You can proceed to install a disk drive filler into the slot from where you removed the disk drive, see
“Installing a disk drive filler in the 8231-E2B, 8231-E1C, or 8231-E2C system” on page 90 for the
procedure.
Figure 16. Removing a disk drive from the 8231-E2B, 8231-E1C, or 8231-E2C
40 Disk drives
Removing the disk drive from the 8231-E2B, 8231-E1C, or 8231-E2C
using Linux kernel 2.6 with the Linux system or logical partition power
turned on
Learn to remove a Small Computer System Interface (SCSI) or serial-attached SCSI (SAS) disk drive in a
system or expansion unit while the power of the Linux operating system or logical partition that controls
the disk drive location is turned on. This procedure uses the Linux kernel 2.6.
If you are installing a new or upgraded disk drive, see “Installing a disk-drive or solid-state drive in the
8231-E2B, 8231-E1C, or 8231-E2C with the Linux system or logical partition power turned on” on page 13.
If you are removing a disk drive as part of a service procedure, continue to use the following procedure.
Attention: Read these instructions carefully and understand the complexity of the powered-on
procedure before you begin. If the system or logical partition that controls the disk drive you are
replacing is at the Linux command line, you can use this powered-on procedure.
Go to “Replacing a disk drive in the 8231-E2B, 8231-E1C, or 8231-E2C with the system or logical partition
power turned off” on page 77, if:
v You are installing a disk drive that is to be connected with the 6203 or 6204 SCSI adapter
v You do not feel comfortable using the power turned on procedure
Removing a disk drive or solid-state drive from the 8231-E2B, 8231-E1C, or 8231-E2C 41
DANGER
When working on or around the system, observe the following precautions:
Electrical voltage and current from power, telephone, and communication cables are hazardous. To
avoid a shock hazard:
v Connect power to this unit only with the IBM provided power cord. Do not use the IBM
provided power cord for any other product.
v Do not open or service any power supply assembly.
v Do not connect or disconnect any cables or perform installation, maintenance, or reconfiguration
of this product during an electrical storm.
v The product might be equipped with multiple power cords. To remove all hazardous voltages,
disconnect all power cords.
v Connect all power cords to a properly wired and grounded electrical outlet. Ensure that the outlet
supplies proper voltage and phase rotation according to the system rating plate.
v Connect any equipment that will be attached to this product to properly wired outlets.
v When possible, use one hand only to connect or disconnect signal cables.
v Never turn on any equipment when there is evidence of fire, water, or structural damage.
v Disconnect the attached power cords, telecommunications systems, networks, and modems before
you open the device covers, unless instructed otherwise in the installation and configuration
procedures.
v Connect and disconnect cables as described in the following procedures when installing, moving,
or opening covers on this product or attached devices.
To Disconnect:
1. Turn off everything (unless instructed otherwise).
2. Remove the power cords from the outlets.
3. Remove the signal cables from the connectors.
4. Remove all cables from the devices
To Connect:
1. Turn off everything (unless instructed otherwise).
2. Attach all cables to the devices.
3. Attach the signal cables to the connectors.
4. Attach the power cords to the outlets.
5. Turn on the devices.
(D005)
Note: Some of the figures in these procedures might not look exactly like the system or expansion unit
that you have. However, the steps to perform the task are the same.
Preparing the system
1. Perform prerequisite tasks. For instructions, see Before you begin.
2. Is the disk drive you are replacing in a redundant array of independent disk (RAID) Level 0 array or
is the disk drive you are replacing in a failed RAID Level 5 or Level 10 array?
v If yes, you must delete the disk array before removing or replacing the disk. For instructions, see
PCI-X SCSI RAID Controller Reference Guide for Linux.
This guide is available from the SCSI PCI Adapters web page.
v If no, your drive is configured as Just a Bunch of Disks (JBOD) or is configured within a degraded
array. Proceed to the next step.
3. If the disk drive you are replacing is in a redundant array of independent disk (RAID) Level 0 disk
array or in a failed RAID Level 5 or RAID Level 10 disk array, you must delete the disk array before
replacing the disk. For instructions, see the PCI-X SCSI RAID Controller Reference Guide for Linux.
This guide is available from the SCSI PCI Adapters web page.
42 Disk drives
4. If applicable, remove the system unit door or expansion unit door or open the rack front door.
Preparing to remove the disk drive by using the HSM
Perform the following tasks to remove a disk drive by using the Hardware Service Manager (HSM):
1. Log in as root user.
2. Type iprconfig on the command line of the Linux session and press Enter.
The IBM Power RAID Configuration Utility display is shown.
3. Select Analyze log. Press Enter. The Kernel Messages Log display is shown.
4. Select View most recent ipr error messages from the Kernel Messages Log display. Press Enter.
5. Find the entry in the log for the disk drive you want to replace.
6. Record the location information for the disk drive.
Note: The location information has the form of 2:0:8:0. In this example, 2 is the SCSI host number, 0
is the SCSI bus, 8 is the SCSI target ID, and 0 is the LUN (logical unit).
7. Return to the command line.
8. Type the following command:
ls -ld /sys/class/scsi_host/host#/device
Where # is the SCSI host number. Press Enter.
9. Record the PCI location information.
Note: The PCI location information has the form of 61:01:0:2.
Kernel Messages Log
Select one of the following:
1. View most recent ipr error messages
2. View ipr error messages
3. View all kernel error messages
4. View iprconfig error messages
5. Set root kernel message log directory
6. Set default editor
7. Restore defaults
8. View ipr boot time messages
Selection:
e=Exit
Figure 17. Kernel Messages Log
Removing a disk drive or solid-state drive from the 8231-E2B, 8231-E1C, or 8231-E2C 43
10. Type iprconfig on the command line and press Enter.
The IBM Power RAID Configuration Utility display is shown.
11. Select Display hardware status from the IBM Power RAID Configuration Utility display. Press Enter.
The Display Hardware Status display is shown.
12. Look for the disk drive at the PCI location you recorded. The disk drive might have a Failed status.
13. If the disk drive you want to replace is unprotected or in use, move the data from the disk drive
before continuing with this procedure.
For information, see the PCI-X SCSI RAID Controller Reference Guide for Linux.
This guide is available from the SCSI PCI Adapters web page.
14. Type option 1 (Display hardware resource information details) next to the disk drive you want to
replace. Press Enter.
The Disk Hardware Resource Information Details is displayed like the shown in the following
sample.
Display Hardware Status
Type option, press Enter.
1=Display hardware resource information details
OPT Name PCI/SCSI Location Description Status
--- ------ -------------------------- ------------------------- ----------------
0000:01:01.0.0/ PCI-X SCSI Adapter Operational
0000:41:01.0.1/ PCI-X SCSI Adapter Operational
sda 0000:41:01.0.1/0:3:0 Physical Disk Active
sdb 0000:41:01.0.1/0:4:0 Physical Disk Active
sdc 0000:41:01.0.1/0:8:0 Physical Disk Active
sdd 0000:41:01.0.1/1:3:0 Physical Disk Active
sde 0000:41:01.0.1/1:4:0 Physical Disk Active
sdf 0000:41:01.0.1/1:5:0 Physical Disk Active
0001:61:01.0.2/ PCI-X SCSI RAID Adapter Operational
sdg 0001:61:01.0.2/0:3:0 Physical Disk Active
0001:61:01.0.2/0:6:0 Advanced Function Disk Active
sdi 0001:61:01.0.2/0:9:0 Physical Disk Active
sdh 0001:61:01.0.2/255:0:0 RAID 10 Disk Array Failed
0001:61:01.0.2/0:4:0 RAID 10 Array Member Failed
0001:61:01.0.2/0:5:0 RAID 10 Array Member Failed
e=Exit q=Cancel r=Refresh t=Toggle
Figure 18. Example Display Hardware Status
44 Disk drives
15. Record the physical location information.
16. Return to the IBM Power RAID Configuration Utility display.
17. If you are removing a solid-state device located on a PCIe RAID and SSD SAS adapter, go to
Removing and replacing the PCIe RAID and SSD SAS adapterRemoving and replacing the PCIe
RAID and SSD SAS adapter and perform that procedure. Then, return here and perform the
following steps:
Important: Ensure that you follow the concurrent or nonconcurrent replacement procedure
depending on the type of data protection enabled.
a. If the data protection is RAID, use the concurrent procedure.
b. If the data protection is mirrored (card to card) and the SSD is located in the 5802 or 5803
expansion unit, use the concurrent procedure.
c. If the data protection is mirrored (card to card) but the SSD is not located in the 5802 or 5803
expansion unit, use the nonconcurrent procedure.
18. From the IBM Power RAID Configuration Utility display, select Work with disk unit recovery. Press
Enter.
Disk Unit Hardware Resource Information Details
Manufacturer . . . . . . . . . . . . . . : IBM
Product ID . . . . . . . . . . . . . . . : ST336607LC
Firmware Version . . . . . . . . . . . . : 43353048 (C50H)
Serial Number. . . . . . . . . . . . . . : 00006719
Capacity . . . . . . . . . . . . . . . . : 36.40 GB
Resource Name. . . . . . . . . . . . . . : /dev/sdd
Physical location
PCI Address. . . . . . . . . . . . . . . : 0001:50:01.0
SCSI Host Number . . . . . . . . . . . . : 2
SCSI Channel . . . . . . . . . . . . . . : 0
SCSI Id. . . . . . . . . . . . . . . . . : 4
SCSI Lun . . . . . . . . . . . . . . . . : 0
Extended Details
FRU Number . . . . . . . . . . . . . . . : 00P2676
More...
Press Enter to Continue
e=Exit q=Cancel f=PageDn b=PageUp
Figure 19. Example Disk Hardware Resource Information Details display
Removing a disk drive or solid-state drive from the 8231-E2B, 8231-E1C, or 8231-E2C 45
19. From the Work with Disk Unit Recovery display, select Concurrent remove device. Press Enter. A
Concurrent Device Remove display is shown, like the following sample.
20. Type option 1 (Select) next to the location for the disk drive you want to replace. Press Enter.
21. The Verify Device Concurrent Remove display is shown. The service indicator turns on for that disk
drive slot. To view the locations of the service indicators, see “Disk drive locations and service
indicators” on page 97.
22. On the Verify Device Concurrent Remove display, verify that the selected disk drive is the disk drive
you want to replace, then press Enter. The identify light turns on for the disk drive.
Note: Ensure that the disk drive is not in use to prevent loss of data.
23. The Device Concurrent Remove display is shown.
24. Complete the removing of the disk drive as detailed in this procedure and then press Enter on the
IBM Power RAID Configuration Utility to indicate that you have removed the disk drive.
Removing the disk drive from the 8231-E2B, 8231-E1C, or 8231-E2C
1. Unlock the disk drive handle by squeezing it and pulling it out toward you, as shown in Figure 21 on
page 47.
2. Press Enter and when the service indicator starts flashing, then squeeze and pull the handle of the
disk drive out toward you before you remove the disk drive. If the handle is not all the way out, the
disk drive cannot slide out of the system or expansion unit.
Important: When you press Enter, after a one minute delay, the service indicator comes on and begins
to flash rapidly. You then have nine seconds to remove the disk drive.
Concurrent Device Remove
Choose a single location for remove operations
1=Select
OPT Name PCI/SCSI Location Description Status
--- ------ -------------------------- ------------------------- -----------------
sdc 0000:41:01.0.1/0:8:0 Physical Disk Active
0000:41:01.0.1/0:5: Empty
sdb 0000:41:01.0.1/0:4:0 Physical Disk Active
sda 0000:41:01.0.1/0:3:0 Physical Disk Active
0000:41:01.0.1/1:8: Empty
sdf 0000:41:01.0.1/1:5:0 Physical Disk Active
sde 0000:41:01.0.1/1:4:0 Physical Disk Active
sdd 0000:41:01.0.1/1:3:0 Physical Disk Active
0001:61:01.0.2/0:8: Empty
sdh 0001:61:01.0.2/0:9:0 Physical Disk Active
sdg 0001:61:01.0.2/0:3:0 Physical Disk Active
Figure 20. Example Concurrent Device Remove display
46 Disk drives
Note: If you are removing a disk drive from the 5887 expansion unit, the service indicator is on
(solid) and does not flash during this procedure.
3. Support the bottom of the disk drive as you slide it out of the system or expansion unit. Do not hold
the disk drive by the handle.
Note: The service indicator stops flashing and remains off when the disk drive is removed.
4. If the removed disk drive has a bezel attached to it, you can remove the disk drive bezel. For
instructions, see “Removing a disk drive bezel from a disk drive or filler” on page 93.
Important: If the disk drive bezel does not have a thumbscrew then the bezel is not removable and
the disk drive is a SAS disk drive. The replacement SAS disk drive comes with an preinstalled bezel.
v You can proceed to install a disk drive as a replacement to the failed disk drive, see “Replacing a disk
drive or solid state drive in the 8231-E2B, 8231-E1C, or 8231-E2C” on page 61 and select the procedure
for replacing a disk drive for your operating system.
v You can proceed to install a disk drive filler into the slot from where you removed the disk drive, see
“Installing a disk drive filler in the 8231-E2B, 8231-E1C, or 8231-E2C system” on page 90 for the
procedure.
Removing a disk drive from the 8231-E2B, 8231-E1C, or 8231-E2C or
an expansion unit controlled by IBM i
Learn how to remove a failed Small Computer System Interface (SCSI) or serial-attached SCSI (SAS) disk
drive from a system or an expansion unit with the power turned on to the IBM i operating system or a
logical partition that controls the disk drive location.
Figure 21. Removing a disk drive from the 8231-E2B, 8231-E1C, or 8231-E2C
Removing a disk drive or solid-state drive from the 8231-E2B, 8231-E1C, or 8231-E2C 47
If you are removing a disk drive in order to install a new or upgraded disk drive, see “Installing a disk
drive or solid-state drive in the 8231-E2B, 8231-E1C, or 8231-E2C with the IBM i system or logical
partition power turned on” on page 25. If you are removing a disk drive as part of a service procedure,
continue to use the following procedure.
DANGER
When working on or around the system, observe the following precautions:
Electrical voltage and current from power, telephone, and communication cables are hazardous. To
avoid a shock hazard:
v Connect power to this unit only with the IBM provided power cord. Do not use the IBM
provided power cord for any other product.
v Do not open or service any power supply assembly.
v Do not connect or disconnect any cables or perform installation, maintenance, or reconfiguration
of this product during an electrical storm.
v The product might be equipped with multiple power cords. To remove all hazardous voltages,
disconnect all power cords.
v Connect all power cords to a properly wired and grounded electrical outlet. Ensure that the outlet
supplies proper voltage and phase rotation according to the system rating plate.
v Connect any equipment that will be attached to this product to properly wired outlets.
v When possible, use one hand only to connect or disconnect signal cables.
v Never turn on any equipment when there is evidence of fire, water, or structural damage.
v Disconnect the attached power cords, telecommunications systems, networks, and modems before
you open the device covers, unless instructed otherwise in the installation and configuration
procedures.
v Connect and disconnect cables as described in the following procedures when installing, moving,
or opening covers on this product or attached devices.
To Disconnect:
1. Turn off everything (unless instructed otherwise).
2. Remove the power cords from the outlets.
3. Remove the signal cables from the connectors.
4. Remove all cables from the devices
To Connect:
1. Turn off everything (unless instructed otherwise).
2. Attach all cables to the devices.
3. Attach the signal cables to the connectors.
4. Attach the power cords to the outlets.
5. Turn on the devices.
(D005)
Preparing the system for removing a disk drive
1. See Preparing to remove a disk drive from a system or logical partition controlled by IBM i and
follow the steps before you start the tasks in this procedure.
2. Before removing a disk drive, ensure to back up all the data from the disk drive or the array that the
drive is a part of, and then move the data from the disk drive that you want to remove. For
instructions, choose one of the following options:
v Move and remove disk units
v Recovering your system in the IBM i Information Center
To access the IBM i Information Center website, go to http://www.ibm.com/systems/i/infocenter
and select from the following options:
48 Disk drives
– If you select V6R1, expand IBM i 6.1 Information Center > Systems management > Backup and
recovery > Recovering your system
– If you select V5R4, expand i5/OS Information Center > Systems management > Backup and
recovery > Backing up and recovering your system > Recovering your system
3. Locate the disk drive to be removed. See “Disk drive locations and service indicators” on page 97.
4. Stop the system. For instructions, see Stopping the system or logical partition.
Preparing to remove the disk drive by using the HSM
Perform the following steps to remove a disk drive by using the Hardware Service Manager (HSM):
1. If you are removing a solid-state device located on a PCIe RAID and SSD SAS adapter, go to
Removing and replacing the PCIe RAID and SSD SAS adapterRemoving and replacing the PCIe
RAID and SSD SAS adapter and perform that procedure. Then, return here and perform the
following steps.
Important: Ensure to follow the concurrent or nonconcurrent replacement procedures depending on
the type of data protection enabled:
v If the data protection is RAID, use the concurrent procedure.
v If the data protection is mirrored (card to card) and the SSD is located in the 5802 or 5803
expansion unit, use the concurrent procedure.
v If the data protection is mirrored (card to card) but the SSD is not located in the 5802 or 5803
expansion unit, use the nonconcurrent procedure.
2. Locate the disk drive to be replaced and record the location information, then use the location
information to activate the identify light for that disk drive.
3. Determine if the disk drive to be removed is a non-configured unit by performing the following
steps:
a. Type strsst on the command line of the IBM i session and press Enter.
b. Type your service tools user ID and service tools password and press Enter.
c. Select Work with disk units and press Enter.
d. Select Display Disk Configuration and press Enter.
e. Select Display non-configured units and press Enter. Is the failed drive a non-configured unit?
v No: Continue to 4
v Yes: Continue to next step.
f. Determine if the disk drive to be removed was replaced by a hot spare when it failed. An
xxxx9031 error logged close to the same time of the original disk failure indicates the automatic
Rebuild of the data to a hot spare device.
4. Verify that the disk drive you are replacing has been suspended before continuing with this
procedure. For instructions, see Preparing to remove a disk drive from a system or logical partition
controlled by IBM i.
5. Did you find the service action log entry related to this problem?
v No: Exit the service action log and go to step 7
v Yes: Continue with the next step.
6. Select the concurrent maintenance option. This option displays the disk drive location on the
concurrent maintenance display. Go to step 9 on page 50
7. Select Device Concurrent Maintenance from the Hardware Service Manager display. Press Enter.
The Device Concurrent Maintenance display is shown, as shown in the following figure.
Removing a disk drive or solid-state drive from the 8231-E2B, 8231-E1C, or 8231-E2C 49
8. If the physical location does not show in the Device Concurrent Maintenance display, type the
physical location where you are replacing the disk drive as shown in the following example:
The physical location has the following form: U787A.001.AAAXXXX-P3-D4 where P3-D4 is the position
of the disk drive in the system unit or expansion unit. You can get the U787A.001.AAAXXXX location
information from the control panel display on the system unit or expansion unit.
If you cannot get the information from the control panel, find the feature code (FC) and sequence
number (SN or SEQ) on the label that is on the front of the system unit or expansion unit as shown
in the following figure.
In this example, U787A.001.AAAXXXX-P3-D4, U787A.001 comes from the feature code (FC) on the label,
AAAXXXX comes from the last seven digits of the sequence number (SN or SEQ) on the label, and
P3–D4 is the slot where you are replacing the disk drive as shown in “Disk drive locations and
service indicators” on page 97.
In the example, U5074.001.AAAXXXX-DB3-D32, you would replace a disk drive in the expansion unit
with type 5074, model 001, sequence number AAAXXXX, and disk drive position DB3–D32.
9. Select option 1 (Remove device) for the Action to be performed.
Device Concurrent Maintenance
Type the choices, then press Enter.
Specify either physical location or resource name.
Physical location:
U787A.001.1234567-P3-D4
or
Device resource name:
Specify action as 1=Remove device 2=Install device
Action to be performed . . . . . . . .: 1
Enter a time value between 00 and 19.
Time delay needed in minutes . . . . .: 01
F3=Exit F5=Refresh F12=Cancel
Figure 22. Example Device Concurrent Maintenance display
Figure 23. Example system or expansion unit label
50 Disk drives
10. Set the time delay for one minute: 01.
Important: Do not press Enter at this time.
11. Locate the service indicator that corresponds to the position of the disk drive that you are replacing.
You can view the locations of the service indicators for the system you are replacing the disk drive
shown in “Disk drive locations and service indicators” on page 97.
Proceed to remove the disk drive.
Removing the disk drive from the 8231-E2B, 8231-E1C, or 8231-E2C
1. Unlock the disk drive handle by squeezing it and pulling it out toward you, as shown in Figure 24.
2. Press Enter and when the service indicator starts flashing, then squeeze and pull the handle of the
disk drive out toward you before you remove the disk drive. If the handle is not all the way out, the
disk drive cannot slide out of the system or expansion unit.
Important: When you press Enter, after a one minute delay, the service indicator comes on and begins
to flash rapidly. You then have nine seconds to remove the disk drive.
Note: If you are removing a disk drive from the 5887 expansion unit, the service indicator is on
(solid) and does not flash during this procedure.
3. Support the bottom of the disk drive as you slide it out of the system or expansion unit. Do not hold
the disk drive by the handle.
Note: The service indicator stops flashing and remains off when the disk drive is removed.
4. If the removed disk drive has a bezel attached to it, you can remove the disk drive bezel. For
instructions, see “Removing a disk drive bezel from a disk drive or filler” on page 93.
Figure 24. Removing a disk drive from the 8231-E2B, 8231-E1C, or 8231-E2C
Removing a disk drive or solid-state drive from the 8231-E2B, 8231-E1C, or 8231-E2C 51
Important: If the disk drive bezel does not have a thumbscrew then the bezel is not removable and
the disk drive is a SAS disk drive. The replacement SAS disk drive comes with an preinstalled bezel.
v You can proceed to install a disk drive as a replacement to the failed disk drive, see “Replacing a disk
drive or solid state drive in the 8231-E2B, 8231-E1C, or 8231-E2C” on page 61 and select the procedure
for replacing a disk drive for your operating system.
v You can proceed to install a disk drive filler into the slot from where you removed the disk drive, see
“Installing a disk drive filler in the 8231-E2B, 8231-E1C, or 8231-E2C system” on page 90 for the
procedure.
Removing a disk drive using hot-spare in an IBM i system or logical
partition from the 8231-E2B, 8231-E1C, or 8231-E2C
You can use the hot-spare function to replace a disk drive in a system or expansion unit while the IBM i
operating system or logical partition that controls the disk drive location is powered on.
Preparing the system
1. Perform prerequisite tasks. For instructions, see Before you begin.
2. Remove the front cover and door, if you need to, to access the disk drives. For instructions on
removing the cover, see Removing and replacing covers and doors.
3. Locate the disk drive to be replaced, and record the location information. Then, use the location
information to activate the identify light for that disk drive. For instructions, see Identifying a failing
part in an IBM i system or logical partition
Preparing to remove a disk drive by using the HSM
Perform the following tasks to remove a disk drive by using the Hardware Service Manager (HSM):
1. If you are removing a solid state device located on a PCIe RAID and SSD SAS adapter, go to
Removing and replacing the PCIe RAID and SSD SAS adapterRemoving and replacing the PCIe
RAID and SSD SAS adapter and complete that procedure, then return here and perform the
following steps.
2. Type strsst on the command line of the IBM i session and press Enter.
3. Type your service tools user ID and service tools password on the Start Service Tools (STRSST)
Sign On display, and press Enter.
4. Select Work with disk units from the System Service Tools display, press Enter.
5. Select Display Disk Configuration from the Work with Disk Units display, and press Enter.
6. Select Display non-configured units from the Display Disk Configuration display, and press Enter.
7. Is the failed drive a non-configured unit?
v No: Go to “Replacing a disk drive in the 8231-E2B, 8231-E1C, or 8231-E2C with the IBM i system
or logical partition power turned on” on page 69.
v Yes: Determine if the disk drive to be removed was replaced by a hot spare when it failed. A
xxxx9031 error logged close to the same time of the original disk failure indicates the automatic
Rebuild of the data to a hot spare device, then go on to the next step.
8. Is the disk drive to be removed controlled by the load source adapter?
v No: Go to step 11 on page 53.
v Yes: Go to the next step.
9. Is the disk drive to be removed located in a valid load source position?
v No: Go to Configuring a disk drive on a load source adapter for hot-spare using the IBM i
operating system.
v Yes: Go to step 11 on page 53.
52 Disk drives
10. Choose the concurrent maintenance option. This option fills in the disk drive location on the
concurrent maintenance display. Go to step 13 on page 54.
11. Select Device Concurrent Maintenance from the Hardware Service Manager display. Press Enter.
The Device Concurrent Maintenance display is shown, as shown in the following figure.
12. If the physical location does not appear in the Device Concurrent Maintenance display, type the
physical location where you will be replacing the disk drive as shown in the preceding example,
Figure 25.
The physical location has the following form: U787A.001.AAAXXXX-P3-D4 where P3-D4 is the position
of the disk drive in the system unit or expansion unit. You can get the U787A.001.AAAXXXX location
information from the control panel display on the system unit or expansion unit.
If you cannot get the information from the control panel, find the feature code (FC) and sequence
number (SN or SEQ) on the label that appears on the front of the system unit or expansion unit as
shown in the following figure.
Device Concurrent Maintenance
Type the choices, then press Enter.
Specify either physical location or resource name.
Physical location:
U787A.001.1234567-P3-D4
or
Device resource name:
Specify action as 1=Remove device 2=Install device
Action to be performed . . . . . . . .: 1
Enter a time value between 00 and 19.
Time delay needed in minutes . . . . .: 01
F3=Exit F5=Refresh F12=Cancel
Figure 25. Example Device Concurrent Maintenance display
Figure 26. Example system or expansion unit label
Removing a disk drive or solid-state drive from the 8231-E2B, 8231-E1C, or 8231-E2C 53
In this example, U787A.001.AAAXXXX-P3-D4, U787A.001 comes from the feature code (FC) on the label,
AAAXXXX comes from the last seven digits of the sequence number (SN or SEQ) on the label, and
P3–D4 is the slot where you are replacing the disk drive as shown in “Disk drive locations and
service indicators” on page 97.
In the example, U5074.001.AAAXXXX-DB3-D32, you would replace a disk drive in the expansion unit
with type 5074, model 001, sequence number AAAXXXX, and disk drive position DB3–D32.
13. Select option 1 (Remove device) for the Action to be performed.
14. Set the time delay for one minute: 01.
Important: Do not press Enter at this time.
Continue to remove the disk drive from your system of expansion unit.
Removing the disk drive from the 8231-E2B, 8231-E1C, or 8231-E2C
1. Unlock the disk drive handle by squeezing it and pulling it out toward you, as shown in Figure 27.
2. Squeeze and pull the handle of the disk drive out toward you before you remove the disk drive. If
the handle is not all the way out, the disk drive cannot slide out of the system or expansion unit.
3. Support the bottom of the disk drive as you slide it out of the system or expansion unit. Do not hold
the disk drive by the handle.
4. If the removed disk drive has a bezel attached to it, you can remove the disk drive bezel. For
instructions, see “Removing a disk drive bezel from a disk drive or filler” on page 93.
Important: If the disk drive bezel does not have a thumbscrew then the bezel is not removable and
the disk drive is a SAS disk drive. The replacement SAS disk drive comes with an preinstalled bezel.
v You can proceed to install a disk drive as a replacement to the failed disk drive. For instructions, see
“Replacing a disk drive or solid state drive in the 8231-E2B, 8231-E1C, or 8231-E2C” on page 61.
Figure 27. Removing a disk drive from the 8231-E2B, 8231-E1C, or 8231-E2C
54 Disk drives
v You can proceed to install a disk drive filler into the slot from where you removed the disk drive. For
instructions, see “Installing a disk drive filler in the 8231-E2B, 8231-E1C, or 8231-E2C system” on page
90.
Removing a SCSI disk drive from the 8231-E2B, 8231-E1C, or 8231-E2C
Learn how to remove a Small Computer System Interface (SCSI) disk drive from a system or an
expansion unit.
Preparing to remove the SCSI disk drive by using the HSM
Perform the following tasks by using the Hardware Service Manager (HSM):
1. Log in as root user on the Hardware Service Manager (HSM).
2. At the command line, type diag and press Enter.
3. Press Enter to continue.
4. On the Function Selection display, select Task Selection.
5. On the Task Selection display, select Hot Plug Task.
6. Select SCSI and SCSI RAID Hot Plug Manager.
7. Select Identify a Device Attached to a SCSI Hot Swap Enclosure Device.
8. Select the disk drive you want to replace and press Enter.
The disk drive slot enters the Identify state, indicated by the flashing service indicator. To view the
locations of the service indicator, see “Disk drive locations and service indicators” on page 97.
9. Exit to the SCSI and SCSI RAID Hot Plug Manager display.
10. On the SCSI and SCSI RAID Hot Plug Manager display, select Replace/Remove a Device Attached
to a SCSI Hot Swap Enclosure Device. A list of all slots from which you can remove a disk drive is
shown.
11. Select the slot that contains the disk drive you want to remove. The disk drive slot enters the
Remove state and power is removed from the slot. The service indicator for the slot begins to flash
rapidly.
Removing the disk drive from the 8231-E2B, 8231-E1C, or 8231-E2C
1. Unlock the disk drive handle by squeezing it and pulling it out toward you, as shown in Figure 28 on
page 56.
2. Press Enter and when the service indicator starts flashing, then squeeze and pull the handle of the
disk drive out toward you before you remove the disk drive. If the handle is not all the way out, the
disk drive cannot slide out of the system or expansion unit.
Important: When you press Enter, after a one minute delay, the service indicator comes on and begins
to flash rapidly. You then have nine seconds to remove the disk drive.
Note: If you are removing a disk drive from the 5887 expansion unit, the service indicator is on
(solid) and does not flash during this procedure.
Removing a disk drive or solid-state drive from the 8231-E2B, 8231-E1C, or 8231-E2C 55
3. Support the bottom of the disk drive as you slide it out of the system or expansion unit. Do not hold
the disk drive by the handle.
Note: The service indicator stops flashing and remains off when the disk drive is removed.
4. If the removed disk drive has a bezel attached to it, you can remove the disk drive bezel. For
instructions, see “Removing a disk drive bezel from a disk drive or filler” on page 93.
Important: If the disk drive bezel does not have a thumbscrew then the bezel is not removable and
the disk drive is a SAS disk drive. The replacement SAS disk drive comes with an preinstalled bezel.
v You can proceed to install a disk drive as a replacement to the failed disk drive, see “Replacing a disk
drive or solid state drive in the 8231-E2B, 8231-E1C, or 8231-E2C” on page 61 and select the procedure
for replacing a disk drive for your operating system.
v You can proceed to install a disk drive filler into the slot from where you removed the disk drive, see
“Installing a disk drive filler in the 8231-E2B, 8231-E1C, or 8231-E2C system” on page 90 for the
procedure.
Removing a disk drive from the 8231-E2B, 8231-E1C, or 8231-E2C with
the system or logical partition power turned off
Learn to remove a failed Small Computer System Interface (SCSI) or serial-attached SCSI (SAS) disk drive
in a system or logical partition that controls the disk drive location with the power turned off.
Attention: If you are installing a new or upgraded disk drive with the power turned off, see “Installing
a disk drive or solid-state drive in the 8231-E2B, 8231-E1C, or 8231-E2C with the system or logical
partition power turned off” on page 31. If you are removing a disk drive as part of a service procedure,
continue to use the following procedure.
Figure 28. Removing a disk drive from the 8231-E2B, 8231-E1C, or 8231-E2C
56 Disk drives
DANGER
When working on or around the system, observe the following precautions:
Electrical voltage and current from power, telephone, and communication cables are hazardous. To
avoid a shock hazard:
v Connect power to this unit only with the IBM provided power cord. Do not use the IBM
provided power cord for any other product.
v Do not open or service any power supply assembly.
v Do not connect or disconnect any cables or perform installation, maintenance, or reconfiguration
of this product during an electrical storm.
v The product might be equipped with multiple power cords. To remove all hazardous voltages,
disconnect all power cords.
v Connect all power cords to a properly wired and grounded electrical outlet. Ensure that the outlet
supplies proper voltage and phase rotation according to the system rating plate.
v Connect any equipment that will be attached to this product to properly wired outlets.
v When possible, use one hand only to connect or disconnect signal cables.
v Never turn on any equipment when there is evidence of fire, water, or structural damage.
v Disconnect the attached power cords, telecommunications systems, networks, and modems before
you open the device covers, unless instructed otherwise in the installation and configuration
procedures.
v Connect and disconnect cables as described in the following procedures when installing, moving,
or opening covers on this product or attached devices.
To Disconnect:
1. Turn off everything (unless instructed otherwise).
2. Remove the power cords from the outlets.
3. Remove the signal cables from the connectors.
4. Remove all cables from the devices
To Connect:
1. Turn off everything (unless instructed otherwise).
2. Attach all cables to the devices.
3. Attach the signal cables to the connectors.
4. Attach the power cords to the outlets.
5. Turn on the devices.
(D005)
Preparing the system
1. Perform prerequisite tasks. For instructions, see Before you begin.
2. If you cannot start the system or logical partition that controls the disk drive location, or if the
system or logical partition is not powered on, go to step 6 on page 58.
3. Identify the disk drive to be removed and record the location information. For information, see
Identifying a part.
4. Prepare to remove the disk drive from the system or expansion unit as follows:
v If the system or logical partition that controls the disk drive location is running AIX, follow the
steps in “Preparing to remove a disk drive from a system or expansion unit controlled by AIX” on
page 105.
v If the system or logical partition that controls the disk drive location is running IBM i, follow the
steps in “Preparing to remove a disk-drive from a system or logical partition controlled by IBM i”
on page 116.
Removing a disk drive or solid-state drive from the 8231-E2B, 8231-E1C, or 8231-E2C 57
v If the system or logical partition that controls the disk drive location is running Linux and the
disk drive you want to replace is unprotected or in use, move the data from the disk drive before
continuing with this procedure. For information, see the PCI-X SCSI RAID Controller Reference
Guide for Linux.
This guide is available from the SCSI PCI Adapters web page.
5. Stop the system. For instructions, see Stopping the system or logical partition.
6. If you stopped the logical partition, do not disconnect the power source.
7. Place the system into the service position. For instructions, see Placing a rack-mounted 8231-E2B,
8231-E1C, or 8231-E2C system into the service position.
8. Remove the service access cover. For instructions, see Removing the service access cover from a
rack-mounted 8231-E2B, 8231-E1C, or 8231-E2C system.
9. Disconnect the power source from the system by unplugging the system. For instructions, see
Disconnecting the power cords.
Note: This system can be equipped with a second power supply. Disconnect it before continuing
with this procedure. Ensure that the power source to the system has been disconnected.
(L003)
or
10. Attach the wrist strap.
58 Disk drives
Attention:
v Attach a wrist strap to an unpainted metal surface of your hardware to prevent electrostatic
discharge (ESD) from damaging your hardware.
v When using a wrist strap, follow all electrical safety procedures. A wrist strap is for static control.
It does not increase or decrease your risk of receiving electric shock when using or working on
electrical equipment.
v If you do not have a wrist strap, just prior to removing the product from ESD packaging and
installing or replacing hardware, touch an unpainted metal surface of the system for a minimum
of 5 seconds.
Removing the disk drive from the 8231-E2B, 8231-E1C, or 8231-E2C
1. Unlock the disk drive handle by squeezing it and pulling it out toward you, as shown in Figure 29.
2. Squeeze and pull the handle of the disk drive out toward you before you remove the disk drive. If
the handle is not all the way out, the disk drive cannot slide out of the system or expansion unit.
3. Support the bottom of the disk drive as you slide it out of the system or expansion unit. Do not hold
the disk drive by the handle.
4. If the removed disk drive has a bezel attached to it, you can remove the disk drive bezel. For
instructions, see “Removing a disk drive bezel from a disk drive or filler” on page 93.
Important: If the disk drive bezel does not have a thumbscrew then the bezel is not removable and
the disk drive is a SAS disk drive. The replacement SAS disk drive comes with an preinstalled bezel.
v You can proceed to install a disk drive as a replacement to the failed disk drive. For instructions, see
“Replacing a disk drive or solid state drive in the 8231-E2B, 8231-E1C, or 8231-E2C” on page 61.
Figure 29. Removing a disk drive from the 8231-E2B, 8231-E1C, or 8231-E2C
Removing a disk drive or solid-state drive from the 8231-E2B, 8231-E1C, or 8231-E2C 59
v You can proceed to install a disk drive filler into the slot from where you removed the disk drive. For
instructions, see “Installing a disk drive filler in the 8231-E2B, 8231-E1C, or 8231-E2C system” on page
90.
60 Disk drives
Replacing a disk drive or solid state drive in the 8231-E2B,
8231-E1C, or 8231-E2C
Learn how to replace a Small Computer System Interface (SCSI) or serial-attached SCSI (SAS) disk drive
in the server or expansion unit.
Note: These instructions can also be used if you are replacing an SAS disk drive.
To remove and replace a disk drive in a different enclosure, use the information for that enclosure.
Use this procedure to remove a failed disk drive and replace it with the same type of disk drive. If you
plan to remove a disk drive and install a different type of disk drive, remove the existing disk drive.
The system or logical partition can be turned off or turned on when you replace the disk drive. If the
disk drive that is being replaced is in the AIX, IBM i or Linux root volume group (rootvg) and does not
have disk protection, or if it is a PCIe RAID adapter with integrated solid-state disk drives, then replace
the disk drive using the powered-off procedure.
Replacing a disk drive in the 8231-E2B, 8231-E1C, or 8231-E2C with
the AIX system or logical partition power turned on
Learn to replace a failed Small Computer System Interface (SCSI) or serial-attached SCSI (SAS) disk drive
in a system or an expansion unit while the power to the AIX operating system or logical partition that
controls the disk drive location is turned on.
To complete this procedure, you must have already completed the removal procedure. For the removal
procedure instructions, see “Removing a disk drive from the 8231-E2B, 8231-E1C, or 8231-E2C with the
AIX system or logical partition power turned on” on page 37. If you are installing a new or upgraded
disk drive, see “Installing a disk drive or solid-state drive in the 8231-E2B, 8231-E1C, or 8231-E2C with
the AIX system or logical partition power turned on” on page 8. If you are replacing a disk drive as part
of a service procedure, continue to use the following procedure.
© Copyright IBM Corp. 2010, 2011 61
DANGER
When working on or around the system, observe the following precautions:
Electrical voltage and current from power, telephone, and communication cables are hazardous. To
avoid a shock hazard:
v Connect power to this unit only with the IBM provided power cord. Do not use the IBM
provided power cord for any other product.
v Do not open or service any power supply assembly.
v Do not connect or disconnect any cables or perform installation, maintenance, or reconfiguration
of this product during an electrical storm.
v The product might be equipped with multiple power cords. To remove all hazardous voltages,
disconnect all power cords.
v Connect all power cords to a properly wired and grounded electrical outlet. Ensure that the outlet
supplies proper voltage and phase rotation according to the system rating plate.
v Connect any equipment that will be attached to this product to properly wired outlets.
v When possible, use one hand only to connect or disconnect signal cables.
v Never turn on any equipment when there is evidence of fire, water, or structural damage.
v Disconnect the attached power cords, telecommunications systems, networks, and modems before
you open the device covers, unless instructed otherwise in the installation and configuration
procedures.
v Connect and disconnect cables as described in the following procedures when installing, moving,
or opening covers on this product or attached devices.
To Disconnect:
1. Turn off everything (unless instructed otherwise).
2. Remove the power cords from the outlets.
3. Remove the signal cables from the connectors.
4. Remove all cables from the devices
To Connect:
1. Turn off everything (unless instructed otherwise).
2. Attach all cables to the devices.
3. Attach the signal cables to the connectors.
4. Attach the power cords to the outlets.
5. Turn on the devices.
(D005)
Attention: This procedure is a complex set of steps that must be performed exactly to avoid potential
data loss or system damage. Alternatively, you can use the procedure, which is a simpler task that
involves less risk of data loss or system damage. If the system or logical partition that controls the disk
drive you are replacing is at the AIX command line, you can use this power turned on procedure. If the
disk drive that you are replacing is in the root volume group (rootvg) and it is not protected by either
Redundant Array of Independent Disks (RAID) or mirroring, or to use the powered-off procedure, go to
“Replacing a disk drive in the 8231-E2B, 8231-E1C, or 8231-E2C with the system or logical partition
power turned off” on page 77.
Note: Some of the figures in these procedures might not look exactly like the system or expansion unit
that you have. However, the steps to perform the task are the same.
Preparing the system
1. Perform prerequisite tasks as described in Before you begin.
2. Prepare to remove the disk drive from the system or expansion unit as described in “Preparing to
remove a disk drive from a system or expansion unit controlled by AIX” on page 105.
62 Disk drives
3. Place the system into the service position, as described in Placing a rack-mounted model into the
service position.
4. Remove the service access cover, as described in Removing the service access cover from a
rack-mounted model.
Replacing a disk drive in the 8231-E2B, 8231-E1C, or 8231-E2C
1. Use the service indicator light-emitting diodes (LEDs) to help identify the part. For information, see
Identifying a part.
2. Use the location information that you recorded while removing the disk drives to install or replace
the internal disk drives to the correct locations.
3. Find the package that contains the new disk drive.
Attention: Disk drives are fragile. Handle with care.
4. Remove the disk drive from the static-protective package.
5. If the slot you want to use contains a disk drive filler, remove the disk drive filler from the slot. For
instructions, see “Removing a disk drive filler from the 8231-E2B, 8231-E1C, or 8231-E2C system” on
page 89.
6. If the disk drive you are installing does not have a bezel preinstalled, then perform the tasks to
install a bezel on the disk drive, then continue with this procedure from step 7. For instructions to
install a bezel, see “Installing a disk drive bezel in a disk drive or filler” on page 94.
7. Unlock the disk drive handle by squeezing it and pulling it out toward you. If the handle is not all
the way out, the disk drive does not slide into the system or expansion unit.
8. Hold the disk drive by the top and bottom edges as you position the drive, and insert it into the
system or the expansion unit. Do not hold the disk drive by the handle.
9. Slide the disk drive halfway into the system or expansion unit.
10. Press Enter on the console.
11. When the service indicator starts flashing, slide the disk drive all the way into the system or
expansion unit, and push the disk drive handle (A) in until it locks, as shown in Figure 30 on page
64.
Note:
v It is important to ensure that when installing a disk drive, that the drive is fully seated and all the
way into the system.
v The light stops flashing and remains on when you have installed the disk drive properly.
Replacing a disk drive or solid state drive in the 8231-E2B, 8231-E1C, or 8231-E2C 63
Completing the disk drive replace by using the HSM
Perform the following tasks by using the Hardware Service Manager (HSM):
1. Perform the following steps using the HSM after you have replaced the disk drive:
2. Press Enter. The disk drive slot exits the Remove state and enters the Normal state.
3. Exit to the RAID Hot-Plug Devices menu. Press F3 to return.
4. Exit to the Task selection display.
5. Select Log Repair Action.
6. Select the disk drive that you replaced and then press Enter.
7. Select Commit after you have made your selection and then press Enter.
8. Exit to the command line.
Completing the procedure
1. Verify the installed part:
v If you replaced the part because of a failure, verify the installed part. For instructions, see Verifying
a repair.
v If you installed the part for any other reason, verify the installed part. For instructions, see
Verifying the installed part.
2. To configure the newly installed disk drive, see the procedure for the operating system or logical
partition that controls the disk drive:
v Preparing a disk drive for use in an AIX system or logical partition.
v Preparing a disk drive for use in a Linux system or logical partition
v Configuring a disk drive in an IBM i system or logical partition.
Figure 30. Installing a disk drive in the 8231-E2B, 8231-E1C, or 8231-E2C
64 Disk drives
3. To rebuild the data on the replacement disk drive, see the procedure for the operating system or
logical partition that controls the disk drive:
v Rebuilding data on a replacement drive using AIX.
v Rebuilding data on a replacement drive using Linux.
v Rebuilding data on a replacement drive using IBM i.
Replacing a disk drive using Linux kernel 2.6, with the Linux system or
logical partition power turned on
Learn to replace a Small Computer System Interface (SCSI) or serial-attached SCSI (SAS) disk drive in a
system or expansion unit while the power to the Linux operating system or logical partition that controls
the disk drive location is turned on. This procedure uses the Linux kernel 2.6.
You must have removed the failed disk drive before you attempt to replace it. For the procedure to
remove a disk drive using Linux kernel 2.6, see “Removing the disk drive from the 8231-E2B, 8231-E1C,
or 8231-E2C using Linux kernel 2.6 with the Linux system or logical partition power turned on” on page
41. If you are installing a new or upgraded disk drive, see “Installing a disk-drive or solid-state drive in
the 8231-E2B, 8231-E1C, or 8231-E2C with the Linux system or logical partition power turned on” on
page 13. If you are replacing a disk drive as part of a service procedure, continue to use the following
procedure.
Attention: Read these instructions carefully and understand the complexity of the powered-on
procedure before you begin. If the system or logical partition that controls the disk drive you are
replacing is at the Linux command line, you can use this powered-on procedure.
Go to “Replacing a disk drive in the 8231-E2B, 8231-E1C, or 8231-E2C with the system or logical partition
power turned off” on page 77 if:
v You are installing a disk drive that is to be connected with the 6203 or 6204 SCSI adapter
v You do not feel comfortable using the powered-on procedure
Replacing a disk drive or solid state drive in the 8231-E2B, 8231-E1C, or 8231-E2C 65
DANGER
When working on or around the system, observe the following precautions:
Electrical voltage and current from power, telephone, and communication cables are hazardous. To
avoid a shock hazard:
v Connect power to this unit only with the IBM provided power cord. Do not use the IBM
provided power cord for any other product.
v Do not open or service any power supply assembly.
v Do not connect or disconnect any cables or perform installation, maintenance, or reconfiguration
of this product during an electrical storm.
v The product might be equipped with multiple power cords. To remove all hazardous voltages,
disconnect all power cords.
v Connect all power cords to a properly wired and grounded electrical outlet. Ensure that the outlet
supplies proper voltage and phase rotation according to the system rating plate.
v Connect any equipment that will be attached to this product to properly wired outlets.
v When possible, use one hand only to connect or disconnect signal cables.
v Never turn on any equipment when there is evidence of fire, water, or structural damage.
v Disconnect the attached power cords, telecommunications systems, networks, and modems before
you open the device covers, unless instructed otherwise in the installation and configuration
procedures.
v Connect and disconnect cables as described in the following procedures when installing, moving,
or opening covers on this product or attached devices.
To Disconnect:
1. Turn off everything (unless instructed otherwise).
2. Remove the power cords from the outlets.
3. Remove the signal cables from the connectors.
4. Remove all cables from the devices
To Connect:
1. Turn off everything (unless instructed otherwise).
2. Attach all cables to the devices.
3. Attach the signal cables to the connectors.
4. Attach the power cords to the outlets.
5. Turn on the devices.
(D005)
Note: Some of the figures in these procedures might not look exactly like the system or expansion unit
that you have. However, the steps to perform the task are the same.
Preparing the system to replace a disk drive
1. Perform prerequisite tasks. For instructions, see Before you begin.
2. Is the disk drive you are replacing in a redundant array of independent disk (RAID) 0 array or is the
disk drive you are replacing in a failed RAID level 5 or 10 array?
v If yes, you must delete the disk array before replacing the disk. For instructions, see PCI-X SCSI
RAID Controller Reference Guide for Linux.
This guide is available from the SCSI PCI Adapters web page.
v If no, your drive is configured as Just a Bunch of Disks (JBOD) or is configured within a degraded
array. Proceed to the next step.
3. If the disk drive you are replacing is in a redundant array of independent disk (RAID) Level 0 disk
array or in a failed RAID Level 5 or RAID Level 10 disk array, you must delete the disk array before
replacing the disk. For instructions, see the PCI-X SCSI RAID Controller Reference Guide for Linux.
This guide is available from the SCSI PCI Adapters web page.
66 Disk drives
Preparing to replace a disk drive by using the HSM
Perform the following steps to replace a disk drive by using the Hardware Service Manager (HSM):
1. From the IBM Power RAID Configuration Utility display, select Work with disk unit recovery. Press
Enter.
2. From the Work with Disk Unit Recovery display, select Concurrent add device. Press Enter.
A Concurrent Device Add display like the following sample is shown.
3. Type option1 (Select) next to the location from which you removed the disk drive.
The Verify Device Concurrent Add display is shown.
4. Press Enter on the Verify Device Concurrent Add display. The Complete Device Concurrent Add
display is shown.
Proceed to install the replacement disk drive.
Replacing a disk drive in the 8231-E2B, 8231-E1C, or 8231-E2C
1. Use the service indicator light-emitting diodes (LEDs) to help identify the part. For information, see
Identifying a part.
2. Use the location information that you recorded while removing the disk drives to install or replace
the internal disk drives to the correct locations.
3. Find the package that contains the new disk drive.
Attention: Disk drives are fragile. Handle with care.
4. Remove the disk drive from the static-protective package.
5. If the slot you want to use contains a disk drive filler, remove the disk drive filler from the slot. For
instructions, see “Removing a disk drive filler from the 8231-E2B, 8231-E1C, or 8231-E2C system” on
page 89.
6. If the disk drive you are installing does not have a bezel preinstalled, then perform the tasks to
install a bezel on the disk drive, then continue with this procedure from step 7. For instructions to
install a bezel, see “Installing a disk drive bezel in a disk drive or filler” on page 94.
7. Unlock the disk drive handle by squeezing it and pulling it out toward you. If the handle is not all
the way out, the disk drive does not slide into the system or expansion unit.
8. Hold the disk drive by the top and bottom edges as you position the drive, and insert it into the
system or the expansion unit. Do not hold the disk drive by the handle.
9. Slide the disk drive halfway into the system or expansion unit.
Concurrent Device Add
Choose a single location for add operations
1=Select
OPT Name PCI/SCSI Location Description Status
--- ------ -------------------------- ------------------------- -----------------
0000:41:01.0.1/0:5: Empty
0000:41:01.0.1/1:8: Empty
0001:61:01.0.2/0:8: Empty
Figure 31. Example Concurrent Device Add display
Replacing a disk drive or solid state drive in the 8231-E2B, 8231-E1C, or 8231-E2C 67
10. Press Enter on the console.
11. When the service indicator starts flashing, slide the disk drive all the way into the system or
expansion unit, and push the disk drive handle (A) in until it locks, as shown in Figure 32.
Note:
v It is important to ensure that when installing a disk drive, that the drive is fully seated and all the
way into the system.
v The light stops flashing and remains on when you have installed the disk drive properly.
Completing the disk drive replace by using the HSM
Press Enter on the Complete Device Concurrent Add display to indicate that the disk drive is installed.
Completing the procedure
1. Verify the installed part:
v If you replaced the part because of a failure, verify the installed part. For instructions, see Verifying
a repair.
v If you installed the part for any other reason, verify the installed part. For instructions, see
Verifying the installed part.
2. To configure the newly installed disk drive, see the procedure for the operating system or logical
partition that controls the disk drive:
v Preparing a disk drive for use in an AIX system or logical partition.
v Preparing a disk drive for use in a Linux system or logical partition
v Configuring a disk drive in an IBM i system or logical partition.
Figure 32. Installing a disk drive in the 8231-E2B, 8231-E1C, or 8231-E2C
68 Disk drives
3. To rebuild the data on the replacement disk drive, see the procedure for the operating system or
logical partition that controls the disk drive:
v Rebuilding data on a replacement drive using AIX.
v Rebuilding data on a replacement drive using Linux.
v Rebuilding data on a replacement drive using IBM i.
Replacing a disk drive in the 8231-E2B, 8231-E1C, or 8231-E2C with
the IBM i system or logical partition power turned on
Learn how to replace a Small Computer System Interface (SCSI) or serial-attached SCSI (SAS) disk drive
in a system or expansion unit while the power of the IBM i operating system or logical partition that
controls the disk drive location is turned on.
To complete this procedure you must have removed the disk drive. For the procedure to remove the disk
drive, see “Removing a disk drive from the 8231-E2B, 8231-E1C, or 8231-E2C or an expansion unit
controlled by IBM i” on page 47. If you are installing a new or upgraded disk drive, see “Installing a disk
drive or solid-state drive in the 8231-E2B, 8231-E1C, or 8231-E2C with the IBM i system or logical
partition power turned on” on page 25. If you are removing a disk drive as part of a service procedure,
continue to use the following procedure.
Attention: The powered-on procedure is a complex set of steps that must be performed exactly to avoid
potential data loss or system damage. Alternatively, you can use the powered-off procedure, which is a
simpler task that involves less risk of data loss or system damage. For instructions, see “Replacing a disk
drive in the 8231-E2B, 8231-E1C, or 8231-E2C with the system or logical partition power turned off” on
page 77.
Replacing a disk drive or solid state drive in the 8231-E2B, 8231-E1C, or 8231-E2C 69
DANGER
When working on or around the system, observe the following precautions:
Electrical voltage and current from power, telephone, and communication cables are hazardous. To
avoid a shock hazard:
v Connect power to this unit only with the IBM provided power cord. Do not use the IBM
provided power cord for any other product.
v Do not open or service any power supply assembly.
v Do not connect or disconnect any cables or perform installation, maintenance, or reconfiguration
of this product during an electrical storm.
v The product might be equipped with multiple power cords. To remove all hazardous voltages,
disconnect all power cords.
v Connect all power cords to a properly wired and grounded electrical outlet. Ensure that the outlet
supplies proper voltage and phase rotation according to the system rating plate.
v Connect any equipment that will be attached to this product to properly wired outlets.
v When possible, use one hand only to connect or disconnect signal cables.
v Never turn on any equipment when there is evidence of fire, water, or structural damage.
v Disconnect the attached power cords, telecommunications systems, networks, and modems before
you open the device covers, unless instructed otherwise in the installation and configuration
procedures.
v Connect and disconnect cables as described in the following procedures when installing, moving,
or opening covers on this product or attached devices.
To Disconnect:
1. Turn off everything (unless instructed otherwise).
2. Remove the power cords from the outlets.
3. Remove the signal cables from the connectors.
4. Remove all cables from the devices
To Connect:
1. Turn off everything (unless instructed otherwise).
2. Attach all cables to the devices.
3. Attach the signal cables to the connectors.
4. Attach the power cords to the outlets.
5. Turn on the devices.
(D005)
Note: Some of the figures in these procedures might not look exactly like the system or expansion unit
that you have. However, the steps to perform the task are the same.
Preparing to replace the disk drive
1. Return to the Hardware Service Manager (HSM) console and wait until the Concurrent Maintenance
Results display is shown.
2. Press F12.
3. The physical locations you entered while removing the disk drive might still show on the display. If
not, retype the physical location where you want to install the new disk drive.
In the example, U787A.001.1234567-P3-D4, you would install a disk drive in the system unit with type
9406, Model 520, sequence number 1234567, and disk drive position P3–D4.
In the example, U5074.001.1234567-DB3-D32, you would install a disk drive in the expansion unit with
type 5074, model 001, sequence number 1234567, and disk drive position DB3–D32.
4. Select option 2 (Install device) for the Action to be performed.
5. Set the time delay for one minute:01.
70 Disk drives
Important: Do not press Enter at this time.
Proceed to replace the disk drive.
Replacing a disk drive in the 8231-E2B, 8231-E1C, or 8231-E2C
1. Use the service indicator light-emitting diodes (LEDs) to help identify the part. For information, see
Identifying a part.
2. Use the location information that you recorded while removing the disk drives to install or replace
the internal disk drives to the correct locations.
3. Find the package that contains the new disk drive.
Attention: Disk drives are fragile. Handle with care.
4. Remove the disk drive from the static-protective package.
5. If the slot you want to use contains a disk drive filler, remove the disk drive filler from the slot. For
instructions, see “Removing a disk drive filler from the 8231-E2B, 8231-E1C, or 8231-E2C system” on
page 89.
6. If the disk drive you are installing does not have a bezel preinstalled, then perform the tasks to
install a bezel on the disk drive, then continue with this procedure from step 7. For instructions to
install a bezel, see “Installing a disk drive bezel in a disk drive or filler” on page 94.
7. Unlock the disk drive handle by squeezing it and pulling it out toward you. If the handle is not all
the way out, the disk drive does not slide into the system or expansion unit.
8. Hold the disk drive by the top and bottom edges as you position the drive, and insert it into the
system or the expansion unit. Do not hold the disk drive by the handle.
9. Slide the disk drive halfway into the system or expansion unit.
10. Press Enter on the console.
11. When the service indicator starts flashing, slide the disk drive all the way into the system or
expansion unit, and push the disk drive handle (A) in until it locks, as shown in Figure 33 on page
72.
Note:
v It is important to ensure that when installing a disk drive, that the drive is fully seated and all the
way into the system.
v The light stops flashing and remains on when you have installed the disk drive properly.
Replacing a disk drive or solid state drive in the 8231-E2B, 8231-E1C, or 8231-E2C 71
Completing the disk drive replace by using the HSM
1. Return to the console and wait until the Concurrent Maintenance Results display is shown. Press
Enter.
2. If you return to the Service Action Log display, exit the service action log.
3. When the Hardware Service Manager display is shown, press F3.
Completing the procedure
1. Verify the installed part:
v If you replaced the part because of a failure, verify the installed part. For instructions, see Verifying
a repair.
v If you installed the part for any other reason, verify the installed part. For instructions, see
Verifying the installed part.
2. To configure the newly installed disk drive, see the procedure for the operating system or logical
partition that controls the disk drive:
v Preparing a disk drive for use in an AIX system or logical partition.
v Preparing a disk drive for use in a Linux system or logical partition
v Configuring a disk drive in an IBM i system or logical partition.
3. To rebuild the data on the replacement disk drive, see the procedure for the operating system or
logical partition that controls the disk drive:
v Rebuilding data on a replacement drive using AIX.
v Rebuilding data on a replacement drive using Linux.
v Rebuilding data on a replacement drive using IBM i.
Figure 33. Installing a disk drive in the 8231-E2B, 8231-E1C, or 8231-E2C
72 Disk drives
Replacing a disk drive in an 8231-E2B, 8231-E1C, or 8231-E2C using
hot-spare in an IBM i system or logical partition
You can use the hot-spare function to replace a disk drive in a system or expansion unit while the IBM i
operating system or logical partition that controls the disk drive location is powered on.
To complete this procedure, you must have already completed the removal procedure. For the removal
procedure, see “Removing a disk drive using hot-spare in an IBM i system or logical partition from the
8231-E2B, 8231-E1C, or 8231-E2C” on page 52. If you are replacing a disk drive as part of a service
procedure, continue to use the following procedure.
Preparing to replace the disk drive by using the HSM
Perform the following tasks by using the Hardware Service Manager (HSM):
1. Continue with the replacing of the disk drive after you have removed the disk drive.
2. Return to the console and wait until the Concurrent Maintenance Results display is shown.
3. Press F12.
4. The physical locations you entered while removing the disk drive might still appear on the display. If
not, retype the physical location where you are installing the disk drive.
In the example, U787A.001.1234567-P3-D4, you would install a disk drive in the system unit with type
9406, Model 520, sequence number 1234567, and disk drive position P3–D4.
In the example, U5074.001.1234567-DB3-D32, you would install a disk drive in the expansion unit with
type 5074, model 001, sequence number 1234567, and disk drive position DB3–D32.
5. Select option 2 (Install device) for the Action to be performed.
6. Set the time delay for one minute:01.
Important: Do not press Enter at this time.
Replacing a disk drive in the 8231-E2B, 8231-E1C, or 8231-E2C
1. Use the service indicator light-emitting diodes (LEDs) to help identify the part. For information, see
Identifying a part.
2. Use the location information that you recorded while removing the disk drives to install or replace
the internal disk drives to the correct locations.
3. Find the package that contains the new disk drive.
Attention: Disk drives are fragile. Handle with care.
4. Remove the disk drive from the static-protective package.
5. If the slot you want to use contains a disk drive filler, remove the disk drive filler from the slot. For
instructions, see “Removing a disk drive filler from the 8231-E2B, 8231-E1C, or 8231-E2C system” on
page 89.
6. If the disk drive you are installing does not have a bezel preinstalled, then perform the tasks to
install a bezel on the disk drive, then continue with this procedure from step 7. For instructions to
install a bezel, see “Installing a disk drive bezel in a disk drive or filler” on page 94.
7. Unlock the disk drive handle by squeezing it and pulling it out toward you. If the handle is not all
the way out, the disk drive does not slide into the system or expansion unit.
8. Hold the disk drive by the top and bottom edges as you position the drive, and insert it into the
system or the expansion unit. Do not hold the disk drive by the handle.
9. Slide the disk drive halfway into the system or expansion unit.
10. Press Enter on the console.
Replacing a disk drive or solid state drive in the 8231-E2B, 8231-E1C, or 8231-E2C 73
11. When the service indicator starts flashing, slide the disk drive all the way into the system or
expansion unit, and push the disk drive handle (A) in until it locks, as shown in Figure 34.
Note:
v It is important to ensure that when installing a disk drive, that the drive is fully seated and all the
way into the system.
v The light stops flashing and remains on when you have installed the disk drive properly.
Completing the procedure by using the HSM
1. Return to the HSM console and wait until the Concurrent Maintenance Results display is shown,
and press Enter if you used the concurrent maintenance process above, otherwise continue with the
next step.
2. If you return to the Hardware Service Manager (HSM), exit the HSM.
3. Select Work with disk units from the Start System Service Tools display.
4. Select Non-configured disk units from the Work with Disk Units display, and press Enter.
5. Is the new disk unit displayed with a status of non-configured?
Note: It may take several minutes for the new disk drive to display.
v No: Contact your next level of support.
v Yes: Continue with the next step.
6. Press F3 to return to the Work with Disk Units display.
7. Select Work with disk unit recovery from the Work with Disk Units display, and press Enter.
8. Select Disk unit problem recovery procedure, and press Enter.
9. Select Initialize and format disk unit, and press Enter.
Figure 34. Installing a disk drive in the 8231-E2B, 8231-E1C, or 8231-E2C
74 Disk drives
10. Select the new disk and press Enter.
Note: This will take several minutes to complete.
11. When the disk drive is initialized and formatted, press F3 to return to the Work with Disk Units
display.
12. Select Start hot spare, and press Enter.
13. Select the IOA with the new disk, then press Enter and press Enter again.
14. Does the new hot spare disk drive have a smaller capacity than previous hot spare disk drive? Refer
to the resource name identified in the xxxx9031 error which was logged close to the same time of the
original disk failure.
v No: The procedure is complete. Press F3 to return to the System service tools display.
v Yes: Go to the next step.
15. Select Display Device Parity Status, and press Enter. If the new disk drive is not active and hot
spare protected, do not continue. Call your next level of support.
Restriction: You must complete the following steps to ensure your data is protected. The following
steps will allow you to exchange the two disk drives so that the larger disk drive will become the
hot spare, and the smaller disk drive will become a parity set member.
16. Fail the previous hot spare disk drive by unlocking the disk drive handle by squeezing it and
pulling it out toward you.
17. Pull the drive out about one inch and wait ten seconds.
18. Push the disk drive back into the slot and close the handle.
19. Return to step 1 on page 74 and start hot spare on the larger drive.
Replacing a SCSI disk drive in the 8231-E2B, 8231-E1C, or 8231-E2C
Learn how to replace a Small Computer System Interface (SCSI) disk drive in a system or expansion unit.
Replacing a disk drive in the 8231-E2B, 8231-E1C, or 8231-E2C
1. Use the service indicator light-emitting diodes (LEDs) to help identify the part. For information, see
Identifying a part.
2. Use the location information that you recorded while removing the disk drives to install or replace
the internal disk drives to the correct locations.
3. Find the package that contains the new disk drive.
Attention: Disk drives are fragile. Handle with care.
4. Remove the disk drive from the static-protective package.
5. If the slot you want to use contains a disk drive filler, remove the disk drive filler from the slot. For
instructions, see “Removing a disk drive filler from the 8231-E2B, 8231-E1C, or 8231-E2C system” on
page 89.
6. If the disk drive you are installing does not have a bezel preinstalled, then perform the tasks to
install a bezel on the disk drive, then continue with this procedure from step 7. For instructions to
install a bezel, see “Installing a disk drive bezel in a disk drive or filler” on page 94.
7. Unlock the disk drive handle by squeezing it and pulling it out toward you. If the handle is not all
the way out, the disk drive does not slide into the system or expansion unit.
8. Hold the disk drive by the top and bottom edges as you position the drive, and insert it into the
system or the expansion unit. Do not hold the disk drive by the handle.
9. Slide the disk drive halfway into the system or expansion unit.
10. Press Enter on the console.
Replacing a disk drive or solid state drive in the 8231-E2B, 8231-E1C, or 8231-E2C 75
11. When the service indicator starts flashing, slide the disk drive all the way into the system or
expansion unit, and push the disk drive handle (A) in until it locks, as shown in Figure 35.
Note:
v It is important to ensure that when installing a disk drive, that the drive is fully seated and all the
way into the system.
v The light stops flashing and remains on when you have installed the disk drive properly.
Completing the disk drive replace by using the HSM
Perform the following steps using the Hardware Service Manager (HSM):
1. Press Enter.
2. Exit to the SCSI and SCSI RAID Hot-Plug Manager menu.
3. Select Configure Added/Replaced Devices. Press Enter.
4. Select the disk drive that you replaced and then press Enter.
The disk drive is now configured.
5. Exit to the Task Selection display.
6. Select Log Repair Action.
7. Select the disk drive that you replaced and then press Enter.
8. Select Commit after you have made your selection and then press Enter.
9. Exit to the command line.
Completing the procedure
1. Verify the installed part:
Figure 35. Installing a disk drive in the 8231-E2B, 8231-E1C, or 8231-E2C
76 Disk drives
v If you replaced the part because of a failure, verify the installed part. For instructions, see Verifying
a repair.
v If you installed the part for any other reason, verify the installed part. For instructions, see
Verifying the installed part.
2. To configure the newly installed disk drive, see the procedure for the operating system or logical
partition that controls the disk drive:
v Preparing a disk drive for use in an AIX system or logical partition.
v Preparing a disk drive for use in a Linux system or logical partition
v Configuring a disk drive in an IBM i system or logical partition.
3. To rebuild the data on the replacement disk drive, see the procedure for the operating system or
logical partition that controls the disk drive:
v Rebuilding data on a replacement drive using AIX.
v Rebuilding data on a replacement drive using Linux.
v Rebuilding data on a replacement drive using IBM i.
Replacing a disk drive in the 8231-E2B, 8231-E1C, or 8231-E2C with
the system or logical partition power turned off
Learn to replace a failed Small Computer System Interface (SCSI) or serial-attached SCSI (SAS) disk drive
in a system or logical partition that controls the disk drive location with the power turned off.
To complete this procedure, you must have already completed the removal procedure. For the removal
procedure, see “Removing a disk drive from the 8231-E2B, 8231-E1C, or 8231-E2C with the system or
logical partition power turned off” on page 56. If you are installing a new or upgraded disk drive, see
“Installing a disk drive or solid-state drive in the 8231-E2B, 8231-E1C, or 8231-E2C with the system or
logical partition power turned off” on page 31. If you are replacing a disk drive as part of a service
procedure, continue to use the following procedure.
Replacing a disk drive or solid state drive in the 8231-E2B, 8231-E1C, or 8231-E2C 77
DANGER
When working on or around the system, observe the following precautions:
Electrical voltage and current from power, telephone, and communication cables are hazardous. To
avoid a shock hazard:
v Connect power to this unit only with the IBM provided power cord. Do not use the IBM
provided power cord for any other product.
v Do not open or service any power supply assembly.
v Do not connect or disconnect any cables or perform installation, maintenance, or reconfiguration
of this product during an electrical storm.
v The product might be equipped with multiple power cords. To remove all hazardous voltages,
disconnect all power cords.
v Connect all power cords to a properly wired and grounded electrical outlet. Ensure that the outlet
supplies proper voltage and phase rotation according to the system rating plate.
v Connect any equipment that will be attached to this product to properly wired outlets.
v When possible, use one hand only to connect or disconnect signal cables.
v Never turn on any equipment when there is evidence of fire, water, or structural damage.
v Disconnect the attached power cords, telecommunications systems, networks, and modems before
you open the device covers, unless instructed otherwise in the installation and configuration
procedures.
v Connect and disconnect cables as described in the following procedures when installing, moving,
or opening covers on this product or attached devices.
To Disconnect:
1. Turn off everything (unless instructed otherwise).
2. Remove the power cords from the outlets.
3. Remove the signal cables from the connectors.
4. Remove all cables from the devices
To Connect:
1. Turn off everything (unless instructed otherwise).
2. Attach all cables to the devices.
3. Attach the signal cables to the connectors.
4. Attach the power cords to the outlets.
5. Turn on the devices.
(D005)
Replacing a disk drive in the 8231-E2B, 8231-E1C, or 8231-E2C
1. Use the service indicator light-emitting diodes (LEDs) to help identify the part. For information, see
Identifying a part.
2. Use the location information that you recorded while removing the disk drives to install or replace
the internal disk drives to the correct locations.
3. Find the package that contains the new disk drive.
Attention: Disk drives are fragile. Handle with care.
4. Remove the disk drive from the static-protective package.
5. If the slot you want to use contains a disk drive filler, remove the disk drive filler from the slot. For
instructions, see “Removing a disk drive filler from the 8231-E2B, 8231-E1C, or 8231-E2C system” on
page 89.
6. If the disk drive you are installing does not have a bezel preinstalled, then perform the tasks to
install a bezel on the disk drive, then continue with this procedure from step 7. For instructions to
install a bezel, see “Installing a disk drive bezel in a disk drive or filler” on page 94.
7. Unlock the disk drive handle by squeezing it and pulling it out toward you. If the handle is not all
the way out, the disk drive does not slide into the system or expansion unit.
78 Disk drives
8. Hold the disk drive by the top and bottom edges as you position the drive, and insert it into the
system or the expansion unit. Do not hold the disk drive by the handle.
9. Slide the disk drive halfway into the system or expansion unit.
10. Press Enter on the console.
11. When the service indicator starts flashing, slide the disk drive all the way into the system or
expansion unit, and push the disk drive handle (A) in until it locks, as shown in Figure 36.
Note:
v It is important to ensure that when installing a disk drive, that the drive is fully seated and all the
way into the system.
v The light stops flashing and remains on when you have installed the disk drive properly.
Completing the procedure
1. Replace the service access cover. For instructions, see Installing the service access cover on a
rack-mounted 8231-E2B, 8231-E1C, or 8231-E2C system.
2. Connect the power cords to the system. For instructions, see Connecting the power cords to the
8231-E2B, 8231-E1C, or 8231-E2C.
3. Place the system in the operating position. For instructions, see Placing the rack-mounted 8231-E2B,
8231-E1C, or 8231-E2C system into the operating position.
4. Start the system or logical partition. For instructions, see Starting the system or logical partition.
5. Verify the installed part:
v If you replaced the part because of a failure, verify the installed part. For instructions, see Verifying
a repair.
v If you installed the part for any other reason, verify the installed part. For instructions, see
Verifying the installed part.
Figure 36. Installing a disk drive in the 8231-E2B, 8231-E1C, or 8231-E2C
Replacing a disk drive or solid state drive in the 8231-E2B, 8231-E1C, or 8231-E2C 79
6. To configure the newly installed disk drive, see the procedure for the operating system or logical
partition that controls the disk drive:
v Preparing a disk drive for use in an AIX system or logical partition.
v Preparing a disk drive for use in a Linux system or logical partition
v Configuring a disk drive in an IBM i system or logical partition.
7. To rebuild the data on the replacement disk drive, see the procedure for the operating system or
logical partition that controls the disk drive:
v Rebuilding data on a replacement drive using AIX.
v Rebuilding data on a replacement drive using Linux.
v Rebuilding data on a replacement drive using IBM i.
80 Disk drives
Installing the external SAS port in the 8231-E2B, 8231-E1C, or
8231-E2C system
Learn to install an external SAS port as a part of upgrading your disk drive backplane or to enable
internal disk sharing.
If you want to replace a failed SAS port, see “Replacing the external SAS port in the 8231-E2B, 8231-E1C,
or 8231-E2C system” on page 87. Perform the tasks in this procedure to install an SAS port in the system.
Perform the prerequisite tasks as described in Before you begin.
Perform the following steps to connect the SAS cable to the external SAS connection port on the server
system backplane:
1. Stop the system. For instructions, see Stopping the system or logical partition.
2. Disconnect the power source from the system by unplugging the system, as described in
Disconnecting the power cords.
(L003)
or
3. Attach the wrist strap.
© Copyright IBM Corp. 2010, 2011 81
Attention:
v Attach a wrist strap to an unpainted metal surface of your hardware to prevent electrostatic
discharge (ESD) from damaging your hardware.
v When using a wrist strap, follow all electrical safety procedures. A wrist strap is for static control.
It does not increase or decrease your risk of receiving electric shock when using or working on
electrical equipment.
v If you do not have a wrist strap, just prior to removing the product from ESD packaging and
installing or replacing hardware, touch an unpainted metal surface of the system for a minimum
of 5 seconds.
4. Place the system into the service position. For instructions, see Placing a rack-mounted 8231-E2B,
8231-E1C, or 8231-E2C system into the service position.
5. Remove the service access cover. For instructions, see Removing the service access cover from a
rack-mounted 8231-E2B, 8231-E1C, or 8231-E2C system.
6. Disconnect all PCI adapters that are attached to the system. For instructions, see Removing a PCI
adapter.
7. If you are installing into a new system, remove the filler from the external SAS connection port on
the disk drive backplane.
8. Connect the SAS shroud of the SAS cable to the external SAS connection port (C) at the rear of the
system backplane until it snaps in place, see Figure 37.
9. Lay the cable (A) beside the side wall and slide it towards the SAS connection port on the disk drive
backplane.
10. Push the SAS shroud (B) into the SAS connection port on the disk drive backplane until it snaps in
place.
11. Connect all PCI adapters. For instructions, see Replacing a PCI adapter.
12. Place the system in the operating position. For instructions, see Placing the rack-mounted 8231-E2B,
8231-E1C, or 8231-E2C system into the operating position.
Figure 37. Installing the SAS cable
82 Disk drives
13. Replace the service access cover. For instructions, see Installing the service access cover on a
rack-mounted 8231-E2B, 8231-E1C, or 8231-E2C system.
14. Connect the power cords to the system. For instructions, see Connecting the power cords to the
8231-E2B, 8231-E1C, or 8231-E2C.
15. Start the system or logical partition. For instructions, see Starting the system or logical partition.
16. Verify the installed part:
v If you replaced the part because of a failure, verify the installed part. For instructions, see
Verifying a repair.
v If you installed the part for any other reason, verify the installed part. For instructions, see
Verifying the installed part.
You can now attach other supported SAS features to the system.
Installing the external SAS port in the 8231-E2B, 8231-E1C, or 8231-E2C system 83
84 Disk drives
Removing the external SAS port from the 8231-E2B,
8231-E1C, or 8231-E2C system
Learn to remove a failed SAS port from the disk drive backplane as part of a service procedure.
Perform the prerequisite tasks as described in Before you begin.
Perform the following steps to remove the SAS cable from the external SAS connection port on the server
system backplane:
1. Stop the system. For instructions, see Stopping the system or logical partition.
2. Disconnect the power source from the system by unplugging the system. For instructions, see
Disconnecting the power cords.
Note: This system can be equipped with a second power supply. Disconnect it before continuing with
this procedure. Ensure that the power source to the system has been disconnected.
(L003)
or
3. Attach the wrist strap.
© Copyright IBM Corp. 2010, 2011 85
Attention:
v Attach a wrist strap to an unpainted metal surface of your hardware to prevent electrostatic
discharge (ESD) from damaging your hardware.
v When using a wrist strap, follow all electrical safety procedures. A wrist strap is for static control. It
does not increase or decrease your risk of receiving electric shock when using or working on
electrical equipment.
v If you do not have a wrist strap, just prior to removing the product from ESD packaging and
installing or replacing hardware, touch an unpainted metal surface of the system for a minimum of
5 seconds.
4. Place the system into the service position. For instructions, see Placing a rack-mounted 8231-E2B,
8231-E1C, or 8231-E2C system into the service position.
5. Remove the service access cover. For instructions, see Removing the service access cover from a
rack-mounted 8231-E2B, 8231-E1C, or 8231-E2C system.
6. Disconnect all PCI adapters. For instructions, see Removing a PCI adapter.
7. Press the side tab (B) and pull out the SAS shroud from the external SAS connection port on the disk
drive backplane. See Figure 38.
8. Pull out the SAS cable (C) from the system.
9. Push the SAS connection port down (A) and then dislodge the SAS cable (C) out of the system
backplane.
v You can continue with the procedure to replace the SAS port. See “Replacing the external SAS port in
the 8231-E2B, 8231-E1C, or 8231-E2C system” on page 87.
v If you are not replacing the SAS port, you can install a filler into the external SAS connection port on
the disk drive backplane.
Figure 38. Removing the external SAS port
86 Disk drives
Replacing the external SAS port in the 8231-E2B, 8231-E1C,
or 8231-E2C system
Learn to replace an external SAS port as part of a service procedure.
Perform the prerequisite tasks as described in Before you begin.
Attention: You must have removed the failed external SAS port before you can replace it. Perform this
procedure to replace the failed SAS port on the system backplane:
1. Attach the wrist strap.
Attention:
v Attach a wrist strap to an unpainted metal surface of your hardware to prevent electrostatic
discharge (ESD) from damaging your hardware.
v When using a wrist strap, follow all electrical safety procedures. A wrist strap is for static control.
It does not increase or decrease your risk of receiving electric shock when using or working on
electrical equipment.
v If you do not have a wrist strap, just prior to removing the product from ESD packaging and
installing or replacing hardware, touch an unpainted metal surface of the system for a minimum
of 5 seconds.
2. Connect the SAS shroud of the SAS cable to the external SAS connection port (C) at the rear of the
system backplane until it snaps in place. See Figure 39 on page 88.
3. Lay the cable (A) beside the side wall and slide it toward the SAS connection port on the disk drive
backplane.
4. Push the SAS shroud (B) into the SAS connection port on the disk drive backplane until it snaps in
place.
© Copyright IBM Corp. 2010, 2011 87
5. Connect all PCI adapters. For instructions, see Replacing a PCI adapter.
6. For a rack-mounted system, place the system into the operating position as described in Placing a
rack-mounted 8231-E2B, 8231-E1C, or 8231-E2C system into the operating position.
7. Replace the service access cover. For instructions, see Installing the service access cover on a
rack-mounted 8231-E2B, 8231-E1C, or 8231-E2C system.
8. Connect the power source to the system by plugging the power cords. For instructions, see
Connecting the power cords.
9. Start the system or logical partition. For instructions, see Starting the system or logical partition.
10. Verify the installed part:
v If you replaced the part because of a failure, verify the installed part. For instructions, see
Verifying a repair.
v If you installed the part for any other reason, verify the installed part. For instructions, see
Verifying the installed part.
You can now attach other supported SAS features to the system.
Figure 39. Replacing the SAS cable
88 Disk drives
Removing or installing a disk drive filler
Learn how to remove or install a Small Computer System Interface (SCSI) disk drive filler in the server or
expansion unit.
Note: Refill the disk drive slots with either another disk drive or a disk drive filler. Filling the disk drive
slot ensures proper air flow for cooling and maintains an optimal EMI compliance.
Removing a disk drive filler from the 8231-E2B, 8231-E1C, or 8231-E2C
system
Learn how to remove a Small Computer System Interface (SCSI) disk drive filler from the server or
expansion unit.
When you are installing a disk drive in a system or an expansion unit, the slot you want to use can
contain a disk drive filler.
To remove the filler before installing a disk drive in the slot, complete the following steps:
1. Remove the system unit door, expansion unit door, or open the rack front door.
2. Identify the slot that contains the disk drive filler.
Attention:
v Attach a wrist strap to an unpainted metal surface of your hardware to prevent electrostatic
discharge (ESD) from damaging your hardware.
v When using a wrist strap, follow all electrical safety procedures. A wrist strap is for static control. It
does not increase or decrease your risk of receiving electric shock when using or working on
electrical equipment.
v If you do not have a wrist strap, just prior to removing the product from ESD packaging and
installing or replacing hardware, touch an unpainted metal surface of the system for a minimum of
5 seconds.
3. Hold the filler handle (A) and pull out the disk drive filler from the slot, as shown in Figure 40 on
page 90.
© Copyright IBM Corp. 2010, 2011 89
Return to the procedure that you brought you here.
Installing a disk drive filler in the 8231-E2B, 8231-E1C, or 8231-E2C
system
You might need to install a Small Computer System Interface (SCSI) disk drive filler in the server or
expansion unit.
When you remove a disk drive from a system or an expansion unit, you can install a disk drive filler in
the slot if you are not installing a replacement disk drive.
To install a disk drive filler in a system or an expansion unit, complete the following steps:
1. Hold the disk drive filler by the top and bottom edges as you position the filler, and insert it into the
system or the expansion unit.
Attention:
v Attach a wrist strap to an unpainted metal surface of your hardware to prevent electrostatic
discharge (ESD) from damaging your hardware.
v When using a wrist strap, follow all electrical safety procedures. A wrist strap is for static control. It
does not increase or decrease your risk of receiving electric shock when using or working on
electrical equipment.
v If you do not have a wrist strap, just prior to removing the product from ESD packaging and
installing or replacing hardware, touch an unpainted metal surface of the system for a minimum of
5 seconds.
2. Slide the disk drive filler all the way into the system or expansion unit, and push the disk drive filler
handle (A) until it locks, as shown in Figure 41 on page 91.
Figure 40. Removing a disk drive filler from the 8231-E2B, 8231-E1C, or 8231-E2C
90 Disk drives
Return to the procedure that you brought you here.
Figure 41. Installing a disk drive filler in the 8231-E2B, 8231-E1C, or 8231-E2C.
Removing or installing a disk drive filler 91
92 Disk drives
Removing or installing a disk drive bezel
Learn how to remove or install the disk drive bezel from and into a disk drive.
Removing a disk drive bezel from a disk drive or filler
Learn how to remove a disk drive bezel from a disk drive or a disk drive filler.
Learn to remove the disk drive bezel from either a failed disk drive that had a bezel installed in it or
from the disk drive filler in the slot where you are replacing the failed disk drive. The removed disk
drive bezel must be attached in the replacement disk drive or a disk drive filler before installing the disk
drive or filler in the system or expansion unit. To install a disk drive filler, see “Installing a disk drive
bezel in a disk drive or filler” on page 94. Perform the steps in this procedure only if the disk drive or
the disk drive filler has a disk drive bezel attached to it.
Important: If the disk drive bezel does not have a thumbscrew, it is not removable and is a SAS disk
drive.
Note: Some of the figures in this procedure might not look exactly like the system or expansion unit that
you have. However, the steps to perform the task are the same.
1. From the removed disk drive or disk drive filler, gently pull off the disk drive bezel latch (A) from
the front of the filler, as shown in Figure 42 on page 94.
2. Loosen and then pull out on the thumbscrew (C) to release the bezel (B) from the front edge of the
filler.
© Copyright IBM Corp. 2010, 2011 93
3. Retain the removed disk drive bezel safely and use the removed bezel to attach it to the replacement
disk drive or a disk drive filler.
Return to the procedure that brought you here.
Installing a disk drive bezel in a disk drive or filler
Learn how to install a disk drive bezel into a disk drive.
You must have removed the disk drive bezel from either a failed disk drive that had a bezel installed in
it or from the disk drive filler. To remove a disk drive filler, see “Removing a disk drive bezel from a disk
drive or filler” on page 93. Perform the steps in this procedure only if the disk drive or the disk drive
filler removed had a disk drive bezel attached to it.
Note: If the disk drive bezel does not have a thumbscrew, it is not removable and is a SAS disk drive.
The replacement SAS disk drive comes with an attached bezel.
1. Attach the disk drive bezel (A) to the disk drive or to a disk drive filler.
Attention:
v Attach a wrist strap to an unpainted metal surface of your hardware to prevent electrostatic
discharge (ESD) from damaging your hardware.
v When using a wrist strap, follow all electrical safety procedures. A wrist strap is for static control. It
does not increase or decrease your risk of receiving electric shock when using or working on
electrical equipment.
v If you do not have a wrist strap, just prior to removing the product from ESD packaging and
installing or replacing hardware, touch an unpainted metal surface of the system for a minimum of
5 seconds.
2. Press in and then tighten the thumbscrew (B) to secure the bezel to the disk drive or filler.
Figure 42. Removing a disk drive bezel
94 Disk drives
3. Install the disk drive bezel latch (C) onto the front of the disk drive or filler, as shown in Figure 43.
Return to the procedure that brought you here.
Figure 43. Attaching a bezel to a disk drive filler
Removing or installing a disk drive bezel 95
96 Disk drives
Disk drive locations and service indicators
Find out where the disk drives and service indicators are located on the system or an expansion unit.
Learn about the disk drive locations and location of the service indicators on the system or expansion
unit.
PCIe RAID and SSD SAS adapter locations
Learn about the PCIe RAID and SSD SAS adapter locations and the location of the service indicators on
the PCIe RAID and SSD SAS adapter.
The figure Figure 44 shows the PCIe RAID and SSD SAS adapter locations.
Disk drive locations and service indicators for the 8231-E2B, 8231-E1C,
or 8231-E2C system
Learn about the disk drive locations and the location of the service indicators on the system.
The following figures show the disk drive locations and service indicators for the 8231-E2B, 8231-E1C, or
8231-E2C. Service indicators are located above the latch handle on the disk drives.
Figure 44. PCIe RAID and SSD SAS adapter locations
© Copyright IBM Corp. 2010, 2011 97
Figure 45. Disk-drive locations for systems with six small-form factor (SFF) bays
Figure 46. Disk-drive locations for systems with three SFF bays (optional)
Figure 47. Service indicators locations for systems with six SFF bays
98 Disk drives
Disk drive locations and service indicators for the 5886 disk-drive
enclosure
Learn about the disk drive locations and where the service indicators are located for your disk-drive
enclosure.
The following figure shows the disk drive locations for the 5886 disk-drive enclosure.
Figure 48. Service indicators locations for systems with three SFF bays (optional)
Figure 49. Disk drive locations for the 5886 disk-drive enclosure
Disk drive locations and service indicators 99
The following figure shows the locations of the disk drive service indicators for the 5886 disk-drive
enclosure.
Disk-drive locations and service indicators for the 5887 disk-drive
enclosure
Learn about the disk-drive locations and where the service indicators are located for the 5887 disk-drive
enclosure.
The following figures show the front view and rear view of the 5887 disk-drive enclosure and the disk
drive locations in the 5887 enclosure.
Figure 50. Front view of disk-drive enclosure service indicators for the 5886
Figure 51. Front view of the 5887 disk-drive enclosure showing the disk-drive locations
100 Disk drives
The following figure shows the locations of the disk-drive service indicators for the 5887 disk-drive
enclosure.
Disk drive locations and service indicators for the 5786, 5787,
7031-D24, and 7031-T24 SCSI disk-drive enclosures
Learn about the disk drive locations and where the service indicators are located for your disk-drive
enclosure.
The availability of disk drive locations depends on the number of SCSI interface cards installed in the
SCSI disk-drive enclosure. If the next available disk drive position is the first slot in a disk-drive
enclosure, such as D21, D26, or D46, look at the disk drive controller to verify that the enclosure is
connected to a SCSI bus port. The controller end of the cable is clearly labeled with the disk-drive
enclosure to which it connects. If the enclosure is not connected to a disk drive controller, you must either
install the disk drive in a different disk-drive enclosure or connect the enclosure to a SCSI bus port on a
disk drive controller. You might need to add a disk drive controller before you can add disk drives to the
expansion unit.
Figure 52. Rear view of the 5887 disk-drive enclosure
Figure 53. Front view of the 5887 disk-drive enclosure showing the service indicators
Disk drive locations and service indicators 101
The following figures show the disk drive locations for the front and rear of the 5786, 5787, 7031-D24,
and 7031-T24 SCSI disk-drive enclosures.
The following figures show the locations of the disk drive service indicators (A) for the 5786, 5787,
7031-D24, and 7031-T24 SCSI disk-drive enclosures.
Figure 54. Front view of 5786, 5787, 7031-D24, and 7031-T24 SCSI disk-drive enclosures
Figure 55. Rear view of 5786, 5787, 7031-D24, and 7031-T24 SCSI disk-drive enclosures
102 Disk drives
Disk-drive locations and service indicators for the 5802 expansion unit
Learn about the disk-drive locations and where the service indicators are located for your expansion unit.
The following figure shows the disk-drive locations for the 5802.
Service indicators are located at the bottom of the latch handle on each disk drive.
Figure 56. Front view of service indicators for 5786, 5787, 7031-D24, and 7031-T24
Figure 57. Rear view of service indicators for 5786, 5787, 7031-D24, and 7031-T24
Disk drive locations and service indicators 103
Related information
Cabling the 5802 disk subsystem
Figure 58. Disk drive locations
104 Disk drives
Related information
Locate additional disk-drive related information and instructions you might need.
Preparing a disk drive for use in an AIX system or logical partition
Configure the newly installed disk drive for your environment.
If the disk drive is attached to a PCI-X SCSI RAID controller, see the PCI-X SCSI RAID Controller Reference
Guide for AIX. If the disk drive is attached to another controller, see the AIX System Management Guide:
Operating System and Devices. These guides are available at the System p and AIX Information Center.
(http://publib.boulder.ibm.com/infocenter/pseries/v5r3/index.jsp)
Preparing to remove a disk drive from a system or expansion unit
controlled by AIX
Learn the actions you must take to properly remove a disk drive.
Before removing a disk drive from a system or expansion unit that is controlled by AIX, ensure that all
data on that disk drive or the array that contains the drive is backed up and removed from the disk drive
and that the disk drive is in the Defined state.
If the disk drive is attached to a PCI-X SCSI RAID controller, see the PCI-X SCSI RAID Controller Reference
Guide for AIX. If the disk drive is attached to another controller, see the information for that controller or
the AIX System Management Guide: Operating System and Devices. These guides are available at the System
p and AIX Information Center.
Rebuilding data on a replacement disk drive using AIX
To rebuild data on the replacement disk drive, refer to the information for the controller to which the
disk drive is attached.
v If the disk drive is attached to a PCI-X SCSI RAID controller, see the PCI-X SCSI RAID Controller
Reference Guide for AIX.
v If the disk drive is attached to a PCI SCSI RAID adapter (feature code 2498), see the PCI 4–Channel
Ultra3 SCSI RAID Adapter Reference Guide.
For more information, see the AIX System Management Guide: Operating System and Devices.These guides
are available at the System p and AIX Information Center.
Removing a disk drive using Linux kernel 2.6
Learn how to remove a disk drive in a location that is controlled by a system or logical partition that is
running Linux kernel 2.6.
Preparing to remove the disk drive
1. Log in as root user.
2. Type iprconfig on the command line of the Linux session and press Enter.
The IBM Power RAID Configuration Utility display is shown.
3. Select Analyze log. Press Enter. The Kernel Messages Log display is shown.
© Copyright IBM Corp. 2010, 2011 105
4. Select View most recent ipr error messages from the Kernel Messages Log display. Press Enter.
5. Find the entry in the log for the disk drive you want to replace.
6. Record the location information for the disk drive.
Note: The location information has the form of 2:0:8:0. In this example, 2 is the SCSI host number, 0
is the SCSI bus, 8 is the SCSI target ID, and 0 is the LUN (logical unit).
7. Return to the command line.
8. Type the following command:
ls -ld /sys/class/scsi_host/host#/device
Where # is the SCSI host number. Press Enter.
9. Record the PCI location information.
Note: The PCI location information has the form of 61:01:0:2.
10. Type iprconfig on the command line and press Enter.
The IBM Power RAID Configuration Utility display is shown.
11. Select Display hardware status from the IBM Power RAID Configuration Utility display. Press Enter.
The Display Hardware Status display is shown.
Kernel Messages Log
Select one of the following:
1. View most recent ipr error messages
2. View ipr error messages
3. View all kernel error messages
4. View iprconfig error messages
5. Set root kernel message log directory
6. Set default editor
7. Restore defaults
8. View ipr boot time messages
Selection:
e=Exit
Figure 59. Kernel Messages Log
106 Disk drives
12. Look for the disk drive at the PCI location you recorded. The disk drive might have a Failed status.
13. If the disk drive you want to replace is unprotected or in use, move the data from the disk drive
before continuing with this procedure.
For information, see the PCI-X SCSI RAID Controller Reference Guide for Linux.
This guide is available from the SCSI PCI Adapters web page.
14. Type option 1 (Display hardware resource information details) next to the disk drive you want to
replace. Press Enter.
The Disk Hardware Resource Information Details is displayed like the shown in the following
sample.
Display Hardware Status
Type option, press Enter.
1=Display hardware resource information details
OPT Name PCI/SCSI Location Description Status
--- ------ -------------------------- ------------------------- ----------------
0000:01:01.0.0/ PCI-X SCSI Adapter Operational
0000:41:01.0.1/ PCI-X SCSI Adapter Operational
sda 0000:41:01.0.1/0:3:0 Physical Disk Active
sdb 0000:41:01.0.1/0:4:0 Physical Disk Active
sdc 0000:41:01.0.1/0:8:0 Physical Disk Active
sdd 0000:41:01.0.1/1:3:0 Physical Disk Active
sde 0000:41:01.0.1/1:4:0 Physical Disk Active
sdf 0000:41:01.0.1/1:5:0 Physical Disk Active
0001:61:01.0.2/ PCI-X SCSI RAID Adapter Operational
sdg 0001:61:01.0.2/0:3:0 Physical Disk Active
0001:61:01.0.2/0:6:0 Advanced Function Disk Active
sdi 0001:61:01.0.2/0:9:0 Physical Disk Active
sdh 0001:61:01.0.2/255:0:0 RAID 10 Disk Array Failed
0001:61:01.0.2/0:4:0 RAID 10 Array Member Failed
0001:61:01.0.2/0:5:0 RAID 10 Array Member Failed
e=Exit q=Cancel r=Refresh t=Toggle
Figure 60. Example Display Hardware Status
Related information 107
15. Record the physical location information.
16. Return to the IBM Power RAID Configuration Utility display.
Removing the disk drive
1. If you are removing a solid state device located on a PCIe RAID and SSD SAS adapter, go to
Removing and replacing the PCIe RAID and SSD SAS adapterRemoving and replacing the PCIe
RAID and SSD SAS adapter and complete that procedure, then return here and perform the
following steps.
2. From the IBM Power RAID Configuration Utility display, select Work with disk unit recovery. Press
Enter.
3. From the Work with Disk Unit Recovery display, select Concurrent remove device. Press Enter. A
Concurrent Device Remove display is shown, like the following sample.
Disk Unit Hardware Resource Information Details
Manufacturer . . . . . . . . . . . . . . : IBM
Product ID . . . . . . . . . . . . . . . : ST336607LC
Firmware Version . . . . . . . . . . . . : 43353048 (C50H)
Serial Number. . . . . . . . . . . . . . : 00006719
Capacity . . . . . . . . . . . . . . . . : 36.40 GB
Resource Name. . . . . . . . . . . . . . : /dev/sdd
Physical location
PCI Address. . . . . . . . . . . . . . . : 0001:50:01.0
SCSI Host Number . . . . . . . . . . . . : 2
SCSI Channel . . . . . . . . . . . . . . : 0
SCSI Id. . . . . . . . . . . . . . . . . : 4
SCSI Lun . . . . . . . . . . . . . . . . : 0
Extended Details
FRU Number . . . . . . . . . . . . . . . : 00P2676
More...
Press Enter to Continue
e=Exit q=Cancel f=PageDn b=PageUp
Figure 61. Example Disk Hardware Resource Information Details display
108 Disk drives
4. Type option 1 (Select) next to the location for the disk drive you want to replace. Press Enter.
5. The Verify Device Concurrent Remove display is shown. The service indicator turns on for that disk
drive slot. .
6. On the Verify Device Concurrent Remove display, verify that the selected disk drive is the disk drive
you want to replace, then press Enter. The identify light turns on for the disk drive.
Note: Ensure that the disk drive is not in use to prevent loss of data.
7. The Device Concurrent Remove display is shown.
Attention:
v Attach a wrist strap to an unpainted metal surface of your hardware to prevent electrostatic
discharge (ESD) from damaging your hardware.
v When using a wrist strap, follow all electrical safety procedures. A wrist strap is for static control.
It does not increase or decrease your risk of receiving electric shock when using or working on
electrical equipment.
v If you do not have a wrist strap, just prior to removing the product from ESD packaging and
installing or replacing hardware, touch an unpainted metal surface of the system for a minimum
of 5 seconds.
8. Squeeze and pull the handle of the disk drive out toward you before you remove the disk drive as
shown in Figure 63 on page 110. If the handle is not all the way out, the disk drive does not slide
out of the system or expansion unit.
Concurrent Device Remove
Choose a single location for remove operations
1=Select
OPT Name PCI/SCSI Location Description Status
--- ------ -------------------------- ------------------------- -----------------
sdc 0000:41:01.0.1/0:8:0 Physical Disk Active
0000:41:01.0.1/0:5: Empty
sdb 0000:41:01.0.1/0:4:0 Physical Disk Active
sda 0000:41:01.0.1/0:3:0 Physical Disk Active
0000:41:01.0.1/1:8: Empty
sdf 0000:41:01.0.1/1:5:0 Physical Disk Active
sde 0000:41:01.0.1/1:4:0 Physical Disk Active
sdd 0000:41:01.0.1/1:3:0 Physical Disk Active
0001:61:01.0.2/0:8: Empty
sdh 0001:61:01.0.2/0:9:0 Physical Disk Active
sdg 0001:61:01.0.2/0:3:0 Physical Disk Active
Figure 62. Example Concurrent Device Remove display
Related information 109
9. Support the bottom of the disk drive as you slide it out of the system or expansion unit. Do not hold
the disk drive by the handle.
10. If your replacement disk drive does not have a bezel preinstalled, then perform the following step.
Loosen and pull out on the thumbscrew (B) to release the disk-drive bezel (C) from the disk drive,
as shown in the following figure.
Figure 63. Removing a disk drive from a model or 7311-D20
110 Disk drives
11. Press Enter on the IBM Power RAID Configuration Utility to indicate that you have removed the
disk drive.
Replacing the disk drive
1. Find the package that contains the new disk drive.
Attention: Disk drives are fragile. Handle them with care.
2. Remove the disk drive from its protective package.
3. Unlock the disk drive handle by squeezing and pulling it out toward you before you install the disk
drive. If the handle is not all the way out, the disk drive does not slide into the system or expansion
unit.
4. If your replacement disk drive does not have a bezel preinstalled, then follow these steps to install
the disk-drive bezel on the replacement disk drive:
a. Attach the bezel (A) to the disk drive, as shown in Figure 65 on page 112.
b. Press in and then tighten the thumbscrew (B) to secure the bezel to the disk drive.
Figure 64. Removing a disk-drive bezel
Related information 111
5. From the IBM Power RAID Configuration Utility display, select Work with disk unit recovery. Press
Enter.
6. From the Work with Disk Unit Recovery display, select Concurrent add device. Press Enter.
A Concurrent Device Add display like the following sample is shown.
7. Type option1 (Select) next to the location from which you removed the disk drive.
The Verify Device Concurrent Add display is shown.
Figure 65. Attaching a bezel to the disk drive
Concurrent Device Add
Choose a single location for add operations
1=Select
OPT Name PCI/SCSI Location Description Status
--- ------ -------------------------- ------------------------- -----------------
0000:41:01.0.1/0:5: Empty
0000:41:01.0.1/1:8: Empty
0001:61:01.0.2/0:8: Empty
Figure 66. Example Concurrent Device Add display
112 Disk drives
8. Press Enter on the Verify Device Concurrent Add display. The Complete Device Concurrent Add
display is shown.
9. Support the bottom of the disk drive as you align it with the guide rails in the system or expansion
unit. Do not hold the disk drive by the handle.
10. Slide the disk drive all the way into the system or expansion unit and then lock it in place by
pushing in the disk drive handle, as shown in Figure 67.
11. Press Enter on the Complete Device Concurrent Add display to indicate that the disk drive is
installed.
Completing the procedure
1. Replace the front cover.
2. Install or close the system, expansion unit, or rack front door.
3. Verify the installed part:
v If you replaced the part because of a failure, verify the installed part. For instructions, see Verifying
a repair.
v If you installed the part for any other reason, verify the installed part. For instructions, see
Verifying the installed part.
4. To configure the newly installed disk drive, see the procedure for the operating system or logical
partition that controls the disk drive:
v Preparing a disk drive for use in an AIX system or logical partition.
v Preparing a disk drive for use in a Linux system or logical partition
v Configuring a disk drive in an IBM i system or logical partition.
5. To rebuild the data on the replacement disk drive, see the procedure for the operating system or
logical partition that controls the disk drive:
v Rebuilding data on a replacement drive using AIX.
v Rebuilding data on a replacement drive using Linux.
v Rebuilding data on a replacement drive using IBM i.
Figure 67. Installing a disk drive in a model or 7311-D20
Related information 113
Rebuilding data on a replacement disk drive using Linux
You can rebuild data on the replacement disk drive that was replaced because of a failed disk drive or a
new installation.
For an unprotected disk drive
If the disk drive you are replacing is in a RAID Level 0 disk array or in a failed RAID Level 5 or RAID
Level 10 disk array, perform these tasks:
1. Re-create the disk array.
2. Re-create the file systems on the disk array.
3. Copy the data back to the restored disk array from your backup media.
For information on these tasks, see the PCI-X SCSI RAID Controller Reference Guide for Linux. This guide is
available from the SCSI PCI Adapters web page.
For a disk drive in a RAID Level 5 or RAID Level 10 disk array, select the appropriate procedure.
Rebuild data on a disk drive using Linux kernel 2.6
Learn the steps to rebuild your data on a disk drive.
If a hot-spare disk drive was available and protected the disk array throughout the disk drive failure, you
can configure the newly installed disk drive as a hot spare. For instructions, see the PCI-X SCSI RAID
Controller Reference Guide for Linux.
This guide is available from the SCSI PCI Adapters Web page at http://publib.boulder.ibm.com/
infocenter/pseries/v5r3/index.jsp?topic=/com.ibm.pseries.doc/hardware_docs/scsipciadapters.htm.
If a hot-spare rebuild was not initiated by the controller, you need to initiate a rebuild on the newly
installed disk drive. To initiate the rebuild, follow these steps:
1. Log in as root user.
2. Type iprconfig on the command line of the Linux session and press Enter. The IBM Power RAID
Configuration Utility display is shown.
3. Select Work with disk unit recovery from the IBM Power RAID Configuration Utility display. Press
Enter. The Work with Disk Unit Recovery display is shown.
4. From the Work with Disk Unit Recovery display, select Rebuild disk unit data.
A Rebuild Disk Unit Data display similar to the following is shown.
114 Disk drives
5. Type option 1 (Rebuild) next to the disk drive you want to rebuild and press Enter.
Note: Performing a rebuild on a disk drive overwrites the data currently on the disk drive.
A Confirm Rebuild Disk Unit Data display similar to the following is shown.
Rebuild Disk Unit Data
Select the disks to be rebuilt
Type choice, press Enter.
1=Rebuild
OPT Name PCI/SCSI Location Description Status
--- ------ -------------------------- ------------------------- -------------
1 0000:58:01.0.0/0:4:0 RAID Array Member Failed
e=Exit q=Cancel t=Toggle
Figure 68. Example Rebuild Disk Unit Data display
Related information 115
6. To confirm that you want to rebuild the data on the disk drive, press Enter.
The message that the rebuild has begun is shown at the bottom of the Work with Disk Unit Recovery
display. The rebuild process might take several minutes to complete.
Preparing to remove a disk-drive from a system or logical partition
controlled by IBM i
Learn the actions you must take to properly remove a disk drive.
Before removing a disk-drive from a system or expansion unit that is controlled by IBM i, follow these
steps:
1. Determine the protection status of the disk-drive that you are replacing. For instructions, see
“Determining the IBM i disk-drive protection status” on page 117.
2. For mirrored disk-drives on an IBM i system or logical partition, follow these steps to verify that the
disk-drive you are replacing has been suspended:
a. Sign on with service level authority.
b. Type strsst on the command line of the IBM i session and press Enter.
c. Type your service tools user ID and service tools password on the Start Service Tools (STRSST)
Sign On display. Press Enter.
Remember: The service tools password is case-sensitive.
d. Select Work with disk units from the System Service Tools (SST) display. Press Enter.
e. Select Display Disk Configuration from the Work with Disk Units display. Press Enter.
f. Select Display Disk Configuration Status from the Display Disk Configuration display. Press
Enter.
Confirm Rebuild Disk Unit Data
Rebuilding the disk unit data might take several minutes for each disk
selected.
Press Enter to confirm having the data rebuilt.
q=Cancel to return and change your choice.
OPT Name PCI/SCSI Location Description Status
--- ------ -------------------------- ------------------------- ---------------
1 0000:58:01.0.0/0:4:0 RAID Array Member Failed
q=Cancel t=Toggle
Figure 69. Example Confirm Rebuild Disk Unit Data display
116 Disk drives
g. Does the disk-drive that is mirrored to the disk-drive that you are replacing (same drive number)
have a status of Active?
v No: The replacement must be performed by a service provider.
v Yes: Does the disk-drive that you are replacing have a status of Suspended?
– Yes: Continue with the next step.
– No: Follow these steps to suspend mirrored protection on the disk-drive that you are
replacing:
1) Press F3 from the Display Disk Configuration display to return to the Work with Disk
Units display.
2) Select Work with Disk Unit Recovery from the Work with Disk Units display. Press
Enter.
3) Select Suspend mirrored protection from the Work with Disk Unit Recovery display.
Press Enter.
4) Select the option to suspend the disk-drive that you are replacing from the Suspend
Mirrored Protection display. Press Enter.
h. Press F3 repeatedly to exit System Service Tools and return to the main menu.
Determining the IBM i disk-drive protection status
To determine the protection status of the disk-drives in your IBM i system or logical partition, follow
these steps from the System Service Tools (SST) display:
Tip: If you do not know the protection status of the disk-drives and you cannot get to the System Service
Tools display, contact your next level of support.
1. Sign on with at least service level authority.
2. Type strsst on the command line of the IBM i session and press Enter.
3. Type your service tools user ID and service tools password on the Start Service Tools (STRSST) Sign
On display. Press Enter.
Note: The service tools password is case-sensitive.
4. Select Work with disk units from the System Service Tools (SST) display. Press Enter.
5. Select Display Disk Configuration from the Work with Disk Units display. Press Enter.
6. Select Display Disk Configuration Status from the Display Disk Configuration display. Press Enter.
A list of each auxiliary storage pool (ASP) displays, showing the disk-drives that are members of the
ASP. The Status column shows one of the following protection statuses for the ASP:
v Mirrored. These ASPs are configured for mirrored protection. If the failing disk-drive has mirrored
protection, record the status of both disk-drives in the mirrored pair. You need this information in
the recovery procedure.
v Unprotected. An ASP with a status of Unprotected might contain disk-drives that are device parity
protected. Record the status of the failing disk-drive as it is shown in the Status column of the
Display Disk Configuration Status display.
A disk-drive with one of the following statuses has device parity protection only if all other
disk-drives in the array are operational:
– DPY/Active
– DPY/Failed
– DPY/HDW Failure
– DPY/Degraded
– DPY/Power Loss
– DPY/Not Ready
Related information 117
– DPY/Unknown
A disk-drive with any other status (such as DPY/Rebuild or DPY/Unprotected) does not have
device parity protection.
Examples:
– Suppose one disk-drive has a status of DPY/Failed while all other disk-drives have a status of
DPY/Unprotected. In this case, only the disk-drive with the DPY/Failed status is protected.
Servicing the failed disk-drive does not result in data loss. Removing any of the unprotected
disk-drives might result in data loss.
– All disk-drives have a status of DPY/Active or DPY/Degraded, therefore, all disk-drives are
protected. In this case, exchanging any disk-drive does not result in data loss.
Tip: For more information about the meaning of a status for any disk-drive, use the Help key on
the display.
Configuring a disk drive in an IBM i system or logical partition
To configure a newly installed disk drive into an existing device parity set, follow these steps:
If you want to start a new device parity set or change the protection on your disks, see Manage device
parity protection or Start device parity protection in the Disk Management section of the IBM i
Information Center web site at http://www.ibm.com/systems/i/infocenter .
1. If necessary, start System Service Tools (SST) by typing strsst on the command line of the IBM i
session and pressing Enter.
2. Type your service tools user ID and service tools password on the Start Service Tools (STRSST) Sign
On display. Press Enter.
Restriction: The service tools password is case sensitive.
3. Select Work with Disk Units on the System Service Tools display. Press Enter.
4. Select Display disk configuration on the Work with Disk Units display. Press Enter.
5. Select Display non-configured units on the Display disk configuration display. Press Enter.
6. The disk drive that you installed is listed. The serial number must match the serial number you
recorded (the last four digits).
Note: The new disk drive might take up to five minutes to appear in the list. If the disk drive is not
listed, ensure that the disk drive was properly installed.
7. Press F12 twice.
8. Select Work with disk configuration on the Work with Disk Units display. Press Enter.
9. Are you adding a disk drive to an existing device parity set?
v Yes: Continue with the next step.
v No: Go to step 23 on page 119 for mirroring or no protection, go to 1 on page 120 for hot spare.
Tip: Select No for mirroring, hot spare or no protection.
10. Select Work with device parity protection on the Work with Disk Configuration display. Press Enter.
11. Select Include unit in device parity protection on the Work with Device Parity Protection display.
Press Enter.
12. Did the Include Device Parity Protection Failed display appear?
v Yes: Continue with the next step.
v No: Go to step 15 on page 119.
13. The following conditions must be met before you can start device parity protection.
118 Disk drives
v If there are enough disk drives available to create a new parity set, the drives are eligible for Start
Device Parity Protection operation and not for the Include Device Parity Protection operation.
v All disk drives in a parity set must be the same capacity with a minimum number of three or four
disk drives and maximum of 10 or 18 disk drives in the resulting parity set, depending on the
disk drive type.
v Not all disk drives attached to an advanced function input/output processor have reported to the
system. Repeat the operation.
v The type or model of the disk drives cannot be used for the requested operation.
When these conditions are met, press F12 to return to the Work with Disk Configuration display.
14. Select Include unit in device parity protection on the Work with disk configuration display. Press
Enter.
15. On the Include Disk Units in Device Parity Protection display, select the disk drives to be included
in Device Parity Protection. Type a 1 in the Option column. Press Enter.
16. Press Enter to confirm your choice. The configuration starts and might take 30 - 60 minutes to
complete.
17. When the device parity protection is complete, the message “Selected drives have been included
successfully” is shown on the Work with device parity protection display. Press F12.
18. Select Display disk configuration on the Work with disk configuration display. Press Enter.
19. Select Display disk configuration status on the Display disk configuration display.
20. If you have more than one ASP, determine the auxiliary storage pool (ASP) to which you want to
add the disk drive.
21. Press F12 twice to get to the Work with Disk Units display.
22. Select Work with disk configuration on the Work with Disk Units display. Press Enter.
23. Select the option to add units to ASPs and balance data on the Work with disk configuration display.
Press Enter.
24. Specify the number of the ASP to which you want to add the disk drive. The system ASP is ASP 1.
Press Enter.
Notes:
a. If ASP is mirror protected, disk drives must be added in pairs of the same size.
b. When one or more pairs of disk drives are added to a mirrored ASP, they automatically become
mirror protected.
c. To start mirrored protection on an ASP, refer to Recovering your system in the IBM i Information
Center.
To access the IBM i Information Center website, go to http://www.ibm.com/systems/i/
infocenter and select from the following options:
v If you select V6R1, expand IBM i 6.1 Information Center > Systems management > Backup
and recovery > Recovering your system
v If you select V5R4, expand i5/OS Information Center > Systems management > Backup and
recovery > Backing up and recovering your system > Recovering your system
25. The Confirm Add Units displays showing the configuration your system must have when the add
operation completes.
Note: If you have selected the wrong ASP, press F12 to change your options. Press the Enter key to
continue. The add process takes several minutes to complete.
26. When the message Selected units have been added successfully is shown, press F3 three times
and press Enter to return to the Main menu display.
27. If you have a printer, print the configuration list. A service provider might refer to the configuration
list in the future.
a. You must sign on with at least service level authority
Related information 119
b. Type strsst on the Main menu command line. Press Enter.
c. Type your service tools user ID and service tools password on the Start Service Tools (STRSST)
Sign On display. Press Enter.
Remember: The service tools password is case-sensitive.
d. Select Start a service tool on the System Service Tools (SST) display. Press Enter.
e. Select Hardware service manager on the Start a Service Tool display. Press Enter.
f. Press F6 (print the configuration) from the Hardware Service Manager display.
g. To return to the Main Menu display, press F3 (Exit) twice and then press Enter.
h. Place the configuration list where you can refer to it in the future.
28. If you are a service provider performing this task, return to the procedure that sent you here.
1. Select Work with hot spare protection from the Work with disk configuration menu.
2. Select Start device parity protection - RAID 5 with hot spare or Start device parity protection -
RAID 6 with hot spare depending on the level of parity protection that is wanted.
3. Return to the procedure that sent you here.
Configuring a disk drive on a load source adapter for hot spare using
the IBM i operating system
To start hot spare protection on your load source adapter you must determine the status of the disk drive
in the load source slot and configure or exchange that disk drive, which might require you to turn off the
power to the system.
1. If necessary, start System Service Tools (SST) by typing strsst on the command line of the IBM i
session and pressing Enter.
2. Type your service tools user ID and service tools password on the Start Service Tools (STRSST) Sign
On display. Press Enter.
Note: The service tools password is case sensitive.
3. Determine the protection status of all disks controlled by the load source adapter. Do all disk drives
that are a member of a parity sets or mirror sets have a status of active?
v No: Replace the disk using the following procedure in, “Replacing a disk drive in the 8231-E2B,
8231-E1C, or 8231-E2C with the IBM i system or logical partition power turned on” on page 69.
v Yes: Continue to the next step.
Restriction: There must be a disk drive of the same or larger capacity located in a load source
capable slot to act as a hot spare for the active load source disk drive. The next steps will have you
change the disk drive in the desired slot if needed.
Attention: If the disk drive acting as hot spare for the load source is of smaller capacity you can
suffer data loss.
4. Determine whether a non-configured disk drive of the same or larger capacity is located in a load
source capable slot but is not the active load source. Only parallel SCSI places a restriction on the
location of the load source. Any SAS, Fibre Channel, or virtual disk drive does not have this
restriction.
v No non-configured drive is located in a load source capable slot. Continue with the next step
v Yes a non-configured drive is located in a load source capable slot. Return to the procedure that
sent you here, and continue to the next step of that procedure.
5. Choose the disk drive to be installed or exchanged with the drive in the load source slot by doing the
following:
a. Select a disk drive in a parity set located in a load source capable slot, but is not the active load
source drive, and record the location of this disk drive.
120 Disk drives
b. Select a non-configured disk drive of the same or larger capacity from either a non-load source
slot or a failed drive you are replacing with a non-configured disk drive.
6. Stop the system if it is running. For instructions on stopping the system, see Stopping the system or
logical partition.
7. Exchange the disk drives from the two locations you recorded in step 5 on page 120 by removing
both disk drives and placing them into the other disk drives slot.
8. Start the system. For instructions, see Starting the system or logical partition
9. Return to the procedure that sent you here and continue with the next step in that procedure.
Rebuilding data on a replacement disk drive using IBM i
Learn how to rebuild data on replacement disks.
To rebuild data on the replacement disk drive, complete the following steps:
1. If necessary, start System Service Tools (SST) by typing strsst on the command line of the IBM i
session and pressing Enter.
2. Type your service tools user ID and service tools password on the Start Service Tools (STRSST) Sign
On display. Press Enter.
Note: The service tools password is case sensitive.
3. Select Work with disk units from the Start System Service Tools (SST) display. Press Enter.
4. Select Work with disk unit recovery from the Work with Disk Units display. Press Enter.
5. If you determined that the disk drive you replaced had device parity protection (RAID), follow these
steps to restore device parity protection (RAID) on the disk drive:
a. Select Rebuild disk unit data on the Work with Disk unit recovery display. Press Enter.
b. Select 1 to rebuild the disk drive displayed (the disk drive displayed is the disk drive that you
removed) on the Rebuild Disk Unit Data display. Press Enter.
c. Press Enter on the Confirm Rebuild Disk Unit Data display. The rebuild process might take several
minutes to complete.
d. Press F5 to refresh the display until the Percent complete shows 5%.
e. When the display shows at least 5% complete, you can either continue to monitor this display to
completion, or press F3 (Exit) to return to the Work with disk units display.
f. Press F3 (Exit) to return to the System service tools display.
g. Press F3 (Exit) to return to the Exit SST display and press Enter.
h. Return to the procedure that sent you here.
6. If the disk drive that you replaced had mirrored protection, follow these steps:
a. Select Replace configured unit on the Work with Disk unit recovery display. Press Enter.
b. Select the configured disk drive that you are exchanging (suspended drive) on the Select
Configured Unit to Replace display. Press Enter.
c. Select the disk drive that you just installed on the Select Replacement Unit display. This drive has
a non-configured status.
Note: In some cases, it might take several minutes for a new disk drive to display. Repeat these
steps until the new drive is shown.
Press Enter.
d. Press Enter on the Confirm Replace of Configured Unit display to confirm your choice for
replacement.
The replacement process might take several minutes to complete. When the process is complete,
the Work with Disk unit recovery display is shown.
e. Press F3 (Exit) to return to the Work with disk units display.
Related information 121
f. Select Display disk configuration on the Work with disk units display.
g. Select Display disk configuration status on the Display Disk Configuration display.
Mirrored status shows Resuming. When complete, the mirrored status shows Active. This process
might take several minutes to complete. You can either monitor this display to completion, or
press F3 (Exit) three times, and then press Enter to return to the IBM i main menu.
Replacing a SAS conduit card in a 5802 expansion unit with the power
turned off
Learn how to replace a SAS conduit card in the 5802 expansion unit with the power turned off.
Perform the prerequisite tasks as described in Before you begin.
1. Stop the system if it is running. For instructions, see Stopping the system or logical partition.
2. Disconnect the power source from the system by unplugging the system.
Attention: This system might be equipped with a second power supply. You must disconnect the
power source from the system by disconnecting all power cords to prevent system damage during
this procedure.
(L003)
or
3. Perform the following steps to replace the midplane and SAS conduit card into the system:
a. Place the card connector (C) on the midplane connection (D) using the guide track (B) to ensure
a good connection.
b. Lift the handle (A) to seat and secure the SAS conduit card to the midplane as shown in
Figure 70 on page 123.
122 Disk drives
4. Perform the following steps to replace the midplane and SAS conduit card into the system:
a. Align the midplane with the track and ensure that the slot (A) on the midplane attaches to the
post (B) in the system.
b. Secure the thumbscrews (C) and (D) on the SAS conduit card.
c. Secure the thumbscrews (E) on the sides of the midplane.
Figure 70. Installing the SAS conduit card into the midplane
Related information 123
5. Attach the power cord cover and secure the thumbscrew (A) to the system.
Figure 71. Installing the SAS conduit card and the midplane
Figure 72. Installing the power cord plate
124 Disk drives
6. Carefully slide the enclosure management controller into the system and close the latch (A) as shown
in Figure 73.
7. Carefully lift and align the I/O system board with the slot at the rear of the expansion unit.
8. Insert the I/O system board firmly into the server, as shown in Figure 74 on page 126.
9. Secure the I/O system board with the locking tabs (A), as shown in Figure 74 on page 126.
Figure 73. Replacing the enclosure management controller
Related information 125
10. To replace the power supply, with the locking handle (A) in the open position, push the power
supply into the expansion unit as shown in Figure 75 on page 127.
11. Close the locking handle (A) until the power supply locks into position.
Figure 74. Install the I/O system board on a rack mounted model
126 Disk drives
12. To replace the disk drive backplane, lift the backplane along the two sides and align the backplane
with the guide rails in the expansion unit.
CAUTION:
The backplane can be heavy to lift. Ensure that you can safely perform this task before you
begin.
13. Using your hand to support the bottom of the backplane, slide it into the expansion unit as shown
in Figure 76 on page 128.
14. Secure the backplane in place by moving the locking tabs from the open position (B) to the locked
position (A).
Figure 75. Replacing a power supply
Related information 127
15. Replace all disk drive units and fillers, if you had removed them earlier.
a. Support the bottom of the disk drive unit as you align it with the guide rails in the backplane.
b. Slide the disk drive unit all the way into the backplane.
c. Push the handle (A) toward the disk drive unit to lock it into place.
Figure 76. Installing the backplane on a rack-mounted model
128 Disk drives
16. Replace all expander cards and fillers, if you had removed them earlier.
a. Supporting the bottom of the expander card with your hand, align it with the guide rails in the
backplane.
b. Slide the expander card all the way into the backplane.
c. Push the handle (A) toward the card to lock it into place.
17. Start the system. For instructions, see Starting the system or logical partition.
Continue with any other service actions you are required to perform.
v For information about connecting the system, see Connecting the expansion units.
v For information about installing the PCI adapters into the system board, see Model 5802 and 5877
expansion units, PCI adapters, and cassettes.
Removing a SAS conduit card from a 5802 expansion unit with the
power turned off
Learn how to remove a SAS conduit card from a 5802 rack-mounted expansion unit if you must replace a
failing part with the power turned off.
Perform the prerequisite tasks as described in Before you begin.
1. Identify the card you want to remove. For instructions on identifying a failing part, see Identifying a
failing part.
2. Stop the system if it is running. For instructions on stopping the system, see Stopping the system or
logical partition.
3. Remove the disk drives and fillers as shown in Figure 77 on page 130.
a. Squeeze the latch of the disk drive and pull the handle (A) toward you to release the disk drive.
b. Support the bottom of the disk drive with your hand and slide the disk drive out of the
backplane.
Related information 129
4. Remove the port cards and fillers as shown in Figure 78 on page 131.
a. Hold the end of the latching handle (A) and pull it upward to unlock the port card.
b. Using your hand to support the bottom of the port card, slide the port card out of the backplane.
c. Place the port card on an electrostatic discharge (ESD) surface.
Figure 77. Removing the disk drives from the backplane
130 Disk drives
5. Remove the backplane as shown in Figure 79 on page 132.
CAUTION:
The backplane might be heavy. Ensure that you can safely perform the procedure.
a. Squeeze the blue latches (A) to unlock the handles.
b. Pull the handles (B) outward in the direction shown to release the backplane.
c. Slide the backplane out of the enclosure and use your hand to support the bottom of the
backplane.
Figure 78. Removing the port card from the backplane
Related information 131
6. Disconnect both the offline converter assembly (OCA) power cords by performing the following
steps:
a. Slide the power cord holders (A) to the left, located on the front of both the OCAs.
b. Label and disconnect both the OCA power cords (B) from each OCA.
Figure 79. Removing the backplane from the enclosure
132 Disk drives
7. Remove the offline converter assemblies by performing the following steps:
a. Pull the OCA handle (A) all the way down in the direction shown to unseat the OCA.
b. Hold both sides of the OCA and pull it out of the enclosure.
Note: Use both the hands to support the OCA as you pull it out of the enclosure.
c. Repeat these steps to remove both OCAs.
Figure 80. Disconnecting the OCA power cords
Related information 133
8. Remove the enclosure management controller by performing the following steps:
a. Label and disconnect any cables connected to the enclosure management controller card.
b. Pinch the latch (A) to unlock the handle (B).
c. Pull the handle outward in the direction shown to unseat the EMC card.
d. Slide the enclosure management controller card straight out of the enclosure.
Figure 81. Remove the offline converter assemblies
134 Disk drives
9. Remove the power cord plate by turning the thumbscrew (A) and pulling the plate away from the
system, as shown in Figure 83.
10. Disconnect the cables from the back of the I/O system board.
Figure 82. Remove the enclosure management controller
Figure 83. Removing the power cord plate
Related information 135
11. Squeeze the release latches (A) and pull the levers (B) out away from the system, as shown in
Figure 84.
12. Slide the system board out of the system.
13. Remove the SAS conduit card and midplane from the enclosure by doing the following steps:
a. Pull the spring plungers (A) up and turn them one quarter to the left.
b. Turn the captive screws (B) and (C) counterclockwise until they are completely loosened.
c. Pull the SAS conduit and midplane component out of the enclosure.
Figure 84. Removing the I/O system board
136 Disk drives
14. Push the handles (A) on the conduit card down to disengage the card from the midplane, and
remove the card.
Continue with replacing the SAS expander card or return to the procedure that sent you here.
Figure 85. Removing the SAS conduit card and midplane
Figure 86. Removing the SAS conduit card from the midplane
Related information 137
Replacing a SAS expander card in a 5802 expansion unit with the
power turned off
Learn how to replace a SAS expander card in a 5802 expansion unit with the power turned off.
Review the requirements in Before you begin.
1. Before installing or reinstalling a SAS expander card, inspect the signal connector pins to ensure that
the pins are not bent or damaged.
Attention: Bent pins damage the backplane receptacle. Be careful not to touch or damage the signal
connector pins.
2. Slide the card completely into the slot and push the lever (A) down to secure the card as shown in
the following figure.
3. Start the system. For instructions see, Starting the system or logical partition.
4. Verify the presence of the newly installed card. For instructions see, Verifying the installed part.
Continue with any other service actions you need to perform.
Removing a SAS expander card from a 5802 expansion unit with
power turned off
Learn how to remove a SAS expander card from a 5802 (rack-mounted) expansion unit if you need to
replace a failing part with power turned off.
1. Perform prerequisite tasks. For instructions, see Before you begin.
2. Identify the card you are going to remove. For information, see Identifying a part.
3. Lift the handle (A) and pull the card out of the system, as shown in the following figure.
Figure 87. Installing the SAS expander card
138 Disk drives
Continue with replacing the SAS expander card or return to the procedure that sent you here.
Figure 88. Removing the SAS expander card.
Related information 139
140 Disk drives
Common procedures for installable features
This section contains all the common procedures that are related to installing, removing, and replacing
features.
© Copyright IBM Corp. 2010, 2011 141
142 Disk drives
Before you begin
Observe these precautions when you are installing, removing, or replacing features and parts.
These precautions are intended to create a safe environment to service your system and do not provide
steps for servicing your system. The installation, removal, and replacement procedures provide the
step-by-step processes required to service your system.
DANGER
When working on or around the system, observe the following precautions:
Electrical voltage and current from power, telephone, and communication cables are hazardous. To
avoid a shock hazard:
v Connect power to this unit only with the IBM provided power cord. Do not use the IBM
provided power cord for any other product.
v Do not open or service any power supply assembly.
v Do not connect or disconnect any cables or perform installation, maintenance, or reconfiguration
of this product during an electrical storm.
v The product might be equipped with multiple power cords. To remove all hazardous voltages,
disconnect all power cords.
v Connect all power cords to a properly wired and grounded electrical outlet. Ensure that the outlet
supplies proper voltage and phase rotation according to the system rating plate.
v Connect any equipment that will be attached to this product to properly wired outlets.
v When possible, use one hand only to connect or disconnect signal cables.
v Never turn on any equipment when there is evidence of fire, water, or structural damage.
v Disconnect the attached power cords, telecommunications systems, networks, and modems before
you open the device covers, unless instructed otherwise in the installation and configuration
procedures.
v Connect and disconnect cables as described in the following procedures when installing, moving,
or opening covers on this product or attached devices.
To Disconnect:
1. Turn off everything (unless instructed otherwise).
2. Remove the power cords from the outlets.
3. Remove the signal cables from the connectors.
4. Remove all cables from the devices
To Connect:
1. Turn off everything (unless instructed otherwise).
2. Attach all cables to the devices.
3. Attach the signal cables to the connectors.
4. Attach the power cords to the outlets.
5. Turn on the devices.
(D005)
DANGER
© Copyright IBM Corp. 2010, 2011 143
Observe the following precautions when working on or around your IT rack system:
v Heavy equipment–personal injury or equipment damage might result if mishandled.
v Always lower the leveling pads on the rack cabinet.
v Always install stabilizer brackets on the rack cabinet.
v To avoid hazardous conditions due to uneven mechanical loading, always install the heaviest
devices in the bottom of the rack cabinet. Always install servers and optional devices starting
from the bottom of the rack cabinet.
v Rack-mounted devices are not to be used as shelves or work spaces. Do not place objects on top
of rack-mounted devices.
v Each rack cabinet might have more than one power cord. Be sure to disconnect all power cords in
the rack cabinet when directed to disconnect power during servicing.
v Connect all devices installed in a rack cabinet to power devices installed in the same rack
cabinet. Do not plug a power cord from a device installed in one rack cabinet into a power
device installed in a different rack cabinet.
v An electrical outlet that is not correctly wired could place hazardous voltage on the metal parts of
the system or the devices that attach to the system. It is the responsibility of the customer to
ensure that the outlet is correctly wired and grounded to prevent an electrical shock.
CAUTION
v Do not install a unit in a rack where the internal rack ambient temperatures will exceed the
manufacturer's recommended ambient temperature for all your rack-mounted devices.
v Do not install a unit in a rack where the air flow is compromised. Ensure that air flow is not
blocked or reduced on any side, front, or back of a unit used for air flow through the unit.
v Consideration should be given to the connection of the equipment to the supply circuit so that
overloading of the circuits does not compromise the supply wiring or overcurrent protection. To
provide the correct power connection to a rack, refer to the rating labels located on the
equipment in the rack to determine the total power requirement of the supply circuit.
v (For sliding drawers.) Do not pull out or install any drawer or feature if the rack stabilizer brackets
are not attached to the rack. Do not pull out more than one drawer at a time. The rack might
become unstable if you pull out more than one drawer at a time.
v (For fixed drawers.) This drawer is a fixed drawer and must not be moved for servicing unless
specified by the manufacturer. Attempting to move the drawer partially or completely out of the
rack might cause the rack to become unstable or cause the drawer to fall out of the rack.
(R001)
Before you begin a replacement or installation procedure, perform these tasks:
1. If you are installing a new feature, ensure that you have the software required to support the new
feature.
To do this, go to the following Web site: IBM Prerequisite
2. If you are performing an installation or replacement procedure that might put your data at risk,
ensure, wherever possible, that you have a current backup of your system or logical partition
(including operating systems, licensed programs, and data).
3. Review the installation or replacement procedure for the feature or part.
4. Note the significance of color on your system.
144 Disk drives
Blue or terra-cotta on a part of the hardware indicates a touch point where you can grip the hardware
to remove it from or install it in the system, open or close a latch, and so on. Terra-cotta might also
indicate that the part can be removed and replaced with the system or logical partition power on.
5. Ensure that you have access to a medium flat-blade screwdriver, a Phillips screwdriver, and a pair of
scissors.
6. If parts are incorrect, missing, or visibly damaged, do the following:
v If you are replacing a part, contact the provider of your parts or next level of support.
v If you are installing a feature, contact one of the following service organizations:
– The provider of your parts or next level of support.
– In the United States, the IBM Rochester Manufacturing Automated Information Line (R–MAIL)
at 1–800–300–8751.
In countries and regions outside of the United States, use the following Web site to locate your service
and support telephone numbers:
http://www.ibm.com/planetwide
7. If you encounter difficulties during the installation, contact your service provider, your IBM reseller,
or your next level of support.
8. If you are installing new hardware in a logical partition, you need to understand and plan for the
implications of partitioning your system. For information, see Logical Partitioning.
Before you begin 145
146 Disk drives
Identifying a part
Use these instructions to learn how to identify the location of a failed part, the location of a part to be
removed, or the location to install a new part on your system or expansion unit using the appropriate
method for your system.
For IBM Power Systems™ servers that contain the POWER7 processor, the light-emitting diodes (LEDs)
can be used to identify or verify the location of a part that you are removing, servicing, or installing.
The combination identify and fault LED (amber color) shows the location of a field replaceable unit
(FRU). When removing a FRU, first verify whether you are working on the correct FRU by using the
identify function in the management console or other user interface. When removing a FRU under the
direction of the hardware management console, the identify function is activated and deactivated
automatically at the correct times.
The identify function causes the amber LED to blink. When you turn off the identify function, the LED
returns to the state it was previously. For parts that have a blue service button, the identify function sets
LED information for the service button so that when the button is pressed, the correct LEDs on that part
blink.
If you need to use the identify function, use the following procedures.
Control panel LEDs
Use this information as a guide to the control panel LEDs and buttons.
The control panel has LEDs that indicate various system status.
Figure 89. Control panel
© Copyright IBM Corp. 2010, 2011 147
v A: Power-on button
v B: Power LED
– A constant light indicates full system power to the unit.
– A blinking light indicates standby power to the unit.
Note: There is approximately a 30-second transition period from the time the power-on button is
pressed to when the power LED goes from blinking to solid. During the transition period, the LED
might blink faster.
v C: Enclosure identify light
– A constant light indicates the identify state, which is used to identify a part.
– No light indicates that the system is operating normally.
v D: System information light
– No light indicates that the system is operating normally.
– Light on indicates that the system requires attention.
v E: USB port
v F: Enclosure fault roll-up light
– A constant light indicates a fault in the system unit.
– No light indicates that the system is operating normally.
v G: Function/Data display
v H: Decrement button
v I: Enter button
v J: Increment button
v K: Pinhole reset button
Related concepts
Identifying a failing part
Use these instructions to learn how to locate and identify a failing part on your system or expansion unit
using the appropriate method for your system.
Identifying a failing part in an AIX system or logical partition
Use these instructions to learn how to locate a failing part, and then activate the indicator light for that
part on a system or logical partition running the AIX operating system.
Locating a failing part in an AIX system or logical partition
You might need to use AIX tools, before activating the indicator light, to locate a part that is failing.
1. Log in as root user or celogin-.
2. At the command line, type diag and press Enter.
3. From the Function Selection menu, select Task Selection and press Enter.
4. Select Display Previous Diagnostic Results and press Enter.
5. From the Display Previous Diagnostic Results display, select Display Diagnostic Log Summary. The
Display Diagnostic Log display shows a chronological list of events.
6. Look in the T column for the most recent S entry. Select this row in the table and press Enter.
7. Select Commit. The details of this log entry are shown.
8. Record the location information and the SRN value shown near the end of the entry.
9. Exit to the command line.
Use the location information for the failing part to activate the indicator light that identifies the failing
part. See “Activating the indicator light for the failing part” on page 149.
148 Disk drives
Activating the indicator light for the failing part
Use these instructions to help physically identify the location of a part you are servicing.
1. Log in as root user.
2. At the command line, type diag and press Enter.
3. From the Function Selection menu, select Task Selection and press Enter.
4. From the Task Selection menu, select Identify and Attention Indicators and press Enter.
5. From the list of lights, select the location code for the failing part and press Enter.
6. Select Commit. This turns on the system attention and indicator light for the failing part.
7. Exit to the command line.
Deactivating the failing-part indicator light
Use this procedure to turn off any indicator light that you turned on as a part of a service action.
To deactivate the indicator light, follow these steps:
1. Log in as root user.
2. At the command line, type diag and press Enter.
3. From the Function Selection menu, select Task Selection and press Enter.
4. From the Task Selection menu, select Identify and Attention Indicators and press Enter.
5. From the list of lights, select the location code for the failing part and press Enter. When a light is
activated for a failing part, an I character precedes the location code.
6. Select Commit. This turns off the system attention and indicator light for the failing part.
7. Exit to the command line.
Identifying a failing part in an IBM i system or logical partition
You can activate or deactivate the indicator light by using IBM i to assist in locating a failing part.
Activating the failing-part indicator light
You can search the service action log for an entry that matches the time, reference code, or resource of a
problem, and then activate the indicator light for a failing part.
1. Sign on to an IBM i session, with at least service level authority.
2. On the command line of the session, type strsst and press Enter.
Note: If you cannot get to the System Service Tools display, use function 21 from the control panel.
Alternatively, if the system is managed by a Hardware Management Console (HMC), use the Service
Focal Point™ utilities to get to the Dedicated Service Tools (DST) display.
3. Type your service tools user ID and service tools password on the System Service Tools (SST) Sign
On display. Press Enter.
Remember: The service tools password is case-sensitive.
4. Select Start a service tool from the System Service Tools (SST) display and press Enter.
5. Select Hardware service manager from the Start a Service Tool display and press Enter.
6. Select Work with service action log from the Hardware Service Manager display and press Enter.
7. On the Select Timeframe display, change the From: Date and Time field to a date and time prior to
when the problem occurred.
8. Search for an entry that matches one or more conditions of the problem:
v System Reference code
v Resource
Identifying a part 149
v Date and time
v Failing item list
9. Select option 2 (Display failing item information) to display the service action log entry.
10. Select option 2 (Display details) to display location information for the failing part to be replaced.
The information displayed in the date and time fields is the date and time for the first occurrence of
the specific System reference code for the resource displayed during the time range selected.
11. If location information is available, select option 6 (Indicator on) to turn on the failing part's
indicator light.
Tip: If the failing part does not contain a physical indicator light, a higher-level indicator light is
activated. For example, the indicator light for the backplane or unit that contains the failing part
might be lit. In this case, use the location information to locate the actual failing part.
12. Look for the enclosure indicator light to locate the enclosure that contains the failing part.
Deactivating the failing-part indicator light
Use this procedure to turn off any indicator light that you turned on as a part of a service action.
To deactivate the indicator light, follow these steps:
1. Sign on to an IBM i session, with at least service level authority.
2. On the command line of the session, type strsst and press Enter.
Note: If you cannot get to the System Service Tools display, use function 21 from the control panel.
Alternatively, if the system is managed by a Hardware Management Console (HMC), use the Service
Focal Point utilities to get to the Dedicated Service Tools (DST) display.
3. Type your service tools user ID and service tools password on the System Service Tools (SST) Sign
On display. Press Enter.
Remember: The service tools password is case-sensitive.
4. Select Start a service tool from the System Service Tools (SST) display and press Enter.
5. Select Hardware service manager from the Start a Service Tool display and press Enter.
6. Select Work with service action log from the Hardware Service Manager display and press Enter.
7. On the Select Timeframe display, change the From: Date and Time field to a date and time prior to
when the problem occurred.
8. Search for an entry that matches one or more conditions of the problem:
v System Reference code
v Resource
v Date and time
v Failing item list
9. Select option 2 (Display failing item information) to display the service action log entry.
10. Select option 2 (Display details) to display location information for the failing part to be replaced.
The information displayed in the date and time fields is the date and time for the first occurrence of
the specific System reference code for the resource displayed during the time range selected.
11. Select option 7 (Indicator off) to turn off the indicator light.
12. Select the Acknowledge all errors function at the bottom of the Service Action Log display, if all
problems have been resolved.
13. Close the log entry by selecting option 8 (Close new entry) on the Service Action Log Report display.
150 Disk drives
Identifying a failing part in a Linux system or logical partition
If the service aids have been installed on a system or logical partition, you can activate or deactivate the
indicator lights to locate a part or compete a service action.
Locating a failing part in a Linux system or logical partition
If the service aids have been installed on a system or logical partition, you need to activate the indicator
lights to locate a part.
To activate the indicator light, follow these steps:
1. Log in as root user.
2. At the command line, type /usr/sbin/usysident -s identify -l and press Enter.
3. Look for the system attention light to identify the enclosure that contains the failing part.
Related information
Service and productivity tools for Linux on POWER systems
IBM provides hardware diagnostic aids and productivity tools, as well as installation aids for Linux
operating systems on IBM Power systems.
Finding the location code of a failing part in a Linux system or logical
partition
To retrieve the location code of the failing part, if you do not know the location code, use the procedure
in this topic.
To locate the failing part in a system or logical partition follow these steps:
1. Log in as root user.
2. At the command line, type grep diagela /var/log/platform and press Enter.
3. Look for the most recent entry that contains a system reference code (SRC).
4. Record the location information.
Note: IBM provides hardware diagnostic aids and productivity tools, as well as installation aids for
Linux operating systems on IBM Power systems. See Service and productivity tools for Linux on
Power Systems servers
Activating the indicator light for the failing part
If you know the location code of the failing part, activate the indicator light to help you locate which part
to replace.
To activate the indicator light, follow these steps:
1. Log in as root user.
2. At the command line, type /usr/sbin/usysident -s identify -l and press Enter.
3. Look for the system attention light to identify the enclosure that contains the failing part.
Note: IBM provides hardware diagnostic aids and productivity tools, as well as installation aids for
Linux operating systems on IBM Power systems. See Service and productivity tools for Linux on Power
Systems servers
Deactivating the failing-part indicator light
After you complete a removal and replacement procedure, you must deactivate the failing-part indicator
light.
Identifying a part 151
To deactivate the indicator light, follow these steps:
1. Log in as root user.
2. At the command line, type /usr/sbin/usysident -s normal -l and press Enter.
Note: IBM provides hardware diagnostic aids and productivity tools, as well as installation aids for
Linux operating systems on IBM Power systems. See Service and productivity tools for Linux on Power
Systems servers
Locating a failing part in a Virtual I/O Server system or logical partition
You can use Virtual I/O Server (VIOS) tools, before activating the indicator light, to locate a part that is
failing.
1. Log in as root user or celogin-.
2. At the command line, type diagmenu and press Enter.
3. From the Function Selection menu, select Task Selection and press Enter.
4. Select Display Previous Diagnostic Results and press Enter.
5. From the Display Previous Diagnostic Results display, select Display Diagnostic Log Summary. A
Display Diagnostic Log display appears. This display contains a chronological list of events.
6. Look in the T column for the most recent S entry. Select this row in the table and press Enter.
7. Choose Commit. The details of this log entry are shown.
8. Record the location information and the SRN value shown near the end of the entry.
9. Exit to the command line.
Use the location information for the failing part to activate the indicator light that identifies the failing
part. For instructions, see “Identifying a part by using the Virtual I/O Server.”
Identifying a part by using the Virtual I/O Server
Use these instructions to turn on the indicator light to help you physically locate a part by using the
Virtual I/O Server (VIOS).
1. Log in as root user.
2. At the command line, type diagmenu and press Enter.
3. From the Function Selection menu, select Task Selection. Press Enter.
4. From the Task Selection menu, select Identify and Attention Indicators. Press Enter.
5. From the list of lights, select the location code for the failing part and press Enter.
6. Select Commit. This turns on the system attention and indicator light for the failing part.
7. Exit to the command line.
152 Disk drives
Starting the system or logical partition
Learn how to start a system or logical partition after performing a service action or system upgrade.
Starting a system that is not managed by a Hardware Management
Console or or a Systems Director Management Console
You can use the power button or the Advanced System Management Interface to start a system that is
not managed by a Hardware Management Console (HMC) or an IBM Systems Director Management
Console (SDMC).
To start a system that is not managed by a HMC or SDMC, follow these steps:
1. Open the front rack door, if necessary.
2. Before you press the power button on the control panel, ensure that power is connected to the system
unit as follows:
v All system power cables are connected to a power source.
v The Power LED, as shown in the following figure, is slowly blinking.
v The top of the display, as shown in the following figure, shows 01 V=F.
3. Press the power button (A), as shown in the following figure, on the control panel.
v A: Power-on button
v B: Power LED
– A constant light indicates full system power to the unit.
– A blinking light indicates standby power to the unit.
Figure 90. Control panel
© Copyright IBM Corp. 2010, 2011 153
Note: There is approximately a 30-second transition period from the time the power-on button is
pressed to when the power LED goes from blinking to solid. During the transition period, the LED
might blink faster.
v C: Enclosure identify light
– A constant light indicates the identify state for the enclosure or for a resource within the
enclosure.
– No light indicates that no resources in the enclosure are being identified.
v D: Attention light
– No light indicates that the system is operating normally.
– A solid light indicates that the system requires attention.
v E: USB port
v F: Enclosure fault roll-up light
– A constant light indicates a fault indicator active in the system.
– No light indicates that the system is operating normally.
v G: Function/Data display
v H: Decrement button
v I: Enter button
v J: Increment button
v K: Pinhole reset button
4. Observe the following after pressing the power button:
v The power-on light begins to blink faster.
v The system cooling fans are activated after approximately 30 seconds and begin to accelerate to
operating speed.
v Progress indicators, also referred to as checkpoints, appear on the control panel display while the
system is being started. The power-on light on the control panel stops blinking and remains on,
indicating that system power is on.
Tip: If pressing the power button does not start the system, do the following steps to start the system
using the Advanced System Management Interface (ASMI):
1. Access the ASMI. For instructions, see Accessing the ASMI.
2. Start the system using the ASMI. For instructions, see Powering the system on and off.
Starting a system or logical partition by using the Hardware
Management Console
You can use the Hardware Management Console (HMC) user interface to start the system or logical
partition after the required cables are installed and the power cables are connected to a power source.
For instructions on working with the HMC, see Managing the Hardware Management Console. For
instructions on starting a logical partition, see Logical partitioning. For instructions on starting the
system, see Powering on the managed system.
Progress indicators, also referred to as checkpoints, appear on the control panel display while the system
is being started. When the power-on light on the control panel stops blinking and remains on, the system
power is on.
154 Disk drives
Starting a system or virtual server with the Systems Director
Management Console
You can use the IBM Systems Director Management Console (SDMC) user interface to start the system or
virtual server after the required cables are installed and the power cables are connected to a power
source.
For instructions on working with the SDMC, see Managing and configuring the SDMC. For instructions
on starting a virtual server, see Managing virtual servers. For instructions on shutting down and
restarting virtual servers, see Shutting down and restarting virtual servers.
Progress indicators, also known as checkpoints, display on the control panel while the system is being
started. When the power-on light on the control panel stops flashing and remains on, the system power is
on.
Starting the system 155
156 Disk drives
Stopping a system or logical partition
Learn how to stop a system or logical partition as a part of a system upgrade or service action.
Attention: Using either the power-on button on the control panel or entering commands at the
Hardware Management Console (HMC) to stop the system can cause unpredictable results in the data
files. Also, the next time you start the system, it might take longer if all applications are not ended before
stopping the system.
To stop the system or logical partition, select the appropriate procedure.
Stopping a system that is not managed by an HMC or an SDMC
You might need to stop the system to perform another task. Use these instructions to stop the system
using the power button or Advanced System Management Interface.
Before you stop the system, follow these steps:
1. If an Integrated xSeries® Adapter (IXA) is present on the system, shut it down using IBM i options.
2. Ensure that all jobs are completed and end all applications.
3. Ensure that the operating system is stopped.
Attention: Failure to do so can result in the loss of data.
4. If a Virtual I/O Server (VIOS) logical partition is running, ensure that all clients are shut down or that
the clients have access to their devices using an alternate method.
The following procedure describes how to stop a system that is not managed by a Hardware
Management Console (HMC).
1. Log in to the system as a user with the authority to run the shutdown or pwrdwnsys (Power Down
System) command.
2. At the command line, enter one of the following commands:
v If your system is running the AIX operating system, type shutdown.
v If your system is running the Linux operating system, type shutdown -h now.
v If your system is running the IBM i operating system, type PWRDWNSYS. If your system is partitioned,
use the PWRDWNSYS command to power down each of the secondary partitions. Then, use the
PWRDWNSYS command to power down the primary partition.
The command stops the operating system. The system power turns off, the power-on light begins to
slowly flash, and the system goes into a standby state.
3. At the Linux command line, type shutdown -h now.
The command stops the operating system. The system power turns off, the power-on light begins to
slowly flash, and the system goes into a standby state.
4. At the Linux command line, type shutdown -h now.
The command stops the operating system. The system power turns off, the power-on light begins to
slowly flash, and the system goes into a standby state.
5. Record the IPL type and the IPL mode from the control panel display to help you return the system
to this state when the installation or replacement procedure is completed.
6. Set the power switches of any devices connected to the system to off.
7. Unplug any power cables that are attached to the unit from electrical outlets. Ensure that you unplug
power cables from peripheral devices, such as printers and expansion units.
© Copyright IBM Corp. 2010, 2011 157
Important: The system is equipped with a second power supply. Before continuing with this
procedure, ensure that all power sources to the system have been disconnected.
(L003)
or
Stopping a system by using the Hardware Management Console
You can use the Hardware Management Console (HMC) user interface to stop the system or a logical
partition.
By default, the managed system is set to power off automatically when you shut down the last running
logical partition on the managed system. If you set the managed system properties on the HMC so that
the managed system does not power off automatically, you must use this procedure to power off your
managed system.
Attention: If possible, shut down the running logical partitions on the managed system before powering
off the managed system. Powering off the managed system without shutting down the logical partitions
first causes the logical partitions to shut down abnormally and can cause data loss. If you use a Virtual
I/O Server (VIOS) logical partition, ensure that all clients are shut down or that the clients have access to
their devices using an alternate method.
To power off a managed system, you must be a member of one of the following roles:
v Super administrator
v Service representative
v Operator
v Product engineer
Use the following steps to stop the system using the HMC.
1. In the Navigation area, expand the Systems Management folder.
158 Disk drives
2. Click the Servers icon.
3. In the Contents area, select the managed system.
4. Select Tasks, then Operations, and then Power Off
5. Select the appropriate power-off mode and click OK.
Related information
Shutting down and restarting logical partitions
Stopping a system with the Systems Director Management Console
You can use the IBM Systems Director Management Console (SDMC) user interface to stop the system or
a virtual server.
By default, the managed system is set to power off automatically when you shut down the last running
virtual server on the managed system. If you set the managed system properties on the SDMC so that the
managed system does not power off automatically, you must use this procedure to power off your
managed system.
Attention: If possible, shut down the running virtual servers on the managed system before powering
off the managed system. Powering off the managed system without shutting down the virtual servers
first causes the virtual servers to shut down abnormally and can cause data loss. If you use a Virtual I/O
Server (VIOS) logical partition, ensure that all clients are shut down or that the clients have access to
their devices with an alternate method.
To power off a managed system, you must be a member of one of the following roles:
v Super administrator
v Service representative
v Operator
v Product engineer
Use the following steps to stop the system using the SDMC.
1. In the Power Systems Resource area, select the managed system you want to power off.
2. From the Actions menu, select Operations ? Power Off.
3. Select the appropriate power-off mode and click OK.
Stopping a system or logical partition 159
160 Disk drives
Removing and replacing the expansion unit cover or door
You might need to remove, replace, or install covers or doors on an expansion unit as a part of accessing
components or performing a service action.
Removing the front cover on the 7314-G30 or 5796
You might need to remove the front cover to perform service to the system.
To remove the front cover, follow these steps:
1. Remove the two thumbscrews (B) located on the left and right of the cover.
2. Press in on the two latches (C) located left and right of the cover to release the cover.
3. Lift out the cover (A) and away from the chassis.
Tip: If an airflow block is present on an unpopulated side of the chassis do not remove it unless you
are populating that side of the chassis.
Installing the front cover on the 7314-G30 or 5796
Use this procedure to install the front cover after installing the system or performing a service action.
If you only have one side of the chassis populated, ensure that the airflow block is present on the
unpopulated side.
1. Insert the posts on the bottom of the cover (A) into the hooks on the chassis of the enclosure.
Figure 91. Remove the front cover for model 7314-G30 or 5796
© Copyright IBM Corp. 2010, 2011 161
2. Press in on the two latches (C) on the right and left of the front cover.
3. Press the cover onto the chassis and release the latches. The cover securely snaps into place.
4. Replace the two thumbscrews (B) into the slots on the left and right of the front cover.
Figure 92. Front cover on a model 7314-G30 or 5796
162 Disk drives
System covers
Use these procedures to remove and replace the covers on IBM Power Systems servers that contain the
POWER7 processor.
Removing and replacing covers for the 8202-E4B, 8202-E4C, 8205-E6B,
or 8205-E6C
Use these instructions to remove, replace, or install covers to access components or perform service.
Removing the front cover on a rack-mounted 8202-E4B, 8202-E4C,
8205-E6B, or 8205-E6C system
Use this procedure to remove the cover to access components or perform service.
To remove the front cover, follow these steps:
1. Remove the two shipping screws (if present) that secure the system to the rack.
Note: Replacing the shipping screws is optional but should be done in geographic areas that are
prone to seismic activity.
2. Pull the cover away from the system. The cover has an indentation where you can hold onto it more
easily.
Figure 93. Removing the front cover
© Copyright IBM Corp. 2010, 2011 163
Removing the front cover on a stand-alone 8202-E4B, 8202-E4C, or
8205-E6B system
Use this procedure to remove the cover to access components or perform service.
To remove the front cover, follow these steps:
1. Open the front door (A) by swinging it towards the left as shown in the following figure.
2. Press latch (B) and pull the cover away from the system. The cover has an indentation where you can
hold onto it more easily.
Installing the front cover on a rack-mounted 8202-E4B, 8202-E4C,
8205-E6B, or 8205-E6C system
Use this procedure to install the cover to access components or perform service.
To install the front cover, follow these steps:
1. Push the cover onto the system.
2. Gently push the cover in until the four cover clips are seated in their respective mounting posts as
shown in the following figure. The cover snaps into place and has an indentation where you can hold
onto it more easily.
Note: Replacing the shipping screws is optional but should be done in geographic areas that are
prone to seismic activity.
Figure 94. Removing the front cover
164 Disk drives
Installing the front cover and front door on a stand-alone 8202-E4B,
8202-E4C, or 8205-E6B system
Use this procedure to install the cover to access components or perform service.
To install the front cover and front door, complete the following steps.
1. Align the cover (A) until the two cover tabs are seated into the slots on the base plate (B), as shown
in the following figure.
2. Rotate the cover up and towards the system until the release latch is seated into its respective slot.
The cover has an indentation where you can hold onto it more easily.
Figure 95. Removing the front cover
System covers 165
3. Hold the front door at an approximate 120 degree angle to the system as shown in the following
figure. Engage the latches.
Removing the service access cover on a rack-mounted 8202-E4B,
8202-E4C, 8205-E6B, or 8205-E6C system
Use this procedure to remove the service access cover.
1. Loosen the two thumbscrews (A) located at the back of the cover.
2. Slide the cover toward the back of the system unit. When the front of the service access cover has
cleared the upper frame ledge, lift the cover up and off the system unit.
Figure 96. Installing the front cover
Figure 97. Installing the front door
166 Disk drives
Attention: When the service access cover is removed, the system powers down.
Removing the service access cover on a stand-alone 8202-E4B,
8202-E4C, or 8205-E6B system
Use this procedure to remove the service access cover.
1. Loosen the thumbscrew (A) located at the back of the outside cover.
2. Slide the cover (B) toward the back of the system unit. When the front of the service access cover has
cleared the upper frame ledge, lift the cover up and off the system unit.
Figure 98. Removing the service access cover
System covers 167
3. Loosen the two thumbscrews (A) located at the back of the inside cover.
4. Slide the cover (B) toward the back of the system unit. When the front of the service access cover has
cleared the upper frame ledge, lift the cover up and off the system unit.
Attention: When the service access cover is removed, the system powers down.
Figure 99. Installing the outside service access cover
Figure 100. Removing the inside service access cover
168 Disk drives
Installing the service access cover on a rack-mounted 8202-E4B,
8202-E4C, 8205-E6B, or 8205-E6C system
Use this procedure to install the service access cover.
1. Place the service access cover on top of the system, about 25 mm (1 in.) from the upper chassis ledge.
2. Hold the service access cover against the system unit and slide it toward the front of the system. The
tabs of the service access cover slide beneath the upper chassis ledge.
3. Align the two thumbscrews (A) located on the back of the service access cover with the two holes on
the back of the system chassis.
4. Tighten the thumbscrews to secure the service access cover.
Installing the service access cover on a stand-alone 8202-E4B,
8202-E4C, or 8205-E6B system
Use this procedure to install the service access cover.
1. Place the inside service access cover (A) on top of the system, about 25 mm (1 in.) from the upper
chassis ledge.
2. Hold the service access cover against the system unit and slide it toward the front of the system. The
tabs of the service access cover slide beneath the upper chassis ledge.
3. Align the two thumbscrews (B) located on the back of the service access cover with the two holes on
the back of the system chassis.
4. Tighten the thumbscrews to secure the service access cover.
Figure 101. Installing the service access cover
System covers 169
5. Place the outside service access cover (A) on top of the system.
6. Hold the service access cover against the system unit and slide it toward the front of the system. The
tabs of the service access cover slide beneath the upper chassis ledge.
7. Align the thumbscrew (B) located on the back of the service access cover with the hole on the back of
the system chassis.
8. Tighten the thumbscrew to secure the service access cover.
Figure 102. Installing the inside service access cover
170 Disk drives
Removing and replacing covers for the 8231-E2B, 8231-E1C, or
8231-E2C
Use these instructions to remove, replace, or install covers to access components or perform service.
Removing the service access cover for the 8231-E2B, 8231-E1C, or
8231-E2C
Use this procedure to remove the service access cover.
1. Loosen the thumbscrew (A) located at the back of the cover.
2. Slide the cover (B) toward the back of the system unit. When the front of the service access cover has
cleared the upper frame ledge, lift the cover up and off the system unit.
Attention: When the service access cover is removed, the system powers down.
Figure 103. Installing the outside service access cover
System covers 171
Installing the service access cover for the 8231-E2B, 8231-E1C, or
8231-E2C
Use this procedure to install the service access cover.
1. Place the service access cover (A) on top of the system, about 25 mm (1 in.) from the upper chassis
ledge.
2. Hold the service access cover against the system unit and slide it toward the front of the system. The
tabs of the service access cover slide beneath the upper chassis ledge.
3. Align the thumbscrew (B) located on the back of the service access cover with the two holes on the
back of the system chassis.
4. Tighten the thumbscrews to secure the service access cover.
Figure 104. Removing the service access cover
172 Disk drives
Removing and replacing covers on the 8233-E8B or 8236-E8C system
Use these instructions to remove, replace, or install covers to access components or perform service.
Removing the front cover from an 8233-E8B or 8236-E8C system
Use this procedure to remove the cover to access components or perform service.
To remove the front cover, follow these steps:
1. Remove the two screws (A) that secure the system to the rack (B) as shown in the following figure.
2. Pull the cover away from the system. The cover has an indentation where you can hold onto it more
easily.
Figure 105. Installing the service access cover
System covers 173
Installing the front cover on an 8233-E8B or 8236-E8C system
Use this procedure to install the cover after accessing components or performing service.
To install the front cover, follow these steps:
1. Align the cover with the system.
2. Gently push the cover in until the four cover clips are seated in their respective mounting posts (B)
on the system.
3. Replace the two screws (C) that secure the system to the rack (A).
Figure 106. Removing the front cover
174 Disk drives
Removing the service access cover from an 8233-E8B or 8236-E8C
system
Use this procedure to remove the service access cover to perform service or to gain access to internal
components.
To remove the service access cover, follow these steps:
1. Place the system into the service position. For instructions, see “Placing the 8233-E8B or 8236-E8C in
the service position” on page 199.
2. Loosen the two thumbscrews (A) located at the back of the cover.
3. Slide the cover (B) toward the back of the system unit. When the front of the service access cover
clears the upper frame ledge, lift the cover up and off the system unit.
Attention: For proper cooling and airflow, install the cover before starting the system. Operating the
system without the cover for more than 30 minutes could damage the system components.
Figure 107. Installing the front cover
System covers 175
Installing the service access cover on an 8233-E8B or 8236-E8C
system
Use this procedure to install the service access cover after performing service or accessing internal
components.
To install the service access cover, follow these steps:
1. Place the service access cover (A) on the top of the system unit, approximately 25 mm (1 in.) from the
front of the system unit.
2. Hold the service access cover against the system unit, and slide it toward the front of the system.
The tabs on the service access cover slide beneath the upper chassis ledge, and the two screws align
with the screw holes at the back of the system unit.
Important: Ensure that the fan LED cables do not get caught on the front edge of the service access
cover as you move it forward.
3. Tighten the screws (B) located at the back of the cover.
Figure 108. Removing the service access cover
176 Disk drives
Removing and replacing covers for the 9117-MMB, 9117-MMC,
9179-MHB, or 9179-MHC
Use these instructions to remove, replace, or install covers to access components or perform service.
Removing the front cover from the 9117-MMB, 9117-MMC, 9179-MHB,
or 9179-MHC
Use this procedure to remove the cover to access components or perform service.
This procedure applies to covers that are shipped after 1 March 2011. For covers that are shipped prior to
1 March 2011, use the Removing the front cover with the cover removal tool procedure.
Note: The older covers shipped before 1 March 2011 can be identified by a single blue stripe on the
cover assembly latches. Newer covers shipped after 1 March 2011 have two blue stripes on the cover
assembly latches.
To remove the front cover, complete the following steps.
1. If necessary, open the front rack door.
2. Pull out the latches (A) located on both sides of the cover as shown in the following figure.
Figure 109. Installing the service access cover
System covers 177
3. Pull off the cover (B) to remove it from the system unit.
Removing the front cover with the cover removal tool
Removing the front cover on the 9117-MMB or 9179-MHB systems shipped before 1 March 2011 requires
the use of a cover removal tool.
Systems shipped before 1 March 2011 require the use of a cover removal tool to prevent the front fans
from being inadvertently removed and causing an unexpected outage.
Note: The older covers shipped before 1 March 2011 can be identified by a single blue stripe (A) on the
cover assembly latches. Newer covers shipped after 1 March 2011 have two blue stripes (B) on the cover
assembly latches. See Figure 111 on page 179.
Figure 110. Removing the front cover
178 Disk drives
Important: Do not remove the cover to view the enclosure serial number. If you need to check the
enclosure serial number, use one of the following methods:
v From the Advanced System Management Interface (ASMI), complete the following steps. This
procedure can be performed in FSP standby mode.
1. Expand System Information.
2. Click Vital Product Data in left column.
3. From the Vital Product Data panel, click Enclosure VPD (EV) for each enclosure configured.
4. Click Details to view results.
v From the Hardware Management Console (HMC), complete the following steps. The server must be
powered on to partition standby mode.
1. Expand Hardware Information.
2. Select View Hardware Topology to view the Enclosure ID for each enclosure.
To remove the front cover with the cover removal tool, complete the following steps:
1. Remove the cover removal tool (PN 74Y9219) from the shipping box, if needed.
2. If necessary, open the front rack door.
3. Align the cover removal tool (A) with the front cover of the system as shown in the following figure.
Figure 111. Identifying the older and newer covers
System covers 179
Note: The top pins of the cover removal tool align with the second hole down from the top of the
cover. Pins must be inserted through the hole until the removal tool pins contact the system and
cannot be pushed forward any further. There is a gap between the system and the cover removal tool.
4. Place hands on the cover removal tool as shown in the following figure while pressing the tool
against the system.
5. While keeping forward pressure on the cover removal tool, pull the cover assembly latches to
disengage them as shown in the following figure, which releases the retention mechanism.
Figure 112. Aligning the cover removal tool
Figure 113. Cover removal tool use
180 Disk drives
6. If the front cover is difficult to remove, the cover might be adhering to the fan assemblies at the top
of the system. Apply forward pressure against the cover removal tool and place fingers underneath
the edge of the cover near the top corners. Then, pull the cover towards the cover removal tool as
shown in the following figure.
Figure 114. Releasing the cover latches
System covers 181
7. After the cover assembly is unlatched and lying against the cover removal tool, remove both from the
front of the system as shown in the following figure.
8. Repeat these steps for all system covers that must be removed.
Figure 115. Removing a cover that adheres to the fan assemblies
Figure 116. Removing the cover
182 Disk drives
Installing the front cover on the 9117-MMB, 9117-MMC, 9179-MHB, or
9179-MHC
Use this procedure to install the cover after accessing components or performing service.
To install the front cover, follow these steps:
1. Position the cover (A) on the front of the system unit so that the four pins on the system match the
four holes on the back of the cover.
2. Press the tabs (B) to snap the cover into position.
3. Close the front rack door.
Figure 117. Installing the front cover
System covers 183
184 Disk drives
Placing the 8231-E2B, 8231-E1C, or 8231-E2C system into the
service or operating position
Use these procedures to place a system into the service position or operating position to perform service
or to gain access to internal components.
Placing the rack-mounted 8231-E2B, 8231-E1C, or 8231-E2C system
into the service position
Use this procedure to place the rack-mounted system into the service position.
Notes:
v When placing the system into the service position, it is essential that all stability plates are firmly in
position to prevent the rack from toppling. Ensure that only one system unit is in the service position
at a time.
v Ensure that the cables at the back of the system unit do not catch or bind as you pull the unit forward
in the rack.
v When the system unit rails are fully extended, the rail safety latches lock into place. This action
prevents the system from being pulled out too far.
1. Open the front rack door.
2. Identify the system unit you are servicing in the rack.
3. While holding the system unit release latches down on both the left and right sides, pull the system
unit out from the rack until the rails are fully extended and locked.
Figure 118. Placing the system in the service position
© Copyright IBM Corp. 2010, 2011 185
Placing the rack-mounted 8231-E2B, 8231-E1C, or 8231-E2C system
into the operating position
Use this procedure to place the rack-mounted system into the operating position.
When placing the system into operating position, ensure that the cables at the back of the system do not
catch or bind as you push the system unit back into the rack.
1. Unlock the blue rail safety latches (A) by lifting them upwards.
2. Push the system unit back into the rack until both system-unit release latches have locked into
position.
Figure 119. Unlocking the safety rail latches
186 Disk drives
3. Close the front rack door of the unit you are servicing.
Figure 120. Placing the system into the operating position
Placing the 8231-E2B, 8231-E1C, or 8231-E2C system into the service or operating position 187
188 Disk drives
Placing the rack-mounted 8202-E4B, 8202-E4C, 8205-E6B, or
8205-E6C system into the service position
Use this procedure to place the rack-mounted system into the service position.
Notes:
v When placing the system into the service position, it is essential that all stability plates are firmly in
position to prevent the rack from toppling. Ensure that only one system unit is in the service position
at a time.
v Ensure that the cables at the back of the system unit do not catch or bind as you pull the unit forward
in the rack.
v When the system unit rails are fully extended, the rail safety latches lock into place. This action
prevents the system from being pulled out too far.
1. Open the front rack door.
2. Identify the system unit you are servicing in the rack.
3. Remove the two screws (A) (if present) that secure the system unit (B) to the rack.
Note: Replacing the shipping screws is optional but should be done in geographic areas that are
prone to seismic activity.
4. While holding the system unit release latches (A) down on both the left and right sides, pull the
system unit (B) out from the rack until the rails are fully extended and locked.
Figure 121. Placing the system in the service position
© Copyright IBM Corp. 2010, 2011 189
190 Disk drives
Placing the rack-mounted 8202-E4B, 8202-E4C, 8205-E6B, or
8205-E6C system into the operating position
Use this procedure to place the rack-mounted system into the operating position.
When placing the system into operating position, ensure that the cables at the back of the system do not
catch or bind as you push the system unit back into the rack.
1. Unlock the blue rail safety latches (A) by lifting them upward. The rail safety latches are located near
the middle of the system.
2. Push the system unit (B) back into the rack until both system-unit release latches have locked into
position.
3. Replace and tighten the two screws (if present) (A) that secure the system unit (B) to the rack.
Note: Replacing the two shipping screws is optional but should be done in geographic areas that are
prone to seismic activity.
4. Close the front rack door of the unit you are servicing.
Figure 122. Placing the system into the operating position
© Copyright IBM Corp. 2010, 2011 191
Figure 123. Placing the system into the operating position
192 Disk drives
Service and operating positions
Use these procedures to place IBM Power Systems servers into the service or operating position.
Placing the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system into
the service or operating position
Use these procedures to place a system into the service position or operating position to perform service
or to gain access to internal components.
Placing the rack-mounted 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C
system into the service position
Use this procedure to place the rack-mounted system into the service position.
Notes:
v When placing the system into the service position, it is essential that all stability plates are firmly in
position to prevent the rack from toppling. Ensure that only one system unit is in the service position
at a time.
v Ensure that the cables at the back of the system unit do not catch or bind as you pull the unit forward
in the rack.
v When the system unit rails are fully extended, the rail safety latches lock into place. This action
prevents the system from being pulled out too far.
1. Open the front rack door.
2. Identify the system unit you are servicing in the rack.
3. Remove the two screws (A) (if present) that secure the system unit (B) to the rack.
Note: Replacing the shipping screws is optional but should be done in geographic areas that are
prone to seismic activity.
© Copyright IBM Corp. 2010, 2011 193
4. While holding the system unit release latches (A) down on both the left and right sides, pull the
system unit (B) out from the rack until the rails are fully extended and locked.
Placing the rack-mounted 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C
system into the operating position
Use this procedure to place the rack-mounted system into the operating position.
Figure 124. Placing the system in the service position
194 Disk drives
When placing the system into operating position, ensure that the cables at the back of the system do not
catch or bind as you push the system unit back into the rack.
1. Unlock the blue rail safety latches (A) by lifting them upward. The rail safety latches are located near
the middle of the system.
2. Push the system unit (B) back into the rack until both system-unit release latches have locked into
position.
3. Replace and tighten the two screws (if present) (A) that secure the system unit (B) to the rack.
Note: Replacing the two shipping screws is optional but should be done in geographic areas that are
prone to seismic activity.
4. Close the front rack door of the unit you are servicing.
Figure 125. Placing the system into the operating position
Service and operating positions 195
Placing the 8231-E2B, 8231-E1C, or 8231-E2C system into the service
or operating position
Use these procedures to place a system into the service position or operating position to perform service
or to gain access to internal components.
Placing the rack-mounted 8231-E2B, 8231-E1C, or 8231-E2C system
into the service position
Use this procedure to place the rack-mounted system into the service position.
Notes:
v When placing the system into the service position, it is essential that all stability plates are firmly in
position to prevent the rack from toppling. Ensure that only one system unit is in the service position
at a time.
v Ensure that the cables at the back of the system unit do not catch or bind as you pull the unit forward
in the rack.
v When the system unit rails are fully extended, the rail safety latches lock into place. This action
prevents the system from being pulled out too far.
1. Open the front rack door.
2. Identify the system unit you are servicing in the rack.
3. While holding the system unit release latches down on both the left and right sides, pull the system
unit out from the rack until the rails are fully extended and locked.
Figure 126. Placing the system into the operating position
196 Disk drives
Placing the rack-mounted 8231-E2B, 8231-E1C, or 8231-E2C system
into the operating position
Use this procedure to place the rack-mounted system into the operating position.
When placing the system into operating position, ensure that the cables at the back of the system do not
catch or bind as you push the system unit back into the rack.
1. Unlock the blue rail safety latches (A) by lifting them upwards.
Figure 127. Placing the system in the service position
Service and operating positions 197
2. Push the system unit back into the rack until both system-unit release latches have locked into
position.
Figure 128. Unlocking the safety rail latches
Figure 129. Placing the system into the operating position
198 Disk drives
3. Close the front rack door of the unit you are servicing.
Placing the rack-mounted 8233-E8B or 8236-E8C system into the
service position or operating position
Use these procedures to place a system into the service position or operating position to perform service
or to gain access to internal components.
Placing the 8233-E8B or 8236-E8C in the service position
Use this procedure to perform service or gain access to internal components by placing the rack-mounted
system or expansion unit in the service position.
Note: Some of the figures in these procedures might not look exactly like the system or expansion unit
that you have. However, the steps to perform the task are the same.
DANGER
When working on or around the system, observe the following precautions:
Electrical voltage and current from power, telephone, and communication cables are hazardous. To
avoid a shock hazard:
v Connect power to this unit only with the IBM provided power cord. Do not use the IBM
provided power cord for any other product.
v Do not open or service any power supply assembly.
v Do not connect or disconnect any cables or perform installation, maintenance, or reconfiguration
of this product during an electrical storm.
v The product might be equipped with multiple power cords. To remove all hazardous voltages,
disconnect all power cords.
v Connect all power cords to a properly wired and grounded electrical outlet. Ensure that the outlet
supplies proper voltage and phase rotation according to the system rating plate.
v Connect any equipment that will be attached to this product to properly wired outlets.
v When possible, use one hand only to connect or disconnect signal cables.
v Never turn on any equipment when there is evidence of fire, water, or structural damage.
v Disconnect the attached power cords, telecommunications systems, networks, and modems before
you open the device covers, unless instructed otherwise in the installation and configuration
procedures.
v Connect and disconnect cables as described in the following procedures when installing, moving,
or opening covers on this product or attached devices.
To Disconnect:
1. Turn off everything (unless instructed otherwise).
2. Remove the power cords from the outlets.
3. Remove the signal cables from the connectors.
4. Remove all cables from the devices
To Connect:
1. Turn off everything (unless instructed otherwise).
2. Attach all cables to the devices.
3. Attach the signal cables to the connectors.
4. Attach the power cords to the outlets.
5. Turn on the devices.
(D005)
DANGER
Service and operating positions 199
Observe the following precautions when working on or around your IT rack system:
v Heavy equipment–personal injury or equipment damage might result if mishandled.
v Always lower the leveling pads on the rack cabinet.
v Always install stabilizer brackets on the rack cabinet.
v To avoid hazardous conditions due to uneven mechanical loading, always install the heaviest
devices in the bottom of the rack cabinet. Always install servers and optional devices starting
from the bottom of the rack cabinet.
v Rack-mounted devices are not to be used as shelves or work spaces. Do not place objects on top
of rack-mounted devices.
v Each rack cabinet might have more than one power cord. Be sure to disconnect all power cords in
the rack cabinet when directed to disconnect power during servicing.
v Connect all devices installed in a rack cabinet to power devices installed in the same rack
cabinet. Do not plug a power cord from a device installed in one rack cabinet into a power
device installed in a different rack cabinet.
v An electrical outlet that is not correctly wired could place hazardous voltage on the metal parts of
the system or the devices that attach to the system. It is the responsibility of the customer to
ensure that the outlet is correctly wired and grounded to prevent an electrical shock.
CAUTION
v Do not install a unit in a rack where the internal rack ambient temperatures will exceed the
manufacturer's recommended ambient temperature for all your rack-mounted devices.
v Do not install a unit in a rack where the air flow is compromised. Ensure that air flow is not
blocked or reduced on any side, front, or back of a unit used for air flow through the unit.
v Consideration should be given to the connection of the equipment to the supply circuit so that
overloading of the circuits does not compromise the supply wiring or overcurrent protection. To
provide the correct power connection to a rack, refer to the rating labels located on the
equipment in the rack to determine the total power requirement of the supply circuit.
v (For sliding drawers.) Do not pull out or install any drawer or feature if the rack stabilizer brackets
are not attached to the rack. Do not pull out more than one drawer at a time. The rack might
become unstable if you pull out more than one drawer at a time.
v (For fixed drawers.) This drawer is a fixed drawer and must not be moved for servicing unless
specified by the manufacturer. Attempting to move the drawer partially or completely out of the
rack might cause the rack to become unstable or cause the drawer to fall out of the rack.
(R001)
To place the rack-mounted system or expansion unit into the service position, follow these steps:
1. If necessary, open the front rack door.
2. Remove the two screws (A) that secure the system unit to the rack as shown in the following figure.
3. Release the rack latches (B) on both the left and right sides as shown in the following figure.
200 Disk drives
4. If required, unplug the cables from the back of the system or expansion unit before you pull the unit
out from the rack. Ensure that the cables at the rear of the system or expansion unit do not catch or
bind as you pull the unit out from the rack.
5. Slowly pull the system or expansion unit out from the rack until the rails are fully extended and
locked. Ensure that the rails are fully extended. When the rails are fully extended, the rail safety
latches lock into place. This action prevents the system or expansion unit from being pulled out too
far.
Placing the 8233-E8B or 8236-E8C in the operating position
Use this procedure to place the system or expansion unit in the operating position to make the unit
available for use.
To place the system into the operating position follow these steps:
Tip: Some of the figures in these procedures might not look exactly like the system or expansion unit
that you have. However, the steps to perform the task are the same.
1. Simultaneously release the blue rail safety latches (B), located near the front of each rail, and push the
system or expansion unit into the rack as shown in the following figure.
Note: Ensure that the cables at the rear of the system or expansion unit do not catch or bind as you
push the unit back into the rack.
Figure 130. Releasing the rack latches
Service and operating positions 201
2. Replace and tighten the two thumbscrews (C) that secure the system or expansion unit (A) to the rack
as shown in the following figure.
3. Close the front rack door.
Figure 131. Releasing the rail safety latches
Figure 132. Replacing the thumbscrews
202 Disk drives
Power cords
Use these procedures to disconnect and connect the power cords on IBM Power Systems servers that
contain the POWER7 processor.
Disconnecting the power cords from the 8202-E4B, 8202-E4C,
8205-E6B, or 8205-E6C system
Use these procedures to disconnect the power cords from the system.
1. For a rack-mounted system, open the back rack door on the unit you are servicing and identify the
system unit you are servicing in the rack.
2. For both a rack-mounted system and a stand-alone system, disconnect all power cords from the unit
you are servicing. This system might be equipped with two power supplies. Before continuing with
any remove and replace procedures that require the power off, ensure that both power sources to the
system have been completely disconnected.
Figure 133. Removing the power cords for a rack-mounted system
© Copyright IBM Corp. 2010, 2011 203
Connecting the power cords to the 8202-E4B, 8202-E4C, 8205-E6B, or
8205-E6C system
Use this procedure to connect the power cords to the system.
1. Identify the system unit you are servicing in the rack.
2. Connect all power cords to the unit you are servicing. Ensure that the cords are threaded through the
handles.
Figure 134. Removing the power cords for a stand-alone system
204 Disk drives
3. Close the back rack door.
Disconnecting the power cords from the 8231-E2B, 8231-E1C, or
8231-E2C
Use these procedures to disconnect the power cords from the system.
1. Open the back rack door on the unit you are servicing.
2. Identify the system unit you are servicing in the rack.
3. Disconnect all power cords from the unit you are servicing. This system might be equipped with two
power supplies. Before continuing with any remove and replace procedures that require the power
off, ensure that both power sources to the system have been completely disconnected.
Figure 135. Connecting the power cords
Power cords 205
Connecting the power cords to the 8231-E2B, 8231-E1C, or 8231-E2C
Use these procedures to connect the power cords to the system.
1. Open the back rack door on the unit you are servicing.
2. Identify the system unit you are servicing in the rack.
3. Disconnect all power cords from the unit you are servicing. Ensure that the cords are threaded
through the handles.
Figure 136. Removing the power cords
206 Disk drives
Figure 137. Connecting the power cords
Power cords 207
208 Disk drives
Verifying the installed part
You can verify a newly installed or replaced part on your system, logical partition, or expansion unit
using the operating system, stand-alone diagnostics, or the Hardware Management Console (HMC).
Verifying an installed feature or replaced part in an AIX system or
logical partition
If you installed feature or replaced a part, you might want to use the tools in the AIX operating system to
verify that the feature or part is recognized by the system or logical partition.
To verify the operation of a newly installed feature or replacement part, select the appropriate procedure:
v Verify the installed feature using AIX
v Verifying the replaced part using AIX
Verify the installed feature by using the AIX operating system:
1. Log in as root user.
2. At the command line, type diag and press Enter.
3. Select Advanced Diagnostics Routines and press Enter.
4. From the Diagnostic Mode Selection menu, select System Verification and press Enter.
5. When the Advanced Diagnostic Selection menu appears, do one of the following:
v To test a single resource, select the resource that you just installed from the list of resources and
press Enter.
v To test all the resources available to the operating system, select All Resources and press Enter.
6. Select Commit, and wait until the diagnostic programs run to completion, responding to any prompts
that appear.
7. Did the diagnostics run to completion and display the message No trouble was found?
v No: If a service request number (SRN) or other reference code is displayed, suspect a loose adapter
or cable connection. Review the installation procedures to ensure that the new feature is installed
correctly. If you cannot correct the problem, collect all SRNs or any other reference code
information that you see. If the system is running in logical partitioning (LPAR) mode, note the
logical partition in which you installed the feature. Contact your service provider for assistance.
v Yes: The new device is installed correctly. Exit the diagnostic programs and return the system to
normal operations.
Verify the replacement part by using the AIX operating system:
To verify the operation of a newly installed feature or replacement part, follow these steps:
1. Did you use either the AIX operating system or the online diagnostics service aid concurrent
(hot-swap) service to replace the part?
No: Go to step 2.
Yes: Go to step 5 on page 210.
2. Is the system powered off?
No: Go to step 4 on page 210.
Yes: If the system supports slow boot, set the system to perform a slow boot. For information, see
Performing a slow boot.
© Copyright IBM Corp. 2010, 2011 209
3. Start the system and wait until the AIX operating system login prompt is displayed or until apparent
system activity on the operator panel or display has stopped.
Did the AIX login prompt display?
v No: If a service request number (SRN) or other reference code is displayed, suspect a loose
adapter or cable connection. Review the procedures for the part that you replaced to ensure that
the new part is installed correctly. If you cannot correct the problem, collect all SRNs or any other
reference code information that you see. If the system does not start or you have no login prompt,
see: Problems with loading and starting the operating system.
If the system is partitioned, note the logical partition in which you replaced the part. Contact your
service provider for assistance.
v Yes: Go to step 4.
4. At the command prompt, type diag —a and press Enter to check for missing resources. If you see a
command prompt, go to step 5.
If the Diagnostic selection menu is shown with M appearing next to any resource, follow these
steps:
a. Select the resource and press Enter.
b. Select Commit.
c. Follow any instructions that are shown.
d. If the Do you want to review the previously displayed error? message is shown, select Yes and press
Enter.
e. If an SRN is shown, suspect a loose card or connection. If no obvious problem is shown, record
the SRN and contact your service provider for assistance..
f. If no SRN is shown, go to step 5.
5. Test the part by doing the following steps:
a. At the command line, type diag and press Enter.
b. From the Function Selection menu, select Advanced Diagnostics Routines and press Enter.
c. From the Diagnostic Mode Selection menu, select System Verification and press Enter.
d. Select All Resources, or select the diagnostics for the individual part to test only the part you
replaced and any devices that are attached to the part you replaced and press Enter.
Did the Resource Repair Action menu appear?
No: Go to step 6.
Yes: Go to step 7.
6. Did the Testing Complete, No trouble was found message appear?
v No: There is still a problem. Contact your service provider. This ends the procedure.
v Yes: Select Log Repair Action, if not previously logged, from the Task Selection menu to update
the AIX error log. If the repair action was reseating a cable or adapter, select the resource
associated with that repair action. If the resource associated with your action is not displayed on
the resource list, select sysplanar0 and press Enter.
Tip: This action changes the indicator light for the part from the fault state to the normal state.
Go to step 9 on page 211.
7. Select the resource for the replaced part from the Resource Repair Action menu. When a test is run
on a resource in system verification mode, and that resource has an entry in the AIX error log, if the
test on the resource was successful, the Resource Repair Action menu appears. Complete the
following steps to update the AIX error log to indicate that a system-detectable part has been
replaced.
Note: On systems with an indicator light for the failing part, this action changes the indicator light
to the normal state.
210 Disk drives
a. Select the resource that has been replaced from the Resource Repair Action menu. If the repair
action was reseating a cable or adapter, select the resource associated with that repair action. If
the resource associated with your action does not appear on the resource list, select sysplanar0
and press Enter.
b. Select Commit after you make your selections. Did another Resource Repair Action display
appear?
No: If the No Trouble Found display appears, go to step 9
Yes: Go to step 8.
8. Select the parent or child of the resource for the replaced part from the Resource Repair Action
menu if necessary. When a test is run on a resource in system verification mode, and that resource
has an entry in the AIX error log, if the test on the resource was successful, the Resource Repair
Action menu appears. Complete the following steps to update the AIX error log to indicate that a
system-detectable part has been replaced.
Note: This action changes the indicator light for the part from the fault state to the normal state.
a. From the Resource Repair Action menu, select the parent or child of the resource that has been
replaced. If the repair action was to reseat a cable or adapter, select the resource associated with
that repair action. If the resource associated with your action does not appear on the resource list,
select sysplanar0 and press Enter.
b. Select Commit after you make your selections.
c. If the No Trouble Found display appears, go to step 9.
9. If you changed the service processor or network settings, as instructed in previous procedures,
restore the settings to the values they had prior to servicing the system.
10. Did you do any hot-plug procedures before doing this procedure?
No: Go to step 11.
Yes: Go to step 12.
11. Start the operating system, with the system or logical partition in normal mode. Were you able to
start the operating system?
No: Contact your service provider. This ends the procedure.
Yes: Go to step 12.
12. Are the indicator lights still on?
v No. This ends the procedure.
v Yes. Turn off the lights. See the following for instructions: Changing service indicators.
Verifying the installed part in an IBM i system or logical partition
If you have installed a new feature or part, verify that the system recognizes the feature or part by using
the IBM i system service tools.
To verify the installed part, follow these steps:
1. Deactivate the failing item indicator light. For instructions, see “Deactivating the failing-part indicator
light” on page 150.
2. Sign on with at least service level authority.
3. On the command line of the IBM i session, type strsst and press Enter.
Note: If you cannot get to the System Service Tools display, use function 21 from the control panel.
Alternatively, if the system is managed by Hardware Management Console (HMC), use the Service
Focal Point Utilities to get to the Dedicated Service Tools (DST) display.
4. Type your service tools user ID and service tools password on the System Service Tools (SST) Sign On
display and press Enter.
Verifying the installed part 211
Note: The service tools password is case-sensitive.
5. Select Start a service tool from the System Service Tools (SST) display and press Enter.
6. Select Hardware service manager from the Start a Service Tool display and press Enter.
7. Select Logical hardware resources (buses, IOPs, controllers) from the Hardware Service Manager
display and press Enter. This option allows you to display and work with logical resources. Logical
hardware resources are the functional resources of the system used by the operating system.
With the Logical Hardware Resources display, you can show logical hardware resource status or
information, and associated packaging hardware resources. Use the online Help information to better
understand specific functions, fields, or symbols.
Deactivating the failing-part indicator light
Use this procedure to turn off any indicator light that you turned on as a part of a service action.
To deactivate the indicator light, follow these steps:
1. Sign on to an IBM i session, with at least service level authority.
2. On the command line of the session, type strsst and press Enter.
Note: If you cannot get to the System Service Tools display, use function 21 from the control panel.
Alternatively, if the system is managed by a Hardware Management Console (HMC), use the Service
Focal Point utilities to get to the Dedicated Service Tools (DST) display.
3. Type your service tools user ID and service tools password on the System Service Tools (SST) Sign
On display. Press Enter.
Remember: The service tools password is case-sensitive.
4. Select Start a service tool from the System Service Tools (SST) display and press Enter.
5. Select Hardware service manager from the Start a Service Tool display and press Enter.
6. Select Work with service action log from the Hardware Service Manager display and press Enter.
7. On the Select Timeframe display, change the From: Date and Time field to a date and time prior to
when the problem occurred.
8. Search for an entry that matches one or more conditions of the problem:
v System Reference code
v Resource
v Date and time
v Failing item list
9. Select option 2 (Display failing item information) to display the service action log entry.
10. Select option 2 (Display details) to display location information for the failing part to be replaced.
The information displayed in the date and time fields is the date and time for the first occurrence of
the specific System reference code for the resource displayed during the time range selected.
11. Select option 7 (Indicator off) to turn off the indicator light.
12. Select the Acknowledge all errors function at the bottom of the Service Action Log display, if all
problems have been resolved.
13. Close the log entry by selecting option 8 (Close new entry) on the Service Action Log Report display.
Verifying the installed part in a Linux system or logical partition
If you have installed a new part, learn how to verify that the system recognizes the part.
To verify the newly installed or replaced part, continue with “Verifying an installed part by using
stand-alone diagnostics” on page 213.
212 Disk drives
Verifying an installed part by using stand-alone diagnostics
If you have installed or replaced a part, verify that the system recognizes the new part. You can use
stand-alone diagnostics to verify an installed part in an AIX or Linux system, expansion unit, or logical
partition.
v If this server is directly attached to another server or attached to a network, ensure communications
with the other servers have stopped.
v The stand-alone diagnostics require use of all of the logical partition resources. No other activity can be
running on the logical partition.
v The stand-alone diagnostics require access to the system console.
You access these diagnostics from a CD-ROM or from the Network Installation Management (NIM)
server. This procedure describes how to use the diagnostics from a CD-ROM. For information on running
diagnostics from the Network Installation Management (NIM) server, see Running stand-alone
diagnostics from a Network Installation Management server.
To use stand-alone diagnostics, follow these steps:
1. Stop all jobs and applications and then stop the operating system on the system or logical partition.
2. Remove all tapes, diskettes, and CD-ROM.
3. Turn off the system unit power. The next step boots the server or logical partition from the
stand-alone diagnostics CD-ROM. If the optical drive is not available as the boot device on the
server or logical partition on which you are working, follow these steps:
a. Access the ASMI. See Accessing the ASMI for information on using the ASMI.
b. On the ASMI main menu, click on Power/Restart Control.
c. Click Power On/Off System.
d. Select the Service mode boot from default boot list option in the AIX or Linux logical partition
mode boot drop-down menu.
e. Click Save settings and power on. As soon as the optical drive has power, insert the standalone
diagnostic CD-ROM.
f. Go to step 5.
4. Turn on the system unit power and immediately insert the diagnostics CD-ROM into the optical
drive.
5. After the keyboard POST indicator displays on the system console and before the last POST
indicator (speaker) displays, press the numeric 5 key on the system console to indicate that a service
mode boot should be initiated using the default-service mode boot list.
6. Enter any requested password.
7. At the Diagnostic Operating Instructions display, press Enter.
Tip: If a service request number (SRN) or other reference code is displayed, suspect a loose adapter
or cable connection.
Note: If you received an SRN or any other reference code when you attempted to start the system,
contact your service provider for assistance..
8. If the terminal type is requested, select the Initialize Terminal option on the Function Selection
menu to initialize the operating system.
9. From the Function Selection menu, select Advanced Diagnostics Routines and press Enter.
10. From the Diagnostic Mode Selection menu, select System Verification and press Enter.
11. When the Advanced Diagnostic Selection menu appears, select All Resources, or test only the part
you replaced, and any devices that are attached to the part you replaced, by selecting the diagnostics
for the individual part and press Enter.
12. Did the Testing Complete, No trouble was found message appear?
Verifying the installed part 213
v No: There is still a problem. Contact your service provider.
v Yes: Go to step 13.
13. If you changed the service processor or network settings, as instructed in previous procedures,
restore the settings to the value they had prior to servicing the system.
14. If the indicator lights are still on, follow these steps:
a. Select Identify and Attention Indicators from the Task Selection menu to turn off the system
attention and indicator lights and press Enter.
b. Select Set System Attention Indicator to NORMAL and press Enter.
c. Select Set All Identify Indicators to NORMAL and press Enter.
d. Choose Commit.
Note: This changes the system attention and identify indicators from the Fault state to the Normal
state.
e. Exit to the command line.
Verifying the installed part by using Hardware Management Console
If you have installed or replaced a part, use the Hardware Management Console (HMC) to update your
HMC records after you have completed a service action on your server. If you have reference codes,
symptoms, or location codes that you used during the service action, locate the records for use during
this procedure.
To verify the installed part, complete these steps:
1. At the HMC, examine the service action event log for any open service action events. See “Viewing
serviceable events by using the HMC” on page 216 for details.
2. Are there any service action events that are open?
No: If the system attention LED is still on, use the HMC to turn off the LED. See “Activating and
deactivating LEDs by using the HMC” on page 215. This ends the procedure.
Yes: Continue with the next step.
3. Record the list of open service action events.
4. Examine the details of the open service action event. Is the error code associated with this service
action event the same as you gathered earlier.
v No: Select one of the following options:
– Review the other serviceable events, find one that does match, and continue with the next step.
– If the log does not match what you had gathered earlier, contact your service provider.
v Yes: Continue with the next step.
5. Select and highlight the service action event from the Error Associated With This Serviceable Event
window.
6. Click Close Event.
7. Add comments for the serviceable event. Include any unique additional information. Click OK.
8. Did you replace, add, or modify a field replaceable unit (FRU) of the open service action event?
v No: Select the No FRU Replaced for this Serviceable Event option, and click OK to close the
service action event.
v Yes: Perform the following steps:
a. From the FRU list, select a FRU that you need to update.
b. Double-click the FRU and update the FRU information.
c. Click OK to close the service action event.
9. If you continue to have problems, contact your service provider.
214 Disk drives
Activating and deactivating LEDs by using the HMC
Use this procedure to activate or deactivate LEDs using Service Focal Point for the Hardware
Management Console (HMC).
Choose from the following:
v “Deactivating a system attention LED or partition LED”
v “Activating or deactivating an identify LED”
Deactivating a system attention LED or partition LED
You can deactivate a system attention LED or a logical partition LED. For example, you might determine
that a problem is not a high priority and decide to repair the problem at a later time. However, you want
to be alerted if another problem occurs, so you must deactivate the system attention LED so that it can be
activated again if another problem occurs.
1. In the navigation area, open Systems Management.
2. Open Servers and select the appropriate system.
3. In the content area, check the box for the appropriate Partition.
4. Select Tasks, then Operations, and then Manage Attention LED.
5. Select the appropriate Partition.
6. Select Deactivate System Attention LED from the Action menu. A confirmation window is displayed
that provides the following information:
v A verification that the system attention LED was deactivated.
v An indication that there still might be open problems within the system.
v An indication that you cannot activate the system attention LED.
7. Select one of the logical partitions in the lower table, and select Deactivate partition LED from the
Partition Operations menu. A confirmation window is displayed that provides the following
information:
v A verification that the logical partition LED was deactivated.
v An indication that there still might be open problems within the logical partition.
v An indication that you cannot activate the logical partition LED.
Activating or deactivating an identify LED
The system provides several LEDs that help identify various components, such as enclosures or field
replaceable units (FRUs), in the system. For this reason, they are called identify LEDs.
You can activate or deactivate the following types of identify LEDs:
v Identify LED for an enclosure If you want to add an adapter to a specific drawer (enclosure), you
need to know the machine type, model, and serial number (MTMS) of the drawer. To determine
whether you have the correct MTMS for the drawer that needs the new adapter, you can activate the
LED for a drawer and verify that the MTMS corresponds to the drawer that requires the new adapter.
v Identify LED for a FRU associated with a specified enclosure If you want to hook up a cable to a
specific I/O adapter, you can activate the LED for the adapter which is a field replaceable unit (FRU),
and then physically check to see where you should hook up the cable. This is especially useful when
you have several adapters with open ports.
To activate or deactivate an identify LED for an enclosure or FRU, follow these steps:
1. In the navigation area, open Systems Management.
2. Select Servers.
3. In the content area, check the box for the appropriate System.
4. Select Tasks, then Operations, then LED Status, and then Identify LED.
Verifying the installed part 215
5. To activate or deactivate an identify LED for an enclosure, select an enclosure from the table, and click
either Activate LED or Deactivate LED. The associated LED is either turned on or off.
6. To activate or deactivate an identify LED for a FRU, select an enclosure from the table, select Selected
? List FRUs.
7. Select one or more FRUs from the table, and click either Activate LED or Deactivate LED. The
associated LED is either turned on or off.
Viewing serviceable events by using the HMC
Use this procedure to view a serviceable event, including details, comments, and service history.
To view serviceable events and other information about the events, you must be a member of one of the
following roles:
v Super administrator
v Service representative
v Operator
v Product engineer
v Viewer
To view serviceable events, follow these steps:
1. In the navigation area, select Service Management.
2. Select Manage Serviceable Events.
3. Select the criteria for the serviceable events that you want to view, and click OK. The Serviceable
Event Overview window opens. The list shows all serviceable events that match your selection
criteria. You can use the menu options to perform actions on the serviceable events.
4. Select a line in the Serviceable Event Overview window, and select Selected ? View Details The
Serviceable Event Details window opens, showing detailed information about the serviceable event.
The upper table shows information, such as problem number and reference code. The lower table
shows the field replaceable units (FRUs) associated with this event.
5. Select the error for which you want to view comments and history, and follow these steps:
a. Select Actions ? View Comments.
b. When you are finished viewing the comments, click Close.
c. Select Actions ? View Service History. The Service History window opens, showing service
history associated with the selected error.
d. When you are finished viewing the service history, click Close.
6. When you are finished, click Cancel twice to close the Serviceable Event Details window and the
Serviceable Event Overview window.
Verifying the installed part by using Systems Director Management
Console
If you installed or replaced a part, use the IBM Systems Director Management Console (SDMC) to update
your SDMC records after you have completed a service action on your server. If you have reference
codes, symptoms, or location codes that you used during the service action, locate the records for use
during this procedure.
To verify the installed part, complete these steps:
1. From the SDMC, examine the service action event log for any open service action events. See
“Viewing serviceable events by using the Systems Director Management Console” on page 218 for
details.
2. Are there any service action events that are open?
216 Disk drives
No: If the system attention LED is still on, use the SDMC to turn off the LED. See “Activating and
deactivating LEDs by using the Systems Director Management Console.” This ends the procedure.
Yes: Continue with the next step.
3. Record the list of open service action events.
4. Examine the details of the open service action event. Is the error code associated with this service
action event the same as you gathered earlier.
v No: Select one of the following options:
– Review the other serviceable events, find one that does match, and continue with the next step.
– If the log does not match what you had gathered earlier, contact your service provider.
v Yes: Continue with the next step.
5. Select and highlight the service action event from the Error Associated With This Serviceable Event
window.
6. Click Delete or Ignore.
Note: These options are only available from the problem event log.
Activating and deactivating LEDs by using the Systems Director
Management Console
Use this procedure to activate or deactivate LEDs by using the IBM Systems Director Management
Console (SDMC).
Choose from the following:
v “Deactivating a system attention LED or partition LED”
v “Activating or deactivating an identify LED by using the SDMC”
Deactivating a system attention LED or partition LED
You can deactivate a system attention LED or a logical partition LED. For example, you might determine
that a problem is not a high priority and decide to repair the problem at a later time. However, you want
to be alerted if another problem occurs, so you must deactivate the system attention LED so that it can be
activated again if another problem occurs.
1. On the Resources tab, select the appropriate host or virtual server.
2. Select Actions ? Service and Support ? Hardware ? System Attention LED.
3. Select Deactivate System Attention LED. A confirmation window is displayed that provides the
following information:
v A verification that the system attention LED was deactivated.
v An indication that there still might be open problems within the system.
v An indication that you cannot activate the system attention LED.
4. Select one of the virtual servers, and select Deactivate System Attention LED. A confirmation
window is displayed that provides the following information:
v A verification that the system attention LED was deactivated.
v An indication that there still might be open problems within the logical partition.
v An indication that you cannot activate the virtual server LED.
Activating or deactivating an identify LED by using the SDMC
The system provides several LEDs that help identify various components, such as enclosures or field
replaceable units (FRUs). For this reason, they are called identify LEDs.
Verifying the installed part 217
You can activate or deactivate the following types of identify LEDs:
v Identify LED for an enclosure If you want to add an adapter to a specific drawer (enclosure), you
need to know the machine type, model, and serial number (MTMS) of the drawer. To determine
whether you have the correct MTMS for the drawer that needs the new adapter, you can activate the
LED for a drawer and verify that the MTMS corresponds to the drawer that requires the new adapter.
v Identify LED for a FRU associated with a specified enclosure If you want to hook up a cable to a
specific I/O adapter, you can activate the LED for the adapter which is a field replaceable unit (FRU),
and then physically check to see where you should hook up the cable. This is especially useful when
you have several adapters with open ports.
To activate or deactivate an identify LED for an enclosure or FRU, follow these steps:
1. On the Resources tab, select the appropriate host or virtual server.
2. Select Actions ? Service and Support ? Hardware ? Identify LED.
3. In the Identify LED, Select Enclosure window, select the system unit or enclosure.
4. To activate or deactivate an identify LED, click either Activate LED or Deactivate LED. The
associated LED is either turned on or off.
5. To activate or deactivate an identify LED for a FRU, select a system or enclosure from the table, and
then select List FRUs....
6. Select one or more FRUs from the table, and click either Activate LED or Deactivate LED. The
associated LED is either turned on or off.
Viewing serviceable events by using the Systems Director
Management Console
Use this procedure to view a serviceable event, including details, comments, and service history.
To view serviceable events, follow these steps:
1. On the Resources tab, select the appropriate host or virtual server.
2. Select Actions ? System Status and Health ? Event Log.
3. Optional: You can narrow the event criteria using the Event filter menu.
4. Select a line in the Events window, and select Actions ? Properties The Properties window opens,
showing detailed information about the serviceable event. The table shows information, such as
problem number, reference code, and the field replaceable units (FRUs) associated with this event.
Verifying an installed feature or replaced part on a system or logical
partition by using Virtual I/O Server tools
If you installed feature or replaced a part, you might want to use the tools in Virtual I/O Server (VIOS)
to verify that the feature or part is recognized by the system or logical partition.
To verify the operation of a newly installed feature or replacement part, select the appropriate procedure:
v Verify the installed feature using VIOS
v Verifying the replaced part using VIOS
Verify the installed feature by using VIOS:
1. Log in as root user.
2. At the command line, type diagmenu and press Enter.
3. Select Advanced Diagnostics Routines and press Enter.
4. From the Diagnostic Mode Selection menu, select System Verification and press Enter.
5. When the Advanced Diagnostic Selection menu appears, do one of the following:
218 Disk drives
v To test a single resource, select the resource that you just installed from the list of resources and
press Enter.
v To test all the resources available to the operating system, select All Resources and press Enter.
6. Select Commit, and wait until the diagnostic programs run to completion, responding to any prompts
that appear.
7. Did the diagnostics run to completion and display the message No trouble was found?
v No: If a service request number (SRN) or other reference code is displayed, suspect a loose adapter
or cable connection. Review the installation procedures to ensure that the new feature is installed
correctly. If you cannot correct the problem, collect all SRNs or any other reference code
information that you see. If the system is running in LPAR mode, note the logical partition in
which you installed the feature. Contact your service provider for assistance.
v Yes: The new device is installed correctly. Exit the diagnostic programs and return the system to
normal operations.
Verify the replacement part by using VIOS:
To verify the operation of a newly installed feature or replacement part, follow these steps:
1. Did you replace the part by using either VIOS or the online diagnostics service aid's concurrent
(hot-swap) service operation?
No: Go to step 2.
Yes: Go to step 5.
2. Is the system powered off?
No: Go to step 4.
Yes: If the system supports slow boot, set the system to perform a slow boot. For information, see
Performing a slow boot.
3. Start the system and wait until the VIOS operating system login prompt displays or until apparent
system activity on the operator panel or display has stopped.
Did the VIOS login prompt display?
v No: If an SRN or other reference code is displayed, suspect a loose adapter or cable connection.
Review the procedures for the part that you replaced to ensure that the new part is installed
correctly. If you cannot correct the problem, collect all SRNs or any other reference code
information that you see. If the system does not start or you have no login prompt, see: Problems
with loading and starting the operating system.
If the system is partitioned, note the logical partition in which you replaced the part. Contact your
service provider for assistance.
v Yes: Go to step 4
4. At the command prompt, type diag —a and press Enter to check for missing resources. If you see a
command prompt, go to step 5.
If the Diagnostic selection menu is shown with M appearing next to any resource, follow these
steps:
a. Select the resource and press Enter.
b. Select Commit.
c. Follow any instructions that are shown.
d. If a Do you want to review the previously displayed error? message is shown, select Yes and press
Enter.
e. If an SRN is shown, suspect a loose card or connection. If no obvious problem is shown, record
the SRN and contact your service provider for assistance..
f. If no SRN is shown, go to 5.
5. Test the part by doing the following:
Verifying the installed part 219
a. At the command line, type diagmenu and press Enter.
b. From the Function Selection menu, select Advanced Diagnostics Routines and press Enter.
c. From the Diagnostic Mode Selection menu, select System Verification and press Enter.
d. Select All Resources, or select the diagnostics for the individual part to test only the part you
replaced, and any devices that are attached to the part you replaced and press Enter.
Did the Resource Repair Action menu appear?
No: Go to step 6.
Yes: Go to step 7.
6. Did the Testing Complete, No trouble was found message appear?
v No: There is still a problem. Contact your service provider. This ends the procedure.
v Yes: Select Log Repair Action, if not previously logged, from the Task Selection menu to update
the AIX error log. If the repair action was reseating a cable or adapter, select the resource
associated with that repair action. If the resource associated with your action is not displayed on
the Resource List, select sysplanar0 and press Enter.
Tip: This action changes the indicator light for the part from the fault state to the normal state.
Go to step 9
7. Select the resource for the replaced part from the Resource Repair Action menu. When a test is run
on a resource in system verification mode, and that resource has an entry in the AIX error log, if the
test on the resource was successful, the Resource Repair Action menu appears. Complete the
following steps to update the AIX error log to indicate that a system-detectable part has been
replaced.
Note: On systems with a indicator light for the failing part, this changes the indicator light to the
normal state.
a. Select the resource that has been replaced from the Resource Repair Action menu. If the repair
action was reseating a cable or adapter, select the resource associated with that repair action. If
the resource associated with your action does not appear on the Resource List, select sysplanar0.
Press Enter.
b. Select Commit after you make your selections. Did another Resource Repair Action display
appear?
No: If the No Trouble Found display appears, go to step 9
Yes: Go to step 8.
8. Select the parent or child of the resource for the replaced part from the Resource Repair Action
menu if necessary. When a test is run on a resource in system verification mode, and that resource
has an entry in the AIX error log, if the test on the resource was successful, the Resource Repair
Action menu appears. Complete the following steps to update the AIX error log to indicate that a
system-detectable part has been replaced.
Note: This changes the indicator light for the part from the fault state to the normal state.
a. From the Resource Repair Action menu, select the parent or child of the resource that has been
replaced. If the repair action was to reseat a cable or adapter, select the resource associated with
that repair action. If the resource associated with your action does not appear on the Resource
List, select sysplanar0. Press Enter.
b. Select Commit after you make your selections.
c. If the No Trouble Found display appears, go to step 9.
9. If you changed the service processor or network settings, as instructed in previous procedures,
restore the settings to the values they had prior to servicing the system.
10. Did you do any hot-plug procedures before doing this procedure?
No: Go to step 11 on page 221.
220 Disk drives
Yes: Go to step 12.
11. Start the operating system, with the system or logical partition in normal mode. Were you able to
start the operating system?
No: Contact your service provider. This ends the procedure.
Yes: Go to step 12.
12. Are the indicator lights still on?
v No. This ends the procedure.
v Yes. Turn off the lights. See one of the following for instructions: Changing service indicators
Verifying the installed part 221
222 Disk drives
Verifying a repair
Use these procedures to verify hardware operation after making repairs to the system.
Choose from the following options:
v To verify the repair of a system that is currently powered off, go to step 1.
v To verify the repair of a system that is currently powered on with no operating system loaded, go to
step 3.
v To verify the repair of a system that is currently powered on and that has an operating system loaded,
go to step 5.
1. Power on the server and all attached I/O enclosures.
Did all the enclosures power on?
Yes: Go to step 3.? Go to step 3.
No: Continue with the next step.
2. Choose from the following options:
v If the original problem was that an enclosure would not power on and you have another FRU to replace, locate
and replace the next field-replaceable unit (FRU).
v If the next FRU in the FRU list is an isolation procedure, perform the isolation procedure.
v If the original problem was that an enclosure would not power on and you have an isolation procedure to
complete, perform the isolation procedure.
v If the original problem was that an enclosure would not power on and there are no more FRUs or isolation
procedures in the FRU list, contact your next level of support.
v If you have a new problem, perform problem analysis and repair the new problem.
3. Load the operating system.
Did the operating system load successfully?
Yes: Go to step 5.
No: Continue with the next step.
4. Choose from the following options:
v If the original problem was a failing disk drive that contained the operating system software, go to step 5.
v If the original problem was that the operating system would not load and you have another FRU to replace, go to
your FRU locations section to locate the next FRU.
v If the next FRU in the FRU list is an isolation procedure, perform the isolation procedure.
v If the original problem was that the operating system would not load and you have an isolation procedure to
complete, perform the isolation procedure.
v If the original problem was that the operating system would not load and there are no more FRUs or isolation
procedures in the FRU list, contact your next level of support.
v If you have a new problem, perform problem analysis and repair the new problem.
5. Choose from the following options:
v “Verifying the repair in AIX” on page 224
v “Verifying the repair in Linux” on page 229
v “Verifying a repair using an IBM i system or logical partition” on page 227
© Copyright IBM Corp. 2010, 2011 223
Verifying the repair in AIX
You can use this procedure to verify that a repair is complete using the AIX operating system.
Use this maintenance analysis procedure (MAP) to check out the server after a repair is completed.
1. Did you replace a disk drive in the root volume group?
No Go to step 3.
Yes Continue with the next step.
2. Run stand-alone diagnostics either from a CD or from a Network Installation Management (NIM)
server.
Did you encounter any problems?
No Reinstall the operating system and continue with step 5.
Yes If the original problem still exists, replace the
field-replaceable unit (FRU) or perform the isolation
procedure that is next in the FRU list. If you have reached
the end of the FRU list, contact your next level of support.
If a new problem has occurred, go to Beginning problem
analysis.
3. Did you replace a FRU with the power turned on and concurrently with system operations?
No Go to step 5.
Yes Continue with the next step.
4. Did you use an AIX diagnostics service aid hot-swap operation to change the FRU?
Yes Go to step 6.
No Go to step 7 on page 225.
Note: The AIX diagnostic service aid was
used if a resource was removed using the Hot
Plug task.
5.
Note: If any FRUs have been removed that should be reinstalled, reinstall them now.
Perform the following steps:
1. If the system is not powered on, power it on now.
2. Perform a slow boot.
3. Wait until the AIX operating system login prompt displays or until system activity on the operator panel or
display apparently has stopped.
4. Did you encounter any problems?
No Continue at step 6.
Yes If the original problem still exists, replace the FRU or perform the isolation procedure that is next in the FRU
list. If you have reached the end of the FRU list, contact your next level of support.
If a new problem has occurred, go to Beginning problem analysis.
6. If the Resource Repair Action menu is already displayed, go to step 9 on page 225; otherwise,
perform the following steps:
224 Disk drives
1. Log into the operating system either with root authority (if
needed, ask the customer to enter the password) or use the CE
login.
2. Enter the diag -a command and check for missing resources.
Follow any instructions that display. If an SRN displays, suspect a
loose card or connection. If no instructions display, no resources
were detected as missing. Continue with the next step.
7. Perform the following steps:
1. Enter diag at the command prompt.
2. Press Enter.
3. Select the Diagnostics Routines option.
4. When the Diagnostic Mode Selection menu displays, select System verification.
5. When the Diagnostic Selection menu displays, select the All Resources option or test the FRUs you exchanged,
and any devices that are attached to the FRU(s) you exchanged, by selecting the diagnostics for the individual
FRU.
Did the Resource Repair Action menu (801015) display?
No Continue with the next step.
Yes Go to step 9.
8. Did the Testing Complete, no trouble was found menu (801010) display?
Yes Use the Log Repair Action option, if not previously logged,
in the TASK SELECTION menu to update the AIX error log.
If the repair action was reseating a cable or adapter, select
the resource associated with that repair action.
If the resource associated with your action is not displayed
on the resource list, select sysplanar0.
Note: If the system attention indicator is on, this action will
set it back to the normal state.
Go to step 11 on page 226.
No If the original problem still exists, replace the FRU or
perform the isolation procedure that is next in the FRU list.
If you have reached the end of the FRU list, contact your
next level of support.
If a new problem has occurred, go to Beginning problem
analysis.
9. When a test is run on a resource in system verification mode, that resource has an entry in the AIX
error log. If the test on that resource was successful, the Resource Repair Action menu displays.
After replacing a FRU, you must select the resource for that FRU from the Resource Repair Action
menu. This updates the AIX error log to indicate that a system-detectable FRU has been replaced.
Note: If the system attention indicator is on, this action will set it back to the normal state.
Perform the following steps:
Verifying a repair 225
1. Select the resource that has been replaced from the Resource
Repair Action menu. If the repair action was reseating a cable or
adapter, select the resource associated with that repair action. If
the resource associated with your action is not displayed on the
resource list, select sysplanar0.
2. Press Commit after you make your selections.
Did another Resource Repair Action (801015) display?
No If the No Trouble Found menu displays, go to step 11.
Yes Continue with the next step.
10. The parent or child of the resource you just replaced might also require that you run the Resource
Repair Action service aid on it.
When a test is run on a resource in system verification mode, that resource has an entry in the AIX
error log. If the test on the resource was successful, the Resource Repair Action menu displays.
After replacing that FRU, you must select the resource for that FRU from the Resource Repair Action
menu. This updates the AIX error log to indicate that a system-detectable FRU has been replaced.
Note: If the system attention indicator is on, this action will set it back to the normal state.
Perform the following steps:
1. From the RESOURCE REPAIR ACTION menu, select the parent
or child of the resource that has been replaced. If the repair action
was reseating a cable or adapter, select the resource associated
with that repair action. If the resource associated with your action
is not displayed on the resource list, select sysplanar0.
2. Press COMMIT after you make your selections.
3. If the No Trouble Found menu displays, continue with the next
step.
11. If you changed the service processor or network settings, as instructed in previous MAPs, restore the
settings to the value they had prior to servicing the system. If you ran stand-alone diagnostics from
CD-ROM, remove the stand-alone diagnostics CD-ROM from the system.
Did you perform service on a RAID subsystem involving changing of the PCI RAID adapter
cache card or changing the configuration?
Note: This information does not apply to the PCI-X RAID adapter or cache.
No Go to the close of call procedure.
Yes Continue with the next step.
12. Use the Recover Options selection to resolve the RAID configuration. To do this, perform the
following steps:
226 Disk drives
1. On the PCI SCSI Disk Array Manager display, select Recovery
options.
2. If a previous configuration exists on the replacement adapter, this
must be cleared. Select Clear PCI SCSI Adapter Configuration
and press F3.
3. On the Recovery Options screen, select Resolve PCI SCSI RAID
Adapter Configuration.
4. On the Resolve PCI SCSI RAID Adapter Configuration screen,
select Accept Configuration on Drives.
5. On the PCI SCSI RAID Adapter selections menu, select the
adapter that you changed.
6. On the next screen, press Enter.
7. When you see the Are You Sure selection menu, press Enter to
continue.
8. If you see a Failed status message, verify that you selected the
correct adapter, and then repeat this procedure. When the
recovery is complete, exit the operating system.
9. Go to the close of call procedure.
Verifying a repair using an IBM i system or logical partition
Use this procedure to verify a repair using the IBM i operating system.
1. Was the system powered off during the repair?
Yes: Continue with the next step.
No: Continue with step 3.
2. Perform the following tasks:
a. Verify that the power cable is plugged into the power outlet.
b. Verify that power is available at the customer's power outlet.
3. Was the partition powered off during the repair?
Yes: Continue with the next step.
No: Continue with step 6.
4. Select the IPL type and mode for the system or logical partition that the customer uses (see IPL type
mode and speed options in the Service functions).
5. Start an IPL by powering on the system or partition (see Powering on and powering off). Did the
system complete the IPL?
Yes: Continue with the next step.
No: This might be a new problem. Go to the Start of call procedure. This ends the procedure.
6. Did the system or partition remain running throughout the repair, and was the I/O processor, I/O
adapter, or storage device replaced?
Yes: Continue with step 10.
No: Continue with the next step.
7. Use the service action log or serviceable event view (if the system is managed by an HMC) to look
for any reference codes that are related to this IPL (see Using the Service Action Log). Are there any
reference codes that are related to this IPL?
Yes: Continue with the next step.
No: If the problem was related to removable media or communications, perform the verification
procedures in the Service functions to verify that the problem is corrected. Then return the
system to the customer and have the customer verify the system date and time. This ends the
procedure.
Verifying a repair 227
8. Is the new reference code the same as the original reference code?
Yes: Continue with the next step.
No: A new symptom might have occurred. Go to the Start of call procedure. This ends the
procedure.
9. Are there any other failing items that remain to be replaced?
Yes: Replace the next failing item listed for this reference code. This ends the procedure.
No: Contact your next level of support for assistance. This ends the procedure.
10. Was concurrent maintenance performed on an optical storage unit?
Yes: The product activity log and service action log, in most cases, contain a reference code for
the optical storage unit when concurrent maintenance is performed. You can ignore this reference
code. Perform the following:
- Perform the verification procedures in the Service functions topic to verify that the problem is
corrected.
- Return the system to the customer and have the customer verify the system date and time. This
ends the procedure.
No: Continue with the next step.
11. Use the service action log to look for any new reference codes (see Using the Service Action Log).
Are there any new reference codes?
Yes: Continue with the next step.
No: Go to step 14.
12. Is the new reference code the same as the original reference code?
Yes: Continue with the next step.
No: A new symptom might have occurred. Go to the Start of call procedure to determine the
cause of the problem. This ends the procedure.
13. Are there any other failing items that need to be replaced?
Yes: Replace the next failing item listed for the reference code. This ends the procedure..
No: Contact your next level of support for assistance. This ends the procedure.
14. Are you working with a tape device?
Yes: Perform the verification procedures in the Service functions to verify that the problem is
corrected. After the verification test is complete, the tape device description will be set to the
failed state because a resource change was detected. Perform the following tasks:
- Vary the tape device description off and then on.
- Return the system to the customer and have the customer verify the system date and time.
Then go to Verifying the repair from the HMC. This ends the procedure.
No: Continue with the next step.
15. Are you working with an IOP or an IOA?
Yes: Use the display hardware configuration service function to check for any missing or failed
hardware:
- On the command line, enter the STRSST (Start System Service Tools command). If you cannot get
to SST, select DST. Do not IPL the system or partition to get to DST.
- On the Start Service Tools Sign On display, enter the user ID with the service authority and
password.
- Select Start a service tool ? Hardware service manager ? Logical hardware resources ? System
bus resources.
- Select the function key for Include nonreporting resources.
- If the IOP and IOA that you just replaced is a failed or non-reporting resource, the problem has
not been fixed. Continue to the next failing item in the failing item list. This ends the procedure.
228 Disk drives
No: Perform the verification procedures in the Service functions topics to verify that the problem
is corrected. Resources that usually vary on automatically during an IPL, or that were previously
varied on manually, might need to be varied on again after the verification procedures are
complete. Return the system to the customer and have the customer verify the system date and
time. This ends the procedure.
Verifying the repair in Linux
You can use this procedure to verify that a repair is complete using the Linux operating system.
1. Run stand-alone diagnostics from either a CD or from a Network Installation Management (NIM)
server. See Running the stand-alone diagnostics from CD-ROM.
Did you encounter any problems?
No Reboot the operating system and continue with the close of call procedure.
Yes If the original problem still exists, replace the field-replaceable unit (FRU) or perform the isolation procedure
that is next in the FRU list. If you have reached the end of the FRU list, contact your next level of support.
If a new problem has occurred, go to Beginning problem analysis and repair the new problem.
Verifying the repair from the management console
Perform these procedures to close problem numbers, clear hardware messages, and prepare the server to
return to the customer using the management console.
Follow this checklist before performing the procedures:
v You return the server to the state that the customer normally uses such as IPL type, IPL mode, and the
way the system is configured or partitioned.
Attention: Before returning the system to the customer, remove the system from service mode. If the
system is left in service mode, it automatically places a call for service every two hours.
v While you were performing the problem analysis on the original serviceable event, other
serviceable-event numbers might have been opened. Close all serviceable events that were opened as a
result of your service activity.
v Server verification has been performed and there are no problems that require additional service
actions.
v If the repair was done using the HMC online repair procedures, ensure that the original serviceable
event is now closed.
1. Is a management console used to manage the server that you are servicing?
v Yes: Continue with the next step.
v No: Return to the “Verifying a repair” on page 223. This ends the procedure.
2. Are you closing a service event that was a repair on the management console personal computer?
v Yes: Continue with the next step.
v No: Go to step 4 on page 230.
3. Power on the management console. Did the power-on process complete without errors?
v Yes: Ensure that the management console can be used to perform server management tasks, and return the
management console to normal operations. Go to “Closing a service call” on page 231. This ends the procedure.
v No: Go to the HMC isolation procedures. This ends the procedure.
Verifying a repair 229
4. Log into the management console as the service representative. If invalid user or invalid password
displays, get the correct login information from the system administer.
1. If logged on the System Manager, select Exit from the Console, located on the System Manager window.
2. Login to the System Manager with the following: User identification service Password service mode
5. View Serviceable Event Details.
1. In the navigation area, select the Service Applications icon.
2. In the navigation area, select the Service Focal Point icon.
3. In the Contents area, select Manage Serviceable Events.
4. Designate the set of Serviceable Events you want to view. When
you are finished, click OK. The Service Event Overview window
opens.
Note: Only the events that match all of the
criteria you specify are shown.
6. Close open or delayed events.
1. Select the problem to close, on the Service Event Overview window.
2. Select the menu Selected, located on the menu bar.
3. Click Close Event.
4. Enter your comments in the Serviceable Event Comments window, and click Close Event.
5. Close all events associated with the problem on which you were working.
7. Did the Service Event Overview window contain the event or events on which you were working?
v Yes: Return the HMC to normal operations. Go to “Closing a service call” on page 231. This ends the procedure.
v No: Go to Detecting problems. This ends the procedure.
230 Disk drives
Closing a service call
Perform these procedures to close problem numbers, clear hardware messages, and prepare the server to
return to the customer.
Follow this checklist before performing the procedures:
v Return the server to the state that the customer normally uses such as IPL type, IPL mode, and the
way the system is configured or partitioned.
Attention: Before returning the system to the customer, remove the system from service mode. If the
system is left in service mode, it automatically places a call for service every two hours.
v While you were performing the problem analysis on the original serviceable event, other serviceable
events might have been opened. Close all serviceable events that were opened as a result of your
service activity.
v Ensure that server verification has been performed and no problems require additional service actions.
v If the repair was done using the management console online repair procedures, ensure that the original
serviceable event is now closed.
1. Record the system reference code (SRC) or symptom and the location code of the field-replaceable
unit (FRU) you replaced, for future reference. Is the server managed by a management console?
v Yes: Continue with the next step.
v No: Do one of the following steps:
– If the server is managed by Integrated Virtualization Manager (IVM), go to “Closing a service call using
Integrated Virtualization Manager” on page 239.
– If the server is not partitioned and is running the AIX or Linux operating system, go to “Closing a service call
using AIX or Linux” on page 235.
2. On the HMC, open Manage Serviceable Events and examine the service action event log for any
open service action events.
3. Are there any service action events that are open?
v Yes: Continue with the next step.
v No: If the system attention LED is still on, turn off the LED as described in “Activating and deactivating LEDs” on
page 245. Return the system to the customer. This completes the repair.
4. Record the list of open service action events.
5. From the list of serviceable events recorded in step 4, perform the following step 6 through step 32
on page 234 for each open service action event.
6. Determine the error class of the serviceable event. Record for future use.
7. Examine the details of the open service action event.
© Copyright IBM Corp. 2010, 2011 231
Is the error code that is associated with this service action event the same as recorded in Step 1 on page 231?
v Yes: Go to step 11.
v No: Continue with the next step.
8. Examine the FRU list of the service action event. Are any FRUs listed for the service action event?
v Yes: Continue with the next step.
v No: Go to step 11.
9. Is the FRU list identical (same FRUs, same number of FRUs, and same order of FRUs) to the FRU
list of the error code recorded in step 1 on page 231?
v Yes: Go to step 11.
v No: Continue with the next step.
10. The FRU list is different. Is the FRU you replaced and recorded in step 1 on page 231 in the list of
FRUs for this service action event?
v Yes: Continue with the next step.
v No: Go to step 32 on page 234.
Note: There are service action events that will remain open when you leave this MAP. Further service actions
might be required to complete the repair.
11. Examine the details of this service action event, and record the partitions involved in this service
action event for use in a later step.
12. Is the error code associated with this service action event of the form A11-xxx or A01-xxx?
v Yes: Continue with the next step.
v No: Go to step 17 on page 233.
13. Have you begun a list of Axx partitions from prior service action events that you processed in this
maintenance analysis procedure (MAP)?
v Yes: Go to step 15.
v No: Continue with the next step.
14. Begin a new list of Axx partitions by copying the list of partitions obtained in step 11. Go to step 16.
15. Add the partition list obtained in step 11 to the existing list of Axx partitions obtained from
processing previous service action events in this MAP.
16. Remove all entries in the list of all partitions you recorded in step 11. If you are referred to the list of
partitions obtained in step 11 in future steps, the list is empty. Go to step 17 on page 233.
232 Disk drives
17. Select and highlight the service action event from the Error Associated With This Serviceable Event
window.
18. Click Close Event.
19. Add comments for the serviceable event. Include any unique additional information. Click OK. The
following steps will add or update FRU information.
20. Did you replace, add, or modify a FRU of the open service action event?
v Yes: Continue with the next step.
v No: Go to step 22.
21. From the FRU list, select a FRU that you need to update. Double-click the FRU, and update the FRU
information. Go to step 23.
22. Select the No FRU Replaced for this Serviceable Event option.
23. Click OK to close the service action event.
24. Is the list of all partitions you recorded in step 11 on page 232 empty?
v Yes: Go to step 32 on page 234.
v No: Continue with the next step.
25. Does the list of all partitions you recorded in step 11 on page 232 contain more than one entry?
v Yes: Continue with the next step.
v No: Go to step 32 on page 234.
26. Is the error class recorded in step 25 AIX?
v Yes: Continue with the next step.
v No: Go to step 32 on page 234.
27. Perform the following steps for each entry in the list of all partitions you recorded in step 11 on page
232, except the partition you were using to debug the original problem.
28. From the HMC virtual terminal window of a partition in the list of all partitions, type diag at the
AIX command prompt.
Closing a service call 233
29. When the diagnostic operating instructions are displayed, do the following steps:
1. Press Enter.
2. Select the Task Selection option.
3. Select the Log Repair option.
4. Select the resource associated with the repair action. If the repair
action was reseating a cable or adapter, select the resource
associated with that repair action. If the resource associated with
your repair action is not displayed on the Resource List, select
sysplanar0.
5. Click Commit after you have made your selection.
Note: If the terminal type is not defined, you
are prompted to define it before you can
proceed.
30. Exit from diagnostics in this partition and return to the AIX command prompt.
31. Have all the partitions in the list of all the partitions you recorded in step 11 on page 232 been
processed?
v Yes: Continue with the next step.
v No: Go to step 24 on page 233 to process the next partition in the list you recorded in step 11 on page 232.
32. Have all the serviceable events recorded in step 4 on page 231 been processed?
v Yes: Continue with the next step.
v No: Go to step 5 on page 231 and process the next service action event in the list of serviceable events recorded in
step 4 on page 231.
33. While processing all service action events, were you directed to step 14 on page 232?
v Yes: Continue with the next step.
v No: If the system attention LED is still on, turn off the LED as described in “Activating and deactivating LEDs” on
page 245. Return the system to the customer. This completes the repair.
Note: If during the processing of the list of open service action events, some service action events remained open,
further service actions might be required to complete the repair.
34. Perform the following steps for each entry in the list of Axx partitions you began recording in step
14 on page 232, except the partition you were using to debug the original problem.
35. From the management console virtual terminal window of a partition in the list of Axx partitions,
type diag at the AIX command prompt.
36. When the diagnostic operating instructions are displayed, do the following steps:
234 Disk drives
1. Press Enter.
2. Select the Task Selection option.
Note: If the terminal type is not defined, you are prompted to
define it before you can proceed.
3. Select the Log Repair option.
4. Select the resource associated with the repair action. If the repair
action was reseating a cable or adapter, select the resource
associated with that repair action. If the resource associated with
your repair action is not displayed on the Resource List, select
sysplanar0.
5. Click Commit after you have made your selection.
37. Exit from diagnostics in this partition and return to the AIX command prompt.
38. Have all the partitions in the list of Axx partitions you began recording in step 14 on page 232 been
processed?
v Yes: Continue with the next step.
v No: Go to step 34 on page 234 to process the next partition in the list you recorded in step 14 on page 232.
39. If the system attention LED is still on, turn off the LED as described in “Activating and deactivating
LEDs” on page 245. This completes the repair. Return the system to the customer.
Note: If during the processing of the list of open service action events, some service action events
remained open, further service actions might be required to complete the repair.
Closing a service call using AIX or Linux
If the server is not connected to an management console and not using Integrated Virtualization Manager
(IVM), perform these procedures to close problem numbers, clear hardware messages, and prepare the
server to return to the customer.
Follow this checklist before performing the procedures:
v Return the server to the state that the customer normally uses, such as IPL type, IPL mode, and the
way the system is configured or partitioned.
Attention: Before returning the system to the customer, remove the system from service mode. If the
system is left in service mode, it automatically places a call for service every two hours.
v While you were performing the problem analysis on the original serviceable event, other
serviceable-event numbers might have been opened. Close all serviceable events that were opened as a
result of your service activity.
v Ensure that server verification has been performed and that no problems require additional service
actions.
v If the repair was done using the IVM online repair procedures, ensure that the original serviceable
event is now closed.
1. Did you use an AIX diagnostics service aid hot-swap operation to change the FRU?
v Yes: Go to step 4 on page 236
v No: Continue with the next step.
Closing a service call 235
2. Do you have any field-replaceable units (FRUs) (for example cards, adapters, cables, or devices) that
were removed during problem analysis that you want to put back into the system?
Note: If the system planar or battery has been replaced and you are loading diagnostics from a
server over a network, it might be necessary for the customer to set the network boot information
for this system before diagnostics can be loaded. The system time and date information should also
be set when the repair is completed.
v Yes: Reinstall all of the FRUs that were removed during problem analysis. Go to step 3
v No: Continue with the next step.
3. Is the system or logical partition that you are performing a repair action on running the AIX
operating system?
v Yes: Continue with the next step.
v No: Go to step 5.
4. Does the system or logical partition you are performing a repair action on have AIX installed?
Note: Answer no to this question if you have just replaced a hard disk in the root volume group.
v Yes: Go to step 7.
v No: Continue with the next step.
5. Run stand-alone diagnostics in problem determination mode from either a CD-ROM or from a
Network Installation Management (NIM) server.
Note: For instructions on running stand-alone diagnostics from a CD and not using an HMC, go to
Running the stand-alone diagnostics from CD on a server without an HMC attached.
For instructions on running stand-alone diagnostics from a NIM server, go to Running the
stand-alone diagnostics from a Network Installation Management server.
Did you encounter any problems?
v Yes: Go to problem analysis.
v No: Continue with the next step.
6. The system hardware is functioning correctly.
If the system attention LED is still on, turn off the LED as described
in “Activating and deactivating LEDs” on page 245.
This completes the repair.
Note: If, during the processing of the list of open service action
events, some service action events remained open, further service
actions might be required to complete the repair.
Return the server to the state that the customer normally uses, such
as IPL type, IPL mode, and the way the system is configured or
partitioned. This might require you to reboot the operating system.
Attention: Before returning the system to the customer, remove the
system from service mode. If the system is left in service mode, it
automatically places a call for service every two hours.
7. Complete the following steps:
236 Disk drives
1. If the system supports slow boot (see Performing a slow boot), do
a slow boot on the system. If the system does not support slow
boot, do a normal boot.
2. Power on the system.
3. Wait until the AIX operating system login prompt displays or
until system activity on the operator panel or display apparently
has stopped.
Did the AIX Login Prompt display?
v Yes: Continue with the next step.
v No: Go to problem analysis.
8. If the Resource Repair Action menu is already displayed, go to 12 on page 238; otherwise, do the
following steps:
1. Log into the operating system, either with root authority (if
needed, ask the customer to enter the password) or use the CE
login.
2. Enter the diag -a command and check for missing resources.
Follow any instructions that display. If an system request number
(SRN) displays, suspect a loose card or connection. If no
instructions display, no resources were detected as missing.
Continue with 9.
9. Complete the following steps:
1. Enter diag at the command prompt.
2. Press Enter.
3. Select the Diagnostics Routines option.
4. When the Diagnostic Mode Selection menu displays, select
Problem determination.
5. When the Advanced Diagnostic Selection menu displays, select
the All Resources option, or test the FRUs you exchanged, and
any devices that are attached to the FRUs you exchanged, by
selecting the diagnostics for the individual FRU.
Did the Resource Repair Action menu (801015) display?
v Yes: Go to step 13 on page 238.
v No: Continue with the next step.
10. Did the TESTING COMPLETE, no trouble was found menu (801010) display?
v Yes: Continue with the next step.
v No: There is still a problem. Go to problem analysis.
11. Use the Log Repair Action option, if not previously logged, in the TASK SELECTION menu to
update the AIX error log. If the repair action was reseating a cable or adapter, select the resource
associated with that repair action.
Closing a service call 237
If the resource associated with your action is not displayed on the
resource list, select sysplanar0.
Note: If the system attention indicator is on, this will set it back to
the normal state. Go to step 14.
12. When a test is run on a resource in system verification mode, and that resource has an entry in the
AIX error log, if the test on the resource was successful, the Resource Repair Action menu displays.
After replacing a FRU, you must select the resource for that FRU from the Resource Repair Action menu. This
updates the AIX error log to indicate that a system-detectable FRU has been replaced.
Note: If the system attention indicator is on, this action will set it back to the normal state.
Do the following steps:
1. Select the resource that has been replaced from the Resource Repair Action menu. If the repair action was
reseating a cable or adapter, select the resource associated with that repair action. If the resource associated with
your action is not displayed on the resource list, select sysplanar0.
2. Press Commit after you make your selections.
Did another Resource Repair Action (801015) display?
v Yes: Continue with the next step.
v No: If the No Trouble Found menu displays, go to step 14.
13. The parent or child of the resource you just replaced might also require that you run the Resource
Repair Action service aid on it.
When a test is run on a resource in system verification mode, and
that resource has an entry in the AIX error log, if the test on the
resource was successful, the Resource Repair Action menu displays.
After replacing that FRU, you must select the resource for that FRU
from the Resource Repair Action menu. This updates the AIX error
log to indicate that a system-detectable FRU has been replaced.
Note: If the system attention indicator is on, this action will set it
back to the normal state.
Do the following steps:
1. From the Resource Repair Action menu, select the parent or child
of the resource that has been replaced. If the repair action was
reseating a cable or adapter, select the resource associated with
that repair action. If the resource associated with your action is
not displayed on the resource list, select sysplanar0.
2. Press COMMIT after you make your selections.
Did the No Trouble Found menu display?
v Yes: Continue with the next step.
v No: Go to problem analysis.
14. If you changed the service processor or network settings, as instructed in previous MAPs, restore the
settings to the value they had prior to servicing the system. If you ran stand-alone diagnostics from
CD-ROM, remove the stand-alone diagnostics CD-ROM from the system.
238 Disk drives
Did you perform service on a RAID subsystem involving changing of the PCI RAID adapter cache card or changing
the configuration?
Note: This does not refer to the PCI-X RAID adapter or cache.
v Yes: Continue with the next step.
v No: Go to step 16.
15. Use the Recover Options selection to resolve the RAID configuration. To do this, do the following
steps:
1. On the PCI SCSI Disk Array Manager dialog, select Recovery options.
2. If a previous configuration exists on the replacement adapter, this must be cleared. Select Clear PCI SCSI
Adapter Configuration and press F3.
3. On the Recovery Options dialog, select Resolve PCI SCSI RAID Adapter Configuration.
4. On the Resolve PCI SCSI RAID Adapter Configuration dialog, select Accept Configuration on Drives.
5. On the PCI SCSI RAID Adapter selections menu, select the adapter that you changed.
6. On the next dialog, press Enter.
7. When you see the Are You Sure selection menu, press Enter to continue.
8. You should see an OK status message when the recover is complete. If you get a Failed status message, verify that
you selected the correct adapter, and then repeat this procedure. When recover is complete, exit the operating
system.
9. Go to 16.
16. The system hardware is functioning correctly. Return the server to the state that the customer
normally uses, such as IPL type, IPL mode, and the way the system is configured or partitioned.
Attention: Before returning the system to the customer, remove the
system from service mode. If the system is left in service mode, it
automatically places a call for service every two hours.
Closing a service call using Integrated Virtualization Manager
Perform these procedures to close problem numbers, clear hardware messages, and prepare the server to
return to the customer.
Follow this checklist before performing the procedures:
v Return the server to the state that the customer normally uses, such as IPL type, IPL mode, and the
way the system is configured or partitioned.
Attention: Before returning the system to the customer, remove the system from service mode. If the
system is left in service mode, it automatically places a call for service every two hours.
v While you were performing the problem analysis on the original serviceable event, other
serviceable-event numbers might have been opened. Close all serviceable events that were opened as a
result of your service activity.
v Ensure that server verification has been performed and there are no problems that require additional
service actions.
v If the repair was done using the Integrated Virtualization Manager (IVM) online repair procedures,
ensure that the original serviceable event is now closed.
1. Record the system reference code (SRC) or symptom and the location code of the field-replaceable
unit (FRU) you replaced, for future reference.
Closing a service call 239
2. On the IVM, open Manage Serviceable Events and look at existing serviceable events.
3. Are there any service action events that are open?
v Yes: Continue with the next step.
v No: If the system attention LED is still on, turn off the LED as described in “Activating and deactivating LEDs” on
page 245. Return the system to the customer. This completes the repair.
4. Record the list of open service action events.
5. From the list of serviceable events recorded in step 4, perform step 6 through step 30 on page 242 for
each open service action event.
6. Determine the error class of the serviceable event. Record for future use.
7. Examine the details of the open service action event.
Is the error code associated with this service action event the same as
recorded in step 1 on page 239?
v Yes: Go to step 11.
v No: Continue with the next step.
8. Examine the FRU list of the service action event. Are any FRUs listed for the service action event?
v Yes: Continue with the next step.
v No: Go to step 11.
9. Is the FRU list identical (same FRUs, same number of FRUs, and same order of FRUs) to the FRU
list of the error code recorded in step 1 on page 239?
v Yes: Go to step 11.
v No: Continue with the next step.
10. Is the FRU you replaced and recorded in step 1 on page 239 in the list of FRUs for this service action
event?
v Yes: Continue with the next step.
v No: Go to step 30 on page 242.
Note: There are service action events that will remain open when you leave this MAP. Further service actions
might be required to complete the repair.
11. Examine the details of this service action event, and record the partitions involved in this service
action event for use in a later step.
240 Disk drives
12. Is the error code associated with this service action event of the form A11-xxx or A01-xxx?
v Yes: Continue with the next step.
v No: Go to step 17.
13. Have you begun a list of Axx partitions from prior service action events that you processed in this
maintenance analysis procedure (MAP)?
v Yes: Go to step 15.
v No: Continue with the next step.
14. Begin a new list of Axx partitions by copying the list of partitions obtained in step 11 on page 240.
Go to step 16.
15. Add the partition list obtained in step 11 on page 240 to the existing list of Axx partitions obtained
from processing previous service action events in this MAP.
16. Remove all entries in the list of all partitions you recorded in step 11 on page 240. If you are referred
to the list of partitions obtained in step 11 on page 240 in future steps, the list is empty. Go to step
17.
17. Select and highlight the service action event from the Manage Serviceable Events window.
18. Click Close Event.
19. Add comments for the serviceable event. Include any unique additional information. Click OK.
20. Add or update FRU information:
Did you replace, add, or modify a FRU of the open service action event?
v Yes: Continue with the next step.
v No: Go to step 22.
21. Click OK to close the service action event.
22. Is the list of all partitions you recorded in step 11 on page 240 empty?
v Yes: Go to step 30 on page 242.
v No: Continue with the next step.
Closing a service call 241
23. Does the list of all partitions you recorded in step 11 on page 240 contain more than one entry?
v Yes: Continue with the next step.
v No: Go to step 30.
24. Is the error class recorded in step 23?
v Yes: Continue with the next step.
v No: Go to step 30.
25. Perform the following steps for each entry in the list of all partitions you recorded in step 11 on page
240, except the partition you were using to debug the original problem.
26. From the IVM virtual terminal window of a partition in the list of all partitions, type diag at the AIX
command prompt.
27. When the diagnostic operating instructions are displayed, do the following steps:
1. Press Enter.
2. Select the Task Selection option.
3. Select the Log Repair option.
4. Select the resource associated with the repair action. If the repair
action was reseating a cable or adapter, select the resource
associated with that repair action. If the resource associated with
your repair action is not displayed on the Resource List, select
sysplanar0.
5. Click Commit after you have made your selection.
Note: If the terminal type is not defined, you
are prompted to define it before you can
proceed.
28. Exit from diagnostics in this partition and return to the AIX command prompt.
29. Have all the partitions in the list of all partitions you recorded in step 11 on page 240 been
processed?
v Yes: Continue with the next step.
v No: Go to step 25 to process the next partition in the list you recorded in step 11 on page 240.
30. Have all the serviceable events recorded in step 4 on page 240 been processed?
v Yes: Continue with the next step.
v No: Go to step 5 on page 240 and process the next service action event in the list of serviceable events recorded in
step 4 on page 240.
31. While processing all service action events, were you directed to step 14 on page 241?
242 Disk drives
v Yes: Continue with the next step.
v No: If the system attention LED is still on, turn off the LED as described in “Activating and deactivating LEDs” on
page 245. Return the system to the customer. This completes the repair.
Note: If during the processing of the list of open service action events, some service action events remained open,
further service actions might be required to complete the repair.
32. Perform the following steps for each entry in the list of Axx partitions you began recording in step
14 on page 241, except the partition you were using to debug the original problem.
33. From the IVM virtual terminal window of a partition in the list of Axx partitions, type diag at the
AIX command prompt.
34. When the diagnostic operating instructions are displayed, do the following steps:
1. Press Enter.
2. Select the Task Selection option.
Note: If the terminal type is not defined, you are prompted to
define it before you can proceed.
3. Select the Log Repair option.
4. Select the resource associated with the repair action. If the repair
action was reseating a cable or adapter, select the resource
associated with that repair action. If the resource associated with
your repair action is not displayed on the Resource List, select
sysplanar0.
5. Click Commit after you have made your selection.
35. Exit from diagnostics in this partition and return to the AIX command prompt.
36. Have all the partitions in the list of Axx partitions you began recording in step 14 on page 241 been
processed?
v Yes: Continue with the next step.
v No: Go to step 32 to process the next partition in the list you recorded in step 14 on page 241.
37. If the system attention LED is still on, turn off the LED as described in “Activating and deactivating
LEDs” on page 245. This completes the repair. Return the system to the customer.
Note: If, during the processing of the list of open service action events, some service action events remained open,
further service actions might be required to complete the repair.
Closing a service call 243
244 Disk drives
Activating and deactivating LEDs
You can use these procedures to activate or deactivate light-emitting diodes (LEDs) using the
management console or the Advanced System Management Interface (ASMI).
For IBM Power Systems servers that contain the POWER7 processor, the LEDs can be used to identify or
verify a part that you are servicing. The error and identify function (amber) LED indicates an error and
corresponds to the location code in the system reference code (SRC). The LED is activated and
deactivated automatically.
Additionally, the following procedures can also be used to activate and deactivate LEDs.
v “Deactivating a system attention LED or partition LED using the management console”
v “Activating or deactivating an identify LED using the management console” on page 246
v “Deactivating a system attention LED or logical partition LED using the Advanced System
Management Interface” on page 246
v “Activating or deactivating an identify LED using the Advanced System Management Interface” on
page 247
Deactivating a system attention LED or partition LED using the
management console
You can deactivate a system attention LED or a logical partition LED if you decide that a problem is not
a high priority and you decide to repair the problem at a later time. You can do this task from the
Hardware Management Console (HMC) or the IBM Systems Director Management Console (SDMC).
If you want to be alerted if another problem occurs, you must deactivate the system attention LED so
that it can be activated again if another problem occurs.
To deactivate a system attention LED by using HMC, complete the following steps:
1. In the navigation area, open Systems management.
2. Select the server you are working on by checking the box next to its name.
3. Open Operations.
4. Open LED Status.
5. Select View System Attention. The system attention LED window opens. The selected system and its
LED state are displayed in the upper part of the window. The logical partition and its LED state are
displayed in the lower part of the window. From the system attention LED window, you can
deactivate both the system attention LED and the logical partition LED.
6. Select Deactivate System Attention LED from the Action menu. A confirmation window is displayed
that provides the following information:
v A verification that the system attention LED was deactivated.
v An indication that there still might be open problems within the system.
v An indication that you cannot activate the system attention LED.
7. Select one of the logical partitions in the lower table, and select Deactivate partition LED from the
Partition Operations menu. A confirmation window is displayed that provides the following
information:
v A verification that the logical partition LED was deactivated.
v An indication that there still might be open problems within the logical partition.
v An indication that you cannot activate the logical partition LED.
© Copyright IBM Corp. 2010, 2011 245
To deactivate a system attention LED by using SDMC, complete the following steps:
1. On the Welcome page, under the Resources tab, select the server.
2. Click Actions > Service and Support> Hardware> System Attention LED.
3. Click OK.
Activating or deactivating an identify LED using the management
console
You can activate or deactivate an identify LED for components attached to the system from the Hardware
Management Console (HMC) or the IBM Systems Director Management Console (SDMC).
The system provides several LEDs that help identify various components, such as enclosures or
field-replaceable units (FRUs), in the system. For this reason, they are called identify LEDs.
You can activate or deactivate the following types of identify LEDs:
v Identify LED for an enclosure If you want to add an adapter to a specific drawer (enclosure), you
need to know the machine type, model, and serial number (MTMS) of the drawer. To determine
whether you have the correct MTMS for the drawer that needs the new adapter, you can activate the
LED for a drawer and verify that the MTMS corresponds to the drawer that requires the new adapter.
v Identify LED for a FRU associated with a specified enclosure If you want to hook up a cable to a
specific I/O adapter, you can activate the LED for the adapter, which is a field replaceable unit (FRU),
and then physically check to see where you should hook up the cable. This is especially useful when
you have several adapters with open ports.
To activate or deactivate an identify LED for an enclosure or FRU by using the HMC, follow these steps:
1. In the navigation area, open Systems management.
2. Select the server you are working on.
3. Click Operations ? LED Status ? Identify LED. The Identify LED, Select Enclosure window opens.
4. To activate or deactivate an identify LED for an enclosure, select an enclosure from the table, and click
either Activate LED or Deactivate LED. The associated LED is either turned on or off.
5. To activate or deactivate an identify LED for a FRU, select an enclosure from the table, and then select
Selected ? List FRUs.
6. Select one or more FRUs from the table, and click either Activate LED or Deactivate LED. The
associated LED is either turned on or off.
To activate or deactivate an identify LED for an enclosure or FRU by using the SDMC, complete these
steps:
1. On the Welcome page, under the Resources tab, select the server.
2. Click Actions > Service and Support> Hardware> Identify LED.
3. To activate or deactivate an identify LED for an enclosure, select an enclosure from the table, and click
either Activate LED or Deactivate LED. The associated LED is either turned on or off.
4. To activate or deactivate an identify LED for a FRU, select an enclosure from the table, and then click
List FRUs.
5. Select one or more FRUs from the table, and then click either Activate LED or Deactivate LED. The
associated LED is either turned on or off.
Deactivating a system attention LED or logical partition LED using the
Advanced System Management Interface
You can deactivate a system attention LED or a logical partition LED using the Advanced System
Management Interface (ASMI).
246 Disk drives
The system attention indicator provides a visual signal that the system as a whole requires attention or
service. Each system has a single system attention indicator. When an event occurs that either needs your
intervention or that of service and support, the system attention indicator lights continuously. The system
attention indicator is turned on when an entry is made in the service processor error log. The error entry
is transmitted to the system error log and the operating system error log.
To perform this operation, your authority level must be one of the following levels:
v Administrator
v Authorized service provider
To turn off the system attention indicator, do the following steps:
1. On the ASMI Welcome pane, specify your user ID and password, and click Log In.
2. In the navigation area, expand System Configuration ? Service Indicators ? System Attention
Indicator.
3. In the right pane, click Turn off system attention indicator. If the attempt is unsuccessful, an error
message is displayed.
Activating or deactivating an identify LED using the Advanced System
Management Interface
You can activate or deactivate an identify LED using the Advanced System Management Interface
(ASMI).
You can specify the location code of any indicator to view or modify its current state. If you provide the
wrong location code, the advanced system manager attempts to go to the next higher level of the location
code.
The next level is the base-level location code for that field replaceable unit (FRU). For example, a user
types the location code for the FRU located on the second I/O slot of the third enclosure in the system. If
the location code for the second I/O slot is incorrect (the FRU does not exist at this location), an attempt
to set the indicator for the third enclosure is initiated. This process continues until a FRU is located or no
other level is available.
To perform this operation, your authority level must be one of the following levels:
v Administrator
v Authorized service provider
To change the current state of an indicator, do the following steps:
1. On the ASMI Welcome pane, specify your user ID and password, and click Log In.
2. In the navigation area, expand System Configuration ? Service Indicators ? Indicators by Location
code.
3. In the right pane, enter the location code of the FRU and click Continue.
4. Select the preferred state from the list.
5. Click Save settings.
Activating and deactivating LEDs 247
248 Disk drives
Notices
This information was developed for products and services offered in the U.S.A.
The manufacturer may not offer the products, services, or features discussed in this document in other
countries. Consult the manufacturer's representative for information on the products and services
currently available in your area. Any reference to the manufacturer's product, program, or service is not
intended to state or imply that only that product, program, or service may be used. Any functionally
equivalent product, program, or service that does not infringe any intellectual property right of the
manufacturer may be used instead. However, it is the user's responsibility to evaluate and verify the
operation of any product, program, or service.
The manufacturer may have patents or pending patent applications covering subject matter described in
this document. The furnishing of this document does not grant you any license to these patents. You can
send license inquiries, in writing, to the manufacturer.
The following paragraph does not apply to the United Kingdom or any other country where such
provisions are inconsistent with local law: THIS INFORMATION IS PROVIDED “AS IS” WITHOUT
WARRANTY OF ANY KIND, EITHER EXPRESS OR IMPLIED, INCLUDING, BUT NOT LIMITED TO,
THE IMPLIED WARRANTIES OF NON-INFRINGEMENT, MERCHANTABILITY OR FITNESS FOR A
PARTICULAR PURPOSE. Some states do not allow disclaimer of express or implied warranties in certain
transactions, therefore, this statement may not apply to you.
This information could include technical inaccuracies or typographical errors. Changes are periodically
made to the information herein; these changes will be incorporated in new editions of the publication.
The manufacturer may make improvements and/or changes in the product(s) and/or the program(s)
described in this publication at any time without notice.
Any references in this information to websites not owned by the manufacturer are provided for
convenience only and do not in any manner serve as an endorsement of those websites. The materials at
those websites are not part of the materials for this product and use of those websites is at your own risk.
The manufacturer may use or distribute any of the information you supply in any way it believes
appropriate without incurring any obligation to you.
Any performance data contained herein was determined in a controlled environment. Therefore, the
results obtained in other operating environments may vary significantly. Some measurements may have
been made on development-level systems and there is no guarantee that these measurements will be the
same on generally available systems. Furthermore, some measurements may have been estimated through
extrapolation. Actual results may vary. Users of this document should verify the applicable data for their
specific environment.
Information concerning products not produced by this manufacturer was obtained from the suppliers of
those products, their published announcements or other publicly available sources. This manufacturer has
not tested those products and cannot confirm the accuracy of performance, compatibility or any other
claims related to products not produced by this manufacturer. Questions on the capabilities of products
not produced by this manufacturer should be addressed to the suppliers of those products.
All statements regarding the manufacturer's future direction or intent are subject to change or withdrawal
without notice, and represent goals and objectives only.
The manufacturer's prices shown are the manufacturer's suggested retail prices, are current and are
subject to change without notice. Dealer prices may vary.
© Copyright IBM Corp. 2010, 2011 249
This information is for planning purposes only. The information herein is subject to change before the
products described become available.
This information contains examples of data and reports used in daily business operations. To illustrate
them as completely as possible, the examples include the names of individuals, companies, brands, and
products. All of these names are fictitious and any similarity to the names and addresses used by an
actual business enterprise is entirely coincidental.
If you are viewing this information in softcopy, the photographs and color illustrations may not appear.
The drawings and specifications contained herein shall not be reproduced in whole or in part without the
written permission of the manufacturer.
The manufacturer has prepared this information for use with the specific machines indicated. The
manufacturer makes no representations that it is suitable for any other purpose.
The manufacturer's computer systems contain mechanisms designed to reduce the possibility of
undetected data corruption or loss. This risk, however, cannot be eliminated. Users who experience
unplanned outages, system failures, power fluctuations or outages, or component failures must verify the
accuracy of operations performed and data saved or transmitted by the system at or near the time of the
outage or failure. In addition, users must establish procedures to ensure that there is independent data
verification before relying on such data in sensitive or critical operations. Users should periodically check
the manufacturer's support websites for updated information and fixes applicable to the system and
related software.
Ethernet connection usage restriction
This product is not intended to be connected directly or indirectly by any means whatsoever to interfaces
of public telecommunications networks.
Trademarks
IBM, the IBM logo, and ibm.com are trademarks or registered trademarks of International Business
Machines Corp., registered in many jurisdictions worldwide. Other product and service names might be
trademarks of IBM or other companies. A current list of IBM trademarks is available on the Web at
Copyright and trademark information at www.ibm.com/legal/copytrade.shtml.
Linux is a registered trademark of Linus Torvalds in the United States, other countries, or both.
Other product and service names might be trademarks of IBM or other companies.
Electronic emission notices
When attaching a monitor to the equipment, you must use the designated monitor cable and any
interference suppression devices supplied with the monitor.
Class A Notices
The following Class A statements apply to the IBM servers that contain the POWER7 processor and its
features unless designated as electromagnetic compatibility (EMC) Class B in the feature information.
Federal Communications Commission (FCC) statement
Note: This equipment has been tested and found to comply with the limits for a Class A digital device,
pursuant to Part 15 of the FCC Rules. These limits are designed to provide reasonable protection against
harmful interference when the equipment is operated in a commercial environment. This equipment
250 Disk drives
generates, uses, and can radiate radio frequency energy and, if not installed and used in accordance with
the instruction manual, may cause harmful interference to radio communications. Operation of this
equipment in a residential area is likely to cause harmful interference, in which case the user will be
required to correct the interference at his own expense.
Properly shielded and grounded cables and connectors must be used in order to meet FCC emission
limits. IBM is not responsible for any radio or television interference caused by using other than
recommended cables and connectors or by unauthorized changes or modifications to this equipment.
Unauthorized changes or modifications could void the user's authority to operate the equipment.
This device complies with Part 15 of the FCC rules. Operation is subject to the following two conditions:
(1) this device may not cause harmful interference, and (2) this device must accept any interference
received, including interference that may cause undesired operation.
Industry Canada Compliance Statement
This Class A digital apparatus complies with Canadian ICES-003.
Avis de conformité à la réglementation d'Industrie Canada
Cet appareil numérique de la classe A est conforme à la norme NMB-003 du Canada.
European Community Compliance Statement
This product is in conformity with the protection requirements of EU Council Directive 2004/108/EC on
the approximation of the laws of the Member States relating to electromagnetic compatibility. IBM cannot
accept responsibility for any failure to satisfy the protection requirements resulting from a
non-recommended modification of the product, including the fitting of non-IBM option cards.
This product has been tested and found to comply with the limits for Class A Information Technology
Equipment according to European Standard EN 55022. The limits for Class A equipment were derived for
commercial and industrial environments to provide reasonable protection against interference with
licensed communication equipment.
European Community contact:
IBM Deutschland GmbH
Technical Regulations, Department M456
IBM-Allee 1, 71139 Ehningen, Germany
Tele: +49 7032 15-2937
email: tjahn@de.ibm.com
Warning: This is a Class A product. In a domestic environment, this product may cause radio
interference, in which case the user may be required to take adequate measures.
VCCI Statement - Japan
The following is a summary of the VCCI Japanese statement in the box above:
Notices 251
This is a Class A product based on the standard of the VCCI Council. If this equipment is used in a
domestic environment, radio interference may occur, in which case, the user may be required to take
corrective actions.
Japanese Electronics and Information Technology Industries Association (JEITA)
Confirmed Harmonics Guideline (products less than or equal to 20 A per phase)
Japanese Electronics and Information Technology Industries Association (JEITA)
Confirmed Harmonics Guideline with Modifications (products greater than 20 A per
phase)
Electromagnetic Interference (EMI) Statement - People's Republic of China
Declaration: This is a Class A product. In a domestic environment this product may cause radio
interference in which case the user may need to perform practical action.
Electromagnetic Interference (EMI) Statement - Taiwan
The following is a summary of the EMI Taiwan statement above.
Warning: This is a Class A product. In a domestic environment this product may cause radio interference
in which case the user will be required to take adequate measures.
IBM Taiwan Contact Information:
252 Disk drives
Electromagnetic Interference (EMI) Statement - Korea
Germany Compliance Statement
Deutschsprachiger EU Hinweis: Hinweis für Geräte der Klasse A EU-Richtlinie zur
Elektromagnetischen Verträglichkeit
Dieses Produkt entspricht den Schutzanforderungen der EU-Richtlinie 2004/108/EG zur Angleichung der
Rechtsvorschriften über die elektromagnetische Verträglichkeit in den EU-Mitgliedsstaaten und hält die
Grenzwerte der EN 55022 Klasse A ein.
Um dieses sicherzustellen, sind die Geräte wie in den Handbüchern beschrieben zu installieren und zu
betreiben. Des Weiteren dürfen auch nur von der IBM empfohlene Kabel angeschlossen werden. IBM
übernimmt keine Verantwortung für die Einhaltung der Schutzanforderungen, wenn das Produkt ohne
Zustimmung von IBM verändert bzw. wenn Erweiterungskomponenten von Fremdherstellern ohne
Empfehlung von IBM gesteckt/eingebaut werden.
EN 55022 Klasse A Geräte müssen mit folgendem Warnhinweis versehen werden:
"Warnung: Dieses ist eine Einrichtung der Klasse A. Diese Einrichtung kann im Wohnbereich
Funk-Störungen verursachen; in diesem Fall kann vom Betreiber verlangt werden, angemessene
Maßnahmen zu ergreifen und dafür aufzukommen."
Deutschland: Einhaltung des Gesetzes über die elektromagnetische Verträglichkeit von Geräten
Dieses Produkt entspricht dem “Gesetz über die elektromagnetische Verträglichkeit von Geräten
(EMVG)“. Dies ist die Umsetzung der EU-Richtlinie 2004/108/EG in der Bundesrepublik Deutschland.
Zulassungsbescheinigung laut dem Deutschen Gesetz über die elektromagnetische Verträglichkeit von
Geräten (EMVG) (bzw. der EMC EG Richtlinie 2004/108/EG) für Geräte der Klasse A
Dieses Gerät ist berechtigt, in Übereinstimmung mit dem Deutschen EMVG das EG-Konformitätszeichen
- CE - zu führen.
Notices 253
Verantwortlich für die Einhaltung der EMV Vorschriften ist der Hersteller:
International Business Machines Corp.
New Orchard Road
Armonk, New York 10504
Tel: 914-499-1900
Der verantwortliche Ansprechpartner des Herstellers in der EU ist:
IBM Deutschland GmbH
Technical Regulations, Abteilung M456
IBM-Allee 1, 71139 Ehningen, Germany
Tel: +49 7032 15-2937
email: tjahn@de.ibm.com
Generelle Informationen:
Das Gerät erfüllt die Schutzanforderungen nach EN 55024 und EN 55022 Klasse A.
Electromagnetic Interference (EMI) Statement - Russia
Class B Notices
The following Class B statements apply to features designated as electromagnetic compatibility (EMC)
Class B in the feature installation information.
Federal Communications Commission (FCC) statement
This equipment has been tested and found to comply with the limits for a Class B digital device,
pursuant to Part 15 of the FCC Rules. These limits are designed to provide reasonable protection against
harmful interference in a residential installation.
This equipment generates, uses, and can radiate radio frequency energy and, if not installed and used in
accordance with the instructions, may cause harmful interference to radio communications. However,
there is no guarantee that interference will not occur in a particular installation.
If this equipment does cause harmful interference to radio or television reception, which can be
determined by turning the equipment off and on, the user is encouraged to try to correct the interference
by one or more of the following measures:
v Reorient or relocate the receiving antenna.
v Increase the separation between the equipment and receiver.
v Connect the equipment into an outlet on a circuit different from that to which the receiver is
connected.
v Consult an IBM-authorized dealer or service representative for help.
Properly shielded and grounded cables and connectors must be used in order to meet FCC emission
limits. Proper cables and connectors are available from IBM-authorized dealers. IBM is not responsible for
254 Disk drives
any radio or television interference caused by unauthorized changes or modifications to this equipment.
Unauthorized changes or modifications could void the user's authority to operate this equipment.
This device complies with Part 15 of the FCC rules. Operation is subject to the following two conditions:
(1) this device may not cause harmful interference, and (2) this device must accept any interference
received, including interference that may cause undesired operation.
Industry Canada Compliance Statement
This Class B digital apparatus complies with Canadian ICES-003.
Avis de conformité à la réglementation d'Industrie Canada
Cet appareil numérique de la classe B est conforme à la norme NMB-003 du Canada.
European Community Compliance Statement
This product is in conformity with the protection requirements of EU Council Directive 2004/108/EC on
the approximation of the laws of the Member States relating to electromagnetic compatibility. IBM cannot
accept responsibility for any failure to satisfy the protection requirements resulting from a
non-recommended modification of the product, including the fitting of non-IBM option cards.
This product has been tested and found to comply with the limits for Class B Information Technology
Equipment according to European Standard EN 55022. The limits for Class B equipment were derived for
typical residential environments to provide reasonable protection against interference with licensed
communication equipment.
European Community contact:
IBM Deutschland GmbH
Technical Regulations, Department M456
IBM-Allee 1, 71139 Ehningen, Germany
Tele: +49 7032 15-2937
email: tjahn@de.ibm.com
VCCI Statement - Japan
Japanese Electronics and Information Technology Industries Association (JEITA)
Confirmed Harmonics Guideline (products less than or equal to 20 A per phase)
Notices 255
Japanese Electronics and Information Technology Industries Association (JEITA)
Confirmed Harmonics Guideline with Modifications (products greater than 20 A per
phase)
IBM Taiwan Contact Information
Electromagnetic Interference (EMI) Statement - Korea
Germany Compliance Statement
Deutschsprachiger EU Hinweis: Hinweis für Geräte der Klasse B EU-Richtlinie zur
Elektromagnetischen Verträglichkeit
Dieses Produkt entspricht den Schutzanforderungen der EU-Richtlinie 2004/108/EG zur Angleichung der
Rechtsvorschriften über die elektromagnetische Verträglichkeit in den EU-Mitgliedsstaaten und hält die
Grenzwerte der EN 55022 Klasse B ein.
Um dieses sicherzustellen, sind die Geräte wie in den Handbüchern beschrieben zu installieren und zu
betreiben. Des Weiteren dürfen auch nur von der IBM empfohlene Kabel angeschlossen werden. IBM
übernimmt keine Verantwortung für die Einhaltung der Schutzanforderungen, wenn das Produkt ohne
Zustimmung von IBM verändert bzw. wenn Erweiterungskomponenten von Fremdherstellern ohne
Empfehlung von IBM gesteckt/eingebaut werden.
Deutschland: Einhaltung des Gesetzes über die elektromagnetische Verträglichkeit von Geräten
Dieses Produkt entspricht dem “Gesetz über die elektromagnetische Verträglichkeit von Geräten
(EMVG)“. Dies ist die Umsetzung der EU-Richtlinie 2004/108/EG in der Bundesrepublik Deutschland.
Zulassungsbescheinigung laut dem Deutschen Gesetz über die elektromagnetische Verträglichkeit von
Geräten (EMVG) (bzw. der EMC EG Richtlinie 2004/108/EG) für Geräte der Klasse B
256 Disk drives
Dieses Gerät ist berechtigt, in Übereinstimmung mit dem Deutschen EMVG das EG-Konformitätszeichen
- CE - zu führen.
Verantwortlich für die Einhaltung der EMV Vorschriften ist der Hersteller:
International Business Machines Corp.
New Orchard Road
Armonk, New York 10504
Tel: 914-499-1900
Der verantwortliche Ansprechpartner des Herstellers in der EU ist:
IBM Deutschland GmbH
Technical Regulations, Abteilung M456
IBM-Allee 1, 71139 Ehningen, Germany
Tel: +49 7032 15-2937
email: tjahn@de.ibm.com
Generelle Informationen:
Das Gerät erfüllt die Schutzanforderungen nach EN 55024 und EN 55022 Klasse B.
Terms and conditions
Permissions for the use of these publications is granted subject to the following terms and conditions.
Personal Use: You may reproduce these publications for your personal, noncommercial use provided that
all proprietary notices are preserved. You may not distribute, display or make derivative works of these
publications, or any portion thereof, without the express consent of the manufacturer.
Commercial Use: You may reproduce, distribute and display these publications solely within your
enterprise provided that all proprietary notices are preserved. You may not make derivative works of
these publications, or reproduce, distribute or display these publications or any portion thereof outside
your enterprise, without the express consent of the manufacturer.
Except as expressly granted in this permission, no other permissions, licenses or rights are granted, either
express or implied, to the publications or any information, data, software or other intellectual property
contained therein.
The manufacturer reserves the right to withdraw the permissions granted herein whenever, in its
discretion, the use of the publications is detrimental to its interest or, as determined by the manufacturer,
the above instructions are not being properly followed.
You may not download, export or re-export this information except in full compliance with all applicable
laws and regulations, including all United States export laws and regulations.
THE MANUFACTURER MAKES NO GUARANTEE ABOUT THE CONTENT OF THESE
PUBLICATIONS. THESE PUBLICATIONS ARE PROVIDED "AS-IS" AND WITHOUT WARRANTY OF
ANY KIND, EITHER EXPRESSED OR IMPLIED, INCLUDING BUT NOT LIMITED TO IMPLIED
WARRANTIES OF MERCHANTABILITY, NON-INFRINGEMENT, AND FITNESS FOR A PARTICULAR
PURPOSE.
Armoires,dispositifs d'armoire
et installation en armoire de
systèmes ou d'unités
d'extension pour E1-700,
E3-700, E2-700, E2-700T,
E4-700 ou E4-700T
REFERENCE
86 F1 29FG 01
ESCALA Power7
Armoires,dispositifs d'armoire et installation
en armoire de systèmes ou d'unités
d'extension pour E1-700, E3-700, E2-700,
E2-700T, E4-700 ou E4-700T
Cette publication concerne les modèles suivants :
- Bull Escala E5-700 (Power 750 / 8233-E8B)
- Bull Escala M6-700 (Power 770 / 9117-MMB)
- Bull Escala M7-700 (Power 780 / 9179-MHB)
- Bull Escala E1-700 (Power 710 / 8231-E2B)
- Bull Escala E2-700 / E2-700T (Power 720 / 8202-E4B)
- Bull Escala E3-700 (Power 730 / 8231-E2B)
- Bull Escala E4-700 / E4-700T (Power 740 / 8205-E6B)
Les références au modèle Power 755 / 8236-E8C sont sans objet.
Hardware
Septembre 2010
BULL CEDOC
357 AVENUE PATTON
B.P.20845
49008 ANGERS CEDEX 01
FRANCE
REFERENCE
86 F1 29FG 01
L’avis juridique de copyright ci–après place le présent document sous la protection des lois de Copyright qui
prohibent, sans s’y limiter, des actions comme la copie, la distribution, la modification et la création de produits
dérivés.
Copyright Bull SAS 2007-2010
Imprimé en France
Marques déposées
Toutes les marques citées dans ce manuel sont la propriété de leurs détenteurs respectifs.
Tous les noms de marques ainsi que les noms de produits matériels et/ou logiciels sont régis par le droit des
marques et/ou des brevets.
La citation des noms de marques et de produits est purement informative et ne constitue pas une violation du droit
des marques et/ou des brevets.
Des corrections ou des modifications au contenu de ce document peuvent intervenir sans préavis. Bull SAS
ne pourra pas être tenu pour responsable des éventuelles erreurs qui pourraient être contenues dans ce
manuel, ni pour tout dommage pouvant résulter de son application.
Table des matières
Avis aux lecteurs canadiens . . . . . . . . . . . . . . . . . . . . . . . . . . vii
Consignes de sécurité . . . . . . . . . . . . . . . . . . . . . . . . . . . . . xi
Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités
d'extension . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1
Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités
d'extension - Nouveautés . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3
Installation de l'armoire . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5
Installation de l'armoire 14T/00 ou 14T/42 . . . . . . . . . . . . . . . . . . . . . . . . . 5
Inventaire des composants. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5
Positionnement de l'armoire . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5
Mise à niveau de l'armoire. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6
Fixation des équerres de stabilisation . . . . . . . . . . . . . . . . . . . . . . . . . . 7
Fixation de l'armoire à un plancher en béton . . . . . . . . . . . . . . . . . . . . . . . 8
Fixation de l'armoire à un plancher en béton sous un faux plancher . . . . . . . . . . . . . . . 13
Raccordement du système de distribution de l'alimentation . . . . . . . . . . . . . . . . . . 18
Vérification des prises de courant alternatif. . . . . . . . . . . . . . . . . . . . . . . . 18
Fixation de la prise de courant alternatif avant ou arrière . . . . . . . . . . . . . . . . . . . 19
Installation des plaques de montage pour l'alimentation CA avec prises CA. . . . . . . . . . . . 19
Installation des plaques de montage pour l'alimentation CA sans prises CA. . . . . . . . . . . . 22
Connexion d'une source d'alimentation CC. . . . . . . . . . . . . . . . . . . . . . . . 23
Installation en armoire d'une unité centrale ou d'une unité d'extension . . . . . . . . 29
Installation du modèle 02E/4B ou 05F/6B en armoire . . . . . . . . . . . . . . . . . . . . . 29
Détermination de l'emplacement . . . . . . . . . . . . . . . . . . . . . . . . . . . 30
Marquage de l'emplacement. . . . . . . . . . . . . . . . . . . . . . . . . . . . 31
Fixation du support de montage 02E/4B ou 05F/6B dans l'armoire. . . . . . . . . . . . . . . . 32
Installation du bras de routage des câbles . . . . . . . . . . . . . . . . . . . . . . . . 35
Raccordement de câbles d'alimentation au système . . . . . . . . . . . . . . . . . . . . . 38
Installation du modèle 31E/2B en armoire . . . . . . . . . . . . . . . . . . . . . . . . . 38
Détermination de l'emplacement . . . . . . . . . . . . . . . . . . . . . . . . . . . 40
Marquage de l'emplacement. . . . . . . . . . . . . . . . . . . . . . . . . . . . 41
Fixation du matériel de montage 31E/2B à l'armoire. . . . . . . . . . . . . . . . . . . . . 42
Installation du bras de routage des câbles . . . . . . . . . . . . . . . . . . . . . . . . 44
Raccordement de câbles d'alimentation au système . . . . . . . . . . . . . . . . . . . . . 47
Installation du modèle 33E/8B ou 36E/8C en armoire . . . . . . . . . . . . . . . . . . . . . 48
Détermination de l'emplacement . . . . . . . . . . . . . . . . . . . . . . . . . . . 50
Marquage de l'emplacement. . . . . . . . . . . . . . . . . . . . . . . . . . . . 51
Fixation du support de montage 33E/8B ou 36E/8C dans l'armoire . . . . . . . . . . . . . . . 52
Installation du bras de routage des câbles . . . . . . . . . . . . . . . . . . . . . . . . 59
Raccordement de câbles d'alimentation au système . . . . . . . . . . . . . . . . . . . . . 60
Installation en armoire du modèle 58/02 ou 58/77 . . . . . . . . . . . . . . . . . . . . . . 61
Détermination de l'emplacement . . . . . . . . . . . . . . . . . . . . . . . . . . . 62
Marquage de l'emplacement. . . . . . . . . . . . . . . . . . . . . . . . . . . . . 63
Fixation du support de montage dans l'armoire . . . . . . . . . . . . . . . . . . . . . . 64
Installation du modèle 58/02 ou 58/77 sur les glissières . . . . . . . . . . . . . . . . . . . 67
Installation du modèle 58/86 en armoire . . . . . . . . . . . . . . . . . . . . . . . . . 70
Détermination de l'emplacement . . . . . . . . . . . . . . . . . . . . . . . . . . . 71
Marquage de l'emplacement. . . . . . . . . . . . . . . . . . . . . . . . . . . . 72
Fixation du support de montage dans l'armoire . . . . . . . . . . . . . . . . . . . . . 73
Installation du modèle 58/86 sur les glissières. . . . . . . . . . . . . . . . . . . . . . 75
iii
Installation en armoire de l'unité d'extension ou 14G/30 . . . . . . . . . . . . . . . . . . . . 78
Mise hors tension du système et coupure de l'alimentation . . . . . . . . . . . . . . . . . . 78
Marquage de l'emplacement. . . . . . . . . . . . . . . . . . . . . . . . . . . . . 78
Mise en place des glissières extensibles et du support d'extension dans l'armoire . . . . . . . . . . . 80
Installation du boîtier . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 82
Installation de l'unité d'extension . . . . . . . . . . . . . . . . . . . . . . . . . . . 83
Fixation des câbles d'alimentation et du réseau de contrôle de l'alimentation système (SPCN) . . . . . . . 85
Mise sous tension du système . . . . . . . . . . . . . . . . . . . . . . . . . . . . 85
Installation en armoire du modèle 57/86 ou 31D/24. . . . . . . . . . . . . . . . . . . . . . 85
Inventaire des composants . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 86
Détermination de l'emplacement . . . . . . . . . . . . . . . . . . . . . . . . . . . 87
Marquage de l'emplacement. . . . . . . . . . . . . . . . . . . . . . . . . . . . . 88
Fixation du support de montage dans l'armoire . . . . . . . . . . . . . . . . . . . . . . 89
Installation du modèle 57/86 ou 31D/24 sur les glissières. . . . . . . . . . . . . . . . . . . 91
Fixation du bras de routage des câbles aux glissières standard . . . . . . . . . . . . . . . . . 94
Retrait et remise en place de panneaux latéraux 14T/00 ou 14T/42 . . . . . . . . . . 97
Remise en place d'un panneau latéral 14T/00 ou 14T/42 . . . . . . . . . . . . . . . . . . . . 98
Retrait et remise en place de panneaux d'habillage 14T/00 ou 14T/42. . . . . . . . . 99
Remise en place de panneaux d'habillage 14T/00 ou 14T/42 . . . . . . . . . . . . . . . . . . . 99
Fixation des volets de l'armoire . . . . . . . . . . . . . . . . . . . . . . . . 101
Fixation d'un volet avant de type High-perforation. . . . . . . . . . . . . . . . . . . . . . 101
Consignes de sécurité relatives aux armoires . . . . . . . . . . . . . . . . . . . . . . . . 102
Installation du kit de sécurité de l'armoire . . . . . . . . . . . . . . . . . . . . 105
Kit de renforcement . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 109
Libération de l'équerre renforcée . . . . . . . . . . . . . . . . . . . . . . . . . . . . 110
Libération du panneau latéral à l'aide d'un kit de renforcement. . . . . . . . . . . . . . . . . . 110
Connexion de plusieurs armoires à l'aide d'un kit de connexion multiarmoire . . . . 113
Alarme de statut d'armoire. . . . . . . . . . . . . . . . . . . . . . . . . . . 115
Installation de l'alarme de statut d'armoire . . . . . . . . . . . . . . . . . . . . . . . . 115
Connexion de plusieurs boîtes de jonction. . . . . . . . . . . . . . . . . . . . . . . . 117
Installation ou retrait d'un taquet de verrouillage d'unité centrale montée en armoire 119
Retrait et remise en place du capot supérieur sur l'armoire . . . . . . . . . . . . 121
Retrait du capot supérieur de l'armoire. . . . . . . . . . . . . . . . . . . . . . . . . . 121
Remise en place du capot supérieur sur l'armoire . . . . . . . . . . . . . . . . . . . . . . 123
Unité d'alimentation plus . . . . . . . . . . . . . . . . . . . . . . . . . . . 127
Installation de l'unité PDU+ sur le côté d'une armoire. . . . . . . . . . . . . . . . . . . . . 127
Installation horizontale en armoire de la PDU+ . . . . . . . . . . . . . . . . . . . . . . 129
Configuration du contrôle de l'alimentation via l'unité PDU+ . . . . . . . . . . . . . . . . . . 133
Utilisation de l'utilitaire de configuration . . . . . . . . . . . . . . . . . . . . . . . . 133
Connexion de la console. . . . . . . . . . . . . . . . . . . . . . . . . . . . . 134
Utilisation d'HyperTerminal . . . . . . . . . . . . . . . . . . . . . . . . . . . 134
Options du menu de l'utilitaire de configuration . . . . . . . . . . . . . . . . . . . . 134
Configuration des adresses IP. . . . . . . . . . . . . . . . . . . . . . . . . . . 135
Utilisation de l'interface Web . . . . . . . . . . . . . . . . . . . . . . . . . . . . 135
Démarrage de l'interface Web . . . . . . . . . . . . . . . . . . . . . . . . . . . 135
Modification des paramètres de base . . . . . . . . . . . . . . . . . . . . . . . . 136
Modification du nom de superutilisateur et du mot de passe . . . . . . . . . . . . . . . 136
Identification de la PDU+ et de la carte Web/SNMP . . . . . . . . . . . . . . . . . . 136
iv Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension
Ajout d'utilisateurs . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 137
Modification de la date et de l'heure. . . . . . . . . . . . . . . . . . . . . . . . 137
Modification des alertes événement . . . . . . . . . . . . . . . . . . . . . . . . 137
Modification des informations réseau . . . . . . . . . . . . . . . . . . . . . . . . 138
Modification de la configuration de réseau . . . . . . . . . . . . . . . . . . . . . 138
Récapitulatifs du journal de l'historique et des événements . . . . . . . . . . . . . . . . . 138
Affichage du journal de l'historique . . . . . . . . . . . . . . . . . . . . . . . . 138
Affichage du journal des événements . . . . . . . . . . . . . . . . . . . . . . . 139
Retrait et remplacement du capot ou du volet de l'unité d'extension . . . . . . . . 141
Retrait du capot avant des modèles 14G/30 . . . . . . . . . . . . . . . . . . . . . . . . 141
Installation du capot avant des modèles 14G/30. . . . . . . . . . . . . . . . . . . . . . . 142
Retrait et installation des carters sur le système 33E/8B ou 36E/8C . . . . . . . . . 143
Retrait du capot d'accès des modèles 33E/8B ou 36E/8C. . . . . . . . . . . . . . . . . . . . 143
Installation du capot d'accès des modèles 33E/8B ou 36E/8C. . . . . . . . . . . . . . . . . . . 144
Retrait du carter avant des modèles 33E/8B ou 36E/8C. . . . . . . . . . . . . . . . . . . . . 145
Installation du carter avant des modèles 33E/8B ou 36E/8C. . . . . . . . . . . . . . . . . . . 146
Retrait et remise en place du capot avant sur les modèles 17M/MB et 79M/HB . . . . . . . . . . . . 147
Retrait du carter avant d'un modèle 17M/MB ou 79M/HB . . . . . . . . . . . . . . . . . . . 147
Installation du carter avant sur un modèle 17M/MB ou 79M/HB . . . . . . . . . . . . . . . . . 148
Mise en position de maintenance ou de fonctionnement du système 33E/8B ou
36E/8C . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 151
Mise en position de maintenance de l'unité centrale ou de l'unité d'extension . . . . . . . . . . . . . 151
Mise en position fonctionnement de l'unité centrale ou de l'unité d'extension . . . . . . . . . . . . . 155
Mise en position de maintenance de l'unité 33E/8B ou 36E/8C. . . . . . . . . . . . . . . . . . 157
Mise en position de fonctionnement de l'unité 33E/8B ou 36E/8C. . . . . . . . . . . . . . . . . 160
Remarques . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 163
Marques . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 164
Bruits radioélectriques . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 164
Remarques sur la classe A . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 164
Remarques sur la classe B . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 168
Dispositions. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 170
Table des matières v
vi Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension
Avis aux lecteurs canadiens
Le présent document a été traduit en France. Voici les principales différences et particularités dont vous
devez tenir compte.
Illustrations
Les illustrations sont fournies à titre d'exemple. Certaines peuvent contenir des données propres à la
France.
Terminologie
La terminologie des titres IBM peut différer d'un pays à l'autre. Reportez-vous au tableau ci-dessous, au
besoin.
IBM France IBM Canada
ingénieur commercial représentant
agence commerciale succursale
ingénieur technico-commercial informaticien
inspecteur technicien du matériel
Claviers
Les lettres sont disposées différemment : le clavier français est de type AZERTY, et le clavier
français-canadien, de type QWERTY.
OS/2 - Paramètres canadiens
Au Canada, on utilise :
v les pages de codes 850 (multilingue) et 863 (français-canadien),
v le code pays 002,
v le code clavier CF.
Nomenclature
Les touches présentées dans le tableau d'équivalence suivant sont libellées différemment selon qu'il s'agit
du clavier de la France, du clavier du Canada ou du clavier des États-Unis. Reportez-vous à ce tableau
pour faire correspondre les touches françaises figurant dans le présent document aux touches de votre
clavier.
vii
Recommandations à l'utilisateur
Ce matériel utilise et peut émettre de l'énergie radiofréquence. Il risque de parasiter les communications
radio et télévision s'il n'est pas installé et utilisé conformément aux instructions du constructeur
(instructions d'utilisation, manuels de référence et manuels d'entretien).
Si cet équipement provoque des interférences dans les communications radio ou télévision, mettez-le hors
tension puis sous tension pour vous en assurer. Il est possible de corriger cet état de fait par une ou
plusieurs des mesures suivantes :
v Réorienter l'antenne réceptrice ;
v Déplacer l'équipement par rapport au récepteur ;
v Éloigner l'équipement du récepteur ;
v Brancher l'équipement sur une prise différente de celle du récepteur pour que ces unités fonctionnent
sur des circuits distincts ;
v S'assurer que les vis de fixation des cartes et des connecteurs ainsi que les fils de masse sont bien
serrés ;
v Vérifier la mise en place des obturateurs sur les connecteurs libres.
Si vous utilisez des périphériques non IBM avec cet équipement, nous vous recommandons d'utiliser des
câbles blindés mis à la terre, à travers des filtres si nécessaire.
En cas de besoin, adressez-vous à votre détaillant.
Le fabricant n'est pas responsable des interférences radio ou télévision qui pourraient se produire si des
modifications non autorisées ont été effectuées sur l'équipement.
viii Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension
L'obligation de corriger de telles interférences incombe à l'utilisateur.
Au besoin, l'utilisateur devrait consulter le détaillant ou un technicien qualifié pour obtenir de plus
amples renseignements.
Brevets
Il est possible qu'IBM détienne des brevets ou qu'elle ait déposé des demandes de brevets portant sur
certains sujets abordés dans ce document. Le fait qu'IBM vous fournisse le présent document ne signifie
pas qu'elle vous accorde un permis d'utilisation de ces brevets. Vous pouvez envoyer, par écrit, vos
demandes de renseignements relatives aux permis d'utilisation au directeur général des relations
commerciales d'IBM, 3600 Steeles Avenue East, Markham, Ontario, L3R 9Z7.
Assistance téléphonique
Si vous avez besoin d'assistance ou si vous voulez commander du matériel, des logiciels et des
publications IBM, contactez IBM direct au 1 800 465-1234.
Avis aux lecteurs canadiens ix
x Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension
Consignes de sécurité
Les consignes de sécurité peuvent être imprimées tout au long de ce guide.
v DANGER - Consignes attirant votre attention sur un risque de blessures graves, voire mortelles.
v ATTENTION - Consignes attirant votre attention sur un risque de blessures graves, en raison de
certaines circonstances réunies.
v Avertissement - Consignes attirant votre attention sur un risque de dommages sur un programme, une
unité, un système ou des données.
Consignes de sécurité relatives au commerce international
Plusieurs pays nécessitent la présentation des consignes de sécurité indiquées dans les publications du
produit dans leur langue nationale. Si votre pays en fait partie, un livret de consignes de sécurité est
inclus dans l'ensemble des publications livré avec le produit. Ce livret contient les consignes de sécurité
dans votre langue en faisant référence à la source en anglais (Etats-Unis). Avant d'utiliser une publication
en version originale anglaise pour installer, faire fonctionner ou dépanner ce produit, vous devez vous
familiariser avec les consignes de sécurité figurant dans ce livret. Vous devez également consulter ce
livret chaque fois que les consignes de sécurité des publications en anglais (Etats-Unis) ne sont pas assez
claires pour vous.
Informations sur les appareils à laser
Les serveurs peuvent comprendre des cartes d'E-S ou des composants à fibres optiques, utilisant des
lasers ou des diodes électroluminescentes (LED).
Conformité aux normes relatives aux appareils à laser
Les serveurs peuvent être installés à l'intérieur ou à l'extérieur d'une armoire d'équipement informatique.
xi
DANGER
Lorsque vous utilisez le système ou travaillez à proximité de ce dernier, observez les consignes
suivantes :
Le courant électrique provenant de l'alimentation, du téléphone et des câbles de transmission peut
présenter un danger. Pour éviter tout risque de choc électrique :
v Branchez cette unité uniquement avec le cordon d'alimentation fourni par . N'utilisez pas ce
dernier avec un autre produit.
v N'ouvrez pas et n'entretenez pas le bloc d'alimentation électrique.
v Ne manipulez aucun câble et n'effectuez aucune opération d'installation, d'entretien ou de
reconfiguration de ce produit au cours d'un orage.
v Le produit peut être équipé de plusieurs cordons d'alimentation. Pour supprimer tout risque de
danger électrique, débranchez tous les cordons d'alimentation.
v Branchez tous les cordons d'alimentation sur un socle de prise de courant correctement câblé et
mis à la terre. Vérifiez que la tension et l'ordre des phases des prises de courant correspondent
aux informations de la plaque d'alimentation électrique du système.
v Branchez sur des socles de prise de courant correctement câblés tout équipement connecté à ce
produit.
v Lorsque cela est possible, n'utilisez qu'une seule main pour connecter ou déconnecter les câbles
d'interface.
v Ne mettez jamais un équipement sous tension en cas d'incendie ou d'inondation, ou en présence
de dommages matériels.
v Avant de retirer les carters de l'unité, mettez celle-ci hors tension et déconnectez ses cordons
d'alimentation, ainsi que les câbles qui la relient aux réseaux, aux systèmes de télécommunication
et aux modems (sauf mention contraire dans les procédures d'installation et de configuration).
v Lorsque vous installez, que vous déplacez, ou que vous manipulez le présent produit ou des
périphériques qui lui sont raccordés, reportez-vous aux instructions ci-dessous pour connecter et
déconnecter les différents cordons.
Pour déconnecter les cordons :
1. Mettez toutes les unités hors tension (sauf mention contraire).
2. Débranchez les cordons d'alimentation des prises.
3. Débranchez les câbles d'interface des connecteurs.
4. Débranchez tous les câbles des unités.
Pour connecter les cordons :
1. Mettez toutes les unités hors tension (sauf mention contraire).
2. Branchez tous les cordons sur les unités.
3. Branchez les câbles d'interface sur des connecteurs.
4. Branchez les cordons d'alimentation aux prises.
5. Mettez l'unité sous tension.
(D005a)
DANGER
xii Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension
Observez les consignes suivantes lors de l'utilisation du système en armoire ou lorsque vous
travaillez à proximité de ce dernier :
v Un mauvais maniement de l'équipement lourd peut engendrer blessures et dommages matériels.
v Abaissez toujours les vérins de mise à niveau de l'armoire.
v Installez toujours des équerres de stabilisation sur l'armoire.
v Pour prévenir tout danger lié à une mauvaise répartition de la charge, installez toujours les
unités les plus lourdes dans la partie inférieure de l'armoire. Installez toujours les serveurs et les
unités en option en commençant par le bas de l'armoire.
v Un serveur monté en armoire n'est pas une étagère ou un espace de travail. Ne posez pas d'objet
sur un serveur monté en armoire.
v Chaque armoire peut être équipée de plusieurs cordons d'alimentation. Avant de manipuler
l'armoire, vous devez débrancher l'ensemble des cordons d'alimentation.
v Reliez toutes les unités installées dans l'armoire aux dispositifs d'alimentation installés dans la
même armoire. Vous ne devez pas brancher le cordon d'alimentation d'une unité installée dans
une armoire au dispositif d'alimentation installé dans une autre armoire.
v Un mauvais câblage du socle de prise de courant peut provoquer une mise sous tension
dangereuse des parties métalliques du système ou des unités qui lui sont raccordées. Il
appartient au client de s'assurer que le socle de prise de courant est correctement câblé et mis à la
terre afin d'éviter tout risque de choc électrique.
ATTENTION
v N'installez pas d'unité dans une armoire dont la température ambiante interne dépasse la
température ambiante que le fabricant recommande pour toutes les unités montées en armoire.
v N'installez pas d'unité dans une armoire où la ventilation n'est pas assurée. Vérifiez que les
côtés, l'avant et l'arrière de l'unité sont correctement ventilés.
v Le matériel doit être correctement raccordé au circuit d'alimentation pour éviter qu'une surcharge
des circuits n'entrave le câblage des dispositifs d'alimentation ou de protection contre les
surintensités. Pour choisir des connexions d'alimentation à l'armoire adaptées, consultez les
étiquettes de puissance nominale situées sur le matériel dans l'armoire afin de déterminer
l'alimentation totale requise par le circuit d'alimentation.
v Armoires dotées de tiroirs coulissants : Si l'armoire n'est pas équipée d'équerres de stabilisation, ne
sortez et n'installez pas de tiroir ou de dispositif. Ne retirez pas plusieurs tiroirs à la fois. Si vous
retirez plusieurs tiroirs simultanément, l'armoire risque de devenir instable.
v Armoires dotées de tiroirs fixes : Sauf indication du fabricant, les tiroirs fixes ne doivent pas être
retirés à des fins de maintenance. Si vous tentez de retirer une partie ou l'ensemble du tiroir,
l'armoire risque de devenir instable et le tiroir risque de tomber.
(R001)
Consignes de sécurité xiii
ATTENTION :
Le retrait des composants des parties supérieures de l'armoire améliore sa stabilité au cours du
déplacement. Pour déplacer une armoire remplie de composants dans une pièce ou dans un bâtiment,
procédez comme suit :
v Pour réduire le poids de l'armoire, retirez les équipements, à commencer par celui situé en haut. Si
possible, restaurez la configuration d'origine de l'armoire. Si vous ne connaissez pas cette
configuration, procédez comme suit :
– Retirez toutes les unités de la position 32U et plus.
– Assurez-vous que les unités les plus lourdes sont installées dans la partie inférieure de l'armoire.
– Assurez-vous qu'il ne reste aucun niveau U vide entre les unités installées dans l'armoire, en
dessous du niveau 32U.
v Si l'armoire déplacée fait partie d'un groupe d'armoires, séparez-la de ce dernier.
v Vérifiez l'itinéraire envisagé pour éliminer tout risque.
v Vérifiez que l'armoire une fois chargée n'est pas trop lourde pour l'itinéraire choisi. Pour plus
d'informations sur le poids d'une armoire chargée, consultez la documentation fournie avec votre
armoire.
v Vérifiez que toutes les ouvertures mesurent au moins 760 x 230 mm.
v Vérifiez que toutes les unités, toutes les étagères, tous les tiroirs, toutes les portes et tous les câbles
sont bien fixés.
v Vérifiez que les vérins de mise à niveau sont à leur position la plus haute.
v Vérifiez qu'aucune équerre de stabilisation n'est installée sur l'armoire pendant le déplacement.
v N'utilisez pas de rampe inclinée à plus de dix degrés.
v Dès que l'armoire est à son nouvel emplacement, procédez comme suit :
– Abaissez les quatre vérins de mise à niveau.
– Installez des équerres de stabilisation sur l'armoire.
– Si vous avez retiré des unités de l'armoire, remettez-les à leur place, en remontant de la partie
inférieure à la partie supérieure de l'armoire.
v Si un déplacement important est nécessaire, restaurez la configuration d'origine de l'armoire. Mettez
l'armoire dans son emballage d'origine ou dans un autre emballage équivalent. De plus, abaissez les
vérins de mise à niveau pour que les roulettes ne soient plus au contact de la palette et fixez
l'armoire à celle-ci.
(R002)
(L001)
(L002)
xiv Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension
(L003)
ou
Aux Etats-Unis, tous les appareils à laser sont certifiés conformes aux normes indiquées dans le
sous-chapitre J du DHHS 21 CFR relatif aux produits à laser de classe 1. Dans les autres pays, ils sont
certifiés être des produits à laser de classe 1 conformes aux normes CEI 60825. Consultez les étiquettes
sur chaque pièce du laser pour les numéros d'accréditation et les informations de conformité.
ATTENTION :
Ce produit peut contenir des produits à laser de classe 1 : lecteur de CD-ROM, DVD-ROM,
DVD-RAM ou module à laser. Notez les informations suivantes :
v Ne retirez pas les carters. En ouvrant le produit à laser, vous vous exposez au rayonnement
dangereux du laser. Vous ne pouvez effectuer aucune opération de maintenance à l'intérieur.
v Pour éviter tout risque d'exposition au rayon laser, respectez les consignes de réglage et d'utilisation
des commandes, ainsi que les procédures décrites dans le présent manuel.
(C026)
Consignes de sécurité xv
ATTENTION :
Les installations informatiques peuvent comprendre des modules à laser fonctionnant à des niveaux
de rayonnement excédant les limites de la classe 1. Il est donc recommandé de ne jamais examiner à
l'oeil nu la section d'un cordon optique ni une prise de fibres optiques ouverte. (C027)
ATTENTION :
Ce produit contient un laser de classe 1M. Ne l'observez pas à l'aide d'instruments optiques. (C028)
ATTENTION :
Certains produits à laser contiennent une diode à laser intégrée de classe 3A ou 3B. Prenez
connaissance des informations suivantes. Rayonnement laser lorsque le carter est ouvert. Evitez toute
exposition directe au rayon laser. Evitez de regarder fixement le faisceau ou de l'observer à l'aide
d'instruments optiques. (C030)
Informations sur l'alimentation électrique et sur le câblage relatives au document
GR-1089-CORE du NEBS (Network Equipment-Building System)
Les commentaires suivants s'appliquent aux serveurs qui ont été déclarés conformes au document
GR-1089-CORE du NEBS (Network Equipment-Building System) :
Cet équipement peut être installé :
v dans des infrastructures de télécommunications réseau
v aux endroits préconisés dans les directives NEC (National Electrical Code).
Les ports de ce matériel qui se trouvent à l'intérieur du bâtiment peuvent être connectés à des câbles
internes ou non exposés uniquement. Ils ne doivent pas être connectés par leur partie métallique aux
interfaces connectées au réseau extérieur ou à son câblage. Ces interfaces sont conçues pour être
exclusivement utilisées à l'intérieur d'un bâtiment (ports de type 2 ou 4 décrits dans le document
GR-1089-CORE) ; elles doivent être isolées du câblage à découvert du réseau extérieur. L'ajout de
dispositifs de protection primaires n'est pas suffisant pour pouvoir connecter ces interfaces par leur partie
métallique au câblage du réseau extérieur.
Remarque : Tous les câbles Ethernet doivent être blindés et mis à la terre aux deux extrémités.
Dans le cas d'un système alimenté en courant alternatif, il n'est pas nécessaire d'installer un dispositif
externe de protection contre les surtensions (SPD).
Un système alimenté en courant continu fait appel à un dispositif de retour du continu (DC-I). La borne
de retour de la batterie en courant continu ne doit pas être connectée à la masse.
xvi Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension
Armoires, dispositifs d'armoire et installation en armoire de
systèmes ou d'unités d'extension
Informations sur les procédures d'installation en armoire. Procédures détaillées concernant l'installation
de dispositifs, d'unités centrales et d'extension dans une armoire.
Vous pouvez exécuter ces tâches vous-même ou faire appel à un fournisseur de services. Il vous imputera
probablement les frais correspondants.
1
2 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension
Armoires, dispositifs d'armoire et installation en armoire de
systèmes ou d'unités d'extension - Nouveautés
Prenez connaissance des nouveautés et modifications significatives apportées à la rubrique Armoires,
dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension depuis la dernière
mise à jour de cette collection.
Septembre 2010
Mises à jour du contenu :
v Procédures supplémentaire pour «Installation du modèle 02E/4B ou 05F/6B en armoire», à la page 29.
v Procédures supplémentaire pour «Installation du modèle 31E/2B en armoire», à la page 38.
v Procédures supplémentaire pour «Retrait et remise en place de panneaux d'habillage 14T/00 ou
14T/42», à la page 99.
3
4 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension
Installation de l'armoire
Cette section décrit l'ensemble des tâches requises pour l'installation d'armoires (modèle 7014-T00 ou
7014-T42), depuis l'inventaire des composants jusqu'au branchement sur une source d'alimentation.
Pourquoi et quand exécuter cette tâche
Installation de l'armoire 14T/00 ou 14T/42
Cette procédure permet d'installer les armoires de type 14T/00 et 14T/42.
Pourquoi et quand exécuter cette tâche
Pour installer le kit de sécurité de l'armoire, voir «Installation du kit de sécurité de l'armoire», à la page
105 après l'installation de l'armoire.
Avant d'installer une armoire, voir «Consignes de sécurité relatives aux armoires», à la page 102.
Inventaire des composants
Il est préférable d'effectuer un inventaire des composants avant de commencer l'installation de l'armoire.
Cette section décrit la procédure à suivre.
Pourquoi et quand exécuter cette tâche
Si vous ne l'avez pas encore fait, effectuez un inventaire des pièces avant d'installer l'unité centrale dans
l'armoire :
Procédure
1. Localisez le rapport des kits dans une boîte d'accessoires.
2. Vérifiez que vous avez reçu tous les éléments commandés et toutes les pièces indiquées dans le kit.
Résultats
En cas de composants manquants, endommagés ou ne correspondant pas à la commande, vous disposez
des contacts suivants :
v Revendeur
Positionnement de l'armoire
La conformité aux normes de réglementation et de sécurité exige un positionnement correct de l'armoire.
La procédure de cette section permet d'effectuer cette tâche.
Pourquoi et quand exécuter cette tâche
Une fois l'armoire installée à sa place sur le sol, bloquez chaque roulette en serrant la vis de verrouillage.
Pour connaître l'emplacement de la vis de verrouillage (2), voir la figure suivante. Retirez toutes les
bandes de protection et les emballages de l'armoire.
5
1 Roulette
2 Vis de verrouillage
Pour passer à l'étape suivante, procédez comme suit :
v Pour fixer l'armoire sur un plancher en béton, voir «Fixation de l'armoire à un plancher en béton», à la
page 8.
v Pour fixer l'armoire sur un plancher en béton recouvert d'un faux plancher, voir «Fixation de l'armoire
à un plancher en béton sous un faux plancher», à la page 13.
v Pour fixer l'armoire au plancher, voir «Mise à niveau de l'armoire».
Mise à niveau de l'armoire
S'il est nécessaire de niveler l'armoire, procédez comme décrit dans cette section.
Pourquoi et quand exécuter cette tâche
Pour niveler l'armoire, procédez comme suit :
Procédure
1. Desserrez le contre-écrou sur chaque pied de nivellement.
2. Faites pivoter chaque pied de nivellement vers le bas jusqu'à ce qu'il soit en contact avec la surface
sur laquelle est placée l'armoire.
3. Réglez les pieds de nivellement vers le bas jusqu'à ce que l'armoire soit à niveau. Une fois l'armoire à
niveau, serrez les contre-écrous sur la base.
Figure 1. Serrage de la vis de verrouillage
6 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension
1 Avant de l'armoire (base)
2 Pied de nivellement (quantité 4)
3 Contre-écrou (x4)
Résultats
Fixation des équerres de stabilisation
Il se peut que vous deviez fixer les équerres de stabilisation à l'armoire. Cette section vous aide à
déterminer si des équerres de stabilisation sont nécessaires et décrit comment les fixer.
Pourquoi et quand exécuter cette tâche
Si vous devez installer des prises électriques en courant alternatif avant et arrière dans l'armoire, vous ne
pouvez pas fixer d'équerre de stabilisation. Vous devez fixer l'armoire au sol. Vous utilisez des équerres
de stabilisation uniquement lorsque vous ne fixez pas l'armoire au plancher. Pour fixer l'armoire au
plancher, voir «Fixation de l'armoire à un plancher en béton», à la page 8.
Pour fixer les équerres de stabilisation au bas de l'armoire, procédez comme suit :
Remarque : Avant d'installer les équerres de stabilisation, consultez «Fixation de la prise de courant
alternatif avant ou arrière», à la page 19 pour obtenir des instructions sur l'installation des plaques de
montage des prises de courant alternatif.
Procédure
1. Alignez les emplacements de l'une des équerres de stabilisation sur les trous de vis, au bas de l'avant
de l'armoire.
2. Installez les deux vis de fixation.
3. Vérifiez que la base de l'équerre de stabilisation repose fermement sur le sol. A l'aide de la clé Allen
fournie avec l'armoire, serrez les vis de fixation, en passant de l'une à l'autre jusqu'à vissage complet.
Figure 2. Réglage des pieds de nivellement
Installation de l'armoire 7
1 Avant de l'armoire (base) 3 Equerre de stabilisation
2 Vis de fixation du stabilisateur 4 Arrière de l'armoire (base)
4. Pour installer la seconde équerre de stabilisation à l'arrière de l'armoire, répétez les étapes 1 à 3.
Résultats
Fixation de l'armoire à un plancher en béton
Si vous devez installer des prises électriques en courant alternatif avant et arrière dans l'armoire, vous
devez fixer l'armoire au sol. Cette section explique comment effectuer cette tâche sur un plancher en
béton.
Pourquoi et quand exécuter cette tâche
Faites appel à un entrepreneur en mécanique pour fixer les plaques de montage de l'armoire au plancher
en béton. Celui-ci doit déterminer que le matériel utilisé pour fixer les plaques de montage de l'armoire
au plancher en béton respecte les exigences de l'installation.
Pour fixer l'armoire à un plancher en béton, procédez comme suit :
Procédure
1. Installez l'armoire sur son emplacement prédéfini et serrez les vis de verrouillage sur les roulettes.
2. S'ils sont en place, retirez les panneaux d'habillage haut, gauche et droit. Les panneaux d'habillage
sont maintenus en place par des pinces de fixation Reportez-vous à l'illustration suivante
Figure 3. Fixation des équerres de stabilisation
8 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension
1 Boîtier d'armoire 4 Panneau d'habillage droit
2 Panneau d'habillage haut 5 Pince de fixation
3 Panneau d'habillage gauche
3. S'ils sont en place, retirez les volets avant et arrière. Une fois les volets de l'armoire retirés, passez à
l'étape suivante. Pour enlever un volet d'armoire :
a. Déverrouillez et ouvrez le volet.
b. Saisissez le volet fermement des deux mains et extrayez-le de la charnière en le tirant vers
l'extérieur.
4. Localisez le kit de montage du matériel et les deux plaques de montage. Lors de l'inventaire du
contenu du kit de montage du matériel, voir la figure suivante. Le kit de montage du matériel
contient les éléments suivants :
v 4 boulons de montage d'armoire
v 4 rondelles plates
v 8 bagues d'isolation en plastique
v 4 rondelles épaisses
v 4 entretoises
5. Si vous installez une armoire alimentée en courant alternatif, placez provisoirement les bagues
d'isolation en plastique inférieures afin de repérer les emplacements de montage de l'équerre de
Figure 4. Retrait des panneaux d'habillage
Installation de l'armoire 9
stabilisation. Une fois l'équerre de stabilisation correctement mise en place, retirez les bagues
d'isolation en plastique inférieures.
1 Boîtier d'armoire 7 Contre-écrou
2 Boulon de montage d'armoire 8 Pied de nivellement
3 Rondelle plate 9 Bague d'isolation en plastique
inférieure (utilisée uniquement sur
les systèmes alimentés en courant
continu)
4 Bague d'isolation en plastique
supérieure
CA Installation standard des pieds de
nivellement sur une armoire
alimentée en courant alternatif
5 Rondelle épaisse CC Installation standard des pieds de
nivellement sur une armoire
alimentée en courant continu
6 Entretoise
6. Placez les deux plaques de montage dans la position de montage approximative, sous l'armoire.
7. Créez un assemblage de boulon de montage d'armoire en ajoutant les pièces suivantes, dans l'ordre
indiqué, à chaque boulon de montage d'armoire :
a. Rondelle plate
b. Bague d'isolation en plastique supérieure
c. Rondelle épaisse
d. Entretoise
8. Insérez un assemblage de boulon de montage d'armoire sur chaque pied de nivellement.
9. Repositionnez les plaques de montage d'armoire sous les quatre boulons de montage d'armoire afin
que ces derniers soient centrés directement sur les trous taraudés.
Figure 5. Installation de plaques de montages CA
10 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension
10. Exercez quatre tours complets sur les boulons de montage d'armoire dans les trous taraudés de la
plaque de montage.
1 Boulon de montage d'armoire 7 Pied de nivellement
2 Rondelle plate 8 Bague d'isolation en plastique
inférieure (utilisée uniquement sur
les systèmes alimentés en courant
continu)
3 Bague d'isolation en plastique
supérieure
9 Plaque de montage
4 Rondelle épaisse 10 Trou taraudé (permet de fixer
l'armoire à l'équerre de stabilisation)
5 Entretoise 11 Trou de boulon d'ancrage
6 Contre-écrou 12 Schéma tracé (schéma à tracer sur le
sol, en utilisant l'équerre de
stabilisation comme gabarit)
11. Faites des repères sur le sol reproduisant les contours des deux équerres de stabilisation.
12. Marquez les trous de boulon inférieurs de la plaque qui sont accessibles par l'ouverture, à l'arrière de
l'armoire.
13. Retirez les assemblages de boulon de montage d'armoire.
14. Si vous installez une armoire alimentée en courant alternatif, retirez la bague d'isolation inférieure
sur chaque pied de nivellement.
15. Retirez les équerres de stabilisation des emplacements marqués.
16. Desserrez les vis de verrouillage sur les roulettes.
17. Eloignez l'armoire des deux zones identifiées au sol pour les emplacements des équerres de
stabilisation.
18. Repositionnez les équerres de stabilisation sur les zones marquées.
Figure 6. Fixation de l'armoire au plancher
Installation de l'armoire 11
19. Faites un repère au sol, au centre de tous les trous des deux équerres de stabilisation.
20. Retirez les deux plaques de montage d'armoire des emplacements repérés.
21. Sur les repères des trous de boulon de montage d'armoire taraudés, percez quatre trous de passage
dans le plancher en béton. Chacun d'eux doit avoir environ 2,5 cm de profondeur. Cette profondeur
permet aux boulons de montage de traverser l'épaisseur des équerres de stabilisation.
Remarque : Vous devez utiliser au minimum deux boulons d'ancrage sur chaque plaque de montage
pour fixer la plaque au plancher en béton. Il est possible que certains trous d'une plaque de montage
s'alignent sur des armatures en béton du plancher ; ils peuvent donc ne pas être utilisables.
22. Sélectionnez au moins deux emplacements de trous appropriés pour chaque boulon d'équerre de
stabilisation. Les emplacements sélectionnés doivent être aussi proches que possible des trous de
boulon taraudés. Vérifiez que les trous choisis à l'arrière de l'armoire sont accessibles. Percez des
trous sur les emplacements sélectionnés du plancher en béton.
23. Positionnez les équerres de stabilisation sur les ancrages en béton.
24. Serrez les boulons de l'équerre de stabilisation avant sur le plancher en béton.
25. Positionnez l'équerre de stabilisation sur les ancrages en béton.
26. Serrez les boulons de l'équerre de stabilisation arrière sur le plancher en béton.
Remarque : La taille des boulons d'ancrage et des ancrages en béton doit être déterminée par
l'entrepreneur en mécanique qui effectue l'installation des plaques de montage.
27. Positionnez l'armoire sur l'équerre de stabilisation.
28. Insérez chaque boulon d'équerre de stabilisation dans une rondelle plate, une bague d'isolation en
plastique et une rondelle épaisse et faites passer le tout par un pied de nivellement.
29. Alignez les quatre boulons d'équerre de stabilisation sur les quatre trous taraudés des deux plaques
de montage, puis effectuez trois à quatre tours.
30. Serrez les vis de verrouillage sur chaque roulette.
31. Réglez les pieds de nivellement vers le bas jusqu'à ce que l'armoire soit à niveau. Une fois l'armoire
à niveau, serrez les contre-écrous sur la base de l'armoire.
1 Avant de l'armoire (base)
2 Pied de nivellement (x4)
3 Contre-écrou (quantité 4)
12 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension
32. Si plusieurs armoires sont reliées en série (fixées l'une à l'autre), voir «Connexion de plusieurs
armoires à l'aide d'un kit de connexion multiarmoire», à la page 113. Sinon, vissez les quatre boulons
à 54-67 nm.
33. Si vous n'installez pas de volets sur l'armoire, replacez les panneaux d'habillage haut, gauche et
droit.
34. Connectez le système de distribution de l'alimentation conformément à la description (voir
«Raccordement du système de distribution de l'alimentation», à la page 18).
35. Lorsque toutes les armoires sont fixées au plancher, passez à la section «Fixation de la prise de
courant alternatif avant ou arrière», à la page 19.
36. Si vous ne fixez pas de prise électrique avant et installez des volets d'armoire, voir «Fixation des
volets de l'armoire», à la page 101.
Résultats
Fixation de l'armoire à un plancher en béton sous un faux plancher
Si vous prévoyez d'installer des prises électriques en courant alternatif avant et arrière dans l'armoire,
vous devez fixer l'armoire au sol. Lorsque vous fixez l'armoire à un plancher en béton recouvert d'un
faux plancher, procédez comme il est expliqué dans cette section.
Pourquoi et quand exécuter cette tâche
Faites appel à un entrepreneur en mécanique pour fixer les plaques de montage de l'armoire au plancher
en béton. Il doit déterminer que le matériel utilisé pour fixer les plaques de montage de l'armoire au
plancher en béton respecte les exigences de l'installation.
Pour fixer l'armoire à un plancher en béton recouvert d'un faux plancher, procédez comme suit :
Procédure
1. Installez l'armoire sur son emplacement prédéfini et serrez les vis de verrouillage sur les roulettes.
2. Le cas échéant, retirez les panneaux d'habillage haut, gauche et droit. Les panneaux d'habillage sont
maintenus en place par des pinces de fixation. Reportez-vous à l'illustration suivante
Figure 7. Réglage des pieds de nivellement
Installation de l'armoire 13
1 Boîtier d'armoire 4 Panneau d'habillage droit
2 Panneau d'habillage haut 5 Pince de fixation
3 Panneau d'habillage gauche
3. Le cas échéant, retirez les volets avant et arrière. Pour retirer un volet d'armoire, voir . Une fois les
volets de l'armoire retirés, passez à l'étape suivante.
4. Localisez le kit de montage du matériel et les deux plaques de montage. Lors de l'inventaire du
contenu du kit de montage du matériel, voir la figure suivante. Le kit de montage du matériel
contient les éléments suivants :
v Quatre boulons de montage d'armoire
v Quatre rondelles plates
v Huit bagues d'isolation en plastique
v Quatre rondelles épaisses
v Quatre entretoises
5. Si vous installez une armoire alimentée en courant alternatif, placez provisoirement les bagues
d'isolation en plastique inférieures afin de mettre en place la plaque de montage de l'armoire. Une
fois la plaque de montage correctement mise en place, retirez les bagues d'isolation en plastique
inférieures.
Figure 8. Retrait des panneaux d'habillage
14 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension
1 Boîtier d'armoire 7 Contre-écrou
2 Boulon de montage d'armoire 8 Pied de nivellement
3 Rondelle plate 9 Bague d'isolation en plastique
inférieure (utilisée uniquement sur
les systèmes alimentés en courant
continu)
4 Bague d'isolation en plastique
supérieure
CA Installation standard des pieds de
nivellement sur une armoire
alimentée en courant alternatif
5 Rondelle épaisse CC Installation standard des pieds de
nivellement sur une armoire
alimentée en courant continu
6 Entretoise
6. Placez les deux plaques de montage dans la position de montage approximative, sous l'armoire.
7. Créez un assemblage de boulon de montage d'armoire en ajoutant les pièces suivantes, dans l'ordre
indiqué, à chaque boulon de montage d'armoire :
a. Rondelle plate
b. Bague d'isolation en plastique supérieure
c. Rondelle épaisse
d. Entretoise
8. Insérez un assemblage de boulon de montage d'armoire sur chaque pied de nivellement.
9. Repositionnez les plaques de montage d'armoire sous les quatre boulons de montage d'armoire afin
que ces derniers soient centrés directement sur les trous taraudés.
10. Exercez quatre tours complets sur les boulons de montage d'armoire dans les trous taraudés de la
plaque de montage.
11. Faites des repères sur le faux plancher autour des plaques de montage avant et arrière.
Figure 9. Installation des plaques de montage de l'unité d'alimentation
Installation de l'armoire 15
12. Marquez les trous de boulon inférieurs de la plaque qui sont accessibles par l'ouverture, à l'arrière de
l'armoire.
13. Retirez les assemblages de boulon de montage d'armoire.
14. Si vous installez une armoire alimentée en courant alternatif, retirez la bague d'isolation inférieure
sur chaque pied de nivellement.
15. Retirez les plaques de montage d'armoire des emplacements repérés.
16. Desserrez les vis de verrouillage sur les roulettes.
17. Eloignez l'armoire des deux zones identifiées au sol pour les emplacements des plaques de montage.
18. Repositionnez les plaques de montage sur les zones repérées.
19. Faites des repères sur le faux plancher au centre de chaque trou des plaques de montage (y compris
des trous taraudés).
20. Retirez les deux plaques de montage d'armoire des emplacements repérés sur le faux plancher.
21. Percez deux trous de passage à chaque extrémité de chaque plaque de montage. Chacun d'eux doit
avoir environ 2,5 cm de profondeur. Cette profondeur permettra le passage des boulons de montage
dans la plaque de montage lors de la fixation de cette dernière à l'armoire.
22. Sélectionnez au moins deux emplacements de trous appropriés pour chaque plaque de montage.
Sélectionnez les emplacements des trous aussi proches que possible des trous taraudés. Vérifiez que
les emplacements des trous choisis à l'arrière de l'armoire sont accessibles.
23. Percez des trous de passage dans le faux plancher. Ces trous permettent d'insérer les boulons
d'ancrage dans la plaque de montage et de traverser le faux plancher jusqu'au plancher en béton.
Remarque : Vous devez utiliser au minimum deux boulons d'ancrage sur chaque plaque de montage
pour fixer la plaque au plancher en béton via le faux plancher. Il est possible que certains trous
d'une plaque de montage s'alignent sur des armatures en béton du plancher ; ils peuvent donc ne
pas être utilisables.
24. Transférez les repères des trous des boulons d'ancrage (à l'exception de ceux des trous de passage
percés pour les boulons de montage de l'armoire) du faux plancher au plancher en béton et marquez
l'emplacement des trous sur le plancher en béton.
25. Percez des trous dans le plancher en béton pour fixer les boulons d'ancrage.
26. Remettez le faux plancher en position, au-dessus des trous des boulons d'ancrage.
27. Positionnez l'équerre de stabilisation avant sur la zone repérée du faux plancher.
28. A l'aide des boulons d'ancrage, fixez l'équerre de stabilisation avant sur le plancher en béton, en
traversant le faux plancher.
29. Positionnez l'équerre de stabilisation arrière sur la zone repérée du faux plancher.
16 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension
1 Boulon de montage d'armoire 7 Pied de nivellement
2 Rondelle plate 8 Bague d'isolation en plastique
inférieure (utilisée uniquement sur
les systèmes alimentés en courant
continu)
3 Bague d'isolation en plastique
supérieure
9 Equerres de stabilisation
4 Rondelle épaisse 10 Trou taraudé (permet de fixer
l'armoire à la plaque de montage)
5 Entretoise 11 Trou de boulon d'ancrage
6 Contre-écrou 12 Schéma tracé (schéma à tracer sur le
sol, en utilisant la plaque de montage
comme gabarit)
30. A l'aide des boulons d'ancrage, fixez l'équerre de stabilisation arrière sur le plancher en béton, en
traversant le faux plancher.
31. Replacez toutes les dalles de faux plancher qui ont été retirées lors de l'alignement et de la fixation
des boulons d'ancrage sur le plancher en béton.
32. Alignez l'armoire sur les équerres de stabilisation avant et arrière.
33. Insérez chaque assemblage de boulon dans un pied de nivellement.
34. Alignez les boulons de montage de l'armoire sur les trous taraudés de chaque équerre de
stabilisation. Appliquez trois ou quatre tours sur chaque boulon.
35. Serrez les vis de verrouillage sur chaque roulette.
36. Réglez les pieds de nivellement vers le bas jusqu'à ce que l'armoire soit à niveau. Une fois l'armoire
à niveau, serrez les contre-écrous sur la base de l'armoire.
Figure 10. Fixation de l'armoire au plancher
Installation de l'armoire 17
37. Si plusieurs armoires sont reliées en série (fixées l'une à l'autre), voir «Connexion de plusieurs
armoires à l'aide d'un kit de connexion multiarmoire», à la page 113. Sinon, vissez les quatre boulons
à 54-67 nm.
38. Si vous n'installez pas de volets sur l'armoire, replacez les panneaux d'habillage haut, gauche et
droit.
39. Connectez le système de distribution de l'alimentation conformément à la description (voir
«Raccordement du système de distribution de l'alimentation»).
40. Une fois l'armoire fixée au sol, passez à la section «Fixation de la prise de courant alternatif avant ou
arrière», à la page 19 pour fixer une prise électrique avant.
41. Si vous ne fixez pas de prise électrique avant et installez des volets d'armoire, voir «Fixation des
volets de l'armoire», à la page 101.
Résultats
Raccordement du système de distribution de l'alimentation
Vous pouvez utiliser un système de distribution de l'alimentation pour contrôler les charges électriques
des unités qui y sont branchés. La procédure de cette section permet de connecter ce système.
Pourquoi et quand exécuter cette tâche
Pour raccorder une unité d'alimentation, voir «Unité d'alimentation plus», à la page 127.
Vérification des prises de courant alternatif
Pour garantir une opération sûre et fiable, vous devrez vérifier les prises de courant alternatif. La
procédure de cette section permet d'effectuer cette tâche.
Avant de commencer
Avant de commencer, veillez à vous munir d'un multimètre pour vérifier la tension, et d'un testeur
d'impédance du sol homologué pour tester les résistances de mise à la terre.
Remarque : Utilisez uniquement un testeur d'impédance du sol homologué pour tester les résistances de
mise à la terre. N'utilisez pas de multimètre pour mesurer la résistance de mise à la terre.
Pourquoi et quand exécuter cette tâche
Avant de brancher l'armoire sur l'unité d'alimentation en courant alternatif, contrôlez les points suivants
sur l'unité d'alimentation en courant alternatif :
Procédure
1. Mettez hors tension le disjoncteur de la prise d'alimentation en courant alternatif à laquelle va être
raccordée l'armoire. Sur le commutateur du disjoncteur, placez l'étiquette S229-0237 demandant de ne
pas manipuler le disjoncteur.
Remarque : Lorsque vous faites les tests, le cache de la prise doit être installé dans sa position
habituelle.
2. Certaines prises se trouvent dans des logements métalliques. Pour ce type de prise, procédez comme
suit :
a. A l'aide d'un multimètre, vérifiez qu'il y a une tension inférieure à 1 volt entre le boîtier de la
prise et tout élément métallique du bâtiment relié à la terre, par exemple, la structure métallique
d'un faux plancher, une tuyauterie, un élément métallique de la structure du bâtiment, ou une
structure similaire.
18 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension
b. A l'aide d'un multimètre, vérifiez qu'il y a une tension inférieure à 1 volt entre la borne terre de la
prise et un point relié à la terre dans le bâtiment.
Remarque : Si le boîtier ou le cache de la prise est peint, assurez-vous que la sonde de test
traverse la peinture et entre en contact avec le métal.
c. A l'aide d'un multimètre, testez la résistance entre la borne terre et le boîtier de la prise. Testez la
résistance entre la borne terre de la prise et la mise à la terre du bâtiment. Vous devez obtenir un
résultat inférieur à 1 ohm, ce qui indique la présence d'un conducteur de terre continu.
3. Si l'un des trois tests effectués à l'étape 2, à la page 18 n'est pas concluant, coupez le courant du
circuit de dérivation et modifiez les branchements. Ensuite, testez à nouveau la prise, une fois les
branchements corrigés.
4. A l'aide d'un testeur d'impédance du sol, testez la résistance infinie entre la borne terre de la prise et
chaque borne de phase. Cela permet de détecter un court-circuit à la terre ou une inversion de
branchement.
5. A l'aide d'un testeur d'impédance du sol, testez la résistance infinie entre les bornes de phase. Ce test
permet de vérifier l'absence de court-circuit.
6. Remettez le disjoncteur en position sous tension.
7. A l'aide d'un multimètre, vérifiez que la tension entre les phases est correcte. S'il n'y a pas de tension
au niveau du boîtier de la prise ou de la borne terre, vous pouvez toucher la prise en toute sécurité.
8. A l'aide d'un multimètre, vérifiez que la tension au niveau de la prise de courant alternatif est
correcte.
Résultats
Fixation de la prise de courant alternatif avant ou arrière
Utilisez cette procédure pour fixer une prise de courant alternatif.
Avertissement : Les plaques de montage de prise de courant alternatif avant et arrière sont fixées sur les
trous de montage permettant déjà de fixer les équerres de stabilisation au boîtier de l'armoire. Par
conséquent, vous devez retirer les équerres si vous fixez l'armoire au sol.
Installez les plaques de montage de la prise de courant alternatif uniquement une fois que l'armoire a été
fixée au sol et que les équerres de stabilisation ont été retirées.
Les éléments suivants sont installés sur le site du client :
v Plaques de montage de la prise de courant alternatif pour l'installation des prises électriques en
courant alternatif fournies par le client, à l'avant ou à l'arrière de l'armoire. La plaque de montage de la
prise de courant alternatif constitue l'emplacement de montage d'une prise électrique ca.
v Cosse de mise à la terre en laiton pour un raccordement de décharge d'électricité statique (ESD).
Remarque : Il appartient au client de fournir les prises et les cordons d'alimentation pour le
raccordement à l'unité d'alimentation. Il est également de sa responsabilité de brancher correctement la
prise de courant alternatif. Ces composants ne font pas partie des unités remplaçables sur site (FRU).
Installation des plaques de montage pour l'alimentation CA avec prises CA
Si vous choisissez d'installer des plaques de montage, vous pouvez suivre la procédure détaillée dans
cette section pour effectuer cette tâche. Cette section comporte également des illustrations des composants
matériel concernés et montre comment ces composants sont liés les uns aux autres.
Pourquoi et quand exécuter cette tâche
Si vous ne souhaitez pas installer de prises de courant alternatif sur l'armoire, voir «Installation des
plaques de montage pour l'alimentation CA sans prises CA», à la page 22.
Installation de l'armoire 19
Pour installer des prises de courant alternatif sur la plaque de montage de prise de courant alternatif
avant ou arrière, procédez comme suit :
Procédure
1. Déterminez le nombre de prises de courant alternatif que vous devez installer.
2. Vérifiez auprès de votre spécialiste que le nombre et l'emplacement des prises de courant alternatif à
installer sont corrects.
3. Retirez les obturateurs des plaques de montage des prises de courant alternatif correspondant au
nombre de prises de courant alternatif à installer.
4. Installez les prises de courant alternatif sur la plaque de montage de prise de courant alternatif.
5. Installez la cosse de mise à la terre sur la plaque de montage de prise de courant alternatif à l'aide
d'un seul écrou (voir la figure suivante).
6. Serrez l'écrou sur la cosse de mise à la terre.
7. Localisez le conducteur de terre en fourni avec la plaque de montage.
Remarque : Pour installer des prises de courant alternatif à l'avant ou à l'arrière de l'armoire,
procédez comme suit.
8. Placez la rondelle de blocage sur la cosse de mise à la terre de la plaque de montage de prise de
courant alternatif avant.
9. Placez la cosse située sur la plus longue extrémité du conducteur de terre sur la cosse de mise à la
terre.
10. Placez un écrou sur la cosse de mise à la terre et vissez-le fermement.
11. Placez la plaque de montage de prise de courant alternatif avant sur le cadre de l'armoire et insérez
complètement la cosse de mise à la terre dans les trous de montage de l'armoire.
12. Faites passer le câble sous l'armoire.
13. Placez la rondelle de blocage sur la cosse de mise à la terre de la plaque de montage de prise de
courant alternatif arrière.
14. Placez la cosse située sur la plus courte extrémité du conducteur de terre sur la cosse de mise à la
terre.
15. Placez un écrou sur la cosse de mise à la terre et vissez-le fermement.
20 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension
1 Cosse de conducteur de terre 7 Cosse de mise à la terre
2 Rondelle de blocage 8 Conducteur de terre (extrémité courte
du conducteur de terre)
3 Avant de l'armoire 9 Extrémité en Y du conducteur de
terre
4 Cordon d'alimentation de l'unité
d'alimentation
10 Cosse de mise à la terre (x2)
5 Plaque de montage 11 Extrémité longue du conducteur de
terre
6 Extrémité longue du conducteur de
terre
12 Cosse de mise à la terre (x2)
16. Placez la plaque de montage de prise de courant alternatif arrière sur le cadre de l'armoire et insérez
complètement la cosse de mise à la terre dans les trous de montage de l'armoire.
17. Placez les vis de la plaque de montage de prise de courant alternatif avant (vis de fixation des
stabilisateurs) dans la plaque de montage, en les insérant dans les trous de montage de l'armoire.
Serrez les vis fermement.
1 Avant ou arrière de l'armoire (selon
le cas)
4 Plaque de montage
2 Cordon d'alimentation de l'unité
d'alimentation
5 Clé Allen
3 Vis à tête demi-ronde 6 Extrémité longue du conducteur de
terre
18. Raccordez l'extrémité en Y du conducteur de terre au cadre de l'armoire, soit au centre du panneau
arrière de l'armoire, soit à la barre omnibus de mise à la terre, à l'arrière de l'armoire.
19. Placez les vis de la plaque de montage de prise de courant alternatif arrière (vis de fixation des
stabilisateurs) dans la plaque de montage, en les insérant dans les trous de montage de l'armoire.
Serrez les vis fermement.
Figure 11. Installation de la cosse de mise à la terre
Figure 12. Installation de la plaque de montage avant
Installation de l'armoire 21
Remarque : La barre omnibus peut se trouver en haut ou en bas de l'armoire.
1 Plaque de montage à barre omnibus 3 Vis à tête hexagonale (M5 x 20)
(quantité 2)
2 Rondelle de fixation (x2) 4 Barre omnibus de mise à la terre
Installation des plaques de montage pour l'alimentation CA sans prises CA
La procédure décrite dans cette section permet d'installer une plaque de montage sans prises.
Si vous ne souhaitez pas installer de prise de courant alternatif sur la plaque de montage de prise de
courant alternatif avant ou arrière, effectuez uniquement les étapes des sections «Fixation de l'armoire à
un plancher en béton sous un faux plancher», à la page 13 et «Raccordement du système de distribution
de l'alimentation», à la page 18.
Pour les plaques de montage de prise de courant alternatif avant ou arrière, voir «Installation des plaques
de montage pour l'alimentation CA avec prises CA», à la page 19.
Figure 13. Installation de la plaque de montage arrière
22 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension
Connexion d'une source d'alimentation CC
Certains modèles d'armoire (tel que le 14T/00) peuvent prendre en charge une configuration
d'alimentation en courant continu pour les systèmes qui le nécessitent. Utilisez la procédure décrite dans
cette section pour connecter une source d'alimentation en courant continu. Cette section comporte
également des illustrations des composants matériel concernés et montre comment ces composants sont
liés les uns aux autres.
Pourquoi et quand exécuter cette tâche
Remarque : Le client est responsable de la fourniture et du raccordement de la source d'alimentation -48
V cc et des câbles de retour électriques -48 V cc à partir de la source du client -48 V cc jusqu'aux barres
omnibus du panneau d'alimentation. Il est également responsable de la connexion du conducteur de terre
au cadre de l'armoire. Cette procédure fournit des informations sur l'accès au panneau d'alimentation.
Procédure
1. Retirez les six vis de fixation du capot supérieur du panneau d'alimentation en courant continu et
déposez le capot supérieur.
2. Si elles sont en place, retirez les quatre vis du capot du chemin de câble.
3. Retirez le capot du chemin de câble.
1 Vis de fixation du capot du chemin
de câble
5 Boîtier de protection
2 Capot du chemin de câble 6 Panneau d'alimentation
3 Vis de fixation du capot supérieur du
panneau d'alimentation
4 Capot supérieur du panneau
d'alimentation
4. Retirez le boîtier de protection de la barre omnibus -48 V cc du panneau d'alimentation.
Figure 14. Retrait du capot du chemin de câble
Installation de l'armoire 23
Avertissement : Le boîtier de protection de la barre omnibus doit être correctement réinstallé sur les
barres omnibus de retour -48 V cc afin d'empêcher d'éventuelles blessures pendant la maintenance du
panneau d'alimentation.
5. Vérifiez que les étapes suivantes sont exécutées pendant le branchement de la source d'alimentation
en courant continu.
a. A la source d'alimentation -48 V cc, mettez hors tension toutes les sources d'alimentation -48 V cc
qui seront raccordées au panneau d'alimentation.
b. Une fois les sources d'alimentation -48 V cc hors tension, assurez-vous qu'une étiquette, sur les
commutateurs ou les fusibles de la source d'alimentation (dispositif de blocage), indique que la
source d'alimentation a été mise hors tension volontairement.
Remarque : Vérifiez que les barres omnibus en cuivre ne subissent aucune oxydation.
c. S'il s'agit d'une installation sur un faux plancher et que vous travaillez à l'arrière de l'armoire,
faites passer les câbles le long du côté droit de l'armoire.
d. Vérifiez que le cordon d'alimentation -48 V cc externe est raccordé correctement à la barre
omnibus -48 V cc.
e. Vérifiez que le câble de retour -48 V cc externe est raccordé correctement et qu'il est installé sur la
barre omnibus de retour.
1 Cordon d'alimentation et câble de retour -48 V cc
2 Panneau d'alimentation
3 Avant de l'armoire
4 Cordon d'alimentation et câble de retour -48 V cc
Figure 15. Routage des cordons d'alimentation
24 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension
1 Avant du panneau d'alimentation 6 (B) Retour (-) Cordon d'alimentation
2 (A) -48 V cc (-) Barre omnibus 7 (B) -48 V cc (-) Cordon d'alimentation
3 (A) -48 V cc (-) Cordon
d'alimentation
8 (B) Retour (-) Barre omnibus
4 (A) Retour (-) Barre omnibus 9 (B) -48 V cc (-) Barre omnibus
5 (A) Retour (-) Cordon d'alimentation
f. Pour installer une alarme d'état de l'alimentation, raccordez le câble d'alarme à la plaque à bornes
sur le capot arrière du panneau d'alimentation en courant continu.
Remarque : Veillez à éliminer l'oxydation sur les barres omnibus en cuivre.
g. Vérifiez que le conducteur de terre de l'alimentation est correctement placé et raccordez-le à la
barre de cuivre, au centre du panneau arrière inférieur ou supérieur de l'armoire.
h. Si l'armoire est posée sur un faux plancher, fixez les câbles de l'unité d'alimentation -48 V cc à
l'arrière de l'armoire à l'aide de sangles de maintien du câblage.
Figure 16. Barre omnibus de retour
Installation de l'armoire 25
1 Vue arrière d'une armoire (cc)
2 Cordon d'alimentation, câble de retour et conducteur de terre
3 Conducteur de terre (installation en haut ou en bas de l'armoire)
6. Réinstallez le boîtier de protection de la barre omnibus -48 V cc.
7. Réinstallez le capot supérieur sur le panneau d'alimentation en courant continu.
8. Réinstallez le capot du chemin de câble.
1 Capot du chemin de câble
Figure 17. Emplacements de câble
26 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension
2 Plaque à bornes (deux côtés)
3 Panneau d'alimentation
4 Avant de l'armoire
Figure 18. Réinstallation du capot du chemin de câble
Installation de l'armoire 27
28 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension
Installation en armoire d'une unité centrale ou d'une unité
d'extension
Il se peut que vous deviez installer une unité centrale ou une unité d'extension dans une armoire. Outre
les informations de sécurité, cette section fournit les procédures à suivre pour effectuer ces tâches. Cette
section comporte également des illustrations des composants matériel concernés et montre comment ces
composants sont liés les uns aux autres.
Les procédures suivantes expliquent comment installer des unités centrales ou des unités d'extension
dans une armoire. Vous pouvez exécuter cette tâche vous-même ou faire appel à un fournisseur de
services. Il vous imputera probablement les frais correspondants.
Installation du modèle 02E/4B ou 05F/6B en armoire
Il se peut que vous deviez installer l'unité centrale dans une armoire. La procédure de cette section
permet d'effectuer cette tâche.
Cette opération incombe au client. Vous pouvez exécuter vous-même cette tâche ou faire appel à un
fournisseur de services. Ce dernier vous imputera probablement les frais correspondants.
Important : Pour effectuer cette procédure, deux personnes sont nécessaires pour fixer le guide à
l'armoire, une devant l'armoire, et l'autre à l'arrière. Trois personnes sont nécessaires pour soulever l'unité
centrale dans l'armoire.
Cette procédure suppose que l'installation de l'unité centrale s'effectue dans une armoire déjà montée et
mise en place. Si l'armoire n'est pas installée, consultez les instructions du «Installation de l'armoire», à la
page 5, puis revenez à cette procédure pour obtenir des instructions sur l'installation de l'unité
d'extension dans l'armoire.
Remarque : Cette procédure ne concerne que les modèles conçus pour une armoire, ne tentez pas
d'installer un modèle autonome dans une armoire.
Pour installer le modèle 02E/4B ou 05F/6B dans une armoire, procédez comme suit :
ATTENTION :
L'installation des glissières dans l'armoire est une opération complexe. Pour une installation correcte
des guides, respectez l'ordre des tâches ci-dessous. Si vous ne le respectez pas, vous risquez de vous
blesser ou d'endommager l'unité centrale.
1. Lisez les «Consignes de sécurité relatives aux armoires», à la page 102.
2. Faites l'inventaire des composants. La figure ci-dessous présente les éléments dont vous avez besoin
pour installer le serveur dans l'armoire.
29
Si des pièces sont manquantes, endommagées, ou ne correspondent pas à la commande, veuillez
contacter :
v Revendeur
3. Repérez le kit de montage en armoire (A) et les guides (B) fournis avec l'unité centrale, comme
indiqué dans figure 19.
Les guides (B) sont dans le sens avant en arrière et gauche à droite. Les glissières portent des
étiquettes indiquant si elles doivent être placées à gauche ou à droite par rapport à l'avant de
l'armoire. L'arrière de chaque guide possède deux grands assemblages de loquets. Ces assemblages de
loquets doivent être placés à l'arrière de l'armoire.
4. Choisissez l'emplacement d'installation de l'unité centrale dans l'armoire. Voir «Détermination de
l'emplacement».
5. Fixez le matériel de montage à l'armoire. Pour plus d'informations, voir Fixation du support de
montage à l'armoire.
Détermination de l'emplacement
Il se peut que vous deviez déterminer l'emplacement d'installation du système dans l'armoire. Cette
procédure permet d'y parvenir.
Pourquoi et quand exécuter cette tâche
Avant d'installer l'unité d'extension dans une armoire, procédez comme suit :
Procédure
1. Lisez les «Consignes de sécurité relatives aux armoires», à la page 102.
2. Choisissez l'emplacement d'installation des unités. Placez les unités les plus encombrantes et les plus
lourdes dans la partie inférieure de l'armoire.
Ce système fait quatre unités EIA (Electronic Industries Alliance) de hauteur. Une unité EIA mesure
44,45 mm de haut. L'armoire contient trois trous de montage pour chaque unité EIA sur la hauteur.
Cette unité centrale mesure donc 177,8 mm (7 po) de haut et couvre 12 orifices de fixation de
l'armoire.
Figure 19. Inventaire des composants
30 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension
3. Si nécessaire, retirez les panneaux obturateurs pour permettre l'accès à l'intérieur du boîtier de
l'armoire.
4. Le cas échéant, retirez les volets avant et arrière de l'armoire.
Résultats
Marquage de l'emplacement
Il se peut que vous deviez marquer l'emplacement d'installation. Cette procédure permet d'effectuer cette
tâche.
Pourquoi et quand exécuter cette tâche
Pour marquer l'emplacement de l'installation et installer les écrous clip dans une armoire, procédez
comme suit :
Figure 20. Retrait des panneaux obturateurs
Installation en armoire d'une unité centrale ou d'une unité d'extension 31
Procédure
1. Déterminez l'emplacement du système dans l'armoire. Installez les unités en commençant par le bas.
Placez les unités les plus encombrantes et les plus lourdes dans la partie inférieure de l'armoire.
Mémorisez l'emplacement EIA. Ce système fait quatre unités EIA (Electronic Industries Alliance) de
hauteur. Une unité EIA mesure 44,45 mm de haut. L'armoire comporte trois orifices de montage pour
chaque unité EIA de hauteur. Ce système mesure donc 177,8 mm (7 po) de haut et couvre 12 orifices
de fixation de l'armoire.
2. Face à l'avant de l'armoire, et en travaillant depuis le côté droit, repérez l'unité EIA inférieure qui sera
utilisée par votre unité d'extension. Mémorisez l'emplacement EIA. Utilisez du ruban adhésif, un
marqueur ou un crayon pour marquer l'orifice inférieur de cette unité EIA (A). Marquez l'armoire de
sorte que la marque reste visible au dos de l'armoire.
3. Commencez par l'orifice identifié par votre marque (A). Comptez un orifice et placez une seconde
marque (B). Marquez l'armoire de sorte que la marque soit également visible depuis l'arrière de
l'armoire (voir figure 21).
4. Répétez les deux étapes précédentes sur le côté gauche de l'armoire.
5. Face à l'arrière de l'armoire, et en travaillant depuis le côté droit, repérez l'unité EIA inférieure qui
sera utilisée par votre unité d'extension. Faites une marque sur l'armoire au niveau de l'orifice
inférieur de cette unité EIA (A).
6. Commencez par l'orifice identifié par votre marque (A). Comptez un orifice et placez une deuxième
marque (B) (voir figure 21).
7. Commencez par l'orifice identifié par votre marque (B). Comptez un orifice et placez une troisième
marque (C) (voir figure 21).
8. Répétez les deux étapes précédentes sur le côté gauche de l'armoire.
Résultats
Fixation du support de montage 02E/4B ou 05F/6B dans l'armoire
Vous devez fixer le support de montage dans l'armoire. La procédure de cette section permet d'effectuer
cette tâche.
Figure 21. Marquage des orifices à l'avant et à l'arrière du châssis de l'armoire
32 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension
Pourquoi et quand exécuter cette tâche
Avertissement : Pour éviter d'entraîner une défaillance des guides, de vous blesser ou d'endommager
l'unité, vérifiez que vous disposez des guides et raccords adaptés à votre armoire. Si votre armoire a des
trous à rebord pour support carrés ou des trous à rebord pour support de filetage de vis, veillez à ce que
les guides et raccords correspondent aux trous à rebord pour support utilisés sur votre armoire.
N'installez pas de matériel non conforme à l'aide de rondelles ou de cales d'espacement. Si vous ne
disposez pas des guides et raccords adéquats pour votre armoire, veuillez contacter votre revendeur. De
même, pour installer correctement les guides, effectuez chaque tâche dans l'ordre suivant.
Pour installer le support de montage dans l'armoire, procédez comme suit :
Procédure
1. Chaque glissière est marquée d'un R (right, pour droite) ou d'un L (left, pour gauche). Sélectionnez
l'une des glissières et poussez vers le haut la languette mobile avant (1). Retirez le taquet avant (2)
pour faire glisser la glissière avant. Si une vis moletée est installée dans la glissière (3), retirez-la.
Remarque : Assurez-vous que le taquet mobile reste étendu et ne s'enclenche pas de nouveau.
2. Alignez les trois broches à l'arrière de la glissière avec les trois orifices dans l'unité EIA sélectionnée à
l'arrière de l'armoire. Poussez les glissières afin que les broches rentrent dans les orifices (1), et lâchez
la glissière (2) pour qu'elle s'enclenche.
Figure 22. Glissière et taquet mobile
Figure 23. Alignez les broches avec les orifices à l'arrière de l'armoire
Installation en armoire d'une unité centrale ou d'une unité d'extension 33
3. Tirez la glissière vers l'avant et insérez les deux broches (1) à l'avant de la glissière dans les deux
orifices inférieurs dans le U à l'avant de l'armoire. Faites descendre la glissière en place jusqu'à ce
qu'elle s'enclenche. Enfoncez le taquet avant (2) jusqu'au fond.
4. Répétez les trois étapes précédentes pour installer l'autre glissière dans l'armoire. Vérifiez que chaque
taquet avant est bien en place.
5. Tirez les glissières vers l'avant (1) jusqu'à ce qu'elles s'enclenchent, avec deux clics. Soulevez avec
précaution le serveur et inclinez-le pour le mettre en position au-dessus des glissières de sorte que les
têtes de clou arrière (2) sur le serveur s'alignent avec les logements arrière (3) sur les glissières. Faites
glisser le serveur vers le bas jusqu'à ce que les têtes de clou arrière s'enclenchent dans les deux
emplacements arrière, puis faites lentement descendre l'avant du serveur (4) jusqu'à ce que les autres
têtes de clou s'enclenchent dans les autres emplacements sur les glissières. Veillez à ce que le taquet
avant (5) passe au-dessus des têtes de clou.
6. Soulevez les taquets de déverrouillage bleus (1) sur les glissières et poussez le serveur (2) jusqu'au
fond de l'armoire jusqu'à ce qu'il s'enclenche.
Figure 24. Glissière avant et broches de l'armoire
Figure 25. Glissières étendues et têtes de clou du serveur alignées sur les emplacements des rails
34 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension
7. Installez le bras de routage des câbles. Pour plus d'informations, voir «Installation du bras de routage
des câbles».
Installation du bras de routage des câbles
Il se peut que vous deviez installer le bras de routage des câbles. Cette procédure permet d'effectuer cette
tâche.
Pourquoi et quand exécuter cette tâche
Pour installer le bras de routage des câbles, procédez comme suit :
Procédure
1. Installez le bras de routage des câbles sur le côté droit du serveur, lorsque vous êtes face à l'arrière.
Connectez une extrémité du bras de support (1) à la glissière sur laquelle vous comptez fixer le bras
de routage des câbles afin de pouvoir balancer l'autre extrémité du bras (2) vers l'armoire.
2. Installez le support d'arrêt de routage des câbles en forme de L (1) sur l'extrémité libre du bras de
support. Tournez le support (2) afin de le fixer au bras de support.
Figure 26. Taquets de déverrouillage et serveur
Figure 27. Connexion du bras de support
Installation en armoire d'une unité centrale ou d'une unité d'extension 35
3. Pour fixer l'autre côté du bras de support à l'arrière de la glissière, sortez la broche de (1), puis faites
glisser le support (2) dans la glissière.
4. Retirez la broche du support de fixation (1) et faites glisser le support de fixation (2) dans la glissière
sur laquelle vous installez le bras de routage des câbles. Poussez le support dans la glissière jusqu'à ce
que la broche s'enclenche.
Figure 28. Support d'arrêt de routage des câbles fixé au bras de support
Figure 29. Broche étendue et support installé dans la glissière
36 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension
5. Placez le bras de routage des câbles sur le bras de support. Retirez la broche du bras de routage des
câbles (1), puis faites glisser la languette du bras de routage des câbles (2) dans l'encoche située à
l'intérieur de la glissière. Enfoncez la languette jusqu'à ce qu'elle s'enclenche. Retirez l'autre broche du
bras de routage des câbles (3), et faites glisser la languette de ce bras de routage des câbles dans
l'encoche (4) de la partie extérieure de la glissière. Enfoncez la languette jusqu'à ce qu'elle s'enclenche.
6. Faites passer les cordons d'alimentation et autres câbles (notamment clavier, écran et souris, si
nécessaire) sur le bras de routage des câbles (1). Branchez tous les câbles à l'arrière du serveur, sauf le
cordon d'alimentation. Fixez le cordon d'alimentation et les câbles à l'aide des attache-câble ou des
attaches velcro.
Remarque : Laissez du mou à tous les câbles afin d'éviter tensions dans les câbles lorsque le bras de
routage des câbles bouge.
Figure 30. Broche du support de fixation étendue et support de fixation installé dans la glissière
Figure 31. Connexion du bras de routage des câbles
Installation en armoire d'une unité centrale ou d'une unité d'extension 37
7. Faites glisser le serveur dans l'armoire jusqu'à ce qu'il s'enclenche.
Résultats
Raccordement de câbles d'alimentation au système
Il se peut que vous deviez raccorder des câbles d'alimentation au système. Cette procédure permet
d'effectuer cette tâche.
Pourquoi et quand exécuter cette tâche
Pour raccorder des câbles d'alimentation au système, procédez comme suit :
Procédure
1. En vous tenant face à l'arrière de l'unité centrale, insérez le cordon d'alimentation du système dans le
passe-câbles horizontale.
2. Branchez le cordon d'alimentation dans l'unité centrale.
Remarque : Si le système est équipé deux alimentations, chacune doit se voir brancher son propre
cordon d'alimentation.
3. Branchez le cordon d'alimentation du système et les cordons d'alimentation des éventuelles
imprimantes ou périphériques d'affichage sur le secteur en CA (courant alternatif).
4. Mettez sous tension l'unité centrale ainsi que tout périphérique raccordé dessus.
Résultats
Pour obtenir des instructions sur la manière de connecter et de configurer l'unité d'extension une fois
l'installation terminée, consultez Boîtiers et unités d'extension.
Installation du modèle 31E/2B en armoire
Il se peut que vous deviez installer l'unité centrale dans une armoire. La procédure de cette section
permet d'effectuer cette tâche.
Cette opération incombe au client. Vous pouvez exécuter vous-même cette tâche ou faire appel à un
fournisseur de services. Ce dernier vous imputera probablement les frais correspondants.
Figure 32. Fixation et routage du cordon d'alimentation
38 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension
Important : Pour effectuer cette procédure, deux personnes sont nécessaires pour fixer le guide à
l'armoire, une devant l'armoire, et l'autre à l'arrière. Trois personnes sont nécessaires pour soulever l'unité
centrale dans l'armoire.
Cette procédure suppose que l'installation de l'unité centrale s'effectue dans une armoire déjà montée et
mise en place. Si l'armoire n'est pas installée, consultez les instructions du «Installation de l'armoire», à la
page 5, puis revenez à cette procédure pour obtenir des instructions sur l'installation de l'unité
d'extension dans l'armoire.
Remarque : Cette procédure ne concerne que les modèles conçus pour une armoire, ne tentez pas
d'installer un modèle autonome dans une armoire.
Pour installer le modèle 31E/2B en armoire, procédez comme suit :
ATTENTION :
L'installation des glissières dans l'armoire est une opération complexe. Pour une installation correcte
des guides, respectez l'ordre des tâches ci-dessous. Si vous ne le respectez pas, vous risquez de vous
blesser ou d'endommager l'unité centrale.
1. Lisez les «Consignes de sécurité relatives aux armoires», à la page 102.
2. Faites l'inventaire des composants. La figure ci-dessous présente les éléments dont vous avez besoin
pour installer le serveur dans l'armoire.
Si des pièces sont manquantes, endommagées, ou ne correspondent pas à la commande, veuillez
contacter :
v Revendeur
3. Repérez le kit de montage en armoire (A) et les guides (B) fournis avec l'unité centrale, comme
indiqué dans figure 33.
Les guides (B) sont dans le sens avant en arrière et gauche à droite. Les glissières portent des
étiquettes indiquant si elles doivent être placées à gauche ou à droite par rapport à l'avant de
l'armoire. L'arrière de chaque guide possède deux grands assemblages de loquets. Ces assemblages de
loquets doivent être placés à l'arrière de l'armoire.
Figure 33. Inventaire des composants
Installation en armoire d'une unité centrale ou d'une unité d'extension 39
4. Choisissez l'emplacement d'installation de l'unité centrale dans l'armoire. Voir «Détermination de
l'emplacement».
5. Fixez le matériel de montage à l'armoire. Pour plus d'informations, voir Fixation du support de
montage à l'armoire.
Détermination de l'emplacement
Il se peut que vous deviez déterminer l'emplacement d'installation du système dans l'armoire. Cette
procédure permet d'y parvenir.
Pourquoi et quand exécuter cette tâche
Avant d'installer l'unité d'extension dans une armoire, procédez comme suit :
Procédure
1. Lisez les «Consignes de sécurité relatives aux armoires», à la page 102.
2. Choisissez l'emplacement d'installation des unités. Placez les unités les plus encombrantes et les plus
lourdes dans la partie inférieure de l'armoire.
Ce système fait deux unités EIA (Electronic Industries Alliance) de haut. Une unité EIA mesure 44,45
mm de haut. L'armoire contient trois trous de montage pour chaque unité EIA sur la hauteur. Cette
unité centrale mesure donc 88,9 mm (3,5 po) de haut et couvre six orifices de fixation de l'armoire.
3. Si nécessaire, retirez les panneaux obturateurs pour permettre l'accès à l'intérieur du boîtier de
l'armoire.
40 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension
4. Le cas échéant, retirez les volets avant et arrière de l'armoire.
Résultats
Marquage de l'emplacement
Il se peut que vous deviez marquer l'emplacement d'installation. Cette procédure permet d'effectuer cette
tâche.
Pourquoi et quand exécuter cette tâche
Pour marquer l'emplacement de l'installation et installer les écrous clip dans une armoire, procédez
comme suit :
Procédure
1. Déterminez l'emplacement du système dans l'armoire. Installez les unités en commençant par le bas.
Placez les unités les plus encombrantes et les plus lourdes dans la partie inférieure de l'armoire.
Figure 34. Retrait des panneaux obturateurs
Installation en armoire d'une unité centrale ou d'une unité d'extension 41
Mémorisez l'emplacement EIA. Ce système fait deux unités EIA (Electronic Industries Alliance) de
haut. Une unité EIA mesure 44,45 mm de haut. L'armoire comporte trois orifices de montage pour
chaque unité EIA de hauteur. Ce système mesure donc 88,9 mm (3,5 po) de haut et couvre six orifices
de fixation de l'armoire.
2. Face à l'avant de l'armoire, et en travaillant depuis le côté droit, repérez l'unité EIA inférieure qui sera
utilisée par votre unité d'extension. Mémorisez l'emplacement EIA. Utilisez du ruban adhésif, un
marqueur ou un crayon pour marquer l'orifice inférieur de cette unité EIA (A). Marquez l'armoire de
sorte que la marque reste visible au dos de l'armoire.
3. Commencez par l'orifice identifié par votre marque (A). Comptez un orifice et placez une seconde
marque (B). Marquez l'armoire de sorte que la marque soit également visible depuis l'arrière de
l'armoire (voir figure 35).
4. Répétez les deux étapes précédentes sur le côté gauche de l'armoire.
5. Face à l'arrière de l'armoire, et en travaillant depuis le côté droit, repérez l'unité EIA inférieure qui
sera utilisée par votre unité d'extension. Faites une marque sur l'armoire au niveau de l'orifice
inférieur de cette unité EIA (A).
6. Commencez par l'orifice identifié par votre marque (A). Comptez un orifice et placez une seconde
marque (B) (voir figure 35).
7. Commencez par l'orifice identifié par votre marque (B). Comptez un orifice et placez une troisième
marque (C) (voir figure 35).
8. Répétez les deux étapes précédentes sur le côté gauche de l'armoire.
Résultats
Fixation du matériel de montage 31E/2B à l'armoire
Vous devez fixer le support de montage dans l'armoire. La procédure de cette section permet d'effectuer
cette tâche.
Pourquoi et quand exécuter cette tâche
Avertissement : Pour éviter d'entraîner une défaillance des guides, de vous blesser ou d'endommager
l'unité, vérifiez que vous disposez des guides et raccords adaptés à votre armoire. Si votre armoire a des
trous à rebord pour support carrés ou des trous à rebord pour support de filetage de vis, veillez à ce que
les guides et raccords correspondent aux trous à rebord pour support utilisés sur votre armoire.
N'installez pas de matériel non conforme à l'aide de rondelles ou de cales d'espacement. Si vous ne
disposez pas des guides et raccords adéquats pour votre armoire, veuillez contacter votre revendeur. De
même, pour installer correctement les guides, effectuez chaque tâche dans l'ordre suivant.
Pour installer le support de montage dans l'armoire, procédez comme suit :
Procédure
1. Chaque glissière est marquée d'un R (right, pour droite) ou d'un L (left, pour gauche). Sélectionnez
l'une des glissières et poussez vers le haut la languette mobile avant (1). Retirez le taquet avant (2)
Figure 35. Marquage des orifices à l'avant et à l'arrière du châssis de l'armoire
42 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension
pour faire glisser la glissière avant. Si une vis moletée est installée dans la glissière (3), retirez-la.
Remarque : Assurez-vous que le taquet mobile reste étendu et ne s'enclenche pas de nouveau.
2. Alignez les trois broches à l'arrière de la glissière avec les trois orifices dans l'unité EIA sélectionnée à
l'arrière de l'armoire. Poussez les glissières afin que les broches rentrent dans les orifices (1), et lâchez
la glissière (2) pour qu'elle s'enclenche.
3. Tirez la glissière vers l'avant et insérez les deux broches (1) à l'avant de la glissière dans les deux
orifices inférieurs dans le U à l'avant de l'armoire. Faites descendre la glissière en place jusqu'à ce
qu'elle s'enclenche. Enfoncez le taquet avant (2) jusqu'au fond.
Figure 36. Glissière et taquet mobile
Figure 37. Alignez les broches avec les orifices à l'arrière de l'armoire
Figure 38. Glissière avant et broches de l'armoire
Installation en armoire d'une unité centrale ou d'une unité d'extension 43
4. Répétez les trois étapes précédentes pour installer l'autre glissière dans l'armoire. Vérifiez que chaque
taquet avant est bien en place.
5. Tirez les glissières vers l'avant (1) jusqu'à ce qu'elles s'enclenchent, avec deux clics. Soulevez avec
précaution le serveur et inclinez-le pour le mettre en position au-dessus des glissières de sorte que les
têtes de clou arrière (2) sur le serveur s'alignent avec les logements arrière (3) sur les glissières. Faites
glisser le serveur vers le bas jusqu'à ce que les têtes de clou arrière s'enclenchent dans les deux
emplacements arrière, puis faites lentement descendre l'avant du serveur (4) jusqu'à ce que les autres
têtes de clou s'enclenchent dans les autres emplacements sur les glissières. Veillez à ce que le taquet
avant (5) passe au-dessus des têtes de clou.
6. Soulevez les taquets de déverrouillage bleus (1) sur les glissières et poussez le serveur (2) jusqu'au
fond de l'armoire jusqu'à ce qu'il s'enclenche.
7. Installez le bras de routage des câbles. Pour plus d'informations, voir «Installation du bras de routage
des câbles».
Installation du bras de routage des câbles
Il se peut que vous deviez installer le bras de routage des câbles. Cette procédure permet d'effectuer cette
tâche.
Figure 39. Glissières étendues et têtes de clou du serveur alignées sur les emplacements des rails
Figure 40. Taquets de déverrouillage et serveur
44 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension
Pourquoi et quand exécuter cette tâche
Pour installer le bras de routage des câbles, procédez comme suit :
Procédure
1. Installez le bras de routage des câbles sur le côté droit du serveur, lorsque vous êtes face à l'arrière.
Connectez une extrémité du bras de support (1) à la glissière sur laquelle vous comptez fixer le bras
de routage des câbles afin de pouvoir balancer l'autre extrémité du bras (2) vers l'armoire.
2. Installez le support d'arrêt de routage des câbles en forme de L (1) sur l'extrémité libre du bras de
support. Tournez le support (2) afin de le fixer au bras de support.
3. Pour fixer l'autre côté du bras de support à l'arrière de la glissière, sortez la broche de (1), puis faites
glisser le support (2) dans la glissière.
Figure 41. Connexion du bras de support
Figure 42. Support d'arrêt de routage des câbles fixé au bras de support
Installation en armoire d'une unité centrale ou d'une unité d'extension 45
4. Retirez la broche du support de fixation (1) et faites glisser le support de fixation (2) dans la glissière
sur laquelle vous installez le bras de routage des câbles. Poussez le support dans la glissière jusqu'à ce
que la broche s'enclenche.
5. Placez le bras de routage des câbles sur le bras de support. Retirez la broche du bras de routage des
câbles (1), puis faites glisser la languette du bras de routage des câbles (2) dans l'encoche située à
l'intérieur de la glissière. Enfoncez la languette jusqu'à ce qu'elle s'enclenche. Retirez l'autre broche du
bras de routage des câbles (3), et faites glisser la languette de ce bras de routage des câbles dans
l'encoche (4) de la partie extérieure de la glissière. Enfoncez la languette jusqu'à ce qu'elle s'enclenche.
Figure 43. Broche étendue et support installé dans la glissière
Figure 44. Broche du support de fixation étendue et support de fixation installé dans la glissière
46 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension
6. Faites passer les cordons d'alimentation et autres câbles (notamment clavier, écran et souris, si
nécessaire) sur le bras de routage des câbles (1). Branchez tous les câbles à l'arrière du serveur, sauf le
cordon d'alimentation. Fixez le cordon d'alimentation et les câbles à l'aide des attache-câble ou des
attaches velcro.
Remarque : Laissez du mou à tous les câbles afin d'éviter tensions dans les câbles lorsque le bras de
routage des câbles bouge.
7. Faites glisser le serveur dans l'armoire jusqu'à ce qu'il s'enclenche.
Résultats
Raccordement de câbles d'alimentation au système
Il se peut que vous deviez raccorder des câbles d'alimentation au système. Cette procédure permet
d'effectuer cette tâche.
Pourquoi et quand exécuter cette tâche
Pour raccorder des câbles d'alimentation au système, procédez comme suit :
Figure 45. Connexion du bras de routage des câbles
Figure 46. Fixation et routage du cordon d'alimentation
Installation en armoire d'une unité centrale ou d'une unité d'extension 47
Procédure
1. En vous tenant face à l'arrière de l'unité centrale, insérez le cordon d'alimentation du système dans le
passe-câbles horizontale.
2. Branchez le cordon d'alimentation dans l'unité centrale.
Remarque : Si le système est équipé deux alimentations, chacune doit se voir brancher son propre
cordon d'alimentation.
3. Branchez le cordon d'alimentation du système et les cordons d'alimentation des éventuelles
imprimantes ou périphériques d'affichage sur le secteur en CA (courant alternatif).
4. Mettez sous tension l'unité centrale ainsi que tout périphérique raccordé dessus.
Résultats
Pour obtenir des instructions sur la manière de connecter et de configurer l'unité d'extension une fois
l'installation terminée, consultez Boîtiers et unités d'extension.
Installation du modèle 33E/8B ou 36E/8C en armoire
Il se peut que vous deviez installer l'unité centrale dans une armoire. La procédure de la présente section
vous permet d'effectuer cette tâche. Outre les informations de sécurité permettant d'effectuer la procédure
de façon fiable et sans danger, cette section comporte également des illustrations des composants matériel
concernés et montre comment ces composants sont liés les uns aux autres.
Cette opération incombe au client. Vous pouvez exécuter vous-même cette tâche ou faire appel à un
fournisseur de services. Ce dernier vous imputera probablement les frais correspondants.
Important : Pour effectuer cette procédure, deux personnes sont nécessaires pour fixer le guide à
l'armoire, une devant l'armoire, et l'autre à l'arrière. Trois personnes seront nécessaires pour soulever
l'unité centrale dans l'armoire.
Cette procédure suppose que l'installation de l'unité centrale s'effectue dans une armoire déjà montée et
mise en place. Si l'armoire n'est pas installée, consultez les instructions de la rubrique «Installation de
l'armoire», à la page 5, puis revenez à cette procédure pour obtenir des instructions sur l'installation de
l'unité d'extension dans l'armoire.
Remarque : Cette procédure ne concerne que les modèles conçus pour une armoire, ne tentez pas
d'installer un modèle autonome dans une armoire.
Pour installer le modèle 33E/8B ou 36E/8C dans une armoire, procédez comme suit :
ATTENTION :
L'installation des glissières dans l'armoire est une opération complexe. Pour une installation correcte
des guides, respectez l'ordre des tâches ci-dessous. Si vous ne le respectez pas, vous risquez de vous
blesser ou d'endommager l'unité centrale.
1. Lisez les «Consignes de sécurité relatives aux armoires», à la page 102.
2. Si ce n'est déjà fait, reportez-vous à la figure 47, à la page 49 et terminez l'inventaire des composants.
48 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension
v A Kit de montage en armoire
– Quatre grandes vis de fixation
– Deux vis de taille moyenne
– Deux écrous clip
– Huit petites vis de fixation
– Deux étriers de fixation de glissière
– Deux passe-câbles
v B glissières d'armoire droite et gauche avec des supports d'armoire
v C Fixations de bras de routage des câbles
v D Deux équerres de bras de routage des câbles (une pour l'installation à gauche, une pour
l'installation à droite)
v E Bras de routage des câbles
v F Deux taquets d'armoire
Si des pièces sont manquantes, endommagées, ou ne correspondent pas à la commande, veuillez
contacter :
v Revendeur
3. Repérez le kit de montage en armoire (A), et les guides (B) fournis avec l'unité centrale, comme
indiqué dans la figure 47.
Figure 47. Kit de montage en armoire
Installation en armoire d'une unité centrale ou d'une unité d'extension 49
Les guides (B) sont dans le sens avant en arrière et gauche à droite. Les glissières portent des
étiquettes indiquant si elles doivent être placées à gauche ou à droite par rapport à l'avant de
l'armoire. L'arrière de chaque guide possède deux grands assemblages de loquets. Ces assemblages de
loquets doivent être placés à l'arrière de l'armoire.
4. Choisissez l'emplacement d'installation de l'unité centrale dans l'armoire. Pour plus d'informations,
voir «Détermination de l'emplacement».
5. Ensuite, vous devez fixer le support de montage à l'armoire. Pour plus d'informations, voir Fixation
du support de montage à l'armoire.
Détermination de l'emplacement
Il se peut que vous deviez déterminer l'emplacement d'installation du système dans l'armoire. Cette
procédure permet d'y parvenir.
Pourquoi et quand exécuter cette tâche
Avant d'installer l'unité d'extension dans une armoire, procédez comme suit :
Procédure
1. Lisez les «Consignes de sécurité relatives aux armoires», à la page 102.
2. Choisissez l'emplacement d'installation des unités. Placez les unités les plus encombrantes et les plus
lourdes dans la partie inférieure de l'armoire.
Ce système fait quatre unités EIA (Electronic Industries Alliance) de hauteur. Une unité EIA mesure
44,45 mm de hauteur. L'armoire contient trois trous de montage pour chaque unité EIA sur la hauteur.
Cette unité centrale mesure donc 177,8 mm de hauteur et correspond à 12 trous de vis sur l'armoire.
3. Si nécessaire, retirez les panneaux obturateurs pour permettre l'accès à l'intérieur du boîtier de
l'armoire.
50 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension
4. Le cas échéant, retirez les volets avant et arrière de l'armoire.
Résultats
Marquage de l'emplacement
Il se peut que vous deviez marquer l'emplacement d'installation. Cette procédure permet d'y parvenir.
Pourquoi et quand exécuter cette tâche
Pour marquer l'emplacement de l'installation et installer les écrous clip dans une armoire, procédez
comme suit :
Procédure
1. Déterminez l'emplacement du système dans l'armoire. Installez les unités en commençant par le bas.
Placez les unités les plus encombrantes et les plus lourdes dans la partie inférieure de l'armoire.
Mémorisez l'emplacement EIA. Ce système fait quatre unités EIA (Electronic Industries Alliance) de
Figure 48. Retrait des panneaux obturateurs
Installation en armoire d'une unité centrale ou d'une unité d'extension 51
hauteur. Une unité EIA mesure 44,45 mm de hauteur. L'armoire contient trois trous de montage par
unité EIA sur la hauteur. Cette unité fait donc 177,8 mm de haut et correspond à 12 trous de vis sur
l'armoire.
2. Mettez-vous face à l'avant de l'armoire et opérez à partir du côté droit. Utilisez du ruban adhésif, un
marqueur ou un crayon pour marquer les trous. Marquez le trou médian A de l'unité EIA se trouvant
le plus bas parmi les quatre utilisés pour cette unité centrale. Vous devez identifier le trou marqué
depuis l'avant et l'arrière de l'armoire.
3. Marquez le trou du milieu de l'unité EIA inférieure, sur le côté gauche de l'armoire (A) comme
indiqué sur la figure 49.
4. Placez-vous à l'arrière de l'armoire. Sur le côté droit, localisez l'unité EIA qui correspond à l'unité EIA
inférieure indiquée sur le devant de l'armoire.
5. Marquez l'orifice du milieu de cette unité EIA inférieure (B) comme indiqué sur la figure 49.
6. Marquez le trou inférieur de l'unité EIA suivante (située plus haut), (C) comme indiqué sur la
figure 49.
7. Marquez les trous correspondants sur le côté gauche de l'armoire.
Résultats
Fixation du support de montage 33E/8B ou 36E/8C dans l'armoire
Vous devez fixer le support de montage dans l'armoire. La procédure de cette section permet d'effectuer
cette tâche. Cette section fournit des informations permettant d'effectuer les opérations de manière fiable
et sans danger. Cette section comporte également des illustrations des composants matériel concernés et
montre comment ces composants sont liés les uns aux autres.
Pourquoi et quand exécuter cette tâche
Avertissement : Pour éviter un mauvais montage des glissières, ainsi que de vous blesser ou
d'endommager l'unité, vérifiez que vous disposez des glissières et raccords adaptés à votre armoire. Si
votre armoire a des trous à rebord pour support carrés ou des trous à rebord pour support de filetage de
vis, veillez à ce que les guides et raccords correspondent aux trous à rebord pour support utilisés sur
votre armoire. N'installez pas de matériel non conforme à l'aide de rondelles ou de cales d'espacement. Si
vous ne disposez pas des guides et raccords adéquats pour votre armoire, veuillez contacter votre
revendeur. De même, pour installer correctement les guides, effectuez chaque tâche dans l'ordre suivant.
Pour installer le support de montage dans l'armoire, procédez comme suit :
Figure 49. Marquage des trous sur l'avant et l'arrière du cadre de l'armoire
52 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension
Procédure
1. Avec la bonne glissière, tournez la patte de déverrouillage du bloc loquet (A) dans la figure 50. Faites
glisser la patte de déverrouillage (B) en position escamotée et verrouillez le bloc loquet. Les goupilles
d'alignement arrière doivent être entièrement escamotées.
2. Une fois les goupilles d'alignement escamotées, introduisez la goupille d'alignement avant de la
glissière de droite dans l'orifice du rebord avant de l'armoire, comme indiqué sur la figure 50. Cet
orifice est identifié par la marque précédemment apposée. Demandez à une deuxième personne de
maintenir fermement la glissière en position.
3. Alignez les goupilles d'alignement arrière de la glissière sur les orifices à l'arrière de l'armoire. Les
orifices sont identifiés par les marques situées à l'arrière de l'armoire. Veillez à ce que les guides
soient au même niveau.
4. Faites coulisser la patte de déverrouillage, (B), pour étendre les deux goupilles d'alignement arrière
dans l'arrière de l'armoire. Veillez à ce que les goupilles soient passées dans les trous correspondants
à l'arrière du cadre de l'armoire.
5. Placé à l'arrière de l'armoire, comme illustré sur figure 51, à la page 54, installez l'étrier de fixation
de la glissière sur les goupilles d'alignement. Puis introduisez une grande vis de retenue de glissière
dans l'orifice situé entre les deux goupilles d'alignement arrière. Serrez partiellement la vis. Ne serrez
pas complètement à cette étape.
Remarque : Les étriers sont étiquetés LEFT (GAUCHE) et RIGHT (DROIT).
Figure 50. Broche d'alignement, vis de fixation et taquet de verrouillage de la glissière avant
Installation en armoire d'une unité centrale ou d'une unité d'extension 53
6. Répétez les étapes 1, à la page 53 à 5, à la page 53 pour le guide gauche.
7. Localisez les taquets de verrouillage (F) sur figure 47, à la page 49. Pour installer les taquets de
verrouillage de l'armoire, procédez comme suit :
a. Faites pivoter le haut du support vers l'extérieur (voir la figure 52, à la page 55).
Figure 51. Installation des étriers de fixation de glissière
54 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension
b. Alignez le support avec le logement sur le côté du serveur (voir la figure 53, à la page 56).
Figure 52. Rotation du haut du support vers l'extérieur
Installation en armoire d'une unité centrale ou d'une unité d'extension 55
c. Déplacez le support vers le haut afin d'engager le taquet (voir la figure 54).
d. Faites pivoter le haut du support vers l'arrière de façon à positionner la languette supérieure sur
le haut du système. Puis, installez les vis comme indiqué sur figure 55, à la page 57.
Figure 53. Alignez le support avec son logement sur le côté de l'unité.
Figure 54. Déplacement du support vers le haut afin d'engager le taquet
56 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension
8. Etirez les glissières intérieures en les tirant vers l'extérieur. Elles doivent être tirées hors du cadre
comme les glissières illustrées dans figure 56.
9. Avec l'aide de trois personnes, saisissez les deux poignées situées de chaque côté du système et
placez-le sur la glissière intérieure. Alignez la languette A sur la fente B comme indiqué par la ligne
verticale tiretée sur figure 56.
10. Appuyez simultanément sur les taquets de sécurité et enfoncez le système dans l'armoire jusqu'à ce
qu'il soit solidement en place, comme indiqué dans la figure 57, à la page 58.
Figure 55. Faites pivoter le haut du support vers l'arrière de façon à positionner la languette supérieure sur le haut du
système
Figure 56. Placez le système sur les glissières.
Installation en armoire d'une unité centrale ou d'une unité d'extension 57
11. Installez et positionnez les écrous clip avant d'insérer les vis. Utilisez les vis de fixation pour fixer le
système à l'armoire, comme indiqué dans la figure 58, à la page 59.
Figure 57. Guide intérieur étendu
58 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension
12. Une fois les deux glissières installées, vérifiez que toutes les vis de retenue des glissières ne sont
serrées qu'à la main. Les guides doivent être au même niveau d'arrière en avant et de gauche à
droite.
13. Ensuite, vous devez fixer le bras de routage des câbles. Pour plus d'informations, voir «Installation
du bras de routage des câbles».
Installation du bras de routage des câbles
Il se peut que vous deviez installer le bras de routage des câbles. Cette procédure permet d'effectuer cette
tâche.
Pourquoi et quand exécuter cette tâche
Pour installer le bras de routage des câbles, procédez comme suit :
Procédure
1. Déterminez le côté de l'armoire sur lequel vous voulez installer le bras de routage des câbles.
2. Placez le support de bras approprié (le gauche ou le droit) au bras de routage des câbles.
3. Utilisez l'ergot (F) pour fixer le bras de routage des câbles (E) au châssis de l'armoire (D) comme
illustré sur figure 59, à la page 60.
Figure 58. Fixez fermement le système à l'armoire par le biais des taquets de l'armoire
Installation en armoire d'une unité centrale ou d'une unité d'extension 59
Conseil : Si l'accès à l'arrière de l'armoire est rendu difficile par le grand nombre de câbles présents, il
peut être plus aisé d'enlever la petite charnière d'articulation du bras de routage des câbles et de la
fixer en premier. Vous pouvez ensuite fixer la section restante du bras de routage des câbles à la
charnière de connexion.
4. Utilisez le deuxième ergot F pour fixer l'autre extrémité du bras de routage des câbles au rebord C
qui est fixé sur la partie coulissante du guide gauche du système B (voir figure 59).
Résultats
Raccordement de câbles d'alimentation au système
Il se peut que vous deviez raccorder des câbles d'alimentation au système. Cette procédure permet
d'effectuer cette tâche.
Pourquoi et quand exécuter cette tâche
Pour raccorder des câbles d'alimentation au système, procédez comme suit :
Procédure
1. En vous tenant face à l'arrière de l'unité centrale, notez l'emplacement de la languette inférieure du
passe-câbles. Tournez le passe-câbles dans le sens inverse des aiguilles d'une montre pour le mettre en
position horizontale et insérez sa languette inférieure dans l'orifice inférieur de la glissière. Puis,
tournez le passe-câbles dans le sens des aiguilles d'une montre jusqu'à le remettre en position
verticale, comme indiqué dans figure 60, à la page 61. Faites passer le cordon d'alimentation du
système dans le passe-câbles comme indiqué dans cette figure. Puis appuyez sur le dessus du
passe-câbles et insérez la languette supérieure dans l'orifice supérieur.
Remarque : La glissière comporte deux séries d'orifices dans lesquels le passe-câbles peut être installé.
Ces séries d'orifices sont séparés d'environ 25,4 mm. Installez le passe-câbles dans la série d'orifices
placée le plus en arrière.
Figure 59. Fixation du bras de routage des câbles
60 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension
2. Branchez le cordon d'alimentation dans l'unité centrale.
Remarque : Si le système est équipé deux alimentations, chacune doit se voir brancher son propre
cordon d'alimentation.
3. Brancher le cordon d'alimentation du système et les cordons d'alimentation des éventuels imprimantes
ou périphériques d'affichage sur le secteur en CA (courant alternatif).
4. Mettez sous tension l'unité centrale ainsi que tout périphérique raccordé dessus.
Résultats
Pour obtenir des instructions sur la manière de connecter et de configurer l'unité d'extension une fois
l'installation terminée, consultez Boîtiers et unités d'extension.
Installation en armoire du modèle 58/02 ou 58/77
Cette procédure peut être utilisée pour installer le boîtier d'unité de disque 58/02 ou 58/77 dans une
armoire. La procédure inclut des informations destinées à promouvoir une opération sécurisée et fiable, et
fournit des illustrations sur la façon dont les composants matériels sont liés les uns aux autres.
v Cette procédure suppose que vous installez l'unité d'extension ou 14G/30 dans une armoire existante.
Si l'armoire n'est pas installée, vous devez effectuer la procédure de «Installation de l'armoire», à la
page 5 avant d'effectuer cette procédure pour installer l'unité d'extension dans l'armoire.
Remarque : Cette procédure ne concerne que les modèles conçus pour une armoire, ne tentez pas
d'installer un modèle autonome dans une armoire.
v Lisez les «Consignes de sécurité relatives aux armoires», à la page 102.
v Pour des instructions concernant l'installation, le retrait et le remplacement d'une unité de disque, d'un
fond de pânier d'unité de disque ou dun boîtier, voir Unités de disque.
Figure 60. Routage du cordon d'alimentation de l'unité centrale
Installation en armoire d'une unité centrale ou d'une unité d'extension 61
Pour installer le modèle 58/02ou 58/77 dans une armoire, procédez comme suit :
ATTENTION :
L'installation des glissières dans l'armoire est une opération complexe. Pour une installation correcte
des guides, respectez l'ordre des tâches ci-dessous. Si vous ne le respectez pas, vous risquez de vous
blesser ou d'endommager l'unité centrale.
Avertissement : L'installation des glissières dans l'armoire est une opération complexe. Pour une
installation correcte des glissières, respectez l'ordre des tâches ci-dessous. Si vous ne le respectez pas,
vous risquez de vous blesser ou d'endommager l'unité d'extension. Notez également que pour 58/02 et
58/77, trois personnes sont nécessaires pour soulever le système en toute sécurité. Le non-respect de cette
recommandation peut provoquer des blessures.
1. Reportez-vous à la liste des éléments du kit fournie dans le kit d'installation en armoire, et effectuez
un inventaire des composants, si ce n'est déjà fait.
2. Repérez le kit matériel de montage en armoire et les guides fournis avec l'unité d'extension.
3. Déterminez l'emplacement d'installation de l'unité d'extension dans l'armoire. Pour plus
d'informations, voir «Détermination de l'emplacement». Cette unité d'extension mesure quatre unités
EIA (Electronic Industries Alliance) de haut. Une unité EIA mesure 4,45 mm de hauteur. L'armoire
comporte trois orifices de montage pour chaque unité EIA de hauteur.
4. Marquez l'emplacement. Pour plus d'informations, voir «Marquage de l'emplacement», à la page 63.
Détermination de l'emplacement
Il se peut que vous deviez déterminer l'emplacement de l'unité d'extension dans l'armoire. Cette
procédure permet d'y parvenir.
Pourquoi et quand exécuter cette tâche
Avant d'installer l'unité d'extension dans une armoire, procédez comme suit :
1. Déterminez l'emplacement de l'unité d'extension dans l'armoire par rapport au reste du matériel du
système. Lors de la préparation, organisez les parties les plus grandes et les plus lourdes dans la
partie inférieure de l'armoire et prévoyez d'installer les éléments de la partie inférieure de l'armoire en
premier. Enregistrez les emplacements EIA dans votre plan.
Remarque : Ces unités d'extension font quatre unités EIA (Electronic Industries Alliance) de hauteur.
Une unité EIA équivaut à 44,50 mm (1,75 po) de haut. L'armoire contient trois trous de montage par
unité EIA sur la hauteur. Cette unité centrale mesure donc 177,8 mm (7 po) de haut et couvre 12
orifices de fixation de l'armoire.
2. Si nécessaire, retirez les panneaux obturateurs pour permettre l'accès à l'intérieur du boîtier de
l'armoire où vous allez placer l'unité d'extension.
62 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension
3. Le cas échéant, retirez les volets avant et arrière de l'armoire.
4. Marquez l'emplacement où vous prévoyez d'installer le boîtier d'unité de disque.
a. Si vous possédez un gabarit de montage, utilisez-le pour marquer l'emplacement.
b. Si vous ne possédez pas de gabarit de montage, voir «Marquage de l'emplacement».
Marquage de l'emplacement
Il se peut que vous deviez marquer l'emplacement d'installation. La procédure de cette section permet
d'effectuer cette tâche.
Pourquoi et quand exécuter cette tâche
Pour marquer l'emplacement de l'installation et la position de mise en place des écrous clip sur une
armoire, procédez comme suit :
1. Déterminez l'emplacement de l'unité d'extension dans l'armoire par rapport au reste du matériel du
système. Lorsque vous planifiez, gardez les éléments suivants à l'esprit :
a. Organisez les unités les plus grandes et les plus lourdes dans la partie inférieure de l'armoire.
Figure 61. Retrait des panneaux obturateurs
Installation en armoire d'une unité centrale ou d'une unité d'extension 63
b. Planifiez l'installation en premier des unités dans la partie inférieure de l'armoire.
c. Enregistrez les emplacements EIA dans votre plan.
Remarque : Ces unités d'extension font quatre unités EIA (Electronic Industries Alliance) de hauteur.
Une unité EIA mesure 44,50 mm de hauteur. L'armoire comporte trois orifices de montage pour
chaque unité EIA de hauteur. Cette unité centrale mesure donc 7 po (177.8 mm) de haut et couvre 12
orifices de fixation de l'armoire.
2. Mettez-vous face à l'avant de l'armoire et travillez du côté gauche. Notez l'unité EIA la plus élevée et
la plus basse qui vous serviront pour l'unité d'extension. Utilisez du ruban adhésif, un marqueur ou
un crayon pour marquer le trou de vis inférieur (A)de la plus haute unité EIA. Puis marquez le trou
de vis supérieur (B) de l'unité EIA la plus basse. Marquez l'armoire de sorte que la marque reste
visible au dos de l'armoire.
3. Comptez deux orifices de vis depuis l'endroit où vous avez placé la marque de l'orifice de vis (B) à la
dernière étape et placez une autre marque à côté de cet orifice de vis (B). Marquez l'armoire de sorte
que la marque reste visible au dos de l'armoire. Vous avez maintenant deux marques (B) sur
l'armoire, avec un trou de montage entre ces marques.
4. Répétez les deux dernières étapes pour placer trois marques sur les orifices de montage
correspondants sur la partie avant droite de l'armoire.
5. Placez-vous à l'arrière de l'armoire. Sur le côté gauche, localisez l'unité EIA qui correspond à l'unité
EIA inférieure indiquée sur le devant de l'armoire.
6. Placez une marque au niveau de l'orifice supérieur (C) de l'unité EIA la plus basse. Marquez l'armoire
de sorte que la marque reste visible au dos de l'armoire.
7. Comptez deux orifices de vis depuis l'endroit où vous avez placé la marque à la dernière étape et
placez une autre marque à côté de cet orifice de vis (C).
8. Répétez les deux dernières étapes pour placer deux marques sur les orifices de montage
correspondants sur la partie arrière droite de l'armoire.
Que faire ensuite
Fixation du support de montage dans l'armoire
Il se peut que vous deviez fixer le support de montage dans l'armoire. La procédure de cette section
permet d'effectuer cette tâche. Cette section intègre également des informations de sécurité permettant
d'effectuer les opérations de façon fiable et sans danger, ainsi que des illustrations des composants
matériels.
Figure 62. Marquage des emplacements d'installation
64 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension
Pourquoi et quand exécuter cette tâche
ATTENTION :
L'installation des glissières est une procédure délicate. Pour une installation correcte des glissières,
respectez l'ordre des tâches ci-dessous. Si vous ne le respectez pas, vous risquez de vous blesser ou
d'endommager l'unité centrale.
Pour installer le support de montage dans l'armoire, procédez comme suit :
1. Faites pivoter vers le bas les fixations avant et arrière (C) des glissières gauche et droite.
2. Sur chaque glissière, ôtez la bande maintenant le ressort (B) sur la glissière. Enroulez l'extrémité du
ressort autour de la vis de butée de la glissière.
3. Commencez par l'avant de l'armoire. Tirez la glissière droite et introduisez les ergots (D) dans les
emplacements marqués de l'armoire de sorte que la glissière soit placée recto vers le haut. Faites
pivoter vers le haut la fixation avant (C) pour bloquer la glissière.
Remarques :
v Si votre armoire possède des trous de vis carrés, retirez les ergots (D) de la glissière. Installez les
ergots de plus grande taille fournis dans le kit d'installation de l'armoire.
v Bien que l'installation de la glissière puisse être effectuée par une seule personne, la procédure
sera facilitée si une personne est positionnée à l'avant de l'armoire et une autre à l'arrière.
4. Tirez la glissière de gauche et introduisez les ergots (D) dans les emplacements marqués. Faites
pivoter vers le haut la fixation avant (C) pour bloquer la glissière.
5. A l'avant de l'armoire, étendez la glissière de droite et insérez ses ergots (D) dans les encoches
indiquées.Faites pivoter vers le haut la fixation arrière (C) pour bloquer la glissière.
6. Tirez la glissière de gauche et introduisez les ergots (D) dans les emplacements marqués. Faites
pivoter vers le haut la fixation arrière (C) pour bloquer la glissière.
7. Fixez la glissière de gauche à l'arrière de l'armoire à l'aide de deux vis M5 (E), une au-dessous de
chaque ergot de la glissière (D).
Figure 63. Installation des glissières
Installation en armoire d'une unité centrale ou d'une unité d'extension 65
8. A l'aide de deux vis M5 (E), une en-dessous de chaque ergot de glissière (D), fixez la glissière de
droite à l'arrière de l'armoire.
9. Placez-vous à l'avant de l'armoire. A l'aide d'une vis M5 (E), en-dessous de la ergot de la glissière
supérieure, fixez la glissière de gauche à l'avant de l'armoire.
10. A l'aide d'une vis M5 (E), en-dessous de l'ergot de la glissière supérieure (D), fixez la glissière de
droite à l'avant de l'armoire.
11. Repérez les marques précédemment effectuées sur les bandes EIA (Electronic Industries Alliance)
avant de l'armoire. Chaque côté comporte une marque. Insérez un écrou clip (F) à la marque du côté
droit de l'armoire (voir figure 65, à la page 67).
12. Insérez un écrou clip (F) à la marque du côté gauche de l'armoire (voir figure 65, à la page 67).
13. Sur l'arrière de l'armoire, repérez les marques faites précédemment sur les bandes EIA (Electronic
Industries Alliance) arrière. Chaque côté comporte deux marques. Insérez un écrou clip (F) à chaque
marque du côté droit de l'armoire (voir figure 65, à la page 67).
Figure 64. Fixation des glissières
66 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension
14. A l'aide de deux vis M5 (H), fixez le support (G) sur le côté droit de l'armoire.
15. Insérez un écrou clip (F) à chaque marque du côté gauche de l'armoire (voir figure 65).
16. A l'aide de deux vis M5 (H), fixez le support (G) sur le côté gauche de l'armoire.
Résultats
Installation du modèle 58/02 ou 58/77 sur les glissières
Il se peut que vous deviez installer l'unité d'extension sur les glissières. La procédure de cette section
permet d'effectuer cette tâche. Outre les informations de sécurité, cette section comporte également des
illustrations des composants matériel concernés et montre comment ces composants sont liés les uns aux
autres.
Avant de commencer
1. Avant d'installer l'unité d'extension sur les glissières, assurez-vous que les stabilisateurs sont étendus
et que la plaque de soutien est fixée au bas de la face avant de l'armoire pour éviter que cette dernière
ne bascule vers l'avant lorsque les glissières sont étendues hors de l'armoire. Pour plus d'informations,
voir «Consignes de sécurité relatives aux armoires», à la page 102.
Pourquoi et quand exécuter cette tâche
Pour installer l'unité d'extension sur ses glissières, procédez comme suit :
1. Pour ôter le capot de transport situé à l'arrière de l'unité d'extension, retirez les quatre vis moletées
servant à l'assujettir. Retirez le capot.
Remarque : Deux vis de serrage à main sont situées dans la partie supérieure du capot et une sur
chacun des côtés.
2. Si le panneau frontal est monté à l'avant du système, enlevez-le en procédant comme suit :
a. Retirez les deux vis de fixations M5 (A) comme indiqué sur la figure 66, à la page 68.
Figure 65. Installation du support de fixation
Installation en armoire d'une unité centrale ou d'une unité d'extension 67
b. Retirez le support des informations de maintenance de son emplacement situé dans le panneau
frontal.
c. A l'aide des deux points tactiles bleus situés de chaque côté de l'unité d'extension, tirez sur le
panneau frontal pour le retirer de l'avant de l'unité, en le maintenant bien droit.
d. Retirez les deux écrous clip servant à assujettir le panneau frontal à l'unité d'extension par le
rebord EIA.
3. Avec l'aide de trois autres personnes, saisissez les quatre poignées (I), soulevez l'unité d'extension et
positionnez-la sur l'avant des glissières (voir la figure suivante).
Avertissement : Trois personnes sont nécessaires pour effectuer cette opération en toute sécurité. Le
non-respect de cette recommandation peut provoquer des blessures.
Figure 66. Retrait du support des informations de maintenance et du panneau frontal.
Figure 67. Installation en armoire de l'unité d'extension
68 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension
4. Faites glisser l'unité d'extension dans l'armoire.
5. Installez les deux vis de fixation M5 (J) en les faisant passer à travers l'unité d'extension et les insérant
dans les écrous clip sur les glissières gauche et droite.
6. Installez le panneau frontal (L) sur l'unité d'extension. Maintenez le panneau en place en appuyant
dessus et fixez-le au moyen de deux vis M5 (M) sur les glissières gauche et droite.
7. Insérez la plaquette d'informations de maintenance (N) dans son emplacement situé dans le panneau
central.
8. A l'arrière de l'unité, installez les quatre vis de fixation (K) en les faisant passer à travers l'unité
d'extension et les supports de fixation situés sur les glissières gauche et droite.
Figure 68. Installation du panneau frontal et du support des informations de maintenance.
Figure 69. Installation de quatre vis à l'arrière de l'unité
Installation en armoire d'une unité centrale ou d'une unité d'extension 69
Résultats
Pour obtenir des instructions sur la manière de connecter et de configurer l'unité d'extension une fois
l'installation terminée, consultez Boîtiers et unités d'extension.
Installation du modèle 58/86 en armoire
Cette procédure peut être utilisée pour installer le boîtier d'unité de disque 58/86 dans une armoire. La
procédure inclut des informations destinées à promouvoir une opération sécurisée et fiable, et fournit des
illustrations sur la façon dont les composants matériels sont liés les uns aux autres.
Avant de commencer
v Cette procédure suppose que vous installez le boîtier d'unité de disque 58/86 dans une armoire
existante. Si l'armoire n'est pas installée, vous devez effectuer la procédure de «Installation de
l'armoire», à la page 5 avant d'effectuer cette procédure pour installer le boîtier d'unité de disque dans
l'armoire.
Remarque : Cette procédure ne concerne que les modèles conçus pour une armoire, ne tentez pas
d'installer un modèle autonome dans une armoire.
v Lisez les «Consignes de sécurité relatives aux armoires», à la page 102.
Pourquoi et quand exécuter cette tâche
L'installation du boîtier d'unité de disque dans une armoire est une tâche client. Vous pouvez exécuter
vous-même cette tâche ou faire appel à un fournisseur de services. Il vous imputera probablement les
frais correspondants.
Important : Pour effectuer cette procédure, deux personnes sont nécessaires pour fixer les glissières à
l'armoire, l'une devant l'armoire et l'autre à l'arrière.
Pour installer le modèle 58/86 en armoire, procédez comme suit :
ATTENTION :
L'installation des glissières est une procédure délicate. Pour une installation correcte des glissières,
respectez l'ordre des tâches ci-dessous. Si vous ne le respectez pas, vous risquez de vous blesser ou
d'endommager l'unité centrale.
1. Faites l'inventaire des composants si ce n'est déjà fait.
Remarque : Pour cette installation, vous aurez besoin d'un tournevis à la lame plate de 5 mm (3/16
po). Un tournevis cruciforme de taille 2 et une clé de 8 mm sont facultatifs.
Voici la liste des éléments dont vous avez besoin pour installer l'unité dans l'armoire :
v Glissière de droite (1)
v Glissière de gauche (1)
v Vis M5 (8)
v Cales d'espacement de petit diamètre (8) (déjà installées, 4 par glissière)
v Cales d'espacement de grand diamètre (8)
Remarque : Ces cales ne sont utilisées qu'avec les armoires à support carré.
v Vis à tête tronconique M4 (2)
v Rondelles de 15 mm (0,60 po) de diamètre (6)
Remarque : Ces rondelles ne sont utilisées qu'avec les armoires à support carré.
70 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension
2. Repérez le gabarit de montage du matériel de montage et les glissières fournis avec votre boîtier
d'unité de disque.
Le sens des glissières et d'avant en arrière et de gauche à droite.
3. Déterminez l'emplacement d'installation du boîtier d'unité de disque dans l'armoire. Pour plus
d'informations, voir «Détermination de l'emplacement».
Résultats
Détermination de l'emplacement
Vous pouvez avoir besoin de déterminer l'emplacement d'installation du boîtier d'unité de disque dans
l'armoire. Cette procédure permet d'effectuer cette tâche.
Avant de commencer
Lisez les «Consignes de sécurité relatives aux armoires», à la page 102.
Pourquoi et quand exécuter cette tâche
Avant d'installer le boîtier d'unité de disque dans une armoire, suivez les étapes ci-dessous :
1. Déterminez l'emplacement du boîtier d'unité de disque dans l'armoire par rapport au reste du
matériel du système. Lorsque vous planifiez, gardez les éléments suivants à l'esprit :
a. Organisez les unités les plus grandes et les plus lourdes dans la partie inférieure de l'armoire.
b. Planifiez l'installation en premier des unités dans la partie inférieure de l'armoire.
c. Enregistrez les emplacements EIA dans votre plan.
Remarque : Le boîtier d'unité de disque 58/86 est de deux unités EIA (Electronic Industries Alliance).
Une unité EIA mesure 44,50 mm de hauteur. L'armoire comporte trois orifices de montage pour
chaque unité EIA de hauteur. Cette unité centrale mesure donc 3,5 po (89 mm) de haut et couvre six
orifices de fixation de l'armoire.
2. Si nécessaire, retirez les panneaux obturateurs pour permettre l'accès à l'intérieur du boîtier de
l'armoire où vous envisagez de placer le boîtier d'unité de disque.
Installation en armoire d'une unité centrale ou d'une unité d'extension 71
3. Le cas échéant, retirez les volets avant et arrière de l'armoire.
4. Marquez l'emplacement où vous prévoyez d'installer le boîtier d'unité de disque.
a. Si vous possédez un gabarit de montage, utilisez-le pour marquer l'emplacement.
b. Si vous ne possédez pas de gabarit de montage, voir «Marquage de l'emplacement».
Marquage de l'emplacement
Il se peut que vous deviez marquer l'emplacement d'installation de l'armoire. Cette procédure permet d'y
parvenir.
Pourquoi et quand exécuter cette tâche
Pour marquer l'emplacement de l'installation et installer les écrous clip dans une armoire, procédez
comme suit :
1. Déterminez l'emplacement du boîtier d'unité de disque dans l'armoire par rapport au reste du
matériel du système. Lorsque vous planifiez, gardez les éléments suivants à l'esprit :
a. Organisez les unités les plus grandes et les plus lourdes dans la partie inférieure de l'armoire.
Figure 70. Retrait des panneaux obturateurs
72 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension
b. Planifiez l'installation en premier des unités dans la partie inférieure de l'armoire.
c. Enregistrez les emplacements EIA dans votre plan.
Remarque : Le boîtier d'unité de disque 58/86 est de deux unités EIA (Electronic Industries Alliance).
Une unité EIA mesure 44,50 mm de hauteur. L'armoire comporte trois orifices de montage pour
chaque unité EIA de hauteur. Cette unité centrale mesure donc 3,5 po (89 mm) de haut et couvre six
orifices de fixation de l'armoire.
2. Mettez-vous face à l'avant de l'armoire et travaillez sur son côté droit. Marquez l'orifice du milieu de
l'unité EIA inférieure (A) comme illustré dans la figure suivante. Repérez le trou marqué depuis
l'avant et l'arrière de l'armoire.
3. Marquez le trou du milieu de l'unité EIA inférieure, sur le côté gauche de l'armoire.
4. Placez-vous à l'arrière de l'armoire. Sur le côté droit, localisez l'unité EIA qui correspond à l'unité EIA
inférieure indiquée sur le devant de l'armoire.
Remarque : La figure suivante illustre un exemple de disposition des unités EIA, il se peut que vous
ayez une configuration ou un placement différent.
5. Marquez l'orifice du milieu de l'unité EIA inférieure B.
6. Marquez l'orifice inférieur de l'unité EIA suivante (supérieure) C.
7. Marquez les trous correspondants sur le côté gauche de l'armoire.
Résultats
Fixation du support de montage dans l'armoire
Il se peut que vous deviez fixer le support de montage dans l'armoire. Cette procédure permet d'y
parvenir. Outre les informations de sécurité permettant d'effectuer la procédure de façon fiable et sans
danger, cette section comporte également des illustrations des composants matériel concernés et montre
comment ces composants sont liés les uns aux autres.
Pourquoi et quand exécuter cette tâche
ATTENTION :
L'installation des glissières est une procédure délicate. Pour une installation correcte des glissières,
respectez l'ordre des tâches ci-dessous. Si vous ne le respectez pas, vous risquez de vous blesser ou
d'endommager l'unité centrale.
Pour installer le support de montage dans l'armoire, procédez comme suit :
Figure 71. Marquage des orifices à l'avant et à l'arrière du châssis de l'armoire
Installation en armoire d'une unité centrale ou d'une unité d'extension 73
1. Desserrez les deux vis de réglage de glissière sur l'une des glissières, juste assez pour permettre aux
glissières de glisser.
2. Repérez les marques précédemment faites sur les bandes EIA (Electronic Industries Alliance) à l'avant
et à l'arrière de l'armoire. Utilisez les lettres L ou R imprimées sur le support avant de votre première
glissière pour l'aligner avec ces marques du côté correspondant de l'armoire. La partie inférieure du
bord du support de glissière doit se situer légèrement au-dessus de la marque U de la bride de
l'armoire.
La glissière est fournie avec deux cales d'espacement dans le support avant et deux cales
d'espacement dans le support arrière. Si l'armoire comporte des orifices circulaires, les cales
d'espacement des supports de glissière s'insèrent dans les orifices circulaires des brides de l'armoire. Si
l'armoire comporte des orifices carrés, remplacez les petites cales à l'arrière et à l'avant de la glissière
rail par les cales plus grosses qui sont incluses dans le kit de glissières.
3. Fixez la glissière à l'avant de l'armoire avec l'une des vis M5, dans l'orifice supérieur du support. Pour
les armoires à orifices carrés, ajoutez une rondelle entre la vis M5 et le support de glissière.
Figure 72. Extension des glissières
Figure 73. Mise en place des cales
74 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension
4. Fixez la glissière à l'arrière de l'armoire, avec deux vis M5 dans les orifices inférieur et supérieur du
support de glissière arrière. Pour les armoires à orifices carrés, ajoutez une rondelle entre chaque vis
M5 et le support de glissière.
5. Serrez les vis de réglage de glissière.
6. Répétez l'opération de l'étape 1, à la page 74 à l'étape 5 pour l'autre glissière.
Résultats
Installation du modèle 58/86 sur les glissières
Il se peut que vous deviez installer l'unité d'extension sur les glissières. Cette procédure permet d'y
parvenir. Outre les informations de sécurité, les illustrations des composants matériel associées sont
fournies.
Avant de commencer
1. Avant d'installer l'unité d'extension sur les glissières, assurez-vous que les stabilisateurs sont étendus
et que la plaque de soutien est fixée au bas de la face avant de l'armoire pour éviter que cette dernière
ne bascule vers l'avant lorsque les glissières sont étendues hors de l'armoire. Pour plus d'informations,
voir «Consignes de sécurité relatives aux armoires», à la page 102.
Pourquoi et quand exécuter cette tâche
Pour installer le boîtier d'unité de disque sur les glissières, procédez comme suit :
1. Pour réduire le poids de l'unité et en faciliter l'installation dans l'armoire, retirez les blocs
d'alimentation électrique. Pour retirer un bloc d'alimentation électrique, procédez comme suit :
a. Sur le côté gauche de l'alimentation électrique, appuyez sur la patte de déverrouillage orange vers
la droite, juste assez pour libérer la poignée (pas plus de 6,4 mm (0,24 po)) tout en faisant pivoter
la poignée vers le bas.
Figure 74. Fixation des glissières
Installation en armoire d'une unité centrale ou d'une unité d'extension 75
b. Retirez avec soin l'alimentation électrique de l'unité.
c. Répétez cette étape pour l'autre alimentation électrique, puis mettez les deux unités de côté pour
une installation ultérieure.
2. A l'aide de deux personnes, soulevez le boîtier d'unité de disque et positionnez-le sur l'avant des
glissières.
Avertissement : Deux personnes sont nécessaires pour soulever le boîtier d'unité de disque. Réaliser
cette opération avec moins de deux personnes peut entraîner des blessures.
3. Faites glisser le boîtier d'unité de disque dans l'armoire, en vous assurant qu'il s'insère dans les pattes
situées à l'arrière de la glissière.
Figure 75. Retrait d'alimentations électriques
Figure 76. Installation du boîtier sur les glissières
76 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension
4. Fixez l'avant du boîtier de l'unité de disque à l'armoire avec une vis M5 dans l'orifice ouvert de
chaque support (n'pas pas besoin de rondelle).
5. Fixez l'arrière du boîtier de l'unité de disque à chaque glissière avec une vis à tête tronconique M4
dans l'orifice rainuré sur l'extérieur de la paroi du boîtier de stockage.
6. Réinstallez les alimentations électriques et les unités de disque dur.
7. Connectez un cordon d'alimentation à chaque alimentation électrique. Utilisez le guide pour cordon
d'alimentation pour empêcher le cordon d'alimentation de se déconnecter accidentellement.
8. Installez le panneau frontal gauche, doté de voyants d'alimentation, puis le panneau frontal droit.
Placez la découpe de la partie supérieure du panneau frontal sur la patte du rebord du boîtier. Faites
ensuite pivoter le panneau frontal vers le bas jusqu'à ce qu'il s'emboîte. Assurez-vous que la surface
intérieure du panneau frontal est contre le boîtier.
Résultats
Pour obtenir des instructions sur la manière de connecter et de configurer l'unité d'extension après son
installation dans l'armoire, voir Boîtiers et unités d'extension.
Figure 77. Installation des vis de fixation
Figure 78. Fixation des panneaux latéraux
Installation en armoire d'une unité centrale ou d'une unité d'extension 77
Installation en armoire de l'unité d'extension ou 14G/30
Cette rubrique explique comment installer l'unité d'extension ou 14G/30 dans une armoire. La procédure
inclut des informations destinées à promouvoir une opération sécurisée et fiable, et fournit des
illustrations sur la façon dont les composants matériels sont liés les uns aux autres.
v Cette procédure suppose que vous installez l'unité d'extension ou 14G/30 dans une armoire existante.
Si l'armoire n'est pas installée, vous devez effectuer la procédure de «Installation de l'armoire», à la
page 5 avant d'effectuer cette procédure pour installer l'unité d'extension dans l'armoire.
Remarque : Cette procédure ne concerne que les modèles conçus pour une armoire, ne tentez pas
d'installer un modèle autonome dans une armoire.
v Lisez les «Consignes de sécurité relatives aux armoires», à la page 102.
Pour installer le modèle ou 14G/30 dans une armoire, procédez comme suit :
ATTENTION :
L'installation des glissières dans l'armoire est une opération complexe. Pour une installation correcte
des glissières, respectez l'ordre des tâches ci-dessous. Si vous ne le respectez pas, vous risquez de vous
blesser ou d'endommager l'unité centrale.
1. Faites l'inventaire des composants.
v Localisez le rapport des kits dans une boîte d'accessoires.
v Vérifiez que vous avez reçu tous les composants commandés.
2. Vérifiez qu'un emplacement est disponible dans l'unité existante ou que l'espace dans l'armoire est
suffisant pour installer une nouvelle unité.
Mise hors tension du système et coupure de l'alimentation
Il se peut que vous deviez mettre hors tension le système et couper l'alimentation. La procédure de cette
section permet d'effectuer cette tâche.
1. Vérifiez que le système auquel vous effectuez la connexion exécute le dernier niveau de
microprogramme. Pour plus d'informations sur la vérification ou la mise à niveau du
microprogramme ou de votre système, consultez la section des mises à jour de microcode dans les
informations d'installation de votre système.
2. Après avoir vérifié que votre microprogramme est au dernier niveau, arrêtez et mettez hors tension le
système auquel vous effectuez la connexion. Pour plus d'informations sur l'arrêt et la mise hors
tension du système, consultez la documentation fournie avec l'unité d'extension.
3. Pour installer une unité d'extension dans une coque existante, voir «Installation de l'unité d'extension»
, à la page 83. Sinon, consultez la rubrique «Marquage de l'emplacement».
Marquage de l'emplacement
Il se peut que vous deviez marquer l'emplacement. La procédure de cette section permet d'effectuer cette
tâche.
Pourquoi et quand exécuter cette tâche
Pour marquer l'emplacement de l'installation et installer les écrous clip dans une armoire, procédez
comme suit :
1. Déterminez l'emplacement de l'unité d'extension dans l'armoire par rapport au reste du matériel du
système. Lorsque vous planifiez, gardez les éléments suivants à l'esprit :
a. Organisez les unités les plus grandes et les plus lourdes dans la partie inférieure de l'armoire.
b. Planifiez l'installation en premier des unités dans la partie inférieure de l'armoire.
c. Enregistrez les emplacements EIA dans votre plan.
78 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension
Remarque : Ces unités d'extension font quatre unités EIA (Electronic Industries Alliance) de hauteur.
Une unité EIA mesure 44,50 mm de hauteur. L'armoire comporte trois orifices de montage pour
chaque unité EIA de hauteur. Cette unité centrale mesure donc 7 po (177.8 mm) de haut et couvre 12
orifices de fixation de l'armoire.
La figure suivante montre une unité EIA 1 et une unité EIA 4. Selon le constructeur de l'armoire, le
séparateur des unités EIA peut être symbolisé par une ligne ou par une couleur. L'espacement entre
les trous de la bande EIA est irrégulier. En l'absence d'élément visuel de séparation (couleur ou
ligne) des unités EIA dans l'armoire, chaque unité EIA commence à l'endroit où les trous sont les
plus rapprochés.
A Marque de positionnement
B Fixe-écrou
2. Face à l'avant de l'armoire, et en travaillant depuis le côté droit, repérez l'unité EIA inférieure qui
sera utilisée par votre unité d'extension. Notez l'emplacement de l'EIA. Utilisez du ruban adhésif, un
marqueur ou un crayon pour marquer l'armoire au niveau de l'orifice supérieur de cette unité EIA
(A). Vérifiez que la marque est visible depuis l'avant et l'arrière de l'armoire.
3. Commencez par l'orifice identifié par votre marque pour (A). Comptez deux orifices et faites une
marque (B) en regard du deuxième orifice. Vérifiez que la marque est visible depuis l'avant et
l'arrière de l'armoire.
4. Commencez par l'orifice identifié par votre marque pour (B). Comptez deux orifices et placez un
écrou clip. L'écrou clip (C) permet de fixer l'unité d'extension à l'armoire.
5. Commencez par l'orifice identifié par votre marque pour (B). Comptez six orifices et placez un écrou
clip. L'écrou clip (D) permet de fixer l'unité d'extension à l'armoire.
6. Répétez les quatre dernières étapes pour faire des marques et placer des écrous clips sur le côté
gauche de l'armoire.
7. Face à l'arrière de l'armoire, et en travaillant depuis le côté droit, repérez l'unité EIA inférieure qui
sera utilisée par votre unité d'extension. Faites une marque sur l'armoire au niveau de l'orifice
supérieur de cette unité EIA (E).
8. Commencez par l'orifice identifié par votre marque pour (E). Comptez deux orifices et faites une
marque (F) en regard du deuxième orifice.
9. Commencez par l'orifice identifié par votre marque pour (F). Comptez quatre orifices et faites une
marque (G) en regard du quatrième orifice.
10. Répétez les trois dernières étapes pour faire des marques sur le côté gauche de l'armoire.
Figure 79. Unités EIA avec glissières extensibles pour l'unité d'extension 5796
Installation en armoire d'une unité centrale ou d'une unité d'extension 79
Résultats
Mise en place des glissières extensibles et du support d'extension
dans l'armoire
Il se peut que vous deviez mettre en place des glissières dans l'armoire. La procédure de la présente
section vous permet d'effectuer cette tâche. Outre les informations de sécurité, cette section comporte
également des illustrations des composants matériel concernés et montre comment ces composants sont
liés les uns aux autres.
Pourquoi et quand exécuter cette tâche
Avertissement : L'installation des glissières est une procédure délicate. Pour une installation correcte des
glissières, respectez l'ordre des tâches ci-dessous. Si vous ne le respectez pas, vous risquez de vous
blesser ou d'endommager l'unité centrale.
Pour installer les glissières extensibles dans l'armoire, procédez comme suit :
1. Desserrez les deux vis (B) situées au milieu des deux glissières (A) pour les étendre.
2. A l'arrière de l'armoire, étendez la glissière de gauche.Insérez ses ergots (C) dans les encoches
indiquées afin de l'orienter vers le haut (en forme de L).
3. A l'arrière de l'armoire, étendez la glissière de droite et insérez ses ergots (C) dans les encoches
indiquées.
4. A l'avant de l'armoire, étendez la glissière de gauche et insérez ses ergots (C) dans les encoches
indiquées.
5. Etendez la glissière de droite et insérez les ergots (C) dans les encoches indiquées.
6. A l'aide d'une vis M5 (D), fixez la glissière de gauche à l'avant de l'armoire dans l'emplacement
inférieur (E)
Figure 80. Extension des glissières
80 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension
Important : Ne placez pas de vis dans l'emplacement supérieur pour l'instant. Sinon, le boîtier sera
fixé à l'assemblage.
.
7. A l'aide d'une vis M5 (D), fixez la glissière de droite à l'avant de l'armoire dans l'emplacement
inférieur (E).
8. Placez-vous à l'arrière de l'armoire et repérez les marques faites sur la glissière arrière gauche. A
l'aide de trois vis M5 (G), fixez le bras et le support (F) de chaque glissière à l'armoire à ces
emplacements.
Remarque : La glissière doit être orientée vers l'intérieur.
Figure 81. Fixation des glissières à l'avant de l'armoire.
Installation en armoire d'une unité centrale ou d'une unité d'extension 81
9. Repérez les marques qui ont été faites sur la glissière arrière droite de l'armoire. A l'aide de trois vis
M5 (G), fixez le bras et le support (F) de chaque glissière à l'armoire à ces emplacements.
10. Serrez les deux vis (B) situées au milieu de la glissière de gauche et les deux vis (B) au milieu de la
glissière de droite. Voir figure 80, à la page 80.
Résultats
Installation du boîtier
Il se peut que vous deviez installer le boîtier. La procédure de cette section permet d'effectuer cette tâche.
Cette section comporte également des illustrations des composants matériel concernés et montre comment
ces composants sont liés les uns aux autres.
Pourquoi et quand exécuter cette tâche
Remarque : Avant d'installer le boîtier, vérifiez que les glissières sont alignées horizontalement et
verticalement.
Pour installer la coque, procédez comme suit :
1. A l'avant de l'armoire, placez le boîtier (A) sur les glissières et faites-le glisser dans l'armoire.
Figure 82. Fixation des glissières à l'arrière de l'armoire
82 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension
A Boîtier
B Vis de fixation (quatre)
2. Mettez en place et serrez les quatre vis de fixation (B) à l'avant du boîtier (A) afin de fixer ce dernier
à l'armoire.
Résultats
Installation de l'unité d'extension
Il se peut que vous deviez installer l'unité d'extension dans une armoire. La procédure de cette section
permet d'effectuer cette tâche. Cette section comporte également des illustrations des composants matériel
concernés et montre comment ces composants sont liés les uns aux autres.
Pourquoi et quand exécuter cette tâche
Pour installer l'unité d'extension dans une armoire, procédez comme suit :
1. Retirez les capots de transport avant et arrière du modèle 57/96 ou 14G/30. Chacun de ces capots de
transport est assujetti au moyen de quatre vis à serrage à main. Deux vis sont situées au-dessus du
capot et deux sur les côtés du capot.
2. Si vous installez une unité d'extension (H) dans un boîtier existant (G) qui dispose d'un panneau
obturateur (F), retirez ce panneau en appuyant sur les pattes supérieures puis en le tirant vers vous.
Figure 83. Installation du boîtier dans l'armoire sur les glissières extensibles
Installation en armoire d'une unité centrale ou d'une unité d'extension 83
F Panneau obturateur
G Boîtier
H Tiroir
I Vis moletée
J Support de câble
3. Faites glisser l'unité d'extension (H) dans le boîtier.
4. Si vous installez un 5796, installez le support de câble (J) à l'aide d'une vis moletée (I).
5. Si vous avez deux unités d'extension à installer, répétez les quatre dernières étapes pour installer la
deuxième unité d'extension.
6. Si une seule unité d'extension est installée dans le boîtier, installez le panneau obturateur (N) dans
l'emplacement vide du boîtier en faisant glisser le panneau obturateur sous la languette supérieure du
boîtier. Appuyez doucement sur le panneau obturateur jusqu'à ce qu'il s'emboîte dans son logement.
Figure 84. Retrait du panneau obturateur
Figure 85. Retrait du panneau obturateur
84 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension
N Panneau obturateur
7. Fixez le carter avant. Voir Installation du carter avant sur un modèle 7314-G30 ou 5796.
Résultats
Fixation des câbles d'alimentation et du réseau de contrôle de
l'alimentation système (SPCN)
Il se peut que vous deviez fixer à l'unité d'extension les câbles d'alimentation et du réseau de contrôle de
l'alimentation système (System Power Control Network, ou SPCN). La procédure de cette section permet
d'effectuer cette tâche.
Pour plus d'instructions sur le câblage et la connexion de vos unités d'extension, voir Boîtiers et unités
d'extension.
Mise sous tension du système
Il se peut que vous deviez mettre l'unité d'extension sous tension. Pour plus d'informations sur la mise
sous tension de l'unité d'extension, voir Démarrage du système ou de la partition logique.
Installation en armoire du modèle 57/86 ou 31D/24
Il se peut que vous deviez installer une unité d'extension dans une armoire. La présente section explique
comment effectuer ces tâches.
Cette procédure suppose que l'installation de l'unité d'extension s'effectue dans une armoire existante. Si
l'armoire n'est pas installée, suivez les instructions du «Installation de l'armoire», à la page 5, puis
revenez à la présente section pour obtenir des instructions sur l'installation de l'unité d'extension dans
l'armoire.
Une fois que vous avez installé l'unité d'extension dans l'armoire, vous devez suivre les instructions
indiquées à la section Unités de disque pour connecter et configurer le modèle 31D/24 ou 57/86.
Figure 86. Installation du panneau obturateur
Installation en armoire d'une unité centrale ou d'une unité d'extension 85
Remarque : Cette procédure ne concerne que les modèles conçus pour une armoire.
Pour installer l'unité d'extension dans une armoire, procédez comme suit :
Avertissement : L'installation des glissières dans l'armoire est une opération complexe. Pour une
installation correcte des glissières, respectez l'ordre des tâches ci-dessous. Si vous ne le respectez pas,
vous risquez de vous blesser ou d'endommager l'unité d'extension.
1. Consultez la rubrique «Consignes de sécurité relatives aux armoires», à la page 102. Le non-respect de
ces instructions peut endommager le matériel ou provoquer des lésions corporelles.
2. Faites l'inventaire des composants (voir «Inventaire des composants»).
3. Identifiez le kit de montage en armoire (A) et les guides du système (B) fournis avec votre unité
d'extension. Voir la figure 87 pour connaître les emplacements relatifs de ces éléments. Les guides sont
identiques.
Remarque : Le tournevis à douille 8 mm et le tournevis à lame plate représentés dans le diagramme
suivant ne figurent pas dans l'inventaire, ni dans les pièces livrées, mais ils vous permettront
d'effectuer l'installation.
4. Choisissez l'emplacement d'installation de l'unité d'extension dans l'armoire Pour plus d'informations,
voir «Détermination de l'emplacement», à la page 87. Cette unité d'extension fait quatre unités EIA
(Electronic Industries Alliance) de hauteur. Une unité EIA mesure 4,45 mm de hauteur. L'armoire
contient trois trous de montage pour chaque unité EIA sur la hauteur.
5. Pour le marquage de l'emplacement, voir «Marquage de l'emplacement», à la page 88.
Inventaire des composants
Il se peut que vous deviez remplir l'inventaire des composants. La procédure de cette section permet
d'effectuer cette tâche.
Figure 87. Inventaire des composants
86 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension
Pourquoi et quand exécuter cette tâche
Si vous ne l'avez pas encore fait, effectuez un inventaire des composants avant de poursuivre l'installation
:
Procédure
1. Localisez le rapport des kits dans une boîte d'accessoires.
2. Vérifiez que vous avez reçu tous les composants commandés.
Résultats
En cas de composants manquants, endommagés ou ne correspondant pas à la commande, vous disposez
des contacts suivants :
v Revendeur
Détermination de l'emplacement
Il se peut que vous deviez déterminer l'emplacement de l'unité d'extension dans l'armoire. Cette
procédure permet d'effectuer cette tâche.
Pourquoi et quand exécuter cette tâche
Avant d'installer l'unité d'extension dans une armoire, procédez comme suit :
Procédure
1. Consultez la rubrique «Consignes de sécurité relatives aux armoires», à la page 102.
2. Choisissez l'emplacement d'installation des unités. Placez les unités les plus encombrantes et les plus
lourdes dans la partie inférieure de l'armoire. Ces unités d'extension font quatre unités EIA (Electronic
Industries Alliance) de hauteur. L'armoire contient trois trous de montage par unité EIA sur la
hauteur. Chacune de ces unités d'extension mesure donc 177,8 mm de hauteur et couvre 12 trous de
vis sur l'armoire.
3. Enlevez les panneaux obturateurs pour permettre l'accès à l'intérieur du boîtier de l'armoire.
Installation en armoire d'une unité centrale ou d'une unité d'extension 87
4. Retirez les volets avant et arrière si nécessaire.
5. Pour le marquage de l'emplacement, voir «Marquage de l'emplacement».
Marquage de l'emplacement
Il se peut que vous deviez marquer l'emplacement d'installation de l'armoire. La procédure de cette
section permet d'effectuer cette tâche.
Pourquoi et quand exécuter cette tâche
Pour marquer l'emplacement de l'installation et installer les écrous clip dans une armoire, procédez
comme suit :
Procédure
1. Déterminez l'emplacement de l'unité dans l'armoire et notez les numéros d'emplacement. Cette unité
fait quatre unités EIA (Electronic Industries Alliance) de hauteur. Une unité EIA correspond à 1,75 po
Figure 88. Retrait des panneaux obturateurs
88 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension
(44,45 mm) de hauteur. L'armoire contient trois trous de montage par unité EIA sur la hauteur. Cette
unité fait donc 7 po (177,8 mm) de haut et correspond à 12 orifices de fixation sur l'armoire.
2. Face à l'avant de l'armoire, et en travaillant depuis le côté gauche, repérez l'unité EIA inférieure qui
sera utilisée par votre unité. Notez l'emplacement de l'EIA. Utilisez du ruban adhésif, un marqueur ou
un crayon pour marquer le trou de vis inférieure (B) de cette unité EIA. Marquez l'armoire de sorte
que la marque reste visible au dos de l'armoire.
3. Sur la glissière gauche, comptez deux trous de vis depuis l'endroit où vous avez placé la marque à
l'étape 2 et placez une autre marque à côté de ce trou de vis (B). Marquez l'armoire de sorte que la
marque reste visible au dos de l'armoire. Vous avez maintenant deux marques sur l'armoire, avec un
trou de vis entre ces marques.
4. Répétez les étapes 2 et 3 pour placer deux marques sur les trous de vis correspondants sur la glissière
avant droite.
5. Placez-vous à l'arrière de l'armoire. Sur le côté gauche, localisez l'unité EIA qui correspond à l'unité
EIA inférieure indiquée sur le devant de l'armoire.
6. Placez une marque au niveau de l'orifice supérieur (C) de l'unité EIA la plus basse. Marquez l'armoire
de sorte que la marque reste visible au dos de l'armoire.
7. Comptez deux trous de vis depuis l'endroit où vous avez placé la la marque à l'étape 6 et placez une
autre marque à côté de ce trou de vis (C).
8. Répétez les étapes 6 et 7 pour placer deux marques sur les trous de vis correspondants sur la glissière
arrière droite.
Que faire ensuite
Fixation du support de montage dans l'armoire
Il se peut que vous deviez fixer le support de montage dans l'armoire. La procédure de cette section
permet d'effectuer cette tâche. Outre les informations de sécurité permettant d'effectuer la procédure de
façon fiable et sans danger, cette section comporte également des illustrations des composants matériel
concernés et montre comment ces composants sont liés les uns aux autres.
Pourquoi et quand exécuter cette tâche
ATTENTION :
L'installation des glissières est une procédure délicate. Pour une installation correcte des glissières,
respectez l'ordre des tâches ci-dessous. Si vous ne le respectez pas, vous risquez de vous blesser ou
d'endommager l'unité centrale.
Figure 89. Marquage des emplacements d'installation
Installation en armoire d'une unité centrale ou d'une unité d'extension 89
Pour installer le support de montage dans l'armoire, procédez comme suit :
Procédure
1. Desserrez les deux vis (B) situées au milieu des deux glissières (A) pour les étendre.
2. A l'arrière de l'armoire, étendez la glissière gauche et insérez ses ergots (C) dans les encoches
indiquées afin de l'orienter vers le haut.
Remarque : Bien que l'installation de la glissière puisse être effectuée par une seule personne, la
procédure sera facilitée si une personne est positionnée à l'avant de l'armoire et une autre à l'arrière.
3. A l'arrière de l'armoire, étendez la glissière de droite et insérez ses ergots (C) dans les encoches
indiquées.
4. A l'avant de l'armoire, étendez la glissière de gauche et insérez ses ergots (C) dans les encoches
indiquées.
5. Etendez la glissière de droite et insérez les ergots (C) dans les encoches indiquées.
6. A l'aide des deux vis M5 (D), fixez la glissière de gauche à l'avant de l'armoire sous chaque ergot de
glissière (C).
Figure 90. Extension des glissières
90 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension
7. A l'aide de deux vis M5 (D), fixez la glissière de droite à l'avant de l'armoire.
8. Placez-vous à l'arrière de l'armoire et fixez la glissière de gauche à l'arrière de l'armoire à l'aide des
deux vis M5 (D).
9. Fixez la glissière de droite à l'arrière de l'armoire en utilisant les deux vis M5 (D).
10. Serrez les deux vis sur la partie médiane de la glissière gauche et de la glissière droite. Pour plus
d'informations, voir la figure 91.
11. Insérez deux écrous clip dans l'emplacement indiqué sur la glissière avant gauche.
12. Insérez deux écrous clip dans les emplacements indiqués sur la glissière avant droite.
Résultats
Installation du modèle 57/86 ou 31D/24 sur les glissières
Il se peut que vous deviez installer l'unité d'extension sur les glissières. La procédure de cette section
permet d'effectuer cette tâche. Outre les informations de sécurité, cette section comporte également des
illustrations des composants matériel concernés et montre comment ces composants sont liés les uns aux
autres.
Pourquoi et quand exécuter cette tâche
Avant d'installer l'unité d'extension sur les glissières, assurez-vous que les stabilisateurs sont étendus et
que la plaque de soutien est fixée au bas de la face avant de l'armoire pour éviter que cette dernière ne
bascule vers l'avant lorsque les glissières sont étendues hors de l'armoire. Pour plus d'informations, voir
«Consignes de sécurité relatives aux armoires», à la page 102.
Pour installer l'unité d'extension sur ses glissières, procédez comme suit :
Figure 91. Fixation des glissières
Installation en armoire d'une unité centrale ou d'une unité d'extension 91
Procédure
1. Si le panneau frontal est installé à l'avant de l'unité d'extension, retirez-le en appuyant sur les deux
boutons de déverrouillage.
2. En vous faisant aider par trois autres personnes, soulevez l'unité d'extension et positionnez-la sur
l'avant des glissières.
Avertissement : Trois personnes sont nécessaires pour effectuer cette opération en toute sécurité. Le
non-respect de cette recommandation peut provoquer des blessures.
3. Poussez l'unité d'extension dans l'armoire jusqu'à ce qu'elle soit complètement calée.
4. Vérifiez si les plaques de fixation sont déjà installées à l'arrière de l'unité d'extension. Si ce n'est pas le
cas, fixez-les à droite et à gauche à l'arrière de l'unité. Les plaques de fixation sont illustrées à la
figure 94, à la page 93.
5. Installez les deux vis de fixation M5 A sur le châssis et dans les écrous clip (B) sur les glissières droite
et gauche.
Figure 92. Retrait du panneau frontal
92 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension
6. A l'arrière de l'armoire, installez les deux vis de fixation M5 (C) à l'arrière des deux plaques de
fixation droite et gauche et insérez-les sur les glissières.
7. Installation du panneau frontal sur l'unité d'extension.
Figure 93. Installation des vis de fixation
Figure 94. Fixation de l'unité d'extension à l'arrière de l'armoire
Installation en armoire d'une unité centrale ou d'une unité d'extension 93
Résultats
Pour obtenir des instructions sur la manière de connecter et de configurer l'unité d'extension une fois que
vous avez terminé l'installation de 57/86 ou de 31D/24 sur l'armoire, voir Boîtiers et unités d'extension.
Fixation du bras de routage des câbles aux glissières standard
Il se peut que vous deviez fixer le bras de routage des câbles. La procédure de cette section permet
d'effectuer cette tâche.
Pourquoi et quand exécuter cette tâche
Si l'unité d'extension est fournie avec les glissières standard (longueur fixe) et que le bras de routage des
câbles n'y est pas fixé, procédez comme suit. Sinon, passez à la rubrique «Détermination de
l'emplacement», à la page 87.
Procédure
1. Insérez la languette de la glissière du bras de routage des câbles (B) dans la glissière.
2. Faites pivoter le bras de routage des câbles (A) (voir la figure suivante).
Figure 95. Installation du panneau frontal
94 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension
3. Fixez le bras de routage des câbles (A) à l'aide de la vis de fixation (C).
4. Répétez les étapes 1 à 3 pour l'autre glissière.
Résultats
Figure 96. Rotation du bras de routage des câbles
Installation en armoire d'une unité centrale ou d'une unité d'extension 95
96 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension
Retrait et remise en place de panneaux latéraux 14T/00 ou
14T/42
Il se peut que vous deviez retirer et remettre en place un panneau latéral sur une armoire. La procédure
de cette section permet d'effectuer cette tâche.
Pourquoi et quand exécuter cette tâche
Il se peut que l'armoire soit dotée de panneaux latéraux facultatifs. Pour retirer et remettre en place un
panneau latéral, procédez comme suit :
Procédure
1. Déverrouillez les panneaux latéraux en appuyant sur les taquets de verrouillage pour les débloquer.
Remarque : Si votre armoire utilise le kit de renforcement, retirez la vis de fixation pour que chaque
panneau latéral puisse être extrait. Pour plus d'informations, voir «Libération du panneau latéral à
l'aide d'un kit de renforcement», à la page 110.
2. Inclinez légèrement la partie supérieure du panneau latéral vers vous.
3. Dégagez le panneau latéral de la rainure située au bas de l'armoire en le soulevant.
4. Répétez cette procédure pour l'autre panneau latéral.
Figure 97. Retrait du panneau latéral
97
Remise en place d'un panneau latéral 14T/00 ou 14T/42
Il se peut que vous deviez remettre en place un panneau latéral sur l'armoire. La procédure de la
présente section vous permet d'effectuer cette tâche.
Pourquoi et quand exécuter cette tâche
Pour remettre en place un panneau latéral, procédez comme suit :
Procédure
1. Inclinez légèrement la partie supérieure du panneau latéral vers vous.
2. Placez la partie inférieure du panneau latéral sur la rainure située au bas de l'armoire.
3. Mettez la partie supérieure du panneau latéral en place en la faisant glisser et fermez les taquets de
verrouillage.
Remarque : Si votre armoire utilise le kit de renforcement, installez une vis de fixation dans chaque
panneau latéral installé. Pour plus d'informations, voir le «Kit de renforcement», à la page 109.
Figure 98. Remise en place du panneau latéral
98 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension
Retrait et remise en place de panneaux d'habillage 14T/00 ou
14T/42
Les boîtiers installés avec des systèmes à tiroir de processeurs multiples peuvent utiliser les panneaux
d'habillage avant au lieu de portes. Pour les armoires qui utilisent les panneaux d'habillage, un type de
panneau à interférences réduites doit être installé lorsque certains modèles d'unité d'extension sont
présents. Utilisez la procédure décrite dans cette section pour retirer les panneaux d'habillage existants de
l'armoire et les remplacer par le type de panneaux d'interférences réduites.
Pourquoi et quand exécuter cette tâche
Pour retirer les panneaux d'habillage armoire existants, exécutez la procédure suivante:
Procédure
1. Placez les deux mains au centre du panneau d'habillage latéral droit.
2. Appuyez fermement vers l'intérieur pour libérer les pinces qui maintiennent le panneau en place.
3. Faites pivoter vos mains légèrement jusqu'à ce que le panneau soi dégagé.
4. Soulevez le panneau, puis mettez-le de côté.
5. Répétez cette procédure pour retirer le panneau d'habillage latéral gauche.
Remise en place de panneaux d'habillage 14T/00 ou 14T/42
Pour les armoires qui utilisent des panneaux d'habillage au lieu de portes, un type de panneau à
interférences réduites doit être installé lorsque certains modèles d'unité d'extension sont présents. La
procédure de cette section permet d'effectuer cette tâche.
Figure 99. Retrait du panneau d'habillage de l'armoire
99
Pourquoi et quand exécuter cette tâche
Pour installer les panneaux d'habillage, procédez comme suit :
Procédure
1. Alignez la plaque inférieure (A) du panneau d'habillage latéral droit avec le bas de l'armoire
2. Alignez le haut du panneau d'habillage (B) et serrez légèrement avec vos doigts.
3. Une fois le panneau d'habillage à l'emplacement approprié, relâchez la pression pour permettre aux
pinces de fixation de maintenir le panneau en place.
4. Répétez cette procédure pour installer le panneau d'habillage latéral gauche.
Figure 100. Installation du panneau d'habillage de l'armoire
100 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension
Fixation des volets de l'armoire
Il se peut que vous deviez fixer les volets de l'armoire. Cette procédure permet d'effectuer cette tâche.
Pourquoi et quand exécuter cette tâche
Le volet avant est facultatif pour certains modèles d'armoire. Si votre système est déjà équipé d'un volet
avant ou si vous ne souhaitez pas installer de volet avant, passez directement à l'étape suivante.
Fixation d'un volet avant de type High-perforation
Il se peut que vous deviez fixer un volet avant à l'armoire. La procédure de cette section permet
d'effectuer cette tâche.
Pourquoi et quand exécuter cette tâche
Pour installer le volet avant de type High-perforation, procédez comme suit :
Procédure
1. Lisez les «Consignes de sécurité relatives aux armoires», à la page 102.
2. Retirez les panneaux d'habillage haut, gauche et droit. Pour plus d'informations sur le retrait des
panneaux d'habillage gauche et droit d'une armoire 14T/00 ou 14T/42, voir «Retrait et remise en
place de panneaux d'habillage 14T/00 ou 14T/42», à la page 99.
Figure 101. Fixation du volet de l'armoire
101
1 Boîtier d'armoire 4 Panneau d'habillage droit
2 Panneau d'habillage haut 5 Pince de fixation
3 Panneau d'habillage gauche
3. Installez le loquet du volet sur la droite et les charnières sur la gauche.
4. Pour un volet avant High Perforation, alignez le volet sur la charnière de l'armoire, puis placez l'ergot
au-dessus et descendez-le sur la charnière.
5. Réglez le loquet afin que le volet se ferme correctement.
Consignes de sécurité relatives aux armoires
Vous devez lire les consignes de sécurité relatives aux armoires avant d'installer votre équipement.
Pourquoi et quand exécuter cette tâche
Avant d'installer une armoire, des fonctionnalités d'armoire ou une unité centrale ou d'extension, lisez les
consignes de sécurité suivantes.
Avertissement : Si vous installez le matériel dans une armoire , elle doit être conforme à la spécification
EIA 310D. L'installation du matériel dans l'armoire sans glissière de montage conçue à cet effet risque
d'endommager le matériel ou de provoquer des lésions corporelles.
Figure 102. Retrait des panneaux d'habillage
102 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension
DANGER
Observez les consignes suivantes lors de l'utilisation du système en armoire ou lorsque vous
travaillez à proximité de ce dernier :
v Un mauvais maniement de l'équipement lourd peut engendrer blessures et dommages matériels.
v Abaissez toujours les vérins de mise à niveau de l'armoire.
v Installez toujours des équerres de stabilisation sur l'armoire.
v Pour prévenir tout danger lié à une mauvaise répartition de la charge, installez toujours les
unités les plus lourdes dans la partie inférieure de l'armoire. Installez toujours les serveurs et les
unités en option en commençant par le bas de l'armoire.
v Un serveur monté en armoire n'est pas une étagère ou un espace de travail. Ne posez pas d'objet
sur un serveur monté en armoire.
v Chaque armoire peut être équipée de plusieurs cordons d'alimentation. Avant de manipuler
l'armoire, vous devez débrancher l'ensemble des cordons d'alimentation.
v Reliez toutes les unités installées dans l'armoire aux dispositifs d'alimentation installés dans la
même armoire. Vous ne devez pas brancher le cordon d'alimentation d'une unité installée dans
une armoire au dispositif d'alimentation installé dans une autre armoire.
v Un mauvais câblage du socle de prise de courant peut provoquer une mise sous tension
dangereuse des parties métalliques du système ou des unités qui lui sont raccordées. Il
appartient au client de s'assurer que le socle de prise de courant est correctement câblé et mis à la
terre afin d'éviter tout risque de choc électrique.
ATTENTION
v N'installez pas d'unité dans une armoire dont la température ambiante interne dépasse la
température ambiante que le fabricant recommande pour toutes les unités montées en armoire.
v N'installez pas d'unité dans une armoire où la ventilation n'est pas assurée. Vérifiez que les
côtés, l'avant et l'arrière de l'unité sont correctement ventilés.
v Le matériel doit être correctement raccordé au circuit d'alimentation pour éviter qu'une surcharge
des circuits n'entrave le câblage des dispositifs d'alimentation ou de protection contre les
surintensités. Pour choisir des connexions d'alimentation à l'armoire adaptées, consultez les
étiquettes de puissance nominale situées sur le matériel dans l'armoire afin de déterminer
l'alimentation totale requise par le circuit d'alimentation.
v Armoires dotées de tiroirs coulissants : Si l'armoire n'est pas équipée d'équerres de stabilisation, ne
sortez et n'installez pas de tiroir ou de dispositif. Ne retirez pas plusieurs tiroirs à la fois. Si vous
retirez plusieurs tiroirs simultanément, l'armoire risque de devenir instable.
v Armoires dotées de tiroirs fixes : Sauf indication du fabricant, les tiroirs fixes ne doivent pas être
retirés à des fins de maintenance. Si vous tentez de retirer une partie ou l'ensemble du tiroir,
l'armoire risque de devenir instable et le tiroir risque de tomber.
(R001)
Précautions à prendre pour soulever un dispositif :
Fixation des volets de l'armoire 103
104 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension
Installation du kit de sécurité de l'armoire
Il se peut que vous deviez installer le kit de sécurité de l'armoire. La procédure de la présente section
vous permet d'effectuer cette tâche. Cette section comporte également des illustrations des composants
matériel concernés et montre comment ces composants sont liés les uns aux autres.
Pour installer un kit de sécurité de l'armoire comprenant un verrou et des glissières de sécurité, procédez
comme suit :
1. Lisez les «Consignes de sécurité relatives aux armoires», à la page 102.
2. Effectuez l'inventaire du kit de sécurité.
v
1 Deux kits de verrouillage. Chaque kit contient :
- Verrou d'armoire
- Support
- Vis
- Deux clés
2 Deux glissières de sécurité
3 Deux autocollants Verrouillé/Déverrouillé
3. Retirez le loquet du volet.
Figure 103. Inventaire du kit de sécurité de l'armoire
105
a. Ouvrez le volet avant de l'armoire.
b. Sur la partie intérieure du volet, retirez la vis (4) qui fixe le verrou sur le volet de l'armoire (voir la
figure 104).
c. Retirez le taquet (5).
d. Retirez le loquet (6) situé sur la partie extérieure du volet.
Remarque : Si l'armoire est équipée d'un kit de renforcement, retirez le contre-écrou et l'écrou six
pans du loquet du volet et réinstallez-les sur le nouveau taquet de verrouillage.
4. Installez le verrou.
a. Sur la partie avant du volet, insérez le verrou à clé de l'armoire dans le logement du loquet (6)
(voir la figure 104).
b. Installez le verrou en fixant le support (5) à l'aide de la vis (4) sur la partie intérieure du volet.
5. Répétez les étapes 3, à la page 105 et 4 pour installer le second verrou sur le volet arrière de l'armoire.
6. Installez une glissière de sécurité sur le côté droit de l'armoire.
Figure 104. Retrait du loquet du volet
106 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension
Remarque : Les glissières possèdent chacune deux longs taquets situés sur la partie inférieure. Elles
sont identiques et peuvent s'installer sur le panneau latéral gauche ou droit.
a. Déverrouillez le panneau latéral droit afin de pouvoir accéder à la partie supérieure du panneau.
b. Placez la partie plate de la glissière (7) face à la partie intérieure du panneau (8) (voir la
figure 105) et insérez les deux taquets (9) de la glissière dans les deux canaux de support verticaux
(10) situés sur le panneau latéral.
Remarque : Lorsque la glissière est correctement installée, elle doit coulisser de l'avant vers
l'arrière.
c. Réinstallez le panneau latéral sur l'armoire.
d. Verrouillez les panneaux latéraux en plaçant les glissières à l'avant de l'armoire.
e. Placez un autocollant Verrouillé/Déverrouillé sur la partie intérieure du panneau afin que le
taquet se trouve sur le symbole Verrouillé lorsque la glissière est en position de verrouillage (11)
(voir la figure 106, à la page 108) et sur le symbole Déverrouillé (12), lorsque la glissière est
déverrouillée.
Figure 105. Installation d'une glissière de sécurité
Installation du kit de sécurité de l'armoire 107
f. Répétez cette procédure pour le côté gauche de l'armoire.
Figure 106. Positionnement de l'autocollant Verrouillé/Déverrouillé sur le panneau
108 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension
Kit de renforcement
Il se peut que vous deviez retirer ou remplacer un composant du kit de renforcement. La présente section
explique comment effectuer ces tâches.
Le kit de renforcement doit être commandé en même temps que l'armoire. Les équerres du kit de
renforcement sont installées par le fabricant.
Remarque : Si vous installez une armoire à l'aide du kit de renforcement et que vous avez besoin de fixer
l'armoire au sol, voir «Installation de l'armoire», à la page 5.
L'illustration suivante décrit le contenu du kit et l'emplacement approximatif des supports et des
charnières pour le cas où vous auriez besoin de désinstaller ou de réinstaller une pièce.
1 Armoire 9 Charnières de l'équerre
2 Entretoise 10 Equerre
3 Charnière de l'équerre 11 Monture de vis de fixation du volet
latéral
4 Goujons de pivot de la charnière 12 Entretoise
5 Taquet de blocage de l'équerre 13 Vis
6 Entretoise 14 Rondelle
7 Vis 15 Vis de fixation du volet latéral
8 Vis moletée de l'équerre
Figure 107. Composants du kit de renforcement
109
Libération de l'équerre renforcée
Il se peut que vous deviez libérer l'équerre renforcée. La procédure de cette section permet d'effectuer
cette tâche.
Pour accéder à la partie arrière d'un système installé dans une armoire équipée d'un kit de renforcement,
effectuez les étapes suivantes pour libérer l'équerre renforcée :
1. Enlevez la vis moletée de l'équerre (8).
2. Sortez l'équerre (10) hors de l'armoire.
3. Accédez au système, puis ré-installez l'équerre.
Libération du panneau latéral à l'aide d'un kit de renforcement
Il se peut que vous deviez libérer le panneau latéral sur l'armoire. Cette procédure permet d'effectuer
cette tâche.
Le kit de renforcement contient des vis qui permettent la fixation des panneaux latéraux à l'armoire. Pour
retirer une vis de serrage, procédez comme suit :
1. Si nécessaire, ouvrez ou retirez le volet arrière de l'armoire.
2. Repérez la monture de vis de fixation (11) du volet latéral à retirer.
Figure 108. Desserrement de l'équerre pour accéder aux systèmes
110 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension
3. A l'aide d'un tournevis, retirez la vis de fixation (15) et la rondelle (14) du panneau latéral. Pour
retirer le panneau latéral, voir «Retrait et remise en place de panneaux latéraux 14T/00 ou 14T/42», à
la page 97.
Figure 109. Retrait des vis de serrage sur le panneau latéral
Kit de renforcement 111
112 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension
Connexion de plusieurs armoires à l'aide d'un kit de
connexion multiarmoire
Il se peut que vous deviez relier plusieurs armoires. La procédure de cette section permet d'effectuer cette
tâche.
Pourquoi et quand exécuter cette tâche
La présente section explique comment connecter plusieurs armoires à l'aide d'un kit de connexion
multiarmoire. Pour ce faire, munissez-vous du kit de connexion multiarmoire.
Pour connecter plusieurs armoires à l'aide d'un kit de connexion multiarmoire, procédez comme suit :
Procédure
1. Lisez les «Consignes de sécurité relatives aux armoires», à la page 102.
2. S'ils sont en place, retirez les panneaux latéraux de chaque armoire. Ne retirez les panneaux latéraux
que des côtés qui seront reliés l'un à l'autre. Pour ce faire :
a. Soulevez les deux pattes de déverrouillage du panneau.
b. Retirez le panneau du boîtier de l'armoire en le soulevant afin de le dégager des deux pattes
inférieures J.
c. Placez à l'écart les panneaux latéraux.
3. Retirez les deux pattes Z et les deux pattes J. Ces pattes servent à accrocher les panneaux latéraux.
4. Installez les deux premières vis de butée dans les angles supérieur gauche et inférieur droit de la
première armoire (voir la figure 110, à la page 114).
5. Installez les deux autres vis de butée dans les angles supérieur gauche et inférieur droit de la
deuxième armoire (voir la figure 110, à la page 114).
6. Fixez la mousse comme illustré sur la figure 110, à la page 114. Sur les modèles d'armoire T42, placez
la bande courte de mousse protectrice dans le prolongement de la bande longue et placez l'ensemble
sur la longueur du cadre de l'armoire.
7. Positionnez les armoires ensemble.
8. Alignez les trous des vis de butée. Au besoin, réglez les pieds de nivellement.
9. Placez une vis et une rondelle dans les quatre emplacements correspondants, sans les serrer.
113
10. Une fois toutes les armoires reliées les unes aux autres, ajustez leur niveau.
11. Serrez toutes les vis.
12. Enclipsez les couvre-joints entre les armoires avant et arrière.
13. Enclipsez les couvre-joints supérieurs entre les armoires.
14. A l'aide des panneaux obturateurs, masquez les espaces vides situés à l'avant des armoires. Tous les
espaces situés à l'avant de l'armoire doivent également être bouchés, y compris les espaces entre le
matériel. Cette étape permet de garantir une bonne ventilation dans l'armoire.
15. Connectez les câbles séparant les armoires.
16. Pour installer des plaques de soutien, passez à l'étape 5 de «Installation de l'armoire», à la page 5.
Figure 110. Retrait des panneaux latéraux et des pattes Z et J, mise en place des vis de butée et de la longue bande
de mousse pour relier plusieurs armoires
Figure 111. Emplacement des bandes de mousse protectrices (vue de dessus)
114 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension
Alarme de statut d'armoire
L'alarme de statut d'armoire se place au sommet d'une armoire et peut être reliée à plusieurs unités
centrales montées dans l'armoire. Le voyant de l'alarme de statut d'armoire est activé par le
microprogramme du serveur en cas de problème de statut, ce qui permet d'identifier le problème dans les
meilleurs délais.
Cette opération incombe au client. Vous pouvez exécuter cette tâche vous-même ou faire appel à un
fournisseur de services. Il vous imputera probablement les frais correspondants.
Ces procédures supposent que les unités centrales ont été installées dans l'armoire et reliées au bloc
d'alimentation. Si tel n'est pas le cas, reportez-vous aux instructions d'installation relatives à l'unité
centrale.
Installation de l'alarme de statut d'armoire
Il se peut que vous deviez installer l'alarme de statut d'armoire. La procédure de cette section permet
d'effectuer cette tâche.
Avant de commencer
Pour installer l'alarme de statut d'armoire, vous devez disposer des éléments suivants :
v Une alarme de statut d'armoire
v Au moins deux câbles d'alarme de statut d'armoire ; un pour connecter l'alarme à la boîte de jonction
et un autre pour chaque unité centrale que vous souhaitez connecter à l'alarme
v Un ou deux cordons d'alimentation
v Une ou plusieurs boîtes de jonction pour l'alarme de statut d'armoire.
v Un ou plusieurs câbles de boîte de jonction.
Pourquoi et quand exécuter cette tâche
Pour installer l'alarme de statut d'armoire, procédez comme suit :
Procédure
1. Placez l'alarme de statut d'armoire (A) sur l'armoire, en dirigeant le voyant vers l'avant de l'armoire.
115
2. Connectez l'une des extrémités du câble de l'alarme de statut d'armoire (1) au port situé à l'arrière de
l'alarme.
3. Faites passer le câble de l'alarme de statut d'armoire par le capot d'accès des câbles de l'armoire (B).
4. Connectez l'autre extrémité du câble de l'alarme de statut d'armoire au port de sortie de la boîte de
jonction (C).
5. Pour relier la boîte de jonction à l'unité centrale, connectez l'une des extrémités d'un câble de l'alarme
de statut d'armoire (1) à un port d'entrée de la boîte de jonction.
Figure 112. Connexion de l'alarme de statut d'armoire à l'unité centrale via la boîte de jonction
Figure 113. Connexion de l'alarme de statut d'armoire et de l'unité centrale à la boîte de jonction
116 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension
6. Connectez l'autre extrémité du second câble de l'alarme de statut d'armoire au port d'alarme de l'unité
centrale. Ce port est indiqué par un symbole représentant une alarme.
Remarque : L'orientation du port peut être différente sur certaines unités centrales.
7. Pour connecter d'autres unités centrales à l'alarme de statut d'armoire, répétez les étapes 4 et 5 pour
chaque unité centrale supplémentaire. Vous pouvez connecter plusieurs boîtes de jonction pour
ajouter des unités centrales. Pour plus d'informations, voir «Connexion de plusieurs boîtes de
jonction».
8. Connectez les cordons d'alimentation à l'alarme de statut d'armoire. Un seul cordon d'alimentation
est requis, mais vous pouvez en utiliser deux par sécurité.
9. Branchez l'autre extrémité des cordons d'alimentation au bloc d'alimentation de l'armoire.
Résultats
Si vous installez l'alarme de statut d'armoire lors de la configuration initiale du système, les informations
d'identification de l'armoire seront transmises au démarrage du système. En revanche, si vous installez
l'alarme de statut d'armoire sur des unités centrales en cours d'exécution, vous devrez redémarrer le
système.
Connexion de plusieurs boîtes de jonction
Il se peut que vous deviez connecter plusieurs boîtes de jonction. La présente section explique comment
effectuer ces tâches.
Pourquoi et quand exécuter cette tâche
Vous pouvez connecter plusieurs boîtes de jonction pour pouvoir ajouter d'autres unités centrales à
l'alarme de statut d'armoire.
Pour connecter plusieurs boîtes de jonction, vous avez besoin du matériel supplémentaire suivant :
v Une ou plusieurs boîtes de jonction d'alarme de statut d'armoire
v Un ou plusieurs câbles de boîte de jonction. Vous avez besoin d'un câble de boîte de jonction par boîte
de jonction à ajouter.
Procédure
1. Connectez une extrémité du câble de boîte de jonction (2) dans un port d'entrée de la première boîte
de jonction.
Figure 114. Port d'alarme d'état de l'armoire
Alarme de statut d'armoire 117
2. Connectez un câble de boîte de jonction dans le port de sortie de la boîte de jonction supplémentaire.
3. Raccordez des unités centrales supplémentaires à cette boîte de jonction comme indiqué dans
«Installation de l'alarme de statut d'armoire», à la page 115.
4. Répétez les étapes 1 - 3 pour chaque boîte de jonction supplémentaire.
Figure 115. Connexion de l'alarme de statut d'armoire et de l'unité centrale à la boîte de jonction
118 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension
Installation ou retrait d'un taquet de verrouillage d'unité
centrale montée en armoire
Il se peut que vous deviez installer ou retirer un taquet de verrouillage. La procédure de cette section
permet d'effectuer cette tâche.
Pourquoi et quand exécuter cette tâche
Ces procédures décrivent l'installation ou le retrait d'un taquet de verrouillage de l'unité centrale montée
en armoire. Vous pouvez exécuter vous-même cette tâche ou faire appel à un fournisseur de services. Il
vous imputera probablement les frais correspondants.
Pour installer ou enlever un taquet de verrouillage, effectuez les opérations suivantes :
Remarque : Les illustrations de ces procédures peuvent présenter des différences par rapport à votre type
et modèle de machine.
Procédure
1. Lisez les «Consignes de sécurité relatives aux armoires», à la page 102.
2. Ouvrez le volet avant de l'armoire.
3. Poussez le taquet de verrouillage (A) ou (B) pour le mettre en place sur le côté de l'unité centrale.
4. Fixez le taquet de verrouillage à l'aide des deux vis de fixation (C) sur le côté du taquet.
5. Réinstallez le carter avant. Voir «Retrait et installation des carters sur le système 33E/8B ou 36E/8C»,
à la page 143.
6. Fermez le volet avant de l'armoire.
Figure 116. Réinstallation des taquets de verrouillage
119
Résultats
Pour retirer un taquet de verrouillage, procédez comme suit :
1. Retirez le capot situé à l'avant du système. Voir «Retrait et installation des carters sur le système
33E/8B ou 36E/8C», à la page 143.
2. Placez le système en position de maintenance. Voir les informations de maintenance de votre modèle.
3. Ôtez les vis de fixation de verrouillage (C) situés sur le côté du taquet de verrouillage (A) ou (B).
4. Retirez le taquet de verrouillage de l'unité centrale.
Figure 117. Retrait des vis de fixation
120 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension
Retrait et remise en place du capot supérieur sur l'armoire
Il se peut que vous deviez retirer ou remettre en place le capot supérieur de l'armoire. Les procédures de
cette section permettent d'effectuer ces tâches.
Retrait du capot supérieur de l'armoire
Il se peut que vous deviez retirer le capot supérieur de l'armoire. Cette procédure permet d'effectuer cette
tâche.
Pourquoi et quand exécuter cette tâche
Remarque : Une clé à douille de 10 mm avec rallonge est nécessaire pour retirer les vis du capot.
D'autres types d'outils risqueraient d'arrondir la tête de vis qui serait alors indémontable.
Pour ôter le capot de l'armoire, procédez comme suit :
Procédure
1. Retirez le volet avant.
Remarque : Si l'armoire est verrouillée, déverrouillez les volets. Puis débloquez les panneaux latéraux
en ôtant les barres de verrouillage situées de chaque côté de la partie supérieure de l'arrière de
l'armoire. Faites glisser les barres de verrouillage hors de l'armoire. Les barres de verrouillage peuvent
être ôtées depuis l'avant de l'armoire si l'espace dans la zone des barres est suffisant et que les barres
sont facilement accessibles.
2. Retirez les panneaux d'habillage haut, gauche et droit.
3. Retirez les panneaux latéraux.Pour plus d'informations, voir «Retrait et remise en place de panneaux
latéraux 14T/00 ou 14T/42», à la page 97.
4. Ôtez une vis du côté droit et une vis du côté gauche du capot supérieur. Appliquez la même
procédure à l'avant et à l'arrière.
5. Identifiez les renforts avant (A) et arrière (C) de l'armoire, fournis dans le conteneur de transport.
Fixez chacun des renforts en haut à l'avant et à l'arrière de l'armoire, juste au-dessous du capot
supérieur.
6. Utilisez les quatre vis (B) qui avaient été ôtées du capot supérieur pour fixer chacun des renforts à
l'armoire (voir figure 118, à la page 122).
121
A Renfort avant de l'armoire
B Vis de fixation (2 vis par renfort)
C Renfort arrière de l'armoire
7. Ôtez les six vis restantes des côtés droit et gauche du capot supérieur (D).
Figure 118. Fixation des renforts
122 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension
D Capot supérieur
E Capot d'accès aux câbles
F Panneau latéral (quantité : 2)
G étiquette EIA
8. Soulevez le capot supérieur.
Remise en place du capot supérieur sur l'armoire
Il se peut que vous deviez remettre en place le capot supérieur de l'armoire. Cette procédure permet
d'effectuer cette tâche.
Pourquoi et quand exécuter cette tâche
Remarque : Une clé à douille de 10 mm avec rallonge est nécessaire pour remettre en place les vis du
capot. D'autres types d'outils risqueraient d'arrondir la tête de vis qui serait alors indémontable par la
suite.
Pour remettre en place le capot sur l'armoire, procédez comme suit :
Procédure
1. Positionnez le capot supérieur (D) sur l'armoire.
2. Insérez les six vis des côtés droit et gauche du capot supérieur.
Figure 119. Retrait du capot supérieur
Retrait et remise en place du capot supérieur sur l'armoire 123
D Capot supérieur
E Capot d'accès aux câbles
F Panneau latéral (quantité : 2)
G étiquette EIA
Remarque : Une clé à douille de 10 mm avec rallonge est nécessaire pour remettre en place les vis du
capot. D'autres types d'outils risqueraient d'arrondir la tête de vis qui serait alors indémontable par la
suite.
3. Retirez les renforts (A) et (C) en les desserrant dans la partie supérieure de l'armoire, juste au-dessous
du capot supérieur. Ôtez les renforts à l'avant et à l'arrière de l'armoire.
Figure 120. Retrait du capot supérieur
124 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension
A Renfort avant de l'armoire
B Vis de fixation (quantité : 2 vis par renfort)
C Renfort arrière de l'armoire
4. Insérez les vis (B) que vous avez ôtées des renforts à droite et à gauche du capot supérieur. Insérez les
vis à l'avant et à l'arrière de l'armoire.
5. A l'aide des panneaux obturateurs, masquez les espaces vides situés à l'avant de l'armoire. Comblez
tous les espaces à l'avant de l'armoire, y compris les espaces entre les éléments de l'équipement.Cette
étape permet de garantir une bonne ventilation dans l'armoire.
Figure 121. Retrait du capot supérieur
Retrait et remise en place du capot supérieur sur l'armoire 125
126 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension
Unité d'alimentation plus
L'unité d'alimentation plus (PDU+) peut être installée dans les armoires 14T/00 et 14T/42. Elle permet de
surveiller les charges électriques des unités qui y sont raccordées.
Installation de l'unité PDU+ sur le côté d'une armoire
Apprenez à installer l'unité d'alimentation plus (PDU+) sur le côté d'une armoire.
Avant de commencer
Conseil : Le retrait des volets et des panneaux latéraux de l'armoire peut faciliter l'installation.
Important : Vous devez utiliser des écrous clip pour installer les étriers de fixation. Les écrous clip sont
fournis avec la PDU+ et s'installent sur les poignées de montage.
Pourquoi et quand exécuter cette tâche
Pour installer la PDU+ dans l'espace de montage 1U sur le côté d'une armoire, procédez comme suit :
Procédure
1. Lisez les «Consignes de sécurité relatives aux armoires», à la page 102.
2. Alignez les étriers de fixation verticaux sur l'avant de la PDU+. Veillez à fixer les crochets de manière
à ce que les prises électriques soient face à l'arrière de l'armoire.
3. Fixation des crochets sur la PDU+ à l'aide de 5 vis M3 par étrier. Utilisez les vis qui ont été fournies
avec la PDU+.
4. Alignez la PDU+ sur l'ouverture latérale de l'armoire. Puis, en maintenant la PDU+ en place, fixez les
crochets sur les poignées de montage avec quatre écrous à clip et quatre vis M5, fournis avec la
PDU+.
Figure 122. Alignement des étriers de fixation verticaux sur l'avant de la PDU+.
127
Avertissement : Vous devez déconnecter l'alimentation principale avant de brancher ou de
débrancher le cordon d'alimentation de la PDU+.
5. Si la PDU+ a été fournie avec un cordon d'alimentation détaché, branchez maintenant le cordon
d'alimentation. Alignez le connecteur du cordon d'alimentation fourni avec la PDU+ sur le connecteur
situé à l'avant de la PDU+, en tournant si nécessaire, afin que les clés soient alignées. Puis, tournez le
verrou rotatif sur le connecteur dans le sens des aiguilles d'une montre, jusqu'à ce qu'il soit fixé en
place.
Figure 123. Alignement de la PDU+ sur l'ouverture latérale de l'armoire
Figure 124. Alignement du connecteur du cordon d'alimentation sur la PDU+
128 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension
6. Guidez le cordon d'alimentation de la PDU+ vers les crochets latéraux de l'armoire. Puis, faites passer
le cordon d'alimentation le long d'un crochet latéral, vers l'arrière de l'armoire et fixez le cordon
d'alimentation avec les brides de câblage fournies avec la PDU+.
7. Connectez le cordon d'alimentation à une source électrique dédiée. Utilisez les brides de câblage
fournies pour fixer le cordon d'alimentation sur toute sa longueur. Utilisez les ouvertures de l'armoire,
si le cordon d'alimentation doit sortir de l'armoire pour se brancher sur une alimentation électrique.
Avertissement : Pour éviter tout dommage à un appareil électrique et autres appareils connectés,
connectez toujours l'appareil électrique à une alimentation électrique autorisée pour cet appareil.
8. Connectez le cordon électrique à une alimentation électrique dédiée correctement câblée et reliée à la
terre. Ensuite, vous pouvez connecter des PDU de serveurs ou d'armoire dans l'armoire aux prises
électriques de la PDU+.
9. Acheminez proprement tous les autres câbles électriques et fixez-les avec des brides de câblage.
Concepts associés
«Configuration du contrôle de l'alimentation via l'unité PDU+», à la page 133
Vous pouvez contrôler le statut de l'alimentation d'une unité connectée à l'unité d'alimentation plus
(PDU+), que ce soit manuellement ou à distance, via l'interface Web PDU+. Vous pouvez utiliser
l'utilitaire de configuration pour configurer initialement l'unité PDU+, ainsi que les paramètres PDU+ tels
que les propriétés du réseau, la table de contrôle d'accès, et la table des récepteurs d'alerte.
Installation horizontale en armoire de la PDU+
Il se peut que vous deviez installer horizontalement la PDU. Suivez alors les instructions de cette section
pour effectuer cette tâche.
Avant de commencer
Conseil : Le retrait des volets et des panneaux latéraux de l'armoire peut faciliter l'installation.
Important : Utilisez des écrous captifs pour les armoires dotées de trous carrés, et utilisez des écrous à
clip pour les armoires dotées de trous ronds. Si votre armoire nécessite des écrous captifs, utilisez un
outil d'installation des écrous captifs ou un tournevis à lame plate pour les installer.
Pourquoi et quand exécuter cette tâche
Pour installer la PDU+ sur l'armoire, procédez comme suit :
Procédure
1. Lisez les «Consignes de sécurité relatives aux armoires», à la page 102.
2. Alignez les étriers de fixation verticaux sur l'avant de la PDU+. Veillez à fixer les crochets de
manière à ce que les prises électriques soient face à l'arrière de l'armoire.
Unité d'alimentation plus 129
3. Alignez les supports de fixation sur les trous à l'arrière de la PDU+ et fixez les supports sur la PDU+
à l'aide des deux vis à tête tronconique M3 avec les rondelles de verrou captif par support. Utilisez
les vis fournies avec le kit de montage en armoire.
4. Maintenez-la PDU+ en formant un léger angle, et insérez-la doucement dans l'espace de montage
1-U dans l'armoire. Le fait de pousser légèrement sur les deux étriers de fixation long aidera à
effacer les étriers des brides de soutien de l'armoire.
Figure 125. Alignement des étriers de fixation verticaux sur l'avant de la PDU+.
130 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension
Avertissement : Vous devez déconnecter l'alimentation principale avant de brancher ou de
débrancher le cordon d'alimentation de la PDU+.
5. Fixez en premier l'extrémité de la PDU+ qui a les étriers de fixation courts à l'armoire. Veillez à ce
que les étriers de fixation courts soient alignés sur l'extérieure des brides de soutien de l'armoire.
Fixez les étriers aux brides de soutien avec deux vis M6 et deux écrous captifs ou écrous à clip par
étrier. Utilisez les écrous captifs ou les écrous à clip ainsi que les vis fournis avec le kit de montage
en armoire.
6. Fixez les étriers de fixation longs et le panneau obturateur blanc 1-U sur l'armoire en procédant
comme suit :
Figure 126. Alignement de la PDU+ sur l'ouverture de l'armoire.
Unité d'alimentation plus 131
a. Ajustez les étriers de fixation longs de manière à ce qu'ils s'ajustent à la profondeur de l'armoire.
b. Veillez à ce que les étriers de fixation longs soient alignés sur l'intérieur des brides de soutien de
l'armoire.
c. Alignez le panneau obturateur blanc sur l'extérieur des brides de soutien.
d. Fixez le panneau obturateur aux brides de soutien de l'armoire puis à l'étrier de fixation long à
l'aide d'une vis M6 par étrier.
e. Serrez les vis à tête tronconique M3 qui fixent les étriers de fixation longs à la PDU.
7. Fixez en premier l'extrémité de la PDU+ qui a les étriers de fixation courts à l'armoire. Veillez à ce
que les étriers de fixation courts soient alignés sur l'extérieure des brides de soutien de l'armoire.
Fixez les étriers aux brides de soutien avec deux vis M6 et deux écrous captifs ou écrous à clip par
étrier. Utilisez les écrous captifs ou les écrous à clip ainsi que les vis fournis avec le kit de montage
en armoire.
8. Si la PDU+ a été fournie avec un cordon d'alimentation détaché, branchez maintenant le cordon
d'alimentation. Alignez le connecteur du cordon d'alimentation fourni avec la PDU+ sur le
connecteur situé à l'avant de la PDU+, en tournant si nécessaire, afin que les clés soient alignées.
Puis, tournez le verrou rotatif sur le connecteur dans le sens des aiguilles d'une montre, jusqu'à ce
qu'il soit fixé en place.
132 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension
9. Guidez le cordon d'alimentation de la PDU+ vers les crochets latéraux de l'armoire. Puis, faites
passer le cordon d'alimentation le long d'un crochet latéral, vers l'arrière de l'armoire et fixez le
cordon d'alimentation avec les brides de câblage fournies avec la PDU+.
10. Connectez le cordon d'alimentation à une source électrique dédiée. Utilisez les brides de câblage
fournies pour fixer le cordon d'alimentation sur toute sa longueur. Utilisez les ouvertures de
l'armoire, si le cordon d'alimentation doit sortir de l'armoire pour se brancher sur une alimentation
électrique.
Avertissement : Pour éviter tout dommage à un appareil électrique et autres appareils connectés,
connectez toujours l'appareil électrique à une alimentation électrique autorisée pour cet appareil.
11. Connectez le cordon électrique à une alimentation électrique dédiée correctement câblée et reliée à la
terre. Ensuite, vous pouvez connecter des PDU de serveurs ou d'armoire dans l'armoire aux prises
électriques de la PDU+.
12. Acheminez proprement tous les autres câbles électriques et fixez-les avec des brides de câblage.
Configuration du contrôle de l'alimentation via l'unité PDU+
Vous pouvez contrôler le statut de l'alimentation d'une unité connectée à l'unité d'alimentation plus
(PDU+), que ce soit manuellement ou à distance, via l'interface Web PDU+. Vous pouvez utiliser
l'utilitaire de configuration pour configurer initialement l'unité PDU+, ainsi que les paramètres PDU+ tels
que les propriétés du réseau, la table de contrôle d'accès, et la table des récepteurs d'alerte.
Remarque : Toutes les options de configuration de l'utilitaire de configuration sont disponibles via
l'interface Web une fois l'unité PDU+ configurée sur le réseau local.
Tâches associées
«Installation de l'unité PDU+ sur le côté d'une armoire», à la page 127
Apprenez à installer l'unité d'alimentation plus (PDU+) sur le côté d'une armoire.
Utilisation de l'utilitaire de configuration
Apprenez à utiliser l'utilitaire de configuration pour configurer les paramètres d'unité d'alimentation plus
(PDU+), tels que l'adresse IP, les propriétés du réseau, la table de contrôle d'accès et la table des
récepteurs d'alerte.
Figure 127. Alignement du connecteur du cordon d'alimentation sur la PDU+
Unité d'alimentation plus 133
Pourquoi et quand exécuter cette tâche
Connexion de la console
Pourquoi et quand exécuter cette tâche
Vous pouvez configurer la PDU+ à l'aide d'un poste de travail ou d'un ordinateur bloc-notes connecté à
la PDU+. Connectez le câble DB9-to-RJ-45 fourni avec la PDU+ au connecteur de la console RJ-45 sur la
PDU+, et à un connecteur de série RS-232 (COM) sur un poste de travail ou ordinateur bloc-notes.
Utilisation d'HyperTerminal
Pourquoi et quand exécuter cette tâche
HyperTerminal est un programme de terminal dans un système d'exploitation Microsoft® Windows® qui
vous permet de configurer ou de commander un appareil à l'aide de paramètres de ligne de commande.
Vous pouvez configurer les paramètres PDU+ et ses prises à l'aide de commandes numériques à partir
d'un clavier. Vous pouvez également utiliser Telnet ou tout autre programme de terminal pour configurer
la PDU+ une fois l'adresse IP paramétrée.
Pour démarrer l'HyperTerminal et communiquer avec la PDU+, procédez comme suit :
Procédure
1. Cliquez sur Démarrer ? Programmes ? Accessoires ? Communications ? HyperTerminal. La fenêtre
Description de la connexion s'affiche.
2. Saisissez le nom de la connexion dans la zone Nom et sélectionnez une icône pour la connexion.
3. Cliquez sur OK. La fenêtre Connecter à s'affiche.
4. Dans la zone Connecter à l'aide de, sélectionnez le port COM connecté à la PDU+.
5. Cliquez sur OK. La fenêtre Propriétés s'affiche.
6. Cliquez sur Restaurer les paramètres par défaut pour utiliser les paramètres par défaut. Veillez à ce
que la zone Bits par seconde soit réglée sur 9600 et à ce que la zone Contrôle de flux soit définie
sur Aucun.
7. Cliquez sur OK.
8. Appuyez sur n'importe quelle touche. Le menu principal de l'utilitaire de configuration s'affiche.
9. Saisissez le mot de passe par défaut, passw0rd (tout en lettres minuscules avec un zéro et non un O),
et appuyez sur Entrée.
10. Saisissez l'option de menu que vous souhaitez. Pour obtenir la description des options, voir «Options
du menu de l'utilitaire de configuration».
Options du menu de l'utilitaire de configuration
Pourquoi et quand exécuter cette tâche
Les options suivantes se trouvent dans le menu principal de l'utilitaire de configuration :
Paramètres de
Lorsque vous sélectionnez Paramètres , le fenêtre de l'utilitaire de configuration s'affiche avec les
options suivantes :
Configurez l'adresse IP, l'adresse de la passerelle et le Groupe de systèmes MIB
Affichez et modifiez l'adresse IP, la date, l'heure et les informations du système MIB.
Définissez le Groupe de commandes
Configurez le nom d'utilisateur de l'administrateur, le mot de passe, et les protocoles
d'accès.
Configurez les gestionnaires d'accès en écriture
Configurez une liste des utilisateurs pouvant accéder à et contrôler la PDU+.
134 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension
Configurez les récepteurs d'alarme
Configurez les serveurs de gestion du réseau à distance (NMS) pour recevoir des alarmes.
Définissez la date et l'heure
Réglez les informations de la date et de l'heure pour la PDU+.
Configurez le nom de superutilisateur et mot de passe
Configurez le nom d'utilisateur et le mot de passe de l'administrateur qui utilisera un
navigateur Web pour configurer la PDU+.
Notification par e-mail
Configurez une liste des utilisateurs qui seront alertés par des messages d'événements si
un événement inhabituel est déclenché sur le système PDU+.
Configurez les multi-utilisateurs
Configurez les autres connexions utilisateurs et mots de passe ainsi que les niveaux
d'accès en lecture et écriture.
Configurez les informations d'
Configurez les zones intervalle de connexion, fréquence de régénération, et nom
personnalisé des groupes de charge PDU+.
Récapitulatif des paramètres et du journal des événements
Affichez tous les paramètres de configuration de la PDU+.
Réinitialisez la configuration aux paramètres par défaut
Réinitialisez tous les paramètres systèmes aux valeurs par défaut en sortie d'usine.
Redémarrez le HD-PDU
Redémarrez la PDU+.
Configuration des adresses IP
Pourquoi et quand exécuter cette tâche
Important : Vous devez configurer l'adresse IP avant de pouvoir utiliser l'interface Web ou d'accéder à la
PDU+ dans un réseau IP (LAN/WAN). Si vous ne connaissez pas l'adresse IP, contactez votre
administrateur système.
Pour configurer l'adresse IP, procédez comme suit :
Procédure
1. Dans le menu principal de l'utilitaire de configuration, saisissez l'option de menu pour les Paramètres.
2. Saisissez l'option de menu pour Configurer l'adresse IP, l'adresse de la passerelle et le groupe
système MIB.
Utilisation de l'interface Web
Apprenez à utiliser l'interface Web pour configurer et surveiller à distance l'unité d'alimentation plus
(PDU+). La PDU+ fournit une interface graphique que vous pouvez afficher à partir d'un navigateur
Web. A l'aide d'un navigateur Web, vous pouvez accéder à et surveiller les prises électriques
d'alimentation des PDU+ à distance depuis un poste de travail ou un ordinateur bloc-notes.
Pourquoi et quand exécuter cette tâche
Démarrage de l'interface Web
Pourquoi et quand exécuter cette tâche
Pour démarrer l'interface Web, procédez comme suit :
Unité d'alimentation plus 135
Procédure
1. Démarrez un navigateur Web à partir d'un poste de travail ou d'un ordinateur bloc-notes, et entrez
l'adresse IP de la PDU+q dans la zone d'adresse. La fenêtre de connexion s'affiche.
Remarque : Pour plus d'informations sur le paramétrage de l'adresse IP du système voir
«Configuration des adresses IP», à la page 135.
2. Dans la zone du nom d'utilisateur, entrez USERID (tout en majuscules).
3. Dans la zone du mot de passe, entrez passw0rd (tout en minuscules avec un zéro à la place du O).
4. Cliquez sur OK. La page de statut principale s'affiche.
Résultats
La page de statut principale affiche une représentation graphique des prises électriques de la PDU+, ainsi
que le statut d'entrée :
v Le panneau à gauche affiche les menus et les sous-menus de la PDU+. Cliquez sur un menu pour en
afficher les options, développer les éléments de menu, et modifier les options de menu selon vos
besoins.
v Le graphique affiché dans le panneau de droite montre le statut des prises électriques, le voltage
d'entrée, le voltage de sortie, la fréquence, la consommation de courant, électricité, en watt/heure, et la
consommation électrique cumulée en watt/heure. Si vous connectez une sonde environnementale
surveillée facultative, les conditions de température et d'humidité de l'environnement s'affichent.
Chaque page de menu fournit une aide en ligne pour vous aider dans la configuration de la PDU+.
Cliquez sur l'icône d'aide en haut de chaque page pour afficher l'aide.
Modification des paramètres de base
Utilisez le menu Système pour configurer les paramètres système de la PDU+ tels que le nom de
superutilisateur, le mot de passe, l'adresse IP, la date et l'heure.
Pourquoi et quand exécuter cette tâche
Modification du nom de superutilisateur et du mot de passe :
Vous pouvez configurer le nom d'utilisateur et le mot de passe de l'administrateur qui utilisera un
navigateur Web pour configurer la PDU+ sur la page Utilitaire de configuration.
Pourquoi et quand exécuter cette tâche
Pour modifier le nom de superutilisateur et le mot de passe, procédez comme suit :
Procédure
1. A partir de la page de statut principale, dans le panneau de navigation de gauche, cliquez sur
Système.
2. Cliquez sur Configuration pour afficher et modifier la configuration système et le nom de
superutilisateur et le mot de passe.
Identification de la PDU+ et de la carte Web/SNMP :
Vous pouvez afficher les informations concernant la PDU+ et la carte Web/SNMP sur la page
Identification de la gestion de l'alimentation.
136 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension
Pourquoi et quand exécuter cette tâche
Pour afficher les informations de gestion de l'alimentation de la PDU+ et de la carte Web/SNMP,
procédez comme suit :
Procédure
1. A partir de la page de statut principale, dans le panneau de navigation de gauche, cliquez sur
Système.
2. Cliquez sur Identification pour afficher les informations de la PDU+ et de la carte Web/SNMP.
Ajout d'utilisateurs :
Sur la page Configuration Multi-utilisateurs, vous pouvez ajouter des utilisateurs qui peuvent accéder à
et surveiller la PDU+.
Pourquoi et quand exécuter cette tâche
Pour créer une liste des utilisateurs qui peuvent accéder à et surveiller la PDU+, procédez comme suit :
Procédure
1. A partir de la page de statut principale, dans le panneau de navigation de gauche, cliquez sur
Système.
2. Cliquez sur Multi-Utilisateurs pour ajouter des utilisateurs qui peuvent uniquement afficher le statut
de la PDU+ ou des utilisateurs qui peuvent modifier les paramètres de la PDU+.
Modification de la date et de l'heure :
Vous pouvez modifier la date et l'heure de la PDU+ sur la page Date et Heure.
Pourquoi et quand exécuter cette tâche
Important : La modification de la date et heure de la PDU+ affecte les autres paramètres système, tels
que les e-mails, les interruptions et journaux.
Pour modifier la date et l'heure, procédez comme suit :
Procédure
1. A partir de la page de statut principale, dans le panneau de navigation de gauche, cliquez sur
Système.
2. Cliquez sur Date et Heure pour afficher et modifier la date et l'heure du système. Vous pouvez
paramétrer manuellement la date et l'heure, les synchroniser avec l'heure de l'ordinateur, ou les
synchroniser avec un serveur NTP.
Modification des alertes événement :
Vous pouvez modifier les alertes événement sur la page des récepteurs d'alerte SNMP.
Pourquoi et quand exécuter cette tâche
Pour configurer la PDU+ afin d'envoyer des alertes e-mail ou alarmes SNMP à des utilisateurs spécifiés
lors de la survenance d'événements spécifiques, procédez comme suit :
Unité d'alimentation plus 137
Procédure
1. A partir de la page de statut principale, dans le panneau de navigation de gauche, cliquez sur
Système.
2. Cliquez sur Récepteurs d'alerte pour créer une liste des utilisateurs ou des postes de travail qui
seront alertés par un message d'alerte SNMP. Vous pouvez spécifier les adresses IP pour un maximum
de huit récepteurs d'alarmes, ainsi que les informations de la communauté, le type d'alarme, la gravité
de l'alarme, et la description des événements qui déclenchent les alarmes.
3. Cliquez sur Notification par e-mail sous Système pour créer une liste d'un maximum de quatre
utilisateurs qui seront alertés par un e-mail. Utilisez ce menu pour spécifier le serveur de mail, le
compte utilisateur, le système de nom de domaine (DNS) et les autres informations nécessaires pour
configurer un serveur de mails pour l'envoi d'alertes mail. Utilisez la table Récepteurs d'e-mail pour
ajouter les adresses e-mail.
Modification des informations réseau
Utilisez le menu Réseau pour modifier les informations réseau pour la PDU+, par exemple, l'adresse IP.
Pourquoi et quand exécuter cette tâche
Modification de la configuration de réseau :
Vous pouvez afficher ou modifier la configuration réseau sur la page Configuration réseau.
Pourquoi et quand exécuter cette tâche
Pour afficher ou modifier la configuration réseau de la PDU+, procédez comme suit :
Procédure
1. A partir de la page de statut principale, dans le panneau de navigation de gauche, cliquez sur Réseau.
2. Cliquez sur Configuration pour paramétrer l'adresse IP, l'adresse de la passerelle, le masque de
sous-réseau, et l'adresse de système de nom de domaine de PDU+.
3. Cliquez sur Commande pour configurer les paramètres TCP/IP.
4. Cliquez sur Commande d'accès pour paramétrer la commande d'accès afin d'empêcher les utilisateurs
non autorisés d'accéder à la PDU+.
Récapitulatifs du journal de l'historique et des événements
Le menu Journaux fournit une description détaillée de tous les événements ainsi qu'un enregistrement du
statut de la PDU+. Les administrateurs système peuvent utiliser cette page pour analyser les incidents
avec l'équipement réseau.
Pourquoi et quand exécuter cette tâche
Affichage du journal de l'historique :
Vous pouvez afficher l'historique complet des entrées et sorties de PDU+ sur la page du Journal de
l'historique.
Pourquoi et quand exécuter cette tâche
Pour afficher l'historique de la PDU+, procédez comme suit :
Procédure
1. A partir de la page de statut principale, dans le panneau de navigation de gauche, cliquez sur
Journaux.
138 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension
2. Cliquez sur Historique. Chaque fichier du journal des événements affiche l'heure, la date et la
description de tous les événements survenant sur la PDU+.
Affichage du journal des événements :
Vous pouvez afficher l'enregistrement complet des événements de la PDU+ sur la page Journal des
événements.
Pourquoi et quand exécuter cette tâche
Pour afficher l'enregistrement complet des événements PDU+, procédez comme suit :
Procédure
1. A partir de la page de statut principale, dans le panneau de navigation de gauche, cliquez sur
Journaux.
2. Cliquez sur Evénements. Chaque fichier journal affiche un enregistrement de l'alimentation d'entrée et
de sortie de chaque prise.
Unité d'alimentation plus 139
140 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension
Retrait et remplacement du capot ou du volet de l'unité
d'extension
Il se peut que vous deviez retirer, remplacer ou installer des capots ou des portes sur une unité
d'extension pour accéder à des composants ou effectuer une opération de maintenance.
Retrait du capot avant des modèles 14G/30
Il se peut que vous deviez retirer le capot pour effectuer des opérations de maintenance du système.
Pourquoi et quand exécuter cette tâche
Pour retirer le capot avant, procédez comme suit :
Procédure
1. Retirez les deux vis moletées (B) figurant sur la partie gauche et à l'arrière du panneau.
2. Appuyez sur les loquets (C) situés de part et d'autre du panneau pour libérer celui-ci.
3. Soulevez le panneau (A) et retirez-le du châssis.
Conseil : Si un bloc de ventilation est présent dans une partie vide du châssis, ne le retirez que si
vous installez des composants dans cette partie du châssis.
Figure 128. Retrait du capot avant des modèles 14G/30
141
Installation du capot avant des modèles 14G/30
Cette procédure permet d'installer le capot avant après avoir installé le système ou effectué une opération
de maintenance.
Avant de commencer
Si des composants ne sont installés que d'un seul côté du châssis, assurez-vous que le bloc de ventilation
est présent du côté vide.
Procédure
1. Insérez les colonnes situées à l'arrière du panneau (A) dans les crochets du châssis.
2. Appuyez sur les loquets (C) situés de part et d'autre du panneau avant.
3. Insérez le panneau dans le châssis et libérez les loquets. Le panneau s'enclenche dans son logement.
4. Remettez en place les deux vis moletées (B) dans les encoches sur la gauche et la droite du capot
avant.
Figure 129. Capot avant d'un modèle 14G/30
142 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension
Retrait et installation des carters sur le système 33E/8B ou
36E/8C
Ces instructions permettent de retirer, de remettre en place ou d'installer des carters pour accéder à des
composants ou effectuer une opération de maintenance.
Retrait du capot d'accès des modèles 33E/8B ou 36E/8C
Cette procédure permet de retirer le capot d'accès pour effectuer des opérations de maintenance ou
accéder à des composants internes.
Pourquoi et quand exécuter cette tâche
Pour retirer le capot d'accès, procédez comme suit :
Procédure
1. Placez le système en position de maintenance. Pour plus d'informations, voir «Mise en position de
maintenance de l'unité 33E/8B ou 36E/8C», à la page 157.
2. Desserrez les deux vis de serrage à main (A) situées à l'arrière du capot.
3. Placez le capot (B) à l'arrière de l'unité centrale. Une fois l'avant du capot d'accès dégagé du bord
supérieur, soulevez et retirez le capot.
Résultats
Avertissement : Pour garantir un niveau de refroidissement et de ventilation approprié, installez le
capot avant de démarrer le système. Vous risquez d'endommager les composants système si vous utilisez
le système sans son capot pendant plus de 30 minutes.
143
Installation du capot d'accès des modèles 33E/8B ou 36E/8C.
Cette procédure permet d'installer le capot d'accès après avoir effectué une opération de maintenance ou
accédé aux composants internes.
Pourquoi et quand exécuter cette tâche
Pour installer le capot d'accès, procédez comme suit :
Procédure
1. Placez le capot d'accès (A) sur le haut de l'unité centrale, à environ 25 mm du carter avant.
2. Tout en maintenant le capot d'accès contre l'unité centrale, faites glisser le capot vers l'avant de l'unité
centrale.
Les taquets du capot d'accès doivent se trouver en-dessous du bord supérieur et les deux vis doivent
être alignées avec les trous situés à l'arrière de l'unité centrale.
Important : Assurez-vous que les câble LED du ventilateur ne sont pas coincés par l'arête avant du
capot d'accès lorsque vous le déplacez vers l'avant.
3. Serrez les deux vis (B) situées à l'arrière du capot.
Figure 130. Retrait du capot d'accès
144 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension
Retrait du carter avant des modèles 33E/8B ou 36E/8C.
Cette procédure permet de retirer le carter pour accéder aux composants ou effectuer une opération de
maintenance.
Pourquoi et quand exécuter cette tâche
Pour retirer le carter avant, procédez comme suit :
Procédure
1. Retirez les deux vis (A) qui maintiennent l'unité centrale à l'armoire (B) (voir figure suivante).
2. Retirez le carter du système. Le carter est doté d'un renfoncement permettant de le tenir plus
facilement.
Figure 131. Installation du capot d'accès
Retrait et installation des carters sur le système 33E/8B ou 36E/8C 145
Installation du carter avant des modèles 33E/8B ou 36E/8C
Cette procédure permet de remettre en place le carter après avoir accédé à des composants ou effectué
une opération de maintenance.
Pourquoi et quand exécuter cette tâche
Pour installer le carter avant, procédez comme suit :
Procédure
1. Alignez le carter face au système.
2. Poussez doucement le carter jusqu'à ce que les quatre taquets de déverrouillage soient positionnés
dans leurs montants de fixation respectifs (B) sur le système.
3. Remettez en place les deux vis (C) qui maintiennent le système fixé à l'armoire (A).
Figure 132. Retrait du carter avant
146 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension
Retrait et remise en place du capot avant sur les modèles 17M/MB et
79M/HB
Ces procédures permettent de retirer et de remettre en place le capot pour accéder à des composants ou
effectuer une opération de maintenance.
Retrait du carter avant d'un modèle 17M/MB ou 79M/HB
Cette procédure permet de retirer le capot pour accéder aux composants ou effectuer une opération de
maintenance.
Pourquoi et quand exécuter cette tâche
Pour retirer le carter avant, procédez comme suit :
Procédure
1. Si nécessaire, ouvrez le volet avant de l'armoire.
2. Tirez les taquets (A) situés de chaque côté du capot (voir figure suivante).
Figure 133. Installation du carter avant
Retrait et installation des carters sur le système 33E/8B ou 36E/8C 147
3. Tirez le panneau (B) pour l'ôter de l'unité centrale.
Installation du carter avant sur un modèle 17M/MB ou 79M/HB
Cette procédure permet de remettre en place le carter après avoir accédé à des composants ou effectué
une opération de maintenance.
Pourquoi et quand exécuter cette tâche
Pour installer le carter avant, procédez comme suit :
Procédure
1. Placez le carter (A) en face du système en alignant les quatre broches avec les quatre trous à l'arrière
du carter.
Figure 134. Retrait du capot avant
148 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension
2. Enfoncez les ergots (B) pour emboîter le carter dans son emplacement.
3. Fermez le volet avant de l'armoire.
Figure 135. Installation du carter avant
Retrait et installation des carters sur le système 33E/8B ou 36E/8C 149
150 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension
Mise en position de maintenance ou de fonctionnement du
système 33E/8B ou 36E/8C
Ces procédures décrivent comment mettre un système en maintenance et en fonctionnement dans le
cadre d'une opération de maintenance ou pour accéder à des composants internes.
Mise en position de maintenance de l'unité centrale ou de l'unité
d'extension
Cette procédure permet d'effectuer des opérations de maintenance ou d'accéder à des composants
internes en plaçant l'unité centrale ou l'unité d'extension montée en armoire en position de maintenance.
Pourquoi et quand exécuter cette tâche
Remarque : Certaines figures accompagnant ces procédures peuvent ne pas correspondre exactement à
votre unité centrale ou votre unité d'extension. Néanmoins, les procédures demeurent identiques pour
l'exécution de chaque tâche.
151
DANGER
Lorsque vous utilisez le système ou travaillez à proximité de ce dernier, observez les consignes
suivantes :
Le courant électrique provenant de l'alimentation, du téléphone et des câbles de transmission peut
présenter un danger. Pour éviter tout risque de choc électrique :
v Branchez cette unité uniquement avec le cordon d'alimentation fourni par . N'utilisez pas ce
dernier avec un autre produit.
v N'ouvrez pas et n'entretenez pas le bloc d'alimentation électrique.
v Ne manipulez aucun câble et n'effectuez aucune opération d'installation, d'entretien ou de
reconfiguration de ce produit au cours d'un orage.
v Le produit peut être équipé de plusieurs cordons d'alimentation. Pour supprimer tout risque de
danger électrique, débranchez tous les cordons d'alimentation.
v Branchez tous les cordons d'alimentation sur un socle de prise de courant correctement câblé et
mis à la terre. Vérifiez que la tension et l'ordre des phases des prises de courant correspondent
aux informations de la plaque d'alimentation électrique du système.
v Branchez sur des socles de prise de courant correctement câblés tout équipement connecté à ce
produit.
v Lorsque cela est possible, n'utilisez qu'une seule main pour connecter ou déconnecter les câbles
d'interface.
v Ne mettez jamais un équipement sous tension en cas d'incendie ou d'inondation, ou en présence
de dommages matériels.
v Avant de retirer les carters de l'unité, mettez celle-ci hors tension et déconnectez ses cordons
d'alimentation, ainsi que les câbles qui la relient aux réseaux, aux systèmes de télécommunication
et aux modems (sauf mention contraire dans les procédures d'installation et de configuration).
v Lorsque vous installez, que vous déplacez, ou que vous manipulez le présent produit ou des
périphériques qui lui sont raccordés, reportez-vous aux instructions ci-dessous pour connecter et
déconnecter les différents cordons.
Pour déconnecter les cordons :
1. Mettez toutes les unités hors tension (sauf mention contraire).
2. Débranchez les cordons d'alimentation des prises.
3. Débranchez les câbles d'interface des connecteurs.
4. Débranchez tous les câbles des unités.
Pour connecter les cordons :
1. Mettez toutes les unités hors tension (sauf mention contraire).
2. Branchez tous les cordons sur les unités.
3. Branchez les câbles d'interface sur des connecteurs.
4. Branchez les cordons d'alimentation aux prises.
5. Mettez l'unité sous tension.
(D005a)
DANGER
152 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension
Observez les consignes suivantes lors de l'utilisation du système en armoire ou lorsque vous
travaillez à proximité de ce dernier :
v Un mauvais maniement de l'équipement lourd peut engendrer blessures et dommages matériels.
v Abaissez toujours les vérins de mise à niveau de l'armoire.
v Installez toujours des équerres de stabilisation sur l'armoire.
v Pour prévenir tout danger lié à une mauvaise répartition de la charge, installez toujours les
unités les plus lourdes dans la partie inférieure de l'armoire. Installez toujours les serveurs et les
unités en option en commençant par le bas de l'armoire.
v Un serveur monté en armoire n'est pas une étagère ou un espace de travail. Ne posez pas d'objet
sur un serveur monté en armoire.
v Chaque armoire peut être équipée de plusieurs cordons d'alimentation. Avant de manipuler
l'armoire, vous devez débrancher l'ensemble des cordons d'alimentation.
v Reliez toutes les unités installées dans l'armoire aux dispositifs d'alimentation installés dans la
même armoire. Vous ne devez pas brancher le cordon d'alimentation d'une unité installée dans
une armoire au dispositif d'alimentation installé dans une autre armoire.
v Un mauvais câblage du socle de prise de courant peut provoquer une mise sous tension
dangereuse des parties métalliques du système ou des unités qui lui sont raccordées. Il
appartient au client de s'assurer que le socle de prise de courant est correctement câblé et mis à la
terre afin d'éviter tout risque de choc électrique.
ATTENTION
v N'installez pas d'unité dans une armoire dont la température ambiante interne dépasse la
température ambiante que le fabricant recommande pour toutes les unités montées en armoire.
v N'installez pas d'unité dans une armoire où la ventilation n'est pas assurée. Vérifiez que les
côtés, l'avant et l'arrière de l'unité sont correctement ventilés.
v Le matériel doit être correctement raccordé au circuit d'alimentation pour éviter qu'une surcharge
des circuits n'entrave le câblage des dispositifs d'alimentation ou de protection contre les
surintensités. Pour choisir des connexions d'alimentation à l'armoire adaptées, consultez les
étiquettes de puissance nominale situées sur le matériel dans l'armoire afin de déterminer
l'alimentation totale requise par le circuit d'alimentation.
v Armoires dotées de tiroirs coulissants : Si l'armoire n'est pas équipée d'équerres de stabilisation, ne
sortez et n'installez pas de tiroir ou de dispositif. Ne retirez pas plusieurs tiroirs à la fois. Si vous
retirez plusieurs tiroirs simultanément, l'armoire risque de devenir instable.
v Armoires dotées de tiroirs fixes : Sauf indication du fabricant, les tiroirs fixes ne doivent pas être
retirés à des fins de maintenance. Si vous tentez de retirer une partie ou l'ensemble du tiroir,
l'armoire risque de devenir instable et le tiroir risque de tomber.
(R001)
Pour mettre une unité centrale ou une unité d'extension montée en armoire en position de maintenance,
procédez comme suit :
Procédure
1. Si nécessaire, ouvrez le volet avant de l'armoire.
2. Retirez les deux vis (A) qui maintiennent l'unité centrale ou l'unité d'extension (B) fixée à l'armoire
(voir figure suivante).
Mise en position de maintenance ou de fonctionnement du système 33E/8B ou 36E/8C 153
3. Débloquez les taquets de déverrouillage (A) situés sur les deux côtés de l'unité (voir la figure
suivante).
Figure 136. Retrait des vis du système et de l'armoire
154 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension
4. Lisez la remarque suivante, puis tirez doucement l'unité centrale ou l'unité d'extension hors de
l'armoire jusqu'à ce que les glissières soient complètement étendues et bloquées.
A faire :
v Si vous devez débrancher des câbles figurant à l'arrière de l'unité centrale ou de l'unité d'extension
pour exécuter la procédure, faites-le avant de sortir l'unité de l'armoire.
v Veillez à ne pas bloquer ou plier les câbles situés à l'arrière de l'unité centrale ou de l'unité
d'extension en tirant l'unité hors de l'armoire.
v Vérifiez que les glissières sont complètement étendues. Une fois les glissières complètement
étendues, les taquets de sécurité s'enclenchent. Ils évitent de sortir l'unité centrale ou l'unité
d'extension de manière excessive.
Mise en position fonctionnement de l'unité centrale ou de l'unité
d'extension
Cette procédure permet de mettre l'unité centrale ou l'unité d'extension montée en armoire en position de
fonctionnement pour pouvoir l'utiliser.
Pourquoi et quand exécuter cette tâche
Conseil : Certaines figures accompagnant ces procédures peuvent ne pas correspondre exactement à
votre unité centrale ou votre unité d'extension. Néanmoins, les procédures demeurent identiques pour
l'exécution de chaque tâche.
Pour mettre l'unité centrale ou l'unité d'extension montée en armoire en position de fonctionnement,
procédez comme suit :
Figure 137. Déblocage des taquets de sécurité
Mise en position de maintenance ou de fonctionnement du système 33E/8B ou 36E/8C 155
Procédure
1. Relâchez simultanément les taquets de sécurité bleus (A) situés à proximité de l'avant de chaque
glissière, puis poussez l'unité centrale ou l'unité d'extension dans l'armoire, comme indiqué dans la
figure suivante.
Remarque : Veillez à ne pas bloquer ou plier les câbles situés à l'arrière de l'unité centrale ou de
l'unité d'extension en poussant l'unité dans l'armoire.
2. Remettez en place et serrez les deux vis (C) qui maintiennent l'unité centrale ou l'unité d'extension (A)
fixée à l'armoire (voir la figure suivante).
Figure 138. Déblocage des taquets de sécurité de la glissière
156 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension
3. Fermez le volet avant de l'armoire.
Mise en position de maintenance de l'unité 33E/8B ou 36E/8C
Cette procédure permet d'effectuer des opérations de maintenance ou d'accéder à des composants
internes en plaçant l'unité centrale ou l'unité d'extension montée en armoire en position de maintenance.
Pourquoi et quand exécuter cette tâche
Remarque : Certaines figures accompagnant ces procédures peuvent ne pas correspondre exactement à
votre unité centrale ou votre unité d'extension. Néanmoins, les procédures demeurent identiques pour
l'exécution de chaque tâche.
Figure 139. Mise en place de l'unité centrale dans l'armoire et fixation des vis
Mise en position de maintenance ou de fonctionnement du système 33E/8B ou 36E/8C 157
DANGER
Lorsque vous utilisez le système ou travaillez à proximité de ce dernier, observez les consignes
suivantes :
Le courant électrique provenant de l'alimentation, du téléphone et des câbles de transmission peut
présenter un danger. Pour éviter tout risque de choc électrique :
v Branchez cette unité uniquement avec le cordon d'alimentation fourni par . N'utilisez pas ce
dernier avec un autre produit.
v N'ouvrez pas et n'entretenez pas le bloc d'alimentation électrique.
v Ne manipulez aucun câble et n'effectuez aucune opération d'installation, d'entretien ou de
reconfiguration de ce produit au cours d'un orage.
v Le produit peut être équipé de plusieurs cordons d'alimentation. Pour supprimer tout risque de
danger électrique, débranchez tous les cordons d'alimentation.
v Branchez tous les cordons d'alimentation sur un socle de prise de courant correctement câblé et
mis à la terre. Vérifiez que la tension et l'ordre des phases des prises de courant correspondent
aux informations de la plaque d'alimentation électrique du système.
v Branchez sur des socles de prise de courant correctement câblés tout équipement connecté à ce
produit.
v Lorsque cela est possible, n'utilisez qu'une seule main pour connecter ou déconnecter les câbles
d'interface.
v Ne mettez jamais un équipement sous tension en cas d'incendie ou d'inondation, ou en présence
de dommages matériels.
v Avant de retirer les carters de l'unité, mettez celle-ci hors tension et déconnectez ses cordons
d'alimentation, ainsi que les câbles qui la relient aux réseaux, aux systèmes de télécommunication
et aux modems (sauf mention contraire dans les procédures d'installation et de configuration).
v Lorsque vous installez, que vous déplacez, ou que vous manipulez le présent produit ou des
périphériques qui lui sont raccordés, reportez-vous aux instructions ci-dessous pour connecter et
déconnecter les différents cordons.
Pour déconnecter les cordons :
1. Mettez toutes les unités hors tension (sauf mention contraire).
2. Débranchez les cordons d'alimentation des prises.
3. Débranchez les câbles d'interface des connecteurs.
4. Débranchez tous les câbles des unités.
Pour connecter les cordons :
1. Mettez toutes les unités hors tension (sauf mention contraire).
2. Branchez tous les cordons sur les unités.
3. Branchez les câbles d'interface sur des connecteurs.
4. Branchez les cordons d'alimentation aux prises.
5. Mettez l'unité sous tension.
(D005a)
DANGER
158 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension
Observez les consignes suivantes lors de l'utilisation du système en armoire ou lorsque vous
travaillez à proximité de ce dernier :
v Un mauvais maniement de l'équipement lourd peut engendrer blessures et dommages matériels.
v Abaissez toujours les vérins de mise à niveau de l'armoire.
v Installez toujours des équerres de stabilisation sur l'armoire.
v Pour prévenir tout danger lié à une mauvaise répartition de la charge, installez toujours les
unités les plus lourdes dans la partie inférieure de l'armoire. Installez toujours les serveurs et les
unités en option en commençant par le bas de l'armoire.
v Un serveur monté en armoire n'est pas une étagère ou un espace de travail. Ne posez pas d'objet
sur un serveur monté en armoire.
v Chaque armoire peut être équipée de plusieurs cordons d'alimentation. Avant de manipuler
l'armoire, vous devez débrancher l'ensemble des cordons d'alimentation.
v Reliez toutes les unités installées dans l'armoire aux dispositifs d'alimentation installés dans la
même armoire. Vous ne devez pas brancher le cordon d'alimentation d'une unité installée dans
une armoire au dispositif d'alimentation installé dans une autre armoire.
v Un mauvais câblage du socle de prise de courant peut provoquer une mise sous tension
dangereuse des parties métalliques du système ou des unités qui lui sont raccordées. Il
appartient au client de s'assurer que le socle de prise de courant est correctement câblé et mis à la
terre afin d'éviter tout risque de choc électrique.
ATTENTION
v N'installez pas d'unité dans une armoire dont la température ambiante interne dépasse la
température ambiante que le fabricant recommande pour toutes les unités montées en armoire.
v N'installez pas d'unité dans une armoire où la ventilation n'est pas assurée. Vérifiez que les
côtés, l'avant et l'arrière de l'unité sont correctement ventilés.
v Le matériel doit être correctement raccordé au circuit d'alimentation pour éviter qu'une surcharge
des circuits n'entrave le câblage des dispositifs d'alimentation ou de protection contre les
surintensités. Pour choisir des connexions d'alimentation à l'armoire adaptées, consultez les
étiquettes de puissance nominale situées sur le matériel dans l'armoire afin de déterminer
l'alimentation totale requise par le circuit d'alimentation.
v Armoires dotées de tiroirs coulissants : Si l'armoire n'est pas équipée d'équerres de stabilisation, ne
sortez et n'installez pas de tiroir ou de dispositif. Ne retirez pas plusieurs tiroirs à la fois. Si vous
retirez plusieurs tiroirs simultanément, l'armoire risque de devenir instable.
v Armoires dotées de tiroirs fixes : Sauf indication du fabricant, les tiroirs fixes ne doivent pas être
retirés à des fins de maintenance. Si vous tentez de retirer une partie ou l'ensemble du tiroir,
l'armoire risque de devenir instable et le tiroir risque de tomber.
(R001)
Pour mettre l'unité centrale ou l'unité d'extension montée en armoire en position de maintenance,
procédez comme suit :
Procédure
1. Si nécessaire, ouvrez le volet avant de l'armoire.
2. Retirez les deux vis (A) qui maintiennent l'unité centrale à l'armoire (voir figure suivante).
3. Débloquez les taquets de sécurité (B) situés sur les deux côtés de l'unité (voir figure suivante).
Mise en position de maintenance ou de fonctionnement du système 33E/8B ou 36E/8C 159
4. Si vous devez sortir l'unité centrale ou l'unité d'extension de l'armoire, veillez à débrancher
auparavant les câbles situés à l'arrière de l'unité. Veillez à ne pas bloquer ou plier les câbles situés à
l'arrière de l'unité centrale ou de l'unité d'extension en tirant l'unité hors de l'armoire.
5. Tirez doucement l'unité centrale ou l'unité d'extension hors de l'armoire jusqu'à ce que les glissières
soient complètement étendues et bloquées. Vérifiez que les glissières sont complètement étendues.
Une fois les glissières complètement étendues, les taquets de sécurité s'enclenchent. Ils évitent de
sortir l'unité centrale ou l'unité d'extension de manière excessive.
Mise en position de fonctionnement de l'unité 33E/8B ou 36E/8C
Cette procédure permet de mettre l'unité centrale ou l'unité d'extension en position de fonctionnement
pour pouvoir l'utiliser.
Pourquoi et quand exécuter cette tâche
Pour mettre l'unité centrale en position de fonctionnement, procédez comme suit :
Conseil : Certaines figures accompagnant ces procédures peuvent ne pas correspondre exactement à
votre unité centrale ou votre unité d'extension. Néanmoins, les procédures demeurent identiques pour
l'exécution de chaque tâche.
Procédure
1. Libérez simultanément les taquets de sécurité bleus (B) situés vers l'avant de chaque glissière, puis
poussez l'unité centrale ou l'unité d'extension dans l'armoire (voir la figure suivante).
Remarque : Veillez à ne pas bloquer ou plier les câbles situés à l'arrière de l'unité centrale ou de
l'unité d'extension en poussant l'unité dans l'armoire.
Figure 140. Déblocage des taquets de sécurité
160 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension
2. Remettez en place et vissez les deux vis de serrage à main (C) qui maintiennent l'unité centrale ou
l'unité d'extension (A) fixé(e) à l'armoire (voir la figure suivante).
3. Fermez le volet avant de l'armoire.
Figure 141. Déblocage des taquets de sécurité de la glissière
Figure 142. Remise en place des vis de serrage à main
Mise en position de maintenance ou de fonctionnement du système 33E/8B ou 36E/8C 161
162 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension
Remarques
Le présent document peut contenir des informations ou des références concernant certains produits,
logiciels ou services non annoncés dans ce pays. Pour plus d'informations, référez-vous aux documents
d'annonce disponibles dans votre pays ou adressez-vous au partenaire commercial du fabricant. Toute
référence à un produit, logiciel ou service du fabricant n'implique pas que seul ce produit, logiciel ou
service puisse convenir. Tout autre élément fonctionnellement équivalent peut être utilisé, s'il n'enfreint
aucun droit du fabricant. Il est toutefois de la responsabilité de l'utilisateur d'évaluer et de vérifier
lui-même le fonctionnement de tout produit, programme ou service.
Le fabricant peut détenir des brevets ou des demandes de brevet couvrant les produits mentionnés dans
le présent document. La remise de ce document ne vous octroie aucun droit de licence sur ces brevets ou
demandes de brevet. Si vous désirez recevoir des informations concernant l'acquisition de licences,
veuillez en faire la demande par écrit au fabricant.
Le paragraphe suivant ne s'applique ni au Royaume-Uni, ni dans aucun pays dans lequel il serait
contraire aux lois locales. LE PRESENT DOCUMENT EST LIVRE «EN L'ETAT» SANS AUCUNE
GARANTIE EXPLICITE OU IMPLICITE. LE FABRICANT DECLINE NOTAMMENT TOUTE
RESPONSABILITE RELATIVE A CES INFORMATIONS EN CAS DE CONTREFAÇON AINSI QU'EN
CAS DE DEFAUT D'APTITUDE A L'EXECUTION D'UN TRAVAIL DONNE. Certaines juridictions
n'autorisent pas l'exclusion des garanties implicites, auquel cas l'exclusion ci-dessus ne vous sera pas
applicable.
Le présent document peut contenir des inexactitudes ou des coquilles. Il est mis à jour périodiquement.
Chaque nouvelle édition inclut les mises à jour. IBM peut modifier sans préavis les produits et logiciels
décrits dans ce document.
Les références à des sites Web non référencés par le fabricant sont fournies à titre d'information
uniquement et n'impliquent en aucun cas une adhésion aux données qu'ils contiennent. Les éléments
figurant sur ces sites Web ne font pas partie des éléments du présent produit et l'utilisation de ces sites
relève de votre seule responsabilité.
Le fabricant pourra utiliser ou diffuser, de toute manière qu'il jugera appropriée et sans aucune obligation
de sa part, tout ou partie des informations qui lui seront fournies.
Les données de performance indiquées dans ce document ont été déterminées dans un environnement
contrôlé. Par conséquent, les résultats peuvent varier de manière significative selon l'environnement
d'exploitation utilisé. Certaines mesures évaluées sur des systèmes en cours de développement ne sont
pas garanties sur tous les systèmes disponibles. En outre, elles peuvent résulter d'extrapolations. Les
résultats peuvent donc varier. Il incombe aux utilisateurs de ce document de vérifier si ces données sont
applicables à leur environnement d'exploitation.
Les informations concernant les produits de fabricants tiers ont été obtenues auprès des fournisseurs de
ces produits, par l'intermédiaire d'annonces publiques ou via d'autres sources disponibles. Ce fabricant
n'a pas testé ces produits et ne peut confirmer l'exactitude de leurs performances ni leur compatibilité. Il
ne peut recevoir aucune réclamation concernant des produits de fabricants tiers. Toute question
concernant les performances de produits de fabricants tiers doit être adressée aux fournisseurs de ces
produits.
Toute instruction relative aux intentions du fabricant pour ses opérations à venir est susceptible d'être
modifiée ou annulée sans préavis, et doit être considérée uniquement comme un objectif.
163
Tous les tarifs indiqués sont les prix de vente actuels suggérés et sont susceptibles d'être modifiés sans
préavis. Les tarifs appliqués peuvent varier selon les revendeurs.
Ces informations sont fournies uniquement à titre de planification. Elles sont susceptibles d'être modifiées
avant la mise à disposition des produits décrits.
Le présent document peut contenir des exemples de données et de rapports utilisés couramment dans
l'environnement professionnel. Ces exemples mentionnent des noms fictifs de personnes, de sociétés, de
marques ou de produits à des fins illustratives ou explicatives uniquement. Tous ces noms sont fictifs et
toute ressemblance avec des noms et adresses utilisés par une entreprise réelle serait purement fortuite.
Si vous visualisez ces informations en ligne, il se peut que les photographies et illustrations en couleur
n'apparaissent pas à l'écran.
Les figures et les spécifications contenues dans le présent document ne doivent pas être reproduites,
même partiellement, sans l'autorisation écrite d'IBM.
Le fabricant a conçu le présent document pour expliquer comment utiliser les machines indiquées. Il n'est
exploitable dans aucun autre but.
Les ordinateurs du fabricant contiennent des mécanismes conçus pour réduire les risques d'altération ou
de perte de données. Ces risques, cependant, ne peuvent pas être éliminés. En cas de rupture de tension,
de défaillances système, de fluctuations ou de rupture de l'alimentation ou d'incidents au niveau des
composants, l'utilisateur doit s'assurer de l'exécution rigoureuse des opérations, et que les données ont été
sauvegardées ou transmises par le système au moment de la rupture de tension ou de l'incident (ou peu
de temps avant ou après). De plus, ces utilisateurs doivent établir des procédures garantissant la
vérification indépendante des données, afin de permettre une utilisation fiable de ces dernières dans le
cadre d'opérations stratégiques. Ces utilisateurs doivent enfin consulter régulièrement sur les sites Web de
support IBM les mises à jour et les correctifs applicables au système et aux logiciels associés.
Marques
IBM, le logo IBM et ibm.com sont des marques d'International Business Machines aux Etats-Unis et/ou
dans certains autres pays. D'autres sociétés sont propriétaires d'autres marques, noms de produits ou
logos. Une liste à jour des marques IBM est disponible sur Internet : Copyright and trademark
information, à l'adresse www.ibm.com/legal/copytrade.shtml.
Microsoft et Windows sont des marques de Microsoft Corporation aux Etats-Unis et/ou dans certains
autres pays.
Les autres noms de sociétés, de produits et de services peuvent appartenir à des tiers.
Bruits radioélectriques
Remarques sur la classe A
Les avis de conformité de classe A suivants s'appliquent aux serveurs.
Recommandation de la Federal Communications Commission (FCC) [Etats-Unis]
Remarque : Cet appareil respecte les limites des caractéristiques d'immunité des appareils numériques
définies pour la classe A, conformément au chapitre 15 de la réglementation de la FCC. La conformité
aux spécifications de cette classe offre une garantie acceptable contre les perturbations électromagnétiques
dans les zones commerciales. Ce matériel génère, utilise et peut émettre de l'énergie radiofréquence. Il
risque de parasiter les communications radio s'il n'est pas installé conformément aux instructions du
164 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension
constructeur. L'exploitation faite en zone résidentielle peut entraîner le brouillage des réceptions radio et
télé, ce qui obligerait le propriétaire à prendre les dispositions nécessaires pour en éliminer les causes.
Utilisez des câbles et connecteurs correctement blindés et mis à la terre afin de respecter les limites de
rayonnement définies par la réglementation de la FCC. IBM® ne peut pas être tenue pour responsable du
brouillage des réceptions radio ou télévision résultant de l'utilisation de câbles et connecteurs inadaptés
ou de modifications non autorisées apportées à cet appareil. Toute modification non autorisée pourra
annuler le droit d'utilisation de cet appareil.
Cet appareil est conforme aux restrictions définies dans le chapitre 15 de la réglementation de la FCC.
Son utilisation est soumise aux deux conditions suivantes : (1) il ne peut pas causer de perturbations
électromagnétiques gênantes et (2) il doit accepter toutes les perturbations reçues, y compris celles
susceptibles d'occasionner un fonctionnement indésirable.
Avis de conformité à la réglementation d'Industrie Canada
Cet appareil numérique de la classe A est conforme à la norme NMB-003 du Canada.
Avis de conformité aux exigences de l'Union européenne
Le présent produit satisfait aux exigences de protection énoncées dans la directive 2004/108/CEE du
Conseil concernant le rapprochement des législations des Etats membres relatives à la compatibilité
électromagnétique. IBM décline toute responsabilité en cas de non-respect de cette directive résultat d'une
modification non recommandée du produit, y compris l'ajout de cartes en option non IBM.
Ce produit respecte les limites des caractéristiques d'immunité des appareils de traitement de
l'information définies par la classe A de la norme européenne EN 55022 (CISPR 22). La conformité aux
spécifications de la classe A offre une garantie acceptable contre les perturbations avec les appareils de
communication agréés, dans les zones commerciales et industrielles.
Dans l'Union européenne, contactez :
IBM Deutschland GmbH
Technical Regulations, Department M456
IBM-Allee 1, 71139 Ehningen, Germany
Tele: +49 7032 15-2937
e-mail: tjahn@de.ibm.com
Avertissement : Ce matériel appartient à la classe A. Il est susceptible d'émettre des ondes
radioélectriques risquant de perturber les réceptions radio. Son emploi dans une zone résidentielle peut
créer des perturbations électromagnétiques. L'utilisateur devra alors prendre les mesures nécessaires pour
en éliminer les causes.
Avis de conformité aux exigences du Voluntary Control Council for Interference
(VCCI) - Japon
Voici un résumé de la recommandation du VCCI japonais figurant dans l'encadré ci-dessus :
Ce produit de la classe A respecte les limites des caractéristiques d'immunité définies par le VCCI
(Voluntary Control Council for Interference) japonais. Si ce matériel est utilisé dans une zone résidentielle,
Remarques 165
il peut créer des perturbations électromagnétiques. L'utilisateur devra alors prendre les mesures
nécessaires pour en éliminer les causes.
Directive relative aux harmoniques confirmée par l'association japonaise JEITA
(Japanese Electronics and Information Technology Industries Association)
(produits inférieurs ou égaux à 20 A par phase)
Directive relative aux harmoniques confirmée avec modifications par l'association
japonaise JEITA (Japanese Electronics and Information Technology Industries
Association) (produits supérieurs 20 A par phase)
Avis d'interférences électromagnétiques (EMI) - République populaire de Chine
Ce matériel appartient à la classe A. Il est susceptible d'émettre des ondes radioélectriques risquant de
perturber les réceptions radio. L'utilisateur devra alors prendre les mesures nécessaires pour en éliminer
les causes.
Avis d'interférences électromagnétiques (EMI) - Taïwan
Voici un résumé de l'avis EMI de Taïwan figurant ci-dessus.
Avertissement : Ce matériel appartient à la classe A. Il est susceptible d'émettre des ondes
radioélectriques risquant de perturber les réceptions radio. Son emploi dans une zone résidentielle peut
créer des interférences. L'utilisateur devra alors prendre les mesures nécessaires pour les supprimer.
166 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension
Avis d'interférences électromagnétiques (EMI) - Corée
Cet équipement a obtenu l'agrément EMC en vue de son utilisation commerciale. S'il a été vendu ou
acheté par erreur, veuillez l'échanger avec du matériel conforme à un usage domestique.
Avis de conformité pour l'Allemagne
Deutschsprachiger EU Hinweis: Hinweis für Geräte der Klasse A EU-Richtlinie zur
Elektromagnetischen Verträglichkeit
Dieses Produkt entspricht den Schutzanforderungen der EU-Richtlinie 2004/108/EG zur Angleichung der
Rechtsvorschriften über die elektromagnetische Verträglichkeit in den EU-Mitgliedsstaaten und hält die
Grenzwerte der EN 55022 Klasse A ein.
Um dieses sicherzustellen, sind die Geräte wie in den Handbüchern beschrieben zu installieren und zu
betreiben. Des Weiteren dürfen auch nur von der IBM empfohlene Kabel angeschlossen werden. IBM
übernimmt keine Verantwortung für die Einhaltung der Schutzanforderungen, wenn das Produkt ohne
Zustimmung von IBM verändert bzw. wenn Erweiterungskomponenten von Fremdherstellern ohne
Empfehlung von IBM gesteckt/eingebaut werden.
EN 55022 Klasse A Geräte müssen mit folgendem Warnhinweis versehen werden:
"Warnung: Dieses ist eine Einrichtung der Klasse A. Diese Einrichtung kann im Wohnbereich
Funk-Störungen verursachen; in diesem Fall kann vom Betreiber verlangt werden, angemessene
Maßnahmen zu ergreifen und dafür aufzukommen."
Deutschland: Einhaltung des Gesetzes über die elektromagnetische Verträglichkeit von Geräten
Dieses Produkt entspricht dem “Gesetz über die elektromagnetische Verträglichkeit von Geräten
(EMVG)“. Dies ist die Umsetzung der EU-Richtlinie 2004/108/EG in der Bundesrepublik Deutschland.
Zulassungsbescheinigung laut dem Deutschen Gesetz über die elektromagnetische Verträglichkeit von
Geräten (EMVG) (bzw. der EMC EG Richtlinie 2004/108/EG) für Geräte der Klasse A
Dieses Gerät ist berechtigt, in Übereinstimmung mit dem Deutschen EMVG das EG-Konformitätszeichen
- CE - zu führen.
Verantwortlich für die Einhaltung der EMV Vorschriften ist der Hersteller:
International Business Machines Corp.
New Orchard Road
Armonk, New York 10504
Tel: 914-499-1900
Der verantwortliche Ansprechpartner des Herstellers in der EU ist:
IBM Deutschland GmbH
Technical Regulations, Abteilung M456
IBM-Allee 1, 71139 Ehningen, Germany
Tel: +49 7032 15-2937
e-mail: tjahn@de.ibm.com
Remarques 167
Generelle Informationen:
Das Gerät erfüllt die Schutzanforderungen nach EN 55024 und EN 55022 Klasse A.
Avis d'interférences électromagnétiques (EMI) - Russie
Remarques sur la classe B
Les avis de conformité de classe B suivants s'appliquent aux dispositifs déclarés comme relevant de la
compatibilité électromagnétique (EMC) de classe B dans les informations d'installation des dispositifs.
Recommandation de la Federal Communications Commission (FCC) [Etats-Unis]
Cet appareil respecte les limites des caractéristiques d'immunité des appareils numériques définies par la
classe B, conformément au chapitre 15 de la réglementation de la FCC. La conformité aux spécifications
de la classe B offre une garantie acceptable contre les perturbations électromagnétiques dans les zones
résidentielles.
Ce matériel génère, utilise et peut émettre de l'énergie radiofréquence. Il risque de parasiter les
communications radio s'il n'est pas installé conformément aux instructions du constructeur. Toutefois, il
n'est pas garanti que des perturbations n'interviendront pas pour une installation particulière.
Si cet appareil provoque des perturbations gênantes dans les communications radio ou télévision,
mettez-le hors tension puis sous tension pour vous en assurer. L'utilisateur peut tenter de remédier à cet
incident en appliquant une ou plusieurs des mesures suivantes :
v Réorienter ou repositionner l'antenne de réception.
v Eloigner l'appareil du récepteur.
v Brancher l'appareil sur une prise différente de celle du récepteur, sur un circuit distinct.
v Prendre contact avec un distributeur agréé IBM ou un représentant commercial IBM pour obtenir de
l'aide.
Utilisez des câbles et connecteurs correctement blindés et mis à la terre afin de respecter les limites de
rayonnement définies par la réglementation de la FCC. Ces câbles et connecteurs sont disponibles chez
votre distributeur agréé IBM. IBM ne peut pas être tenue pour responsable du brouillage des réceptions
radio ou télévision résultant de modifications non autorisées apportées à cet appareil. Toute modification
non autorisée pourra annuler le droit d'utilisation de cet appareil.
Cet appareil est conforme aux restrictions définies dans le chapitre 15 de la réglementation de la FCC.
Son utilisation est soumise aux deux conditions suivantes : (1) il ne peut pas causer de perturbations
électromagnétiques gênantes et (2) il doit accepter toutes les perturbations reçues, y compris celles
susceptibles d'occasionner un fonctionnement indésirable.
Avis de conformité à la réglementation d'Industrie Canada
Cet appareil numérique de la classe B est conforme à la norme NMB-003 du Canada.
168 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension
Avis de conformité aux exigences de l'Union européenne
Le présent produit satisfait aux exigences de protection énoncées dans la directive 2004/108/CEE du
Conseil concernant le rapprochement des législations des Etats membres relatives à la compatibilité
électromagnétique. IBM décline toute responsabilité en cas de non-respect de cette directive résultant
d'une modification non recommandée du produit, y compris l'ajout de cartes en option non IBM.
Ce produit respecte les limites des caractéristiques d'immunité des appareils de traitement de
l'information définies par la classe B de la norme européenne EN 55022 (CISPR 22). La conformité aux
spécifications de la classe B offre une garantie acceptable contre les perturbations avec les appareils de
communication agréés, dans les zones résidentielles.
Dans l'Union européenne, contactez :
IBM Deutschland GmbH
Technical Regulations, Department M456
IBM-Allee 1, 71139 Ehningen, Germany
Tele: +49 7032 15-2937
e-mail: tjahn@de.ibm.com
Avis de conformité aux exigences du Voluntary Control Council for Interference
(VCCI) - Japon
Directive relative aux harmoniques confirmée par l'association japonaise JEITA
(Japanese Electronics and Information Technology Industries Association)
(produits inférieurs ou égaux à 20 A par phase)
Directive relative aux harmoniques confirmée avec modifications par l'association
japonaise JEITA (Japanese Electronics and Information Technology Industries
Association) (produits supérieurs 20 A par phase)
Avis d'interférences électromagnétiques (EMI) - Corée
Remarques 169
Avis de conformité pour l'Allemagne
Deutschsprachiger EU Hinweis: Hinweis für Geräte der Klasse B EU-Richtlinie zur
Elektromagnetischen Verträglichkeit
Dieses Produkt entspricht den Schutzanforderungen der EU-Richtlinie 2004/108/EG zur Angleichung der
Rechtsvorschriften über die elektromagnetische Verträglichkeit in den EU-Mitgliedsstaaten und hält die
Grenzwerte der EN 55022 Klasse B ein.
Um dieses sicherzustellen, sind die Geräte wie in den Handbüchern beschrieben zu installieren und zu
betreiben. Des Weiteren dürfen auch nur von der IBM empfohlene Kabel angeschlossen werden. IBM
übernimmt keine Verantwortung für die Einhaltung der Schutzanforderungen, wenn das Produkt ohne
Zustimmung von IBM verändert bzw. wenn Erweiterungskomponenten von Fremdherstellern ohne
Empfehlung von IBM gesteckt/eingebaut werden.
Deutschland: Einhaltung des Gesetzes über die elektromagnetische Verträglichkeit von Geräten
Dieses Produkt entspricht dem “Gesetz über die elektromagnetische Verträglichkeit von Geräten
(EMVG)“. Dies ist die Umsetzung der EU-Richtlinie 2004/108/EG in der Bundesrepublik Deutschland.
Zulassungsbescheinigung laut dem Deutschen Gesetz über die elektromagnetische Verträglichkeit von
Geräten (EMVG) (bzw. der EMC EG Richtlinie 2004/108/EG) für Geräte der Klasse B
Dieses Gerät ist berechtigt, in Übereinstimmung mit dem Deutschen EMVG das EG-Konformitätszeichen
- CE - zu führen.
Verantwortlich für die Einhaltung der EMV Vorschriften ist der Hersteller:
International Business Machines Corp.
New Orchard Road
Armonk, New York 10504
Tel: 914-499-1900
Der verantwortliche Ansprechpartner des Herstellers in der EU ist:
IBM Deutschland GmbH
Technical Regulations, Abteilung M456
IBM-Allee 1, 71139 Ehningen, Germany
Tel: +49 7032 15-2937
e-mail: tjahn@de.ibm.com
Generelle Informationen:
Das Gerät erfüllt die Schutzanforderungen nach EN 55024 und EN 55022 Klasse B.
Dispositions
Les droits d'utilisation relatifs à ces publications sont soumis aux dispositions suivantes.
Usage personnel : Vous pouvez reproduire ces publications pour votre usage personnel, non commercial,
sous réserve que toutes les mentions de propriété soient conservées. Vous ne pouvez distribuer ou
publier tout ou partie de ces publications ou en faire des oeuvres dérivées, sans le consentement exprès
du fabricant.
Usage commercial : Vous pouvez reproduire, distribuer et afficher ces publications uniquement au sein
de votre entreprise, sous réserve que toutes les mentions de propriété soient conservées. Vous ne pouvez
reproduire, distribuer, afficher ou publier tout ou partie de ces publications en dehors de votre entreprise,
ou en faire des oeuvres dérivées, sans le consentement exprès du fabricant.
170 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension
Excepté les droits d'utilisation expressément accordés dans ce document, aucun autre droit, licence ou
autorisation, implicite ou explicite, n'est accordé pour ces publications ou autres informations, données,
logiciels ou droits de propriété intellectuelle contenus dans ces publications.
Le fabricant se réserve le droit de retirer les autorisations accordées dans le présent document si
l'utilisation des publications s'avère préjudiciable à ses intérêts ou que, selon son appréciation, les
instructions susmentionnées n'ont pas été respectées.
Vous ne pouvez télécharger, exporter ou réexporter ces informations qu'en total accord avec toutes les lois
et règlements applicables dans votre pays, y compris les lois et règlements américains relatifs à
l'exportation.
LE FABRICANT NE DONNE AUCUNE GARANTIE SUR LE CONTENU DE CES PUBLICATIONS. CES
PUBLICATIONS SONT LIVREES EN L'ETAT SANS AUCUNE GARANTIE EXPLICITE OU IMPLICITE.
LE FABRICANT DECLINE NOTAMMENT TOUTE RESPONSABILITE RELATIVE A CES
INFORMATIONS EN CAS DE CONTREFAÇON AINSI QU'EN CAS DE DEFAUT D'APTITUDE A
L'EXECUTION D'UN TRAVAIL DONNE.
E1-700, E3-700, E2-700,
E2-700T, E4-700 or E4-700T
Rack features, and installing
systems or expansion units
into a rack
REFERENCE
86 A1 29FG 01
ESCALA Power7
E1-700, E3-700, E2-700, E2-700T,
E4-700 or E4-700T Rack features,
and installing systems or expansion
units into a rack
This publication concerns the following models:
- Bull Escala E5-700 (Power 750 / 8233-E8B)
- Bull Escala M6-700 (Power 770 / 9117-MMB)
- Bull Escala M7-700 (Power 780 / 9179-MHB)
- Bull Escala E1-700 (Power 710 / 8231-E2B)
- Bull Escala E2-700 / E2-700T (Power 720 / 8202-E4B)
- Bull Escala E3-700 (Power 730 / 8231-E2B)
- Bull Escala E4-700 / E4-700T (Power 740 / 8205-E6B)
References to Power 755 / 8236-E8C models are irrelevant.
Hardware
September 2010
BULL CEDOC
357 AVENUE PATTON
B.P.20845
49008 ANGERS CEDEX 01
FRANCE
REFERENCE
86 A1 29FG 01
The following copyright notice protects this book under Copyright laws which prohibit such actions as, but not limited
to, copying, distributing, modifying, and making derivative works.
Copyright Bull SAS 2007-2010
Printed in France
Suggestions and criticisms concerning the form, content, and presentation of this
book are invited. A form is provided at the end of this book for this purpose.
To order additional copies of this book or other Bull Technical Publications, you
are invited to use the Ordering Form also provided at the end of this book.
Trademarks and Acknowledgements
We acknowledge the right of proprietors of trademarks mentioned in this book.
The information in this document is subject to change without notice. Bull will not be liable for errors
ontained herein, or for incidental or consequential damages in connection with the use of this material.
c
Contents
Safety notices . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . vii
Racks, rack features, and installing systems or expansion units into a rack . . . . . . 1
What's new in Racks, rack features, and installing systems or expansion units into a
rack . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3
Installing the rack . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5
Installing the 7014-T00 or 7014-T42 racks . . . . . . . . . . . . . . . . . . . . . . . . . . 5
Completing a parts inventory. . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5
Positioning the rack . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5
Leveling the rack. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6
Attaching the stabilizer brackets. . . . . . . . . . . . . . . . . . . . . . . . . . . . 7
Attaching the rack to a concrete floor . . . . . . . . . . . . . . . . . . . . . . . . . . 8
Attaching the rack to the concrete floor beneath a raised floor . . . . . . . . . . . . . . . . . 13
Connecting the power distribution system . . . . . . . . . . . . . . . . . . . . . . . . 17
Checking the ac outlets . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 17
Attaching the front or back ac electrical outlet. . . . . . . . . . . . . . . . . . . . . . . 18
Installing the ac outlet-mounting plates with ac outlets. . . . . . . . . . . . . . . . . . . 18
Installing the ac outlet-mounting plate without ac outlets . . . . . . . . . . . . . . . . . . 20
Connecting a dc power source . . . . . . . . . . . . . . . . . . . . . . . . . . . . 21
Installing a system or expansion unit into a rack . . . . . . . . . . . . . . . . . 27
Installing the model 8202-E4B or 8205-E6B into a rack . . . . . . . . . . . . . . . . . . . . . 27
Determining the location . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 28
Marking the location . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 29
Attaching the 8202-E4B or 8205-E6B mounting hardware to the rack . . . . . . . . . . . . . . . 30
Installing the cable-management arm. . . . . . . . . . . . . . . . . . . . . . . . . . 32
Connecting the power cables to the system. . . . . . . . . . . . . . . . . . . . . . . . 35
Installing the model 8231-E2B into a rack . . . . . . . . . . . . . . . . . . . . . . . . . 35
Determining the location . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 36
Marking the location . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 37
Attaching the 8231-E2B mounting hardware to the rack . . . . . . . . . . . . . . . . . . . 38
Installing the cable-management arm. . . . . . . . . . . . . . . . . . . . . . . . . . 40
Connecting the power cables to the system. . . . . . . . . . . . . . . . . . . . . . . . 43
Installing the model 8233-E8B or 8236-E8C into a rack . . . . . . . . . . . . . . . . . . . . . 43
Determining the location . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 45
Marking the location . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 46
Attaching the 8233-E8B or 8236-E8C mounting hardware to the rack . . . . . . . . . . . . . . . 47
Installing the cable-management arm. . . . . . . . . . . . . . . . . . . . . . . . . . 54
Connecting the power cables to the system. . . . . . . . . . . . . . . . . . . . . . . . 55
Installing the model 5802 or 5877 into a rack . . . . . . . . . . . . . . . . . . . . . . . . 56
Determining the location . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 57
Marking the location . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 58
Attaching the mounting hardware to the rack . . . . . . . . . . . . . . . . . . . . . . . 59
Installing the 5802 or 5877 on the rail assembly . . . . . . . . . . . . . . . . . . . . . . 62
Installing the model 5886 into a rack . . . . . . . . . . . . . . . . . . . . . . . . . . . 64
Determining the location . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 65
Marking the location . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 67
Attaching the mounting hardware to the rack . . . . . . . . . . . . . . . . . . . . . . 67
Installing the 5886 on the rail assembly . . . . . . . . . . . . . . . . . . . . . . . . 69
Installing the 5796 or 7314-G30 expansion unit in a rack . . . . . . . . . . . . . . . . . . . . 71
Powering off the system and removing power. . . . . . . . . . . . . . . . . . . . . . . 72
Marking the location . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 72
© Copyright IBM Corp. 2010 iii
Installing the expandable rails and extension bracket into the rack . . . . . . . . . . . . . . . . 73
Installing the shell . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 76
Installing the expansion unit. . . . . . . . . . . . . . . . . . . . . . . . . . . . . 77
Attaching the power controller (SPCN), and power cables. . . . . . . . . . . . . . . . . . . 79
Powering on the system . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 79
Installing the model 5786 or 7031-D24 into a rack. . . . . . . . . . . . . . . . . . . . . . . 79
Completing a parts inventory . . . . . . . . . . . . . . . . . . . . . . . . . . . . 80
Determining the location . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 81
Marking the location . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 82
Attaching the mounting hardware to the rack . . . . . . . . . . . . . . . . . . . . . . . 82
Installing the 5786 or 7031-D24 on the rail assembly. . . . . . . . . . . . . . . . . . . . . 84
Attaching the cable-management arm to the standard rails . . . . . . . . . . . . . . . . . . 87
Removing and replacing 7014-T00 or 7014-T42 side panels . . . . . . . . . . . . . 89
Replacing a 7014-T00 or 7014-T42 side panel . . . . . . . . . . . . . . . . . . . . . . . . 89
Removing and replacing 7014-T00 or 7014-T42 trim panels . . . . . . . . . . . . . 91
Replacing the 7014-T00 or 7014-T42 trim panels . . . . . . . . . . . . . . . . . . . . . . . 91
Attaching the rack doors . . . . . . . . . . . . . . . . . . . . . . . . . . . . 93
Attaching a high-perforation front door . . . . . . . . . . . . . . . . . . . . . . . . . . 93
Rack safety notices . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 94
Installing the rack security kit. . . . . . . . . . . . . . . . . . . . . . . . . . 97
Ruggedized kit. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 101
Releasing the ruggedized brace . . . . . . . . . . . . . . . . . . . . . . . . . . . . 102
Releasing the side panel with a ruggedized kit . . . . . . . . . . . . . . . . . . . . . . . 102
Connecting multiple racks with rack-to-rack attachment kit . . . . . . . . . . . . 105
Rack status beacon . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 109
Installing the rack status beacon . . . . . . . . . . . . . . . . . . . . . . . . . . . . 109
Connecting multiple junction boxes . . . . . . . . . . . . . . . . . . . . . . . . . . 111
Installing or removing a rack-mounted system-unit latch bracket . . . . . . . . . . 113
Removing and replacing the rack top cover . . . . . . . . . . . . . . . . . . . 115
Removing the rack top cover . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 115
Replacing the rack top cover . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 117
Power distribution unit plus . . . . . . . . . . . . . . . . . . . . . . . . . . 121
Installing the PDU+ in the side of a rack . . . . . . . . . . . . . . . . . . . . . . . . . 121
Installing the PDU+ horizontally in a rack. . . . . . . . . . . . . . . . . . . . . . . . 123
Setting up power monitoring using the PDU+ . . . . . . . . . . . . . . . . . . . . . . . 127
Using the IBM DPI Configuration Utility . . . . . . . . . . . . . . . . . . . . . . . . 127
Connecting the console . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 128
Using HyperTerminal . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 128
Configuration Utility menu options . . . . . . . . . . . . . . . . . . . . . . . . . 128
Setting the IP address . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 129
Using the Web interface. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 129
Starting the Web interface . . . . . . . . . . . . . . . . . . . . . . . . . . . . 129
Modifying the basic settings . . . . . . . . . . . . . . . . . . . . . . . . . . . 129
Changing the superuser name and password. . . . . . . . . . . . . . . . . . . . . 130
Identifying the PDU+ and Web/SNMP card . . . . . . . . . . . . . . . . . . . . . 130
Adding users . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 130
Changing the date and time . . . . . . . . . . . . . . . . . . . . . . . . . . 130
Changing event alerts . . . . . . . . . . . . . . . . . . . . . . . . . . . . 130
iv Installing into a rack
Changing the network information . . . . . . . . . . . . . . . . . . . . . . . . . 131
Changing the network configuration . . . . . . . . . . . . . . . . . . . . . . . 131
History and event log summaries . . . . . . . . . . . . . . . . . . . . . . . . . 131
Viewing the history log . . . . . . . . . . . . . . . . . . . . . . . . . . . . 131
Viewing the event log . . . . . . . . . . . . . . . . . . . . . . . . . . . . 131
Removing and replacing the expansion unit cover or door. . . . . . . . . . . . . 133
Removing the front cover on the 7314-G30 or 5796. . . . . . . . . . . . . . . . . . . . . . 133
Installing the front cover on the 7314-G30 or 5796 . . . . . . . . . . . . . . . . . . . . . . 133
Removing and replacing covers on the 8233-E8B or 8236-E8C system. . . . . . . . 135
Removing the service access cover from an 8233-E8B or 8236-E8C system . . . . . . . . . . . . . . 135
Installing the service access cover on an 8233-E8B or 8236-E8C system . . . . . . . . . . . . . . . 136
Removing the front cover from an 8233-E8B or 8236-E8C system . . . . . . . . . . . . . . . . . 136
Installing the front cover on an 8233-E8B or 8236-E8C system . . . . . . . . . . . . . . . . . . 137
Removing and replacing the front cover on a 9117-MMB or 9179-MHB . . . . . . . . . . . . . . . 138
Removing the front cover from the 9117-MMB or 9179-MHB . . . . . . . . . . . . . . . . . . 138
Installing the front cover on the 9117-MMB or 9179-MHB . . . . . . . . . . . . . . . . . . . 139
Placing the rack-mounted 8233-E8B or 8236-E8C system into the service position or
operating position . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 141
Placing the rack-mounted system or expansion unit in the service position. . . . . . . . . . . . . . 141
Placing the rack-mounted system or expansion unit in the operating position . . . . . . . . . . . . . 144
Placing a 8233-E8B or 8236-E8C in the service position . . . . . . . . . . . . . . . . . . . . 145
Placing the 8233-E8B or 8236-E8C in the operating position . . . . . . . . . . . . . . . . . . . 148
Notices . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 151
Trademarks . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 152
Electronic emission notices . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 152
Class A Notices. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 152
Class B Notices. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 156
Terms and conditions. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 159
Contents v
vi Installing into a rack
Safety notices
Safety notices may be printed throughout this guide:
v DANGER notices call attention to a situation that is potentially lethal or extremely hazardous to
people.
v CAUTION notices call attention to a situation that is potentially hazardous to people because of some
existing condition.
v Attention notices call attention to the possibility of damage to a program, device, system, or data.
World Trade safety information
Several countries require the safety information contained in product publications to be presented in their
national languages. If this requirement applies to your country, a safety information booklet is included
in the publications package shipped with the product. The booklet contains the safety information in
your national language with references to the U.S. English source. Before using a U.S. English publication
to install, operate, or service this product, you must first become familiar with the related safety
information in the booklet. You should also refer to the booklet any time you do not clearly understand
any safety information in the U.S. English publications.
German safety information
Das Produkt ist nicht für den Einsatz an Bildschirmarbeitsplätzen im Sinne § 2 der
Bildschirmarbeitsverordnung geeignet.
Laser safety information
IBM® servers can use I/O cards or features that are fiber-optic based and that utilize lasers or LEDs.
Laser compliance
IBM servers may be installed inside or outside of an IT equipment rack.
© Copyright IBM Corp. 2010 vii
DANGER
When working on or around the system, observe the following precautions:
Electrical voltage and current from power, telephone, and communication cables are hazardous. To
avoid a shock hazard:
v Connect power to this unit only with the IBM provided power cord. Do not use the IBM
provided power cord for any other product.
v Do not open or service any power supply assembly.
v Do not connect or disconnect any cables or perform installation, maintenance, or reconfiguration
of this product during an electrical storm.
v The product might be equipped with multiple power cords. To remove all hazardous voltages,
disconnect all power cords.
v Connect all power cords to a properly wired and grounded electrical outlet. Ensure that the outlet
supplies proper voltage and phase rotation according to the system rating plate.
v Connect any equipment that will be attached to this product to properly wired outlets.
v When possible, use one hand only to connect or disconnect signal cables.
v Never turn on any equipment when there is evidence of fire, water, or structural damage.
v Disconnect the attached power cords, telecommunications systems, networks, and modems before
you open the device covers, unless instructed otherwise in the installation and configuration
procedures.
v Connect and disconnect cables as described in the following procedures when installing, moving,
or opening covers on this product or attached devices.
To Disconnect:
1. Turn off everything (unless instructed otherwise).
2. Remove the power cords from the outlets.
3. Remove the signal cables from the connectors.
4. Remove all cables from the devices
To Connect:
1. Turn off everything (unless instructed otherwise).
2. Attach all cables to the devices.
3. Attach the signal cables to the connectors.
4. Attach the power cords to the outlets.
5. Turn on the devices.
(D005)
DANGER
viii Installing into a rack
Observe the following precautions when working on or around your IT rack system:
v Heavy equipment–personal injury or equipment damage might result if mishandled.
v Always lower the leveling pads on the rack cabinet.
v Always install stabilizer brackets on the rack cabinet.
v To avoid hazardous conditions due to uneven mechanical loading, always install the heaviest
devices in the bottom of the rack cabinet. Always install servers and optional devices starting
from the bottom of the rack cabinet.
v Rack-mounted devices are not to be used as shelves or work spaces. Do not place objects on top
of rack-mounted devices.
v Each rack cabinet might have more than one power cord. Be sure to disconnect all power cords in
the rack cabinet when directed to disconnect power during servicing.
v Connect all devices installed in a rack cabinet to power devices installed in the same rack
cabinet. Do not plug a power cord from a device installed in one rack cabinet into a power
device installed in a different rack cabinet.
v An electrical outlet that is not correctly wired could place hazardous voltage on the metal parts of
the system or the devices that attach to the system. It is the responsibility of the customer to
ensure that the outlet is correctly wired and grounded to prevent an electrical shock.
CAUTION
v Do not install a unit in a rack where the internal rack ambient temperatures will exceed the
manufacturer's recommended ambient temperature for all your rack-mounted devices.
v Do not install a unit in a rack where the air flow is compromised. Ensure that air flow is not
blocked or reduced on any side, front, or back of a unit used for air flow through the unit.
v Consideration should be given to the connection of the equipment to the supply circuit so that
overloading of the circuits does not compromise the supply wiring or overcurrent protection. To
provide the correct power connection to a rack, refer to the rating labels located on the
equipment in the rack to determine the total power requirement of the supply circuit.
v (For sliding drawers.) Do not pull out or install any drawer or feature if the rack stabilizer brackets
are not attached to the rack. Do not pull out more than one drawer at a time. The rack might
become unstable if you pull out more than one drawer at a time.
v (For fixed drawers.) This drawer is a fixed drawer and must not be moved for servicing unless
specified by the manufacturer. Attempting to move the drawer partially or completely out of the
rack might cause the rack to become unstable or cause the drawer to fall out of the rack.
(R001)
Safety notices ix
CAUTION:
Removing components from the upper positions in the rack cabinet improves rack stability during
relocation. Follow these general guidelines whenever you relocate a populated rack cabinet within a
room or building:
v Reduce the weight of the rack cabinet by removing equipment starting at the top of the rack
cabinet. When possible, restore the rack cabinet to the configuration of the rack cabinet as you
received it. If this configuration is not known, you must observe the following precautions:
– Remove all devices in the 32U position and above.
– Ensure that the heaviest devices are installed in the bottom of the rack cabinet.
– Ensure that there are no empty U-levels between devices installed in the rack cabinet below the
32U level.
v If the rack cabinet you are relocating is part of a suite of rack cabinets, detach the rack cabinet from
the suite.
v Inspect the route that you plan to take to eliminate potential hazards.
v Verify that the route that you choose can support the weight of the loaded rack cabinet. Refer to the
documentation that comes with your rack cabinet for the weight of a loaded rack cabinet.
v Verify that all door openings are at least 760 x 230 mm (30 x 80 in.).
v Ensure that all devices, shelves, drawers, doors, and cables are secure.
v Ensure that the four leveling pads are raised to their highest position.
v Ensure that there is no stabilizer bracket installed on the rack cabinet during movement.
v Do not use a ramp inclined at more than 10 degrees.
v When the rack cabinet is in the new location, complete the following steps:
– Lower the four leveling pads.
– Install stabilizer brackets on the rack cabinet.
– If you removed any devices from the rack cabinet, repopulate the rack cabinet from the lowest
position to the highest position.
v If a long-distance relocation is required, restore the rack cabinet to the configuration of the rack
cabinet as you received it. Pack the rack cabinet in the original packaging material, or equivalent.
Also lower the leveling pads to raise the casters off of the pallet and bolt the rack cabinet to the
pallet.
(R002)
(L001)
(L002)
x Installing into a rack
(L003)
or
All lasers are certified in the U.S. to conform to the requirements of DHHS 21 CFR Subchapter J for class
1 laser products. Outside the U.S., they are certified to be in compliance with IEC 60825 as a class 1 laser
product. Consult the label on each part for laser certification numbers and approval information.
CAUTION:
This product might contain one or more of the following devices: CD-ROM drive, DVD-ROM drive,
DVD-RAM drive, or laser module, which are Class 1 laser products. Note the following information:
v Do not remove the covers. Removing the covers of the laser product could result in exposure to
hazardous laser radiation. There are no serviceable parts inside the device.
v Use of the controls or adjustments or performance of procedures other than those specified herein
might result in hazardous radiation exposure.
(C026)
Safety notices xi
CAUTION:
Data processing environments can contain equipment transmitting on system links with laser modules
that operate at greater than Class 1 power levels. For this reason, never look into the end of an optical
fiber cable or open receptacle. (C027)
CAUTION:
This product contains a Class 1M laser. Do not view directly with optical instruments. (C028)
CAUTION:
Some laser products contain an embedded Class 3A or Class 3B laser diode. Note the following
information: laser radiation when open. Do not stare into the beam, do not view directly with optical
instruments, and avoid direct exposure to the beam. (C030)
Power and cabling information for NEBS (Network Equipment-Building System)
GR-1089-CORE
The following comments apply to the IBM servers that have been designated as conforming to NEBS
(Network Equipment-Building System) GR-1089-CORE:
The equipment is suitable for installation in the following:
v Network telecommunications facilities
v Locations where the NEC (National Electrical Code) applies
The intrabuilding ports of this equipment are suitable for connection to intrabuilding or unexposed
wiring or cabling only. The intrabuilding ports of this equipment must not be metallically connected to the
interfaces that connect to the OSP (outside plant) or its wiring. These interfaces are designed for use as
intrabuilding interfaces only (Type 2 or Type 4 ports as described in GR-1089-CORE) and require isolation
from the exposed OSP cabling. The addition of primary protectors is not sufficient protection to connect
these interfaces metallically to OSP wiring.
Note: All Ethernet cables must be shielded and grounded at both ends.
The ac-powered system does not require the use of an external surge protection device (SPD).
The dc-powered system employs an isolated DC return (DC-I) design. The DC battery return terminal
shall not be connected to the chassis or frame ground.
xii Installing into a rack
Racks, rack features, and installing systems or expansion
units into a rack
Learn about the procedures used to install racks. Also detailed procedures are provided for installing rack
features, systems and expansion units into a rack.
You can perform these tasks or contact a service provider to perform the tasks for you. You might be
charged a fee by the service provider for this service.
© Copyright IBM Corp. 2010 1
2 Installing into a rack
What's new in Racks, rack features, and installing systems or
expansion units into a rack
Read about new or significantly changed information in Racks, rack features, and installing systems or
expansion units into a rack since the previous update of this topic collection.
September 2010
Content updates include the following:
v Added procedures for “Installing the model 8202-E4B or 8205-E6B into a rack” on page 27.
v Added procedures for “Installing the model 8231-E2B into a rack” on page 35.
v Added procedures for “Removing and replacing 7014-T00 or 7014-T42 trim panels” on page 91.
© Copyright IBM Corp. 2010 3
4 Installing into a rack
Installing the rack
This section describes all the tasks required to install 7014-T00 or 7014-T42 racks. The full spectrum of
related tasks is described, from completing a parts inventory to finally connecting to a dc power source.
Installing the 7014-T00 or 7014-T42 racks
Use this information to install the 7014-T00 or 7014-T42 racks.
If you are installing a rack security kit in this rack, see “Installing the rack security kit” on page 97 after
you have installed the rack.
Before installing a rack, read the “Rack safety notices” on page 94.
Completing a parts inventory
Before beginning the rack installation it is a good idea to do a parts inventory. This section guides you in
performing this task.
If you have not done so, complete a parts inventory before installing the unit in the rack:
1. Locate the kitting report in an accessory box.
2. Ensure that you received all of the features that you ordered and all of the parts on the kitting report.
If there are incorrect, missing, or damaged parts, contact:
v Your IBM reseller
v IBM support (see Directory of worldwide contacts website at http://www.ibm.com/planetwide
for contact information for your country)
v IBM Rochester Manufacturing Automated Information Line at 1–800–300–8751 (United States only)
Positioning the rack
Proper rack positioning is needed to comply with safety and regulatory requirements. Use the procedure
in this section to perform this task.
After the rack has been placed into its location on the floor, lock each caster by tightening the locking
screw. See the following illustration for the locking screw location (2). Remove all the tape and packing
materials from the rack.
© Copyright IBM Corp. 2010 5
1 Caster
2 Locking screw
Use the following to determine the next step:
v To bolt the rack to a concrete floor, go to “Attaching the rack to a concrete floor” on page 8.
v To bolt the rack to a concrete floor beneath a raised floor, go to “Attaching the rack to the concrete
floor beneath a raised floor” on page 13.
v To attach the rack to the floor, go to “Leveling the rack.”
Leveling the rack
If you need to level the rack, use the procedure described in this section.
To level the rack, complete the following steps:
1. Loosen the jam nut on each leveling foot.
2. Rotate each leveling foot downward until it contacts the surface on which the rack is placed.
3. Adjust the leveling feet downward as needed until the rack is level. When the rack is level, tighten
the jam nuts against the base.
Figure 1. Tightening the locking screw
6 Installing into a rack
1 Rack Front (base)
2 Leveling Foot (quantity 4)
3 Jam Nut (quantity 4)
Attaching the stabilizer brackets
You might need to attach the stabilizer brackets to the rack. This section helps you determine whether
stabilizer brackets are necessary and describes how to attach them if needed.
If the front or back ac electrical outlets are going to be installed in the rack, you cannot attach the
stabilizer brackets. The rack must be bolted to the floor. Stabilizer brackets are used only if you will not
be bolting the rack to the floor. If you are going to bolt the rack to the floor, go to “Attaching the rack to
a concrete floor” on page 8.
To attach the stabilizer brackets to the bottom of the rack, do the following:
Note: Before installing the stabilizer brackets, see “Attaching the front or back ac electrical outlet” on
page 18 for instruction on installing the ac outlet-mounting plates.
1. Align the slots of one of the stabilizer brackets with the mounting holes at the bottom front of the
rack.
2. Install the two mounting screws.
3. Ensure that the base of the stabilizer bracket rests firmly on the floor. Use the Allen wrench that was
supplied with the rack to tighten the mounting screws alternately until they are tight.
Figure 2. Adjusting the leveling feet
Installing the rack 7
1 Rack front (base) 3 Stabilizer bracket
2 Stabilizer mounting screws 4 Rack rear (base)
4. To install the second stabilizer bracket on the back of the rack, repeat steps 1 through 3.
Attaching the rack to a concrete floor
If you plan to install the front or back ac electrical outlets in the rack, the rack must be bolted to the floor.
This section describes how to perform this task for a concrete floor.
Obtain the services of a mechanical contractor to attach the rack-mounting plates to the concrete floor.
The mechanical contractor must determine that the hardware being used to secure the rack-mounting
plates to the concrete floor is sufficient to meet the requirements for the installation.
To attach the rack to a concrete floor, do the following:
1. Put the rack in its predetermined location, and tighten the locking screws on the casters.
2. If they are installed, remove the top, left, and right trim panels. The trim panels are held in place
with spring clips. See the following illustration.
Figure 3. Attaching the stabilizer brackets
8 Installing into a rack
1 Rack chassis 4 Right-side trim panel
2 Top trim panel 5 Spring clip
3 Left-side trim panel
3. If they are installed, remove the front and back doors. After the rack doors have been removed, go to
the next substep. To remove a rack door:
a. Unlock and open the door.
b. Grasp the door firmly with both hands and pull it away from the hinges.
4. Locate the hardware mounting kit and the two mounting plates. Refer to the following illustration
when reviewing the contents of the hardware mounting kit. The hardware mounting kit contains the
following:
v 4 Rack-mounting bolts
v 4 Thin washers
v 8 Plastic isolator bushings
v 4 Thick washers
v 4 Spacers
5. If you are installing an ac-powered rack, temporarily install the lower plastic isolator bushings to
help you locate the mounting locations for the stabilizer bracket. After the stabilizer bracket has been
Figure 4. Removing the trim panels
Installing the rack 9
correctly located, remove the lower plastic isolator bushings.
1 Rack chassis 7 Jam nut
2 Rack-mounting bolt 8 Leveling foot
3 Thin washer 9 Lower plastic isolator bushing (used
only on dc powered systems)
4 Top plastic isolator bushing AC Typical leveling foot installation for
an ac-powered rack
5 Thick washer DC Typical leveling foot installation for
an dc-powered rack
6 Spacer
6. Position the two mounting plates in the approximate mounting location under the rack.
7. Create a rack-mounting bolt assembly by adding the following items, in the order listed, to each
rack-mounting bolt.
a. Thin washer
b. Top plastic isolator bushing
c. Thick flat washer
d. Spacer
8. Insert a rack-mounting bolt assembly through each of the leveling feet.
9. Reposition the rack-mounting plates under the four rack-mounting bolts so that the mounting bolts
are centered directly over the threaded bolt holes.
10. Turn the rack-mounting bolts four complete turns into the mounting plate's threaded bolt holes.
Figure 5. Installing ac-power mounting plates
10 Installing into a rack
1 Rack-mounting bolt 7 Leveling foot
2 Thin washer 8 Lower plastic isolator bushing (Used
only on dc powered systems)
3 Top plastic isolator bushing 9 Mounting plate
4 Thick washer 10 Threaded hole (Used to secure the
rack to stabilizer bracket.)
5 Spacer 11 Anchor bolt hole
6 Jam nut 12 Traced pattern (Pattern to be traced
onto the floor using the stabilizer
bracket as a template)
11. Mark the floor around the edges of both stabilizer brackets.
12. Mark the plate bolt-down holes that are accessible through the opening in the rear of the rack.
13. Remove the rack-mounting bolt assemblies.
14. If you are installing an ac-powered rack, remove the bottom isolator bushing from each of the
leveling feet.
15. Remove the stabilizer brackets from the marked locations.
16. Loosen each of the locking screws on the casters.
17. Move the rack so that it is clear of both areas that were marked on the floor for the stabilizer bracket
locations.
18. Reposition the stabilizer brackets within the marked areas.
19. Mark the floor at the center of all holes in both stabilizer brackets.
20. Remove the two rack-mounting plates from the marked areas.
21. At the marked location of the threaded rack-mounting bolt holes, drill four clearance holes into the
concrete floor. Each clearance hole should be approximately 1-inch deep. This depth allows the
rack-mounting bolts enough room to protrude past the thickness of the stabilizer brackets.
Figure 6. Securing the rack to the floor
Installing the rack 11
Note: You must use a minimum of two anchor bolts for each rack-mounting plate to securely attach
the plate to the concrete floor. Because some of the holes in each rack-mounting plate might align
with concrete reinforcement rods embedded in the concrete, some of the rack-mounting plate holes
might not be usable.
22. Select at least two suitable hole locations for each stabilizer bracket bolt. The selected locations
should be as close to the threaded bolt holes as possible. Be sure that the holes selected at the back
of the rack are accessible. Drill holes at the selected locations into the concrete floor.
23. Position the stabilizer brackets over the concrete anchors.
24. Securely bolt the front stabilizer bracket to the concrete floor.
25. Position the stabilizer bracket over the concrete anchors.
26. Securely bolt the back stabilizer bracket to the concrete floor.
Note: The size of the anchor bolts and concrete anchors must be determined by the mechanical
contractor who will be installing the rack-mounting plate.
27. Position the rack over the stabilizer bracket.
28. Insert each of the stabilizer bracket bolts through a flat washer, a plastic isolator bushing and a thick
washer, and through a leveling foot.
29. Align the four stabilizer brackets bolts with the four tapped holes in the two mounting plates and
turn three to four rotations.
30. Tighten the locking screw on each caster.
31. Adjust the leveling feet downward as needed until the rack is level. When the rack is level, tighten
the jam nuts against the base of the rack.
1 Rack front (base)
2 Leveling foot (quantity 4)
3 Jam nut (quantity 4)
32. If you have multiple racks that are connected in a suite (bolted to each other), go to “Connecting
multiple racks with rack-to-rack attachment kit” on page 105. Otherwise, torque the four bolts to
40-50 ft-lbs (54-67 nm).
Figure 7. Adjusting the leveling feet
12 Installing into a rack
33. If you are not installing doors on your rack, install the top, left, and right trim panel.
34. Connect the power distribution system as described in “Connecting the power distribution system”
on page 17.
35. After all racks are bolted down, go to “Attaching the front or back ac electrical outlet” on page 18.
36. If you are not going to attach a front electrical outlet and you are installing rack doors, go to
“Attaching the rack doors” on page 93
Attaching the rack to the concrete floor beneath a raised floor
If you plan to install front or back ac electrical outlets in the rack, the rack must be bolted to the floor.
When you attach the rack to a concrete floor, which lays beneath a raised floor, follow the procedure
described in this section.
Obtain the services of a mechanical contractor to attach the rack-mounting plates to the concrete floor.
The mechanical contractor needs to determine that the hardware being used to secure the rack-mounting
plates to the concrete floor is sufficient to meet the requirements for the installation.
To attach the rack to a concrete floor beneath a raised floor, do the following:
1. Put your rack in its predetermined location and tighten the locking screws on the casters.
2. If installed, remove the top, left and right trim panels. The trim panels are held in place with spring
clips. See the following illustration.
1 Rack chassis 4 Right-side trim panel
2 Top trim panel 5 Spring clip
3 Left-side trim panel
Installing the rack 13
3. If installed, remove the front and rear doors. To remove a rack door, go to . After the rack doors
have been removed, go to the next substep.
4. Locate the hardware mounting kit and the two mounting plates. Refer to the following illustration
when reviewing the contents of the hardware mounting kit. The hardware mounting kit contains the
following:
v Four rack-mounting bolts
v Four thin washers
v Eight plastic isolator bushings
v Four thick washers
v Four spacers
5. If you are installing an ac-powered rack, temporarily install the lower plastic isolator bushings to
help you locate the rack-mounting plate. After the mounting plate has been correctly located, remove
the lower plastic isolator bushings.
1 Rack chassis 7 Jam nut
2 Rack-mounting bolt 8 Leveling foot
3 Thin washer 9 Lower plastic isolator bushing (used
only on dc powered systems)
4 Top plastic isolator bushing AC Typical leveling foot installation for
an ac-powered rack
5 Thick washer DC Typical leveling foot installation for
an dc-powered rack
6 Spacer
Figure 8. Removing the trim panels
Figure 9. Installing the ac power-mounting plates
14 Installing into a rack
6. Position the two mounting plates in the approximate mounting location under the rack.
7. Create a rack-mounting bolt assembly by adding the following items, in the order listed, to each
rack-mounting bolt.
a. Thin washer
b. Top plastic isolator bushing
c. Thick flat washer
d. Spacer
8. Insert a rack-mounting bolt assembly through each of the leveling feet.
9. Reposition the rack-mounting plates under the four rack-mounting bolts so that the mounting bolts
are centered directly over the threaded bolt holes.
10. Turn the rack-mounting bolts four complete turns into the mounting plate's threaded bolt holes.
11. Mark the raised-floor panel around the edges of front and back rack-mounting plates.
12. Mark the plate bolt-down holes that are accessible through the opening in the back of the rack.
13. Remove the rack-mounting bolt assemblies.
14. If you are installing an ac-powered rack, remove the bottom isolator bushing from each of the
leveling feet.
15. Remove the rack-mounting plates from the marked locations.
16. Loosen each of the locking screws on the casters.
17. Move the rack so that it is clear of both areas that were marked on the floor for the rack-mounting
plate locations.
18. Reposition the mounting plates within the marked areas.
19. Mark the raised-floor panel at the center of each hole in the rack-mounting plates (including the
tapped holes).
20. Remove the two rack-mounting plates from the marked locations on the raised floor panel.
21. Drill two clearance holes on each end of each rack-mounting plate. The drilled holes should be
approximately 1-inch deep. This depth will accommodate any rack-mounting bolt extending past the
rack-mounting plate when securing the rack to the rack-mounting plate.
22. For each rack-mounting plate, select at least two suitable hole locations. Select the hole locations as
close to the threaded hole areas as possible. Be sure the hole locations selected at the back of the rack
are accessible.
23. Drill pass-through holes in the raised-floor panel. The pass-through holes allow the anchor bolts to
be inserted into the rack-mounting plate and pass through the raised floor panel to the concrete
floor.
Note: You must use a minimum of two anchor bolts for each rack-mounting plate to securely attach
the rack-mounting plate through the raised-floor panel to the concrete floor. Because some of the
holes in each rack-mounting plate may align with concrete reinforcement rods imbedded in the
concrete, some of the rack-mounting plate holes may not be usable.
24. Transfer the locations of the anchor bolt holes (exclude the clearance holes drilled for the
rack-mounting bolts ) from the raised-floor panel to the concrete floor directly beneath, and mark the
hole locations on the concrete floor.
25. Drill holes in the concrete floor to secure the anchor bolts.
26. Position the raised-floor panel back into position over the anchor bolt holes.
27. Position the front stabilizer bracket within the marked area on the raised-floor panel.
28. Using your anchor bolts, secure the front stabilizer brackets on top of the raised floor and through to
the concrete floor.
29. Position the rear stabilizer brackets within the marked area on the raised-floor panel.
Installing the rack 15
1 Rack-mounting bolt 7 Leveling foot
2 Thin washer 8 Lower plastic isolator bushing (used
only on dc-powered systems)
3 Top plastic isolator bushing 9 Stabilizer brackets
4 Thick washer 10 Threaded hole (used to secure the
rack to mounting plate.)
5 Spacer 11 Anchor bolt hole
6 Jam nut 12 Traced pattern (pattern to be traced
onto the floor using the mounting
plate as a template)
30. Using your anchor bolts, secure the back stabilizer bracket on top of the raised floor and through to
the concrete floor.
31. Replace all raised-floor panels that may have been removed when aligning and securing the anchor
bolts to the concrete floor.
32. Align the rack over the front and back stabilizer brackets.
33. Insert each of the bolt assemblies through a leveling foot.
34. Align the rack-mounting bolts with the threaded holes in each stabilizer bracket. Turn each bolt three
to four rotations.
35. Tighten the locking screw on each caster.
36. Adjust the leveling feet downward as needed until the rack is level. When the rack is level, tighten
the jam nuts against the base of the rack.
37. If you have multiple racks that are connected as a suite (bolted to each other), go to “Connecting
multiple racks with rack-to-rack attachment kit” on page 105. Otherwise, torque the four bolts to
40-50 ft-lbs (54-67 nm).
38. If you are not installing doors on your rack, install the top, left, and right trim panel.
Figure 10. Securing the rack to the floor
16 Installing into a rack
39. Connect the power distribution system as described in “Connecting the power distribution system.”
40. After the rack is bolted down and you are going to attach a front electrical outlet, go to “Attaching
the front or back ac electrical outlet” on page 18.
41. If you are not going to attach a front electrical outlet and you are installing rack doors, go to
“Attaching the rack doors” on page 93.
Connecting the power distribution system
You can use a power distribution system to monitor the individual power loads of the devices that are
plugged into it. Use the procedure in this section to connect this system.
To connect a Power Distribution Unit, see “Power distribution unit plus” on page 121.
Checking the ac outlets
To help ensure safety and reliable operation, you should check the ac outlets. Use the procedure in this
section to perform this task.
Before you begin, ensure that you have a multimeter to check voltages and an appropriately approved
ground-impedance tester to test the grounding resistances.
Note: Use only an appropriately approved ground-impedance tester to test the grounding resistances. Do
not use a multimeter to measure grounding resistance.
Before plugging the rack into the ac power source, complete the following checks on the ac power source:
1. Turn off the branch circuit breaker for the ac power outlet that the rack will plug into. To the circuit
breaker switch, attach tag S229-0237, which reads "Do Not Operate."
Note: All measurements are made with the receptacle faceplate in the usual installed position.
2. Some receptacles are enclosed in metal housings. For this type of receptacle, do the following steps:
a. Using a multimeter, check for less than 1 volt from the receptacle case to any grounded metal
structure in the building, such as a raised-floor metal structure, water pipe, building steel, or
similar structure.
b. Using a multimeter, check for less than 1 volt from the receptacle ground pin to a grounded point
in the building.
Note: If the receptacle case or faceplate is painted, be sure the probe tip penetrates the paint and
makes good electrical contact with the metal.
c. Using a multimeter, check the resistance from the receptacle ground pin to the receptacle case.
Check resistance from the ground pin to the building ground. The readings should be less than 1.0
ohm, which indicates the presence of a continuous grounding conductor.
3. If any of the checks made in step 2 are not correct, remove the power from the branch circuit and
make the wiring corrections. Recheck the receptacle after the wiring is corrected.
4. Using a ground-impedance tester, check for infinite resistance between the ground pin of the
receptacle and each of the phase pins. This is a check for a wiring short to ground or a wiring
reversal.
5. Using a ground-impedance tester, check for infinite resistance between the phase pins. This is a check
for a wiring short.
6. Turn on the branch circuit breaker.
7. Using a multimeter, measure for the appropriate voltages between phases. If no voltage is present on
the receptacle case or grounded pin, the receptacle is safe to touch.
8. Using a multimeter, verify that the voltage at the ac outlet is correct.
Installing the rack 17
Attaching the front or back ac electrical outlet
If you need to attach an ac outlet, you can use the procedure in this section to perform this task.
Attention: The front and back ac outlet-mounting plates mount through the same mounting holes in
that secure the stabilizer brackets to the rack chassis. Therefore, if the rack must be bolted to the floor, the
stabilizer brackets must be removed.
Install the ac outlet-mounting plates only after the rack has been bolted to the floor and the stabilizer
brackets have been removed.
The following items are installed at the customer's site:
v The ac outlet-mounting plates for installing customer-supplied ac electrical outlets on the front or back
of the rack. The ac outlet-mounting place provides the mounting location for an ac electrical outlet.
v The brass ground lug for an electrostatic discharge (ESD) connection.
Note: The customer is responsible for providing both the outlets and the power cables that attach to the
power source. The customer is also responsible for connecting the ac outlet correctly. These items are not
field-replaceable units (FRUs).
Installing the ac outlet-mounting plates with ac outlets
If you choose to install ac mounting plates, you can follow the procedure detailed in this section to
perform this task. This section also includes illustrations of the related hardware components and shows
how these components relate to each other.
If you do not want ac outlets installed on the rack, go to “Installing the ac outlet-mounting plate without
ac outlets” on page 20.
If you want ac outlets installed on the front or back ac outlet-mounting plate, do the following:
1. Determine the number of ac outlets that you are installing.
2. Confirm with your contractor that the number and location of ac outlets to be installed are correct.
3. Remove the blank filler plates from the ac outlet-mounting plates for the number of ac outlets being
installed.
4. Install the ac outlets on the ac outlet-mounting plate.
5. Install the ground lug in the ac outlet-mounting plate using only one nut, as shown in the following
illustration.
6. Securely tighten the one nut on the ground lug.
7. Locate the "Y"-shaped ground cable supplied with the mounting plate.
Note: The remaining steps can be used to install ac outlets on the front or the back of the rack.
8. Place the star washer onto the ground lug of the front ac outlet-mounting plate.
9. Place the lug on the long end of the ground cable onto the ground lug.
10. Place a ground lug nut onto the ground lug and securely tighten it.
11. Position the front ac outlet-mounting plate onto the rack frame with the ground lug fully inserted
through the mounting holes in the rack.
12. Route the cable under the rack.
13. Place the star washer onto the ground lug of the back ac outlet-mounting plate.
14. Place the lug on the short end of the ground cable onto the ground lug.
15. Place a ground lug nut onto the ground lug and securely tighten it.
18 Installing into a rack
1 Ground cable lug 7 Ground lug
2 Star washer 8 Ground connector (short end of
ground cable)
3 Front of rack 9 "Y" End of ground cable
4 Power cable from the power source 10 Ground lug nut (quantity 2)
5 Mounting plate 11 Long end of ground cable
6 Long end of ground cable 12 Ground lug nut (quantity 2)
16. Position the back ac outlet-mounting plate onto the rack frame with the ground lug fully inserted
through the mounting holes in the rack.
17. Install the front ac outlet-mounting plate screws (stabilizer mounting screws) into the mounting plate
and through the mounting holes in the rack. Securely tighten the screws.
1 Front or back of rack (as applicable) 4 Mounting plate
2 Power cable from power source 5 Allen wrench
Figure 11. Installing the ground lug
Installing the rack 19
3 Button-head screw 6 Long end of ground cable
18. Connect the "Y"-shaped end of the ground cable to the rack frame, either near the center in the back
of the rack or to the ground bus bar at the back of the rack.
19. Install the back ac outlet-mounting plate screws (stabilizer mounting screws) into the mounting plate
and through the mounting holes in the rack. Securely tighten the screws.
Note: The bus bar might be located at either the top or bottom of the rack.
1 Bus bar mounting plate 3 Hex screws (M5 x 20) (quantity 2)
2 Lock washer (quantity 2) 4 Ground bus bar
Installing the ac outlet-mounting plate without ac outlets
If you choose to install an outlet plate without the outlets, use the procedure in this section to perform
this task.
Figure 12. Installing the front mounting plate
Figure 13. Installing the back mounting plate
20 Installing into a rack
If you do not want any ac outlets installed on the front or rear ac outlet-mounting plate, perform only
“Attaching the rack to the concrete floor beneath a raised floor” on page 13 through “Connecting the
power distribution system” on page 17.
For front or rear ac outlet-mounting plates, refer to “Installing the ac outlet-mounting plates with ac
outlets” on page 18.
Connecting a dc power source
Some rack models (such as the 7014-T00) can support a dc power configuration for systems that require
dc power. If you decide to connect a dc power source to the rack, you can use the procedure in this
section to perform this task. This section also includes illustrations of the related hardware components
and shows how these components relate to each other.
Note: The customer is responsible for providing and connecting the -48 V dc power source and -48 V dc
power return cables from the customer's source -48 V dc to the bus bars in the power distribution panel.
The customer is also responsible for connecting the ground cable to the rack frame. This procedure
provides information about accessing the power distribution panel.
1. Remove the six mounting screws from the top cover of the dc power distribution panel and remove
the top cover.
2. If they are installed, remove the four screws from the cable channel cover.
3. Remove the cable channel cover.
1 Cable channel cover retaining screw 5 Shield
2 Cable channel cover 6 Power distribution panel
3 Power distribution panel top cover
retaining screws
4 Power distribution panel top cover
4. Remove the -48 V dc bus bar shield from the power distribution panel.
Figure 14. Removing the cable channel cover
Installing the rack 21
Attention: The bus bar shield must be correctly reinstalled over the -48 V dc return bus bars to
protect against injury while servicing the power distribution panel.
5. Ensure that the following steps are performed when connecting the dc power source.
a. At -48 V dc power source, turn off any -48 V dc power sources that will be connected to the
power distribution panel.
b. After the -48 V dc power sources are turned off, be sure there is a tag or label over the power
source switches or fuses (lock-out/tag-out) to indicate that the power source is turned off
intentionally.
Note: Ensure that any oxidation on the copper bus bars is removed.
c. If this is a raised-floor installation and you are working at the back of the rack, route the power
cables up the rack's right side.
d. Ensure that the external -48 V dc power cable is connected correctly to the -48 V dc bus bar.
e. Ensure that the external -48 V dc return cable is routed correctly and installed on the return bus
bar.
1 -48 V dc power cable and return power cable
2 Power distribution panel
3 Front of rack
4 -48 V dc power cable and return power cable
Figure 15. Routing the power cables
22 Installing into a rack
1 Front of power distribution panel 6 (B) Return (-) power cable
2 (A) -48 V dc (-) bus bar 7 (B) -48 V dc (-) power cable
3 (A) -48 V dc (-) power cable 8 (B) Return (-) bus bar
4 (A) Return (-) bus bar 9 (B) -48 V dc (-) bus bar
5 (A) Return (-) power cable
f. If you want to install a power status alarm, connect the alarm cable to the terminal board on the
back cover of the dc power distribution panel.
Note: Ensure that you remove the oxidation on the copper bus bars.
g. Ensure that the power-source ground cable is routed correctly and connects the power-source
ground cable to the copper bar at the lower-back or upper-back center of the rack.
h. If the rack is on a raised floor, attach the -48 V dc power source cables to the back of the rack with
cable-restraint straps.
Figure 16. Return bus bar
Installing the rack 23
1 Back view of rack (dc)
2 Power cable, power return cable, and ground
3 Ground cable (Install at either top or bottom of the rack)
6. Reinstall the -48 V dc bus bar shield.
7. Reinstall the top cover on the dc power distribution panel.
8. Reinstall the cable channel cover.
1 Cable channel cover
Figure 17. Cable locations
24 Installing into a rack
2 Terminal block (both sides)
3 Power distribution panel
4 Front of rack
Figure 18. Reinstalling the cable channel cover
Installing the rack 25
26 Installing into a rack
Installing a system or expansion unit into a rack
You might need to install a system or expansion unit into a rack. In addition to important related safety
information, this section provides procedures that you can follow to perform these tasks. This section also
includes illustrations of the related hardware components and shows how these components relate to
each other.
The following procedures describe how to install system units or expansion units into a rack. You can
perform this task or contact a service provider to perform the task for you. You might be charged a fee
by the service provider for this service.
Installing the model 8202-E4B or 8205-E6B into a rack
You might need to install the system into a rack. Use the procedure in this section to perform this task.
This is a customer task. You can perform this task yourself, or contact a service provider to perform the
task for you. You might be charged a fee by the service provider for this service.
Important: To complete this procedure, use two people when attaching the rail assembly to the rack, one
in front of the rack and one at the rear of the rack. Use three people when lifting the system unit into the
rack.
This procedure assumes that you are installing the system into an existing rack. If the rack is not
installed, go to the instructions for “Installing the rack” on page 5 and then return to this procedure for
instructions on installing the system unit in a rack.
Note: This procedure applies only to the models that are designed to be mounted in a rack. Do not
attempt to install a stand-alone model in a rack.
To install the model 8202-E4B or 8205-E6B into a rack, complete the following steps:
CAUTION:
Installing the rails in the rack is a complex procedure. To install the rails correctly, you must perform
each task in the following order. Failure to do so might cause rail failure and potential danger to
yourself and the system unit.
1. Read the “Rack safety notices” on page 94.
2. Complete a parts inventory. The following figure shows the items that you need to install the server
in the rack cabinet.
© Copyright IBM Corp. 2010 27
If there are incorrect, missing, or damaged parts, contact:
v Your IBM reseller
v IBM Rochester Manufacturing Automated Information Line at 1-800-300-8751 (United States only)
v Directory of worldwide contacts website at http://www.ibm.com/planetwide (Locate your
service and support telephone numbers.)
3. Locate the rack-mounting hardware kit (A) and the rack rails (B) that were included with your system
unit as shown in Figure 19.
The system rails (B) are front-to-back and left-to-right side dependent. The rails are labeled left and
right to indicate their placement when you face the front of the rack. The back of each rail has two
large latch assemblies. These latch assemblies go in the back of the rack.
4. Determine where you will locate the system unit in the rack. See “Determining the location.”
5. Attach mounting hardware to the rack. For more information, see Attach the mounting hardware to
the rack.
Determining the location
You might need to determine where to install the system in the rack. Use this procedure to perform this
task.
Before installing the system unit into a rack, complete the following steps:
1. Read the “Rack safety notices” on page 94.
2. Plan where to place the units. Place the larger and heavier units in the lower part of the rack.
This system unit is four Electronic Industries Alliance (EIA) units high. An EIA unit is 44.45 mm (1.75
in.) in height. The rack contains three mounting holes for each EIA unit of height. This system unit
therefore is 177.8 mm (7 in.) high and covers 12 mounting holes in the rack.
3. If necessary, remove the filler panels to provide access to the inside of the rack enclosure where you
plan to place the unit.
Figure 19. Parts inventory
28 Installing into a rack
4. If necessary, remove the front and rear rack doors.
Marking the location
You might need to mark the installation location. Use this procedure to perform this task.
To mark the installation location and install the nut clips into a rack, complete the following steps:
1. Determine where in the rack to place the system. Install units in the lower part of the rack first. Place
larger and heavier units in the lower part of the rack. Record the EIA location. The system is four
Electronic Industries Alliance (EIA) units high. An EIA unit is 44.45 mm (1.75 in.) in height. The rack
contains three mounting holes for each EIA unit of height. This system therefore is 177.8 mm (7 in.)
high and covers 12 mounting holes in the rack.
2. Facing the front of the rack and working from the right side, locate the bottom EIA unit that your
expansion unit uses. Record the EIA location. Use tape, a marker, or a pencil to mark the bottom hole
of this EIA unit (A). Mark the rack so that the mark can also be seen from the rear of the rack.
Figure 20. Removing the filler panels
Installing a system or expansion unit into a rack 29
3. Begin with the hole identified by the mark (A). Count up one hole and place a second mark (B). Mark
the rack so that the mark can also be seen from the rear of the rack, as shown in Figure 21.
4. Repeat the previous two steps on the left side of the rack.
5. Facing the rear of the rack and working from the right side, locate the bottom EIA unit that your
expansion unit uses. Make a mark next to the bottom hole of this EIA unit (A).
6. Begin with the hole identified by the mark (A). Count up one hole and place a second mark (B), as
shown in Figure 21.
7. Begin with the hole identified by the mark (B). Count up one hole and place a third mark (C), as
shown in Figure 21.
8. Repeat the previous two steps on the left side of the rack.
Attaching the 8202-E4B or 8205-E6B mounting hardware to the rack
You might need to attach the mounting hardware to the rack. Use the procedure in this section to
perform this task.
Attention: To avoid rail failure and potential danger to yourself and to the unit, ensure that you have
the correct rails and fittings for your rack. If your rack has square support flange holes or screw-thread
support flange holes, ensure that the rails and fittings match the support flange holes used on your rack.
Do not install mismatched hardware using washers or spacers. If you do not have the correct rails and
fittings for your rack, contact your IBM reseller. Also, to install the rails correctly, perform each task in
the following order.
To install the rack-mounting hardware into the rack, complete the following steps:
1. Each slide rail is marked with either an R (right) or an L (left). Select one of the slide rails and push
up on the front movable tab (1). Pull out the front latch (2) to slide out the front side rail. If a
thumbscrew is installed in the slide rail (3), remove it.
Figure 21. Marking holes on the front and rear of the rack frame
30 Installing into a rack
Note: Make sure that the movable tab remains extended and does not click back into place.
2. Align the three pins on the rear of the slide rail with the three holes in the selected EIA unit on the
rear of the rack. Push the rails so that the pins go into the holes (1), and drop the slide rail down (2)
until it latches into place.
3. Pull the slide rail forward and insert the two pins (1) on the front of the rail into the two lower holes
in the U on the front of the rack. Drop the rail into place until it clicks into place. Push the front latch
(2) in all the way.
4. Repeat the previous three steps to install the other rail into the rack. Make sure that each front latch is
fully seated.
Figure 22. Slide rail and movable tab
Figure 23. Align the pins with the holes in the rear of the rack
Figure 24. Rack front rail and pins
Installing a system or expansion unit into a rack 31
5. Pull the slide rails forward (1) until they click, twice, into place. Carefully lift the server and tilt it into
position over the slide rails so that the rear nail heads (2) on the server line up with the rear slots (3)
on the slide rails. Slide the server down until the rear nail heads slip into the two rear slots, and then
slowly lower the front of the server (4) until the other nail heads slip into the other slots on the slide
rails. Make sure that the front latch (5) slides over the nail heads.
6. Lift the blue release latches (1) on the slide rails and push the server (2) all the way into the rack until
it clicks into place.
7. Install the cable-management arm. For more information, see “Installing the cable-management arm.”
Installing the cable-management arm
You might need to install the cable-management arm. Use this procedure to perform this task.
To install the cable-management arm, complete the following steps:
1. Install the cable-management arm on the right side of the server, when you are facing it from the rear.
Connect one end of the support arm (1) to the same slide rail to which you plan to attach the
cable-management arm so that you can swing the other end of the support arm (2) toward the rack.
Figure 25. Slide rails extended and server nail heads aligned with slots in rail
Figure 26. Release latches and server
32 Installing into a rack
2. Install the L-shaped cable-management stop bracket (1) on the unattached end of the support arm.
Turn the bracket (2) to secure it to the support arm.
3. To attach the other side of the support arm to the back side of the slide rail, pull the pin out (1), and
then slide the bracket (2) into the slide rail.
4. Pull out the mounting bracket pin (1) and slide the mounting bracket (2) into the slide rail onto which
you are installing the cable-management arm. Push the bracket into the slide rail until the
Figure 27. Support arm connection
Figure 28. Cable-management stop bracket secured to the support arm
Figure 29. Pin extended and bracket installed into slide rail
Installing a system or expansion unit into a rack 33
spring-loaded pin snaps into place.
5. Place the cable-management arm on the support arm. Pull out the cable-management arm pin (1), and
then slide the cable-management arm tab (2) into the slot on the inside of the slide rail. Push the tab
until it snaps into place. Pull out the other cable-management arm pin (3), and then slide that cable
management arm tab into the slot (4) on the outside of the slide rail. Push the tab until it snaps into
place.
6. Route the power cords and other cables (including keyboard, monitor, and mouse cables, if required)
on the cable-management arm (1). Attach all cables to the rear of the server except for the power cord.
Secure the cord and cables with cable ties or hook-and-loop fasteners.
Note: Allow slack in all cables to avoid tension in the cables while the cable-management arm moves.
Figure 30. Mounting bracket pin extended and mounting bracket installed into slide rail
Figure 31. Cable-management arm connection
34 Installing into a rack
7. Slide the server into the rack until it snaps into place.
Connecting the power cables to the system
You might need to connect power cables to the system. Use this procedure to perform this task.
To connect power cables to the system, follow these steps:
1. While facing the rear of the system unit, insert the system power cord up through the horizontal
cable-restraint bracket.
2. Plug the power cord into the system unit.
Note: If the system is equipped with two power supplies, each must have its own power cord
plugged in.
3. Plug the system power cord and the power cords for any other attached devices into the alternating
current (AC) power source.
4. Power on the system unit and any devices connected to the system unit.
For instructions about how to connect and configure the expansion unit after installation, see Enclosures
and expansion units.
Installing the model 8231-E2B into a rack
You might need to install the system into a rack. Use the procedure in this section to perform this task.
This is a customer task. You can perform this task yourself, or contact a service provider to perform the
task for you. You might be charged a fee by the service provider for this service.
Important: To complete this procedure, use two people when attaching the rail assembly to the rack, one
in front of the rack and one at the rear of the rack. Use three people when lifting the system unit into the
rack.
This procedure assumes that you are installing the system into an existing rack. If the rack is not
installed, go to the instructions for “Installing the rack” on page 5 and then return to this procedure for
instructions on installing the system unit in a rack.
Note: This procedure applies only to the models that are designed to be mounted in a rack. Do not
attempt to install a stand-alone model in a rack.
To install the model 8231-E2B into a rack, complete the following steps:
Figure 32. Power cord attachment and routing
Installing a system or expansion unit into a rack 35
CAUTION:
Installing the rails in the rack is a complex procedure. To install the rails correctly, you must perform
each task in the following order. Failure to do so might cause rail failure and potential danger to
yourself and the system unit.
1. Read the “Rack safety notices” on page 94.
2. Complete a parts inventory. The following figure shows the items that you need to install the server
in the rack cabinet.
If there are incorrect, missing, or damaged parts, contact:
v Your IBM reseller
v IBM Rochester Manufacturing Automated Information Line at 1-800-300-8751 (United States only)
v Directory of worldwide contacts website at http://www.ibm.com/planetwide (Locate your
service and support telephone numbers.)
3. Locate the rack-mounting hardware kit (A) and the rack rails (B) that were included with your system
unit as shown in Figure 33.
The system rails (B) are front-to-back and left-to-right side dependent. The rails are labeled left and
right to indicate their placement when you face the front of the rack. The back of each rail has two
large latch assemblies. These latch assemblies go in the back of the rack.
4. Determine where you will locate the system unit in the rack. See “Determining the location.”
5. Attach mounting hardware to the rack. For more information, see Attach the mounting hardware to
the rack.
Determining the location
You might need to determine where to install the system in the rack. Use this procedure to perform this
task.
Before installing the system unit into a rack, complete the following steps:
1. Read the “Rack safety notices” on page 94.
2. Plan where to place the units. Place the larger and heavier units in the lower part of the rack.
Figure 33. Parts inventory
36 Installing into a rack
This system unit is two Electronic Industries Alliance (EIA) units high. An EIA unit is 44.45 mm (1.75
in.) in height. The rack contains three mounting holes for each EIA unit of height. This system unit
therefore is 88.9 mm (3.5 in.) high and covers six mounting holes in the rack.
3. If necessary, remove the filler panels to provide access to the inside of the rack enclosure where you
plan to place the unit.
4. If necessary, remove the front and rear rack doors.
Marking the location
You might need to mark the installation location. Use this procedure to perform this task.
To mark the installation location and install the nut clips into a rack, complete the following steps:
1. Determine where in the rack to place the system. Install units in the lower part of the rack first. Place
larger and heavier units in the lower part of the rack. Record the EIA location. The system is two
Electronic Industries Alliance (EIA) units high. An EIA unit is 44.45 mm (1.75 in.) in height. The rack
contains three mounting holes for each EIA unit of height. This system therefore is 88.9 mm (3.5 in.)
high and covers six mounting holes in the rack.
Figure 34. Removing the filler panels
Installing a system or expansion unit into a rack 37
2. Facing the front of the rack and working from the right side, locate the bottom EIA unit that your
expansion unit uses. Record the EIA location. Use tape, a marker, or a pencil to mark the bottom hole
of this EIA unit (A). Mark the rack so that the mark can also be seen from the rear of the rack.
3. Begin with the hole identified by the mark (A). Count up one hole and place a second mark (B). Mark
the rack so that the mark can also be seen from the rear of the rack, as shown in Figure 35.
4. Repeat the previous two steps on the left side of the rack.
5. Facing the rear of the rack and working from the right side, locate the bottom EIA unit that your
expansion unit uses. Make a mark next to the bottom hole of this EIA unit (A).
6. Begin with the hole identified by the mark (A). Count up one hole and place a second mark (B), as
shown in Figure 35.
7. Begin with the hole identified by the mark (B). Count up one hole and place a third mark (C), as
shown in Figure 35.
8. Repeat the previous two steps on the left side of the rack.
Attaching the 8231-E2B mounting hardware to the rack
You might need to attach the mounting hardware to the rack. Use the procedure in this section to
perform this task.
Attention: To avoid rail failure and potential danger to yourself and to the unit, ensure that you have
the correct rails and fittings for your rack. If your rack has square support flange holes or screw-thread
support flange holes, ensure that the rails and fittings match the support flange holes used on your rack.
Do not install mismatched hardware using washers or spacers. If you do not have the correct rails and
fittings for your rack, contact your IBM reseller. Also, to install the rails correctly, perform each task in
the following order.
To install the rack-mounting hardware into the rack, complete the following steps:
1. Each slide rail is marked with either an R (right) or an L (left). Select one of the slide rails and push
up on the front movable tab (1). Pull out the front latch (2) to slide out the front side rail. If a
thumbscrew is installed in the slide rail (3), remove it.
Figure 35. Marking holes on the front and rear of the rack frame
38 Installing into a rack
Note: Make sure that the movable tab remains extended and does not click back into place.
2. Align the three pins on the rear of the slide rail with the three holes in the selected EIA unit on the
rear of the rack. Push the rails so that the pins go into the holes (1), and drop the slide rail down (2)
until it latches into place.
3. Pull the slide rail forward and insert the two pins (1) on the front of the rail into the two lower holes
in the U on the front of the rack. Drop the rail into place until it clicks into place. Push the front latch
(2) in all the way.
4. Repeat the previous three steps to install the other rail into the rack. Make sure that each front latch is
fully seated.
Figure 36. Slide rail and movable tab
Figure 37. Align the pins with the holes in the rear of the rack
Figure 38. Rack front rail and pins
Installing a system or expansion unit into a rack 39
5. Pull the slide rails forward (1) until they click, twice, into place. Carefully lift the server and tilt it into
position over the slide rails so that the rear nail heads (2) on the server line up with the rear slots (3)
on the slide rails. Slide the server down until the rear nail heads slip into the two rear slots, and then
slowly lower the front of the server (4) until the other nail heads slip into the other slots on the slide
rails. Make sure that the front latch (5) slides over the nail heads.
6. Lift the blue release latches (1) on the slide rails and push the server (2) all the way into the rack until
it clicks into place.
7. Install the cable-management arm. For more information, see “Installing the cable-management arm.”
Installing the cable-management arm
You might need to install the cable-management arm. Use this procedure to perform this task.
To install the cable-management arm, complete the following steps:
1. Install the cable-management arm on the right side of the server, when you are facing it from the rear.
Connect one end of the support arm (1) to the same slide rail to which you plan to attach the
cable-management arm so that you can swing the other end of the support arm (2) toward the rack.
Figure 39. Slide rails extended and server nail heads aligned with slots in rail
Figure 40. Release latches and server
40 Installing into a rack
2. Install the L-shaped cable-management stop bracket (1) on the unattached end of the support arm.
Turn the bracket (2) to secure it to the support arm.
3. To attach the other side of the support arm to the back side of the slide rail, pull the pin out (1), and
then slide the bracket (2) into the slide rail.
4. Pull out the mounting bracket pin (1) and slide the mounting bracket (2) into the slide rail onto which
you are installing the cable-management arm. Push the bracket into the slide rail until the
Figure 41. Support arm connection
Figure 42. Cable-management stop bracket secured to the support arm
Figure 43. Pin extended and bracket installed into slide rail
Installing a system or expansion unit into a rack 41
spring-loaded pin snaps into place.
5. Place the cable-management arm on the support arm. Pull out the cable-management arm pin (1), and
then slide the cable-management arm tab (2) into the slot on the inside of the slide rail. Push the tab
until it snaps into place. Pull out the other cable-management arm pin (3), and then slide that cable
management arm tab into the slot (4) on the outside of the slide rail. Push the tab until it snaps into
place.
6. Route the power cords and other cables (including keyboard, monitor, and mouse cables, if required)
on the cable-management arm (1). Attach all cables to the rear of the server except for the power cord.
Secure the cord and cables with cable ties or hook-and-loop fasteners.
Note: Allow slack in all cables to avoid tension in the cables while the cable-management arm moves.
Figure 44. Mounting bracket pin extended and mounting bracket installed into slide rail
Figure 45. Cable-management arm connection
42 Installing into a rack
7. Slide the server into the rack until it snaps into place.
Connecting the power cables to the system
You might need to connect power cables to the system. Use this procedure to perform this task.
To connect power cables to the system, follow these steps:
1. While facing the rear of the system unit, insert the system power cord up through the horizontal
cable-restraint bracket.
2. Plug the power cord into the system unit.
Note: If the system is equipped with two power supplies, each must have its own power cord
plugged in.
3. Plug the system power cord and the power cords for any other attached devices into the alternating
current (AC) power source.
4. Power on the system unit and any devices connected to the system unit.
For instructions about how to connect and configure the expansion unit after installation, see Enclosures
and expansion units.
Installing the model 8233-E8B or 8236-E8C into a rack
You might need to install the system into a rack. Use the procedure in this section to perform this task. In
addition to information intended to promote safety and reliable operation, this section also includes
illustrations of the related hardware components and shows how these components relate to each other.
This is a customer task. You can perform this task yourself, or contact a service provider to perform the
task for you. You might be charged a fee by the service provider for this service.
Important: To complete this procedure, it is suggested that you use two people to attach the rail
assembly to the rack, one in front of the rack and one at the back of the rack. You will need three people
to lift the system unit onto the rack.
This procedure assumes that you are installing the system into an existing rack. If the rack is not
installed, go to the instructions for “Installing the rack” on page 5 and then return to this procedure for
instructions on installing the system unit in a rack.
Note: This procedure applies only to the models that are designed to be mounted in a rack. Do not
attempt to install a stand-alone model in a rack.
Figure 46. Power cord attachment and routing
Installing a system or expansion unit into a rack 43
To install the model 8233-E8B or 8236-E8C into a rack, complete the following steps:
CAUTION:
Installing the rails in the rack is a complex procedure. To install the rails correctly, you must perform
each task in the following order. Failure to do so might cause rail failure and potential danger to
yourself and the system unit.
1. Read the “Rack safety notices” on page 94.
2. If you have not already done so, refer to Figure 47 and complete a parts inventory.
v A Rack-mounting hardware kit
– Four large retaining screws
– Two medium-sized screws
– Two nut clips
– Eight small retaining screws
– Two rail support brackets
– Two cable-restraint brackets
v B Left and right rack rails with rack brackets
v C Cable-management arm pins
v D Two cable-management arm brackets (one for left-side installation, one for right-side
installation)
v E Cable-management arm
Figure 47. Rack-mounting kit
44 Installing into a rack
v F Two rack latches
If there are incorrect, missing, or damaged parts, contact:
v Your IBM reseller
v IBM Rochester Manufacturing Automated Information Line at 1-800-300-8751 (United States only)
v Directory of worldwide contacts website at http://www.ibm.com/planetwide (Locate your
service and support telephone numbers.)
3. Locate the rack-mounting hardware kit (A), and the rack rails (B) that were included with your
system unit as shown in Figure 47 on page 44.
The system rails (B) are front-to-back and left-to-right side dependent. The rails are labeled left and
right to indicate their placement when you face the front of the rack. The back of each rail has two
large latch assemblies. These latch assemblies go in the back of the rack.
4. Determine where you will locate the system unit in the rack. See “Determining the location.”
5. Next, you must attach mounting hardware to the rack. For more information, see Attach the mounting
hardware to the rack.
Determining the location
You might need to determine where to install the system in the rack. Use this procedure to perform this
task.
Before installing the system unit into a rack, complete the following steps:
1. Read the “Rack safety notices” on page 94.
2. Plan where to place the units. Place the larger and heavier units in the lower part of the rack.
This system unit is four Electronic Industries Alliance (EIA) units high. An EIA unit is 1.75 in. (44.45
mm) in height. The rack contains three mounting holes for each EIA unit of height. This system unit
therefore is 7 in. (177.8 mm) high and covers 12 mounting holes in the rack.
3. If necessary, remove the filler panels to allow access to the inside of the rack enclosure where you
plan to place the unit.
Installing a system or expansion unit into a rack 45
4. If necessary, remove the front and back rack doors.
Marking the location
You might need to mark the installation location. Use this procedure to perform this task.
To mark the installation location and install the nut clips into a rack, complete the following steps:
1. Determine where in the rack to place the system. Install units in the lower part of the rack first. Place
larger and heavier units in the lower part of the rack. Record the EIA location. The system is four
Electronic Industries Alliance (EIA) units high. An EIA unit is 1.75 in. (44.45 mm) in height. The rack
contains three mounting holes for each EIA unit of height. This system therefore is 7 in (177.8 mm)
high and covers 12 mounting holes in the rack.
2. Face the front of the rack and work from the right side. Use tape, a marker, or a pencil to mark the
holes. Mark the middle hole A of the bottom EIA unit of the four EIA units that you are using for this
system unit. You need to identify the marked hole from both the front and back of the rack.
Figure 48. Removing the filler panels
46 Installing into a rack
3. Mark the middle hole of the bottom EIA unit on the left side of the rack (A) as shown in Figure 49.
4. Go to the back of the rack. On the right side, find the EIA unit that corresponds to the bottom EIA
unit marked on the front of the rack.
5. Mark the middle hole of the bottom EIA unit (B) as shown in Figure 49.
6. Mark the bottom hole of the next (higher) EIA unit (C) as shown in Figure 49.
7. Mark the corresponding holes on the left side of the rack.
Attaching the 8233-E8B or 8236-E8C mounting hardware to the rack
You might need to attach the mounting hardware to the rack. Use the procedure in this section to
perform this task. The information provided is intended to promote safety and reliable operation. This
section also includes illustrations of the related hardware components and shows how these components
relate to each other.
Attention: To avoid rail failure and potential danger to yourself and to the unit, ensure that you have
the correct rails and fittings for your rack. If your rack has square support flange holes or screw-thread
support flange holes, ensure that the rails and fittings match the support flange holes used on your rack.
Do not install mismatched hardware using washers or spacers. If you do not have the correct rails and
fittings for your rack, contact your IBM reseller. Also, to install the rails correctly, perform each task in
the following order.
To install the rack-mounting hardware into the rack, complete the following steps:
1. With the right rail, twist the latch assembly release tab (A) in Figure 50 on page 48. Slide release tab
(B) back to the retracted position and lock the latch assembly. The back-alignment pins must be fully
retracted.
2. After the alignment pins are retracted, insert the right side rail front-alignment pin, as shown in
Figure 50 on page 48, into the rack front flange hole. This hole is identified by the mark that you
previously made. Have a second person hold the rail securely in the front hole.
Figure 49. Marking holes on the front and back of the rack frame
Installing a system or expansion unit into a rack 47
3. Align the back-alignment pins of the rail with the holes at the back of the rack. The holes are
identified by the marks on the back of the rack. Ensure that the rails are level.
4. Slide the release tab (B) to extend the two back-alignment pins into the back of the rack. Ensure that
the pins have passed through the correct holes in the rack frame.
5. From the back of the rack, as shown in Figure 51 on page 49, install the rail support bracket on the
alignment pins. Then insert a large rail-retaining screw into the hole located between the two back
alignment pins. Partially tighten the screw. Do not tighten completely at this time.
Note: The brackets are labeled LEFT and RIGHT.
Figure 50. Front slide rail alignment pin, retaining screws, and latch bracket
48 Installing into a rack
6. Repeat steps 1 on page 47 through 5 on page 48 for the left side rail.
7. Locate the two latch brackets (F) in Figure 47 on page 44. To install the rack latches, use the
following procedure:
a. Rotate the top of the bracket out as shown in Figure 52 on page 50.
Figure 51. Installing the rail support brackets
Installing a system or expansion unit into a rack 49
b. Align the bracket with the slot on the side of the system as shown in Figure 53 on page 51.
Figure 52. Rotate out the top of the bracket
50 Installing into a rack
c. Move the bracket up to engage the tab as shown in Figure 54.
d. Rotate the top of the bracket back so that the top tab is on top of the system. Then install the
screws as shown in Figure 55 on page 52.
Figure 53. Align the bracket with the slot on the side of the system
Figure 54. Move the bracket up to engage the tab
Installing a system or expansion unit into a rack 51
8. Extend the inner rails by pulling out the rails. They must be extended from the frame like the rails
shown in Figure 56.
9. Using three people, grasp the two handles located on each side of the system, and place the system
onto the inner rail. Align tab A with slot B as indicated by the vertical dashed line shown in
Figure 56.
10. Simultaneously press the safety latches and push the system into the rack until it locks as shown in
Figure 57 on page 53.
Figure 55. Rotate the top of the bracket back so that the top tab is on top of the system
Figure 56. Place system onto the rails
52 Installing into a rack
11. Install and position the nut clips before inserting the screws. Use the retaining screws to attach the
system to the rack as shown in Figure 58 on page 54.
Figure 57. Inner rail extended
Installing a system or expansion unit into a rack 53
12. After both rails have been installed, ensure that none of the rail retaining screws are more than
finger tight. The rails must be level from front to back and from left to right.
13. Next, you must install the cable-management arm. For more information, see “Installing the
cable-management arm.”
Installing the cable-management arm
You might need to install the cable-management arm. Use this procedure to perform this task.
To install the cable-management arm, complete the following steps:
1. Determine on which side of the rack you want to install the cable-management arm.
2. Place the correct arm bracket (Left or Right) with the cable-management arm.
3. Use the pin (F) to pin the cable-management arm (E) to the rack frame (D) as shown in Figure 59 on
page 55.
Figure 58. Secure system to rack through rack latches
54 Installing into a rack
Tip: If access to the back of the rack is obscured by many existing cables, it might be easier to remove
the small connecting hinge from the cable-management arm and attach it first. Then, you can attach
the remaining section of the cable-management arm to the connecting hinge.
4. Use the second pin (F) to pin the other end of the cable-management arm to the flange (C) that is
attached to the sliding portion of the left system rail assembly (B) as shown in Figure 59.
Connecting the power cables to the system
You might need to connect power cables to the system. Use this procedure to perform this task.
To connect power cables to the system, follow these steps:
1. While facing the rear of the system unit, note the bottom tab of the cable-restraint bracket. Turn the
cable-restraint bracket counter-clockwise to a horizontal position, and insert the bottom tab of the
cable-restraint bracket into the bottom hole in the rail. Then rotate the cable-restraint bracket
clockwise to the vertical position as shown in Figure 60 on page 56. Insert the system power cord
through the cable-restraint bracket as shown in this figure. Then press down on the top of the bracket
and insert the top tab into the top hole.
Note: There are two sets of holes on the rail where the cable-restraint bracket can be installed. These
sets of holes are about 25.4 mm (1 inch) apart. Install the bracket in the rear most set of holes.
Figure 59. Attaching the cable-management arm.
Installing a system or expansion unit into a rack 55
2. Plug the power cord into the system unit.
Note: If the system is equipped with two power supplies, each must have its own power cord
plugged in.
3. Plug the system power cord and the power cords for any printers or displays into the alternating
current (AC) power source.
4. Power on the system unit and any devices connected to the system unit.
For instructions on how to connect and configure the expansion unit after installation, see Enclosures and
expansion units.
Installing the model 5802 or 5877 into a rack
This procedure can be used to install the 5802 or 5877 disk drive enclosure into a rack. The procedure
includes information intended to promote safety and reliable operation, and provides illustrations on how
hardware components relate to each other.
v This procedure assumes that you are installing the 5796 or 7314-G30 expansion unit into an existing
rack. If the rack is not installed, you must perform the procedure for “Installing the rack” on page 5
before performing this procedure to install the expansion unit into the rack.
Note: This procedure applies only to the models that are designed to be mounted in a rack. Do not
attempt to install a stand-alone model in a rack.
v Read the “Rack safety notices” on page 94.
v For instructions related to installing, removing, and replacing a disk drive, disk-drive backplane, or
enclosure, see Disk drives.
To install the model 5802 or 5877 into a rack, complete these steps:
Figure 60. Routing system power cord
56 Installing into a rack
CAUTION:
Installing the rails in the rack is a complex procedure. To install the rails correctly, you must perform
each task in the following order. Failure to do so might cause rail failure and potential danger to
yourself and the system unit.
Attention: Installing the rails in the rack is a complex procedure. To install the rails correctly, you must
perform each task in order. Failure to do so might cause rail failure and potential danger to yourself and
the expansion unit. Also, note that for the 5802 and 5877, three people are required to safely lift the
system. Using fewer than three people to lift the system can result in injury.
1. Refer to the kitting list provided with the rack installation kit, and complete a parts inventory, if you
have not done so.
2. Locate the rack-mounting hardware kit and the system rail assemblies that were included with your
expansion unit.
3. Determine where to locate the expansion unit in the rack. See “Determining the location.” This
expansion unit is four Electronic Industries Alliance (EIA) units high. An EIA unit is 1.75 in (44.45
mm) in height. The rack contains three mounting holes for each EIA unit of height.
4. Mark the location. See “Marking the location” on page 58.
Determining the location
You might need to determine where to install an expansion unit in the rack. Use this procedure to
perform this task.
Before installing the expansion unit into a rack, complete the following steps:
1. Determine where in the rack to place the expansion unit in relation to other system hardware. As you
prepare, organize larger and heavier units into the lower part of the rack, and plan to install units into
the lower part of the rack first. Record the EIA locations in your plan.
Note: These expansion units are four Electronic Industries Alliance (EIA) units high. An EIA unit is
44.50 mm (1.75 in.) in height. The rack contains three mounting holes for each EIA unit of height. This
system unit, therefore, is 177.8 mm (7 in.) high and covers 12 mounting holes in the rack.
2. If necessary, remove the filler panels to allow access to the inside of the rack enclosure where you
plan to place the expansion unit.
Installing a system or expansion unit into a rack 57
3. If necessary, remove the front and back rack doors.
4. Mark the location where you intend to install the disk drive enclosure.
a. If you have a rack-mounting template, use it to mark the location.
b. If you do not have a rack-mounting template, see “Marking the location.”
Marking the location
You might need to mark the installation location. Use the procedure in this section to perform this task.
To mark the installation location and the position for installing the nut clips into a rack, complete these
steps:
1. Determine where in the rack to place the expansion unit in relation to other system hardware. As you
plan, keep the following in mind:
a. Organize larger and heavier units into the lower part of the rack.
b. Plan to install units into the lower part of the rack first.
c. Record the EIA locations in your plan.
Figure 61. Removing the filler panels
58 Installing into a rack
Note: These expansion units are four Electronic Industries Alliance (EIA) units high. An EIA unit is
1.75 in. (44.50 mm) in height. The rack contains three mounting holes for each EIA unit of height. This
system unit, therefore, is 7 in. (177.8 mm) high and covers 12 mounting holes in the rack.
2. Face the front of the rack and work from the left side. Make a note of both the lowest and highest
EIA units to be used for the expansion unit. Use tape, a marker, or pencil to mark the bottom
mounting hole (A) of the highest EIA unit. Then mark the top mounting hole (B) of the lowest EIA
unit. Mark the rack so that the mark can also be seen from the rear of the rack.
3. Count up two mounting holes from where you placed the mark for mounting hole (B) in the last step,
and place another mark next to that mounting hole (B). Mark the rack so that the mark can also be
seen from the rear of the rack. You now have two (B) marks on the rack, with one mounting hole
between the marks.
4. Repeat the last two steps to place three marks on the corresponding mounting holes on the front-right
side of the rack.
5. Go to the back of the rack. On the left side, find the EIA unit that corresponds to the lowest EIA unit
marked on the front of the rack.
6. Place a mark at the top hole (C) of the lowest EIA unit. Mark the rack so that the mark can be seen
from the rear of the rack.
7. Count up two mounting holes from where you placed the mark in the last step, and place another
mark next to that mounting hole (C).
8. Repeat the last two steps to place two marks on the corresponding mounting holes on the rear-right
side of the rack.
Attaching the mounting hardware to the rack
You might need to attach mounting hardware to the rack. Use the procedure in this section to perform
this task. This section also includes information intended to promote safety and reliable operation, and
illustrations of the hardware components.
CAUTION:
Installing the rails in the rack is a complex procedure. To install the rails correctly, you must perform
each task in the following order. Failure to do so might cause rail failure and potential danger to
yourself and the system unit.
To install the mounting hardware into the rack, follow these steps:
1. Rotate down the front and back clamps (C) on the left and right rails.
2. On each rail, remove the tape holding the spring (B) to the rail. Hook the end of the spring around
the circular standoff on the rail.
Figure 62. Marking the installation locations
Installing a system or expansion unit into a rack 59
3. Start at the front of the rack. Extend the right rail and insert the rail pins (D) into the marked
locations in the rack so that the rail is facing upward. Rotate up the front clamp (C) to secure the rail
in place.
Notes:
v If your rack has the square mounting holes, remove the rail pins (D) from the rail. Install the
larger rail pins provided with the rack installation kit.
v The rail installation can be completed by one person. However, the installation is easier if one
person is positioned at the front of the rack and one person is positioned at the back of the rack.
4. Extend the left rail and insert the rail pins (D) into the marked locations. Rotate up the front clamp
(C) to secure the rail in place.
5. Move to the back of the rack, extend the right rail, and insert the rail pins (D) into the marked
locations. Rotate up the back clamp (C) to secure the rail in place.
6. Extend the left rail and insert the rail pins (D) into the marked locations. Rotate up the back clamp
(C) to secure the rail in place.
7. At the back of the rack, use two M5 screws (E), one below each rail pin (D), to secure the left rail to
the back of the rack.
8. Use two M5 screws (E), one below each rail pin (D), to secure the right rail to the back of the rack.
9. Move to the front of the rack. Use one M5 screw (E) below the top rail pin to secure the left rail to
the front of the rack.
Figure 63. Installing the rails
60 Installing into a rack
10. Use one M5 screw (E), below the top rail pin (D), to secure the right rail to the front of the rack.
11. Locate the marks previously made on the front Electronic Industries Alliance (EIA) strips of the rack
cabinet. Each side contains one mark. Insert one nut clip (F) at the mark on the right side of the rack
as shown in Figure 65 on page 62.
12. Insert a nut clip (F) at the mark on the left side of the rack as shown in Figure 65 on page 62.
13. From the back of the rack cabinet, locate the marks previously made on the rear Electronic Industries
Alliance (EIA) strips. Each side contains two marks. Insert a nut clip (F) at each of the marks on the
right side of the rack as shown in Figure 65 on page 62.
Figure 64. Attaching the rails
Installing a system or expansion unit into a rack 61
14. Use two M5 screws (H) to attach the retention bracket (G) to the rack on the right side.
15. Insert a nut clip (F) at each of the marks on the left side of the rack as shown in Figure 65.
16. Use two M5 screws (H) to attach the retention bracket (G) to the rack on the left side.
Installing the 5802 or 5877 on the rail assembly
You might need to install the expansion unit on the rail assembly. Use the procedure in this section to
perform this task. In addition to related safety information, this section also includes illustrations of the
related hardware components and shows how these components relate to each other.
1. Before installing the expansion unit on the rail assembly, ensure that the stabilizers are extended and
the tip plate is attached to the bottom front of the rack to prevent the rack from falling forward when
the rails are pulled out of the rack. Refer to “Rack safety notices” on page 94.
To install the expansion unit on the rail assembly, follow these steps:
1. To remove the shipped cover from the back of the expansion unit, remove the four thumb screws that
are used to secure it. Pull off the cover.
Note: There are two thumb screws at the top of the cover and one on each side.
2. If the bezel is installed on the front of the system, remove it by completing the followingsteps:
a. Remove the two M5 securing screws (A) as shown in Figure 66 on page 63.
b. Remove the service information holder from the slot in the bezel.
c. Using the two blue touch points on each side of the expansion unit, pull the bezel straight out to
remove the bezel from the front of the expansion unit.
d. Remove the two nut clips that were used to secure the bezel to the expansion unit from the EIA
flange.
Figure 65. Installing the retention bracket
62 Installing into a rack
3. Using three people, hold onto the four handles (I), as shown in the following figure, to lift the
expansion unit and position it over the front of the rails.
Attention: Three people are required to safely lift the expansion unit. Using fewer than three people
to lift the expansion unit can result in injury.
4. Slide the expansion unit into the rack.
5. Install the two M5 securing screws (J) through the expansion unit and into the nut clips on the left
and right rails.
6. Install the bezel (L) onto the expansion unit. Press the bezel into place and secure it with two M5
screws (M) on the left and right rails.
Figure 66. Removing the service information holder and bezel
Figure 67. Installing the expansion unit into a rack
Installing a system or expansion unit into a rack 63
7. Insert the service information holder (N) into the slot in the bezel.
8. At the rear of the system, install the four securing screws (K) through the expansion unit and
retention brackets on the left and right rails.
For instructions on how to connect and configure the expansion unit after installation, see Enclosures and
expansion units.
Installing the model 5886 into a rack
This procedure can be used to install the 5886 disk drive enclosure into a rack. The procedure includes
information intended to promote safety and reliable operation, and provides illustrations on how
hardware components relate to each other.
Figure 68. Installing the bezel and service information holder
Figure 69. Installing four screws at back of system
64 Installing into a rack
v This procedure assumes that you are installing the 5886 disk drive enclosure into an existing rack. If
the rack is not installed, you must perform the procedure for “Installing the rack” on page 5 before
performing this procedure to install the disk drive enclosure into the rack.
Note: This procedure applies only to the models that are designed to be mounted in a rack. Do not
attempt to install a stand-alone model in a rack.
v Read the “Rack safety notices” on page 94.
Installing the disk drive enclosure into a rack is a customer task. You can perform this task yourself, or
contact a service provider to perform the task for you. You might be charged a fee by the service
provider for this service.
Important: To complete this procedure, it is suggested that you use two people to attach the rail
assembly to the rack. Position one person in front of the rack and one person at the back of the rack.
To install the model 5886 into a rack, complete these steps:
CAUTION:
Installing the rails in the rack is a complex procedure. To install the rails correctly, you must perform
each task in the following order. Failure to do so might cause rail failure and potential danger to
yourself and the system unit.
1. Complete a parts inventory, if you have not done so.
Note: For this installation you will need a 5 mm (3/16-in.) flat-blade screwdriver during the
installation. A #2 Phillips screwdriver and an 8 mm wrench are optional.
Here is a list of items that you need to install the unit in the rack cabinet:
v Rail assembly, right (1)
v Rail assembly, left (1)
v M5 screws (8)
v Small diameter spacers (8) (these come installed, four in each rail)
v Large diameter spacers (8)
Note: These are used with square mount racks only.
v M4 pan-head screws (2)
v 15 mm (0.60 in.) diameter washers (6)
Note: These are used with square mount racks only.
2. Locate the rack-mounting hardware kit and the system rail assemblies that were included with your
disk drive enclosure.
The rails are front-to-back and left-to-right side dependent.
3. Determine where you will locate the disk drive enclosure in the rack. See “Determining the location.”
Determining the location
You might need to determine where to install the disk drive enclosure into the rack. Follow this
procedure to perform this task.
Read the “Rack safety notices” on page 94.
Before installing the disk drive enclosure into a rack, complete the following steps:
1. Determine where in the rack to place the disk drive enclosure in relation to other system hardware.
As you plan, keep the following in mind:
a. Organize larger and heavier units into the lower part of the rack.
Installing a system or expansion unit into a rack 65
b. Plan to install units into the lower part of the rack first.
c. Record the EIA locations in your plan.
Note: The 5886 disk drive enclosure is two Electronic Industries Alliance (EIA) units high. An EIA
unit is 1.75 in. (44.50 mm) in height. The rack contains three mounting holes for each EIA unit of
height. This system unit, therefore, is 3.5 in. (89 mm) high and covers six mounting holes in the rack.
2. If necessary, remove the filler panels to allow access to the inside of the rack enclosure where you
plan to place the disk drive enclosure.
3. If necessary, remove the front and back rack doors.
4. Mark the location where you intend to install the disk drive enclosure.
a. If you have a rack-mounting template, use it to mark the location.
b. If you do not have a rack-mounting template, see “Marking the location” on page 67.
Figure 70. Removing the filler panels
66 Installing into a rack
Marking the location
You might need to mark the location for rack installation. Follow these steps to perform this task.
To mark the installation location and install the nut clips into a rack, complete the following steps:
1. Determine where in the rack to place the disk drive enclosure in relation to other system hardware.
As you plan, keep the following in mind:
a. Organize larger and heavier units into the lower part of the rack.
b. Plan to install units into the lower part of the rack first.
c. Record the EIA locations in your plan.
Note: The 5886 disk drive enclosure is two Electronic Industries Alliance (EIA) units high. An EIA
unit is 1.75 in. (44.50 mm) in height. The rack contains three mounting holes for each EIA unit of
height. This system unit, therefore, is 3.5 in. (89 mm) high and covers six mounting holes in the rack.
2. Face the front of the rack and work from the right side of the rack. Mark the middle hole of the
bottom EIA unit (A) as shown in the following figure. Identify the marked hole from both the front
and back of the rack.
3. Mark the middle hole of the bottom EIA unit on the left side of the rack.
4. Go to the back of the rack. On the right side, find the EIA unit that corresponds to the bottom EIA
unit marked on the front of the rack.
Note: The following figure is an example of how the EIA units appear, you might have a different
configuration or placement.
5. Mark the middle hole of the bottom EIA unit B.
6. Mark the bottom hole of the next (higher) EIA unit C.
7. Mark the corresponding holes on the left side of the rack.
Attaching the mounting hardware to the rack
You might need to attach mounting hardware to the rack. Follow these steps to perform this task. In
addition to information intended to promote safety and reliable operation, this section also includes
illustrations of the related hardware components and shows how these components relate to each other.
CAUTION:
Installing the rails in the rack is a complex procedure. To install the rails correctly, you must perform
each task in the following order. Failure to do so might cause rail failure and potential danger to
yourself and the system unit.
Figure 71. Marking holes on the front and back of the rack frame
Installing a system or expansion unit into a rack 67
To install the mounting hardware into the rack, follow these steps:
1. Loosen the two rail-adjustment screws on one rail just enough to enable the rails to slide.
2. Locate the marks previously made on the Electronic Industries Alliance (EIA) strips at the front and
back of the rack cabinet. Use the L or R imprinted on the front bracket of your first rail to align it
with these marks within the matching side of the rack cabinet. The bottom of the rail support ledge
should be slightly above the U mark on the rack flange.
The rail comes with two spacers in the front bracket and two spacers in the rear bracket. If the rack
has round holes, the spacers on the rail brackets fit into the round holes in the rack flanges. If the rack
has square holes, replace the set of small spacers on the front and rear of the rail with the larger
spacers that are included with the rail kit.
3. Secure the rail to the front of the rack cabinet with one M5 screw in the top hole of the bracket. For
racks with square holes, add one washer between the M5 screw and the rail bracket.
Figure 72. Extending the rails
Figure 73. Setting the spacers in place
68 Installing into a rack
4. Secure the rail to the rear of the rack cabinet with two M5 screws in the top and bottom holes in the
rear rail bracket. For racks with square holes, add one washer between each M5 screw and the rail
bracket.
5. Tighten the rail-adjustment screws.
6. Repeat step 1 on page 68 through step 5for the other rail.
Installing the 5886 on the rail assembly
You might need to install the expansion unit on the rail assembly. Use the steps detailed in this procedure
to perform this task. In addition to related safety information, illustrations of the related hardware
components are provided.
1. Before installing the expansion unit on the rail assembly, ensure that the stabilizers are extended and
the tip plate is attached to the bottom front of the rack to prevent the rack from falling forward when
the rails are pulled out of the rack. Refer to “Rack safety notices” on page 94.
To install the disk drive enclosure on the rail assembly, follow these steps:
1. To reduce the weight of the unit for easier installation in the rack cabinet, remove the power supplies.
To remove a power supply, complete the following steps:
a. On the left side of a power supply, press the orange release tab to the right just enough to release
the handle (no more than 6.4 mm (0.24 in.)) as you rotate the handle downward.
Figure 74. Attaching the rails
Installing a system or expansion unit into a rack 69
b. Carefully slide the power supply out of the unit.
c. Repeat this step for the other power supply, and then, set both units aside for later installation.
2. Using two people, lift the disk drive enclosure and position it over the front of the rails.
Attention: Two people are required to safely lift the disk drive enclosure. Using fewer than two
people to lift the disk drive enclosure can result in injury.
3. Slide the disk drive enclosure into the rack cabinet, making sure that it slides into the tabs on the rear
of the rail.
4. Secure the front of the disk drive enclosure to the rack cabinet with one M5 screw in the open hole in
each bracket (no washer is required).
Figure 75. Removing power supplies
Figure 76. Installing the enclosure on the rails
70 Installing into a rack
5. Secure the rear of the disk drive enclosure to each rail assembly with one M4 pan-head screw in the
slotted hole on the outside of the storage enclosure chassis wall.
6. Reinstall the power supplies and hard disk drives.
7. Connect a power cord to each power supply. Use the power-cord guide for strain relief to prevent the
power cord from disconnecting accidentally.
8. Install the left bezel that has the LEDs, and then install the right bezel. Fit the cutout on the top of the
bezel over the tab on the chassis flange. Rotate the bezel down until it snaps into place. Make sure
that the inside surface of the bezel is flush with the chassis.
For instructions on how to connect and configure the expansion unit when you have completed installing
the expansion unit into the rack, see Enclosures and expansion units.
Installing the 5796 or 7314-G30 expansion unit in a rack
This procedure can be used to install the 5796 or 7314-G30 expansion unit into a rack. The procedure
includes information intended to promote safety and reliable operation, and provides illustrations on how
hardware components relate to each other.
Figure 77. Installing the securing screws
Figure 78. Attaching the side bezels
Installing a system or expansion unit into a rack 71
v This procedure assumes that you are installing the 5796 or 7314-G30 expansion unit into an existing
rack. If the rack is not installed, you must perform the procedure for “Installing the rack” on page 5
before performing this procedure to install the expansion unit into the rack.
Note: This procedure applies only to the models that are designed to be mounted in a rack. Do not
attempt to install a stand-alone model in a rack.
v Read the “Rack safety notices” on page 94.
To install the model 5796 or 7314-G30 into a rack, complete these steps:
CAUTION:
Installing the rails in the rack is a complex procedure. To install the rails correctly, you must perform
each task in the following order. Failure to do so might cause rail failure and potential danger to
yourself and the system unit.
1. Complete a parts inventory.
v Locate the kitting report in an accessory box.
v Ensure that you received all the parts that you ordered.
2. Ensure that there is an empty slot in an existing shell or that there is enough room in the rack to
install a new shell.
Powering off the system and removing power
You might need to power off the system and remove power. Use the procedure in this section to perform
this task.
1. Ensure that the system you are attaching to is running the latest level of firmware. For information on
checking or upgrading the firmware level or your system, refer to the firmware updates section of
your system's installation information.
2. After ensuring that your firmware is at the latest level, shut down and power off the system that you
are attaching to. For information about shutting down and powering off the system, see the
documentation provided with your expansion unit.
3. If you are installing an expansion unit into an existing shell, continue with “Installing the expansion
unit” on page 77. Otherwise, refer to “Marking the location.”
Marking the location
You might need to mark the location. Use the procedure in this section to perform this task.
To mark the installation location and install the nut clips into a rack, complete the following steps:
1. Determine where in the rack to place the expansion unit in relation to other system hardware. As
you plan, keep the following in mind:
a. Organize larger and heavier units into the lower part of the rack.
b. Plan to install units into the lower part of the rack first.
c. Record the EIA locations in your plan.
Note: These expansion units are four Electronic Industries Alliance (EIA) units high. An EIA unit is
1.75 in. (44.50 mm) in height. The rack contains three mounting holes for each EIA unit of height.
This system unit, therefore, is 7 in. (177.8 mm) high and covers 12 mounting holes in the rack.
The following illustration shows a 1-EIA unit and a 4-EIA unit. Depending on the rack manufacturer,
the EIA units might be separated either by color or by a line. The holes along the EIA strip are not
evenly spaced. If your rack has no color or line separation between EIAs, assume that each EIA
section begins where the hole spacing is closest together.
72 Installing into a rack
A Placement mark
B Nut clip
2. Facing the front of the rack and working from the right side, locate the bottom EIA unit that your
expansion unit uses. Make a note of the EIA location. Use tape, a marker, or a pencil to mark the
rack next to the top hole of this EIA unit (A). Assure that the mark can be seen from both the front
and rear of the rack.
3. Begin with the hole identified by your mark for (A). Count up two holes and make a mark (B) next
to the second hole. Assure that the mark can be seen from both the front and rear of the rack.
4. Begin with the hole identified by your mark for (B). Count up two holes and place a nut clip. The
nut clip (C) aids in securing your expansion unit to the rack.
5. Begin with the hole identified by your mark for (B). Count up six holes and place a nut clip. The nut
clip (D) aids in securing your expansion unit to the rack.
6. Repeat the last four steps to make marks and place nut clips on the left side of the rack.
7. Facing the back of the rack and working from the right side, locate the bottom EIA unit that your
expansion unit uses. Mark the rack next to the top hole of this EIA unit (E).
8. Begin with the hole identified by your mark for (E). Count up two holes and make a mark (F) next
to the second hole.
9. Begin with the hole identified by your mark for (F). Count up four holes and make a mark (G) next
to the fourth hole.
10. Repeat the last three steps to make marks on the left side of the rack.
Installing the expandable rails and extension bracket into the rack
You might need to install rails into the rack. Use the procedure in this section to perform this task. In
addition to related safety information, this section also includes illustrations of the related hardware
components and shows how these components relate to each other.
Attention: Installing the rails in the rack is a complex procedure. To install the rails correctly, you must
perform each task in the following order. Failure to do so might cause rail failure and potential danger to
yourself and the system unit.
To install the expandable rails into the rack, complete the following steps:
1. Loosen the two screws (B) in the middle of both rails (A) to allow each rail to extend.
Figure 79. EIA units with expandable rails for the 5796 expansion unit
Installing a system or expansion unit into a rack 73
2. Starting at the back of the rack, extend the left rail. Insert the rail pins (C) into the marked locations
on the rack so that it is facing upward (L shape).
3. From the back of the rack, extend the right rail and insert the rail pins (C) into the marked locations.
4. Move to the front of the rack, extend the left rail and insert the rail pins (C) into the marked
locations.
5. Extend the right rail and insert the rail pins (C) into the marked locations.
6. Using one M5 screw (D), secure the left rail to the front of rack into the lower location (E)
Important: Do not install a screw in the top location at this time. It will secure the shell to the
assembly.
Figure 80. Extending the rails
74 Installing into a rack
.
7. Using one M5 screw (D), secure the right rail to the front of the rack into the lower location (E).
8. Move to the back of the rack and locate the marks that were previously made on the back-left rail.
Use three M5 screws (G) to secure the rail arm and bracket (F) to the rack at these locations.
Note: The bracket will face inward.
Figure 81. Attaching the rails to the front of the rack
Installing a system or expansion unit into a rack 75
9. Locate the the marks that were previously made on the back-right rail of the rack. Use three M5
screws (G) to secure the rail arm and bracket (F) to the rack at these locations.
10. Tighten the two screws (B) in the middle of the left rail and the two screws (B) in the middle of the
right rail. Refer to Figure 80 on page 74.
Installing the shell
You might need to install the shell. Use the procedure in this section to perform this task. This section
also includes illustrations of the related hardware components and shows how these components relate to
each other.
Note: Before installing the shell, ensure that the rails are level from side to side and from front to back.
To install the shell, complete the following steps:
1. From the front of the rack, place the shell (A) on the rails and slide it into the rack.
Figure 82. Attaching the rails to the back of the rack
76 Installing into a rack
A Shell
B Retaining screws (four)
2. Install and tighten the four retaining screws (B) into the front of the shell, securing the shell (A) to the
rack.
Installing the expansion unit
You might need to install the expansion unit into the rack. Use the procedure in this section to perform
this task. This section also includes illustrations of the related hardware components and shows how
these components relate to each other.
To install the expansion unit into a rack, complete the following steps:
1. Remove the front and back shipping covers of the 5796 or 7314-G30. These shipping covers are
secured by four thumb screws each. Two screws are located on the top of the cover and two are
located on the sides of the cover.
2. If you are installing a expansion unit (H) into an existing shell (G) that has a filler plate (F), remove
the filler plate by pressing down the top tabs and pulling it out.
Figure 83. Installing the shell into the rack on the expandable rails
Installing a system or expansion unit into a rack 77
F Filler plate
G Shell
H Drawer
I Thumb screw
J Cable retention bracket
3. Slide the expansion unit (H) into the shell.
4. If you are installing a 5796, use a thumb screw (I) to install the cable retention bracket (J).
5. If you have two expansion units to install, repeat the last four steps to install the second expansion
unit.
6. If there is only one expansion unit installed in the shell, install the filler plate (N) in the blank slot in
the shell by sliding the filler plate up under the top tab of the shell. Gently press the filler plate into
place.
Figure 84. Removing the filler plate
Figure 85. Removing the filler plate
78 Installing into a rack
N Filler plate
7. Attach the front cover. See http://publib.boulder.ibm.com/infocenter/systems/scope/hw/topic/
p7haj/frontcoverin7314g30.htm (http://publib.boulder.ibm.com/infocenter/systems/scope/hw/topic/
p7haj/frontcoverin7314g30.htm).
Attaching the power controller (SPCN), and power cables
You might need to attach the System Power Control Network (SPCN) and power cables to the expansion
unit. Use the procedure in this section to perform this task.
For instructions on cabling and connecting your expansion units, see Enclosures and expansion units.
Powering on the system
You might need to power on the expansion unit. For information about powering on the expansion unit,
refer to Start the system or logical partition.
Installing the model 5786 or 7031-D24 into a rack
You might need to install the expansion unit into a rack. This section includes procedures so that you can
perform these tasks.
This procedure assumes that you are installing the expansion unit into an existing rack. If the rack is not
installed, follow the instructions in “Installing the rack” on page 5. Return to this procedure for
instructions on installing the expansion unit into the rack.
When you have installed the expansion unit into the rack, you will need the instructions in Disk drives,
to connect and configure the 7031-D24 or 5786.
Note: This procedure applies only to the models that are designed to be mounted into a rack.
To install the expansion unit into a rack, perform the following steps:
Figure 86. Installing the filler plate
Installing a system or expansion unit into a rack 79
Attention: Installing the rails in the rack is a complex procedure. To install the rails correctly, you must
perform each task in order. Failure to do so might cause rail failure and potential danger to yourself and
the expansion unit.
1. Read the “Rack safety notices” on page 94. Failure to read these notices could cause damage to the
equipment or injury to the installer.
2. Complete a parts inventory (See “Completing a parts inventory”).
3. Locate the rack-mounting hardware kit (A) and the system rail assemblies (B) that were included with
your expansion unit. See Figure 87 for the relative locations of these items. The system rails are
identical.
Note: An 8 mm nut driver and a flat-blade screwdriver, represented in the following diagram, are not
listed in the inventory or included in the parts shipped. These tools assist you in completing the
installation.
4. Determine where you will locate the expansion unit in the rack. See “Determining the location” on
page 81. This expansion unit is 4 Electronic Industries Alliance (EIA) units high. An EIA unit is 1.75 in
(44.45 mm) in height. The rack contains three mounting holes for each EIA unit of height.
5. To mark the location, see “Marking the location” on page 82.
Completing a parts inventory
You might need to complete a parts inventory. Use the procedure in this section to perform this task.
If you have not done so, complete a parts inventory before proceeding with the installation:
1. Locate the kitting report in an accessory box.
2. Ensure that you received all the parts that were ordered.
If there are incorrect, missing or damaged parts, contact:
v Your IBM reseller
Figure 87. Parts inventory
80 Installing into a rack
v IBM Rochester Manufacturing Automated Information Line at 1-800-300-8751 (United States only)
v Directory of worldwide contacts website at http://www.ibm.com/planetwide (Locate your service
and support telephone numbers.)
Determining the location
You might need to determine where to install an expansion unit in the rack. Use this procedure to
perform this task.
Before installing the expansion unit into a rack, complete the following steps:
1. Read the “Rack safety notices” on page 94.
2. Plan where you will place the units. Place the larger and heavier units in the lower part of the rack.
These expansion units are four Electronic Industries Alliance (EIA) units high. The rack contains three
mounting holes for each EIA unit of height. Each of these units therefore is 7 in. (177.8 mm) high and
covers 12 mounting holes in the rack.
3. Remove the filler panels to allow access to the inside of the rack enclosure where you plan to place
the unit.
Installing a system or expansion unit into a rack 81
4. Remove the front and back rack doors if necessary.
5. To mark the location, see “Marking the location.”
Marking the location
You might need to mark the location for the rack installation. Use the procedure in this section to
perform this task.
To mark the installation location and install the nut clips into a rack, complete these steps:
1. Determine where in the rack to place the unit and make note of the location numbers. This unit is
four Electronic Industries Alliance (EIA) units high. An EIA unit is 1.75 in. (44.45 mm) in height. The
rack contains three mounting holes for each EIA unit of height. This unit therefore is 7 in. (177.8 mm)
high and covers 12 mounting holes in the rack.
2. Facing the front of the rack and working from the left side, locate the lowest EIA unit that your unit
uses. Make a note of the EIA location. Use tape, a marker, or a pencil to mark the top mounting hole
(B) of this EIA unit. Mark the rack so that the mark can be seen from the rear of the rack.
3. On the left rail, count up two mounting holes from where you placed the mark in step 2 and place
another mark next to that mounting hole (B). Mark the rack so that the mark can be seen from the
rear of the rack. You now have two marks on the rack, with one mounting hole between the marks.
4. Repeat steps 2 and 3 to place two marks on the corresponding mounting holes on the front-right rail.
5. Go to the back of the rack. On the left side, find the EIA unit that corresponds to the lowest EIA unit
marked on the front of the rack.
6. Place a mark at the top hole (C) of the lowest EIA unit. Mark the rack so that the mark can be seen
from the rear of the rack.
7. Count up two mounting holes from where you placed the mark in step 6 and place another mark
next to that mounting hole (C).
8. Repeat steps 6 and 7 to place two marks on the corresponding mounting holes on the rear-right rail.
Attaching the mounting hardware to the rack
You might need to attach mounting hardware to the rack. Use the procedure in this section to perform
this task. In addition to information intended to promote safety and reliable operation, this section also
includes illustrations of the related hardware components and shows how these components relate to
each other.
Figure 88. Removing the filler panels
Figure 89. Marking the installation locations
82 Installing into a rack
CAUTION:
Installing the rails in the rack is a complex procedure. To install the rails correctly, you must perform
each task in the following order. Failure to do so might cause rail failure and potential danger to
yourself and the system unit.
To install the mounting hardware into the rack, follow these steps:
1. Loosen the two screws (B) in the middle of both rails (A) to allow each rail to extend.
2. Starting at the back of the rack, extend the left rail and insert the rail pins (C) into the marked
locations the rack so that the rail is facing upward.
Note: Although the rail installation can be completed by one person, the installation will be easier if
one person is positioned at the front of the rack and one person is positioned at the back of the rack.
3. From the back of the rack, extend the right rail and insert the rail pins (C) into the marked locations.
4. Moving to the front of the rack, extend the left rail and insert the rail pins (C) into the marked
locations.
5. Extend the right rail and insert the rail pins (C) into the marked locations.
6. Using two M5 screws (D), secure the left rail to the front of rack below each rail pin (C).
Figure 90. Extending the rails
Installing a system or expansion unit into a rack 83
7. Using two M5 screws (D), secure the right rail to the front of the rack.
8. Moving to the back of the rack and using two M5 screws (D), secure the left rail to the back of the
rack.
9. Using two M5 screws (D), secure the right rail to the back of the rack.
10. Tighten the two screws in the middle of the left rail and the two screws in the middle of the right
rail. Refer to Figure 91.
11. Insert two nut clips at the marked location on the front-left rail.
12. Insert two nut clips at the marked locations on the front-right rail.
Installing the 5786 or 7031-D24 on the rail assembly
You might need to install the expansion unit on the rail assembly. Use the procedure in this section to
perform this task. In addition to related safety information, this section also includes illustrations of the
related hardware components and shows how these components relate to each other.
Before installing the expansion unit on the rail assembly, ensure that the stabilizers are extended and the
tip plate is attached to the bottom front of the rack to prevent the rack from falling forward when the
rails are pulled out of the rack. Refer to “Rack safety notices” on page 94.
To install the expansion unit on the rail assembly, follow these steps:
1. If the bezel is installed on the front of the expansion unit, remove it by pressing in on the two release
buttons.
Figure 91. Attaching the system rails
84 Installing into a rack
2. Using three people, lift the expansion unit and position it over the front of the rails.
Attention: Three people are required to safely lift the expansion unit. Using fewer than three people
to lift the expansion unit can result in injury.
3. Push the expansion unit into the rack until it is completely installed into the rack.
4. Determine whether the securing plates are preinstalled on the back of the expansion unit. If not,
attach the plates to the back left and back right of the expansion unit. The securing plates are
illustrated in Figure 94 on page 86.
5. Install the two M5 securing screws A through the chassis and into the nut clips (B) on the left and
right rail.
Figure 92. Removing the bezel
Figure 93. Installing the securing screws
Installing a system or expansion unit into a rack 85
6. At the back of the rack, install two M5 securing screws (C) through the back of both the left and right
securing plates and into the rack rail.
7. Install the bezel onto the expansion unit.
For instructions on how to connect and configure the expansion unit when you have completed installing
the 5786 or 7031-D24 into the rack, see Enclosures and expansion units.
Figure 94. Securing the expansion unit to the back of the rack
Figure 95. Installing the bezel
86 Installing into a rack
Attaching the cable-management arm to the standard rails
You might need to attach the cable-management arm. Use the procedure in this section to perform this
task.
If standard (fixed length) rails were included with the expansion unit, and the cable-management arm is
not attached to the rails, complete the following steps. Otherwise, go to “Determining the location” on
page 81.
1. Insert the cable-management arm rail tab (B) into the rail.
2. Rotate the cable-management arm (A) as shown in the following illustration.
3. Secure the cable-management arm (A) with the retaining screw (C).
4. Repeat steps 1 through 3 for the other rail.
Figure 96. Rotating the cable-management arm
Installing a system or expansion unit into a rack 87
88 Installing into a rack
Removing and replacing 7014-T00 or 7014-T42 side panels
You might need to remove and replace a side panel on a rack. Use the procedure in this section to
perform this task.
The rack might have optional side panels. To remove and replace a side panel, complete the following
procedure:
1. Unlock the side panels by pressing down on both locking latches to release the latches.
Note: If your rack uses a ruggedized kit, you must remove the securing screw to allow each side
panel to be removed. Refer to “Releasing the side panel with a ruggedized kit” on page 102.
2. Tilt the top of the side panel slightly toward you.
3. Lift the side panel away from the ridge on the bottom of the rack.
4. Repeat this procedure for the other side panel.
Replacing a 7014-T00 or 7014-T42 side panel
You might need to replace a side panel on a rack. Use the procedure in this section to perform this task.
To replace a side panel, complete the following procedure:
1. Tilt the top of the side panel slightly toward you.
Figure 97. Removing the side panel
© Copyright IBM Corp. 2010 89
2. Place the bottom of the side panel onto the ridge on the bottom of the rack.
3. Slide the top of the side panel into place and close the locking latches.
Note: If your rack uses ruggedized kit, you must install a securing screw into each side panel that
was installed. Refer to “Ruggedized kit” on page 101.
Figure 98. Replacing the side panel
90 Installing into a rack
Removing and replacing 7014-T00 or 7014-T42 trim panels
Racks that are installed with multiple processor drawer systems can use front trim panels instead of
doors. For racks that use trim panels, a reduced-interference panel type must be installed when certain
expansion unit models are present. Use the procedure in this section to remove the existing trim panels
from the rack and replace them with the reduced-interference panel type.
To remove the existing rack trim panels, complete the following procedure:
1. Place both hands on the center of the right side trim panel.
2. Squeeze inward firmly with your fingertips to release the spring clips that hold the panel in place.
3. Rotate your hands in slightly until the panel is disengaged.
4. Lift the panel out and set it aside.
5. Repeat this procedure to remove the left side trim panel.
Replacing the 7014-T00 or 7014-T42 trim panels
For racks that use trim panels instead of doors, a reduced-interference panel type must be installed when
certain expansion unit models are present. Use the procedure in this section to perform this task.
To install the trim panels, complete the following procedure:
Figure 99. Removing the rack trim panel
© Copyright IBM Corp. 2010 91
1. Align the bottom plate (A) of the right side trim panel to the bottom of the rack
2. Align the top of the trim panel (B) and squeeze slightly with your fingertips.
3. After the trim panel is in the correct location, release the pressure to allow the spring clips to hold the
panel in place.
4. Repeat this procedure to install the left side trim panel.
Figure 100. Installing the rack trim panel
92 Installing into a rack
Attaching the rack doors
You might need to attach the rack doors. Use this procedure to perform this task.
Depending on the model of the rack, the front door of a rack might be an optional feature. If your system
already has the front door installed, or does not have a front door to install, skip this step.
Attaching a high-perforation front door
You might need to attach a front door to the rack. Use the procedure in this section to perform this task.
To install the high-perforation front door, do the following:
1. Read the “Rack safety notices” on page 94.
2. Remove the top, left, and right trim panels. For details on removing the left and right trim panels
from a7014-T00 or 7014-T42 rack, see “Removing and replacing 7014-T00 or 7014-T42 trim panels” on
page 91
Figure 101. Attaching the rack door
© Copyright IBM Corp. 2010 93
page 91.
1 Rack chassis 4 Right-side trim panel
2 Top trim panel 5 Spring clip
3 Left-side trim panel
3. Install the door latch on the right and the door hinges on the left.
4. For a high-perforation front door, align the door over the rack hinge, then move up the hinge pin on
the door, and lower the hinge pin into the hinge.
5. Adjust the latch so the door latches securely.
Rack safety notices
You need to read the rack safety notices before installing equipment.
Before installing a rack, rack features, or a system or expansion unit into a rack, read the following safety
information.
Attention: If you are installing equipment into a non-IBM rack, the rack must comply with the
Electronics Industries Association (EIA) 310D specifications. If you do not have a rail kit designed for the
equipment in the non-IBM rack, do not install the equipment into the rack as damage to the equipment
or personal injury could occur.
Figure 102. Removing the trim panels
94 Installing into a rack
DANGER
Observe the following precautions when working on or around your IT rack system:
v Heavy equipment–personal injury or equipment damage might result if mishandled.
v Always lower the leveling pads on the rack cabinet.
v Always install stabilizer brackets on the rack cabinet.
v To avoid hazardous conditions due to uneven mechanical loading, always install the heaviest
devices in the bottom of the rack cabinet. Always install servers and optional devices starting
from the bottom of the rack cabinet.
v Rack-mounted devices are not to be used as shelves or work spaces. Do not place objects on top
of rack-mounted devices.
v Each rack cabinet might have more than one power cord. Be sure to disconnect all power cords in
the rack cabinet when directed to disconnect power during servicing.
v Connect all devices installed in a rack cabinet to power devices installed in the same rack
cabinet. Do not plug a power cord from a device installed in one rack cabinet into a power
device installed in a different rack cabinet.
v An electrical outlet that is not correctly wired could place hazardous voltage on the metal parts of
the system or the devices that attach to the system. It is the responsibility of the customer to
ensure that the outlet is correctly wired and grounded to prevent an electrical shock.
CAUTION
v Do not install a unit in a rack where the internal rack ambient temperatures will exceed the
manufacturer's recommended ambient temperature for all your rack-mounted devices.
v Do not install a unit in a rack where the air flow is compromised. Ensure that air flow is not
blocked or reduced on any side, front, or back of a unit used for air flow through the unit.
v Consideration should be given to the connection of the equipment to the supply circuit so that
overloading of the circuits does not compromise the supply wiring or overcurrent protection. To
provide the correct power connection to a rack, refer to the rating labels located on the
equipment in the rack to determine the total power requirement of the supply circuit.
v (For sliding drawers.) Do not pull out or install any drawer or feature if the rack stabilizer brackets
are not attached to the rack. Do not pull out more than one drawer at a time. The rack might
become unstable if you pull out more than one drawer at a time.
v (For fixed drawers.) This drawer is a fixed drawer and must not be moved for servicing unless
specified by the manufacturer. Attempting to move the drawer partially or completely out of the
rack might cause the rack to become unstable or cause the drawer to fall out of the rack.
(R001)
Lift precautions:
Attaching the rack doors 95
96 Installing into a rack
Installing the rack security kit
You might need to install the rack security kit. Use the procedure in this section to perform this task. This
section also includes illustrations of the related hardware components and shows how these components
relate to each other.
To install a rack security kit (feature 6580) that consists of the security lock and security slide bars,
complete the following procedure:
1. Read the “Rack safety notices” on page 94.
2. Verify the inventory in the rack security kit.
v
1 Two lock hardware kits. Each kit contains:
- Rack lock
- Bracket
- Screw
- Two keys
2 Two security slide bars
3 Two locked/unlocked stickers
3. Remove the existing door latch.
Figure 103. Rack security kit inventory
© Copyright IBM Corp. 2010 97
a. Open the front rack door.
b. On the inside of the door, remove the screw (4) in Figure 104, that secures the lock to the rack
door.
c. Remove the bracket (5).
d. From the outside of the door, remove the door latch (6).
Note: If the rack is equipped with the ruggedized kit, remove the jam nut and hex nut from the
existing door latch and reinstall both nuts on the new door lock latch.
4. Install the locking latch.
a. Insert the keyed rack lock into the latch slot on the front of the door (6) in Figure 104.
b. Secure the lock by attaching the lock bracket (5) with the screw (4), on the inside of the door.
5. Repeat steps 3 on page 97 and 4 to install the second lock on the back rack door.
6. Install a security slide bar on the right side of the rack.
Figure 104. Removing the existing door latch
98 Installing into a rack
Note: Each slide bar rail has two long tabs on the bottom of the rail. The slide bar rails are identical
and can be installed on either the right or left side cover panel.
a. Unlatch right-side cover panel and lean the panel back so that you can access the top of the panel.
b. With the flat side of the slide bar rail (7) in Figure 105, facing the inside of the cover panel (8),
insert the two tabs (9), on the slide bar rail into the two vertical support channels (10) on the side
cover panel.
Note: When installed correctly, the slide rail moves from front to back.
c. Reinstall the side panel cover on to the rack.
d. Lock the side panel covers by sliding the bars to the front of the rack.
e. Place a locked/unlocked sticker on the inside of the cover panel so that when the slide bar is in
the locked position, the tab is over the locked symbol (11), as shown in Figure 106 on page 100,
and over the unlocked symbol (12), when the slide bar is unlocked.
Figure 105. Installing a security slide bar
Installing the rack security kit 99
f. Repeat the procedure for the left side of the rack.
Figure 106. Placing the locked/unlocked sticker on the cover panel
100 Installing into a rack
Ruggedized kit
You might need to remove or replace a part in the ruggedized kit. This section includes procedures so
that you can perform these tasks.
The ruggedized kit, feature code 6080, should be ordered at the same time the rack is ordered. The
ruggedized kit brackets are installed at the manufacturer.
Note: If you are installing a rack with the ruggedized kit and need to secure the rack to the floor, refer to
“Installing the rack” on page 5.
The following illustration highlights the contents of the kit and the approximate location of each bracket
and hinges in the event that you need to uninstall and reinstall a part.
1 Rack 9 Brace hinges
2 Spacer 10 Brace
3 Brace hinge 11 Side-door securing screw mount
4 Hinge pivot studs 12 Spacer
5 Brace latch bracket 13 Screw
6 Spacer 14 Washer
7 Screw 15 Side door securing screw
8 Brace thumbscrew
Figure 107. Ruggedized kit parts
© Copyright IBM Corp. 2010 101
Releasing the ruggedized brace
You might need to release the ruggedized brace. Use the procedure in this section to perform this task.
To access the back of a system that is installed in a rack with a ruggedized kit, complete the following
steps to release the ruggedized brace:
1. Remove the brace thumbscrew (8).
2. Swing the brace (10) out of the rack.
3. Access the system and reinstall the brace.
Releasing the side panel with a ruggedized kit
You might need to release the side panel on the rack. Use this procedure to perform this task.
The ruggedized kit contains securing screws that secure the side panels to the rack. To remove a securing
screw, complete the following steps:
1. If necessary, open or remove the back rack door.
2. Locate the securing screw mount (11) for the side door that will be removed.
Figure 108. Releasing the brace to access systems
102 Installing into a rack
3. Using a screwdriver, remove the securing screw (15) and washer (14) from the side panel. To remove
the side panel, see “Removing and replacing 7014-T00 or 7014-T42 side panels” on page 89.
Figure 109. Removing the securing screws on the side panel
Ruggedized kit 103
104 Installing into a rack
Connecting multiple racks with rack-to-rack attachment kit
You might need to connect multiple racks together. Use the procedure in this section to perform this task.
This topic describes how to connect multiple racks to each other using a rack-to-rack attachment kit. To
do this, you will need the rack-to-rack attachment kit (feature 7840).
To connect multiple racks with the rack-to-rack attachment kit follow this procedure:
1. Read the “Rack safety notices” on page 94.
2. If they are installed, remove the side panels from each rack. Remove side panels only from the sides
that will be attached to each other. To do this:
a. Lift up the two panel-release tabs.
b. Pull the panel up and away from the rack chassis. This motion will release the panel from the
two lower J brackets.
c. Store the side panels.
3. Remove the two Z brackets and the two J brackets. These brackets are used to hang the side panels.
4. Install the first two standoffs in the upper-left and lower-right corners of the first rack as shown in
Figure 110 on page 106.
5. Install the second two standoffs in the upper-left and lower-right corners of the second rack as
shown in Figure 110 on page 106.
6. Attach the long foam as shown in Figure 110 on page 106. For a model T42 rack, join the short foam
to the end of the long foam, and adhere it to the frame length of the rack.
7. Position the racks together.
8. Align the standoff holes. You might need to adjust the leveling feet to do this.
9. Install a screw and washer into all four positions, but do not tighten.
© Copyright IBM Corp. 2010 105
10. After all racks are bolted together, level the racks.
11. Tighten all four screws.
12. Snap on the trim pieces that go between the front and back racks.
13. Snap on the trim piece that goes on top and between the racks.
14. Install rack filler panels to cover the open areas at the front of the racks. All the gaps in the front of
the rack must also be sealed, including the gaps between equipment. This step ensures that proper
airflow within the rack is maintained.
15. Connect the cables that go between the racks.
Figure 110. Removing the side panels, Z and J brackets, and installing standoffs and long foam to connect multiple
racks
Figure 111. Location of foam strips (top view)
106 Installing into a rack
16. If you are installing tip plates, go to step 5 in “Installing the rack” on page 5.
Connecting multiple racks with rack-to-rack attachment kit 107
108 Installing into a rack
Rack status beacon
The rack status beacon is placed on top of a rack and cabled to multiple system units inside the rack. The
server firmware illuminates the rack status beacon in response to any detected problems in the status.
This helps you locate the problem more quickly.
This is a customer task. You can perform this task yourself, or contact an service provider to perform the
task for you. The service provider might charge you for this service.
These procedures assume that the system units have been installed into the rack and have been
connected to the power supply. If the system units have not been installed, refer to the installation
instructions for your system unit.
Installing the rack status beacon
You might need to install the rack status beacon. Use the procedure in this section to perform this task.
To install the rack status beacon, you need the following:
v Rack status beacon assembly.
v Two or more rack status beacon cables. You need one cable to connect the beacon to the junction box
and one for each system unit that you want to connect to the beacon.
v One or two power cords.
v One or more junction boxes for the rack status beacon.
v One or more junction box cables.
To install the rack status beacon, follow these steps:
1. Place the rack status beacon (A), on top of the rack so that the light is pointing to the front of the
rack.
© Copyright IBM Corp. 2010 109
2. Plug one end of the rack status beacon cable (1) into the port on the back of the rack status beacon
assembly.
3. Thread the rack status beacon cable through the rack's cable access cover, (B).
4. Plug the other end of the rack status beacon cable into the output port on the junction box, (C).
5. To connect the junction box to the system unit, plug one end of a rack status beacon cable (1), into an
input port on the junction box.
Figure 112. Connecting the rack status beacon to the system unit through the junction box
Figure 113. Connecting the rack status beacon and system unit to the junction box
110 Installing into a rack
6. Plug the other end of the second rack status beacon cable into the beacon port on the system unit.
This port is labeled with a beacon symbol.
Note: On some system units, the orientation of the port is rotated.
7. To add additional system units to this rack status beacon, repeat steps 4 and 5 for each additional
system unit. You can connect multiple junction boxes to add additional system units. Refer to
“Connecting multiple junction boxes.”
8. Plug the power cords into the rack status beacon assembly. Only one power cord is required, but you
can use two power cords for backup.
9. Plug the other end of the power cords into the rack power supply.
If you install the rack status beacon during initial system setup, the initial program load (IPL) will read
the rack identification information. If you install the rack status beacon on system units that are running,
you will need to start the system again.
Connecting multiple junction boxes
You might need to connect multiple junction boxes. This section includes procedures so that you can
perform these tasks.
You can connect multiple junction boxes so that additional system units can be added to the rack status
beacon.
To connect multiple junction boxes, you need the following additional equipment:
v One or more rack status beacon junction boxes.
v One or more junction box cables. You need a junction box cable for each junction box that you want to
add.
1. Plug the one end of the junction box cable (2), into an input port on the first junction box.
Figure 114. Rack status beacon port
Rack status beacon 111
2. Plug a junction box cable into the output port of the additional junction box.
3. Connect additional system units to the additional junction box as described in “Installing the rack
status beacon” on page 109.
4. Repeat steps 1 - 3 for each additional junction box.
Figure 115. Connect the rack status beacon and system unit to the junction box
112 Installing into a rack
Installing or removing a rack-mounted system-unit latch
bracket
You might need to install or remove a latch bracket. Use the procedure in this section to perform this task
These procedures describe how to install or remove the rack-mounted system-unit latch bracket. You can
perform this task yourself, or contact a service provider to perform the task for you. The service provider
might charge you for this service.
To install or remove a latch bracket, complete the following steps:
Note: The illustrations in these procedures may differ from your machine type and model.
1. Read the “Rack safety notices” on page 94.
2. Open the front rack door.
3. Push the latch bracket (A) or (B) into place on the side of the system unit.
4. Secure the latch bracket with two retaining screws (C) on the side of the latch bracket.
5. Replace the front cover. See “Removing and replacing covers on the 8233-E8B or 8236-E8C system” on
page 135.
6. Close the front rack door.
To remove a latch bracket, complete the following steps:
1. Remove the cover from the front of the system. See “Removing and replacing covers on the 8233-E8B
or 8236-E8C system” on page 135.
2. Place the system in the service position. See the service information for your model.
Figure 116. Reinstalling the latch brackets
© Copyright IBM Corp. 2010 113
3. Remove the retaining screws (C) located on the side of the latch bracket (A) or (B).
4. Remove the latch bracket from the system unit.
Figure 117. Removing the retaining screws
114 Installing into a rack
Removing and replacing the rack top cover
You might need to remove or replace the rack top cover. User these procedures to perform these tasks.
Removing the rack top cover
You might need to remove the rack top cover. Use this procedure to perform this task.
Note: A 10 mm box socket with an extension bar is required to remove the screws from the top cover.
Other tools might cause the screw heads to become rounded and unremovable.
To remove the rack cover from the rack, do the following steps:
1. Remove the front door.
Note: If the rack is locked, unlock the doors. Then unlock the side panels by removing the locking
bars from both sides of the top rear of the rack. Slide the locking bars out from the rack at the top of
each side of the rack from the rear. The locking bars can be removed from each side of the front of the
rack if the rack space in the locking bars area is open and the bars are easily accessible.
2. Remove the top, left, and right trim panels.
3. Remove the side panels. For more information, see “Removing and replacing 7014-T00 or 7014-T42
side panels” on page 89.
4. Remove one screw from the right side of the top cover and one screw from the left side of the top
cover. Do for both front and rear.
5. Locate the front (A) and back (C) rack braces that were provided in the shipping container. Fasten
each rack brace at the top of the front and rear of the rack, just below the top cover.
6. Use the four screws (B) that were removed from the top cover to secure each rack brace to the rack,
as shown in Figure 118 on page 116.
© Copyright IBM Corp. 2010 115
A Front rack brace
B Retaining screws (2 screws for each brace)
C Back rack brace
7. Remove the remaining six screws from the left and right sides of the top cover (D).
Figure 118. Securing the rack brace
116 Installing into a rack
D Top cover
E Cable access cover
F Side cover (quantity 2)
G EIA label
8. Lift off the top cover.
Replacing the rack top cover
You might need to replace the rack top cover. Use this procedure to perform this task.
Note: A 10 mm box socket with an extension bar is required to replace the screws into the rack top cover.
Other tools might cause the screw heads to become rounded and unable to be removed again.
To replace the rack top cover onto the rack, do the following:
1. Position the rack top cover (D) on the rack.
2. Install the six screws at the left and right sides of the rack top cover.
Figure 119. Removing the top cover
Removing and replacing the rack top cover 117
D Top cover
E Cable access cover
F Side cover (quantity 2)
G EIA label
Note: A 10 mm box socket with an extension bar is required to replace the screws into the rack top
cover. Other tools might cause the screw heads to become rounded and unable to be removed again.
3. Remove the rack braces (A) and (C) by unfastening them at the top of the rack, just below the top
cover. Remove the braces at the front and rear of the rack.
Figure 120. Removing the top cover
118 Installing into a rack
A Front rack brace
B Retaining screws (quantity 2 for each brace)
C Back rack brace
4. Install the screws (B) you removed from the rack braces at the right and left side of the top cover.
Install the screws for both the front and rear of the rack.
5. Install rack filler panels to cover open areas at the front of the rack. Seal all gaps in the front of the
rack, including the gaps between pieces of equipment. This step ensures that proper airflow within
the rack is maintained.
Figure 121. Removing the top cover
Removing and replacing the rack top cover 119
120 Installing into a rack
Power distribution unit plus
The power distribution unit plus (PDU+) can be installed in the 7014-T00 and 7014-T42 racks. It allows
you to monitor the individual power loads of the devices that are plugged into it.
Installing the PDU+ in the side of a rack
Learn how to install the power distribution unit plus (PDU+) in the side of a rack.
Tip: Removing the rack doors and side panels might make installation easier.
Important: You must use clip nuts to install the mounting brackets. Clip nuts are provided with the
PDU+ and install on the rack-mounting flanges.
To install the PDU+ in the 1U mounting space in the side of a rack, complete the following steps
1. Read the “Rack safety notices” on page 94.
2. Align the vertical-mounting brackets to the front of the PDU+. Make sure that you attach the brackets
so that the power outlets face the rear of the rack.
3. Attach the brackets to the PDU+ with two M3x5 screws per bracket. Use the screws that were
provided with the PDU+.
4. Align the PDU+ with the opening in the side of the rack. Then, while holding the PDU+ in place,
attach the brackets to the rack-mounting flanges with four clip nuts and four M5 screws that were
provided with the PDU+.
Figure 122. Aligning the vertical-mounting brackets to the front of the PDU+
© Copyright IBM Corp. 2010 121
Attention: You must disconnect the main input power before connecting or disconnecting the input
power cord from the PDU+.
5. If the PDU+ was provided with a detached power cord, connect the power cord now. Align the
connector on the power cord that was provided with the PDU+ with the connector on the front of the
PDU+, turning as necessary for key alignment. Then, turn the twist-lock on the connector clockwise
until it locks into place.
Figure 123. Aligning the PDU+ with the opening in the side of the rack
Figure 124. Aligning the connector on the power cord with the PDU+
122 Installing into a rack
6. Route the power cord from the PDU+ toward the rack side braces. Then, route the power cord along
a side brace toward the back of the rack and secure the power cord with the cable straps that are
provided with the PDU+.
7. Route the power cord toward a dedicated power source. Use the provided cable straps to secure the
power cord along the way. Use the openings in the rack, if the power cord must exit the rack to
connect to a power source.
Attention: To prevent damage to a power device and other connected devices, always connect the
power device to an authorized power source for that device.
8. Connect the power cord to a properly wired and grounded dedicated power source. Then, you can
connect servers or rack PDUs in the rack to the power outlets on the PDU+.
9. Route all of the other power cables neatly and secure the power cables with cable straps.
Related concepts
“Setting up power monitoring using the PDU+” on page 127
You can monitor the power status for any device that is connected to the power distribution unit plus
(PDU+), either manually or remotely, through the PDU+ Web interface. You can use the IBM DPI®
Configuration Utility to initially set up the PDU+ and to configure PDU+ settings such as network
parameters, access control table, and trap receivers table.
Installing the PDU+ horizontally in a rack
You might need to install the PDU horizontally, use the instructions in this section to complete this task.
Tip: Removing the rack doors and side panels might make installation easier.
Important: Use cage nuts for rack cabinets with square holes, and use clip nuts for rack cabinets with
round holes. If your rack cabinet requires cage nuts, use a cage-nut-insertion tool or a flat-blade
screwdriver to install them..
To install the PDU+ in the rack, complete the following steps
1. Read the “Rack safety notices” on page 94.
2. Align the vertical-mounting brackets to the front of the PDU+. Make sure that you attach the
brackets so that the power outlets face the rear of the rack.
3. Align the long mounting-brackets with the holes in the rear of thePDU+ and attach the brackets to
the PDU+ with two M3 pan-head screws with captive lock washers per bracket. Use the screws that
Figure 125. Aligning the vertical-mounting brackets to the front of the PDU+.
Power distribution unit plus 123
come with the rack-mounting kit.
4. Hold the PDU+ at a slight angle and carefully insert it into the 1-U mounting space in the rack
cabinet. Pushing in slightly on both of the long mounting-brackets helps clear the brackets from the
rack flanges.
124 Installing into a rack
Attention: You must disconnect the main input power before connecting or disconnecting the input
power cord from the PDU+.
5. Secure the end of the PDU+ that has the short mounting-brackets to the rack cabinet first. Make sure
that the short mounting-brackets are aligned with the outside of the rack flanges. Attach the brackets
to the rack flanges with two M6 screws and two cage nuts or clip nuts per bracket. Use the cage
nuts or clip nuts and the screws that come with the rack-mounting kit.
6. Secure the long mounting-brackets and the 1-U blank filler panel to the rack cabinet by doing the
following steps:
Figure 126. Aligning the PDU+ with the opening in the rack.
Power distribution unit plus 125
a. Adjust the long mounting-brackets to fit the depth of the rack cabinet.
b. Make sure that the long mounting-brackets are aligned with the inside of the rack flanges.
c. Align the blank filler panel on the outside of the rack flanges.
d. Attach the filler panel to the rack flanges and then to the long mounting-bracket with one M6
screw per bracket.
e. Tighten the M3 pan-head screws that secure the long mounting-brackets to the PDU.
7. Secure the end of the PDU+ that has the short mounting-brackets to the rack cabinet first. Make sure
that the short mounting-brackets are aligned with the outside of the rack flanges. Attach the brackets
to the rack flanges with two M6 screws and two cage nuts or clip nuts per bracket. Use the cage
nuts or clip nuts and the screws that come with the rack-mounting kit.
8. If the PDU+ was provided with a detached power cord, connect the power cord now. Align the
connector on the power cord that was provided with the PDU+ with the connector on the front of
the PDU+, turning as necessary for key alignment. Then, turn the twist-lock on the connector
clockwise until it locks into place.
126 Installing into a rack
9. Route the power cord from the PDU+ toward the rack side braces. Then, route the power cord along
a side brace toward the back of the rack and secure the power cord with the cable straps that are
provided with the PDU+.
10. Route the power cord toward a dedicated power source. Use the provided cable straps to secure the
power cord along the way. Use the openings in the rack, if the power cord must exit the rack to
connect to a power source.
Attention: To prevent damage to a power device and other connected devices, always connect the
power device to an authorized power source for that device.
11. Connect the power cord to a properly wired and grounded dedicated power source. Then, you can
connect servers or rack PDUs in the rack to the power outlets on the PDU+.
12. Route all of the other power cables neatly and secure the power cables with cable straps.
Setting up power monitoring using the PDU+
You can monitor the power status for any device that is connected to the power distribution unit plus
(PDU+), either manually or remotely, through the PDU+ Web interface. You can use the IBM DPI
Configuration Utility to initially set up the PDU+ and to configure PDU+ settings such as network
parameters, access control table, and trap receivers table.
Note: All of the Configuration Utility configuration options are available through the Web interface after
the PDU+ is set up on the local network.
Related tasks
“Installing the PDU+ in the side of a rack” on page 121
Learn how to install the power distribution unit plus (PDU+) in the side of a rack.
Using the IBM DPI Configuration Utility
Learn how to use the IBM DPI Configuration Utility to configure the power distribution unit plus
(PDU+) settings, such as the IP address, network parameters, access control table, and trap receivers
table.
Figure 127. Aligning the connector on the power cord with the PDU+
Power distribution unit plus 127
Connecting the console
You can configure the PDU+ using a workstation or notebook computer that is connected to the PDU+.
Connect the DB9-to-RJ-45 cable that is shipped with the PDU+ to the RJ-45 console connector on the
PDU+, and to a RS-232 serial (COM) connector on a workstation or notebook computer.
Using HyperTerminal
HyperTerminal is a terminal program in a Microsoft® Windows® operating system that enables you to
configure or control a device using command line parameters. You can configure the PDU+ parameters
and its outlets using numeric commands from a keyboard. You can also use Telnet or any other terminal
program to configure the PDU+ after the IP address is set.
To start HyperTerminal and communicate with the PDU+, complete the following steps:
1. Click Start ? Programs ? Accessories ? Communications ? HyperTerminal. The Connection
Description window is displayed.
2. Type the name for the connection in the Name field and select an icon for the connection.
3. Click OK. The Connect To window is displayed.
4. In the Connect using field, select the COM port that is connected to the PDU+.
5. Click OK. The Properties window is displayed.
6. Click Restore Defaults to use the default settings. Make sure that the Bits per second field is 9600
and that the Flow control field is None.
7. Click OK.
8. Press any key. The Configuration Utility main menu is displayed.
9. Type the default password, passw0rd (all lowercase letters with a zero, not O), and press Enter.
10. Enter the menu option that you want. For descriptions of the options, see “Configuration Utility
menu options.”
Configuration Utility menu options
The following options are on the Configuration Utility main menu:
IBM DPI Settings
When you select IBM DPI Settings, the IBM DPI Configuration Utility window is displayed with
the following options:
Set the IP Address, Gateway Address and MIB System Group
View and change the IP address, date, time, and MIB system information.
Set IBM DPI Control Group
Set the administrator user name, password, and access protocols.
Set Write Access Managers
Set up a list of users who can access and control the PDU+.
Set Trap Receivers
Configure remote network management system (NMS) servers to receive traps.
Set Date and Time
Adjust the date and time information for the PDU+.
Set Superuser Name and Password
Set the user name and password of the administrator who will use a Web browser to
configure the PDU+.
E-mail Notification
Set up a list of users who will be alerted with event messages if an unusual event is
triggered on the PDU+ system.
128 Installing into a rack
Set Multi-Users
Configure other user and password logins and the read and write access levels.
Set IBM DPI Information
Configure the PDU+ logging interval, refresh rate, and custom name fields for the load
groups.
Settings and Event Log Summary
View all PDU+ configuration settings.
Reset Configuration to Default
Reset all system settings to their factory default values.
Restart HD-PDU
Restart the PDU+.
Setting the IP address
Important: You must set the IP address before you can use the Web interface or access the PDU+ in an IP
network (LAN/WAN). Contact your system administrator if you do not know the IP address.
To set the IP address, complete the following steps:
1. In the Configuration Utility main menu, enter the menu option for IBM DPI Settings.
2. Enter the menu option for Set the IP Address, Gateway Address and MIB System Group.
Using the Web interface
Learn how to use the Web interface to configure and monitor the power distribution unit plus (PDU+)
remotely. The PDU+ provides a graphical user interface that you can view from a Web browser. Using a
Web browser, you can access and monitor the PDU+ power outlets and output devices remotely from a
workstation or notebook computer.
Starting the Web interface
To start the Web interface, complete the following steps:
1. Start a Web browser from a workstation or notebook computer, and enter the IP address of the PDU+
in the address field. The Connect to window is displayed.
Note: For more information about setting the IP address of the system, see “Setting the IP address.”
2. In the User name field, type USERID (all uppercase letters).
3. In the Password field, type passw0rd (all lowercase letters with a zero, not O).
4. Click OK. The main status page is displayed.
The main status page displays a graphical representation of the PDU+ power outlets and input status:
v The left pane displays the menus and submenus for the PDU+. Click a menu to display the menu
options, expand the menu items, and modify the menu options as required.
v The graphic displayed in the right pane shows the status of the outlets, input voltage, output voltage,
frequency, current and power, watt-hour consumption, and cumulative kilowatt hour power
consumption. If you connect an optional environmental monitored probe, the temperature and
humidity environment conditions are displayed.
Each menu page provides online help to assist you with configuring the PDU+. Click the Help icon at
the top of each page to view the help.
Modifying the basic settings
Use the System menu to configure the PDU+ system parameters such as the superuser name, password,
IP address, date, and time.
Power distribution unit plus 129
Changing the superuser name and password:
You can set the user name and password of the administrator who will use a Web browser to configure
the PDU+ on the Configuration Utility page.
To change the superuser name and password, complete the following steps:
1. From the main status page, in the left navigation pane, click System.
2. Click Configuration to view and modify the system configuration and superuser user name and
password.
Identifying the PDU+ and Web/SNMP card:
You can view the PDU+ and Web/SNMP card information on the Identification of Power Management
page.
To view the power management information of the PDU+ and Web/SNMP card, complete the following
steps:
1. From the main status page, in the left navigation pane, click System.
2. Click Identification to view the PDU+ and Web/SNMP card information.
Adding users:
You can add users who can access and control the PDU+ on the Multi-User Configuration page.
To create a list of users who can access and control the PDU+, complete the following steps:
1. From the main status page, in the left navigation pane, click System.
2. Click Multi-User to add users who can only view the PDU+ status or users who can change the
PDU+ settings.
Changing the date and time:
You can change the date and time of the PDU+ on the Date and Time page.
Important: Changing the PDU+ date and time affects other system settings such as e-mail, traps, and
logs.
To change the date and time, complete the following steps:
1. From the main status page, in the left navigation pane, click System.
2. Click Date and Time to view and modify the system date and time. You can set the date and time
manually, synchronize it with the computer time, or synchronize it with an NTP server.
Changing event alerts:
You can change event alerts on the SNMP Trap Receivers page.
To configure the PDU+ to send e-mail or SNMP trap alerts to specified users when specific events occur,
complete the following steps:
1. From the main status page, in the left navigation pane, click System.
2. Click Trap Receivers to create a list of users or workstations who will be alerted with an SNMP trap
message. You can specify the IP addresses of up to eight trap receivers, the community information,
type of trap, severity of trap, and description of the events that cause the traps.
130 Installing into a rack
3. Click Email Notification under System to create a list of up to four users who will be alerted with an
e-mail. Use this menu to specify the mail server, user account, DNS, and other information necessary
to set up a mail server for sending mail alerts. Use the Email Receivers Table to add the e-mail
addresses.
Changing the network information
Use the Network menu to change the network information for the PDU+, for example, the IP address.
Changing the network configuration:
You can view or change the network configuration on the Network Configuration page.
To view or change the network configuration of the PDU+, complete the following steps:
1. From the main status page, in the left navigation pane, click Network.
2. Click Configuration to set the PDU+ IP address, gateway address, subnet mask, and Domain Name
System (DNS) address.
3. Click Control to configure TCP/IP settings.
4. Click Access Control to set access control to prevent unauthorized users from accessing the PDU+.
History and event log summaries
The Logs menu provides a detailed description of all events and a record of the PDU+ status. System
administrators can use this page to analyze problems with network equipment.
Viewing the history log:
You can view the complete history of the PDU+ inputs and outputs on the History Log page.
To view the history of the PDU+, complete the following steps:
1. From the main status page, in the left navigation pane, click Logs.
2. Click History. Each event log file shows the time, date, and description of all the events occurring on
the PDU+.
Viewing the event log:
You can view the complete record of the PDU+ events on the Event Log page.
To view the complete record of the PDU+ events, complete the following steps:
1. From the main status page, in the left navigation pane, click Logs.
2. Click Events. Each log file shows a record of the input power and output power of each outlet.
Power distribution unit plus 131
132 Installing into a rack
Removing and replacing the expansion unit cover or door
You might need to remove, replace, or install covers or doors on an expansion unit as a part of accessing
components or performing a service action.
Removing the front cover on the 7314-G30 or 5796
You might need to remove the front cover to perform service to the system.
To remove the front cover, follow these steps:
1. Remove the two thumbscrews (B) located on the left and right of the cover.
2. Press in on the two latches (C) located left and right of the cover to release the cover.
3. Lift out the cover (A) and away from the chassis.
Tip: If an airflow block is present on an unpopulated side of the chassis do not remove it unless you
are populating that side of the chassis.
Installing the front cover on the 7314-G30 or 5796
Use this procedure to install the front cover after installing the system or performing a service action.
If you only have one side of the chassis populated, ensure that the airflow block is present on the
unpopulated side.
1. Insert the posts on the bottom of the cover (A) into the hooks on the chassis of the enclosure.
Figure 128. Remove the front cover for model 7314-G30 or 5796
© Copyright IBM Corp. 2010 133
2. Press in on the two latches (C) on the right and left of the front cover.
3. Press the cover onto the chassis and release the latches. The cover securely snaps into place.
4. Replace the two thumbscrews (B) into the slots on the left and right of the front cover.
Figure 129. Front cover on a model 7314-G30 or 5796
134 Installing into a rack
Removing and replacing covers on the 8233-E8B or 8236-E8C
system
Use these instructions to remove, replace, or install covers to access components or perform service.
Removing the service access cover from an 8233-E8B or 8236-E8C
system
Use this procedure to remove the service access cover to perform service or to gain access to internal
components.
To remove the service access cover, follow these steps:
1. Place the system into the service position. For instructions, see “Placing a 8233-E8B or 8236-E8C in the
service position” on page 145.
2. Loosen the two thumbscrews (A) located at the back of the cover.
3. Slide the cover (B) toward the back of the system unit. When the front of the service access cover
clears the upper frame ledge, lift the cover up and off the system unit.
Attention: For proper cooling and airflow, install the cover before starting the system. Operating the
system without the cover for more than 30 minutes could damage the system components.
Figure 130. Removing the service access cover
© Copyright IBM Corp. 2010 135
Installing the service access cover on an 8233-E8B or 8236-E8C
system
Use this procedure to install the service access cover after performing service or accessing internal
components.
To install the service access cover, follow these steps:
1. Place the service access cover (A) on the top of the system unit, approximately 25 mm (1 in.) from the
front of the system unit.
2. Hold the service access cover against the system unit, and slide it toward the front of the system.
The tabs on the service access cover slide beneath the upper chassis ledge, and the two screws align
with the screw holes at the back of the system unit.
Important: Ensure that the fan LED cables do not get caught on the front edge of the service access
cover as you move it forward.
3. Tighten the screws (B) located at the back of the cover.
Removing the front cover from an 8233-E8B or 8236-E8C system
Use this procedure to remove the cover to access components or perform service.
To remove the front cover, follow these steps:
1. Remove the two screws (A) that secure the system to the rack (B) as shown in the following figure.
2. Pull the cover away from the system. The cover has an indentation where you can hold onto it more
easily.
Figure 131. Installing the service access cover
136 Installing into a rack
Installing the front cover on an 8233-E8B or 8236-E8C system
Use this procedure to install the cover after accessing components or performing service.
To install the front cover, follow these steps:
1. Align the cover with the system.
2. Gently push the cover in until the four cover clips are seated in their respective mounting posts (B)
on the system.
3. Replace the two screws (C) that secure the system to the rack (A).
Figure 132. Removing the front cover
Removing and replacing covers on the 8233-E8B or 8236-E8C system 137
Removing and replacing the front cover on a 9117-MMB or 9179-MHB
Use these procedures to remove and replace the cover to access components or perform service.
Removing the front cover from the 9117-MMB or 9179-MHB
Use this procedure to remove the cover to access components or perform service.
To remove the front cover follow these steps:
1. If necessary, open the front rack door.
2. Pull out the latches (A) located on both sides of the cover as shown in the following figure.
Figure 133. Installing the front cover
138 Installing into a rack
3. Pull off the cover (B) to remove it from the system unit.
Installing the front cover on the 9117-MMB or 9179-MHB
Use this procedure to install the cover after accessing components or performing service.
To install the front cover, follow these steps:
1. Position the cover (A) on the front of the system unit so that the four pins on the system match the
four holes on the back of the cover.
Figure 134. Removing the front cover
Removing and replacing covers on the 8233-E8B or 8236-E8C system 139
2. Press the tabs (B) to snap the cover into position.
3. Close the front rack door.
Figure 135. Installing the front cover
140 Installing into a rack
Placing the rack-mounted 8233-E8B or 8236-E8C system into
the service position or operating position
Use these procedures to place a system into the service position or operating position to perform service
or to gain access to internal components.
Placing the rack-mounted system or expansion unit in the service
position
Use this procedure to perform service or gain access to internal components by placing the rack-mounted
system or expansion unit in the service position.
Note: Some of the figures in these procedures might not look exactly like the system or expansion unit
that you have. However, the steps to perform the task are the same.
DANGER
When working on or around the system, observe the following precautions:
Electrical voltage and current from power, telephone, and communication cables are hazardous. To
avoid a shock hazard:
v Connect power to this unit only with the IBM provided power cord. Do not use the IBM
provided power cord for any other product.
v Do not open or service any power supply assembly.
v Do not connect or disconnect any cables or perform installation, maintenance, or reconfiguration
of this product during an electrical storm.
v The product might be equipped with multiple power cords. To remove all hazardous voltages,
disconnect all power cords.
v Connect all power cords to a properly wired and grounded electrical outlet. Ensure that the outlet
supplies proper voltage and phase rotation according to the system rating plate.
v Connect any equipment that will be attached to this product to properly wired outlets.
v When possible, use one hand only to connect or disconnect signal cables.
v Never turn on any equipment when there is evidence of fire, water, or structural damage.
v Disconnect the attached power cords, telecommunications systems, networks, and modems before
you open the device covers, unless instructed otherwise in the installation and configuration
procedures.
v Connect and disconnect cables as described in the following procedures when installing, moving,
or opening covers on this product or attached devices.
To Disconnect:
1. Turn off everything (unless instructed otherwise).
2. Remove the power cords from the outlets.
3. Remove the signal cables from the connectors.
4. Remove all cables from the devices
To Connect:
1. Turn off everything (unless instructed otherwise).
2. Attach all cables to the devices.
3. Attach the signal cables to the connectors.
4. Attach the power cords to the outlets.
5. Turn on the devices.
(D005)
© Copyright IBM Corp. 2010 141
DANGER
Observe the following precautions when working on or around your IT rack system:
v Heavy equipment–personal injury or equipment damage might result if mishandled.
v Always lower the leveling pads on the rack cabinet.
v Always install stabilizer brackets on the rack cabinet.
v To avoid hazardous conditions due to uneven mechanical loading, always install the heaviest
devices in the bottom of the rack cabinet. Always install servers and optional devices starting
from the bottom of the rack cabinet.
v Rack-mounted devices are not to be used as shelves or work spaces. Do not place objects on top
of rack-mounted devices.
v Each rack cabinet might have more than one power cord. Be sure to disconnect all power cords in
the rack cabinet when directed to disconnect power during servicing.
v Connect all devices installed in a rack cabinet to power devices installed in the same rack
cabinet. Do not plug a power cord from a device installed in one rack cabinet into a power
device installed in a different rack cabinet.
v An electrical outlet that is not correctly wired could place hazardous voltage on the metal parts of
the system or the devices that attach to the system. It is the responsibility of the customer to
ensure that the outlet is correctly wired and grounded to prevent an electrical shock.
CAUTION
v Do not install a unit in a rack where the internal rack ambient temperatures will exceed the
manufacturer's recommended ambient temperature for all your rack-mounted devices.
v Do not install a unit in a rack where the air flow is compromised. Ensure that air flow is not
blocked or reduced on any side, front, or back of a unit used for air flow through the unit.
v Consideration should be given to the connection of the equipment to the supply circuit so that
overloading of the circuits does not compromise the supply wiring or overcurrent protection. To
provide the correct power connection to a rack, refer to the rating labels located on the
equipment in the rack to determine the total power requirement of the supply circuit.
v (For sliding drawers.) Do not pull out or install any drawer or feature if the rack stabilizer brackets
are not attached to the rack. Do not pull out more than one drawer at a time. The rack might
become unstable if you pull out more than one drawer at a time.
v (For fixed drawers.) This drawer is a fixed drawer and must not be moved for servicing unless
specified by the manufacturer. Attempting to move the drawer partially or completely out of the
rack might cause the rack to become unstable or cause the drawer to fall out of the rack.
(R001)
To place a rack-mounted system or expansion unit into the service position, follow these steps:
1. If necessary, open the front rack door.
2. Remove the two screws (A) that secure the system or expansion unit (B) to the rack as shown in the
following figure.
142 Installing into a rack
3. Release the rack latches (A) on both the left and right sides as shown in the following figure.
Figure 136. Removing the screws from the system and rack
Figure 137. Releasing the rack latches
Placing the rack-mounted 8233-E8B or 8236-E8C system into the service position or operating position 143
4. Read the following note, and then slowly pull the system or expansion unit out from the rack until
the rails are fully extended and locked.
Remember:
v If the procedure you are performing requires you to unplug cables from the back of the system or
expansion unit, do so before you pull the unit out from the rack.
v Ensure that the cables at the rear of the system or expansion unit do not catch or bind as you pull
the unit out from the rack.
v Ensure the rails are fully extended. When the rails are fully extended, the rail safety latches lock
into place. This action prevents the system or expansion unit from being pulled out too far.
Placing the rack-mounted system or expansion unit in the operating
position
Use this procedure to place the rack-mounted system or expansion unit in the operating position to make
the unit available for use.
Tip: Some of the figures in these procedures might not look exactly like the system or expansion unit
that you have. However, the steps to perform the task are the same.
To place the rack-mounted system or expansion unit into the operating position, follow these steps:
1. Simultaneously release the blue rail safety latches (A), located near the front of each rail, and push the
system or expansion unit into the rack as shown in the following figure.
Note: Ensure that the cables at the rear of the system or expansion unit do not catch or bind as you
push the unit back into the rack.
Figure 138. Releasing the rail safety latches
144 Installing into a rack
2. Replace and tighten the two screws (C) that secure the system or expansion unit (A) to the rack as
shown in the following figure.
3. Close the front rack door.
Placing a 8233-E8B or 8236-E8C in the service position
Use this procedure to perform service or gain access to internal components by placing the rack-mounted
system or expansion unit in the service position.
Note: Some of the figures in these procedures might not look exactly like the system or expansion unit
that you have. However, the steps to perform the task are the same.
Figure 139. Pushing the system into the rack and attaching the screws
Placing the rack-mounted 8233-E8B or 8236-E8C system into the service position or operating position 145
DANGER
When working on or around the system, observe the following precautions:
Electrical voltage and current from power, telephone, and communication cables are hazardous. To
avoid a shock hazard:
v Connect power to this unit only with the IBM provided power cord. Do not use the IBM
provided power cord for any other product.
v Do not open or service any power supply assembly.
v Do not connect or disconnect any cables or perform installation, maintenance, or reconfiguration
of this product during an electrical storm.
v The product might be equipped with multiple power cords. To remove all hazardous voltages,
disconnect all power cords.
v Connect all power cords to a properly wired and grounded electrical outlet. Ensure that the outlet
supplies proper voltage and phase rotation according to the system rating plate.
v Connect any equipment that will be attached to this product to properly wired outlets.
v When possible, use one hand only to connect or disconnect signal cables.
v Never turn on any equipment when there is evidence of fire, water, or structural damage.
v Disconnect the attached power cords, telecommunications systems, networks, and modems before
you open the device covers, unless instructed otherwise in the installation and configuration
procedures.
v Connect and disconnect cables as described in the following procedures when installing, moving,
or opening covers on this product or attached devices.
To Disconnect:
1. Turn off everything (unless instructed otherwise).
2. Remove the power cords from the outlets.
3. Remove the signal cables from the connectors.
4. Remove all cables from the devices
To Connect:
1. Turn off everything (unless instructed otherwise).
2. Attach all cables to the devices.
3. Attach the signal cables to the connectors.
4. Attach the power cords to the outlets.
5. Turn on the devices.
(D005)
DANGER
146 Installing into a rack
Observe the following precautions when working on or around your IT rack system:
v Heavy equipment–personal injury or equipment damage might result if mishandled.
v Always lower the leveling pads on the rack cabinet.
v Always install stabilizer brackets on the rack cabinet.
v To avoid hazardous conditions due to uneven mechanical loading, always install the heaviest
devices in the bottom of the rack cabinet. Always install servers and optional devices starting
from the bottom of the rack cabinet.
v Rack-mounted devices are not to be used as shelves or work spaces. Do not place objects on top
of rack-mounted devices.
v Each rack cabinet might have more than one power cord. Be sure to disconnect all power cords in
the rack cabinet when directed to disconnect power during servicing.
v Connect all devices installed in a rack cabinet to power devices installed in the same rack
cabinet. Do not plug a power cord from a device installed in one rack cabinet into a power
device installed in a different rack cabinet.
v An electrical outlet that is not correctly wired could place hazardous voltage on the metal parts of
the system or the devices that attach to the system. It is the responsibility of the customer to
ensure that the outlet is correctly wired and grounded to prevent an electrical shock.
CAUTION
v Do not install a unit in a rack where the internal rack ambient temperatures will exceed the
manufacturer's recommended ambient temperature for all your rack-mounted devices.
v Do not install a unit in a rack where the air flow is compromised. Ensure that air flow is not
blocked or reduced on any side, front, or back of a unit used for air flow through the unit.
v Consideration should be given to the connection of the equipment to the supply circuit so that
overloading of the circuits does not compromise the supply wiring or overcurrent protection. To
provide the correct power connection to a rack, refer to the rating labels located on the
equipment in the rack to determine the total power requirement of the supply circuit.
v (For sliding drawers.) Do not pull out or install any drawer or feature if the rack stabilizer brackets
are not attached to the rack. Do not pull out more than one drawer at a time. The rack might
become unstable if you pull out more than one drawer at a time.
v (For fixed drawers.) This drawer is a fixed drawer and must not be moved for servicing unless
specified by the manufacturer. Attempting to move the drawer partially or completely out of the
rack might cause the rack to become unstable or cause the drawer to fall out of the rack.
(R001)
To place the rack-mounted system or expansion unit into the service position, follow these steps:
1. If necessary, open the front rack door.
2. Remove the two screws (A) that secure the system unit to the rack as shown in the following figure.
3. Release the rack latches (B) on both the left and right sides as shown in the following figure.
Placing the rack-mounted 8233-E8B or 8236-E8C system into the service position or operating position 147
4. If required, unplug the cables from the back of the system or expansion unit before you pull the unit
out from the rack. Ensure that the cables at the rear of the system or expansion unit do not catch or
bind as you pull the unit out from the rack.
5. Slowly pull the system or expansion unit out from the rack until the rails are fully extended and
locked. Ensure that the rails are fully extended. When the rails are fully extended, the rail safety
latches lock into place. This action prevents the system or expansion unit from being pulled out too
far.
Placing the 8233-E8B or 8236-E8C in the operating position
Use this procedure to place the system or expansion unit in the operating position to make the unit
available for use.
To place the system into the operating position follow these steps:
Tip: Some of the figures in these procedures might not look exactly like the system or expansion unit
that you have. However, the steps to perform the task are the same.
1. Simultaneously release the blue rail safety latches (B), located near the front of each rail, and push the
system or expansion unit into the rack as shown in the following figure.
Note: Ensure that the cables at the rear of the system or expansion unit do not catch or bind as you
push the unit back into the rack.
Figure 140. Releasing the rack latches
148 Installing into a rack
2. Replace and tighten the two thumbscrews (C) that secure the system or expansion unit (A) to the rack
as shown in the following figure.
3. Close the front rack door.
Figure 141. Releasing the rail safety latches
Figure 142. Replacing the thumbscrews
Placing the rack-mounted 8233-E8B or 8236-E8C system into the service position or operating position 149
150 Installing into a rack
Notices
This information was developed for products and services offered in the U.S.A.
The manufacturer may not offer the products, services, or features discussed in this document in other
countries. Consult the manufacturer's representative for information on the products and services
currently available in your area. Any reference to the manufacturer's product, program, or service is not
intended to state or imply that only that product, program, or service may be used. Any functionally
equivalent product, program, or service that does not infringe any intellectual property right of the
manufacturer may be used instead. However, it is the user's responsibility to evaluate and verify the
operation of any product, program, or service.
The manufacturer may have patents or pending patent applications covering subject matter described in
this document. The furnishing of this document does not grant you any license to these patents. You can
send license inquiries, in writing, to the manufacturer.
The following paragraph does not apply to the United Kingdom or any other country where such
provisions are inconsistent with local law: THIS INFORMATION IS PROVIDED “AS IS” WITHOUT
WARRANTY OF ANY KIND, EITHER EXPRESS OR IMPLIED, INCLUDING, BUT NOT LIMITED TO,
THE IMPLIED WARRANTIES OF NON-INFRINGEMENT, MERCHANTABILITY OR FITNESS FOR A
PARTICULAR PURPOSE. Some states do not allow disclaimer of express or implied warranties in certain
transactions, therefore, this statement may not apply to you.
This information could include technical inaccuracies or typographical errors. Changes are periodically
made to the information herein; these changes will be incorporated in new editions of the publication.
The manufacturer may make improvements and/or changes in the product(s) and/or the program(s)
described in this publication at any time without notice.
Any references in this information to Web sites not owned by the manufacturer are provided for
convenience only and do not in any manner serve as an endorsement of those Web sites. The materials at
those Web sites are not part of the materials for this product and use of those Web sites is at your own
risk.
The manufacturer may use or distribute any of the information you supply in any way it believes
appropriate without incurring any obligation to you.
Any performance data contained herein was determined in a controlled environment. Therefore, the
results obtained in other operating environments may vary significantly. Some measurements may have
been made on development-level systems and there is no guarantee that these measurements will be the
same on generally available systems. Furthermore, some measurements may have been estimated through
extrapolation. Actual results may vary. Users of this document should verify the applicable data for their
specific environment.
Information concerning products not produced by this manufacturer was obtained from the suppliers of
those products, their published announcements or other publicly available sources. This manufacturer has
not tested those products and cannot confirm the accuracy of performance, compatibility or any other
claims related to products not produced by this manufacturer. Questions on the capabilities of products
not produced by this manufacturer should be addressed to the suppliers of those products.
All statements regarding the manufacturer's future direction or intent are subject to change or withdrawal
without notice, and represent goals and objectives only.
© Copyright IBM Corp. 2010 151
The manufacturer's prices shown are the manufacturer's suggested retail prices, are current and are
subject to change without notice. Dealer prices may vary.
This information is for planning purposes only. The information herein is subject to change before the
products described become available.
This information contains examples of data and reports used in daily business operations. To illustrate
them as completely as possible, the examples include the names of individuals, companies, brands, and
products. All of these names are fictitious and any similarity to the names and addresses used by an
actual business enterprise is entirely coincidental.
If you are viewing this information in softcopy, the photographs and color illustrations may not appear.
The drawings and specifications contained herein shall not be reproduced in whole or in part without the
written permission of the manufacturer.
The manufacturer has prepared this information for use with the specific machines indicated. The
manufacturer makes no representations that it is suitable for any other purpose.
The manufacturer's computer systems contain mechanisms designed to reduce the possibility of
undetected data corruption or loss. This risk, however, cannot be eliminated. Users who experience
unplanned outages, system failures, power fluctuations or outages, or component failures must verify the
accuracy of operations performed and data saved or transmitted by the system at or near the time of the
outage or failure. In addition, users must establish procedures to ensure that there is independent data
verification before relying on such data in sensitive or critical operations. Users should periodically check
the manufacturer's support websites for updated information and fixes applicable to the system and
related software.
Trademarks
IBM, the IBM logo, and ibm.com are trademarks or registered trademarks of International Business
Machines Corp., registered in many jurisdictions worldwide. Other product and service names might be
trademarks of IBM or other companies. A current list of IBM trademarks is available on the Web at
Copyright and trademark information at www.ibm.com/legal/copytrade.shtml.
Microsoft and Windows are trademarks of Microsoft Corporation in the United States, other countries, or
both.
Other company, product, or service names may be trademarks or service marks of others.
Electronic emission notices
Class A Notices
The following Class A statements apply to the IBM servers that contain the POWER7 processor and its
features unless designated as electromagnetic compatibility (EMC) Class B in the feature information.
Federal Communications Commission (FCC) statement
Note: This equipment has been tested and found to comply with the limits for a Class A digital device,
pursuant to Part 15 of the FCC Rules. These limits are designed to provide reasonable protection against
harmful interference when the equipment is operated in a commercial environment. This equipment
generates, uses, and can radiate radio frequency energy and, if not installed and used in accordance with
the instruction manual, may cause harmful interference to radio communications. Operation of this
equipment in a residential area is likely to cause harmful interference, in which case the user will be
required to correct the interference at his own expense.
152 Installing into a rack
Properly shielded and grounded cables and connectors must be used in order to meet FCC emission
limits. IBM is not responsible for any radio or television interference caused by using other than
recommended cables and connectors or by unauthorized changes or modifications to this equipment.
Unauthorized changes or modifications could void the user's authority to operate the equipment.
This device complies with Part 15 of the FCC rules. Operation is subject to the following two conditions:
(1) this device may not cause harmful interference, and (2) this device must accept any interference
received, including interference that may cause undesired operation.
Industry Canada Compliance Statement
This Class A digital apparatus complies with Canadian ICES-003.
Avis de conformité à la réglementation d'Industrie Canada
Cet appareil numérique de la classe A est conforme à la norme NMB-003 du Canada.
European Community Compliance Statement
This product is in conformity with the protection requirements of EU Council Directive 2004/108/EC on
the approximation of the laws of the Member States relating to electromagnetic compatibility. IBM cannot
accept responsibility for any failure to satisfy the protection requirements resulting from a
non-recommended modification of the product, including the fitting of non-IBM option cards.
This product has been tested and found to comply with the limits for Class A Information Technology
Equipment according to European Standard EN 55022. The limits for Class A equipment were derived for
commercial and industrial environments to provide reasonable protection against interference with
licensed communication equipment.
European Community contact:
IBM Deutschland GmbH
Technical Regulations, Department M456
IBM-Allee 1, 71139 Ehningen, Germany
Tele: +49 7032 15-2937
e-mail: tjahn@de.ibm.com
Warning: This is a Class A product. In a domestic environment, this product may cause radio
interference, in which case the user may be required to take adequate measures.
VCCI Statement - Japan
The following is a summary of the VCCI Japanese statement in the box above:
This is a Class A product based on the standard of the VCCI Council. If this equipment is used in a
domestic environment, radio interference may occur, in which case, the user may be required to take
corrective actions.
Notices 153
Japanese Electronics and Information Technology Industries Association (JEITA)
Confirmed Harmonics Guideline (products less than or equal to 20 A per phase)
Japanese Electronics and Information Technology Industries Association (JEITA)
Confirmed Harmonics Guideline with Modifications (products greater than 20 A per
phase)
Electromagnetic Interference (EMI) Statement - People's Republic of China
Declaration: This is a Class A product. In a domestic environment this product may cause radio
interference in which case the user may need to perform practical action.
Electromagnetic Interference (EMI) Statement - Taiwan
The following is a summary of the EMI Taiwan statement above.
Warning: This is a Class A product. In a domestic environment this product may cause radio interference
in which case the user will be required to take adequate measures.
IBM Taiwan Contact Information:
154 Installing into a rack
Electromagnetic Interference (EMI) Statement - Korea
Please note that this equipment has obtained EMC registration for commercial use. In the event that it
has been mistakenly sold or purchased, please exchange it for equipment certified for home use.
Germany Compliance Statement
Deutschsprachiger EU Hinweis: Hinweis für Geräte der Klasse A EU-Richtlinie zur
Elektromagnetischen Verträglichkeit
Dieses Produkt entspricht den Schutzanforderungen der EU-Richtlinie 2004/108/EG zur Angleichung der
Rechtsvorschriften über die elektromagnetische Verträglichkeit in den EU-Mitgliedsstaaten und hält die
Grenzwerte der EN 55022 Klasse A ein.
Um dieses sicherzustellen, sind die Geräte wie in den Handbüchern beschrieben zu installieren und zu
betreiben. Des Weiteren dürfen auch nur von der IBM empfohlene Kabel angeschlossen werden. IBM
übernimmt keine Verantwortung für die Einhaltung der Schutzanforderungen, wenn das Produkt ohne
Zustimmung von IBM verändert bzw. wenn Erweiterungskomponenten von Fremdherstellern ohne
Empfehlung von IBM gesteckt/eingebaut werden.
EN 55022 Klasse A Geräte müssen mit folgendem Warnhinweis versehen werden:
"Warnung: Dieses ist eine Einrichtung der Klasse A. Diese Einrichtung kann im Wohnbereich
Funk-Störungen verursachen; in diesem Fall kann vom Betreiber verlangt werden, angemessene
Maßnahmen zu ergreifen und dafür aufzukommen."
Deutschland: Einhaltung des Gesetzes über die elektromagnetische Verträglichkeit von Geräten
Dieses Produkt entspricht dem “Gesetz über die elektromagnetische Verträglichkeit von Geräten
(EMVG)“. Dies ist die Umsetzung der EU-Richtlinie 2004/108/EG in der Bundesrepublik Deutschland.
Zulassungsbescheinigung laut dem Deutschen Gesetz über die elektromagnetische Verträglichkeit von
Geräten (EMVG) (bzw. der EMC EG Richtlinie 2004/108/EG) für Geräte der Klasse A
Dieses Gerät ist berechtigt, in Übereinstimmung mit dem Deutschen EMVG das EG-Konformitätszeichen
- CE - zu führen.
Verantwortlich für die Einhaltung der EMV Vorschriften ist der Hersteller:
International Business Machines Corp.
Notices 155
New Orchard Road
Armonk, New York 10504
Tel: 914-499-1900
Der verantwortliche Ansprechpartner des Herstellers in der EU ist:
IBM Deutschland GmbH
Technical Regulations, Abteilung M456
IBM-Allee 1, 71139 Ehningen, Germany
Tel: +49 7032 15-2937
e-mail: tjahn@de.ibm.com
Generelle Informationen:
Das Gerät erfüllt die Schutzanforderungen nach EN 55024 und EN 55022 Klasse A.
Electromagnetic Interference (EMI) Statement - Russia
Class B Notices
The following Class B statements apply to features designated as electromagnetic compatibility (EMC)
Class B in the feature installation information.
Federal Communications Commission (FCC) statement
This equipment has been tested and found to comply with the limits for a Class B digital device,
pursuant to Part 15 of the FCC Rules. These limits are designed to provide reasonable protection against
harmful interference in a residential installation.
This equipment generates, uses, and can radiate radio frequency energy and, if not installed and used in
accordance with the instructions, may cause harmful interference to radio communications. However,
there is no guarantee that interference will not occur in a particular installation.
If this equipment does cause harmful interference to radio or television reception, which can be
determined by turning the equipment off and on, the user is encouraged to try to correct the interference
by one or more of the following measures:
v Reorient or relocate the receiving antenna.
v Increase the separation between the equipment and receiver.
v Connect the equipment into an outlet on a circuit different from that to which the receiver is
connected.
v Consult an IBM-authorized dealer or service representative for help.
Properly shielded and grounded cables and connectors must be used in order to meet FCC emission
limits. Proper cables and connectors are available from IBM-authorized dealers. IBM is not responsible for
any radio or television interference caused by unauthorized changes or modifications to this equipment.
Unauthorized changes or modifications could void the user's authority to operate this equipment.
156 Installing into a rack
This device complies with Part 15 of the FCC rules. Operation is subject to the following two conditions:
(1) this device may not cause harmful interference, and (2) this device must accept any interference
received, including interference that may cause undesired operation.
Industry Canada Compliance Statement
This Class B digital apparatus complies with Canadian ICES-003.
Avis de conformité à la réglementation d'Industrie Canada
Cet appareil numérique de la classe B est conforme à la norme NMB-003 du Canada.
European Community Compliance Statement
This product is in conformity with the protection requirements of EU Council Directive 2004/108/EC on
the approximation of the laws of the Member States relating to electromagnetic compatibility. IBM cannot
accept responsibility for any failure to satisfy the protection requirements resulting from a
non-recommended modification of the product, including the fitting of non-IBM option cards.
This product has been tested and found to comply with the limits for Class B Information Technology
Equipment according to European Standard EN 55022. The limits for Class B equipment were derived for
typical residential environments to provide reasonable protection against interference with licensed
communication equipment.
European Community contact:
IBM Deutschland GmbH
Technical Regulations, Department M456
IBM-Allee 1, 71139 Ehningen, Germany
Tele: +49 7032 15-2937
e-mail: tjahn@de.ibm.com
VCCI Statement - Japan
Japanese Electronics and Information Technology Industries Association (JEITA)
Confirmed Harmonics Guideline (products less than or equal to 20 A per phase)
Japanese Electronics and Information Technology Industries Association (JEITA)
Confirmed Harmonics Guideline with Modifications (products greater than 20 A per
phase)
Notices 157
IBM Taiwan Contact Information
Electromagnetic Interference (EMI) Statement - Korea
Germany Compliance Statement
Deutschsprachiger EU Hinweis: Hinweis für Geräte der Klasse B EU-Richtlinie zur
Elektromagnetischen Verträglichkeit
Dieses Produkt entspricht den Schutzanforderungen der EU-Richtlinie 2004/108/EG zur Angleichung der
Rechtsvorschriften über die elektromagnetische Verträglichkeit in den EU-Mitgliedsstaaten und hält die
Grenzwerte der EN 55022 Klasse B ein.
Um dieses sicherzustellen, sind die Geräte wie in den Handbüchern beschrieben zu installieren und zu
betreiben. Des Weiteren dürfen auch nur von der IBM empfohlene Kabel angeschlossen werden. IBM
übernimmt keine Verantwortung für die Einhaltung der Schutzanforderungen, wenn das Produkt ohne
Zustimmung von IBM verändert bzw. wenn Erweiterungskomponenten von Fremdherstellern ohne
Empfehlung von IBM gesteckt/eingebaut werden.
Deutschland: Einhaltung des Gesetzes über die elektromagnetische Verträglichkeit von Geräten
Dieses Produkt entspricht dem “Gesetz über die elektromagnetische Verträglichkeit von Geräten
(EMVG)“. Dies ist die Umsetzung der EU-Richtlinie 2004/108/EG in der Bundesrepublik Deutschland.
Zulassungsbescheinigung laut dem Deutschen Gesetz über die elektromagnetische Verträglichkeit von
Geräten (EMVG) (bzw. der EMC EG Richtlinie 2004/108/EG) für Geräte der Klasse B
Dieses Gerät ist berechtigt, in Übereinstimmung mit dem Deutschen EMVG das EG-Konformitätszeichen
- CE - zu führen.
Verantwortlich für die Einhaltung der EMV Vorschriften ist der Hersteller:
International Business Machines Corp.
New Orchard Road
Armonk, New York 10504
Tel: 914-499-1900
Der verantwortliche Ansprechpartner des Herstellers in der EU ist:
IBM Deutschland GmbH
Technical Regulations, Abteilung M456
158 Installing into a rack
IBM-Allee 1, 71139 Ehningen, Germany
Tel: +49 7032 15-2937
e-mail: tjahn@de.ibm.com
Generelle Informationen:
Das Gerät erfüllt die Schutzanforderungen nach EN 55024 und EN 55022 Klasse B.
Terms and conditions
Permissions for the use of these publications is granted subject to the following terms and conditions.
Personal Use: You may reproduce these publications for your personal, noncommercial use provided that
all proprietary notices are preserved. You may not distribute, display or make derivative works of these
publications, or any portion thereof, without the express consent of the manufacturer.
Commercial Use: You may reproduce, distribute and display these publications solely within your
enterprise provided that all proprietary notices are preserved. You may not make derivative works of
these publications, or reproduce, distribute or display these publications or any portion thereof outside
your enterprise, without the express consent of the manufacturer.
Except as expressly granted in this permission, no other permissions, licenses or rights are granted, either
express or implied, to the publications or any information, data, software or other intellectual property
contained therein.
The manufacturer reserves the right to withdraw the permissions granted herein whenever, in its
discretion, the use of the publications is detrimental to its interest or, as determined by the manufacturer,
the above instructions are not being properly followed.
You may not download, export or re-export this information except in full compliance with all applicable
laws and regulations, including all United States export laws and regulations.
THE MANUFACTURER MAKES NO GUARANTEE ABOUT THE CONTENT OF THESE
PUBLICATIONS. THESE PUBLICATIONS ARE PROVIDED "AS-IS" AND WITHOUT WARRANTY OF
ANY KIND, EITHER EXPRESSED OR IMPLIED, INCLUDING BUT NOT LIMITED TO IMPLIED
WARRANTIES OF MERCHANTABILITY, NON-INFRINGEMENT, AND FITNESS FOR A PARTICULAR
PURPOSE.
Bull
D10 and D20 I/O Drawers
Service Guide
86 A1 38EG 01
ORDER REFERENCE
Bull
D10 and D20 I/O Drawers
Service Guide
Hardware
June 2003
BULL CEDOC
357 AVENUE PATTON
B.P.20845
49008 ANGERS CEDEX 01
FRANCE
86 A1 38EG 01
ORDER REFERENCE
The following copyright notice protects this book under the Copyright laws of the United States of America
and other countries which prohibit such actions as, but not limited to, copying, distributing, modifying, and
making derivative works.
Copyright Bull S.A. 1992, 2003
Printed in France
Suggestions and criticisms concerning the form, content, and presentation of
this book are invited. A form is provided at the end of this book for this purpose.
To order additional copies of this book or other Bull Technical Publications, you
are invited to use the Ordering Form also provided at the end of this book.
Trademarks and Acknowledgements
We acknowledge the right of proprietors of trademarks mentioned in this book.
AIX is a registered trademark of International Business Machines Corporation, and is being used under
licence.
UNIX is a registered trademark in the United States of America and other countries licensed exclusively through
the Open Group.
Linux is a registered trademark of Linus Torvalds.
The information in this document is subject to change without notice. Groupe Bull will not be liable for errors
contained herein, or for incidental or consequential damages in connection with the use of this material.
Contents
Safety Notices . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . vii
Rack Safety Instructions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . vii
Electrical Safety . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . viii
Laser Safety Information . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . ix
Laser Compliance . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . ix
Data Integrity and Verification . . . . . . . . . . . . . . . . . . . . . . . . . . . xi
About This Book . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . xiii
ISO 9000 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . xiii
Highlighting . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . xiii
Related Publications . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . xiv
Trademarks . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . xiv
Chapter 1. Reference Information . . . . . . . . . . . . . . . . . . . . . . . . . . 1
D10 Overview . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1
D10 I/O Subsystem Features . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1
I/O Subsystem PCI-X Slot LED Definitions. . . . . . . . . . . . . . . . . . . . . . . 2
D10 Power . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4
Subsystem Positioning and Cabling . . . . . . . . . . . . . . . . . . . . . . . . . 6
D10 I/O Subsystem Locations. . . . . . . . . . . . . . . . . . . . . . . . . . . . 10
D10 I/O Subsystem Front Locations. . . . . . . . . . . . . . . . . . . . . . . . . 10
D10 I/O Subsystem Rear Locations. . . . . . . . . . . . . . . . . . . . . . . . . 11
D10 I/O Backplane Locations . . . . . . . . . . . . . . . . . . . . . . . . . . . 12
D10 I/O Subsystem PHB. . . . . . . . . . . . . . . . . . . . . . . . . . . . . 13
D10 Location Codes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 14
AIX and Physical Location Code Table. . . . . . . . . . . . . . . . . . . . . . . . 14
D10 I/O Subsystem Location Code Table. . . . . . . . . . . . . . . . . . . . . . . 15
D20 Overview. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 16
D20 I/O Subsystem Features . . . . . . . . . . . . . . . . . . . . . . . . . . . . 16
D20 PCI-X Slots. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 16
D20 Power. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 18
D20 Location Codes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 31
AIX and Physical Location Code Table. . . . . . . . . . . . . . . . . . . . . . . . 31
D20 Location Code Table . . . . . . . . . . . . . . . . . . . . . . . . . . . . 32
Powering the System On and Off . . . . . . . . . . . . . . . . . . . . . . . . . . 34
Power-On Self-Test. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 34
D10 and D20 Specifications . . . . . . . . . . . . . . . . . . . . . . . . . . . . 35
D10 Specifications . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 35
D20 Specifications . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 36
External AC Power Cables . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 37
Service Inspection Guide. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 38
Chapter 2. Diagnostics and Maintenance Procedures . . . . . . . . . . . . . . . . . . 39
Power and Attention LEDs . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 39
Component LEDs . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 39
Resetting the LEDs. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 39
Checkpoints . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 39
FRU Isolation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 39
Maintenance Analysis Procedures (MAPs) . . . . . . . . . . . . . . . . . . . . . . . 40
Checkpoints . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 40
Error Code to FRU Index . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 40
Using the Service Processor . . . . . . . . . . . . . . . . . . . . . . . . . . . . 40
iii
Using System Management Services . . . . . . . . . . . . . . . . . . . . . . . . . 40
Chapter 3. Removal and Replacement Procedures . . . . . . . . . . . . . . . . . . . 41
Safety Considerations. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 41
Handling Static-Sensitive Devices . . . . . . . . . . . . . . . . . . . . . . . . . . 42
Stopping and Starting the System . . . . . . . . . . . . . . . . . . . . . . . . . . 42
Removal and Replacement Procedures for the D10 I/O Subsystem . . . . . . . . . . . . . . 42
D10 FRU Replacement Procedure List . . . . . . . . . . . . . . . . . . . . . . . 42
D10 Service Position . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 43
D10 Operating Position . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 43
D10 Front Bezel . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 43
D10 Covers . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 44
D10 PCI Adapters . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 45
Hot-Pluggable PCI Adapter . . . . . . . . . . . . . . . . . . . . . . . . . . . . 46
PCI Hot-Plug Manager Access . . . . . . . . . . . . . . . . . . . . . . . . . . 53
Removing and Replacing a PCI Adapter Cassette . . . . . . . . . . . . . . . . . . . 55
PCI Adapter or Blank Filler Removal from a Cassette Assembly . . . . . . . . . . . . . . 59
Replacing an Adapter in a PCI Adapter Cassette . . . . . . . . . . . . . . . . . . . . 70
Short Adapter or Blank Filler Installation . . . . . . . . . . . . . . . . . . . . . . . 74
Long Adapter Installation. . . . . . . . . . . . . . . . . . . . . . . . . . . . . 89
D10Fan . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 104
D10 RIO Cable. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 105
D10 RIO Bus Adapter Assembly . . . . . . . . . . . . . . . . . . . . . . . . . 105
D10 I/O Backplane Assembly . . . . . . . . . . . . . . . . . . . . . . . . . . 106
D10 Power Supply . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 107
Removal and Replacement Procedures for the D20 I/O Subsystem . . . . . . . . . . . . . 108
D20 FRU Replacement Procedure List . . . . . . . . . . . . . . . . . . . . . . . 108
D20 Service Position. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 109
D20 Operating Position . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 110
D20 Service Access Cover. . . . . . . . . . . . . . . . . . . . . . . . . . . . 111
D20 Front Bezel . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 112
D20 Operator Panel . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 113
D20 Blowers. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 115
D20 Cooling Blower Cable. . . . . . . . . . . . . . . . . . . . . . . . . . . . 118
D20 RIO Bus Adapter Assembly . . . . . . . . . . . . . . . . . . . . . . . . . 120
D20 PCI Adapters. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 123
D20 Hot-Plug Disk Drives . . . . . . . . . . . . . . . . . . . . . . . . . . . . 137
D20 Disk Drive Cage. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 142
D20 Disk Drive Backplane. . . . . . . . . . . . . . . . . . . . . . . . . . . . 144
D20 Power Supplies . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 146
D20 Power Supply Bulkhead. . . . . . . . . . . . . . . . . . . . . . . . . . . 148
D20 I/O Backplane Assembly . . . . . . . . . . . . . . . . . . . . . . . . . . 151
Chapter 4. Parts Information . . . . . . . . . . . . . . . . . . . . . . . . . . . 157
D10 I/O Subsystem . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 157
D10 I/O Subsystems Parts . . . . . . . . . . . . . . . . . . . . . . . . . . . 157
D10 I/O Subsystem Parts (continued) . . . . . . . . . . . . . . . . . . . . . . . 159
D10 SPCN, RIO, and Rack Beacon Cables (These parts are not shown.) . . . . . . . . . . 161
D10 Rack Mounting Enclosure (These parts are not shown.) . . . . . . . . . . . . . . . 161
D20 I/O Subsystem . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 162
D20 I/O Backplane and Cabling. . . . . . . . . . . . . . . . . . . . . . . . . . 162
D20 PCI Slots . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 163
D20 Power, RIO Adapter, Cabling . . . . . . . . . . . . . . . . . . . . . . . . . 164
D20 Operator Panel . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 165
D20 Fans and Disk Drives. . . . . . . . . . . . . . . . . . . . . . . . . . . . 166
D20 Covers and Brackets . . . . . . . . . . . . . . . . . . . . . . . . . . . . 167
iv D10 and D20 Service Guide
D20 RIO, SPCN, and Rack Beacon Cables . . . . . . . . . . . . . . . . . . . . . 167
Power Cables . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 168
Chapter 5. Fault and Attention LEDs . . . . . . . . . . . . . . . . . . . . . . . . 169
Component LEDs . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 169
Resetting the LEDs . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 171
Appendix A. Environmental Notices . . . . . . . . . . . . . . . . . . . . . . . . 173
Product Recycling and Disposal. . . . . . . . . . . . . . . . . . . . . . . . . . . 173
Acoustical Noise Emissions . . . . . . . . . . . . . . . . . . . . . . . . . . . . 174
D10 Declared Acoustical Noise Emissions . . . . . . . . . . . . . . . . . . . . . . 174
D20 Declared Acoustical Noise Emissions . . . . . . . . . . . . . . . . . . . . . . 174
Appendix B. Notices . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 175
Index . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 177
Contents v
vi D10 and D20 Service Guide
Safety Notices
A danger notice indicates the presence of a hazard that has the potential of causing death or serious
personal injury. Danger notices appear on the following pages:
v viii
v 41
v 146
A caution notice indicates the presence of a hazard that has the potential of causing moderate or minor
personal injury. Caution notices appear on the following pages:
v viii
v viii
v 41
v 41
v ix
Note: For a translation of these notices, see System Unit Safety Information, order number SA23-2652.
Rack Safety Instructions
v Do not install this unit in a rack where the internal rack ambient temperatures will exceed 35 degrees C.
v Do not install this unit in a rack where the air flow is compromised. Any side, front or back of the unit
used for air flow through the unit must not be in direct contact with the rack.
v Care should be taken to ensure that a hazardous condition is not created due to uneven mechanical
loading when installing this unit in a rack. If the rack has a stabilizer it must be firmly attached before
installing or removing this unit.
v Consideration should be given to the connection of the equipment to the supply circuit so that
overloading of circuits does not compromise the supply wiring or overcurrent protection. To provide the
correct power connection to the rack, refer to the rating labels located on the equipment in the rack to
determine the total power requirement for the supply circuit.
v An electrical outlet that is not correctly wired could place hazardous voltage on the metal parts of the
system or the devices that attach to the system. It is the responsibility of the customer to ensure that
the outlet is correctly wired and grounded to prevent an electrical shock.
vii
Electrical Safety
Observe the following safety instructions any time you are connecting or disconnecting devices attached to
the workstation.
In the system you are about to setup or service:
v The ac power interface connector is considered the main power disconnect device.
v This system has redundant power supply capabilities, meaning that it has the ability to have two power
supplies running simultaneously in the same system unit. When instructed to disconnect the power
source, ensure that all power cables have been unplugged.
DANGER
An electrical outlet that is not correctly wired could place hazardous voltage on metal parts of
the system or the devices that attach to the system. It is the responsibility of the customer to
ensure that the outlet is correctly wired and grounded to prevent an electrical shock.
Before installing or removing signal cables, ensure that the power cables for the system unit
and all attached devices are unplugged.
When adding or removing any additional devices to or from the system, ensure that the power
cables for those devices are unplugged before the signal cables are connected. If possible,
disconnect all power cables from the existing system before you add a device.
Use one hand, when possible, to connect or disconnect signal cables to prevent a possible
shock from touching two surfaces with different electrical potentials.
During an electrical storm, do not connect cables for display stations, printers, telephones, or
station protectors for communications lines.
D05
CAUTION:
This product is equipped with a three-wire power cable and plug for the user’s safety. Use this
power cable with a properly grounded electrical outlet to avoid electrical shock.
C01
CAUTION:
This unit has more than one power supply cord. To reduce the risk of electrical shock, disconnect
two power supply cords before servicing.
C21
viii D10 and D20 Service Guide
Laser Safety Information
CAUTION:
This product may contain a CD-ROM, DVD-ROM, or laser module on a PCI card, which are class 1
laser products.
C30
Laser Compliance
All lasers are certified in the U.S. to conform to the requirements of DHHS 21 CFR Subchapter J for class
1 laser products. Outside the U.S., they are certified to be in compliance with the IEC 825 (first edition
1984) as a class 1 laser product. Consult the label on each part for laser certification numbers and
approval information.
CAUTION:
All laser modules are designed so that there is never any human access to laser radiation above a
class 1 level during normal operation, user maintenance, or prescribed service conditions. Data
processing environments can contain equipment transmitting on system links with laser modules
that operate at greater than class 1 power levels. For this reason, never look into the end of an
optical fiber cable or open receptacle. Only trained service personnel should perform the
inspection or repair of optical fiber cable assemblies and receptacles.
C25, C26
Safety Notices ix
x D10 and D20 Service Guide
Data Integrity and Verification
These computer systems contain mechanisms designed to reduce the possibility of undetected data
corruption or loss. This risk, however, cannot be eliminated. Users who experience unplanned outages,
system failures, power fluctuations or outages, or component failures must verify the accuracy of operations
performed and data saved or transmitted by the system at or near the time of the outage or failure. In
addition, users must establish procedures to ensure that there is independent data verification before relying
on such data in sensitive or critical operations. Users should periodically check our support websites for
updated information and fixes applicable to the system and related software.
xi
xii D10 and D20 Service Guide
About This Book
This book provides maintenance information that is specific to the D10 and D20 I/O subsystems, as well
as adapters and attached devices that do not have their own service information. In this book, the I/O
subsystems are referred to as the D10 I/O subsystem or the D20 I/O subsystem.
MAPs that are common to all systems are contained in the Diagnostic Information for Multiple Bus
Systems.
This book is used by the service representative to repair system failures. This book assumes that the
service representative has had training on the processor subsystem to which an I/O subsystem is
attached, in addition to training on the I/O subsystem drawer.
ISO 9000
ISO 9000 registered quality systems were used in the development and manufacturing of this product.
Highlighting
The following highlighting conventions are used in this book:
Bold Identifies commands, subroutines, keywords, files, structures, directories, and other items
whose names are predefined by the system. Also identifies graphical objects such as buttons,
labels, and icons that the user selects.
Italics Identifies parameters whose actual names or values are to be supplied by the user.
Monospace Identifies examples of specific data values, examples of text similar to what you might see
displayed, examples of portions of program code similar to what you might write as a
programmer, messages from the system, or information you should actually type.
xiii
Related Publications
The following publications provide additional information about your system:
v The D10 I/O Drawer Installation Guide, order number 86 A1 32EG, contains information on how to install
the D10 I/O subsystem.
v The Installation Guide, order number 86 A1 39EG, contains information on how to install the D20 I/O
subsystem.
v The PL420T and PL420R User’s Guide, order number 86 A1 41EG, contains information to help users use
the system, use the service aids, and solve minor problems.
v The Diagnostic Information for Multiple Bus Systems, order number 86 A1 26HX, contains diagnostic
information, service request numbers (SRNs), and failing function codes (FFCs).
v The Adapters Information for Multiple Bus Systems, order number 86 A1 27HX, contains information about
adapterss for your system. This manual is intended to supplement the service information found in the
Diagnostic Information for Multiple Bus Systems.
v The Site Preparation for Rack Systems Guide, order number 86 A1 30PX, contains information to help you plan
your installation.
v The System Unit Safety Information, order number 86 X1 11WD, contains translations of safety
information used throughout this book.
xiv D10 and D20 Service Guide
Chapter 1. Reference Information
This chapter provides reference information for the D10 and D20 I/O subsystems.
D10 Overview
The D10 is a 19-inch, rack-mountable I/O subsystem that is attached to a processor subsystem drawer to
extend the system’s capacity for I/O adapters. The I/O drawer includes redundant concurrently
maintainable power and cooling.
Five PCI-X slots and one PCI slot are available for PCI adapters. PCI adapters are installed through the
rear of the I/O subsystem using PCI adapter cassettes. Because the PCI slots support hot-pluggable
adapters, the I/O subsystem can have adapters installed without turning off power or removing covers. The
D10 is 4 EIA units high. Two D10s can be installed side-by-side in a 19-inch rack enclosure.
D10 I/O Subsystem Features
The following figure shows the front view of I/O subsystem.
2
3
1
5
4
1 Cooling Fan 4 Power Supply 1
2 Power cord channel 5 Power cord receptacles
3 Power Supply 2
1
The following figure shows the rear view of the I/O subsystem.
1 2 3 4 5 6
1 2 3 4 5 6
8 7
9
11
10
12
1 PCI adapter slot 1 7 RIO bus adapter card
Upper connector 0
Lower connector 1
2 PCI-X adapter slot 2 8 SPCN connector card
3 PCI-X adapter slot 3 9 Lower SPCN connector J16
4 PCI-X adapter slot 4 10 Upper SPCN connector J15
5 PCI-X adapter slot 5 11 Rack beacon connector
6 PCI-X adapter slot 6 12 Power cord channel
D10 Operator Indicators
LEDs are used on the D10 as operator indicators. Tables found in “I/O Subsystem PCI-X Slot LED
Definitions” and “D10 Power” on page 4 describes the LED states and their definitions. For more
information on LEDs and their definitions, see Chapter 5, “Fault and Attention LEDs”, on page 169. LEDs
can be found on or near the following FRUs:
v LED indicators visible on each PCI adapter cassette
v LED indicators on the I/O subsystem backplane
v Attention/Identify LEDs for power supplies and fans
D10 PCI-X Slots
The D10 has five PCI-X slots and one PCI slot. Adapters are installed and removed using a PCI adapter
cassette, which allows adapters to be installed without turning off the power or opening the I/O subsystem
covers.
PCI adapters are installed using an adapter cassette, each adapter cassette shows two LEDs for each
adapter. There is a green power indicator LED (upper) and an amber fault/identify LED (lower). The slots
are numbered on the rear of the chassis from left to right 1 through 6. Slot 1 is a 5V PCI slot. Slots 2
through 6 are 3.3V PCI-X.
I/O Subsystem PCI-X Slot LED Definitions
The green and amber LEDs (viewed on the rear of the PCI adapter cassette when installed) are used to
indicate the state of the PCI slot during removal and replacement of an adapter. The following table
2 D10 and D20 Service Guide
describes the LED states and their definitions.
3
2
1
1 PCI Adapter Cassette
2 Power LED (Green )
3 Fault/Identify LED (Amber)
Green PCI Slot
LED Indication
Amber PCI Slot
LED
PCI Slot
Power/Status
Definition
On Off On/No action The PCI slot power is on and the slot is operational. Do
not remove or install adapters.
On Flashing On/Identify The PCI slot power is on, and the slot has been
identified by the software or firmware. Do not remove or
install adapters.
Off Flashing Off/Action The PCI slot power is off, and the slot has been
identified by the software or firmware. If directed by the
PCI hot-plug procedures, perform PCI adapter removal,
replacement, or installation.
Off Off Off/No power The PCI slot power is off. If the system power is off,
perform non-hot-plug PCI adapter removal, replacement,
or installation procedures.
Chapter 1. Reference Information 3
D10 Power
The following figure shows the front view of the pocessor subsystem and two I/O subsystems. Each unit
has two power supplies and two power cords for redundancy. Connect the power cords from one side of
the system to one power distribution bus in the rack. Connect the power cords from the other side of the
system to a different power distribution bus in the rack. To ensure that the systems in a rack keep running
if power is interrupted, a separate branch power line for each side of the rack must be used.
1 1
2
1
3
1 Power supplies, power receptacles
2 Two D10 I/O subsystems mounted side-by-side.
3 One 386/50 processor subsystem
The D10 I/O subsystem has two power supplies. Either power supply is capable of providing the
necessary voltages and currents, independent of the other power supply. Each power supply provides 5V
dc, 3.3V dc, -12V dc, and 5V dc standby. The power supplies are hot-pluggable and may be changed one
at a time while the system is operational.
4 D10 and D20 Service Guide
The following table describes power-supply LED indicators that are available on the D10.
LED Status LED Definition
Off Power source not connected
Blinking green LED System power source connected, but power is not turned
on
Blinking green LED, visibly begins to blink faster after the
power button has been pressed.
System power source connected, the power-on button
has been pressed and power-on initiated
Solid (not blinking) green LED, (There is approximately a
30-second transition period from the time the power-on
button is pressed to the time the power LED is on solid.)
System power source connected and turned on
The following figure shows the rear view of the processor subsystem and two I/O subsystems.
1 D10 I/O subsystem 3 Processor subsystem power supplies,
power receptacles
2 386/50 processor subsystem 4 I/O backplane power (green) and
fault/identify (amber) LEDs
D10 Cooling
Fans mounted inside each I/O subsystem power supply and an additional fan mounted on the front of the
subsystem provide cooling. The power supplies and the fan can be removed and replaced with the power
turned on, as long as only one fan is removed from the I/O subsystem at a time.
D10 Input/Output Ports
The connector ports on the rear of the D10 I/O subsystem are used to connect the RIO-2 cables, the
SPCN cables, and the rack-beacon LED.
Chapter 1. Reference Information 5
Subsystem Positioning and Cabling
I/O subsystems can be installed in a standard 19-inch EIA rack in any location. The cables that connect
the subsystems allow some flexibility in drawer placement, but the I/O subsystems should be located
above the processor subsystem to which they are connected in the same rack.
Up to eight I/O subsystems can be connected to a processor subsystem. Use the following guidelines:
v Each I/O subsystem connects to the processor subsystem using one system power control network
(SPCN) cable loop.
v Each I/O subsystem connects to the processor subsystem using one or more remote I/O (RIO) cable
loops.
v Lowest cost configurations allow up to four I/O subsystems connected to the processor subsystem using
a single RIO loop.
v Optimum performance is achieved when each I/O drawer is connected to the processor subsystem
through its own RIO loop.
SPCN Cabling
I/O subsystem SPCN cables are connected to the processor subsystem unit in a loop configuration. The
SPCN cables are connected in a loop so that the system has two paths to each I/O subsystem. Up to
eight I/O subsystems are supported on one SPCN loop.
One I/O Subsystem, Two I/O Subsystems: The illustrations on the left show one I/O subsystem
connected to the processor subsystem. The illustrations on the right show two I/O subsystems connected
to the processor subsystem using one SPCN loop.
Processor Subsystem Connected to One or Two D10 I/O Subsystems:
1 Processor Subsystem
2 I/O Subsystem
6 D10 and D20 Service Guide
Four I/O Subsystems, Eight I/O Subsystems: The illustration on the left shows four I/O subsystems
connected to the processor subsystem using one SPCN loop. The illustration on the right shows eight I/O
subsystems connected to the processor subsystem using one SPCN loop.
Processor Subsystem Connected to Four or Eight D10 I/O Subsystems:
1 Processor Subsystem
2 I/O Subsystem
RIO Cabling
I/O subsystems are connected to the processor subsystem through remote I/O (RIO) cable loops. The
cable loops are connected to ports that are on the rear of the processor subsystem. The RIO cables are
connected in loops so that the system has two paths to each I/O subsystem when more than one I/O
drawer is in a loop.
If only one I/O subsystem is in a RIO loop and both the processor subsystem and the I/O subsystem are
RIO-2 capable, the RIO-2 cables are used as one two-cable path to the I/O subsystem. In this cable
configuration, the data rate is increased by a factor of 2.
v A maximum of eight D10 drawers are supported on one processor subsystem.
v If both D10 I/O drawers are connected to the same processor subsystem, the total number of I/O
subsystems supported is eight.
v The maximum number of I/O subsystems supported on one RIO loop is four.
v The D10 I/O drawers must be on separate RIO loops.
v The D10 with RIO cabling and D10 with RIO-2 cabling can share one RIO loop.
The system can have up to four RIO loops. A total of eight I/O drawers can be connected to the processor
subsystem in a variety of cabling configurations. For optimum performance, connect the RIO loops in a
configuration that distributes multiple I/O drawers among as many RIO loops as possible. A dedicated RIO
loop for each I/O drawer can provide optimum performance. Some examples of valid cabling examples
follow.
Chapter 1. Reference Information 7
One I/O Subsystem or Two I/O Subsystems: The illustration on the left shows one I/O subsystem
drawer connected to the processor subsystem. The illustration on the right shows two I/O subsystem
drawers connected to the system unit using one RIO loop.
Processor Subsystem Connected to One or Two D10 I/O Subsystems:
A0 A0
A1 A1
0
1 1
0 0
1
1
2
1 Processor Subsystem
2 I/O Subsystem
Three I/O Subsystems or Four I/O Subsystems: The illustration on the left shows three I/O subsystem
drawers connected to the system unit using one RIO loop. The illustration on the right shows four I/O
subsystem drawers connected to the system unit using one RIO loop.
Processor Subsystem Connected to Three or Four D10 I/O Subsystems:
1 Processor Subsystem
2 I/O Subsystem
Eight I/O Subsystems with Two RIO Loops: The following illustrations show eight I/O subsystem
drawers connected to the processor subsystem unit using two RIO loops.
8 D10 and D20 Service Guide
Processor Subsystem Connected to Eight D10 I/O Subsystems:
1 Processor Subsystem
2 I/O Subsystem
Chapter 1. Reference Information 9
D10 I/O Subsystem Locations
This system uses physical location codes to provide mapping of the failing field replaceable units. The
location codes are produced by the processor subsystem’s firmware and AIX. For information about how to
read a location code, see the service guide for the processor subsystem to which your I/O subsystem is
connected.
D10 I/O Subsystem Front Locations
1 2
4 3
1 Cooling fan 3 Power Supply 2
U0.dd-V2
2 Power cord channel 4 Power Supply 1
U0.dd-V1
10 D10 and D20 Service Guide
D10 I/O Subsystem Rear Locations
1 2 3 4 5 6
1 2 3 4 5 6
8 7
9
11
10
12
1 PCI adapter slot 1
U0.dd-P1-I1
7 RIO bus adapter card
U0.dd-P1.1
2 PCI-X adapter slot 2
U0.dd-P1-I2
8 SPCN connector card
U0.dd-P1 (part of the I/O backplane
FRU)
3 PCI-X adapter slot 3
U0.dd-P1-I3
9 SPCN Connector 2/J16
U0.ddP1/Q5
4 PCI-X adapter slot 4
U0.dd-P1-I4
10 SPCN Connector 1/J15
U0.ddP1/Q4
5 PCI-X adapter slot 5
U0.dd-P1-I5
11 Rack beacon connector
6 PCI-X adapter slot 6
U0.dd-P1-I6
12 Power cord channel
Note: In the preceding table, dd is equal to the number assigned to each I/O subsystem by the connected
processor subsystem.
Chapter 1. Reference Information 11
D10 I/O Backplane Locations
The following illustration of the I/O backplane identifies the primary connectors used in your subsystem.
6 5 4 3 2 1
7
8
9
10
11
12
1 PCI adapter connector 1 7 SPCN connector
2 PCI-X adapter connector 2 8 VPD module
3 PCI-X adapter connector 3 9 VPD module pin 1 orientation
4 PCI-X adapter connector 4 10 RIO bus adapter connector
5 PCI-X adapter connector 5 11 Fan connector
6 PCI-X Adapter connector 6 12 Power supply connectors
12 D10 and D20 Service Guide
D10 I/O Subsystem PHB
Each D10 I/O Subsystem has two PCI host bridges (PHBs) through which the PCI slots are connected.
1 2 3 4 5 6
Slot PHB Slot Characteristics
1 1 64-bit 5V, 33 MHz
2 1 64-bit 3.3V, 131.5 MHz
3 1 64-bit 3.3V, 131.5 MHz
4 2 64-bit 3.3V, 131.5 MHz
5 2 64-bit 3.3V, 131.5 MHz
6 2 64-bit 3.3V, 131.5 MHz
Chapter 1. Reference Information 13
D10 Location Codes
This system (processor subsystem and attached I/O subsystems) use physical location codes in
conjunction with AIX location codes to provide mapping of failing field replaceable units (FRUs). The
location codes are produced by the processor subsystem’s firmware and AIX. For information about how to
read a location code, see the service guide for the processor subsystem to which your I/O subsystem is
connected.
AIX and Physical Location Code Table
This section covers the AIX and Physical Location Code tables for the D10.
The tables in this section contain the location codes for I/O subsystems when they are attached to the
system unit. In the tables, the location code for the I/O subsystem number is represented by dd. The first
time that an installed system is powered on, the I/O subsystems are numbered. For example, if a system
is first powered on with eight I/O subsystems connected, the dd value for the subsystems should be
numbered from 2 through 9.
Note: If the system was powered on with I/O subsystems connected before delivery to the customer, the
I/O subsystem location codes for the connected I/O subsystems are permanently set.
If at a later time, an I/O subsystem is removed from the system and a different I/O subsystem is
substituted, the substitute is assigned the next higher number available for its value of dd, which in the
earlier example, is 10. If the original I/O subsystem is reinstalled in the system configuration, the system
uses the original dd value for the I/O subsystem.
To keep the system from renumbering the I/O drawer when an I/O backplane is replaced, the VPD module
from the old I/O backplane must be moved over to the new I/O backplane.
The following diagram defines each part of a location code.
Ux.dd-xx-yy
| | | |
| | | |
| | | yy This code is used for the next component (yy, yy, yy, ...)
| | xx This code is used for the next component (Px, Fx, Vx, ...)
| dd This code is the location code for the I/O drawer (2 to 63)
Ux This code normally identifies the rack in which a drawer is installed.
The x will always be zero (0) for this system.
14 D10 and D20 Service Guide
D10 I/O Subsystem Location Code Table
The following table lists the location codes for a D10 I/O subsystem. The location code for the I/O
subsystem connected to a system unit is U0.dd (where dd can be any number from 2 through 63).
FRU Name Physical Location Code
D10 I/O Drawer U0.dd (dd = any number from 2 through 63)
I/O subsystem 1 drawer U0.dd
I/O backplane U0.dd-P1
RIO connector riser card U0.dd-P1.1
RIO port 0 (upper connector) U0.dd-P1.1/Q1
RIO port 0 cable U0.dd-P1.1/Q1#
RIO port 1 (lower connector) U0.dd-P1.1/Q2
RIO port 1 cable U0.dd-P1.1/Q2#
PHB1 U0.dd-P1
EADS_X under PHB1 U0.dd-P1
PCI slot 1 U0.dd-P1/I1
PCI slot 1 adapter U0.dd-P1-I1
PCI slot 2 U0.dd-P1/I2
PCI slot 2 adapter U0.dd-P1-I2
PCI slot 3 U0.dd-P1/I3
PCI slot 3 adapter U0.dd-P1-I3
PHB2 U0.dd-P1
EADS_X under PHB2 U0.dd-P1
PCI slot 4 U0.dd-P1/I4
PCI slot 4 adapter U0.dd-P1-I4
PCI slot 5 U0.dd-P1/I5
PCI slot 5 adapter U0.dd-P1-I5
PCI slot 6 U0.dd-P1/I6
PCI slot 6 adapter U0.dd-P1-I6
Power supply 1 (with 2 fans) U0.dd-V1
Power supply 2 (with 2 fans) U0.dd-V2
Cooling fan 5 U0.dd-F5
Rack indicator connector (4-pins) U0.dd-P1/Q3
SPCN connector 1/J15 U0.dd-P1/Q4
SPCN connector 2/J16 U0.dd-P1/Q5
Subsystem VPD U0.dd-P1
Chapter 1. Reference Information 15
D20 Overview
The D20 is a 19-inch, rack-mountable I/O subsystem that is attached to a processor subsystem to extend
the system’s capacity for I/O adapters and disk drives. The I/O subsystem includes redundant concurrently
maintainable power and cooling.
Seven PCI slots are available for PCI adapters. Because the PCI slots support hot-pluggable adapters, the
I/O subsystem can have adapters installed without turning off power. The D20 is 4 EIA units high and is
compatible with 19-inch EIA rack enclosures.
D20 I/O Subsystem Features
The features of the D20 enable this I/O subsystem to extend the input and output capabilities of the
processor subsystem. Features such as PCI-X adapters and internal SCSI disk drives are supported.
D20 Operator Indicators
LEDs are used on the D20 as operator indicators. Tables found in “D20 PCI-X Slots” and “D20 Power” on
page 18 describes the LED states and their definitions. For more information on LEDs and their definitions,
see Chapter 5, “Fault and Attention LEDs”, on page 169. LEDs can be found on or near the following
FRUs:
v LED indicators visible on each PCI adapter light pipe
v LEDs for power on, attention, SCSI activity
v Attention/Identify LEDs
v Rack Beacon
D20 PCI-X Slots
Seven PCI-X slots are available. The slots are 64-bit capable at up to 131.5 Mhz, 3.3 volts. The slots are
numbered on the rear of the chassis from left to right 1 through 7. The I/O backplane and each I/O slot
have green power indicator LEDs and amber identify LEDs. The following illustration shows the LED
locations when viewing from the rear of the I/O drawer.
1 PCI-X Expansion Slot 1 6 PCI-X Expansion Slot 6
2 PCI-X Expansion Slot 2 7 PCI-X Expansion Slot 7
3 PCI-X Expansion Slot 3 A Green Power LEDs
4 PCI-X Expansion Slot 4 B Amber Identify LEDs
5 PCI-X Expansion Slot 5
16 D10 and D20 Service Guide
The green LEDs for the PCI-X adapter slots (viewed on the rear of the I/O subsystem) indicate the state of
the PCI slot during removal and replacement of an adapter.
PCI LED (Green) Indication PCI Slot Status Definition
Off Off Slot power is Off. It is safe to remove
or replace adapters.
On (not flashing) On Slot power is On. Do not remove or
replace adapters.
Flashing slowly (one flash per
second)
Slot has been identified by the
software. Do not remove or replace
adapters at this time.
Flashing rapidly (six to eight flashes
per second)
Slot is ready removing or replacing an
adapter.
D20 Disk Drives
There are 12 hot-plug disk-drive bays provided. The bays are located behind the D20 front bezel. The D20
has from 18.2 GB to 1.7 terabytes of disk storage capacity. The following disk drive sizes and speeds are
available:
v 18.2 GB Ultra3 10K RPM 1 inch
v 36.4 GB Ultra3 10K RPM 1 inch
v 73.4 GB Ultra3 10K RPM 1 inch
v 146.8 GB Ultra3 10K RPM 1 inch
For an updated listing of hot-plug disk-drive sizes, contact your sales representative.
Chapter 1. Reference Information 17
D20 Power
The D20 is standard with one power supply. If the customer wants redundant power in their I/O
subsystem, an optional power supply can be ordered. Either power supply, in a D20 that is configured with
two power supplies, can provide the necessary voltages and currents, independent of the other power
supply. The left and right power-supply output voltages are connected and monitored by the power
distribution board contained in the I/O subsystem.
The left and right power supplies are hot-pluggable and may be changed one at a time while the system is
operational. Each power supply is capable of converting available 110/220 V ac to the proper internal
voltages used by the system components. The internal voltage range is 5 volts system standby, 3.3 volts,
5 volts, 12 volts and -12 volts. When operating correctly, each supply will share the current load when
powered on. When in a fault state, the operating power supply will supply standby current to the faulted
supply for the DEVROS module. The following table describes power-related LED indicators that are
available on the D20. The operator panel LED referred to in the following table is on the front of the
processor subsystem to which the I/O subsystem is attached.
Status of LED Operator Panel LED (processor subsystem)
Off Power Source not connected
Blinking green LED System power source connected, but power is not turned
on
Blinking green LED, visibly begins to blink faster after the
power-button has been pressed.
System power source connected, the power-on button
has been pressed and power-on initiated
Solid (not blinking) green LED, (There is approximately a
30-second transition period from the time the power on
button is pressed to the time the power LED is on solid.)
System power source connected and turned on
D20 Cooling
Four blowers are mounted on the top of the I/O subsystem to provide cooling.
Attention: The four cooling blowers mounted on top of the D20 are hot-plug. The one exception is when
your subsystem has only one power supply either installed or functioning, in which case you must shut
down the system and subsystem. Failure to do so will automatically shut down the system.
D20 Input/Output Ports
The connector ports on the rear of the D20 I/O subsystem are used to connect the RIO-2 cables, the
SPCN cables, and the rack-beacon LED.
18 D10 and D20 Service Guide
D20 I/O Subsystem Locations
The system uses physical location codes to provide mapping of the failing field replaceable units. The
location codes are produced by the processor subsystem’s firmware and AIX. For information about how to
read a location code, see the service guide for the processor subsystem to which your I/O subsystem is
connected.
D20 Front View
1 Serial Number 3 D20 Front Bezel
2 Operator Panel 4 Disk Drives (maximum quantity 12 )
D20 Rear View
1 PCI-X Expansion Slot 1 9 Primary Power Supply
2 PCI-X Expansion Slot 2 10 D20 I/O Drawer
3 PCI-X Expansion Slot 3 11 Connector J11 (Not Used)
4 PCI-X Expansion Slot 4 12 Connector J14 (Not Used)
5 PCI-X Expansion Slot 5 13 Rear Serial Number Label
6 PCI-X Expansion Slot 6 14 Connector J15 (SPCN 3)
7 PCI-X Expansion Slot 7 15 Connector J16 (SPCN 4)
8 Redundant Power Supply Receptacle 16 RIO Connectors
Upper Connector = 1
Lower Connector = 0
Chapter 1. Reference Information 19
D20 Power Supply Locations
1
2 3 4
6
5
1 Power Supplies 4 Green AC Power LED
2 Amber Fault/Identify LED 5 Power Supply Filler
3 Green DC Good LED 6 D20 I/O Drawer
20 D10 and D20 Service Guide
D20 Blower Locations
The following illustration identifies the I/O subsystem cooling blowers. Each blower has a green power
LED and an amber identify LED located on the front blower housing. The green LED indicates that a
blower has power and is operating correctly. The amber LED is in the identify state when blinking.
1 Green LED 4 Blower Connector
2 Amber LED 5 Hot-Plug Blower Assembly
3 Snap Button 6 Docking Connector
Chapter 1. Reference Information 21
D20 I/O Backplane Locations
The following illustration of the I/O backplane identifies the primary connectors used in your subsystem.
Note: Before replacing the I/O backplane, note the position of pin 1 on the VPD module.
1 PCI-X Adapter Connector 1 11 Disk Drive Power Connector
2 PCI-X Adapter Connector 2 12 Cooling Blower Connector
3 PCI-X Adapter Connector 3 13 VPD Module
4 PCI-X Adapter Connector 4 14 Redundant Power Supply Connector
5 PCI-X Adapter Connector 5 15 Primary Power Supply Connector
6 PCI-X Adapter Connector 6 16 Connector J11 (Not Used)
7 PCI-X Adapter Connector 7 17 Connector J14 (Not Used)
8 Operator Panel Connector 18 Connector J15 (SPCN 3)
9 RIO Bus Adapter Connector 19 Connector J16 (SPCN 4)
10 Disk Drive System Power Control Network
(SPCN) Connector
20 4-pin Connector, Rack Beacon Connector
22 D10 and D20 Service Guide
D20 Reading the Operator Panel and I/O Backplane LEDs
The following illustration shows the component location on the operator panel
1 Power-On LED
2 Attention LED
3 Operator Panel
4 Snap Buttons
The following table defines each LEDs function on the operator panel.
Number Component Name Component Description
1 Green Power LED Standby State - Blinking LED
(System connected to power source but not powered on.)
Powered On - LED is turned on solid (not blinking).
2 Amber Attention LED Normal State - LED is off.
Chapter 1. Reference Information 23
D20 SCSI IDs and Bay Locations
Index Bay Location Drive Name SCSI Bus ID
8 DB1 D01 Hot-Plug Disk Drive 8 - DB1
9 DB1 D02 Hot-Plug Disk Drive 9 - DB1
A DB1 D03 Hot-Plug Disk Drive A - DB1
B DB1 D04 Hot-Plug Disk Drive B - DB1
C DB1 D05 Hot-Plug Disk Drive C - DB1
D DB1 D06 Hot-Plug Disk Drive D - DB1
8 DB2 D07 Hot-Plug Disk Drive 8 - DB2
9 DB2 D08 Hot-Plug Disk Drive 9 - DB2
A DB2 D09 Hot-Plug Disk Drive A - DB2
B DB2 D10 Hot-Plug Disk Drive B - DB2
C DB2 D11 Hot-Plug Disk Drive C - DB2
D DB2 D12 Hot-Plug Disk Drive D - DB2
Note: The SCSI bus IDs are the recommended values and indicate how the IDs are set when the system
is shipped from the factory. Field installations might not comply with these recommendations.
24 D10 and D20 Service Guide
Cabling the D20 to a 286/C4
The D20 subsystem can be installed in any location in a standard 19-inch EIA rack. The cables that
connect the subsystems allow some flexibility in drawer placement. Up to two D20s can be connected to a
286/C4. Each I/O subsystem is connected to the processor subsystem using a system power control
network (SPCN) cable loop and a remote I/O (RIO) cable loop. One SPCN cable loop is needed to
connect the I/O subsystems to one processor subsystem.
Before connecting the RIO-2 and SPCN cables to the D20, do the following:
1. Ensure that your 286/C4 is running the latest level firmware. Refer to the ²Firmware Updates² section
of the ²Using the Service Processor² chapter in the 286/C4 and 286/E4 Service Guide, order number
SA23-1277. For information about checking or upgrading the firmware level of your 286/C4.
2. After ensuring that your firmware is at the latest level, turn off the power and disconnect the 286/C4
power cables from the power source.
3. Attach the RIO-2 and SPCN cables. For cabling diagrams, see “RIO-2 (Remote Input Output) Cabling”
on page 26.
4. After connecting the RIO-2 and SPCN cables from the D20 to your system, reconnect the system’s
power cables to the power source and refer to “Stopping and Starting the System” on page 42.
5. Run system verification. Refer to the ²Verifying the Hardware² chapter in the 286/C4 and 286/E4
Installation Guide, order number SA23-1277.
Your 286/C4 is expandable when connected up to two D20 I/O drawers. Each D20 can contain up to 7 I/O
PCI adapter slots, and two 6-pack disk drive (DASD) bays.
Chapter 1. Reference Information 25
RIO-2 (Remote Input Output) Cabling
The following rules apply to RIO-2 cable connections:
Notes:
1. To connect to a D20, you will be using the RIO-2 (Remote Input Output) and SPCN (System Power
Control Network) connectors and cables. The RIO-2 and SPCN cables provide two functions, remote
data bus connection, and power control.
2. The I/O drawers must be connected in a loop for both the RIO-2 and SPCN cables. The loop
connection provides redundant paths so that if a failure occurs in part of a cable, the system will
continue to operate. If a failure does occur, a message is displayed on the system console; however,
the system does continue to operate. See the following illustrations for possible cabling configurations.
To connect the RIO-2 cables to the rear of the 286/C4, do the following:
1. Locate the RIO-2 cable-support bracket.
2. Using two M3 x 6 retaining screws, secure the RIO-2 cable-support bracket to the rear of the 286/C4
chassis. See the following illustration.
1 286/C4
2 RIO-2 Cable-Support Bracket
3 M3 x 6 Retaining Screws (Quantity 2)
3. Pull the sliding portion of the RIO-2 cable-support bracket back from the 286/C4 chassis. The sliding
portion of the cable support bracket will stop before being fully separated.
4. Align the RIO-2 cable connector with the bottom of the sliding portion of the RIO-2 cable support
bracket.
26 D10 and D20 Service Guide
5. Lift the RIO-2 cable and cable connector into the bottom of the RIO-2 cable-support bracket.
1 286/C4
2 RIO-2 Cable-Support Bracket
3 Slide Portion of RIO-2 Cable-Support Bracket
4 RIO-2 Connector Support Lever
6. Pull the RIO-2 connector back until the connector is seated against the support hook.
7. Push the RIO-2 connector forward, ensuring that the RIO-2 connector lever slides over the top of the
connector hooks.
8. Slowly push the assembly toward the chassis.
9. Secure the RIO-2 cable-support bracket with the captive thumbscrew, the RIO-2 connector and slide
portion of the RIO-2 support bracket to the back of the 286/C4 chassis.
1 Captive Thumbscrews
2 RIO-2 Cable and Connector
3 RIO-2 Connector Support Lever
Chapter 1. Reference Information 27
286/C4 Attached to One I/O Drawer:
1 286/C4
2 D20
3 Cable from 286/C4 Connector RIO-2 0 to D20 Connector RIO-2 0
4 Cable from 286/C4 Connector RIO-2 1 to D20 Connector RIO-2 1
286/C4 Attached to Two I/O Drawers:
1 286/C4
2 D20 I/O Drawer
3 D20 I/O Drawer
4 Cable from 286/C4 Connector RIO-2 0 to D20 Connector RIO-2 0
5 Cable from 286/C4 Connector RIO-2 1 to D20 Connector RIO-2 1
6 Cable from D20 Connector RIO-2 1 to D20 Connector RIO-2 0
28 D10 and D20 Service Guide
SPCN (System Power Control Network) Cabling
The following figures provide cabling examples for all valid cabling configurations. Match your configuration
to the correct figure and connect your SPCN cables as shown.
286/C4 Attached to One I/O Drawer:
1 286/C4
2 D20
3 Cable from 286/C4 Connector SPCN 2 to D20 Connector SPCN 4 (J16)
4 Cable from 286/C4 Connector SPCN 1 to D20 Connector SPCN 3 (J15)
286/C4 Attached to Two I/O Drawers:
1 286/C4
2 D20 I/O Drawer
3 D20 I/O Drawer
4 Cable from 286/C4 Connector SPCN 1 to D20 Connector SPCN 3 (J15)
5 Cable from 286/C4 Connector SPCN 2 to D20 Connector SPCN 4 (J16)
6 Cable from D20 Connector SPCN 4 (J16) to D20 Connector SPCN 3 (J15)
Chapter 1. Reference Information 29
D20 Internal Cabling
1 Cooling Blowers (Quantity 4) 6 Disk Drive Power Connector
2 Operator Panel 7 Disk Drive SPCN Connector
3 Disk Drive Backplanes 8 Disk Drive Power/Signal Cable
53P0416
4 Cooling Blower Cable
53P0419
9 Operator Panel Connector
5 Cooling Blower Connector 10 Operator Panel Cable
53P0414
30 D10 and D20 Service Guide
D20 Location Codes
This system (processor subsystem and attached I/O subsystems) uses physical location codes in
conjunction with AIX location codes to provide mapping of a failing field replaceable units (FRUs). The
location codes are produced by the processor subsystem’s firmware and AIX. For information about how to
read a location code, see the service guide for the processor subsystem to which your I/O subsystem is
connected.
AIX and Physical Location Code Table
This section covers the AIX and Physical Location Code tables for the D20 I/O subsystem.
The tables in this section contain the location codes for I/O subsystems when they are attached to the
system unit. In the tables, the location code for the I/O subsystem number is represented by dd. The first
time that an installed system is powered on, the I/O subsystems are numbered. For example, if a system
is first powered on with two I/O subsystems connected, then the dd value for the connected I/O
subsystems should be numbered from 2 and 3.
Note: If the system was powered on with I/O subsystems connected before delivery to the customer, the
I/O subsystem location codes for the drawers connected at that time are already assigned.
If at a later time, an I/O subsystem is removed from the system and a different I/O subsystem is
substituted, the substitute is assigned the next higher number available for its value of dd, which, in the
earlier example, is 4. If the original I/O subsystem is reinstalled in the system configuration, the system
uses the original dd value for the I/O subsystem.
To keep the system from renumbering the I/O drawer when an I/O backplane is replaced, the VPD module
from the old I/O backplane must be moved over to the new I/O backplane.
The following diagram defines each part of a location code.
Ux.dd-xx-yy
| | | |
| | | |
| | | yy This code is used for the next component (yy, yy, yy, ...)
| | xx This code is used for the next component (Px, Fx, Vx, ...)
| dd This code is the location code for the I/O drawer (2 to 63)
Ux This code normally identifies the rack in which a drawer is installed.
The x will always be zero (0) for this system.
Chapter 1. Reference Information 31
D20 Location Code Table
The following table lists the location codes for a D20 I/O subsystem. The location code for the I/O
subsystem connected to a system unit is U0.dd (where dd can be any number from 2 through 63).
FRU Name Physical Location Code
D20 I/O Drawer U0.dd (dd = any number from 2 through 63)
Rack frame U0
I/O drawer backplane U0.dd-P1
I/O subsystem drawer U0.dd
RIO Bus Adapter
RIO bus adapter U0.dd-P1.1
RIO port 1 upper connector U0.dd-P1.1/Q1
RIO port 1 cable U0.dd-P1.1/Q1#
RIO port 0 lower connector U0.dd-P1.1/Q2
RIO port 0 cable U0.dd-P1.1/Q2#
PCI Bus Controller
PCI bus controller PHB0 U0.dd-P1
PCI to PCI bridge EADS-X U0.dd-P1
PCI Bus Controller PHB2
PCI bus controller PHB2 U0.dd-P1
PCI to PCI bridge EADS-X U0.dd-P1
Pluggable Adapter Slots
PCI slot 1 U0.dd-P1/I1
PCI card in slot 1 U0.dd-P1-I1
PCI slot 2 U0.dd-P1/I2
PCI card in slot 2 U0.dd-P1-I2
PCI slot 3 U0.dd-P1/I3
PCI card in slot 3 U0.dd-P1-I3
PCI slot 4 U0.dd-P1/I4
PCI card in slot 4 U0.dd-P1-I4
PCI slot 5 U0.dd-P1/I5
PCI card in slot 5 U0.dd-P1/I5
PCI slot 6 U0.dd-P1/I6
PCI card in slot 6 U0.dd-P1/I6
PCI slot 7 U0.dd-P1/I7
PCI card in slot 7 U0.dd-P1/I7
Power Supply
Power supply 1 (no internal blower) U0.dd-V1
Power supply 2 (no internal blower) U0.dd-V2
Cooling Blowers
Cooling blower 1 U0.dd-F1
Cooling blower 2 U0.dd-F2
Cooling blower 3 U0.dd-F3
32 D10 and D20 Service Guide
FRU Name Physical Location Code
Cooling blower 4 U0.dd-F4
SPCN Connectors
SPCN connector 1 U0.dd-P1/Q3
SPCN connector 2 U0.dd-P1/Q4
SPCN connector 3 U0.dd-P1/Q5
SPCN connector 4 U0.dd-P1/Q6
Rack Indicator
Rack beacon (4–pin) U0.dd-P1/Q7
Operator Panel
Operator panel U0.dd-L1
Temperature Sensor
Temperature sensor (on operator panel) U0.dd-L1
Disk Drives Backplanes
Disk drives backplane 1 U0.dd-P3
Disk drives backplane 2 U0.dd-P4
Disk Drive Locations
Hot-swap disk drive bay 1 U0.dd-P1-Ix/Zn-A8
Hot-swap disk drive bay 2 U0.dd-P1-Ix/Zn-A9
Hot-swap disk drive bay 3 U0.dd-P1-Ix/Zn-AA
Hot-swap disk drive bay 4 U0.dd-P1-Ix/Zn-AB
Hot-swap disk drive bay 5 U0.dd-P1-Ix/Zn-AC
Hot-swap disk drive bay 6 U0.dd-P1-Ix/Zn-AD
Hot-swap disk drive bay 7 U0.dd-P1-Iy/Zn-A8
Hot-swap disk drive bay 8 U0.dd-P1-Iy/Zn-A9
Hot-swap disk drive bay 9 U0.dd-P1-Iy/Zn-AA
Hot-swap disk drive bay 10 U0.dd-P1-Iy/Zn-AB
Hot-swap disk drive bay 11 U0.dd-P1-Iy/Zn-AC
Hot-swap disk drive bay 12 U0.dd-P1-Iy/Zn-AD
SCSI
SES0 U0.dd-P1-Ix/Zn-AF
SES1 U0.dd-P1-Iy/Zn-AF
Chapter 1. Reference Information 33
Powering the System On and Off
Power for the I/O subsystems is controlled by the attached processor subsystem and the System Power
Control Network (SPCN).
For information about procedures to power the system on and off, refer to the service guide of the
processor subsystem to which the I/O subsystem is attached.
Power-On Self-Test
After power is turned on and before the operating system is loaded, the system does a power-on self-test
(POST). The RIO loop that connects the I/O subsystems to the system unit is tested. Tests are also
performed on the installed adapters and devices in the I/O subsystems. If an error occurs during the
POST, an error code is displayed on the system operator panel LCD that indicates which part is failing and
which subsystem contains the error.
34 D10 and D20 Service Guide
D10 and D20 Specifications
This section contains system specifications for both the D10 and D20 I/O subsystems.
D10 Specifications
Dimensions 7311-D10 Two 7311-D10s with
Enclosure
Height 170 mm (6.6 in) 178 mm (7.0 in)
Width 220 mm (8.7 in) 445 mm (17.5 in)
Depth 711 mm (28.0 in) 711 mm (28.0 in)
Weight 16.8 kg (37 lbs) 39.1 kg (86 lbs)
Electrical
Power source loading for two 7311-D10 (max.) 0.21 kVA
Voltage range 200 to 240 V ac, (dc not supported)
Frequency 50 or 60 Hz
Thermal output (typical) 461 Btu/hr
Thermal output (max.) 683 Btu/hr
Power requirements (typical) 135 watts
Power requirements for two 7311-D10 (max.) 200 watts
Power factor 0.91
Inrush currentæ 64 amps
Maximum altitude3, 4 3048 m (10000 ft.)
Temperature Requirementsç Operating
10 to 38°C
50 to 100°F)
Non-Operating
1 to 60°C
(34 to 140°F)
Storage
1 to 60°C
(34 to 140°F)
Humidity Requirements4 Operating Non-Operating Storage
(Noncondensing) 8 to 80% 8 to 80% 8 to 80%
Wet Bulb 23°C (73°F) 27°C (81°F) 29°C (84°F)
Noise Emissions1, 4 Operating Idle
LWAd, one 7311-D10
LWAd, two 7311-D10
LWAd, four 7311-D10
5.6 bels
5.9 bels
6.2 bels
5.6 bels
5.9 bels
6.2 bels
m, one 7311-D10
m, two 7311-D10
m, four 7311-D10
40 dBA
43 dBA
46 dBA
40 dBA
43 dBA
46 dBA
Install/Air Flow: Maintaining service clearance allows proper air flow.
Service Clearances: See 14T/0 and 14T/4 Installation and Service Guide, order number SA23-2544 for T00 or T42
rack service clearances.
1. See page 174 for definitions of noise emissions positions. See noise emissions note 4.
2. Inrush currents occur only at initial application of power, no inrush occurs during normal power off-on cycle.
3. The upper limit of the dry bulb temperature must be derated 1°C per 137 m (450 ft.) above 915 m (3000 ft.).
4. The upper limit of the wet bulb temperature must be derated 1°C per 274 m (900 ft. ) above 305 m (1000 ft.).
Chapter 1. Reference Information 35
D20 Specifications
Dimensions
Height 178 mm (7.0 in)
Width 445 mm (17.5 in)
Depth 610 mm (24.0 in)
Maximum Weight 45.9kg (101 lbs)
Electrical
Power source loading (max.) 0.358 kVA
Voltage range 100 to 240 V ac, V dc not supported
Frequency 50 or 60 Hz
Thermal output (typical) 774 Btu/hr
Thermal output (max.) 1161 Btu/hr
Power requirements (typical) 227 watts
Power requirements for two
7311-D20 (max.)
340 watts
Power factor 0.91
Inrush currentæ 60 amps
Maximum altitude3, 4 3048 m (10,000 ft.)
Temperature
Requirementsç
Operating
5 to 35°C
41 to 95°F)
Non-Operating
1 to 43°C
(34 to 109°F)
Storage
1 to 60°C
(34 to 140°F)
Humidity Requirements4 Operating Non-Operating Storage
(Noncondensing) 8 to 80% 8 to 80% 5 to 80%
Wet Bulb 23°C (73°F) 27°C (81°F) 29°C (84°F)
Noise Emissions1, 5 Operating Idle
LWAd 6.1 bels 6.0 bels
m 44 dBA 43 dBA
Install/Air Flow Maintaining service clearance allows proper air flow.
Service Clearances: See 14T/0 and 14T/4 Installation and Service Guide, order number SA23-2544 for T00 or T42
rack service clearances.
1. See 174 for definitions of noise emissions positions. See noise emissions note 4.
2. Inrush currents occur only at initial application of power, no inrush occurs during normal power off-on cycle.
3. The upper limit of the dry bulb temperature must be derated 1 degree C per 137 m (450 ft.) above 915 m (3000
ft.).
4. The upper limit of the wet bulb temperature must be derated 1 degree C per 274 m (900 ft. ) above 305 m (1000
ft.).
36 D10 and D20 Service Guide
External AC Power Cables
To avoid electrical shock, a power cable with a grounded attachment plug is provided. Use only properly
grounded outlets.
Power cables used in the United States and Canada are listed by Underwriter’s Laboratories (UL) and
certified by the Canadian Standards Association (CSA). These power cords consist of the following:
v Electrical cables, Type SVT or SJT.
v Attachment plugs complying with National Electrical Manufacturers Association (NEMA) 5-15P, that is:
²For 115 V operation (D20 only), use a UL listed cable set consisting of a minimum 18 AWG, Type SVT or
SJT three-conductor cord a maximum of 15 feet in length and a parallel blade, grounding type attachment
plug rated at 15 A, 125 V.²
²For 230 V operation in the United States use a UL listed cable set consisting of a minimum 18 AWG,
Type SVT or SJT three-conductor cable a maximum of 15 feet in length, and a tandem blade, grounding
type attachment plug rated at 15 A, 250 V.²
v Appliance couplers complying with International Electrotechnical Commission (IEC) Standard 320, Sheet
C13.
Power cables used in other countries consist of the following:
v Electrical cables, Type HD21.
v Attachment plugs approved by the appropriate testing organization for the specific countries where they
are used.
²For units set at 230 V (outside of U.S.): use a cable set consisting of a minimum 18 AWG cable and
grounding type attachment plug rated 15 A, 250 V. The cable set should have the appropriate safety
approvals for the country in which the equipment will be installed and should be marked `HAR’.²
For information about the power cables that are available, refer to Chapter 4, “Parts Information”, on
page 157.
Chapter 1. Reference Information 37
Service Inspection Guide
Perform a service inspection on the system (processor subsystem and connected I/O subsystems) when:
v The system is inspected for a maintenance agreement.
v Service is requested and service has not recently been performed.
v An alterations and attachments review is performed.
v Changes have been made to the equipment that may affect the safe operation of the equipment.
v External devices with their own power cables have those cables attached.
If the inspection indicates an unacceptable safety condition, the condition must be corrected before anyone
can service the machine.
Note: The owner of the system is responsible to correct any unsafe conditions.
Perform the following checks:
1. Check the covers for sharp edges and for damage or alterations that expose the internal parts of the
system.
2. Check the covers for proper fit to the system. They should be in place and secure.
3. Gently rock the system from side to side to determine if it is steady.
4. Set the power button of the system to Off.
5. Remove the covers.
6. Check for alterations or attachments. If there are any, check for obvious safety hazards, such as
broken wires, sharp edges, or broken insulation.
7. Check the internal cables for damage.
8. Check for dirt, water, and any other contamination within the system.
9. Check the voltage label on the back of the system to ensure that it matches the voltage at the outlet.
10. Check the external power cable for damage.
11. With the external power cable connected to the system, check for 0.1 ohm or less resistance between
the ground lug on the external power cable plug and the metal frame.
12. Perform the following checks on each device that has its own power cables:
a. Check for damage to the power cable.
b. Check for the correctly grounded power cable.
c. With the external power cable connected to the device, check for 0.1 ohm or less resistance
between the ground lug on the external power cable plug and the metal frame of the device.
13. Install the covers.
38 D10 and D20 Service Guide
Chapter 2. Diagnostics and Maintenance Procedures
The system uses an integrated set of software diagnostic procedures to help isolate failing components
and system maintenance. This book, along with the Diagnostic Information for Multiple Bus Systems, is the
basis of the diagnostic procedures. Refer to the system service guide when running diagnostics on your
I/O subsystem.
Power and Attention LEDs
The Power and Attention LEDs provide a means to identify failing components in your subsystem. When a
failing component is detected in your system, the system’s Attention LED is turned on.
Component LEDs
To further help you identify the failing component, all system components have individual LEDs that
indicate a failure when lit. The LEDs are either on the component itself or on the carrier of the component
(memory card, fan, memory module, or CPU).
The LEDs are either green or amber in color. A lit green LED indicates that the system or component is
receiving power. A lit amber LED identifies a system or component on the system. The amber LED can
also indicate when a system or component on a system has a problem or fault. For more information on
LEDs and their definitions, see Chapter 5, “Fault and Attention LEDs”, on page 169.
Resetting the LEDs
To reset the LEDs, do the following:
1. Replace the failing component with the new component.
2. Log in as root user.
3. At the command line, type diag.
4. Select Task Selection.
5. Select Log Repair Action.
6. Select the device that was repaired. (If the device is not listed, select sysplanar0.)
Checkpoints
The system uses various types of checkpoints, error codes, and SRNs, which are referred to throughout
your system’s service guide. These codes can appear in the service processor boot progress log, the AIX
error log, and the operator panel display. Understanding the definition and relationships of these codes is
important to the service personnel who are installing or maintaining the system.
Service Request Numbers (SRNs) are listed in the Diagnostic Information for Multiple Bus Systems, order
number SA23-2769.
FRU Isolation
For a list of error codes and recommended actions for each code, see your system’s service guide. These
actions can refer to Chapter 4, “Parts Information”, on page 157, “Maintenance Analysis Procedures
(MAPs)” on page 40, or provide informational message and directions. If a replacement part is indicated,
direct reference is made to the part name. The respective AIX and physical location codes are listed for
each occurrence as required. For a list of locations codes, see your system’s service guide.
39
Maintenance Analysis Procedures (MAPs)
The maintenance analysis procedures (MAPs) provide the service representative a step-by-step procedure
to analyze a problem with the system and I/O subsystem hardware. Be prepared to record code numbers
and other data while using the MAPs.
Because the D10 and D20 are subsystems, refer to your system’s service guide. The system’s service
guide contains all of the MAPs needed for servicing the D10 and D20 I/O subsystems.
Checkpoints
Checkpoints display on the system operator panel and the virtual terminal while the system unit is
powering on and going through the initial program load (IPL). See the system’s service guide for
descriptive information and a complete listing of all checkpoints used by the D10 and D20 I/O subsystems.
Error Code to FRU Index
The Error Code to FRU Index lists fault symptoms and possible causes. The most likely cause is listed
first. Use this index to help you decide which FRUs to replace when servicing the D10 or D20 I/O
subsystem. See your system’s service guide for more information and a complete listing of all error codes
and FRUs used by the D10 and D20 I/O subsystem.
Using the Service Processor
The service processor is in the processor subsystem to which your I/O subsystem is connected and runs
on its own power boundary. The service processor continually monitors hardware attributes and the
environmental conditions within the system and connected I/O subsystems. The service processor is
controlled by firmware and does not require the operating system to be operational to perform its tasks.
The service processor menus allow you to configure service processor options, as well as enable and
disable functions.
Service processor menus are available using an ASCII terminal or an HMC virtual terminal window when
OK is displayed on the operator panel or when the service processor has detected a system problem (such
as a surveillance failure).
For more information about the service processor for the system to which your I/O subsystem is
connected, see the system’s service guide.
Using System Management Services
Use the system management services menus to view information about the processor subsystem to which
the I/O subsystem is connected. For more information about the service processor, see the service guide
for the attached system.
40 D10 and D20 Service Guide
Chapter 3. Removal and Replacement Procedures
Before performing any of the removal or replacement procedures in this chapter, read the following danger
and caution notices.
Safety Considerations
Observe the following safety precautions anytime you work with these I/O subsystems.
For the I/O subsystem you are about to setup or service:
v The ac power interface connector is considered the main power disconnect device.
v The I/O subsystems have independent redundant power supply capabilities, meaning that each unit
might be configured to have two power supplies running simultaneously in the same I/O subsystem.
When instructed to disconnect the power source, ensure that all power cables that run to each serviced
subsystem are disconnected from the power distribution bus.
DANGER
An electrical outlet that is not correctly wired could place hazardous voltage on metal parts of
the system or the devices that attach to the system. It is the responsibility of the customer to
ensure that the outlet is correctly wired and grounded to prevent an electrical shock.
Before installing or removing signal cables, ensure that the power cables for the system unit
and all attached devices are unplugged.
When adding or removing any additional devices to or from the system, ensure that the power
cables for those devices are unplugged before the signal cables are connected. If possible,
disconnect all power cables from the existing system before you add a device.
Use one hand, when possible, to connect or disconnect signal cables to prevent a possible
shock from touching two surfaces with different electrical potentials.
During an electrical storm, do not connect cables for display stations, printers, telephones, or
station protectors for communications lines.
D05
CAUTION:
This product is equipped with a three-wire power cable and plug for the user’s safety. Use this
power cable with a properly grounded electrical outlet to avoid electrical shock.
C01
CAUTION:
This unit has more than one power supply cord. To reduce the risk of electrical shock, disconnect
two power supply cords before servicing.
C21
41
Handling Static-Sensitive Devices
Attention: Electronic boards and disk drives are sensitive to static electricity discharge. These devices
are wrapped in antistatic bags to prevent this damage.
Take the following precautions:
v If you have an antistatic wrist strap available, use it while handling the device.
v Do not remove the device from the antistatic bag until you are ready to install the device in the system.
v With the device still in its antistatic bag, touch it to a metal frame of the system.
v Grasp cards and boards by the edges. Hold drives by the frame. Avoid touching the solder joints or
pins.
v If you need to lay the device down while it is out of the antistatic bag, lay it on the antistatic bag. Before
picking it up again, touch the antistatic bag and the metal frame of the system at the same time.
v Handle the devices carefully to prevent permanent damage.
Stopping and Starting the System
For procedures to power the system on and off, refer to the service guide of the processor subsystem to
which the I/O subsystem is attached.
Removal and Replacement Procedures for the D10 I/O Subsystem
The following procedures cover the removal and replacement of the D10 I/O Subsystem FRUs.
Note: Before performing any of the removal or replacement procedures in this chapter, read the danger
and caution notices on “Safety Considerations” on page 41.
D10 FRU Replacement Procedure List
Field Replaceable Unit (FRU) FRU Procedure Name and Page Location
Adapters “D10 PCI Adapters” on page 45
Bezel “D10 Front Bezel” on page 43
Cover “D10 Covers” on page 44
Blower “D10 Fan” on page 104
I/O Backplane “D10 I/O Backplane Assembly” on page 106
Operating Position “D10 Operating Position” on page 43
Power Supplies “D10 Power Supply” on page 107
RIO Bus Adapter “D10 RIO Bus Adapter Assembly” on page 105
Service Position “D10 Service Position” on page 43
Static-Sensitive Devices “Handling Static-Sensitive Devices”
Stopping and Starting the System “Stopping and Starting the System”
42 D10 and D20 Service Guide
D10 Service Position
To perform a removal or replacement procedure that requires access to the inside of the I/O subsystem,
the subsystem must be removed from the rack and placed on a stable work surface.
To put the drawer into the service position, do the following:
1. Shut down the system unit to which the I/O subsystem is connected.
2. From the rear of the rack, disconnect the I/O subsystem’s power cables from the power distribution
bus.
3. Remove the retaining screws located on the rear of the I/O subsystem drawer.
4. Label and disconnect all of the cables connected to the rear of the I/O subsystem.
5. From the rear of the rack, pull the I/O subsystem straight back until the subsystem stops.
6. To allow the subsystem to move further to the rear, press the stop latch on the side of the enclosure.
7. Support the I/O subsystem as you pull it toward the rear of the rack.
8. When the I/O subsystem is out of the rack, place it on a stable work surface.
D10 Operating Position
To return the subsystem to the operating position, do the following:
1. From the rear of the rack, insert the I/O subsystem into the position from which it was removed. The
end of the I/O subsystem that has the power supplies goes toward the front of the rack.
2. Support the I/O subsystem as you push it toward the front of the rack.
3. Install the retaining screws in the rear of the I/O subsystem drawer.
4. Reconnect the cables to the rear of the I/O subsystem.
5. Reconnect the power cables.
6. Restart the system.
D10 Front Bezel
Before performing the following procedure, read the “Safety Notices” on page vii.
Removal
To remove the front bezel, do the following:
1. Open the front rack door.
2. Simultaneously press in both bezel-release tabs.
3. Pivoting the bezel from the top, swing the top forward.
4. Pull the bottom of the bezel up, then away from the subsystem chassis. This action releases the two
tab hooks located on the bottom of the I/O subsystem chassis.
Front Bezel Replacement
To replace the front bezel, do the following:
1. Open the front rack door.
2. Insert the two tabs located on the bottom edge of the bezel into their locking hooks, located on the
chassis.
3. Pivot the front bezel up toward the top of the chassis.
4. Align the release tabs to the matching slots located on the front of the subsystem chassis.
5. Gently push the tabs into the slots until the bezel seats against the front of the subsystem.
6. If the subsystem is not completely in the operating position, push the subsystem back into the
operating position as described in “D10 Operating Position”.
7. Close the rack door.
Chapter 3. Removal and Replacement Procedures 43
D10 Covers
Before performing the following procedure, read the “Safety Considerations” on page 41.
Service Access Cover Removal
To remove the service access cover, do the following:
1. Put the I/O subsystem into the service position as described in “D10 Service Position” on page 43.
2. Loosen the two captive thumbscrews located on the rear of the cover.
3. From the rear of the I/O subsystem, lift the cover and slide it backwards until the front disengages. Lift
the cover off the I/O subsystem drawer.
1
2
1 Service access cover 2 Thumbscrews
Service Access Cover Replacement
To replace the service access cover, do the following:
1. Position the cover over the rear of the I/O subsystem.
2. Align the service access cover with the I/O subsystem so that the front portion of the cover engages
with the front part of the I/O subsystem chassis. The flanges on the left and right sides of the cover
should be on the outside of the I/O subsystem chassis.
3. Hold the service access cover down and slide it forward toward the front of the I/O subsystem. The
front edge of the service access cover engages the front portion of the I/O subsystem.
4. Push in to engage and then tighten the thumbscrews located on the rear of the cover.
44 D10 and D20 Service Guide
D10 PCI Adapters
Most PCI adapters can be removed and replaced, or installed in the system without turning off the power
to the entire system. These adapters are referred to as hot-pluggable PCI adapters. Some adapters are
not hot-pluggable and power must be removed from the system for adapter removal, replacement, or
installation. Though some adapters are not hot-pluggable, they must still be installed in the cassette.
Note: An adapter or an adapter blank filler must be installed into the PCI adapter cassette assembly
before it is reinstalled in a system unit or an I/O drawer.
Before you remove or replace an adapter, determine if the PCI adapter you are working with is
hot-pluggable. See the PCI Adapter Placement Reference, order number SA23-2504.
If you are removing a PCI adapter that is:
v Not hot-pluggable, go to “Removing a Non-Hot-Pluggable PCI Adapter”.
v Hot-pluggable, go to “Hot-Pluggable PCI Adapter” on page 46.
Removing a Non-Hot-Pluggable PCI Adapter
To remove a non-hot-pluggable adapter, do the following:
1. Turn off power and remove the power from the system as described in “Stopping and Starting the
System” on page 42.
2. Determine the slot from which you are removing the adapter.
3. Disconnect any cables that are connected to the adapter being removed.
4. Press the center of the handle of the PCI card cassette to release the latch, then lower the handle
completely.
5. When the handle is completely lowered, push the gray locking cross bar.
6. Carefully pull the PCI card cassette straight out from the I/O subsystem as shown in the following
illustration. Take care not to pull EMC gaskets from neighboring cassettes.
1 I/O Subsystem
2 PCI adapter Cassette
7. If you are installing another adapter in this slot, follow the instructions given in “Replacing a
Non-Hot-Pluggable PCI Adapter” on page 46.
8. If you are not installing another adapter in this slot, install a blank PCI adapter cassette into the slot.
9. Connect power to the system and turn the power on.
10. Return the system to normal operations.
Chapter 3. Removal and Replacement Procedures 45
Replacing a Non-Hot-Pluggable PCI Adapter
Replace the adapter using the following procedure:
Note: Because the adapter you are installing is not hot-pluggable, shut down the system and remove
power before performing this procedure.
1. Press the center of the handle of the PCI card cassette to release the latch, then lower the handle
completely.
2. When the handle is completely lowered, push the gray locking cross bar.
3. Hold the assembly straight on and level with the slot.
4. Align the bottom edge of the PCI cassette cover with the PCI card guide rail on the I/O backplane.
Note: If there is a cassette to the left of the one you are installing, align the ridge on the cover with
the ²tick² in the notch of the neighboring cassette.
5. Slide the cassette partially into the guide.
6. Ensure the dovetail on the top track aligns with its mating component(s) on both sides.
7. When the cassette is fully inserted, prepare to activate the handle by lowering the gray locking bar.
Lift the handle all the way up until you hear a click. The PCI card should be completely seated.
Note: To insert the adapter correctly, some minor forward or backward movement of the PCI adapter
cassette might be necessary.
8. Set the color slide to blue.
9. Connect power to the system, and turn on the power as described in “Stopping and Starting the
System” on page 42.
10. Ensure that the adapter is configured when the system completes the boot process.
Hot-Pluggable PCI Adapter
Before performing the following procedure, read “Safety Notices” on page vii.
Attention: The Linux operating system does not support these hot-plug procedures. Also, Linux does not
support hot-plugging any hot-pluggable PCI adapters. Systems with Linux installed on one or more
partitions must be shut down and powered off before replacing any PCI adapter assigned to a Linux
partition. Follow the non-hot-pluggable adapter procedures when replacing a PCI adapter in any partition
with Linux installed.
The following hot-plug procedures take you through removing and replacing hot-plug PCI adapters using
software that presents procedures on your display. The LEDs on a PCI adapter cassette are described in
“I/O Subsystem PCI-X Slot LED Definitions” on page 2.
46 D10 and D20 Service Guide
Removing and Replacing a Hot-Pluggable PCI Adapter
Note: Use this procedure only when you are replacing an adapter with an identical adapter. If you are
replacing an adapter with an adapter that is not identical to the adapter removed, go to “Removing
a Hot-Pluggable PCI Adapter” on page 49, and then to “Installing a Hot-Pluggable PCI Adapter” on
page 51.
To replace an adapter, perform the following steps:
1. Determine the slot from which you are removing the adapter.
2. Ensure that any processes or applications that might use the adapter are stopped.
Note: Removing a hot-pluggable PCI adapter requires the system administrator to take the PCI
adapter offline before performing any PCI adapter hot-plug procedures. Before taking an
adapter offline, the devices attached to the adapter must be taken offline as well. This action
prevents a service representative or user from causing an unexpected outage for system
users.
For additional information about taking an adapter offline or removing it from the system
configuration, see the AIX System Management Guide: Operating System and Devices.
3. Log in as root user. If the system is a partitioned system, log in as root user on the partition that has
the adapter assigned to it.
4. At the command line, type smitty.
5. Select Devices.
6. Select PCI Hot Plug Manager.
7. Select Unconfigure a Device and press Enter.
8. Press F4 to display the Device Names menu.
9. From the menu, select the adapter you are removing.
10. Answer YES to Keep Definition. Press Enter.
11. The ARE YOU SURE screen displays. Press Enter to verify the information. Successful unconfigure is
indicated by the OK message displayed next to the Command field at the top of the screen.
12. Press F3 to return to the PCI Hot-Plug Manager menu.
13. Select Replace/Remove a PCI Hot-Plug Adapter and press Enter. The Replace/Remove a PCI
Hot-Plug Adapter menu displays.
14. Move the cursor to select the adapter that you are removing and press Enter.
15. Press the Tab key until the entry field displays the replace operation and then press the Enter key.
Follow the instructions that display on the screen until you are instructed to remove the adapter.
16. When you are instructed to remove the adapter from the adapter slot, disconnect any cables that are
connected to the adapter being removed.
17. Press the center of the handle of the PCI card cassette to release the latch, then lower the handle
completely.
18. When the handle is completely lowered, push the gray locking cross bar.
Chapter 3. Removal and Replacement Procedures 47
19. Carefully pull the PCI card cassette straight out from the rear of the subsystem. Take care not to pull
EMC gaskets from neighboring cassettes.
1 I/O Subsystem 2 PCI Adapter Cassette
20. Install the replacement adapter in the PCI adapter cassette, see “PCI Adapter or Blank Filler Removal
from a Cassette Assembly” on page 59.
21. When the adapter is installed in the PCI adapter cassette, install the adapter in the adapter slot, as
follows:
a. Lower the black handle completely.
b. Push the gray locking cross bar until you hear a clicking sound.
c. Hold the assembly straight on and level with the slot.
1 I/O Subsystem 2 PCI Adapter Cassette
d. Align the bottom edge of the PCI cassette cover with the PCI adapter guide rail on the I/O
backplane.
Note: If there is a cassette to the left of the one that you are installing, align the ridge on the
cover with the tick in the notch of the neighboring cassette.
e. Slide the cassette partially into the guide.
f. Ensure that the dovetail on the top track aligns with its mating component(s) on both sides.
g. When the cassette is fully inserted, prepare to activate the handle, lower the gray locking bar. Lift
the handle up completely until you hear a click. The PCI adapter should be completely seated.
Note: To enable proper insertion of the adapter, some minor forward or backward movement of
the PCI adapter cassette might be necessary.
h. Set the color slide to the orange color indicating that the adapter in the cassette is hot-pluggable.
22. Connect appropriate cables and devices to the adapter.
48 D10 and D20 Service Guide
23. Continue to follow the screen instructions until you receive a message that the replacement is
successful. Successful replacement is indicated by the OK message displayed next to the Command
field at the top of the screen.
24. Press the F3 key to return to the PCI Hot-Plug Manager menu.
25. Select Install/Configure Devices Added After IPL and press Enter. Then follow the instructions on
the screen. Successful replacement is indicated by the OK message displayed next to the Command
field at the top of the screen.
26. If you do not have other adapters to replace, continue with the next step.
OR
If you have other adapters to replace, press the F3 key to return to the PCI Hot-Plug Manager menu
and then return to step 13 on page 47.
27. Press F10 to exit the Hot-Plug Manager.
If you have added, removed, or replaced any adapters, run the diag -a command. If the system
responds with a menu or prompt, follow the instructions to complete the device configuration.
28. Install the covers that you removed earlier and return the drawer to the operating position.
Removing a Hot-Pluggable PCI Adapter
If you are permanently removing an adapter from a system, this procedure should be used.
Attention: The Linux operating system does not support these hot-plug procedures. Also, Linux does not
support hot-plugging any hot-pluggable PCI adapters. Systems with Linux installed on one or more
partitions must be shut down and powered off before replacing any PCI adapter assigned to a Linux
partition. Follow the non-hot-pluggable adapter procedures when replacing a PCI adapter in any partition
with Linux installed.
Note: If you are removing an adapter and replacing it with an adapter that is identical to the adapter that
was removed, use the “Removing and Replacing a Hot-Pluggable PCI Adapter” on page 47
procedure.
To remove an adapter, perform the following steps:
1. Open the door of the rack.
2. Determine the slot from which you are removing the adapter.
3. Ensure that any processes or applications that might use the adapter are stopped.
Note: Removing a hot-pluggable PCI adapter requires the system administrator to take the PCI
adapter offline before performing any PCI adapter hot-plug procedures. Before taking an
adapter offline, the devices attached to the adapter must also be taken offline as well. This
action prevents a service representative or user from causing an unexpected outage for
system users.
For more information on taking the PCI adapter online, see the AIX operating system
documentation. Documentation for the AIX operating system is available from the IBM
Eserver pSeries Information Center at
http://publib16.boulder.ibm.com/pseries/en_US/infocenter/base. Select AIX documentation.
The AIX Documentation CD contains the base set of publications for the operating system,
including system-management and end-user documentation.
Chapter 3. Removal and Replacement Procedures 49
4. Log in as root user. If the system is a partitioned system, log in as root user on the partition that has
the adapter assigned to it.
5. At the command line, type smitty.
6. Select Devices.
7. Select PCI Hot Plug Manager.
8. Select Unconfigure a Device and press Enter.
9. Press F4 to display the Device Names menu.
10. Select the adapter you are removing from the menu.
11. Use the Tab key to answer NO to Keep Definition. Press Enter.
12. The ARE YOU SURE screen displays. Press Enter to verify the information. Successful unconfigure is
indicated by the OK message displayed next to the Command field at the top of the screen.
13. Press F3 to return to the PCI Hot-Plug Manager menu.
14. Select Replace/Remove a PCI Hot-Plug Adapter and press Enter. The Replace/Remove a PCI
Hot-Plug Adapter menu displays.
15. Move the cursor to select the adapter that you are removing and press Enter. (The description entry
displays as unknown).
16. Press the Tab key until the entry field displays the remove operation and then press the Enter key.
Follow the instructions that display on the screen until you are instructed to remove the adapter.
17. When you are instructed to remove the adapter from the adapter slot, disconnect any cables that are
connected to the adapter being removed.
18. Press the center of the handle of the PCI card cassette to release the latch, then lower the handle
completely.
19. When the handle is completely lowered, push the gray locking cross bar.
20. Carefully pull the PCI adapter cassette straight out from the I/O subsystem, as shown in the following
illustration. Take care not to pull EMC gaskets from neighboring cassettes.
1 I/O Subsystem 2 PCI adapter Cassette
21. If you are not installing another adapter in this slot, install a blank PCI adapter cassette into the slot.
22. Continue to follow the screen instructions until you receive a message that the adapter removal is
successful. Successful removal is indicated by the OK message displayed next to the Command field
at the top of the screen.
23. If you do not have other adapters to remove, continue with the next step.
OR
If you have other adapters to remove, press the F3 key to return to the PCI Hot-Plug Manager menu
and then return to step 14.
24. Press F10 to exit the Hot-Plug Manager.
25. Install any covers that you removed earlier, and return the drawer to the normal operations.
50 D10 and D20 Service Guide
Installing a Hot-Pluggable PCI Adapter
Attention: The Linux operating system does not support these hot-plug procedures. Also, Linux does not
support hot-plugging any hot-pluggable PCI adapters. Systems with Linux installed on one or more
partitions must be shut down and powered off before replacing any PCI adapter assigned to a Linux
partition. Follow the non-hot-pluggable adapter procedures when replacing a PCI adapter in any partition
with Linux installed.
Note: Before installing a new adapter, ensure the device driver for the adapter is installed. Check with the
system administrator if the device driver is not already installed.
To install an adapter, perform the following steps:
1. Install the new adapter into a PCI adapter cassette, see “PCI Adapter or Blank Filler Removal from a
Cassette Assembly” on page 59.
2. Log in as root user. If the system is a partitioned system, log in as root user on the partition that has
the adapter assigned to it.
3. At the command line, type smitty.
4. Select Devices.
5. Select PCI Hot Plug Manager.
6. From the PCI Hot-Plug Manager menu, select Add a PCI Hot-Plug Adapter and press Enter. The
Add a Hot-Plug Adapter window displays.
7. See the PCI Adapter Placement Reference, order number SA23-2504, for adapter placement
information. Select an empty PCI slot for the adapter.
8. Select the appropriate empty PCI slot from the ones listed on the screen, and press Enter.
9. Follow the instructions on the screen to install the adapter until the visual indicator (LED) for the
specified PCI slot is set to the Action state.
10. Install the new adapter in the adapter slot by performing the following:
a. Lower the black handle completely.
b. Push the gray locking cross bar until you hear a clicking sound.
c. Hold the assembly straight on and level with the slot.
d. Align the bottom edge of the PCI cassette cover with the PCI card guide rail on the I/O
backplane.
Note: If there is a cassette to the left of the one you are installing, align the ridge on the cover
with the ²tick² in the notch of the neighboring cassette.
Chapter 3. Removal and Replacement Procedures 51
e. Slide the cassette partially into the guide.
f. Ensure that the dovetail on the top track aligns with its mating component(s) on both sides.
1 I/O Subsystem 2 PCI adapter Cassette
g. When the cassette is fully inserted, prepare to activate the handle, lower the gray locking bar. Lift
the handle all the way up until you hear a click. The PCI Card should be completely seated.
h. Set the color slide to the orange color indicating that the adapter in the cassette is hot-pluggable.
11. Connect appropriate cables and devices to the adapter.
12. Continue to follow the screen instructions until you receive a message that the installation is
successful. Successful installation is indicated by the OK message displayed next to the Command
field at the top of the screen.
13. Press the F3 key to return to the PCI Hot-Plug Manager menu.
14. Select Install/Configure Devices Added After IPL and press Enter. Then follow the instructions on
the screen. Successful installation is indicated by the OK message displayed next to the Command
field at the top of the screen.
15. If you do not have other adapters to install, continue with the next step.
OR
If you have other adapters to install, press the F3 key to return to the PCI Hot-Plug Manager menu
and then return to step 6 on page 51.
16. Press F10 to exit the Hot-Plug Manager.
If you have added, removed, or replaced any adapters, run the diag -a command. If the system
responds with a menu or prompt, follow the instructions to complete the device configuration.
17. Install any covers that you removed earlier, and return the drawer to normal operations.
52 D10 and D20 Service Guide
PCI Hot-Plug Manager Access
Attention: The Linux operating system does not support these hot-plug procedures. Also, Linux does not
support hot-plugging any hot-pluggable PCI adapters. Systems with Linux installed on one or more
partitions must be shut down and powered off before replacing any PCI adapter assigned to a Linux
partition. Follow the non-hot-pluggable adapter procedures when replacing a PCI adapter in any partition
with Linux installed.
The installation instructions for hot-pluggable PCI adapters refer you to these procedures when it is
appropriate to perform them.
Note: A PCI adapter is only hot-pluggable if the PCI adapter supports hot-plug applications. See the PCI
Adapter Placement Reference, order number SA23-2504.
Accessing Hot-Plug Management Functions
Note: Removing or installing a Hot-Pluggable PCI adapter requires the system administrator to take the
PCI adapter offline prior to performing the operation. Before taking an adapter offline, the devices
attached to the adapter must be taken offline as well. This action prevents a service representative
or user from causing an unexpected outage for system users.
For additional information about taking an adapter offline or removing it from the system
configuration, see the AIX System Management Guide: Operating System and Devices.
To access the hot-plug menus, do the following:
1. Log in as root user. If the system is a partitioned system, log in as root user on the partition that has
the adapter assigned to it.
2. At the command line, type smitty.
3. Select Devices.
4. Select PCI Hot Plug Manager and press Enter.
5. The PCI Hot-Plug Manager menu displays. Return to the procedure that directed you here. For a
description of the menu options, see below.
PCI Hot-Plug Manager Menu
The following options are available from the PCI Hot Plug Manager menu:
Note: For information about the PCI slot LED states, see “I/O Subsystem PCI-X Slot LED Definitions” on
page 2.
Chapter 3. Removal and Replacement Procedures 53
List PCI Hot-Plug Slots
Provides a descriptive list of all slots that support PCI hot-plug capability. If the listing for a slot indicates it
holds an “Unknown” device, select the Install/Configure Devices Added after IPL to configure the
adapter in that slot.
Add a PCI Hot-Plug Adapter
Allows the user to add a new PCI hot-plug-capable adapter to the slot with the system turned on. You will
be asked to identify the PCI slot that you have selected prior to the actual operation. The selected PCI slot
will go into the Action state and finally into the On state.
Note: The system will indicate the slot holds an “Unknown” device until you perform the Install/Configure
Devices Added After IPL option to configure the adapter.
Replace/Remove a PCI Hot-Plug Adapter
Allows the user to remove an existing adapter, or replace an existing adapter with an identical one. For
this option to work, the adapter must be in the Defined state (see “Unconfigure a Device” option below).
You will be asked to identify the PCI slot prior to the actual operation. The selected PCI slot will go into the
Action state.
Identify a PCI Hot-Plug Slot
Allows the user to identify a PCI slot. The selected PCI slot will go into the Identify state. See “I/O
Subsystem PCI-X Slot LED Definitions” on page 2.
Unconfigure a Device
Allows the user to put an existing PCI adapter into the Defined state if the device is no longer in use.
This step must be completed successfully before starting any removal or replacement operation. If this
step fails, the customer must take action to release the device.
Configure a Defined Device
Allows a new PCI adapter to be configured into the system if software support is already available for the
adapter. The selected PCI slot will go into the On state.
54 D10 and D20 Service Guide
Install/Configure Devices Added After IPL
The system attempts to configure any new devices and tries to find and install any required software from
a user-selected source.
The add, remove, and replace functions return information to the user indicating whether the operation
was successful. If additional instructions are provided on the screen, complete the recommended actions.
If the instructions do not resolve the problem, see the following:
v If the adapter is listed as Unknown, perform the Install/Configure Devices Added After IPL option to
configure the adapter.
v If you receive a warning indicating that needed device packages are not installed, the system
administrator must install the specified packages before you can configure or diagnose the adapter.
v If you receive a failure message indicating a hardware error, the problem might be either the adapter or
the PCI slot. Isolate the problem by retrying the operation in a different PCI slot, or trying a different
adapter in the slot. If you determine that you have failing hardware, call your service representative.
Removing and Replacing a PCI Adapter Cassette
Perform these procedures only when you are directed to by the “Removing a Non-Hot-Pluggable PCI
Adapter” on page 45 or “Hot-Pluggable PCI Adapter” on page 46.
Removing a PCI Adapter Cassette
Attention: Perform this procedure only when you are instructed to by the removal and replacement
procedures for a “Removing a Non-Hot-Pluggable PCI Adapter” on page 45 or “Hot-Pluggable PCI
Adapter” on page 46.
Note: An adapter or a blank filler must be installed into the PCI adapter cassette assembly before it is
reinstalled in a processor subsystem or an I/O subsystem.
The following illustration shows the PCI adapter cassette with the handle lowered.
1
2
3
4
1 dove-tail 3 Handle in the lowered position
2 Gray locking crossbar 4 Center of handle
1. Determine the slot from which you are removing the PCI adapter cassette.
2. Disconnect any cables that are connected to the adapter cassette being removed.
Chapter 3. Removal and Replacement Procedures 55
3. Press the center of the handle of the PCI card cassette to release the latch, then lower the handle
completely.
1
2
3
1 Center of handle 3 Gray locking crossbar
2 Handle in the lowered position
4. When the handle is completely lowered, push the gray locking cross bar. This action prevents the
handle from pulling up while the adapter is being removed from the unit.
5. Carefully pull the PCI card cassette straight out from the subsystem as shown in the following
illustration. Take care not to pull EMC gaskets from neighboring cassettes.
Attention: Do not remove more than one adapter cassette at at time.
1 Processor subsystem 2 PCI adapter cassette
Replacing a PCI Adapter Cassette
Attention: Perform this procedure only when instructed to by “Removing a Non-Hot-Pluggable PCI
Adapter” on page 45 or “Hot-Pluggable PCI Adapter” on page 46. Check the following:
v An adapter or blank filler must be installed in each PCI adapter cassette and the top cover for the unit
must be installed before installing PCI adapter cassette in the system.
v Check the adapter in the cassette that you are about to install and make sure that the adapter is held
firmly by the retaining arms and rachet clips on all four corners.
v Review the procedures beginning with “PCI Adapter or Blank Filler Removal from a Cassette Assembly”
on page 59 to make sure that the adapter is correctly installed in the cassette.
Replace the PCI adapter cassette in the unit using the following procedure:
56 D10 and D20 Service Guide
1. Press the center of the handle of the PCI card cassette to release the latch, then lower the handle
completely.
1
2
3
4
1 Dove-tail 3 Handle in the lowered position
2 Gray locking crossbar 4 Center of handle
2. Study the PCI adapter slot opening carefully and note the location the dove-tail channels and the
cassette guide slot. Use a flashlight to better see the dove-tail channels and the cassette guide slot
1 Dove-tail
channels
2 PCI cassette guide slot
Chapter 3. Removal and Replacement Procedures 57
3. Hold the assembly straight on and level with the slot.
1 Processor subsystem 2 PCI adapter cassette
4. Align the bottom edge of the PCI cassette cover with the PCI cassette guide rail on the of the system
or I/O unit.
Note: If there is a cassette to the left of the one you are installing, align the ridge on the cover with
the ²tick² in the notch of the neighboring cassette.
5. Slide the cassette partially into the guide.
Attention: If the cassette does not slide in smoothly, the cover might not have engaged PCI cassette
guide rail. Carefully pull the adapter back out and start again. Also, the cassette might bind if it hits the
top cover as it is inserted.
6. As you slide the cassette into the slot, ensure the dove-tails on the top of the cassette align with the
mating components on both the left and right sides.
7. Push the cassette in until its bezel is even with the next cassettes (to the left or right).
Note: Viewed from the rear, the left and right dove tail channels must be properly engaged. Check the
following on the cassette after it is completely inserted into the unit.
v There should be no freedom of movement of the cassette when pressure is applied to the
handle in a back and fourth motion.
v The PCI cassette bezel should be even with the other cassettes that are already installed.
v Visually check to ensure that the cassette is parallel with the cassettes to the left or right. If
the cassette is not parallel with the neighboring cassettes, the cassette dove-tail channels
might not have engaged, pull the cassette out a few inches, reinsert it, and check the
alignment again. You might need to apply light pressure to the left or right neighboring
cassettes against the new cassette in order to engage both of the dove-tails correctly.
8. When the cassette is fully inserted, ensure that the gray locking bar is in the unlocked position (pulled
out).
Attention: Do not force the handle into the raised postion. To enable proper insertion of the adapter,
press down on the adapter tailstock (metal tab just below the handle hinge) at the rear of the adapter
cassette when lifting the handle. Some minor forward or backward movement of the PCI adapter
cassette might be necessary.
9. Carfully lift the handle all the way up until you hear and feel a click. The PCI adapter is pushed into the
PCI adapter slot when the handle is raised.
58 D10 and D20 Service Guide
PCI Adapter or Blank Filler Removal from a Cassette Assembly
Use this procedure when you are preparing to install or replace an adapter or an adapter blank filler in the
system unit or an I/O drawer.
Note: An adapter or an adapter blank filler must be installed into the PCI adapter cassette assembly
before it is reinstalled in a system unit or an I/O drawer.
Determine the slot of the system or I/O drawer in which you plan to install or remove an adapter, and refer
to the procedures for removing an adapter from the system. Remove the PCI adapter cassette assembly
from the system unit or I/O drawer before beginning this procedure.
Note: It may take approximately 30 to 40 minutes to perform this procedure the first time. This time
includes using the instructions in this guide and performing the steps. Thereafter, performing this
procedure usually takes approximately 10 minutes.
Before performing the following steps, familiarize yourself with the entire procedure.
To remove a PCI adapter or blank filler from a cassette assembly, do the following:
1. Place the PCI adapter cassette assembly on a flat work surface with the cover facing up, and the top
of the adapter facing you. See the following illustration.
1 Cover 3 Top of Cover
2 Handle
Chapter 3. Removal and Replacement Procedures 59
2. Using two fingers, remove the bushing-lock pin from the bushing. The pin can be removed by pulling it
out of the bushing with your fingernails.
1 Bushing
2 Bushing-Lock Pin
1 Bushing-Lock Pin
60 D10 and D20 Service Guide
3. Remove the bushing. The bushing can be removed by pulling it out of the PCI adapter cassette
assembly with your fingernails.
1 Bushing
4. Turn over the PCI adapter cassette assembly so that the top is facing away from you.
1 Top of Adapter
2 Handle
Chapter 3. Removal and Replacement Procedures 61
5. Remove the bezel, as follows:
a. Locate the plastic latch fingers in the top part of the cassette.
1 Top of Cassette
2 Bezel
3 Plastic Latch Fingers
b. Using one hand, pinch the plastic latch fingers, and with your other hand, carefully lift the top part
of the bezel extension out until the tabs clear the slots in the PCI adapter cassette assembly.
1 Slots
2 Tab
3 Bezel Extension
62 D10 and D20 Service Guide
c. While holding the bezel extension out, push the plastic cover latch out of the bezel hook, as shown
in the following illustration. This action allows the bezel to be removed.
1 Bezel Extension
2 Plastic Cover Latch in Bezel Hook
d. On the opposite side of the cassette (cover side), push the cover latch to release the bezel.
1 Cover Side
2 Cover Latch
Chapter 3. Removal and Replacement Procedures 63
e. While holding the bezel extension out, carefully slide the bezel off. You might have to work from
both sides to loosen the bezel assembly from the cassette assembly. When the bezel is free, slide
it completely off the cassette assembly, and set it aside.
1 Bezel Extension
2 Bezel
1 Bezel
64 D10 and D20 Service Guide
6. Turn over the cassette so that the cover is facing up, as shown in the following illustration. Remove the
cover from the cassette as follows:
a. Slide the cover until it releases from the cassette assembly.
Attention: The cover might be tight and difficult to slide. If you grasp the left end (handle end) of
the cassette and the right end of the cover, you can use enough force to pull the cover off the PCI
adapter cassette assembly.
1 Left End of the Cassette
2 Cassette Cover
b. Lift the cover off the assembly, and set it aside.
c. By pulling on both sides of the gray plastic locking bar, which is located on the handle, ensure that
the handle is pulled into the unlocked position. Raise the handle on the cassette linkage assembly
until it locks into the up position (the blank filler or adapter moves downward).
1 Handle
2 Gray Plastic Locking Bar
Chapter 3. Removal and Replacement Procedures 65
7. Remove the metal EMC shield from the top of the tailstock.
1 Metal EMC Shield
8. Remove the blank filler or adapter that is installed in the cassette linkage assembly.
Note: If there is a blank filler in the cassette linkage assembly, as shipped from the manufacturer,
there are two adapter arms. One adapter arm is used with short adapters (short adapter arm)
and the other adapter arm is used with long adapters (long adapter arm).
If you are removing a short adapter or blank filler, see “Short Adapter or Blank Filler Removal” on
page 67.
If you are removing a long adapter, see “Long Adapter Removal” on page 69.
66 D10 and D20 Service Guide
Short Adapter or Blank Filler Removal
To remove a short adapter or blank filler, do the following:
1. Slide the long and short adapter arms away from the adapter or blank filler by doing the following:
a. Each adapter arm has a release tab that allows the arm to be moved away from the adapter or
blank filler in the cassette assembly. Use your fingernail to lift the tab, to allow each arm to be
moved away from the adapter or blank filler. Lift the release tab on the short adapter arm, and
push on the slotted tab to release the end of the blank filler.
Note: If you plan to install a short adapter, leave the long adapter arm on the cassette linkage
assembly. If you plan to install a long adapter, remove both the long and short adapter arms
from the cassette linkage assembly in the next step.
b. Slide the long and short adapter arms away from the blank filler or adapter.
1 Long Adapter Arm 4 Short Adapter Arm
2 Release Tab 5 Slotted Tab
3 Release Tab
Chapter 3. Removal and Replacement Procedures 67
2. Remove the adapter or blank filler from the cassette linkage assembly by rotating the bottom of the
tailstock out, as shown in the following illustration. Store the adapter or blank filler in a safe place.
1 Cassette Linkage
Assembly
3 Handle
2 Bottom of Tailstock
3. The removal procedure for the PCI adapter cassette assembly is complete. To install a new adapter or
blank filler in the cassette, go to “Installing a Hot-Pluggable PCI Adapter” on page 51.
68 D10 and D20 Service Guide
Long Adapter Removal
To remove a long adapter, do the following:
1. Each adapter arm has a release tab that allows the arm to be moved away from the adapter in the
cassette assembly. Use your fingernail to lift the tab, to allow the arm to be moved away from the
adapter. Lift the release tab on the long adapter arm, and slide it off the cassette linkage assembly.
1 Release Tab
2. Remove the adapter from the cassette linkage assembly by rotating the bottom of the tailstock out, as
shown in the following illustration. Store the adapter in a safe place.
1 Cassette Linkage
Assembly
3 Bottom of Tailstock
2 Adapter 4 Handle
3. The removal procedure for the PCI adapter cassette assembly is complete. To install a new adapter or
blank filler in the cassette, go to “Installing a Hot-Pluggable PCI Adapter” on page 51.
Chapter 3. Removal and Replacement Procedures 69
Replacing an Adapter in a PCI Adapter Cassette
This procedure is performed when preparing to install a new adapter or a blank filler in the processor
subsystem or an I/O drawer. Familiarize yourself with the entire procedure before performing the following
steps.
1. Place the empty PCI adapter cassette linkage on a flat work surface in front of you. Position the
cassette so that the handle is in the raised (up position), and on the left, with the top of the linkage
facing away from you.
Note: Your PCI adapter cassette linkage might not have the short or long adapter retaining arms
installed if you removed them during the remove procedure. In the following figure, both arms
are shown.
Note: If you are installing an adapter that is in the following list, use the additional brackets listed to
help when you seat the adapter during installation. There are unique brackets (1) for each
adapter type (2):
1
2
53P5450 Bracket for Gigabit Ethernet PCI-X Adapter, FC 5700 or 10/100/1000 Base-TX
Ethernet PCI-X Adapter, FC 5701
44P2661 Bracket for Dual Channel Ultra3 SCSI PCI Adapter, FC 6203 Type 4Y
44P0321 Bracket for 10/100 BaseT Ethernet PCI Adapter, FC 4962 Type AF
44P2675 Bracket for 2 Gigabit Fibre Channel Adapter, FC 6228 Type 4W
44P2676 Bracket for Gigabit Ethernet - SX PCI Adapter, FC 2969 Type 9U, or
10/100/1000 Base-T Ethernet PCI Adapter, FC 2975 Type A-A
70 D10 and D20 Service Guide
2. Use the following procedure to determine if you are installing a long adapter or a short adapter. If you
know the length of your adapter, skip the following steps and proceed to step 3 on page 73.
a. Remove the adapter from its protective packaging and, if present, remove the plastic extension
handle from the end of the adapter.
b. Place the adapter or a blank filler over the cassette with the upper left corner aligned into the top
adapter-retaining clip.
5
4
2
3
1
1 Top Adapter-Retaining Clip 3 Adapter
2 Cassette Linkage Assembly 4 Bottom of Tailstock
5 Handle
Chapter 3. Removal and Replacement Procedures 71
c. The cassette linkage is marked to show long or short adapters similar to the adapter shown in the
following figure. Check the right end of the adapter to determine if it is long enough to fall into the
long area of the cassette linkage. Ensure that the corner on the left end of the adapter is still
aligned into the top adapter-retaining clip and determine the length of the adapter.
LONG
ARM ARM
SHORT
1
1 Top Adapter-Retaining Clip
72 D10 and D20 Service Guide
3. Remove the adapter or blank filler and ensure that the handle is in the up position. If you need to
move the handle, ensure that the gray locking bar is pulled into the unlocked position, and then rotate
the adapter handle until the handle is in the up position (the cassette linkage will extend down beyond
the cassette top).
If you are installing a short adapter or blank filler, proceed to “Short Adapter or Blank Filler Installation”
on page 74. If you are installing a long adapter, proceed to “Long Adapter Installation” on page 89.
2
1
1 Handle
2 Lock
Chapter 3. Removal and Replacement Procedures 73
Short Adapter or Blank Filler Installation
Use the following procedure to install a short adapter or blank filler.
1. Install the adapter or blank filler by doing the following:
a. Place the adapter or blank filler into the cassette so that the upper-left corner of the adapter
engages the adjustable top adapter-retaining clip as shown in the following figure.
4
3
1
2
1 Cassette Linkage
Assembly
3 Retaining Clip
2 Bottom of Tailstock 4 Handle
74 D10 and D20 Service Guide
b. Rotate the adapter so that the adapter engages the slot in the bottom adapter-retaining clip and
the top corner of the adapter is seated into the adjustable top adapter-retaining clip.
Note: If the adapter is not a full-height adapter, you must slide the adjustable top
adapter-retaining clip downward until the lower edge of the adapter is seated into the slot
on the bottom adapter-retaining clip.
3
2
1
1 Adapter 3 Bottom Retaining Clip
2 Slot
Chapter 3. Removal and Replacement Procedures 75
c. Slide the short-adapter retaining arm toward the adapter or blank filler on the cassette linkage rail.
1) If the short-adapter retaining arm has been removed from the cassette linkage rail, install the
short-adapter retaining arm and then slide the arm until it contacts the adapter or blank filler
as shown in the following figure.
d. Ensure that the adjustable top retainer clip catches the corner of the adapter as shown in the
following figure:
1
1 Short Adapter Top Retaining Clip
76 D10 and D20 Service Guide
e. Use the lower short adapter-retaining clip to engage and hold the bottom of the adapter.
Note: It might be necessary to apply pressure to engage and hold the bottom of the adapter.
1
1 Lower Short Adapter Arm Clip
f. To hold the top of the adapter, slide down the adjustable top adapter-retaining clip on the retaining
arm. Ensure that the bottom edge of the adapter is held by the lower part of the adapter retaining
arm.
1
1 Short Adapter Adjustable Retaining Clip
Chapter 3. Removal and Replacement Procedures 77
g. Press the lock on the handle, and rotate the adapter handle until it is in the down position
(adapter or blank filler moves up into the cassette assembly).
2
1
1 Handle
2 Lock
Ensure the right end of the cassette linkage rail moves up into the cassette linkage assembly as
shown in the following figure:
1
1 Right End of Cassette Linkage
78 D10 and D20 Service Guide
h. Position the adapter and cassette assembly with the handle on the left (in the down position) and
the top facing away from you.
2
1
1 Top of Cassette
2 Handle
Chapter 3. Removal and Replacement Procedures 79
i. Install the cover on the cassette assembly:
1) Place the cassette cover on the cassette assembly as shown below. Slide the cover toward the
handle until the hole in the cover aligns with the hole in the cassette assembly.
1 2
1 Handle
2 Holes Aligned
80 D10 and D20 Service Guide
j. Install the bezel assembly using the following procedure.
1) Carefully slide the bezel onto the cassette assembly.
1
1 Bezel
2) Align and insert the cover arm latch in the hooked notch in the bezel.
Chapter 3. Removal and Replacement Procedures 81
3) Align the top of the bezel assembly into the grooves on the top of the cassette assembly. Push
the bezel onto the cassette linkage until the tab on the top of the bezel is seated in the recess
of the cassette assembly.
1 2 3
4
1 Recess 3 Tab
2 Groove 4 Bezel Assembly
4) Insert the two tabs on the bezel extension into the two slots on the cassette assembly.
1
1 Tabs
82 D10 and D20 Service Guide
k. Check for the following:
1) Ensure that the extension arm engages the pins on the cassette. You should be able to see
the three pins in the holes in the arm. The bezel tab should be seated as shown in the
following figure.
1
1
1
1 Pin
2) Ensure the cover arm latch is completely pressed into the hooked slot on the bezel as shown.
Chapter 3. Removal and Replacement Procedures 83
3) Turn the cassette assembly over so the cover is up. Check the cover latch to ensure it is
holding the bezel to the cover as shown.
1
1 Cover Latch
l. Ensure the holes are aligned, and insert the bushing as shown.
1
1 Bushing
84 D10 and D20 Service Guide
m. Insert the bushing lock pin into the hole in the bushing and push it in until it seats.
1
1 Bushing Lock Pin
n. By pulling on both sides, ensure the gray plastic locking bar on the handle is pulled into the
unlocked position. Raise the handle on the cassette linkage until it locks into the up position (the
blank filler or adapter moves downward).
1 Handle
2 Gray Plastic Locking Bar
Chapter 3. Removal and Replacement Procedures 85
o. To install the metal EMC shield on the adapter bracket, do the following:
1) Ensure that the shield slides up inside the top of the cassette.
1 Metal EMC Shield
86 D10 and D20 Service Guide
2) The metal shield has clips that slide over the top of the tailstock. Ensure that these clips are
holding the EMC shield to the tailstock.
1
1 Top of Tailstock
p. Press the lock on the handle and rotate the adapter handle until the handle is in the down
position (adapter or blank filler moves up into the cassette assembly).
1
1 Handle
Chapter 3. Removal and Replacement Procedures 87
q. Using the system documentation, determine if the adapter you are installing is hot-swappable. If
the adapter is hot-swappable, move the slider on the color indicator to allow the orange to be
visible. If the adapter is not hot-swappable, the blue color is visible.
1 2
1 Orange for Hot-Swappable 2 Blue for Not Hot-Swappable
r. The adapter is ready to be installed into a system or an I/O drawer.
88 D10 and D20 Service Guide
Long Adapter Installation
Use the following procedure to install a long adapter.
Note: Ensure that both the retaining arms for the long and short adapters are removed from the cassette
linkage. Before continuing, see step 1a on page 67 to remove the arms.
1. Install the long adapter by doing the following:
a. Place the adapter into the cassette so that the upper left corner of the adapter engages the top
adjustable adapter-retaining clip as shown in the following figure:
5
4
2
3
1
1 Top Adapter-Retaining Clip 3 Adapter
2 Cassette Linkage Assembly 4 Bottom of Tailstock
5 Handle
Chapter 3. Removal and Replacement Procedures 89
b. Rotate the adapter so that the adapter engages the slot in the bottom retaining clip and the top
corner of the adapter is seated into the top adjustable adapter-retaining clip.
Note: If the adapter is not a full-height adapter, slide the top adjustable adapter-retaining clip
downward until the lower edge of the adapter is seated into the slot on the bottom retaining
clip.
2
1
3
1 Adapter 3 Bottom Retaining Clip
2 Slot
90 D10 and D20 Service Guide
c. Install the long adapter-retaining arm onto the cassette linkage rail, as shown in the following
figure, and slide the arm toward the adapter.
1 2
3
1 Cassette Linkage Assembly 3 Top Adapter Retaining Clip
2 Adapter Retaining Arm
Chapter 3. Removal and Replacement Procedures 91
d. Ensure that the top adjustable adapter-retaining clip catches the corner of the adapter as shown
in the following figures.
Long Arm Adjustable Retaining Clip
1 Long Arm Adjustable Retaining Clip
e. Ensure that the bottom edge of the adapter is held by the groove in the lower part of the
adapter-retaining arm.
1
2
1 Long Adapter Arm
Adjustable Retaining Clip
2 Long Adapter Retaining Arm Slot
92 D10 and D20 Service Guide
f. To hold the top of the adapter, slide down the top adjustable adapter-retaining clip on the retaining
arm.
1
1 Long Adapter Retaining Arm
g. Press the lock on the handle and rotate the adapter handle until it is in the down position (adapter
or blank filler moves up into the cassette assembly).
2
1
1 Handle
2 Lock
Chapter 3. Removal and Replacement Procedures 93
Ensure the right end of the cassette linkage rail moves up into the cassette linkage assembly as
shown below:
1
1 Right End of Cassette Linkage
h. Position the adapter and cassette assembly with the handle on the left (in the down position) and
the top facing away from you.
1
2
1 Handle
2 Top of Cassette
94 D10 and D20 Service Guide
i. Install the cover on the cassette assembly by placing the cassette cover on the cassette assembly
as shown below. Slide the cover toward the handle until the hole in the cover aligns with the hole
in the cassette assembly.
1 2
1 Top of Cassette
2 Handle
Chapter 3. Removal and Replacement Procedures 95
j. Install the bezel assembly using the following procedure:
1) Carefully slide the bezel onto the cassette assembly.
1
1 Bezel
2) Align and insert the cover arm latch in the hooked notch in the bezel.
96 D10 and D20 Service Guide
3) Align the top of the bezel assembly into the grooves on the top of the cassette assembly and
then push the bezel onto the cassette linkage until the tab on the top of the bezel is seated in
the recess of the cassette assembly.
1 2 3
4
1 Recess 3 Tab
2 Groove 4 Bezel
4) Insert the two tabs on the bezel extension into the two slots on the cassette assembly.
1
1 Tabs
Chapter 3. Removal and Replacement Procedures 97
k. Check for the following:
1) Ensure that the extension arm engages the three pins on the cassette. You should be able to
see the pins in the holes in the arm. The bezel tab should be seated as shown in the following
figure.
1
1
1
1 Pins
2) Ensure the cover arm latch is completely pressed into the hooked slot on the bezel as shown.
98 D10 and D20 Service Guide
3) Turn the cassette assembly over so the cover is up and check the cover latch to ensure it is
holding the bezel to the cover as shown.
1
1 Cover Latch
l. Position the adapter with the cover side up. Ensure the holes are aligned and insert the bushing
as shown.
1
1 Bushing
Chapter 3. Removal and Replacement Procedures 99
m. Insert the bushing lock pin into the hole in the bushing. Push on the pin until it seats.
1
1 Bushing Lock Pin
n. By pulling on both sides, ensure the gray plastic locking bar on the handle is pulled into the
unlocked position. Raise the handle on the cassette linkage until it locks into the up position (the
blank filler or adapter moves downward).
1 Handle
2 Gray Plastic Locking Bar
100 D10 and D20 Service Guide
o. To install the metal EMC shield on the adapter bracket, do the following:
1) Ensure that the shield slides up inside the top of the cassette.
1 Metal EMC Shield
Chapter 3. Removal and Replacement Procedures 101
2) The metal shield has clips that slide over the top of the tailstock. Ensure that these clips are
holding the EMC shield to the tailstock.
1
1 Top of Tailstock
p. Press the lock on the handle and rotate the adapter handle until the handle is in the down
position (adapter or blank filler moves up into the cassette assembly).
1
1 Handle
q. Using your system documentation, determine if the adapter you are installing is hot-swappable. If
the adapter is hot-swappable, move the slider on the color indicator to allow the orange to be
102 D10 and D20 Service Guide
visible. If the adapter is not hot-swappable, the blue color is visible.
1 2
1 Orange for Hot-Swappable
2 Blue for Not Hot-Swappable
r. The adapter is ready to be installed into a system or an I/O drawer. Refer to your system
documentation for more information.
Chapter 3. Removal and Replacement Procedures 103
D10 Fan
Before performing the following procedure, read “Safety Considerations” on page 41.
Hot-Swap Fan Assembly Removal
To remove a hot-swap fan, do the following:
1. Locate the cooling fan assembly on the front of the I/O subsystem.
2. Unscrew the thumbscrew that holds the fan into the I/O subsystem.
3. Pull the fan assembly straight out until it is clear of the I/O subsystem.
2 1
1 Fan 2 Thumbscrew
Hot-Swap Fan Assembly Replacement
To replace your subsystem’s hot-swap fan, do the following:
1. Position the fan with the connector (located on the fan housing) facing in toward the I/O subsystem.
2. Insert the replacement fan assembly into the I/O subsystem as far as possible.
3. Screw in the thumbscrew to hold the fan in position.
104 D10 and D20 Service Guide
D10 RIO Cable
Removal
1. Turn off the power to the system.
2. Release the RIO cable retainers.
3. Disconnect the RIO cable.
Replacement
Replace in reverse order.
D10 RIO Bus Adapter Assembly
Removal
Before performing the following procedure, read the “Safety Considerations” on page 41.
1. Turn off the system power.
2. Perform the procedure to put the I/O subsystem into the service position as described in “D10 Service
Position” on page 43.
3. Remove the service access cover. (See “D10 Covers” on page 44).
4. Remove the screws that attach the RIO riser card to the I/O subsystem chassis.
5. Carefully pull the card straight up and out of the slot.
Replacement
Replace in reverse order.
Chapter 3. Removal and Replacement Procedures 105
D10 I/O Backplane Assembly
Before performing the following procedure, read the “Safety Considerations” on page 41.
Note: The I/O backplane and the SPCN riser card are replaced as a pair.
Removal
1. Turn off the system power.
2. Put the I/O subsystem into the service position as described in “D10 Service Position” on page 43.
3. Label and remove the PCI adapters. (See “D10 PCI Adapters” on page 45).
4. Remove the service access cover. (See “D10 Covers” on page 44).
5. Disconnect the fan cable from the I/O backplane.
6. Remove the power supplies. (See “D10 Power Supply” on page 107).
7. Remove the RIO bus adapter. (See “D10 RIO Bus Adapter Assembly” on page 105).
8. Remove the two screws that hold the SPCN connector card, and remove it from the I/O backplane.
9. Remove the screws that hold the PCI adapter mounting-guide, and remove the guides from the I/O
backplane.
10. Remove the screws that secure the I/O backplane to the subsystem chassis.
11. Lift the I/O subsystem backplane straight up and out of the subsystem chassis.
Note: If you are replacing the I/O subsystem backplane, mark and remove the VPD module (see
“D10 I/O Backplane Locations” on page 12 for VPD module location) from the old backplane
and move it to the replacement I/O subsystem backplane.
Replacement
Note: If the power supplies are not removed from the I/O subsystem chassis, remove them before
replacing the I/O subsystem backplane.
1. Place the I/O subsystem backplane assembly into the chassis. Align the backplane with the mounting
screw holes.
Note: If you are replacing an I/O subsystem backplane with a new one, remove the VPD module
from the old backplane and move it to the replacement I/O subsystem backplane. For VPD
module location, see “D10 I/O Backplane Locations” on page 12. Put the module from the new
backplane onto the old backplane.
2. Position the PCI adapter mounting-guides on the I/O backplane, and install the screws to secure the
mounting-guides.
3. Secure the backplane with the remaining mounting screws removed earlier.
4. Reconnect the fan cable to the I/O backplane.
5. Install the new SPCN connector card on the new I/O subsystem backplane using the screws that you
removed earlier.
Note: The I/O backplane and the SPCN connector card are replaced as a pair.
6. Reinstall the RIO bus adapter card. (See “D10 RIO Bus Adapter Assembly” on page 105).
7. Reinstall the power supplies. (See “D10 Power Supply” on page 107).
8. Reinstall the cover. (See “D10 Covers” on page 44).
9. Reinstall all PCI adapters. (See “D10 PCI Adapters” on page 45).
10. Return the I/O subsystem to the operating position (see “D10 Operating Position” on page 43) and
connect RIO, SPCN, and adapter cables.
11. Connect the power cables.
12. Power the system on.
106 D10 and D20 Service Guide
D10 Power Supply
Before performing the following procedure, read the “Safety Considerations” on page 41.
Removal
Attention: Do not remove two power supplies at the same time if performing hot-plug procedures. Power
supplies are considered as hot-pluggable FRUs only if you remove one power supply at a time. The power
supplies can be removed from the front of the I/O subsystem.
1. Disconnect the power cord from the power supply.
2. Unlatch the power supply handle and rotate the handle downward to unseat the power supply.
Attention: Do not remove a power supply for more than four minutes. If you cannot replace the
power supply in less than four minutes, shut down the system and then remove the power supply.
3. Pull the power supply straight out from the I/O subsystem.
Replacement
To replace the power supply, perform the steps in the removal procedure in reverse order.
Chapter 3. Removal and Replacement Procedures 107
Removal and Replacement Procedures for the D20 I/O Subsystem
Notes:
1. All of the parts in the D20 I/O Subsystem can be replaced with the subsystem installed in a rack. The
power supplies, blowers, disk drives, and PCI adapters are hot-pluggable. Under normal conditions,
during the swapping of these FRUs, there is no need to shut down or unplug the system.
2. Before performing any of the removal or replacement procedures in this chapter, read the danger and
caution notices on “Safety Considerations” on page 41.
D20 FRU Replacement Procedure List
Field Replaceable Unit (FRU) FRU Procedure Name and Page Location
Adapters “D20 PCI Adapters” on page 123
Bezel “D20 Front Bezel” on page 112
Cover “D20 Service Access Cover” on page 111
Disk Drives “D20 Hot-Plug Disk Drives” on page 137
Disk Drive Backplanes “D20 Disk Drive Backplane” on page 144
Disk Drive 12 Pack Cage “D20 Disk Drive Cage” on page 142
Disk Drive Configuration or
Deconfiguration
“Deconfiguring (Removing) or Configuring a Disk Drive” on page 137
Blowers “D20 Blowers” on page 115
I/O Backplane “D20 I/O Backplane Assembly” on page 151
Operating Position “D20 Operating Position” on page 110
Operator Panel “D20 Operator Panel” on page 113
Power Supplies “D20 Power Supplies” on page 146
Power Bulkhead “D20 Power Supply Bulkhead” on page 148
RIO Bus Adapter “D20 RIO Bus Adapter Assembly” on page 120
Service Position “D20 Service Position” on page 109
Static-Sensitive Devices “Handling Static-Sensitive Devices” on page 42
Stopping and Starting the System “Stopping and Starting the System” on page 42
108 D10 and D20 Service Guide
D20 Service Position
Attention: When placing your D20 subsystem into the service position, all stability plates must be firmly
in position to prevent the rack from toppling. Ensure that only one system drawer is in the service position
at a time.
Before doing any service actions inside your D20 subsystem, put the D20 subsystem into the service
position.
To place the drawer into the service position, do the following:
1. Open the front rack door.
2. If your D20 subsystem is equipped with two blue thumbscrews securing it to the rack, remove the
thumbscrews at this time. The screws are located on the bezel, just above each subsystem release
latch.
1 Front Bezel
2 Transport Retaining Screw for System Drawer
3. Release the subsystem release latches located on the left and right side.
4. Pull the system drawer out from the rack until the rails are fully extended.
Note: When the system rails are fully extended, safety latches on the slide rails lock into place. This
action prevents the system from being accidentally pulled out too far and dropped. The following
illustration shows a system drawer in the service position.
Chapter 3. Removal and Replacement Procedures 109
1 D20 Subsystem 4 Stabilizing Bar
2 19-Inch Rack 5 Front Rack Door
3 Extension Rail Safety Release Latch 6 System Drawer Release Latch
After completing the service actions, return the D20 subsystem to the operating position.
D20 Operating Position
To return your D20 subsystem to the operating position, do the following:
1. Release the rail release latches located on each rail. See the illustration on page 110 for the location
of the release tabs.
Note: To release the left rail, pull the release latch up. To release the right rail, push the right latch
down.
2. Release latches must be in the open position. See the illustration on page 110 for the location of the
subsystem release latches.
3. Push the system drawer straight back into the rack until both release latches on the subsystem have
locked into position.
4. If you removed two thumbscrews from the front bezel, replace them at this time.
5. Close the front rack door.
110 D10 and D20 Service Guide
D20 Service Access Cover
Before performing the following procedure, read the “Safety Considerations” on page 41.
Opening the Service Access Cover
To open the service access cover, do the following:
1. Open the front rack door and place the D20 subsystem into the service position as described in “D20
Service Position” on page 109.
2. Loosen the three captive thumbscrews located on the rear of the cover. See the following illustration
for thumbscrew locations.
Note: The service access cover pivots on a piano hinge located directly behind the four cooling
blowers.
3. To open the service access cover, lift the cover up from the back edge. The hinges allow the service
access cover to swing open to about 170 degrees. Do not force the cover to come to a rest on top of
the cooling blowers.
Note: When you open the service access cover, ensure you have enough height clearance.
4. Close and then secure the service access cover with the three thumbscrews located on its back edge.
Attention: For proper cooling and airflow, close the cover before turning on the system. Operating the
system for extended periods of time (over 30 minutes) with the cover opened might damage the system
components.
1 Thumbscrews 3 D20 Subsystem
2 Service Access Cover
Service Access Cover Removal
If you need to remove the service access cover from the subsystem, do the following:
1. Open the cover as described in the previous procedure.
2. Remove the four retaining screws that secure the cover hinge to the chassis.
3. Remove the cover.
Service Access Cover Replacement
To install the service access cover, do the following:
1. Open the hinge, located on the service access cover.
2. Align the four holes located on the hinge with the four screw holes located on the top of the chassis.
3. Start and then tighten the four retaining screws that secure the service access cover to the chassis.
4. Close and then secure the service access cover with the three thumbscrews located on its back edge.
Chapter 3. Removal and Replacement Procedures 111
D20 Front Bezel
Before performing the following procedure, read the “Safety Considerations” on page 41.
Front Bezel Removal
To remove the front bezel, do the following:
1. Open the front rack door. If necessary, place the D20 subsystem into the service position as described
in “D20 Service Position” on page 109.
2. Simultaneously press in both bezel-release tabs. See the following illustration for bezel-release tab
locations.
3. Pivoting the bezel from the bottom, swing the top of the bezel out.
4. Pull the bottom of the bezel up, and then away from the subsystem chassis. This action releases the
two tabs located on the bottom of the bezel.
5. Put the bezel in a safe place.
1 Rack Door 3 Bezel-Release Tab (Located
on each side of bezel)
2 Bezel 4 19-Inch Rack
Front Bezel Replacement
To replace the front bezel, do the following:
1. Open the front rack door. If necessary, put the D20 subsystem into the service position as described in
“D20 Service Position” on page 109.
2. Insert the two tabs located on the bottom edge of the bezel into their locking slots, located on the
chassis.
3. Pivot the front bezel up toward the top of the chassis.
4. Align the release tabs to the matching slots located on the front of the subsystem chassis.
5. Gently push the tabs into the slots until the bezel seats against the front of the subsystem.
6. If the subsystem is in the service position, put the subsystem back into the operating position as
described in “D20 Operating Position” on page 110.
7. Close the rack door.
112 D10 and D20 Service Guide
D20 Operator Panel
Before performing the following procedure, read “Safety Considerations” on page 41.
1 Power LED (Green)
2 Attention LED (Amber)
3 Operator Panel
4 Snap Buttons
Operator Panel Removal
To remove the operator panel from the subsystem, do the following:
1. Open the rack front door.
2. Shut down the system as described in “Stopping and Starting the System” on page 42.
3. If necessary, put the D20 subsystem into the service position as described in “D20 Service Position”
on page 109.
4. Disconnect the power source from the system.
Note: This system may be equipped with a second power supply. Before continuing with this
procedure, ensure that the power source to the system has been completely disconnected.
5. Remove the front bezel as described in “Front Bezel Removal” on page 112.
6. Locate the operator panel, as shown in the following illustration.
7. Pull out the two snap buttons located on each side of the operator panel.
8. Remove the operator panel by grasping its edges and slowly pulling it out of its bay.
9. Disconnect the power/signal cable connector from the back of the operator panel.
10. Put the operator panel in a safe place.
Chapter 3. Removal and Replacement Procedures 113
1 Rack Front Door 4 Snap Buttons
2 D20 5 Front Bezel
3 Operator Panel
Operator Panel Replacement
To replace the operator panel, do the following:
1. Connect the power/signal cable connector to the back of the operator panel.
2. Carefully slide the operator panel back into the system bay.
3. Ensure that the two snap buttons are in the unlocked position.
Note: Snap buttons are in the unlocked position when the button component is pulled back as far as
possible.
4. Push the operator panel into the bay until it comes to rest against the subsystem chassis.
5. Push in each snap button, locking the operator panel against the subsystem chassis.
6. Replace the front bezel as described in “Front Bezel Replacement” on page 112.
7. If the system is in the service position, return it to the operating position as described in “D20
Operating Position” on page 110.
8. Reconnect the power source to the system.
9. Power on the system as described in “Stopping and Starting the System” on page 42.
10. Close the rack front door.
114 D10 and D20 Service Guide
D20 Blowers
Before performing the following procedure, read “Safety Considerations” on page 41.
Each cooling blower assembly has a green LED and an amber LED. The green LED indicates when the
cooling blower is receiving power. A blinking amber LED indicates when the cooling blower needs attention
from a potential cooling problem. When you are standing in front of and facing the I/O subsystem, the
cooling blowers are labeled from left to right, 1 through 4, across the top blower retaining bracket.
Attention: If the I/O subsystem does not have two functional power supplies installed, do not remove a
blower with the power turned on. If only one power supply is functional, removing of a blower will cause
the system to turn power off.
Hot-Swap Blower Removal
To remove a hot-swap blower, do the following:
1. Put the D20 subsystem into the service position as described in “D20 Service Position” on page 109.
2. Locate the cooling blower assemblies, as shown in the following illustration.
3. Pull out the snap button located on the front flange of the failing blower.
4. Using the snap button as a handle, pull the front of the blower up, allowing the back edge to pivot on a
recessed ledge located just below the back edge of the blower.
Note: Using the snap button, lift the blower from the chassis. The blower connector will disconnect
from its docking connector, which is located on the underside of the blower.
5. Pull the blower away from the system. Ensure that the two back tabs, located on the bottom back edge
of the blower assembly, have cleared their retaining slots.
Chapter 3. Removal and Replacement Procedures 115
1 Green LED 4 Blower Connector
2 Amber LED 5 Hot-Plug Blower Assembly
3 Snap Button 6 Docking Connector
116 D10 and D20 Service Guide
Hot-Swap Blower Replacement
Note: If a blower assembly is being replaced for a redundant failure, after the service repair action is
completed, ask the customer to check the crontab file for any power/cooling warning messages.
When a power or cooling error is encountered, AIX adds an entry to the crontab file to ²wall² a
warning message every 12 hours, to alert or remind the customer of the problem. Replacing the
faulty part does not clear this crontab entry, so unless the crontab file is edited to remove this
entry, the customer continues to be reminded of the failure despite its having been repaired. The
crontab -l command reads the crontab file to determine if an entry exists. The crontab -e
command edits the file.
To replace a hot-swap blower, do the following:
1. Grasp the blower by its snap button, and ensure that the snap button is in the unlocked position.
Note: Snap buttons are in the unlocked position when the button component is pulled back as far as
possible.
2. Install the two alignment tabs located on the lower back edge of the blower assembly into the two
retaining slots located on the chassis ledge.
3. Pivoting from the back, lower the front of the blower onto its docking connector.
4. Push down on the front of the blower until the front flange of the blower is seated on top of the chassis
ledge.
5. Push down on the snap button, locking the blower in place.
6. Return the subsystem back to the operating position as described in “D20 Operating Position” on
page 110.
7. Close the rack door.
Chapter 3. Removal and Replacement Procedures 117
D20 Cooling Blower Cable
This procedure must be performed with the power turned off.
Cooling Blower Cable Removal
To remove the cooling blower cable, do the following:
1. Open the rack front door.
2. Shut down the system as described in “Stopping and Starting the System” on page 42.
3. Put the D20 subsystem into the service position as described in “D20 Service Position” on page 109.
4. Disconnect the power source from the system.
Note: This system may be equipped with a second power supply. Before continuing with this
procedure, ensure that the power source to the system has been completely disconnected.
5. Remove the four cooling blowers as described in “Hot-Swap Blower Removal” on page 115.
6. Open the service access cover as described in “Opening the Service Access Cover” on page 111.
7. Remove the power supplies and, if applicable, the power supply filler.
8. If necessary, remove the power supply bulkhead as described in “Power Supply Bulkhead Removal”
on page 148.
9. Disconnect the blower cable from the I/O board.
10. Unclip the three cable ties from the chassis and the three cable ties from the blower
connector-support bracket.
11. Remove the blower docking connectors from the chassis.
To remove each blower docking connector from the chassis, do the following:
a. Depress the connector release tab next to the blower docking connector. The connector release
tab has a small tab retaining clip that locks the blower docking connector into the blower
connector support bracket. Depress the connector release tab far enough so that the tab retaining
clip clears the bottom of the blower connector-support bracket.
b. With the connector release tab still depressed, slide the blower docking connector toward the
depressed tab.
c. Align the blower docking connector’s four retaining ears with the four small slots located on the
blower connector-support bracket.
d. Push down on the blower docking connector to free it from the blower connector-support bracket.
12. Remove the cable from the subsystem.
118 D10 and D20 Service Guide
1 Cooling Blowers 6 Connector Retaining Ears
2 Cable Ties 7 Blower Docking Connector
3 Power Supply 8 D20
4 Tab Retaining Clip 9 Blower Connector Support
5 Connector Release Tab
Cooling Blower Cable Replacement
Replace in reverse order.
Chapter 3. Removal and Replacement Procedures 119
D20 RIO Bus Adapter Assembly
Before performing this procedure, read “Safety Considerations” on page 41.
In the following RIO bus adapter removal and replacement procedures, the RIO bus adapter assembly is
referred to as the RIO adapter.
RIO Bus Adapter Removal
To remove the RIO adapter from the subsystem, do the following:
1. Open the rack front door.
2. Shut down the system as described in “Stopping and Starting the System” on page 42.
3. Put the D20 subsystem into the service position as described in“D20 Service Position” on page 109.
4. Disconnect the power source from the system.
Note: This system may be equipped with a second power supply. Before continuing with this
procedure, ensure that the power source to the system has been completely disconnected.
5. Open the service access cover as described in “Opening the Service Access Cover” on page 111.
6. Disconnect the RIO-2 cables from the RIO connectors located on the rear of the subsystem.
7. Identify, and then disconnect and label all cables that cross over the top of the RIO adapter. These
cables might interfere with the removal and installation of the RIO adapter.
8. Release the release latches located on top of the RIO adapter.
9. Simultaneously lift both retention handles.
10. Pivot the release handles up until they are perpendicular (90 degrees) to the top of the RIO adapter.
Note: By placing the handles perpendicular to the top of the RIO adapter, the base or hinged portion
of each handle acts as a cam and will gently pry the RIO adapter up, disconnecting it from its
docking connector.
11. Remove the RIO adapter from the subsystem chassis, and put it in a safe place.
120 D10 and D20 Service Guide
1 RIO Bus Adapter Release Handle 4 Service Access Cover
2 RIO Bus Adapter Release Latch 5 D20 Subsystem
3 RIO Bus Adapter
Chapter 3. Removal and Replacement Procedures 121
RIO Bus Adapter Replacement
To replace the RIO bus adapter, do the following:
1. Grasp the two RIO adapter release handles.
2. Pivot both handles upward to 90 degrees, ensuring that the handles are perpendicular to the RIO
adapter. The handle cams have now been placed into the correct position to assist you when seating
the RIO adapter into its docking connector.
3. Before inserting the RIO adapter into its bay, observe the alignment bracket. The alignment bracket is
secured to the power bulkhead.
4. Insert the RIO adapter into its bay. Ensure that the power cable receptacle located on the back of the
RIO adapter is facing the back of the subsystem chassis.
5. Lower the RIO adapter through the alignment bracket. The alignment bracket will catch the back edge
of the RIO adapter closest to it.
The RIO adapter should now be resting on the top of its docking connector. The docking connector
has two large alignment pins located on each end. These alignment pins will ensure alignment of the
RIO adapter to its docking connector when seated.
6. Lower the RIO adapter locking handles, carefully seating the RIO adapter into the docking connector.
The plastic latch located beneath each handle clicks when the RIO adapter is fully seated. This click
also indicates that the handle is locked in the closed position.
7. Reconnect the RIO-2 cables to the RIO adapter connectors located on the back of the chassis.
8. Reconnect the cables that were disconnected during the RIO adapter removal.
9. Reconnect the power source to the system.
10. Close and then secure the service access cover with the three thumbscrews located on its back
edge.
11. Return the D20 to the operating position as described in “D20 Operating Position” on page 110.
12. Power on the system as described in “Stopping and Starting the System” on page 42.
13. Close the rack front door.
122 D10 and D20 Service Guide
D20 PCI Adapters
Before performing this procedure, read “Safety Considerations” on page 41.
Notes:
1. Some PCI adapter cards are shipped from the manufacturer with a blue handle or support along the
back edge of the card. In this system unit, you must remove the blue handle or support from the card.
2. With this system, you can install PCI adapters with the power on. These adapters are referred to as
hot-pluggable PCI adapters. Some adapters are not hot-pluggable, and the system must be powered
off for this type of adapter installation.
Before you install the adapter, determine if the PCI adapter you are installing is hot-pluggable. See the PCI
Adapter Placement Reference, order number SA23-2504.
If you are removing a PCI adapter that is:
v Non-hot-pluggable, go to “Non-Hot-Pluggable PCI Adapter Removal” on page 126.
v Hot-pluggable, go to “Hot-Pluggable PCI Adapter Removal” on page 129.
PCI Hot-Plug Manager Access
The installation instructions for hot-pluggable PCI adapters refer you to these procedures when it is
appropriate to perform them.
Note: A PCI adapter is only hot-pluggable if the PCI adapter is supported for hot-plug applications. For
complete information regarding your PCI adapter, see the PCI Adapter Placement Reference, order
number SA23-2504.
Accessing Hot-Plug Management Functions:
Note: Removing or installing a hot-pluggable PCI adapter requires the system administrator to take the
PCI adapter offline prior to performing the operation. Before taking an adapter offline, the devices
attached to the adapter must be taken offline as well. This action prevents a service representative
or user from causing an unexpected outage for system users.
For additional information about taking an adapter offline or removing it from the system
configuration, see the AIX System Management Guide: Operating System and Devices.
This publication is also contained on the AIX Documentation CD. The documentation is made
accessible by loading the documentation CD onto the hard disk or by mounting the CD in the
CD-ROM drive.
Chapter 3. Removal and Replacement Procedures 123
To access the hot-plug menus, do the following:
1. Log in as root user.
2. At the command line, type smitty.
3. Select Devices.
4. Select PCI Hot Plug Manager and press Enter.
5. The PCI Hot-Plug Manager menu displays. Return to the procedure that directed you here. The
following section describes the menu options.
PCI Hot-Plug Manager Menu: The following options are available from the PCI Hot Plug Manager menu:
Note: For information about the PCI slot LED states, see “D20 PCI-X Slots” on page 16.
List PCI Hot-Plug Slots
Provides a descriptive list of all slots that support PCI hot-plug capability. If the listing for a slot indicates it
holds an “Unknown” device, select the Install/Configure Devices Added after IPL to configure the
adapter in that slot.
Add a PCI Hot-Plug Adapter
Allows the user to add a new PCI hot-plug-capable adapter to the slot with the system turned on. You will
be asked to identify the PCI slot that you have selected prior to the actual operation. The selected PCI slot
will go into the Action state and finally into the On state.
Note: The system will indicate the slot holds an “Unknown” device until you perform the Install/Configure
Devices Added After IPL option to configure the adapter.
Replace/Remove a PCI Hot-Plug Adapter
Allows the user to remove an existing adapter, or replace an existing adapter with an identical one. For
this option to work, the adapter must be in the Defined state (see the “Unconfigure a Device” option).
You will be asked to identify the PCI slot prior to the actual operation. The selected PCI slot will go into the
Action state.
Identify a PCI Hot-Plug Slot
Allows the user to identify a PCI slot. The selected PCI slot will go into the Identify state. See “D20 PCI-X
Slots” on page 16.
124 D10 and D20 Service Guide
Unconfigure a Device
Allows the user to put an existing PCI adapter into the Defined state if the device is no longer in use.
This step must be completed successfully before starting any removal or replacement operation. If this
step fails, the customer must take action to release the device.
Configure a Defined Device
Allows a new PCI adapter to be configured into the system if software support is already available for the
adapter. The selected PCI slot will go into the On state.
Install/Configure Devices Added After IPL
The system attempts to configure any new devices and tries to find and install any required software from
a user-selected source.
The add, remove, and replace functions return information to the user indicating whether the operation
was successful. If additional instructions are provided on the screen, complete the recommended actions.
If the instructions do not resolve the problem, do the following:
v If the adapter is listed as Unknown, perform the Install/Configure Devices Added After IPL option to
configure the adapter.
v If you receive a warning indicating that needed device packages are not installed, the system
administrator must install the specified packages before you can configure or diagnose the adapter.
v If you receive a failure message indicating a hardware error, the problem might be either the adapter or
the PCI slot. Isolate the problem by retrying the operation in a different PCI slot, or trying a different
adapter in the slot. If you determine that you have failing hardware, call your service representative.
v Do not use Install/Configure Devices Added After IPL if your system is set up to run HACMP
clustering. Consult with your system administrator or software support to determine the correct method
to configure the replacement device.
Chapter 3. Removal and Replacement Procedures 125
Non-Hot-Pluggable PCI Adapter Removal
To remove a PCI adapter, do the following:
1. Open the front and rear rack doors, and place the subsystem into the service position as described in
“D20 Service Position” on page 109.
2. Shut down the system as described in “Stopping and Starting the System” on page 42.
3. Disconnect the power source to the system.
Note: This system may be equipped with a second power supply. Before continuing with this
procedure, ensure that the system power source has been completely disconnected.
4. Open the service access cover as described in “Service Access Cover Removal” on page 111.
5. Determine which adapters you plan to remove, and then label and disconnect all cables attached to
that adapter.
6. Record the slot number and location of each adapter being removed.
Note: Adapter slots are numbered on the rear of the subsystem unit.
7. Before handling any card, board, or memory DIMM, be sure to use your electrostatic discharge strap
to minimize static-electric discharge. Refer to “Handling Static-Sensitive Devices” on page 42.
1 Hot-Plug PCI Adapters
2 Service Access Cover
3 D20 Subsystem
126 D10 and D20 Service Guide
8. Rotate the adapter retainer clip counterclockwise, as shown in the following illustration.
9. Lift the adapter retaining seat that is resting on the adapter retention bracket.
10. Carefully grasp the PCI adapter by its top edge or upper corners, and remove it from the system.
11. Store the adapter in a safe place.
12. If you do not plan to install another adapter into the vacated slot, seal the expansion slot using an
expansion-slot cover. If you are installing another adapter, go to “Adding or Replacing a
Non-Hot-Pluggable PCI Adapter” on page 128.
13. If you have other options to install, refer to “D20 FRU Replacement Procedure List” on page 108. If
you do not have other options to install, continue on to the next step.
14. Close and then secure the service access cover with the three thumbscrews located on its back
edge.
15. Connect the adapter cables.
16. Reconnect the power source to the system.
17. Route the cables through the cable-management arm.
18. Push the system drawer back into the operating position as described in “D20 Operating Position” on
page 110.
19. Power on the system as described in “Stopping and Starting the System” on page 42.
20. Close the rack doors.
1 D20
2 Service Access Cover
3 Retainer Clip
4 Retainer Seat
5 PCI Adapter
Chapter 3. Removal and Replacement Procedures 127
Adding or Replacing a Non-Hot-Pluggable PCI Adapter
To add or replace an adapter, do the following:
1. Open the front and rear rack doors, and place the system into the service position as described in
“D20 Service Position” on page 109.
2. Shut down the system as described in “Stopping and Starting the System” on page 42.
3. Disconnect the power source to the system.
Note: This system may be equipped with a second power supply. Before continuing with this
procedure, ensure that the system has been completely disconnected from its power source.
4. Open the service access cover as described in “Service Access Cover Removal” on page 111.
5. Refer to the PCI Adapter Placement Reference for information regarding slot restrictions for adapters
that can be used in this subsystem.
6. If necessary, remove the adapter expansion slot shield.
7. Before handling any card, board, or memory DIMM, be sure to use your electrostatic discharge strap
to minimize static-electric discharge. Refer to “Handling Static-Sensitive Devices” on page 42.
8. If necessary, remove the adapter from the antistatic package.
Attention: Avoid touching the components and gold-edge connectors on the adapter.
9. Place the adapter, component-side up, on a flat, static-protective surface.
10. Set any jumpers or switches as instructed by the adapter’s manufacturer.
11. Carefully grasp the adapter by its top edge, and align the adapter with the expansion slot and its
connector on the PCI riser card.
12. Press the adapter firmly into its connector.
Attention: When you install an adapter into your subsystem, be sure that it is completely and
correctly seated in its connector.
13. Lower the adapter retaining seat onto the PCI adapter EMC shield/connector faceplate. Rotate the
adapter retainer clip over the adapter retaining seat until it comes to a stop.
14. Close and then secure the service access cover with the three thumbscrews located on its back
edge.
15. Connect the adapter cables.
16. Reconnect the power source to the system.
17. Route the cables through the cable-management arm.
18. Push the system drawer back into the operating position as described in “D20 Operating Position” on
page 110.
19. Power on the system as described in “Stopping and Starting the System” on page 42.
20. Close the rack doors.
128 D10 and D20 Service Guide
Hot-Pluggable PCI Adapter Removal
Attention: The Linux operating system does not support these hot-plug procedures. Also, Linux does not
support hot-plugging any hot-pluggable PCI adapters. Systems with Linux installed on one or more
partitions must be shut down and powered off before replacing any PCI adapter assigned to a Linux
partition. Follow the non-hot-pluggable adapter procedures when replacing a PCI adapter in any partition
with Linux installed.
Note: Removing a hot-pluggable PCI adapter requires the system administrator to take the PCI adapter
offline before performing any PCI adapter hot-plug procedures. Before taking an adapter offline, the
devices attached to the adapter must also be taken offline. This action prevents a service
representative or user from causing an unexpected outage for system users.
For additional information about taking an adapter offline or removing it from the system
configuration, see the AIX System Management Guide: Operating System and Devices.
This publication is also contained on the AIX Documentation CD. The documentation is made
accessible by loading the documentation CD onto the hard disk or by mounting the CD in the
CD-ROM drive.
To remove a hot-pluggable PCI adapter, do the following:
1. Open the front and rear rack doors, and place the system into the service position as described in
“D20 Service Position” on page 109.
2. Open the service access cover as described in “Service Access Cover Removal” on page 111.
3. Determine which adapters you plan to remove.
4. Record the slot number and location of each adapter being removed.
Note: Adapter slots are numbered on the rear of the system unit.
5. Ensure that any processes or applications that might use the adapter are stopped.
6. Refer to “PCI Hot-Plug Manager Access” on page 123, and follow the steps in the access procedure
to select PCI Hot Plug Manager. Then return here to continue.
7. Select Unconfigure a Device and press Enter.
8. Press F4 to display the Device Names menu.
9. Select the adapter you are removing.
10. Use the Tab key to answer NO to Keep Definition and YES to Unconfigure Child Devices. Press
Enter.
11. The ARE YOU SURE screen displays. Press Enter to verify the information. Successful
unconfiguration is indicated by the OK message displayed next to the Command field at the top of the
screen.
12. Label and disconnect all cables attached to that adapter.
13. Press F3 to return to the PCI Hot-Plug Manager menu.
Chapter 3. Removal and Replacement Procedures 129
14. Select Replace/Remove a PCI Hot-Plug Adapter and press Enter. The Replace/Remove a PCI
Hot-Plug Adapter menu displays.
Note: If the PCI Hot-Plug Adapter menu indicates that the slot for the just replaced adapter is
reading empty, do the following:
a. From the Replace/Remove a PCI Hot-Plug Adapter menu, select F3 to cancel.
b. Select Add PCI Hot-Plug Adapter, and press Enter. The display screen indicates that the PCI
slot is empty.
c. On the display screen, select the indicated empty slot. Look at the adapter you intend to replace.
A slow blinking amber LED located next to the adapter on the PCI riser card is indicating that the
slot has been identified. Press Enter. The adapter, has been put into the action state.
d. Completely remove the hot-plug adapter from the slot. The LED goes off.
e. Reinstall the adapter, ensuring that it is correctly seated. Press Enter.
f. Select F3 to return to the PCI Hot-Plug Adapter Menu.
g. Select Install/Configure Devices Added after IPL.
h. At the display screen, identify the PCI hot-plug slot that your adapter is using. If it still shows as
being empty, redo steps a through g. However, before you reinstall the adapter, test it. If the
adapter fails the test, redo steps a through g with a new adapter.
15. Move the cursor to select the adapter that you are removing and press Enter. (The description entry
displays as unknown).
16. Press the Tab key until the entry field displays the remove operation and then press the Enter key.
Follow the instructions that display on the screen until you are instructed to remove the adapter.
17. When you are instructed to remove the adapter from the adapter slot, disconnect any cables that are
connected to the adapter being removed.
18. Before handling any card, board, or memory DIMM, be sure to use your electrostatic discharge strap
to minimize static-electric discharge. Refer to “Handling Static-Sensitive Devices” on page 42.
1 Hot-Plug PCI Adapters
2 Service Access Cover
3 D20 Subsystem
130 D10 and D20 Service Guide
19. Turn the retainer clip and lift the adapter retaining seat off the adapter.
1 D20
2 Service Access Cover
3 Retainer Clip
4 Retainer Seat
5 PCI Adapter
20. Carefully grasp the adapter by the edges and pull it straight out from the PCI riser card.
21. If you are not installing another adapter in this slot, place an expansion slot cover in the adapter slot
opening.
22. Lower the plastic retainer seat over the PCI adapter faceplate.
23. Rotate the locking latch clockwise until it clicks into the locked position.
24. Continue to follow the screen instructions until you receive a message that the adapter removal is
successful. Successful removal is indicated by the OK message displayed next to the Command field
at the top of the screen.
25. If you do not have other adapters to remove, continue with the next step.
OR
If you have other adapters to remove, press the F3 key to return to the PCI Hot-Plug Manager menu,
and then return to step 14 on page 130.
26. Press F10 to exit the Hot-Plug Manager.
27. If you have added, removed, or replaced any adapters, run the diag -a command. If the system
responds with a menu or prompt, follow the instructions to complete the device configuration.
Chapter 3. Removal and Replacement Procedures 131
28. If you have other options to install, refer to “D20 FRU Replacement Procedure List” on page 108. If
you do not have other options to install, continue on to the next step.
29. Replace the service access cover as described in “Service Access Cover Replacement” on page 111.
30. Push the system drawer back into the operating position as described in “D20 Operating Position” on
page 110.
31. Connect the adapter cables.
32. Route the cables through the cable-management arm.
33. Close the rack doors.
Replacing a Hot-Pluggable PCI Adapter
Attention: The Linux operating system does not support these hot-plug procedures. Also, Linux does not
support hot-plugging any hot-pluggable PCI adapters. Systems with Linux installed on one or more
partitions must be shut down and powered off before replacing any PCI adapter assigned to a Linux
partition. Follow the non-hot-pluggable adapter procedures when replacing a PCI adapter in any partition
with Linux installed.
Notes:
1. Use this procedure only when you are replacing an adapter with an identical adapter. If you are
replacing an adapter with an adapter that is not identical to the adapter removed, go to “Hot-Pluggable
PCI Adapter Removal” on page 129 and “Installing a Hot-Pluggable PCI Adapter” on page 135.
2. Removing a hot-pluggable PCI adapter requires the system administrator to take the PCI adapter
offline before performing any PCI adapter hot-plug procedures. Before taking an adapter offline, the
devices attached to the adapter must be taken offline as well. This action prevents a service
representative or user from causing an unexpected outage for system users.
For additional information about taking an adapter offline or removing it from the system configuration,
see the AIX System Management Guide: Operating System and Devices.
This publication is also contained on the AIX Documentation CD. The documentation is made
accessible by loading the documentation CD onto the hard disk or by mounting the CD in the CD-ROM
drive.
To replace a hot-plug PCI adapter, do the following:
1. Open the front and rear rack doors, and place the system into the service position as described in
“D20 Service Position” on page 109.
2. Remove the service access cover as described in “Service Access Cover Removal” on page 111.
3. Refer to the PCI Adapter Placement Reference, order number SA23-2504, for information regarding
slot restrictions for adapters that can be used in this system.
4. Before handling any card, board, or memory DIMM, be sure to use your electrostatic discharge strap
to minimize static-electric discharge. Refer to “Handling Static-Sensitive Devices” on page 42.
5. Determine the slot from which you are removing the adapter.
6. Ensure that any processes or applications that might use the adapter are stopped.
7. Refer to “PCI Hot-Plug Manager Access” on page 123, and follow the steps in the access procedure
to select PCI Hot Plug Manager. Then return here to continue.
132 D10 and D20 Service Guide
8. Select Unconfigure a Device and press Enter.
Note: If the adapter or device has a problem that prevents it from being replaced by hot-plug, it must
be replaced by shutting down the system. Go to “Adding or Replacing a Non-Hot-Pluggable
PCI Adapter” on page 128.
9. Press F4 to display the Device Names menu.
10. From the menu, select the adapter you are removing.
11. Use the tab key to answer YES to Keep Definition and YES to set Unconfigure Child Devices.
Press Enter.
12. The ARE YOU SURE screen displays. Press Enter to verify the information. Successful
unconfiguration is indicated by the OK message displayed next to the Command field at the top of the
screen.
13. Press F3 to return to the PCI Hot-Plug Manager menu.
14. Select List PCI Hot-Plug Slots and press Enter.
Note: If the List PCI Hot-Plug Slots menu indicates that the slot for the adapter you intend to replace
or reseat is empty, do the following:
a. Press F3 to cancel.
b. Select Add PCI Hot-Plug Adapter, and press Enter. The display screen indicates that the PCI
slot is empty.
c. On the display screen, select the indicated empty slot. Examine at the adapter you intend to
replace or reseat. A slow blinking amber LED located on the PCI riser card is indicating that the
slot has been identified. Press Enter. The adapter is now in the action state.
d. Completely remove the hot-plug adapter from the slot. The LED goes off.
e. Reinstall the adapter, ensuring that it is correctly seated. Press Enter.
f. Press F3 to return to the ²PCI Hot-Plug Adapter Menu².
g. Connect the appropriate cables and devices to the adapter.
h. Select List PCI Hot-Plug Slots, then press Enter.
i. Return to the error code or procedure that sent you here, if the ²List PCI Hot-Plug Slots² menu
indicates that the slot for the just-replaced or reseated adapter is empty.
15. Press F3 to cancel.
16. Select Replace/Remove a PCI Hot-Plug Adapter, then press Enter. The ²Replace/Remove a PCI
Hot-Plug Adapter menu displays.
Note: If the PCI Hot-Plug Adapter menu indicates that the slot for the just replaced adapter is
reading empty, do the following:
a. From the Replace/Remove a PCI Hot-Plug Adapter menu, press F3 to cancel.
b. Select Add PCI Hot-Plug Adapter, and press Enter. The display screen indicates that the PCI
slot is empty.
c. On the display screen, select the indicated empty slot. Examine at the adapter you intend to
replace. A slow blinking amber LED located on the PCI riser card is indicating that the slot has
been identified. Press Enter. The adapter has been put into the action state.
d. Completely remove the hot-plug adapter from the slot. The LED goes off.
e. Reinstall the adapter, ensuring that it is correctly seated. Press Enter.
f. Press F3 to return to the PCI Hot-Plug Adapter Menu.
g. Select Install/Configure Devices Added after IPL.
h. At the display screen, identify the PCI hot-plug slot that your adapter is using. If it still shows as
being empty, redo steps a through g. However, before you reinstall the adapter, test it. If the
adapter fails the test, redo steps a through g with a new adapter.
Chapter 3. Removal and Replacement Procedures 133
17. Move the cursor to select the adapter that you are removing, and press Enter.
18. Press the Tab key until the entry field displays the replace operation, and then press Enter. Follow the
instructions that display on the screen until you are instructed to remove the adapter.
19. When you are instructed to remove the adapter from the adapter slot, disconnect any cables that are
connected to the adapter being removed.
20. Turn the retainer clip, and lift the adapter retaining seat off the top of the adapter.
21. Carefully grasp the adapter by the edges, and pull it straight up and out of the subsystem.
22. If necessary, remove the replacement adapter from the antistatic package.
Attention: Avoid touching the components and gold-edge connectors on the adapter.
1 D20
2 Service Access Cover
3 Retainer Clip
4 Retainer Seat
5 PCI Adapter
23. Place the adapter, component-side up, on a flat, static-protective surface.
24. Set any jumpers or switches as instructed by the adapter manufacturer.
25. Replace the adapter into the adapter slot. Carefully grasp the adapter by the edges and align the
adapter in the slot guides. Insert the adapter fully into the adapter slot connector. If you are installing
a full-length adapter, ensure that both ends of the adapter engage the card guides.
26. Lower the adapter retaining seat over the PCI adapter faceplate. Rotate the retainer clip until it stops.
27. Connect appropriate cables and devices to the adapter.
28. Continue to follow the screen instructions until you receive a message that the replacement is
successful. Successful replacement is indicated by the OK message displayed next to the Command
field at the top of the screen.
29. Press F3 to return to the PCI Hot-Plug Manager menu.
134 D10 and D20 Service Guide
30. Select Install/Configure Devices Added After IPL and press Enter. Follow the instructions on the
screen. Successful replacement is indicated by the OK message displayed next to the Command field
at the top of the screen.
Do not use Install/Configure Devices Added After IPL if your system is set up to run HACMP
clustering. Consult with your system administrator or software support to determine the correct
method to configure the replacement device.
31. If you do not have other adapters to replace, continue with the next step.
OR
If you have other adapters to replace, press the F3 key to return to the PCI Hot-Plug Manager menu
and then return to step 14 on page 133.
32. Press F10 to exit the Hot-Plug Manager.
If you have added, removed, or replaced any adapters, run the diag -a command. If the system
responds with a menu or prompt, follow the instructions to complete the device configuration.
33. Close and then secure the service access cover with the three thumbscrews located on its back
edge.
34. Connect the adapter cables.
35. Route the cables through the cable-management arm.
36. Push the system drawer back into the operating position as described in “D20 Operating Position” on
page 110.
37. Close the rack doors.
Installing a Hot-Pluggable PCI Adapter
Attention: The Linux operating system does not support these hot-plug procedures. Also, Linux does not
support hot-plugging any hot-pluggable PCI adapters. Systems with Linux installed on one or more
partitions must be shut down and powered off before replacing any PCI adapter assigned to a Linux
partition. Follow the non-hot-pluggable adapter procedures when replacing a PCI adapter in any partition
with Linux installed.
Note: This system supports PCI hot-pluggable adapters, so the system does not need to be shut down
and the power cables removed before adding or replacing an adapter.
To add or install an adapter, do the following:
1. Open the front and rear rack doors, and place the system into the service position as described in
“D20 Service Position” on page 109.
2. Open the service access cover as described in “Service Access Cover Removal” on page 111.
3. Refer to the PCI Adapter Placement Reference, order number SA23-2504, for information regarding
slot restrictions for adapters that can be used in this system.
4. If necessary, remove the adapter expansion slot shield.
5. Before handling any card, board, or memory DIMM, be sure to use your electrostatic discharge strap
to minimize static-electric discharge. Refer to “Handling Static-Sensitive Devices” on page 42.
6. If necessary, remove the adapter from the antistatic package.
Attention: Avoid touching the components and gold-edge connectors on the adapter.
7. Place the adapter, component-side up, on a flat, static-protective surface.
8. Set any jumpers or switches as described by the adapter’s manufacturer.
Chapter 3. Removal and Replacement Procedures 135
9. Refer to “PCI Hot-Plug Manager Access” on page 123, and follow the steps in the access procedure
to select PCI Hot Plug Manager. Then return here to continue.
10. From the PCI Hot-Plug Manager menu, select Add a PCI Hot-Plug Adapter and press Enter. The
Add a Hot-Plug Adapter window displays.
11. See the PCI Adapter Placement Reference, order number SA23-2504, for adapter placement
information. Then select an empty PCI slot for the adapter.
12. Select the appropriate empty PCI slot from the ones listed on the screen, and press Enter.
13. Rotate the retainer clip, lift the adapter retainer seat, and remove the expansion slot cover.
1 D20
2 Service Access Cover
3 Retainer Clip
4 Retainer Seat
5 PCI Adapter
14. Follow the instructions on the screen to install the adapter until the LED for the specified PCI slot is
set to the Action state. See “D20 PCI-X Slots” on page 16.
15. When you are instructed to install the adapter in the adapter slot, carefully grasp the adapter by the
edges and align the adapter in the slot guides. Insert the adapter fully into the adapter slot connector.
If you are installing a full-length adapter, ensure that both ends of the adapter engage the card
guides.
16. Lower the adapter retainer seat onto the PCI adapter faceplate. Rotate the adapter retainer clip until it
stops.
17. Close and then secure the service access cover with the three thumbscrews located on its back
edge.
18. Connect the adapter cables.
19. Route the adapter cables through the cable-management arm.
20. Push the system drawer back into the operating position as described in “D20 Operating Position” on
page 110.
21. Close the rack doors.
136 D10 and D20 Service Guide
D20 Hot-Plug Disk Drives
Attention: The Linux operating system does not support these hot-plug procedures. Also, Linux does not
support hot-plugging any hot-pluggable PCI adapters. Systems with Linux installed on one or more
partitions must be shut down and powered off before replacing any PCI adapter assigned to a Linux
partition. Follow the non-hot-pluggable adapter procedures when replacing a PCI adapter in any partition
with Linux installed.
Before performing the following procedure, read “Safety Considerations” on page 41.
Note: You do not have to power off the system to remove a hot-plug disk drive. Before you perform these
procedures, ensure that the customer has taken appropriate actions to back up the data for the
drive you are removing, and that the drive has been removed from the configuration. Physically
removing a hot-plug disk drive from the system before it has been removed from the system
configuration can cause unrecoverable data corruption.
Deconfiguring (Removing) or Configuring a Disk Drive
Use this procedure if you are removing or installing a drive while the system power is turned on.
Deconfiguring (Removing)
Before removing a disk drive from a volume group, do the following:
1. Log in as root user.
2. Before a disk can be removed from a volume group, all data must be backed up and the data
removed from the disk to be replaced. Use AIX Logical Volume Manager to perform this task.
3. At the command line, type smitty.
4. Select System Storage Manager (Physical and Logical Storage) and press Enter.
5. Select Logical Volume Manager and press Enter.
6. Select Volume Groups and press Enter.
7. Select Set Characteristics of a Volume Group and press Enter.
8. Select Remove a Physical Volume from a Volume Group.
9. Press F4 to list the available volume groups, then select the volume group name and press Enter.
10. Press F4 to select a physical volume, and follow the instructions on the screen to select the physical
volume. Then press Enter.
11. Press F3, Cancel, to go back to the first menu and select System Storage Management (Physical
and Logical Storage).
12. Select Removable Disk Management.
13. Select Remove a Disk.
14. Select the desired disk from the list on the screen and press Enter.
Chapter 3. Removal and Replacement Procedures 137
15. Follow the instructions on the screen to remove the drive.
16. When you are asked ²Are you sure?², press Enter. The power LED on the drive that you selected will
remain on.
17. Remove the disk drive by pulling the disk drive lever toward you until it is completely open.
18. Remove the disk drive from the slot. The LED on the top of the slot will turn off when the disk drive is
removed.
19. Press F10 to exit smitty.
Configuring (Replacing)
1. Remove the disk drive from its protective packaging, and open the drive latch handle.
2. Install the disk drive in the drive slot. Align the disk drive with the drive slot rails, and slide the disk
drive into the slot until it contacts the backplane at the rear of the drive bay. The drive should be in
far enough for the latch handle to engage the latch. Push the disk drive lever up and to the rear to
lock the disk drive. The LED above the slot will turn on.
3. Log in as root user.
4. At the command line, type smitty.
5. Select Devices.
6. Select Install/Configure Devices Added After IPL and press Enter. Successful configuration is
indicated by the OK message displayed next to the Command field at the top of the screen
7. Press F3, Cancel, to go back to the first menu and select System Storage Management (Physical
and Logical Storage) and press Enter.
8. Select Logical Volume Manager and press Enter.
9. Select Volume Groups and press Enter.
10. Select Set Characteristics of a Volume Group and press Enter.
11. Select Add a Physical Volume to a Volume Group.
12. Fill in the fields for the drive you are adding to the system. Press F4 for a list of selections.
13. See the AIX System Management Guide: Operating System and Devices to finish the drive
configuration.
14. Press F10 to exit smitty.
138 D10 and D20 Service Guide
Hot-Plug Disk Drive Removal
Attention: The Linux operating system does not support these hot-plug procedures. Also, Linux does not
support hot-plugging any hot-pluggable PCI adapters. Systems with Linux installed on one or more
partitions must be shut down and powered off before replacing any PCI adapter assigned to a Linux
partition. Follow the non-hot-pluggable adapter procedures when replacing a PCI adapter in any partition
with Linux installed.
Attention: Before you perform the following procedure, ensure that the customer has taken appropriate
actions to back up the data for the drive you are removing, and that the drive has been removed from the
configuration. Physically removing a hot-plug drive from the system before it has been removed from the
system configuration can cause unrecoverable data corruption. Refer to “Deconfiguring (Removing) or
Configuring a Disk Drive” on page 137.
To remove a hot-plug disk drive, do the following:
1. Before removing the disk, ensure that all data has been saved.
2. Deconfigure the drive that you are removing from the system. See “Deconfiguring (Removing) or
Configuring a Disk Drive” on page 137.
3. Open the rack front door.
4. Remove the front bezel as described in “Front Bezel Removal” on page 112.
5. Locate the hot-plug disk drive you plan to remove. If you are removing a faulty disk drive, a lit amber
LED located at the front right side of each disk drive will help you isolate the faulty drive. Before
physically removing any disk drive, do the following:
a. Log in as root user.
b. At the command line, type smitty.
c. Select System Storage Management (Physical and Logical Storage).
d. Select Removable Disk Management.
e. Select Remove a Disk.
f. Select the desired disk from the list on the screen and press Enter.
6. Remove the disk drive by placing the handle on the disk drive into the open position (perpendicular to
the drive).
7. Pull the hot-plug disk drive out of the disk drive bay. The lit LED on the disk drive will turn off when the
disk drive is removed.
8. Press F10 to exit smitty.
Chapter 3. Removal and Replacement Procedures 139
1 Rack Front Door 3 Disk Drive Release Handle
2 D20 4 Front Bezel
140 D10 and D20 Service Guide
Hot-Plug Disk Drive Replacement
Attention: The Linux operating system does not support these hot-plug procedures. Also, Linux does not
support hot-plugging any hot-pluggable PCI adapters. Systems with Linux installed on one or more
partitions must be shut down and powered off before replacing any PCI adapter assigned to a Linux
partition. Follow the non-hot-pluggable adapter procedures when replacing a PCI adapter in any partition
with Linux installed.
Attention: Before you perform these procedures, ensure that the customer has taken appropriate
actions to back up the data for the drive you are removing, and that the drive has been removed from the
configuration. Physically removing a hot-plug drive from the system before it has been removed from the
system configuration can cause unrecoverable data corruption. Refer to “Deconfiguring (Removing) or
Configuring a Disk Drive” on page 137.
Do not use Install/Configure Devices Added After IPL if your system is set up to run HACMP clustering.
Consult with your system administrator or software support to determine the correct method to configure
the replacement disk.
1. To install a hot-plug disk drive, do the following:
a. Ensure the tray handle is open (perpendicular to the drive).
b. Align the drive/tray assembly so that it engages the guide rails in the bay.
c. Push the drive assembly into the bay until the tray handle engages the lock mechanism.
d. Push the tray handle in until it locks. The lit LED on the disk drive will turn on.
2. Log in as root user.
3. At the command line, type smitty.
4. Select Devices.
5. Select Install/Configure Devices Added After IPL and press Enter. Successful configuration is
indicated by the OK message displayed next to the Command field at the top of the screen.
6. Press F10 to exit smitty.
7. Replace the front bezel as described in “Front Bezel Replacement” on page 112.
8. Close the rack doors.
Chapter 3. Removal and Replacement Procedures 141
D20 Disk Drive Cage
Before performing the following procedure, read “Safety Considerations” on page 41.
Disk Drive Cage Removal
To remove the disk drive cage, do the following:
1. Open the rack front door.
2. Shut down the system as described in “Stopping and Starting the System” on page 42.
3. Place the system into the service position as described in “D20 Service Position” on page 109.
4. Disconnect the power source from the system.
Note: This system may be equipped with a second power supply. Before continuing with this
procedure, ensure that the power source to the system has been completely disconnected.
5. Remove the front bezel as described in “Front Bezel Removal” on page 112.
6. Before handling any card, board, or memory DIMM, be sure to use your electrostatic discharge strap to
minimize static-electric discharge. Refer to “Handling Static-Sensitive Devices” on page 42.
7. Label and then remove all hot-plug disk drives as described in “Hot-Plug Disk Drive Removal” on
page 139.
8. Remove the 8 retaining screws that secure the disk drive cage to the front of the system chassis. See
the following illustration.
9. Pull the disk drive cage assembly out of the subsystem.
Note: When pulling the disk drive cage out of the subsystem, use the center post and pull straight
back toward you. Do Not grab, hold or pull on the light pipes.
142 D10 and D20 Service Guide
1 D20 5 Retaining Screws (Quantity 8)
2 Disk Drive Cage 6 Light Pipes
3 Disk Drives 7 Light Pipe Support Bracket
4 Front Bezel
Disk Drive Cage Replacement
To replace the disk drive cage, go the following:
1. Insert the disk drive cage assembly into the subsystem.
2. Insert the 8 retaining screws that secure the disk drive cage to the front of the system chassis.
3. Replace all hot-plug disk drives as described in “Hot-Plug Disk Drive Replacement” on page 141.
4. Replace the front bezel as described in “Front Bezel Replacement” on page 112.
5. Reconnect the power source to the system. If the system has two power supplies, two power cables
need to be reconnected.
6. Put the system back into the operating position as described in “D20 Operating Position” on page 110.
7. Start the system as described in “Stopping and Starting the System” on page 42.
8. Close the rack door.
Chapter 3. Removal and Replacement Procedures 143
D20 Disk Drive Backplane
Before performing the following procedure, read “Safety Considerations” on page 41.
Disk Drive Backplane Removal
To remove the disk drive backplane, do the following:
1. Remove all hot-plug disk drives as described in “Hot-Plug Disk Drive Removal” on page 139.
2. Remove the four cooling blowers as described in “Hot-Swap Blower Removal” on page 115.
Note: Removing the cooling blowers enables you to view the disk drive backplanes.
3. Disconnect the cables connected to the disk drive backplane.
4. Remove the 8 retaining screws that secure the disk drive cage as described in “Disk Drive Cage
Removal” on page 142.
5. Pull the disk drive cage assembly out of the system.
6. Remove the 2 retaining screws that secure the disk drive backplane to the subsystem chassis.
7. Grasp the disk drive backplane from the bottom of one of the connectors and pull it out of the chassis,
bottom first.
1 Cooling blowers 4 Disk Drive Cage
2 D20 5 Disk Drives
3 Disk Drive Backplane 6 Front Bezel
144 D10 and D20 Service Guide
Disk Drive Backplane Replacement
To replace the disk drive backplane, do the following:
1. Reconnect the power/signal cable to the connector located on the top edge of the disk drive
backplane.
2. Grasp a bottom edge of one of the connectors located on the disk drive backplane, and insert the
backplane into the disk drive bay.
3. Using the 3 supplied retaining screws, secure the disk drive backplane to the subsystem chassis.
4. Carefully insert the disk drive cage assembly into the system.
5. Align and then secure the disk drive cage to the subsystem chassis with the 7 previously removed
retaining screws.
6. Before handling any card, board, or memory DIMM, be sure to use your electrostatic discharge strap
to minimize static-electric discharge. Refer to “Handling Static-Sensitive Devices” on page 42.
7. Replace all hot-plug disk drives as described in “Hot-Plug Disk Drive Replacement” on page 141.
8. Reconnect the power source to the system.
9. Replace the front bezel as described in “Front Bezel Replacement” on page 112.
10. Power on the system as described in “Stopping and Starting the System” on page 42.
11. Close and then secure the service access cover with the three thumbscrews located on its back
edge.
12. Return the subsystem into the operating position as described in “D20 Operating Position” on
page 110.
13. Close the rack front door.
Chapter 3. Removal and Replacement Procedures 145
D20 Power Supplies
DANGER
Do not attempt to open the covers of the power supply. Power supplies are not serviceable and
are to be replaced as a unit.
D02
The D20 subsystem has power supplies that have hot-swap capabilities. To use this feature, you must
have two power supplies in place.
Power Supply Removal
D20 subsystems configured at the factory with one power supply have a power supply filler panel. The
filler panel protects the vacant power supply connector from dust or damage. If you plan to remove one of
the power supplies from a subsystem configured with two power supplies, obtain the power supply filler
panel. The part number for the power supply filler panel is 53P0233.
Note: You need not shut down the system to change a subsystem power supply.
Before performing any of the following procedure, read and understand all of the safety notices beginning
with “Safety Considerations” on page 41.
To remove a power supply from your subsystem, do the following:
1. Put the subsystem into the service position as described in “D20 Service Position” on page 109.
2. Open the service access cover as described in “D20 Service Access Cover” on page 111.
3. If your subsystem is configured with two power supplies, visually check the power supply LEDs. If one
of the power supplies has an amber LED that is lit or blinking, suspect that power supply to be faulty.
4. Remove the power cable from the rear of the power supply. Wait at least 10 seconds before
performing the next step.
5. Release, and then lift the power supply handles.
6. Lift the handles until they are perpendicular (90 degrees) to the top of the power supply. By placing the
handles perpendicular to the top of the power supply, the base or hinged portion of each handle acts
as a cam and will gently pry the power supply from its docking connector located on the subsystem
system board.
146 D10 and D20 Service Guide
7. Carefully remove the power supply from the subsystem.
1
2 3 4
6
5
1 Power Supplies 4 Green DC Good LED
2 Green Power LED 5 Power Supply Filler
3 Amber Fault/Identify LED 6 D20 Subsystem
Note: During normal operation, each power-supply bay must have either a power supply or filler panel
installed for proper cooling.
Power Supply Replacement
To replace a power supply, do the following:
1. Ensure the handles are in the unlocked position and perpendicular to the power supply.
2. Carefully insert the new power supply into the power supply bay.
3. Press the power supply onto the docking connectors.
4. Put the power supply handles into the locked position.
Note: The handles act as a cam and will correctly seat the power supply.
5. Reconnect the power source to the replaced power supply. The green LED located on the power
supply comes on solid.
6. Close and then secure the service access cover with the three thumbscrews located on its back edge.
7. Put the D20 subsystem back into the operating position as described in “D20 Operating Position” on
page 110.
8. If your system has successfully powered on, close the rack doors.
Chapter 3. Removal and Replacement Procedures 147
D20 Power Supply Bulkhead
Before performing the following procedure, read the “Safety Considerations” on page 41.
Power Supply Bulkhead Removal
To remove the power supply bulkhead, do the following:
1. Open the rack front door.
2. Shut down the system as described in “Stopping and Starting the System” on page 42.
3. Put the D20 subsystem into the service position.
4. Disconnect the power source from the system.
Note: This system may be equipped with a second power supply. Before continuing with this
procedure, ensure that the power source to the system has been completely disconnected.
5. Open the service access cover as described in “Opening the Service Access Cover” on page 111.
6. Disconnect the RIO-2 cables from the RIO connectors located on the rear of the subsystem.
7. Disconnect the power cables from the power supplies
8. Label and then disconnect all PCI adapter cables from their respective adapters.
9. Remove the power supplies or power supply filler as described in “Power Supply Removal” on
page 146.
10. Remove the RIO bus adapter as described in “RIO Bus Adapter Removal” on page 120.
11. Remove all PCI adapters as described in “Non-Hot-Pluggable PCI Adapter Removal” on page 126.
Note: Alternately remove the PCI dividers as you are removing the PCI adapters.
12. Remove the three upper-retaining screws that secure the top edge of the power supply bulkhead to
the subsystem chassis.
13. Remove the three lower-retaining screws that secure the bottom portion of the power supply bulkhead
to the subsystem chassis.
14. Remove the power supply bulkhead by lifting it straight up and out of the subsystem.
148 D10 and D20 Service Guide
1 Power Supply Filler 5 PCI-X Adapters
2 Power Supply Bulkhead 6 D20
3 PCI Dividers 7 Power Supplies
4 RIO Bus Adapter
Chapter 3. Removal and Replacement Procedures 149
Power Supply Bulkhead Replacement
To replace the power supply bulkhead, do the following:
1. Before handling any card, board, or memory DIMM, be sure to use your electrostatic discharge strap
to minimize static-electric discharge. Refer to “Handling Static-Sensitive Devices” on page 42.
2. Insert the power supply bulkhead into the subsystem.
3. Align the three screw holes located on the top of the power supply bulkhead with the three threaded
screw holes located on the subsystem chassis.
4. Align the three screw holes located on the bottom of the power supply bulkhead with the three
threaded screw holes located on the subsystem chassis base.
5. Tighten the six retaining screws for the power supply bulkhead.
6. Replace the power supplies or the power supply filler as described in “Power Supply Replacement”
on page 147.
7. Reconnect the power cables to the power supplies.
8. Replace the RIO bus adapter as described in “RIO Bus Adapter Replacement” on page 122.
9. Reconnect the RIO-2 cables from the RIO connectors located on the rear of the subsystem.
10. Replace all of the previously removed PCI adapters as described in “Adding or Replacing a
Non-Hot-Pluggable PCI Adapter” on page 128.
Note: When you are reinstalling the PCI adapters, alternately reinstall the PCI dividers.
11. Reconnect all PCI adapter cables to their respective adapters.
12. Reconnect the power source to the system. The green LED located on the power supply comes on
solid.
13. Close then secure the service access cover with the three thumbscrews located on its back edge.
14. Put the D20 subsystem back into the operating position as described in “D20 Operating Position” on
page 110.
15. After the system has successfully powered on, close the rack doors.
150 D10 and D20 Service Guide
D20 I/O Backplane Assembly
Before performing the following procedure, read the “Safety Considerations” on page 41.
I/O Backplane Removal
To remove the I/O backplane from the subsystem, do the following:
1. Open the rack front door.
2. Shut down the system as described in “Stopping and Starting the System” on page 42.
3. Put the D20 subsystem into the service position.
4. Disconnect the power source from the system.
Note: This system may be equipped with a second power supply. Before continuing with this
procedure, ensure that the power source to the system has been completely disconnected.
5. Open the service access cover as described in “Opening the Service Access Cover” on page 111.
6. Disconnect the RIO-2 cables from the RIO connectors located on the rear of the subsystem.
7. Disconnect the power cables from the I/O subsystem power supplies.
8. Disconnect the SPCN connectors from the back of the subsystem chassis.
9. Label, and then disconnect all PCI adapter cables from their respective adapters.
10. Before handling any card, board, or memory DIMM, be sure to use your electrostatic discharge strap
to minimize static-electric discharge. Refer to “Handling Static-Sensitive Devices” on page 42.
11. Remove the RIO bus adapter as described in “RIO Bus Adapter Removal” on page 120.
12. Remove all of the power supplies or power supply filler as described in “Power Supply Removal” on
page 146.
13. Remove all PCI adapters as described in “Non-Hot-Pluggable PCI Adapter Removal” on page 126.
Note: When you are removing the PCI adapters, alternately remove the seven PCI dividers.
14. Disconnect the following cables in the order listed from the connectors located on the front of the I/O
backplane. See the following illustration for connector locations.
v Operator panel cable
v Disk drive SPCN cable
v Disk drive power cable
v Cooling blower cable
1 Operator Panel Connector 3 Disk Drive Power Connector
2 Disk Drive System Power Control Network
(SPCN) Connector
4 Cooling Blower Connector
Chapter 3. Removal and Replacement Procedures 151
15. Remove the plastic membrane from the top of the I/O backplane by first removing the five retaining
screws that secure the plastic membrane to the I/O backplane.
Note: When removing the plastic membrane, slide the membrane under the light pipes that identify
the green and amber power LEDs.
16. Remove the three retaining screws that secure the front edge of the I/O backplane to the subsystem
chassis.
17. Go to the rear of the subsystem, and remove the two retaining screws that secure the back edge of
the I/O backplane to the back chassis bulkhead.
Note: The two retaining screws that secure the back edge of the I/O backplane to the rear chassis
bulkhead are located just below the left SPCN connector and the rack indicator connector.
18. Remove the two retaining screws that secure the SCSI cable bracket to the subsystem-chassis
bracket.
19. Remove the SCSI cable bracket by wiggling it up and down, then pulling it straight out and away from
the PCI adapter-headstock bracket and the subsystem chassis bracket.
20. Remove the four retaining screws that secure the PCI adapter-headstock bracket to the chassis and
power supply bulkhead.
21. Remove the PCI adapter-headstock bracket by lifting it up, then out of the subsystem chassis.
22. Remove the four retaining screws that secure the power-supply bulkhead to the chassis
23. Remove the power-supply bulkhead by doing the following:
a. Lift the slotted tab off the alignment pin. The slotted tab and alignment pin are located at the
bottom edge of the power-supply bulkhead.
b. Move the lower portion of the power-supply bulkhead out and away from the alignment pin.
Notes:
1) Ensure that the back edge of the power-supply bulkhead clears the disk drive backplane’s
plastic alignment pin. The disk drive backplane’s plastic alignment pin is located on the disk
drive-backplane support bracket between the power-supply bulkhead and the PCI
adapter-headstock bracket.
2) The disk drive backplane-support bracket is not shown in the following illustration; however, it
is easily located. This is the bracket that secures the service access cover to the chassis. It
also supports the disk drive backplanes, cooling blowers, power supply bulkhead, and PCI
adapter headstock bracket.
c. Push the power-supply bulkhead toward the center of the I/O subsystem chassis.
Note: The upper-right corner of the power-supply bulkhead might get caught on the underside of
the chassis support. If the power-supply bulkhead gets caught on the underside of the
chassis support, wiggle the bulkhead until it is freed from under the chassis support.
24. Remove the I/O backplane by sliding the backplane toward the front of the I/O subsystem.
25. Slowly lift the I/O backplane straight up and out of the subsystem.
152 D10 and D20 Service Guide
26. Locate the VPD module.
1 I/O Backplane
2 VPD Module
Attention: Note the location of Pin 1 before removing.
27. Note the location of pin 1 on the VPD module.
28. Gently remove the VPD module and put it in a safe place.
Note: You will be installing the old VPD module onto the new I/O backplane.
Chapter 3. Removal and Replacement Procedures 153
1 PCI Adapter Dividers (maximum
quantity 7)
6 Plastic Membrane
2 RIO Bus Adapter 7 D20
3 PCI-X Adapters (maximum quantity 7) 8 SCSI Cable Bracket
4 Power Supplies 9 PCI Adapter-Headstock Bracket
5 I/O Backplane 10 Power-Supply Bulkhead
5a VPD Module
154 D10 and D20 Service Guide
I/O Backplane Replacement
To replace the I/O backplane, do the following:
1. Before handling any card, board, or memory DIMM, be sure to use your electrostatic discharge strap
to minimize static-electric discharge. Refer to “Handling Static-Sensitive Devices” on page 42.
2. Note the location of pin 1 on the VPD module.
3. Remove the VPD module that was installed on the new I/O backplane.
4. Install the VPD module from the old I/O backplane onto the new I/O backplane.
Note: When installing the VPD module, ensure that pin 1 is in the correct orientation.
5. Carefully align the I/O backplane to the alignment pins located on the chassis base.
6. Replace the three front retaining screws that secure the I/O backplane to the chassis.
7. Correctly align and then place the plastic membrane that protects the I/O backplane over the I/O
backplane. Secure it in place with five retaining screws.
8. Reconnect the following cables, in the order listed, to their respective connectors located on the front
of the I/O backplane:
v Cooling blower cable
v Disk drive power cable
v Disk drive SPCN cable
v Operator panel cable
9. Go to the rear of the subsystem chassis, and replace the two rear retaining screws.
Note: The two threaded screw holes used to secure the rear of the I/O backplane to the rear of the
subsystem chassis are located just below the left SPCN connector and the rack-indicator
connector.
10. Replace the power-supply bulkhead by doing the following:
a. Rotate the power-supply bulkhead, placing the top right corner under the chassis support bracket.
b. Align the slotted tab, located on the lower edge of the power supply bulkhead, with the alignment
pin, located on the subsystem chassis.
c. Align the four clearance holes located on the power-supply bulkhead with the four screw holes
located on the upper and lower subsystem chassis.
Note: When aligning the power-supply bulkhead, wiggle it until properly seated.
d. Secure the power-supply bulkhead to the subsystem chassis with four retaining screws.
11. Replace the PCI adapter-headstock bracket.
12. Align the four clearance holes on the bracket with the four screw holes located on the chassis.
13. Secure the PCI adapter-headstock bracket to the chassis with four retaining screws.
14. Replace the SCSI cable bracket, aligning the two screw holes and securing it with two retaining
screws to the chassis retaining bracket.
15. Reconnect the SPCN connectors from the back of the subsystem chassis.
16. Replace the power supplies or the power supply filler as described in “Power Supply Replacement”
on page 147.
17. Replace the RIO bus adapter as described in “RIO Bus Adapter Replacement” on page 122.
18. Reconnect the RIO-2 cables from the RIO connectors located on the rear of the subsystem.
Chapter 3. Removal and Replacement Procedures 155
19. Replace all of the previously removed PCI adapters as described in “Adding or Replacing a
Non-Hot-Pluggable PCI Adapter” on page 128.
Note: Alternately reinstall the PCI dividers, when reinstalling the PCI adapters.
20. Reconnect all PCI adapter cables to their respective adapters.
21. Reconnect the power cables to the power supplies.
22. Reconnect the power source to the system. The green LED located on the power supply comes on
solid.
23. Close and then secure the service access cover with the three thumbscrews located on its back
edge.
24. Put the D20 subsystem back into the operating position as described in “D20 Operating Position” on
page 110.
25. After the system has successfully powered on, close the rack doors.
156 D10 and D20 Service Guide
Chapter 4. Parts Information
This chapter contains parts information for the D10 and D20 I/O drawers.
D10 I/O Subsystem
This section lists part numbers for the D10 I/O Subsystem. This I/O subsystem is used in conjunction with
a system unit. Depending on your configuration, the quantities of parts listed may increase by the actual
number of I/O subsystems connected to the system unit.
D10 I/O Subsystems Parts
1
2
3
4
5
6
7
8
9
10
12
13
14
11
157
Index Part Number Units Per Assy Description
1 00P2752 1 Bracket, Bulkhead
2 09P4656 1 Tray, cable
3 09P5417 1 Cable, fan
4 09P3128 2 Guide, mounting
5 1624749 8 Screw, guide mounting
6 ******* See index number 9
7 1621829 2 Screw, tray mounting
8 1624743 6 Screw, backplane mounting
9 09P5921 1 I/O backplane assembly
(includes SPCN connector card )
10 1621829 1 Screw, tailstock cover mounting
11 00P4053 1 Dual port RIO bus adapter
00P4080 1 Dual port RIO-2 bus adapter
12 1621829 2 Screw, midplate cover mounting
13 00P2751 1 Drawer chassis assembly
14 09P4664 1 Fan assembly
158 D10 and D20 Service Guide
D10 I/O Subsystem Parts (continued)
1
2
3
4
5
6
7
Chapter 4. Parts Information 159
Index Part Number Units Per Assy Description
1 00P2753 1 Cover, access
09P5159 Front bezel, white
2 1624743 4 Screw, guide mounting
3 00P2750 2 Guide, adapter cassette top
44P0239 Empty PCI adapter cassette kit
00P3842 PCI adapter cassette blank filler, long
00P3843 PCI adapter cassette blank filler, short
53P5450 PCI bracket (used in PCI adapter cassette to install
Gigabit Ethernet PCI-X adapter)
44P2661 PCI clip (used in PCI adapter cassette to install Ultra
SCSI adapter)
44P0321 PCI clip (used in PCI adapter cassette to install 10/100
Eternet adapter)
44P2675 PCI clip for (used in PCI adapter cassette to install
FCAL 2GBS adapter)
44P2676 Clip for FC 2969 bracket (used in PCI adapter cassette
to install Gigabit Ethernet PCI adapter)
4 09P3185 5 Guide, power cable support
5 09P2401 2 Cable, power
6 09P4665 1 Cover, power cable channel
7 09P3354 2 Power supply
09P4778 1 Front Bezel
8 09P5159 1 Front Bezel
160 D10 and D20 Service Guide
D10 SPCN, RIO, and Rack Beacon Cables (These parts are not shown.)
Index Part Number Units Per Assy Description
1 53P2231 Up to 12 Rack beacon junction box, 7-port mini connector (4693)
2 53P2237 Up to 12 Rack beacon connector cable, subsystem to junction
box, 4-pin std to 4-pin mini connector, 3 meter (4691)
07H6655 As needed Hook-and-loop fastner
3 53P2854 Up to 12 Rack beacon connector cable, 4-pin mini to 4-pin mini
0.25 meter (4692)
4 87G6235 Up to 9 SPCN cable, 2 meter
09P1251 Up to 9 SPCN cable, 3 meter
21F9469 Up to 9 SPCN cable, 6 meter
21F9358 Up to 9 SPCN cable, 15 meter
5 21P5454 Up to 10 RIO-2 cable, 1.2 meter
53P2676 Up to 10 RIO-2 cable, 3.5 meter
21P5456 Up to 10 RIO-2 cable, 10 meter
D10 Rack Mounting Enclosure (These parts are not shown.)
Index Part Number Units Per Assy Description
1 09P4746 1 Two position I/O subsystem mounting assembly
2 03N3845 1 Left rail
03N3847 1 Right rail
3 00P2402 2 Cable support bracket
4 09P4911 1 Mounting bracket
5 09P4912 1 Rear filler for empty I/O subsystem space in mounting
assembly
Chapter 4. Parts Information 161
D20 I/O Subsystem
This section lists part numbers for the D20 I/O Subsystem. This I/O subsystem is used in conjunction with
a system unit. Depending on your configuration, the quantities of parts listed may increase by the actual
number of I/O subsystems connected to the system unit.
D20 I/O Backplane and Cabling
1
2
3
4
5
6
Index Part Number Units Per Assy Description
1 53P0416 1 DASD cable
2 53P0419 1 Blower cable
3 53P0414 1 Operator panel cable
4 53P0220 1 Chassis assembly
5 44H7366 5 Mounting screw, stiffener
6 53P3472 1 I/O Backplane
162 D10 and D20 Service Guide
D20 PCI Slots
1
2
3
4
5
6
7
Index Part Number Units Per Assy Description
1 See Note 7 (maximum
quantity)
PCI adapter
2 53P0248 1 Power supply bulkhead
3 44H7366 4 Screw, power bulkhead mounting
4 44H7366 3 Screw, PCI plate mounting
5 53P0249 1 PCI adapter headstock bracket
6 53P2728 5 PCI dividers
7 53P2729 2 PCI dividers
Note: See Diagnostic Information for Multiple Bus Systems for part numbers.
Chapter 4. Parts Information 163
D20 Power, RIO Adapter, Cabling
Index Part Number Units Per Assy Description
1 53P4832 2 (maximum
quantity)
Power supply
2
53P3800 1 RIO bus adapter
97H2459 1 RIO-2 bus adapter
3 53P2690 3 Cable bracket, SCSI
4 44H7366 1 Screw, SCSI cable
5 53P0639 5 RIO-2 cable bracket
6 44H7366 2 Screw, RIO-2 cable bracket
7 53P0417 Configuration
dependent
SCSI bus cable
8 53P0418 Configuration
dependent
SCSI bus cable
9 53P0233 1 Power supply filler
164 D10 and D20 Service Guide
D20 Operator Panel
Index Part Number Units Per Assy Description
1 53P0257 2 Disk-drive-backplane (disk drive cage included)
2 53P0321 4 Screw, disk-drive-backplane mounting
3 53P0330 1 Operator panel
4 53P0275 1 Processor cover
5 44H7366 4 Screw, cover-mounting
Chapter 4. Parts Information 165
D20 Fans and Disk Drives
Index Part Number Units Per Assy Description
1 53P0262 4 Blower
2 D20
3 53P0257 1 Disk-drive cage 12 pack (disk drive back planes
included)
4 53P0319 8 Screw, DASD-cage mounting
5 Disk Drives
6 53P0234 4 Screw, filler-plate mounting
7 44H7366 1 (optional) Disk-drive filler plate
8 53P3451 1 Left Rail
9 53P3452 1 Right Rail
166 D10 and D20 Service Guide
D20 Covers and Brackets
Index Part Number Units Per Assy Description
1 53P1457 1 Front bezel
2 44H7366 2 Screw, latch bracket
3 53P2572 1 Latch bracket assembly left
4 53P0295 4 Cable arm bracket
5 44H7366 1 Screw, cable arm bracket
6 53P2573 1 Latch bracket assembly right
D20 RIO, SPCN, and Rack Beacon Cables
Index Part Number Units Per Assy Description
1 53P2231 Up to 12 Rack beacon cable junction box
2 53P2237 Up to 12 Rack beacon connector cable, 4-pin connector
3 53P2854 Up to 12 Rack beacon connector cable, 4-pin
87G6235 SPCN cable, 2 meter
09P1251 SPCN cable, 3 meter
21F9469 SPCN cable, 6 meter
21F9358 SPCN cable, 15 meter
4 53P2676 RIO-2 cable, 3.5 meter
21P5456 RIO-2 cable, 10 meter
Chapter 4. Parts Information 167
Power Cables
Index Part Number Country or Region
1 1838574 Bahamas, Barbados, Bolivia, Brazil, Canada, Costa Rica, Dominican Republic, El
Salvador, Ecuador, Guatemala, Guyana, Haiti, Honduras, Jamaica, Japan, Netherlands
Antilles, Panama, Peru, Philippines, Taiwan, Thailand, Trinidad, Tobago, U.S.A. (except
Chicago), Venezuela
2 86G7648 Bahamas, Barbados, Bermuda, Bolivia, Brazil, Canada, Cayman Islands, Colombia, Costa
Rica, Dominican Republic, Ecuador, El Salvador, Guatemala, Guyana, Haiti, Honduras,
Jamaica, Japan, Korea (South), Mexico, Netherlands Antilles, Nicaragua, Panama, Peru,
Philippines, Puerto Rico, Saudi Arabia, Suriname, Trinidad, Taiwan, U.S.A. (except
Chicago), Venezuela
2 62X1045 Chicago, U.S.A.
3 6952311 Argentina, Australia, New Zealand
4 13F9979 Abu Dhabi, Austria, Belgium, Bulgaria, Botswana, China (Macau S.A.R), Egypt, Finland,
France, Germany, Greece, Iceland, Indonesia, Korea (South), Lebanon, Luxembourg,
Netherlands, Norway, Portugal, Saudi Arabia, Spain, Sudan, Sweden, Turkey, Yugoslavia
5 13F9997 Denmark
6 14F0015 Bangladesh, Burma, Pakistan, South Africa, Sri Lanka
7 14F0033 Bahrain, Bermuda, Brunei, Channel Islands, Cyprus, China (Hong Kong S.A.R), Ghana,
India, Iraq, Ireland, Jordan, Kenya, Kuwait, Malawi, Malaysia, Nigeria, Oman, People’s
Republic of China, Qatar, Sierra Leone, Singapore, Tanzania, Uganda, United Arab
Emirates (Dubai), United Kingdom, Zambia
8 14F0051 Liechtenstein, Switzerland
9 14F0069 Chile, Ethiopia, Italy
10 14F0087 Israel
11 6952291 Paraguay, Colombia, Uruguay
42F6841 AC Power Cable, (Power distribution bus to AC power supply, 286/C4 only)
1 2 3 4
5 6 7
8 9 10 11
168 D10 and D20 Service Guide
Chapter 5. Fault and Attention LEDs
This chapter contains information about fault and attention LEDs, which assist in identifying failing
components in your system.
Component LEDs
Individual LEDs are located on or near the failing components. The LEDs are located either on the
component itself or on the carrier of the component (for example, PCI adapters and disk drives). LEDs are
either green or amber.
Green LEDs indicate one of two things:
v Electrical power is present.
v Activity is occuring on a link. (The system could be sending or receiving information.)
Amber-colored LEDs indicate a fault or identify condition. If your system or one of the components on your
system has an amber-colored LED turned on or blinking, identify the problem and take the appropriate
action to restore the system to normal.
The following table identifies the location, color and status of your system component LEDs.
169
Unit
(FRU)
LED
Function
Location Viewable
Location
LED Color OFF ON Blink
Power and Packaging LEDs
Rack
Indicator
Rack
Summary Top of Rack
External
Amber Normal Fault Identify
AC Input External Front Green No Input AC Good Fault
I/O
Subsystem
Drawer
Attention Operator
Panel
External Front Amber Normal Fault Identify
Power External Front Green No AC System On Standby
Fans
Identify FRU External Amber Normal Identify N/A
Power FRU External Green Power Off Power On N/A
Power
Supply
Input
Source
FRU External Back Green No Input AC Input
Good
N/A
Identify/
COM Fail
FRU External Back Amber Normal N/A 1 Hz=Identify
.25 Hz=Comm.
Failed
DC Output
Good
FRU External Back Green
Standby Off System On Standby
Disk Drives
(DASD)
Slot Power
DASD
Backplane
External Front Green Power Off Power On N/A
Activity
DASD
Backplane
External Front Green Not Active N/A Activity
Identify DASD
Backplane
External Front Amber Normal Action
Remove
Identify
Media
Slot Power Backplane External Front Green Power Off Power On N/A
Activity/
Identify
Backplane External Front Amber Normal Action Identify
Electronic Component LEDs
RIO
Connector
Card
Identify
FRU (Top) Internal Only Amber Normal Identify N/A
RIO
Extender
Card
Identify
FRU (Top) Internal Only Amber Normal Identify N/A
I/O
Backplane
Identify FRU Internal Only Amber Normal Identify N/A
PCI Adapter
Connectors
(Slots)
Slot Power Backplane External Back Green Power Off Power On N/A
Action/
Identify
Backplane External Back Amber Normal
N/A Grn off - Action
Grn on -
Identify
Disk Drive
Backplane
No LED
Port LED Indicators
RIOPorts Identify
Top of Each
External
Connector
External Back Amber Normal N/A Identify
170 D10 and D20 Service Guide
Resetting the LEDs
After the repair action is completed, do the following:
1. Log in as root user.
2. At the command line, type diag.
3. Select Task Selection.
4. Select Log Repair Action.
5. Select the device that was repaired.
6. Press F10 to exit diagnostics.
If the attention LED remains on after you have completed the repair action and reset the LEDs, call for
service support.
Chapter 5. Fault and Attention LEDs 171
172 D10 and D20 Service Guide
Appendix A. Environmental Notices
Product Recycling and Disposal
Components of the system unit, such as structural parts and circuit boards, can be recycled where
recycling facilities exist. Companies are available to disassemble, reutilize, recycle, or dispose of electronic
products. Contact your account representative for more information. This system unit contains parts such
as circuit boards, cables, electromagnetic compatibility gaskets and connectors which may contain lead
and copper/beryllium alloys that require special handling and disposal at end of life. Before this unit is
disposed, these materials must be removed and recycled or discarded according to applicable regulations.
This book contains specific information on each battery type where applicable.
This product may contain a sealed lead acid, nickel cadmium, nickel metal hydride, lithium, or
lithium ion battery. Consult your user manual or service manual for specific battery information.
The battery must be recycled or disposed of properly. Recycling facilities may not be available in
your area. For information on disposal of batteries, contact your local waste disposal facility.
173
Acoustical Noise Emissions
The equivalent continuous A-weighted sound pressure level at workstations (emission sound pressure
level at the 1-meter bystander positions) does not exceed 70 dB(A).
Der Geräuschpegel der Einheit ist kleiner oder gleich 70 db(A).
D10 Declared Acoustical Noise Emissions
Acoustical Characteristic
Product Configuration Declared A-Weighted Sound Power
Level, LWAd (B)
Declared A-Weighted Sound Pressure
Level, LpAm (dB)
Operating Idle Operating Idle
One D10 I/O subsystem 5.65 5.65 406 406
Two D10 I/O subsystems 5.95 5.95 436 436
Four D10 I/O subsystems 6.25 6.25 466 466
Notes:
1. Noise levels cited are for the typical configuration (A-Frame: Bulk Power, CEC cage, battery option, media
drawer, and two I/O drawers).
2. The 0.6-B (6-dB) reduction in noise emission levels with the acoustical rear door corresponds to a factor of 4
reduction. That is, the noise level of a single A-Frame with Slimline covers is about the same as the noise level of
four A-Frames with acoustical covers.
3. LWAd is the upper-limit A-weighted sound power level; LpAm is the mean A-weighted sound pressure level at the
1-meter bystander positions; 1 B = 10 dB.
4. All measurements made in conformance with ISO 7779 and declared in conformance with ISO 9296.
D20 Declared Acoustical Noise Emissions
Acoustical Characteristic
Product Configuration Declared A-Weighted Sound Power
Level, LWAd (B)
Declared A-Weighted Sound Pressure
Level, LpAm (dB)
Operating Idle Operating Idle
D20 I/O subsystem 6.15 6.05 446 436
Notes:
1. Noise levels cited are for the typical configuration (A-Frame: Bulk Power, CEC cage, battery option, media
drawer, and two I/O drawers).
2. The 0.6-B (6-dB) reduction in noise emission levels with the acoustical rear door corresponds to a factor of 4
reduction. That is, the noise level of a single A-Frame with Slimline covers is about the same as the noise level of
four A-Frames with acoustical covers.
3. LWAd is the upper-limit A-weighted sound power level; LpAm is the mean A-weighted sound pressure level at the
1-meter bystander positions; 1 B = 10 dB.
4. All measurements made in conformance with ISO 7779 and declared in conformance with ISO 9296.
174 D10 and D20 Service Guide
Appendix B. Notices
This information was developed for products and services offered in the U.S.A.
The manufacturer may not offer the products, services, or features discussed in this document in other
countries. Consult the manufacturer’s representative for information on the products and services currently
available in your area. Any reference to the manufacturer’s product, program, or service is not intended to
state or imply that only that product, program, or service may be used. Any functionally equivalent product,
program, or service that does not infringe any intellectual property right of the manufacturer may be used
instead. However, it is the user’s responsibility to evaluate and verify the operation of any product,
program, or service.
The manufacturer may have patents or pending patent applications covering subject matter described in
this document. The furnishing of this document does not give you any license to these patents. You can
send license inquiries, in writing, to the manufacturer.
The following paragraph does not apply to the United Kingdom or any country where such
provisions are inconsistent with local law: THIS MANUAL IS PROVIDED ²AS IS² WITHOUT
WARRANTY OF ANY KIND, EITHER EXPRESSED OR IMPLIED, INCLUDING, BUT NOT LIMITED TO,
THE IMPLIED WARRANTIES OF NON-INFRINGEMENT, MERCHANTABILITY OR FITNESS FOR A
PARTICULAR PURPOSE. Some states do not allow disclaimer of express or implied warranties in certain
transactions; therefore, this statement may not apply to you.
This information could include technical inaccuracies or typographical errors. Changes are periodically
made to the information herein; these changes will be incorporated in new editions of the publication. The
manufacturer may make improvements and/or changes in the product(s) and/or the program(s) described
in this publication at any time without notice.
Information concerning products made by other than the manufacturer was obtained from the suppliers of
those products, their published announcements, or other publicly available sources. The manufacturer has
not tested those products and cannot confirm the accuracy of performance, compatibility or any other
claims related to products made by other than the manufacturer. Questions on the capabilities of products
made by other than the manufacturer should be addressed to the suppliers of those products.
175
176 D10 and D20 Service Guide
Index
A
acoustical
noise
emissions 174
adapter cassette
removing 55
replacing 56
adapter, PCI removal and replacement, D10 45
adapters, PCI D20 123
AIX documentation 47, 53, 123, 129, 132, 135, 137,
138
AIX location codes 14, 31
attention LEDs 169
B
backplane, I/O removal and replacement D20 151
battery
disposal, recycling 173
bay locations 24
beacon, parts 161
bezel removal, D20 112
bezel, front 43, 112
blower locations 21
blower, cable removal D20 118
blower, cooling 18
blowers, cooling 115
blowers, hot-swap removal, D20 115
blowers, hot-swap replacement, D20 117
brackets, parts 159, 167
bulkhead, power supply removal and replacement,
D20 148
C
cables, external parts 167
cabling, parts 162
cage, disk drive D20 142
checkpoints 39, 40
code, error 40
codes, location 14, 31
component LEDs 39
cooling 5, 18
cooling blower cable replacement, D20 119
cooling blowers cable, D20 118
cooling, parts 159, 163
cover removal and replacement D10 44
cover, service access 111
covers 44
covers, parts 159, 167
D
D10 location code table 15
D10 location codes 14
D10 locations 10
codes, D10 14
D10 locations (continued)
D10 15
D10 I/O subsystem, front 10
D10 I/O subsystem, rear 11
I/O backplane 12
PHB 13
D10 replacement procedure list 42
D20 location code table 32
D20 location codes 31
D20 locations 19
blowers 21
codes, D20 31
D20 32
D20 I/O subsystem, front 19
D20 I/O subsystem, rear 19
I/O backplane 22
power supply 20
SCSI bay locations 24
D20 replacement procedure list 108
definitions, LED 2
diagnostics overview 39
disk drive backplane 144
disk drive backplane replacement, D20 145
disk drive cage removal D20 142
disk drive cage replacement, D20 143
disk drive, backplane removal D20 144
disk drive, configuring 138
disk drive, deconfiguring 137
disk drive, hot-plug removal D20 139
disk drive, hot-plug replacement, D20 141
disk drive, parts 166
disk drive, removing 137
disk drive, replacing 138
disk drives, hot-plug 137
documentation
AIX 47, 53, 123, 129, 132, 135, 137, 138
E
electrical safety vii
laser compliance statement ix
error code to FRU index 40
external cables, parts 167
F
fan, cooling 5
fan, parts 166
fan, removal and replacement D10 104
features, subsystem 1
cooling 5
LED definitions 2
operator indicators 2
pci slot 2
PCI-X slots 2
ports, input/output 5
ports, serial 5
177
features, subsystem (continued)
power 4
features, system 16
cooling 18
disk drives 17
operator panel 16
PCI-X slots 16
ports, input/output 18
ports, serial 18
power 18
slots, PCI-X 16
front bezel removal, D10 43
front bezel replacement, D10 43
front bezel replacement, D20 112
front, bezel 43
FRU
index 40
FRU isolation 39
H
handling static-sensitive devices 42
highlighting xiii
hot-plug management functions 123
hot-plug manager 123
hot-plug manager menu 124
hot-pluggable PCI adapter installation overview 53
hot-pluggable PCI adapter removal and replacement,
D10 46
hot-swap fan assembly D10 104
hot-swap fan replacement, D10 104
I
I/O backplane assembly replacement, D10 106
I/O backplane assembly, removal and replacement,
D10 106
I/O backplane locations 12, 22
I/O backplane removal D20 151
I/O backplane replacement, D20 155
I/O backplane, parts 157, 162
index, error code to FRU 40
information, reference 1
input/output ports 5, 18
IO backplane assembly removal, D10 106
isolation, FRU 39
L
laser compliance statement ix
laser safety information ix
LED definitions 2
LED definitions, PCI-X 16
LED, D10 power supply 5
LEDs 169
attention LEDs 169
component 39, 169
component LEDs 169
fault LEDs 169
resetting 171
resetting LEDs 171
location codes 14, 31
location codes, physical 14, 31
locations, D10 10
locations, D20 19
M
MAP rules 40
N
noise emissions
acoustical 174
non-hot-pluggable PCI adapter, removal D20 126
O
operating position 110
operating position, D10 43
operator panel 23, 113
operator panel removal, D20 113
operator panel replacement, D20 114
operator panel, parts 165
overview
hot-pluggable PCI adapter 53
overview, D10 1
overview, D20 16
overview, diagnostics 39
P
parts information
covers and brackets 167
external cables 167
fans and disk drives 166
I/O backplane
PCI slots 157
I/O backplane and cabling 162
operator panel 165
PCI slots 163
power and cooling 159
power, RIO 164
rack mount and beacon connect 161
RIO cables 167
SPCN cables 167
subsystem 157
parts, beacon 161
PCI adapter cassette
removing 55
replacing 56
PCI adapter removal and replacement, D10 45
PCI adapter, hot-plug removal D20 129
PCI-X definitions, LED 16
PHB 13
physical location codes 14, 31
ports, input/output 5, 18
power and lightpath LEDs 39
power cables 37
power supplies, removal and replacement 146
power supply 4, 18
178 D10 and D20 Service Guide
power supply bulkhead removal D20 148
power supply bulkhead replacement, D20 150
power supply LED 5
power supply locations 20
power supply removal D20 146
power supply removal, D10 107
power supply replacement, D10 107
power supply, hot-swap replacement, D20 147
power supply, removal and replacement, D10 107
Power-On Self-Test 34
power, D10 4
power, D20 18
power, parts 159, 163, 164
powering on and off 34
product disposal 173
R
recycling 173
reference information 1
related publications xiv
removal and replacement procedures 41
adapter, adding or replacing a non-hot-plug 128
adapter, hot-plug management functions 123
adapter, hot-plug manager 123
adapter, install hot-plug 135
adapter, replace hot-plug 132
adapters, PCI D20 123
bezel 43
bezel, front 112
blowers, cooling 115
cooling blowers cable, D20 118
cover, service access 111
covers 44
devices, handling static-sensitive 42
disk drive backplane 144
disk drive cage, D20 142
disk drive, configuring 137
disk drive, deconfiguring 137
disk drives, hot-plug 137
fan, D10 104
hot-plug manager menu 124
hot-pluggable PCI adapter, 46
I/O backplane assembly, D10 106
I/O backplane D20 151
operating position 110
operator panel 113
PCI adapter, D10 45
power supplies 146
power supply bulkhead D20 148
power supply, D10 107
RIO bus adapter, D10 105
RIO bus adapter, D20 120
RIO cable, D10 105
safety considerations 41
service position 109
stopping the system 42
removal procedure list D10 42
removal procedure list D20 108
removal procedures 41
adapter, PCI D20 126
removal procedures (continued)
bezel, D20 112
blower cable, D20 118
blowers, hot-swap, D20 115
caution and danger 41
disk drive backplane, D20 144
disk drive cage D20 142
front bezel, D10 43
hot-plug adapter, D20 129
hot-plug disk drive, D20 139
hot-swap fan assembly D10 104
I/O backplane, D20 151
IO backplane assembly, D10 106
operator panel, D20 113
power supply bulkhead, D20 148
power supply D20 146
power supply, D10 107
RIO bus adapter, D10 105
RIO bus adapter, D20 120
RIO cable, D10 105
service access cover, D10 44
service access cover, D20 111
removing a PCI adapter cassette 55
replacement procedures 41
blowers, hot-swap, D20 117
cooling blower cable, D20 119
disk drive backplane, D20 145
disk drive cage, D20 143
disk drive, hot-plug, D20 141
front bezel, D10 43
front bezel, D20 112
hot-swap fan, D10 104
I/O backplane, D20 155
operator panel, D20 114
power supply bulkhead, D20 150
power supply, D10 107
power supply, hot-swap, D20 147
RIO bus adapter, D10 105
RIO bus adapter, D20 122
RIO cable, D10 105
service access cover, D10 44
service access cover, D20 111
rereplacing a PCI adapter cassette 56
resetting the LEDs 171
RIO bus adapter removal D20 120
RIO bus adapter removal, D10 105
RIO bus adapter replacement, D10 105
RIO bus adapter replacement, D20 122
RIO bus adapter, D10 removal and replacement 105
RIO bus adapter, D20 120
RIO cable removal D10 105
RIO cable replacement, D10 105
RIO cable, removal and replacement D10 105
RIO-2 cables 5
RIO, parts 164
S
safety considerations 41
safety notices vii
SCSI IDs 24
Index 179
Self-Test, Power-On 34
serial port configurations 18
service access cover removal, D10 44
service access cover removal, D20 111
service access cover replacement, D10 44
service access cover replacement, D20 111
service inspection guide 38
service position 109
service position, D10 43
SPCN cables 5
SPCN cables, parts 167
specifications 35
specifications, power cables 37
stopping the system 42
subsystem cables 5
subsystem, power 4, 18
system
stopping 42
system management services 40
system, powering on and off 34
T
trademarks xiv
W
Web sites
AIX documentation 47, 53, 123, 129, 132, 135,
137, 138
180 D10 and D20 Service Guide
Vos remarques sur ce document / Technical publication remark form
Titre / Title : Bull D10 and D20 I/O Drawers Service Guide
Nº Reférence / Reference Nº : 86 A1 38EG 01 Daté / Dated : June 2003
ERREURS DETECTEES / ERRORS IN PUBLICATION
AMELIORATIONS SUGGEREES / SUGGESTIONS FOR IMPROVEMENT TO PUBLICATION
Vos remarques et suggestions seront examinées attentivement.
Si vous désirez une réponse écrite, veuillez indiquer ci-après votre adresse postale complète.
Your comments will be promptly investigated by qualified technical personnel and action will be taken as required.
If you require a written reply, please furnish your complete mailing address below.
NOM / NAME : Date :
SOCIETE / COMPANY :
ADRESSE / ADDRESS :
Remettez cet imprimé à un responsable BULL ou envoyez-le directement à :
Please give this technical publication remark form to your BULL representative or mail to:
BULL CEDOC
357 AVENUE PATTON
B.P.20845
49008 ANGERS CEDEX 01
FRANCE
Technical Publications Ordering Form
Bon de Commande de Documents Techniques
To order additional publications, please fill up a copy of this form and send it via mail to:
Pour commander des documents techniques, remplissez une copie de ce formulaire et envoyez-la à :
BULL CEDOC
ATTN / Mr. L. CHERUBIN
357 AVENUE PATTON
B.P.20845
49008 ANGERS CEDEX 01
FRANCE
Phone / Téléphone : +33 (0) 2 41 73 63 96
FAX / Télécopie +33 (0) 2 41 73 60 19
E–Mail / Courrier Electronique : srv.Cedoc@franp.bull.fr
Or visit our web sites at: / Ou visitez nos sites web à:
http://www.logistics.bull.net/cedoc
http://www–frec.bull.com http://www.bull.com
CEDOC Reference #
No Référence CEDOC
Qty
Qté
CEDOC Reference #
No Référence CEDOC
Qty
Qté
CEDOC Reference #
No Référence CEDOC
Qty
Qté
_ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ]
_ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ]
_ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ]
_ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ]
_ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ]
_ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ]
_ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ]
[ _ _ ] : no revision number means latest revision / pas de numéro de révision signifie révision la plus récente
NOM / NAME : Date :
SOCIETE / COMPANY :
ADRESSE / ADDRESS :
PHONE / TELEPHONE : FAX :
E–MAIL :
For Bull Subsidiaries / Pour les Filiales Bull :
Identification:
For Bull Affiliated Customers / Pour les Clients Affiliés Bull :
Customer Code / Code Client :
For Bull Internal Customers / Pour les Clients Internes Bull :
Budgetary Section / Section Budgétaire :
For Others / Pour les Autres :
Please ask your Bull representative. / Merci de demander à votre contact Bull.
BULL CEDOC
357 AVENUE PATTON
B.P.20845
49008 ANGERS CEDEX 01
FRANCE
86 A1 38EG 01
ORDER REFERENCE
PLACE BAR CODE IN LOWER
LEFT CORNER
Utiliser les marques de découpe pour obtenir les étiquettes.
Use the cut marks to get the labels.
86 A1 38EG 01
D10 and D20 I/O
Drawers
Service Guide
86 A1 38EG 01
D10 and D20 I/O
Drawers
Service Guide
86 A1 38EG 01
Bull ESCALA PL420T and PL420R
Service Guide
86 A1 42EG 00
ORDER REFERENCE
Bull ESCALA PL420T and PL420R
Service Guide
Hardware
February 2003
BULL CEDOC
357 AVENUE PATTON
B.P.20845
49008 ANGERS CEDEX 01
FRANCE
86 A1 42EG 00
ORDER REFERENCE
The following copyright notice protects this book under the Copyright laws of the United States of America
and other countries which prohibit such actions as, but not limited to, copying, distributing, modifying, and
making derivative works.
Copyright Bull S.A. 1992, 2003
Printed in France
Suggestions and criticisms concerning the form, content, and presentation of
this book are invited. A form is provided at the end of this book for this purpose.
To order additional copies of this book or other Bull Technical Publications, you
are invited to use the Ordering Form also provided at the end of this book.
Trademarks and Acknowledgements
We acknowledge the right of proprietors of trademarks mentioned in this book.
AIX is a registered trademark of International Business Machines Corporation, and is being used under
licence.
UNIX is a registered trademark in the United States of America and other countries licensed exclusively through
the Open Group.
Linux is a registered trademark of Linus Torvalds.
The information in this document is subject to change without notice. Groupe Bull will not be liable for errors
contained herein, or for incidental or consequential damages in connection with the use of this material.
Contents
Safety Notices . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . xi
Rack Safety Instructions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . xi
Electrical Safety . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . xii
Laser Safety Information. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . xiii
Laser Compliance . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . xiii
Data Integrity and Verification . . . . . . . . . . . . . . . . . . . . . . . . . . . xv
About This Book. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . xvii
Highlighting . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . xvii
ISO 9000 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . xvii
Related Publications . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . xvii
Trademarks . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . xviii
Chapter 1. Reference Information . . . . . . . . . . . . . . . . . . . . . . . . . . 1
Overview . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1
System Features . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2
PCI-X Slots and Integrated Adapters. . . . . . . . . . . . . . . . . . . . . . . . . 4
I/O Expansion Drawer . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4
Hardware Management Console (HMC). . . . . . . . . . . . . . . . . . . . . . . . . 6
Powering the System on and Off . . . . . . . . . . . . . . . . . . . . . . . . . . . 7
Console Strategy . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7
Power-On Self-Test . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7
POST Indicators . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7
POST Keys . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8
System Unit Locations . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 9
286/C4 Rear View. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 9
Power Supply Locations . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 12
Fan Locations. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 14
CEC Backplane Locations . . . . . . . . . . . . . . . . . . . . . . . . . . . . 15
PCI Riser Card Locations . . . . . . . . . . . . . . . . . . . . . . . . . . . . 16
Memory DIMM Location . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 18
Reading the Operator Panel and System LEDs . . . . . . . . . . . . . . . . . . . . 19
SCSI IDs and Bay Locations . . . . . . . . . . . . . . . . . . . . . . . . . . . 20
System Logic Flow . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 21
Systems Configured with a GP Processor and Four-Slotted PCI Riser Card . . . . . . . . . . 21
Systems Configured with a GQ Processor and Six-Slotted PCI Riser Card . . . . . . . . . . 22
Location Codes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 23
Physical Location Codes. . . . . . . . . . . . . . . . . . . . . . . . . . . . . 23
Location Code Format. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 23
AIX Location Codes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 24
Mapping AIX and Physical Location Codes . . . . . . . . . . . . . . . . . . . . . . . 28
Four-Slotted PCI Riser Card Location Codes . . . . . . . . . . . . . . . . . . . . . 29
Six-Slotted PCI Riser Card Location Codes . . . . . . . . . . . . . . . . . . . . . . 30
Disk Drive Backplane Location Codes . . . . . . . . . . . . . . . . . . . . . . . . 30
AIX and Physical Location Code Table. . . . . . . . . . . . . . . . . . . . . . . . 31
System Cables . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 37
Subsystem Positioning and Cabling. . . . . . . . . . . . . . . . . . . . . . . . . . 39
SPCN Cabling . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 39
RIO Cabling . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 41
Specifications. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 42
External AC Power Cables . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 43
Service Inspection Guide. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 44
iii
Chapter 2. Diagnostic Overview . . . . . . . . . . . . . . . . . . . . . . . . . . 45
Maintenance Analysis Procedures (MAPs) . . . . . . . . . . . . . . . . . . . . . . . 45
Power and Attention LEDs . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 46
Component LEDs . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 46
Resetting the LEDs. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 46
System Attention LED. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 46
Checkpoints . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 46
FRU Isolation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 47
FRU Identify LEDs . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 48
Service Focal Point. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 48
Getting Started . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 49
Testing Error Reporting . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 49
Service Focal Point Settings . . . . . . . . . . . . . . . . . . . . . . . . . . . 49
Working With Serviceable Events . . . . . . . . . . . . . . . . . . . . . . . . . 51
Hardware Service Functions . . . . . . . . . . . . . . . . . . . . . . . . . . . 53
System Error Interrogation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 54
The System is Attached to an HMC. . . . . . . . . . . . . . . . . . . . . . . . . 55
The System is not Attached to an HMC and AIX is Installed. . . . . . . . . . . . . . . . 55
The System is Attached to an HMC and Linux is Installed . . . . . . . . . . . . . . . . 55
Resetting the System Attention LED . . . . . . . . . . . . . . . . . . . . . . . . . 56
Systems Attached To an HMC. . . . . . . . . . . . . . . . . . . . . . . . . . . 56
Systems not Attached to an HMC with AIX Installed. . . . . . . . . . . . . . . . . . . 56
Systems not Attached to an HMC with Linux Installed . . . . . . . . . . . . . . . . . . 56
Chapter 3. Maintenance Analysis Procedures (MAPs) . . . . . . . . . . . . . . . . . . 57
Entry MAP . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 57
Quick Entry MAP . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 58
MAP 1020: Problem Determination . . . . . . . . . . . . . . . . . . . . . . . . . . 64
MAP 1240: Memory Problem Resolution . . . . . . . . . . . . . . . . . . . . . . . . 68
MAP 1320: Service Focal Point Procedures. . . . . . . . . . . . . . . . . . . . . . . 75
MAP 1321: Quick Entry MAP for Systems with Service Focal Point . . . . . . . . . . . . . . 76
MAP 1322: End of Call MAP for Systems with Service Focal Point . . . . . . . . . . . . . . 84
MAP 1520: Power . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 89
FRU Identify LEDs . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 90
MAP 1521: Base System Drawer With Power Failure and No Error Codes . . . . . . . . . . . 91
Map 1522: Cannot Power On the I/O Drawer, Power Present LED Does Not Come On. . . . . . . 98
Map 1523: The SPCN Error Codes Table Directed You Here and the Error Code is 1011 1B0x . . . 105
MAP 1540: Problem Isolation Procedures . . . . . . . . . . . . . . . . . . . . . . . 108
FRU Identify LEDs . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 108
MAP 1541: JTAG Problem Isolation . . . . . . . . . . . . . . . . . . . . . . . . . 110
MAP 1542: I/O Problem Isolation . . . . . . . . . . . . . . . . . . . . . . . . . . 111
MAP 1548: Processor Subsystem Problem Isolation . . . . . . . . . . . . . . . . . . . 123
MAP 1549: Attention Problem Isolation . . . . . . . . . . . . . . . . . . . . . . . . 137
MAP 154C: SPCN Cable Problem Isolation . . . . . . . . . . . . . . . . . . . . . . 139
Chapter 4. Checkpoints . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 141
IPL Flow with an HMC Attached to the System . . . . . . . . . . . . . . . . . . . . . 141
IPL Flow Without an HMC Attached to the System. . . . . . . . . . . . . . . . . . . . 142
Service Processor Checkpoints . . . . . . . . . . . . . . . . . . . . . . . . . . . 143
Firmware Checkpoints . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 150
Boot Problems/Concerns . . . . . . . . . . . . . . . . . . . . . . . . . . . . 173
Chapter 5. Error Code to FRU Index . . . . . . . . . . . . . . . . . . . . . . . . 179
Performing Slow Boot . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 179
Four-Character Checkpoints . . . . . . . . . . . . . . . . . . . . . . . . . . . . 179
Replacing the Operator Panel . . . . . . . . . . . . . . . . . . . . . . . . . . . 179
iv Service Guide
Replacing the Network Adapter . . . . . . . . . . . . . . . . . . . . . . . . . . . 180
Determining Location Code . . . . . . . . . . . . . . . . . . . . . . . . . . . . 180
FRU Identify LEDs . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 180
Checkpoint and Error Code Index . . . . . . . . . . . . . . . . . . . . . . . . . . 181
Operator Panel Error Codes . . . . . . . . . . . . . . . . . . . . . . . . . . . . 182
SPCN Error Codes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 183
Firmware Error Codes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 192
Service Processor Error Codes . . . . . . . . . . . . . . . . . . . . . . . . . . . 211
Memory DIMM Present Detect Bits (PD-Bits) . . . . . . . . . . . . . . . . . . . . . . 285
Error Log Utility Procedure . . . . . . . . . . . . . . . . . . . . . . . . . . . . 285
System Firmware Update Messages . . . . . . . . . . . . . . . . . . . . . . . . . 286
Scan Dump Messages . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 286
Common Firmware Error Codes . . . . . . . . . . . . . . . . . . . . . . . . . . 286
Scan Log Dump Progress Codes . . . . . . . . . . . . . . . . . . . . . . . . . . 294
Problem Determination Generated Error Codes . . . . . . . . . . . . . . . . . . . . . 295
Chapter 6. Using the Online and Standalone Diagnostics . . . . . . . . . . . . . . . . 297
Operating Considerations for Online and Standalone Diagnostics . . . . . . . . . . . . . . 297
Identifying the Terminal Type to the Diagnostics. . . . . . . . . . . . . . . . . . . . 298
Undefined Terminal Types. . . . . . . . . . . . . . . . . . . . . . . . . . . . 298
Resetting the Terminal . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 298
Online Diagnostics Modes of Operation. . . . . . . . . . . . . . . . . . . . . . . . 298
Service Mode . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 298
Concurrent Mode . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 299
Maintenance Mode . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 300
Standalone Diagnostics Operations . . . . . . . . . . . . . . . . . . . . . . . . . 301
Performing Slow Boot . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 301
Considerations for Running Standalone Diagnostics on a Partitioned Systems . . . . . . . . 301
Considerations for Running Standalone Diagnostics from CD-ROM. . . . . . . . . . . . . 301
NIM Server Configuration . . . . . . . . . . . . . . . . . . . . . . . . . . . . 302
Client Configuration and Booting Standalone Diagnostics from the NIM Server . . . . . . . . 303
Running the Online Diagnostics with No HMC Attached . . . . . . . . . . . . . . . . . . 304
Chapter 7. Using the Service Processor. . . . . . . . . . . . . . . . . . . . . . . 305
Service Processor Menus . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 305
Accessing the Service Processor Menus Locally . . . . . . . . . . . . . . . . . . . 305
Accessing the Service Processor Menus Remotely. . . . . . . . . . . . . . . . . . . 306
Saving and Restoring Service Processor Settings . . . . . . . . . . . . . . . . . . . 306
Menu Inactivity . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 306
General User Menu . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 306
Privileged User Menus . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 307
Main Menu . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 307
Service Processor Setup Menu . . . . . . . . . . . . . . . . . . . . . . . . . . 309
Passwords . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 309
System Power Control Menu. . . . . . . . . . . . . . . . . . . . . . . . . . . 314
System Information Menu . . . . . . . . . . . . . . . . . . . . . . . . . . . . 317
Language Selection Menu. . . . . . . . . . . . . . . . . . . . . . . . . . . . 324
Call-In/Call-Out Setup Menu . . . . . . . . . . . . . . . . . . . . . . . . . . . 325
Modem Configuration Menu . . . . . . . . . . . . . . . . . . . . . . . . . . . 326
Serial Port Selection Menu . . . . . . . . . . . . . . . . . . . . . . . . . . . 326
Serial Port Speed Setup Menu . . . . . . . . . . . . . . . . . . . . . . . . . . 327
Telephone Number Setup Menu. . . . . . . . . . . . . . . . . . . . . . . . . . 327
Call-Out Policy Setup Menu . . . . . . . . . . . . . . . . . . . . . . . . . . . 328
Customer Account Setup Menu. . . . . . . . . . . . . . . . . . . . . . . . . . 329
Call-out Test Menu . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 329
Service Processor Parameters in Service Mode (Full System Partition) . . . . . . . . . . . . 330
Contents v
System Power-On Methods . . . . . . . . . . . . . . . . . . . . . . . . . . . . 330
Service Processor Reboot/Restart Recovery . . . . . . . . . . . . . . . . . . . . . . 331
Boot (IPL) Speed . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 331
Failure During Boot Process . . . . . . . . . . . . . . . . . . . . . . . . . . . 331
Failure During Normal System Operation . . . . . . . . . . . . . . . . . . . . . . 331
Service Processor Reboot/Restart Policy Controls . . . . . . . . . . . . . . . . . . . 331
System Firmware Updates. . . . . . . . . . . . . . . . . . . . . . . . . . . . . 332
General Information on System Firmware Updates. . . . . . . . . . . . . . . . . . . 333
Determining the Level of Firmware on the System . . . . . . . . . . . . . . . . . . . 334
Updating System Firmware From the Service Processor Menus . . . . . . . . . . . . . . 334
Updating System Firmware from a NIM Server . . . . . . . . . . . . . . . . . . . . 335
Recovery Mode. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 335
Configuring and Deconfiguring Processors or Memory . . . . . . . . . . . . . . . . . . 335
Run-Time CPU Deconfiguration (CPU Repeat Gard) . . . . . . . . . . . . . . . . . . 336
Service Processor System Monitoring - Surveillance . . . . . . . . . . . . . . . . . . . 336
System Firmware Surveillance . . . . . . . . . . . . . . . . . . . . . . . . . . 336
Operating System Surveillance . . . . . . . . . . . . . . . . . . . . . . . . . . 336
Call-Out (Call-Home). . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 337
Console Mirroring . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 338
System Configuration . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 338
Service Processor Error Logs . . . . . . . . . . . . . . . . . . . . . . . . . . . 339
LCD Progress Indicator Log . . . . . . . . . . . . . . . . . . . . . . . . . . . . 340
Service Processor Operational Phases . . . . . . . . . . . . . . . . . . . . . . . . 341
Pre-Standby Phase . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 341
Standby Phase. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 341
Bring-Up Phase . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 342
Runtime Phase. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 343
Chapter 8. Using System Management Services . . . . . . . . . . . . . . . . . . . 345
Select Language . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 346
Change Password Options . . . . . . . . . . . . . . . . . . . . . . . . . . . . 347
View Error Log . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 347
Setup Remote IPL (Initial Program Load) . . . . . . . . . . . . . . . . . . . . . . . 348
Change SCSI Settings . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 351
Select Console . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 351
Select Boot Options . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 352
Select Boot Devices . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 354
Display Current Settings . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 355
Restore Default Settings . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 356
Multiboot Startup . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 356
Exiting System Management Services . . . . . . . . . . . . . . . . . . . . . . . . 356
Chapter 9. Removal and Replacement Procedures. . . . . . . . . . . . . . . . . . . 357
Procedure List . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 358
Handling Static-Sensitive Devices . . . . . . . . . . . . . . . . . . . . . . . . . . 359
Stopping the System without an HMC Attached . . . . . . . . . . . . . . . . . . . . . 359
Stopping the System with an HMC Attached and AIX Installed . . . . . . . . . . . . . . . 359
Stopping the System with an HMC Attached and Linux Installed. . . . . . . . . . . . . . . 360
Starting the System without an HMC Attached . . . . . . . . . . . . . . . . . . . . . 360
Starting the System with an HMC Attached . . . . . . . . . . . . . . . . . . . . . . 361
286/C4 Drawer-Release Latch and Brackets . . . . . . . . . . . . . . . . . . . . . . 362
Drawer-Release Latch Removal. . . . . . . . . . . . . . . . . . . . . . . . . . 362
Drawer-Release Latch Replacement . . . . . . . . . . . . . . . . . . . . . . . . 363
Drawer-Release Latch Bracket Removal . . . . . . . . . . . . . . . . . . . . . . 363
Drawer-Release Latch Bracket Replacement . . . . . . . . . . . . . . . . . . . . . 364
Placing the 286/C4 into the Service Position . . . . . . . . . . . . . . . . . . . . . . 364
vi Service Guide
Returning the 286/C4 to the Operating Position . . . . . . . . . . . . . . . . . . . . . 365
Covers . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 366
Service Access Cover Removal (286/C4) . . . . . . . . . . . . . . . . . . . . . . 366
Service Access Cover Replacement (286/C4). . . . . . . . . . . . . . . . . . . . . 366
Top Cap Removal (286/E4) . . . . . . . . . . . . . . . . . . . . . . . . . . . 367
Top Cap Replacement (286/E4). . . . . . . . . . . . . . . . . . . . . . . . . . 367
Top Cover Removal (286/E4). . . . . . . . . . . . . . . . . . . . . . . . . . . 368
Top Cover Replacement (286/E4) . . . . . . . . . . . . . . . . . . . . . . . . . 369
Outer Side Cover Removal (286/E4) . . . . . . . . . . . . . . . . . . . . . . . . 370
Outer Side Cover Replacement (286/E4) . . . . . . . . . . . . . . . . . . . . . . 370
Service Access Cover Removal (286/E4) . . . . . . . . . . . . . . . . . . . . . . 371
Service Access Cover Replacement (286/E4). . . . . . . . . . . . . . . . . . . . . 372
Back Cover Removal (286/E4) . . . . . . . . . . . . . . . . . . . . . . . . . . 372
Back Cover Replacement (286/E4) . . . . . . . . . . . . . . . . . . . . . . . . 373
Disk Drive Access Cover Removal (286/E4) . . . . . . . . . . . . . . . . . . . . . 373
Disk Drive Access Cover Replacement (286/E4). . . . . . . . . . . . . . . . . . . . 373
Right Side Cover (System Base) Removal (286/E4) . . . . . . . . . . . . . . . . . . 374
Right Side Cover (System Base) Replacement (286/E4). . . . . . . . . . . . . . . . . 375
Bezel Door (286/E4) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 376
Bezel Door Removal. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 376
Bezel Door Replacement . . . . . . . . . . . . . . . . . . . . . . . . . . . . 376
Bezels . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 377
Front Bezel Removal (286/C4) . . . . . . . . . . . . . . . . . . . . . . . . . . 377
Front Bezel Replacement (286/C4) . . . . . . . . . . . . . . . . . . . . . . . . 377
Front Bezel Removal (286/E4) . . . . . . . . . . . . . . . . . . . . . . . . . . 378
Front Bezel Replacement (286/E4) . . . . . . . . . . . . . . . . . . . . . . . . 378
PCI Adapters . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 379
PCI Hot-Plug Manager Access . . . . . . . . . . . . . . . . . . . . . . . . . . 379
Non-Hot-Pluggable PCI Adapter Removal . . . . . . . . . . . . . . . . . . . . . . 381
Adding or Replacing a Non-Hot-Pluggable PCI Adapter . . . . . . . . . . . . . . . . . 383
Hot-Pluggable PCI Adapter Removal . . . . . . . . . . . . . . . . . . . . . . . . 385
Replacing a Hot-Pluggable PCI Adapter. . . . . . . . . . . . . . . . . . . . . . . 388
Installing a Hot-Pluggable PCI Adapter . . . . . . . . . . . . . . . . . . . . . . . 392
Fans. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 395
Removing a Processor Fan in a Non-Hot-Swap Environment . . . . . . . . . . . . . . . 395
Replacing a Processor Fan in a Non-Hot-Swap Environment . . . . . . . . . . . . . . . 396
Hot-Swap Fan Assembly Removal. . . . . . . . . . . . . . . . . . . . . . . . . 397
286/C4. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 398
286/E4 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 399
Hot-Swap Fan Assembly Replacement . . . . . . . . . . . . . . . . . . . . . . . 399
Media Drives (Diskette, CD-ROM, Tape, or Disk Drive) . . . . . . . . . . . . . . . . . . 401
Media Removal. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 401
286/C4. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 402
286/E4 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 403
Media Replacement . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 404
Hot-Plug Disk Drives. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 405
Deconfiguring (Removing) or Configuring a Disk Drive . . . . . . . . . . . . . . . . . 405
Deconfiguring (Removing). . . . . . . . . . . . . . . . . . . . . . . . . . . . 405
Configuring (Replacing). . . . . . . . . . . . . . . . . . . . . . . . . . . . . 406
Hot-Plug Disk Drive Removal . . . . . . . . . . . . . . . . . . . . . . . . . . 406
Hot-Plug Disk Drive Replacement . . . . . . . . . . . . . . . . . . . . . . . . . 408
PCI Riser Card. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 409
PCI Riser Card Removal . . . . . . . . . . . . . . . . . . . . . . . . . . . . 409
PCI Riser Card Replacement. . . . . . . . . . . . . . . . . . . . . . . . . . . 413
Disk Drive Backplane . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 415
Disk Drive Backplane Removal . . . . . . . . . . . . . . . . . . . . . . . . . . 415
Contents vii
Disk Drive Backplane Replacement . . . . . . . . . . . . . . . . . . . . . . . . 418
Operator Panel. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 419
Operator Panel Removal . . . . . . . . . . . . . . . . . . . . . . . . . . . . 419
286/C4. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 419
286/E4 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 420
Operator Panel Replacement. . . . . . . . . . . . . . . . . . . . . . . . . . . 420
System Vital Product Data (VPD) Update Procedure . . . . . . . . . . . . . . . . . . . 421
Processor Card. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 423
Processor Card Removal . . . . . . . . . . . . . . . . . . . . . . . . . . . . 423
Adding or Replacing a Processor Card . . . . . . . . . . . . . . . . . . . . . . . 426
Memory DIMMs . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 427
Memory DIMM Removal . . . . . . . . . . . . . . . . . . . . . . . . . . . . 427
Memory DIMM Replacement. . . . . . . . . . . . . . . . . . . . . . . . . . . 431
Replacing the Battery . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 432
Power Supplies. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 436
Non-Hot-Swap Power Supply Removal . . . . . . . . . . . . . . . . . . . . . . . 436
Non-Hot-Swap Power Supply Replacement . . . . . . . . . . . . . . . . . . . . . 438
Hot-Swap Power Supply Removal. . . . . . . . . . . . . . . . . . . . . . . . . 439
Hot-Swap Power Supply Replacement . . . . . . . . . . . . . . . . . . . . . . . 441
Central Electronics Complex (CEC) Backplane . . . . . . . . . . . . . . . . . . . . . 443
CEC Backplane Removal . . . . . . . . . . . . . . . . . . . . . . . . . . . . 443
CEC Backplane Replacement . . . . . . . . . . . . . . . . . . . . . . . . . . 448
Chapter 10. Parts Information. . . . . . . . . . . . . . . . . . . . . . . . . . . 451
System Parts . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 452
System Parts (continued) . . . . . . . . . . . . . . . . . . . . . . . . . . . . 454
286/E4 Cover Set and Brackets. . . . . . . . . . . . . . . . . . . . . . . . . . . 456
System Internal Cables. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 459
Power Cable. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 462
Keyboards and Mouse (White) . . . . . . . . . . . . . . . . . . . . . . . . . . . 463
Chapter 11. Fault and Attention LEDs. . . . . . . . . . . . . . . . . . . . . . . . 465
Operator Panel Display. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 465
Component LEDs . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 466
Resetting the LEDs . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 469
Appendix A. Environmental Notices . . . . . . . . . . . . . . . . . . . . . . . . 471
Product Recycling and Disposal. . . . . . . . . . . . . . . . . . . . . . . . . . . 471
Acoustical Noise Emissions . . . . . . . . . . . . . . . . . . . . . . . . . . . . 472
Declared Acoustical Noise Emissions. . . . . . . . . . . . . . . . . . . . . . . . . 472
Appendix B. Notices . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 473
Appendix C. Service Processor Setup and Test. . . . . . . . . . . . . . . . . . . . 475
Service Processor Setup Checklist. . . . . . . . . . . . . . . . . . . . . . . . . . 475
Testing the Setup . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 476
Testing Call-In . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 476
Testing Call-Out . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 476
Serial Port Configuration . . . . . . . . . . . . . . . . . . . . . . . . . . . . 477
Appendix D. Setting Up the System Rack . . . . . . . . . . . . . . . . . . . . . . 479
Rack Specifications . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 479
Model 14T/0 and 14T/4 Rack . . . . . . . . . . . . . . . . . . . . . . . . . . 479
Noise Emission Notes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 480
System Service Clearances . . . . . . . . . . . . . . . . . . . . . . . . . . . 481
286/C4 Rack Installation Inventory. . . . . . . . . . . . . . . . . . . . . . . . . . 481
viii Service Guide
Read the Rack Safety Instructions. . . . . . . . . . . . . . . . . . . . . . . . . . 484
Rack Safety Instructions . . . . . . . . . . . . . . . . . . . . . . . . . . . . 484
Attaching the Mounting Hardware to the Rack Enclosure . . . . . . . . . . . . . . . . . 485
System Rail Assembly (Type 1). . . . . . . . . . . . . . . . . . . . . . . . . . 490
System Rail Assembly (Type 2). . . . . . . . . . . . . . . . . . . . . . . . . . 493
286/C4 Removal and Installation into a Rack. . . . . . . . . . . . . . . . . . . . . . 496
System Drawer Removal . . . . . . . . . . . . . . . . . . . . . . . . . . . . 496
Installing the 286/C4 into the Rack Enclosure. . . . . . . . . . . . . . . . . . . . . 497
Installing the Cable-Management Arm . . . . . . . . . . . . . . . . . . . . . . . . 501
Appendix E. Installing a Hardware Management Console (HMC) . . . . . . . . . . . . . 503
Position the HMC and Monitor . . . . . . . . . . . . . . . . . . . . . . . . . . . 503
Connect the Cables . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 504
Connect the 8-Port Adapter Cables . . . . . . . . . . . . . . . . . . . . . . . . . 506
Connect the External Modem . . . . . . . . . . . . . . . . . . . . . . . . . . . 506
Check the Microswitch Setting on the Modem . . . . . . . . . . . . . . . . . . . . 507
Connect the LAN Cable (If Applicable) . . . . . . . . . . . . . . . . . . . . . . . . 508
Plug in the HMC Power Cable . . . . . . . . . . . . . . . . . . . . . . . . . . . 508
Configure the Network . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 508
Configure Inventory Scout Services . . . . . . . . . . . . . . . . . . . . . . . . . 508
Collect Vital Product Data Information . . . . . . . . . . . . . . . . . . . . . . . 509
Configure the Service Agent . . . . . . . . . . . . . . . . . . . . . . . . . . . . 510
Appendix F. Modem Configurations . . . . . . . . . . . . . . . . . . . . . . . . 513
Sample Modem Configuration Files . . . . . . . . . . . . . . . . . . . . . . . . . 513
Configuration File Selection . . . . . . . . . . . . . . . . . . . . . . . . . . . . 513
Transfer of a Modem Session . . . . . . . . . . . . . . . . . . . . . . . . . . . 517
Modem Configuration Sample Files . . . . . . . . . . . . . . . . . . . . . . . . . 519
Index . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 535
Contents ix
x Service Guide
Safety Notices
A danger notice indicates the presence of a hazard that has the potential of causing death or serious
personal injury. Danger notices appear on the following pages:
v xii
v 89
v 90
v 357
v 436
A caution notice indicates the presence of a hazard that has the potential of causing moderate or minor
personal injury. Caution notices appear on the following pages:
v xii
v xiii
v 89
v 357
Note: For a translation of these notices, see System Unit Safety Information, order number SA23-2652.
Rack Safety Instructions
v Do not install this unit in a rack where the internal rack ambient temperatures will exceed 35 degrees C.
v Do not install this unit in a rack where the air flow is compromised. Any side, front or back of the unit
used for air flow through the unit must not be in direct contact with the rack.
v Care should be taken to ensure that a hazardous condition is not created due to uneven mechanical
loading when installing this unit in a rack. If the rack has a stabilizer it must be firmly attached before
installing or removing this unit.
v Consideration should be given to the connection of the equipment to the supply circuit so that
overloading of circuits does not compromise the supply wiring or overcurrent protection. To provide the
correct power connection to the rack, refer to the rating labels located on the equipment in the rack to
determine the total power requirement for the supply circuit.
v An electrical outlet that is not correctly wired could place hazardous voltage on the metal parts of the
system or the devices that attach to the system. It is the responsibility of the customer to ensure that
the outlet is correctly wired and grounded to prevent an electrical shock.
xi
Electrical Safety
Observe the following safety instructions any time you are connecting or disconnecting devices attached to
the workstation.
In the system you are about to setup or service:
v The ac power interface connector is considered the main power disconnect device.
v This system has redundant power supply capabilities, meaning that it has the ability to have two power
supplies running simultaneously in the same system unit. When instructed to disconnect the power
source, ensure that all power cables have been unplugged.
DANGER
To prevent electrical shock hazard, disconnect all power cables from the electrical outlet before
relocating the system.
D01
CAUTION:
This product is equipped with a three-wire power cable and plug for the user’s safety. Use this
power cable with a properly grounded electrical outlet to avoid electrical shock.
C01
DANGER
To prevent electrical shock hazard, disconnect all power cables from the electrical outlet before
relocating the system.
D01
xii Service Guide
Laser Safety Information
CAUTION:
This product may contain a CD-ROM, DVD-ROM, or laser module on a PCI card, which are class 1
laser products.
C30
Laser Compliance
All lasers are certified in the U.S. to conform to the requirements of DHHS 21 CFR Subchapter J for class
1 laser products. Outside the U.S., they are certified to be in compliance with the IEC 825 (first edition
1984) as a class 1 laser product. Consult the label on each part for laser certification numbers and
approval information.
CAUTION:
All laser modules are designed so that there is never any human access to laser radiation above a
class 1 level during normal operation, user maintenance, or prescribed service conditions. Data
processing environments can contain equipment transmitting on system links with laser modules
that operate at greater than class 1 power levels. For this reason, never look into the end of an
optical fiber cable or open receptacle. Only trained service personnel should perform the
inspection or repair of optical fiber cable assemblies and receptacles.
C25, C26
Safety Notices xiii
xiv Service Guide
Data Integrity and Verification
These computer systems contain mechanisms designed to reduce the possibility of undetected data
corruption or loss. This risk, however, cannot be eliminated. Users who experience unplanned outages,
system failures, power fluctuations or outages, or component failures must verify the accuracy of operations
performed and data saved or transmitted by the system at or near the time of the outage or failure. In
addition, users must establish procedures to ensure that there is independent data verification before relying
on such data in sensitive or critical operations. Users should periodically check our support websites for
updated information and fixes applicable to the system and related software.
xv
xvi Service Guide
About This Book
This book provides maintenance information that is specific to the 286/C4 and 286/E4, as well as adapters
and attached devices that do not have their own service information. It also contains Maintenance Analysis
Procedures (MAPs) that are not common to other systems. In this book, the 286/C4 and 286/E4 are
hereafter referred to as the system.
MAPs that are common to all systems are contained in the Diagnostic Information for Multiple Bus
Systems.
This book is used by the service representative to repair system failures. This book assumes that the
service representative has had training on the system.
Highlighting
The following highlighting conventions are used in this book:
Bold Identifies commands, subroutines, keywords, files, structures, directories, and other items
whose names are predefined by the system. Also identifies graphical objects such as buttons,
labels, and icons that the user selects.
Italics Identifies parameters whose actual names or values are to be supplied by the user.
Monospace Identifies examples of specific data values, examples of text similar to what you might see
displayed, examples of portions of program code similar to what you might write as a
programmer, messages from the system, or information you should actually type.
ISO 9000
ISO 9000 registered quality systems were used in the development and manufacturing of this product.
Related Publications
The following publications provide additional information about your system:
v The 286/C4 and 286/E4 Installation Guide, order number 86 A1 40EG, contains information on how to
set up and cable the system, install and remove options, and verify system operation.
v The 286/C4 and 286/E4 User’s Guide, order number 86 A1 41EG, contains information to help users use
the system, use the service aids, and solve minor problems.
v The D20 Installation Guide, order number 86 A1 39EG contains information on how to set up, cable, install
and remove options, and verify the subsystem operations.
v The D10 and D20 Service Guide, order number 86 A1 38EG, contains reference information,
maintenance analysis procedures (MAPs), error codes, removal and replacement procedures, and a
parts catalog.
v The Diagnostic Information for Multiple Bus Systems, order number 86 A1 26HX, contains diagnostic
information, service request numbers (SRNs), and failing function codes (FFCs).
v The Adapters Information for Multiple Bus Systems, order number 86 A1 27HX, contains information about
adapters for your system. This manual is intended to supplement the service information found in the
Diagnostic Information for Multiple Bus Systems.
v The Hardware Management Console Installation and Operations Guide, order number 86 A1 83EF,
contains information on how to set up a hardware management console and how to use it to manage a
system.
v The Site Preparation for Rack Systems, order number 86 A1 30PX, contains information to help you plan
your installation.
xvii
v The System Unit Safety Information, order number 86 X1 11WD, contains translations of safety
information used throughout this book.
xviii Service Guide
Notes:
In this manual, 286/C4 stand for PL420R and 286/E4 stands for PL420T.
The PL420 systems do not have the 4-slotted PCI riser card, but only the 6-slotted one.
The PL420 systems have only the POWER4+ 1.2 and 1.45 GHz processors.
Chapter 1. Reference Information
This chapter provides an overview of the system, including a logical description and a physical overview.
The following additional details pertaining to the system are also provided:
v Memory overview and placement
v General description of the operator panel
v Cabling rules
v System location rules and descriptions
v Power flow
v Data flow
Overview
The 286/C4 and 286/E4 have similar operational capabilities but are packaged differently.
The 286/E4 is a deskside system while the 286/C4 is a 4 EIA-unit drawer designed to be housed in a
19-inch rack. Both models can be configured as a 1-way to 4-way symmetric multiprocessing (SMP)
system; each can have up to 4 hot-plug disk drives and 6 hot-plug PCI adapters. Both models support
redundant hot-plug power supplies, cooling fans, and PCI adapters. Power is provided through ac power
cords. Either model can be set up to use 100-127 V ac or 200-240 V ac power. The two media bays can
be used to accommodate any of the following drives:
v SCSI DVD-RAM drive
v IDE DVD-ROM drive
v IDE CD-ROM drive
v Diskette drive (special mounting carriage needed)
v Tape drive
Note: A CD-ROM is recommended for use in the base configuration of either model.
1
System Features
Bus Architecture
Depending on your system configuration, four or six PCI-X slotted riser cards are available. The slots are
64-bit capable at 133 MHz, 3.3 volts. If your system has four PCI-X slots, use slots 3 and 4 with PCI
adapters that require high performance. Use slots 1 and 2 with PCI adapters that do not require high
performance. On the six-slotted PCI riser cards, all slots are equal. All slots are capable of using the
full-length PCI-X adapters. The PCI-X slots are numbered on the rear of the chassis, from left to right.
When equipped with POWER4+ processors and a six-slotted PCI riser card, the 286/C4 and 286/E4
include six hot-plug PCI-X slots, dual integrated Ultra3 SCSI controllers, dual 10/100 Mbps integrated
Ethernet controllers, and four front-accessible disk bays supporting hot-swappable disks. These disk bays
can accommodate up to 587.2 GB of disk storage using 146.8 GB Ultra3 SCSI disk drives. Two media
bays are used for a CD-ROM, DVD-RAM, DVD-ROM, or another optional media device, such as a tape or
diskette drive. The Converged Service Processor 2 (CSP), including system power control, is also
integrated, along with the native I/O functions such as serial ports, keyboard, and mouse. Also included
are an enhanced I/O subsystem with the implementation of the remote I/O (RIO) interconnect and PCI-X
bus protocols.
The 286/C4 can support up to two high-density D20 I/O drawers to provide additional PCI-X slots and disk
drive bays. A fully configured system with two I/O drawers has 20 PCI-X slots and 4.1 TB of disk space.
Optionally redundant hot-plug cooling fans and power supplies are available.
All PCI-X slots can run either 64-bit or 32-bit adapters. However, a 32-bit adapter operates in a 32-bit
mode and shows no performance advantage while running in a 64-bit slot. The following illustration shows
the PCI adapter slot locations when viewing from the rear of the system.
1 286/C4 Using a 4-Slotted PCI Riser
Card
3 286/E4 Using a 4-Slotted PCI Riser
Card
2 286/C4 Using a 6-Slotted PCI Riser
Card
4 286/E4 Using a 6-Slotted PCI Riser
Card
2 Service Guide
Processors
v One-way and 2-way processor cards with GP 1.0 GHz POWER4 microprocessors with 32 MB of L3
cache per processor card. Only a single 1-way card is allowed per server at a time. A 3-way
configuration is not supported.
v One-way and 2-way processor cards with GQ 1.45 GHz POWER4+ microprocessors with 8 MB of L3
cache per processor card. Only a single 1-way card is allowed per server at a time. A 3-way
configuration is not supported. POWER4 and POWER4+ cards cannot be intermixed.
v A 1-way to a 2-way upgrade is available by feature conversion.
v A POWER4 to POWER4+ upgrade is available by feature conversion and requires the redundant
cooling option, a 6-slot riser card, and a RIO-enabled planar.
v
L3 cache capacity per 1.0 GHz GP processor configuration:
– One-way configuration = 1 L3 chip with a 32 MB memory capacity
– Two-way configuration = 1 L3 chip with a 32 MB memory capacity
– Four-way configuration = 2 L3 chips per processor, each with 32 MB memory (64 MB memory
capacity)
v L3 cache capacity per 1.45 GHz GQ processor configuration:
– One-way configuration = 1 L3 chip with an 8 MB memory capacity
– Two-way configuration = 1 L3 chip with an 8 MB memory capacity
– Four-way configuration = 2 L3 chips per processor, each with 8 MB memory (32 MB memory
capacity)
Memory
v 1 GB to 32 GB ECC (error code correction) DDR (double data rate) SDRAM (synchronous dynamic
random-access memory).
v Memory DIMMs plug into the processor cards (8 DIMM slots per card).
v DIMMs must be populated in quads (four DIMMs). A memory feature consists of a quad. Additional
quads can consist of any memory size.
v A system with a single processor card (1-way or 2-way) can have a maximum of 16 GB of memory.
Media Drives
Typical system configuration consists of two media bays. One of the media bays can be configured with
either IDE CD-ROM or IDE DVD-ROM.
v Media bay 1 can accommodate an IDE CD-ROM or IDE DVD-ROM.
v Media bay 2 can accommodate a SCSI DVD-RAM, diskette drive, or tape drive.
Note: The SCSI DVD-RAM can read CD-ROM installation media.
The following media drives are available for the 286/C4 and 286/E4.
– Diskette drive
– DVD-RAM drive 4.7 GB capacity
– 650 MB IDE 48x CD-ROM drive
– 4.7 GB IDE 16x/48x DVD-ROM drive
– 80/160 GB VXA tape drive
– 8 mm 60/150 GB tape drive
– 4 mm 20/40 GB tape drive
Chapter 1. Reference Information 3
Hot-Pluggable Disk Drives
Four hot-plug disk-drive bays:
v 18.2 GB to 587.2 GB of disk storage
v The following disk-drive sizes and speeds are available:
– 18.2 GB Ultra3 10K RPM 1 inch
– 36.4 GB Ultra3 10K RPM 1 inch
– 73.4 GB Ultra3 10K RPM 1 inch
– 146.8 GB Ultra3 10K RPM 1 inch
– 36.4 GB Ultra3 15K RPM 1 inch
– 73.4 GB Ultra3 15K RPM 1 inch
PCI-X Slots and Integrated Adapters
The following devices are included in the 286/C4 and 286/E4:
v Six hot-plug PCI-X slots, 64-bit, 133 MHz, 3.3 volt on POWER4+ processor models, four slots on
POWER models.
v Integrated ports:
– Two 10/100 Ethernet (IEEE 802.3 compliant).
– Two Ultra3 SCSI (one external Ultra3 SCSI (with VHDCI 4 mini 68-pin port), and one internal Ultra3
SCSI disk drive backplane). VHDCI may require a mini 68-pin connector or FC 2118 mini 68-pin to
68-pin 0.3 meter cable as an additional feature.
– Three serial. Serial port 1 (S1) has two physical connectors, one RJ-48 connector located in front on
the operator panel and a 9-pin D-shell connector located on the rear of the chassis. The use of the
front port disables the rear S1 port.
– One parallel (the parallel port is not accessible when in partitioned mode).
– Keyboard and mouse.
I/O Expansion Drawer
Expansion is provided by adding additional 11D/20 I/O drawers (up to two) for the 286/C4. The D20
provides additional PCI-X slots and disk drive bays for the 286/C4.
v Seven hot-plug PCI-X slots, 64-bit, 133 MHz, 3.3 volt
v Up to 12 hot-pluggable disk drive bays
The optional disk backplane consists of two 6-pack disk bays and requires SCSI cables to connect a
required Ultra3 SCSI PCI adapter or Ultra3 SCSI RAID adapter, in slot 7 to each of the 6-packs, or two
SCSI adapters (one in slot 4 and one in slot 7 of the I/O drawer).
Power Supplies
Up to two power supplies maximum per system; the second power supply is for redundant power.
v AC input type 100-127 V ac or 200-240 V ac
v Single phase
v Frequency = 47 Hz - 63 Hz
Keyboard
v Standard: 101-key enhanced keyboard
v 101/102 or 106-key enhanced keyboard is also available
Mouse
v Three-button
4 Service Guide
Operator Panel
v 32-character LED diagnostics display
v LEDs for power on, attention, SCSI activity, and LAN activity
v Buttons for power on, system reset, and service processor reset switch:
System Reset Button
Dumps AIX (if dump is enabled) and reboots
Service Processor Reset Switch
Resets the service processor (into standby mode). The service processor reset switch is the
pinhole located on the operator panel above the system reset button.
Input/Output Ports
v One 25-pin parallel
v Keyboard
v Mouse
v Two Ultra3 SCSI (one to internal 4-pack disk drive backplane, one to external VHDCI mini 68-pin
connector)
v Integrated Drive Electronics (IDE)
v Two System Power Control Network (SPCN) Connectors
v Two RIO Connectors
v Two 10/100 Ethernet (IEEE 802.3 compliant)
v Three serial ports. Serial port 1 (S1) has two physical connectors, one RJ-48 connector located in front
on the operator panel, and a 9-pin D-shell connector located on the rear of the chassis. The other two
serial ports are 9-pin D-shell connectors located on the rear of the system chassis.
1 Serial Port 1 Connector 3 Serial Port 3 Connector
1a Serial Port 1 Front Connector (RJ-48) 4 Parallel Connector
2 Serial Port 2 Connector
Chapter 1. Reference Information 5
Current usage for the serial port connectors are as follows:
Serial Port Number Location Examples of Applicable Usage
Serial Port 1 (S1
Front)
Operator Panel Service Agent, PDA system management applications (for example,
handheld devices, laptop systems), Service Processor menus, ASCII
terminal for the operating system console, and modems
Serial Port 1 (S1
Rear)
Rear of the System Service Processor menus, Service Agent, PDA system management
applications (interface cable required), ASCII terminal for the
operating system console, and modems
Serial Port 2 (S2) Rear of the System Service Processor menus, HACMP, ASCII terminal for the operating
system console, and modems
Serial Port 3 (S3) Rear of the System HACMP, UPS (uninterruptible power supply), ASCII terminal for the
operating system console, and modems
Notes:
1. Serial port 1 is never used to run HACMP or attach a UPS. If you are configuring your system to run
HACMP with a UPS attached, you must connect the HACMP cable to serial port 2 and the UPS
cable to serial port 3. Do not run UPS connected to serial port 2. If you disconnect HACMP, you
must reset the service processor using the pinhole reset switch before running another application.
The service processor pinhole reset switch is located on the operator panel.
2. A modem can be attached to any of the serial ports.
Security Features
On the 286/C4 and 286/E4, you can set the following types of passwords to limit access to these systems:
v General-access password - set only from the service processor menus. It provides limited access to the
service processor menus and is usually available to all users who are allowed to power on the system.
v Privileged-access password - Set from the service processor menus or from System Management
Services (SMS) utilities. Used by the system administrator or root user and allows access to all service
processor functions.
Hardware Management Console (HMC)
The Hardware Management Console (HMC) is an optional feature that allows you to manage configuration
and operation of partitions in a system, as well as add and remove hardware without interrupting system
operation.
In this book, a system that is managed by the HMC is referred to as the managed system. The HMC uses
its serial connection to the managed system to perform various functions. The HMC’s main functions
include the following:
v Detecting, reporting, and storing changes in hardware conditions
v Acting as a service focal point for service representatives to determine an appropriate service strategy
Service representatives use the Service Focal Point application on the HMC to start and end their service
calls. Service Focal Point provides service representatives with serviceable event information, vital product
data (VPD), and diagnostic information.
The HMC is a closed system. Additional applications cannot be loaded on the HMC. All the tasks needed
to maintain the platform, the underlying operating system, and the HMC application code are available by
using the HMC’s management applications.
6 Service Guide
Powering the System on and Off
For procedures to power off and power on the system for various system configurations and operating
systems, see “Stopping the System without an HMC Attached” on page 359 and “Starting the System
without an HMC Attached” on page 360.
Console Strategy
Note: This section applies to a system that does not have an HMC attached.
The firmware starts a console-selection sequence at system boot time if any of the following is true:
v A console has not yet been selected.
v A previous console-selection sequence timed out.
v A change in the system configuration affects the console (for example, keyboard installed or removed,
mouse installed or removed, graphics adapter installed/removed or moved to another PCI slot).
The console-selection sequence allows you to select (from the appropriate input device) any one of the
available console devices. If no console is selected within approximately 60 seconds, serial port 1 (S1) is
selected as the console and the selection sequence times out.
Attention: If an ASCII terminal is attached to serial port 1 (S1), and there is any interaction with this
terminal:
v After OK displays in the operator panel
AND
v Before the power-on sequence is initiated
the firmware will use this terminal as the console, regardless of the previous console selection.
After a console has been selected, the console-selection sequence is started at boot time only if there is a
change in the system configuration (as previously described), or the contents of the system’s nonvolatile
memory (NVRAM) are lost.
Note: Moving an ASCII terminal from one serial port to another (from S1 to S2) cannot be detected by the
firmware, so it does not constitute a configuration change.
You can also initiate a system console-selection sequence from the SMS menus.
Power-On Self-Test
After power is turned on and before the operating system is installed, the system does a power-on self-test
(POST). This test performs checks to ensure that the hardware is functioning correctly before the
operating system is installed. During the POST, a POST screen displays, and POST indicators appear on
the firmware console (if one is connected). The next section describes the POST indicators and functions
that can be accessed during the POST.
POST Indicators
POST indicators indicate tests that are being performed as the system is preparing to load the operating
system. The POST indicators are words that display on the system console. Each time that the system
starts a different step in the POST, a POST indicator word appears on the console. Each word is an
indicator of the tests that are being performed.
Chapter 1. Reference Information 7
The POST screen displays the following words:
Memory Memory test
Keyboard Initialize the keyboard and mouse. The time period for pressing a key to access the
System Management Services, or to initiate a service mode boot is now open. See “POST
Keys” for more information.
Network Self-test on network adapters
SCSI Adapters are being initialized
Speaker Sounds an audible tone at the end of POST
POST Keys
The POST keys, if pressed after the keyboard POST indicator displays and before the last POST indicator
speaker displays, cause the system to start services or to initiate service mode boots used for configuring
the system and diagnosing problems. The keys are described below:
Note: The program function keys (F1-F12) on a keyboard attached to the CEC drawer are not used and
will be ignored. After the keyboard POST indicator displays, you must use the numeric number
keys.
1 Key
The numeric 1 key, when pressed during POST, starts the System Management Services (SMS) interface.
5 Key
The numeric 5 key, when pressed during POST, initiates a system boot in service mode using the default
service mode boot list.
This mode attempts to boot from the first device of each type found in the list. It does not search for other
bootable devices of that type if the first device is not bootable. Instead, it continues to the next device type
in the list. The firmware supports up to five entries in the boot list.
Note: This is the preferred method of loading standalone diagnostics from CD-ROM.
The default boot sequence is:
1. Diskette (if installed)
2. CD-ROM (if installed)
3. Hard file
4. Tape drive (if installed)
5. Network
a. Token ring
b. Ethernet
6 Key
The numeric 6 key works like the numeric 5 key, except that firmware uses the customized service mode
bootlist that was set up in the AIX service aids or the service processor menus.
Note: This is the preferred method of loading online diagnostics from the boot hard disk.
8 Service Guide
System Unit Locations
286/C4 Front View
1 Operator Panel 4 Media Bay
2 Front Serial Port 5 Hot-Plug Disk Drives
3 CD-ROM Drive (optional)
286/C4 Rear View
1 PCI-X Slot Access 11a RIO-G0 Connector
2 Redundant Power Cable Receptacle 11b RIO-G1 Connector
3 Primary Power Cable Receptacle 12 Serial Connector 2
4 System Power Control Network Connector
SPCN2
13 Serial Connector 3
5 System Power Control Network Connector
SPCN1
14 Serial Connector 1
6 Parallel Connector 15 Mouse Connector
7 Hardware Management Console Connector
HMC2
16 Keyboard Connector
8 Hardware Management Console Connector
HMC1
17 Rack Indicator
9 Ethernet Connector 1 18 External SCSI Connector
10 Ethernet Connector 2
Chapter 1. Reference Information 9
286/E4 Front View
1 Operator Panel 4 Media bay
2 Front Serial Connector 5 Hot-Plug Disk Drives
3 CD-ROM Drive
10 Service Guide
Rear View
1 PCI-X Slot Access 10 Ethernet Connector 2
2 Redundant Power Cable Receptacle 11 RIO Connector Cover
3 Primary Power Cable Receptacle 12 Serial Connector 2
4 System Power Control Network Connector
SPCN2
13 Serial Connector 3
5 System Power Control Network Connector
SPCN1
14 Serial Connector 1
6 Parallel Connector 15 Mouse Connector
7 Hardware Management Console Connector
HMC2
16 Keyboard Connector
8 Hardware Management Console Connector
HMC1
17 Rack Indicator
9 Ethernet Connector 1 18 External SCSI Connector
Chapter 1. Reference Information 11
Power Supply Locations
286/C4
1 Power Supply #1 (Primary) 3 AC Good LED (Green)
2 Power Supply #2 (Redundant) 4 DC Good LED (Green)
5 Fault LED (Amber)
12 Service Guide
286/E4
1 Power Supply #2 (Redundant) (Front
View)
5 AC Good LED (Green)
2 Power Supply #1 (Primary) (Front
View)
6 Redundant Power Supply (Rear View)
3 Fault LED (Amber) 7 Primary Power Supply (Rear View)
4 DC Good LED (Green)
Note: The preceding illustration does not show the LEDs located on the top of the power supplies. For location
references, see the illustration on page 12 showing the power supplies for the 286/C4.
Chapter 1. Reference Information 13
Fan Locations
The following illustration identifies the system cooling fans. The fans have an amber LED located on top of
the housing. A lit amber LED indicates that the fan is not operating correctly.
The power supplies have two groups of three LEDs located on the front and top. The amber LED on the
power supply indicates a problem with the power supply. If a power supply is diagnosed with a cooling
problem, the entire power supply must be replaced.
1 Processor Card Cooling Fan (Primary Fan
#1 )
4 PCI Adapter Cooling Fan (Fan #3)
2 Processor Card Cooling Fan (Redundant
Fan #2)
5 Stacked Power Supply Cooling Fans
3 PCI Adapter Cooling Fan (Fan #4)
Note: The preceding illustration does not show the two cooling fans integrated into each power supply.
14 Service Guide
CEC Backplane Locations
The following illustration of the CEC backplane identifies the primary connectors used in your system.
1 CEC Backplane 17 Serial Port #3 Connector
2 Power Supply #1 (Primary) Connectors 18 Mouse
3 Power Supply #2 (Redundant) Connectors 19 Keyboard
4 SPCN1 Connector 20 Rack Indicator Connector
5 SPCN2 Connector 21 PCI Riser Card Connectors
6 HMC2 Connector 22 Battery Connector
7 Parallel Port Connector 23 For Manufacturing Use Only
8 HMC1 Connector 24 PCI Cooling Fan #3 and #4 Connector
9 Ethernet #1 Connector 25 IDE CD-ROM Signal Connector
10 Ethernet #2 Connector 26 Diskette Drive Signal Connector
11 Connector RIO-G1 Connector 27 Diskette Drive Power Connector
12 Processor Cooling Fan #1 Connector
(Primary Fan)
28 Media Power Connector
13 Processor Cooling Fan #2 Connector
(Redundant Fan)
29 Disk Drive Backplane Power Connector
14 Connector RIO-G0 Connector 30 Processor Card #2 Connector
15 Serial Port #2 Connector 31 For Manufacturing Use Only
16 Serial Port #1 Connector 32 Operator Panel Connector
33 Processor Card #1 Connector
Chapter 1. Reference Information 15
PCI Riser Card Locations
Depending on your system configuration, four-slotted or six-slotted PCI-X riser cards are available. The
slots are 64-bit capable at 133 MHz, 3.3 volts. If your system has four PCI-X slots, use slots 3 and 4 with
PCI adapters that require high performance. Use slots 1 and 2 with PCI adapters that do not require high
performance. On the six-slotted PCi riser cards, all slots are equal. All slots are capable of using the
full-length PCI-X adapters. The PCI-X slots are numbered on the rear of the chassis, from left to right.
The PCI card has two integrated SCSI controllers. One SCSI controller (PCI, 64-bit) is a SYM53C1010
that operates at 66MHz and has dual LVD (low voltage differential) ports. The other is a 53C875, SE-SCSI
controller and is only available on the 6 slotted PCI riser card. The internal port from SCSI controller
SYM53C1010 connects to the DASD board through a 68-pin SCSI cable. The port from the SE-SCSI
controller is available only on the six–slotted PCI riser card, and connects to the media bay through a
68-pin SCSI cable.
All PCI-X slots can run either 64-bit or 32-bit adapters. However, a 32-bit adapter operates in a 32-bit
mode and shows no performance advantage while running in a 64-bit slot.
The following illustrations show the PCI adapter slot locations when viewing from the rear of the system.
1 286/C4 Using a 4-Slotted PCI-X Riser
Card
3 286/E4 Using a 4-Slotted PCI-X Riser
Card
2 286/C4 Using a 6-Slotted PCI-X Riser
Card
4 286/E4 Using a 6-Slotted PCI-X Riser
Card
16 Service Guide
The following illustrations show the connector and slot locations on the four-slotted PCI riser card and
six-slotted PCI riser card, respectively.
Four-Slotted PCI Riser Card
1 PCI Slot 1 7 4 Slotted PCI Riser Card
2 PCI Slot 2 8 ²Push² Labels
3 PCI Slot 3 9 Internal Disk Drive SCSI Connector (for internal
disk drive use only)
4 PCI Slot 4 10 External SCSI Connector
5 Manufacturing Use Only 11 Removal Sling
6 Riser Card to CEC Backplane Standoff
Connectors (Quantity 4) located on the
underside of the PCI Riser Card
12 Riser Card to Disk Drive Backplane Signal
Connector
Six-Slotted PCI Riser Card
1 PCI Slot 1 9 6-Slotted PCI Riser Card
2 PCI Slot 2 10 ²Push² Labels
3 PCI Slot 3 11 Internal Disk Drive SCSI Connector (for internal
disk drive use only)
4 PCI Slot 4 12 Single-Ended SCSI to Media Connector
5 PCI Slot 5 13 External SCSI Connector
6 PCI Slot 6 14 Removal Sling
7 Manufacturing Use Only 15 Riser Card to Disk Drive Backplane Signal
Connector
8 Riser Card to CEC Backplane Standoff
Connectors (Quantity 4) located on the
underside of the PCI Riser Card
Chapter 1. Reference Information 17
Memory DIMM Location
The memory DIMMs used in the 286/C4 and 286/E4 are located on the processor card. Your system
contains either one or two processor cards, each card with 8 memory DIMM slots. Associated with each
memory DIMM slot is an LED, which is located on the edge of the processor card. A lit LED indicates a
problem with a specific memory DIMM. The following rules must be followed when removing or installing
memory DIMMs:
v Memory DIMMs must be installed in quads. However, you do not need to install four new DIMMs in a
quad when you have a memory problem. Swap out each DIMM, starting the system after each change,
until the faulty DIMM has been replaced.
Note: A quad is a group of four memory DIMMs. Each memory DIMM in a quad must be the same type
and size. However, both quads on the same processor card are not required to be the same.
v The system must have at least four memory DIMMs installed (one quad).
v The quad slot combinations are 1, 3, 6, and 8 (or 2, 4, 5, and 7).
Note: 1, 3, 6, and 8 (or 2, 4, 5, and 7) combinations are the only possible slot combinations recognized
by the system.
v On a two-processor-card system, either processor card can be populated with memory.
Note: A processor card containing no memory is still functional. (For example, a system containing two
processor cards has all of its memory installed on processor card 1, and no memory installed on
processor card 2. Processor card 2 has no memory-storage capacity, but functions as a
processor card.)
The following are valid memory configurations for one and two processor cards:
Number of Processor Cards Allowable Memory Configurations
1 Processor Card 1 GB through 16 GB installed in quads of 256 MB DIMMs
to 2 GB DIMMs
2 Processor Cards 1 GB through 32 GB installed in quads of 256 MB DIMMs
to 2 GB DIMMs
Note: The maximum amount of system memory varies by model.
The following processor card illustration shows the memory DIMM slot locations, as well as the LED
location associated with each memory DIMM slot.
1 Memory Module Slot 1 (J2A) 6 Memory Module Slot 6 (J1A)
2 Memory Module Slot 2 (J2B) 7 Memory Module Slot 7 (J0B)
3 Memory Module Slot 3 (J3A) 8 Memory Module Slot 8 (J0A)
4 Memory Module Slot 4 (J3B) 9 Processor Card
5 Memory Module Slot 5 (J1B) 10 Memory Module Fault LEDs
18 Service Guide
Reading the Operator Panel and System LEDs
The following illustration shows the component location on the operator panel.
1 Power-On Button 6 Operator Panel Display
2 Power LED 7 (FS1) Front Serial Connector
(RJ-48 Connector)
3 Attention LED 8 Service Processor Reset Switch
(Pinhole)
4 SCSI Port Activity 9 System Reset Button
5 Ethernet Port Activity
Index Component Name Component Description
1 Power-On Button Turns the system power on and off.
2 Power LED Blinking - When connected to the power source (System is
in standby mode).
Solid - When power button has been pressed.
Note: There is approximately a 30 second transition period
from the time the power button is pressed to when the
power LED goes from blinking to on solid. During the
transition period, you may observe the blinking intervals
speed up.
3 Attention LED Normal State - LED is off.
4 SCSI Port Activity Normal State - LED is on when there is SCSI activity.
5 Ethernet Port Activity Normal State - LED is on when there is Ethernet activity.
6 Operator Panel Display Displays current status of system startup, or diagnostic
information in the event of a hardware problem.
7 Front Serial Connector (FS1) Serial port uses RJ-48 connector. Use to plug in external
devices at the front of the system unit.
8 Service Processor Reset Switch
(Pinhole)
Service Personnel Use Only
9 System Reset Button Resets the system
For more information about other system LEDs, see “Component LEDs” on page 466.
Chapter 1. Reference Information 19
SCSI IDs and Bay Locations
Note: To show the disk drive and disk drive bay locations, the 286/C4 and 286/E4 are shown with their
bezels removed.
Index Bay Location Drive Name SCSI ID
1 Operator Panel
2 D01 IDE CD-ROM (Default)
or IDE DVD-ROM (Optional)
IDE (Non-SCSI)
SCSI ID 1
3 D02
Tape Drive (Optional) SCSI ID 0
SCSI DVD-RAM (Optional) SCSI ID 1
Diskette Drive (Optional) Not Applicable
4 D10 Disk Drive SCSI ID 8
5 D11 Disk Drive SCSI ID 9
6 D12 Disk Drive SCSI ID 10
7 D13 Disk Drive SCSI ID 11
Notes:
1. The SCSI bus IDs are the recommended values and indicate how the IDs are set when the system is
shipped from the factory. Field installations might not comply with these recommendations.
2. Media bay location D02 is designed to house one of the following optional devices:
v Tape Drive
v SCSI DVD-RAM
v Diskette Drive
20 Service Guide
System Logic Flow
When viewing the following system logic flow illustrations, select the diagram that is closest to your system
configuration.
Systems Configured with a GP Processor and Four-Slotted PCI Riser
Card
Chapter 1. Reference Information 21
Systems Configured with a GQ Processor and Six-Slotted PCI Riser
Card
22 Service Guide
Location Codes
This system unit uses physical location codes in conjunction with AIX location codes to provide mapping of
the failing field replaceable units. The location codes are produced by the system unit’s firmware and AIX.
Note: When the system is running under the Linux operating system no ²Linux location codes² exist, but
physical location codes might display in the Linux error logs.
Physical Location Codes
Physical location codes provide a mapping of logical functions in a platform (or expansion sites for logical
functions, such as connectors or ports) to their specific locations within the physical structure of the
platform.
Location Code Format
The location code is an alphanumeric string of variable length, consisting of a series of location identifiers,
separated by a dash (-), or slash (/), or a pound sign (#) character. The series is hierarchical; that is, each
location identifier in the string is a physical or logical child of the one preceding it.
v The - (dash) separator character represents a normal structural relationship where the child is a
separate physical package and it plugs into (or is connected to) the parent. For example, P1-C1 is a
processor card (C1) plugged into a planar (P1), or P1-M1 is a memory card (M1) plugged into a planar
(P1).
v The / (slash) separator character separates the base location code of a function from any extended
location information. A group of logical devices can have the same base location code because they are
all on the same physical package, but may require extended location information to describe the
connectors they support. For example, P2/S1 describes the location of the serial port 1 controller and its
connector (S1), which is located on planar P2 (its base location code), but the / indicates that further
devices can be connected to it at the external S1 serial connector. The keyboard controller and its
connector likewise have location code P2/K1, which means they have the same base location code (P2)
as serial port 1, but a different external connector. In contrast, the location code P2-K1 actually points to
the device connected to connector K1; that is, the keyboard. The location code P2/Z1 indicates an
integrated SCSI controller which drives connector Z1, while location codes of P2-Z1-... point to the
actual SCSI bus and devices.
v The # (pound sign) separator character indicates a cable connection between a connector and parent.
The following are examples:
v U0.1-P1-C1 identifies processor card C1 plugged into the CEC backplane P1.
v U0.1-P1-C2-M1 identifies memory card M1 plugged into slot 1 on processor card C2 connected to the
CEC backplane P1.
v U0.1-P1/S1 identifies serial port 1 controller on the CEC backplane P1 or the connector for serial port 1.
v U0.1-P1-K1 identifies a keyboard K1, attached to connector K1 on CEC backplane P1.
v U0.1-P2/Z1 identifies an integrated SCSI port controller on PCI riser card P2 that drives internal SCSI
drive connector Z1.
v U0.1-P2-Z1-A8... points to the actual SCSI bus and devices attached to Z1; in this case, a hot-plug disk
drive in bay location D10 with a SCSI ID of 8.
Chapter 1. Reference Information 23
AIX Location Codes
Note: When the system is running under the Linux operating system, AIX location codes are not available.
The basic formats of the AIX location codes are as follows:
v For non-SCSI devices/drives:
– AB-CD-EF-GH
v For SCSI devices/drives:
– AB-CD-EF-G,H
Non-SCSI Devices/Drives
For planars, cards, and non-SCSI devices, the location code is defined as follows:
AB-CD-EF-GH
| | | |
| | | Device/FRU/Port ID
| | Connector ID
| devfunc Number, Adapter Number or Physical Location
Bus Type or PCI Parent Bus
v The AB value identifies a bus type or PCI parent bus as assigned by the firmware.
v The CD value identifies adapter number, the adapter’s devfunc number, or physical location. The
devfunc number is defined as the PCI device number times 8, plus the function number.
v The EF value identifies a connector.
v The GH value identifies a port, address, device, or FRU.
Adapters and cards are identified only with AB-CD.
The possible values for AB are:
00 Processor bus
01 ISA bus
02 EISA bus
03 MCA bus
04 PCI bus used in the case where the PCI bus cannot be identified
05 PCMCIA buses
xy For PCI adapters where x is equal to or greater than 1. The x and y are characters in the range of 0-9, A-H,
J-N, P-Z (O, I, and lowercase are omitted) and are equal to the parent bus’s ’ibm, aix-location’ open firmware
property.
The possible values for CD depend on the adapter/card:
v For pluggable PCI adapters/cards, CD is the device’s devfunc number (PCI device number times 8, plus
the function number). The C and D are characters in the range of 0-9, and A-F (hex numbers). Location
codes therefore uniquely identify multiple adapters on individual PCI cards.
v For pluggable ISA adapters, CD is equal to the order of the ISA cards defined/configured either by SMIT
or the ISA Adapter Configuration Service Aid.
v For integrated ISA adapters, CD is equal to a unique code identifying the ISA adapter. In most cases,
this code is equal to the adapter’s physical location code. In cases where a physical location code is not
available, CD will be FF.
EF is the connector ID. It is used to identify the adapter’s connector to which a resource is attached.
GH is used to identify a port, device, or FRU. For example:
v For ASYNC devices, GH defines the port on the fanout box. The values are 00 to 15.
v For a diskette drive, H identifies either diskette drive 1 or 2. G is always 0.
v For all other devices, GH is equal to 00.
24 Service Guide
For an integrated adapter, EF-GH is the same as the definition for a pluggable adapter. For example, the
location code for a diskette drive is 01-D1-00-00. A second diskette drive is 01-D1-00-01.
SCSI Devices/Drives
For SCSI devices, the location code is defined as follows:
AB-CD-EF-G,H
| | | | |
| | | | Logical Unit address of the SCSI Device
| | | Control Unit Address of the SCSI Device
| | Connector ID
| devfunc Number, Adapter Number or Physical Location
Bus Type or PCI Parent Bus
Where AB-CD-EF are the same as non-SCSI devices.
G defines the control unit address of the device. Values of 0 to 15 are valid.
H defines the logical unit address of the device. Values of 0 to 255 are valid.
A bus location code is also generated as ’00-XXXXXXXX’ where XXXXXXXX is equivalent to the node’s
unit address.
Examples of physical location codes displayed by AIX are as follows:
v Processor card C1 plugged into CEC backplane P1:
U0.1-P1-C1
v Processor card C2 attached to CEC backplane P1:
U0.1-P1-C2
v Memory DIMM in slot 2 of processor card C1 attached to CEC backplane P1:
U0.1-P1-C1-M2
Examples of AIX location codes displayed are as follows:
v Integrated PCI adapter:
10-80 Ethernet
10-60 Integrated SCSI Port 1 (internal)
10-88 Integrated SCSI Port 2 (external)
Chapter 1. Reference Information 25
v Pluggable PCI adapters:
Note: The number of PCI-X slots available is dependent on the number of slots your PCI riser card has
available. Your system can be configured with either a four or six slotted PCI riser card.
20-58 to 20-5F Any PCI card in slot 1
20-60 to 20-67 Any PCI card in slot 2
10-68 to 10-6F Any PCI card in slot 3
10-70 to 10-77 Any PCI card in slot 4
10-78 to 10-7F Any PCI card in slot 5
10-80 to 10-87 Any PCI card in slot 6
v Integrated ISA adapters:
01-D1 Diskette adapter
01-R1 Parallel port adapter
01-S1 Serial port 1 adapter
01-S2 Serial port 2 adapter
01-S3 Serial port 3 adapter
01-K1 Keyboard adapter
v Device attached to SCSI controller:
10-60-00-4,0 Device attached to integrated SCSI Port 1
Multiple FRU Callout Instructions
If an eight-digit error code appears in the operator panel display or in Chapter 5, “Error Code to FRU
Index”, on page 179, a location code for a failing part might also be specified. If the location code includes
a blank space followed by a lowercase x followed by a number, this is an error code with multiple FRU
callouts. This error can typically happen with memory DIMMs or processors, and might involve mixed
types of parts. In this case, check the system’s configuration for FRU part numbers to determine the
appropriate set of FRUs.
To determine the FRU part numbers of the electronic assemblies in the entire system, use the service
processor menus. From the General User menu, select Read VPD Image from Last System Boot, then
enter 90 to display detailed Vital Product Data (VPD).
To determine the FRU part numbers of the electronic assemblies in a partition by using the lscfg -vp | pg
command on the AIX command line to display the detailed VPD of all assemblies. The FRU part number
information for processors and memory DIMMs might be located at the bottom of the command output.
26 Service Guide
FRU Identify LEDs
This system is configured with an arrangement of LEDs that help identify various components of the
system. These include, but are not limited to the following:
v Rack identify LED
v CEC drawer identify LED
v I/O drawer identify LED
v RIO port identify LED
v FRU identify LED
– Power subsystem FRUs
– CEC subsystem FRUs
– I/O subsystem FRUs
v I/O adapter identify LED
v DASD identify LED
The identify LEDs are arranged hierarchically, with the FRU identify LED at the bottom of the hierarchy,
followed by the corresponding CEC or I/O drawer identify LED, and the corresponding rack identify LED to
locate the failing FRU more easily.
Any identify LED in the system can be flashed when the system is in the failed state with power on by
using the service processor LED Control Menu contained in the “System Information Menu” on page 317.
Any identify LED in the system can also be flashed by using the ²Identify and Attention Indicators² task
through diagnostics. The procedure to operate the ²Identify and Attention Indicators² task in diagnostics is
outlined in the Diagnostic Information for Multiple Bus Systems.
If the service processor menus and the AIX diagnostics are not available, the FRU identify LEDs can be
flashed by one of following:
v If the system is configured as a full system partition, boot to the open firmware prompt. See the “System
Power Control Menu” on page 314 for instructions on setting up the boot mode to enable the boot to the
open firmware prompt. To display a menu that will allow you to turn on the desired FRU identify LED,
type FRU-LED-MENU on the command line.
v If the system is logically partitioned, the HMC must be attached. For instructions on activating and
deactivating a FRU identify LED, see the Service Focal Point chapter of the Hardware Management
Console Installation and Operations Guide, order number SA38-0590.
Chapter 1. Reference Information 27
Mapping AIX and Physical Location Codes
Note: When the system is running under the Linux operating system, AIX location codes are not available.
Use the following illustration to aid you in the mapping of a location code to its connector on the system
backplane.
28 Service Guide
Use the following illustration to aid you in the mapping of a location code to its DIMM slot on the processor
cards.
Use the following illustration to aid you in the mapping of a location code to its slot on the PCI riser card.
Four-Slotted PCI Riser Card Location Codes
Chapter 1. Reference Information 29
Six-Slotted PCI Riser Card Location Codes
Disk Drive Backplane Location Codes
Use the following illustration to aid you in the mapping of a location code to its slot on the disk drive
backplane.
30 Service Guide
AIX and Physical Location Code Table
Note: When the system is running under the Linux operating system, AIX location codes are not available.
This section covers the AIX and Physical Location Code tables for the 286/C4 and 286/E4, as well as the
11D/20 I/O Drawer.
286/C4 and 286/E4 Location Code Table
Component Name Physical Location Code AIX Location Code
Central Electronics Complex (CEC)
Rack Frame U0
CEC Drawer U0.1
CEC Backplane U0.1-P1
PCI Riser Card U0.1-P2
Disk Drive Backplane U0.1-P3
Processor Cards
Processor Card 1 U0.1-P1-C1
Processor Card 2 U0.1-P1-C2
Memory DIMMs on Processor Card 1
Memory
DIMM 1
U0.1-P1-C1-M1
Memory
DIMM 2
U0.1-P1-C1-M2
Memory
DIMM 3
U0.1-P1-C1-M3
Memory
DIMM 4
U0.1-P1-C1-M4
Memory
DIMM 5
U0.1-P1-C1-M5
Memory
DIMM 6
U0.1-P1-C1-M6
Memory
DIMM 7
U0.1-P1-C1-M7
Memory
DIMM 8
U0.1-P1-C1-M8
Memory DIMMs on Processor Card 2
Memory
DIMM 1
U0.1-P1-C2-M1
Memory
DIMM 2
U0.1-P1-C2-M2
Memory
DIMM 3
U0.1-P1-C2-M3
Memory
DIMM 4
U0.1-P1-C2-M4
Memory
DIMM 5
U0.1-P1-C2-M5
Memory
DIMM 6
U0.1-P1-C2-M6
Chapter 1. Reference Information 31
Component Name Physical Location Code AIX Location Code
Memory
DIMM 7
U0.1-P1-C2-M7
Memory
DIMM 8
U0.1-P1-C2-M8
Integrated Devices
Debug Connector
(Manufacturing Use Only)
Labeled Debug Connector
Parallel Port U0.1-P1/R1 01-R1
ISA Bus U0.1-P1 1G-18
HMC2 Connector U0.1-P1/S5
HMC1 Connector U0.1-P1/S4
SPCN2 Connector U0.1-P1/Q5
SPCN1 Connector U0.1-P1/Q4
Keyboard Connector U0.1-P1/K1 01-K1-00
Keyboard U0.1-P1/K1-K1 01-K1-00-00
Mouse Connector U0.1-P1/O1 01-K1-01
Mouse U0.1-P1/O1-O1 01-K1-01-00
Ethernet Connector 1 U0.1-P1/E1 14-08
Ethernet Connector 2 U0.1-P1/E2 1L-08
RIO-G1 Connector U0.1-P1/Q3
RIO-G0 Connector U0.1-P1/Q2
Rack Indicator USB Connector U0.1-P1/Q1
Serial Connector 1 U0.1-P1/S1 01-S1
Serial Connector S1F
(Located on Operator Panel)
U0.1-P1/S1 01-S1
Serial Connector 2 U0.1-P1/S2 01-S2
Serial Connector 3 U0.1-P1/S3 01-S3
Diskette Port U0.1-P1/D1 01-D1
Diskette U0.1-P1-D1 01-D1-00-00
CD-ROM (IDE) U0.1-P1/Q6-A0 1G-19-00
Base CD-ROM (IDE) Controller U0.1-P1/Q6 1G-19
Pluggable Adapters
PCI Host Bridge 0 U0.1-P2
PCI Host Bridge 1 U0.1-P2
PCI Slot 1 U0.1-P2/I1
PCI Slot 1 Card U0.1-P2-I1 1V-08
PCI Slot 2 U0.1-P2/I2
PCI Slot 2 Card U0.1-P2-I2 1H-08
PCI Host Bridge 2 U0.1-P2
PCI Slot 3 U0.1-P2/I3
PCI Slot 3 Card U0.1-P2-I3 1n-08
PCI Slot 4 U0.1-P2/I4
32 Service Guide
Component Name Physical Location Code AIX Location Code
PCI Slot 4 Card U0.1-P2-I4 1Z-08
PCI Slot 5 (six-slotted PCI riser card
only)
U0.1-P2/I5
PCI Slot 5 Card (six-slotted PCI riser
card only)
U0.1-P2-I5 1D-08
PCI Slot 6 (six-slotted PCI riser card
only)
U0.1-P2/I6
PCI Slot 6 Card (six-slotted PCI riser
card only)
U0.1-P2-I6 11-08
SCSI Devices
Internal SCSI Port (for internal disk
drive)
U0.1-P2/Z1 1S-08
External SCSI Port U0.1-P2/Z2 1S-09
Single-Ended SCSI (six-slotted PCI
riser card only)
U0.1-P2/Z3 1c-08
Media Device #1
(for example, a tape drive)
U0.1-P2-Ix/Zn-A0 Dependent on Ix
Media Device #2
(for example, a DVD drive)
U0.1-P2-Ix/Zn-A1 Dependent on Ix
Hot-swap Disk Drive bay 1 U0.1-P2/Z1-A8 1S-08-00-8,0
Hot-swap Disk Drive bay 2 U0.1-P2/Z1-A9 1S-08-00-9,0
Hot-swap Disk Drive bay 3 U0.1-P2/Z1-AA 1S-08-00-10,0
Hot-swap Disk Drive bay 4 U0.1-P2/Z1-AB 1S-08-00-11,0
Fans
Fan 1 (Primary Processor Fan) U0.1-F1
Fan 2 (Redundant Processor Fan) U0.1-F2
Fan 3 (PCI Fan) U0.1-F3
Fan 4 (PCI Fan) U0.1-F4
Operator Panel
Operator panel U0.1-L1
Power Supply
Power supply 1 (Primary) U0.1-V1
Power supply 2 (Redundant) U0.1-V2
Battery
Battery U0.1-P1-V3
Platform Firmware
Platform Firmware U0.1-P1/Y1
System VPD module
System VPD module U0.1-L1-N1
Chapter 1. Reference Information 33
Component Name Physical Location Code AIX Location Code
Notes:
1. The physical location code for the PCI slots, when empty, uses the P1/Ix notation, where the ’/’ identifies an
integrated device (in this case the empty slot). A PCI device plugged into the slot uses the P1-Ix notation, where
the ’-’ identifies a plugged device.
2. SCSI bus IDs are the recommended values. SCSI IDs shown for media devices indicate how the devices are set
when they are shipped from the factory. Field installations may not comply with these recommendations.
I/O Subsystem Locations
This section covers the AIX and physical location code tables for the 11D/20 I/O drawer.
The tables in this section contain the location codes for I/O subsystems when they are attached to the
processor-subsystem drawer. A separate table is shown for each I/O subsystem. In the tables, the drawer
number for the I/O subsystem is represented by dd. The first time that a system is powered on, the I/O
subsystems are numbered according to the order in which they are configured. For example, if a system is
first installed with eight I/O subsystems connected, the dd values for the subsystems should number from
2 through 9.
If at a later time, an I/O subsystem is removed from the system and another I/O subsystem is substituted,
the substitute is assigned the next higher number available for its value of dd. In the earlier example, this
is 10.
Also, if an I/O backplane is replaced inside an I/O subsystem and power is applied, the system will
configure the I/O subsystem with the new backplane in the order following the other I/O subsystems. Its
value for dd will not be the same as the one it replaced. In the case of the earlier example, the next higher
value for dd is 10, so a drawer number of 10 is assigned to the new I/O backplane.
Note: If the I/O backplane VPD module is moved to the new I/O backplane, the I/O subsystem retains its
original dd value.
If the original I/O subsystem or I/O backplane is reinstalled in the system, the system uses the original dd
value for the I/O subsystem.
The following diagram defines each part of a location code:
Ux.dd-xx-yy
| | | |
| | | |
| | | yy This code is used for the next component (yy, yy, yy, ...)
| | xx This code is used for the next component (Px, Fx, Vx, ...)
| dd This code is the drawer number for the I/O drawer (2 to 63)
Ux This code normally identifies the rack in which a drawer is installed.
The x will always be zero (0) for this system.
Determining the AIX Location Code of a Physical Slot: Every slot of every I/O subsystem has a
unique AIX location code assigned to it. For instance, the PCI adapter in slot 1 of I/O subsystem 1 is
29-08. There is no obvious way to determine the value of the dd number for the I/O subsystem that is
interpreted by the system as the first I/O subsystem. If you have only the AIX location code, you might
determine the associated I/O subsystem dd number by performing the following steps:
1. On the AIX command line type lsdev -Ccadapter |pg
The resulting list gives the logical AIX name of all resources and the corresponding AIX location code.
Look for the appropriate AIX location code, and record the corresponding AIX logical name of the
resource.
34 Service Guide
2. On the AIX command line type lsslot -c pci |pg
This list relates the AIX logical name of all resources to location code. Look for the AIX logical name
that you recorded in the first step and read the location code of the resource. You can then determine
the physical location of the resource by flashing the FRU identify LED (see “FRU Identify LEDs” on
page 27) that corresponds to the location code.
FRU Name Physical Location Code
D20 I/O Drawer U0.dd (dd = any number from 2 through 63)
Rack frame U0
I/O drawer backplane U0.dd-P1
I/O subsystem drawer U0.dd
RIO Bus Adapter
RIO bus adapter U0.dd-P1.1
RIO port 1 upper connector U0.dd-P1.1/Q1
RIO port 1 cable U0.dd-P1.1/Q1#
RIO port 0 lower connector U0.dd-P1.1/Q2
RIO port 0 cable U0.dd-P1.1/Q2#
PCI Bus Controller
PCI bus controller PHB0 U0.dd-P1
PCI to PCI bridge EADS-X U0.dd-P1
PCI Bus Controller PHB2
PCI bus controller PHB2 U0.dd-P1
PCI to PCI bridge EADS-X U0.dd-P1
Pluggable Adapter Slots
PCI slot 1 U0.dd-P1/I1
PCI card in slot 1 U0.dd-P1-I1
PCI slot 2 U0.dd-P1/I2
PCI card in slot 2 U0.dd-P1-I2
PCI slot 3 U0.dd-P1/I3
PCI card in slot 3 U0.dd-P1-I3
PCI slot 4 U0.dd-P1/I4
PCI card in slot 4 U0.dd-P1-I4
PCI slot 5 U0.dd-P1/I5
PCI card in slot 5 U0.dd-P1/I5
PCI slot 6 U0.dd-P1/I6
PCI card in slot 6 U0.dd-P1/I6
PCI slot 7 U0.dd-P1/I7
PCI card in slot 7 U0.dd-P1/I7
Power Supply
Power supply 1 (no internal blower) U0.dd-V1
Power supply 2 (no internal blower) U0.dd-V2
Cooling Blowers
Cooling blower 1 U0.dd-F1
Chapter 1. Reference Information 35
FRU Name Physical Location Code
Cooling blower 2 U0.dd-F2
Cooling blower 3 U0.dd-F3
Cooling blower 4 U0.dd-F4
SPCN Connectors
SPCN connector 1 U0.dd-P1/Q3
SPCN connector 2 U0.dd-P1/Q4
SPCN connector 3 U0.dd-P1/Q5
SPCN connector 4 U0.dd-P1/Q6
Rack Indicator
Rack beacon (4-pin) U0.dd-P1/Q7
Operator Panel
Operator panel U0.dd-L1
Temperature Sensor
Temperature sensor (on operator panel) U0.dd-L1
Disk Drive Backplanes
Disk drive backplane 1 U0.dd-P3
Disk drive backplane 2 U0.dd-P4
Disk Drive Locations
Hot-swap disk drive bay 1 U0.dd-P1-Ix/Zn-A8
Hot-swap disk drive bay 2 U0.dd-P1-Ix/Zn-A9
Hot-swap disk drive bay 3 U0.dd-P1-Ix/Zn-AA
Hot-swap disk drive bay 4 U0.dd-P1-Ix/Zn-AB
Hot-swap disk drive bay 5 U0.dd-P1-Ix/Zn-AC
Hot-swap disk drive bay 6 U0.dd-P1-Ix/Zn-AD
Hot-swap disk drive bay 7 U0.dd-P1-Iy/Zn-A8
Hot-swap disk drive bay 8 U0.dd-P1-Iy/Zn-A9
Hot-swap disk drive bay 9 U0.dd-P1-Iy/Zn-AA
Hot-swap disk drive bay 10 U0.dd-P1-Iy/Zn-AB
Hot-swap disk drive bay 11 U0.dd-P1-Iy/Zn-AC
Hot-swap disk drive bay 12 U0.dd-P1-Iy/Zn-AD
SCSI
SES0 U0.dd-P1-Ix/Zn-AF
SES1 U0.dd-P1-Iy/Zn-AF
36 Service Guide
System Cables
The following diagrams show the system cable connections. The following diagram illustrates the routing of
the internal power cables.
Internal Power Cable Routing Diagram
1 Operator Panel
2 IDE CD-ROM
3 SCSI Media Device
4 CEC Backplane to Operator Panel Signal/Power Cable
5 2-drop Power Cable from CEC Backplane to IDE CD-ROM and other SCSI Media Devices
6 CEC Backplane to Disk Drive Backplane Power Cable
7 Disk Drive Backplane
8 Diskette Drive to CEC Backplane Power Cable
9 PCI Cooling Fans #3 and #4 Power Cable
10 Diskette Drive
11 CEC Backplane
Chapter 1. Reference Information 37
Internal Signal Cable Routing Diagram
The following diagram illustrates the routing of the signal cables.
38 Service Guide
1 Operator Panel
2 SCSI Media Device (50-pin)
3 SCSI Media Device (68-pin)
4 SCSI Media Device (68-pin)
5 CEC Backplane to Operator Panel Signal Cable
6 SCSI Media (68-pin) to 50-pin Converter Cable
7 SCSI Media Device Signal Cable to Integrated Single-Ended SCSI Connector on the PCI Riser Card
8 CEC Backplane Connector to Diskette Drive Signal Cable
9 CEC Backplane to IDE CD-ROM Signal Cable
10 Diskette Drive
11 IDE CD-ROM
12 Disk Drive Backplane
13 PCI Riser Card to Disk Drive Backplane Signal Cable
14 PCI Riser Card to Disk Drive Backplane Signal Cable
15 6-Slotted PCI Riser Card
15a 4-Slotted PCI Riser Card (Used on 4-slotted models only
16 CEC Backplane
17 PCI Riser Card to External SCSI Signal Cable
18 External SCSI Port
19 CEC Backplane to Rack Indicator Signal Cable
20 Rack Indicator Port
21 CEC Backplane to Debug Port Signal Cable
22 Debug Parallel Port
Subsystem Positioning and Cabling
The I/O subsystem drawer can be installed in a standard 19-inch EIA rack in any location near to the
system. The cables that connect the drawers allow some flexibility in drawer placement, but the I/O
drawers should be located as close to the system unit as possible.
The 286/C4 can have a total of two I/O drawers connected. Each drawer is connected to the system unit
using a system power control network (SPCN) cable loop and a remote I/O (RIO) cable loop.
Although the SPCN cables are connected in a loop, only one loop is needed for both of the I/O drawers.
SPCN Cabling
I/O drawer SPCN cables are connected to the system using a loop configuration. The SPCN cables are
connected in a loop so that the system has two paths to each I/O drawer. The 286/C4 can have a total of
two I/O drawers connected in one SPCN loop.
The location codes for the I/O drawers are assigned during the system power-on sequence. Each drawer’s
location is determined by its position in the SPCN loop. The first drawer in the SPCN loop (the drawer
connected to SPCN 0) is given the location U0.2. If there is a second I/O drawer in the SPCN loop, it is
assigned the location U0.3. The I/O drawers are numbered sequentially until the end of the loop is
reached at SPCN 1. For cabling examples, see the following illustrations.
Chapter 1. Reference Information 39
Cabling SPCN Cables to One or Two I/O Subsystems
The following illustration shows the SPCN cabling for either one or two I/O drawer configurations.
One I/O Drawer Attached to 286/C4:
1 286/C4 3 SPCN 2 to SPCN 2 Cable
2 11D/20 4 SPCN 1 to SPCN 1 Cable
Two I/O Drawers Attached to 286/C4:
1 286/C4 4 SPCN 1 to SPCN 1 Cable
2 11D/20 I/O Drawer 5 SPCN 2 to SPCN 2 Cable
3 11D/20 I/O Drawer 6 SPCN 2 to SPCN 1 Cable
40 Service Guide
RIO Cabling
I/O drawers are connected to the system through remote I/O (RIO-G) cable loops. The cable loops are
connected to ports located on the rear of the system. The RIO-G cables are connected in loops so that the
system has two paths to each I/O drawer.
Up to two I/O drawers can be connected to a system using one RIO loop. For cabling examples, see the
following illustrations.
Cabling RIO-G Cables to One or Two I/O Subsystems
The following illustrations show the RIO-G cabling for either one or two I/O drawer configurations.
One I/O Drawer Attached to 286/C4:
1 286/C4 3 RIO-G 0 to RIO-G 0 Cable
2 11D/20 4 RIO-G 1 to RIO-G 1 Cable
Two I/O Drawers Attached to 286/C4:
1 286/C4 4 RIO-G 0 to RIO-G 0 Cable
2 11D/20 I/O Drawer 5 RIO-G 1 to RIO-G 1 Cable
3 11D/20 I/O Drawer 6 RIO-G 1 to RIO-G 0 Cable
Chapter 1. Reference Information 41
Specifications
This section contains system specifications for both 286/C4 and 286/E4.
Note: Refer to the 14T/0 and 14T/4 Installation and Service Guide, order number SA38-0577, for
information on cabling power from the rack to the I/O subsystem.
Dimensions Rack (286/C4) Tower (286/E4)
Height 176 mm 6.9 in. 544.0 mm (21.42 in.)
4 EIA Units
Width 448 mm 17.5 in. 308.0 mm (12.13 in.) 191 mm without
legs
Depth 816 mm 32.13 in. - Includes 145 mm
for cable management arm
789.0 mm (31 in.) - Includes 70 mm
for rear acoustic cover
Weight
Minimum configuration 32.0 kg 70.4 lbs. 36.0 kg 79.2 lbs.
Maximum configuration 47.3 kg 104.0 lbs. 51.0 kg 112.2 lbs.
Electrical
Power source loading (maximum in
kVA)
1.20
Power source loading (typical in kVA) 0.75
Voltage range (V ac) 100-127 Vac or 200-240 Vac (single phase)
Frequency (hertz) 50 / 60
Thermal output (maximum) 2540 Btu/hr
Thermal output (typical) Btu/hr
Power Requirements (minimum load) 350 watts
Power Requirements (maximum load) 670 watts
Power factor - U.S., World Trade,
Japan
0.95
Inrush currentå 75/amps (maximum at <10ms)
25/amps (maximum at 10ms - 150ms)
Note: The above amps are held for the full input range of 180 V/ac to 259
V/ac and 47 to 63 Hz.
Maximum Altitudeæ, ç 2135 m (7000 ft.)
Temperature Requirements Operating
5 to 35°C
(41 to 95°F)
Storage
1 to 60°C
(34 to 140°F)
Humidity Requirements Operating Storage
(Noncondensing) 8 to 80% 5 to 80%
Wet Bulb 27°C (80°F) 27°C (80°F)
286/E4 Noise Emissions4 Operating Idle
LWAd 6.0 bels 5.9 bels
m 42 dBA 41 dBA
286/C4 Noise Emissions4 Operating Idle
LWAd 6.1 bels 6.0 bels
m 44 dBA 43 dBA
42 Service Guide
Dimensions Rack (286/C4) Tower (286/E4)
Install/Air Flow Maintenance of proper service clearance should allow proper airflow.
Service When installing a rack unit, refer to “System Service Clearances” on
page 481 for the rack correct clearances.
1. Inrush currents occur only at initial application of power, no inrush occurs during normal power off-on cycle.
2. The upper limit of the dry bulb temperature must be derated 1 degree C per 137 m (450 ft.) above 915 m (3000
ft.).
3. The upper limit of the wet bulb temperature must be derated 1 degree C per 274 m (900 ft. ) above 305 m (1000
ft.).
4. Levels are for a single system installed in a T00 32-EIA rack with the center of the unit approximately 1500 mm
(59 in.) off the floor.
External AC Power Cables
To avoid electrical shock, a power cable with a grounded attachment plug is provided. Use only properly
grounded outlets.
Power cables used in the United States and Canada are listed by Underwriter’s Laboratories (UL) and
certified by the Canadian Standards Association (CSA). These power cords consist of the following:
v Electrical cables, Type SVT or SJT.
v Attachment plugs complying with National Electrical Manufacturers Association (NEMA) 5-15P, that is:
²For 115 V operation, use a UL listed cable set consisting of a minimum 14 AWG, Type SVT or SJT
three-conductor cord a maximum of 15 feet in length and a parallel blade, grounding type attachment plug
rated at 15 A, 125 V.²
²For 230 V operation in the United States use a UL listed cable set consisting of a minimum 18 AWG,
Type SVT or SJT three-conductor cable a maximum of 15 feet in length, and a tandem blade, grounding
type attachment plug rated at 15 A, 250 V.²
v Appliance couplers complying with International Electrotechnical Commission (IEC) Standard 320, Sheet
C13.
Power cables used in other countries consist of the following:
v Electrical cables, Type HD21.
v Attachment plugs approved by the appropriate testing organization for the specific countries where they
are used.
²For units set at 230 V (outside of U.S.): use a cable set consisting of a minimum 18 AWG (1.0 mm2
nominal cross sectional area). Outside of North America , a more common cordage reference is in mm2
and for a HAR (harmonized) cord, you may not find a AWG equivalent.
To find the power cables that are available, refer to Chapter 10, “Parts Information”, on page 451.
Chapter 1. Reference Information 43
Service Inspection Guide
Perform a service inspection on the system when:
v The system is inspected for a maintenance agreement.
v Service is requested and service has not recently been performed.
v An alterations and attachments review is performed.
v Changes have been made to the equipment that may affect the safe operation of the equipment.
v External devices with their own power cables have those cables attached.
If the inspection indicates an unacceptable safety condition, the condition must be corrected before anyone
can service the machine.
Note: The owner of the system is responsible to correct any unsafe conditions.
Perform the following checks:
1. Check the covers for sharp edges and for damage or alterations that expose the internal parts of the
system.
2. Check the covers for proper fit to the system. They should be in place and secure.
3. Gently rock the system from side to side to determine if it is steady.
4. Set the power button of the system to Off.
5. Remove the covers.
6. Check for alterations or attachments. If there are any, check for obvious safety hazards, such as
broken wires, sharp edges, or broken insulation.
7. Check the internal cables for damage.
8. Check for dirt, water, and any other contamination within the system.
9. Check the voltage label on the back of the system to ensure that it matches the voltage at the outlet.
10. Check the external power cable for damage.
11. With the external power cable connected to the system, check for 0.1 ohm or less resistance between
the ground lug on the external power cable plug and the metal frame.
12. Perform the following checks on each device that has its own power cables:
a. Check for damage to the power cable.
b. Check for the correctly grounded power cable.
c. With the external power cable connected to the device, check for 0.1 ohm or less resistance
between the ground lug on the external power cable plug and the metal frame of the device.
13. Install the covers.
44 Service Guide
Chapter 2. Diagnostic Overview
The system uses an integrated set of software diagnostic procedures to help isolate failing components
and system maintenance. This book, along with the Diagnostic Information for Multiple Bus Systems, is the
basis of the diagnostic procedures for the system. In particular, Chapter 4, “Checkpoints”, on page 141,
Chapter 5, “Error Code to FRU Index”, on page 179, Chapter 6, “Using the Online and Standalone
Diagnostics”, on page 297, and Chapter 10, “Parts Information”, on page 451, in this book are important for
the trained service representative to understand and use when isolating a failure on the system.
The manufacturer recommends that systems configured with 4 GB of memory or greater have access to a
4-mm or 8-mm tape drive for submission of system dump information if required. This function can be
accomplished through locally attached or network-attached devices, as appropriate.
Maintenance Analysis Procedures (MAPs)
Maintenance Analysis Procedures (MAPs) guide the trained service representative through the system.
These MAPs are the entry point for all isolation and error recovery procedures. The MAPs are consistent
with existing procedures and methods. The system uses a set of integrated procedures, mentioned earlier,
to which the MAPs are the primary entry point.
The MAPS are as follows:
v Entry MAP
v Quick Entry MAP
v MAP 1020: Problem Determination MAP
v MAP 1240: Memory Problem Resolution
v MAP 1321: Quick Entry MAP for Systems with Service Focal Point. This MAP contains information for
systems not running under the AIX operating system.
v MAP 1322: End of Call MAP for Systems with Service Focal Point
v MAP 1520: Power MAP
v MAP 1521: The Base System Drawer will not Power On and No Error Codes are Available
v MAP 1522: Cannot Power On the I/O Drawer, Power Present LED Does Not Come On
v MAP 1523: The SPCN Error Codes Table Directed You Here and the Error Code is 1011 1B0x
v MAP 1540: Problem Isolation Procedures
v MAP 1541: JTAG Problem Isolation
v MAP 1542: I/O Problem Isolation
v MAP 1548: Processor Subsystem Problem Isolation
The Entry Map is the starting point for problem determination. The purpose of this MAP is to quickly point
to the appropriate MAP or service reference information either in this book, or in the common book set,
which includes the Diagnostic Information for Multiple Bus Systems and PCI Adapter Placement
Reference.
The Quick Entry MAP is a subset of the Entry MAP and helps to save time for some types of problems.
The Problem Determination MAP provides a structured analysis method to get an error code if one is not
provided by the customer, or if diagnostics cannot be loaded.
The Power MAP deals with isolation of components to diagnose a power problem. Power problems can be
related to powering on and powering off the system, or power failures that occur after power is turned on.
45
The Minimum Configuration MAP is used to locate defective components not found by normal diagnostics
or error-isolation methods. This MAP provides a systematic method of isolation to the failing item.
Power and Attention LEDs
The Power and Attention LEDs provide a means to identify failing components in your system. When a
failing component is detected in your system, the system’s Attention LED is turned on.
Component LEDs
To further help you identify the failing component, many system components have individual LEDs that
indicate a failure when lit. The LEDs are either on the component itself or on the carrier of the component
(memory card, fan, memory module, CPU).
The LEDs are either green or amber in color. A lit green LED indicates that the system or component is
receiving power. A lit amber LED indicates the system or component has a failure.
Resetting the LEDs
To reset the LEDs, do the following:
1. Replace the failing component with the new component.
2. Log in as root user.
3. At the command line, type diag.
4. Select Task Selection.
5. Select Log Repair Action.
6. Select the device that was repaired. (If the device is not listed, select sysplanar0.)
7. Press Enter, then select commit.
System Attention LED
The system attention LED on the operator panel turns on when an entry is made in the service processor
error log, indicating that intervention or service is needed. The error entry is transmitted to the following:
v The system-level error logs
v The operating system error log
v The service action event log (such as loss of surveillance from the HMC to a logical partition) in the
Service Focal Point application
Depending on your system configuration, refer to one of the following topics for more information:
v “System Error Interrogation” on page 54
v “The System is not Attached to an HMC and AIX is Installed” on page 55
v “The System is Attached to an HMC and Linux is Installed” on page 55
Checkpoints
The system uses various types of checkpoints, error codes, and SRNs, which are referred to throughout
this book (primarily in Chapter 4, “Checkpoints”, on page 141, Chapter 5, “Error Code to FRU Index”, on
page 179, Chapter 6, “Using the Online and Standalone Diagnostics”, on page 297, and Chapter 10, “Parts
Information”, on page 451). These codes can appear in the service processor boot progress log, the
operating system error log, and the operator panel display. Understanding the definition and relationships
of these codes is important to the service personnel who are installing or maintaining the system.
Codes that can appear in the service processor boot progress log, the operating system error log, the
virtual terminal, and the operator panel display are as follows:
Checkpoints Checkpoints display in the operator panel from the time ac power is connected to the
46 Service Guide
system until the operating system login prompt is displayed after a successful operating
system boot. These checkpoints have the following forms:
8xxx 8xxx checkpoints are displayed from the time ac power is connected to the system
until the OK prompt is displayed by the service processor on the operator panel
display. (Several 9xxx checkpoints are displayed just before the OK prompt
displays.)
9xxx 9xxx checkpoints are displayed by the service processor after the power-on
sequence is initiated. A system processor takes control when 91FF displays on the
operator panel display.
Note: Certain checkpoints may remain in the display for long periods of time. A
spinning cursor is visible in the upper-right corner of the display during
these periods to indicate that system activity is continuing.
Exxx Exxx checkpoints indicate that a system processor is in control and is initializing
the system resources. Control is being passed to the operating system when E105
displays on the operator panel display. Location code information may also display
on the operator panel during this time.
0xxx
and
2xxx 0xxx and 2xxx codes are AIX progress codes and configuration codes. AIX
progress codes will not appear when Linux is the operating system. Location
codes may also be shown on the operator panel display during this time.
Error Codes If a fault is detected, an 8-digit error code is displayed in the operator panel display. A
location code may be displayed at the same time on the second line.
SRNs Service request numbers, in the form xxx-xxx, xxx-xxxx, xxxx-xxx, or xxxx-xxxx may also
be displayed on the operator panel display and be noted in the AIX error log. SRNs will
not appear in the Linux system error log when Linux is the operating system.
Checkpoints can become error codes if the system fails to advance past the point at which the code is
presented. For a list of the checkpoints, see Chapter 4, “Checkpoints”, on page 141. Each entry provides a
description of the event and the recommended action if the system fails to advance.
SRNs are listed in the Diagnostic Information for Multiple Bus Systems, order number SA23-2769.
FRU Isolation
For a list of error codes and recommended actions for each code, see Chapter 5, “Error Code to FRU
Index”, on page 179. These actions can refer to Chapter 10, “Parts Information”, on page 451, Chapter 3,
“Maintenance Analysis Procedures (MAPs)”, on page 57, or provide informational message and directions.
If a replacement part is indicated, direct reference is made to the part name. The respective AIX and
physical location codes are listed for each occurrence as required. For a list of locations codes, see
“Location Codes” on page 23.
Note: When the system is running under the Linux operating system, AIX location codes are not available.
To look up part numbers and view component diagrams, see Chapter 10, “Parts Information”, on page 451.
The beginning of that chapter provides a parts index with the predominant field replaceable units (FRUs)
listed by name. The remainder of the chapter provides illustrations of the various assemblies and
components that make up the system.
Chapter 2. Diagnostic Overview 47
FRU Identify LEDs
This system is configured with an arrangement of LEDs that help identify various components of the
system. These include, but are not limited to the following:
v Rack identify LED
v CEC drawer identify LED
v I/O drawer identify LED
v RIO port identify LED
v FRU identify LED
– Power subsystem FRUs
– CEC subsystem FRUs
– I/O subsystem FRUs
v I/O adapter identify LED
v DASD identify LED
The identify LEDs are arranged hierarchically, with the FRU identify LED at the bottom of the hierarchy,
followed by the corresponding CEC or I/O drawer identify LED, and the corresponding rack identify LED to
locate the failing FRU more easily.
Any identify LED in the system can be flashed when the system is in the failed state with power on by
using the service processor LED Control Menu contained in the “System Information Menu” on page 317.
Any identify LED in the system can also be flashed by using the ²Identify and Attention Indicators² task
through diagnostics. The procedure to operate the ²Identify and Attention Indicators² task in diagnostics is
outlined in the Diagnostic Information for Multiple Bus Systems.
If the service processor menus and the AIX diagnostics are not available, the FRU identify LEDs can be
flashed by one of following:
v If the system is configured as a full system partition, boot to the open firmware prompt. See the “System
Power Control Menu” on page 314 for instructions on setting up the boot mode to enable the boot to the
open firmware prompt. To display a menu that will allow you to turn on the desired FRU identify LED,
type FRU-LED-MENU on the command line.
v If the system is logically partitioned, the HMC must be attached. For instructions on activating and
deactivating a FRU identify LED, see the Service Focal Point chapter of the Hardware Management
Console Installation and Operations Guide, order number SA38-0590.
Service Focal Point
The Service Focal Point application is used to help the service representative diagnose and repair
problems on partitioned systems. Service representatives use the HMC as the starting point for all service
issues. The HMC groups various system management issues at one control point, allowing service
representatives to use the Service Focal Point application to determine an appropriate service strategy.
Traditional service strategies become more complicated in a partitioned environment. Each partition runs
on its own, unaware that other partitions exist on the same system. If one partition reports an error for a
shared resource, such as a managed system power supply, other active partitions report the same error.
The Service Focal Point application enables service representatives to avoid long lists of repetitive
call-home information by recognizing that these errors repeat, and by filtering them into one error code.
Errors that require service are reported to the HMC as serviceable events. Because the HMC stores these
serviceable events for 90 days and then discards them, it is important to have the partition and HMC date
and time set correctly. For instance, if the date on a partition’s software is set 90 days behind the HMC’s
set time, the serviceable events reported from this partition are immediately discarded. For more
48 Service Guide
information about setting the HMC’s date and time, see the Hardware Management Console Installation
and Operations Guide, order number SA23-1251. To set the partition’s date and time, see the
documentation provided with the operating system that is running on that partition.
Getting Started
When you are setting up Service Focal Point, keep the following in mind:
v If the time configured on a partition is 90 days older than time configured on the HMC, serviceable
events cannot be reported.
v Verify that the HMC host names are defined. For more information about using fully qualified and short
host names, see the Hardware Management Console Installation and Operations Guide, order number
SA23-1251.
v If you need to add or change a partition name, see the Hardware Management Console Installation and
Operations Guide.
Testing Error Reporting
To ensure that Service Focal Point is configured properly, generate a test error by doing the following:
1. In the partition, run diagnostics to test the managed system’s operator panel.
2. When the diagnostics window asks you if you see 0000 on the managed system’s operator panel,
select NO. This action generates an error.
3. In the SRN window, press Enter to proceed.
4. When the system asks you if you want the error sent to Service Focal Point, select YES.
5. Type F3 to exit diagnostics.
6. Wait for one minute while the managed system sends the error to Service Focal Point.
7. Check the Serviceable Event window to ensure that the error was sent to Service Focal Point and that
Service Focal Point reported the error. For more information about working with serviceable events,
see “Working With Serviceable Events” on page 51.
Service Focal Point Settings
The Service Focal Point Settings task in the HMC Contents area allows you to configure your Service
Focal Point application.
Note: The Advanced Operator, Operator, and Viewer roles have read-only access to the following tasks.
Automatic Call-Home Feature
You can configure the HMC to automatically call an appropriate service center when it identifies a
serviceable event.
To enable or disable the call-home feature, you must be a member of one of the following roles:
v System Administrator
v Service Representative
To enable or disable the call-home feature for one or more systems, do the following:
Note: It is strongly recommended that you not disable the call-home feature. When you disable the
call-home feature, serviceable events are not automatically reported to your service representative.
1. In the Navigation area, click the Service Applications icon.
2. In the Navigation area, double-click the Service Focal Point icon.
3. In the Contents area, click Service Focal Point Settings.
4. The Service Focal Point Settings window opens. Select the CEC Call Home tab on the top of the
window.
Chapter 2. Diagnostic Overview 49
5. Click on the managed system you want to enable or disable.
6. Click Enable to enable call-home for the selected system, or click Disable to disable call-home for the
selected system.
7. Click OK.
Setting Up Surveillance
Service Focal Point surveillance generates serviceable events when it detects communication problems
between the HMC and its managed systems.
You can configure how you want the HMC to survey the following:
v The number of disconnected minutes considered an outage
v The number of connected minutes you want the HMC to consider a recovery
v The number of minutes between outages considered a new incident
To set up surveillance, you must be a member of one of the following roles:
v System Administrator
v Service Representative
To set up surveillance, do the following:
1. In the Navigation area, click the Service Applications icon.
2. In the Navigation area, double-click the Service Focal Point icon.
3. In the Contents area, select Service Focal Point Settings.
4. The Service Focal Point Settings window opens. Select the Surveillance Setup tab on the top of the
window.
5. In the first field, select the number of minutes you want the HMC to wait before sending a
disconnection error message.
6. In the second field, select the amount of connection time that the HMC is considered to be recovered.
This amount is expressed in minutes.
7. In the third field, select the number of minutes between outages that you want the HMC to wait before
sending a new incident report.
8. Select one or more managed systems from the table in the lower part of the window and then click
Enable or Disable. Surveillance is then either enabled or disabled for the selected managed systems.
Enabling Surveillance Notifications
You can enable or disable surveillance-error notification from this HMC to connected managed systems.
Enabling this notification causes errors to be passed to the Service Agent application for notification.
Note: You must further configure Service Agent to handle notifications sent by Service Focal Point. For
more information about Service Agent, refer to the Hardware Management Console Installation and
Operations Guide, order number SA38-0590.
To set up surveillance, you must be a member of one of the following roles:
v System Administrator
v Service Representative
To set up surveillance-error notification, do the following:
1. In the Navigation area, click the Service Applications icon.
2. In the Navigation area, double-click the Service Focal Point icon.
3. In the Contents area, select Service Focal Point Settings.
4. The Service Focal Point Settings window opens. Select the Surveillance Notification tab on the top
of the window.
50 Service Guide
5. Select one or more managed systems from the list and then click Enable or Disable. Surveillance
notification is then either enabled or disabled for the selected managed systems.
Working With Serviceable Events
You can view, add, or update serviceable event information, including error details.
Viewing Serviceable Events
To view serviceable events, you must be a member of one of the following roles:
v System Administrator
v Service Representative
v Advanced Operator
v Operator
v Viewer
To view a serviceable event, do the following:
1. In the Navigation area, click the Service Applications icon.
2. In the Navigation area, double-click the Service Focal Point icon.
3. In the Contents area, click Select Serviceable Event.
4. Designate the set of serviceable events you want to view. When you are finished, click OK.
5. The Serviceable Event Overview window opens, and the entries displayed are ordered by time stamp.
Each line in the Serviceable Event Overview window corresponds to one error within a serviceable
event. On this window, designate the set of serviceable events you want to view by specifying your
search criteria (such as event status or error class).
Note: Only events that match all of the criteria that you specify are shown.
6. When you are finished, click OK.
When you select a line in the Serviceable Event Overview window, all lines in the same serviceable event
are selected. To open the Serviceable Event Details window for the selected event, select the event and
click Event Details.
Viewing Serviceable Event Details
To view serviceable event details, do the following:
1. Perform the steps in “Viewing Serviceable Events”.
2. The Serviceable Event Details window opens, showing extended serviceable event information,
including the following:
v Status
v Earliest original time stamp of any managed object
v AIX error log. (The Linux system error log does not place entries into Service Focal Point.)
v Should this error ever get called home?
v Error was called home
v Pointer to extended error-data collection on the HMC
The window’s lower table displays all of the errors associated with the selected serviceable event. The
information is shown in the following sequence:
v Failing device system name
v Failing device machine type/model/serial
v Error class
v Descriptive error text
Viewing Serviceable Event Error Details: To view serviceable event error details, do the following:
Chapter 2. Diagnostic Overview 51
1. Perform the steps in “Viewing Serviceable Event Details” on page 51.
2. Select an error in the lower table, and click Error Details.
Viewing Service Processor Error Details: To view service processor error details, do the following:
1. Perform the steps in “Viewing Serviceable Event Error Details” on page 51.
2. If the serviceable event error details you are viewing are for a service processor-class error, the lower
table on the resulting window contains service processor errors. Select a service processor error from
the lower table, and click Service Processor Error Details to see further details.
Saving and Managing Extended Error Data
To save extended error (EE) data, do the following:
1. Perform the steps in “Viewing Serviceable Event Details” on page 51.
2. Click Save EE Data. To save extended error data for only one error associated with the serviceable
event (rather than for the entire serviceable event), select the error from the lower table and then click
Error Details. In the next menu, click Manage EE Data.
Viewing and Adding Serviceable Event Comments
To add comments to a serviceable event, you must be a member of the Service Representative role.
To add comments to a serviceable event, do the following:
Note: You cannot edit or delete previous comments.
1. Perform the steps in “Viewing Serviceable Event Details” on page 51.
2. Select the error to which you want to add comments to and click Comments.... If you want to close
the event and add comments, click Close Event from this window. The Serviceable Event Comments
window opens.
3. Type your name and add comments as appropriate. You can also review previous comments, but you
cannot edit this information.
4. If you clicked Comments on the Serviceable Event Details window, clicking OK commits your entry
and returns you to the Serviceable Event Details window.
If you clicked Close Event on the Serviceable Event Details window, clicking OK commits all changes and
opens the Update FRU Information window. For more information about updating field replaceable unit
information, see “Updating Field Replaceable Unit (FRU) Information”.
Closing a Serviceable Event
To close a serviceable event, do the following:
1. Perform the steps in “Viewing Serviceable Event Details” on page 51.
2. Select the error to which you want to add comments and click Comments....
3. Click Close Event from this window. The Serviceable Event Comments window opens.
4. Click OK to commit your comments. The Update FRU Information window displays. See “Updating
Field Replaceable Unit (FRU) Information” for information on completing this window. Click OK on the
Update FRU Information window to close the serviceable event.
Note: You must close a serviceable event after it has been serviced to ensure that if a similar error is
reported later, it is called home. If an old problem remains open, the new similar problem is
reported as a duplicate. Duplicate errors are neither reported nor called home to a service
center. Close a serviceable event when the partition that reports the error is active. Closing the
event causes the new status of the serviceable event to be properly sent to the partition.
Updating Field Replaceable Unit (FRU) Information
This task allows you to update the FRU information you changed or modified as a result of this serviceable
event. From this panel, you can also activate and deactivate LEDs and search for other serviceable events
that contain the same FRU entries.
52 Service Guide
To update FRU information, do the following:
1. Perform the steps in “Viewing Serviceable Event Details” on page 51.
2. Click FRU Information... . The Update FRU Information window opens.
The lower table shows any parts that you have replaced or added during your current update session
but that have not been committed to the serviceable event. The changes from the lower table are
committed by clicking OK or Apply.
From this window, you can also activate and deactivate LEDs and search for other serviceable events that
contain the same FRU entries.
Replacing an Existing FRU
To replace a part already listed for this serviceable event, do the following:
1. Perform the steps in “Updating Field Replaceable Unit (FRU) Information” on page 52.
2. In the upper table, double-click the part you want to replace.
3. If the FRU has a new part number, type it in the New FRU Part Number field.
4. Click Replace FRU. The Update FRU Information window displays the FRU replacement information in
the lower table. Click OK or Apply to commit the changes to the serviceable event.
Adding a New FRU
You can add a part to the serviceable event that was not listed in the upper table of the Update FRU
Information window. To add a new FRU for this serviceable event, do the following:
1. Perform the steps in “Updating Field Replaceable Unit (FRU) Information” on page 52.
2. Click Add New FRU to List.
3. Type the FRU’s location code and its part number in the appropriate fields.
4. Click Add to List. The Update FRU Information window opens and displays the newly added FRU in
the lower table.
5. Click OK or Apply to commit these changes to the serviceable event.
Note: After you click OK or Apply, you cannot change this information. If you clicked the Close Event
button in the Serviceable Event Details window, then clicking OK also completes the close
dialog and changes the status of the serviceable event to Closed.
Viewing Serviceable Event Partition Information
You can view partition information associated with this serviceable event. This information includes each
affected partition’s state and resource use.
1. Perform the steps in “Viewing Serviceable Event Details” on page 51.
2. Click Partition Info....
Hardware Service Functions
Activating and Deactivating FRU LEDs
This task allows you to activate or deactivate a managed system’s system attention LED or any FRU LED.
FRU LEDs are helpful in determining which FRUs need servicing.
To activate or deactivate a managed system’s System Attention LED, do the following:
1. In the Navigation area, click the Service Applications icon.
2. In the Navigation area, double-click the Service Focal Point icon.
3. In the Contents area, select Hardware Service Functions. The LED Management window opens.
4. In the LED Management window, select one or more managed systems from the table.
5. Select either Activate LED or Deactivate LED. The associated System Attention LED is then either
turned on or off.
To activate or deactivate a FRU associated with a particular managed system, do the following:
Chapter 2. Diagnostic Overview 53
1. In the Navigation area, click the Service Applications icon.
2. In the Navigation area, double-click the Service Focal Point icon.
3. In the Contents area, click Hardware Service Functions. The LED Management window opens.
4. In the LED Management window, select one managed system from the table.
5. Click the List FRUs... button. The list of FRU slot indexes and their respective current LED states
display.
6. Select one or more FRU slot indexes.
7. Click either the Activate LED or the Deactivate LED button.
The associated FRU LEDs are now either enabled (blinking) or off.
System Error Interrogation
The following sections explain how to interrogate a system. To identify which section you should use to
interrogate your system, answer the following:
v Is your system attached to an HMC?
v If your system is not attached to an HMC, what operating system is installed ?
Depending on your response, select one of the following sections and follow the instructions given in that
section to interrogate your system.
54 Service Guide
The System is Attached to an HMC
On systems attached to an HMC, when the system attention LED comes on, open the Service Action
Event Log in the Service Focal Point application on the HMC. Check the open service events using the
procedure described in ²Working with Serviceable Events² in the Hardware Management Console
Installation and Operations Guide.
Note: For information on these Service Focal Point settings, see ²Setting Up Surveillance and Enabling
Surveillance Notifications² in the Hardware Management Console Installation and Operations
Guide.
Events requiring customer intervention are marked Call Home Candidate? NO. For all of these events,
examine the description in the serviceable event error details. If actions are listed in the description,
perform those actions. If the error indicates a loss of surveillance between the HMC and a partition, check
the status of the partition, the network, and the cabling between the HMC and the partition. If a problem is
still indicated, call service support.
If the service action event is labeled Call Home Candidate? YES, service support is required to correct the
faulty condition. If the system is configured to automatically call home on error, the request for service will
be placed and the service representative will be dispatched.
If the system is not enabled for call home capability, the customer should call service support to report the
error condition.
Whether a customer intervention error or an service support error is detected, the system attention LED
can be reset by following the procedures described in “Resetting the System Attention LED” on page 56.
The preferred method to reset the attention LED is by following the HMC Attached Systems procedures.
To reset the attention LED, use either the HMC Attached Systems procedure or the Non-HMC Attached
Systems procedure.
The System is not Attached to an HMC and AIX is Installed
When the system attention LED comes on, examine the AIX error log to see if user intervention is
required. If the system is powered off, examine the service processor error log. If the AIX error log
indicates service support is required, or there are entries in the service processor error log, go to “Entry
MAP” on page 57 with the information noted from the error log.
The System is Attached to an HMC and Linux is Installed
When the system attention light comes on, perform the following:
v If you performed “Step 1321-34” on page 81 in “MAP 1321: Quick Entry MAP for Systems with Service
Focal Point” on page 76, return to “Step 1321-34” on page 81 to process any other error codes or
symptoms.
v If you have not performed any of the steps in “MAP 1321: Quick Entry MAP for Systems with Service
Focal Point” on page 76, go to step “Step 1321-1” on page 76 to process any other error codes or
symptoms.
The system attention LED can be reset by following one of the Resetting the System Attention LED
procedures.
Chapter 2. Diagnostic Overview 55
Resetting the System Attention LED
Choose one of the following procedures to reset your attention LED.
Systems Attached To an HMC
To reset the system attention LED on HMC-attached systems, do the following:
1. On the HMC graphical interface, click Service Applications.
2. Double-click Service Focal Point.
3. In the Contents area of the screen, select Hardware Service Functions. The LED Management
window opens.
4. In the LED Management window, select one or more managed systems from the table.
5. Select Deactivate LED. The associated system attention LED is turned off.
For more information about the virtual operator panel on the HMC, see the Hardware Management
Console Installation and Operations Guide .
Systems not Attached to an HMC with AIX Installed
As a user with root authority, type diag on the AIX command line, and do the following:
1. Select Task Selection.
2. On the Task Selection Menu, select Identify and Attention Indicators.
3. When the list of LEDs displays, use the cursor to highlight Set System Attention Indicator to
Normal.
4. Press Enter, and then press F7 to commit. This action turns off the LED.
If the system is powered off, access the service processor menus. From the service processor main menu,
do the following:
1. Select System Information Menu.
2. Select LED Control Menu.
3. Select Clear System Attention Indicator. This action turns off the LED.
Systems not Attached to an HMC with Linux Installed
The system must be powered off to access the service processor menus. From the service processor
main menu, do the following:
1. Select System Information Menu.
2. Select LED Control Menu.
3. Select Clear System Attention Indicator. This action turns off the LED.
56 Service Guide
Chapter 3. Maintenance Analysis Procedures (MAPs)
This chapter contains Maintenance Analysis Procedures (MAPs) for the 286/C4 and 286/E4.
Entry MAP
When possible, run online diagnostics in service mode. Online diagnostics perform additional functions
compared to standalone diagnostics. This ensures that the error state of the system that was captured in
NVRAM is available for your use in fixing the problem. The AIX error log and SMIT are only available
when diagnostics are run from the hard drive. Always check Service Focal Point for any open service
events in Service Action Event log.
Attention: AIX diagnostics from the hard drive is not available when Linux is the operating system.
If this system has an Hardware Management Console Installation and Operations Guide (HMC) with
Service Focal Point, go to “MAP 1321: Quick Entry MAP for Systems with Service Focal Point” on
page 76. Otherwise, use the following table to help determine your next step.
Notes:
1. Licensed programs frequently rely on system information stored on the VPD module on the operator
panel assembly. If the MAPs indicate that the operator panel assembly should be replaced, refer to
“Operator Panel Replacement” on page 420. If the old VPD module has to be replaced, call technical
support for recovery instructions. If recovery is not possible, notify the system owner that new keys for
licensed programs may be required.
2. If a network adapter is replaced, the network administrator must be notified so that the client IP
addresses used by the system can be changed. In addition, the operating system configuration of the
network controller may need to be changed in order to enable system startup. Also check to ensure
that any client or system that addresses this system is updated.
3. If more than eight digits are displayed in the operator panel, use only the first eight digits to find the
error in the tables. The digits that display beyond the first eight digits are location codes that can assist
you in diagnosing the problem. See “Location Codes” on page 23.
4. If you are not able to isolate the problem, try loading standalone diagnostics from the CD-ROM or a
NIM server.
Note: The various codes that might display on the HMC are all listed as error codes by Service Focal
Point (SFP). To assist you in identifying the types of error data in this guide, use the following table.
Number of Digits in Error Code Error code Used in this Service Guide
Any Contains # (pound sign) Menu Goal
Any Contains - (hyphen) SRN
5 Does not contain # or - SRN
6 Does not contain # or - Error Code
8 Does not contain # or - Error Code, Service Reference Code
(SRC)
57
Symptom Starting Point
You have a problem that does not prevent the system
from booting and the operator panel is functional and the
rack indicator LED operates as expected.
Go to the Fast Path MAP in the Diagnostic Information for
Multiple Bus Systems.
You do not have a symptom. Go to MAP 0020 in the Diagnostic Information for Multiple
Bus Systems
You have an SRN. Go to the Fast Path MAP in the Diagnostic Information for
Multiple Bus Systems.
The system stops and a 3-digit number is displayed in the
operator panel display.
Record SRN 101-xxx, where xxx is the 3-digit number
displayed in the operator panel display, then go to the
Fast Path MAP in the Diagnostic Information for Multiple
Bus Systems.
The system stops and a 4-digit number beginning with 0
or 2 is displayed in the operator panel display.
Go to the Fast Path MAP in the Diagnostic Information for
Multiple Bus Systems.
All other symptoms. Go to “Quick Entry MAP”.
Quick Entry MAP
The following is a Quick Entry MAP table of contents.
Table 1.
Problem Description Page No.
Service actions 59
The rack indicator LED does not operate as expected. 59
The system attention LED on the operator panel is on. 59
OK does not appear in the operator panel display before pressing the power-on button.
Other symptoms appear in the operator panel display or LEDs before the power-on
button is pressed.
59
8-digit error codes 60
System stops with an 8-digit number displayed 60
System stops with a 4-digit number displayed that does not begin with 0 or 2. 60
System stops with a 3-digit number displayed (see below for 888 sequence). 60
Hardware Management Console (HMC) Problem 60
There appears to be a display problem (distortion, blurring, etc.) 60
Power and cooling problems 61
888 sequence in operator panel display 61
Other symptoms or problems 61
Attention: If you replace FRUs or perform an action and the problem is still not corrected, go to “MAP
1540: Problem Isolation Procedures” on page 108 unless you were directed to MAP 1540 by the error
code. Otherwise, call for support if the actions for an error code do not resolve the problem.
If you replace FRUs or perform an action, and the problem is corrected, go to ²MAP 0410: Repair
Checkout² in the Diagnostic Information for Multiple Bus Systems.
Note: If the only service actions you perform are on the HMC, do not use MAP 0410 to do the repair
checkout.
58 Service Guide
Symptom Action
Service Actions
You have an open service event in the service action
event log.
Go to “Service Focal Point” on page 48.
You have parts to exchange or a corrective action to
perform.
1. Go to Chapter 9, “Removal and Replacement
Procedures”, on page 357.
2. Go to ²MAP 0410: Repair Checkout² in the Diagnostic
Information for Multiple Bus Systems.
You need to verify that a part exchange or corrective
action corrected the problem.
Go to ²MAP 0410: Repair Checkout² in the Diagnostic
Information for Multiple Bus Systems.
You need to verify correct system operation. Go to ²MAP 0410: Repair Checkout² in the Diagnostic
Information for Multiple Bus Systems.
The rack indicator LED does not operate as expected.
Symptom: The rack indicator LED does not turn on, but a drawer identify LED is on.
Action:
1. Make sure the rack indicator LED is properly mounted to the rack.
2. Make sure that the rack identify LED is properly cabled to the bus bar on the rack and to the drawer identify LED
connector.
3. Replace the following parts one at a time:
v rack LED to bus bar cable.
v LED bus bar to drawer cable.
v LED bus bar.
v Call support.
The system attention LED on the operator panel is on.
The system attention LED on the operator panel is on. Go to “System Attention LED” on page 46 and perform
the listed actions.
OK does not appear in the operator panel display before pressing the power-on button. Other symptoms
appear in the operator panel display or LEDs before the power-on button is pressed.
Symptom: A bouncing or scrolling ball remains on the operator panel display, or the operator panel display is filled
with dashes or blocks.
Action: Verify the operator panel cable from the operator panel to the CEC backplane is connected and properly
seated at both ends.
If an ASCII terminal is available, connect it to the system through serial port 1.
1. If the service processor menu is displayed:
a. Replace the operator panel assembly, location U0.1-L1. Refer to “Operator Panel” on page 419.
b. Replace the CEC backplane, location U0.1-P1. (See notes on page 57).
2. If the service processor menu is not displayed, replace the CEC backplane. (See notes on page 57).
If an ASCII terminal is not available, replace the following one at a time.
1. Operator panel assembly, location U0.1-L1. Refer to “Operator Panel” on page 419.
2. CEC backplane, location U0.1-P1. (See notes on page 57).
You have a blank display on the operator panel. Other
LEDs on the operator panel appear to behave normally.
1. Replace the operator panel assembly, Location:
U0.1-L1. Refer to “Operator Panel” on page 419.
2. Replace the CEC backplane, Location: U0.1-P1. (See
notes on page 57).
You have a blank display on the operator panel. Other
LEDs on the operator panel are off.
Go to “MAP 1520: Power” on page 89.
Chapter 3. Maintenance Analysis Procedures (MAPs) 59
Symptom Action
8-Digit Error Codes
You have an 8-digit error code displayed Look up the error code in the table in “Checkpoint and
Error Code Index” on page 181.
System stops with an 8-digit number displayed
The system stops with an 8-digit error code displayed
when booting.
Look up the error code in the table in “Checkpoint and
Error Code Index” on page 181.
System stops with a 4-digit number displayed
The system stops with a 4-digit number displayed that
does not begin with 0 or 2.
Go to “Checkpoint and Error Code Index” on page 181.
If you do not find the error code there, go to the Fast
Path MAP in the Diagnostic Information for Multiple Bus
Systems.
System stops with a 3-digit number displayed (see below for 888 sequence)
The system stops with a 3-digit error code. Add 101- to the left of the three digits to create an SRN.
Go to the Diagnostic Information for Multiple Bus
Systems.
If there is a location code displayed under the 3-digit error
code, then look at the location to see if it matches the
failing component that the SRN pointed to. If they do not
match, perform the action from Diagnostic Information for
Multiple Bus Systems first. If the problem still exists, then
replace the failing component from that location code.
Hardware Management Console (HMC) Problem
Symptom: Hardware Management Console (HMC) cannot be used to manage a managed system, or the connection
to the managed system is failing.
Action: If the managed system is operating normally (no error codes or other symptoms), the HMC might have a
problem, or the connection to the managed system might be damaged or incorrectly cabled. Do the following:
1. Check the connections (serial cable) between the HMC and the managed system. Correct any cabling errors if
found. If another serial cable is available, connect it in place of the existing cable and refresh the HMC graphical
user interface. You may have to wait up to 30 seconds for the managed system to reconnect.
2. Verify that any connected HMC is connected to the managed system by checking the Management Environment
of the HMC.
Note: The managed system must have power connected and the system running, or waiting for a power-on
instruction (the OK prompt is in the operator panel.) If the managed system does not appear in the Navigation
area of the HMC Management Environment, then the HMC or the connection to the managed system might be
failing.
3. Go to the ²Entry MAP² in the Hardware Management Console Maintenance Guide, order number SA23-1273.
4. If you cannot fix the problem using the HMC tests in the Hardware Management Console Maintenance Guide,
there might be a problem with the CEC backplane; replace the CEC backplane.
Symptom: Hardware management console (HMC) cannot call out using the attached modem and the customer’s
telephone line.
Action: If the managed system is operating normally (no error codes or other symptoms), the HMC might have a
problem, or the connection to the modem and telephone line may have a problem. Do the following:
1. Check the connections between the HMC and the modem and telephone line. Correct any cabling errors if found.
2. Go to the Entry MAP in the Hardware Management Console Maintenance Guide, order number SA23-1273.
There is a Display Problem (Distortion, Blurring, Etc.
60 Service Guide
Symptom Action
Symptom: All display problems.
Action:
1. If you are using the HMC, go to the Hardware Management Console Maintenance Guide, order number
SA23-1273.
2. If you are using a graphics display:
a. Go to the problem determination procedures for the display.
b. If you do not find a problem:
v replace the display adapter, location U0.1-P2-In or U0.dd-P1-In.
v replace the backplane in which the card is plugged.
3. If you are using an ASCII terminal:
a. Make sure that the ASCII terminal is connected to S1.
b. If problems persist, go to the problem determination procedures for the terminal.
c. If you do not find a problem, replace the CEC backplane, location U0.1-P1. (See notes on page 57).
Power and Cooling Problems
The system will not power on and no error codes are
available.
Go to “MAP 1520: Power” on page 89.
The power LEDs on the operator panel and the power
supply do not come on or stay on.
1. Check the service processor error log.
2. Go to “MAP 1520: Power” on page 89.
The power LEDs on the operator panel and the power
supply come on and stay on, but the system does not
power on.
1. Check the service processor error log.
2. Go to “MAP 1520: Power” on page 89.
A rack or a rack-mounted unit will not power on. 1. Check the service processor error log.
2. Go to “MAP 1520: Power” on page 89.
The cooling fan(s) do not come on, or come on but do
not stay on.
1. Check the service processor error log.
2. Go to “MAP 1520: Power” on page 89.
The system attention LED on the operator panel is on
and there is no error code displayed.
1. Check the service processor error log.
2. Go to “MAP 1520: Power” on page 89.
888 Sequence in Operator Panel Display
An 888 sequence is displayed in the operator panel
display.
Go to the Fast Path MAP in the Diagnostic Information for
Multiple Bus Systems.
Other Symptoms or Problems
The operator panel displays OK and the fans are off. The service processor is ready. The system is waiting for
power-on. Boot the system. If the boot is unsuccessful,
and the system returns to the OK, go to “MAP 1020:
Problem Determination” on page 64.
The operator panel displays STBY. The service processor is ready. The system was shut
down by the operating system and is still powered on.
This condition can be requested by a privileged system
user with no faults. See the service processor error log
for possible operating system fault indications.
All of the system POST indicators are displayed on the
system console, the system pauses and then restarts.
The term POST indicators refers to the device
mnemonics (the words memory, keyboard, network, scsi,
and speaker) that appear on the system console during
the power-on self-test (POST).
Go to “Boot Problems/Concerns” on page 173.
Chapter 3. Maintenance Analysis Procedures (MAPs) 61
Symptom Action
The system stops and all of the POST indicators are
displayed on the system console. The term POST
indicators refers to the device mnemonics (the words
memory, keyboard, network, scsi, and speaker) that
appear on the system console during the power-on
self-test (POST).
Go to “Boot Problems/Concerns” on page 173.
The system stops and the message STARTING
SOFTWARE PLEASE WAIT... is displayed on the
firmware console.
Go to Chapter 4, “Checkpoints”, on page 141.
Symptom: The system does not respond to the password being entered or the system login prompt is displayed
when booting in service mode.
Action:
1. If the password is being entered from the hardware management console (HMC), go to the Hardware
Management Console Maintenance Guide.
2. If the password is being entered from a keyboard attached to the system, then the keyboard or its controller may
be faulty. In this case, replace these parts in the following order:
a. Keyboard
b. CEC backplane, location U0.1-P1. (See notes on page 57).
3. If the password is being entered from an ASCII terminal, then use the problem determination procedures for the
ASCII terminal. Make sure the ASCII terminal is connected to S1.
If the problem persists, replace the CEC backplane, location U0.1-P1. (See notes on page 57).
If the problem is fixed, go to ²MAP 0410: Repair Checkout², in the Diagnostic Information for Multiple Bus Systems.
If the problem is still not corrected, go to “MAP 1548: Processor Subsystem Problem Isolation” on page 123.
Symptom: No codes are displayed on the operator panel within a few seconds of turning on the system. The
operator panel is blank before the system is powered on.
Action: Reseat the operator panel cable. If the problem is not resolved, replace in the following order:
1. Operator panel assembly, location U0.1-L1. Swap the VPD module from the old operator panel to the new
operator panel.
2. CEC backplane, location U0.1-P1. (See notes on 57).
If the problem is fixed, go to ²MAP 0410: Repair Checkout², in the Diagnostic Information for Multiple Bus Systems.
If the problem is still not corrected, go to “MAP 1548: Processor Subsystem Problem Isolation” on page 123.
Symptom: The SMS configuration list or boot sequence selection menu shows more SCSI devices attached to a
controller/adapter than are actually attached.
Action: A device may be set to use the same SCSI bus ID as the control adapter. Note the ID being used by the
controller/adapter (this can be checked and/or changed through an SMS utility), and verify that no device attached to
the controller is set to use that ID.
If settings do not appear to be in conflict:
1. Go to “MAP 1542: I/O Problem Isolation” on page 111.
2. Replace the SCSI cable.
3. Replace the device.
4. Replace the SCSI adapter
Note: In a ²twin-tailed² configuration where there is more than one initiator device (normally another system)
attached to the SCSI bus, it may be necessary to use SMS utilities to change the ID of the SCSI controller or
adapter.
62 Service Guide
Symptom Action
Symptom: The System Management Services menu is displayed.
Action: If you are loading diagnostics from the CD-ROM, you may not have pressed the correct key when you were
trying to indicate a service mode IPL of the diagnostic programs. If this is the case, start again at the beginning of
this step.
If you are sure you pressed the correct key, the device or media you are attempting to boot from may be faulty.
1. If you are using the hardware management console (HMC) or an ASCII terminal:
v Select View Error Log.
v If an error is logged, check the time stamp.
v If the error was logged during the current boot attempt, record it.
v Look up the error in Chapter 5, “Error Code to FRU Index”, on page 179 and do the listed action.
v If no recent error is logged in the error log, go to step 3.
2. If you are using a keyboard and a graphic display attached to the system:
v Select View Error Log.
v If an error is logged, check the time stamp.
v If the error was logged during the current boot attempt, record it.
v Look up the error in Chapter 5, “Error Code to FRU Index”, on page 179
v If no recent error is logged in the error log, go to step 3.
3. Try to boot from an alternate boot device connected to the same controller as the original boot device. If the boot
succeeds, replace the original boot device (for removable media devices, try the media first).
4. Go to “MAP 1548: Processor Subsystem Problem Isolation” on page 123.
You suspect a cable problem. Go to Devices, and Cable Information for Multiple Bus
Systems.
All other problems. Go to “MAP 1020: Problem Determination” on page 64.
Chapter 3. Maintenance Analysis Procedures (MAPs) 63
MAP 1020: Problem Determination
Purpose of This MAP
Use this MAP to get an error code if you were not provided one by the customer or you are unable to load
diagnostics. If you are able to load the diagnostics, go to MAP 0020 in the Diagnostic Information for
Multiple Bus Systems.
The service processor may have recorded one or more symptoms in its error log. Examine this error log
before proceeding (see “System Information Menu” on page 317). The system may have been set up by
using the HMC. Check the Service Action Event (SAE) log in the Service Focal Point. The SAE log may
have recorded one or more symptoms in the Service Focal Point. To avoid unnecessary replacement of
the same FRU for the same problem, check the SAE log for evidence of prior service activity on the same
subsystem.
The service processor may have been set by the user to monitor system operations and to attempt
recoveries. You can disable these actions while you diagnose and service the system. If the system was
set up according to the recommendations of the 286/C4 and 286/E4 User’s Guide, all the settings of the
service processor (except language) were saved by using the SAVE/RESTORE HARDWARE
MAINTENANCE POLICIES service aid. You can use the same service aid to restore the settings at the
conclusion of your service action.
In case the service processor settings were not saved by the user, if you disable them, make notes of their
current settings for restoration before you leave.
In addition to the parameters in the following table, you can disconnect the modem to prevent incoming
signals that could cause the system to power on.
Following are the service processor settings. The service processor menus are described in Chapter 7,
“Using the Service Processor”, on page 305.
Surveillance From the service processor Setup Menu, go to the
Surveillance Setup Menu and disable surveillance.
Unattended Start From the service processor System Power Control Menu,
disable unattended start mode.
Reboot Policy From the System Power Control Menu, go to the
Reboot/Restart Policy Setup Menu and set:
1. Number of reboot attempts to 0 (zero)
2. Use OS-Defined restart policy to No
3. Enable supplemental restart policy to No.
Call-Out From the Call-In/Call-Out Setup Menu, go to the Serial
Port Selection Menu and disable call-out on both serial
ports.
64 Service Guide
Another feature that could disrupt a service action by powering the system on is an AIX function called
Timed Power-On (not available when Linux is the operating system). For more information about timed
power-on, see “System Power-On Methods” on page 330.
Be prepared to record code numbers and use those numbers in the course of analyzing a problem. Go to
“Step 1020-1”.
Step 1020-1
The following steps analyze a failure to load the diagnostic programs.
Note: Be prepared to answer questions regarding the operator panel display and to perform certain
actions based on displayed POST indicators. Be observant of these conditions.
1. Power off the system. Refer to “Stopping the System without an HMC Attached” on page 359.
2. Select slow boot mode (select disable fast boot) on the system power control menu from the service
processor main menu.
3. Power on the system. Refer to “Starting the System without an HMC Attached” on page 360.
4. Insert the diagnostic CD-ROM into the CD-ROM drive.
5. When the keyboard indicator is displayed (the word keyboard), press the 5 key on the system console.
6. Enter a password, if requested.
7. Wait until the diagnostics are loaded or the system appears to stop. If you receive an error code or if
the system stops before diagnostics are loaded, find your symptom in the following table, then follow
the instructions given in the Action column.
If no fault is identified, continue to the next step.
8. Run the standalone diagnostics on the entire system. Find your symptom in the following table, then
follow the instructions given in the Action column.
If no fault is identified, call service support for assistance.
Chapter 3. Maintenance Analysis Procedures (MAPs) 65
Symptom Action
The system stopped and a code is displayed on the
operator panel.
Go to the “Quick Entry MAP” on page 58.
The rack identify LED does not operate properly. Go to the “Entry MAP” on page 57.
The system stops with a prompt to enter a password. Enter the password. You cannot continue until a correct
password has been entered. When you have entered a
valid password, go to the beginning of this table and wait
for one of the other conditions to occur.
The diagnostic operating instructions are displayed. Go to ²MAP 0020: Problem Determination Procedure² in
the Diagnostic Information for Multiple Bus Systems.
Symptom:The system login prompt is displayed.
Action: If you are loading the diagnostics from a CD-ROM, you may not have pressed the correct key or you may
not have pressed the key soon enough when you were trying to indicate a service mode IPL of the diagnostic
programs. If this is the case, start again at the beginning of this step.
Note: Perform the system shutdown procedure before turning off the system.
If you are sure you pressed the correct key in a timely manner, go to “Step 1020-2” on page 67.
If you are loading diagnostics from a NIM server, check for the following:
v The bootlist on the client may be incorrect.
v Cstate on the NIM server may be incorrect.
v There may be network problems preventing you from connecting to the NIM server.
Verify the settings and the status of the network. If you continue to have problems refer to “Boot Problems/Concerns”
on page 173 and follow the steps for network boot problems.
The system does not respond when the password is
entered.
Go to “Step 1020-2” on page 67.
Symptom: The system stopped. A POST indicator is displayed on the system console and an eight-digit error code
is not displayed.
Action: If the POST indicator represents:
1. Memory, go to “MAP 1240: Memory Problem Resolution” on page 68.
2. Keyboard
a. Replace the keyboard.
b. Replace the CEC backplane, location U0.1-P1. (See notes on page 57).
c. Go to “MAP 1548: Processor Subsystem Problem Isolation” on page 123.
3. Network, go to “MAP 1548: Processor Subsystem Problem Isolation” on page 123.
4. SCSI, go to “MAP 1548: Processor Subsystem Problem Isolation” on page 123.
5. Speaker
a. Replace the operator panel, location U0.1-L1. Refer to “Operator Panel” on page 419.
b. Replace the CEC backplane, location: U0.1-P1. (See notes on page 57).
c. Go to “MAP 1548: Processor Subsystem Problem Isolation” on page 123.
66 Service Guide
Symptom Action
Symptom: The System Management Services menu is displayed.
Action: If you are loading diagnostics from the CD-ROM, you may not have pressed the correct key when you were
trying to indicate a service mode IPL of the diagnostic programs. If this is the case, start again at the beginning of
this step.
If you are sure you pressed the correct key, the device or media you are attempting to boot from may be faulty.
1. If you are using the hardware management console (HMC) or an ASCII terminal:
v Select View Error Log.
v If an error is logged, check the time stamp.
v If the error was logged during the current boot attempt, record it.
v Look up the error in Chapter 5, “Error Code to FRU Index”, on page 179 and do the listed action.
v If no recent error is logged in the error log, go to step 3.
2. If you are using a keyboard and a graphic display attached to the system:
v Select View Error Log.
v If an error is logged, check the time stamp.
v If the error was logged during the current boot attempt, record it.
v Look up the error in Chapter 5, “Error Code to FRU Index”, on page 179 and do the listed action.
v If no recent error is logged in the error log, go to step 3.
3. Try to boot from an alternate boot device connected to the same controller as the original boot device. If the boot
succeeds, replace the original boot device (for removable media devices, try the media first).
4. Go to “MAP 1548: Processor Subsystem Problem Isolation” on page 123.
All other symptoms. If you were directed here from the Entry MAP, go to “MAP
1548: Processor Subsystem Problem Isolation” on
page 123. Otherwise, find the symptom in the “Quick
Entry MAP” on page 58.
Step 1020-2
There is a problem with the keyboard.
Find the type of keyboard you are using in the following table; then follow the instructions given in the
Action column.
Keyboard Type Action
Type 101 keyboard (U.S.). Identified by the size of the
Enter key. The Enter key is in only one horizontal row of
keys.
v Replace the type 101 keyboard.
v Replace the CEC backplane, location U0.1-P1. (See
notes on page 57.)
Type 102 keyboard (W.T.). Identified by the size of the
Enter key. The Enter key extends into two horizontal
rows.
v Replace the type 102 keyboard.
v Replace the CEC backplane, location U0.1-P1. (See
notes on page 57.)
Type 106 keyboard. (Identified by the Japanese
characters.)
v Replace the type 106 keyboard.
v Replace the CEC backplane, location U0.1-P1. (See
notes on page 57.)
ASCII terminal keyboard Go to the documentation for this type of ASCII terminal
and continue with problem determination.
If the problem was not corrected by one of the previous actions and you were not already directed to go to
MAP 1548, go to “MAP 1548: Processor Subsystem Problem Isolation” on page 123.
Chapter 3. Maintenance Analysis Procedures (MAPs) 67
MAP 1240: Memory Problem Resolution
Note: The firmware or error code that sent you here could be one of the following:
v Service Processor Checkpoints 910B, 910C, 910E or 95xx
v Firmware Checkpoints E134 or E701
v System Power Control Network Error Code 1011840C
v Firmware Error Codes 25Cyy001, 25Cyy002, 25B00001 or 25B00004
v Service Processor Error Codes 406x001x, 406x010x, 406x013x, 406x0CA6 or 450x30xx
v Common Firmware Error Codes B1xx466x
v Problem Determination Error Code M0MEM002
These checkpoints and error codes are referred to as a memory code in this MAP.
Purpose of This MAP
This MAP is used to troubleshoot a problem when the system stops with a memory checkpoint or memory
error code on the operator panel display or console.
Notes:
1. If the symptom changes while you are using this MAP, check for loose cards, cables, and obvious
problems. If you do not find a problem, go to “MAP 1548: Processor Subsystem Problem Isolation” on
page 123.
2. The service processor may have recorded one or more symptoms in its error log. It is a good idea to
examine that error log before proceeding (see Service Processor System Information Menu).
3. The service processor may have been set by the user to monitor service operations and to attempt
recoveries. You might want to disable these actions while you diagnose and service the system. If you
disable them, make note of their current settings so that you can restore them before you leave. The
following settings may be of interest to you.
Surveillance From the Service Processor Setup Menu, go to the
Surveillance Setup menu and disable surveillance.
Unattended Start From the Service Processor System Power Control Menu,
disable unattended start mode.
Reboot Policy From the System Power Control Menu, go to the
Reboot/Restart Policy Setup Menu and set:
1. Number of reboot attempts to 0 (zero)
2. Use OS-Defined restart policy to No.
3. Enable supplemental restart policy to No.
Call Out From the Call-In/Call-Out Setup Menu, go to the Serial
Port Selection Menu and disable call-out on both serial
ports.
General Memory Information
Be sure to unplug the power cables before removing or installing the processor cards or memory DIMMs
to avoid damage to them.
It is acceptable for the memory DIMMs to partially populate each processor card in a multi-processor card
system. For example, 4 memory DIMMs can be placed onto processor card 1, and another 4 memory
DIMMs can be placed onto processor card 2. The system uses both sets of memory DIMMs
indiscriminately.
68 Service Guide
Memory DIMMs, however, must be installed in matched (size and speed) quads. Refer to “Memory
DIMMs” on page 427 for labeling of the processor card and instructions on memory DIMM removal and
installation. Do not, however, replace the covers as directed while troubleshooting this problem. Memory
DIMM quads can only be installed in one of the following two memory DIMM slot combinations:
v 1, 3, 6 and 8
v 2, 4, 5 and 7
There is no requirement that memory DIMM slots 1, 3, 6 and 8 be populated before memory DIMM slots
2, 4, 5 and 7.
Attention: Go to the service processor menus. In the System Information Menu, select Memory
Configuration/ Deconfiguration. Verify on the first screen that memory repeat gard is disabled. If it is
enabled, disable it for the duration of this MAP. When exiting this MAP to go to MAP 0410 in the
Diagnostic Information for Multiple Bus Systems; also return the setting to enabled if necessary.
Step 1240-1
1. Record the error code that sent you to this map.
2. Go to the service processor menus and disable memory repeat gard if it is not already disabled.
3. Turn off the power. Remove the power cord.
4. Remove and reinstall the processor cards.
5. Reattach the power cord. Wait for OK to display in the operator panel.
6. Turn on the power. Attempt to boot to the SMS menus. (The system may stop with an 8-digit error
code before it reaches the SMS menus.)
7. Power off the system using the white button. After the OK displays in the operator panel, look at the
memory configuration/deconfiguration menu.
Are any DIMMs deconfigured or does the error code recorded earlier in this step still appear?
No Reseating the processor card(s) has corrected the problem.
Enable memory repeat gard if necessary.
Go to ²MAP 0410: Repair Checkout² in the Diagnostic Information for Multiple Bus Systems.
Yes If there is only 1 processor card installed, tag it as ²suspect bad² and go to “Step 1240-6” on
page 71. If there are 2 processor cards installed, go to “Step 1240-2”.
Step 1240-2
1. Turn off the power. Remove the power cord.
2. Remove processor card 2.
3. Reattach the power cord. Wait for OK to display in the operator panel.
4. Turn on the power. Attempt to boot to the SMS menus. (The system may stop with an 8-digit error
code before it reaches the SMS menus.)
5. Power off the system using the white button. After the OK displays in the operator panel, look at the
memory configuration/deconfiguration menu.
Are any DIMMs deconfigured or does the error code recorded in “Step 1240-1” still appear?
No Go to “Step 1240-3” on page 70.
Yes Go to “Step 1240-6” on page 71.
Chapter 3. Maintenance Analysis Procedures (MAPs) 69
Step 1240-3
1. Turn off the power. Remove the power cord.
2. Remove processor card 1.
3. Reinstall processor card 2 in slot 1.
4. Reattach the power cord. Wait for OK to display in the operator panel.
5. Turn on the power. Attempt to boot to the SMS menus. (The system may stop with an 8-digit error
code before it reaches the SMS menus.)
6. Power off the system using the white button. After the OK displays in the operator panel, look at the
memory configuration/deconfiguration menu.
Are any DIMMs deconfigured or does the error code recorded in “Step 1240-1” on page 69 still
appear?
No Go to “Step 1240-4”.
Yes Go to “Step 1240-6” on page 71.
Step 1240-4
1. Turn off the power. Remove the power cord.
2. Install processor card 1 in slot 2.
3. Reattach the power cord. Wait for OK to display in the operator panel.
4. Turn on the power. Attempt to boot to the SMS menus. (The system may stop with an 8-digit error
code before it reaches the SMS menus.)
5. Power off the system using the white button. After the OK displays in the operator panel, look at the
memory configuration/deconfiguration menu.
Have any DIMMs been deconfigured or does the error code recorded in “Step 1240-1” on page 69
still appear?
No Reseating the processor cards has corrected the problem.
Enable memory repeat gard if necessary.
Go to ²MAP 0410: Repair Checkout² in the Diagnostic Information for Multiple Bus Systems.
Yes Go to “Step 1240-5” on page 71.
70 Service Guide
Step 1240-5
One of the FRUs remaining in the system unit is defective.
1. Turn off the power. Remove the power cord.
2. Exchange the following FRUs one at a time in the order listed:
v CEC backplane
v Processor cards
3. Reattach the power cord. Wait for OK to display in the operator panel.
4. Turn on the power. Attempt to boot to the SMS menus. (The system may stop with an 8-digit error
code before it reaches the SMS menus.)
5. Power off the system using the white button. After the OK displays in the operator panel, look at the
memory configuration/deconfiguration menu.
Are any DIMMs deconfigured or does the error code recorded in “Step 1240-1” on page 69 still
appear?
No Enable memory repeat gard if necessary.
Go to ²MAP 0410: Repair Checkout² in the Diagnostic Information for Multiple Bus Systems.
Yes Reinstall the original FRU.
Repeat this step until the defective FRU is identified or all the FRUs have been exchanged.
If the symptom did not change and all FRUs have been exchanged, go to “MAP 1548: Processor
Subsystem Problem Isolation” on page 123.
Step 1240-6
1. Turn off the power. Remove the power cord.
2. Reseat the DIMMs. Reinstall the processor card.
3. Reattach the power cord. Wait for OK to display in the operator panel.
4. Turn on the power. Attempt to boot to the SMS menus. (The system may stop with an 8-digit error
code before it reaches the SMS menus.)
5. Power off the system using the white button. After the OK displays in the operator panel, look at the
memory configuration/deconfiguration menu.
Are any DIMMs deconfigured or does the error code recorded in “Step 1240-1” on page 69 still
appear?
NO Reseating the DIMMs has corrected the problem.
Enable memory repeat gard if necessary.
Go to ²MAP 0410: Repair Checkout² in the Diagnostic Information for Multiple Bus Systems.
YES If there is only one quad of DIMMs on the card, go to “Step 1240-10” on page 73.
If there are two quads of DIMMs on the card, go to “Step 1240-7” on page 72.
Chapter 3. Maintenance Analysis Procedures (MAPs) 71
Step 1240-7
1. Turn off the power. Remove the power cord.
2. Remove one quad of DIMMs. Record the positions of the DIMMs as they are removed. When
instructed to reinstall the DIMMs, install them back into the same slots from which they were removed.
3. Reattach the power cord. Wait for OK to display in the operator panel.
4. Turn on the power. Attempt to boot to the SMS menus. (The system may stop with an 8-digit error
code before it reaches the SMS menus.)
5. Power off the system using the white button. After the OK displays in the operator panel, look at the
memory configuration/deconfiguration menu.
Are any DIMMs deconfigured or does the error code recorded in “Step 1240-1” on page 69 still
appear?
NO Go to “Step 1240-8”.
YES Go to “Step 1240-10” on page 73.
Step 1240-8
1. Turn off the power. Remove the power cord.
2. Move the quad of DIMMs located on the card to the other quad’s slots.
3. Reattach the power cord. Wait for OK to display in the operator panel.
4. Turn on the power. Attempt to boot to the SMS menus. (The system may stop with an 8-digit error
code before it reaches the SMS menus.)
5. Power off the system using the white button. After the OK displays in the operator panel, look at the
memory configuration/deconfiguration menu.
Are any DIMMs deconfigured or does the error code recorded in “Step 1240-1” on page 69 still
appear?
NO Go to “Step 1240-9”.
YES Replace the processor card.
Enable memory repeat gard if necessary.
Go to ²MAP 0410: Repair Checkout² in the Diagnostic Information for Multiple Bus Systems.
Step 1240-9
The failing DIMM(s) is in the quad of DIMMs that was removed in “Step 1240-7”. Do the following:
1. Turn off the power. Remove the power cord.
2. Remove the quad of DIMMs that is on the processor card and reinstall the quad of DIMMs that had
been removed. The DIMMs can be reinstalled in either quad.
Go to “Step 1240-11” on page 73.
72 Service Guide
Step 1240-10
1. Turn off the power. Remove the power cord.
2. Move the quad of DIMMs that is on the processor card to the other quad’s location.
3. Reattach the power cord. Wait for OK to display in the operator panel.
4. Turn on the power. Attempt to boot to the SMS menus. (The system may stop with an 8-digit error
code before it reaches the SMS menus.)
5. Power off the system using the white button. After the OK displays in the operator panel, look at the
memory configuration/deconfiguration menu.
Are any DIMMs deconfigured or does the error code recorded in “Step 1240-1” on page 69 still
appear?
NO Replace the processor card.
Enable memory repeat gard if necessary.
Go to ²MAP 0410: Repair Checkout² in the Diagnostic Information for Multiple Bus Systems.
YES One or more of the DIMMs on the card is defective. Go to “Step 1240-11”.
Step 1240-11
Attention: If only one quad of DIMMs is on the processor card, you must have at least one replacement
DIMM of the same size and speed as the DIMMs that are on the card.
If there were two quad of DIMMs on the processor card, but they are not the same size, your must have at
least one replacement DIMM of the same size and speed as the DIMMs in the quad that is failing.
If there were two quad of DIMMs on the processor card, and they are the same size and speed, DIMMs
from the good quad can be used when isolating the failing DIMM in the suspect quad.
1. Turn off the power. Remove the power cord.
2. Remove one of the DIMMs on the processor card and install the replacement DIMM or another known
good DIMM.
3. Reattach the power cord. Wait for OK to display in the operator panel.
4. Turn on the power. Attempt to boot to the SMS menus. (The system may stop with an 8-digit error
code before it reaches the SMS menus.)
5. Power off the system using the white button. After the OK displays in the operator panel, look at the
memory configuration/deconfiguration menu.
Are any DIMMs deconfigured or does the error code recorded in “Step 1240-1” on page 69 still
appear?
NO The DIMM that was just removed was the failing DIMM; replace it.
Enable memory repeat gard if necessary.
Go to ²MAP 0410: Repair Checkout² in the Diagnostic Information for Multiple Bus Systems.
YES Go to “Step 1240-12” on page 74.
Chapter 3. Maintenance Analysis Procedures (MAPs) 73
Step 1240-12
Have all the DIMMs in the quad located on the processor card been replaced?
NO Go to “Step 1240-11” on page 73 to test the next DIMM of the quad on the processor card.
YES Go to “Step 1240-13”.
Step 1240-13
Did the processor originally have only one quad of DIMMs at the start of this map?
NO Call service support.
YES Replace the processor card. This ends the procedure. Enable memory repeat gard if necessary.
Go to ²MAP 0410: Repair Checkout² in the Diagnostic Information for Multiple Bus Systems.
74 Service Guide
MAP 1320: Service Focal Point Procedures
These procedures define the steps to be taken when servicing a machine equipped with Service Focal
Point (SFP).
Note: The various codes that might display on the HMC are all listed as error codes by Service Focal
Point (SFP). To assist you in identifying the types of error data in this guide, use the following table.
Number of Digits in Error Code Error code Service Guide Name
Any Contains # (pound sign) Menu Goal
Any Contains - (hyphen) SRN
5 Does not contain # or - SRN
6 Does not contain # or - Error Code
8 Does not contain # or - Error Code, Service Reference Code
(SRC)
For steps to be taken when beginning service on a machine with SFP, see the “MAP 1321: Quick Entry
MAP for Systems with Service Focal Point” on page 76.
For steps to be taken after service is complete on a machine with SFP, see the “MAP 1322: End of Call
MAP for Systems with Service Focal Point” on page 84.
Chapter 3. Maintenance Analysis Procedures (MAPs) 75
MAP 1321: Quick Entry MAP for Systems with Service Focal Point
Step 1321-1
For future reference, record the error code and the location code (that brought you to this MAP) of the
FRU to be replaced. Record any other symptoms if an error code is not available.
Does the system have an HMC attached?
No Go to “Step 1321-22” on page 78.
Yes Go to “Step 1321-2”.
Step 1321-2
At the HMC console, start Service Focal Point (SFP) and examine the service action event log for any
open service action events. For SFP operating instructions, see “Service Focal Point” on page 48.
Does the HMC appear to operate correctly?
No Go to “Quick Entry MAP” on page 58, look up the HMC symptom, and follow the directions there.
Yes Go to “Step 1321-3”.
Step 1321-3
Are there any service action events that are open?
No Go to “Step 1321-40” on page 82.
Yes Go to “Step 1321-4”.
Step 1321-4
Record the list of open service action events.
Step 1321-5
From the list of serviceable events recorded in “Step 1321-4”, open the details of each service action
event and examine the FRU list and error code(s) for each.
Step 1321-6
Are there any serviceable actions events with error codes of the form #xxxxxx in the list of error
codes found in “Step 1321-5”?
No Go to “Step 1321-7”.
Yes Go to “Step 1321-18” on page 78.
Step 1321-7
Are there any serviceable events with FRU location codes reported in “Step 1321-5”?
No Go to “Step 1321-14” on page 77.
Yes Go to “Step 1321-8”.
Step 1321-8
From the list of open service action events with FRUs reported, examine the details of each service action
event and record the error code.
76 Service Guide
Step 1321-9
Examine the list of error codes you obtained from “Step 1321-8” on page 76 and look for any of the form
4xxB xxxx, A0D-34x, or A1D-34x.
Step 1321-10
Are there any error codes from step“Step 1321-8” on page 76 of the form 4xxB xxxx, A0D-34x, or
A1D-34x?
No Go to “Step 1321-13”.
Yes Go to “Step 1321-11”.
Step 1321-11
Do any of the error codes from “Step 1321-8” on page 76 of the form 4xxB xxxx, A0D-34x, or
A1D-34x have the same first two characters on the left as the first two characters on the left of the
error code recorded in “Step 1321-1” on page 76 that sent you to this MAP?
No Go to “Step 1321-13”.
Yes Record the error codes from “Step 1321-8” on page 76 of the form 4xxB xxxx, A0D-34x, or
A1D-34x that have the same first two characters on the left as the first two characters on the left
of the error code recorded in “Step 1321-1” on page 76 that sent you to this MAP. Also, record the
service action event associated with each error code. Go to “Step 1321-12”.
Step 1321-12
From the list of open service action events with error codes of the form 4xxB xxxx, A0D-34x, or A1D-34x
recorded in “Step 1321-11”, choose the service action event with the earliest time stamp (the one that
occurred first), record its error code and FRU location(s), and proceed to “Step 1321-14” with this new
code.
When future steps refer to the error code recorded in “Step 1321-1” on page 76, use this new error code
and its location code(s) instead of the original error code that sent you to this MAP.
Step 1321-13
From the list of open service action events with FRUs reported, choose the service action event with the
earliest time stamp (the one that occurred first), record its error code and FRU location(s), and proceed to
“Step 1321-14” with this new error code. When future steps refer to the error code recorded in “Step
1321-1” on page 76, use this new error code and its location code(s) instead of the original error code that
sent you to this MAP.
Step 1321-14
At the HMC console, open Service Focal Point (SFP) and examine the service action event log for any
closed service action events. For SFP operating instructions, refer to “Service Focal Point” on page 48.
Step 1321-15
Are there any service action events that are closed?
No Go to the table in “Entry MAP” on page 57.
Yes Go to “Step 1321-16” on page 78.
Chapter 3. Maintenance Analysis Procedures (MAPs) 77
Step 1321-16
Examine the details of the ten most-recently closed service action events.
Are there any error codes or symptoms in the ten latest closed service action events that are the
same as the error code or symptom recorded in “Step 1321-1” on page 76?
No Go to the table in “Entry MAP” on page 57.
Yes Go to “Step 1321-17”.
Step 1321-17
Examine the FRUs of the same error codes or symptoms.
Are there any FRUs in the same error codes or symptoms of the closed service action events left
to be replaced?
No Go to the table in “Entry MAP” on page 57.
Yes Replace the next FRU of same error codes or symptoms of the closed service action events. Go
to ²MAP 0410: Repair Checkout² in Diagnostic Information for Multiple Bus Systems.
Step 1321-18
From the list of serviceable events with error code(s) of the form #xxxxxx, examine the details of each
serviceable event to display the description field of the error code(s) of the form #xxxxxx.
Step 1321-19
Double click on the description field to view the complete text.
Step 1321-20
Perform any actions that may be listed in the descriptions of all error codes of the form #xxxxxx before
proceeding.
Step 1321-21
Did you find a problem?
No Go to “Step 1321-7” on page 76.
Yes Go to ²MAP 0410: Repair Checkout² in Diagnostic Information for Multiple Bus Systems.
Step 1321-22
Is Linux installed on all of the system’s partitions?
No Go to “Step 1321-2” on page 76.
Yes Look for and record any additional error information or software messages on the operator panel
or the console (serial TTY, virtual terminal, or HMC console). Go to “Step 1321-23” on page 79.
78 Service Guide
Step 1321-23
Is Linux functional in the partition with the problem?
No Go to “Step 1321-34” on page 81.
Yes Go to “Step 1321-24”.
Attention: “Step 1321-24” through “Step 1321-40” on page 82 must be performed on a partition running
the Linux operating system.
Step 1321-24
If your system is functional, do the following:
Examine the Linux system log by logging in to the system as the root user and entering the following
command:
cat /var/log/messages |grep RTAS |more
Linux run-time error messages are logged in the messages file under /var/log. The following is an
example of the Linux system error log.
Aug 27 18:13:41 rasler kernel: RTAS: -------- event-scan begin --------
Aug 27 18:13:41 rasler kernel: RTAS: Location Code: U0.1-P1-C1
Aug 27 18:13:41 rasler kernel: RTAS: WARNING: (FULLY RECOVERED) type: INTERN_DEV_FAIL
Aug 27 18:13:41 rasler kernel: RTAS: initiator: UNKNOWN target: UNKNOWN
Aug 27 18:13:41 rasler kernel: RTAS: Status: predictive new
Aug 27 18:13:41 rasler kernel: RTAS: Date/Time: 20020827 18134000
Aug 27 18:13:41 rasler kernel: RTAS: CPU Failure
Aug 27 18:13:41 rasler kernel: RTAS: CPU id: 0
Aug 27 18:13:41 rasler kernel: RTAS: Failing element: 0x0000
Aug 27 18:13:41 rasler kernel: RTAS: A reboot of the system may correct the problem
Aug 27 18:13:41 rasler kernel: RTAS: -------- event-scan end ----------
Step 1321-25
If present, record any additional error information recorded in the Linux system log. If the system is
configured with more than one logical partition with Linux installed, repeat “Step 1321-24” and “Step
1321-25” for all logical partitions that have Linux installed.
Chapter 3. Maintenance Analysis Procedures (MAPs) 79
Step 1321-26
Examine the Linux boot (IPL) log by logging in to the system as the root user and enter the following
command:
cat /var/log/boot.msg |grep RTAS |more
Linux boot (IPL) error messages are logged into the boot.msg file under /var/log. The following is an
example of the Linux boot error log.
RTAS daemon started
RTAS: -------- event-scan begin --------
RTAS: Location Code: U0.1-F3
RTAS: WARNING: (FULLY RECOVERED) type: SENSOR
RTAS: initiator: UNKNOWN target: UNKNOWN
RTAS: Status: bypassed new
RTAS: Date/Time: 20020830 14404000
RTAS: Environment and Power Warning
RTAS: EPOW Sensor Value: 0x00000001
RTAS: EPOW caused by fan failure
RTAS: -------- event-scan end ----------
Step 1321-27
If present, record any additional error information reported in the Linux boot (IPL) log. If the system is
configured with more than one logical partition with Linux installed, repeat “Step 1321-26” and “Step
1321-27” for all logical partitions that have LINUX installed.
Step 1321-28
Examine the Linux extended error data log.
The following is an example of the Linux extended error log.
<3>RTAS daemon started
<3>RTAS: -------- event-scan begin --------
<3>RTAS: Location Code: U0.1-P1-C2
<4>RTAS: Log Debug: 04
4b2726fb04a00011702c0014000000000000000000000000f1800001001801d3ffffffff010000000000000042343138
20202020383030343236464238454134303030303 030303030303030
<4>RTAS: Log Debug: D2
5046413405020d0a000001000271400100000033434d502044415441000001000000000000010000f180000153595
320444154410000000000000000200216271501050920021627150105092002063715010509535243204441544170
2c001400000000000000020018820201d382000000000000000000000000000000000000000000000000000000000
0000000000000000000000000000000000000000000000000000000000000000280048ea400000000000000000000
000000000000000000004350542044415441702cff08000000001c000000702cf0080000000080000000702cf10070
2cf200702c000400000800702c01040bf2002e702c02040c1fffbf702c0300702c1000702c11040bf2002e702c12040
c1fffbf702c1300702ca000702ca108000000000000a03c702ca208000000000000effc702cb000702cb10800000000
0000a03c702cb208000000000000effc702cc000702cc108000000000000a03c702cc208000000000000effc702c3
000702c31080000000000000003702c3208000000000000007b702c8000702c81080000000020e27a39702c820800
000000fffeffff702cd000702cd1080000000010004010702cd208000000007777f3ffffffffffffffffffffffffffffffffffffff
ffffffffffffffffffffffffffff
<3>RTAS: WARNING: (FULLY RECOVERED) type: INTERN_DEV_FAIL
<3>RTAS: initiator: UNKNOWN target: UNKNOWN
<3>RTAS: Status: unrecoverable new
<3>RTAS: Date/Time: 20020905 15372200
<3>RTAS: CPU Failure
<3>RTAS: Internal error (not cache)
<3>RTAS: CPU id: 0
<3>RTAS: Failing element: 0x0000
<3>RTAS: -------- event-scan end ---------
80 Service Guide
Step 1321-29
If present, record any additional error information in the Linux extended data error log.
Note: The line(s) in the Linux extended error log that begin with ²<4>RTAS: Log Debug: 04² contain the
error code listed in the next 8 hex characters. In the example above, ²4b27 26fb² is an error code.
If the system is configured with more than one logical partition with Linux installed, repeat “Step 1321-28”
on page 80 and “Step 1321-29” for all logical partitions that have Linux installed.
Step 1321-30
Use the lscfg command to list all resources assigned to all partitions. Record the adapter and the partition
for each resource.
Step 1321-31
Compare the list of found resources and partition assignments to the customer’s known configuration to
determine if any device(s) or adapter(s) are missing. Record any missing adapters or devices.
If the system is configured with more than one logical partition with Linux installed, repeat “Step 1321-30”
and “Step 1321-31” for all logical partitions that have Linux installed.
Step 1321-32
Go to “Step 1321-33”.
Step 1321-33
Go to “Step 1321-34”.
Step 1321-34
Were any error codes, checkpoints, or symptoms recorded in Steps 1321-1, 1321-22, 1321-25,
1321-27, 1321-29, or 1321-31?
No Go to “Step 1321-35”.
Yes Go to the “Quick Entry MAP” on page 58 with each recorded error code or symptom. Perform the
indicated actions one at a time for each error code until the problem has been corrected. If all
recorded error codes have been processed and the problem has not been corrected, go to “Step
1321-35”.
Step 1321-35
If no additional error information is available and the problem has not been corrected, shut down the
system.
If an HMC is not attached, you need a laptop (and cable, part number 62H4857) capable of connecting to
serial port 1 on the processor subsystem. The Linux login prompt cannot be seen on a laptop connected
to serial port 1.
1. If there is no HMC, attach the laptop and cable to serial port 1 on the processor subsystem.
2. When OK displays in the operator panel, press a key on the virtual terminal on the laptop. The service
processor menus are available on the attached laptop or on the HMC.
3. Examine the service processor (SP) error log. Record all error codes and messages written to the SP
error log. Go to “Step 1321-36” on page 82.
4. If the service processor menus are not available, go to “Step 1321-37” on page 82.
Chapter 3. Maintenance Analysis Procedures (MAPs) 81
Step 1321-36
Boot the system to the system management services (SMS) menus. Record all error codes and messages
written to the SMS error log.
Step 1321-37
Were any error codes recorded in steps 1321-35 or 1321-36
No Go to “Step 1321-41” on page 83.
Yes Go to the “Quick Entry MAP” on page 58 with each error code or symptom you have recorded.
Perform the indicated actions, one at a time, until the problem has been corrected. If all recorded
error codes have been processed and the problem is has not been corrected, go to “Step
1321-41” on page 83.
Step 1321-38
Does the system appear to stop or hang before reaching the login prompt or did you record any
missing resources in “Step 1321-31” on page 81?
No Go to “Step 1321-39”.
Yes There may be a problem with an I/O device. Go to “MAP 1542: I/O Problem Isolation” on
page 111. In MAP 1542, when instructed to boot the system, boot a full system partition.
Step 1321-39
Boot the AIX standalone diagnostics and run diagnostics on all resources. Be sure to boot a full system
partition. Ensure that AIX diagnostics was run on all known resources. You may need to select each
resource individually and run diagnostics on each resource one at a time. Check to make sure that AIX
diagnostics was run on all known resources. You may need to select each resource individually and run
diagnostics on each resource one at a time.
Did AIX standalone diagnostics find a problem?
No You may have a software problem, call service support.
Yes Go to the “Entry MAP” on page 57 with each error code or SRN you have recorded. Until the
problem has been corrected, perform the indicated actions, one at a time, for each SRN or error
code not already processed in “Step 1321-37”. If all recorded SRNs or error codes have been
processed and the problem has not been corrected, call service support.
Step 1321-40
Does the system have Linux installed on one or more partitions?
No Go to “Step 1321-14” on page 77.
Yes Go to “Step 1321-22” on page 78.
82 Service Guide
Step 1321-41
Have you replaced all parts whose location codes were recorded in Steps 1321-25, 1321-27,1321-29,
or 1321-31?
No Replace, one at a time, all parts whose location code was recorded in Steps 1321-25, 1321-27,
1321-29, and 1321-31 that have not been replaced. Power off the system before replacing a part.
After replacing the part, power on the system to check if the problem has been corrected. Go to
“Step 1321-42” when the problem has been corrected, or all parts in the location codes list have
been replaced.
Yes Go to “Step 1321-38” on page 82.
Step 1321-42
Was the problem corrected in Step 1321-41?
No Go to “Step 1321-38” on page 82.
Yes Return the system to its original configuration. This ends the procedure. Go to ²MAP 0410: Repair
Checkout² in the Diagnostic Information for Multiple Bus Systems.
Chapter 3. Maintenance Analysis Procedures (MAPs) 83
MAP 1322: End of Call MAP for Systems with Service Focal Point
Step 1322-1
For future reference, record the SRC or symptom and the location code of the FRU you replaced.
Step 1322-2
For service focal point operating instructions, refer to “Service Focal Point” on page 48. At the HMC
console, open Service Focal Point and examine the service action event log for any open service action
events.
Step 1322-3
Are there any service action events that are open?
No If the System Attention LED is still on, use the HMC to turn off the LED as described in “Systems
Attached To an HMC” on page 56. This completes the repair, return the system to the customer.
Yes Go to “Step 1322-4”.
Step 1322-4
Record the list of open service action events.
Step 1322-5
From the list of serviceable events recorded in “Step 1322-4”, perform “Step 1322-6” through “Step
1322-33” on page 87 for each open service action event.
Step 1322-6
Determine the error class of the serviceable event. Record for future use.
Step 1322-7
Examine the details of the open service action event.
Is the error code associated with this service action event the same as recorded in ²Step 1322-1²?
No Go to “Step 1322-8”.
Yes Go to “Step 1322-11” on page 85.
Step 1322-8
Examine the FRU list of the service action event.
Are there any FRUs listed for the service action event?
No Go to “Step 1321-11” on page 77.
Yes Go to “Step 1321-9” on page 77.
84 Service Guide
Step 1322-9
Is the FRU list identical (same FRUs, same number of FRUs, and same order of FRUs) to the FRU
list of the error code recorded in “Step 1322-1” on page 84?
No Go to “Step 1322-10”.
Yes Go to “Step 1322-11”.
Step 1322-10
The FRU list is different.
Is the FRU you replaced and recorded in “Step 1322-1” on page 84 in the list of FRUs for this
service action event?
No Go to “Step 1322-33” on page 87.
Note: There are service action events that will remain open when you leave this MAP. Further
service actions may be required to complete the repair.
Yes Go to “Step 1322-11”.
Step 1322-11
Examine the details of this service action event, and record the partition(s) involved in this service action
event for use in a later step.
Step 1322-12
Is the error code associated with this service action event of the form A11-xxx or A01-xxx?
No Go to “Step 1322-17” on page 86.
Yes Go to “Step 1322-13”.
Step 1322-13
Have you begun a list of “Axx” partitions from prior service action events that you processed in
this MAP?
No Go to “Step 1322-14”.
Yes Go to “Step 1322-15”.
Step 1322-14
Begin a new list of “Axx” partitions by copying the list of partitions obtained in “Step 1322-11”. Go to “Step
1322-16” on page 86.
Step 1322-15
Add the partition list obtained in “Step 1322-11” to the existing list of “Axx” partitions obtained from
processing previous service action events in this MAP.
Chapter 3. Maintenance Analysis Procedures (MAPs) 85
Step 1322-16
Remove all entries in the list of all partition(s) you recorded in “Step 1322-11” on page 85. If you are
referred to the list of partition(s) obtained in “Step 1322-11” on page 85 in future steps, the list is empty.
Go to “Step 1322-17”.
Step 1322-17
Select and highlight the service action event from the “Error Associated With This Serviceable Event”
window.
Step 1322-18
Select the Close Event button.
Step 1322-19
Add comments for the serviceable event. Include any unique additional information. Select the OK button.
Step 1322-20
The following steps will add or update FRU information.
Step 1322-21
Did you replace, add, or modify a FRU of the open service action event?
No Go to “Step 1322-23”.
Yes Go to “Step 1322-22”.
Step 1322-22
From the FRU list, select a FRU that you need to update. Double-click on the FRU and update the FRU
information. Go to “Step 1322-24”.
Step 1322-23
Select the No FRU Replaced for this Serviceable Event option.
Step 1322-24
Select OK to close the service action event.
Step 1322-25
Is the list of all partition(s) you recorded in “Step 1322-11” on page 85 empty?
No Go to “Step 1322-26”.
Yes Go to “Step 1322-33” on page 87.
Step 1322-26
Does the list of all partition(s) you recorded in “Step 1322-11” on page 85 contain more than one
entry?
No Go to “Step 1322-33” on page 87.
Yes Go to “Step 1322-27” on page 87.
86 Service Guide
Step 1322-27
Is the error class recorded in “Step 1322-6” on page 84 AIX?
No Go to “Step 1322-33”.
Yes Go to “Step 1322-28”.
Step 1322-28
Perform the following steps for each entry in the list of all partition(s) you recorded in “Step 1322-11” on
page 85, except the partition you were using to debug the original problem.
Step 1322-29
From the HMC virtual terminal window of a partition in the list of all partitions, type diag at the AIX
command prompt.
Step 1322-30
1. When the diagnostic operating instructions are displayed, press Enter.
2. Select the Task Selection option.
Note: If the terminal type is not defined, you are prompted to define it before you can proceed.
3. Select the Log Repair option.
4. Select the resource associated with the repair action. If the repair action was reseating a cable or
adapter, select the resource associated with that repair action. If the resource associated with your
repair action is not displayed on the Resource List, select sysplanar0.
5. Click commit after you have made your selection.
Step 1322-31
Exit from diagnostics in this partition and return to the AIX prompt.
Step 1322-32
Have all the partitions in the list of all partition(s) you recorded in “Step 1322-11” on page 85 been
processed?
No Go to “Step 1322-28” to process the next partition in the list you recorded in “Step 1322-11” on
page 85.
Yes Go to “Step 1322-33”.
Step 1322-33
Have all the serviceable events recorded in “Step 1322-4” on page 84 been processed?
No Go “Step 1322-5” on page 84 and process the next service action event in the list of serviceable
events recorded in the list of serviceable events recorded in “Step 1322-4” on page 84.
Yes Go to “Step 1322-34” on page 88.
Chapter 3. Maintenance Analysis Procedures (MAPs) 87
Step 1322-34
While processing all service action events, were you directed to “Step 1322-14” on page 85?
No If the System Attention LED is still on, use the HMC to turn off the LED as described in “Systems
Attached To an HMC” on page 56. This completes the repair, return the system to the customer.
Note: If during the processing of the list of open service action events, some service actions
events were left open, further service actions may be required to complete the repair.
Yes Go to “Step 1322-35”.
Step 1322-35
Perform the following steps for each entry in the list of “Axx” partitions you began recording in “Step
1322-14” on page 85, except the partition you were using to debug the original problem.
Step 1322-36
From the HMC virtual terminal window of a partition in the list of “Axx” partitions, type diag at the AIX
command prompt.
Step 1322-37
1. When the diagnostic operating instructions are displayed, press Enter.
2. Select the Task Selection option.
Note: If the terminal type is not defined, you are prompted to define it before you can proceed.
3. Select the Log Repair option.
4. Select the resource associated with the repair action. If the repair action was reseating a cable or
adapter, select the resource associated with that repair action. If the resource associated with your
repair action is not displayed on the Resource List, select sysplanar0.
5. Click commit after you have made your selection.
Step 1322-38
Exit from diagnostics in this partition and return to the AIX prompt.
Step 1322-39
Have all the partitions in the list of “Axx” partitions(s) you began recording in “Step 1322-14” on
page 85 been processed?
No Go to “Step 1322-35” to process the next partition in the list you recorded in “Step 1322-14” on
page 85.
Yes If the System Attention LED is still on, use the HMC to turn off the LED as described in “Systems
Attached To an HMC” on page 56. This completes the repair, return the system to the customer.
Note: If during the processing of the list of open service action events, some service actions
events were left open, further service actions may be required to complete the repair.
88 Service Guide
MAP 1520: Power
Notes:
1. This is not a start-of-call MAP. Use this Power MAP only if you have been directed here from another
MAP, an SRN in the Diagnostic Information for Multiple Bus Systems, or an error code.
2. This procedure is used to locate power problems in system units. If a problem is detected, this
procedure helps you isolate the problem to a failing unit.
3. The 286/C4 and 286/E4 can accommodate two power supplies installed with separate power
receptacles on the rear of the power supply. There are three LEDs located on the front and top of the
power supply:
v AC Good LED (green)
v DC Good LED (green)
v Power Supply Fault Indicator LED (amber)
The LEDs indicate the condition of the power supply. Refer to “Component LEDs” on page 466 for
additional information.
4. The condition of two power supplies being present, but only one working, can be determined by the
service processor and a warning level EPOW.
5. Observe the following safety notices during service procedures.
DANGER
An electrical outlet that is not correctly wired could place hazardous voltage on metal parts of
the system or the devices that attach to the system. It is the responsibility of the customer to
ensure that the outlet is correctly wired and grounded to prevent an electrical shock.
Before installing or removing signal cables, ensure that the power cables for the system unit
and all attached devices are unplugged.
When adding or removing any additional devices to or from the system, ensure that the power
cables for those devices are unplugged before the signal cables are connected. If possible,
disconnect all power cables from the existing system before you add a device.
Use one hand, when possible, to connect or disconnect signal cables to prevent a possible
shock from touching two surfaces with different electrical potentials.
During an electrical storm, do not connect cables for display stations, printers, telephones, or
station protectors for communications lines.
D05
CAUTION:
This product is equipped with a three-wire power cable and plug for the user’s safety. Use this
power cable with a properly grounded electrical outlet to avoid electrical shock.
C01
Chapter 3. Maintenance Analysis Procedures (MAPs) 89
DANGER
To prevent electrical shock hazard, disconnect all power cables from the electrical outlet before
relocating the system.
D01
FRU Identify LEDs
This system is configured with an arrangement of LEDs that help identify various components of the
system. These include, but are not limited to the following:
v Rack identify LED
v CEC drawer identify LED
v I/O drawer identify LED
v RIO port identify LED
v FRU identify LED
– Power subsystem FRUs
– CEC subsystem FRUs
– I/O subsystem FRUs
v I/O adapter identify LED
v DASD identify LED
The identify LEDs are arranged hierarchically, with the FRU identify LED at the bottom of the hierarchy,
followed by the corresponding CEC or I/O drawer identify LED, and the corresponding rack identify LED to
locate the failing FRU more easily.
Any identify LED in the system can be flashed when the system is in the failed state with power on by
using the service processor LED Control Menu contained in the “System Information Menu” on page 317.
Any identify LED in the system can also be flashed by using the ²Identify and Attention Indicators² task
through diagnostics. The procedure to operate the ²Identify and Attention Indicators² task in diagnostics is
outlined in the Diagnostic Information for Multiple Bus Systems.
If the service processor menus and the AIX diagnostics are not available, the FRU identify LEDs can be
flashed by one of following:
v If the system is configured as a full system partition, boot to the open firmware prompt. See the “System
Power Control Menu” on page 314 for instructions on setting up the boot mode to enable the boot to the
open firmware prompt. To display a menu that will allow you to turn on the desired FRU identify LED,
type FRU-LED-MENU on the command line.
v If the system is logically partitioned, the HMC must be attached. For instructions on activating and
deactivating a FRU identify LED, see the Service Focal Point chapter of the Hardware Management
Console Installation and Operations Guide, order number SA38-0590.
Symptom Action
The base system drawer will not power on and no error
codes are available.
Go to “MAP 1521: Base System Drawer With Power
Failure and No Error Codes” on page 91.
The I/O drawer does not power on and the power LED is
not lit.
Go to “Map 1522: Cannot Power On the I/O Drawer,
Power Present LED Does Not Come On” on page 98.
Error code 1011 1B0x in the SPCN error codes table
directed you here.
Go to “Map 1523: The SPCN Error Codes Table Directed
You Here and the Error Code is 1011 1B0x” on page 105.
90 Service Guide
MAP 1521: Base System Drawer With Power Failure and No Error
Codes
Step 1521-1
For 286/C4, do the following:
1. Place the drawer into the service position as described in “Placing the 286/C4 into the Service
Position” on page 364.
2. Remove the service access cover as described in “Service Access Cover Removal (286/C4)” on
page 366.
For 286/E4, do the following:
1. Place the deskside unit into a comfortable position for you to work on.
2. Remove the service access cover as described in “Service Access Cover Removal (286/E4)” on
page 371.
You may have been directed to this MAP for one or more of the following reasons:
v The ac power supply power LEDs are not on. The power LEDs on the operator panel and the power
LEDs on the rear of the system unit are not blinking. The operator panel display is blank. Do the
following:
– 286/C4, go to “Step 1521-2” on page 92.
– 286/E4, go to “Step 1521-5” on page 93.
v The ac power supply power LEDs are on. If equipped with dc power, the dc power supply power LEDs
are blinking. Power LEDs, for either ac or dc power supplies, on the operator panel and the rear of the
system unit are not blinking, and the operator panel display is blank. Go to “Step 1521-7” on page 94.
v The ac power supply power LEDs are on. If equipped with dc power, the dc power LEDs are blinking.
Power LEDs located on the operator panel and the rear of the system unit are blinking, and OK is
displayed on the operator panel.
When the power button, located on the operator panel, is pressed:
– If equipped with dc power, the dc power LEDs, located on the power supplies, do not come on.
– The power LED (for either ac or dc powered systems), located on the operator panel, does not come
on.
– None of the cooling fans start to turn.
Go to “Step 1521-7” on page 94.
v The ac power supply power LEDs are on. If equipped with dc power, the dc power supply power LEDs
are blinking. The power LED, located on the operator panel, is blinking and OK is displayed on the
operator panel.
When the power button on the operator panel is pressed:
– If equipped with dc power, the dc power LEDs on the power supplies come on.
– The power LED (for either ac or dc powered systems), on the operator panel, comes on and all the
fans start to turn. However, the dc power LEDs located on the power supplies, and the power LED
located on the operator panel and the cooling fans do not stay on.
Go to “Step 1521-7” on page 94.
v An SRN referenced in the Diagnostic Information for Multiple Bus Systems is listed in MAP 1521 in the
²Actions/Descriptions² column. Go to “Step 1521-11” on page 96.
Chapter 3. Maintenance Analysis Procedures (MAPs) 91
Step 1521-2
Is the green power light on the power distribution panel on?
NO Go to “Step 1521-3”.
YES Go to “Step 1521-5” on page 93.
Step 1521-3
Do the following:
1. Unplug the main power cable from the wall outlet.
2. If you are servicing a drawer unit that is mounted in a rack, unplug the main power cable from the
power distribution panel.
3. Check that the power cable has continuity.
4. Check that the wall outlet has been wired correctly with the correct voltage.
Did you find a problem?
NO Go to “Step 1521-4”.
YES Correct the problem. Go to ²MAP 0410: Repair Checkout² in the Diagnostic Information for
Multiple Bus Systems.
Step 1521-4
If you are servicing a drawer unit that is mounted in a rack, do the following:
1. Unplug the the system unit power cables from the power distribution panel.
2. Plug in the main power cable into the power distribution panel.
3. Plug in the main power cable from the wall outlet.
Is the green power light on the power distribution panel on?
NO Replace the rack-mounted power distribution panel.
YES Go to “Step 1521-5” on page 93.
92 Service Guide
Step 1521-5
Turn off the power and do the following:
For 286/C4:
1. Unplug the the system unit power cables from the power distribution panel.
2. Unplug the power cables from the system unit.
3. Check that the power cables have continuity.
4. Check that the power distribution panel outlets have the correct voltage.
For 286/E4:
1. Unplug the power cables from the wall outlet.
2. Unplug the power cables from the system unit.
3. Check that the power cables have continuity.
4. Check that the wall outlet has been wired correctly with the correct voltage.
Did you find a problem?
NO Go to “Step 1521-6”.
YES Correct the problem. Go to ²MAP 0410: Repair Checkout² in the Diagnostic Information for
Multiple Bus Systems.
Step 1521-6
1. One of the following may be defective:
v Power supplies
v CEC backplane
v Memory DIMMs
v Processor cards
2. To test each FRU, exchange the FRUs that have not already been exchanged in the following order.
v Primary power supply
v Redundant power supply (if installed)
v CEC backplane
v Memory DIMMs
v Processor card(s)
3. Turn off the power.
4. Unplug the power cables from the system unit.
5. Exchange one of the FRUs in the list.
6. Connect the power cables to the system unit.
Do the ac power LEDs on the power supplies come on within 30 seconds after applying ac power?
NO Reinstall the original FRU.
Repeat this step until the defective FRU is identified or all the FRUs have been exchanged.
If the symptom did not change and all the FRUs have been exchanged, call service support for
assistance.
YES Go to ²MAP 0410: Repair Checkout² in the Diagnostic Information for Multiple Bus Systems.
Chapter 3. Maintenance Analysis Procedures (MAPs) 93
Step 1521-7
1. Unplug the power cables from the system unit.
2. Exchange the operator panel assembly.
3. Plug the power cables into the system unit and wait for the OK prompt located on the operator panel
display.
4. Turn on the power.
Does the power LED on the operator panel come on and stay on?
NO Reinstall the original operator panel assembly, go to “Step 1521-8”.
YES Go to ²MAP 0410: Repair Checkout² in the Diagnostic Information for Multiple Bus Systems.
Step 1521-8
1. Turn off the power.
2. Unplug the power cables from the system unit.
3. Record the slot numbers of all the adapters. Label and record the location of any cables attached to
the adapters. Disconnect any cables attached to the adapters and remove all the adapters
4. Remove all but one quad of memory DIMMs from the first processor card.
5. Remove processor card #2 (if installed).
6. Unplug the signal and power cables from the DASD backplane.
7. Unplug the disk drives from the DASD backplane.
8. Unplug the signal and power cables from any devices in the media bays.
9. Remove all the fans.
10. Plug the power cables into the system unit.
11. Turn on the power.
Does the power LED on the operator panel come on and stay on?
NO Go to “Step 1521-9”.
YES Go to “Step 1521-10” on page 95.
Step 1521-9
Note: Either the processor card, PCI riser card, CEC backplane, or the power supplies may be defective.
To test each FRU, exchange the FRUs that have not already been exchanged in the following order:
1. Primary power supply
2. Redundant power supply (if installed)
3. CEC backplane.
4. Processor card
5. Memory
6. PCI riser card
94 Service Guide
Do the following as you exchange each of the above FRUs in the order listed:
1. Turn off the power.
2. Unplug the power cable from the system unit.
3. Exchange one of the FRUs in the list.
4. Connect the power cable to the system unit.
5. Turn on the power.
Does the power LED on the operator panel come on and stay on?
NO Reinstall the original FRU.
Repeat this step until the defective FRU is identified or all the FRUs have been exchanged.
If the symptom did not change and all the FRUs have been exchanged, call service support for
assistance.
YES Go to ²MAP 0410: Repair Checkout² in the Diagnostic Information for Multiple Bus Systems.
Step 1521-10
One of the parts that was removed or unplugged is causing the problem. Install or connect the parts in the
following order:
1. Memory DIMMs on processor card #1
2. Processor card #2 (if removed)
3. DASD backplane
4. Disk drives
5. Media bay devices
6. PCI adapters, lowest slot to highest slot
7. Fans
Turn on the power after each part is installed or connected. If the system does not power on or the power
LED located on the operator panel does not stay on, the most recently installed or connected part is
causing the failure.
Do the following as you exchange each of the above FRUs in the order listed.
1. Turn off the power.
2. Unplug the power cable from the system unit.
3. Install or connect one of the parts in the list.
4. Plug the power cable into the system unit.
5. Turn on the power.
Does the power LED on the operator panel come on and stay on?
NO Replace the last part installed.
v If memory DIMMs on the first processor card were the last parts installed, exchange the
memory DIMMs one at a time to isolate the failing module. If all memory DIMMs have been
exchanged and the problem is not resolved, replace the processor card.
v If the second processor card was just installed, exchange the memory DIMMs one at a time to
isolate a possible failing module. If all modules have been exchanged and the problem is not,
resolved replace the processor card.
Repeat these steps until all the parts have been installed. Go to “Step 1521-11” on page 96.
YES Go to ²MAP 0410: Repair Checkout² in the Diagnostic Information for Multiple Bus Systems.
Chapter 3. Maintenance Analysis Procedures (MAPs) 95
Step 1521-11
Does the system contain two power supplies?
NO Go to “Step 1521-12”.
YES Go to “Step 1521-14” on page 97.
Step 1521-12
Shut down the system and remove all power cables from the rear of the system unit.
Exchange the following FRUs in the order listed:
1. Power supply
2. CEC backplane
3. PCI riser card
Restart the system and perform error log analysis.
Do you get an SRN indicating a voltage sensor is out of range?
NO The last FRU exchanged is defective. Go to ²MAP 0410: Repair Checkout² in the Diagnostic
Information for Multiple Bus Systems.
YES Reinstall the original FRU.
Repeat the FRU replacement steps until a defective FRU is identified or all the FRUs have been
exchanged.
If the symptom did not change and all FRUs have been exchanged, go to “Step 1521-13”.
If the symptom changed, check for loose cards, cables, and obvious problems. If you do not find a
problem, go to the “Quick Entry MAP” on page 58 and follow the instructions for the new symptom.
Step 1521-13
Check that the wall outlet is properly wired and is providing the correct voltage.
Did you find a problem?
NO Go to “MAP 1548: Processor Subsystem Problem Isolation” on page 123.
YES Correct the problem. Go to ²MAP 0410: Repair Checkout² in the Diagnostic Information for
Multiple Bus Systems.
96 Service Guide
Step 1521-14
Because the 286/C4 and 286/E4 accommodate redundant power supplies, it is not necessary to power off
the system when replacing a power supply.
The power supplies are symmetrical, so replacement starts with the unit closest to the system wall of the
system drawer (power supply 1). For instructions on replacing a power supply, refer to “Power Supplies”
on page 436.
Notes:
1. Always service first the power supply whose green LED, located on the front of the power supplies, is
out.
2. Before removing a power supply, be sure the redundant power supply is operational by observing the
green LED. The green LED must be on solid.
Replace the following FRUs in order:
1. Power supply #1 (primary)
2. Power supply #2 (redundant)
Perform error log analysis.
Do you receive an SRN indicating a voltage sensor out of range?
NO The last FRU exchanged is defective. Go to ²MAP 0410: Repair Checkout² in the Diagnostic
Information for Multiple Bus Systems.
YES Reinstall the original FRU.
Repeat the FRU replacement steps until a defective FRU is identified or all the FRUs have been
exchanged.
If the symptom did not change and all FRUs have been exchanged, go to “Step 1521-15”.
If the symptom changed, check for loose cards, cables, and obvious problems. If you do not find a
problem, go to the “Quick Entry MAP” on page 58 and follow the instructions for the new symptom.
Step 1521-15
The problem is within the system unit hardware or with the line voltage or wiring.
Shut down the system and remove the power cable from the system unit. Exchange the following FRUs in
the order listed.
1. PCI riser card
2. CEC backplane
Restart the system and perform error log analysis.
Do you get an SRN indicating a voltage sensor is out of range?
NO The last FRU exchanged is defective. Go to ²MAP 0410: Repair Checkout² in the Diagnostic
Information for Multiple Bus Systems.
YES Reinstall the original FRU.
Repeat the FRU replacement steps until a defective FRU is identified or all the FRUs have been
exchanged.
If the symptom did not change and all FRUs have been exchanged, go to “Step 1521-13” on
page 96.
If the symptom changed, check for loose cards, cables, and obvious problems. If you do not find a
problem, go to the “Quick Entry MAP” on page 58 and follow the instructions for the new symptom.
Chapter 3. Maintenance Analysis Procedures (MAPs) 97
Map 1522: Cannot Power On the I/O Drawer, Power Present LED Does
Not Come On
Step 1522-1
Attempt to power on the I/O drawer (see “Starting the System without an HMC Attached” on page 360)
Does the I/O drawer power on?
NO Go to “Step 1522-2”.
YES This ends the procedure. Go to ²MAP 0410: Repair Checkout² in the Diagnostic Information for
Multiple Bus Systems.
Step 1522-2
Examine the power supply LEDs at the rear of the I/O drawer.
Are the green AC Good LEDs of both power supplies of the I/O drawer on solid?
NO Go to “Step 1522-8” on page 99.
YES Go to “Step 1522-3”.
Step 1522-3
Is the amber fault LED of any power supply of the I/O drawer on solid?
NO Go to “Step 1522-4”.
YES Go to “Step 1522-21” on page 102.
Step 1522-4
Is the amber fault LED of any power supply of the I/O drawer blinking?
NO Go to “Step 1522-5”.
YES Go to “Step 1522-23” on page 103.
Step 1522-5
Is the green DC good LED of any power supply of the I/O drawer off?
NO Go to “Step 1522- 25” on page 103.
YES Go to “Step 1522-6”.
Step 1522-6
Are the green DC good LEDs of both power supplies of the I/O drawer off?
NO Go to “Step 1522-7” on page 99.
YES Go to “Step 1522- 25” on page 103.
98 Service Guide
Step 1522-7
Replace the items in the following list one at a time. Turn the power off before removing the item, and turn
the power back on after replacing the item to determine if the drawer will power on.
1. Replace the power supply with the green DC good LED off.
2. Replace the other power supply with the one just removed from above.
3. Replace the I/O drawer backplane.
Does the I/O drawer power on?
NO Have all the items in the list above been replaced?
NO Go to beginning of this step and replace the next item.
YES Call service support.
YES The item just replaced was defective. This ends the procedure. Go to ²MAP 0410: Repair
Checkout² in the Diagnostic Information for Multiple Bus Systems.
Step 1522-8
Is this machine connected to dc power?
NO Go to “Step 1522-12” on page 100.
YES Go to “Step 1522-9”.
Step 1522-9
Are all circuit breakers reset at the rack power distribution panel?
NO Reset the breaker switch by first turning off, then turning on.
This ends the procedure. Go to ²MAP 0410: Repair Checkout² in the Diagnostic Information for
Multiple Bus Systems.
YES Go to “Step 1522-10”.
Step 1522-10
Are all the dc power cables seated firmly and connected correctly from the rack power distribution
panel to the rack drawers?
NO Fix the connection.
This ends the procedure. Go to ²MAP 0410: Repair Checkout² in the Diagnostic Information for
Multiple Bus Systems.
YES Go to “Step 1522-11” on page 100.
Chapter 3. Maintenance Analysis Procedures (MAPs) 99
Step 1522-11
Note: Read the danger and caution notices under “Safety Notices” on page xi before continuing with this
procedure.
Perform the following:
1. Disconnect the dc power cable from each power supply of the I/O drawer.
2. Use a multimeter to measure the dc voltage at the drawer end of each dc power cable.
Is the dc voltage from -40 V dc to -60 V dc?
NO There is no power to the drawer. Refer to the rack documentation to isolate the dc power problem.
The 14T/0 and 14T/4 Installation and Service Guide is appropriate for most systems.
This ends the procedure.
YES Go to “Step 1522-17” on page 101.
Step 1522-12
Are all circuit breakers reset at the rack power distribution unit(s)?
NO Reset the breaker switch.
This ends the procedure. Go to ²MAP 0410: Repair Checkout² in the Diagnostic Information for
Multiple Bus Systems.
YES Go to “Step 1522-13”.
Step 1522-13
Are all the ac power cables seated firmly and connected correctly from the rack power distribution
unit(s) to the rack drawers?
NO Fix the connection.
This ends the procedure. Go to ²MAP 0410: Repair Checkout² in the Diagnostic Information for
Multiple Bus Systems.
YES Go to “Step 1522-14”.
Step 1522-14
Follow the ac power cable(s) from the I/O drawer to the appropriate power distribution unit.
Is the power distribution unit power LED on?
NO Go to “Step 1522-18” on page 102.
YES Go to “Step 1522-15” on page 101.
100 Service Guide
Step 1522-15
Note: Read the danger and caution notices under “Safety Notices” on page xi before continuing with this
procedure.
Perform the following:
1. Disconnect the ac power cable(s) from the power distribution unit that supplies power to the I/O
drawer.
2. Use a multimeter to measure the ac voltage at the power distribution unit socket(s) where the ac
power cable(s), just removed, was (were) attached.
Is the ac voltage from 200 V ac to 240 V ac?
NO Replace the power distribution unit
This ends the procedure. Go to ²MAP 0410: Repair Checkout² in the Diagnostic Information for
Multiple Bus Systems.
YES Go to “Step 1522-16”.
Step 1522-16
Reconnect the ac power cable(s) to the power distribution unit.
Note: Read the danger and caution notices under “Safety Notices” on page xi before continuing with this
procedure.
Perform the following:
1. Disconnect the ac power cable from each power supply of the I/O drawer.
2. Use a multimeter to measure the ac voltage at the drawer end of each ac power cable.
Is the ac voltage from 200 V ac to 240 V ac?
NO Replace the ac power cable(s).
This ends the procedure. Go to ²MAP 0410: Repair Checkout² in the Diagnostic Information for
Multiple Bus Systems.
YES Go to “Step 1522-17”.
Step 1522-17
Replace the following in the I/O drawer, one at a time, disconnecting the ac power cable (dc power cable
for a dc-powered system) to each power supply before replacement and reconnecting the ac power cable
(dc power cable for a dc-powered system) to each supply after replacement:
v I/O power supply 1.
v I/O power supply 2.
v I/O drawer backplane
Does the operator panel show the OK prompt?
NO Contact your next level of support for assistance.
YES This ends the procedure. Go to ²MAP 0410: Repair Checkout² in the Diagnostic Information for
Multiple Bus Systems.
Chapter 3. Maintenance Analysis Procedures (MAPs) 101
Step 1522-18
Is the ac power cable from the rack power distribution unit to the customer’s ac power outlet
connected and seated correctly at both ends?
NO Connect the ac power cable correctly at both ends.
This ends the procedure. Go to ²MAP 0410: Repair Checkout² in the Diagnostic Information for
Multiple Bus Systems.
YES Go to “Step 1522-19”.
Step 1522-19
Note: Read the danger and caution notices under “Safety Notices” on page xi before continuing with this
procedure.
Perform the following:
1. Disconnect the mainline ac power cable from the power distribution unit that supplies power to the I/O
drawer.
2. Use a multimeter to measure the ac voltage at the power distribution unit end of the mainline ac power
cable.
Is the ac voltage from 200 V ac to 240 V ac?
NO Go to “Step 1522-20”.
YES Replace the power distribution unit.
This ends the procedure. Go to ²MAP 0410: Repair Checkout² in the Diagnostic Information for
Multiple Bus Systems.
Step 1522-20
Note: Read the danger and caution notices under “Safety Notices” on page xi before continuing with this
procedure.
Perform the following:
1. Disconnect the mainline ac power cable from the customer’s ac power outlet.
2. Use a multimeter to measure the ac voltage at the customer’s ac power outlet.
Is the ac voltage from 200 V ac to 240 V ac?
NO Inform the customer that the ac voltage at the power outlet is not correct.
This ends the procedure. Go to ²MAP 0410: Repair Checkout² in the Diagnostic Information for
Multiple Bus Systems.
YES Replace the mainline ac power cable.
This ends the procedure. Go to ²MAP 0410: Repair Checkout² in the Diagnostic Information for
Multiple Bus Systems.
Step 1522-21
Are the amber fault LEDs of both power supplies of the I/O drawer on solid?
NO Go to “Step 1522-22” on page 103.
YES Go to “Step 1522- 25” on page 103.
102 Service Guide
Step 1522-22
Replace the items in the following list one at a time. Turn the power off before removing the item, and turn
the power back on after replacing the item to determine if the drawer will power on.
1. Replace the power supply with the amber fault LED on solid.
2. Replace the other power supply with the one just removed.
3. Replace the I/O drawer backplane.
Does the I/O drawer power on?
NO Have all the items in the list above been replaced?
NO Go to “Step 1522-22” and replace the next item.
YES Call service support.
YES The item just replaced was defective. This ends the procedure. Go to ²MAP 0410: Repair
Checkout² in the Diagnostic Information for Multiple Bus Systems.
Step 1522-23
Are the amber fault LEDs of both power supplies of the I/O drawer blinking?
NO Go to “Step 1522-24”.
YES Go to “Step 1522- 25”.
Step 1522-24
Perform the action items in the following list one at a time. Turn the power off before removing the item,
and turn the power back on after replacing the item to determine if the drawer will power on.
1. Check all SPCN external cabling.
2. Replace the power supply with the amber fault LED blinking.
3. Replace the other power supply with the one just removed.
4. Replace the I/O drawer backplane.
Does the I/O drawer power on?
NO Have all the items in the list above been replaced?
NO Go to “Step 1522-24”, and replace the next item.
YES Call service support.
YES The item just replaced was defective. This ends the procedure. Go to ²MAP 0410: Repair
Checkout² in the Diagnostic Information for Multiple Bus Systems.
Step 1522- 25
Disconnect the ac power cable (dc power cable for a dc-powered system) to each power supply of the I/O
drawer and remove the following components from the I/O drawer, one at a time, in the sequence listed:
1. RIO adapter card
2. PCI adapter card(s)
Reconnect the ac power cables (dc power cable for a dc-powered system) to each power supply of the I/O
drawer and attempt to power on the I/O drawer.
Does the I/O drawer power-on?
NO Go to “Step 1522-28” on page 104.
YES Go to “Step 1522-26” on page 104.
Chapter 3. Maintenance Analysis Procedures (MAPs) 103
Step 1522-26
Reinsert the following components in the I/O drawer, one card at a time, disconnecting the ac power cable
(dc power cable in a dc-powered system) to each power supply before reinsertion and reconnecting the ac
power cable (dc power cable for a dc-powered system) to each power supply after reinsertion:
1. RIO adapter card
2. PCI adapter card(s)
Does the I/O drawer power-on?
NO Replace the defective component just reinserted.
Reinsert any remaining components.
This ends the procedure. Go to ²MAP 0410: Repair Checkout² in the Diagnostic Information for
Multiple Bus Systems.
YES Go to “Step 1522-27”.
Step 1522-27
Have all components in step “Step 1522-26” been reinserted?
NO Go to “Step 1522-26” to reinsert the next component.
YES The symptom has changed. Call your next level of support for assistance.
This ends the procedure.
Step 1522-28
Replace each I/O drawer, one at a time, disconnecting the power cable to each power supply before
replacement and reconnecting the power cable to each power supply after replacement:
Does the I/O drawer power on?
NO Go to “Step 1522-29”.
YES Reinsert the components removed in “Step 1522- 25” on page 103.
This ends the procedure. Go to ²MAP 0410: Repair Checkout² in the Diagnostic Information for
Multiple Bus Systems.
Step 1522-29
Have all components in the previous step been replaced?
NO Go to “Step 1522-28” and replace the next component.
YES Call your next level of support for assistance.
This ends the procedure.
104 Service Guide
Map 1523: The SPCN Error Codes Table Directed You Here and the
Error Code is 1011 1B0x
A base system power supply is reporting a load fault.
Note: Read the danger and caution notices under “Safety Notices” on page xi before continuing with this
procedure.
Perform the following steps to isolate the problem.
Step 1523-1
1. Turn off the power.
2. Disconnect the ac power cables (dc power cables in a dc-powered system) from the base system
drawer.
3. At the top of the base system drawer, remove the following cards, in the sequence listed:
a. Processor card(s)
b. RIO adapter card(s)
c. PCI adapter card(s)
d. DASD device
e. Media bay device(s)
4. Reconnect the ac power cables (dc power cables in a dc-powered system) to the base system drawer.
5. Turn on the power.
Is the error code 1011 1B0x displayed?
NO Go to “Step 1523-7” on page 107.
YES Go to “Step 1523-2”.
Step 1523-2
1. Turn off the power.
2. Disconnect the ac power cables (dc power cables in a dc-powered system) from the base system
drawer.
3. Remove base system power supply 2.
4. Reconnect the ac power cable (dc power cable in a dc-powered system) to the base system power
supply 1.
5. Turn on the power.
Is the error code 1011 1B0x displayed?
NO Go to “Step 1523-3” on page 106.
YES Go to “Step 1523-4” on page 106
Chapter 3. Maintenance Analysis Procedures (MAPs) 105
Step 1523-3
1. Turn off the power.
2. Reinstall base system power supply 2 that was removed in “Step 1523-2” on page 105.
3. Reconnect the ac power cable (dc power cable in a dc-powered system) to the base system power
supply 2.
4. Turn on the power.
Is the error code 1011 1B0x displayed?
NO The symptom has changed.
This ends the procedure.
YES Replace base system power supply 2, location: U0.1-V2.
This ends the procedure. Go to ²MAP 0410: Repair Checkout² in the Diagnostic Information for
Multiple Bus Systems.
Step 1523-4
1. Turn off the power.
2. Reinstall base system power supply 2 that was removed in “Step 1523-2” on page 105.
3. Reconnect the ac power cable (dc power cable in a dc-powered system) to the base system power
supply 2.
4. Disconnect the ac cable (dc power cable in a dc-powered system) to base system power supply 1.
5. Remove base system power supply 1.
6. Turn on the power.
Is the error code 1011 1B0x displayed?
NO Replace base system power supply 1, location: U0.1-V1.
This ends the procedure. Go to ²MAP 0410: Repair Checkout² in the Diagnostic Information for
Multiple Bus Systems.
YES Go to “Step 1523-5”.
Step 1523-5
1. Turn off the power.
2. Reinstall base system power supply 2 that was removed in “Step 1523-4”.
3. Reconnect the ac power cable (dc power cable in a dc-powered system) to the base system power
supply 2.
4. Remove base system fan 3. Disregard a base system fan error code if it occurs during this step.
5. Turn on the power.
Is the error code 1011 1B0x displayed?
NO Replace base system fan 3, location: U0.1-F3.This ends the procedure. Go to ²MAP 0410:
Repair Checkout² in the Diagnostic Information for Multiple Bus Systems.
YES Go to “Step 1523-6” on page 107.
106 Service Guide
Step 1523-6
1. Turn off the power.
2. Reinstall base system fan 3 that was removed in “Step 1523-5” on page 106.
3. Remove base system fan 4. Disregard a base system fan error code if it occurs during this step.
4. Turn on the power.
Is the error code 1011 1B0x displayed?
NO Replace base system fan 4, location: U0.1-F4. This ends the procedure. Go to ²MAP 0410:
Repair Checkout² in the Diagnostic Information for Multiple Bus Systems.
YES Replace the base system backplane, location: U0.1-P1. This ends the procedure. Go to ²MAP
0410: Repair Checkout² in the Diagnostic Information for Multiple Bus Systems.
Step 1523-7
1. Turn off the power.
2. Disconnect the ac power cables (dc power cables in a dc-powered system) from the base system
drawer.
3. Reinstall one of the cards that was removed in “Step 1523-1” on page 105, in the sequence listed:
a. Processor cards
b. RIO adapter card(s)
c. PCI adapter card(s)
d. DASD device(s)
e. Media bay device(s)
4. Reconnect the ac power cables (dc power cables in a dc-powered system) to the base system drawer.
5. Turn on the power.
Is the error code 1011 1B0x displayed?
NO Go to “Step 1523-8”.
YES The last card you installed is defective. Replace the last card you installed. This ends the
procedure. Go to ²MAP 0410: Repair Checkout² in the Diagnostic Information for Multiple Bus
Systems.
Step 1523-8
Have all the cards that were removed in “Step 1523-7” been reinstalled?
NO Go to “Step 1523-7” and reinstall the next card in the list.
YES Replace the base system backplane, location: U0.1-P1.
This ends the procedure. Go to ²MAP 0410: Repair Checkout² in the Diagnostic Information for
Multiple Bus Systems.
Chapter 3. Maintenance Analysis Procedures (MAPs) 107
MAP 1540: Problem Isolation Procedures
These procedures are used to locate problems in the processor subsystem, I/O subsystem(s) or rack. If a
problem is detected, these procedures help you isolate the problem to a failing unit. Find the symptom in
the following table; then follow the instructions given in the Action column.
FRU Identify LEDs
This system is configured with an arrangement of LEDs that help identify various components of the
system. These include, but are not limited to the following:
v Rack identify LED
v CEC drawer identify LED
v I/O drawer identify LED
v RIO port identify LED
v FRU identify LED
– Power subsystem FRUs
– CEC subsystem FRUs
– I/O subsystem FRUs
v I/O adapter identify LED
v DASD identify LED
The identify LEDs are arranged hierarchically, with the FRU identify LED at the bottom of the hierarchy,
followed by the corresponding CEC or I/O drawer identify LED, and the corresponding rack identify LED to
locate the failing FRU more easily.
Any identify LED in the system can be flashed when the system is in the failed state with power on by
using the service processor LED Control Menu contained in the “System Information Menu” on page 317.
Any identify LED in the system can also be flashed by using the ²Identify and Attention Indicators² task
through diagnostics. The procedure to operate the ²Identify and Attention Indicators² task in diagnostics is
outlined in the Diagnostic Information for Multiple Bus Systems.
If the service processor menus and the AIX diagnostics are not available, the FRU identify LEDs can be
flashed by one of following:
v If the system is configured as a full system partition, boot to the open firmware prompt. See the “System
Power Control Menu” on page 314 for instructions on setting up the boot mode to enable the boot to the
open firmware prompt. To display a menu that will allow you to turn on the desired FRU identify LED,
type FRU-LED-MENU on the command line.
v If the system is logically partitioned, the HMC must be attached. For instructions on activating and
deactivating a FRU identify LED, see the Service Focal Point chapter of the Hardware Management
Console Installation and Operations Guide, order number SA38-0590.
108 Service Guide
This system is configured with an arrangement of LEDs that help identify various components of the
system. These include but are not limited to:
Problem Isolation Procedures
Symptom/Reference Code/Checkpoint Action
You were sent here by the Diagnostic Information for
Multiple Bus Systems book.
Go to “MAP 1542: I/O Problem Isolation” on page 111.
406x 00B7 Go to “MAP 1548: Processor Subsystem Problem
Isolation” on page 123.
406x 0EB1, 406x 0EB2, 406x 0EB3, 406x 0EB4,
406x 0EB5, 406x 0EB6, 406x 0EB7, 406x 0EB8,
406x 0EB9, 406x 0EBA, 406x 0EBB, 406x 0EBC
Go to “MAP 1549: Attention Problem Isolation” on
page 137.
406x 0EB0 Go to “MAP 1541: JTAG Problem Isolation” on page 110.
1011 8400, 1011 8401, 1011 8409, 1011 840A,
1011 840B, 406x 0CA5, B1xx 4643, B1xx 4644,
B1xx 4645, B1xx 4646, B1xx 4648
Go to “MAP 1548: Processor Subsystem Problem
Isolation” on page 123.
E101, E102, E10A, E10B, E111, E120, E121, E122,
E130, E131, E132, E133, E134, E135, E138, E139,
E13A, E149, E14C, E191, E19A, E19B, E19D, E1A0,
E1A1, E1A2, E1A3, E1A4, E1A5, E1A6, E1A7, E1A8,
E1A9, E1AA, E1AB, E1AC, E1AD, E1AE, E1AF, E1B1,
E1C4, E1C5, E1C6, E1D0, E1D3, E1D4, E1DB, E1F0,
E1F1, E1F4, E1F5, E1F6, E1F8, E1F9, E1FA, E1FB,
E1FD, E1FE, E5E0, E5E2, E5E3, E5E4, E5E5, E5E6,
E5E7, E5E8, E5E9, E5EE, E5EF, B1xx 4601, B1xx 4602,
B1xx 4650, B1xx 469E, B1xx 469F B1xx 4660, B1xx
4661, B1xx 4662
Go to “MAP 1548: Processor Subsystem Problem
Isolation” on page 123.
E500, E501, E502, E503, E504, E505, E506, E507,
E508, E509, E50A, E50B, E50C, E50D, E50E, E50F,
E51F, E520, E521, E522, E523, E524, E525, E526,
E527, E52F, E530, E531, E532, E533, E534, E535,
E536, E537, E538, E539, E53A, E53B, E53C, E540,
E541, E560, E56F, E57F, E580, E581, E582, E583,
E584, E585, E586, E587, E588, E589, E58A, E58B,
E58C, E58D, E58E, E58F E590, E591, E592, E593,
E594, E595, E59E, E708, E70C, E70F, E7FF, E852,
E861, E871
Go to “MAP 1542: I/O Problem Isolation” on page 111.
B1xx 4634 Go to “MAP 154C: SPCN Cable Problem Isolation” on
page 139.
Chapter 3. Maintenance Analysis Procedures (MAPs) 109
MAP 1541: JTAG Problem Isolation
Step 1541-1
Record the error code and location code(s) that sent you to this MAP.
Step 1541-2
Turn off the power.
Step 1541-3
Replace the following, if present, one at a time and in the order listed:
v First location code item recorded, if any, “Step 1541-1”.
v Second location code item recorded, if any, in “Step 1541-1”.
v Third location code item recorded, if any, in “Step 1541-1”.
v Fourth location code item recorded, if any, in “Step 1541-1”.
v Replace the CEC backplane at U0.1-P1.
v Processor card at location U0.1-P1-C1 if not recorded in “Step 1541-1”.
v All memory DIMMS on processor card 1 at location U0.1-P1-C1-Mn (n equals 1 to 8), if not recorded in
“Step 1541-1”.
v Processor card at location U0.1-P1-C2 if not recorded in “Step 1541-1”.
v All memory DIMMS on processor card 2 at location U0.1-P1-C2-Mn (n equals 1 to 8), if not recorded in
“Step 1541-1”.
v PCI riser card at U0.1-P2.
Step 1541-4
Turn on the power.
Step 1541-5
Did the system stop with the same error code as recorded in “Step 1541-1”?
Yes Go to “Step 1541-6”.
No The part just replaced was defective. This ends the procedure.
Return the system to its original configuration.
Go to ²MAP 0410: Repair Checkout² in the Diagnostic Information for Multiple Bus Systems.
Step 1541-6
Have all the cards listed in “Step 1541-3” been replaced?
Yes Go to “Step 1541-7”.
No Go to “Step 1541-2”.
Step 1541-7
Turn off the power. Call for support.
110 Service Guide
MAP 1542: I/O Problem Isolation
This MAP is used to locate defective FRUs not found by normal diagnostics. For this procedure,
diagnostics are run on a minimally configured system. If a failure is detected on the minimally configured
system, the remaining FRUs are exchanged one at a time until the failing FRU is identified. If a failure is
not detected, FRUs are added back until the failure occurs. The failure is then isolated to the failing FRU.
This I/O problem-determination MAP isolates I/O card and I/O subsystem failures. When I/O problem
isolation is complete, all cables and cards exhibiting a failure will have been replaced or reseated.
Notes:
1. This MAP assumes that a CD-ROM drive is installed in the base system and connected to an adapter
in the base system, and that a Diagnostics CD-ROM is available.
2. If a general-access password or privileged-access password is installed, you are prompted to enter the
password before the diagnostic CD-ROM can load.
3. The term POST indicators refers to the device mnemonics (words memory, keyboard, network,
SCSI, and speaker) that appear on the system console during the power-on self-test (POST).
4. The service processor may have recorded one or more symptoms in its error log. It is a good idea to
examine that error log before proceeding (see “System Information Menu” on page 317).
5. The service processor may have been set by the user to monitor system operations and to attempt
recoveries. You can disable these actions while you diagnose and service the system. If you disable
them, make notes of their current settings so that you can restore them easily. The following table lists
the settings:
Surveillance From the service processor Setup Menu, go to the
Surveillance Setup Menu and disable surveillance.
(Operating System Surveillance is disabled in LPAR
mode.)
Unattended Start From the service processor System Power Control Menu,
disable unattended start mode.
Reboot Policy From the System Power Control Menu, go to the
Reboot/Restart Policy Setup Menu and set:
a. Number of reboot attempts to 0 (zero)
b. Use OS-Defined restart policy to No
c. Enable supplemental restart policy to No.
Call Out From the Call-In/Call-Out Setup Menu, go to the Serial
Port Selection Menu and disable call-out on both serial
ports.
The steps in this procedure will attempt to slow boot the system into service mode diagnostics. The
following procedure describes booting a nonpartitioned system with an attached 3151 console. The
procedure for booting a partitioned system with an attached HMC is slightly different and is described in
“Standalone Diagnostics Operations” on page 301. “Standalone Diagnostics Operation with an HMC
Attached to the System” describes the procedure for booting online disgnostics (6 key between the
keyboard and speaker indicators) and the following procedure requires booting standalone diagnostics (5
key between the keyboard and speaker indicators). Therefore, on a partitioned system, make sure the
partition you selected to boot in the following procedure has the CD-ROM allocated to it, as described in
Chapter 6, “Using the Online and Standalone Diagnostics”, on page 297.
Chapter 3. Maintenance Analysis Procedures (MAPs) 111
Step 1542-1
1. Ensure that the diagnostics and the operating system are shut down.
2. Turn off the power.
3. Select slow boot mode (select Disable Fast Boot) on the System Power Control menu from the
service processor main menu.
4. Turn on the power.
5. Insert the diagnostic CD-ROM into the CD-ROM drive.
Does the CD-ROM appear to operate correctly?
NO Go to “Boot Problems/Concerns” on page 173.
YES Go to “Step 1542-2”.
Step 1542-2
1. When the keyboard indicator is displayed (the word keyboard), press the 5 key on the firmware
console.
2. Enter the appropriate password when you are prompted to do so.
Is the Please define system console screen displayed?
NO Go to “Step 1542-3”.
YES Go to “Step 1542-4” on page 113.
Step 1542-3
The system is unable to boot standalone diagnostics.
Check the service processor error log and the operator panel for additional error codes resulting from the
slow boot in “Step 1542-1”. Did the slow boot generate a different error code from the one that originally
sent you to MAP 1542?
NO It appears you have a processor subsystem problem. Call for support. This ends the procedure.
YES Restore fast-boot mode (select Enable Fast Boot) on the System Power Control menu from the
service processor main menu. Go to “Checkpoint and Error Code Index” on page 181 and follow
the actions for the new error code.
112 Service Guide
Step 1542-4
The system stopped with the Please define system console prompt appearing on the system console.
Standalone diagnostics can be booted. Perform the following:
1. Follow the instructions on the screen to select the system console.
2. When the DIAGNOSTIC OPERATING INSTRUCTIONS screen is displayed, press Enter.
3. If the terminal type has not been defined, you must use the Initialize Terminal option of the FUNCTION
SELECTION menu to initialize the AIX operating system environment before you can continue with the
diagnostics. This is a separate and different operation from selecting the firmware console display.
4. Select Advanced Diagnostic Routines.
5. When the DIAGNOSTIC MODE SELECTION menu displays, select System Verification to run
diagnostics on all resources.
Did running diagnostics produce a different symptom?
NO Go to substep 6.
YES Go to “Entry MAP” on page 57. Use the new symptom.
6. Record any devices missing from the list of all adapters and devices. Continue with this MAP. When
you have fixed the problem, use this record to verify that all devices appear when you run System
Verification.
Are there any devices missing from the list of all adapters and devices?
NO Reinstall all remaining adapters, if any, and reconnect all devices. Return the system to its original
configuration. Be sure to select fast-boot mode (select Enable Fast Boot) on the System Power
Control menu from the service processor main menu. Go to ²Map 0410: Repair Checkout² in the
Diagnostic Information for Multiple Bus Systems.
YES The boot attempts that follow will attempt to isolate any remaining I/O drawer problems with
missing devices. Ignore any codes that may display on the operator panel unless stated otherwise.
Go to “Step 1542-5”.
Step 1542-5
Examine the base system RIO port 0 of RIO connector card 1 (U0.1-P1-H1/Q1).
Are there any I/O subsystems attached to the base system?
NO Go to “Step 1542-18” on page 119.
YES Go to “Step 1542-6”.
Step 1542-6
There may be missing devices associated with one or more I/O subsystems.
Note: Before continuing, check the cabling from the base system to the I/O subsystem(s) to ensure that
the system is cabled correctly. Refer to “Subsystem Positioning and Cabling” on page 39 for valid
configurations. Record the current cabling configuration and then continue with the following steps:
1. Turn off the power.
2. At the base system, disconnect the cable connection at RIO-G port 0 (U0.1-P1/Q2).
3. At the other end of the RIO-G cable referred to in substep 2, disconnect the I/O subsystem port
connector 0 (U0.x-P1/Q2). The RIO cable that was connected to RIO-G port 0 in the base system
should now be loose and can be removed. Record the location of this I/O subsystem.
Chapter 3. Maintenance Analysis Procedures (MAPs) 113
4. Examine the connection at the I/O port connector 1 (U0.x-P1/Qu) of the I/O subsytem recorded in
substep 3 on page 113. If the RIO cable attached to the I/O port connector 1 of the I/O subsytem
connects to the I/O port connector 0 of another I/O subsystem, record that the I/O subsystem
identified in substep 3 on page 113 is ²looped one² and record the location of the next I/O subsystem
that is connected to I/O port connector 1 of the ²looped one² I/O subsystem, then go to substep 9.
5. If the RIO cable attached to the I/O port connector 1 (U0.x-P1.1/Q2) of the I/O subsytem recorded in
substep 3 on page 113 connects to RIO-G port 1 (U0.1-P1/Q3) of the base system, record that the
I/O subsystem is ²not looped².
6. At the base system, disconnect the cable connection at RIO-G port 1 (U0.1-P1/Q3) and reconnect it
to RIO-G port 0 (U0.1-P1/Q2).
7. At the I/O subsystem recorded in substep 3 on page 113, disconnect the I/O port connector 1
(U0.x-P1/Qu) and reconnect to the I/O port connector 0 (U0.x-P1/Q2).
8. Verify that a single RIO cable connects base system RIO-G port 0 (U0.1-P1/Q2) to the I/O subsystem
recorded in substep 4 port connector 0 (U0.x-P1/Q2). Go to substep 13.
9. Record that the next I/O subsystem. This will be the I/O subsystem that is connected to I/O port
connector 1 of the ²looped one² I/O subsystem.
10. At the base system, disconnect the cable connection at RIO-G port 1 (U0.1-P1/Q3) and reconnect it
to RIO-G port 0 (U0.1-P1/Q2).
11. At the ²looped two² I/O subsystem recorded in substep 14, disconnect the I/O port connector 1
(U0.x-P1/Qu) and reconnect to the I/O port connector 0 (U0.x-P1/Q2) of the ²looped one² I/O
subsystem identified in substep 4.
12. Verify that a single RIO cable connects base system RIO-G port 0 (U0.1-P1/Q2) to the I/O subsystem
recorded in substep 3 on page 113 port connector 0 (U0.x-P1/Qu). Go to substep 13.
13. If an I/O subsystem was ²looped two², verify that the ²looped one² I/O subsystem I/O port connector
1 (U0.x-P1/Qu) is connected to the ²looped two² I/O subsystem I/O port connector 0 (U0.x-P1/Q2).
14. At this point, a chain of one or two I/O subsystems is attached to RIO-G port 0 (U0.1-P1/Q2) of the
base system.
15. Turn on the power to boot standalone diagnostics from CD.
16. If the Please define the System Console screen is displayed, follow directions to select the system
console.
17. Use the Display Configuration and Resource List to list all attached devices and adapters (refer to the
Diagnostic Information for Multiple Bus Systems, if necessary).
18. Check that all attached devices and adapters are listed.
Did the Please define the System Console screen display and are all attached devices and adapters
listed?
NO Go to “Step 1542-7” on page 115.
YES The RIO cable that was removed in substep 3 on page 113 is defective. Replace the RIO cable.
If two I/O subsystems are chained to RIO-G port 0 (U0.1-P1/Q2) of the base system, connect the
new RIO cable from the ²looped two² I/O subsystem I/O port connector 1 (U0.x-P1/Q1) to base
system RIO-G port 1 (U0.1-P1/Q1).
If the I/O subsystem was ²not looped² connect the new RIO cable from the I/O subsystem
recorded in substep 4 I/O port connector 1 (P1/Q1) to base system RIO-G port 1 (U0.1-P1/Q3).
Reconfigure the system to its original condition. Go to “Map 0410: Repair Checkout” in the
Diagnostic Information for Multiple Bus Systems.
114 Service Guide
Step 1542-7
The 286/C4 and 286/E4 server may have up to two I/O subsystems attached to its RIO ports. The
following steps reduce the system I/O to only the I/O of the base system.
1. Turn off the system power.
2. At the base system, disconnect the cable connection at RIO-G port 0 (U0.1-P1/Q2). Record all
connections.
3. At the base system, disconnect the cable connection at RIO-G port 1 (U0.1-P1/Q3). Record all
connections.
4. Disconnect the power cables from the I/O subsystem(s) that were attached to the base system. All I/O
subsystems should now be physically disconnected from the base system.
Go to “Step 1542-8”.
Step 1542-8
The base system is running from the integrated I/O or from I/O attached to an adapter installed into one of
the integrated PCI slots.
1. Turn on the power to boot standalone diagnostics from CD.
2. If the Please define the System Console screen is displayed, follow directions to select the system
console.
3. Use the Display Configuration and Resource List to list all attached devices and adapters (refer to the
Diagnostic Information for Multiple Bus Systems, order number SA23-2769, if necessary).
4. Check that all attached devices and adapters are listed.
If the Please define the System Console screen did not display or all attached devices and adapters are
not listed, the problem is in the base system.
Did the Please define the System Console screen display and are all attached devices and adapters
listed?
NO Go to “Step 1542-18” on page 119.
YES Go to “Step 1542-17” on page 119.
Step 1542-9
For any I/O subsystem attached to the RIO-G ports ot the base system:
If the I/O subsystem was recorded as ²looped one², are there any adapters in slots 1, 2, 3, 4, 5, 6,
or 7 (location codes U0.x-P1-I1 through I7)?
NO Go to “Step 1542-10”.
YES Go to “Step 1542-11” on page 116.
Step 1542-10
For any I/O subsystem attached to the RIO-G ports of the base:
If the I/O subsystem was recorded as ²looped two², are there any adapters in slots 1, 2, 3, 4, 5, 6,
or 7 (location codes U0.x-P1-I1 through I7)?
NO Go to “Step 1542-15” on page 118.
YES Go to “Step 1542-12” on page 116.
Chapter 3. Maintenance Analysis Procedures (MAPs) 115
Step 1542-11
1. If it is not already off, turn off the power.
2. Remove all adapters from slots 1, 2, 3, 4, 5, 6, and 7 (location codes Ux.y-P1-I1 through I7) from the
I/O drawer.
3. Label and record the location of any cables attached to the adapters.
4. Record the slot number of the adapters.
5. Turn on the power to boot standalone diagnostics from CD.
6. If the TTY screen displays, enter 0 to select this console, press the 0 key on the TTY terminal’s
keyboard.
7. If the Please define the System Console screen is displayed, follow directions to select the system
console.
8. Use the Display Configuration and Resource List to list all attached devices and adapters (refer to the
Diagnostic Information for Multiple Bus Systems, order number SA23-2769, if necessary).
9. Check that all attached devices and adapters are listed.
Did the Please define the System Console screen display and are all attached devices and adapters
listed?
NO Go to “Step 1542-10” on page 115.
YES Go to “Step 1542-13” on page 117.
Step 1542-12
1. If it is not already off, turn off the power.
2. Remove all adapters from slots 1, 2, 3, 4, 5, 6, and 7 (location codes Ux.y-P1-I1 through I7) from the
I/O drawer.
3. Label and record the location of any cables attached to the adapters.
4. Record the slot number of the adapters.
5. Turn on the power to boot standalone diagnostics from CD.
6. If the TTY screen displays Enter 0 to select this console, press the 0 key on the TTY terminal’s
keyboard.
7. If the Please define the System Console screen is displayed, follow directions to select the system
console.
8. Use the Display Configuration and Resource List to list all attached devices and adapters (refer to the
Diagnostic Information for Multiple Bus Systems, order number SA23-2769, if necessary)
9. Check that all attached devices and adapters are listed.
Did the Please define the System Console screen display and are all attached devices and adapters
listed?
NO Go to “Step 1542-15” on page 118.
YES Go to “Step 1542-13” on page 117.
116 Service Guide
Step 1542-13
If the Please define the System Console screen does display and all attached devices and adapters are
listed, the problem is with one of the adapter cards or devices that was removed or disconnected from the
I/O subystem.
1. Turn off the power.
2. Reinstall one adapter or device that was removed. Use the original adapter cards in their original slots
when reinstalling adapters.
3. Turn on the power to boot standalone diagnostics from CD.
4. If the Please define the System Console screen is displayed, follow directions to select the system
console.
5. Use the Display Configuration and Resource List to list all attached devices and adapters (refer to the
Diagnostic Information for Multiple Bus Systems, order number SA23-2769, if necessary).
6. Check that all attached devices and adapters are listed.
Did the Please define the System Console screen display and are all attached devices and adapters
listed?
NO Go to “Step 1542-14”.
YES Reinstall the next adapter or device and return to the beginning of this step. Repeat this process
until an adapter or device causes the Please define the System Console screen to not display or
all attached devices and adapters to not be listed.
After installing all of the adapters and the Please define the System Console screen does display
and all attached devices and adapters are listed, return the system to its original configuration. Go
to “Map 0410: Repair Checkout” in the Diagnostic Information for Multiple Bus Systems.
Step 1542-14
Replace the adapter you just installed with a new adapter and retry the boot to standalone diagnostics
from CD.
1. If the Please define the System Console screen is displayed, follow directions to select the system
console.
2. Use the Display Configuration and Resource List to list all attached devices and adapters (refer to the
Diagnostic Information for Multiple Bus Systems, order number SA23-2769, if necessary).
3. Check that all attached devices and adapters are listed.
Did the Please define the System Console screen display and are all attached devices and adapters
listed?
NO Go to “Step 1542-25” on page 122.
YES The adapter was defective. Go to “Step 1542-16” on page 118.
Chapter 3. Maintenance Analysis Procedures (MAPs) 117
Step 1542-15
1. Turn off the power.
2. Disconnect the I/O subsystem power cables.
3. Replace the following parts, one at a time, in the sequence listed:
a. I/O subsystem backplane of any ²looped one² I/O subsystem attached to the base system.
b. I/O subsystem backplane of any ²looped two² I/O subsystem attached to the base system.
4. Reconnect the I/O subsystem power cables.
5. Turn on the power.
6. Boot standalone diagnostics from CD.
7. If the Please define the System Console screen is displayed, follow directions to select the system
console.
8. Use the Display Configuration and Resource List to list all attached devices and adapters (refer to the
Diagnostic Information for Multiple Bus Systems, order number SA23-2769 if necessary).
9. Check that all attached devices and adapters are listed.
Did the Please define the System Console screen display and are all attached devices and adapters
listed?
NO Replace the next part in the list and return to the beginning of this step. Repeat this process until
a part causes the Please define the System Console screen to be displayed and all attached
devices and adapters to be listed. If you have replaced all the items listed above and the Please
define the System Console screen does not display or all attached devices and adapters are not
listed, check any external devices and cabling. If you do not find a problem, contact your next level
of support for assistance.
YES Go to “Step 1542-16”.
Step 1542-16
1. Turn off the power.
2. The item just replaced fixed the problem.
3. If a display adapter with keyboard and mouse were used, reinstall the display adapter card, keyboard,
and mouse.
4. Reconnect the diskette drive cable to the diskette drive connector on the base system.
5. Reconnect the tape drive (if previously installed) to the internal SCSI bus cable.
6. Plug in all adapters that were previously removed but not reinstalled.
7. Reconnect the I/O subsystem power cables that were previously disconnected.
Reconfigure the system to its original condition. Go to “Map 0410: Repair Checkout” in the Diagnostic
Information for Multiple Bus Systems.
118 Service Guide
Step 1542-17
1. Turn off the power.
2. At the base system, reconnect the cable connection at RIO-G port 0 (U0.1-P1/Q2) recorded in “Step
1542-7” on page 115.
3. At the base system, reconnect the cable connection at RIO-G port 1 (U0.1-P1/Q3) recorded in “Step
1542-7” on page 115.
4. Reconnect the power cables to the I/O subsystems that were found attached to the base system
RIO-G ports mentioned in substeps 2 and 3 above. All I/O subsystems that were attached to the base
system RIO-G port 0 (U0.1-P1/Q2) and RIO-G port 1 (U0.1-P1/Q3) should now be physically
reconnected to the base system.
5. Make sure the I/O subsystem(s) are cabled correctly as shown in “Subsystem Positioning and Cabling”
on page 39.
6. Turn on the power to boot standalone diagnostics from CD.
7. If the Please define the System Console screen is displayed, follow directions to select the system
console.
8. Use the Display Configuration and Resource List to list all attached devices and adapters (refer to the
Diagnostic Information for Multiple Bus Systems, if necessary).
9. Check that all attached devices and adapters are listed.
Did the Please define the System Console screen display and are all attached devices and adapters
listed?
NO Go to “Step 1542-9” on page 115 to isolate I/O problems attached to the base system RIO-G
connectors.
YES The symptom has changed. No items were replaced. Go to “Step 1542-16” on page 118.
Step 1542-18
Are there any adapters in slots 1, 2, 3, or 4 (location codes U0.1-P1-I1 through I4) in the base
system?
NO Go to “Step 1542-19”.
YES Go to “Step 1542-21” on page 120.
Step 1542-19
Replace the CEC backplane, location U0.1-P1
Step 1542-20
1. Boot standalone diagnostics from CD.
2. If the Please define the System Console screen is displayed, follow directions to select the system
console.
3. Use the Display Configuration and Resource List to list all attached devices and adapters (refer to the
Diagnostic Information for Multiple Bus Systems, order number SA23-2769, if necessary).
4. Check that all attached devices and adapters are listed.
Did the Please define the System Console screen display and are all attached devices and adapters
listed?
NO Go to “Step 1542-24” on page 121.
YES Go to “Step 1542-16” on page 118.
Chapter 3. Maintenance Analysis Procedures (MAPs) 119
Step 1542-21
1. If it is not already off, turn off the power.
2. Remove all adapters from slots 1, 2, 3, and 4 (location codes U0.1-P1-I1 through I4) in the base
system that are not attached to the boot device.
3. Label and record the location of any cables attached to the adapters.
4. Record the slot number of the adapters.
5. Turn on the power to boot standalone diagnostics from CD.
6. If the TTY screen displays Enter 0 to select this console, press the 0 key on the TTY terminal’s
keyboard.
7. If the Please define the System Console screen is displayed, follow directions to select the system
console.
8. Use the Display Configuration and Resource List to list all attached devices and adapters (refer to the
Diagnostic Information for Multiple Bus Systems, order number SA23-2769, if necessary).
9. Check that all attached devices and adapters are listed.
Did the Please define the System Console screen display and are all attached devices and adapters
listed?
NO Go to “Step 1542-19” on page 119.
YES Go to “Step 1542-22”.
Step 1542-22
If the Please define the System Console screen does display and all attached devices are adapters are
listed, the problem is with one of the adapter cards or devices that was removed or disconnected from the
I/O subsystem.
1. Turn off the power.
2. Reinstall the adapter or device that was removed. Use the original adapter cards in their original slots
when reinstalling adapters.
3. Turn on the power to boot standalone diagnostics from CD.
4. If the Please define the System Console screen is displayed, follow directions to select the system
console.
5. Use the Display Configuration and Resource List to list all attached devices and adapters (refer to the
Diagnostic Information for Multiple Bus Systems, order number SA23-2769, if necessary).
6. Check that all attached devices and adapters are listed.
Did the Please define the System Console screen display and are all attached devices and adapters
listed?
NO Go to “Step 1542-23” on page 121.
YES Reinstall the next adapter or device and return to the beginning of this step. Continue repeating
this process until an adapter or device causes the Please define the System Console screen to
not display or all attached devices and adapters to not be listed.
After installing all of the adapters and the Please define the System Console screen displays and
all attached devices and adapters are listed, go to “Step 1542-16” on page 118.
120 Service Guide
Step 1542-23
Replace the adapter you just installed with a new adapter and retry the boot to standalone diagnostics
from CD.
1. If the Please define the System Console screen is displayed, follow directions to select the system
console.
2. Use the Display Configuration and Resource List to list all attached devices and adapters (refer to the
Diagnostic Information for Multiple Bus Systems, order number SA23-2769, if necessary).
3. Check that all attached devices and adapters are listed.
Did the Please define the System Console screen display and are all attached devices and adapters
listed?
NO Go to “Step 1542-19” on page 119.
YES Go to “Step 1542-16” on page 118.
Step 1542-24
1. Turn off the power.
2. Disconnect the base system power cables.
3. Replace the following parts, one at a time, in the sequence listed:
a. Internal SCSI cable.
b. DASD backplane.
c. Media backplane.
4. Reconnect the base system power cables.
5. Turn on the power.
6. Boot standalone diagnostics from CD.
7. If the Please define the System Console screen is displayed, follow directions to select the system
console.
8. Use the Display Configuration and Resource List to list all attached devices and adapters (refer to the
Diagnostic Information for Multiple Bus Systems, order number SA23-2769, if necessary).
9. Check that all attached devices and adapters are listed.
Did the Please define the System Console screen display and are all attached devices and adapters
listed?
NO Replace the next part in the list and return to the beginning of this step. Repeat this process until
a part causes the Please define the System Console screen to be displayed and all attached
devices and adapters to be listed. If you have replaced all the items listed above and the Please
define the System Console screen does not display or all attached devices and adapters are not
listed, check any external devices and cabling. If you do not find a problem, contact your next level
of support for assistance.
YES Go to “Step 1542-16” on page 118.
Chapter 3. Maintenance Analysis Procedures (MAPs) 121
Step 1542-25
1. Turn off the power.
2. Disconnect the I/O subsystem power cables.
3. Replace the following parts, one at a time, in the sequence listed:
a. Attached external device cable.
b. Attached external device(s), one at a time, if more than one.
c. I/O subsystem backplane (U0.x-P1).
4. Reconnect the I/O subsystem power cables.
5. Turn on the power.
6. Boot standalone diagnostics from CD.
7. If the Please define the System Console screen is displayed, follow directions to select the system
console.
8. Use the Display Configuration and Resource List to list all attached devices and adapters (refer to the
Diagnostic Information for Multiple Bus Systems, order number SA23-2769, if necessary).
9. Check that all attached devices and adapters are listed.
Did the Please define the System Console screen display and are all attached devices and adapters
listed?
NO Replace the next part in the list and return to the beginning of this step. Repeat this process until
a part causes the Please define the System Console screen to be displayed and all attached
devices and adapters to be listed. If you have replaced all the items listed above and the Please
define the System Console screen does not display or all attached devices and adapters are not
listed, check any external devices and cabling. If you do not find a problem, contact your next level
of support for assistance.
YES Go to “Step 1542-16” on page 118.
122 Service Guide
MAP 1548: Processor Subsystem Problem Isolation
Notes:
1. Be sure to unplug the power cords before removing or installing processor cards, the PCI riser card, or
the CEC backplane to avoid damage to it.
2. This MAP assumes that a CD-ROM drive is installed and connected to the integrated EIDE adapter,
and a diagnostic CD-ROM disc is available.
3. If a power-on password or privileged-access password is installed, you are prompted to enter the
password before the diagnostic CD-ROM can load.
4. The term POST indicators refers to the device mnemonics that appear during the power-on self-test
(POST).
5. The support leg located on the bottom of the media support shelf is used to deactivate a microswitch.
The microswitch is located on the CEC backplane and when deactivated cuts off power to some of the
components and connectors located on the CEC backplane. This support shelf must be correctly
seated and secured in position after any new FRU is added to the system and you perform a system
boot.
6. The service processor might have recorded one or more symptoms in its error log.
v If you arrived here after performing a slow boot, look for a possible new error that occurred during
the slow boot. If there is a new error, and its actions call for a FRU replacement, perform those
actions. If this does not resolve the problem, go to “Step 1548-1” on page 124.
v If an additional slow boot has not been performed, or if the slow boot did not yield a new error code,
look at the error that occurred just before the original error. Perform the actions associated with that
error. If the problem is not resolved, go to “Step 1548-1” on page 124.
v If a slow boot results in the same error code, and there are no error codes before the original error
code, go to “Step 1548-1” on page 124.
Before proceeding, it is a good idea to examine that error log (see “System Information Menu” on
page 317).
7. The service processor might have been set by the user to monitor system operations and to attempt
recoveries. You might want to disable these actions while you diagnose and service the system. If you
disable them, make notes of their current settings so that you can restore them before you leave. The
following settings may be of interest to you.
Surveillance From the Service Processor Setup Menu, go to the
Surveillance Setup Menu and disable surveillance.
Unattended Start Mode From the Service Processor System Power Control Menu,
disable unattended start mode.
Reboot Policy From the System Power Control Menu, go to the
Reboot/Restart Policy Setup Menu and set:
a. Number of reboot attempts to 0 (zero)
b. Use OS-Defined restart policy to No
c. Enable supplemental restart policy to No.
Call Out From the Call-In/Call-Out Setup Menu, go to the Serial
Port Selection Menu and disable call-out on both serial
ports.
8. Verify that the system has not been set to boot to the SMS menus or to the open firmware prompt.
From the System Power Control Menu, select Boot Mode Menu. Verify that all four options are set to
disabled.
Chapter 3. Maintenance Analysis Procedures (MAPs) 123
Purpose of this MAP
This MAP is used to locate defective FRUs not found by normal diagnostics. For this procedure,
diagnostics are run on a minimally configured system. If a failure is detected on the minimally configured
system, the remaining FRUs are exchanged one at a time until the failing FRU is identified. If a failure is
not detected, FRUs are added back until the failure occurs. The failure is then isolated to the failing FRU.
Step 1548-1
1. Ensure that the diagnostics and the operating system are shut down.
2. Turn on the power.
3. Insert the diagnostic CD-ROM into the CD-ROM drive.
Note: If you cannot insert the diagnostic CD-ROM, go to “Step 1548-2”.
4. When the keyboard indicator is displayed:
v On an ASCII terminal or hardware management console (HMC), the word keyboard, press the
number 5 key.
v On a graphical display, the word keyboard, press the number 5 key on the directly attached
keyboard.
5. If you are prompted to do so, enter the appropriate password.
Is the ²Please define the System Console² screen displayed?
NO Go to “Step 1548-2”.
YES Go to “Step 1548-17” on page 133.
Step 1548-2
1. Turn off the power.
2. If you have not already done so, configure the service processor with the instructions in note 7 on
page 123. Then return here and continue.
3. Exit the service processor menus and remove the power cords.
4. Disconnect all external cables (parallel, serial port 1, serial port 2, serial port 3, keyboard, mouse,
Ethernet, and so on).
5. Remove the service access cover (286/E4) or place the drawer (286/C4) into the service position and
remove the service access cover.
6. Record the slot numbers of the PCI adapters. Label and record the locations of any cables attached
to the adapters. Disconnect any cables attached to the adapters and remove all the adapters.
7. Disconnect all power and signal cables from the media drives, located in media bay 1 and media bay
2.
8. Remove the media drives as described in “Media Removal” on page 401.
9. Remove the media support shelf by loosening the thumbscrew and grasping the back edge of the
shelf, sliding it back, then lifting it up and out of the system chassis.
Note: The support leg located on the bottom of the media support shelf is used to deactivate a
microswitch. The microswitch is located on the CEC backplane and when deactivated, cuts off
all power to the CEC backplane. This support shelf must be correctly seated and secured in
position after any new FRU is added to the system.
124 Service Guide
10. Remove processor card #2 (if installed). If processor card #2 is removed, ensure processor card #1 is
installed and contains at least one quad of memory DIMMs.
The processor card that is left in the system must contain at least one quad of memory DIMMs. For
example, a system configuration having two processor cards; processor card #2 contains all of the
memory DIMMs, processor card #1 contains no memory DIMMs. Remove processor card #1 that
contains no memory DIMMs, then remove all but one quad of memory DIMMs from processor card
#2. Place processor card #2 into processor slot #1.
11. Record the slot numbers of the memory DIMMs on processor card #1. Remove all memory DIMMs
except for one quad from processor card #1.
Notes:
a. Place the memory DIMM locking tabs in the locked (upright) position to prevent damage to the
tabs.
b. Memory DIMMs must be installed in quads and in the correct connectors. Refer to “Memory DIMM
Location” on page 18 for complete information on memory DIMMs.
12. Disconnect the cables attached to the PCI riser card.
13. Disconnect the IDE cable from the IDE connector on the CEC backplane.
14. If your system is equipped with a diskette drive, disconnect the diskette drive cable from the diskette
drive connector on the CEC backplane.
15. Disconnect the signal and power connectors from the disk drive backplane.
16. Disconnect the disk drives from the disk drive backplane connectors.
17. Plug in the power cords and wait for the OK prompt to display on the operator panel display.
18. Turn on the power.
Does the system stop with code E1F2, E1F3, STBY or 20EE000B, 4BA00830 displayed on the
operator panel?
NO Go to “Step 1548-6” on page 126.
YES Go to “Step 1548-3”.
Step 1548-3
Were any more memory DIMMs removed from processor card #1?
NO Go to “Step 1548-5” on page 126.
YES Go to “Step 1548-4”.
Step 1548-4
1. Turn off the power, and remove the power cords.
2. Replug the memory DIMMs that were removed from processor #1 in “Step 1548-2” on page 124 in
their original locations.
Notes:
a. Place the memory DIMM locking tabs into the locked (upright) position to prevent damage to the
tabs.
b. Memory DIMMs must be installed in quads in the correct connectors. Refer to “Memory DIMM
Location” on page 18 for complete information on memory DIMMs.
3. Plug in the power cords and wait for the OK prompt to display on the operator panel display.
4. Turn on the power.
Chapter 3. Maintenance Analysis Procedures (MAPs) 125
Does the system stop with code E1F2, E1F3, STBY, 20EE000B or 4BA00840 displayed in the
operator panel?
NO A memory DIMM in the quad you just replaced in the system is defective. Turn off the power,
remove the power cords, and exchange the memory DIMMs in that quad, one at a time with new
or previously removed memory DIMMs.
Repeat this step until the defective memory DIMM is identified, or all memory DIMMs have been
exchanged.
If your symptom did not change and all the memory DIMMs have been exchanged, call your
service support person for assistance.
If the symptom changed, check for loose cards and obvious problems. If you do not find a
problem, go to the “Quick Entry MAP” on page 58 and follow instructions for the new symptom.
YES Go to “Step 1548-5”.
Step 1548-5
Was processor card #2 removed from the system?
NO Go to “Step 1548-8” on page 127.
YES Go to “Step 1548-7” on page 127.
Step 1548-6
One of the FRUs remaining in the system unit is defective.
Note: If a memory DIMM is exchanged, ensure that the new memory DIMM is the same size and speed
as the original memory DIMM.
1. Turn off the power, remove the power cords, and exchange the following FRUs in the order listed:
a. Memory DIMMs. Exchange one at a time with new or previously removed memory DIMMs.
b. Processor card #1
c. CEC backplane (See the notes on page 57.)
d. PCI riser card
e. Power supplies.
2. Plug in the power cords and wait for the OK prompt to display on the operator panel display.
3. Turn on the power.
Does the system stop with code E1F2, E1F3, STBY or 20EE000B, 4BA00830 displayed on the
operator panel?
NO Reinstall the original FRU.
Repeat the FRU replacement steps until the defective FRU is identified or all the FRUs have been
exchanged.
If the symptom did not change and all the FRUs have been exchanged, call service support for
assistance.
If the symptom has changed, check for loose cards, cables, and obvious problems. If you do not
find a problem, go to the “Quick Entry MAP” on page 58 and follow the instructions for the new
symptom.
YES Go to ²MAP 0410: Repair Checkout² in the Diagnostic Information for Multiple Bus Systems.
126 Service Guide
Step 1548-7
No failure was detected with this configuration.
1. Turn off the power and remove the power cords.
2. Reinstall processor card #2.
3. Plug in the power cords and wait for the OK prompt to display on the operator panel display.
4. Turn on the power.
Does the system stop with code E1F2, E1F3, STBY or 20EE000B, 4BA00830 displayed on the
operator panel?
NO One of the FRUs remaining in the system unit is defective.
Exchange the FRUs that have not been exchanged, in the following order:
1. Memory DIMMs, if present on processor card #2, exchange one at a time with new or
previously removed memory DIMMs.
2. Processor card #2 (last one installed)
3. CEC backplane (see notes on page 57)
Repeat this step until the defective FRU is identified or all the FRUs have been exchanged.
If the symptom did not change and all the FRUs have been exchanged, call your service support
person for assistance.
If the symptom changed, check for loose cards and obvious problems. If you do not find a
problem, go to the “Quick Entry MAP” on page 58 and follow the instructions for the new symptom.
YES Go to “Step 1548-8”.
Step 1548-8
1. Turn off the power.
2. Reconnect the system console.
Notes:
a. If an HMC has been defined as the firmware console, attach the HMC cable to the HMC1
connector on the rear of the system unit.
b. If an ASCII terminal has been defined as the system console, attach the ASCII terminal cable to
the S1 connector on the rear of the system unit.
c. If a display attached to a display adapter has been defined as the system console, install the
display adapter and connect the display to the adapter. Plug the keyboard into the keyboard
connector on the rear of the system unit.
3. Turn on the power.
4. If the ASCII terminal or graphics display (including display adapter) is connected differently than it was
before, the Console Selection screen appears and requires that a new console be selected.
5. When the word keyboard is displayed, press the number 1 key on the directly attached keyboard, an
ASCII terminal or HMC. This activates the system management services (SMS).
6. If you are prompted to do so, enter the appropriate password.
Chapter 3. Maintenance Analysis Procedures (MAPs) 127
Is the SMS screen displayed?
NO One of the FRUs remaining in the system unit is defective.
Exchange the FRUs that have not been exchanged, in the following order:
1. If you are using the hardware management console (HMC) go to the ²Entry MAP² in the
Hardware Management Console Maintenance Guide, order number SA23-1273. If you cannot
fix the problem using the HMC tests, there may be a problem with the CEC backplane. If you
do not find a problem, do the following:
v Replace the CEC backplane, location U0.1-P1. (See notes on page 57.)
2. If you are using a graphics display, go to the problem determination procedures for the display.
If you do not find a problem, do the following:
v Replace the display adapter.
v Replace the backplane in which the graphics adapter is plugged.
Repeat this step until the defective FRU is identified or all the FRUs have been exchanged.
If the symptom did not change and all the FRUs have been exchanged, call service support for
assistance.
If the symptom changed, check for loose cards, cables, and obvious problems. If you do not find a
problem, go to the “Quick Entry MAP” on page 58 and follow the instructions for the new symptom.
YES Go to “Step 1548-9”.
Step 1548-9
1. Make sure the diagnostic CD-ROM is inserted into the CD-ROM drive.
2. Turn off the power and remove the power cords.
3. Plug the IDE cable into the IDE connector on the CEC backplane.
4. Connect the signal and power connectors to the IDE CD-ROM drive.
5. Plug in the power cords and wait for the OK prompt to display on the operator panel display.
6. Turn on the power.
7. Insert the diagnostic CD-ROM into the CD-ROM drive.
8. After the word keyboard is displayed, press the number 5 key on either the directly attached keyboard
or on an ASCII terminal keyboard.
9. If you are prompted to do so, enter the appropriate password .
Is the Please define the System Console screen displayed?
NO One of the FRUs remaining in the system unit is defective.
Exchange the FRUs that have not been exchanged, in the following order:
1. IDE cable
2. CD-ROM drive
3. CEC backplane (see notes on page 57)
4. Processor card
Repeat this step until the defective FRU is identified or all the FRUs have been exchanged.
If the symptom did not change and all the FRUs have been exchanged, call service support for
assistance.
If the symptom has changed, check for loose cards, cables, and obvious problems. If you do not
find a problem, go to the “Quick Entry MAP” on page 58 and follow the instructions for the new
symptom.
YES Go to “Step 1548-10” on page 129.
128 Service Guide
Step 1548-10
The system is working correctly with this configuration. One of the SCSI devices that you disconnected
may be defective.
1. Make sure the diagnostic CD-ROM is inserted into the CD-ROM drive.
2. Turn off the power and remove the power cords.
3. Plug the SCSI cable into the SCSI connector on the CEC backplane.
4. Connect the signal and power connectors to one of the SCSI devices (for example: CD-ROM drive,
tape drive, disk drive, and so on). Do not connect the signal and power connectors to the disk drive
backplane at this time.
5. Plug in the power cords and wait for the OK prompt to display on the operator panel display.
6. Turn on the power.
7. After the word keyboard is displayed, press the number 5 key on either the directly attached keyboard
or on an ASCII terminal keyboard.
8. If you are prompted to do so, enter the appropriate password.
Is the Please define the System Console screen displayed?
NO One of the FRUs remaining in the system unit is defective.
Exchange the FRUs that have not been exchanged, in the following order:
1. SCSI cable
2. Last SCSI device connected (for example: disk drive, tape drive)
3. CEC backplane (See notes on page 57.)
4. Processor card
Repeat this step until the defective FRU is identified or all the FRUs have been exchanged.
If the symptom did not change and all the FRUs have been exchanged, call service support for
assistance.
If the symptom has changed, check for loose cards, cables, and obvious problems. If you do not
find a problem, go to the return to “Quick Entry MAP” on page 58 and follow the instructions for
the new symptom.
YES Repeat this step, adding one SCSI device at a time, until all the SCSI devices that were attached
to the integrated SCSI adapter, except the disk drive backplane, are connected and tested.
Go to “Step 1548-11”.
Step 1548-11
The system is working correctly with this configuration. The disk drive backplane may be defective.
1. Make sure the diagnostic CD-ROM is inserted into the CD-ROM drive.
2. Turn off the power and remove the power cords.
3. Connect the signal and power connectors to the disk drive backplane.
4. Plug in the power cords and wait for the OK prompt to display on the operator panel display.
5. Turn on the power.
6. If the Console Selection screen is displayed, choose the system console.
7. After the word keyboard is displayed, press the number 5 key on either the directly attached keyboard
or on an ASCII terminal keyboard.
8. If you are prompted to do so, enter the appropriate password.
Chapter 3. Maintenance Analysis Procedures (MAPs) 129
Is the Please define the System Console screen displayed?
NO One of the FRUs remaining in the system unit is defective.
Exchange the FRUs that have not been exchanged, in the following order:
1. SCSI cable
2. Disk drive backplane
Repeat this step until the defective FRU is identified or all the FRUs have been exchanged.
If the symptom did not change and all the FRUs have been exchanged, call service support for
assistance.
If the symptom has changed, check for loose cards, cables, and obvious problems. If you do not
find a problem, go to the “Quick Entry MAP” on page 58 and follow the instructions for the new
symptom.
YES Go to “Step 1548-12”.
Step 1548-12
The system is working correctly with this configuration. One of the disk drives that you removed from the
disk drive may be defective.
1. Make sure the diagnostic CD-ROM is inserted into the CD-ROM drive.
2. Turn off the power and remove the power cords.
3. Install a disk drive in the disk drive backplane.
4. Plug in the power cords and wait for the OK prompt to display on the operator panel display.
5. Turn on the power.
6. After the word keyboard is displayed, press the number 5 key on either the directly attached keyboard
or on an ASCII terminal keyboard.
7. If you are prompted to do so, enter the appropriate password.
Is the Please define the System Console screen displayed?
No Exchange the FRUs that have not been exchanged, in the following order:
1. Last disk drive installed
2. Disk drive backplane
Repeat this step until the defective FRU is identified or all the FRUs have been exchanged.
If the symptom did not change and all the FRUs have been exchanged, call service support for
assistance.
If the symptom has changed, check for loose cards, cables, and obvious problems. If you do not
find a problem, go to the “Quick Entry MAP” on page 58 and follow the instructions for the new
symptom.
Yes Repeat this step with the second disk drive (if it was installed).
If the system has a diskette drive, go to “Step 1548-13” on page 131.
Go to “Step 1548-14” on page 131.
130 Service Guide
Step 1548-13
The system is working correctly with this configuration. The diskette drive may be defective.
1. Make sure the diagnostic CD-ROM is inserted into the CD-ROM drive.
2. Turn off the power and remove the power cords.
3. Plug the diskette drive cable into the diskette drive connector on the CEC backplane.
4. Connect the signal and power connectors to the diskette drive.
5. Plug in the power cords and wait for the OK prompt to display on the operator panel display.
6. Turn on the power.
7. After the word keyboard displays, press the number 5 key on either the directly attached keyboard or
on an ASCII terminal keyboard.
8. If you are prompted to do so, enter the appropriate password.
Is the Please define the System Console screen displayed?
NO One of the FRUs remaining in the system is defective.
Exchange the FRUs that have not been exchanged, in the following order:
1. Diskette drive
2. Diskette drive cable
3. CEC backplane (see notes on page 57)
Repeat this step until the defective FRU is identified or all the FRUs have been exchanged.
If the symptom did not change and all the FRUs have been exchanged, call service support for
assistance.
If the symptom has changed, check for loose cards, cables, and obvious problems. If you do not
find a problem return, go to the “Quick Entry MAP” on page 58 and follow the instructions for the
new symptom.
YES Go to “Step 1548-14”.
Step 1548-14
The system is working correctly with this configuration. One of the devices that you disconnected from the
CEC backplane may be defective.
1. Turn off the power and remove the power cords.
2. Attach a CEC backplane device (for example: parallel, serial port 1, serial port 2, serial port 3,
keyboard, mouse, Ethernet, Ultra-2 SCSI) that had been removed.
3. Plug in the power cords and wait for the OK prompt to display on the operator panel display.
4. Turn on the power.
5. If the Console Selection screen is displayed, choose the system console.
6. After the word keyboard displays, press the number 5 key on either the directly attached keyboard or
on an ASCII terminal keyboard.
7. If you are prompted to do so, enter the appropriate password.
Chapter 3. Maintenance Analysis Procedures (MAPs) 131
Is the Please define the System Console screen displayed?
NO The last device or cable that you attached is defective.
To test each FRU, exchange the FRUs in the following order:
1. Device and cable (last one attached)
2. CEC backplane (see notes on page 57).
If the symptom did not change and all the FRUs have been exchanged, call service support for
assistance.
If the symptom has changed, check for loose cards, cables, and obvious problems. If you do not
find a problem, go to the “Quick Entry MAP” on page 58 and follow the instructions for the new
symptom.
Go to ²MAP 0410: Repair Checkout² in the Diagnostic Information for Multiple Bus Systems.
YES Repeat this step until all of the devices are attached.
Go to “Step 1548-15”.
Step 1548-15
The system is working correctly with this configuration. One of the FRUs (adapters) that you removed may
be defective.
1. Turn off the power and remove the power cords.
2. Install a FRU (adapter) and connect any cables and devices that were attached to the FRU.
3. Plug in the power cords and wait for the OK prompt to display on the operator panel display.
4. Turn on the power.
5. If the Console Selection screen is displayed, choose the system console.
6. After the word keyboard displays, press the number 5 key on either the directly attached keyboard or
on an ASCII terminal keyboard.
7. If you are prompted to do so, enter the appropriate password.
Is the Please define the System Console screen displayed?
NO Go to “Step 1548-16” on page 133.
YES Repeat this step until all of the FRUs (adapters) are installed.
Go to ²MAP 0410: Repair Checkout² in the Diagnostic Information for Multiple Bus Systems.
132 Service Guide
Step 1548-16
The last FRU installed or one of its attached devices is probably defective.
1. Make sure the diagnostic CD-ROM is inserted into the CD-ROM drive.
2. Turn off the power and remove the power cords.
3. Starting with the last installed adapter, disconnect one attached device and cable.
4. Plug in the power cords and wait for the OK prompt to display on the operator panel display.
5. Turn on the power.
6. If the Console Selection screen is displayed, choose the system console.
7. After the word keyboard displays, press the number 5 key on either the directly attached keyboard or
on an ASCII terminal keyboard.
8. Enter the appropriate password if you are prompted to do so.
Is the Please define the System Console screen displayed?
NO Repeat this step until the defective device or cable is identified or all devices and cables have
been disconnected.
If all the devices and cables have been removed, then one of the FRUs remaining in the system
unit is defective.
To test each FRU, exchange the FRUs in the following order:
1. Adapter (last one installed)
2. PCI riser card
3. CEC backplane (see notes on page 57)
If the symptom did not change and all the FRUs have been exchanged, call service support for
assistance.
If the symptom has changed, check for loose cards, cables, and obvious problems. If you do not
find a problem, go to the “Quick Entry MAP” on page 58 and follow the instructions for the new
symptom.
YES The last device or cable that you disconnected is defective.
Exchange the defective device or cable.
Go to ²MAP 0410: Repair Checkout² in the Diagnostic Information for Multiple Bus Systems.
Step 1548-17
1. Follow the instructions on the screen to select the system console.
2. When the DIAGNOSTIC OPERATING INSTRUCTIONS screen is displayed, press Enter.
3. Select Advanced Diagnostics Routines.
4. If the terminal type has not been defined, you must use the Initialize Terminal option on the
FUNCTION SELECTION menu to initialize the AIX diagnostic environment before you can continue
with the diagnostics. This is a separate operation from selecting the console display.
5. If the NEW RESOURCE screen displays, select an option from the bottom of the screen.
Note: Adapters or devices that require supplemental media are not shown in the new resource list. If
the system has adapters or devices that require supplemental media, select option 1.
6. When the DIAGNOSTIC MODE SELECTION screen is displayed, press Enter.
7. Select All Resources (if you were sent here from “Step 1548-21” on page 134, select the
adapter/device that you loaded from the supplemental media).
Did you get an SRN?
NO Go to “Step 1548-19” on page 134.
YES Go to “Step 1548-18” on page 134.
Chapter 3. Maintenance Analysis Procedures (MAPs) 133
Step 1548-18
Look at the FRU part numbers associated with the SRN.
Have you exchanged all the FRUs that correspond to the failing function codes (FFCs)?
NO Exchange the FRU with the highest failure percentage that has not been changed.
Repeat this step until all the FRUs associated with the SRN have been exchanged or diagnostics
run with no trouble found. Run diagnostics after each FRU is exchanged.
If the CEC backplane or a network adapter is removed, see notes on page 57.
Go to ²MAP 0410: Repair Checkout² in the Diagnostic Information for Multiple Bus Systems.
YES If the symptom did not change and all the FRUs have been exchanged, call service support for
assistance.
Step 1548-19
Does the system have adapters or devices that require supplemental media?
NO Go to “Step 1548-20”.
YES Go to “Step 1548-21”.
Step 1548-20
Consult the PCI adapter configuration documentation for your operating system to verify that all installed
adapters are configured correctly.
Go to ²MAP 0410: Repair Checkout² in the Diagnostic Information for Multiple Bus Systems.
If the symptom did not change and all the FRUs have been exchanged, call service support for assistance.
Step 1548-21
1. Select Task Selection.
2. Select Process Supplemental Media and follow the onscreen instructions to process the media.
Supplemental media must be loaded and processed one at a time.
Did the system return to the TASKS SELECTION SCREEN after the supplemental media was
processed?
NO Go to “Step 1548-22” on page 135.
YES Press F3 to return to the FUNCTION SELECTION screen.
Go to “Step 1548-17” on page 133, substep 4.
134 Service Guide
Step 1548-22
The adapter or device is probably defective.
If the supplemental media is for an adapter, replace the FRUs in the following order:
1. Adapter
2. CEC backplane (see notes on page 57)
If the supplemental media is for a device, replace the FRUs in the following order:
1. Device and any associated cables
2. The adapter to which the device is attached
Repeat this step until the defective FRU is identified or all the FRUs have been exchanged.
If the symptom did not change and all the FRUs have been exchanged, call service support for assistance.
If the symptom has changed, check for loose cards, cables, and obvious problems. If you do not find a
problem, go to the “Quick Entry MAP” on page 58 and follow the instructions for the new symptom.
Go to ²MAP 0410: Repair Checkout² in the Diagnostic Information for Multiple Bus Systems.
Step 1548-23
1. Ensure that the diagnostics and the operating system are shut down.
2. Turn off the power.
3. If you have not already done so, configure the service processor with the instructions in note 7 on
page 123 and then return here and continue.
4. Exit the service processor menus and remove the power cords.
5. Remove the service access cover (286/E4), or place the drawer (286/C4) into the service position and
remove the service access cover.
6. Record the slot numbers of the PCI adapters. Label and record the locations of any cables attached to
the adapters. Disconnect any cables attached to the adapters and remove all the adapters.
7. Plug in the power cords and wait for the OK prompt to display on the operator panel display.
8. Turn on the power.
Does the system stop with the same error code displayed on the operator panel that directed you
to this MAP step?
NO Go to “Step 1548-25” on page 136.
YES Go to “Step 1548-24”.
Step 1548-24
One of the FRUs remaining in the system unit is defective.
1. Turn off the power, remove the power cords and exchange the CEC backplane. See the notes on page
57.
2. Plug in the power cable and wait for the OK prompt to display on the operator panel display.
3. Turn on the power.
Chapter 3. Maintenance Analysis Procedures (MAPs) 135
Does the system stop with the same error code displayed on the operator panel that directed you
to this MAP step?
NO Go to ²MAP 0410: Repair Checkout² in the Diagnostic Information for Multiple Bus Systems.
YES Reinstall the original FRU.
If the symptom did not change and all the FRUs have been exchanged, return to “Step 1548-2” on
page 124.
Step 1548-25
The system is working correctly with this configuration. One of the FRUs (adapters) that you removed is
probably defective.
1. Turn off the power and remove the power cable from the power outlet.
2. Install a FRU (adapter) and connect any cables and devices that were attached to it.
3. Plug in the power cable and wait for the OK prompt to display on the operator panel display.
4. Turn on the power.
5. If the Console Selection screen is displayed, choose the firmware console.
6. Enter the appropriate password if you are prompted to do so.
Does the system stop with the same error code displayed on the operator panel that directed you
to this MAP step?
NO Repeat this step until all of the FRUs (adapters) are installed, then go to ²MAP 0410: Repair
Checkout² in the Diagnostic Information for Multiple Bus Systems.
YES Go to “Step 1548-26”.
Step 1548-26
The last FRU installed or one of its attached devices is probably defective.
1. Turn off the power and remove the power cables from the power outlet.
2. Starting with the last installed adapter, disconnect one attached device and cable.
3. Plug in the power cords and wait for the OK prompt to display on the operator panel display.
4. Turn on the power.
5. If the Console Selection screen is displayed, choose the firmware console.
6. Enter the appropriate password if you are prompted to do so.
Does the system stop with the same error code displayed on the operator panel that directed you
to this MAP step?
NO The last device or cable that you disconnected is defective.
Exchange the defective device or cable.
Go to ²MAP 0410: Repair Checkout² in the Diagnostic Information for Multiple Bus Systems.
YES Repeat this step until the defective device or cable is identified or all of the devices and cables
have been disconnected. If all of the devices and cables have been removed, then one of the
FRUs remaining in the system unit is defective.
To test each FRU, exchange the FRUs in the following order:
1. Adapter (last one installed)
2. CEC backplane (see notes on page 57)
Go to ²MAP 0410: Repair Checkout² in the Diagnostic Information for Multiple Bus Systems.
If the symptom did not change and all the FRUs have been exchanged, return to “Step 1548-2” on
page 124.
136 Service Guide
MAP 1549: Attention Problem Isolation
Step 1549-1
Record the error code and the location code(s) that sent you to this MAP.
Step 1549-2
Turn off the power.
Step 1549-3
Examine the following table and locate the error code that sent you to this MAP to determine your next
step.
Error Code Action
406x 0EB1 Go to “Step 1549-4”.
406x 0EB2 Go to “Step 1549-4”.
406x 0EB3 Go to “Step 1549-4”.
406x 0EB4 Go to “Step 1549-7” on page 138.
406x 0EB5 Go to “Step 1549-7” on page 138.
406x 0EB6 Go to “Step 1549-7” on page 138.
406x 0EB7 Go to “Step 1549-7” on page 138.
406x 0EB8 Go to “Step 1549-7” on page 138.
406x 0EB9 Go to “Step 1549-7” on page 138.
406x 0EBA Go to “Step 1549-7” on page 138.
406x 0EBB Go to “Step 1549-7” on page 138.
406x 0EBC Go to “Step 1549-7” on page 138.
Step 1549-4
Replace the following cards if present, one at a time:
v First location code item recorded, if any, in “Step 1549-1”.
v Second location code item recorded, if any, in “Step 1549-1”.
v Third location code item recorded, if any, in “Step 1549-1”.
v Processor card 1 at U0.1-P1-C1, if not recorded in “Step 1549-1”.
v Processor card 2 at U0.1-P1-C2, if not recorded in “Step 1549-1”.
Attention: Before replacing the system backplane, call support.
v CEC backplane at U0.1-P1, if not recorded in “Step 1549-1”.
Step 1549-5
Turn on the power.
Chapter 3. Maintenance Analysis Procedures (MAPs) 137
Step 1549-6
Did the system stop with the same error code as recorded in “Step 1549-1” on page 137?
No The card just replaced was defective. This ends the procedure. Return the system to its original
configuration. Go to ²MAP 0410: Repair Checkout² in the Diagnostic Information for Multiple Bus
Systems.
Yes If all of the cards listed in “Step 1549-4” on page 137 have been replaced, go to “Step 1549-10”,
otherwise continue with “Step 1549-4” on page 137 and replacing the next card on the list.
Step 1549-7
Replace, one at a time the following cards (if present):
v First location code item recorded, if any, in “Step 1549-1” on page 137.
v Second location code item recorded, if any, in “Step 1549-1” on page 137.
v Third location code item recorded, if any, in “Step 1549-1” on page 137.
Attention: Before replacing the system backplane, call support.
v CEC backplane at U0.1-P1, if not recorded in “Step 1549-1” on page 137.
Step 1549-8
Turn on the power.
Step 1549-9
Did the system stop with the same error code as recorded in “Step 1549-1” on page 137?
No The card just replaced was defective. This ends the procedure. Return the system to its original
configuration. Go to ²MAP 0410: Repair Checkout² in the Diagnostic Information for Multiple Bus
Systems.
Yes If all of the cards listed in “Step 1549-7” have been replaced, go to “Step 1549-10”, otherwise
continue with “Step 1549-7” and replacing the next card on the list.
Step 1549-10
Call support.
138 Service Guide
MAP 154C: SPCN Cable Problem Isolation
Step 154C-1
Record the error code, location code(s), and word 13 that sent you to this MAP.
Step 154C-2
Turn off the power.
Step 154C-3
Locate the word 13 value that you recorded in Step 154C-1 in the following table and perform the
indicated action.
Word 13 Value Action
A780 xxxx Replace the SPCN cable attached to U0.1-P2/Q12 of the base system drawer.
A781 xxxx 1. Trace the SPCN cable attached to U0.1-P2/Q12 of the base system drawer back to an
attached I/O drawer connector U0.dd-P1/Q4.
2. Replace the SPCN cable attached to connector U0.dd-P1/Q5 of this I/O drawer.
A782 xxxx 1. Trace the SPCN cable attached to U0.1-P2/Q12 of the base system drawer back to an
attached I/O drawer connector U0.dd-P1/Q4.
2. Trace the SPCN cable attached to connector U0.dd-P1/Q5 of this I/O drawer back to an
attached I/O drawer connector U0.dd-P1/Q4.
3. Replace the SPCN cable attached to connector U0.dd-P1/Q5 of this I/O drawer.
A783 xxxx 1. Trace the SPCN cable attached to U0.1-P2/Q12 of the base system drawer back to an
attached I/O drawer connector U0.dd-P1/Q4.
2. Trace the SPCN cable attached to connector U0.dd-P1/Q5 of this I/O drawer back to an
attached I/O drawer connector U0.dd-P1/Q4.
3. Trace the SPCN cable attached to connector U0.dd-P1/Q5 of this I/O drawer back to an
attached I/O drawer connector U0.dd-P1/Q4.
4. Replace the SPCN cable attached to connector U0.dd-P1/Q5 of this I/O drawer.
A784 xxxx 1. Trace the SPCN cable attached to U0.1-P2/Q12 of the base system drawer back to an
attached I/O drawer connector U0.dd-P1/Q4.
2. Trace the SPCN cable attached to connector U0.dd-P1/Q5 of this I/O drawer back to an
attached I/O drawer connector U0.dd-P1/Q4.
3. Trace the SPCN cable attached to connector U0.dd-P1/Q5 of this I/O drawer back to an
attached I/O drawer connector U0.dd-P1/Q4.
4. Trace the SPCN cable attached to connector U0.dd-P1/Q5 of this I/O drawer back to an
attached I/O drawer connector U0.dd-P1/Q4.
5. Replace the SPCN cable attached to connector U0.dd-P1/Q5 of this I/O drawer.
A785 xxxx 1. Trace the SPCN cable attached to U0.1-P2/Q11 of the base system drawer back to an
attached I/O drawer connector U0.dd-P1/Q5.
2. Trace the SPCN cable attached to connector U0.dd-P1/Q4 of this I/O drawer back to an
attached I/O drawer connector U0.dd-P1/Q5.
3. Trace the SPCN cable attached to connector U0.dd-P1/Q4 of this I/O drawer back to an
attached I/O drawer connector U0.dd-P1/Q5.
4. Replace the SPCN cable attached to connector U0.dd-P1/Q4 of this I/O drawer.
Chapter 3. Maintenance Analysis Procedures (MAPs) 139
Word 13 Value Action
A786 xxxx 1. Trace the SPCN cable attached to U0.1-P2/Q11 of the base system drawer back to an
attached I/O drawer connector U0.dd-P1/Q5.
2. Trace the SPCN cable attached to connector U0.dd-P1/Q4 of this I/O drawer back to an
attached I/O drawer connector U0.dd-P1/Q5.
3. Replace the SPCN cable attached to connector U0.dd-P1/Q4 of this I/O drawer.
A787 xxxx 1. Trace the SPCN cable attached to U0.1-P2/Q11 of the base system drawer back to an
attached I/O drawer connector U0.dd-P1/Q5.
2. Replace the SPCN cable attached to connector U0.dd-P1/Q4 of this I/O drawer.
A78F xxxx Replace the SPCN cable attached to U0.1-P2/Q11 of the base system drawer.
Step 154C-4
Turn on the power.
Step 154C-5
Did the system stop with the same error code as recorded in “Step 154C-1” on page 139?
No The SPCN cable just replaced was defective. This ends the procedure. Return the system to its
original configuration. Go to ²MAP 0410: Repair Checkout² in the Diagnostic Information for
Multiple Bus Systems.
Yes Call service support.
140 Service Guide
Chapter 4. Checkpoints
Checkpoints display on the media subsystem operator panel and the virtual terminal while the system is
powering on and going through the initial program load (IPL). This chapter explains the IPL flow of the
system and provides a table that lists checkpoints that you might see in the operator panel and in a virtual
terminal window on the HMC.
IPL Flow with an HMC Attached to the System
The IPL process starts when ac power is connected to the system. The IPL process has the following
phases:
v Phase 1: Service Processor Initialization
Phase 1 starts when ac power is connected to the system and ends when OK is displayed in the
processor subsystem operator panel. 8xxx checkpoints are displayed during this phase. Several 9xxx
codes may also be displayed. Service processor menus are available at the end of this phase by
pressing any key on the console keyboard.
v Phase 2: Hardware Initialization by the Service Processor
Phase 2 starts when system power-on is initiated by pressing the power on button on the processor
subsystem operator panel. 9xxx checkpoints are displayed during this time. 91FF, the last checkpoint in
this phase, indicates the transition to phase 3 is taking place.
v Phase 3: System Firmware initialization
On a full system partition, at phase 3, a system processor takes control and continues initializing
partition resources. During this phase, checkpoints in the form of Exxx are displayed. E105, the last
checkpoint in this phase, indicates that control is being passed to the operating system boot program.
On a partitioned system, there is a global system-wide initialization phase 3 during which a system
processor continues the initialization process. Checkpoints in this phase are of the form Exxx. This
global phase 3 ends with a LPAR... on the operator panel. As a logical partition begins a partition
initialization phase 3, one of the system processors assigned to that partition continues initialization of
resource assigned to that partition. Checkpoints in this phase are also of the form Exxx.
This partition phase 3 ends with an E105 displayed on the partition’s virtual operator panel on the HMC,
indicating control has been passed to that logical partition’s operating system boot program. For both
the global and partition phase 3, location codes may also be displayed on the physical operator panel
and the partition’s virtual terminal.
v Phase 4: Operating System Boot
When the operating system starts to boot, checkpoints in the form 0xxx and 2xxx are displayed. This
phase ends when the operating system login prompt displays on the operating system console.
141
IPL Flow Without an HMC Attached to the System
The IPL process starts when ac power is connected to the system. The IPL process has the following
phases:
v Phase 1: Service Processor Initialization
Phase 1 starts when ac power is connected to the system and ends when OK is displayed in the media
subsystem operator panel. 8xxx checkpoints are displayed during this phase. Several 9xxx codes may
also be displayed. Service processor menus are available at the end of this phase by striking any key
on the console keyboard.
v Phase 2: Hardware Initialization by the Service Processor
Phase 2 starts when system power-on is initiated by pressing the power on button on the processor
subsystem operator panel. 9xxx checkpoints are displayed during this time. 91FF, the last checkpoint in
this phase, indicates the transition to phase 3 is taking place.
v Phase 3: System Firmware initialization
At phase 3, a system processor takes over control and continues initializing partition resources. During
this phase, checkpoints in the form Exxx are displayed. E105, the last checkpoint in this phase,
indicates that control is being passed to the operating system boot program.
v Phase 4: Operating System Boot
When the operating system starts to boot, checkpoints in the form 0xxx and 2xxx are displayed. This
phase ends when the operating system login prompt displays on the operating system console.
The following table outlines the IPL phases from power-on to the operating system login prompt, matching
the phases to corresponding operator-panel checkpoints for a typical partition boot. Also listed are the
approximate boot times for each phase, along with their dependencies for each phase.
Note: The following table lists the major checkpoints only.
Phase # Checkpoints on Operator
Panel or HMC
Time in Phase
(minutes)
Major Contributors to Time in Phase
Phase 1 8xxx => OK Approx. 1
Phase 2 9xxx => 91FF 3 to 6 Number of I/O drawers
Phase 3 Exxx => E105 1 to 3 Number of bootable adapters
Phase 4 0xxx or 2xxx => blank 2 to 30+ Number of SSA drives
Number of SCSI drives
Number of Async sessions
Number of processors
Number of adapters
Amount of memory
Checkpoints enable users and service personnel to know what the server is doing, with some detail, as it
initializes. These checkpoints are not intended to be error indicators, but in some cases a server could
hang at one of the checkpoints without displaying an 8-character error code. It is for these hang
conditions, only, that any action should be taken with respect to checkpoints. The most appropriate action
is included with each checkpoint.
142 Service Guide
Before taking actions listed with a checkpoint, check for additional symptoms in the service processor error
log. See the “System Information Menu” on page 317 for information on how to access the service
processor error log.
Notes:
1. Go to “MAP 1548: Processor Subsystem Problem Isolation” on page 123 for any of the following
checkpoint hang conditions:
v A four-digit code in the range of 8xxx-E4FF, E5Ex, or E700-EFFF that is not listed in the checkpoint
tables.
v A four-digit code in the range of 8xxx-E4FF, E5Ex, or E700-EFFF that is listed in the checkpoint
tables but does not contain a repair action or FRU listing.
v A four-digit code in the range of 8xxx-E4FF, E5Ex, or E700-EFFF, in which all of the FRUs listed
have been replaced or service actions have been performed and the problem has not been
corrected.
2. Go to “MAP 1542: I/O Problem Isolation” on page 111 for any of the following checkpoint hang
conditions:
v A four-digit code in the range of E500 through E6FF (except E5Ex - see note 1 above) that is not
listed in the checkpoint tables.
v A four-digit code in the range of E500 through E6FF (except E5Ex - see note 1 above) that is listed
in the checkpoint tables but does not contain a repair action or FRU listing.
v A four-digit code in the range of E500 through E6FF (except E5Ex - see note 1 above), in which all
of the FRUs listed have been replaced or service actions have been performed and the problem has
not been corrected.
Checkpoints listed in the following tables are in the format 8xxx, 9xxx, Axxx, Bxxx, or Exxx, where x is any
hexadecimal digit from 1-9 or A-F. If your system hangs with a checkpoint displayed that begins with
anything other than 8, 9, A, B, or E, go to the Diagnostic Information for Multiple Bus Systems.
Use the following table to determine where to find the checkpoint.
Symptom Action
You have a code that is of the form 8xxx, 9xxx, Axxx, or
Bxxx.
Go to “Service Processor Checkpoints”.
You have a code of the form Exxx. Go to “Firmware Checkpoints” on page 150.
You have a four-character code that is not of the form
8xxx, 9xxx, Axxx, Bxxx, or Exxx.
Refer to the Diagnostic Information for Multiple Bus
Systems manual.
Service Processor Checkpoints
Service processor checkpoints are in the range 8xxx to Bxxx. The message OK indicates successful
service processor testing and initialization. Firmware checkpoints are listed in “Firmware Checkpoints” on
page 150.
Note: A spinning (or twirling) slash displays in the upper-right corner of the operator panel display while
some checkpoints are being displayed. If the slash stops spinning, a hang condition is indicated.
Checkpoint Description Action/
Possible Failing FRU
8000 Test of the service processor DRAM See note 1 on page 149.
8004 Verify base code checksum See note 1 on page 149.
8008 Verify base code mirror checksum See note 1 on page 149.
Chapter 4. Checkpoints 143
Checkpoint Description Action/
Possible Failing FRU
8009 Set NVRAM configuration in TITAN See note 1 on page 149.
800F Start base code See note 1 on page 149.
8010 Start supervisor in base code See note 1 on page 149.
8012 Initialize hardware interrupts See note 1 on page 149.
8014 Allocate and initialize area in DRAM See note 1 on page 149.
8016 Initialize debugger and build VPD See note 1 on page 149.
8018 Initialize service processor interfaces See note 1 on page 149.
801A Initialize external FLASH load See note 1 on page 149.
801B Initialize and test ISA hub chip See note 1 on page 149.
801C Initialize and test SUPER I/O chip See note 1 on page 149.
801F Initialize diskette, verify VDASD and start
operational loader
See note 1 on page 149.
809x Related to recovery mode system firmware
update from diskette
See note 3 on page 149.
8092 Diskette initialization during recovery mode
system firmware update
See note 3 on page 149.
8098 Firmware image being read during recovery
mode system firmware update
See note 3 on page 149.
809C Firmware image being written during
recovery mode system firmware update
See note 3 on page 149.
809F Diskette error during recovery mode system
firmware update
See note 3 on page 149.
8100 Start operational loader See note 1 on page 149.
81xx Load code into DRAM from VDASD (where
xx = number of loads)
See note 1 on page 149.
81FE Load service processor operating system See note 1 on page 149.
81FF Start service processor operating system See note 1 on page 149.
9000 Primary service processor task initializing See note 1 on page 149.
9009 System processor IPL has started See note 1 on page 149.
9010 Primary service processor thread waiting
for response
See note 1 on page 149.
9020 Service processor state set to pre-standby See note 1 on page 149.
9021 Service processor state set to standby See note 1 on page 149.
9022 Service processor state set to IPL See note 1 on page 149.
9023 Service processor state set to runtime See note 1 on page 149.
9024 Service processor state set to terminate See note 1 on page 149.
9025 Service processor state set to DPO See note 1 on page 149.
9030 Waiting for secondary thread
acknowledgement
See note 1 on page 149.
9033 I/O base initialization See note 1 on page 149.
9034 I2C initialization See note 1 on page 149.
9035 ISA initialization See note 1 on page 149.
9036 Mailbox initialization See note1 on page 149.
144 Service Guide
Checkpoint Description Action/
Possible Failing FRU
9037 SLLA initialization (VPD) See note 1 on page 149.
9038 NVRAM initialization See note 1 on page 149.
9039 Serial port initialization See note 1 on page 149.
903A Serial I/O initialization See note 1 on page 149.
903B SPCN initialization See note 1 on page 149.
9051 End of reset/reload operation: service
processor dump has completed
See note 1 on page 149.
9055 Start menus thread See note 1 on page 149.
9060 Checking/configuring modem on serial port
1
See note 4 on page 149.
9061 Checking/configuring modem on serial port
2
See note 4 on page 149.
906A Create mailbox processing thread See note 1 on page 149.
906B Create menu interface thread See note 1 on page 149.
906C Create general thread See note 1 on page 149.
906D Create SPCN thread See note 1 on page 149.
9070 Create thread to control LEDs on I/O
subsystems
See note 1 on page 149.
9071 Initialize thread to control LEDs on I/O
subsystems
See note 1 on page 149.
9080 Start VPD collection program See note 1 on page 149.
9081 Start SC VPD data collection See note 1 on page 149.
9082 I2C/SPCN VPD data collection See note 1 on page 149.
9083 End SC VPD data collection See note 1 on page 149.
9084 End I2C/SPCN VPD data collection See note 1 on page 149.
90FD Waiting for IBIST to complete See note 1 on page 149.
9101 Setup load source See note 1 on page 149.
9102 Open load source See note 1 on page 149.
9103 Verify open status of load source See note 1 on page 149.
9104 Waiting for SPCN VPD collection to be
completed
See note 1 on page 149.
9105 Load IPL LIDs See note 1 on page 149.
9106 Initialize MOPs See note 1 on page 149.
9107 System flush See note 1 on page 149.
9108 Processor scan interface BATs and LBIST See note 1 on page 149.
9109 ABIST See note 1 on page 149.
910A Non-processor scan interface BATs and
LBIST
See note 1 on page 149.
910B Wire test See note 1 on page 149.
910C Initialize main storage (system memory) See note 1 on page 149.
910D Configure processors See note 1 on page 149.
910E Configure and initialize I/O See note 1 on page 149.
Chapter 4. Checkpoints 145
Checkpoint Description Action/
Possible Failing FRU
910F Test and initialize main storage (system
memory)
See note 1 on page 149.
9110 Free IPL LIDs See note 1 on page 149.
9111 Load run-time LIDS See note 1 on page 149.
9112 Load dump LIDs See note 1 on page 149.
9113 Enable attention See note 1 on page 149.
9114 Enable function 22 See note 1 on page 149.
9115 Load system IPL program See note 1 on page 149.
9116 Load system data areas (NACA/PACA) See note 1 on page 149.
9117 Start system IPL program See note 1 on page 149.
9119 Waiting for VPD collection to complete See note 1 on page 149.
9184 Firmware image being loaded into service
processor’s control storage during firmware
update.
1. Remove, then reapply power to the
system. If the service processor enters
recovery mode, (indicated by A1FD
0000 in the display), attempt to update
the firmware again in recovery mode. If
the service processor comes up to the
OK prompt, try again to update the
firmware using the service processor
menus.
2. Call service support.
91C4 Flash update pending, either a firmware
update or hypervisor dump.
Note: This checkpoint may be displayed
during the firmware update process. The
checkpoint may be displayed for a long
period of time during the process. If
process does not complete, the system
may be hung.
See note 1 on page 149.
91FF Control being handed to system processor
from service processor
See note 1 on page 149.
9200 Scan interface BATs See note 1 on page 149.
9300 Logic BIST See note 1 on page 149.
9301 Scanned ABIST See note 1 on page 149.
9302 Dedicated ABIST See note 1 on page 149.
9303 Dedicated ABIST array fuse repair
calculation
See note 1 on page 149.
9380 Built-in-self-test (BIST) See note 2 on page 149.
9400 Service processor is requesting system
flush
See note 1 on page 149.
9410 Service processor is issuing request to start
instruction execution
See note 1 on page 149.
9411 Service processor is issuing request to stop
instruction execution
See note 1 on page 149.
9420 Service processor is issuing request to start
system clocks
See note 1 on page 149.
9421 Service processor is issuing request to stop
system clocks
See note 1 on page 149.
146 Service Guide
Checkpoint Description Action/
Possible Failing FRU
94B0 Wire test faulty driver status refcode See note 2 on page 149.
94B1 Wire test shorted net status refcode See note 2 on page 149.
94B2 Wire test elastic interface test See note 2 on page 149.
94BB Wire test starting See note 2 on page 149.
9501 IPL diagnostic initialization See note 2 on page 149.
9502 IPL diagnostic L3 cache march test See note 2 on page 149.
9503 IPL diagnostic L3 connections test See note 2 on page 149.
9504 IPL diagnostic L2 cache march test See note 2 on page 149.
9505 IPL diagnostic RIO wrap test See note 2 on page 149.
9506 IPL diagnostic cleanup See note 2 on page 149.
9507 IPL diagnostic test See note 2 on page 149.
9508 IPL diagnostic main storage march test See note 2 on page 149.
9509 IPL diagnostic main storage connections
test
See note 2 on page 149.
950A IPL diagnostic elastic interface slack test See note 2 on page 149.
950B IPL diagnostic fast initialization to zeros See note 2 on page 149.
96C2 IPL MOPs processor configuration See note 2 on page 149.
96C3 IPL MOPs main storage size See note 2 on page 149.
96C4 IPL MOPs main storage configuration See note 2 on page 149.
96C6 IPL MOPs I/O configuration See note 2 on page 149.
96C7 IPL MOPs: enable chip machine checks See note 2 on page 149.
96E1 Initialize run-time PRD objects and memory See note 2 on page 149.
96E2 Run-time initialization: enable attention
handling
See note 2 on page 149.
99FD Service processor receiving firmware
update module
See note 2 on page 149.
Attention: This checkpoint may stay in
the operator panel for some period of time
during a firmware update or hypervisor
dump with no indication of activity. Do not
assume that the service processor is hung
unless this checkpoint has been in the
operator panel for at least 15 minutes. with
no indication of activity.
Attention: Do not remove power from the
system, or reset the service processor,
during a firmware update or hypervisor
dump.
Chapter 4. Checkpoints 147
Checkpoint Description Action/
Possible Failing FRU
99FF Service processor writing firmware update
module
See note 2 on page 149.
Attention: This checkpoint may stay in
the operator panel for some period of time
during a firmware update or hypervisor
dump with no indication of activity. Do not
assume that the service processor is hung
unless this checkpoint has been in the
operator panel for at least 15 minutes. with
no indication of activity.
Attention: Do not remove power from the
system, or reset the service processor,
during a firmware update or hypervisor
dump.
A800 Start service processor dump process See note 1 on page 149.
A801 Start dump to NVRAM See note 1 on page 149.
A802 Start dump to debug port See note 1 on page 149.
A803 NVRAM not usable See note 1 on page 149.
A804 NVRAM dump done See note 1 on page 149.
A805 Start dump to flash See note 1 on page 149.
A806 Flash dump area done See note 1 on page 149.
A807 Flash area not usable See note 1 on page 149.
A808 Flash error log to debug port See note 1 on page 149.
A809 Flash erase start See note 1 on page 149.
A80A Flash erase end See note 1 on page 149.
A80B Reserved; not used See note 1 on page 149.
A80C Reserved; not used See note 1 on page 149.
A80D Reserved; not used See note 1 on page 149.
A80E Reserved; not used See note 1 on page 149.
A80F Service processor dump done See note 1 on page 149.
A810-A8FF Description: Scan log dump in progress
Action/Possible Failing FRU Informational message. (See “Scan Log Dump Policy” on
page 312.)
The last two characters of the checkpoints will change as the scan log dump progresses.
If the last two characters do not change after 5 minutes, the service processor is hung
and must be reset. Follow the procedure outlined in Note 1 on page 149 following this
table.
When the scan log dump is complete, depending on how the reboot policy is set, the
system will either:
v Go to the standby state (and the service processor menus will be available), indicated
by OK or STBY in the operator panel,
OR
v Attempt to reboot.
B0F5 Delayed power-off sequence has begun See note 1 on page 149.
148 Service Guide
Checkpoint Description Action/
Possible Failing FRU
B0FA Delayed power-off acknowledged See note 1.
B0FF Power-off requested program executing See note 1.
Notes:
1. If the system fails to progress after this checkpoint is put in the display, do the following:
a. Activate the service processor pinhole reset switch on the system’s operator panel.
b. If the hang repeats, check with service support to see if there is a firmware update that fixes the
problem.
c. If not, or the update does not fix the problem, replace the CEC backplane, location: U0.1-P1.
d. If this procedure does not fix the problem, call service support.
2. If the system fails to progress after this checkpoint is put in the operator display, do the following:
a. Activate the service processor pinhole reset switch on the system’s operator panel.
b. Reboot the system in slow mode. (Slow/fast IPL is set using a service processor menu.) If the
hang repeats, check with service support to see if there is a system firmware update that fixes this
problem.
c. If not, or the update does not fix the problem, call service support.
3. Recovery mode flash update has failed. Do the following:
a. Activate the service processor pinhole reset switch on the system’s operator panel and try the
recovery mode flash update again.
b. If the system hangs again, make a new set of firmware update diskettes, using new, blank
diskettes. Retry the flash update with the new diskettes.
c. If the same hang occurs with the new diskettes, replace the following FRUs one at a time in this
order:
1) Diskette drive
2) CEC backplane, location: U0.1-P1.
d. If this procedure does not fix the problem, call service support.
4. The service processor was unable to configure the modem(s) attached to the serial port(s) on the CEC
backplane (a problem with the modem attached to the HMC is not indicated). Do the following:
a. Check the modem, modem cable, and serial port for loose or unconnected cables. Reset the
service processor using the pinhole reset switch on the system’s operator panel.
b. If the hang repeats, check with service support to see if there is a firmware update that fixes the
problem.
c. If there is not, or the update does not fix the problem, replace the following FRUs, one at a time, in
the following order:
1) Modem cable
2) Modem
d. If this procedure does not fix the problem, call service support.
Chapter 4. Checkpoints 149
Firmware Checkpoints
Firmware uses checkpoints (progress codes) in the range of Exxx to EFFF. These checkpoints occur
during system startup and can be useful in diagnosing certain problems. Service processor checkpoints
are listed in “Service Processor Checkpoints” on page 143.
If you have a checkpoint with no location code associated with it, see “Determining Location Code” on
page 180. If a checkpoint has a location code associated with a FRU replacement, see “Mapping AIX and
Physical Location Codes” on page 28 for physical location information.
If you replace FRUs and the problem is still not corrected, go to “MAP 1548: Processor Subsystem
Problem Isolation” on page 123 unless otherwise indicated in the tables.
Note: If you receive a four-digit code in the range of E1xx to EFFF that is not listed in the following table,
go to “MAP 1548: Processor Subsystem Problem Isolation” on page 123.
Checkpoint Description Action/
Possible Failing FRU
E101 Create RTAS node Go to “MAP 1540: Problem Isolation
Procedures” on page 108.
E102 Load/Init RTAS Go to “MAP 1540: Problem Isolation
Procedures” on page 108.
E105 Transfer control to operating system
(normal boot).
See “Boot Problems/Concerns” on
page 173.
E10A Load RTAS device tree Go to “MAP 1540: Problem Isolation
Procedures” on page 108.
E10B Set RTAS device properties Go to “MAP 1540: Problem Isolation
Procedures” on page 108.
E111 GOOD CRC - jump to composite image Go to “MAP 1540: Problem Isolation
Procedures” on page 108.
E120 Initialize I/O and early memory block Go to “MAP 1540: Problem Isolation
Procedures” on page 108.
E121 Initialize service processor Go to “MAP 1540: Problem Isolation
Procedures” on page 108.
E122 RTAS sensor setup complete Go to “MAP 1540: Problem Isolation
Procedures” on page 108.
E130 Build device tree Go to “MAP 1540: Problem Isolation
Procedures” on page 108.
E131 Build device tree Go to “MAP 1540: Problem Isolation
Procedures” on page 108.
E132 Build device tree Go to “MAP 1540: Problem Isolation
Procedures” on page 108.
E133 Build device tree Go to “MAP 1540: Problem Isolation
Procedures” on page 108.
E134 Create memory node Go to “MAP 1540: Problem Isolation
Procedures” on page 108.
E138 Create options node Go to “MAP 1540: Problem Isolation
Procedures” on page 108.
E139 Create node aliases and system aliases Go to “MAP 1540: Problem Isolation
Procedures” on page 108.
E13A Create packages node Go to “MAP 1540: Problem Isolation
Procedures” on page 108.
150 Service Guide
Checkpoint Description Action/
Possible Failing FRU
E140 Operating system load See “Boot Problems/Concerns” on
page 173.
E149 Create boot mgr node Go to “MAP 1540: Problem Isolation
Procedures” on page 108.
E14C Create terminal-emulator node Go to “MAP 1540: Problem Isolation
Procedures” on page 108.
E14D Loading boot image See “Boot Problems/Concerns” on
page 173.
E150 Create host (primary) PCI controller node If a location code is specified, replace the
adapter at that location code.
If no location code is specified, go to “MAP
1540: Problem Isolation Procedures” on
page 108.
E151 Probing primary PCI bus If a location code is specified, replace the
adapter at that location code.
If no location code is specified, go to “MAP
1540: Problem Isolation Procedures” on
page 108.
E152 Probing for adapter FCODE, evaluate if
present
Follow the repair action listed for
checkpoint E151.
E153 End adapter FCODE, probe/evaluate Follow the repair action listed for
checkpoint E151.
E154 Create PCI bridge node Follow the repair action listed for
checkpoint E151.
E155 Probing PCI bridge secondary bus Follow the repair action listed for
checkpoint E151.
E156 Create PCI Ethernet node If a location code is specified, replace the
adapter at that location code. If no location
code is specified, go to “MAP 1540:
Problem Isolation Procedures” on page 108.
E158 System firmware waiting for a virtual
terminal to be opened on an HMC for SMS
menus to be displayed.
Open a virtual terminal window on the
HMC.
E15B Transferring control to operating system
(service mode boot)
See “Boot Problems/Concerns” on
page 173.
E15F Adapter VPD probe Follow the repair action listed for
checkpoint E151.
E170 Start of PCI Bus Probe Follow the repair action listed for
checkpoint E151.
E172 First pass of PCI device probe Follow the repair action listed for
checkpoint E151.
Chapter 4. Checkpoints 151
Checkpoint Description Action/
Possible Failing FRU
E174 Description: Establishing host connection
Action/Possible Failing FRU: If the system is not connected to an active network, or if
the target server is inaccessible (which can result from incorrect IP parameters being
specified), the system still attempts to boot and because time-out durations are
necessarily long to accommodate retries, the system may appear to hang.
Restart the system and get to the SMS utilities menu. Under Select Boot Options,
verify:
v Is the intended boot device correctly specified in the boot list?
v Are the IP parameters correct?
v Look at the server configuration for this client.
v Attempt to ping the target server using the ping utility in SMS.
E175 BootP request Follow the repair actions for checkpoint
E174.
E176 TFTP file transfer Follow the repair actions for checkpoint
E174.
E177 Transfer failure due to TFTP error condition See “Boot Problems/Concerns” on
page 173.
E17B Processor clock speed measurement Call service support.
E183 Service processor POST 1. Replace the CEC backplane, location:
U0.1-P1.
2. Go to “MAP 1540: Problem Isolation
Procedures” on page 108.
E18D A system reset request has been sent to
the service processor.
1. Check for system firmware updates.
2. Call service support.
E191 Create ISA reserved node Go to “MAP 1540: Problem Isolation
Procedures” on page 108.
E19A NVRAM auto-boot? variable not found -
assume FALSE
Go to “MAP 1540: Problem Isolation
Procedures” on page 108.
E19B NVRAM menu? variable not found -
assume FALSE
Go to “MAP 1540: Problem Isolation
Procedures” on page 108.
E19D Create NVRAM node Go to “MAP 1540: Problem Isolation
Procedures” on page 108.
E19E Real-time clock (RTC) initialization Refer to error code 28030xxx in
“Checkpoint and Error Code Index” on
page 181.
E1A0 User requested boot to SMS menus via
keyboard entry
Go to “MAP 1540: Problem Isolation
Procedures” on page 108.
E1A1 User requested boot to open firmware
prompt via keyboard entry
Go to “MAP 1540: Problem Isolation
Procedures” on page 108.
E1A2 User requested boot using default service
mode boot list via keyboard entry
Go to “MAP 1540: Problem Isolation
Procedures” on page 108.
E1A3 User requested boot using customized
service mode boot list via keyboard entry
Go to “MAP 1540: Problem Isolation
Procedures” on page 108.
E1A4 User requested boot to SMS menus via
HMC or CSP command
Go to “MAP 1540: Problem Isolation
Procedures” on page 108.
152 Service Guide
Checkpoint Description Action/
Possible Failing FRU
E1A5 User requested boot to open firmware
prompt via HMC or CSP command
Go to “MAP 1540: Problem Isolation
Procedures” on page 108.
E1A6 User requested boot using default service
mode boot list via HMC or CSP command
Go to “MAP 1540: Problem Isolation
Procedures” on page 108.
E1A7 User requested boot using customized
service mode boot list via HMC or CSP
command
Go to “MAP 1540: Problem Isolation
Procedures” on page 108.
E1A8 System booting to open firmware prompt Go to “MAP 1540: Problem Isolation
Procedures” on page 108.
E1A9 Description: System booting to SMS menus
Action/Possible Failing FRU:
1. If a graphics adapter is driving the console display, connect a laptop (and cable PN
62H4857) to serial port 1 on the processor subsystem. If the SMS menus appear on
the laptop you may have a problem with the display. Go to the “Quick Entry MAP” on
page 58 with the display problem symptom.
2. If the SMS menus do not appear on the laptop in step 1 or a graphics adapter is not
driving the console display, go to “MAP 1540: Problem Isolation Procedures” on
page 108.
E1AA System NVRAM settings during boot Go to “MAP 1540: Problem Isolation
Procedures” on page 108.
E1AB System booting using default service mode
boot list
Go to “MAP 1540: Problem Isolation
Procedures” on page 108.
E1AC System booting using customized service
mode boot list
Go to “MAP 1540: Problem Isolation
Procedures” on page 108.
E1AD System booting to operating system Go to “MAP 1540: Problem Isolation
Procedures” on page 108.
E1AE System booted to multiboot menu via
NVRAM settings
Go to “MAP 1540: Problem Isolation
Procedures” on page 108.
E1AF System booted to utilities menu via NVRAM
settings
Go to “MAP 1540: Problem Isolation
Procedures” on page 108.
E1B1 Create serial node Go to “MAP 1540: Problem Isolation
Procedures” on page 108.
E1B6 Probe for (ISA) keyboard Replace the CEC backplane,
Location: U0.1-P1.
E1BD Probe for (ISA) mouse Replace the CEC backplane,
Location: U0.1-P1.
E1C4 Build ISA timer chip node Go to “MAP 1540: Problem Isolation
Procedures” on page 108.
E1C5 Create ISA interrupt controller (pic) node Go to “MAP 1540: Problem Isolation
Procedures” on page 108.
E1C6 Create DMA node Go to “MAP 1540: Problem Isolation
Procedures” on page 108.
E1D0 Create PCI SCSI node Go to “MAP 1540: Problem Isolation
Procedures” on page 108.
E1D3 Create SCSI block device node (SD) Go to “MAP 1540: Problem Isolation
Procedures” on page 108.
E1D4 Create SCSI byte device node (ST) Go to “MAP 1540: Problem Isolation
Procedures” on page 108.
Chapter 4. Checkpoints 153
Checkpoint Description Action/
Possible Failing FRU
E1DB Create floppy controller (FDC) node Go to “MAP 1540: Problem Isolation
Procedures” on page 108.
E1DC Description: Dynamic console selection.
Action/Possibly Failing FRU
1. If a console is attached, but nothing is displayed on it, go to ²All display problems² in
the Entry MAP.
2. If selection screens can be seen on the terminals and the appropriate key on the
input device is pressed within 60 seconds but there is no response to the keystroke:
a. If selecting the console with a keyboard attached to the system, replace the
keyboard, then replace the CEC backplane, Location: U0.1-P1.
b. If selecting the console with an ASCII terminal, suspect the terminal. Use the
problem determination procedures for the terminal.
Note: Terminal setting should be:
v 9600 baud
v no parity
v 8 data bits
v 1 stop bit
E1F0 Start O.B.E. Go to “MAP 1540: Problem Isolation
Procedures” on page 108.
E1F1 Begin selftest sequence on boot devices. Go to “MAP 1540: Problem Isolation
Procedures” on page 108.
E1F2 Power-on password prompt The prompt should be visible on the
firmware console (maybe a virtual terminal
on the HMC).
If a console is attached but nothing is
displayed on it, go to the “Quick Entry
MAP” on page 58 with the symptom ²All
display problems.²
E1F3 Privileged-access password prompt The prompt should be visible on the
firmware console (maybe a virtual terminal
on the HMC).
If a console is attached but nothing is
displayed on it, go to the “Quick Entry
MAP” on page 58 with the symptom ²All
display problems.²
E1F4 End self-test sequence on boot devices;
begin SMS.
Go to “MAP 1540: Problem Isolation
Procedures” on page 108.
E1F5 Build boot device list Go to “MAP 1540: Problem Isolation
Procedures” on page 108.
E1F6 Determine boot device sequence Go to “MAP 1540: Problem Isolation
Procedures” on page 108.
E1F7 No boot image located Go to “Boot Problems/Concerns” on
page 173.
E1F8 Building boot device list for SCSI adapters.
(The location of the SCSI adapter being
scanned is also displayed.)
1. If a location code is available, follow the
Repair actions listed for error code
21A0 00XX.
2. If no location code is available, go to
“MAP 1540: Problem Isolation
Procedures” on page 108.
154 Service Guide
Checkpoint Description Action/
Possible Failing FRU
E1F9 Building boot device list for fibre-channel
adapters. (The location of the SAN adapter
being scanned is also displayed.)
1. If a location code is available, follow the
Repair actions listed for error code
21A0 00XX.
2. If no location code is available, go to
“MAP 1540: Problem Isolation
Procedures” on page 108.
E1FA Building boot device list for SCSI adapters.
(The device ID and the device LUN being
scanned are also displayed.)
Go to “MAP 1540: Problem Isolation
Procedures” on page 108.
E1FB Scan SCSI bus for attached devices 1. If a location code is available, follow the
Repair actions listed for error code
21A0 00XX.
2. If no location code is available, go to
“MAP 1540: Problem Isolation
Procedures” on page 108.
E1FD The operator panel alternates between the
code E1FD and another Exxx code, where
Exxx is the point at which the error
occurred.
1. If a location code is available, follow the
actions listed for error code 21A0 00xx.
2. If no location code is available, go to
“MAP 1540: Problem Isolation
Procedures” on page 108.
E1FE Building boot device list for fibre-channel
adapters. (The WWPN of the SAN adapter
being scanned is also displayed.)
1. If a location code is available, follow the
Repair actions listed for error code
21A0 00XX.
2. If no location code is available, go to
“MAP 1540: Problem Isolation
Procedures” on page 108.
E1FF Building boot device list for fibre-channel
adapters. (The WWPN of the SAN adapter
being scanned is also displayed.)
Go to “MAP 1542: I/O Problem Isolation” on
page 111.
E20F System firmware has exited to open
firmware prompt
1. Replace the CEC backplane,
Location: U0.1-P1.
2. Call service support.
E440 Validate NVRAM, initialize partitions as
needed
1. Check for system firmware updates.
2. Replace the CEC backplane,
Location: U0.1-P1.
3. Call service support.
E441 Generate /options node NVRAM
configuration variable properties
1. Check for system firmware updates.
2. Replace the CEC backplane,
Location: U0.1-P1.
3. Call service support.
E442 Validate NVRAM partitions 1. Check for system firmware updates.
2. Replace the CEC backplane,
Location: U0.1-P1.
3. Call service support.
E443 Generate NVRAM configuration variable
dictionary words
Suspect a system firmware problem if
problem persists. If a system firmware
update is available, install it.
E500 Begin I/O drawer configuration 1. Check for system firmware updates.
2. Go to “MAP 1540: Problem Isolation
Procedures” on page 108.
Chapter 4. Checkpoints 155
Checkpoint Description Action/
Possible Failing FRU
E501 Initialize I/O data structures 1. Check for system firmware updates.
2. Go to “MAP 1540: Problem Isolation
Procedures” on page 108.
E502 Set up I/O data structure pointers 1. Check for system firmware updates.
2. Go to “MAP 1540: Problem Isolation
Procedures” on page 108.
E503 Initialize location code data structures 1. Check for system firmware updates.
2. Go to “MAP 1540: Problem Isolation
Procedures” on page 108.
E504 Initialize memory map data structures 1. Check for system firmware updates.
2. Go to “MAP 1540: Problem Isolation
Procedures” on page 108.
E505 Enable the time base for all RIO hubs 1. Check for system firmware updates.
2. Replace the RIO riser card specified by
the location code.
3. Go to “MAP 1540: Problem Isolation
Procedures” on page 108.
E506 Reset time base of processors 1. Check for system firmware updates.
2. Go to “MAP 1540: Problem Isolation
Procedures” on page 108.
E507 RIO hub assignment 1. Check for system firmware updates.
2. Replace the RIO riser card specified by
the location code.
3. Go to “MAP 1540: Problem Isolation
Procedures” on page 108.
E508 Begin initializing RIO hubs 1. Check for system firmware updates.
2. Go to “MAP 1540: Problem Isolation
Procedures” on page 108.
E509 Begin configuring EADS 1. Check for system firmware updates.
2. Go to “MAP 1540: Problem Isolation
Procedures” on page 108.
E50A Check for missing RIO interface chips 1. Check for system firmware updates.
2. Go to “MAP 1540: Problem Isolation
Procedures” on page 108.
E50B Save RIO hub information 1. Check for system firmware updates.
2. Go to “MAP 1540: Problem Isolation
Procedures” on page 108.
E50C Copy all of the memory ranges 1. Check for system firmware updates.
2. Go to “MAP 1540: Problem Isolation
Procedures” on page 108.
E50D Copy all of the I/O data 1. Check for system firmware updates.
2. Go to “MAP 1540: Problem Isolation
Procedures” on page 108.
156 Service Guide
Checkpoint Description Action/
Possible Failing FRU
E50E Copy all of the RIO hub information 1. Check for system firmware updates.
2. Go to “MAP 1540: Problem Isolation
Procedures” on page 108.
E50F Clean up the error registers 1. Check for system firmware updates.
2. Replace the RIO riser card specified by
the location code.
3. Go to “MAP 1540: Problem Isolation
Procedures” on page 108.
E51F End of I/O configuration 1. Check for system firmware updates.
2. Go to “MAP 1540: Problem Isolation
Procedures” on page 108.
E520 Set up the RIO architecture registers 1. Check for system firmware updates.
2. Replace the RIO riser card specified by
the location code.
3. Go to “MAP 1540: Problem Isolation
Procedures” on page 108.
E521 Set up interrupts for the current RIO hub 1. Check for system firmware updates.
2. Replace the RIO riser card specified by
the location code.
3. Go to “MAP 1540: Problem Isolation
Procedures” on page 108.
E522 Set up miscellaneous registers for the
current RIO hub
1. Check for system firmware updates.
2. Replace the RIO riser card specified by
the location code.
3. Go to “MAP 1540: Problem Isolation
Procedures” on page 108.
E523 Initialize data structures and enable all RIO
hub ports
1. Check for system firmware updates.
2. Replace the RIO riser card specified by
the location code.
3. Go to “MAP 1540: Problem Isolation
Procedures” on page 108.
E524 Determine I/O configuration of current RIO
hub
1. Check for system firmware updates.
2. Replace the RIO riser card specified by
the location code.
3. Go to “MAP 1540: Problem Isolation
Procedures” on page 108.
E525 Set up the ETE of the current RIO hub 1. Check for system firmware updates.
2. Replace the RIO riser card specified by
the location code.
3. Go to “MAP 1540: Problem Isolation
Procedures” on page 108.
Chapter 4. Checkpoints 157
Checkpoint Description Action/
Possible Failing FRU
E526 Reassign the I/O subsystems to the closest
RIO hub
1. Replace the RIO-G cable specified by
the location code.
2. Replace the RIO riser card that the
failing drawer is attached to.
3. Replace the I/O subsystem planar that
the RIO-G cable in step 1 is attached
to.
4. Go to “MAP 1540: Problem Isolation
Procedures” on page 108.
E527 Enable the memory MAP and I/O MAP for
the current RIO hub port
1. Check for system firmware updates.
2. Replace the RIO riser card specified by
the location code.
3. Go to “MAP 1540: Problem Isolation
Procedures” on page 108.
E52F End of configuration of the RIO hub 1. Check for system firmware updates.
2. Go to “MAP 1540: Problem Isolation
Procedures” on page 108.
E528 Begin search for the next available I/O
drawer from a current RIO port
1. Check the RIO cable specified by the
location code. If the location code
specifies an I/O subsystem or its RIO
interface, check the RIO cables going
into and out of the I/O subsystem.
2. Replace the RIO cables specified by
the location code. If the location code
specifies an I/O subsystem or its RIO
interface, replace the RIO cables going
into and out of the I/O subsystem.
3. Replace the CEC backplane, U0.1-P1.
4. Replace the I/O subsystem backplane
specified by the location code, or the
I/O subsystem backplane that the RIO
cables in step 1 are attached to.
E529 Enable the control port and driver of the
current RIO hub
1. Check the RIO cable specified by the
location code. If the location code
specifies an I/O subsystem or its RIO
interface, check the RIO cables going
into and out of the I/O subsystem.
2. Replace the RIO cables specified by
the location code. If the location code
specifies an I/O subsystem or its RIO
interface, replace the RIO cables going
into and out of the I/O subsystem.
3. Replace the CEC backplane, U0.1-P1.
4. Replace the I/O subsystem backplane
specified by the location code, or the
I/O subsystem backplane that the RIO
cables in step 1 are attached to.
158 Service Guide
Checkpoint Description Action/
Possible Failing FRU
E52A Search for the next available I/O drawer
from a current RIO port
1. Check the RIO cable specified by the
location code. If the location code
specifies an I/O subsystem or its RIO
interface, check the RIO cables going
into and out of the I/O subsystem.
2. Replace the RIO cables specified by
the location code. If the location code
specifies an I/O subsystem or its RIO
interface, replace the RIO cables going
into and out of the I/O subsystem.
3. Replace the CEC backplane, U0.1-P1.
4. Replace the I/O subsystem backplane
specified by the location code, or the
I/O subsystem backplane that the RIO
cables in step 1 are attached to.
E52B The RIO cable connected to the RIO port
has been detected
1. Check the RIO cable specified by the
location code. If the location code
specifies an I/O subsystem or its RIO
interface, check the RIO cables going
into and out of the I/O subsystem.
2. Replace the RIO cables specified by
the location code. If the location code
specifies an I/O subsystem or its RIO
interface, replace the RIO cables going
into and out of the I/O subsystem.
3. Replace the CEC backplane, U0.1-P1.
4. Replace the I/O subsystem backplane
specified by the location code, or the
I/O subsystem backplane that the RIO
cables in step 1 are attached to.
E530 Checking RIO hub ports on current RIO
hub
1. Check for system firmware updates.
2. Go to “MAP 1540: Problem Isolation
Procedures” on page 108.
E531 Get RIO hub node ID 1. Replace the RIO-G cable specified by
the location code.
2. Replace the RIO riser card that the
failing drawer is attached to.
3. Replace the I/O subsystem planar that
the RIO-G cable in step 1 is attached
to.
4. Go to “MAP 1540: Problem Isolation
Procedures” on page 108.
E532 RIO cable detected; enable the RIO hub
port
1. Replace the RIO-G cable specified by
the location code.
2. Replace the RIO riser card that the
failing drawer is attached to.
3. Replace the I/O subsystem planar that
the RIO-G cable in step 1 is attached
to.
4. Go to “MAP 1540: Problem Isolation
Procedures” on page 108.
Chapter 4. Checkpoints 159
Checkpoint Description Action/
Possible Failing FRU
E533 Set up the RIO hub routing table 1. Replace the RIO-G cable specified by
the location code.
2. Replace the RIO riser card that the
failing drawer is attached to.
3. Replace the I/O subsystem planar that
the RIO-G cable in step 1 is attached
to.
4. Go to “MAP 1540: Problem Isolation
Procedures” on page 108.
E534 Get the node ID of the node on the other
end of the RIO cable
1. Replace the RIO-G cable specified by
the location code.
2. Replace the RIO riser card that the
failing drawer is attached to.
3. Replace the I/O subsystem planar that
the RIO-G cable in step 1 is attached
to.
4. Go to “MAP 1540: Problem Isolation
Procedures” on page 108.
E535 Valid node ID detected at other end of the
RIO cable
1. This error code may indicate that the
RIO-G cables are not properly
connected to the I/O subsystems.
Check the RIO cabling.
2. Replace the RIO-G cable specified by
the location code.
3. Replace the RIO riser card that the
failing drawer is attached to.
4. Replace the I/O subsystem planar that
the RIO-G cable in step 1 is attached
to.
5. Go to “MAP 1540: Problem Isolation
Procedures” on page 108.
E536 I/O subsystem detected at other end of the
RIO cable
1. Check for system firmware updates.
2. Go to “MAP 1540: Problem Isolation
Procedures” on page 108.
E537 Configuring the I/O subsystem 1. Replace the RIO-G cable specified by
the location code.
2. Replace the RIO riser card that the
failing drawer is attached to.
3. Replace the I/O subsystem planar that
the RIO-G cable in step 1 is attached
to.
4. Go to “MAP 1540: Problem Isolation
Procedures” on page 108.
E538 Detected the RIO hub on the other end of
the RIO cable
1. Check for system firmware updates.
2. Go to “MAP 1540: Problem Isolation
Procedures” on page 108.
160 Service Guide
Checkpoint Description Action/
Possible Failing FRU
E539 Enabling the RIO hub ports 1. Check for system firmware updates.
2. Replace the RIO riser card specified by
the location code.
3. Go to “MAP 1540: Problem Isolation
Procedures” on page 108.
E53A Set up the RIO port speed of the exit port
of the I/O subsystem connected to the RIO
hub port
1. Replace the RIO-G cable specified by
the location code.
2. Replace the RIO riser card that the
failing drawer is attached to.
3. Replace the I/O subsystem planar that
the RIO-G cable in step 1 is attached
to.
4. Go to “MAP 1540: Problem Isolation
Procedures” on page 108.
E53B End of looking for next node port 1. Check for system firmware updates.
2. Go to “MAP 1540: Problem Isolation
Procedures” on page 108.
E53C End of looking for next RIO hub port 1. Check for system firmware updates.
2. Go to “MAP 1540: Problem Isolation
Procedures” on page 108.
E53D Disable the anynode mode Replace the I/O subsytem backplane that
the RIO cable specified by the location
code is attached to.
E540 Begin initialization of I/O drawer 1. Check for system firmware updates.
2. Go to “MAP 1540: Problem Isolation
Procedures” on page 108.
E541 Initialize the routing table 1. Replace the I/O subsystem planar
specified by the location code.
2. Go to “MAP 1540: Problem Isolation
Procedures” on page 108.
E544 Enable the RIO ports Replace the I/O subsystem planar specified
by the location code.
E545 Set up the device routing address for this
new node ID
Replace the I/O subsystem planar specified
by the location code.
E546 Set up the RIO interface chip new node ID Replace the I/O subsystem planar specified
by the location code.
E547 Set up the RIO interface chip routing table Replace the I/O subsystem planar specified
by the location code.
E548 Disable any node mode Replace the I/O subsystem planar specified
by the location code.
E549 Adjust RIO hub table Replace the I/O subsystem planar specified
by the location code.
Chapter 4. Checkpoints 161
Checkpoint Description Action/
Possible Failing FRU
E54A Configure the I/O drawer 1. Check the RIO cable specified by the
location code. If the location code
specifies an I/O subsystem or its RIO
interface, check the RIO cables going
into and out of the I/O subsystem.
2. Replace the RIO cables specified by
the location code. If the location code
specifies an I/O subsystem or its RIO
interface, replace the RIO cables going
into and out of the I/O subsystem.
3. Replace the I/O subsystem backplane
specified by the location code, or the
I/O subsystem backplane that the RIO
cables in step 1 are attached to.
E54B Update the I/O drawer routing table of RIO
ports
1. Check the RIO cable specified by the
location code. If the location code
specifies an I/O subsystem or its RIO
interface, check the RIO cables going
into and out of the I/O subsystem.
2. Replace the RIO cables specified by
the location code. If the location code
specifies an I/O subsystem or its RIO
interface, replace the RIO cables going
into and out of the I/O subsystem.
3. Replace the I/O subsystem backplane
specified by the location code, or the
I/O subsystem backplane that the RIO
cables in step 1 are attached to.
E550 Begin to initialize RIO interface chip Replace the device specified by the
location code.
E551 Set up the RIO speed Replace the device specified by the
location code.
E552 Set up RIO interface chip registers Replace the device specified by the
location code.
E553 Set up the RIO interface chip base address Replace the device specified by the
location code.
E554 Set up more RIO interface chip registers Replace the device specified by the
location code.
E555 Set up the RIO interface chip RIO link Replace the device specified by the
location code.
E556 Set up and initialize RIO interface chips
under current RIO interface chip
Replace the device specified by the
location code.
E55F End of I/O drawer initialization Replace the device specified by the
location code.
E560 Begin initializing the I/O drawer 1. Check for system firmware updates.
2. Go to “MAP 1540: Problem Isolation
Procedures” on page 108.
162 Service Guide
Checkpoint Description Action/
Possible Failing FRU
E561 Set up RIO interface chip registers If the location code associated with this
checkpoint is either U0.1-P1 or U0.1-P2, do
the following:
1. Replace the PCI riser card in the CEC
drawer, location: U0.1-P2.
2. Replace the CEC backplane: location
U0.1-P1.
Otherwise, replace the device specified by
the location code.
E562 Set up RIO interface chip registers If the location code associated with this
checkpoint is either U0.1-P1 or U0.1-P2, do
the following:
1. Replace the PCI riser card in the CEC
drawer, location: U0.1-P2.
2. Replace the CEC backplane: location
U0.1-P1.
Otherwise, replace the device specified by
the location code.
E563 Set up RIO interface chip registers If the location code associated with this
checkpoint is either U0.1-P1 or U0.1-P2, do
the following:
1. Replace the PCI riser card in the CEC
drawer, location: U0.1-P2.
2. Replace the CEC backplane: location
U0.1-P1.
Otherwise, replace the device specified by
the location code.
E564 Set up RIO interface chip registers If the location code associated with this
checkpoint is either U0.1-P1 or U0.1-P2, do
the following:
1. Replace the PCI riser card in the CEC
drawer, location: U0.1-P2.
2. Replace the CEC backplane: location
U0.1-P1.
Otherwise, replace the device specified by
the location code.
E565 Set up RIO interface chip registers If the location code associated with this
checkpoint is either U0.1-P1 or U0.1-P2, do
the following:
1. Replace the PCI riser card in the CEC
drawer, location: U0.1-P2.
2. Replace the CEC backplane: location
U0.1-P1.
Otherwise, replace the device specified by
the location code.
Chapter 4. Checkpoints 163
Checkpoint Description Action/
Possible Failing FRU
E566 Enable device arbitration on the RIO
interface chip
If the location code associated with this
checkpoint is either U0.1-P1 or U0.1-P2, do
the following:
1. Replace the PCI riser card in the CEC
drawer, location: U0.1-P2.
2. Replace the CEC backplane: location
U0.1-P1.
Otherwise, replace the device specified by
the location code.
E567 Set up RIO interface chip registers If the location code associated with this
checkpoint is either U0.1-P1 or U0.1-P2, do
the following:
1. Replace the PCI riser card in the CEC
drawer, location: U0.1-P2.
2. Replace the CEC backplane: location
U0.1-P1.
Otherwise, replace the device specified by
the location code.
E568 Set up the external interrupt vector register If the location code associated with this
checkpoint is either U0.1-P1 or U0.1-P2, do
the following:
1. Replace the PCI riser card in the CEC
drawer, location: U0.1-P2.
2. Replace the CEC backplane: location
U0.1-P1.
Otherwise, replace the device specified by
the location code.
E569 Set up RIO interface chip registers If the location code associated with this
checkpoint is either U0.1-P1 or U0.1-P2, do
the following:
1. Replace the PCI riser card in the CEC
drawer, location: U0.1-P2.
2. Replace the CEC backplane: location
U0.1-P1.
Otherwise, replace the device specified by
the location code.
E56A Set up RIO interface chip registers If the location code associated with this
checkpoint is either U0.1-P1 or U0.1-P2, do
the following:
1. Replace the PCI riser card in the CEC
drawer, location: U0.1-P2.
2. Replace the CEC backplane: location
U0.1-P1.
Otherwise, replace the device specified by
the location code.
164 Service Guide
Checkpoint Description Action/
Possible Failing FRU
E56B Set up RIO interface chip registers If the location code associated with this
checkpoint is either U0.1-P1 or U0.1-P2, do
the following:
1. Replace the PCI riser card in the CEC
drawer, location: U0.1-P2.
2. Replace the CEC backplane: location
U0.1-P1.
Otherwise, replace the device specified by
the location code.
E56C Set up the RIO interface chip PCI bus If the location code associated with this
checkpoint is either U0.1-P1 or U0.1-P2, do
the following:
1. Replace the PCI riser card in the CEC
drawer, location: U0.1-P2.
2. Replace the CEC backplane: location
U0.1-P1.
Otherwise, replace the device specified by
the location code.
E56D Identify devices on current bus If the location code associated with this
checkpoint is either U0.1-P1 or U0.1-P2, do
the following:
1. Replace the PCI riser card in the CEC
drawer, location: U0.1-P2.
2. Replace the CEC backplane: location
U0.1-P1.
Otherwise, replace the device specified by
the location code.
E56E Begin initializing the service processor
control chip
If the location code associated with this
checkpoint is either U0.1-P1 or U0.1-P2, do
the following:
1. Replace the PCI riser card in the CEC
drawer, location: U0.1-P2.
2. Replace the CEC backplane: location
U0.1-P1.
Otherwise, replace the device specified by
the location code.
E56F End initializing the service processor control
chip
1. Check for system firmware updates.
2. Go to “MAP 1540: Problem Isolation
Procedures” on page 108.
E570 Initialize the PCI-ISA bridge chip Replace the device specified by the
location code.
E571 Initialize the integrated Ethernet (if present) Replace the device specified by the
location code.
E572 Initialize the integrated SCSI Replace the device specified by the
location code.
E57F End of RIO interface chip initialization 1. Check for system firmware updates.
2. Go to “MAP 1540: Problem Isolation
Procedures” on page 108.
Chapter 4. Checkpoints 165
Checkpoint Description Action/
Possible Failing FRU
E580 Begin initializing the EADS module. 1. Check for system firmware updates.
2. Replace the I/O subsystem backplane
specified by the location code.
3. Go to “MAP 1540: Problem Isolation
Procedures” on page 108.
E581 Read the PCI bus configuration header. 1. Check for system firmware updates.
2. Replace the I/O subsystem backplane
specified by the location code.
3. Go to “MAP 1540: Problem Isolation
Procedures” on page 108.
E582 EADS has been detected. 1. Check for system firmware updates.
2. Replace the I/O subsystem backplane
specified by the location code.
3. Go to “MAP 1540: Problem Isolation
Procedures” on page 108.
E583 Running BIST on the current EADS 1. Check for system firmware updates.
2. Replace the I/O subsystem backplane
specified by the location code.
3. Go to “MAP 1540: Problem Isolation
Procedures” on page 108.
E584 Checking function of current EADS 1. Check for system firmware updates.
2. Replace the I/O subsystem backplane
specified by the location code.
3. Go to “MAP 1540: Problem Isolation
Procedures” on page 108.
E585 Valid function of current EADS detected 1. Check for system firmware updates.
2. Replace the I/O subsystem backplane
specified by the location code.
3. Go to “MAP 1540: Problem Isolation
Procedures” on page 108.
E586 Set up EADS function 1. Check for system firmware updates.
2. Replace the I/O subsystem backplane
specified by the location code.
3. Go to “MAP 1540: Problem Isolation
Procedures” on page 108.
E587 Set up EADS BIST 1. Check for system firmware updates.
2. Replace the I/O subsystem backplane
specified by the location code.
3. Go to “MAP 1540: Problem Isolation
Procedures” on page 108.
E588 Set up EADS function register 1. Check for system firmware updates.
2. Replace the I/O subsystem backplane
specified by the location code.
3. Go to “MAP 1540: Problem Isolation
Procedures” on page 108.
166 Service Guide
Checkpoint Description Action/
Possible Failing FRU
E589 Set up EADS function registers 1. Check for system firmware updates.
2. Replace the I/O subsystem backplane
specified by the location code.
3. Go to “MAP 1540: Problem Isolation
Procedures” on page 108.
E58A Set up EADS function registers 1. Check for system firmware updates.
2. Replace the I/O subsystem backplane
specified by the location code.
3. Go to “MAP 1540: Problem Isolation
Procedures” on page 108.
E58B Set up EADS function registers 1. Check for system firmware updates.
2. Replace the I/O subsystem backplane
specified by the location code.
3. Go to “MAP 1540: Problem Isolation
Procedures” on page 108.
E58C Set up EADS function registers 1. Check for system firmware updates.
2. Replace the I/O subsystem backplane
specified by the location code.
3. Go to “MAP 1540: Problem Isolation
Procedures” on page 108.
E58D Set up EADS function registers 1. Check for system firmware updates.
2. Replace the I/O subsystem backplane
specified by the location code.
3. Go to “MAP 1540: Problem Isolation
Procedures” on page 108.
E58E Set up EADS function registers 1. Check for system firmware updates.
2. Replace the I/O subsystem backplane
specified by the location code.
3. Go to “MAP 1540: Problem Isolation
Procedures” on page 108.
E58F Set up EADS function registers 1. Check for system firmware updates.
2. Replace the I/O subsystem backplane
specified by the location code.
3. Go to “MAP 1540: Problem Isolation
Procedures” on page 108.
E590 Set up EADS function registers 1. Check for system firmware updates.
2. Replace the I/O subsystem backplane
specified by the location code.
3. Go to “MAP 1540: Problem Isolation
Procedures” on page 108.
E591 Set up EADS function registers 1. Check for system firmware updates.
2. Replace the I/O subsystem backplane
specified by the location code.
3. Go to “MAP 1540: Problem Isolation
Procedures” on page 108.
Chapter 4. Checkpoints 167
Checkpoint Description Action/
Possible Failing FRU
E592 Set up EADS function registers 1. Check for system firmware updates.
2. Replace the I/O subsystem backplane
specified by the location code.
3. Go to “MAP 1540: Problem Isolation
Procedures” on page 108.
E593 Set up EADS function registers 1. Check for system firmware updates.
2. Replace the I/O subsystem backplane
specified by the location code.
3. Go to “MAP 1540: Problem Isolation
Procedures” on page 108.
E594 Begin probing slots 1. Check for system firmware updates.
2. Replace the I/O subsystem backplane
specified by the location code.
3. Go to “MAP 1540: Problem Isolation
Procedures” on page 108.
E595 Detected valid adapter in slot 1. Check for system firmware updates.
2. Replace the I/O subsystem backplane
specified by the location code.
3. Go to “MAP 1540: Problem Isolation
Procedures” on page 108.
E596 Read the EADS revision ID and class code. 1. Replace the RIO cable specified by the
location code.
2. Replace the I/O subsystem planar that
the RIO cable in step 1 is attached to.
E59E End of walking bus of current EADS 1. Check for system firmware updates.
2. Replace the I/O subsystem backplane
specified by the location code.
3. Go to “MAP 1540: Problem Isolation
Procedures” on page 108.
E5E0 Initialize system processor array 1. Check for system firmware updates.
2. Replace the I/O subsystem backplane
specified by the location code.
3. Go to “MAP 1540: Problem Isolation
Procedures” on page 108.
E5E1 Reserved
E5E2 Initialization of the hypervisor, phase 1 1. Check for system firmware updates.
2. Go to “MAP 1548: Processor
Subsystem Problem Isolation” on
page 123.
E5E3 Not in quick-restart: load and set up open
firmware code in memory
1. Check for system firmware updates.
2. Go to “MAP 1548: Processor
Subsystem Problem Isolation” on
page 123.
168 Service Guide
Checkpoint Description Action/
Possible Failing FRU
E5E4 Activate slave processors 1. Check for system firmware updates.
2. Go to “MAP 1548: Processor
Subsystem Problem Isolation” on
page 123.
E5E5 Initialize the hypervisor, phase 2 1. Check for system firmware updates.
2. Go to “MAP 1548: Processor
Subsystem Problem Isolation” on
page 123.
E5E6 Set L2 and processor status 1. Check for system firmware updates.
2. Go to “MAP 1548: Processor
Subsystem Problem Isolation” on
page 123.
E5E7 Save I/O hub information for RTAS and
open firmware
1. Check for system firmware updates.
2. Go to “MAP 1548: Processor
Subsystem Problem Isolation” on
page 123.
E5E8 Set up registers 1. Check for system firmware updates.
2. Go to “MAP 1548: Processor
Subsystem Problem Isolation” on
page 123.
E5E9 Reset the quick restart path 1. Check for system firmware updates.
2. Go to “MAP 1548: Processor
Subsystem Problem Isolation” on
page 123.
E5EE Hang state in main control code 1. Check for system firmware updates.
2. Go to “MAP 1548: Processor
Subsystem Problem Isolation” on
page 123.
E5EF End of low-level firmware 1. Check for system firmware updates.
2. Go to “MAP 1548: Processor
Subsystem Problem Isolation” on
page 123.
E5FE Get the system’s licensed memory quantity
from the service processor
1. Remove, then reapply AC power to the
system using the UEPO switch.
2. Verify that the CUoD capacity card is
installed and properly seated.
3. Call service support.
E600 SSA PCI adapter open firmware has run
successfully.
Replace the adapter.
E601 SSA PCI adapter BIST has started but
failed to complete after 4 seconds.
Replace the adapter.
E602 SSA PCI open firmware has started. Replace the adapter.
E603 SSA PCI adapter BIST has completed with
an error.
Replace the adapter.
E604 SSA PCI adapter BIST and subsequent
POSTs have completed successfully.
Replace the adapter.
Chapter 4. Checkpoints 169
Checkpoint Description Action/
Possible Failing FRU
E605 SSA PCI adapter BIST has completed
successfully but subsequent POSTS have
failed.
Replace the adapter.
E60E SSA PCI open firmware about to exit (no
stack corruption).
Replace the adapter.
E60F SSA PCI open firmware has run
unsuccessfully.
Replace the adapter.
E6FF SSA PCI open firmware about to exit (with
stack corruption).
Replace the adapter.
E700 Create system firmware VPD System firmware may be corrupted.
Reinstall system firmware.
E701 Create memory VPD. 1. Check for system firmware updates.
2. Call for support.
E702 Create processor card VPD. 1. Check for system firmware updates.
2. Call for support.
E703 Create operator panel VPD. 1. Check for system firmware updates.
2. Call for support.
E704 Create CEC VPD. 1. Check for system firmware updates.
2. Call for support.
E705 Create VPD. 1. Check for system firmware updates.
2. Call for support.
E706 Create VPD. 1. Check for system firmware updates.
2. Call for support.
E707 Create system VPD. 1. Check for system firmware updates.
2. Call for support.
E708 Create VPD. Go to “MAP 1540: Problem Isolation
Procedures” on page 108.
E709 Create VPD via SPCN. 1. Check for system firmware updates.
2. Call service support.
E70A Create service processor VPD. 1. Check for system firmware updates.
2. Call for support.
E70B Create L3 module VPD 1. Check for system firmware updates.
2. Call service support.
E70C Create I/O backplane VPD (I2C problem) Go to “MAP 1540: Problem Isolation
Procedures” on page 108.
E70D Create VPD 1. Check for system firmware updates.
2. Call service support.
E70F Collect CEC VPD Go to “MAP 1540: Problem Isolation
Procedures” on page 108.
E7EE Delaying while waiting for power subsystem
VPD collection to be completed;
informational only
1. Check for system firmware updates.
2. Call service support.
170 Service Guide
Checkpoint Description Action/
Possible Failing FRU
E7FF Successful end of VPD creation Informational message. If the system hangs
here, go to “MAP 1540: Problem Isolation
Procedures” on page 108.
E800 RTAS initialization problem 1. Check for system firmware updates.
2. Call service support.
E841 Initializing RTAS/open firmware device tree
interface
1. Check for system firmware updates.
2. Call service support.
E842 Initializing RTAS/service processor interface 1. Check for system firmware updates.
2. Replace the CEC backplane
Location: U0.1-P1.
3. Call service support.
E843 Initializing RTAS/sensor interface 1. Check for system firmware updates.
2. Call service support.
E844 Initializing RTAS/time-of-day clock interface 1. Check for system firmware updates.
2. Replace the CEC backplane
Location: U0.1-P1.
3. Call service support.
E845 Initializing interface/sensor access 1. Check for system firmware updates.
2. Call service support.
E846 Initializing interface/sensor access 1. Check for system firmware updates.
2. Call service support.
E84F Initializing RTAS/operator panel interface 1. Check for system firmware updates.
2. Call service support.
E850 Initializing RTAS/cache interface 1. Check for system firmware updates.
2. Call service support.
E851 Initializing RTAS/hardware access 1. Check for system firmware updates.
2. Call service support.
E852 Initializing RTAS/PCI bus interface 1. Check for system firmware updates.
2. Go to “MAP 1540: Problem Isolation
Procedures” on page 108.
3. Call service support.
E853 Start cable ID request process 1. Check the SPCN and RIO cabling to
the I/O subystems.
2. Remove, then reapply power to the
system.
3. Go to “MAP 1542: I/O Problem
Isolation” on page 111.
4. Call service support.
Chapter 4. Checkpoints 171
Checkpoint Description Action/
Possible Failing FRU
E854 Description: Start collecting cable IDs
Action:
1. Check the SPCN and RIO cabling to the I/O subystems.
2. Remove, then reapply power to the system.
3. Go to “MAP 1542: I/O Problem Isolation” on page 111.
4. Call service support.
E855 Description: Start setting RIO link speed
Action:
1. Check the SPCN and RIO cabling to the I/O subystems.
2. Remove, then reapply power to the system.
3. Go to “MAP 1542: I/O Problem Isolation” on page 111.
4. Call service support.
E860 Initializing RTAS/power subsystem interface 1. Check for system firmware updates.
2. Call service support.
E861 Initializing RTAS/PCI bus interface 1. Check for system firmware updates.
2. Go to “MAP 1540: Problem Isolation
Procedures” on page 108.
3. Call service support.
E870 Initializing RTAS/logs interface 1. Check for system firmware updates.
2. Call service support.
E871 Initializing RTAS/EEH handling interface 1. Check for system firmware updates.
2. Go to “MAP 1540: Problem Isolation
Procedures” on page 108.
3. Call service support.
E872 Initializing RTAS/error inject interface 1. Check for system firmware updates.
2. Call service support.
E873 Initializing RTAS/error handling interface 1. Check for system firmware updates.
2. Call service support.
E880 Initializing RTAS/debug interface 1. Check for system firmware updates.
2. Call service support.
E881 Initializing RTAS/utility interface 1. Check for system firmware updates.
2. Call service support.
E882 Initializing RTAS/softpatch register interface 1. Check for system firmware updates.
2. Call service support.
E890 Initializing RTAS/hot-plug interface 1. Check for system firmware updates.
2. Call service support.
E8C1 Initializing interface/client interface access 1. Check for system firmware updates.
2. Call service support.
E8D1 Initializing interface/memory access 1. Check for system firmware updates.
2. Call service support.
EAA1 Probe PCI-PCI bridge bus Follow the repair actions for checkpoint
E151.
172 Service Guide
Boot Problems/Concerns
Depending on the boot device, a checkpoint may be displayed on the operator panel for an extended
period of time while the boot image is retrieved from the device. This is particularly true for tape and
network boot attempts. If booting from CD-ROM or tape, watch for activity on the drive’s LED indicator. A
blinking LED indicates that the loading of either the boot image or additional information required by the
operating system being booted is still in progress. If the checkpoint is displayed for an extended period of
time and the drive LED is not indicating any activity, there might be a problem loading the boot image from
the device.
Note: For network boot attempts, if the system is not connected to an active network or if the target
server is inaccessible (this can also result from incorrect IP parameters being supplied), the system
will still attempt to boot. Because time-out durations are necessarily long to accommodate retries,
the system may appear to be hung.
This procedure assumes that a diagnostic CD-ROM is available and that an IDE CD-ROM or IDE
DVD-RAM drive is connected to the internal IDE connector. If your CD-ROM or DVD-RAM drive is SCSI
note the following:
v If your system is configured with a four slotted riser card, the CD-ROM or DVD RAM drive must be
connected to a SCSI adapter.
v If your system is configured with a four slotted riser card, for peek performance load the SCSI adapter
into either PCI-X slot 3 or PCI-X slot 4 on the PCI riser card. See “PCI Riser Card Locations” on
page 16 for riser card description and connector locations.
v If your system is configured with a six slotted riser card the CD-ROM or DVD-RAM can be connected to
the integratted SE-SCSI connector located on the PCI riser card. See “PCI Riser Card Locations” on
page 16 for riser card description and connector locations.
Booting the diagnostics image from CD-ROM or a NIM server is referred to as running standalone
diagnostics.
Step 1
Without an HMC Attached to the System: Restart the system and access the firmware SMS main
menu. Select View Error Log. Look for any errors with time stamps that are close to the boot during
which the error was received that sent you here. Perform the actions associated with these error code(s) if
present.
If the problem is resolved, go to ²MAP 410: Repair Checkout² in the Diagnostic Information for Multiple
Bus Systems. If the problem is not resolved, restart the system and access the firmware SMS menu.
Select Select Boot Options.
1. Check to see if the intended boot device is correctly specified in the boot list. If it is in the boot list:
a. Remove all removable media from devices in the boot list from which you do not want to boot.
b. If attempting to boot from the network, go to “Step 2” on page 174.
c. If attempting to boot from a disk drive or CD-ROM, go to “Step 3” on page 175.
Chapter 4. Checkpoints 173
2. If the intended boot device is not correctly identified in the boot sequence, add it to the boot sequence
using the SMS menus. If the device can be added to the boot sequence, reboot the system, then go to
MAP 0410: Repair Checkout in the Diagnostic Information for Multiple Bus Systems. If the intended
boot device cannot be added to the boot list, go to “Step 3” on page 175.
With an HMC Attached to the System:
Attention: If the system is running partitions from partition standby (LPAR), the following procedure
addresses the problem in which one partition will not boot while other partitions boot successfully and run
the operating system successfully.
This procedure also addresses the problem in which a system does not boot to a full system partition. All
of the system’s resources are available to the diagnostics; the same general procedure applies, with the
exception of moving resources between partitions.
Attention: It is the customer’s responsibility to move devices between partitions. If a device must be
moved to another partition to run standalone diagnostics, contact the customer or system administrator. (If
a SCSI CD-ROM or DVD-RAM must be moved to another partition, all SCSI devices connected to that
SCSI adapter must be moved because moves are done at the slot level, not at the device level. In
addition, if a CD-ROM or DVD-RAM drive is connected to the internal IDE adapter, and the CD-ROM or
DVD-RAM drive must be moved to another partition, all of the integrated I/O devices will move with it.)
Both partitions must be rebooted if a device is moved between partitions.
Step 2
This procedure assumes that a CD-ROM or DVD-RAM drive is connected to the system, or that the
diagnostics can be run from a NIM (Network Installation Management) server.
1. Look at the service action event-error log in the Service Focal Point on the HMC (see “Service Focal
Point” on page 48 for instructions on accessing the service action event log). If there are open entries
that affect devices in the boot path of the partition, perform the actions to correct those errors. If there
are open entries that indicate problems with I/O cabling, perform the actions to correct those problems.
Try to reboot the partition.
If the partition hangs with a 4-character checkpoint in the display, the partition must be deactivated,
then reactivated before attempting to reboot.
If a 20EE xxxx error code is reported, the partition is already deactivated and in the error state. Reboot
by activating the partition. If the system is running in full system partition, the power must be cycled
before the system can be rebooted. If the reboot is still not successful, continue to step substep 2.
2. Boot to the SMS main menu as follows:
v If you are rebooting a partition from partition standby (LPAR), go to the properties of the partition
and select Boot to SMS, then activate the partition.
v If you are rebooting the system in full system partition, start the service processor menus, select
option 2, System Power Control Menu, then select option 7, Boot Mode Menu, then select option
1, Boot to SMS menu. Enter 98 to return to the system power control menu, then select option 4,
Power-on System. Enter y to confirm.
At the SMS main menu, select Select Boot Devices.
a. Check to see if the intended boot device is correctly specified in the boot list. If it is in the boot list:
1) Remove all removable media from devices in the boot list from which you do not want to boot.
2) If attempting to boot from a network, go to step “Step 2”.
If attempting to boot from a disk drive or CD-ROM, go to step “Step 3” on page 175.
174 Service Guide
b. If the intended boot device is not correctly listed in the boot sequence, add it to the boot sequence
using the SMS menus. If the device can be added to the boot sequence, reboot the partition. If the
intended boot device cannot be added to the boot list, ask the customer or system administrator to
verify that the device you are trying to boot from is assigned to the correct partition. If it is, go to
step “Step 3”.
If attempting to boot from the network:
1. Verify that IP parameters are correct.
2. Attempt to ²ping² the target server using the SMS Ping utility. If the ping is not successful, have the
network administrator verify the server configuration for this client.
3. Check with the network administrator to ensure that the network is up.
4. Check the network cabling to the adapter.
5. Turn the power off, then on and retry the boot operation.
Step 3
Try to boot and run standalone diagnostics against the system, particularly against the intended boot
device.
Standalone diagnostics can be run from CD-ROM or a NIM server. To boot standalone diagnostics, follow
the detailed procedures in “Considerations for Running Standalone Diagnostics from CD-ROM” on
page 301 or “Running Standalone Diagnostics from a Network Installation Management (NIM) Server with
an HMC Attached to the System” on page 302.
Attention: Standalone diagnostics may be run on systems configured for either full system partition or
partition standby. When attempting to boot diagnostics on a partition from partition standby, the device
from which you are booting standalone diagnostics must be made available to the partition that is not able
to boot, if it is not already in that partition. Contact the customer or system administrator if a device must
be moved between partitions in order to boot standalone diagnostics. If devices are moved between
partitions, both partitions must be rebooted. For more information, see “Standalone Diagnostics
Operations” on page 301 or “Running the Standalone Diagnostics with No HMC Attached” on page 302.
If diagnostics do not boot successfully, and a SCSI boot failure (cannot boot from a SCSI attached device)
is also occurring, go to “MAP 1548: Processor Subsystem Problem Isolation” on page 123.
If diagnostics do not boot successfully, and a SCSI boot failure (cannot boot from a SCSI attached device)
is not occurring:
1. Check IDE cabling to boot device.
2. Check device configuration jumpers.
If no problem is found with the cabling or the jumpers, go to “Step 4” on page 176.
If diagnostics boot successfully and ²No Trouble Found² was the result when diagnostics were run against
the intended boot device, go to substep 4. If the diagnostics boot successfully, but the intended boot
device was not present in the output of the ²Display Configuration and Resource List², which is run from
the Task Selelction Menu:
1. If you booted standalone diagnostics from IDE CD-ROM, follow these steps. After each action, do the
following:
Restart the system. Stop at the SMS menus and select Select Boot Options. If the device is present
in the boot list, and it was present previously, go to MAP 0410: Repair Checkout in the Diagnostic
Information for Multiple Bus Systems.
Chapter 4. Checkpoints 175
If the device still is not present, continue with the actions shown in the following list.
a. Check the SCSI cables
b. Remove all hot-swap disk drives except the intended boot device if its a hot-swap drive. If the boot
device is present in the boot list when the system is booted to the SMS menus, add the hot-swap
disk drives back in, one at a time, until the failing device is isolated.
c. Disconnect all other internal SCSI devices. If the boot device is present in the boot list when the
system is booted to the SMS menus, reconnect the internal SCSI devices, one at a time, until the
failing device or cable is isolated.
d. Replace the SCSI cables.
e. Replace the PCI riser card.
f. Replace the SCSI backplane.
g. Replace the intended boot device.
h. Replace the CEC backplane.
2. Go to the Task Selection Menu and select Display Configuration and Resource List. If the intended
boot device is not listed, go to ²MAP 0290: Missing Resource Problem Resolution² in Diagnostic
Information for Multiple Bus Systems.
3. If an SRN, not an 8-digit error code, is reported, go to Diagnostic Information for Multiple Bus Systems.
4. Choose one of the following, then go to step 5.
v If the diagnostics are successful, and no other devices have been disconnected, it may be
necessary to perform an operating system-specific recovery process, or reinstall the operating
system.
v If the diagnostics are successful, and devices have been removed, reinstall them one at a time.
After each device is reinstalled, reboot the system. Continue this procedure until the failing device is
isolated. Replace the failing device.
5. If you replaced the indicated FRUs and the problem is not corrected, or the above descriptions did not
address your particular situation, go to “MAP 1548: Processor Subsystem Problem Isolation” on
page 123.
If the problem has been corrected, go to ²MAP 0410: Repair Checkout² in Diagnostic Information for
Multiple Bus Systems.
Step 4
It is possible that another installed adapter is causing the problem. Do the following:
1. Remove all installed adapters except the one the CD-ROM drive is attached to and the one used for
the console.
2. Try to boot the standalone diagnostics again.
3. If unable to load standalone diagnostics, go to “Step 5” on page 177.
4. If standalone diagnostics load, reinstall adapters (and attached devices as applicable) one at a time
and retry the boot operation until the problem recurs. Then replace the adapter or device that caused
the problem. Go to ²MAP 0410: Repair Checkout² in the Diagnostic Information for Multiple Bus
Systems.
176 Service Guide
Step 5
The CD-ROM drive, IDE cable, graphics adapter (if installed), or the system board is most likely defective.
A TTY terminal attached to the serial port also can be used to determine if the graphics adapter is causing
the problem. This is done by removing the graphics adapter, attaching a TTY to the serial port, and
retrying standalone diagnostics. If the standalone diagnostics load, replace the graphics adapter.
1. Replace the CD-ROM drive.
2. Replace the IDE cable.
3. Replace the system board.
4. If you replaced the indicated FRUs and the problem is still not corrected, or the above descriptions did
not address your particular situation, go to “MAP 1548: Processor Subsystem Problem Isolation” on
page 123.
5. Go to ²MAP 0410: Repair Checkout² in Diagnostic Information for Multiple Bus Systems.
Chapter 4. Checkpoints 177
178 Service Guide
Chapter 5. Error Code to FRU Index
Attention: Start your activity in this chapter by going to “Checkpoint and Error Code Index” on page 181.
In this section, you may be asked to perform additional isolation procedures before looking up the actions
for a 4-character checkpoint or 8-character error code.
The Error Code to FRU Index lists fault symptoms and possible causes. The most likely cause is listed
first. Use this index to help you decide which FRUs to replace when servicing the system.
v If you replace FRUs or perform an action and the problem is still not corrected, go to “MAP 1548:
Processor Subsystem Problem Isolation” on page 123 unless otherwise indicated in the tables.
v If you replace FRUs and the problem is corrected, go to ²MAP 0410: Repair Checkout² in the
Diagnostic Information for Multiple Bus Systems.
Note: Do not shut down the system if the FRU was replaced by hot swap.
Performing Slow Boot
On this system, the error code displayed in the operator panel may not indicate the root cause of the
problem. To fully analyze all of the available information, perform the following steps before doing a
hardware repair or replacement:
1. Record the 8-character error code (and location code if present) in the operator panel.
2. Power off the system using the white button. Refer to “Stopping the System without an HMC Attached”
on page 359.
3. Select slow boot mode (select disable fast boot) on the system power control menu from the service
processor main menu.
4. Do a slow-mode boot to the SMS menus. This boot can be specified using the System Power Control
Menu on the service processor main menu. Refer to “System Power Control Menu” on page 314. (A
fast-mode boot skips much of the built-in diagnostic testing.) A slow-mode boot may yield a new
8-character error code on the operator panel and new errors in the service processor error log. Power
on the system. Refer to “Starting the System without an HMC Attached” on page 360.
v If a new error code is reported, use it in subsequent steps to continue problem analysis.
v If the system hangs with the same error code, continue with the next step in the Checkpoints and
Error Codes Index.
v If no 8-character error code is displayed in the operator panel, use the original error code and
continue with the next step in the Checkpoints and Error Codes Index.
5. Power off the system.
6. At this point, the service processor menus should be available to examine the service processor error
log.
Four-Character Checkpoints
Four-character progress codes and checkpoints in the form 8xxx, 9xxx, Axxx, Bxxx, and Exxx are listed in
“Firmware Checkpoints” on page 150. If you have a four-character code that begins with any character
other than 8, 9, A, B, or E, refer to the Diagnostic Information for Multiple Bus Systems.
Replacing the Operator Panel
Licensed programs frequently rely on system information that is stored in the VPD module on the operator
panel. If the MAPs indicate that the operator panel should be replaced, the system VPD must be written
into the VPD module on the new operator panel. See “System Vital Product Data (VPD) Update
Procedure” on page 421.
179
Replacing the Network Adapter
If a network adapter is replaced, the network administrator must be notified so that the client IP addresses
used by the system can be updated. In addition, the operating system configuration of the network
controller may need to be changed in order to enable system startup. Also check to ensure that any client
or server that accesses this system is updated.
Determining Location Code
If you have an error code in the form 2xxx xxxx or Exxx, and no location code associated with it, the
location code may possibly be found in three places, depending on the state of the system:
v For systems with AIX installed; if the system will boot into AIX, the location code will be reported when
error log analysis is run under AIX diagnostics.
v If the system will boot into the system management services (SMS) menus, the SMS error log will
contain the location code associated with the error code. See “View Error Log” on page 347.
v If the system halted when the error occurred, the error code and location can be found in the LCD
progress indicator log, which can be accessed using the service processor menus. See “System
Information Menu” on page 317, then select Read Progress Indicators From Last System Boot. Find
the error code in the list that directed you to this note, and the location code should be a part of the
entry in the log.
FRU Identify LEDs
This system is configured with an arrangement of LEDs that help identify various components of the
system. These include, but are not limited to the following:
v Rack identify LED
v CEC drawer identify LED
v I/O drawer identify LED
v RIO port identify LED
v FRU identify LED
– Power subsystem FRUs
– CEC subsystem FRUs
– I/O subsystem FRUs
v I/O adapter identify LED
v DASD identify LED
The identify LEDs are arranged hierarchically, with the FRU identify LED at the bottom of the hierarchy,
followed by the corresponding CEC or I/O drawer identify LED, and the corresponding rack identify LED to
locate the failing FRU more easily.
Any identify LED in the system can be flashed when the system is in the failed state with power on by
using the service processor LED Control Menu contained in the “System Information Menu” on page 317.
Any identify LED in the system can also be flashed by using the ²Identify and Attention Indicators² task
through diagnostics. The procedure to operate the ²Identify and Attention Indicators² task in diagnostics is
outlined in the Diagnostic Information for Multiple Bus Systems.
180 Service Guide
If the service processor menus and the AIX diagnostics are not available, the FRU identify LEDs can be
flashed by one of following:
v If the system is configured as a full system partition, boot to the open firmware prompt. See the “System
Power Control Menu” on page 314 for instructions on setting up the boot mode to enable the boot to the
open firmware prompt. To display a menu that will allow you to turn on the desired FRU identify LED,
type FRU-LED-MENU on the command line.
v If the system is logically partitioned, the HMC must be attached. For instructions on activating and
deactivating a FRU identify LED, see the Service Focal Point chapter of the Hardware Management
Console Installation and Operations Guide, order number SA38-0590.
Checkpoint and Error Code Index
The following table is an index to the checkpoints or error codes that you may receive to help diagnose a
failure. Locate the code you received, and follow the instructions to determine your next step.
Checkpoint or Error
Code
What You Should Do
0000 xxxx Operator Panel Error Codes
Go to “Operator Panel Error Codes” on page 182 and follow the instructions in the Operator
Panel Error Code table.
1xxx xxxx System Power Control Network (SPCN) Error Codes
Go to “SPCN Error Codes” on page 183 and follow the instructions in the SPCN Error Code
table.
2xxx xxxx Firmware Error Codes
Go to “Firmware Error Codes” on page 192 and follow the instructions in the Firmware Error
Code table.
4xxx xxxx Service Processor Error Codes
1. Perform slow boot, if not already performed earlier. See “Performing Slow Boot” on
page 179.
2. If the error codes are 450x xxxx, and if their actions call out memory DIMMs, go to “MAP
1240: Memory Problem Resolution” on page 68.
3. Go to “Service Processor Error Codes” on page 211 and follow the instructions in the
Service Processor Error Codes table.
8xxx
9xxx
Axxx
Bxxx
Service Processor Checkpoints
Go to “Service Processor Checkpoints” on page 143 for more information on these
checkpoints.
A1FD 000x The system has entered recovery mode. Go to “Recovery Mode” on page 335.
B006 xxxx
B1xx xxxx
Common Firmware Error Codes
Go to “Common Firmware Error Codes” on page 286 and follow the instructions in the
Common Firmware Error Codes table.
Exxx Firmware Checkpoints
Go to “Firmware Checkpoints” on page 150 for more information on these checkpoints.
Mxxx xxxx Problem Determination-Generated Error Codes
Go to “Problem Determination Generated Error Codes” on page 295 and follow the
instructions in the Problem Determination Generated Error Codes table.
Chapter 5. Error Code to FRU Index 181
Operator Panel Error Codes
Attention: If you replace FRUs or perform an action and the problem is still not corrected, go to “MAP
1548: Processor Subsystem Problem Isolation” on page 123 unless otherwise indicated in the tables.
If you replace FRUs and the problem is corrected, go to ²MAP 0410: Repair Checkout² in the Diagnostic
Information for Multiple Bus Systems.
Table 2. Operator Panel Error Codes.
Error Code Description Action /
Possible Failing FRU
0000 1111 At power on, failure of power-good signal from
power supply.
1. CEC backplane
Location: U0.1-P1
2. CEC backplane power supply
Locations:
U0.1-V1
U0.1-V2
0000 1112 At power off, failure of power-good signal from
power supply.
1. CEC backplane
Location: U0.1-P1
2. CEC backplane power supply
Locations:
U0.1-V1
U0.1-V2
0000 1113 While power on, failure of power-good signal
from power supply.
1. CEC backplane
Location: U0.1-P1
2. CEC backplane power supply
Locations:
U0.1-V1
U0.1-V2
0000 1114 Operator panel and service processor have been
reset after main power disturbance.
Information message
0000 2222 Operator panel not communicating with service
processor
1. If system cannot be powered off, reset the
service processor (via the pinhole reset or by
disconnecting then reconnecting power).
2. CEC backplane
Location: U0.1-P1
0000 3333 SPCN communications error CEC backplane
Location: U0.1-P1
0000 3334 I2C repeater error CEC backplane
Location: UO.1-P1
0000 BBBB TOD battery malfunction TOD battery
Location: U0.1-P1-V3
182 Service Guide
SPCN Error Codes
Attention: If a power supply or fan is hot-swapped, wait five minutes, then recheck the operating system
error log. If the system has to be shut down to replace a power supply or fan, reboot the system, wait five
minutes after the operating system login prompt appears, then recheck the operating system error log. If
the detailed information for the most recent error with the same identifier indicates that the original problem
has not been fixed by performing the first action in the action column, continue with the next action in the
action column.
Attention: If you replace FRUs or perform an action and the problem is still not corrected, go to “MAP
1548: Processor Subsystem Problem Isolation” on page 123 unless otherwise indicated in the tables.
If you replace FRUs and the problem is corrected, go to ²MAP 0410: Repair Checkout² in the Diagnostic
Information for Multiple Bus Systems.
Table 3. SPCN Error Codes for the CEC (system unit)
Error Code Description Action/
Possible Failing FRU
1011 00AC AC loss or brownout detected Informational
1011 00EF Room emergency power off detected Informational
1011 1510 Power supply 1 power source failure 1. Check input power to the power supply
2. System power supply 1
Location: U0.1-V1
3. CEC backplane
Location: U0.1-P1
1011 1511 Power supply 1 fault 1. Verify that the media support shelf is
installed and properly seated.
2. System power supply 1
Location: U0.1-V1
3. CEC backplane
Location: U0.1-P1
1011 1512 Power supply 1 non-power fault 1. System power supply 1
Location: U0.1-V1
2. CEC backplane
Location: U0.1-P1
1011 1513 Power supply 1 domain fault 1. Verify that the media support shelf is
installed and properly seated.
2. System power supply 1
Location: U0.1-V1
3. CEC backplane
Location: U0.1-P1
1011 1520 Power supply 2 power source failure 1. Check input power to power supply 2
2. System power supply 2
Location: U0.1-V2
3. CEC backplane
Location: U0.1-P1
1011 1521 Power supply 2 fault 1. System power supply 2
Location: U0.1-V2
2. CEC backplane
Location: U0.1-P1
Chapter 5. Error Code to FRU Index 183
Table 3. SPCN Error Codes for the CEC (system unit) (continued)
Error Code Description Action/
Possible Failing FRU
1011 1522 Power supply 2 non-power fault 1. System power supply 2
Location: U0.1-V2
2. CEC backplane
Location: U0.1-P1
1011 1523 Power supply 2 domain fault 1. System power supply 2
Location: U0.1-V2
2. CEC backplane
Location: U0.1-P1
1011 1B0A +12V/-12V load fault 1. Replace power supply 1
Location: U0.1-V1
2. If power supply 2 is present, use removed
power supply 1 to replace power supply 2
3. CEC backplane
Location: U0.1-P1
1011 1B0B 1.8V load fault 1. Replace power supply 1
Location: U0.1-V1
2. If power supply 2 is present, use removed
power supply 1 to replace power supply 2
3. CEC backplane
Location: U0.1-P1
1011 1B0C 3.3V/1.5V load fault 1. Replace power supply 1
Location: U0.1-V1
2. If power supply 2 is present, use removed
power supply 1 to replace power supply 2
3. CEC backplane
Location: U0.1-P1
1011 1B0D 2.5V load fault 1. Replace power supply 1
Location: U0.1-V1
2. If power supply 2 is present, use removed
power supply 1 to replace power supply 2
3. CEC backplane
Location: U0.1-P1
1011 1B0E +5V load fault 1. Replace power supply 1
Location: U0.1-V1
2. If power supply 2 is present, use removed
power supply 1 to replace power supply 2
3. CEC backplane
Location: U0.1-P1
1011 1B0F 1.5V/1.3V load fault 1. Replace power supply 1
Location: U0.1-V1
2. If power supply 2 is present, use removed
power supply 1 to replace power supply 2
3. CEC backplane
Location: U0.1-P1
1011 1F01 TMS log limit reached Call service support.
184 Service Guide
Table 3. SPCN Error Codes for the CEC (system unit) (continued)
Error Code Description Action/
Possible Failing FRU
1011 2600 Pgood fault CEC backplane
Location: U0.1-P1
1011 2603 Pgood fault CEC backplane
Location: U0.1-P1
1011 2612 Optical converter fault CEC backplane
Location: U0.1-P1
1011 3100 I2C controller fault CEC backplane
Location: U0.1-P1
1011 3105 I2C port 5 fault CEC backplane
Location: U0.1-P1
1011 3114 PCI riser card fault 1. PCI riser card
Location: U0.1-P2
2. CEC backplane
Location: U0.1-P1
1011 3116 I2C VPD interface CEC backplane
Location: U0.1-P1
1011 3118 SPCN VPD and NVS interface CEC backplane
Location: U0.1-P1
1011 7610 System processor fan 1 failure 1. Processor fan 1
Location: U0.1-F1
2. CEC backplane
Location: U0.1-P1
1011 7611 Processor card cooling fan missing from primary
location
1. Processor fan 1
Location: U0.1-F1
2. CEC backplane
Location: U0.1-P1
1011 7620 System processor fan 2 failure 1. Processor fan 2
Location: U0.1-F2
2. CEC backplane
Location: U0.1-P1
1011 7621 System processor fan 2 missing 1. Processor fan 2
Location: U0.1-F2
2. CEC backplane
Location: U0.1-P1
1011 7630 Slow system PCI fan 3 failure 1. PCI fan 3
Location: U0.1-F3
2. CEC backplane
Location: U0.1-P1
1011 7631 System PCI fan 3 missing 1. PCI fan 3
Location: U0.1-F3
2. CEC backplane
Location: U0.1-P1
1011 7640 Slow system PCI fan 3 failure 1. PCI fan 3
Location: U0.1-F3
2. CEC backplane
Location: U0.1-P1
Chapter 5. Error Code to FRU Index 185
Table 3. SPCN Error Codes for the CEC (system unit) (continued)
Error Code Description Action/
Possible Failing FRU
1011 7641 System PCI fan 3 missing 1. PCI fan 3
Location: U0.1-F3
2. CEC backplane
Location: U0.1-P1
1011 7650 Slow system PCI fan 4 failure 1. PCI fan 4
Location: U0.1-F4
2. CEC backplane
Location: U0.1-P1
1011 7651 System PCI fan 4 missing 1. PCI fan 4
Location: U0.1-F4
2. CEC backplane
Location: U0.1-P1
1011 7660 System PCI fan 4 failure 1. PCI fan 4
Location: U0.1-F4
2. CEC backplane
Location: U0.1-P1
1011 7661 System PCI fan 4 missing 1. PCI fan 4
Location: U0.1-F4
2. CEC backplane
Location: U0.1-P1
1011 8400 VPD configuration Ignore all location codes.
Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
1011 8401 VPD access Ignore all location codes.
Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
1011 8402 VPD access-concentrator CEC backplane, U0.1-P1
1011 8403 VPD level mismatch 1. System VPD module, U0.1-L1-N1
2. CEC backplane, U0.1-P1
1011 8404 PU 2 mismatch 1. Processor card 2,
Location: U0.1-P1-C2
2. CEC backplane
Location: U0.1-P1
186 Service Guide
Table 3. SPCN Error Codes for the CEC (system unit) (continued)
Error Code Description Action/
Possible Failing FRU
1011 8409 No valid processor card VPD found 1. Verify that the processor card(s) is of the
right type for your system and is properly
installed.
2. If only one processor card is installed,
replace it. If there are two processor cards
installed, remove the second processor card.
If the problem is resolved, replace that
processor card. If it does not resolve the
problem, swap the first processor card with
the second, just removed, processor card. If
this resolves the problem, replace the
processor card that was originally in the first
slot. If it does not resolve the problem, go to
“MAP 1540: Problem Isolation Procedures”
on page 108.
3. Go to “MAP 1540: Problem Isolation
Procedures” on page 108.
1011 840A VPD +5 stuck high Ignore all location codes.
Go to “MAP 1540: Problem Isolation Procedures”
on page 108 and run minimum configuration.
1011 840B VPD +5 stuck low Ignore all location codes.
Go to “MAP 1540: Problem Isolation Procedures”
on page 108 and run minimum configuration.
1011 840C Memory DIMM misplugged. Inspect system memory for proper installation of
memory DIMMs.
1011 840D SPCN configuration 1. Check cabling
2. Set the I/O type. See Power Control Network
Utilities Menu on page 321.
3. CEC backplane
Location: U0.1-P1
1011 840E SPCN default configuration loaded 1. Check cabling
2. Set the I/O type. See Power Control Network
Utilities Menu on page 321.
3. CEC backplane
Location: U0.1-P1
1011 840F I/O type not recognized 1. Check cabling
2. Set the I/O type. See Power Control Network
Utilities Menu on page 321.
3. CEC backplane
Location: U0.1-P1
1011 8413 Invalid processor 1 VPD Processor card
Location: U0.1-P1-C1
1011 8414 Invalid processor 2 VPD Processor card
Location: U0.1-P1-C2
1011 8423 No processor 1 VPD Processor card
Location: U0.1-P1-C1
1011 8424 No processor 2 VPD Processor card
Location: U0.1-P1-C2
Chapter 5. Error Code to FRU Index 187
Table 3. SPCN Error Codes for the CEC (system unit) (continued)
Error Code Description Action/
Possible Failing FRU
1011 9280 SPCN interface failure 1. Check the SPCN cables going to the CEC
drawer.
2. Check for system firmware updates. Apply
the update if it is available.
3. Replace the SPCN cables going to the CEC
drawer, locations: U0.1-P1/Q4#,
U0.1-P1/Q5#
4. Replace the system backplane, location:
U0.1-P1.
1011 9281 SPCN interface failure 1. Check the SPCN cables going to the CEC
drawer.
2. Check for system firmware updates. Apply
the update if it is available.
3. Replace the SPCN cables going to the CEC
drawer, locations: U0.1-P1/Q4#,
U0.1-P1/Q5#
4. Replace the system backplane, location:
U0.1-P1.
1011 9282 SPCN interface failure 1. Check the SPCN cables going to the CEC
drawer.
2. Check for system firmware updates. Apply
the update if it is available.
3. Replace the SPCN cables going to the CEC
drawer, locations: U0.1-P1/Q4#,
U0.1-P1/Q5#
4. Replace the system backplane, location:
U0.1-P1.
Attention: If you replace FRUs or perform an action on an I/O subsystem and the problem is still not
corrected, go to “MAP 1542: I/O Problem Isolation” on page 111 unless you were already directed to any
MAP 154x by the error code. Otherwise call support if the action(s) for an error code do not resolve the
problem.
Attention: If the error code that brought you to this chapter originated in the operating system error log,
please be aware that an error may require an additional 15 minutes to be propagated to the operating
system error log.
If you replace FRUs and the problem is corrected, go to ²MAP 0410: Repair Checkout² in the Diagnostic
Information for Multiple Bus Systems.
The “nn” in “1nn1 xxxx”, where nn is between 2 and 3F hexadecimal, inclusive, is replaced by the I/O
subsystem drawer number (in hexadecimal) in the error code that is reported by the system. The I/O
drawer number in the location code of a failing FRU will be converted to decimal (which is shown as U0.dd
in the following table). The “dd” value ranges from 02 to 63, inclusive. For example, the error code 11F1
1510 will have location codes U0.31-V1 and U0.31-P1 listed.
188 Service Guide
Table 4. SPCN Error Codes for I/O Subsystems
Error Code Description Action/
Possible Failing FRU
1nn1 00AC AC loss or brownout detected Informational message.
1nn1 00EF Remote emergency power off occurred Informational message.
1nn1 1510 Power supply 1 source failure 1. Check the input voltage to the power supply.
2. I/O subsystem power supply 1, U0.dd-V1.
3. I/O subsystem backplane, U0.dd-P1.
1nn1 1511 Power supply 1 fault 1. I/O subsystem power supply 1, U0.dd-V1.
2. I/O subsystem backplane, U0.dd-P1.
1nn1 1512 Power supply 1 non-power fault 1. I/O subsystem power supply 1, U0.dd-V1.
2. I/O subsystem backplane, U0.dd-P1.
1nn1 1513 Power supply 1 domain fault 1. I/O subsystem power supply 1, U0.dd-V1.
2. I/O subsystem backplane, U0.dd-P1.
1nn1 1517 Power supply 1 thermal fault 1. Check for obstructions to the air flow around
the I/O subsystem.
2. I/O subsystem power supply 1, U0.dd-V1.
3. I/O subsystem backplane, U0.dd-P1.
1nn1 1520 Power supply 2 source failure 1. Check the input voltage to the power supply.
2. I/O subsystem power supply 2, U0.dd-V2.
3. I/O subsystem backplane, U0.dd-P1.
1nn1 1521 Power supply 2 fault 1. I/O subsystem power supply 2, U0.dd-V2.
2. I/O subsystem backplane, U0.dd-P1.
1nn1 1522 Power supply 2 non-power fault 1. I/O subsystem power supply 2, U0.dd-V2.
2. I/O subsystem backplane, U0.dd-P1.
1nn1 1523 Power supply 2 domain fault 1. I/O subsystem power supply 2, U0.dd-V2.
2. I/O subsystem backplane, U0.dd-P1.
1nn1 1527 Power supply 2 thermal fault 1. Check for obstructions to the air flow around
the I/O subsystem.
2. I/O subsystem power supply 2, U0.dd-V2.
3. I/O subsystem backplane, U0.dd-P1.
1nn1 1B01 +12 V load fault 1. Replace power supply 1, U0.dd-V1.
2. If power supply 2 is present, use power
supply 1 that was removed to replace power
supply 2.
3. I/O subsystem backplane, U0.dd-P1.
1nn1 1B02 +5 V load fault 1. Replace power supply 1, U0.dd-V1.
2. If power supply 2 is present, use power
supply 1 that was removed to replace power
supply 2.
3. I/O subsystem backplane, U0.dd-P1.
Chapter 5. Error Code to FRU Index 189
Table 4. SPCN Error Codes for I/O Subsystems (continued)
Error Code Description Action/
Possible Failing FRU
1nn1 1B03 +3.3 V load fault 1. Replace power supply 1, U0.dd-V1.
2. If power supply 2 is present, use power
supply 1 that was removed to replace power
supply 2.
3. I/O subsystem backplane, U0.dd-P1.
1nn1 1B04 -12 V load fault 1. Replace power supply 1, U0.dd-V1.
2. If power supply 2 is present, use power
supply 1 that was removed to replace power
supply 2.
3. I/O subsystem backplane, U0.dd-P1.
1nn1 2600 Pgood fault I/O subsystem backplane, U0.dd-P1.
1nn1 2603 Power domain fault I/O subsystem backplane, U0.dd-P1.
1nn1 2605 Pgood 1.8 V fault I/O subsystem backplane, U0.dd-P1.
1nn1 2606 Pgood 2.5 V fault I/O subsystem backplane, U0.dd-P1.
1nn1 3100 I2C controller fault I/O subsystem backplane, U0.dd-P1.
1nn1 3101 I2C bus fault I/O subsystem backplane, U0.dd-P1.
1nn1 3102 I2C interface fault I/O subsystem backplane, U0.dd-P1.
1nn1 3103 I2C interface fault I/O subsystem backplane, U0.dd-P1.
1nn1 3104 I2C interface fault I/O subsystem backplane, U0.dd-P1.
1nn1 3110 SES module fault on SCSI backplane 1 Replace the SCSI backplane, location: U0.dd-P3
1nn1 3111 SES module fault on SCSI backplane 2 Replace the SCSI backplane, location: U0.dd-P4
1nn1 3113 PCI bridge chip fault I/O subsystem backplane, U0.dd-P1.
1nn1 3114 I2C fault I/O subsystem backplane, U0.dd-P1.
1nn1 3115 Operator panel fault I/O subsystem backplane, U0.dd-P1.
1nn1 3118 SPCN VPD fault I/O subsystem backplane, U0.dd-P1.
1nn1 3121 I2C VPD fault I/O subsystem backplane, U0.dd-P1.
1nn1 7610 Blower 1 fault Blower 1, U0.dd-F1.
1nn1 7611 Blower 1 missing Blower 1, U0.dd-F1.
1nn1 7620 Blower 2 fault Blower 2, U0.dd-F2.
1nn1 7621 Blower 2 missing Blower 2, U0.dd-F2.
1nn1 7630 Blower 3 fault Blower 3, U0.dd-F3.
1nn1 7631 Blower 3 missing Blower 3, U0.dd-F3.
1nn1 7640 Blower 4 fault Blower 4, U0.dd-F4.
1nn1 7641 Blower 4 missing Blower 4, U0.dd-F4.
1nn1 8400 Configuration ID conflict 1. Check the RIO and the SPCN cabling.
2. Set the I/O type using the service processor
menus.
3. I/O subsystem backplane, U0.dd-P1.
190 Service Guide
Table 4. SPCN Error Codes for I/O Subsystems (continued)
Error Code Description Action/
Possible Failing FRU
1nn1 840D I/O type mismatch 1. Check the RIO and the SPCN cabling.
2. Set the I/O type using the service processor
menus.
3. I/O subsystem backplane, U0.dd-P1.
1nn1 840E I/O type mismatch 1. Check the RIO and the SPCN cabling.
2. Set the I/O type using the service processor
menus.
3. I/O subsystem backplane, U0.dd-P1.
1nn1 8A00 The power controlling the CEC changed Informational message
1nn1 C62E Polling error I/O subsystem backplane, U0.dd-P1.
Chapter 5. Error Code to FRU Index 191
Firmware Error Codes
Attention: If you replace FRUs or perform an action and the problem is still not corrected, go to “MAP
1548: Processor Subsystem Problem Isolation” on page 123 unless otherwise indicated in the tables.
If you replace FRUs and the problem is corrected, go to ²MAP 0410: Repair Checkout² in the Diagnostic
Information for Multiple Bus Systems.
Table 5. Firmware Error Codes.
Error Code Description Action/
Possible Failing FRU
20A8 0000 Insufficient information to boot. Verify the IP address.
20A8 0001 The client IP address is already in use by some
other network device.
Change IP address.
20A8 0002 Cannot get gateway IP address. Refer to 150 using code E174.
20A8 0003 Cannot get system hardware address. Refer to 150 using code E174.
20A8 0004 Bootup failed. Refer to 150 using code E175.
20A8 0005 File transmission (TFTP) failed. Check network connection, try again.
20A8 0006 Boot image too large Contact your next level of support for
assistance.
20D0 0001 I/O subsystem has changed locations This is an informational message.
The I/O subsystem at the location code
associated with this message was found at
another location code during a prior boot of the
system.
If an I/O subsystem has not been moved, do the
following:
1. Boot the system and match the I/O
subsystem location codes reported by the
operating system to the subsystems
attached to the CEC. Check the SPCN and
RIO cabling to the I/O subsystems. If an
cabling problem is found, correct it.
2. If no cabling problem is found, call your next
level of support.
20D0 000F Self-test failed on a device, no error and/or
location code information available.
Refer to “Mapping AIX and Physical Location
Codes” on page 28 to see if a location code is
available for the device the error was reported
against.
1. Replace device indicated by the location
code (if available).
2. Go to “MAP 1540: Problem Isolation
Procedures” on page 108
20D0 0010 Self-test failed on a device, cannot locate
package
Contact your service support representative for
assistance.
20D0 0011 Firmware RTAS attempt to allocate memory
failed.
Contact your next level of support for
assistance.
20D0 0800 Firmware update failure Retry the flash update.
20D0 0801 System firmware update failure Retry the firmware update.
20D0 0997 Operating system terminated with error string Call service support.
192 Service Guide
Table 5. Firmware Error Codes. (continued)
Error Code Description Action/
Possible Failing FRU
20D0 0998 Operating system terminated with error string Call service support.
20D0 0999 Operating system terminated with error string Call service support.
20D0 0A01 Error initializing RTAS NVRAM 1. Check for system firmware updates. Apply
update if available.
2. Contact your next level of support for
assistance.
20D0 0A02 Error initializing RTAS debugging token 1. Check for system firmware updates. Apply
update if available.
2. Contact your next level of support for
assistance.
20D0 0A03 Error initializing interface/service processor
access
1. Check for system firmware updates. Apply
update if available.
2. Contact your next level of support for
assistance.
20D0 0A04 Error initializing interface/operator panel access 1. Check for system firmware updates. Apply
update if available.
2. Contact your next level of support for
assistance.
20D0 0A05 Error initializing hardware access 1. Check for system firmware updates. Apply
update if available.
2. Contact your next level of support for
assistance.
20D0 0A06 Error initializing interface/PCI access 1. Check for system firmware updates. Apply
update if available.
2. Contact your next level of support for
assistance.
20D0 0A07 Error initializing interface/operating system
access
1. Check for system firmware updates. Apply
update if available.
2. Contact your next level of support for
assistance.
20D0 0A08 Error initializing interface/client_interface access 1. Check for system firmware updates. Apply
update if available.
2. Contact your next level of support for
assistance.
20D0 0A09 Error initializing interface/NUMA access 1. Check for system firmware updates. Apply
update if available.
2. Contact your next level of support for
assistance.
20D0 0A0A Error initializing interface/interrupt access 1. Check for system firmware updates. Apply
update if available.
2. Contact your next level of support for
assistance.
Chapter 5. Error Code to FRU Index 193
Table 5. Firmware Error Codes. (continued)
Error Code Description Action/
Possible Failing FRU
20D0 0A0B Error initializing interface/OFDT access 1. Check for system firmware updates. Apply
update if available.
2. Contact your next level of support for
assistance.
20D0 0A0C Error initializing interface/sensor access 1. Check for system firmware updates. Apply
update if available.
2. Contact your next level of support for
assistance.
20D0 0A0D Error initializing interface/sensors/led_table
access
1. Check for system firmware updates. Apply
update if available.
2. Contact your next level of support for
assistance.
20D0 0A0E Error initializing interface/timebase access 1. Check for system firmware updates. Apply
update if available.
2. Contact your next level of support for
assistance.
20D0 0A0F Error initializing interface/time-of-day access 1. Check for system firmware updates. Apply
update if available.
2. Contact your next level of support for
assistance.
20D0 0A10 Error initializing interface/cache access 1. Check for system firmware updates. Apply
update if available.
2. Contact your next level of support for
assistance.
20D0 0A11 Error initializing interface/power access 1. Check for system firmware updates. Apply
update if available.
2. Contact your next level of support for
assistance.
20D0 0A12 Error initializing interface/hot-plug access 1. Check for system firmware updates. Apply
update if available.
2. Contact your next level of support for
assistance.
20D0 0A13 Error initializing interface/log access 1. Check for system firmware updates. Apply
update if available.
2. Contact your next level of support for
assistance.
20D0 0A14 Error initializing interface/EEH access 1. Check for system firmware updates. Apply
update if available.
2. Contact your next level of support for
assistance.
20D0 0A15 Error initializing interface/error injection access 1. Check for system firmware updates. Apply
update if available.
2. Contact your next level of support for
assistance.
194 Service Guide
Table 5. Firmware Error Codes. (continued)
Error Code Description Action/
Possible Failing FRU
20D0 0A16 Error initializing interface/handling access 1. Check for system firmware updates. Apply
update if available.
2. Contact your next level of support for
assistance.
20D0 0A17 Error initializing interface/utility access 1. Check for system firmware updates. Apply
update if available.
2. Contact your next level of support for
assistance.
20D0 0A18 Error initializing register softpatch access 1. Check for system firmware updates. Apply
update if available.
2. Contact your next level of support for
assistance.
20D0 9001 Subsystem information not found in VPD 1. Power off, then power on the system.
2. If the error persists, call service support.
20D0 9002 Subsystem machine type and model information
not found in VPD
1. Power off, then power on the system.
2. If the error persists, call service support.
20D0 9003 Subsystem serial number information not found
in VPD
1. Power off, then power on the system.
2. If the error persists, call service support.
20D0 9004 Subsystem machine type and model field is
empty in VPD
1. Power off, then power on the system.
2. If the error persists, call service support.
20D0 9005 Subsystem serial number field is empty in VPD 1. Power off, then power on the system.
2. If the error persists, call service support.
20D0 9006 Duplicate subsystem serial number found in
VPD
1. Power off, then power on the system.
2. If the error persists, call service support.
20D0 9007 Invalid subsystem machine type and model
information in VPD
1. Power off, then power on the system.
2. If the error persists, call service support.
20D0 9008 Invalid subsystem serial number found in VPD 1. Power off, then power on the system.
2. If the error persists, call service support.
20D1 0001 Failed to send state-change message to service
processor
1. Check for system firmware updates. Apply
update if available.
2. Contact your next level of support for
assistance.
20D1 0002 Failed to disable heartbeat alarm 1. Check for system firmware updates. Apply
update if available.
2. Contact your next level of support for
assistance.
20D1 0003 Failed to send boot status 1. Check for system firmware updates. Apply
update if available.
2. Contact your next level of support for
assistance.
Chapter 5. Error Code to FRU Index 195
Table 5. Firmware Error Codes. (continued)
Error Code Description Action/
Possible Failing FRU
20D1 0004 Failed to locate service processor device tree
node
1. Check for system firmware updates. Apply
update if available.
2. Contact your next level of support for
assistance.
20D1 0005 Failed to reboot system 1. Check for system firmware updates. Apply
update if available.
2. Contact your next level of support for
assistance.
20D1 0006 Failed to reboot a partition 1. Check for system firmware updates. Apply
update if available.
2. Contact your next level of support for
assistance.
20D1 0007 Failed to set PCI read/write permissions array 1. Check for system firmware updates. Apply
update if available.
2. Contact your next level of support for
assistance.
20D1 0008 Failed to set PCI read/write permissions array
for slot
1. Check for system firmware updates. Apply
update if available.
2. Contact your next level of support for
assistance.
20D1 0009 Description: Missing or invalid subsystem serial number
Action:
1. Check for system firmware updates. Apply update if available.
2. If a location code is shown with this error code, replace the part at that location.
3. Contact your next level of support for assistance.
Note: The location code identifies the failing subsystem.
20D1 000A Failed to send boot failed message to service
processor
1. Check for system firmware updates. Apply
update if available.
2. Contact your next level of support for
assistance.
20D1 000B Failed to reset PCI read/write permissions array 1. Check for system firmware updates. Apply
update if available.
2. Contact your next level of support for
assistance.
20D1 000C Failed to send LMB-TABLE-INFO mailbox repair 1. Check for system firmware updates. Apply
update if available.
2. Contact your next level of support for
assistance.
20D1 000D Function to get time-of-day failed. 1. Check for system firmware updates. Apply
update if available.
2. Contact your next level of support for
assistance.
196 Service Guide
Table 5. Firmware Error Codes. (continued)
Error Code Description Action/
Possible Failing FRU
20D1 000E Function to get time-of-day failed. 1. Check for system firmware updates. Apply
update if available.
2. Contact your next level of support for
assistance.
20D1 000F Hypervisor call to reset PHB or to check for
drawer existence failed
1. Check for system firmware updates. Apply
update if available.
2. Contact your next level of support for
assistance.
20D1 0010 Stack underflow from fiber channel adapter 1. Check for microcode updates for the fiber
channel adapter. Apply an update if
available.
2. Check for system firmware updates. Apply
an update if available.
3. Contact your next level of support for
assistance.
20D1 0011 PHB recovery was not successful. Open
firmware received failure in phb-recovery-buff
1. Check for system firmware updates. Apply
update if available.
2. Contact your next level of support for
assistance.
20D1 0012 Attention: One of the I/O subsystems has
been disconnected from the system.
This is indicates that an I/O subsystem that was
previously in the configuration was not found
during the current boot. If an I/O subsystem has
been configuration has not changed, do the
following:
1. If not already booted, boot the system and
match the I/O subsystem location codes
reported by the operating system to the
subsystems attached to the CEC. Check the
SPCN and RIO cabling to the I/O
subsystems. If an cabling problem is found,
correct it.
2. Check for system firmware updates. Apply
the update if available.
3. Contact your next level of support for
assistance.
20E0 0001 Privileged-access password entry error. The password has been entered incorrectly.
Retry installing the password.
20E0 0004 Battery drained or needs replacement 1. Replace battery.
Location: U0.1-P1-V3
2. If problem persists, replace the CEC
backplane
Location: U0.1-P1
20E0 0009 Invalid password entered - system locked The password has been entered incorrectly 3
times.
Turn off, then turn on the system unit, then enter
the password correctly.
Chapter 5. Error Code to FRU Index 197
Table 5. Firmware Error Codes. (continued)
Error Code Description Action/
Possible Failing FRU
20EE 0003 IP parameter requires 3 dots ².² Enter valid IP parameter.
Example: 000.000.000.000
20EE 0004 Invalid IP parameter Enter valid IP parameter.
Example: 000.000.000.000
20EE 0005 Invalid IP parameter (>255) Enter valid IP parameter.
Example: 255.192.002.000
20EE 0007 Keyboard not found 1. Plug in keyboard.
2. Replace the CEC backplane.
Location: U0.1-P1
20EE 0008 No configurable adapters were found in the
system by the remote IPL menu in the SMS
utilities.
This warning occurs when the remote IPL menu
in the SMS utilities cannot locate any LAN
adapters that are supported by the remote IPL
function. If a supported device is installed:
1. Replace the adapter.
2. Go to “MAP 1542: I/O Problem Isolation” on
page 111.
20EE 0009 Unable to communicate with the service
processor.
Replace the CEC backplane
Location: U0.1-P1.
20EE 000B The system was not able to find an operating
system boot record on the devices listed in the
boot list.
This can be caused by the wrong devices being
in the boot list, a problem that prevents the
system from communicating with the device that
contains the boot record, or the boot record
being corrupted.
Go to “Boot Problems/Concerns” on page 173.
20EE 000C Description: Pointer to the operating system not found in non-volatile storage.
Action: Values normally found in non-volatile storage that point to the location of an operating
system were not found. This can happen for two reasons, either your installed operating system
does not support storing the values or some event occurred that caused the system to lose
non-volatile storage information (drainage or replacement of the battery). If you are running AIX,
this information can be reconstructed by running the bootlist command specifying the device that
the operating system is installed on. Refer to your AIX documentation for the exact syntax and
usage of the bootlist command.
To boot the operating system so that the above-mentioned values can be reconstructed, power off
the system and power it on again. This should cause the system to look for the operating system
in the devices contained in the custom boot list or in the default boot list, depending on the
condition of the system. If this is not successful, use the System Management Services Menus to
modify the boot sequence (also known as boot list) to include devices that are known to contain a
copy of the operating system. For example, select a hard disk known to have a copy of the
operating system as the first and only device in the boot sequence (boot list) and attempt to boot
again. If the problem persists, replace the CEC backplane.
Location: U0.1-P1.
20EE 000E Unable to execute the open firmware method. 1. Reboot the system.
2. Reflash the system with the latest level of
system firmware.
3. Call service support.
198 Service Guide
Table 5. Firmware Error Codes. (continued)
Error Code Description Action/
Possible Failing FRU
20EE 000F Unable to find the package node. 1. Reboot the system.
2. Reflash the system with the latest level of
system firmware.
3. Call service support.
20EE 0010 Unable to execute the root VPD method. 1. Reboot the system.
2. Reflash the system with the latest level of
system firmware.
3. Call service support.
20EE 0011 Unable to execute the VPD method. 1. Reboot the system.
2. Reflash the system with the latest level of
system firmware.
3. Call service support.
20EE 0012 Unable to execute the mem-dimm-VPD method 1. Reboot the system.
2. Reflash the system with the latest level of
system firmware.
3. Call service support.
20EE 0013 Unable to allocate memory for VPD. 1. Reboot the system.
2. Reflash the system with the latest level of
system firmware.
3. Call service support.
20EE 0014 Unable to allocate memory for location codes. The system firmware may be corrupted.
20EE 0100 Fcode resident on an I/O adapter is not
supported on this system
Informational message. The adapter’s Fcode
driver is not supported on this system. Additional
information may be available from service
support.
This error code only affects boot-time operation,
not the adapter’s functionality under the
operating system.
20FC 0020 Failed to retrieve power, packaging, and/or
cooling VPD via SPCN for the corresponding
subsystem. Service processor mailbox failed to
get VPD.
1. Check the RIO cabling.
2. Check for system firmware updates.
3. Replace the part specified by the location
code.
20FC 0021 Failed to retrieve power, packaging, and/or
cooling VPD via SPCN for the corresponding
subsystem. Service processor mailbox failed to
get power, packaging, and cooling device VPD.
1. Check the RIO cabling.
2. Check for system firmware updates.
3. Replace the part specified by the location
code.
20FC 0030 Corrupted power, packaging, and cooling device
VPD format in the SPCN configuration table.
1. Check for system firmware updates.
2. Replace the part specified by the location
code.
20FF 0001 Problem with VPD 1. Replace the part indicated by the location
code.
2. If no location code is specified, call service
support.
Chapter 5. Error Code to FRU Index 199
Table 5. Firmware Error Codes. (continued)
Error Code Description Action/
Possible Failing FRU
2100 0xxx SCSI controller fault 1. Before replacing any system components,
refer to the notes in error code 21A00001.
2. Replace the SCSI device.
2101 0010 The IDE device remained busy longer than
time-out period allows.
1. Check for system firmware updates.
2. Call service support
21A0 0001 Description: SCSI DASD - test unit ready failed - hardware error
Action:
Note:
1. Before replacing any system components:
a. Ensure that the controller and each device on the SCSI bus is assigned a unique SCSI ID.
b. Ensure SCSI bus is properly terminated.
c. Ensure SCSI signal and power cables are securely connected and not damaged.
2. The location code information is required to identify the ID of SCSI device failures as well as to
indicate the location of the controller to which the device is attached. Check the system error
logs to determine the location code information associated with the error code.
a. Replace the SCSI device.
b. Replace the SCSI cable.
c. Replace the SCSI controller.
21A0 0002 SCSI DASD - test unit ready failed - sense data
available
Before replacing any system components, refer
to the notes in error code 21A0 0001.
1. Replace the media (removable media
devices).
2. Replace the SCSI device.
21A0 0003 SCSI DASD - send diagnostic failed - sense
data available
Before replacing any system components, refer
to the notes in error code 21A0 0001.
Replace the SCSI device.
21A0 0004 SCSI DASD - send diagnostic failed - DevOfl
cmd
Before replacing any system components, refer
to the notes in error code 21A0 0001.
Replace the SCSI device.
21E0 0001 SCSI tape - test unit ready failed - hardware
error
Before replacing any system components, refer
to the notes in error code 21A0 0001.
1. Replace the SCSI device.
2. Replace the SCSI cable.
3. Replace the SCSI controller.
21E0 0002 SCSI tape - test unit ready failed - sense data
available
Before replacing any system components, refer
to the notes in error code 21A0 0001.
1. Replace the media.
2. Replace the SCSI device.
21E0 0003 SCSI tape - send diagnostic failed - sense data
available
Before replacing any system components, refer
to the notes in error code 21A0 0001.
Replace the SCSI device.
21E0 0004 SCSI tape - send diagnostic failed - DevOfl cmd Before replacing any system components, refer
to the notes in error code 21A0 0001.
Replace the SCSI device.
200 Service Guide
Table 5. Firmware Error Codes. (continued)
Error Code Description Action/
Possible Failing FRU
21ED 0001 SCSI changer - test unit ready failed - hardware
error
Before replacing any system components, refer
to the notes in error code 21A0 0001.
1. Replace the SCSI device.
2. Replace the SCSI cable.
3. Replace the SCSI controller.
21ED 0002 SCSI changer - test unit ready failed - sense
data available
Before replacing any system components, refer
to the notes in error code 21A0 0001.
1. Replace the media.
2. Replace the SCSI device.
21ED 0003 SCSI changer - send diagnostic failed - sense
data available
Before replacing any system components, refer
to the notes in error code 21A0 0001.
Replace the SCSI device.
21ED 0004 SCSI changer - send diagnostic failed - DevOfl
cmd
Before replacing any system components, refer
to the notes in error code 21A0 0001.
Replace the SCSI device.
21EE 0001 Undetermined SCSI device type - test unit ready
failed - hardware error
Before replacing any system components, refer
to the notes in error code 21A0 0001.
1. Replace the SCSI device.
2. Replace the SCSI cable.
3. If the missing SCSI devices are connected
to the same backplane, replace the SCSI
backplane.
4. Replace the SCSI controller.
21EE 0002 Undetermined SCSI device type - test unit ready
failed - sense data available
Before replacing any system components, refer
to the notes in error code 21A0 0001.
1. Replace the media (removable media
devices).
2. Replace the SCSI device.
21EE 0003 Undetermined SCSI device type - send
diagnostic failed - sense data available
Before replacing any system components, refer
to the notes in error code 21A0 0001.
Replace the SCSI device.
21EE 0004 Undetermined SCSI device type - send
diagnostic failed - DevOfl cmd
Before replacing any system components, refer
to the notes in error code 21A0 0001.
Replace the SCSI device.
21F0 0001 SCSI CD-ROM - test unit ready failed -
hardware error
Before replacing any system components, refer
to the notes in error code 21A0 0001.
1. Replace the SCSI device.
2. Replace the SCSI cable.
3. Replace the SCSI controller.
21F0 0002 SCSI CD-ROM - test unit ready failed - sense
data available
Before replacing any system components, refer
to the notes in error code 21A0 0001.
1. Replace the media.
2. Replace the SCSI device.
Chapter 5. Error Code to FRU Index 201
Table 5. Firmware Error Codes. (continued)
Error Code Description Action/
Possible Failing FRU
21F0 0003 SCSI CD-ROM - send diagnostic failed - sense
data available
Before replacing any system components, refer
to the notes in error code 21A0 0001.
Replace the SCSI device.
21F0 0004 SCSI CD-ROM - send diagnostic failed - DevOfl
cmd
Before replacing any system components, refer
to the notes in error code 21A0 0001.
Replace the SCSI device.
21F2 0001 SCSI read/write optical - test unit ready failed -
hardware error
Before replacing any system components, refer
to the notes in error code 21A0 0001.
1. Replace the SCSI device.
2. Replace the SCSI cable.
3. Replace the SCSI controller.
21F2 0002 SCSI read/write optical - test unit ready failed -
sense data available
Before replacing any system components, refer
to the notes in error code 21A0 0001.
1. Replace the media.
2. Replace the SCSI device.
21F2 0003 SCSI read/write optical - send diagnostic failedsense
data available
Before replacing any system components, refer
to the notes in error code 21A0 0001.
Replace the SCSI device.
21F2 0004 SCSI read/write optical - send diagnostic failed-
DevOfl cmd
Before replacing any system components, refer
to the notes in error code 21A0 0001.
Replace the SCSI device.
2200 0001 PCI Ethernet BNC/RJ-45 or PCI Ethernet
AUI/RJ-45 adapter
internal wrap test failure
Replace the adapter.
See “Checkpoint and Error Code Index” on
page 181 for location code information related to
this error. Also see “Replacing the Network
Adapter” on page 180.
2200 1001 10/100 Mbps Ethernet PCI Adapter
internal wrap test failure
Replace the adapter.
See “Checkpoint and Error Code Index” on
page 181 for location code information related to
this error. Also see “Replacing the Network
Adapter” on page 180.
2200 1002 10/100 Mbps Ethernet PCI adapter failure Replace the adapter.
See “Checkpoint and Error Code Index” on
page 181 for location code information related to
this error. Also see “Replacing the Network
Adapter” on page 180.
2201 0001 PCI Auto LANstreamer token ring adapter failed
to complete hardware initialization.
Replace the adapter.
See “Checkpoint and Error Code Index” on
page 181 for location code information related to
this error. Also see “Replacing the Network
Adapter” on page 180.
2201 1001 PCI token ring adapter failed to complete
hardware initialization.
Replace the adapter.
See “Checkpoint and Error Code Index” on
page 181 for location code information related to
this error. Also see “Replacing the Network
Adapter” on page 180.
202 Service Guide
Table 5. Firmware Error Codes. (continued)
Error Code Description Action/
Possible Failing FRU
25A0 0001 Cache L2 controller failure 1. Replace the processor card.
Location: U0.1-P1-C1
2. Replace the CEC backplane assembly.
Location: U0.1-P1.
25A1 0001 Cache L2 SRAM failure 1. Replace the processor card.
Location: U0.1-P1-C1
25A8 0xxx Description: NVRAM problems
Action: Errors reported against NVRAM can be caused by low battery voltage and (more rarely)
power outages that occur during normal system usage. With the exception of the 25A80000 error,
these errors are warnings that the NVRAM data content had to be re-established and do not
require any FRU replacement unless the error is persistent. When one of these errors occurs, any
system customization (for example, boot device list) information has been lost, the system may
need to be reconfigured.
If the error is persistent, replace CEC backplane
Location: U0.1-P1
25A8 0000 Initialization failed, device test failed Refer to Action under error code 25A80xxx.
25A8 0100 NVRAM data validation check failed. Turn off, then turn on system unit and retry the
operation before replacing any system
component.
Refer to Action under error code 25A80xxx.
25A8 0201 Unable to expand target partition - saving
configuration variable.
Refer to Action under error code 25A80xxx.
25A8 0202 Unable to expand target partition - writing error
log entry.
Refer to Action under error code 25A80xxx.
25A8 0203 Unable to expand target partition - writing VPD
data.
Refer to Action under error code 25A80xxx.
25A8 0210 Setenv/$Setenv parameter error - name
contains a null character.
Refer to Action under error code 25A80xxx.
25A8 0211 Setenv/$Setenv parameter error - value contains
a null character.
Refer to Action under error code 25A80xxx.
25A8 0998 Description: NVRAMRC script evaluation error - command line execution error
Action: Execution of a command line within the nvram configuration variable “nvramrc” (script)
resulted in a “throw” being executed. This script can be modified by the system firmware SMS
utilities, the operating system, PCI adapter ROM code or utility, or an operator (via the open
firmware script editing command nvedit). It may not be possible to resolve the problem without a
detailed analysis of the nvram script, the current system configuration, and device tree contents.
1. The problem can be caused by a SCSI adapter, whose SCSI bus ID has been changed from
the default setting, no longer appearing in the system. This can be caused either by removing
a SCSI adapter, or a problem with a SCSI adapter.
a. Select the SCSI ID utility from the SMS menu(s).
1) Verify the list of SCSI controllers/adapters. If the list is not correct, suspect a problem
with the adapter(s) installed but not listed.
2) Select the option to Save the configuration information.
3) Restart the system.
b. If the problem persists, boot the operating system and verify the SCSI bus IDs of any
installed/available SCSI controllers (change as necessary), and restart the system.
2. Contact support support for further assistance.
Chapter 5. Error Code to FRU Index 203
Table 5. Firmware Error Codes. (continued)
Error Code Description Action/
Possible Failing FRU
25A8 0999 NVRAMRC script evaluation error - stack
unbalanced on completion
This is a firmware debug environment error.
There is no user action or FRU replacement for
this error.
25A8 0A00 Unable to retrieve NVRAM LPAR table
information
Refer to the actions column for error code 25A8
0xxx.
Memory Error Notes
Use the location code obtained from the SMS Error Log utility (described in “Error Log Utility Procedure” on
page 285) to identify which memory DIMM (or memory DIMM quad) the error is reported against.
Memory error codes are in the form 25Cyyxxx where yy indicates memory DIMM present detect-bits (PD-bits) and
xxx indicates the last 3 digits of the error code.
See “Memory DIMM Present Detect Bits (PD-Bits)” on page 285 for information on how to interpret PD bits.
25B0 0001 No memory DIMMs detected. Go to “MAP 1240: Memory Problem Resolution”
on page 68.
25B0 0004 Multiple memory DIMMs failed a memory test. Go to “MAP 1240: Memory Problem Resolution”
on page 68.
25Cy y001 DIMM is not supported Replace unsupported DIMM(s).
Note: Memory DIMMs must be replaced with a
supported type of memory DIMM. If an
unsupported type of memory DIMM is replaced
with the same unsupported type, the error
persists.
See the ²Memory Error Notes² on page 204 for
more information.
25Cy y002 Memory DIMM fails memory test. Go to “MAP 1240: Memory Problem Resolution”
on page 68.
2602 0001 Description: Invalid PCI adapter vendor ID
Action:
Attention: Before moving the adapter to another slot, verify with the customer or system
administrator that the slot that the adapter is being moved to is available to the partition in which
the adapter presently resides.
1. Move adapter to another slot (behind a different PCI bus).
2. Check for available firmware updates for adapter. Apply update if available.
3. Run AIX diagnostics on adapter. Replace if indicated.
4. Replace adapter.
5. Check for system firmware updates. Apply if update available.
6. If the card is in the system drawer, replace the PCI riser card, location: U0.1-P2. If the card is
in an I/O subsystem, replace the I/O backplane that contains the slot in which the adapter was
originally installed, location: U0.dd-P1.
204 Service Guide
Table 5. Firmware Error Codes. (continued)
Error Code Description Action/
Possible Failing FRU
2602 0002 Description: Invalid PCI adapter device ID
Action:
Attention: Before moving the adapter to another slot, verify with the customer or system
administrator that the slot that the adapter is being moved to is available to the partition in which
the adapter presently resides.
1. Move adapter to another slot (behind a different PCI bus).
2. Check for available firmware updates for adapter. Apply update if available.
3. Run AIX diagnostics on adapter. Replace if indicated.
4. Replace adapter.
5. Check for system firmware updates. Apply if update available.
6. If the card is in the system drawer, replace the PCI riser card, location: U0.1-P2. If the card is
in an I/O subsystem, replace the I/O backplane that contains the slot in which the adapter was
originally installed, location: U0.dd-P1.
2602 0007 Description: PCI device I/O or memory space request could not be satisfied
Action:
1. If the location code identifies a slot:
a. Reseat the adapter and reboot the system.
b. Replace the adapter.
c. Check for system firmware update. Apply update if available.
2. If the location code identifies an I/O planar:
a. Check for system firmware updates. Apply if available.
b. Replace the I/O planar.
3. Call service support.
2602 0008 Description: PCI device Fcode evaluation error
Action:
1. If the location code identifies a slot:
a. Check for adapter firmware updates. Apply if available.
b. Replace the adapter.
c. Check for system firmware updates. Apply if available.
2. If the location code identifies an I/O planar:
a. Check for system firmware updates. Apply if available.
b. Replace the I/O planar.
3. Call service support.
Chapter 5. Error Code to FRU Index 205
Table 5. Firmware Error Codes. (continued)
Error Code Description Action/
Possible Failing FRU
2602 0010 Description: PCI probe error, bridge in freeze state
Action: If the location code identifies a slot, do the following:
1. Check for system firmware updates. Apply if available.
2. Check the cabling to the adapter, in particular, the adapters that have serial ports. Serial ports
may require null modems or special cabling configurations to avoid connecting driver outputs
together. This may create a PCI slot power problem and force the adapter to be deconfigured.
3. Is the system running Linux?
YES Go to The System is Running Linux on page 206.
NO Check the PCI Adapter Placement Reference to see if the adapter is hot-pluggable. If
it is not hot-pluggable, go to The System is Running Linux on page 206 because the
system must be powered off to continue. If it is hot-pluggable, continue.
4. Use the hot plug service aid to reseat the card specified by the location code. See “Replacing
a Hot-Pluggable PCI Adapter” on page 388. If reseating the adapter fixes the problem, go to
MAP 410: ²Repair Checkout² in the Diagnostic Information for Multiple Bus Systems manual. If
the problem is not resolved, continue.
Attention: Before moving the adapter to another slot, verify with the customer or system
administrator that the slot that the adapter is being moved to is available to the partition in
which the adapter presently resides.
5. Is there an empty slot available into which the adapter could be plugged?
YES Use the hot plug task to move the adapter card to another slot behind a different PCI
bridge. The “System Logic Flow” on page 21 identifies the PCI bridges and the
associated slots. See “Hot-Pluggable PCI Adapter Removal” on page 385 when
removing the adapter, and “Installing a Hot-Pluggable PCI Adapter” on page 392 when
installing the card in the new slot. If the card is successfully reconfigured in the new
slot (behind a different PCI bridge), the slot in which the card was originally plugged is
bad. Replace the PCI riser card or I/O planar in which the card was installed, location:
U0.1-P2 or U0.dd-P1. If the adapter does not reconfigure successfully in the new slot,
replace the adapter.
NO Replace the adapter. If the problem is not resolved, replace the PCI riser card or I/O
planar in which the card was installed, location: U0.1-P2 or U0.dd-P1.
6. Go to “MAP 1542: I/O Problem Isolation” on page 111.
If the location code does not identify a slot or PCI adapter, or no location code is specified, go to
“MAP 1542: I/O Problem Isolation” on page 111.
The System is Running Linux
206 Service Guide
Table 5. Firmware Error Codes. (continued)
Error Code Description Action/
Possible Failing FRU
1. Power off the system. See “Non-Hot-Pluggable PCI Adapter Removal” on page 381 and “Adding or Replacing a
Non-Hot-Pluggable PCI Adapter” on page 383 for instructions on reseating the card. If reseating the adapter fixes
the problem, go to ²MAP 410, Repair Checkout² in the Diagnostic Information for Multiple Bus Systems manual.
Attention: Before moving the adapter to another slot, verify with the customer or system administrator that the
slot that the adapter is being moved to is in the partition in which the adapter presently resides.
2. Is there an empty slot available into which the adapter could be plugged?
YES Move the adapter to another slot behind a different PCI bridge. The “System Logic Flow” on page 21
identifies the PCI bridges and the associated slots. See “Non-Hot-Pluggable PCI Adapter Removal” on
page 381 when removing the adapter, and “Adding or Replacing a Non-Hot-Pluggable PCI Adapter” on
page 383 when installing the card in the new slot. If the card is successfully reconfigured in the new slot
(behind a different PCI bridge), the slot in which the card was originally plugged is bad. Replace the PCI
riser card or I/O planar in which the card was installed, location: U0.1-P2 or U0.dd-P1. If the adapter
does not reconfigure successfully in the new slot, replace the adapter.
NO Replace the adapter. If the problem is not resolved, replace the PCI riser card or I/O planar in which the
card was installed, location: U0.1-P2 or U0.dd-P1.
3. Go to “MAP 1542: I/O Problem Isolation” on page 111.
If the location code does not identify a slot or PCI adapter, or no location code is specified, go to “MAP 1542: I/O
Problem Isolation” on page 111.
2602 0011 Description: PCI probe error, bridge is not usable
Action: If the location code identifies a slot:
1. Check the cabling to the adapter, in particular, the adapters that have serial ports. Serial ports
may require null modems or special cabling configurations to avoid connecting driver outputs
together. This may create a PCI slot power problem and force the adapter to be deconfigured.
Attention: Before moving the adapter to another slot, verify with the customer or system
administrator that the slot that the adapter is being moved to is available to the partition in
which the adapter presently resides.
2. Move adapter to another slot (behind a different PCI bus).
3. Check for available firmware updates for adapter. Apply update if available.
4. Replace the adapter.
5. Check for system firmware updates. Apply update if available.
6. If the card is in the system drawer, replace the PCI riser card, location: U0.1-P2. If the card is
in an I/O subsystem, replace the I/O backplane that contains the slot in which the adapter was
originally installed, location: U0.dd-P1.
Chapter 5. Error Code to FRU Index 207
Table 5. Firmware Error Codes. (continued)
Error Code Description Action/
Possible Failing FRU
2602 0012 Description: PCI device run-time error, bridge in freeze state
Action: If the location code identifies a slot:
1. Check the cabling to the adapter, in particular, the adapters that have serial ports. Serial ports
may require null modems or special cabling configurations to avoid connecting driver outputs
together. This may create a PCI slot power problem and force the adapter to be deconfigured.
Attention: Before moving the adapter to another slot, verify with the customer or system
administrator that the slot that the adapter is being moved to is available to the partition in
which the adapter presently resides.
2. Move adapter to another slot (behind a different PCI bus).
3. Check for available firmware updates for adapter. Apply update if available.
4. Replace the adapter.
5. Check for system firmware updates. Apply if update available.
6. If the card is in the system drawer, replace the PCI riser card, location: U0.1-P2. If the card is
in an I/O subsystem, replace the I/O backplane that contains the slot in which the adapter was
originally installed, location: U0.dd-P1.
2680 0Axy MX-PCI bridge BIST failure The problem is associated with bridge ²y² in I/O
drawer ²x.²
Replace the I/O backplane in I/O drawer ²x.²
Location U0.x-P1
See “Determining Location Code” on page 180
or location code information related to this error.
2680 0Cxx Machine check occurred 1. Replace the part identified by the location
code associated with this error code.
2. If the location code associated with this error
code identifies an adapter, and replacing the
adapter does not correct the problem,
replace the PCI riser card
Location: U0.1-P2 or U0.dd-P1
2680 0Dxx Machine check occurred; unable to isolate to a
single device
Go to “MAP 1540: Problem Isolation
Procedures” on page 108
27A0 0001 Error obtaining external RIO cable IDs 1. Check the SPCN and RIO cabling going to
the I/O subsystem specified by the location
code.
2. Check for system firmware updates.
3. Replace the I/O subsystem planar specified
by the location code.
208 Service Guide
Table 5. Firmware Error Codes. (continued)
Error Code Description Action/
Possible Failing FRU
2803 0xxx Description: Real-time-clock (RTC) errors
Action:
1. Errors reported against the real-time clock can be caused by low battery voltage and (more
rarely) power outages that occur during normal system usage. These errors are warnings that
the real-time clock data content needs to be re-established and do not require any FRU
replacement unless the error is persistent. When one of these errors occurs, the time and date
information has been lost.
To set or restore the time and date, use the operating system facility.
2. If the error is persistent, replace the battery.
Location: U0.1-P1-V3
3. If the error is persistent after the battery is replaced, replace the CEC backplane.
Location: U0.1-P1
2803 0001 RTC not updating - RTC initialization required 1. Set time and date.
2. Refer to Action under error code 2803 0xxx.
2803 0002 Bad time/date values 1. Set time and date.
2. Refer to Action under error code 2803 0xxx.
2803 0003 RTC not updating - not correctable Replace CEC backplane
Location: U0.1-P1
2803 0004 RTC operating mode parameters (eg. data
mode) changed
1. Set time and date.
2. Refer to Action under error code 28030001.
2803 0005 RTC battery error 1. Replace the battery.
Note: Password, time, and date need to be
set.
2. Refer to Action under error code 2803 0001.
2803 0010 The real-time-clock cannot be protected in the
LPAR environment
1. Check for system firmware updates. Apply
update if available.
2. Contact your next level of support for
assistance.
2803 F003 Real-time-clock not updating - not correctable
(detected by hypervisor)
Replace the CEC backplane
Location: U0.1-P1
2900 0002 Keyboard/mouse controller failed self-test Replace CEC backplane
Location: U0.1-P1
29A0 0003 Keyboard not present/detected 1. Keyboard
2. Replace CEC backplane
Location: U0.1-P1
29A0 0004 Keyboard stuck key detected 1. Keyboard
2. Replace CEC backplane
Location: U0.1-P1
29B0 0004 Mouse not present/detected 1. Mouse
2. Replace CEC backplane
Location: U0.1-P1
2B20 0011 Processor is manually disabled Use the service processor menus to re-enable
the processor and reboot the system
Chapter 5. Error Code to FRU Index 209
Table 5. Firmware Error Codes. (continued)
Error Code Description Action/
Possible Failing FRU
2B20 0022 Processor is deconfigured by the system The processor has previously failed BIST or
POST. Replace the processor card indicated by
the location code associated with the error code
in the error log.
2B20 0031 Processor is failing POST. Replace the processor card indicated by the
location code associated with the error code in
the error log.
2B20 0042 Unknown processor card Remove the unknown/unsupported processor
card.
2B20 8880 Unrecoverable internal hardware error Replace the FRU indicated in the service
processor error log entry. If the system has been
rebooted, run AIX diagnostics and look for an
operating system error log entry in which the
description says cache or cache controller error.
2B20 8881 Software error Call service support.
2B20 8882 Software error Call service support.
2B20 8883 Software error Call service support.
2B20 8884 Unrecoverable processor subsystem error Replace the FRU indicated in the service
processor error log entry. If the system has been
rebooted, run AIX diagnostics and look for an
operating system error log entry in which the
description says cache or cache controller error.
Replace the FRUs listed in this entry.
2B20 8885 Unrecoverable internal hardware error Run AIX diagnostics. Look for an operating
system error log entry in which the description
says I/O bus time out, access, or other error or
I/O bridge/device internal error. Replace the
FRUs listed in this entry.
2B20 8888 Operating system termination request received Informational message.
2B2x 00EE Description: An exception error has been reported on a global processor
Action: The type of interrupt that caused the exception is specified by the x as follows:
0: Unknown interrupt
1: System reset interrupt (SRI)
2. Machine check interrupt (MCI)
3. Data storage interrupt (DSI)
4. Instruction storage interrupt (ISI)
5. External interrupt (EXI)
6. Alignment interrupt (ALI)
7. Program interrupt (SRI)
8. Floating unavailable interrupt (FUI)
1. Check for system firmware updates. Apply if available.
2. Call service support.
2BA0 0000 Service processor POST failure Replace CEC backplane
Location: U0.1-P1
2BA0 0012 Service processor reports self test failure Replace CEC backplane
Location: U0.1-P1
210 Service Guide
Table 5. Firmware Error Codes. (continued)
Error Code Description Action/
Possible Failing FRU
2BA0 0013 Service processor reports bad NVRAM CRC Refer to the actions for error code 25A8 0xxx.
2BA0 0017 Service processor reports bad or low battery. 1. Replace the battery.
2. Refer to the actions for error code 25A8
0xxx.
2BA0 0021 Open firmware mailbox call to activate attention
LED failed
Call service support.
2BA0 0041 Service processor VPD is corrupted. Replace CEC backplane
Location: U0.1-P1
2BA0 0071 VPD data is corrupted for processor card 1. Replace the processor card
Location: U0.1-P1-C1
2BA0 0073 VPD data is corrupted for processor card 2. Replace the processor card
Location: U0.1-P1-C2
2BA0 0080 Service processor hardware reset request failure 1. Check for system firmware updates. Apply if
available.
2. Call service support.
Service Processor Error Codes
Attention: Follow the procedure defined in the “Checkpoint and Error Code Index” on page 181. If you
replace FRUs or perform an action and the problem is still not corrected, go to “MAP 1548: Processor
Subsystem Problem Isolation” on page 123 unless otherwise indicated in the tables.
If you replace FRUs and the problem is corrected, go to ²MAP 0410: Repair Checkout² in the Diagnostic
Information for Multiple Bus Systems.
Table 6. Service Processor Error Codes.
Error Code Description Action/
Possible Failing FRU
4064 0001 Scan interface basic assurance tests detected a
CEC backplane fault
CEC backplane
Location: U0.1-P1
4064 0002 CEC backplane fault CEC backplane
Location: U0.1-P1
4064 0003 Scan interface basic assurance tests detected
processor fault
Processor card
Location: U0.1-P1-C1
4064 0004 Processor fault Processor card
Location: U0.1-P1-C2
4064 000E Processor capacity card Operator panel
Location: U0.1-L1
4064 0010 Memory DIMM fault Memory DIMM
Location: U0.1-P1-C1-M8
4064 0011 Memory DIMM fault Memory DIMM
Location: U0.1-P1-C1-M6
4064 0012 Memory DIMM fault Memory DIMM
Location: U0.1-P1-C1-M1
4064 0013 Memory DIMM fault Memory DIMM
Location: U0.1-P1-C1-M3
Chapter 5. Error Code to FRU Index 211
Table 6. Service Processor Error Codes. (continued)
Error Code Description Action/
Possible Failing FRU
4064 0014 Memory DIMM fault Memory DIMM
Location: U0.1-P1-C1-M7
4064 0015 Memory DIMM fault Memory DIMM
Location: U0.1-P1-C1-M5
4064 0016 Memory DIMM fault Memory DIMM
Location: U0.1-P1-C1-M2
4064 0017 Memory DIMM fault Memory DIMM
Location: U0.1-P1-C1-M4
4064 0018 Memory DIMM fault Memory DIMM
Location: U0.1-P1-C2-M8
4064 0019 Memory DIMM fault Memory DIMM
Location: U0.1-P1-C2-M6
4064 001A Memory DIMM fault Memory DIMM
Location: U0.1-P1-C2-M1
4064 001B Memory DIMM fault Memory DIMM
Location: U0.1-P1-C2-M3
4064 001C Memory DIMM fault Memory DIMM
Location: U0.1-P1-C2-M7
4064 001D Memory DIMM fault Memory DIMM
Location: U0.1-P1-C2-M5
4064 001E Memory DIMM fault Memory DIMM
Location: U0.1-P1-C2-M2
4064 001F Memory DIMM fault Memory DIMM
Location: U0.1-P1-C2-M4
4064 00B0 Bus controller failure 1. Check for system firmware updates
2. PCI riser card
Location: U0.1-P2
4064 00B1 Bus controller failure 1. Check for system firmware updates
2. PCI riser card
Location: U0.1-P2
4064 00B3 Processor/VPD problem detected 1. Check for system firmware updates
2. Processor card 1
Location: U0.1-P1-C1
4064 00B4 Processor/VPD problem detected 1. Check for system firmware updates
2. Processor card 2
Location: U0.1-P1-C2
4064 00B7 Insufficient hardware resources were detected to
continue IPL of the system after processor
subsystem testing and initialization was
completed
Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
4064 00D1 Informational message Informational message
4064 0100 Memory extender 0 on processor card 1 Processor card,
Location: U0.1-P1-C1
4064 0101 Memory extender 2 on processor card 1 Processor card,
Location: U0.1-P1-C1
212 Service Guide
Table 6. Service Processor Error Codes. (continued)
Error Code Description Action/
Possible Failing FRU
4064 0130 Memory extender 0 on processor card 2 Processor card,
Location: U0.1-P1-C2
4064 0131 Memory extender 2 on processor card 2 Processor card,
Location: U0.1-P1-C2
4064 0A03 Reboot message on VPD channel 3 processor
card 1. Reboot is necessary for run-time array
gard to take effect.
Reboot the system in slow mode. Run
diagnostics, then go to ²MAP 0235: System
Array Self-Repair Problem Resolution² in the
Diagnostic Information for Multiple Bus Systems.
4064 0A04 Reboot message on VPD channel 4 processor
card 2. Reboot is necessary for run-time array
gard to take effect.
Reboot the system in slow mode. Run
diagnostics, then go to ²MAP 0235: System
Array Self-Repair Problem Resolution² in the
Diagnostic Information for Multiple Bus Systems.
4064 0B00 Problem with internal RIO interface (I/O slot 0,
I/O port 0)
CEC backplane, U0.1-P1
4064 0B01 Problem with internal RIO interface (I/O slot 0,
I/O port 1)
CEC backplane, U0.1-P1
4064 0B02 Problem with internal RIO interface (I/O slot 0,
I/O port 2)
1. Check for system firmware updates.
2. Check the cabling between the CEC and the
I/O subsystems.
3. CEC backplane, U0.1-P1
4. Call service support.
4064 0B03 Problem with internal RIO interface (I/O slot 0,
I/O port 3)
1. Check for system firmware updates.
2. Check the cabling between the CEC and the
I/O subsystems.
3. CEC backplane, U0.1-P1
4. Call service support.
4064 0B40 Problem with controller chip on CEC backplane 1. CEC backplane, U0.1-P1.
2. Call service support.
4064 0BF0 Problem with RIO interface (I/O slot 0, I/O port
3)
1. Check for system firmware updates.
2. Check the cabling between the CEC and the
I/O subsystems.
3. CEC backplane, U0.1-P1
4. Call service support.
4064 0C10 Processor 0 configuration error: VPD mismatch
for processor card 0. (Service processor code
problem.)
1. Check for system firmware updates.
2. Call service support.
4064 0C11 Processor 1 configuration error: VPD mismatch
for processor card 1. (Service processor code
problem.)
1. Check for system firmware updates.
2. Call service support.
4064 0CA5 Invalid system configuration 1. Check that the processor cards and memory
DIMMs are properly installed and configured.
2. If there are any FRUs listed in the service
processor error log entry, replace them one
at a time.
3. Go to “MAP 1540: Problem Isolation
Procedures” on page 108.
Chapter 5. Error Code to FRU Index 213
Table 6. Service Processor Error Codes. (continued)
Error Code Description Action/
Possible Failing FRU
4064 0CA6 No usable memory left in the system. The
system must be rebooted.
Go to “MAP 1240: Memory Problem Resolution”
on page 68.
40640 CA9 Incorrect/incompatible memory DIMMs are
installed.
Check that the memory DIMMs are properly
configured, and are the right type for the system.
4064 0CAA System memory exceeds the limit for this
system type and model.
Remove the memory DIMMs until the memory
does not exceed the limit for this system type
and model.
4064 0EA0 MOPS service processor code error. 1. Check for system firmware updates.
2. Call service support.
4064 0EA1 Operating system code on main processor 1. Check for operating system code updates.
2. If there are other symptoms, go (one
symptom at a time) to the “Quick Entry MAP”
on page 58 and follow the instructions.
3. Call service support.
4064 0EA2 PRD code on service processor error. 1. Check for system firmware updates.
2. Call service support.
4064 0EA4 The service processor detected a problem, but
was unable to isolate the source.
Contact next level of support.
4064 0EAF Service processor code and system type are
unknown
1. Check for system firmware updates.
2. Call service support.
4064 0EB0 JTAG problem Go to “MAP 1541: JTAG Problem Isolation” on
page 110.
4064 0EB1 Attention line 0 stuck fault Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
4064 0EB2 Attention line 1 stuck fault Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
4064 0EB3 Attention line 2 stuck fault Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
4064 0EB4 Attention line 3 stuck fault Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
4064 0EB5 Attention line 4 stuck fault Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
4064 0EB6 Attention line 5 stuck fault Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
4064 0EB7 Attention line 6 stuck fault Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
4064 0EB8 Attention line 7 stuck fault Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
4064 0EB9 Attention line 8 stuck fault Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
4064 0EBA Attention line 9 stuck fault Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
4064 0EBB Attention line 10 stuck fault Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
214 Service Guide
Table 6. Service Processor Error Codes. (continued)
Error Code Description Action/
Possible Failing FRU
4064 0EBC Attention line 11 stuck fault Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
4064 0EE1 Service processor firmware: invalid MRU to FRU
mapping
1. Check for system firmware updates.
2. Call service support.
4064 0EE2 Service processor firmware fault 1. Check for system firmware updates.
2. Call service support.
4064 0EE5 Firmware error, no callout available 1. Check for system firmware updates.
2. Call service support.
4065 0001 Scan interface basic assurance tests detected a
CEC backplane fault
CEC backplane
Location: U0.1-P1
4065 0002 CEC backplane fault 1. Check JTAG cable connections
2. CEC backplane
Location: U0.1-P1
4065 0003 Scan interface basic assurance tests detected
processor fault
1. Check JTAG cable connections.
2. Processor card
Location: U0.1-P1-C1
4065 0004 Processor fault 1. Check JTAG cable connections.
2. Processor card
Location: U0.1-P1-C2
4065 000E Processor capacity card Operator Panel
Location: U0.1-L1
4065 0010 Memory DIMM fault Memory DIMM
Location: U0.1-P1-C1-M8
4065 0011 Memory DIMM fault Memory DIMM
Location: U0.1-P1-C1-M6
4065 0012 Memory DIMM fault Memory DIMM
Location: U0.1-P1-C1-M1
4065 0013 Memory DIMM fault Memory DIMM
Location: U0.1-P1-C1-M3
4065 0014 Memory DIMM fault Memory DIMM
Location: U0.1-P1-C1-M7
4065 0015 Memory DIMM fault Memory DIMM
Location: U0.1-P1-C1-M5
4065 0016 Memory DIMM fault Memory DIMM
Location: U0.1-P1-C1-M2
4065 0017 Memory DIMM fault Memory DIMM
Location: U0.1-P1-C1-M4
4065 0018 Memory DIMM fault Memory DIMM
Location: U0.1-P1-C2-M8
4065 0019 Memory DIMM fault Memory DIMM
Location: U0.1-P1-C2-M6
4065 001A Memory DIMM fault Memory DIMM
Location: U0.1-P1-C2-M1
Chapter 5. Error Code to FRU Index 215
Table 6. Service Processor Error Codes. (continued)
Error Code Description Action/
Possible Failing FRU
4065 001B Memory DIMM fault Memory DIMM
Location: U0.1-P1-C2-M3
4065 001C Memory DIMM fault Memory DIMM
Location: U0.1-P1-C2-M7
4065 001D Memory DIMM fault Memory DIMM
Location: U0.1-P1-C2-M5
4065 001E Memory DIMM fault Memory DIMM
Location: U0.1-P1-C2-M2
4065 001F Memory DIMM fault Memory DIMM
Location: U0.1-P1-C2-M4
4065 00B0 Bus controller failure 1. Check for system firmware updates
2. PCI riser card
Location: U0.1-P2
4065 00B1 Bus controller failure 1. Check for system firmware updates
2. PCI riser card
Location: U0.1-P2
4065 00B3 Processor/VPD problem detected 1. Check for system firmware updates
2. Processor card 1
Location: U0.1-P1-C1
4065 00B4 Processor/VPD problem detected 1. Check for system firmware updates
2. Processor card 2
Location: U0.1-P1-C2
4065 00B7 Insufficient hardware resources were detected to
continue IPL of the system after processor
subsystem testing and initialization was
completed
Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
4065 00D1 Informational message Informational message
4065 0100 Memory extender 0 on processor card 1 Processor card,
Location: U0.1-P1-C1
4065 0101 Memory extender 2 on processor card 1 Processor card,
Location: U0.1-P1-C1
4065 0130 Memory extender 0 on processor card 2 Processor card,
Location: U0.1-P1-C2
4065 0131 Memory extender 2 on processor card 2 Processor card,
Location: U0.1-P1-C2
4065 0A03 Reboot message on VPD channel 3 processor
card 1. Reboot is necessary for run-time array
gard to take effect.
Reboot the system in slow mode. Run
diagnostics, then go to ²MAP 0235: System
Array Self-Repair Problem Resolution² in the
Diagnostic Information for Multiple Bus Systems.
4065 0A04 Reboot message on VPD channel 4 processor
card 2. Reboot is necessary for run-time array
gard to take effect.
Reboot the system in slow mode. Run
diagnostics, then go to ²MAP 0235: System
Array Self-Repair Problem Resolution² in the
Diagnostic Information for Multiple Bus Systems.
4065 0B00 Problem with internal RIO interface (I/O slot 0,
I/O port 0)
CEC backplane, U0.1-P1
216 Service Guide
Table 6. Service Processor Error Codes. (continued)
Error Code Description Action/
Possible Failing FRU
4065 0B01 Problem with internal RIO interface (I/O slot 0,
I/O port 1)
CEC backplane, U0.1-P1
4065 0B02 Problem with internal RIO interface (I/O slot 0,
I/O port 2)
1. Check for system firmware updates.
2. Check the cabling between the CEC and the
I/O subsystems.
3. CEC backplane, U0.1-P1
4. Call service support.
4065 0B03 Problem with internal RIO interface (I/O slot 0,
I/O port 3)
1. Check for system firmware updates.
2. Check the cabling between the CEC and the
I/O subsystems.
3. CEC backplane, U0.1-P1
4. Call service support.
4065 0B40 Problem with controller chip on CEC backplane 1. CEC backplane, U0.1-P1.
2. Call service support.
4065 0BF0 Problem with RIO interface (I/O slot 0, I/O port
3)
1. Check for system firmware updates.
2. Check the cabling between the CEC and the
I/O subsystems.
3. CEC backplane, U0.1-P1
4. Call service support.
4065 0C10 Processor 0 configuration error: VPD mismatch
for processor card 0. (Service processor code
problem.)
1. Check for system firmware updates.
2. Call service support.
4065 0C11 Proc 1 configuration error: VPD mismatch for
processor card 1. (Service processor code
problem.)
1. Check for system firmware updates.
2. Call service support.
4065 0CA5 Invalid system configuration 1. Check that the processor cards and memory
DIMMs are properly installed and configured.
2. If there are any FRUs listed in the service
processor error log entry, replace them one
at a time.
3. Go to “MAP 1540: Problem Isolation
Procedures” on page 108.
4065 0CA6 No usable memory left in the system. The
system must be rebooted.
Go to “MAP 1240: Memory Problem Resolution”
on page 68.
4065 0CA9 Incorrect/incompatible memory DIMMs are
installed.
Check that the memory DIMMs are properly
configured, and are the right type for the system.
4065 0CAA System memory exceeds the limit for this
system type and model.
Remove the memory DIMMs until the memory
does not exceed the limit for this system type
and model.
4065 0EA0 MOPS service processor code error. 1. Check for system firmware updates.
2. Call service support.
Chapter 5. Error Code to FRU Index 217
Table 6. Service Processor Error Codes. (continued)
Error Code Description Action/
Possible Failing FRU
4065 0EA1 Operating system code on main processor 1. Check for operating system code updates.
2. If there are other symptoms, go (one
symptom at a time) to the “Quick Entry MAP”
on page 58 and follow the instructions.
3. Call service support.
4065 0EA2 PRD code on service processor error. 1. Check for system firmware updates.
2. Call service support.
4065 0EA4 The service processor detected a problem, but
was unable to isolate the source.
Contact next level of support.
4065 0EAF Service processor code and system type are
unknown
1. Check for system firmware updates.
2. Call service support.
4065 0EB0 JTAG problem Go to “MAP 1541: JTAG Problem Isolation” on
page 110.
4065 0EB1 Attention line 0 stuck fault Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
4065 0EB2 Attention line 1 stuck fault Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
4065 0EB3 Attention line 2 stuck fault Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
4065 0EB4 Attention line 3 stuck fault Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
4065 0EB5 Attention line 4 stuck fault Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
4065 0EB6 Attention line 5 stuck fault Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
4065 0EB7 Attention line 6 stuck fault Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
4065 0EB8 Attention line 7 stuck fault Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
4065 0EB9 Attention line 8 stuck fault Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
4065 0EBA Attention line 9 stuck fault Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
4065 0EBB Attention line 10 stuck fault Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
4065 0EBC Attention line 11 stuck fault Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
4065 0EE1 Service processor firmware: invalid MRU to FRU
mapping
1. Check for system firmware updates.
2. Call service support.
4065 0EE2 Service processor firmware fault 1. Check for system firmware updates.
2. Call service support.
4065 0EE5 Firmware error, no callout available 1. Check for system firmware updates.
2. Call service support.
218 Service Guide
Table 6. Service Processor Error Codes. (continued)
Error Code Description Action/
Possible Failing FRU
4066 0001 Scan interface basic assurance tests detected a
CEC backplane fault
CEC backplane
Location: U0.1-P1
4066 0002 CEC backplane fault 1. Check JTAG cable connections
2. CEC backplane
Location: U0.1-P1
4066 0003 Scan interface basic assurance tests detected
processor fault
1. Check JTAG cable connections.
2. Processor card
Location: U0.1-P1-C1
4066 0004 Processor fault 1. Check JTAG cable connections.
2. Processor card
Location: U0.1-P1-C2
4066 000E Processor capacity card Operator Panel
Location: U0.1-L1
4066 0010 Memory DIMM fault Memory DIMM
Location: U0.1-P1-C1-M8
4066 0011 Memory DIMM fault Memory DIMM
Location: U0.1-P1-C1-M6
4066 0012 Memory DIMM fault Memory DIMM
Location: U0.1-P1-C1-M1
4066 0013 Memory DIMM fault Memory DIMM
Location: U0.1-P1-C1-M3
4066 0014 Memory DIMM fault Memory DIMM
Location: U0.1-P1-C1-M7
4066 0015 Memory DIMM fault Memory DIMM
Location: U0.1-P1-C1-M5
4066 0016 Memory DIMM fault Memory DIMM
Location: U0.1-P1-C1-M2
4066 0017 Memory DIMM fault Memory DIMM
Location: U0.1-P1-C1-M4
4066 0018 Memory DIMM fault Memory DIMM
Location: U0.1-P1-C2-M8
4066 0019 Memory DIMM fault Memory DIMM
Location: U0.1-P1-C2-M6
4066 001A Memory DIMM fault Memory DIMM
Location: U0.1-P1-C2-M1
4066 001B Memory DIMM fault Memory DIMM
Location: U0.1-P1-C2-M3
4066 001C Memory DIMM fault Memory DIMM
Location: U0.1-P1-C2-M7
4066 001D Memory DIMM fault Memory DIMM
Location: U0.1-P1-C2-M5
4066 001E Memory DIMM fault Memory DIMM
Location: U0.1-P1-C2-M2
4066 001F Memory DIMM fault Memory DIMM
Location: U0.1-P1-C2-M4
Chapter 5. Error Code to FRU Index 219
Table 6. Service Processor Error Codes. (continued)
Error Code Description Action/
Possible Failing FRU
4066 00B0 Bus controller failure 1. Check for system firmware updates
2. PCI riser card
Location: U0.1-P2
4066 00B1 Bus controller failure 1. Check for system firmware updates
2. PCI riser card
Location: U0.1-P2
4066 00B3 Processor/VPD problem detected 1. Check for system firmware updates
2. Processor card 1
Location: U0.1-P1-C1
4066 00B4 Processor/VPD problem detected 1. Check for system firmware updates
2. Processor card 2
Location: U0.1-P1-C2
4066 00B7 Insufficient hardware resources were detected to
continue IPL of the system after processor
subsystem testing and initialization was
completed
Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
4066 00D1 Informational message Informational message
4066 0100 Memory extender 0 on processor card 1 Processor card,
Location: U0.1-P1-C1
4066 0101 Memory extender 2 on processor card 1 Processor card,
Location: U0.1-P1-C1
4066 0130 Memory extender 0 on processor card 2 Processor card,
Location: U0.1-P1-C2
4066 0131 Memory extender 2 on processor card 2 Processor card,
Location: U0.1-P1-C2
4066 0A03 Reboot message on VPD channel 3 processor
card 1. Reboot is necessary for run-time array
gard to take effect.
Reboot the system in slow mode. Run
diagnostics, then go to ²MAP 0235: System
Array Self-Repair Problem Resolution² in the
Diagnostic Information for Multiple Bus Systems.
4066 0A04 Reboot message on VPD channel 4 processor
card 2. Reboot is necessary for run-time array
gard to take effect.
Reboot the system in slow mode. Run
diagnostics, then go to ²MAP 0235: System
Array Self-Repair Problem Resolution² in the
Diagnostic Information for Multiple Bus Systems.
4066 0B00 Problem with internal RIO interface (I/O slot 0,
I/O port 0)
CEC backplane, U0.1-P1
4066 0B01 Problem with internal RIO interface (I/O slot 0,
I/O port 1)
CEC backplane, U0.1-P1
4066 0B02 Problem with internal RIO interface (I/O slot 0,
I/O port 2)
1. Check for system firmware updates.
2. Check the cabling between the CEC and the
I/O subsystems.
3. CEC backplane, U0.1-P1
4. Call service support.
220 Service Guide
Table 6. Service Processor Error Codes. (continued)
Error Code Description Action/
Possible Failing FRU
4066 0B03 Problem with internal RIO interface (I/O slot 0,
I/O port 3)
1. Check for system firmware updates.
2. Check the cabling between the CEC and the
I/O subsystems.
3. CEC backplane, U0.1-P1
4. Call service support.
4066 0B40 Problem with controller chip on CEC backplane 1. CEC backplane, U0.1-P1.
2. Call service support.
4066 0BF0 Problem with RIO interface (I/O slot 0, I/O port
3)
1. Check for system firmware updates.
2. Check the cabling between the CEC and the
I/O subsystems.
3. CEC backplane, U0.1-P1
4. Call service support.
4066 0C10 Processor 0 configuration error: VPD mismatch
for processor card 0. (Service processor code
problem.)
1. Check for system firmware updates.
2. Call service support.
4066 0C11 Processor 1 configuration error: VPD mismatch
for processor card 1. (Service processor code
problem.)
1. Check for system firmware updates.
2. Call service support.
4066 0CA5 Invalid system configuration 1. Check that the processor cards and memory
DIMMs are properly installed and configured.
2. If there are any FRUs listed in the service
processor error log entry, replace them one
at a time.
3. Go to “MAP 1540: Problem Isolation
Procedures” on page 108.
4066 0CA6 No usable memory left in the system. The
system must be rebooted.
Go to “MAP 1240: Memory Problem Resolution”
on page 68.
4066 0CA9 Incorrect/incompatible memory DIMMs are
installed.
Check that the memory DIMMs are properly
configured, and are the right type for the system.
4066 0CAA System memory exceeds the limit for this
system type and model.
Remove the memory DIMMs until the memory
does not exceed the limit for this system type
and model.
4066 0EA0 MOPS service processor code error. 1. Check for system firmware updates.
2. Call service support.
4066 0EA1 Operating system code on main processor 1. Check for operating system code updates.
2. If there are other symptoms, go (one
symptom at a time) to the “Quick Entry MAP”
on page 58 and follow the instructions.
3. Call service support.
4066 0EA2 PRD code on service processor error. 1. Check for system firmware updates.
2. Call service support.
4066 0EA4 The service processor detected a problem, but
was unable to isolate the source.
Contact next level of support.
4066 0EAF Service processor code and system type are
unknown
1. Check for system firmware updates.
2. Call service support.
Chapter 5. Error Code to FRU Index 221
Table 6. Service Processor Error Codes. (continued)
Error Code Description Action/
Possible Failing FRU
4066 0EB0 JTAG problem Go to “MAP 1541: JTAG Problem Isolation” on
page 110.
4066 0EB1 Attention line 0 stuck fault Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
4066 0EB2 Attention line 1 stuck fault Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
4066 0EB3 Attention line 2 stuck fault Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
4066 0EB4 Attention line 3 stuck fault Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
4066 0EB5 Attention line 4 stuck fault Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
4066 0EB6 Attention line 5 stuck fault Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
4066 0EB7 Attention line 6 stuck fault Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
4066 0EB8 Attention line 7 stuck fault Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
4066 0EB9 Attention line 8 stuck fault Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
4066 0EBA Attention line 9 stuck fault Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
4066 0EBB Attention line 10 stuck fault Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
4066 0EBC Attention line 11 stuck fault Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
4066 0EE1 Service processor firmware: invalid MRU to FRU
mapping
1. Check for system firmware updates.
2. Call service support.
4066 0EE2 Service processor firmware fault 1. Check for system firmware updates.
2. Call service support.
4066 0EE5 Firmware error, no callout available 1. Check for system firmware updates.
2. Call service support.
4067 0001 Scan interface basic assurance tests detected a
CEC backplane fault
CEC backplane
Location: U0.1-P1
4067 0002 CEC backplane fault 1. Check JTAG cable connections
2. CEC backplane
Location: U0.1-P1
4067 0003 Scan interface basic assurance tests detected
processor fault
1. Check JTAG cable connections.
2. Processor card
Location: U0.1-P1-C1
4067 0004 Processor fault 1. Check JTAG cable connections.
2. Processor card
Location: U0.1-P1-C2
222 Service Guide
Table 6. Service Processor Error Codes. (continued)
Error Code Description Action/
Possible Failing FRU
4067 000E Processor capacity card Operator Panel
Location: U0.1-L1
4067 0010 Memory DIMM fault Memory DIMM
Location: U0.1-P1-C1-M8
4067 0011 Memory DIMM fault Memory DIMM
Location: U0.1-P1-C1-M6
4067 0012 Memory DIMM fault Memory DIMM
Location: U0.1-P1-C1-M1
4067 0013 Memory DIMM fault Memory DIMM
Location: U0.1-P1-C1-M3
4067 0014 Memory DIMM fault Memory DIMM
Location: U0.1-P1-C1-M7
4067 0015 Memory DIMM fault Memory DIMM
Location: U0.1-P1-C1-M5
4067 0016 Memory DIMM fault Memory DIMM
Location: U0.1-P1-C1-M2
4067 0017 Memory DIMM fault Memory DIMM
Location: U0.1-P1-C1-M4
4067 0018 Memory DIMM fault Memory DIMM
Location: U0.1-P1-C2-M8
4067 0019 Memory DIMM fault Memory DIMM
Location: U0.1-P1-C2-M6
4067 001A Memory DIMM fault Memory DIMM
Location: U0.1-P1-C2-M1
4067 001B Memory DIMM fault Memory DIMM
Location: U0.1-P1-C2-M3
4067 001C Memory DIMM fault Memory DIMM
Location: U0.1-P1-C2-M7
4067 001D Memory DIMM fault Memory DIMM
Location: U0.1-P1-C2-M5
4067 001E Memory DIMM fault Memory DIMM
Location: U0.1-P1-C2-M2
4067 001F Memory DIMM fault Memory DIMM
Location: U0.1-P1-C2-M4
4067 00B0 Bus controller failure 1. Check for system firmware updates
2. PCI riser card
Location: U0.1-P2
4067 00B1 Bus controller failure 1. Check for system firmware updates
2. PCI riser card
Location: U0.1-P2
4067 00B3 Processor/VPD problem detected 1. Check for system firmware updates
2. Processor card 1
Location: U0.1-P1-C1
Chapter 5. Error Code to FRU Index 223
Table 6. Service Processor Error Codes. (continued)
Error Code Description Action/
Possible Failing FRU
4067 00B4 Processor/VPD problem detected 1. Check for system firmware updates
2. Processor card 2
Location: U0.1-P1-C2
4067 00B7 Insufficient hardware resources were detected to
continue IPL of the system after processor
subsystem testing and initialization was
completed
Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
4067 00D1 Informational message Informational message
4067 0100 Memory extender 0 on processor card 1 Processor card,
Location: U0.1-P1-C1
4067 0101 Memory extender 2 on processor card 1 Processor card,
Location: U0.1-P1-C1
4067 0130 Memory extender 0 on processor card 2 Processor card,
Location: U0.1-P1-C2
4067 0131 Memory extender 2 on processor card 2 Processor card,
Location: U0.1-P1-C2
4067 0A03 Reboot message on VPD channel 3 processor
card 1. Reboot is necessary for run-time array
gard to take effect.
Reboot the system in slow mode. Run
diagnostics, then go to ²MAP 0235: System
Array Self-Repair Problem Resolution² in the
Diagnostic Information for Multiple Bus Systems.
4067 0A04 Reboot message on VPD channel 4 processor
card 2. Reboot is necessary for run-time array
gard to take effect.
Reboot the system in slow mode. Run
diagnostics, then go to ²MAP 0235: System
Array Self-Repair Problem Resolution² in the
Diagnostic Information for Multiple Bus Systems.
4067 0B00 Problem with internal RIO interface (I/O slot 0,
I/O port 0)
CEC backplane, U0.1-P1
4067 0B01 Problem with internal RIO interface (I/O slot 0,
I/O port 1)
CEC backplane, U0.1-P1
4067 0B02 Problem with internal RIO interface (I/O slot 0,
I/O port 2)
1. Check for system firmware updates.
2. Check the cabling between the CEC and the
I/O subsystems.
3. CEC backplane, U0.1-P1
4. Call service support.
4067 0B03 Problem with internal RIO interface (I/O slot 0,
I/O port 3)
1. Check for system firmware updates.
2. Check the cabling between the CEC and the
I/O subsystems.
3. CEC backplane, U0.1-P1
4. Call service support.
4067 0B40 Problem with controller chip on CEC backplane 1. CEC backplane, U0.1-P1.
2. Call service support.
4067 0BF0 Problem with RIO interface (I/O slot 0, I/O port
3)
1. Check for system firmware updates.
2. Check the cabling between the CEC and the
I/O subsystems.
3. CEC backplane, U0.1-P1
4. Call service support.
224 Service Guide
Table 6. Service Processor Error Codes. (continued)
Error Code Description Action/
Possible Failing FRU
4067 0C10 Processor 0 configuration error: VPD mismatch
for processor card 0. (Service processor code
problem.)
1. Check for system firmware updates.
2. Call service support.
4067 0C11 Processor 1 configuration error: VPD mismatch
for processor card 1. (Service processor code
problem.)
1. Check for system firmware updates.
2. Call service support.
4067 0CA5 Invalid system configuration 1. Check that the processor cards and memory
DIMMs are properly installed and configured.
2. If there are any FRUs listed in the service
processor error log entry, replace them one
at a time.
3. Go to “MAP 1540: Problem Isolation
Procedures” on page 108.
4067 0CA6 No usable memory left in the system. The
system must be rebooted.
Go to “MAP 1240: Memory Problem Resolution”
on page 68.
4067 0CA9 Incorrect/incompatible memory DIMMs are
installed.
Check that the memory DIMMs are properly
configured, and are the right type for the system.
4067 0CAA System memory exceeds the limit for this
system type and model.
Remove the memory DIMMs until the memory
does not exceed the limit for this system type
and model.
4067 0EA0 MOPS service processor code error. 1. Check for system firmware updates.
2. Call service support.
4067 0EA1 Operating system code on main processor 1. Check for operating system code updates.
2. If there are other symptoms, go (one
symptom at a time) to the “Quick Entry MAP”
on page 58 and follow the instructions.
3. Call service support.
4067 0EA2 PRD code on service processor error. 1. Check for system firmware updates.
2. Call service support.
4067 0EA4 The service processor detected a problem, but
was unable to isolate the source.
Contact next level of support.
4067 0EAF Service processor code and system type are
unknown
1. Check for system firmware updates.
2. Call service support.
4067 0EB0 JTAG problem Go to “MAP 1541: JTAG Problem Isolation” on
page 110.
4067 0EB1 Attention line 0 stuck fault Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
4067 0EB2 Attention line 1 stuck fault Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
4067 0EB3 Attention line 2 stuck fault Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
4067 0EB4 Attention line 3 stuck fault Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
4067 0EB5 Attention line 4 stuck fault Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
Chapter 5. Error Code to FRU Index 225
Table 6. Service Processor Error Codes. (continued)
Error Code Description Action/
Possible Failing FRU
4067 0EB6 Attention line 5 stuck fault Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
4067 0EB7 Attention line 6 stuck fault Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
4067 0EB8 Attention line 7 stuck fault Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
4067 0EB9 Attention line 8 stuck fault Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
4067 0EBA Attention line 9 stuck fault Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
4067 0EBB Attention line 10 stuck fault Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
4067 0EBC Attention line 11 stuck fault Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
4067 0EE1 Service processor firmware: invalid MRU to FRU
mapping
1. Check for system firmware updates.
2. Call service support.
4067 0EE2 Service processor firmware fault 1. Check for system firmware updates.
2. Call service support.
4067 0EE5 Firmware error, no callout available 1. Check for system firmware updates.
2. Call service support.
4068 0001 Scan interface basic assurance tests detected a
CEC backplane fault
CEC backplane
Location: U0.1-P1
4068 0002 CEC backplane fault 1. Check JTAG cable connections
2. CEC backplane
Location: U0.1-P1
4068 0003 Scan interface basic assurance tests detected
processor fault
1. Check JTAG cable connections.
2. Processor card
Location: U0.1-P1-C1
4068 0004 Processor fault 1. Check JTAG cable connections.
2. Processor card
Location: U0.1-P1-C2
4068 000E Processor capacity card Operator Panel
Location: U0.1-L1
4068 0010 Memory DIMM fault Memory DIMM
Location: U0.1-P1-C1-M8
4068 0011 Memory DIMM fault Memory DIMM
Location: U0.1-P1-C1-M6
4068 0012 Memory DIMM fault Memory DIMM
Location: U0.1-P1-C1-M1
4068 0013 Memory DIMM fault Memory DIMM
Location: U0.1-P1-C1-M3
4068 0014 Memory DIMM fault Memory DIMM
Location: U0.1-P1-C1-M7
226 Service Guide
Table 6. Service Processor Error Codes. (continued)
Error Code Description Action/
Possible Failing FRU
4068 0015 Memory DIMM fault Memory DIMM
Location: U0.1-P1-C1-M5
4068 0016 Memory DIMM fault Memory DIMM
Location: U0.1-P1-C1-M2
4068 0017 Memory DIMM fault Memory DIMM
Location: U0.1-P1-C1-M4
4068 0018 Memory DIMM fault Memory DIMM
Location: U0.1-P1-C2-M8
4068 0019 Memory DIMM fault Memory DIMM
Location: U0.1-P1-C2-M6
4068 001A Memory DIMM fault Memory DIMM
Location: U0.1-P1-C2-M1
4068 001B Memory DIMM fault Memory DIMM
Location: U0.1-P1-C2-M3
4068 001C Memory DIMM fault Memory DIMM
Location: U0.1-P1-C2-M7
4068 001D Memory DIMM fault Memory DIMM
Location: U0.1-P1-C2-M5
4068 001E Memory DIMM fault Memory DIMM
Location: U0.1-P1-C2-M2
4068 001F Memory DIMM fault Memory DIMM
Location: U0.1-P1-C2-M4
4068 00B0 Bus controller failure 1. Check for system firmware updates
2. PCI riser card
Location: U0.1-P2
4068 00B1 Bus controller failure 1. Check for system firmware updates
2. PCI riser card
Location: U0.1-P2
4068 00B3 Processor/VPD problem detected 1. Check for system firmware updates
2. Processor card 1
Location: U0.1-P1-C1
4068 00B4 Processor/VPD problem detected 1. Check for system firmware updates
2. Processor card 2
Location: U0.1-P1-C2
4068 00B7 Insufficient hardware resources were detected to
continue IPL of the system after processor
subsystem testing and initialization was
completed
Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
4068 00D1 Informational message Informational message
4068 0100 Memory extender 0 on processor card 1 Processor card,
Location: U0.1-P1-C1
4068 0101 Memory extender 2 on processor card 1 Processor card,
Location: U0.1-P1-C1
4068 0130 Memory extender 0 on processor card 2 Processor card,
Location: U0.1-P1-C2
Chapter 5. Error Code to FRU Index 227
Table 6. Service Processor Error Codes. (continued)
Error Code Description Action/
Possible Failing FRU
4068 0131 Memory extender 2 on processor card 2 Processor card,
Location: U0.1-P1-C2
4068 0A03 Reboot message on VPD channel 3 processor
card 1. Reboot is necessary for run-time array
gard to take effect.
Reboot the system in slow mode. Run
diagnostics, then go to ²MAP 0235: System
Array Self-Repair Problem Resolution² in the
Diagnostic Information for Multiple Bus Systems.
4068 0A04 Reboot message on VPD channel 4 processor
card 2. Reboot is necessary for run-time array
gard to take effect.
Reboot the system in slow mode. Run
diagnostics, then go to ²MAP 0235: System
Array Self-Repair Problem Resolution² in the
Diagnostic Information for Multiple Bus Systems.
4068 0B00 Problem with internal RIO interface (I/O slot 0,
I/O port 0)
CEC backplane, U0.1-P1
4068 0B01 Problem with internal RIO interface (I/O slot 0,
I/O port 1)
CEC backplane, U0.1-P1
4068 0B02 Problem with internal RIO interface (I/O slot 0,
I/O port 2)
1. Check for system firmware updates.
2. Check the cabling between the CEC and the
I/O subsystems.
3. CEC backplane, U0.1-P1
4. Call service support.
4068 0B03 Problem with internal RIO interface (I/O slot 0,
I/O port 3)
1. Check for system firmware updates.
2. Check the cabling between the CEC and the
I/O subsystems.
3. CEC backplane, U0.1-P1
4. Call service support.
4068 0B40 Problem with controller chip on CEC backplane 1. CEC backplane, U0.1-P1.
2. Call service support.
4068 0BF0 Problem with RIO interface (I/O slot 0, I/O port
3)
1. Check for system firmware updates.
2. Check the cabling between the CEC and the
I/O subsystems.
3. CEC backplane, U0.1-P1
4. Call service support.
4068 0C10 Processor 0 configuration error: VPD mismatch
for processor card 0. (Service processor code
problem.)
1. Check for system firmware updates.
2. Call service support.
4068 0C11 Processor 1 configuration error: VPD mismatch
for processor card 1. (Service processor code
problem.)
1. Check for system firmware updates.
2. Call service support.
4068 0CA5 Invalid system configuration 1. Check that the processor cards and memory
DIMMs are properly installed and configured.
2. If there are any FRUs listed in the service
processor error log entry, replace them one
at a time.
3. Go to “MAP 1540: Problem Isolation
Procedures” on page 108.
4068 0CA6 No usable memory left in the system. The
system must be rebooted.
Go to “MAP 1240: Memory Problem Resolution”
on page 68.
228 Service Guide
Table 6. Service Processor Error Codes. (continued)
Error Code Description Action/
Possible Failing FRU
4068 0CA9 Incorrect/incompatible memory DIMMs are
installed.
Check that the memory DIMMs are properly
configured, and are the right type for the system.
4068 0CAA System memory exceeds the limit for this
system type and model.
Remove the memory DIMMs until the memory
does not exceed the limit for this system type
and model.
4068 0EA0 MOPS service processor code error. 1. Check for system firmware updates.
2. Call service support.
4068 0EA1 Operating system code on main processor 1. Check for operating system code updates.
2. If there are other symptoms, go (one
symptom at a time) to the “Quick Entry MAP”
on page 58 and follow the instructions.
3. Call service support.
4068 0EA2 PRD code on service processor error. 1. Check for system firmware updates.
2. Call service support.
4068 0EA4 The service processor detected a problem, but
was unable to isolate the source.
Contact next level of support.
4068 0EAF Service processor code and system type are
unknown
1. Check for system firmware updates.
2. Call service support.
4068 0EB0 JTAG problem Go to “MAP 1541: JTAG Problem Isolation” on
page 110.
4068 0EB1 Attention line 0 stuck fault Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
4068 0EB2 Attention line 1 stuck fault Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
4068 0EB3 Attention line 2 stuck fault Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
4068 0EB4 Attention line 3 stuck fault Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
4068 0EB5 Attention line 4 stuck fault Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
4068 0EB6 Attention line 5 stuck fault Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
4068 0EB7 Attention line 6 stuck fault Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
4068 0EB8 Attention line 7 stuck fault Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
4068 0EB9 Attention line 8 stuck fault Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
4068 0EBA Attention line 9 stuck fault Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
4068 0EBB Attention line 10 stuck fault Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
4068 0EBC Attention line 11 stuck fault Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
Chapter 5. Error Code to FRU Index 229
Table 6. Service Processor Error Codes. (continued)
Error Code Description Action/
Possible Failing FRU
4068 0EE1 Service processor firmware: invalid MRU to FRU
mapping
1. Check for system firmware updates.
2. Call service support.
4068 0EE2 Service processor firmware fault 1. Check for system firmware updates.
2. Call service support.
4068 0EE5 Firmware error, no callout available 1. Check for system firmware updates.
2. Call service support.
4069 0001 Scan interface basic assurance tests detected a
CEC backplane fault
CEC backplane
Location: U0.1-P1
4069 0002 CEC backplane fault 1. Check JTAG cable connections
2. CEC backplane
Location: U0.1-P1
4069 0003 Scan interface basic assurance tests detected
processor fault
1. Check JTAG cable connections.
2. Processor card
Location: U0.1-P1-C1
4069 0004 Processor fault 1. Check JTAG cable connections.
2. Processor card
Location: U0.1-P1-C2
4069 000E Processor capacity card Operator Panel
Location: U0.1-L1
4069 0010 Memory DIMM fault Memory DIMM
Location: U0.1-P1-C1-M8
4069 0011 Memory DIMM fault Memory DIMM
Location: U0.1-P1-C1-M6
4069 0012 Memory DIMM fault Memory DIMM
Location: U0.1-P1-C1-M1
4069 0013 Memory DIMM fault Memory DIMM
Location: U0.1-P1-C1-M3
4069 0014 Memory DIMM fault Memory DIMM
Location: U0.1-P1-C1-M7
4069 0015 Memory DIMM fault Memory DIMM
Location: U0.1-P1-C1-M5
4069 0016 Memory DIMM fault Memory DIMM
Location: U0.1-P1-C1-M2
4069 0017 Memory DIMM fault Memory DIMM
Location: U0.1-P1-C1-M4
4069 0018 Memory DIMM fault Memory DIMM
Location: U0.1-P1-C2-M8
4069 0019 Memory DIMM fault Memory DIMM
Location: U0.1-P1-C2-M6
4069 001A Memory DIMM fault Memory DIMM
Location: U0.1-P1-C2-M1
4069 001B Memory DIMM fault Memory DIMM
Location: U0.1-P1-C2-M3
230 Service Guide
Table 6. Service Processor Error Codes. (continued)
Error Code Description Action/
Possible Failing FRU
4069 001C Memory DIMM fault Memory DIMM
Location: U0.1-P1-C2-M7
4069 001D Memory DIMM fault Memory DIMM
Location: U0.1-P1-C2-M5
4069 001E Memory DIMM fault Memory DIMM
Location: U0.1-P1-C2-M2
4069 001F Memory DIMM fault Memory DIMM
Location: U0.1-P1-C2-M4
4069 00B0 Bus controller failure 1. Check for system firmware updates
2. PCI riser card
Location: U0.1-P2
4069 00B1 Bus controller failure 1. Check for system firmware updates
2. PCI riser card
Location: U0.1-P2
4069 00B3 Processor/VPD problem detected 1. Check for system firmware updates
2. Processor card 1
Location: U0.1-P1-C1
4069 00B4 Processor/VPD problem detected 1. Check for system firmware updates
2. Processor card 2
Location: U0.1-P1-C2
4069 00B7 Insufficient hardware resources were detected to
continue IPL of the system after processor
subsystem testing and initialization was
completed
Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
4069 00D1 Informational message Informational message
4069 0100 Memory extender 0 on processor card 1 Processor card,
Location: U0.1-P1-C1
4069 0101 Memory extender 2 on processor card 1 Processor card,
Location: U0.1-P1-C1
4069 0130 Memory extender 0 on processor card 2 Processor card,
Location: U0.1-P1-C2
4069 0131 Memory extender 2 on processor card 2 Processor card,
Location: U0.1-P1-C2
4069 0A03 Reboot message on VPD channel 3 processor
card 1. Reboot is necessary for run-time array
gard to take effect.
Reboot the system in slow mode. Run
diagnostics, then go to ²MAP 0235: System
Array Self-Repair Problem Resolution² in the
Diagnostic Information for Multiple Bus Systems.
4069 0A04 Reboot message on VPD channel 4 processor
card 2. Reboot is necessary for run-time array
gard to take effect.
Reboot the system in slow mode. Run
diagnostics, then go to ²MAP 0235: System
Array Self-Repair Problem Resolution² in the
Diagnostic Information for Multiple Bus Systems.
4069 0B00 Problem with internal RIO interface (I/O slot 0,
I/O port 0)
CEC backplane, U0.1-P1
4069 0B01 Problem with internal RIO interface (I/O slot 0,
I/O port 1)
CEC backplane, U0.1-P1
Chapter 5. Error Code to FRU Index 231
Table 6. Service Processor Error Codes. (continued)
Error Code Description Action/
Possible Failing FRU
4069 0B02 Problem with internal RIO interface (I/O slot 0,
I/O port 2)
1. Check for system firmware updates.
2. Check the cabling between the CEC and the
I/O subsystems.
3. CEC backplane, U0.1-P1
4. Call service support.
4069 0B03 Problem with internal RIO interface (I/O slot 0,
I/O port 3)
1. Check for system firmware updates.
2. Check the cabling between the CEC and the
I/O subsystems.
3. CEC backplane, U0.1-P1
4. Call service support.
4069 0B40 Problem with controller chip on CEC backplane 1. CEC backplane, U0.1-P1.
2. Call service support.
4069 0BF0 Problem with RIO interface (I/O slot 0, I/O port
3)
1. Check for system firmware updates.
2. Check the cabling between the CEC and the
I/O subsystems.
3. CEC backplane, U0.1-P1
4. Call service support.
4069 0C10 Processor 0 configuration error: VPD mismatch
for processor card 0. (Service processor code
problem.)
1. Check for system firmware updates.
2. Call service support.
4069 0C11 Processor 1 configuration error: VPD mismatch
for processor card 1. (Service processor code
problem.)
1. Check for system firmware updates.
2. Call service support.
4069 0CA5 Invalid system configuration 1. Check that the processor cards and memory
DIMMs are properly installed and configured.
2. If there are any FRUs listed in the service
processor error log entry, replace them one
at a time.
3. Go to “MAP 1540: Problem Isolation
Procedures” on page 108.
4069 0CA6 No usable memory left in the system. The
system must be rebooted.
Go to “MAP 1240: Memory Problem Resolution”
on page 68.
4069 0CA9 Incorrect/incompatible memory DIMMs are
installed.
Check that the memory DIMMs are properly
configured, and are the right type for the system.
4069 0CAA System memory exceeds the limit for this
system type and model.
Remove the memory DIMMs until the memory
does not exceed the limit for this system type
and model.
4069 0EA0 MOPS service processor code error. 1. Check for system firmware updates.
2. Call service support.
4069 0EA1 Operating system code on main processor 1. Check for operating system code updates.
2. If there are other symptoms, go (one
symptom at a time) to the “Quick Entry MAP”
on page 58 and follow the instructions.
3. Call service support.
232 Service Guide
Table 6. Service Processor Error Codes. (continued)
Error Code Description Action/
Possible Failing FRU
4069 0EA2 PRD code on service processor error. 1. Check for system firmware updates.
2. Call service support.
4069 0EA4 The service processor detected a problem, but
was unable to isolate the source.
Contact next level of support.
4069 0EAF Service processor code and system type are
unknown
1. Check for system firmware updates.
2. Call service support.
4069 0EB0 JTAG problem Go to “MAP 1541: JTAG Problem Isolation” on
page 110.
4069 0EB1 Attention line 0 stuck fault Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
4069 0EB2 Attention line 1 stuck fault Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
4069 0EB3 Attention line 2 stuck fault Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
4069 0EB4 Attention line 3 stuck fault Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
4069 0EB5 Attention line 4 stuck fault Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
4069 0EB6 Attention line 5 stuck fault Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
4069 0EB7 Attention line 6 stuck fault Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
4069 0EB8 Attention line 7 stuck fault Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
4069 0EB9 Attention line 8 stuck fault Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
4069 0EBA Attention line 9 stuck fault Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
4069 0EBB Attention line 10 stuck fault Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
4069 0EBC Attention line 11 stuck fault Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
4069 0EE1 Service processor firmware: invalid MRU to FRU
mapping
1. Check for system firmware updates.
2. Call service support.
4069 0EE2 Service processor firmware fault 1. Check for system firmware updates.
2. Call service support.
4069 0EE5 Firmware error, no callout available 1. Check for system firmware updates.
2. Call service support.
406A 0001 Scan interface basic assurance tests detected a
CEC backplane fault
CEC backplane
Location: U0.1-P1
406A 0002 CEC backplane fault 1. Check JTAG cable connections
2. CEC backplane
Location: U0.1-P1
Chapter 5. Error Code to FRU Index 233
Table 6. Service Processor Error Codes. (continued)
Error Code Description Action/
Possible Failing FRU
406A 0003 Scan interface basic assurance tests detected
processor fault
1. Check JTAG cable connections.
2. Processor card
Location: U0.1-P1-C1
406A 0004 Processor fault 1. Check JTAG cable connections.
2. Processor card
Location: U0.1-P1-C2
406A 000E Processor capacity card Operator Panel
Location: U0.1-L1
406A 0010 Memory DIMM fault Memory DIMM
Location: U0.1-P1-C1-M8
406A 0011 Memory DIMM fault Memory DIMM
Location: U0.1-P1-C1-M6
406A 0012 Memory DIMM fault Memory DIMM
Location: U0.1-P1-C1-M1
406A 0013 Memory DIMM fault Memory DIMM
Location: U0.1-P1-C1-M3
406A 0014 Memory DIMM fault Memory DIMM
Location: U0.1-P1-C1-M7
406A 0015 Memory DIMM fault Memory DIMM
Location: U0.1-P1-C1-M5
406A 0016 Memory DIMM fault Memory DIMM
Location: U0.1-P1-C1-M2
406A 0017 Memory DIMM fault Memory DIMM
Location: U0.1-P1-C1-M4
406A 0018 Memory DIMM fault Memory DIMM
Location: U0.1-P1-C2-M8
406A 0019 Memory DIMM fault Memory DIMM
Location: U0.1-P1-C2-M6
406A 001A Memory DIMM fault Memory DIMM
Location: U0.1-P1-C2-M1
406A 001B Memory DIMM fault Memory DIMM
Location: U0.1-P1-C2-M3
406A 001C Memory DIMM fault Memory DIMM
Location: U0.1-P1-C2-M7
406A 001D Memory DIMM fault Memory DIMM
Location: U0.1-P1-C2-M5
406A 001E Memory DIMM fault Memory DIMM
Location: U0.1-P1-C2-M2
406A 001F Memory DIMM fault Memory DIMM
Location: U0.1-P1-C2-M4
406A 00B0 Bus controller failure 1. Check for system firmware updates
2. PCI riser card
Location: U0.1-P2
234 Service Guide
Table 6. Service Processor Error Codes. (continued)
Error Code Description Action/
Possible Failing FRU
406A 00B1 Bus controller failure 1. Check for system firmware updates
2. PCI riser card
Location: U0.1-P2
406A 00B3 MCM/VPD problem detected 1. Check for system firmware updates
2. Processor card 1
Location: U0.1-P1-C1
406A 00B4 MCM/VPD problem detected 1. Check for system firmware updates
2. Processor card 2
Location: U0.1-P1-C2
406A 00B7 Insufficient hardware resources were detected to
continue IPL of the system after processor
subsystem testing and initialization was
completed
Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
406A 00D1 Informational message Informational message
406A 0100 Memory extender 0 on processor card 1 Processor card,
Location: U0.1-P1-C1
406A 0101 Memory extender 2 on processor card 1 Processor card,
Location: U0.1-P1-C1
406A 0130 Memory extender 0 on processor card 2 Processor card,
Location: U0.1-P1-C2
406A 0131 Memory extender 2 on processor card 2 Processor card,
Location: U0.1-P1-C2
406A 0A03 Reboot message on VPD channel 3 processor
card 1. Reboot is necessary for run-time array
gard to take effect.
Reboot the system in slow mode. Run
diagnostics, then go to ²MAP 0235: System
Array Self-Repair Problem Resolution² in the
Diagnostic Information for Multiple Bus Systems.
406A 0A04 Reboot message on VPD channel 4 processor
card 2. Reboot is necessary for run-time array
gard to take effect.
Reboot the system in slow mode. Run
diagnostics, then go to ²MAP 0235: System
Array Self-Repair Problem Resolution² in the
Diagnostic Information for Multiple Bus Systems.
406A 0B00 Problem with internal RIO interface (I/O slot 0,
I/O port 0)
CEC backplane, U0.1-P1
406A 0B01 Problem with internal RIO interface (I/O slot 0,
I/O port 1)
CEC backplane, U0.1-P1
406A 0B02 Problem with internal RIO interface (I/O slot 0,
I/O port 2)
1. Check for system firmware updates.
2. Check the cabling between the CEC and the
I/O subsystems.
3. CEC backplane, U0.1-P1
4. Call service support.
406A 0B03 Problem with internal RIO interface (I/O slot 0,
I/O port 3)
1. Check for system firmware updates.
2. Check the cabling between the CEC and the
I/O subsystems.
3. CEC backplane, U0.1-P1
4. Call service support.
Chapter 5. Error Code to FRU Index 235
Table 6. Service Processor Error Codes. (continued)
Error Code Description Action/
Possible Failing FRU
406A 0B40 Problem with controller chip on CEC backplane 1. CEC backplane, U0.1-P1.
2. Call service support.
406A 0BF0 Problem with RIO interface (I/O slot 0, I/O port
3)
1. Check for system firmware updates.
2. Check the cabling between the CEC and the
I/O subsystems.
3. CEC backplane, U0.1-P1
4. Call service support.
406A 0C10 Processor 0 configuration error: VPD mismatch
for processor card 0. (Service processor code
problem.)
1. Check for system firmware updates.
2. Call service support.
406A 0C11 Processor 1 configuration error: VPD mismatch
for processor card 1. (Service processor code
problem.)
1. Check for system firmware updates.
2. Call service support.
406A 0CA5 Invalid system configuration 1. Check that the processor cards and memory
DIMMs are properly installed and configured.
2. If there are any FRUs listed in the service
processor error log entry, replace them one
at a time.
3. Go to “MAP 1540: Problem Isolation
Procedures” on page 108.
406A 0CA6 No usable memory left in the system. The
system must be rebooted.
Go to “MAP 1240: Memory Problem Resolution”
on page 68.
406A 0CA9 Incorrect/incompatible memory DIMMs are
installed.
Check that the memory DIMMs are properly
configured, and are the right type for the system.
406A 0CAA System memory exceeds the limit for this
system type and model.
Remove the memory DIMMs until the memory
does not exceed the limit for this system type
and model.
406A 0EA0 MOPS service processor code error. 1. Check for system firmware updates.
2. Call service support.
406A 0EA1 Operating system code on main processor 1. Check for operating system code updates.
2. If there are other symptoms, go (one
symptom at a time) to the “Quick Entry MAP”
on page 58 and follow the instructions.
3. Call service support.
406A 0EA2 PRD code on service processor error. 1. Check for system firmware updates.
2. Call service support.
406A 0EA4 The service processor detected a problem, but
was unable to isolate the source.
Contact next level of support.
406A 0EAF Service processor code and system type are
unknown
1. Check for system firmware updates.
2. Call service support.
406A 0EB0 JTAG problem Go to “MAP 1541: JTAG Problem Isolation” on
page 110.
406A 0EB1 Attention line 0 stuck fault Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
236 Service Guide
Table 6. Service Processor Error Codes. (continued)
Error Code Description Action/
Possible Failing FRU
406A 0EB2 Attention line 1 stuck fault Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
406A 0EB3 Attention line 2 stuck fault Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
406A 0EB4 Attention line 3 stuck fault Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
406A 0EB5 Attention line 4 stuck fault Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
406A 0EB6 Attention line 5 stuck fault Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
406A 0EB7 Attention line 6 stuck fault Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
406A 0EB8 Attention line 7 stuck fault Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
406A 0EB9 Attention line 8 stuck fault Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
406A 0EBA Attention line 9 stuck fault Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
406A 0EBB Attention line 10 stuck fault Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
406A 0EBC Attention line 11 stuck fault Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
406A 0EE1 Service processor firmware: invalid MRU to FRU
mapping
1. Check for system firmware updates.
2. Call service support.
406A 0EE2 Service processor firmware fault 1. Check for system firmware updates.
2. Call service support.
406A 0EE5 Firmware error, no callout available 1. Check for system firmware updates.
2. Call service support.
406B 0001 Scan interface basic assurance tests detected a
CEC backplane fault
CEC backplane
Location: U0.1-P1
406B 0002 CEC backplane fault 1. Check JTAG cable connections
2. CEC backplane
Location: U0.1-P1
406B 0003 Scan interface basic assurance tests detected
processor fault
1. Check JTAG cable connections.
2. Processor card
Location: U0.1-P1-C1
406B 0004 Processor fault 1. Check JTAG cable connections.
2. Processor card
Location: U0.1-P1-C2
406B 000E Processor capacity card Operator Panel
Location: U0.1-L1
406B 0010 Memory DIMM fault Memory DIMM
Location: U0.1-P1-C1-M8
Chapter 5. Error Code to FRU Index 237
Table 6. Service Processor Error Codes. (continued)
Error Code Description Action/
Possible Failing FRU
406B 0011 Memory DIMM fault Memory DIMM
Location: U0.1-P1-C1-M6
406B 0012 Memory DIMM fault Memory DIMM
Location: U0.1-P1-C1-M1
406B 0013 Memory DIMM fault Memory DIMM
Location: U0.1-P1-C1-M3
406B 0014 Memory DIMM fault Memory DIMM
Location: U0.1-P1-C1-M7
406B 0015 Memory DIMM fault Memory DIMM
Location: U0.1-P1-C1-M5
406B 0016 Memory DIMM fault Memory DIMM
Location: U0.1-P1-C1-M2
406B 0017 Memory DIMM fault Memory DIMM
Location: U0.1-P1-C1-M4
406B 0018 Memory DIMM fault Memory DIMM
Location: U0.1-P1-C2-M8
406B 0019 Memory DIMM fault Memory DIMM
Location: U0.1-P1-C2-M6
406B 001A Memory DIMM fault Memory DIMM
Location: U0.1-P1-C2-M1
406B 001B Memory DIMM fault Memory DIMM
Location: U0.1-P1-C2-M3
406B 001C Memory DIMM fault Memory DIMM
Location: U0.1-P1-C2-M7
406B 001D Memory DIMM fault Memory DIMM
Location: U0.1-P1-C2-M5
406B 001E Memory DIMM fault Memory DIMM
Location: U0.1-P1-C2-M2
406B 001F Memory DIMM fault Memory DIMM
Location: U0.1-P1-C2-M4
406B 00B0 Bus controller failure 1. Check for system firmware updates
2. PCI riser card
Location: U0.1-P2
406B 00B1 Bus controller failure 1. Check for system firmware updates
2. PCI riser card
Location: U0.1-P2
406B 00B3 MCM/VPD problem detected 1. Check for system firmware updates
2. Processor card 1
Location: U0.1-P1-C1
406B 00B4 MCM/VPD problem detected 1. Check for system firmware updates
2. Processor card 2
Location: U0.1-P1-C2
406B 00B7 Insufficient hardware resources were detected to
continue IPL of the system after processor
subsystem testing and initialization was
completed
Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
238 Service Guide
Table 6. Service Processor Error Codes. (continued)
Error Code Description Action/
Possible Failing FRU
406B 00D1 Informational message Informational message
406B 0100 Memory extender 0 on processor card 1 Processor card,
Location: U0.1-P1-C1
406B 0101 Memory extender 2 on processor card 1 Processor card,
Location: U0.1-P1-C1
406B 0130 Memory extender 0 on processor card 2 Processor card,
Location: U0.1-P1-C2
406B 0131 Memory extender 2 on processor card 2 Processor card,
Location: U0.1-P1-C2
406B 0A03 Reboot message on VPD channel 3 processor
card 1. Reboot is necessary for run-time array
gard to take effect.
Reboot the system in slow mode. Run
diagnostics, then go to ²MAP 0235: System
Array Self-Repair Problem Resolution² in the
Diagnostic Information for Multiple Bus Systems.
406B 0A04 Reboot message on VPD channel 4 processor
card 2. Reboot is necessary for run-time array
gard to take effect.
Reboot the system in slow mode. Run
diagnostics, then go to ²MAP 0235: System
Array Self-Repair Problem Resolution² in the
Diagnostic Information for Multiple Bus Systems.
406B 0B00 Problem with internal RIO interface (I/O slot 0,
I/O port 0)
CEC backplane, U0.1-P1
406B 0B01 Problem with internal RIO interface (I/O slot 0,
I/O port 1)
CEC backplane, U0.1-P1
406B 0B02 Problem with internal RIO interface (I/O slot 0,
I/O port 2)
1. Check for system firmware updates.
2. Check the cabling between the CEC and the
I/O subsystems.
3. CEC backplane, U0.1-P1
4. Call service support.
406B 0B03 Problem with internal RIO interface (I/O slot 0,
I/O port 3)
1. Check for system firmware updates.
2. Check the cabling between the CEC and the
I/O subsystems.
3. CEC backplane, U0.1-P1
4. Call service support.
406B 0B40 Problem with controller chip on CEC backplane 1. CEC backplane, U0.1-P1.
2. Call service support.
406B 0BF0 Problem with RIO interface (I/O slot 0, I/O port
3)
1. Check for system firmware updates.
2. Check the cabling between the CEC and the
I/O subsystems.
3. CEC backplane, U0.1-P1
4. Call service support.
406B 0C10 Processor 0 configuration error: VPD mismatch
for processor card 0. (Service processor code
problem.)
1. Check for system firmware updates.
2. Call service support.
406B 0C11 Processor 1 configuration error: VPD mismatch
for processor card 1. (Service processor code
problem.)
1. Check for system firmware updates.
2. Call service support.
Chapter 5. Error Code to FRU Index 239
Table 6. Service Processor Error Codes. (continued)
Error Code Description Action/
Possible Failing FRU
406B 0CA5 Invalid system configuration 1. Check that the processor cards and memory
DIMMs are properly installed and configured.
2. If there are any FRUs listed in the service
processor error log entry, replace them one
at a time.
3. Go to “MAP 1540: Problem Isolation
Procedures” on page 108.
406B 0CA6 No usable memory left in the system. The
system must be rebooted.
Go to “MAP 1240: Memory Problem Resolution”
on page 68.
406B 0CA9 Incorrect/incompatible memory DIMMs are
installed.
Check that the memory DIMMs are properly
configured, and are the right type for the system.
406B 0CAA System memory exceeds the limit for this
system type and model.
Remove the memory DIMMs until the memory
does not exceed the limit for this system type
and model.
406B 0EA0 MOPS service processor code error. 1. Check for system firmware updates.
2. Call service support.
406B 0EA1 Operating system code on main processor 1. Check for operating system code updates.
2. If there are other symptoms, go (one
symptom at a time) to the “Quick Entry MAP”
on page 58 and follow the instructions.
3. Call service support.
406B 0EA2 PRD code on service processor error. 1. Check for system firmware updates.
2. Call service support.
406B 0EA4 The service processor detected a problem, but
was unable to isolate the source.
Contact next level of support.
406B 0EAF Service processor code and system type are
unknown
1. Check for system firmware updates.
2. Call service support.
406B 0EB0 JTAG problem Go to “MAP 1541: JTAG Problem Isolation” on
page 110.
406B 0EB1 Attention line 0 stuck fault Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
406B 0EB2 Attention line 1 stuck fault Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
406B 0EB3 Attention line 2 stuck fault Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
406B 0EB4 Attention line 3 stuck fault Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
406B 0EB5 Attention line 4 stuck fault Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
406B 0EB6 Attention line 5 stuck fault Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
406B 0EB7 Attention line 6 stuck fault Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
406B 0EB8 Attention line 7 stuck fault Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
240 Service Guide
Table 6. Service Processor Error Codes. (continued)
Error Code Description Action/
Possible Failing FRU
406B 0EB9 Attention line 8 stuck fault Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
406B 0EBA Attention line 9 stuck fault Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
406B 0EBB Attention line 10 stuck fault Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
406B 0EBC Attention line 11 stuck fault Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
406B 0EE1 Service processor firmware: invalid MRU to FRU
mapping
1. Check for system firmware updates.
2. Call service support.
406B 0EE2 Service processor firmware fault 1. Check for system firmware updates.
2. Call service support.
406B 0EE5 Firmware error, no callout available 1. Check for system firmware updates.
2. Call service support.
406C 0001 Scan interface basic assurance tests detected a
CEC backplane fault
CEC backplane
Location: U0.1-P1
406C 0002 CEC backplane fault 1. Check JTAG cable connections
2. CEC backplane
Location: U0.1-P1
406C 0003 Scan interface basic assurance tests detected
processor fault
1. Check JTAG cable connections.
2. Processor card
Location: U0.1-P1-C1
406C 0004 Processor fault 1. Check JTAG cable connections.
2. Processor card
Location: U0.1-P1-C2
406C 000E Processor capacity card Operator Panel
Location: U0.1-L1
406C 0010 Memory DIMM fault Memory DIMM
Location: U0.1-P1-C1-M8
406C 0011 Memory DIMM fault Memory DIMM
Location: U0.1-P1-C1-M6
406C 0012 Memory DIMM fault Memory DIMM
Location: U0.1-P1-C1-M1
406C 0013 Memory DIMM fault Memory DIMM
Location: U0.1-P1-C1-M3
406C 0014 Memory DIMM fault Memory DIMM
Location: U0.1-P1-C1-M7
406C 0015 Memory DIMM fault Memory DIMM
Location: U0.1-P1-C1-M5
406C 0016 Memory DIMM fault Memory DIMM
Location: U0.1-P1-C1-M2
406C 0017 Memory DIMM fault Memory DIMM
Location: U0.1-P1-C1-M4
Chapter 5. Error Code to FRU Index 241
Table 6. Service Processor Error Codes. (continued)
Error Code Description Action/
Possible Failing FRU
406C 0018 Memory DIMM fault Memory DIMM
Location: U0.1-P1-C2-M8
406C 0019 Memory DIMM fault Memory DIMM
Location: U0.1-P1-C2-M6
406C 001A Memory DIMM fault Memory DIMM
Location: U0.1-P1-C2-M1
406C 001B Memory DIMM fault Memory DIMM
Location: U0.1-P1-C2-M3
406C 001C Memory DIMM fault Memory DIMM
Location: U0.1-P1-C2-M7
406C 001D Memory DIMM fault Memory DIMM
Location: U0.1-P1-C2-M5
406C 001E Memory DIMM fault Memory DIMM
Location: U0.1-P1-C2-M2
406C 001F Memory DIMM fault Memory DIMM
Location: U0.1-P1-C2-M4
406C 00B0 Bus controller failure 1. Check for system firmware updates
2. PCI riser card
Location: U0.1-P2
406C 00B1 Bus controller failure 1. Check for system firmware updates
2. PCI riser card
Location: U0.1-P2
406C 00B3 Processor/VPD problem detected 1. Check for system firmware updates
2. Processor card 1
Location: U0.1-P1-C1
406C 00B4 Processor/VPD problem detected 1. Check for system firmware updates
2. Processor card 2
Location: U0.1-P1-C2
406C 00B7 Insufficient hardware resources were detected to
continue IPL of the system after processor
subsystem testing and initialization was
completed
Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
406C 00D1 Informational message Informational message
406C 0100 Memory extender 0 on processor card 1 Processor card,
Location: U0.1-P1-C1
406C 0101 Memory extender 2 on processor card 1 Processor card,
Location: U0.1-P1-C1
406C 0130 Memory extender 0 on processor card 2 Processor card,
Location: U0.1-P1-C2
406C 0131 Memory extender 2 on processor card 2 Processor card,
Location: U0.1-P1-C2
406C 0A03 Reboot message on VPD channel 3 processor
card 1. Reboot is necessary for run-time array
gard to take effect.
Reboot the system in slow mode. Run
diagnostics, then go to ²MAP 0235: System
Array Self-Repair Problem Resolution² in the
Diagnostic Information for Multiple Bus Systems.
242 Service Guide
Table 6. Service Processor Error Codes. (continued)
Error Code Description Action/
Possible Failing FRU
406C 0A04 Reboot message on VPD channel 4 processor
card 2. Reboot is necessary for run-time array
gard to take effect.
Reboot the system in slow mode. Run
diagnostics, then go to ²MAP 0235: System
Array Self-Repair Problem Resolution² in the
Diagnostic Information for Multiple Bus Systems.
406C 0B00 Problem with internal RIO interface (I/O slot 0,
I/O port 0)
CEC backplane, U0.1-P1
406C 0B01 Problem with internal RIO interface (I/O slot 0,
I/O port 1)
CEC backplane, U0.1-P1
406C 0B02 Problem with internal RIO interface (I/O slot 0,
I/O port 2)
1. Check for system firmware updates.
2. Check the cabling between the CEC and the
I/O subsystems.
3. CEC backplane, U0.1-P1
4. Call service support.
406C 0B03 Problem with internal RIO interface (I/O slot 0,
I/O port 3)
1. Check for system firmware updates.
2. Check the cabling between the CEC and the
I/O subsystems.
3. CEC backplane, U0.1-P1
4. Call service support.
406C 0B40 Problem with controller chip on CEC backplane 1. CEC backplane, U0.1-P1.
2. Call service support.
406C 0BF0 Problem with RIO interface (I/O slot 0, I/O port
3)
1. Check for system firmware updates.
2. Check the cabling between the CEC and the
I/O subsystems.
3. CEC backplane, U0.1-P1
4. Call service support.
406C 0C10 Processor 0 configuration error: VPD mismatch
for processor card 0. (Service processor code
problem.)
1. Check for system firmware updates.
2. Call service support.
406C 0C11 Processor 1 configuration error: VPD mismatch
for processor card 1. (Service processor code
problem.)
1. Check for system firmware updates.
2. Call service support.
406C 0CA5 Invalid system configuration 1. Check that the processor cards and memory
DIMMs are properly installed and configured.
2. If there are any FRUs listed in the service
processor error log entry, replace them one
at a time.
3. Go to “MAP 1540: Problem Isolation
Procedures” on page 108.
406C 0CA6 No usable memory left in the system. The
system must be rebooted.
Go to “MAP 1240: Memory Problem Resolution”
on page 68.
406C 0CA9 Incorrect/incompatible memory DIMMs are
installed.
Check that the memory DIMMs are properly
configured, and are the right type for the system.
406C 0CAA System memory exceeds the limit for this
system type and model.
Remove the memory DIMMs until the memory
does not exceed the limit for this system type
and model.
Chapter 5. Error Code to FRU Index 243
Table 6. Service Processor Error Codes. (continued)
Error Code Description Action/
Possible Failing FRU
406C 0EA0 MOPS service processor code error. 1. Check for system firmware updates.
2. Call service support.
406C 0EA1 Operating system code on main processor 1. Check for operating system code updates.
2. If there are other symptoms, go (one
symptom at a time) to the “Quick Entry MAP”
on page 58 and follow the instructions.
3. Call service support.
406C 0EA2 PRD code on service processor error. 1. Check for system firmware updates.
2. Call service support.
406C 0EA4 The service processor detected a problem, but
was unable to isolate the source.
Contact next level of support.
406C 0EAF Service processor code and system type are
unknown
1. Check for system firmware updates.
2. Call service support.
406C 0EB0 JTAG problem Go to “MAP 1541: JTAG Problem Isolation” on
page 110.
406C 0EB1 Attention line 0 stuck fault Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
406C 0EB2 Attention line 1 stuck fault Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
406C 0EB3 Attention line 2 stuck fault Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
406C 0EB4 Attention line 3 stuck fault Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
406C 0EB5 Attention line 4 stuck fault Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
406C 0EB6 Attention line 5 stuck fault Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
406C 0EB7 Attention line 6 stuck fault Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
406C 0EB8 Attention line 7 stuck fault Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
406C 0EB9 Attention line 8 stuck fault Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
406C 0EBA Attention line 9 stuck fault Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
406C 0EBB Attention line 10 stuck fault Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
406C 0EBC Attention line 11 stuck fault Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
406C 0EE1 Service processor firmware: invalid MRU to FRU
mapping
1. Check for system firmware updates.
2. Call service support.
406C 0EE2 Service processor firmware fault 1. Check for system firmware updates.
2. Call service support.
244 Service Guide
Table 6. Service Processor Error Codes. (continued)
Error Code Description Action/
Possible Failing FRU
406C 0EE5 Firmware error, no callout available 1. Check for system firmware updates.
2. Call service support.
406D 0001 Scan interface basic assurance tests detected a
CEC backplane fault
CEC backplane
Location: U0.1-P1
406D 0002 CEC backplane fault 1. Check JTAG cable connections
2. CEC backplane
Location: U0.1-P1
406D 0003 Scan interface basic assurance tests detected
processor fault
1. Check JTAG cable connections.
2. Processor card
Location: U0.1-P1-C1
406D 0004 Processor fault 1. Check JTAG cable connections.
2. Processor card
Location: U0.1-P1-C2
406D 000E Processor capacity card Operator Panel
Location: U0.1-L1
406D 0010 Memory DIMM fault Memory DIMM
Location: U0.1-P1-C1-M8
406D 0011 Memory DIMM fault Memory DIMM
Location: U0.1-P1-C1-M6
406D 0012 Memory DIMM fault Memory DIMM
Location: U0.1-P1-C1-M1
406D 0013 Memory DIMM fault Memory DIMM
Location: U0.1-P1-C1-M3
406D 0014 Memory DIMM fault Memory DIMM
Location: U0.1-P1-C1-M7
406D 0015 Memory DIMM fault Memory DIMM
Location: U0.1-P1-C1-M5
406D 0016 Memory DIMM fault Memory DIMM
Location: U0.1-P1-C1-M2
406D 0017 Memory DIMM fault Memory DIMM
Location: U0.1-P1-C1-M4
406D 0018 Memory DIMM fault Memory DIMM
Location: U0.1-P1-C2-M8
406D 0019 Memory DIMM fault Memory DIMM
Location: U0.1-P1-C2-M6
406D 001A Memory DIMM fault Memory DIMM
Location: U0.1-P1-C2-M1
406D 001B Memory DIMM fault Memory DIMM
Location: U0.1-P1-C2-M3
406D 001C Memory DIMM fault Memory DIMM
Location: U0.1-P1-C2-M7
406D 001D Memory DIMM fault Memory DIMM
Location: U0.1-P1-C2-M5
406D 001E Memory DIMM fault Memory DIMM
Location: U0.1-P1-C2-M2
Chapter 5. Error Code to FRU Index 245
Table 6. Service Processor Error Codes. (continued)
Error Code Description Action/
Possible Failing FRU
406D 001F Memory DIMM fault Memory DIMM
Location: U0.1-P1-C2-M4
406D 00B0 Bus controller failure 1. Check for system firmware updates
2. PCI riser card
Location: U0.1-P2
406D 00B1 Bus controller failure 1. Check for system firmware updates
2. PCI riser card
Location: U0.1-P2
406D 00B3 Processor/VPD problem detected 1. Check for system firmware updates
2. Processor card 1
Location: U0.1-P1-C1
406D 00B4 Processor/VPD problem detected 1. Check for system firmware updates
2. Processor card 2
Location: U0.1-P1-C2
406D 00B7 Insufficient hardware resources were detected to
continue IPL of the system after processor
subsystem testing and initialization was
completed
Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
406D 00D1 Informational message Informational message
406D 0100 Memory extender 0 on processor card 1 Processor card,
Location: U0.1-P1-C1
406D 0101 Memory extender 2 on processor card 1 Processor card,
Location: U0.1-P1-C1
406D 0130 Memory extender 0 on processor card 2 Processor card,
Location: U0.1-P1-C2
406D 0131 Memory extender 2 on processor card 2 Processor card,
Location: U0.1-P1-C2
406D 0A03 Reboot message on VPD channel 3 processor
card 1. Reboot is necessary for run-time array
gard to take effect.
Reboot the system in slow mode. Run
diagnostics, then go to ²MAP 0235: System
Array Self-Repair Problem Resolution² in the
Diagnostic Information for Multiple Bus Systems.
406D 0A04 Reboot message on VPD channel 4 processor
card 2. Reboot is necessary for run-time array
gard to take effect.
Reboot the system in slow mode. Run
diagnostics, then go to ²MAP 0235: System
Array Self-Repair Problem Resolution² in the
Diagnostic Information for Multiple Bus Systems.
406D 0B00 Problem with internal RIO interface (I/O slot 0,
I/O port 0)
CEC backplane, U0.1-P1
406D 0B01 Problem with internal RIO interface (I/O slot 0,
I/O port 1)
CEC backplane, U0.1-P1
406D 0B02 Problem with internal RIO interface (I/O slot 0,
I/O port 2)
1. Check for system firmware updates.
2. Check the cabling between the CEC and the
I/O subsystems.
3. CEC backplane, U0.1-P1
4. Call service support.
246 Service Guide
Table 6. Service Processor Error Codes. (continued)
Error Code Description Action/
Possible Failing FRU
406D 0B03 Problem with internal RIO interface (I/O slot 0,
I/O port 3)
1. Check for system firmware updates.
2. Check the cabling between the CEC and the
I/O subsystems.
3. CEC backplane, U0.1-P1
4. Call service support.
406D 0B40 Problem with controller chip on CEC backplane 1. CEC backplane, U0.1-P1.
2. Call service support.
406D 0BF0 Problem with RIO interface (I/O slot 0, I/O port
3)
1. Check for system firmware updates.
2. Check the cabling between the CEC and the
I/O subsystems.
3. CEC backplane, U0.1-P1
4. Call service support.
406D 0C10 Processor 0 configuration error: VPD mismatch
for processor card 0. (Service processor code
problem.)
1. Check for system firmware updates.
2. Call service support.
406D 0C11 Processor 1 configuration error: VPD mismatch
for processor card 1. (Service processor code
problem.)
1. Check for system firmware updates.
2. Call service support.
406D 0CA5 Invalid system configuration 1. Check that the processor cards and memory
DIMMs are properly installed and configured.
2. If there are any FRUs listed in the service
processor error log entry, replace them one
at a time.
3. Go to “MAP 1540: Problem Isolation
Procedures” on page 108.
406D 0CA6 No usable memory left in the system. The
system must be rebooted.
Go to “MAP 1240: Memory Problem Resolution”
on page 68.
406D 0CA9 Incorrect/incompatible memory DIMMs are
installed.
Check that the memory DIMMs are properly
configured, and are the right type for the system.
406D 0CAA System memory exceeds the limit for this
system type and model.
Remove the memory DIMMs until the memory
does not exceed the limit for this system type
and model.
406D 0EA0 MOPS service processor code error. 1. Check for system firmware updates.
2. Call service support.
406D 0EA1 Operating system code on main processor 1. Check for operating system code updates.
2. If there are other symptoms, go (one
symptom at a time) to the “Quick Entry MAP”
on page 58 and follow the instructions.
3. Call service support.
406D 0EA2 PRD code on service processor error. 1. Check for system firmware updates.
2. Call service support.
406D 0EA4 The service processor detected a problem, but
was unable to isolate the source.
Contact next level of support.
406D 0EAF Service processor code and system type are
unknown
1. Check for system firmware updates.
2. Call service support.
Chapter 5. Error Code to FRU Index 247
Table 6. Service Processor Error Codes. (continued)
Error Code Description Action/
Possible Failing FRU
406D 0EB0 JTAG problem Go to “MAP 1541: JTAG Problem Isolation” on
page 110.
406D 0EB1 Attention line 0 stuck fault Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
406D 0EB2 Attention line 1 stuck fault Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
406D 0EB3 Attention line 2 stuck fault Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
406D 0EB4 Attention line 3 stuck fault Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
406D 0EB5 Attention line 4 stuck fault Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
406D 0EB6 Attention line 5 stuck fault Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
406D 0EB7 Attention line 6 stuck fault Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
406D 0EB8 Attention line 7 stuck fault Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
406D 0EB9 Attention line 8 stuck fault Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
406D 0EBA Attention line 9 stuck fault Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
406D 0EBB Attention line 10 stuck fault Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
406D 0EBC Attention line 11 stuck fault Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
406D 0EE1 Service processor firmware: invalid MRU to FRU
mapping
1. Check for system firmware updates.
2. Call service support.
406D 0EE2 Service processor firmware fault 1. Check for system firmware updates.
2. Call service support.
406D 0EE5 Firmware error, no callout available 1. Check for system firmware updates.
2. Call service support.
406E 0001 Scan interface basic assurance tests detected a
CEC backplane fault
CEC backplane
Location: U0.1-P1
406E 0002 CEC backplane fault 1. Check JTAG cable connections
2. CEC backplane
Location: U0.1-P1
406E 0003 Scan interface basic assurance tests detected
processor fault
1. Check JTAG cable connections.
2. Processor card
Location: U0.1-P1-C1
406E 0004 Processor fault 1. Check JTAG cable connections.
2. Processor card
Location: U0.1-P1-C2
248 Service Guide
Table 6. Service Processor Error Codes. (continued)
Error Code Description Action/
Possible Failing FRU
406E 000E Processor capacity card Operator Panel
Location: U0.1-L1
406E 0010 Memory DIMM fault Memory DIMM
Location: U0.1-P1-C1-M8
406E 0011 Memory DIMM fault Memory DIMM
Location: U0.1-P1-C1-M6
406E 0012 Memory DIMM fault Memory DIMM
Location: U0.1-P1-C1-M1
406E 0013 Memory DIMM fault Memory DIMM
Location: U0.1-P1-C1-M3
406E 0014 Memory DIMM fault Memory DIMM
Location: U0.1-P1-C1-M7
406E 0015 Memory DIMM fault Memory DIMM
Location: U0.1-P1-C1-M5
406E 0016 Memory DIMM fault Memory DIMM
Location: U0.1-P1-C1-M2
406E 0017 Memory DIMM fault Memory DIMM
Location: U0.1-P1-C1-M4
406E 0018 Memory DIMM fault Memory DIMM
Location: U0.1-P1-C2-M8
406E 0019 Memory DIMM fault Memory DIMM
Location: U0.1-P1-C2-M6
406E 001A Memory DIMM fault Memory DIMM
Location: U0.1-P1-C2-M1
406E 001B Memory DIMM fault Memory DIMM
Location: U0.1-P1-C2-M3
406E 001C Memory DIMM fault Memory DIMM
Location: U0.1-P1-C2-M7
406E 001D Memory DIMM fault Memory DIMM
Location: U0.1-P1-C2-M5
406E 001E Memory DIMM fault Memory DIMM
Location: U0.1-P1-C2-M2
406E 001F Memory DIMM fault Memory DIMM
Location: U0.1-P1-C2-M4
406E 00B0 Bus controller failure 1. Check for system firmware updates
2. PCI riser card
Location: U0.1-P2
406E 00B1 Bus controller failure 1. Check for system firmware updates
2. PCI riser card
Location: U0.1-P2
406E 00B3 Processor/VPD problem detected 1. Check for system firmware updates
2. Processor card 1
Location: U0.1-P1-C1
Chapter 5. Error Code to FRU Index 249
Table 6. Service Processor Error Codes. (continued)
Error Code Description Action/
Possible Failing FRU
406E 00B4 Processor/VPD problem detected 1. Check for system firmware updates
2. Processor card 2
Location: U0.1-P1-C2
406E 00B7 Insufficient hardware resources were detected to
continue IPL of the system after processor
subsystem testing and initialization was
completed
Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
406E 00D1 Informational message Informational message
406E 0100 Memory extender 0 on processor card 1 Processor card,
Location: U0.1-P1-C1
406E 0101 Memory extender 2 on processor card 1 Processor card,
Location: U0.1-P1-C1
406E 0130 Memory extender 0 on processor card 2 Processor card,
Location: U0.1-P1-C2
406E 0131 Memory extender 2 on processor card 2 Processor card,
Location: U0.1-P1-C2
406E 0A03 Reboot message on VPD channel 3 processor
card 1. Reboot is necessary for run-time array
gard to take effect.
Reboot the system in slow mode. Run
diagnostics, then go to ²MAP 0235: System
Array Self-Repair Problem Resolution² in the
Diagnostic Information for Multiple Bus Systems.
406E 0A04 Reboot message on VPD channel 4 processor
card 2. Reboot is necessary for run-time array
gard to take effect.
Reboot the system in slow mode. Run
diagnostics, then go to ²MAP 0235: System
Array Self-Repair Problem Resolution² in the
Diagnostic Information for Multiple Bus Systems.
406E 0B00 Problem with internal RIO interface (I/O slot 0,
I/O port 0)
CEC backplane, U0.1-P1
406E 0B01 Problem with internal RIO interface (I/O slot 0,
I/O port 1)
CEC backplane, U0.1-P1
406E 0B02 Problem with internal RIO interface (I/O slot 0,
I/O port 2)
1. Check for system firmware updates.
2. Check the cabling between the CEC and the
I/O subsystems.
3. CEC backplane, U0.1-P1
4. Call service support.
406E 0B03 Problem with internal RIO interface (I/O slot 0,
I/O port 3)
1. Check for system firmware updates.
2. Check the cabling between the CEC and the
I/O subsystems.
3. CEC backplane, U0.1-P1
4. Call service support.
406E 0B40 Problem with controller chip on CEC backplane 1. CEC backplane, U0.1-P1.
2. Call service support.
406E 0BF0 Problem with RIO interface (I/O slot 0, I/O port
3)
1. Check for system firmware updates.
2. Check the cabling between the CEC and the
I/O subsystems.
3. CEC backplane, U0.1-P1
4. Call service support.
250 Service Guide
Table 6. Service Processor Error Codes. (continued)
Error Code Description Action/
Possible Failing FRU
406E 0C10 Processor 0 configuration error: VPD mismatch
for processor card 0. (Service processor code
problem.)
1. Check for system firmware updates.
2. Call service support.
406E 0C11 Processor 1 configuration error: VPD mismatch
for processor card 1. (Service processor code
problem.)
1. Check for system firmware updates.
2. Call service support.
406E 0CA5 Invalid system configuration 1. Check that the processor cards and memory
DIMMs are properly installed and configured.
2. If there are any FRUs listed in the service
processor error log entry, replace them one
at a time.
3. Go to “MAP 1540: Problem Isolation
Procedures” on page 108.
406E 0CA6 No usable memory left in the system. The
system must be rebooted.
Go to “MAP 1240: Memory Problem Resolution”
on page 68.
406E 0CA9 Incorrect/incompatible memory DIMMs are
installed.
Check that the memory DIMMs are properly
configured, and are the right type for the system.
406E 0CAA System memory exceeds the limit for this
system type and model.
Remove the memory DIMMs until the memory
does not exceed the limit for this system type
and model.
406E 0EA0 MOPS service processor code error. 1. Check for system firmware updates.
2. Call service support.
406E 0EA1 Operating system code on main processor 1. Check for operating system code updates.
2. If there are other symptoms, go (one
symptom at a time) to the “Quick Entry MAP”
on page 58 and follow the instructions.
3. Call service support.
406E 0EA2 PRD code on service processor error. 1. Check for system firmware updates.
2. Call service support.
406E 0EA4 The service processor detected a problem, but
was unable to isolate the source.
Contact next level of support.
406E 0EAF Service processor code and system type are
unknown
1. Check for system firmware updates.
2. Call service support.
406E 0EB0 JTAG problem Go to “MAP 1541: JTAG Problem Isolation” on
page 110.
406E 0EB1 Attention line 0 stuck fault Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
406E 0EB2 Attention line 1 stuck fault Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
406E 0EB3 Attention line 2 stuck fault Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
406E 0EB4 Attention line 3 stuck fault Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
406E 0EB5 Attention line 4 stuck fault Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
Chapter 5. Error Code to FRU Index 251
Table 6. Service Processor Error Codes. (continued)
Error Code Description Action/
Possible Failing FRU
406E 0EB6 Attention line 5 stuck fault Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
406E 0EB7 Attention line 6 stuck fault Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
406E 0EB8 Attention line 7 stuck fault Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
406E 0EB9 Attention line 8 stuck fault Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
406E 0EBA Attention line 9 stuck fault Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
406E 0EBB Attention line 10 stuck fault Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
406E 0EBC Attention line 11 stuck fault Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
406E 0EE1 Service processor firmware: invalid MRU to FRU
mapping
1. Check for system firmware updates.
2. Call service support.
406E 0EE2 Service processor firmware fault 1. Check for system firmware updates.
2. Call service support.
406E 0EE5 Firmware error, no callout available 1. Check for system firmware updates.
2. Call service support.
406F 0001 Scan interface basic assurance tests detected a
CEC backplane fault
CEC backplane
Location: U0.1-P1
406F 0002 CEC backplane fault 1. Check JTAG cable connections
2. CEC backplane
Location: U0.1-P1
406F 0003 Scan interface basic assurance tests detected
processor fault
1. Check JTAG cable connections.
2. Processor card
Location: U0.1-P1-C1
406F 0004 Processor fault 1. Check JTAG cable connections.
2. Processor card
Location: U0.1-P1-C2
406F 000E Processor capacity card Operator Panel
Location: U0.1-L1
406F 0010 Memory DIMM fault Memory DIMM
Location: U0.1-P1-C1-M8
406F 0011 Memory DIMM fault Memory DIMM
Location: U0.1-P1-C1-M6
406F 0012 Memory DIMM fault Memory DIMM
Location: U0.1-P1-C1-M1
406F 0013 Memory DIMM fault Memory DIMM
Location: U0.1-P1-C1-M3
406F 0014 Memory DIMM fault Memory DIMM
Location: U0.1-P1-C1-M7
252 Service Guide
Table 6. Service Processor Error Codes. (continued)
Error Code Description Action/
Possible Failing FRU
406F 0015 Memory DIMM fault Memory DIMM
Location: U0.1-P1-C1-M5
406F 0016 Memory DIMM fault Memory DIMM
Location: U0.1-P1-C1-M2
406F 0017 Memory DIMM fault Memory DIMM
Location: U0.1-P1-C1-M4
406F 0018 Memory DIMM fault Memory DIMM
Location: U0.1-P1-C2-M8
406F 0019 Memory DIMM fault Memory DIMM
Location: U0.1-P1-C2-M6
406F 001A Memory DIMM fault Memory DIMM
Location: U0.1-P1-C2-M1
406F 001B Memory DIMM fault Memory DIMM
Location: U0.1-P1-C2-M3
406F 001C Memory DIMM fault Memory DIMM
Location: U0.1-P1-C2-M7
406F 001D Memory DIMM fault Memory DIMM
Location: U0.1-P1-C2-M5
406F 001E Memory DIMM fault Memory DIMM
Location: U0.1-P1-C2-M2
406F 001F Memory DIMM fault Memory DIMM
Location: U0.1-P1-C2-M4
406F 00B0 Bus controller failure 1. Check for system firmware updates
2. PCI riser card
Location: U0.1-P2
406F 00B1 Bus controller failure 1. Check for system firmware updates
2. PCI riser card
Location: U0.1-P2
406F 00B3 Processor/VPD problem detected 1. Check for system firmware updates
2. Processor card 1
Location: U0.1-P1-C1
406F 00B4 Processor/VPD problem detected 1. Check for system firmware updates
2. Processor card 2
Location: U0.1-P1-C2
406F 00B7 Insufficient hardware resources were detected to
continue IPL of the system after processor
subsystem testing and initialization was
completed
Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
406F 00D1 Informational message Informational message
406F 0100 Memory extender 0 on processor card 1 Processor card,
Location: U0.1-P1-C1
406F 0101 Memory extender 2 on processor card 1 Processor card,
Location: U0.1-P1-C1
406F 0130 Memory extender 0 on processor card 2 Processor card,
Location: U0.1-P1-C2
Chapter 5. Error Code to FRU Index 253
Table 6. Service Processor Error Codes. (continued)
Error Code Description Action/
Possible Failing FRU
406F 0131 Memory extender 2 on processor card 2 Processor card,
Location: U0.1-P1-C2
406F 0A03 Reboot message on VPD channel 3 processor
card 1. Reboot is necessary for run-time array
gard to take effect.
Reboot the system in slow mode. Run
diagnostics, then go to ²MAP 0235: System
Array Self-Repair Problem Resolution² in the
Diagnostic Information for Multiple Bus Systems.
406F 0A04 Reboot message on VPD channel 4 processor
card 2. Reboot is necessary for run-time array
gard to take effect.
Reboot the system in slow mode. Run
diagnostics, then go to ²MAP 0235: System
Array Self-Repair Problem Resolution² in the
Diagnostic Information for Multiple Bus Systems.
406F 0B00 Problem with internal RIO interface (I/O slot 0,
I/O port 0)
CEC backplane, U0.1-P1
406F 0B01 Problem with internal RIO interface (I/O slot 0,
I/O port 1)
CEC backplane, U0.1-P1
406F 0B02 Problem with internal RIO interface (I/O slot 0,
I/O port 2)
1. Check for system firmware updates.
2. Check the cabling between the CEC and the
I/O subsystems.
3. CEC backplane, U0.1-P1
4. Call service support.
406F 0B03 Problem with internal RIO interface (I/O slot 0,
I/O port 3)
1. Check for system firmware updates.
2. Check the cabling between the CEC and the
I/O subsystems.
3. CEC backplane, U0.1-P1
4. Call service support.
406F 0B40 Problem with controller chip on CEC backplane 1. CEC backplane, U0.1-P1.
2. Call service support.
406F 0BF0 Problem with RIO interface (I/O slot 0, I/O port
3)
1. Check for system firmware updates.
2. Check the cabling between the CEC and the
I/O subsystems.
3. CEC backplane, U0.1-P1
4. Call service support.
406F 0C10 Processor 0 configuration error: VPD mismatch
for processor card 0. (Service processor code
problem.)
1. Check for system firmware updates.
2. Call service support.
406F 0C11 Processor 1 configuration error: VPD mismatch
for processor card 1. (Service processor code
problem.)
1. Check for system firmware updates.
2. Call service support.
406F 0CA5 Invalid system configuration 1. Check that the processor cards and memory
DIMMs are properly installed and configured.
2. If there are any FRUs listed in the service
processor error log entry, replace them one
at a time.
3. Go to “MAP 1540: Problem Isolation
Procedures” on page 108.
406F 0CA6 No usable memory left in the system. The
system must be rebooted.
Go to “MAP 1240: Memory Problem Resolution”
on page 68.
254 Service Guide
Table 6. Service Processor Error Codes. (continued)
Error Code Description Action/
Possible Failing FRU
406F 0CA9 Incorrect/incompatible memory DIMMs are
installed.
Check that the memory DIMMs are properly
configured, and are the right type for the system.
406F 0CAA System memory exceeds the limit for this
system type and model.
Remove the memory DIMMs until the memory
does not exceed the limit for this system type
and model.
406F 0EA0 MOPS service processor code error. 1. Check for system firmware updates.
2. Call service support.
406F 0EA1 Operating system code on main processor 1. Check for operating system code updates.
2. If there are other symptoms, go (one
symptom at a time) to the “Quick Entry MAP”
on page 58 and follow the instructions.
3. Call service support.
406F 0EA2 PRD code on service processor error. 1. Check for system firmware updates.
2. Call service support.
406F 0EA4 The service processor detected a problem, but
was unable to isolate the source.
Contact next level of support.
406F 0EAF Service processor code and system type are
unknown
1. Check for system firmware updates.
2. Call service support.
406F 0EB0 JTAG problem Go to “MAP 1541: JTAG Problem Isolation” on
page 110.
406F 0EB1 Attention line 0 stuck fault Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
406F 0EB2 Attention line 1 stuck fault Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
406F 0EB3 Attention line 2 stuck fault Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
406F 0EB4 Attention line 3 stuck fault Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
406F 0EB5 Attention line 4 stuck fault Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
406F 0EB6 Attention line 5 stuck fault Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
406F 0EB7 Attention line 6 stuck fault Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
406F 0EB8 Attention line 7 stuck fault Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
406F 0EB9 Attention line 8 stuck fault Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
406F 0EBA Attention line 9 stuck fault Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
406F 0EBB Attention line 10 stuck fault Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
406F 0EBC Attention line 11 stuck fault Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
Chapter 5. Error Code to FRU Index 255
Table 6. Service Processor Error Codes. (continued)
Error Code Description Action/
Possible Failing FRU
406F 0EE1 Service processor firmware: invalid MRU to FRU
mapping
1. Check for system firmware updates.
2. Call service support.
406F 0EE2 Service processor firmware fault 1. Check for system firmware updates.
2. Call service support.
406F 0EE5 Firmware error, no callout available 1. Check for system firmware updates.
2. Call service support.
4503 26F6 Description: 1-way processor card fault
Action: This error code indicates that the processor repeat gard function has deconfigured a
processor. Do the following:
1. Go to the processor configuration/deconfiguration menu, located under the system information
menu in the service processor menus.
Look at the status of the processors. If any of the processors have a status of manually
deconfigured, reconfigure the processor(s) and boot the system to the SMS menus. Power off
the system using the white power button and look again at the status of the processors. If all of
the processors are still configured in the processor configuration/deconfiguration menu, the
problem has been resolved.
If any of the processors have a status of deconfigured by the system, either before or after the
boot to the SMS menus, go to step 2.
2. Replace the processor card (on which the deconfigured processor resides) specified by the
location code.
4503 26F7 Description: 2-way processor card fault
Action: This error code indicates that the processor repeat gard function has deconfigured a
processor. Do the following:
1. Go to the processor configuration/deconfiguration menu, located under the system information
menu in the service processor menus.
Look at the status of the processors. If any of the processors have a status of manually
deconfigured, reconfigure the processor(s) and boot the system to the SMS menus. Power off
the system using the white power button and look again at the status of the processors. If all of
the processors are still configured in the processor configuration/deconfiguration menu, the
problem has been resolved.
If any of the processors have a status of deconfigured by the system, either before or after the
boot to the SMS menus, go to step 2.
2. Replace the processor card (on which the deconfigured processor resides) specified by the
location code.
4503 26F8 Description: 1-way processor card fault
Action: This error code indicates that the processor repeat gard function has deconfigured a
processor. Do the following:
1. Go to the processor configuration/deconfiguration menu, located under the system information
menu in the service processor menus.
Look at the status of the processors. If any of the processors have a status of manually
deconfigured, reconfigure the processor(s) and boot the system to the SMS menus. Power off
the system using the white power button and look again at the status of the processors. If all of
the processors are still configured in the processor configuration/deconfiguration menu, the
problem has been resolved.
If any of the processors have a status of deconfigured by the system, either before or after the
boot to the SMS menus, go to step 2.
2. Replace the processor card (on which the deconfigured processor resides) specified by the
location code.
256 Service Guide
Table 6. Service Processor Error Codes. (continued)
Error Code Description Action/
Possible Failing FRU
4503 26FA Description: 2-way processor card fault
Action: This error code indicates that the processor repeat gard function has deconfigured a
processor. Do the following:
1. Go to the processor configuration/deconfiguration menu, located under the system information
menu in the service processor menus.
Look at the status of the processors. If any of the processors have a status of manually
deconfigured, reconfigure the processor(s) and boot the system to the SMS menus. Power off
the system using the white power button and look again at the status of the processors. If all of
the processors are still configured in the processor configuration/deconfiguration menu, the
problem has been resolved.
If any of the processors have a status of deconfigured by the system, either before or after the
boot to the SMS menus, go to step 2.
2. Replace the processor card (on which the deconfigured processor resides) specified by the
location code.
4503 26FB Description: 2-way processor card fault
Action: This error code indicates that the processor repeat gard function has deconfigured a
processor. Do the following:
1. Go to the processor configuration/deconfiguration menu, located under the system information
menu in the service processor menus.
Look at the status of the processors. If any of the processors have a status of manually
deconfigured, reconfigure the processor(s) and boot the system to the SMS menus. Power off
the system using the white power button and look again at the status of the processors. If all of
the processors are still configured in the processor configuration/deconfiguration menu, the
problem has been resolved.
If any of the processors have a status of deconfigured by the system, either before or after the
boot to the SMS menus, go to step 2.
2. Replace the processor card (on which the deconfigured processor resides) specified by the
location code.
4503 26FD Description: 1-way processor card fault
Action: This error code indicates that the processor repeat gard function has deconfigured a
processor. Do the following:
1. Go to the processor configuration/deconfiguration menu, located under the system information
menu in the service processor menus.
Look at the status of the processors. If any of the processors have a status of manually
deconfigured, reconfigure the processor(s) and boot the system to the SMS menus. Power off
the system using the white power button and look again at the status of the processors. If all of
the processors are still configured in the processor configuration/deconfiguration menu, the
problem has been resolved.
If any of the processors have a status of deconfigured by the system, either before or after the
boot to the SMS menus, go to step 2.
2. Replace the processor card (on which the deconfigured processor resides) specified by the
location code.
Chapter 5. Error Code to FRU Index 257
Table 6. Service Processor Error Codes. (continued)
Error Code Description Action/
Possible Failing FRU
4503 26FE Description: 2-way processor card fault
Action: This error code indicates that the processor repeat gard function has deconfigured a
processor. Do the following:
1. Go to the processor configuration/deconfiguration menu, located under the system information
menu in the service processor menus.
Look at the status of the processors. If any of the processors have a status of manually
deconfigured, reconfigure the processor(s) and boot the system to the SMS menus. Power off
the system using the white power button and look again at the status of the processors. If all of
the processors are still configured in the processor configuration/deconfiguration menu, the
problem has been resolved.
If any of the processors have a status of deconfigured by the system, either before or after the
boot to the SMS menus, go to step 2.
2. Replace the processor card (on which the deconfigured processor resides) specified by the
location code.
4503 26FF Description: 1-way processor card fault
Action: This error code indicates that the processor repeat gard function has deconfigured a
processor. Do the following:
1. Go to the processor configuration/deconfiguration menu, located under the system information
menu in the service processor menus.
Look at the status of the processors. If any of the processors have a status of manually
deconfigured, reconfigure the processor(s) and boot the system to the SMS menus. Power off
the system using the white power button and look again at the status of the processors. If all of
the processors are still configured in the processor configuration/deconfiguration menu, the
problem has been resolved.
If any of the processors have a status of deconfigured by the system, either before or after the
boot to the SMS menus, go to step 2.
2. Replace the processor card (on which the deconfigured processor resides) specified by the
location code.
4503 301A Description: 128 MB DIMM fault
Action: This error code indicates that the memory repeat gard function has deconfigured a quad of
memory. Do the following:
1. Go to the memory configuration/deconfiguration menu, located under the system information
menu in the service processor menus.
Look at the status of the DIMMs. If any of the DIMMs have a status of manually deconfigured,
reconfigure the DIMMs and boot the system to the SMS menus. Power off the system using the
white power button and look again at the status of the DIMMs. If all of the processors are still
configured in the processor configuration/deconfiguration menu, the problem has been resolved.
If all of the DIMMs are still configured in the memory configuration/deconfiguration menu, the
problem has been resolved.
If any of the DIMMs have a status of deconfigured by the system, either before or after the boot
to the SMS menus, go to step 2.
2. Replace the quad of memory containing the DIMM specified by the location code. The entire
quad of memory must be replaced.
258 Service Guide
Table 6. Service Processor Error Codes. (continued)
Error Code Description Action/
Possible Failing FRU
4503 301B Description: 1 GB DIMM fault
Action:This error code indicates that the memory repeat gard function has deconfigured a quad of
memory. Do the following:
1. Go to the memory configuration/deconfiguration menu, located under the system information
menu in the service processor menus.
Look at the status of the DIMMs. If any of the DIMMs have a status of manually deconfigured,
reconfigure the DIMMs and boot the system to the SMS menus. Power off the system using the
white power button and look again at the status of the DIMMs. If all of the processors are still
configured in the processor configuration/deconfiguration menu, the problem has been resolved.
If all of the DIMMs are still configured in the memory configuration/deconfiguration menu, the
problem has been resolved.
If any of the DIMMs have a status of deconfigured by the system, either before or after the boot
to the SMS menus, go to step 2.
2. Replace the quad of memory containing the DIMM specified by the location code. The entire
quad of memory must be replaced.
4503 301C Description: 256 MB DIMM fault
Action: This error code indicates that the memory repeat gard function has deconfigured a quad of
memory. Do the following:
1. Go to the memory configuration/deconfiguration menu, located under the system information
menu in the service processor menus.
Look at the status of the DIMMs. If any of the DIMMs have a status of manually deconfigured,
reconfigure the DIMMs and boot the system to the SMS menus. Power off the system using the
white power button and look again at the status of the DIMMs. If all of the processors are still
configured in the processor configuration/deconfiguration menu, the problem has been resolved.
If all of the DIMMs are still configured in the memory configuration/deconfiguration menu, the
problem has been resolved.
If any of the DIMMs have a status of deconfigured by the system, either before or after the boot
to the SMS menus, go to step 2.
2. Replace the quad of memory containing the DIMM specified by the location code. The entire
quad of memory must be replaced.
4503 301D Description: 512 MB DIMM fault
Action: This error code indicates that the memory repeat gard function has deconfigured a quad of
memory. Do the following:
1. Go to the memory configuration/deconfiguration menu, located under the system information
menu in the service processor menus.
Look at the status of the DIMMs. If any of the DIMMs have a status of manually deconfigured,
reconfigure the DIMMs and boot the system to the SMS menus. Power off the system using the
white power button and look again at the status of the DIMMs. If all of the processors are still
configured in the processor configuration/deconfiguration menu, the problem has been resolved.
If all of the DIMMs are still configured in the memory configuration/deconfiguration menu, the
problem has been resolved.
If any of the DIMMs have a status of deconfigured by the system, either before or after the boot
to the SMS menus, go to step 2.
2. Replace the quad of memory containing the DIMM specified by the location code. The entire
quad of memory must be replaced.
Chapter 5. Error Code to FRU Index 259
Table 6. Service Processor Error Codes. (continued)
Error Code Description Action/
Possible Failing FRU
4503 301E Description: 1 GB DIMM fault
Action: This error code indicates that the memory repeat gard function has deconfigured a quad of
memory. Do the following:
1. Go to the memory configuration/deconfiguration menu, located under the system information
menu in the service processor menus.
Look at the status of the DIMMs. If any of the DIMMs have a status of manually deconfigured,
reconfigure the DIMMs and boot the system to the SMS menus. Power off the system using the
white power button and look again at the status of the DIMMs. If all of the processors are still
configured in the processor configuration/deconfiguration menu, the problem has been resolved.
If all of the DIMMs are still configured in the memory configuration/deconfiguration menu, the
problem has been resolved.
If any of the DIMMs have a status of deconfigured by the system, either before or after the boot
to the SMS menus, go to step 2.
2. Replace the quad of memory containing the DIMM specified by the location code. The entire
quad of memory must be replaced.
4503 302E Description: 2 GB DIMM fault
Action: This error code indicates that the memory repeat gard function has deconfigured a quad of
memory. Do the following:
1. Go to the memory configuration/deconfiguration menu, located under the system information
menu in the service processor menus.
Look at the status of the DIMMs. If any of the DIMMs have a status of manually deconfigured,
reconfigure the DIMMs and boot the system to the SMS menus. Power off the system using the
white power button and look again at the status of the DIMMs. If all of the processors are still
configured in the processor configuration/deconfiguration menu, the problem has been resolved.
If all of the DIMMs are still configured in the memory configuration/deconfiguration menu, the
problem has been resolved.
If any of the DIMMs have a status of deconfigured by the system, either before or after the boot
to the SMS menus, go to step 2.
2. Replace the quad of memory containing the DIMM specified by the location code. The entire
quad of memory must be replaced.
4503 302F Description: 256 MB DIMM fault
Action: This error code indicates that the memory repeat gard function has deconfigured a quad of
memory. Do the following:
1. Go to the memory configuration/deconfiguration menu, located under the system information
menu in the service processor menus.
Look at the status of the DIMMs. If any of the DIMMs have a status of manually deconfigured,
reconfigure the DIMMs and boot the system to the SMS menus. Power off the system using the
white power button and look again at the status of the DIMMs. If all of the processors are still
configured in the processor configuration/deconfiguration menu, the problem has been resolved.
If all of the DIMMs are still configured in the memory configuration/deconfiguration menu, the
problem has been resolved.
If any of the DIMMs have a status of deconfigured by the system, either before or after the boot
to the SMS menus, go to step 2.
2. Replace the quad of memory containing the DIMM specified by the location code. The entire
quad of memory must be replaced.
260 Service Guide
Table 6. Service Processor Error Codes. (continued)
Error Code Description Action/
Possible Failing FRU
4503 303B Description: 1 GB DIMM fault
Action: This error code indicates that the memory repeat gard function has deconfigured a quad of
memory. Do the following:
1. Go to the memory configuration/deconfiguration menu, located under the system information
menu in the service processor menus.
Look at the status of the DIMMs. If any of the DIMMs have a status of manually deconfigured,
reconfigure the DIMMs and boot the system to the SMS menus. Power off the system using the
white power button and look again at the status of the DIMMs. If all of the processors are still
configured in the processor configuration/deconfiguration menu, the problem has been resolved.
If all of the DIMMs are still configured in the memory configuration/deconfiguration menu, the
problem has been resolved.
If any of the DIMMs have a status of deconfigured by the system, either before or after the boot
to the SMS menus, go to step 2.
2. Replace the quad of memory containing the DIMM specified by the location code. The entire
quad of memory must be replaced.
4503 303C Description: 1 GB DIMM fault
Action: This error code indicates that the memory repeat gard function has deconfigured a quad of
memory. Do the following:
1. Go to the memory configuration/deconfiguration menu, located under the system information
menu in the service processor menus.
Look at the status of the DIMMs. If any of the DIMMs have a status of manually deconfigured,
reconfigure the DIMMs and boot the system to the SMS menus. Power off the system using the
white power button and look again at the status of the DIMMs. If all of the processors are still
configured in the processor configuration/deconfiguration menu, the problem has been resolved.
If all of the DIMMs are still configured in the memory configuration/deconfiguration menu, the
problem has been resolved.
If any of the DIMMs have a status of deconfigured by the system, either before or after the boot
to the SMS menus, go to step 2.
2. Replace the quad of memory containing the DIMM specified by the location code. The entire
quad of memory must be replaced.
4503 303D Description: 2 GB DIMM fault
Action: This error code indicates that the memory repeat gard function has deconfigured a quad of
memory. Do the following:
1. Go to the memory configuration/deconfiguration menu, located under the system information
menu in the service processor menus.
Look at the status of the DIMMs. If any of the DIMMs have a status of manually deconfigured,
reconfigure the DIMMs and boot the system to the SMS menus. Power off the system using the
white power button and look again at the status of the DIMMs. If all of the processors are still
configured in the processor configuration/deconfiguration menu, the problem has been resolved.
If all of the DIMMs are still configured in the memory configuration/deconfiguration menu, the
problem has been resolved.
If any of the DIMMs have a status of deconfigured by the system, either before or after the boot
to the SMS menus, go to step 2.
2. Replace the quad of memory containing the DIMM specified by the location code. The entire
quad of memory must be replaced.
Chapter 5. Error Code to FRU Index 261
Table 6. Service Processor Error Codes. (continued)
Error Code Description Action/
Possible Failing FRU
4503 309A Description: 128 MB DIMM fault
Action: This error code indicates that the memory repeat gard function has deconfigured a quad of
memory. Do the following:
1. Go to the memory configuration/deconfiguration menu, located under the system information
menu in the service processor menus.
Look at the status of the DIMMs. If any of the DIMMs have a status of manually deconfigured,
reconfigure the DIMMs and boot the system to the SMS menus. Power off the system using the
white power button and look again at the status of the DIMMs. If all of the processors are still
configured in the processor configuration/deconfiguration menu, the problem has been resolved.
If all of the DIMMs are still configured in the memory configuration/deconfiguration menu, the
problem has been resolved.
If any of the DIMMs have a status of deconfigured by the system, either before or after the boot
to the SMS menus, go to step 2.
2. Replace the quad of memory containing the DIMM specified by the location code. The entire
quad of memory must be replaced.
4503 309B Description: 256 MB DIMM fault
Action: This error code indicates that the memory repeat gard function has deconfigured a quad of
memory. Do the following:
1. Go to the memory configuration/deconfiguration menu, located under the system information
menu in the service processor menus.
Look at the status of the DIMMs. If any of the DIMMs have a status of manually deconfigured,
reconfigure the DIMMs and boot the system to the SMS menus. Power off the system using the
white power button and look again at the status of the DIMMs. If all of the processors are still
configured in the processor configuration/deconfiguration menu, the problem has been resolved.
If all of the DIMMs are still configured in the memory configuration/deconfiguration menu, the
problem has been resolved.
If any of the DIMMs have a status of deconfigured by the system, either before or after the boot
to the SMS menus, go to step 2.
2. Replace the quad of memory containing the DIMM specified by the location code. The entire
quad of memory must be replaced.
4503 309C Description: 256 MB DIMM fault
Action: This error code indicates that the memory repeat gard function has deconfigured a quad of
memory. Do the following:
1. Go to the memory configuration/deconfiguration menu, located under the system information
menu in the service processor menus.
Look at the status of the DIMMs. If any of the DIMMs have a status of manually deconfigured,
reconfigure the DIMMs and boot the system to the SMS menus. Power off the system using the
white power button and look again at the status of the DIMMs. If all of the processors are still
configured in the processor configuration/deconfiguration menu, the problem has been resolved.
If all of the DIMMs are still configured in the memory configuration/deconfiguration menu, the
problem has been resolved.
If any of the DIMMs have a status of deconfigured by the system, either before or after the boot
to the SMS menus, go to step 2.
2. Replace the quad of memory containing the DIMM specified by the location code. The entire
quad of memory must be replaced.
262 Service Guide
Table 6. Service Processor Error Codes. (continued)
Error Code Description Action/
Possible Failing FRU
4503 309D Description: 512 MB DIMM fault
Action: This error code indicates that the memory repeat gard function has deconfigured a quad of
memory. Do the following:
1. Go to the memory configuration/deconfiguration menu, located under the system information
menu in the service processor menus.
Look at the status of the DIMMs. If any of the DIMMs have a status of manually deconfigured,
reconfigure the DIMMs and boot the system to the SMS menus. Power off the system using the
white power button and look again at the status of the DIMMs. If all of the processors are still
configured in the processor configuration/deconfiguration menu, the problem has been resolved.
If all of the DIMMs are still configured in the memory configuration/deconfiguration menu, the
problem has been resolved.
If any of the DIMMs have a status of deconfigured by the system, either before or after the boot
to the SMS menus, go to step 2.
2. Replace the quad of memory containing the DIMM specified by the location code. The entire
quad of memory must be replaced.
4503 309E Description: 1 GB DIMM fault
Action: This error code indicates that the memory repeat gard function has deconfigured a quad of
memory. Do the following:
1. Go to the memory configuration/deconfiguration menu, located under the system information
menu in the service processor menus.
Look at the status of the DIMMs. If any of the DIMMs have a status of manually deconfigured,
reconfigure the DIMMs and boot the system to the SMS menus. Power off the system using the
white power button and look again at the status of the DIMMs. If all of the processors are still
configured in the processor configuration/deconfiguration menu, the problem has been resolved.
If all of the DIMMs are still configured in the memory configuration/deconfiguration menu, the
problem has been resolved.
If any of the DIMMs have a status of deconfigured by the system, either before or after the boot
to the SMS menus, go to step 2.
2. Replace the quad of memory containing the DIMM specified by the location code. The entire
quad of memory must be replaced.
4503 309F Description: 1 GB DIMM fault
Action: This error code indicates that the memory repeat gard function has deconfigured a quad of
memory. Do the following:
1. Go to the memory configuration/deconfiguration menu, located under the system information
menu in the service processor menus.
Look at the status of the DIMMs. If any of the DIMMs have a status of manually deconfigured,
reconfigure the DIMMs and boot the system to the SMS menus. Power off the system using the
white power button and look again at the status of the DIMMs. If all of the processors are still
configured in the processor configuration/deconfiguration menu, the problem has been resolved.
If all of the DIMMs are still configured in the memory configuration/deconfiguration menu, the
problem has been resolved.
If any of the DIMMs have a status of deconfigured by the system, either before or after the boot
to the SMS menus, go to step 2.
2. Replace the quad of memory containing the DIMM specified by the location code. The entire
quad of memory must be replaced.
Chapter 5. Error Code to FRU Index 263
Table 6. Service Processor Error Codes. (continued)
Error Code Description Action/
Possible Failing FRU
4503 30AA Description: 2 GB DIMM fault
Action: This error code indicates that the memory repeat gard function has deconfigured a quad of
memory. Do the following:
1. Go to the memory configuration/deconfiguration menu, located under the system information
menu in the service processor menus.
Look at the status of the DIMMs. If any of the DIMMs have a status of manually deconfigured,
reconfigure the DIMMs and boot the system to the SMS menus. Power off the system using the
white power button and look again at the status of the DIMMs. If all of the processors are still
configured in the processor configuration/deconfiguration menu, the problem has been resolved.
If all of the DIMMs are still configured in the memory configuration/deconfiguration menu, the
problem has been resolved.
If any of the DIMMs have a status of deconfigured by the system, either before or after the boot
to the SMS menus, go to step 2.
2. Replace the quad of memory containing the DIMM specified by the location code. The entire
quad of memory must be replaced.
4504 26F6 1-way processor card fault Replace the processor card specified by the
location code.
4504 26F7 2-way processor card fault Replace the processor card specified by the
location code.
4504 26F8 1-way processor card fault Replace the processor card specified by the
location code.
4504 26FA 2-way processor card fault Replace the processor card specified by the
location code.
4504 26FB 2-way processor card fault Replace the processor card specified by the
location code.
4504 26FD 1-way processor card fault Replace the processor card specified by the
location code.
4504 26FE 2-way processor card fault Replace the processor card specified by the
location code.
4504 26FF 1-way processor card fault Replace the processor card specified by the
location code.
4504 301A 128 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
4504 301B 1 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
4504 301C 256 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
4504 301D 512 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
4504 301E 1 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
4504 302E 2 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
4504 302F 256 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
4504 303B 1 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
264 Service Guide
Table 6. Service Processor Error Codes. (continued)
Error Code Description Action/
Possible Failing FRU
4504 303C 1 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
4504 303D 2 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
4504 309A 128 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
4504 309B 256 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
4504 309C 256 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
4504 309D 512 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
4504 309E 1 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
4504 309F 1 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
4504 30AA 2 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
4505 26F6 1-way processor card fault Replace the processor card specified by the
location code.
4505 26F7 2-way processor card fault Replace the processor card specified by the
location code.
4505 26F8 1-way processor card fault Replace the processor card specified by the
location code.
4505 26FA 2-way processor card fault Replace the processor card specified by the
location code.
4505 26FB 2-way processor card fault Replace the processor card specified by the
location code.
4505 26FD 1-way processor card fault Replace the processor card specified by the
location code.
4505 26FE 2-way processor card fault Replace the processor card specified by the
location code.
4505 26FF 1-way processor card fault Replace the processor card specified by the
location code.
4505 301A 128 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
4505 301B 1 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
4505 301C 256 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
4505 301D 512 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
4505 301E 1 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
4505 302E 2 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
Chapter 5. Error Code to FRU Index 265
Table 6. Service Processor Error Codes. (continued)
Error Code Description Action/
Possible Failing FRU
4505 302F 256 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
4505 303B 1 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
4505 303C 1 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
4505 303D 2 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
4505 309A 128 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
4505 309B 256 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
4505 309C 256 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
4505 309D 512 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
4505 309E 1 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
4505 309F 1 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
4505 30AA 2 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
4506 26F6 1-way processor card fault Replace the processor card specified by the
location code.
4506 26F7 2-way processor card fault Replace the processor card specified by the
location code.
4506 26F8 1-way processor card fault Replace the processor card specified by the
location code.
4506 26FA 2-way processor card fault Replace the processor card specified by the
location code.
4506 26FB 2-way processor card fault Replace the processor card specified by the
location code.
4506 26FD 1-way processor card fault Replace the processor card specified by the
location code.
4506 26FE 2-way processor card fault Replace the processor card specified by the
location code.
4506 26FF 1-way processor card fault Replace the processor card specified by the
location code.
4506 301A 128 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
4506 301B 1 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
4506 301C 256 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
4506 301D 512 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
266 Service Guide
Table 6. Service Processor Error Codes. (continued)
Error Code Description Action/
Possible Failing FRU
4506 301E 1 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
4506 302E 2 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
4506 302F 256 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
4506 303B 1 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
4506 303C 1 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
4506 303D 2 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
4506 309A 128 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
4506 309B 256 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
4506 309C 256 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
4506 309D 512 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
4506 309E 1 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
4506 309F 1 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
4506 30AA 2 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
4507 26F6 1-way processor card fault Replace the processor card specified by the
location code.
4507 26F7 2-way processor card fault Replace the processor card specified by the
location code.
4507 26F8 1-way processor card fault Replace the processor card specified by the
location code.
4507 26FA 2-way processor card fault Replace the processor card specified by the
location code.
4507 26FB 2-way processor card fault Replace the processor card specified by the
location code.
4507 26FD 1-way processor card fault Replace the processor card specified by the
location code.
4507 26FE 2-way processor card fault Replace the processor card specified by the
location code.
4507 26FF 1-way processor card fault Replace the processor card specified by the
location code.
4507 301A 128 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
4507 301B 1 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
Chapter 5. Error Code to FRU Index 267
Table 6. Service Processor Error Codes. (continued)
Error Code Description Action/
Possible Failing FRU
4507 301C 256 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
4507 301D 512 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
4507 301E 1 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
4507 302E 2 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
4507 302F 256 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
4507 303B 1 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
4507 303C 1 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
4507 303D 2 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
4507 309A 128 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
4507 309B 256 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
4507 309C 256 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
4507 309D 512 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
4507 309E 1 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
4507 309F 1 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
4507 30AA 2 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
4508 26F6 1-way processor card fault Replace the processor card specified by the
location code.
4508 26F7 2-way processor card fault Replace the processor card specified by the
location code.
4508 26F8 1-way processor card fault Replace the processor card specified by the
location code.
4508 26FA 2-way processor card fault Replace the processor card specified by the
location code.
4508 26FB 2-way processor card fault Replace the processor card specified by the
location code.
4508 26FD 1-way processor card fault Replace the processor card specified by the
location code.
4508 26FE 2-way processor card fault Replace the processor card specified by the
location code.
4508 26FF 1-way processor card fault Replace the processor card specified by the
location code.
268 Service Guide
Table 6. Service Processor Error Codes. (continued)
Error Code Description Action/
Possible Failing FRU
4508 301A 128 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
4508 301B 1 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
4508 301C 256 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
4508 301D 512 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
4508 301E 1 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
4508 302E 2 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
4508 302F 256 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
4508 303B 1 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
4508 303C 1 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
4508 303D 2 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
4508 309A 128 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
4508 309B 256 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
4508 309C 256 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
4508 309D 512 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
4508 309E 1 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
4508 309F 1 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
4508 30AA 2 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
4509 26F6 1-way processor card fault Replace the processor card specified by the
location code.
4509 26F7 2-way processor card fault Replace the processor card specified by the
location code.
4509 26F8 1-way processor card fault Replace the processor card specified by the
location code.
4509 26FA 2-way processor card fault Replace the processor card specified by the
location code.
4509 26FB 2-way processor card fault Replace the processor card specified by the
location code.
4509 26FD 1-way processor card fault Replace the processor card specified by the
location code.
Chapter 5. Error Code to FRU Index 269
Table 6. Service Processor Error Codes. (continued)
Error Code Description Action/
Possible Failing FRU
4509 26FE 2-way processor card fault Replace the processor card specified by the
location code.
4509 26FF 1-way processor card fault Replace the processor card specified by the
location code.
4509 301A 128 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
4509 301B 1 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
4509 301C 256 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
4509 301D 512 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
4509 301E 1 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
4509 302E 2 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
4509 302F 256 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
4509 303B 1 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
4509 303C 1 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
4509 303D 2 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
4509 309A 128 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
4509 309B 256 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
4509 309C 256 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
4509 309D 512 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
4509 309E 1 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
4509 309F 1 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
4509 30AA 2 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
450A 26F6 1-way processor card fault Replace the processor card specified by the
location code.
450A 26F7 2-way processor card fault Replace the processor card specified by the
location code.
450A 26F8 1-way processor card fault Replace the processor card specified by the
location code.
450A 26FA 2-way processor card fault Replace the processor card specified by the
location code.
270 Service Guide
Table 6. Service Processor Error Codes. (continued)
Error Code Description Action/
Possible Failing FRU
450A 26FB 2-way processor card fault Replace the processor card specified by the
location code.
450A 26FD 1-way processor card fault Replace the processor card specified by the
location code.
450A 26FE 2-way processor card fault Replace the processor card specified by the
location code.
450A 26FF 1-way processor card fault Replace the processor card specified by the
location code.
450A 301A 128 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
450A 301B 1 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
450A 301C 256 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
450A 301D 512 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
450A 301E 1 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
450A 302E 2 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
450A 302F 256 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
450A 303B 1 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
450A 303C 1 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
450A 303D 2 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
450A 309A 128 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
450A 309B 256 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
450A 309C 256 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
450A 309D 512 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
450A 309E 1 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
450A 309F 1 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
450A 30AA 2 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
450B 26F6 1-way processor card fault Replace the processor card specified by the
location code.
450B 26F7 2-way processor card fault Replace the processor card specified by the
location code.
Chapter 5. Error Code to FRU Index 271
Table 6. Service Processor Error Codes. (continued)
Error Code Description Action/
Possible Failing FRU
450B 26F8 1-way processor card fault Replace the processor card specified by the
location code.
450B 26FA 2-way processor card fault Replace the processor card specified by the
location code.
450B 26FB 2-way processor card fault Replace the processor card specified by the
location code.
450B 26FD 1-way processor card fault Replace the processor card specified by the
location code.
450B 26FE 2-way processor card fault Replace the processor card specified by the
location code.
450B 26FF 1-way processor card fault Replace the processor card specified by the
location code.
450B 301A 128 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
450B 301B 1 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
450B 301C 256 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
450B 301D 512 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
450B 301E 1 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
450B 302E 2 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
450B 302F 256 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
450B 303B 1 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
450B 303C 1 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
450B 303D 2 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
450B 309A 128 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
450B 309B 256 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
450B 309C 256 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
450B 309D 512 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
450B 309E 1 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
450B 309F 1 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
450B 30AA 2 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
272 Service Guide
Table 6. Service Processor Error Codes. (continued)
Error Code Description Action/
Possible Failing FRU
450C 26F6 1-way processor card fault Replace the processor card specified by the
location code.
450C 26F7 2-way processor card fault Replace the processor card specified by the
location code.
450C 26F8 1-way processor card fault Replace the processor card specified by the
location code.
450C 26FA 2-way processor card fault Replace the processor card specified by the
location code.
450C 26FB 2-way processor card fault Replace the processor card specified by the
location code.
450C 26FD 1-way processor card fault Replace the processor card specified by the
location code.
450C 26FE 2-way processor card fault Replace the processor card specified by the
location code.
450C 26FF 1-way processor card fault Replace the processor card specified by the
location code.
450C 301A 128 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
450C 301B 1 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
450C 301C 256 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
450C 301D 512 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
450C 301E 1 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
450C 302E 2 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
450C 302F 256 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
450C 303B 1 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
450C 303C 1 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
450C 303D 2 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
450C 309A 128 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
450C 309B 256 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
450C 309C 256 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
450C 309D 512 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
450C 309E 1 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
Chapter 5. Error Code to FRU Index 273
Table 6. Service Processor Error Codes. (continued)
Error Code Description Action/
Possible Failing FRU
450C 309F 1 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
450C 30AA 2 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
450D 26F6 1-way processor card fault Replace the processor card specified by the
location code.
450D 26F7 2-way processor card fault Replace the processor card specified by the
location code.
450D 26F8 1-way processor card fault Replace the processor card specified by the
location code.
450D 26FA 2-way processor card fault Replace the processor card specified by the
location code.
450D 26FB 2-way processor card fault Replace the processor card specified by the
location code.
450D 26FD 1-way processor card fault Replace the processor card specified by the
location code.
450D 26FE 2-way processor card fault Replace the processor card specified by the
location code.
450D 26FF 1-way processor card fault Replace the processor card specified by the
location code.
450D 301A 128 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
450D 301B 1 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
450D 301C 256 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
450D 301D 512 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
450D 301E 1 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
450D 302E 2 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
450D 302F 256 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
450D 303B 1 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
450D 303C 1 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
450D 303D 2 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
450D 309A 128 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
450D 309B 256 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
450D 309C 256 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
274 Service Guide
Table 6. Service Processor Error Codes. (continued)
Error Code Description Action/
Possible Failing FRU
450D 309D 512 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
450D 309E 1 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
450D 309F 1 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
450D 30AA 2 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
450E 26F6 1-way processor card fault Replace the processor card specified by the
location code.
450E 26F7 2-way processor card fault Replace the processor card specified by the
location code.
450E 26F8 1-way processor card fault Replace the processor card specified by the
location code.
450E 26FA 2-way processor card fault Replace the processor card specified by the
location code.
450E 26FB 2-way processor card fault Replace the processor card specified by the
location code.
450E 26FD 1-way processor card fault Replace the processor card specified by the
location code.
450E 26FE 2-way processor card fault Replace the processor card specified by the
location code.
450E 26FF 1-way processor card fault Replace the processor card specified by the
location code.
450E 301A 128 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
450E 301B 1 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
450E 301C 256 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
450E 301D 512 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
450E 301E 1 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
450E 302E 2 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
450E 302F 256 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
450E 303B 1 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
450E 303C 1 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
450E 303D 2 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
450E 309A 128 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
Chapter 5. Error Code to FRU Index 275
Table 6. Service Processor Error Codes. (continued)
Error Code Description Action/
Possible Failing FRU
450E 309B 256 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
450E 309C 256 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
450E 309D 512 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
450E 309E 1 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
450E 309F 1 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
450E 30AA 2 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
450F 26F6 1-way processor card fault Replace the processor card specified by the
location code.
450F 26F7 2-way processor card fault Replace the processor card specified by the
location code.
450F 26F8 1-way processor card fault Replace the processor card specified by the
location code.
450F 26FA 2-way processor card fault Replace the processor card specified by the
location code.
450F 26FB 2-way processor card fault Replace the processor card specified by the
location code.
450F 26FD 1-way processor card fault Replace the processor card specified by the
location code.
450F 26FE 2-way processor card fault Replace the processor card specified by the
location code.
450F 26FF 1-way processor card fault Replace the processor card specified by the
location code.
450F 301A 128 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
450F 301B 1 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
450F 301C 256 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
450F 301D 512 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
450F 301E 1 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
450F 302E 2 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
450F 302F 256 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
450F 303B 1 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
450F 303C 1 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
276 Service Guide
Table 6. Service Processor Error Codes. (continued)
Error Code Description Action/
Possible Failing FRU
450F 303D 2 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
450F 309A 128 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
450F 309B 256 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
450F 309C 256 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
450F 309D 512 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
450F 309E 1 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
450F 309F 1 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
450F 30AA 2 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution”
on page 68.
4604 287A CEC backplane Replace the CEC backplane,
Location: U0.1-P1
4604 28C4 CEC backplane fault Replace the CEC backplane,
Location: U0.1-P1
4605 287A CEC backplane Replace the CEC backplane,
Location: U0.1-P1
4605 28C4 CEC backplane fault Replace the CEC backplane,
Location: U0.1-P1
4606 287A CEC backplane Replace the CEC backplane,
Location: U0.1-P1
4606 28C4 CEC backplane fault Replace the CEC backplane,
Location: U0.1-P1
4607 287A CEC backplane Replace the CEC backplane,
Location: U0.1-P1
4607 28C4 CEC backplane fault Replace the CEC backplane,
Location: U0.1-P1
4608 287A CEC backplane Replace the CEC backplane,
Location: U0.1-P1
4608 28C4 CEC backplane fault Replace the CEC backplane,
Location: U0.1-P1
4609 287A CEC backplane Replace the CEC backplane,
Location: U0.1-P1
4609 28C4 CEC backplane fault Replace the CEC backplane,
Location: U0.1-P1
460A 287A CEC backplane Replace the CEC backplane,
Location: U0.1-P1
460A 28C4 CEC backplane fault Replace the CEC backplane,
Location: U0.1-P1
460B 287A CEC backplane Replace the CEC backplane,
Location: U0.1-P1
Chapter 5. Error Code to FRU Index 277
Table 6. Service Processor Error Codes. (continued)
Error Code Description Action/
Possible Failing FRU
460B 28C4 CEC backplane fault Replace the CEC backplane,
Location: U0.1-P1
460C 287A CEC backplane Replace the CEC backplane,
Location: U0.1-P1
460C 28C4 CEC backplane fault Replace the CEC backplane,
Location: U0.1-P1
460D 287A CEC backplane Replace the CEC backplane,
Location: U0.1-P1
460D 28C4 CEC backplane fault Replace the CEC backplane,
Location: U0.1-P1
460E 287A CEC backplane Replace the CEC backplane,
Location: U0.1-P1
460E 28C4 CEC backplane fault Replace the CEC backplane,
Location: U0.1-P1
460F 287A CEC backplane Replace the CEC backplane,
Location: U0.1-P1
460F 28C4 CEC backplane fault Replace the CEC backplane,
Location: U0.1-P1
4B23 26F6 Description: 1-way processor card fault
Action: This error code indicates that the processor repeat gard function has deconfigured a
processor. Do the following:
1. Go to the processor configuration/deconfiguration menu, located under the system information
menu in the service processor menus.
Look at the status of the processors. If any of the processors have a status of manually
deconfigured, reconfigure the processor(s) and boot the system to the SMS menus. Power
down the system using the white power button and look again at the status of the processors. If
all of the processors are still configured in the processor configuration/deconfiguration menu,
the problem has been resolved.
If any of the processors have a status of deconfigured by the system, either before or after the
boot to the SMS menus, go to step 2.
2. Replace the processor card (on which the deconfigured processor resides) specified by the
location code.
4B23 26F7 Description: 2-way processor card fault
Action: This error code indicates that the processor repeat gard function has deconfigured a
processor. Do the following:
1. Go to the processor configuration/deconfiguration menu, located under the system information
menu in the service processor menus.
Look at the status of the processors. If any of the processors have a status of manually
deconfigured, reconfigure the processor(s) and boot the system to the SMS menus. Power off
the system using the white power button and look again at the status of the processors. If all of
the processors are still configured in the processor configuration/deconfiguration menu, the
problem has been resolved.
If any of the processors have a status of deconfigured by the system, either before or after the
boot to the SMS menus, go to step 2.
2. Replace the processor card (on which the deconfigured processor resides) specified by the
location code.
278 Service Guide
Table 6. Service Processor Error Codes. (continued)
Error Code Description Action/
Possible Failing FRU
4B23 26F8 Description: 1-way processor card fault
Action: This error code indicates that the processor repeat gard function has deconfigured a
processor. Do the following:
1. Go to the processor configuration/deconfiguration menu, located under the system information
menu in the service processor menus.
Look at the status of the processors. If any of the processors have a status of manually
deconfigured, reconfigure the processor(s) and boot the system to the SMS menus. Power off
the system using the white power button and look again at the status of the processors. If all of
the processors are still configured in the processor configuration/deconfiguration menu, the
problem has been resolved.
If any of the processors have a status of deconfigured by the system, either before or after the
boot to the SMS menus, go to step 2.
2. Replace the processor card (on which the deconfigured processor resides) specified by the
location code.
4B23 26FA Description: 2-way processor card fault
Action: This error code indicates that the processor repeat gard function has deconfigured a
processor. Do the following:
1. Go to the processor configuration/deconfiguration menu, located under the system information
menu in the service processor menus.
Look at the status of the processors. If any of the processors have a status of manually
deconfigured, reconfigure the processor(s) and boot the system to the SMS menus. Power off
the system using the white power button and look again at the status of the processors. If all of
the processors are still configured in the processor configuration/deconfiguration menu, the
problem has been resolved.
If any of the processors have a status of deconfigured by the system, either before or after the
boot to the SMS menus, go to step 2.
2. Replace the processor card (on which the deconfigured processor resides) specified by the
location code.
4B23 26FB Description: 2-way processor card fault
Action: This error code indicates that the processor repeat gard function has deconfigured a
processor. Do the following:
1. Go to the processor configuration/deconfiguration menu, located under the system information
menu in the service processor menus.
Look at the status of the processors. If any of the processors have a status of manually
deconfigured, reconfigure the processor(s) and boot the system to the SMS menus. Power off
the system using the white power button and look again at the status of the processors. If all of
the processors are still configured in the processor configuration/deconfiguration menu, the
problem has been resolved.
If any of the processors have a status of deconfigured by the system, either before or after the
boot to the SMS menus, go to step 2.
2. Replace the processor card (on which the deconfigured processor resides) specified by the
location code.
Chapter 5. Error Code to FRU Index 279
Table 6. Service Processor Error Codes. (continued)
Error Code Description Action/
Possible Failing FRU
4B23 26FD Description: 1-way processor card fault
Action: This error code indicates that the processor repeat gard function has deconfigured a
processor. Do the following:
1. Go to the processor configuration/deconfiguration menu, located under the system information
menu in the service processor menus.
Look at the status of the processors. If any of the processors have a status of manually
deconfigured, reconfigure the processor(s) and boot the system to the SMS menus. Power off
the system using the white power button and look again at the status of the processors. If all of
the processors are still configured in the processor configuration/deconfiguration menu, the
problem has been resolved.
If any of the processors have a status of deconfigured by the system, either before or after the
boot to the SMS menus, go to step 2.
2. Replace the processor card (on which the deconfigured processor resides) specified by the
location code.
4B23 26FE Description: 2-way processor card fault
Action: This error code indicates that the processor repeat gard function has deconfigured a
processor. Do the following:
1. Go to the processor configuration/deconfiguration menu, located under the system information
menu in the service processor menus.
Look at the status of the processors. If any of the processors have a status of manually
deconfigured, reconfigure the processor(s) and boot the system to the SMS menus. Power off
the system using the white power button and look again at the status of the processors. If all of
the processors are still configured in the processor configuration/deconfiguration menu, the
problem has been resolved.
If any of the processors have a status of deconfigured by the system, either before or after the
boot to the SMS menus, go to step 2.
2. Replace the processor card (on which the deconfigured processor resides) specified by the
location code.
4B23 26FF Description: 1-way processor card fault
Action: This error code indicates that the processor repeat gard function has deconfigured a
processor. Do the following:
1. Go to the processor configuration/deconfiguration menu, located under the system information
menu in the service processor menus.
Look at the status of the processors. If any of the processors have a status of manually
deconfigured, reconfigure the processor(s) and boot the system to the SMS menus. Power off
the system using the white power button and look again at the status of the processors. If all of
the processors are still configured in the processor configuration/deconfiguration menu, the
problem has been resolved.
If any of the processors have a status of deconfigured by the system, either before or after the
boot to the SMS menus, go to step 2.
2. Replace the processor card (on which the deconfigured processor resides) specified by the
location code.
4B24 26F6 1-way processor card fault Replace the processor card specified by the
location code.
4B24 26F7 2-way processor card fault Replace the processor card specified by the
location code.
4B24 26F8 1-way processor card fault Replace the processor card specified by the
location code.
4B24 26FA 2-way processor card fault Replace the processor card specified by the
location code.
280 Service Guide
Table 6. Service Processor Error Codes. (continued)
Error Code Description Action/
Possible Failing FRU
4B24 26FB 2-way processor card fault Replace the processor card specified by the
location code.
4B24 26FD 1-way processor card fault Replace the processor card specified by the
location code.
4B24 26FE 2-way processor card fault Replace the processor card specified by the
location code.
4B24 26FF 1-way processor card fault Replace the processor card specified by the
location code.
4B25 26F6 1-way processor card fault Replace the processor card specified by the
location code.
4B25 26F7 2-way processor card fault Replace the processor card specified by the
location code.
4B25 26F8 1-way processor card fault Replace the processor card specified by the
location code.
4B25 26FA 2-way processor card fault Replace the processor card specified by the
location code.
4B25 26FB 2-way processor card fault Replace the processor card specified by the
location code.
4B25 26FD 1-way processor card fault Replace the processor card specified by the
location code.
4B25 26FE 2-way processor card fault Replace the processor card specified by the
location code.
4B25 26FF 1-way processor card fault Replace the processor card specified by the
location code.
4B26 26F6 1-way processor card fault Replace the processor card specified by the
location code.
4B26 26F7 2-way processor card fault Replace the processor card specified by the
location code.
4B26 26F8 1-way processor card fault Replace the processor card specified by the
location code.
4B26 26FA 2-way processor card fault Replace the processor card specified by the
location code.
4B26 26FB 2-way processor card fault Replace the processor card specified by the
location code.
4B26 26FD 1-way processor card fault Replace the processor card specified by the
location code.
4B26 26FE 2-way processor card fault Replace the processor card specified by the
location code.
4B26 26FF 1-way processor card fault Replace the processor card specified by the
location code.
4B27 26F6 1-way processor card fault Replace the processor card specified by the
location code.
4B27 26F7 2-way processor card fault Replace the processor card specified by the
location code.
4B27 26F8 1-way processor card fault Replace the processor card specified by the
location code.
Chapter 5. Error Code to FRU Index 281
Table 6. Service Processor Error Codes. (continued)
Error Code Description Action/
Possible Failing FRU
4B27 26FA 2-way processor card fault Replace the processor card specified by the
location code.
4B27 26FB Description: 2-way processor card fault
Action:
1. If the system has one 2-way processor card or two 2-way processor cards, look at word 13 in
the service processor error log entry detail data. If word 13 is not 7xxxD023 or 7xxxD0E1,
replace the processor card specified by the location code. If word 13 is 7xxxD023 or 7xxxD0E1,
look for an earlier entry in the service processor error log of the form 4067xxxx, 4507xxxx,
4607xxxx, or 4B2726FB that has not already been corrected by a parts replacement. If you find
an earlier entry, replace the FRU identified by that error, not the latest occurrence of 4B2726FB.
2. If this does not resolve the problem, call service support.
4B27 26FD 1-way processor card fault Replace the processor card specified by the
location code.
4B27 26FE 2-way processor card fault Replace the processor card specified by the
location code.
4B27 26FF 1-way processor card fault Replace the processor card specified by the
location code.
4B28 26F6 1-way processor card fault Replace the processor card specified by the
location code.
4B28 26F7 2-way processor card fault Replace the processor card specified by the
location code.
4B28 26F8 1-way processor card fault Replace the processor card specified by the
location code.
4B28 26FA 2-way processor card fault Replace the processor card specified by the
location code.
4B28 26FB 2-way processor card fault Replace the processor card specified by the
location code.
4B28 26FD 1-way processor card fault Replace the processor card specified by the
location code.
4B28 26FE 2-way processor card fault Replace the processor card specified by the
location code.
4B28 26FF 1-way processor card fault Replace the processor card specified by the
location code.
4B29 26F6 1-way processor card fault Replace the processor card specified by the
location code.
4B29 26F7 2-way processor card fault Replace the processor card specified by the
location code.
4B29 26F8 1-way processor card fault Replace the processor card specified by the
location code.
4B29 26FA 2-way processor card fault Replace the processor card specified by the
location code.
4B29 26FB 2-way processor card fault Replace the processor card specified by the
location code.
4B29 26FD 1-way processor card fault Replace the processor card specified by the
location code.
4B29 26FE 2-way processor card fault Replace the processor card specified by the
location code.
282 Service Guide
Table 6. Service Processor Error Codes. (continued)
Error Code Description Action/
Possible Failing FRU
4B29 26FF 1-way processor card fault Replace the processor card specified by the
location code.
4B2A 26F6 1-way processor card fault Replace the processor card specified by the
location code.
4B2A 26F7 2-way processor card fault Replace the processor card specified by the
location code.
4B2A 26F8 1-way processor card fault Replace the processor card specified by the
location code.
4B2A 26FA 2-way processor card fault Replace the processor card specified by the
location code.
4B2A 26FB 2-way processor card fault Replace the processor card specified by the
location code.
4B2A 26FD 1-way processor card fault Replace the processor card specified by the
location code.
4B2A 26FE 2-way processor card fault Replace the processor card specified by the
location code.
4B2A 26FF 1-way processor card fault Replace the processor card specified by the
location code.
4B2B 26F6 1-way processor card fault Replace the processor card specified by the
location code.
4B2B 26F7 2-way processor card fault Replace the processor card specified by the
location code.
4B2B 26F8 1-way processor card fault Replace the processor card specified by the
location code.
4B2B 26FA 2-way processor card fault Replace the processor card specified by the
location code.
4B2B 26FB Description: 2-way processor card fault
Action:
1. Replace the processor card at location U0.1-P1-C2.
2. If the problem is not resolved, remove the processor card at U0.1-P1-C1 and replace it with the
processor card that was removed in the previous step.
3. If the problem is still not resolved, replace the CEC backplane, at location U0.1-P1.
4B2B 26FD 1-way processor card fault Replace the processor card specified by the
location code.
4B2B 26FE 2-way processor card fault Replace the processor card specified by the
location code.
4B2B 26FF 1-way processor card fault Replace the processor card specified by the
location code.
4B2C 26F6 1-way processor card fault Replace the processor card specified by the
location code.
4B2C 26F7 2-way processor card fault Replace the processor card specified by the
location code.
4B2C 26F8 1-way processor card fault Replace the processor card specified by the
location code.
4B2C 26FA 2-way processor card fault Replace the processor card specified by the
location code.
Chapter 5. Error Code to FRU Index 283
Table 6. Service Processor Error Codes. (continued)
Error Code Description Action/
Possible Failing FRU
4B2C 26FB 2-way processor card fault Replace the processor card specified by the
location code.
4B2C 26FD 1-way processor card fault Replace the processor card specified by the
location code.
4B2C 26FE 2-way processor card fault Replace the processor card specified by the
location code.
4B2C 26FF 1-way processor card fault Replace the processor card specified by the
location code.
4B2D 26F6 1-way processor card fault Replace the processor card specified by the
location code.
4B2D 26F7 2-way processor card fault Replace the processor card specified by the
location code.
4B2D 26F8 1-way processor card fault Replace the processor card specified by the
location code.
4B2D 26FA 2-way processor card fault Replace the processor card specified by the
location code.
4B2D 26FB 2-way processor card fault Replace the processor card specified by the
location code.
4B2D 26FD 1-way processor card fault Replace the processor card specified by the
location code.
4B2D 26FE 2-way processor card fault Replace the processor card specified by the
location code.
4B2D 26FF 1-way processor card fault Replace the processor card specified by the
location code.
4B2E 26F6 1-way processor card fault Replace the processor card specified by the
location code.
4B2E 26F7 2-way processor card fault Replace the processor card specified by the
location code.
4B2E 26F8 1-way processor card fault Replace the processor card specified by the
location code.
4B2E 26FA 2-way processor card fault Replace the processor card specified by the
location code.
4B2E 26FB 2-way processor card fault Replace the processor card specified by the
location code.
4B2E 26FD 1-way processor card fault Replace the processor card specified by the
location code.
4B2E 26FE 2-way processor card fault Replace the processor card specified by the
location code.
4B2E 26FF 1-way processor card fault Replace the processor card specified by the
location code.
4B2F 26F6 1-way processor card fault Replace the processor card specified by the
location code.
4B2F 26F7 2-way processor card fault Replace the processor card specified by the
location code.
4B2F 26F8 1-way processor card fault Replace the processor card specified by the
location code.
284 Service Guide
Table 6. Service Processor Error Codes. (continued)
Error Code Description Action/
Possible Failing FRU
4B2F 26FA 2-way processor card fault Replace the processor card specified by the
location code.
4B2F 26FB 2-way processor card fault Replace the processor card specified by the
location code.
4B2F 26FD 1-way processor card fault Replace the processor card specified by the
location code.
4B2F 26FE 2-way processor card fault Replace the processor card specified by the
location code.
4B2F 26FF 1-way processor card fault Replace the processor card specified by the
location code.
Memory DIMM Present Detect Bits (PD-Bits)
The following table expands on the Memory Error Notes found on page 204, where yy is the PD value in
the table below and xxx are the last three digits of the error code. Use these values to identify the type of
memory that generated the error.
If you replace FRUs or perform an action, and the problem is still not corrected, go to “MAP 1548:
Processor Subsystem Problem Isolation” on page 123 unless otherwise indicated in the Memory Error
Notes found on page 204.
PD value Size
28 512 MB
38 128 MB
68 256 MB
78 1024 MB
88 2048 MB
Note: Memory DIMMs must be installed in quads.
Error Log Utility Procedure
When the System Management Services menu appears, check the error log for any errors. Use the
location code obtained from the SMS error log utility to identify the component the error is reported
against. To obtain the location code from the error log utility do the following:
1. From the SMS main menu, choose View Error Log.
2. If an error is logged, check the time stamp.
3. If the error was logged during the current boot attempt, record it.
4. Look up the error in the Chapter 5, “Error Code to FRU Index”, on page 179 and perform the listed
action.
5. If no recent error is logged in the error log, go to “MAP 1548: Processor Subsystem Problem Isolation”
on page 123.
Chapter 5. Error Code to FRU Index 285
System Firmware Update Messages
Error Code Description Action/
Possible Failing FRU
A1FD 0000 Recovery mode entered Information message
A1FD 0001 Service processor flash recovery Insert Diskette #1
A1FD 0002 Service processor flash recovery Insert Diskette #2
A1FD 0003 Service processor flash recovery Insert Diskette #3
A1FD 0004 Service processor flash recovery Insert Diskette #4
Scan Dump Messages
Error Code Description Action/ Possible Failing FRU
A100 3000 Successful completion of scan dump Informational message; may or may
not be seen during a scan log dump.
A1FF 3000 Scan dump has been disabled Informational message; may or may
not be seen during a scan log dump.
Common Firmware Error Codes
Attention: Follow the procedure defined in “Checkpoint and Error Code Index” on page 181. If you
replace FRUs or perform an action and the problem is still not corrected, go to “MAP 1548: Processor
Subsystem Problem Isolation” on page 123 unless otherwise indicated in the tables.
If you replace FRUs and the problem is corrected, go to ²MAP 0410: Repair Checkout² in the Diagnostic
Information for Multiple Bus Systems.
Error Code Description Action/
Possible Failing FRU
B006 1403 Missing service processor resource 1. Update system firmware
2. CEC backplane
Location: U0.1-P1
B006 1404 Service processor firmware fault 1. Update system firmware
2. CEC backplane
Location: U0.1-P1
B006 1405 Service processor firmware fault 1. Update system firmware
2. CEC backplane
Location: U0.1-P1
B006 1406 Service processor program error 1. Update system firmware
2. Check for operating system updates
3. CEC backplane
Location: U0.1-P1
B006 1407 Service processor resource allocation problem 1. Update system firmware
2. Check for operating system updates
3. CEC backplane
Location: U0.1-P1
286 Service Guide
Error Code Description Action/
Possible Failing FRU
B006 1408 Service processor program error 1. Update system firmware
2. Check for operating system updates
3. CEC backplane
Location: U0.1-P1
B006 1409 Service processor program error 1. Update system firmware
2. CEC backplane
Location: U0.1-P1
B10F 1000 Service processor DRAM failure CEC backplane
Location: U0.1-P1
B10F 1002 Service processor failure CEC backplane
Location: U0.1-P1
B10F 1005 Service processor failure CEC backplane
Location: U0.1-P1
B10F 1007 Service processor failure CEC backplane
Location: U0.1-P1
B10F 1008 Service processor failure CEC backplane
Location: U0.1-P1
B10F 100E Service processor boot flash corrupted CEC backplane
Location: U0.1-P1
B10F 100F Service processor base flash corrupted CEC backplane
Location: U0.1-P1
B10F 1300 Service processor failure CEC backplane
Location: U0.1-P1
B10F 1370 Service processor flash update failure 1. Retry flash update
2. CEC backplane
Location: U0.1-P1
B10F 1380 Service processor program failure 1. Update system firmware
2. CEC backplane
Location: U0.1-P1
B10F 1381 Service processor flash error 1. Update system firmware
2. CEC backplane
Location: U0.1-P1
B10F 1384 Service processor failure CEC backplane
Location: U0.1-P1
B10F 1387 Service processor failure CEC backplane
Location: U0.1-P1
B10F 1400 Service processor failure CEC backplane
Location: U0.1-P1
B10F 1401 Service processor failure CEC backplane
Location: U0.1-P1
B10F 1670 Service processor flash update failure 1. Retry flash update
2. CEC backplane
Location: U0.1-P1
B10F 1672 Service processor firmware update invalid Call service support
B10F 1675 Service processor failure CEC backplane
Location: U0.1-P1
Chapter 5. Error Code to FRU Index 287
Error Code Description Action/
Possible Failing FRU
B10F 1681 Service processor firmware update invalid Call service support
B10F 1682 Service processor firmware update invalid Call service support
B10F 1683 Service processor firmware update invalid Call service support
B1xx 0013 A firmware update has failed because the
update image is invalid. The update was stopped
before any changes were made to the currently
installed firmware.
Install a valid firmware image for the specific
type and model.
B1xx 4600 Service processor failure CEC backplane
Location: U0.1-P1
B1xx 4601 System processor failure Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
B1xx 4602 Service processor firmware failure 1. Do a slow boot (see “Performing Slow Boot”
on page 179), then look at the service
processor error log. Look for the first new
error after the B1xx 4602. Go to the
“Checkpoint and Error Code Index” on
page 181 with the new error code and
perform the action indicated. If there are no
new error codes, go to “MAP 1540: Problem
Isolation Procedures” on page 108.
2. Call service support.
B1xx 4603 Service processor firmware corrupted 1. Reload system firmware
2. CEC backplane
Location: U0.1-P1
B1xx 4606 Firmware failure 1. Check for system firmware updates
2. Operator panel
Location: U0.1-L1
3. CEC backplane
Location: U0.1-P1
B1xx 4608 Service processor early termination 1. Reset the system by disconnecting and
reconnecting system power
2. Reload system firmware
3. CEC backplane
Location: U0.1-P1
B1xx 460A TOD clock has been reset 1. Set TOD clock
2. CEC backplane
Location: U0.1-P1
B1xx 460B TOD battery failure TOD battery
Location: U0.1-P1-V3
B1xx 4611 Service processor failure 1. Processor card
Location: U0.1-P1-C1
2. CEC backplane
Location: U0.1-P1
288 Service Guide
Error Code Description Action/
Possible Failing FRU
B1xx 4620 Modem configuration failure 1. Check modem configuration files
2. Check for system firmware update
3. Modem
4. CEC backplane
Location: U0.1-P1
B1xx 4621 Modem disconnect failure 1. Check modem configuration files
2. Check for system firmware update
3. Modem
4. CEC backplane
Location: U0.1-P1
B1xx 4622 Service processor failure 1. Operator panel
Location: U0.1-L1
2. Check for system firmware update
3. CEC backplane
Location: U0.1-P1
B1xx 4633 System power control network failure 1. Check for system firmware update
2. CEC backplane
Location: U0.1-P1
B1xx 4634 SPCN loop fault 1. Check SPCN cabling
2. Go to “MAP 1540: Problem Isolation
Procedures” on page 108.
B1xx 4643 I2C card/flex cable detection error Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
B1xx 4644 Processor VPD system fault Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
B1xx 4645 I2C (Non-processor) VPD System Fault Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
B1xx 4646 General VPD fault Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
B1xx 4647 Write TM field in system VPD Replace the operator panel, location: U0.1-L1.
Do not swap the VPD module from the old
operator panel onto the new operator panel;
leave the new, unprogrammed VPD module on
the new operator panel. Call service support for
instructions.
B1xx 4648 CPU Backplane VPD or Power Source Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
B1xx 4649 Power supply VPD failure CEC power supply
Location: U0.1-V1, U0.1-V2
B1xx 4650 A system processor failure has been detected by
the service processor. All system processors
have been reset in an attempt to recover
operations.
Information only. If the problem persists, check
the error logs for other fault indications.
If no other fault indications are found, go to
“MAP 1540: Problem Isolation Procedures” on
page 108.
B1xx 4651 CPU VPD fault Processor card
Location: U0.1-P1-C1, U0.1-P1-C2
Chapter 5. Error Code to FRU Index 289
Error Code Description Action/
Possible Failing FRU
B1xx 4660 Memory subsystem failure Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
B1xx 4661 Memory card VPD failure Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
B1xx 4662 Memory DIMM VPD failure Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
B1xx 4670 Processor drawer cooling 1. Check ambient temperature
2. Check air supply to fans
3. Check fans for proper installation and
operation
4. Check for any unsupported cards (that may
be too hot)
5. CEC backplane
Location: U0.1-P1
B1xx 4671 Critical temperature in CEC drawer 1. Check ambient temperature.
2. Check air supply to fans.
3. Check fans for proper installation and
operation.
4. Check for unsupported cards (which may be
too hot)
5. CEC planar, U0.1-P1
B1xx 4672 Critical temperature in I/O subsystem 1. Check ambient temperature.
2. Check air supply to fans.
3. Check fans for proper installation and
operation.
4. Check for unsupported cards (which may be
too hot)
5. I/O subsystem planar, U0.dd-P1
B1xx 4681 JTAG scan interface 1. JTAG cabling
2. CEC backplane
Location: U0.1-P1
3. Processor card
Location: U0.1-P1-C1
B1xx 4682 Problem with system VPD on operator panel. 1. Replace the operator panel, location:
U0.1-L1. Do not swap the old VPD module
onto the new operator panel. Call service
support for instructions on programming the
new VPD module.
2. Call service support.
290 Service Guide
Error Code Description Action/
Possible Failing FRU
B1xx 4690 Description: Operating system surveillance time out (AIX to service processor interface failure).
Action:
1. This error code indicates that the operating system terminated early (which usually implies an
operating system crash). This error code may appear in the service processor error log by itself.
However, in the AIX error log, another error points to the cause of the operating system crash.
Use the other error as the starting point for your service action.
If the system will not boot to AIX, power off and look at the service processor error log. Look for
error codes (with time stamps that correspond to the boot that was just attempted) that indicate
hardware failures. If these error codes are found, record them, find the error code listed in the
Chapter 5, “Error Code to FRU Index”, on page 179, and perform the indicated actions.
2. The other possibility is that the operating system was not found during a prior boot attempt. To
determine if this occurred, do the following:
a. Look at the AIX error log entry containing B1xx4690. This is a ²SCAN_ERROR_CHRP²
error with an identifier of BFEC0425. Look at the AIX error log entry containing B1xx4690.
This is a ²SCAN_ERROR_CHRP² error with an identifier of BFEC0425.
b. In the detail data, find the string ²B1xx4690² (If present, it will be at byte 60 of the detail
data.)
c. Go forward 8 bytes after the ²B1² to byte 68, and look at bytes 68 and 69. If the values of
bytes 68 and 69 are A2B0, this indicates that the firmware was unable to find a bootable
device in the boot list that is set in the SMS menus. If the system is up, the boot list
problem has been corrected and the B1xx 4690 can be treated as an informational
message with no actions required.
3. Call service support.
B1xx 4691 Description: System firmware to service processor interface failure. (System firmware surveillance
time out)
Action:
1. Check for system firmware updates.
2. Go to the service processor main menu and select System Information Menu. Then select
Read Progress Indicators From Last System Boot. Begin your repair action with the error
code or checkpoint immediately preceding B1xx 4691: If a location code displays with the error
code or checkpoint, replace the part at that location.
Note: If the location code is U0.1-P1/E1 or U0.1-P1/E2, change the PCI riser card (U0.1-P2)
first, then change the CEC backplane (U0.1-P1). If no location code is specified, and you have
an 8-character error code, go to the “Checkpoint and Error Code Index” on page 181. If no
location code is specified, and you have a 4-character checkpoint (for example, Exxx), go to
“Firmware Checkpoints” on page 150.
3. If your checkpoint is 91FF, perform the following:
a. Replace the RIO cable attached to RIO-G port 0 (U0.1-P1/Qx) of the base system.
b. Replace the RIO adapter card of the I/O subsystem attached to the other end of the RIO
cable attached to RIO-G port 0 (U0.1-P1/Qx) of the base system.
c. Replace the CEC backplane, location U0.1-P1.
4. If the problem is not resolved, call service support.
B1xx 4692 Operator panel task interface failure 1. Check for system firmware updates.
2. Call service support.
B1xx 4693 Service processor firmware error 1. Check for system firmware updates.
2. Call service support.
B1xx 4694 System processor firmware failure 1. Check for system firmware updates.
2. CEC backplane
Location: U0.1-P1
Chapter 5. Error Code to FRU Index 291
Error Code Description Action/
Possible Failing FRU
B1xx 4695 System power control network to service
processor interface failure
1. Check for system firmware updates.
2. CEC backplane
Location: U0.1-P1
B1xx 4696 System firmware interface failure 1. Check for system firmware updates.
2. Call service support.
B1xx 4698 Firmware problem 1. Check for system firmware updates.
2. Call service support.
B1xx 4699 Service processor firmware:
This is usually an indication of a problem in the communication path between the HMC and the
service processor. It may only be an informational message.
If the managed system is down, go to the service processor error log and find the error log entry
containing B1xx 4699. Look at the first two bytes of word 13 of the detailed entry information.
If the managed system is running, look at the AIX error log entry containing B1xx 4699. This is a
²SCAN_ERROR_CHRP² error with an identifier of BFE4C025. In the detail data, find the string
²B1xx 4699². (If present, it will be at byte 60 of the detail data.) Go forward 8 bytes after the ²B1²
to byte 68 and look at bytes 68 and 69.
Perform the following actions based on the following values of bytes 68 and 69 from the AIX error
log entry, or on the first two bytes of word 13 from the service processor error log entry:
2306 No processor card is detected in slot one (U0.1-P1-C1); a processor card is required in
the first slot for the system to boot.
Actions:
1. If a processor card is not plugged into slot one (U0.1-P1-C1), plug one in.
2. If a processor card is plugged into slot one (U0.1-P1-C1), reseat it. If reseating the
processor card does not fix the problem, replace it.
A205: Machine type and model fields are not valid in the VPD module. Obtain an operator panel.
Do not swap the old VPD module onto the new operator panel. Call service support for
instructions on how to write the machine type and model into the new VPD module.
A20B: Error requesting trace buffer for service processor.
Actions:
1. Reset the service processor, if possible.
2. Check for system firmware updates. Apply the updates if they are available.
A218: Unknown return code detected.
Actions:
Check for system firmware updates. Apply the updates if they are available.
A21A: Error allocating an internal service processor memory space.
Actions:
1. Reset the service processor, if possible.
2. Check for system firmware updates. Apply the updates if they are available.
A800: HMC/service processor initialization failure.
Actions:
1. Check for system firmware updates.
2. Replace the service processor, location: U0.1-P1.
A801: HMC wrap failure.
Actions:
Replace the service processor, location: U0.1-P1.
292 Service Guide
Error Code Description Action/
Possible Failing FRU
B1xx 4699
(continued)
(Continued from the previous page.)
A806: Loss of the surveillance heartbeat between the HMC and the service processor.
Actions:
1. Make sure that the HMC is booted and operational.
2. Check the serial cables that go from the HMC to the service processor, location:
U0.1-P1.
If there are no other error codes or indications of a problem, the A806 (loss of surveillance
heartbeat) was a temporary condition and has been resolved; the B1xx 4699 code is then
an informational message only.
If the problem persists:
1. Check the serial cables connecting the HMC to the CEC backplane, location: U0.1-P1.
2. Run diagnostics on the serial port on the HMC.
3. Run diagnostics on the serial ports on the service processor.
Values of A009 and A719 of bytes 68 and 69 in the AIX error log entry or the first two bytes of word
13 in the service processor error log entry are also informational entries:
A009: The system received a power-off request at run time from the HMC.
A719: Primary power failed; the system switched to battery backup power.
For all other values of bytes 68 and 69, or the first two bytes of word 13, do the following:
1. Check for system firmware updates.
2. Reset the service processor by activating the pinhole reset switch on the operator panel.
3. Call service support.
B1xx 469A System firmware interface 1. Check for system firmware updates.
2. Call service support.
B1xx 469B Firmware to I/O interface 1. Check for system firmware updates.
2. CEC backplane
Location: U0.1-P1
B1xx 469C Firmware to I/O interface 1. Check for system firmware updates.
2. CEC backplane
Location: U0.1-P1
B1xx 469E Service processor firmware failure 1. Check for system firmware updates.
2. Go to “MAP 1540: Problem Isolation
Procedures” on page 108, CEC minimum
configuration
3. CEC backplane
Location: U0.1-P1
B1xx 469F Service processor failure 1. Check for system firmware updates.
2. Go to “MAP 1540: Problem Isolation
Procedures” on page 108, CEC minimum
configuration
3. CEC backplane
Location: U0.1-P1
B1xx 8FF0 Informational message No action required.
B1FD 0013 A firmware update has failed because the
update image is invalid. The update was stopped
before any changes were made to the currently
installed firmware.
Reset the service processor (by activating the
pinhole reset switch or by disconnecting, then
reconnecting power). Install a valid firmware
update image for the specific system type and
model.
Chapter 5. Error Code to FRU Index 293
Error Code Description Action/
Possible Failing FRU
B1FD 0015 Too many firmware update diskettes inserted (or
wrong or faulty update image)
1. Ensure the firmware update image is correct
for the specific system type and model.
2. Replace the update diskettes with a new set
of firmware update diskettes.
B1FD 0016 Diskette read error 1. The firmware diskette may be faulty. Try
another diskette.
2. Replace the diskette drive.
3. Replace the CEC backplane
Location: U0.1-P1
B1FD 001A Service processor recovery mode update fault Reset service processor (by activating pinhole
reset switch or by disconnecting, then
reconnecting power)
B1FD 001E Service processor recovery mode update fault Wrong firmware update diskette (Insert correct
diskette)
B1FD 001F Bad firmware update diskette Replace diskette
Scan Log Dump Progress Codes
A scan dump is the collection of chip data that the service processor gathers after a system malfunction,
such as a checkstop or hang. The scan dump data may contain chip scan rings, chip trace arrays, and
SCOM contents. For more information on scan dumps, refer to “Scan Log Dump Policy” on page 312.
The following table shows the error code produced by a scan dump.
Error Code Description Action/Possible Failing FRU
A1FF 3000 Scan log dump request rejected This is an informational message. The service processor
firmware was not able to perform a scan log dump even
though one was requested.
D130 xxxx Scan log dump in progress Informational message only. As the dump progresses,
the xxx characters in the display changes. When
completed, the system may reboot, depending on the
reboot policy settings.
294 Service Guide
Problem Determination Generated Error Codes
Table 7. Problem Determination Generated Error Codes
Error Code Description Action/
Possible Failing FRU
M0BT 0000 The system hung during speaker POST. This
error code is generated by the customer
performing “Problem Determination” in the
286/C4 and 286/E4 User’s Guide.
Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
M0BT 0001 The system hung during “Starting Software...”.
This error code is generated by the customer
performing “Problem Determination” in the
286/C4 and 286/E4 User’s Guide.
Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
M0KB D000 The system hung during keyboard POST. This
error code is generated by the customer
performing “Problem Determination” in the
286/C4 and 286/E4 User’s Guide.
1. Keyboard
2. CEC backplane
Location: U0.1-P1
M0KB D001 The system did not respond to a keyboard entry.
This error code is generated by the customer
performing “Problem Determination” in the
286/C4 and 286/E4 User’s Guide.
Type 101 keyboard
1. Keyboard
2. CEC backplane
Location: U0.1-P1
M0KB D002 The system did not respond to a keyboard entry.
This error code is generated by the customer
performing “Problem Determination” in the
286/C4 and 286/E4 User’s Guide.
Type 102 keyboard
1. Keyboard
2. CEC backplane
Location: U0.1-P1
M0KB D003 The system did not respond to a keyboard entry.
This error code is generated by the customer
performing “Problem Determination” in the
286/C4 and 286/E4 User’s Guide.
1. Keyboard
2. CEC backplane
Location: U0.1-P1
M0ME M002 The system hung during memory POST. This
error code is generated by the customer
performing “Problem Determination” in the
286/C4 and 286/E4 User’s Guide.
Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
M0NE T000 The system hung during network POST. This
error code is generated by the customer
performing “Problem Determination” in the
286/C4 and 286/E4 User’s Guide.
Go to “MAP 1540: Problem Isolation Procedures”
on page 108.
M0PS 0000 Power failure. This error code is generated by
the customer performing “Problem
Determination” in the 286/C4 and 286/E4 User’s
Guide.
Go to “MAP 1520: Power” on page 89.
M0SC SI00 Unable to load diagnostics. This error code is
generated by the customer performing “Problem
Determination” in the 286/C4 and 286/E4 User’s
Guide.
Go to “Boot Problems/Concerns” on page 173.
M0SC SI01 Unable to load diagnostics. This error code is
generated by the customer performing “Problem
Determination” in the 286/C4 and 286/E4 User’s
Guide.
Go to “Boot Problems/Concerns” on page 173.
Chapter 5. Error Code to FRU Index 295
296 Service Guide
Chapter 6. Using the Online and Standalone Diagnostics
The AIX diagnostics consist of online diagnostics and standalone diagnostics.
Attention: To run online diagnostics on a partition, the AIX operating system must be installed on that
partition. If the AIX operating system is not installed, use the standalone diagnostic procedures.
Online diagnostics, when they are installed, reside with AIX in the file system. They can be booted:
v In single user mode (referred to as service mode)
v To run in maintenance mode (referred to as maintenance mode)
v To run concurrently with other applications (referred to as concurrent mode)
Standalone diagnostics must be booted before they can be run. If booted, the diagnostics have no access
to the AIX error log or the AIX configuration data.
Does the system have AIX diagnostics preinstalled?
Yes If there is an HMC attached to the system, go to “Operating Considerations for Online and
Standalone Diagnostics”.
If an HMC is not attached to the system, go to “Running the Online Diagnostics with No HMC
Attached” on page 304.
No If there is an HMC attached to the system, go to “Standalone Diagnostics Operations” on
page 301.
If an HMC is not attached to the system, go to “Running the Standalone Diagnostics with No HMC
Attached” on page 302.
If standalone diagnostics will be run from a NIM server, go to “Running Standalone Diagnostics
from a Network Installation Management (NIM) Server with an HMC Attached to the System” on
page 302. If an HMC is not attached to the system, follow the directions for a full system partition.
Operating Considerations for Online and Standalone Diagnostics
Note: When possible, run online diagnostics in service mode. Online diagnostics perform additional
functions compared to standalone diagnostics. Running online diagnostics in service mode ensures
that the error state of the system that has been captured in NVRAM is available for your use in
analyzing the problem. The AIX error log and certain SMIT functions are available only when
diagnostics are run from the disk drive.
Consider the following items before using the diagnostics:
v Standalone diagnostics can run on systems configured for either a full (or single) system partition or a
multiple partitioned system. When running in a multiple partitioned system, the device from which you
are booting standalone diagnostics must be made available to the partition dedicated to running
standalone diagnostics. This might require moving the device from the partition that currently contains
the boot device (for example, the CD-ROM or network adapter connected to the NIM server that has a
standalone diagnostic image) to the partition used to run standalone diagnostics. Unless the system
supports dynamic logical partitioning, reboot both partitions if you move a device. For more information,
see “Standalone Diagnostics Operations” on page 301.
v When diagnostics are installed, the device support for some devices might not get installed. If this is the
case, that device does not display in the diagnostic test list when running disk-based diagnostics.
v When running diagnostics in a partitioned system, diagnostics will work only with the resources that
were assigned to that partition. You must run diagnostics in the partition containing the resource that
you want to test.
297
Identifying the Terminal Type to the Diagnostics
When you run diagnostics, you must identify which type of terminal you are using. If the terminal type is
not known when the FUNCTION SELECTION menu is displayed, the diagnostics do not allow you to
continue until a terminal is selected from the DEFINE TERMINAL option menu.
Undefined Terminal Types
If you specify an undefined terminal type from the DEFINE TERMINAL option menu, the menu prompts the
user to enter a valid terminal type. The menu redisplays until either a valid type is entered or you exit the
DEFINE TERMINAL option. For a system attached to an HMC use VT320 emulation. If the AIX operating
system is being used, from the command line, type: export term=vt320
Resetting the Terminal
If you enter a terminal type that is valid (according to the DEFINE TERMINAL option menu) but is not the
correct type for the Hardware Management Console (HMC) virtual terminal window being used, you may
be unable to read the screen, use the function keys, or use the Enter key. To pass these difficulties, press
Ctrl-C to reset the terminal. The screen display that results from this reset depends on the mode in which
the system is being run:
v Online Normal or Maintenance Mode - The command prompt displays.
v Standalone Mode or Online Service Mode - The terminal type is reset to dumb, the Diagnostic Operating
Instruction screen displays, and you are required to go through the DEFINE TERMINAL process again.
Online Diagnostics Modes of Operation
Notes:
1. When running online diagnostics on a partition in a partitioned system, diagnostics can be run only on
resources that are allocated to that partition.
2. Online diagnostics can only be run in a partition with AIX installed.
The online diagnostics can be run in the following modes:
v Service Mode
v Concurrent Mode
v Maintenance Mode
Service Mode
Service mode provides the most complete checkout of the system resources. This mode also requires that
no other programs be running on the partition or on a full system partition. All partition on a full system
partition resources, except the SCSI adapter and the disk drives used for paging, can be tested. However,
note that the memory and processor are only tested during POST, and the results of the POST tests are
reported by diagnostics.
Error-log analysis is done in service mode when you select Problem Determination on the DIAGNOSTIC
MODE SELECTION menu.
Running Online Diagnostics with an HMC Attached to the System
Consider the following when you run the online diagnostics from a server or a disk:
v The diagnostics cannot be loaded and run from a disk until the AIX operating system has been installed
and configured.
v The diagnostics cannot be loaded on a system (client) from a server if that system is not set up to boot
from a server over a network. When the system is set up to boot from a server, the diagnostics are run
in the same manner as they are from disk.
298 Service Guide
v On a full system partition, if the diagnostics are loaded from disk or a server, you must shut down the
AIX operating system before turning off the system unit to prevent possible damage to disk data. Do
this in either of the following ways:
– If the diagnostics were loaded in standalone mode, press the F3 key until DIAGNOSTIC OPERATING
INSTRUCTIONS displays. Then press the F3 key once again to shut down the AIX operating system.
– If the diagnostics were loaded in maintenance or concurrent mode, type the shutdown -F command.
v Under some conditions, the system might stop, with instructions displayed on attached displays and
terminals. Follow the instructions to select a console display.
Running Online Diagnostics in Service Mode with an HMC Attached to the System
To run the online diagnostics in service mode from the boot hard disk, do the following:
1. From the HMC, select Partition Manager.
2. Right-click on the mouse and select Open Terminal Window.
3. From the Service Processor Menu on the virtual terminal, select Option 2 System Power Control.
4. Select option 6. Verify that the state changes to currently disabled. Disabling fast system boot
automatically enables slow boot.
5. Select Option 98 to exit the system power control menu.
6. Use the HMC to power on the managed system in a full system partition by selecting the managed
system in the Contents area.
7. Right-click or select the desired system in the Contents area. Next, on the menu, choose Selected.
8. Select Power On.
9. Select the Power on Diagnostic Stored Boot list option. For more information about full system
partitions, see “Full System Management Tasks” in the Hardware Management Console Maintenance
Guide, order number SA38-0590.
10. Make sure that there are no media in the devices in the media subsystem.
11. Enter any passwords, if requested.
Concurrent Mode
Use concurrent mode to run online diagnostics on some of the system resources while the system is
running normal activity.
Because the system is running in normal operation, the following resources cannot be tested in concurrent
mode:
v SCSI adapters connected to paging devices
v Disk drive used for paging
v Some display adapters and graphics related devices
v Memory (tested during POST)
v Processor (tested during POST)
The following levels of testing exist in concurrent mode:
v The share-test level tests a resource while the resource is being shared by programs running in the
normal operation. This testing is mostly limited to normal commands that test for the presence of a
device or adapter.
v The sub-test level tests a portion of a resource while the remaining part of the resource is being used
in normal operation. For example, this test could test one port of a multiport device while the other ports
are being used in normal operation.
v The full-test level requires the device not be assigned to or used by any other operation. This level of
testing on a disk drive might require the use of the varyoff command. The diagnostics display menus to
allow you to vary off the needed resource.
Chapter 6. Using the Online and Standalone Diagnostics 299
Error-log analysis is done in concurrent mode when you select Problem Determination on the
DIAGNOSTIC MODE SELECTION menu.
To run the online diagnostics in concurrent mode, you must be logged in to the AIX operating system and
have proper authority to issue the commands (if help is needed, see the system operator).
The diag command loads the diagnostic controller and displays the online diagnostic menus.
Running the Online Diagnostics in Concurrent Mode
To run online diagnostics in concurrent mode, do the following:
1. Log in to the AIX operating system as root user or use CE Login.
2. Enter the diag command.
3. When the DIAGNOSTIC OPERATING INSTRUCTIONS display, follow the instructions to check out the
desired resources.
4. When testing is complete, press the F3 key to return to the DIAGNOSTIC OPERATING
INSTRUCTIONS. Press the F3 key again to return to the AIX operating system prompt. Be sure to
vary on any resource that you had varied to off.
5. Press Ctrl-D to log off from root user or CE Login.
Maintenance Mode
Maintenance mode runs the online diagnostics using the customer’s version of the AIX operating system.
This mode requires that all activity on the partition running the AIX operating system be stopped so that
the online diagnostics have most of the resources available to check. All of the system resources, except
the SCSI adapters, memory, processor, and the disk drive used for paging, can be checked.
Error log analysis is done in maintenance mode when you select Problem Determination on the
DIAGNOSTIC MODE SELECTION menu.
Use the shutdown -m command to stop all activity on the AIX operating system and put the AIX operating
system into maintenance mode. The diag command is then used to invoke the diagnostic controller so you
can run the diagnostics. After the diagnostic controller is loaded, follow the normal diagnostic instructions.
Running the Online Diagnostics in Maintenance Mode
To run the online diagnostics in maintenance mode, do the following:
1. Stop all programs running on the partition except the AIX operating system (if help is needed, see the
system operator).
2. Log in to the AIX operating system as root user or use CE Login.
3. Type the shutdown -m command.
4. When a message indicates the system is in maintenance mode, enter the diag command.
Note: It might be necessary to set TERM type again.
5. When DIAGNOSTIC OPERATING INSTRUCTIONS screen displays, follow the displayed instructions
to check out the desired resources.
6. When testing is complete, press the F3 key to return to DIAGNOSTIC OPERATING INSTRUCTIONS.
Press the F3 key again to return to the AIX operating system prompt.
7. Press Ctrl-D to log off from root user or CE Login.
300 Service Guide
Standalone Diagnostics Operations
Use standalone diagnostics to test the system when the online diagnostics are not installed and as a
method of testing the disk drives that cannot be tested by the online diagnostics.
Note: No error log analysis is done by the standalone diagnostics. The CD-ROM drive and the controller
that controls it cannot be tested by the standalone diagnostics. Standalone diagnostics:
v Are resident on CD-ROM or a Network Installation Management (NIM) server
v Provide a method to test the system when the online diagnostics are not installed or cannot be
loaded from the disk drive
v Allow testing of the disk drives and other resources that cannot be tested by the online
diagnostics
v Do not have access to the AIX configuration data
v Do not have access to the AIX error log
v Do not allow for running of error log analysis
Performing Slow Boot
To fully analyze all of the available information, perform the following steps before doing a hardware repair
or replacement:
1. Record the 8-character error code (and location code if present) in the operator panel or that was
reported by the customer.
2. Do a slow-mode boot in service mode. This boot can be specified using the System Power Control
Menu on the service processor main menu. (A fast-mode boot skips much of the built-in diagnostic
testing.) A slow-mode boot may yield a new 8-character error code on the operator panel and new
errors in the service processor error log. If a new error code is reported, use this code to continue
problem analysis.
Considerations for Running Standalone Diagnostics on a Partitioned
Systems
To run standalone diagnostics on a full system partition, you must reboot the entire system. However, for a
partition in a partitioned system, you can boot standalone diagnostics either in a given partition or on the
entire system (which is the same procedure as a full system partition). When running in a multiple
partitioned system, the device from which you are booting standalone diagnostics must be made available
to the partition dedicated to running standalone diagnostics. This might require moving the device from the
partition that currently contains the boot device (for example, the CD-ROM or network adapter connected
to the NIM server that has a standalone diagnostic image) to the partition used to run standalone
diagnostics. Reboot both partitions if you move a device. Devices on a partitioned system are moved on
an I/O-slot basis.
The CD-ROM is attached to an integrated adapter in the base system. All integrated devices must be
moved together from partition to partition.
Considerations for Running Standalone Diagnostics from CD-ROM
Consider the following when you run standalone diagnostics:
v The diagnostic CD must remain in the CD-ROM drive for the entire time that diagnostics are running.
v The diagnostic CD cannot be ejected from the CD-ROM drive after the diagnostics have loaded. The
CD can only be ejected after the system has been turned off and then turned on (standalone mode) or
after the diagnostics program has terminated (online concurrent mode).
v The CD-ROM drive from which diagnostics were loaded cannot be tested.
v The adapter (or circuitry) controlling the CD-ROM drive from which diagnostics were loaded cannot be
tested.
Chapter 6. Using the Online and Standalone Diagnostics 301
Loading the Standalone Diagnostics from CD-ROM Using an HMC
To run standalone diagnostics in service mode from CD-ROM, do the following:
1. Stop all programs, including the operating system (get help if needed).
2. Remove all tapes, diskettes, and CD-ROMs.
3. Power off the system. For more information about full system partitions, see “Full System Management
Tasks” in the Hardware Management Console Maintenance Guide, order number SA38-0590.
4. Right-click the mouse and select Open Terminal Window.
5. From the service processor menu on the virtual terminal, select option 2, System Power Control
Menu.
6. Select option 6. Verify that the state changes to currently disabled. Disabling fast system boot
automatically enables slow boot.
7. Select option 98 to exit the system power control menu.
8. Use the HMC to power on the managed server in a full system partition. Select the Power on
Diagnostic Default Boot List. For more information about full system partitions, see “Full System
Management Tasks” in the Hardware Management Console Maintenance Guide, order number
SA38-0590.
9. Insert the CD-ROM into the system’s CD-ROM drive (not into the HMC’s CD-ROM drive).
Running the Standalone Diagnostics with No HMC Attached
To load standalone diagnostics from the default boot list, perform the following procedure:
1. Verify with the system administrator and users that all programs may be stopped, then do so.
2. Turn off the system.
Note: If AIX is running, type the shutdown command. If Linux is running, type the shutdown now -h
command.
3. Wait until the Power-on LED stops flashing and remains off, then press the power button to turn on the
system.
4. Immediately insert the diagnostic CD-ROM into the CD-ROM drive.
5. When or after the word keyboard appears during startup, press the numeric 5 key on the keyboard.
6. Enter any requested passwords.
Running Standalone Diagnostics from a Network Installation Management (NIM)
Server with an HMC Attached to the System
A client system connected to a network with a Network Installation Management (NIM) server can boot
standalone diagnostics from the NIM server if the client-specific settings on both the NIM server and client
are correct.
Notes:
1. All operations to configure the NIM server require root user authority.
2. If you replace the network adapter in the client, the network-adapter hardware-address settings for the
client must be updated on the NIM server.
3. The Cstate for each standalone diagnostics client on the NIM server should be kept in the diagnostic
boot has been enabled state.
4. On the client system, the NIM server network adapter should be put in the bootlist after the boot disk
drive. This allows the system to boot in standalone diagnostics from the NIM server if there is a
problem booting from the disk drive. For information about setting the bootlist, see the Multiboot
section under “SMS” in the client system’s service guide.
NIM Server Configuration
Refer to the “Advanced NIM Configuration Tasks” chapter of the AIX Installation Guide and Reference,
order number SC23-4389, for information on doing the following:
302 Service Guide
v Registering a client on the NIM server
v Enabling a client to run diagnostics from the NIM server
To verify that the client system is registered on the NIM server and the diagnostic boot is enabled, run the
lsnim -a Cstate -Z ClientName command from the command line on the NIM server. Refer to the
following table for system responses.
Note: The ClientName is the name of the system on which you want to run standalone diagnostics.
System Response Client Status
#name:Cstate:
ClientName:diagnostic boot has been
enabled:
The client system is registered on the NIM server and
enabled to run diagnostics from the NIM server.
#name:Cstate:
ClientName:ready for a NIM operation:
or
#name:Cstate:
ClientName:BOS installation has been
enabled:
The client is registered on the NIM server but not enabled
to run diagnostics from the NIM server.
Note: If the client system is registered on the NIM server
but Cstate has not been set, no data will be returned.
0042-053 lsnim: there is no NIM object
named "ClientName"
The client is not registered on the NIM server.
Client Configuration and Booting Standalone Diagnostics from the NIM
Server
To run standalone diagnostics on a client from the NIM server, do the following:
1. Remove any removable media (tape or CD-ROM disc).
2. Stop all programs including the operating system (get help if needed).
3. If you are running standalone diagnostics in a full system partition, verify with the system administrator
and system users that the system unit can shut down. Stop all programs, including the operating
system. Refer to the operating system documentation for shutdown command information.
In a partitioned system, make the CD-ROM drive available to the partition used to run standalone
diagnostics (refer to the Hardware Management Console Maintenance Guide for more information).
Verify with the system administrator and system users using that partition that all applications on that
partition must be stopped, and that the partition will be rebooted. Stop all programs on that partition,
including the operating system.
4. If you are in a full system partition, power on the system unit to run standalone diagnostics. In a
partitioned system, reboot the partition to run standalone diagnostics.
5. When the keyboard indicator is displayed (the word keyboard), press the number 1 key on the
keyboard to display the SMS menu.
6. Enter any requested passwords.
7. Select Setup Remote IPL (Initial Program Load).
8. Enter the client address, server address, gateway address (if applicable), and subnet mask. Exit to the
Network Parameters screen.
9. If the NIM server is set up to allow pinging from the client system, use the ping utility in the RIPL utility
to verify that the client system can ping the NIM server. Under the ping utility, choose the network
adapter that provides the attachment to the NIM server to do the ping operation. If the ping returns
with an OK prompt, the client is prepared to boot from the NIM server. If ping returns with a FAILED
prompt, the client cannot proceed with the NIM boot.
Chapter 6. Using the Online and Standalone Diagnostics 303
Note: If the ping fails, refer to “Boot Problems/Concerns” on page 173 and follow the steps for network
boot problems.
To do a one-time boot of the network adapter attached to the NIM server network, do the following:
1. Exit to the SMS Main screen.
2. Select Select Boot Options.
3. Select Install or Boot a Device.
4. On the Select Device Type screen, select Network.
5. Set the network parameters for the adapter from which you want to boot.
6. Exit completely from SMS. The system starts loading packets while doing a bootp from the network.
Follow the instructions on the screen to select the system console.
v If Diagnostics Operating Instructions Version x.x.x displays, standalone diagnostics have loaded
successfully.
v If the operating system login prompt displays, standalone diagnostics did not load. Check the following
items:
– The network parameters on the client may be incorrect.
– Cstate on the NIM server may be incorrect.
– Network problems might be preventing you from connecting to the NIM server.
Verify the settings and the status of the network. If you continue to have problems, refer to “Boot
Problems/Concerns” on page 173 and follow the steps for network boot problems.
Running the Online Diagnostics with No HMC Attached
Online diagnostics can be loaded by booting from a hard file in the custom boot list by pressing the
numeric 6 key on the keyboard.
To boot from the devices listed in the service mode boot list, perform the following procedure:
1. Verify with the system administrator and users that all programs may be stopped, then do so.
2. Turn off the system.
Note: If AIX is running, type the shutdown command.
3. Wait until the Power-on LED stops flashing and remains off, then press the power button to turn the
system on.
4. When the keyboard indicator appears during startup, press the numeric 6 key on the keyboard.
5. Enter any requested passwords.
After any requested passwords have been entered, the system attempts to boot from the first device of
each type found on the list. If no bootable image is found on the first device of each type on the list, the
system does not search through the other devices of that type for a bootable image; instead, it polls the
first device of the next type.
If all types of devices in the boot list have been polled without finding a bootable image, the system
restarts. This action gives the user the opportunity to start the System Management Services (by pressing
the number 1 key on a directly attached keyboard or the number 1 on an ASCII terminal) before the
system attempts to boot again.
304 Service Guide
Chapter 7. Using the Service Processor
Notes:
1. On some of the system management services (or service processor) screens, you will see the term
LPAR. LPAR is equivalent to the term logical partitioned system or partitioned system.
2. The information in this chapter regarding the configuring of serial ports (and modems attached to those
serial ports), applies only to the serial ports (S1 and S2) on the CEC backplane (location U0.1-P1).
This information is not applicable to the serial ports (or modems attached to those serial ports), on the
HMC.
The service processor runs on its own power boundary and continually monitors hardware attributes and
the environmental conditions within the system. The service processor is controlled by firmware and does
not require the operating system to be operational to perform its tasks.
The service processor menus allow you to configure service processor options, as well as enable and
disable functions.
Service processor menus are available using an ASCII terminal or an HMC virtual terminal window when
OK is displayed on the operator panel or when the service processor has detected a system problem (such
as a surveillance failure).
Service Processor Menus
The service processor menus are divided into the following groups:
v General user menu - the user must know the general-access password.
v Privileged user menus - the user must know the privileged-access password.
If the system is powered off, the service processor menus can be accessed locally or remotely on the
following:
v Serial port 1 (S1)
v Serial port 2 (S2)
v HMC
Accessing the Service Processor Menus Locally
Service processor menus can be accessed by opening a virtual terminal window on the HMC, or by
attaching an ASCII terminal to serial port 1 (S1) or serial port 2 (S2). After OK displays in the operator
panel, press any key on the keyboard to signal the service processor.
Note: The service processor menus cannot be accessed simultaneously on a virtual terminal window on
the HMC and on an ASCII terminal. Accessing the menus on the HMC prevents access to the
ASCII terminals and vice versa.
When you gain access, the service processor prompts you for a password (if one is set), and when
verified, displays the service processor menus.
The service processor menu prompt, represented by 0> or 1>, indicates the serial port to which the
terminal is connected.
v An ASCII terminal can have the following prompts:
– 0> indicates serial port 1 (S1)
– 1> indicates serial port 2 (S2)
v The HMC always displays a prompt of 0>
305
Accessing the Service Processor Menus Remotely
If your system has a modem connected to serial port 1 or serial port 2 and is configured for call-in (see
“Modem Configuration Menu” on page 326), the service processor menus can be accessed remotely as
follows:
1. With the system powered off, call in from a remote terminal.
2. The service processor detects ring-indicate and prompts you for a password (if one is set). When
verified, the service processor menus display remotely.
Saving and Restoring Service Processor Settings
All the settings that you make (except language) from the service processor menus can be backed up
either for recovering from a fault that may corrupt these settings, or for replicating these settings to other
servers that include a service processor.
The service aid ²Save or Restore Hardware Management Policies² can be used to save your settings after
initial setup or whenever the settings must be changed for system operation purposes.
It is strongly recommended that you use this service aid for backing up service processor settings to
protect the usefulness of the service processor and the availability of the system. Refer to “Save or
Restore Hardware Management Policies,” in the “Introduction to Tasks and Service Aids” section of the
Diagnostic Information for Multiple Bus Systems, order number SA23-2769.
Menu Inactivity
The service processor exits menu mode after ten minutes of inactivity and displays a message indicating
that it has done so. Pressing any key on the virtual terminal window causes the main menu to display.
General User Menu
The menu options presented to the general user are a subset of the options available to the privileged
user. The user must know the general-access password, if one is set, to access this menu.
GENERAL USER MENU
1. Power-on System
2. Power-off System
3. Read VPD Image from Last System Boot
4. Read Progress Indicators from Last System Boot
5. Read Service Processor Error Logs
6. Read System POST Errors
99. Exit from Menus
0>
v Power-on System
Allows the user to start the system using the current virtual terminal window as the active console.
v Power-off System
This option is not available on this system.
v Read VPD Image from Last System Boot
Displays manufacturer vital product data, such as serial numbers, part numbers, and so on, that were
stored from the system boot prior to the one in progress now, for the entire system.
306 Service Guide
v Read Progress Indicators from Last System Boot
Displays a number of the boot progress indicators, which may include service processor checkpoints,
IPL checkpoints, or AIX configuration codes, from the previous system boot. This information can be
useful in diagnosing system faults.
Note: If you are running one or more logical partitions, enter the partition ID (0-15) to display progress
indicators for that partition since the last system boot. If your system is running in Full System
Partition mode, this option automatically displays details from partition 0.
The progress indicator codes are listed from top (latest) to bottom (oldest).
This information is not stored in nonvolatile storage. If the system is powered off using the power-on
button on the operator panel, this information is retained. If the ac power is disconnected from the
system, this information will be lost. For an example, refer to “LCD Progress Indicator Log” on page 340.
v Read Service Processor Error Logs
Displays the service processor error logs. For an example, refer to “Service Processor Error Logs” on
page 339.
v Read System POST Errors
Displays additional error log information (this option is only for service personnel).
v Exit from Menus
Selecting this option will exit the service processor menus. You can re-enter the menus by pressing any
key on the console.
Privileged User Menus
The following menus are available to privileged users only. The user must know the privileged-access
password, if one is set, to access these menus.
Main Menu
A listing at the top of the main menu contains the following:
v Your system’s current firmware version
v The firmware copyright notice
v The system name given to your system during setup
You need the firmware version for reference when you either update or repair the functions of your service
processor.
The system name, an optional field, is the name that your system reports in problem messages. This
name helps your support team (for example, your system administrator, network administrator, or service
representative) to more quickly identify the location, configuration, and history of your system. Set the
system name, from the main menu, using option 6.
Chapter 7. Using the Service Processor 307
Note: The information under the Service Processor Firmware heading in the following Main Menu
illustration is example information only.
Service Processor Firmware
VERSION: RH011007
Copyright 2001 IBM Corporation
SYSTEM NAME
MAIN MENU
1. Service Processor Setup Menu
2. System Power Control Menu
3. System Information Menu
4. Language Selection Menu
5. Call-In/Call-Out Setup Menu
6. Set System Name
99. Exit from Menus
0>
v Service Processor Setup Menu
See “Service Processor Setup Menu” on page 309 for more information.
v System Power Control Menu
See “System Power Control Menu” on page 314 for more information.
v System Information Menu
See “System Information Menu” on page 317 for more information.
v Language Selection Menu
See “Language Selection Menu” on page 324 for more information.
v Call-In/Call-Out Setup Menu
See “Call-In/Call-Out Setup Menu” on page 325 for more information.
v Set System Name
Allows setting of the system name.
308 Service Guide
Service Processor Setup Menu
The following Service Processor Setup Menu is accessed from the Main Menu:
SERVICE PROCESSOR SETUP MENU
1. Change Privileged Access Password
2. Change General Access Password
3. Enable/Disable Console Mirroring:
Currently Enabled
4. Start Talk Mode
5. OS Surveillance Setup Menu
NOT supported in LPAR mode
6. Reset Service Processor
7. Reprogram Flash EPROM Menu
8. Serial Port Snoop Setup Menu
NOT supported in LPAR mode
9. Scan Log Dump Setup Menu:
Currently As Needed
98. Return to Previous Menu
99. Exit from Menus
0>
Note: Unless otherwise stated in menu responses, settings become effective when a menu is exited using
option 98 or 99.
Passwords
Passwords can be any combination of up to eight alphanumeric characters. You can enter longer
passwords, but the entries are truncated to include only the first eight characters. The privileged-access
password can be set from service processor menus or from System Management Services (SMS) utilities
(see Chapter 8, “Using System Management Services”, on page 345). The general-access password can
be set only from service processor menus.
For security purposes, the service processor counts the number of attempts to enter passwords. The
results of not recognizing a password within this error threshold are different, depending on whether the
attempts are being made locally (at the system) or remotely (through a modem). The error threshold is
three attempts.
If the error threshold is reached by someone entering passwords at the system, the service processor
commands the system to resume the initial program load (IPL). This action is taken based on the
assumption that the system is in an adequately secure location with only authorized users having access.
Such users must still successfully enter a login password to access the operating system.
If the error threshold is reached by someone entering passwords remotely, the service processor
commands the system to power off to prevent potential security attacks on the system by unauthorized
remote users. The following table lists what you can access with the privileged-access password and the
general-access password.
Chapter 7. Using the Service Processor 309
Privileged
Access
Password
General
Access
Password
Resulting Menu
None None Service processor MAIN MENU displays.
Set None Users with the password see the service processor MAIN MENU.
Users without password cannot log in.
Set Set Users see menus associated with the entered password.
If you forget the password, you must remove the battery for at least 30 seconds to disable the password.
v Change Privileged-Access Password
Set or change the privileged-access password. It provides the user with the capability to access all
service processor functions. This password is usually used by the system administrator or root user.
v Change General-Access Password
Set or change the general-access password. It provides limited access to service processor menus, and
is usually available to all users who are allowed to power on the system, especially remotely.
Note: The general-access password can only be set or changed after the privileged-access password
is set.
v Enable/Disable Console Mirroring
Note: Console mirroring is disabled in partitioned systems.
Console mirroring is supported on serial port 1 (S1) and serial port 2 (S2). When console mirroring is
enabled, the service processor sends information to all serial ports. The serial port from which console
mirroring is enabled is referred to as the active port. The mirror port is determined when keyboard input
is detected from one of the other ports. From this point on, the service processor sends information only
to the active port and the mirror port. This capability can be enabled by local or remote users, providing
local users with the capability to monitor remote sessions. Console mirroring can be enabled for the
current session only. For more information, see “Console Mirroring” on page 338.
v Start Talk Mode
In a console-mirroring session, it is useful for those who are monitoring the session to be able to
communicate with each other. Selecting Start Talk Mode activates the keyboards and displays for such
communications while console mirroring is established. This is a full duplex link, so message
interference is possible. Alternating messages between users works best.
v OS Surveillance Setup Menu
Note: This option is disabled in partitioned systems.
This menu can be used to set up operating system (OS) surveillance.
OS Surveillance Setup Menu
1. Surveillance:
Currently Enabled
2. Surveillance Time Interval:
2 minutes
3. Surveillance Delay:
2 minutes
98. Return to Previous Menu
0>
310 Service Guide
– Surveillance
Can be set to Enabled or Disabled.
– Surveillance Time Interval
Can be set to any number from 2 through 255.
– Surveillance Delay
Can be set to any number from 0 through 255.
Refer to “Service Processor System Monitoring - Surveillance” on page 336 for more information about
surveillance.
v Reset Service Processor
If this option is selected, entering Y causes the service processor to reboot.
v Reprogram Flash EPROM Menu
This option requires a diskette drive to be installed in the system. If this option is selected, and no
diskette drive is present in the system, the service processor will indicate ²Not Supported² on the
screen.
This option updates the system EPROMs. After entering Y to indicate that you want to continue, you are
prompted to enter the update diskettes. Follow the instructions on the screen. When the update is
complete, the service processor reboots.
All system EPROMs that can be reprogrammed are updated at the same time and are as follows:
– System power control network programming
– Service processor programming
– System firmware programming
– Run-Time Abstraction Services
v Serial Port Snoop Setup Menu
Note: This option is disabled in partitioned systems.
This menu can be used to set up serial port snooping, in which the user can configure serial port 1 as a
²catch-all² reset device.
From the service processor main menu, select option 1, service processor setup menu, then select
option 8 (Serial Port Snoop Setup Menu).
SERIAL PORT SNOOP SETUP MENU
1. System reset string:
Currently Unassigned
2. Snoop Serial Port:
Currently Unassigned
98. Return to Previous Menu
1>
Use the Snoop Serial Port option to select the serial port to snoop.
Note: Only serial port 1 is supported.
Chapter 7. Using the Service Processor 311
Use the system reset string option to enter the system reset string, which resets the machine when it is
detected on the main console on Serial Port 1.
After serial port snooping is correctly configured, at any point after the system is booted to the operating
system, whenever the reset string is typed on the main console, the system uses the service processor
reboot policy to restart.
Because pressing Enter after the reset string is not required, make sure that the string is not common
or trivial. A mixed-case string is recommended.
v Scan Log Dump Policy
A scan dump is the collection of chip data that the service processor gathers after a system malfunction,
such as a checkstop or hang. The scan dump data may contain chip scan rings, chip trace arrays, and
SCOM contents.
The scan dump data are stored in the system control store. The size of the scan dump area is
approximately 4 MB.
During the scan log dump, A8xx (in the range A810 to A8FF) displays in the operator panel. The xx
characters will change as the scan log dump progresses. If the xx characters do not change after five
minutes, the service processor is hung and must be reset.
When the scan log dump is complete, depending on how the reboot policy is set, the system will either:
– Go to the standby state (and the service processor menus will be available), indicated by OK or
STBY in the operator panel
OR
– Attempt to reboot.
Scan Log Dump Setup Menu
1. Scan Log Dump Policy:
Currently As Needed
2. Scan Log Dump Content:
Currently As Requested
3. Immediate Dump
98. Return to Previous Menu
0> 1
Select from the following options:
(As Needed=2, Always=3)
Enter New Option:
0>
The scan log dump policy can be set to the following:
2 = As Needed
The processor run-time diagnostics record the dump data based on the error type. This is the
default value.
3 = Always
Selecting this option allows the service processor to record a scan log dump for all error types.
The scan log dump policy can also be set from the Tasks menu in the AIX service aids.
312 Service Guide
Option 2 displays the following screen:
Scan Log Dump Setup Menu
1. Scan Log Dump Policy:
Currently As Needed
2. Scan Log Dump Content:
Currently As Requested
3. Immediate Dump
98. Return to Previous Menu
0> 2
Select from the following options:
(As Requested=1, Optimum=2, Complete=3, Minimum=4)
Enter New Option:
0>
The scan log dump content can be set to the following:
1 = As Requested
The processor run-time diagnostics will select the contents of the dump file based on the type of
error that occurs. This is the default.
2 = Optimum
The dump will include the smallest amount of information to diagnose a hardware error.
3 = Complete
The dump will include as much information as possible to allow the complete analysis of hardware
and software errors.
4 = Minimum
The dump will include the smallest amount of information possible (a minimum number of
hardware scan log rings).
The complete dump will take the longest time to finish; it may take as long as 1.5 hours on a fully
configured system.
Option 3, Immediate Dump, can only be used when the system is in the standby state with power on. It is
used to dump the system data after a checkstop or machine check occurs when the system firmware is
running, or when the operating system is booting or running.
The scan log dump content can also be set from the Tasks menu in the AIX diagnostic service aids.
If a valid dump file already exists, the dump control code will stop because the contents of the prior dump
must be protected.
Chapter 7. Using the Service Processor 313
System Power Control Menu
This menu is used to set power control options. Other menus that control boot options are available from
this menu.
SYSTEM POWER CONTROL MENU
1. Enable/Disable Unattended Start Mode:
Currently Enabled
2. Ring Indicate Power-On Menu
3. Reboot/Restart Policy Setup Menu
4. Power-On System
5. Power-Off System
6. Enable/Disable Fast System Boot
Currently Fast Boot
7. Boot Mode Menu
98. Return to Previous Menu
99. Exit from Menus
0>
v Enable/Disable Unattended Start Mode
Use this option to instruct the service processor to restore the power state of the system after a
temporary power failure. Unattended start mode can also be set through the System Management
Services (SMS) menus. This option is intended to be used on systems that require automatic power-on
after a power failure. For more information, see “System Power-On Methods” on page 330.
v Ring Indicate Power-On Menu
RING INDICATE POWER-ON MENU
1. Ring indicate power-on :
Currently Enabled
2. Number of rings:
Currently 3
30. Refresh Modem Settings
98. Return to Previous Menu
Ring indicate power-on is enabled by default on both serial port 1 (S1) and serial port 2 (S2). When ring
indicate power-on is enabled, call-in is disabled.
If ring indicate power-on is enabled and call-in is already enabled, you will be asked to confirm your
choice. Refer to the message displayed on your screen.
If the ring indicate power-on setting is changed, you must select option 30, Refresh Modem Settings to
update the modem settings. If Refresh Modem Settings is selected, and the modem(s) have not been
configured, you will be asked to configure the modems first. See “Call-In/Call-Out Setup Menu” on
page 325 for information on configuring modems.
Option 2 is used to set the number of rings.
314 Service Guide
v Reboot/Restart Policy Setup Menu
The following menu controls the Reboot/Restart Policy:
Reboot/Restart Policy Setup Menu
1. Number of reboot attempts:
Currently 1
2. Use OS-Defined restart policy?
Currently No
3. Enable supplemental restart policy?
Currently Yes
4. Call-Out before restart:
Currently Disabled
98. Return to Previous Menu
0>
Reboot is the process of bringing up the system hardware; for example, from a system reset or power
on. Restart is activating the operating system after the system hardware is reinitialized. Restart must
follow a successful reboot.
– Number of reboot attempts - If the system fails to successfully complete the boot process, it
attempts to reboot the number of times specified. Entry values equal to or greater than 0 are valid.
Only successive failed reboot/restart attempts are counted.
– Use OS-Defined restart policy - In a full system partition, this allows the service processor to react
in the same way that the operating system does to major system faults by reading the setting of the
operating system parameter Automatically Restart/Reboot After a System Crash. This parameter
might already be defined, depending on the operating system (or its version or level). If the operating
system automatic restart setting is defined, it can be set to respond to a major fault by restarting or
by not restarting. See your operating system documentation for details on setting up operating
system automatic restarts. The default value is No.
– Enable supplemental restart policy - The default setting is Yes. When set to Yes in a full system
partition, the service processor restarts the system when the system loses control as detected by
service processor surveillance, and either:
The Use OS-Defined restart policy is set to No.
OR
The Use OS-Defined restart policy is set to Yes, and the operating system has no automatic
restart policy.
– Call-Out before restart (Enabled/Disabled) - If a restart is necessary due to a system fault, and
you are running a full system partition, you can enable the service processor to call out and report
the event. This option can be valuable if the number of these events becomes excessive, which
might signal a bigger problem.
This setting is ignored on a partitioned system.
Chapter 7. Using the Service Processor 315
v Power-On System
Allows immediate power-on of the system.
v Power-Off System
This option is not available on this system.
v Enable/Disable Fast System Boot
Allows the user to select the IPL type, mode, and speed of the system boot.
Attention: Selecting the fast IPL results in several diagnostic tests being skipped and a shorter
memory test being run.
v Boot Mode Menu
Note: This option is disabled in partitioned systems.
The Boot Mode Menu allows you to select a boot mode.
Boot Mode Menu
1. Boot to SMS Menu:
Currently Disabled
2. Service Mode Boot from Saved List:
Currently Disabled
3. Service Mode Boot from Default List:
Currently Disabled
4. Boot to Open Firmware Prompt:
Currently Disabled
98. Return to Previous Menu
0>
To select a boot mode, select a number and press Enter. The item corresponding to the selected
number toggles between Disabled to Enabled. If a boot mode is Enabled, the boot mode selected is
performed, and the Disabled/Enabled selection is reset to Disabled. The following describes each boot
mode:
– Boot to SMS Menu
When this selection is enabled, the system boots to the System Management Services (SMS) Menu.
– Service Mode Boot from Saved List
This selection causes the system to perform a service mode boot using the service mode boot list
saved in NVRAM. If the system boots AIX from the disk drive and AIX diagnostics are loaded on the
disk drive, AIX boots to the diagnostics menu.
Using this option to boot the system is the preferred way to run online diagnostics.
– Service Mode Boot from Default List
This selection is similar to Service Mode Boot from Saved List, except the system boots using the
default boot list that is stored in the system firmware. This is normally used to try to boot customer
diagnostics from the CD-ROM drive.
Using this option to boot the system is the preferred way to run standalone diagnostics.
– Boot to Open Firmware
This option should only be used by service personnel to obtain additional debug information. When
this selection is enabled, the system boots to the open firmware prompt.
316 Service Guide
System Information Menu
This menu provides access to system configuration information, error logs, system resources, and
processor configuration.
SYSTEM INFORMATION MENU
1. Read VPD Image from Last System Boot
2. Read Progress Indicators from Last System Boot
3. Read Service Processor Error Logs
4. Read System POST Errors
5. Read NVRAM
6. Read Service Processor Configuration
7. Processor Configuration/Deconfiguration Menu
8. Memory Configuration/Deconfiguration Menu
9. Power Control Network Utilities Menu
10. LED Control Menu
11. MCM/L3 Interposer Plug Count Menu
Not Supported
12. Performance Mode Setup Menu
98. Return to Previous Menu
99. Exit from Menus
0>
v Read VPD Image from Last System Boot
Displays manufacturer’s vital product data (VPD), such as serial numbers, part numbers, and so on, that
was stored from the system boot prior to the one in progress now. VPD from all devices in the system is
displayed.
v Read Progress Indicators from Last System Boot
Displays a number of the boot progress indicators, which may include Service Processor checkpoints,
IPL checkpoints, or AIX configuration codes, from the previous system boot. This information can be
useful in diagnosing system faults.
Note: If you are running from a partitioned system, enter the partition ID (0-15) to display progress
indicators for that partition since the last system boot. In a full system partition, this option
automatically displays details from partition 0.
The progress indicator codes are listed from top (latest) to bottom (oldest).
This information is not stored in nonvolatile storage. If the system is powered off using the power-on
button on the operator panel, this information is retained. If the ac power is disconnected from the
system, this information will be lost. For an example, refer to “LCD Progress Indicator Log” on page 340.
v Read Service Processor Error Logs
Displays error conditions detected by the service processor. Refer to “Service Processor Error Logs” on
page 339 for an example of this error log.
Chapter 7. Using the Service Processor 317
v Read System POST Errors
This option should only be used by service personnel to obtain additional debug information.
v Read NVRAM
Displays Non Volatile Random Access Memory (NVRAM) content.
v Read Service Processor Configuration
Displays current service processor configuration.
v Processor Configuration/Deconfiguration Menu
Enable/Disable CPU Repeat Gard
CPU repeat gard will automatically deconfigure a CPU during a system boot if a processor has
failed BIST (power-on self-test), caused a machine check or check stop, or has reached a
threshold of recoverable errors. The processor will remain deconfigured until repeat gard is
disabled or the processor is replaced.
The default is enabled.
For more information, see “Configuring and Deconfiguring Processors or Memory” on page 335.
Processor Hot Sparing
This option is not available on this system.
This menu allows the user to change the system processor configuration. If it is necessary to take one
of the processors offline, use this menu to deconfigure a processor, and then reconfigure the processor
at a later time. An example of this menu follows:
PROCESSOR CONFIGURATION/DECONFIGURATION MENU
77. Enable/Disable CPU Repeat Gard: Currently Enabled
78. Enable/Disable Processor Hot Sparing (if available): Currently Disabled
1. 0 3.0 (00) Configured by system 2. 1 3.1 (00) Deconfigured by system
3. 24 4.0 (00) Configured by system 4. 25 4.1 (00) Configured by system
98. Return to Previous Menu
0>
Note: This table is built from vital product data collected during the last boot sequence. The first time
the system is powered on, or after the system’s nonvolatile RAM (NVRAM) has been erased, this
table may be empty. The table is rebuilt during the next boot into the operating system.
The fields of the previous table represent the following:
Column 1
(1.) Menu selection index.
Column 2
(0) Logical processor device number assigned by AIX. If the system is running AIX, you can
display these logical device numbers by issuing the following command on the AIX command
line:
lsdev -C | grep proc
Column 3
(3.0) Processor address list used by the service processor.
Column 4
(00) Error status of the processors.
318 Service Guide
The error status of each processor is indicated by AB, where B indicates the number of errors and A
indicates the type of error according to the following:
1. Bring-up failure
2. Run-time non-recoverable failure
3. Run-time recoverable failure
4. Group integrity failure
5. Non-repeat-gardable error. The resource may be reconfigured on the next boot.
A status of 00 indicates that the CPU has not had any errors logged against it by the service processor.
To enable or disable CPU repeat gard, use menu option 77. CPU repeat gard is enabled by default.
If CPU repeat gard is disabled, processors that are in the ²deconfigured by system² state will be
reconfigured. These reconfigured processors are then tested during the boot process, and if they pass,
they remain online. If they fail the boot testing, they are deconfigured even though CPU repeat gard is
disabled.
The failure history of each CPU is retained. If a processor with a history of failures is brought back
online by disabling repeat gard, it remains online if it passes testing during the boot process. However, if
repeat gard is enabled, the processor is taken offline again because of its history of failures.
Notes:
1. The processor numbering scheme used by the service processor is different from the numbering
scheme used by the operating system. Consult the operating system documentation before
configuring or deconfiguring a processor to ensure that the correct processor is selected.
2. To determine the number of processors available to AIX, run the following command on the AIX
command line: bindprocessor -q
Note: The number of processor available to Linux can be determined by issuing the following
command on the Linux command line:
cat /proc/cpuinfo | grep POWER
v Memory Configuration/Deconfiguration Menu
Enable/Disable Memory Repeat Gard
Memory repeat gard will automatically deconfigure a quad of memory during a system boot if a
memory DIMM has failed BIST (power-on self-test), caused a machine check or check stop, or
has reached a threshold of recoverable errors. The quad will remain deconfigured until repeat
gard is disabled or the memory is replaced.
The default is enabled.
For more information, see “Configuring and Deconfiguring Processors or Memory” on page 335.
Runtime Recoverable Error Repeat Gard
The runtime recoverable error repeat gard flag controls the deallocation of the memory if a
recoverable error occurs during runtime. If a recoverable memory error occurs, and runtime
recoverable error repeat gard is disabled, the system will continue running with no change in
the memory configuration. If a recoverable memory error occurs, and runtime recoverable error
repeat gard is enabled, the memory quad in which the error occurred will be garded out (taken
offline).
The default is disabled.
Chapter 7. Using the Service Processor 319
These menus allow the user to change the system memory configuration. If it is necessary to take one
of the memory DIMMs offline, this menu allows you to deconfigure a DIMM, and then reconfigure the
DIMM at a later time.
When this option is selected, a menu displays. The following is an example of this menu:
MEMORY CONFIGURATION/DECONFIGURATION MENU
77. Enable/Disable Memory Repeat Gard: Currently Enabled
78. Runtime Recoverable Error Repeat Gard: Currently Disabled
1. Memory card
98. Return to Previous Menu
After you select the memory card option by entering 1, a menu displays, allowing the selection of a
memory DIMM. The following is an example of this menu.
MEMORY CONFIGURATION/DECONFIGURATION MENU
1. 3.16(00) Configured by system 2. 3.17(00) Configured by system
3. 3.18(00) Configured by system 4. 3.19(00) Configured by system
5. 4.24(00) Configured by system 6. 4.25(00) Configured by system
7. 4.26(00) Configured by system 8. 4.27(00) Configured by system
98. Return to Previous Menu
Note: This table is built from vital product data collected during the last boot sequence. The first time
the system is powered on, or after the system’s nonvolatile RAM (NVRAM) has been erased, this
table may be empty. The table is rebuilt during the next boot into the operating system.
The fields in the previous table represent the following:
Column 1
1. Menu selection index/DIMM number
Column 2
xx.xx : DIMM address used by service processor
Column 3
(00) Error status
The error status of the each memory DIMM is indicated by AB, where B indicates the number of errors
and A indicates the type of error according to the following table:
1. Bring-up failure
2. Run-time non-recoverable failure
3. Run-time recoverable failure
4. Group integrity failure
5. Non-repeat-gardable error. The resource may be reconfigured on the next boot.
An error status of 00 (for example, 3.16(00)) indicates that the memory DIMM has not had any errors
logged against it by the service processor.
320 Service Guide
To change the memory configuration, select the number of the memory DIMM. The memory DIMM state
will change from configured to deconfigured or from deconfigured to configured.
In the previous example menu, each line shows two DIMMs and indicates whether they are configured.
To enable or disable Memory Repeat Gard, use menu option 77 of the Memory
Configuration/Deconfiguration Menu.
To enable or disable runtime recoverable error repeat gard, use option 78 of the memory
configuration/deconfiguration menu.
The failure history of each DIMM is retained. If a DIMM with a history of failures is brought back online
by disabling Repeat Gard, it remains online if it passes testing during the boot process. However, if
Repeat Gard is enabled, the DIMM is taken offline again because of its history of failures.
v Power Control Network Utilities Menu
POWER CONTROL NETWORK UTILITIES MENU
1. Lamp Test for all Operator Panels
2. Display I/O Type
3. Change I/O Type
98. Return to Previous Menu
0>
– Lamp Test for All Operator Panels
Selecting this option tests the media drawer operator panel’s indicators by causing them to blink on
and off for approximately 30 seconds.
– Display I/O Type
This option is not available on this system.
– Change I/O Type
Use this option to change the I/O type after a service action or configuration change, if the I/O type
is incorrect. If this option is chosen, you will be asked to make the following entries:
1. For the I/O drawer address, type 1.
2. If a 4-slot PCI riser has been replaced, or is present in the system, type A6 for the I/O type. If a
6-slot PCI riser has been replaced, or is present in the system, type A3 for the I/O type.
If either value is not valid, a failure message displays on the console. Press Enter to return to the
Power Control Network Utilities Menu.
Chapter 7. Using the Service Processor 321
v LED Indicator Menu
This menu displays the state of the processor subsystem disturbance/system attention LED and the
fault/identify LED on the I/O subsystem(s). Use this menu to toggle the attention/fault LEDs between
identify (blinking) and off. Option 1 is only available when the system is in the error state (the CEC is
powered on and the service processor menus are available). It is not available when the system is in
standby. An example of this menu follows:
LED Indicator
1. Set/Reset Identify LED state
2. Clear System Attention Indicator
98. Return to Previous Menu
Enter LED index
0 >
If Option 1 is selected, a list of location codes of the I/O subsystems and the system unit drawer is
shown. The screen is similar to the following:
1. U0.2-P1
2. U0.3-P1
Enter number corresponding to the location code, or
press Return to continue, or ’x’ to return to the menu
If one of the devices is selected using the index number, the present state of its LED is displayed, and
you are given the option to toggle it as shown in the following example screen. The final state of the
LED is then displayed whether or not its state was changed.
U0.2-P1 is currently in the off state
Select from the following (1=IDENTIFY ON, 2=IDENTIFY OFF)
0>2
Please wait
U0.2-P1 is currently in the OFF state
(Press Return to continue)
Option 2, Clear System Attention Indicator, clears the attention indicator on the operator panel.
The processor subsystem disturbance/LED is on the operator panel. The I/O drawer fault/identify LED is
located on the front of each I/O subsystem.
v MCM/L3 Interposer Plug Count Menu
This option is not available on this system.
322 Service Guide
v Performance Mode Setup Menu
If certain types of processor cards are installed in the system, this menu is not available. For other
types of processor cards, this menu is active after the first boot.
Note: The first time the system is booted after NVRAM is cleared, Not Applicable displays under
Performance Mode Setup Menu on the screen. This might also happen if the service processor
is replaced, or the processor cards are upgraded.
If Performance Mode Setup Menu is selected when Not Applicable displays the screen, the
system responds with Not Applicable and redisplays the system information menu. The setup
menu can be displayed after the performance mode is set, which occurs when the system is
rebooted for the first time.
The default performance mode is set by the firmware during IPL. The default mode provides the
optimum performance for the hardware configuration of the system. The performance mode is
systemwide; it cannot be set on a per-partition basis.
You can override the default setting by using the Default Performance Mode menu, which will be similar
to the following:
Default Performance Mode: Standard Operation
1. Current Performance Mode:
Standard Operation
98. Return to Previous Menu
0>1
Selecting option 1 displays the following performance modes:
Select from the following options:
1. Large Commercial System optimization
2. Standard Operation
3. Turbo Database Mode
0>
Notes:
1. Some processor card changes cause the default performance mode to change. This new setting is
not reflected in the menu until after the system is rebooted with the new configuration.
2. If the NVRAM has been cleared, the default performance mode and the current performance mode
will indicate uninitialized.
Chapter 7. Using the Service Processor 323
If you want to override the default setting, a brief description of each performance mode follows:
Large Commercial System Optimization
Is the setting for systems that do not fall into the other two categories, standard operation and
turbo database mode. This setting provides the optimum performance for most applications.
Standard Operation
Optimizes the system for large memory bandwidth applications where minimal sharing of data
occurs and the likelihood of significant hardware data-prefetching exists.
Turbo Database Mode
Optimizes system operation for environments where there is a large amount of data sharing
among processes running concurrently on the system.
Language Selection Menu
The service processor menus and messages are available in various languages. This menu allows
selecting languages in which the service processor and system firmware menus and messages are
displayed.
LANGUAGE SELECTION MENU
1. English
2. Francais
3. Deutsch
4. Italiano
5. Espanol
98. Return to Previous Menu
99. Exit from Menus
0>
Note: Your virtual terminal window must support the ISO-8859 character set to correctly display
languages other than English.
324 Service Guide
Call-In/Call-Out Setup Menu
Note: The information in this section regarding the configuring of serial ports, and modems attached to
those serial ports, applies only to the serial ports (S1 and S2) on the CEC backplane (location
U0.1-P1). These serial ports are normally used for call-in and call-out.
Call-out is disabled in partitioned systems. The call-out function is normally handled by the Service
Focal Point application running on the HMC.
None of these menus are applicable to the serial ports, or modems attached to those serial ports,
on the HMC.
CALL-IN/CALL-OUT SETUP MENU
1. Modem Configuration Menu
2. Serial Port Selection Menu
3. Serial Port Speed Setup Menu
4. Telephone Number Setup Menu
5. Call-Out Policy Setup Menu
6. Customer Account Setup Menu
7. Call-Out Test
NOT supported in LPAR mode
98. Return to Previous Menu
99. Exit from Menus
0>
v Modem Configuration Menu, see “Modem Configuration Menu” on page 326.
v Serial Port Selection Menu, see “Serial Port Selection Menu” on page 326.
v Serial Port Speed Setup Menu, see “Serial Port Speed Setup Menu” on page 327.
v Telephone Number Setup Menu, see “Telephone Number Setup Menu” on page 327.
v Call-Out Policy Setup Menu, see “Call-Out Policy Setup Menu” on page 328.
v Customer Account Setup Menu, see “Customer Account Setup Menu” on page 329.
v Call-Out Test tests the configuration after the modem is installed and configured correctly (disabled in
partitioned systems).
Chapter 7. Using the Service Processor 325
Modem Configuration Menu
Note: This option applies only to a modem attached to serial port 1 (S1) or serial port 2 (S2) on the CEC
backplane.
The first two lines of the Modem Configuration Menu contain status of the current selections. Selections
are made in the sections labeled Modem Ports and Modem Configuration File Name. Select the serial port
that you want to activate and then select the modem configuration file for the modem on the port. If you
want to set up all of the serial ports with modems, make your selections one port at a time.
Modem Configuration Menu
Port 1 Modem Configuration File Name:
Port 2 Modem Configuration File Name:
To make changes, First select the port and then the configuration file
name
Modem Ports:
1. Serial port 1
2. Serial port 2
Modem Configuration File Name:
5. none9. modem_z_sp
6. modem_f_sp 10. modem_m1_sp
7. modem_f0_sp 11. modem_m0_sp
8. modem_f1_sp 12. modem_m1_sp
30. Save configuration to NVRAM and Configure modem
98. Return to Previous Menu
0>
For information on choosing a modem configuration file, see “Sample Modem Configuration Files” on
page 513 and “Transfer of a Modem Session” on page 517.
Serial Port Selection Menu
This menu allows you to enable or disable the call-in and call-out functions of each serial port in any
combination.
Note: Call-out is disabled in partitioned systems.
Serial Port Selection Menu
1. Serial Port 1 Call-Out:
Currently Disabled
NOT supported in LPAR mode
2. Serial Port 2 Call-Out:
Currently Disabled
NOT supported in LPAR mode
3. Serial Port 1 Call-In:
Currently Disabled
4. Serial Port 2 Call-In:
Currently Disabled
98. Return to Previous Menu
0>
326 Service Guide
Call-in and ring indicate power-on cannot be enabled at the same time. If ring-indicate power-on is already
enabled and you try to enable call-in, a message prompts you for confirmation. Refer to the message
displayed on the screen.
Serial Port Speed Setup Menu
This menu allows you to set serial port speed to enhance terminal performance or to accommodate
modem capabilities.
Serial Port Speed Setup Menu
1. Serial Port 1 Speed:
Currently 9600
2. Serial Port 2 Speed:
Currently 9600
98. Return to Previous Menu
0>
A serial port speed of 9600 baud or higher is recommended. The following are valid serial port speeds:
50 600 4800
75 1200 7200
110 1800 9600
134 2000 19200
150 2400 38000
300 3600 57600
115200
Telephone Number Setup Menu
Note: These menus are available, however, the function of calling out to report a system failure is
normally handled by the Service Focal Point application running on the HMC.
Use this menu to set or change the telephone numbers for reporting a system failure.
Telephone Number Setup Menu
1. Service Center Telephone Number:
Currently Unassigned
2. Customer Administration Center Telephone Number:
Currently Unassigned
3. Digital Pager Telephone Number:
Currently Unassigned
4. Customer Voice Telephone Number:
Currently Unassigned
5. Customer System Telephone Number:
Currently Unassigned
98. Return to Previous Menu
0>
Chapter 7. Using the Service Processor 327
v Service Center Telephone Number is the number of the service center computer. The service center
usually includes a computer that takes calls from servers with call-out capability. This computer is
referred to as the catcher. The catcher expects messages in a specific format to which the service
processor conforms.
For more information about the format and catcher computers, refer to the readme file in the AIX
/usr/samples/syscatch directory. Contact your service provider for the correct service center telephone
number to enter. Until you have that number, leave this field unassigned.
v Customer Administration Center Telephone Number is the number of the System Administration
Center computer (catcher) that receives problem calls from servers. Contact your system administrator
for the correct telephone number to enter here. Until you have that number, leave this field unassigned.
v Digital Pager Telephone Number is the number for a numeric pager carried by someone who
responds to problem calls from your system. Contact your administration center representative for the
correct telephone number to enter. For test purposes, use a test number, which you can change later
(see the note on page 337).
Note: If the system is running in partitioned mode, call-out is disabled. However, if the system is booted
in full system partition mode, at least one of the preceding three telephone numbers must be
assigned in order for the call-out test to complete successfully.
v Customer Voice Telephone Number is the telephone number of a phone near the system or answered
by someone responsible for the system. This is the telephone number left on the pager for callback. For
test purposes, use a test number, which you can change later.
v Customer System Telephone Number is the telephone number to which your system’s modem is
connected. The service or administrative center representatives need this number to make direct
contact with your system for problem investigation. This is also referred to as the call-in phone number.
Call-Out Policy Setup Menu
Note: Call-out is disabled on partitioned systems.
Call out settings can be set using the following menu:
CALL-OUT POLICY SETUP MENU
1. Call-Out policy (First/All):
Currently First
2. Remote timeout, (in seconds):
Currently 120
3. Remote latency, (in seconds):
Currently 2
4. Number of retries:
Currently 2
98. Return to Previous Menu
0>
328 Service Guide
v Call-Out policy can be set to first or all. If call-out policy is set to first, the service processor stops
at the first successful call-out to one of the following numbers in the order listed:
1. Service Center
2. Customer Administrative Center
3. Pager
If call-out policy is set to all, the service processor attempts a call-out to all of the following numbers in
the order listed:
1. Service Center
2. Customer Administrative Center
3. Pager
v Remote timeout and remote latency are functions of your service provider’s catcher computer. Either
use the defaults or contact your service provider for recommended settings.
v Number of retries is the number of times you want the system to retry calls that failed to complete.
Customer Account Setup Menu
This menu allows users to enter information that is specific to their account.
Customer Account Setup Menu
1. Customer Account Number:
Currently Unassigned
2. Customer RETAIN Login userid:
Currently Unassigned
3. Customer RETAIN login password:
Currently Unassigned
98. Return to Previous Menu
0>
v Customer Account Number is assigned by your service provider for record-keeping and billing. If you
have an account number, enter it. Otherwise, leave this field unassigned.
v Customer RETAIN Login User ID and Customer RETAIN Login Password apply to a service function
to which your service provider might have access. If your service provider does not use RETAIN, leave
these fields unassigned.
Call-out Test Menu
Note: Call-out test is disabled in partitioned systems.
Use this menu to test the configuration after the modem is installed and configured. To execute this test
successfully, at least one of the following numbers must be assigned:
v Service center telephone number
v Customer administration center telephone number
v Digital pager telephone number
Chapter 7. Using the Service Processor 329
Service Processor Parameters in Service Mode (Full System Partition)
When the system is in service mode, the following service-processor parameters are suspended:
v Unattended Start Mode
v Reboot/Restart Policy
v Call-Out
v Surveillance
When service mode is exited, the service processor functions are reactivated.
System Power-On Methods
This section discusses the following system power-on methods:
v Power-on Switch
v Service Processor Menus
Privileged users can power on the system by selecting the System Control Power Menu option from
the main menu and then selecting the Power-on System option from the System Power Control Menu.
General users should select Power-on System on the General User Menu.
v Remote Power-on via Ring-Indicate Signal
The system automatically powers on when it detects a ²ring indicate² signal from a modem attached to
serial port 1 (S1) or serial port 2 (S2).
A remote user can call the system to activate ring detection by the modem. Listen for a few more rings
than the threshold number for starting the system. The system powers on without answering the call.
v Unattended start mode - refer to Enable/Disable Unattended Start Mode on page 314.
The service processor can be enabled to recover from the loss of ac power (see Enable/Disable
Unattended Power-On Mode in the SYSTEM POWER CONTROL MENU). When ac power is restored,
the system returns to the power state at the time ac loss occurred. For example, if the system was
powered on when ac loss occurred, it reboots/restarts when power is restored. If the system was
powered off when ac loss occurred, it remains off when power is restored.
v Timed power-on - refer to the shutdown -t command on servers using AIX.
Working in conjunction with AIX, the service processor in your system can operate a timer, much like
the wake-up timer on your clock radio. You can set the timer so that your system powers on at a certain
time after shutting down. The timer is battery-operated, so power interruptions that occur while the
system is off do not affect its accuracy. Refer to the AIX shutdown -t command for details on setting
the timer.
Note: If an ac power loss is in progress when the timed power-on attempt occurs, the system cannot
power on when ac power is restored.
v Follow-up to a Failed Boot Attempt
The service processor initiates a power-on sequence if a failed boot attempt is detected (due to a
hardware or software failure).
v Fast or Slow Boot (IPL)
Using the service processor menus, you can select the IPL type, mode, and speed of your system.
Attention: Selecting fast IPL results in several diagnostic tests being skipped and a shorter memory
test being run.
330 Service Guide
Service Processor Reboot/Restart Recovery
Reboot describes bringing the system hardware back up; for example, from a system reset or power-on.
The boot process ends when control passes to the operating system process.
Restart describes activating the operating system after the system hardware is reinitialized. Restart must
follow a successful reboot.
Boot (IPL) Speed
When the system enters reboot recovery, slow IPL is automatically started, which gives the POST an
opportunity to locate and report any problems that might otherwise be unreported.
Failure During Boot Process
During the boot process, either initially after system power-on or upon reboot after a system failure, the
service processor monitors the boot progress. If progress stops, the service processor can reinitiate the
boot process (reboot) if enabled to do so. The service processor can re-attempt this process according to
the number of retries selected in the Reboot/Restart Policy Setup Menu.
Failure During Normal System Operation
When the boot process completes and control transfers to the operating system (OS), the service
processor can monitor operating system activity (see the Set Surveillance Parameters option in the
SERVICE PROCESSOR SETUP MENU). If OS activity stops due to a hardware- or software-induced
failure, the service processor can initiate a reboot/restart process based on the settings in the Service
Processor Reboot/Restart Policy Setup Menu and the OS automatic restart settings (see the operating
system documentation), if you are running in a full system partition. Operating system surveillance is
disabled in partitioned systems.
If you are using the AIX operating system, the menu item under SMIT for setting the restart policy is
Automatically Reboot After Crash. The default is false. When the setting is true, and if the service
processor parameter ²Use OS-Defined Restart Policy² is yes (the default), the service processor takes
over for AIX to reboot/restart after a hardware or surveillance failure.
Service Processor Reboot/Restart Policy Controls
The operating system’s automatic restart policy (see operating system documentation) indicates the
operating system response to a system crash. The service processor can be instructed to refer to that
policy by the Use OS-Defined Restart Policy setup menu.
If the operating system has no automatic restart policy, or if it is disabled, then the service
processor-restart policy can be controlled from the service processor menus. Use the Enable
Supplemental Restart Policy selection.
Use OS-Defined restart policy - The default setting is no. In a full-system partition, this causes the
service processor to refer to the OS Automatic Restart Policy setting and take action (the same action the
operating system would take if it could have responded to the problem causing the restart).
When this setting is no, or if the operating system did not set a policy, the service processor refers to
enable supplemental restart policy for its action.
Chapter 7. Using the Service Processor 331
Enable supplemental restart policy - The default setting is yes. When set to yes in a full system
partition, the service processor restarts the server when the operating system loses control and either:
v The Use OS-Defined restart policy is set to No.
OR
v The Use OS-Defined restart policy is set to Yes and the operating system has no automatic restart
policy.
If set to Yes on a full-system partition, the service processor restarts the system when the system loses
control and it is detected by service processor surveillance. Refer to “Service Processor Reboot/Restart
Recovery” on page 331.
The following table describes the relationship among the operating system and service processor restart
controls in a full system partition:
OS Automatic
reboot/restart
after crash setting
Service processor to use
OS-Defined
restart policy?
Service Processor Enable
supplemental restart
policy?
System response
None No1 No
None No1 Yes1 Restarts
None Yes No
None Yes Yes1 Restarts
False2 No1 No
False2 No1 Yes1 Restarts
False2 Yes No
False2 Yes Yes1
True No1 No
True No1 Yes1 Restarts
True Yes No Restarts
True Yes Yes1 Restarts
1 Service processor default
2 AIX default
Note: In a partitioned system, the service processor’s supplemental restart policy is the only setting that is
used.
System Firmware Updates
This section provides information and instructions for updating the system firmware. You may need to
perform these steps if you are installing an option or if your support representative has instructed you to
update your firmware.
332 Service Guide
If you cannot download from the Web, do the following:
v If the system cannot be powered on, but the service processor menus are available, see “Updating
System Firmware From the Service Processor Menus” on page 334.
v If the service processor programming has been corrupted, the service processor will automatically enter
recovery mode when power is applied to the system.
To check the level of firmware that is currently on the system, see “Determining the Level of Firmware on
the System” on page 334.
General Information on System Firmware Updates
All the system firmware types that can be reprogrammed are updated at the same time. They are:
v System power control network programming
v Service processor programming
v IPL programming
v Run-time abstraction services
Retain and store the latest firmware diskettes each time the firmware gets updated in the event that the
firmware becomes corrupted and must be reloaded.
Chapter 7. Using the Service Processor 333
Determining the Level of Firmware on the System
The firmware level is denoted by XXYYMMDD, where XX = model designation, YY = year, MM = month,
and DD = day of the release.
If your system is running AIX, the platform firmware level can be determined by either of the following
methods:
v On the AIX command line, by typing:
lscfg -vp|grep -p Platform
A line that begins with ROM level (alterable).. displays the firmware level that is currently on the
system.
v Looking at the top of the service processor main menu.
If the system is running Linux, the platform firmware level can be determined by either of the following
methods:
v On the Linux command line, type:
/usr/sbin/ibmras/nvram --print-vpd | grep RR
A line similar to Alterable ROM Level RR021114 displays the firmware level that is currently on the
system.
v Looking at the top of the service processor menu main menu.
Updating System Firmware From the Service Processor Menus
This procedure requires a diskette drive to be installed in the system. This procedure also requires a set of
firmware update diskettes in backup format.
The service processor menus are available while the system is powered off. As a privileged user, from the
service processor main menu, select Service Processor Setup, then select Reprogram Flash EPROM
Menu. The update process requests update diskettes as needed.
334 Service Guide
Updating System Firmware from a NIM Server
Refer to “Running Standalone Diagnostics from a Network Installation Management (NIM) Server with an
HMC Attached to the System” on page 302.
Recovery Mode
Contact your marketing representative for information about obtaining and installing system firmware
update packages.
Code Action
A1FD 0000 System firmware has been corrupted and must be reflashed.
A1FD 0001 Insert update diskette 1.
A1FD 0002 Insert update diskette 2.
A1FD 0003 Insert update diskette 3.
A1FD 000n Insert update diskette n.
Notes:
1. If the wrong diskette is inserted at any time, or if the diskette is left in the drive after it has been read,
B1FD 001F is displayed, indicating that the wrong diskette is in the drive.
2. If B1FD 001A is displayed at any time during the process, the service processor must be reset by
activating the pinhole reset switch on the primary drawer operator panel.
Configuring and Deconfiguring Processors or Memory
All failures that crash the system with a machine check or check stop, even if intermittent, are reported as
a diagnostic callout for service repair. To prevent the recurrence of intermittent problems and improve the
availability of the system until a scheduled maintenance window, processors and memory DIMMs with a
failure history are marked ²bad² to prevent their being configured on subsequent boots. This function is
called repeat gard.
A processor or memory DIMM is marked ²bad² under the following circumstances:
v A processor or memory DIMM fails built-in self-test (BIST) or power-on self-test (POST) testing during
boot (as determined by the service processor).
v A processor or memory DIMM causes a machine check or check stop during runtime, and the failure
can be isolated specifically to that processor or memory DIMM (as determined by the processor runtime
diagnostics in the service processor).
v A processor or memory DIMM reaches a threshold of recovered failures that results in a predictive
callout (as determined by the processor run-time diagnostics in the service processor).
During boot time, the service processor does not configure processors or memory DIMMs that are marked
“bad.”
Chapter 7. Using the Service Processor 335
If a processor or memory DIMM is deconfigured, the processor or memory DIMM remains offline for
subsequent reboots until it is replaced or memory repeat gard is disabled. The repeat gard function also
provides the user with the option of manually deconfiguring a processor or memory DIMM, or re-enabling a
previously deconfigured processor or memory DIMM.
For information about configuring or deconfiguring a processor, see the Processor
Configuration/Deconfiguration Menu on page 318. For information on configuring or deconfiguring a
memory DIMM, see the Memory Configuration/Deconfiguration Menu on page 319. Both of these menus
are submenus under the System Information Menu. You can enable or disable CPU Repeat Gard or
Memory Repeat Gard using the Processor Configuration/Deconfiguration Menu.
Run-Time CPU Deconfiguration (CPU Repeat Gard)
L1 instruction cache recoverable errors, L1 data cache correctable errors, and L2 cache correctable errors
are monitored by the processor runtime diagnostics (PRD) code running in the service processor. When a
predefined error threshold is met, an error log with warning severity and threshold exceeded status is
returned to AIX. At the same time, PRD marks the CPU for deconfiguration at the next boot. AIX will
attempt to migrate all resources associated with that processor to another processor and then stop the
defective processor.
Service Processor System Monitoring - Surveillance
Surveillance is a function in which the service processor monitors the system, and the system monitors the
service processor. This monitoring is accomplished by periodic samplings called heartbeats.
Surveillance is available during the following phases:
v System firmware bringup (automatic)
v Operating system runtime (optional)
Note: Operating system surveillance is disabled in partitioned systems.
System Firmware Surveillance
System firmware surveillance is automatically enabled during system power-on. It cannot be disabled by
the user, and the surveillance interval and surveillance delay cannot be changed by the user.
If the service processor detects no heartbeats during system IPL (for a set period of time), it cycles the
system power to attempt a reboot. The maximum number of retries is set from the service processor
menus. If the fail condition persists, the service processor leaves the machine powered on, logs an error,
and displays menus to the user. If Call-out is enabled, the service processor calls to report the failure and
displays the operating-system surveillance failure code on the operator panel.
Operating System Surveillance
Note: Operating system surveillance is disabled on partitioned systems.
Operating system surveillance provides the service processor with a means to detect hang conditions, as
well as hardware or software failures, while the operating system is running. It also provides the operating
system with a means to detect a service processor failure caused by the lack of a return heartbeat.
Operating system surveillance is not enabled by default, allowing you to run operating systems that do not
support this service processor option.
336 Service Guide
For operating system surveillance to work correctly, you must set these parameters:
v Surveillance enable/disable
v Surveillance interval
The maximum time the service processor should wait for a heartbeat from the operating system before
timeout.
v Surveillance delay
The length of time to wait from the time the operating system is started to when the first heartbeat is
expected.
Surveillance does not take effect until the next time the operating system is started after the parameters
have been set.
If desired, you can initiate surveillance mode immediately from service aids. In addition to the previously
discussed options, another option allows you to select immediate surveillance, and rebooting of the system
is not necessarily required.
If operating system surveillance is enabled (and system firmware has passed control to the operating
system), and the service processor does not detect any heartbeats from the operating system, the service
processor assumes the system is hung and takes action according to the reboot/restart policy settings.
See “Service Processor Reboot/Restart Recovery” on page 331.
If surveillance is selected from the service processor menus that are only available at bootup, surveillance
is enabled by default as soon as the system boots. From service aids, the selection is optional.
Call-Out (Call-Home)
Note: Call-out is disabled on partitioned systems. The call-out function is handled by the Service Focal
Point application on the Hardware Management Console (HMC).
The service processor can call out (call-home) when it detects one of the following conditions:
v System firmware surveillance failure
v Operating system surveillance failure (if supported by operating system)
v Restarts
v Critical hardware failure
v Abnormal operating system termination
To enable the call-out feature, do the following:
1. Connect a modem to any serial port.
2. Set up the following using the service processor menus or diagnostic service aids:
v Enable call-out for the serial port where the modem is connected.
v Enter the modem configuration file name.
v Set up site-specific parameters (phone numbers for call-out, call-out policy, number of call-out
retries, and so on).
Chapter 7. Using the Service Processor 337
3. To call out before restart, set Call-out before restart to ENABLED from the Reboot/Restart Policy
Setup menu.
Note: Some modems, such as IBM 7857-017, are not designed for the paging function. Although they can
be used for paging, they will return an error message when they do not get the expected response
from another modem. Therefore, even though the paging was successful, the error message will
cause the service processor to retry, continuing to place pager calls for the number of retries
specified in the call-out policy setup menu. These retries result in redundant pages.
Console Mirroring
Console mirroring allows a user on one serial port to monitor the service processor activities on another
serial port. This can be done on the locally attached HMC virtual terminal window or windows remotely
attached through modems. Console mirroring is supported on any combination of locally or remotely
attached monitors.
The serial port from which console mirroring is enabled is called the primary port. The mirror port is
determined when keyboard input is detected from one of the other two serial ports. From this point
forward, the service processor sends information to the active port and the mirror port only. Console
mirroring ends when the service processor releases control of the serial ports to the system firmware.
Console mirroring is supported on serial port 1 (S1), serial port 2 (S2), and serial port 3 (S3). Remote
attachment through modem is supported on serial port 1 (S1) and serial port 2 (S2).
System Configuration
The following describes the configuration for console mirroring:
v Service processor
v Modem connected to one serial port and enabled for incoming calls
v Local HMC virtual terminal window connected to the other serial port. This local terminal can be
connected directly to your system or connected through another modem.
Console mirroring can be started by either of the following methods:
v Remote session first, then local session added:
1. Remote session is already in progress.
2. Remote user uses service processor menus to enable console mirroring, allowing both consoles to
be active.
v Local session first, then remote session added:
1. Local session is already in progress.
2. The service processor receives a call from the remote user.
3. The local user selects the option to enable console mirroring. The service processor immediately
begins mirroring service processor menus.
338 Service Guide
Service Processor Error Logs
The service processor error logs, an example of which follows, contain error conditions detected by the
service processor.
Error Log
1. 11/30/99 19:41:56 Service Processor Firmware Failure
B1004999
Enter error number for more details.
Press Return to continue, or ’x’ to return to menu.
Press "C" to clear error log, any other key to continue. >
Note: The time stamp in this error log is coordinated universal time (UTC), which is also referred to as
Greenwich mean time (GMT). Operating system error logs have additional information available and
can time stamp with local time.
Entering an error number provides nine words of system reference code (SRC) data; an example menu
follows.
Detail: 6005
SRC
- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
word11:B1004999 word12:0110005D word13:00000000
word14:00000000 word15:00001111 word16:00000000
word17:B1004AAA word18:0114005D word19:A4F1E909
B1004999
Press Return to continue, or ’x’ to return to menu.
If Return is pressed, the contents of NVRAM will be dumped 320 bytes at a time, starting at address 0000.
Chapter 7. Using the Service Processor 339
LCD Progress Indicator Log
The following is an example of the LCD progress indicator log. It shows the types of entries that can
appear in the log, and is for example purposes only.
The progress indicator codes are listed from top (latest) to bottom (oldest).
LCD Progress Indicator Log
B0FF
0539..17
0538..17
0539..17
0538..17
0539..17
0581
0538..17
0539..12
0538..12
0539..
0821..01-K1-00
0539..
0728..01-R1-00-00
0539..
0664..40-60-00-1,0
0539..
0777..U0.1-P2-I1/E1
0539..
0742..U0.1-P2-I2/E1
0539..
0776..U0.1-P2-I3/T1
E139
E1FB
E139
E183
Press Return to continue, or ’x’ to return to menu. >
EAA1..U0.1-P1-I4
E172..U0.1-P1
E172..U0.1-P1-I4
E172..U0.1-P1
94BB
9109
9380
9108
9107
9106
9105
9118
9104
9103
9102
90FD
340 Service Guide
Service Processor Operational Phases
This section provides a high-level flow of the phases of the service processor.
Pre-Standby Phase
This phase is entered when the system is connected to a power source. The system may or may not be
fully powered on. This phase is exited when the power-on self-tests (POSTs) and configuration tasks are
completed.
The pre-standby phase components are:
v Service Processor Initialization - Service processor performs any necessary hardware and software
initialization.
v Service Processor POST - Service processor conducts Power-on self-tests on its various work and code
areas.
v Service Processor Unattended Start Mode Checks - To assist fault recovery. If unattended start mode is
set, the service processor automatically reboots the system. The service processor does not wait for
user input or power-on command, but moves through the phase and into the bring-up phase. Access
the SMS menus or the service processor menus to reset the unattended start mode.
Standby Phase
The standby phase can be reached in either of the following ways:
v With the system off and power connected (the normal path), recognized by OK in the LCD display.
OR
v With the system on after an operating system fault, recognized by an 8-digit code in the LCD display.
In the standby phase, the service processor takes care of some automatic duties and is available for
menus operation. The service processor remains in the standby phase until a power-on request is
detected.
Chapter 7. Using the Service Processor 341
The standby phase components are as follows:
v Modem Configuration
The service processor configures the modem (if installed) so that incoming calls can be received, or
outgoing calls can be placed.
v Dial In
Monitor incoming phone line to answer calls, prompt for a password, verify the password, and remotely
display the standby menu. The remote session can be mirrored on the local HMC virtual terminal
window if the system is so equipped and if the user enables this function.
v Menus
The service processor menus are password-protected. Before you can access them, you need either
the general user-password or privileged-user password.
Service processor menus are available on ASCII terminals attached to the serial ports on the system,
and on terminal emulators. On HMC-managed systems, service processor menus are also available on
the HMC interface.
Bring-Up Phase
On a system that is powered-on to full system partition, this phase is entered upon power-on, and exited
upon loading of the operating system.
On a system that is powered-on to partition standby, this phase is entered upon power-on, and exited
when the partition manager and hypervisor have been loaded and become fully operational in system
memory. The end of this phase is indicated by LPAR on the operator panel. At this point, the server can
start multiple logical partitions that would be activated through the HMC interface.
The bring-up phase components are as follows:
v Retry Request Check
The service processor checks to see if the previous boot attempt failed. If the specified number of
failures are detected, the service processor displays an error code and places an outgoing call to notify
an external party if the user has enabled this option.
v Dial Out
The service processor can dial a preprogrammed telephone number in the event of an IPL failure. The
service processor issues an error report with the last reported IPL status indicated and any other
available error information.
v Update Operator Panel
The service processor displays operator panel data on the HMC virtual terminal window if a remote
connection is active.
v Environmental Monitoring
The service processor provides expanded error recording and reporting.
v System Firmware Surveillance (Heartbeat Monitoring)
The service processor monitors and times the interval between system firmware heartbeats.
v Responding to System Processor Commands
The service processor responds to any command issued by the system processor.
342 Service Guide
Runtime Phase
This phase includes the tasks that the service processor performs during steady-state execution of the
operating system.
v Environmental Monitoring
The service processor monitors voltages, temperatures, and fan speeds (on some servers).
v Responding to System Processor Commands
The service processor responds to any command issued by the system processor.
v Run-Time Surveillance (not supported on partitioned systems)
If the device driver is installed and surveillance enabled, the service processor monitors the system
heartbeat. If the heartbeat times out, the service processor places an outgoing call. This is different from
the bring-up phase scenario, where the specified number of reboot attempts are made before placing an
outgoing call.
v HMC surveillance
On an HMC-managed system, the service processor monitors the communication link between the
managed system and the HMC. If the service processor detects that this communication link has been
broken, it will post an error to the operating system running on the managed system.
Chapter 7. Using the Service Processor 343
344 Service Guide
Chapter 8. Using System Management Services
Use the system management services menus to view information about your system or partition, and to
perform tasks such as setting a password, changing the boot list, and setting the network parameters.
Notes:
1. On some of the system management services (or service processor) screens, you will see the term
LPAR. LPAR is equivalent to the term logically partitioned system or partitioned system.
2. In a partitioned system, only those devices that are assigned to the partition that is being booted
display in the SMS menus. In a full system partition, all devices in the system display in the SMS
menus.
To start the system management services, do the following:
1. For a partitioned system, use the Hardware Management Console (HMC) to restart the partition.
For a full system partition, restart the system.
2. For a partitioned system, watch the virtual terminal window on the HMC.
For a full system partition, watch the firmware console.
3. Look for the POST indicators memory, keyboard, network, scsi, speaker, which appear across the
bottom of the screen. Press the numeric 1 key after the word keyboard appears, and before the word
speaker appears.
For more information on the POST indicator words, refer to “POST Indicators” on page 7.
After the system management services starts, the following screen displays:
Main Menu
1 Select Language
2 Change Password Options NOT available in LPAR mode
3 View Error Log
4 Setup Remote IPL (Initial Program Load)
5 Change SCSI Settings
6 Select Console NOT available in LPAR mode
7 Select Boot Options
--------------------------------------------------------------------------------------------------
Navigation keys:
X = eXit System Management Services
--------------------------------------------------------------------------------------------------
Type the number of the menu item and press Enter or Select a Navigation key: _
Note: The system management services can also be started using the Service Processor Boot Mode
Menu. See page 316.
345
On all menus except the Main Menu, there are several navigation keys:
M Return to the main menu.
ESC Return to the previous menu.
X Exit the system management services and start the operating system.
If X is entered, you are asked to confirm your choice to exit the SMS menus and start the
operating system.
When there is more than one page of information to display, there are two additional navigation keys:
N Display the next page of the list.
P Display the previous page of the list.
Note: The lowercase navigation key has the same effect as the uppercase key that is shown on the
screen. For example, m or M returns you to the main menu.
On each menu screen, you are given the option of choosing a menu item and pressing Enter (if
applicable), or selecting a navigation key.
Select Language
Note: Your TTY must support the ISO-8859 character set to properly display languages other than
English.
This option allows you to change the language used by the text-based System Management Services
menus.
SELECT LANGUAGE
1. English
2. Francais
3. Deutsch
4. Italiano
5. Espanol
--------------------------------------------------------------------------------------------------
Navigation keys:
M = return to main menu
ESC key = return to previous screen X = eXit System Management Services
--------------------------------------------------------------------------------------------------
Type the number of the menu item and press Enter or Select a Navigation key: _
346 Service Guide
Change Password Options
The Change Password Options menu enables you to select from password utilities.
Note: This option is not available in partitioned systems.
Password Utilities
1 Set Privileged-Access Password
2 Remove Privileged-Access Password
--------------------------------------------------------------------------------------------------
Navigation keys:
M = return to main menu
ESC key = return to previous screen X = eXit System Management Services
--------------------------------------------------------------------------------------------------
Type the number of the menu item and press Enter or Select a Navigation key: _
The privileged-access password protects against the unauthorized starting of the system programs.
Note: If the privileged-access password has been enabled, you are asked for the privileged-access
password at startup every time you boot your system.
If you previously had set a privileged-access password and want to remove it, select Remove
Privileged-Access Password.
View Error Log
Use this option to view or clear your system’s error log. A menu similar to the following displays when you
select this option.
Error Log
Date Time Error Code Location
Entry 1. 01/04/96 12:13:22 25A80011 00-00
Entry 2. no error logged
1. Clear error log
--------------------------------------------------------------------------------------------------
Navigation keys:
M = return to main menu
ESC key = return to previous screen X = eXit System Management Services
--------------------------------------------------------------------------------------------------
Type the number of the menu item and press Enter or Select a Navigation key: _
Note: The time stamp in his error log is coordinated universal time (UTC), which is also referred to as
Greenwich mean time (GMT). The operating system error logs have more information available and
can time stamp with your local time.
Chapter 8. Using System Management Services 347
Setup Remote IPL (Initial Program Load)
This option allows you to enable and set up the remote startup capability of your system or partition. You
must first specify the network parameters.
Network Parameters
1. IP Parameters
2. Adapter Parameters
3. Ping Test
--------------------------------------------------------------------------------------------------
Navigation keys:
M = return to main menu
ESC key = return to previous screen X = eXit System Management Services
--------------------------------------------------------------------------------------------------
Type the number of the menu item and press Enter or Select a Navigation key: _
Selecting IP Parameters displays the following menu.
IP Parameters
1. Client IP Address [000.000.000.000]
2. Server IP Address [000.000.000.000]
3. Gateway IP Address [000.000.000.000]
4. Subnet Mask [255.255.255.000]
--------------------------------------------------------------------------------------------------
Navigation keys:
M = return to main menu
ESC key = return to previous screen X = eXit System Management Services
--------------------------------------------------------------------------------------------------
Type the number of the menu item and press Enter or Select a Navigation key: _
To change IP (Internet Protocol) parameters, type the number of the parameters for which you want to
change the value.
Attention: If the client system and the server are on the same subnet, set the gateway IP address to
[0.0.0.0].
348 Service Guide
Selecting Adapter Parameters allows you to view an adapter’s hardware address, as well as configure
network adapters that require setup. A menu similar to the following displays.
Attention: In a partitioned system, only those network adapters that have been assigned to the partition
being booted display in the Adapters Parameters menu. In a full system partition, all network adapters in
the system are listed in the adapter parameters menu.
Device Slot Hardware Address
1. 10/100 Mbps Ethernet PCI Adapter II 4:U0.1-P1-I4/E1 000629aca72d
2. 10/100 Ethernet Adapter Integrated:U0.1-P1/E1 0020357A0530
3. High Speed Token-Ring Adapter 1:U0.1-P1-I1/T1 000629be04e1
4. Gigabit Ethernet PCI Adapter 3:U0.1-P1-I3/E1 0004ac7c9ec7
--------------------------------------------------------------------------------------------------
Navigation keys:
M = return to main menu
ESC key = return to previous screen X = eXit System Management Services
--------------------------------------------------------------------------------------------------
Type the number of the menu item and press Enter or Select a Navigation key: _
Entering adapter parameters on these screens will automatically update the parameters on the ping test
screen.
Selecting an adapter on this menu displays configuration options for that adapter. The current setting is
indicated by <===.
Adapter Parameters
10/100 Mbps Ethernet PCI Adapter II
1. ethernet, 10, rj45, full
2. ethernet, 10, rj45, half
3. ethernet, 100, rj45, full
4. ethernet, 100, rj45, half
5. ethernet, auto, rj45, auto <===
--------------------------------------------------------------------------------------------------
Navigation keys:
M = return to main menu
ESC key = return to previous screen X = eXit System Management Services
--------------------------------------------------------------------------------------------------
Type the number of the menu item and press Enter or Select a Navigation key: _
To change the adapter parameters, select an option from the list.
Chapter 8. Using System Management Services 349
To test a connection to a remote system unit, select Ping Test from the Network Parameters Menu. After
selecting the Ping Test option, you must select which adapter communicates with the remote system.
Adapter Parameters
Device Slot Hardware Address
1. 10/100 Mbps Ethernet PCI Adapter II 4:U0.1-P1-I4/E1 000629aca72d
2. 10/100 Ethernet Adapter Integrated:U0.1-P1/E1 0020357A0530
3. High Speed Token-Ring Adapter 1:U0.1-P1-I1/T1 000629be04e1
4. Gigabit Ethernet PCI Adapter 3:U0.1-P1-I3/E1 0004ac7c9ec7
--------------------------------------------------------------------------------------------------
Navigation keys:
M = return to main menu
ESC key = return to previous screen X = eXit System Management Services
--------------------------------------------------------------------------------------------------
Type the number of the menu item and press Enter or Select a Navigation key: _
After selecting an adapter, you will be asked to set the parameters for the adapter; a menu similar to the
following displays:
Adapter Parameters
10/100 Mbps Ethernet PCI Adapter II
1. ethernet, 10, rj45, full
2. ethernet, 10, rj45, half
3. ethernet, 100, rj45, full
4. ethernet, 100, rj45, half <===
5. ethernet, auto, rj45, auto
6. Execute Ping Test
--------------------------------------------------------------------------------------------------
Navigation keys:
M = return to main menu
ESC key = return to previous screen X = eXit System Management Services
--------------------------------------------------------------------------------------------------
Type the number of the menu item and press Enter or Select a Navigation key: _
350 Service Guide
After selecting which adapter and its parameters to use to ping the remote system, you must provide the
addresses needed to communicate with the remote system.
Ping Test
1. Client IP Address [129.132.4.20]
2. Server IP Address [129.132.4.10]
3. Gateway IP Address [129.132.4.30]
4. Subnet Mask [255.255.255.0]
5. Execute Ping Test
--------------------------------------------------------------------------------------------------
Navigation keys:
M = return to main menu
ESC key = return to previous screen X = eXit System Management Services
--------------------------------------------------------------------------------------------------
Type the number of the menu item and press Enter or Select a Navigation key: _
Notes:
1. After the ping test is initiated, it may take up to 60 seconds to return a result.
2. If the ping test passes or fails, the firmware will stop and wait for a key to be pressed before
continuing.
Change SCSI Settings
This option allows you to view and change the addresses of the SCSI controllers attached to your system.
SCSI Utilities
1. Hardware Spin Up Delay
2. Change SCSI Id
--------------------------------------------------------------------------------------------------
Navigation keys:
M = return to main menu
ESC key = return to previous screen X = eXit System Management Services
--------------------------------------------------------------------------------------------------
Type the number of the menu item and press Enter or Select a Navigation key: _
Select Console
Note: This option is not available on partitioned systems.
The Select Console utility allows the user to select which console to use to display the SMS menus. This
selection is only for the SMS menus and does not affect the display used by the operating system.
Follow the instructions that display on the screen. The firmware automatically returns to the SMS main
menu.
Chapter 8. Using System Management Services 351
Select Boot Options
Use this menu to view and set various options regarding the installation devices and boot devices.
1. Select Install or Boot a Device
2. Select Boot Devices
3. Multiboot Startup
--------------------------------------------------------------------------------------------------
Navigation keys:
M = return to main menu
ESC key = return to previous screen X = eXit System Management Services
--------------------------------------------------------------------------------------------------
Type the number of the menu item and press Enter or Select a Navigation key: _
Select Install or Boot a Device
Allows you to select a device from which to boot or install the operating system. This option is for
the current boot only.
Select Boot Devices
Allows you to set the boot list.
Multiboot Startup
Toggles the multiboot startup flag, which controls whether the multiboot menu is invoked
automatically on startup.
If Select Install or Boot a Device is selected, the following menu displays:
Select Device Type
1. Diskette
2. Tape
3. CD/DVD
4. IDE
5. Hard Drive
6. Network
7. None
8. List All Devices
--------------------------------------------------------------------------------------------------
Navigation keys:
M = return to main menu
ESC key = return to previous screen X = eXit System Management Services
--------------------------------------------------------------------------------------------------
Type the number of the menu item and press Enter or Select a Navigation key: _
If a device is selected that is not in the system, a menu with the following message displays:
.---------------------------------------------------------.
| THE SELECTED DEVICES WERE NOT DETECTED IN THE SYSTEM ! |
| Press any key to continue. |
Á---------------------------------------------------------’
352 Service Guide
If Hard Drive is selected, the following menu displays:
Select Hard Drive Type
1. SCSI
2. SSA
3. SAN
4. None
5. List All Devices
--------------------------------------------------------------------------------------------------
Navigation keys:
M = return to main menu
ESC key = return to previous screen X = eXit System Management Services
--------------------------------------------------------------------------------------------------
Type the number of the menu item and press Enter or Select a Navigation key: _
If List All Devices is selected, a menu similar to the following displays, depending on the devices that are
installed in the system:
Select Device
Device Current Device
Number Position Name
1. 1 SCSI 18200 MB Harddisk (loc=U0.1-P2/Z1-A8,0)
2. - SCSI 18200 MB Harddisk (loc=U0.1-P2/Z1-A9,0)
3. - SCSI 18200 MB Harddisk (loc=U0.1-P2/Z1-Aa,0)
4. None
5. List all devices
--------------------------------------------------------------------------------------------------
Navigation keys:
M = return to main menu
ESC key = return to previous screen X = eXit System Management Services
--------------------------------------------------------------------------------------------------
Type the number of the menu item and press Enter or Select a Navigation key: _
The appropriate device can then be selected for this boot or installation.
When a device is selected for installing the operating system, or to boot from, the Select Task menu allows
you to get more information about the device, or to boot from that device in normal mode or service mode.
The following is an example of this menu.
Select Task
SCSI 18200 MB Harddisk (loc=U0.1-P2/Z1-A8,0)
1. Information
2. Normal Mode Boot
3. Service Mode Boot
--------------------------------------------------------------------------------------------------
Navigation keys:
M = return to main menu
ESC key = return to previous screen X = eXit System Management Services
--------------------------------------------------------------------------------------------------
Type the number of the menu item and press Enter or Select a Navigation key: _
If either Normal Mode Boot or Service Mode Boot is selected, the next screen will ask, Are you sure?. If
you answer yes, the device will be booted in the appropriate mode. If you answer no, the firmware will
return to the Select Task menu.
Chapter 8. Using System Management Services 353
Select Boot Devices
Attention: In a partitioned system, only those devices from which an operating system can be booted
that are assigned to the partition that is being booted display on the select boot devices menu. In a full
system partition, devices from which an operating system can be booted display on the Select Boot
Devices menu.
Select this option to view and change the customized boot list, which is the sequence of devices read at
startup.
Configure Boot Device Order
1. Select 1st Boot Device
2. Select 2nd Boot Device
3. Select 3rd Boot Device
4. Select 4th Boot Device
5. Select 5th Boot Device
6. Display Current Setting
7. Restore Default Setting
--------------------------------------------------------------------------------------------------
Navigation keys:
M = return to main menu
ESC key = return to previous screen X = eXit System Management Services
--------------------------------------------------------------------------------------------------
Type the number of the menu item and press Enter or Select a Navigation key: _
When any of the options 1-5 is selected, the Select Device Type screen will be displayed, which is similar
to the following:
Select Device Type
1. Diskette
2. Tape
3. CD/DVD
4. IDE
5. Hard Drive
6. Network
7. None
8. List All Devices
--------------------------------------------------------------------------------------------------
Navigation keys:
M = return to main menu
ESC key = return to previous screen X = eXit System Management Services
--------------------------------------------------------------------------------------------------
Type the number of the menu item and press Enter or Select a Navigation key: _
When a device type is selected, such as item 5, a Select Task menu is displayed. The following is an
example of that menu for a hard disk.
354 Service Guide
Select Task
SCSI 18200 MB Harddisk (loc=U0.1-P2/Z1-A8,0)
1. Information
2. Set Boot Sequence: Configure as 1st Boot Device
--------------------------------------------------------------------------------------------------
Navigation keys:
M = return to main menu
ESC key = return to previous screen X = eXit System Management Services
--------------------------------------------------------------------------------------------------
Type the number of the menu item and press Enter or Select a Navigation key: _
Selecting Information displays a menu similar to the following for a hard disk.
Device Information
/pci@3fffdf0a000/pci@2,4/scsi@1/sd@8,0
: (Integrated) (Bootable)
DEVICE : SCSI 18200 MB Harddisk ( loc=U0.1-P2/Z1-A8,0)
NAME : sd
DEVICE-TYPE : block
--------------------------------------------------------------------------------------------------
Navigation keys:
M = return to main menu
ESC key = return to previous screen X = eXit System Management Services
--------------------------------------------------------------------------------------------------
Type the number of the menu item and press Enter or Select a Navigation key: _
The Set Boot Sequence option allows you to set the location of the device in the boot list.
Display Current Settings
This option displays the current setting of the customized boot list. An example of this menu, with one
device in the boot list, follows.
Current Boot Sequence
1. SCSI 18200 MB Harddisk (loc=U0.1-P1/Z1-A8,0)
2. None
3. None
4. None
5. None
--------------------------------------------------------------------------------------------------
Navigation keys:
M = return to main menu
ESC key = return to previous screen X = eXit System Management Services
--------------------------------------------------------------------------------------------------
Type the number of the menu item and press Enter or Select a Navigation key: _
Chapter 8. Using System Management Services 355
Restore Default Settings
This option restores the boot list to the default boot list. The default boot list will vary depending on the
devices that are installed in the system.
The default boot list is as follows:
1. Primary diskette drive (if installed)
2. CD-ROM drive (if installed)
3. Tape drive (if installed)
4. Hard disk drive (if installed)
5. Network adapter
Multiboot Startup
Multiboot Startup toggles the multiboot startup flag, which controls whether the multiboot menu is invoked
automatically on startup.
Exiting System Management Services
After you have finished using the system management services, type x (for exit) to boot your system or
partition.
356 Service Guide
Chapter 9. Removal and Replacement Procedures
Notes:
1. Servicing of the 286/E4 can be performed with the system in either the vertical or horizontal position.
Servicing of the 286/C4 is performed with the system in the rack and placed into the service position.
The service position for the 286/C4 is described in “Placing the 286/C4 into the Service Position” on
page 364.
2. Before performing any of the removal or replacement procedures in this chapter, read the following
danger and caution notices.
In the system you are about to set up or service:
v The ac power interface connector is considered the main power disconnect device.
v This system has redundant power supply capabilities, meaning that it has the ability to have two power
supplies running simultaneously in the same system unit. When instructed to disconnect the power
source, ensure that all power cables have been unplugged.
DANGER
An electrical outlet that is not correctly wired could place hazardous voltage on metal parts of
the system or the devices that attach to the system. It is the responsibility of the customer to
ensure that the outlet is correctly wired and grounded to prevent an electrical shock.
Before installing or removing signal cables, ensure that the power cables for the system unit
and all attached devices are unplugged.
When adding or removing any additional devices to or from the system, ensure that the power
cables for those devices are unplugged before the signal cables are connected. If possible,
disconnect all power cables from the existing system before you add a device.
Use one hand, when possible, to connect or disconnect signal cables to prevent a possible
shock from touching two surfaces with different electrical potentials.
During an electrical storm, do not connect cables for display stations, printers, telephones, or
station protectors for communications lines.
D05
CAUTION:
This product is equipped with a three-wire power cable and plug for the user’s safety. Use this
power cable with a properly grounded electrical outlet to avoid electrical shock.
C01
CAUTION:
This unit has more than one power supply cord. To reduce the risk of electrical shock, disconnect
two power supply cords before servicing.
C21
357
Procedure List
Procedure Procedure Name and Page Location
Adapters “PCI Adapters” on page 379
Battery Replacement “Replacing the Battery” on page 432
Bezels “Bezels” on page 377
Bezel Door “Bezel Door (286/E4)” on page 376
Cable Management Arm “Installing the Cable-Management Arm” on page 501
CD-ROM Drive “Media Drives (Diskette, CD-ROM, Tape, or Disk Drive)” on page 401
CEC Backplane “Central Electronics Complex (CEC) Backplane” on page 443
Covers “Covers” on page 366
Disk Drive “Hot-Plug Disk Drives” on page 405
Disk Drive Backplane “Disk Drive Backplane” on page 415
Disk Drive Configuration or
Deconfiguration
“Deconfiguring (Removing) or Configuring a Disk Drive” on page 405
Diskette Drive “Media Drives (Diskette, CD-ROM, Tape, or Disk Drive)” on page 401
Fans “Fans” on page 395
HMC Appendix E, “Installing a Hardware Management Console (HMC)”, on page 503
I/O Card (Refer to PCI Riser Card)
Media Drives “Media Drives (Diskette, CD-ROM, Tape, or Disk Drive)” on page 401
Memory DIMMs “Memory DIMMs” on page 427
Operating Position “Returning the 286/C4 to the Operating Position” on page 365
Operator Panel “Operator Panel” on page 419
PCI Riser Card “PCI Riser Card” on page 409
Power Supply “Power Supplies” on page 436
Processor Card “Processor Card” on page 423
Rack Installation “286/C4 Removal and Installation into a Rack” on page 496
Release Latch and Brackets “286/C4 Drawer-Release Latch and Brackets” on page 362
Service Position “Placing the 286/C4 into the Service Position” on page 364
Starting the System “Starting the System without an HMC Attached” on page 360
Static-Sensitive Devices “Handling Static-Sensitive Devices” on page 359
Stopping the System “Stopping the System without an HMC Attached” on page 359
System Board (Refer to CEC Backplane)
Rack Hardware “Attaching the Mounting Hardware to the Rack Enclosure” on page 485
VPD Update “System Vital Product Data (VPD) Update Procedure” on page 421
358 Service Guide
Handling Static-Sensitive Devices
Attention: Electronic boards, diskette drives, and disk drives are sensitive to static electricity discharge.
These devices are wrapped in antistatic bags to prevent this damage.
Take the following precautions:
v If you have an antistatic wrist strap available, use it while handling the device.
v Do not remove the device from the antistatic bag until you are ready to install the device in the system.
v With the device still in its antistatic bag, touch it to a metal frame of the system.
v Grasp cards and boards by the edges. Hold drives by the frame. Avoid touching the solder joints or
pins.
v If you need to lay the device down while it is out of the antistatic bag, lay it on the antistatic bag. Before
picking it up again, touch the antistatic bag and the metal frame of the system at the same time.
v Handle the devices carefully to prevent permanent damage.
Stopping the System without an HMC Attached
Attention: When shutting down your system to install options, shut down all applications first and then
shut down the operating system. The system power turns off and the system goes into standby mode
when the operating system is shut down. Before removing power from the system, ensure that the
shutdown process is complete. Failure to do so can result in the loss of data. Some option-installation
procedures do not require the system to be stopped for installation. The option-installation procedures in
this chapter will direct you here if stopping the system is required.
1. Log in to the system as root user.
2. Have your system administrator stop all applications that are running on the system.
3. At the command line, ask the administrator to type one of the following commands:
v If your system is running AIX, type shutdown
v If your system is running Linux, type shutdown now -h
4. After you shut down the operating system, set the power switches of any attached devices to Off.
Stopping the System with an HMC Attached and AIX Installed
The HMC user interface provides a power-off function to turn off the power to the system. If the system is
logically partitioned, AIX in each partition may be stopped individually to make repairs. If the system power
must be turned off, all of the logical partitions must be stopped.
AIX provides hot-swap procedures for adapters and devices that support hot-swap removal and
installation. You do not have to shut down a partition that controls an adapter or device that supports
hot-swap. Individual power components and fans can be serviced with the power on for power systems
equipped with redundant power and cooling. Before servicing the system, check removal and installation
procedures.
If the system is operating in a full system partition under AIX, typing the shutdown command causes the
system to shut down and power off. The -F flag shuts down the system without warning system users.
Check with the system administrator before using this command. If you cannot use this method, you can
power off the system by pressing the power button on the operator panel or typing the appropriate
command at the HMC.
Attention: Using the power button on the operator panel or commands at the HMC to power off the
system can cause unpredictable results in the data files. The next IPL will also take longer to complete if
all applications in all partitions are not stopped beforehand.
Chapter 9. Removal and Replacement Procedures 359
Stopping the System with an HMC Attached and Linux Installed
The HMC user interface provides a power-off function to turn off the power to the system. If the system is
logically partitioned, Linux in each partition may be stopped individually to make repairs. If the system
power must be turned off, all of the logical partitions must be stopped.
Linux does not provide hot-swap procedures for adapter and device removal and installation. You must
shut down all partitions and the system power must be turned off to service an adapter or device.
Individual power components and fans may be serviced with power on for power systems equipped with
redundant power and cooling. Before servicing this system, check removal and installation procedures.
If the system is operating in a full system partition under Linux, typing the shutdown now -h command
causes the system to shut down and power off. The -F flag shuts down the system without warning
system users. Check with the system administrator before using this command. If you cannot use this
method, you can power off the system by pressing the power button on the operator panel or typing the
appropriate command at the HMC.
Attention: Using the power-on button on the operator panel or commands at the HMC to power off the
system can cause unpredictable results in the data files. The next IPL will also take longer to complete if
all applications in all partitions are not stopped beforehand.
Starting the System without an HMC Attached
Power on the system only after all of the following steps are completed:
v All I/O drawer cables are connected
v Base system cables are connected
v All PCI cables to supported subsystems are connected
v The hardware management console (HMC) is connected (if required)
v Power is connected to the system
To power on the system, do the following:
1. Open any system or rack doors necessary so that you can view the operator panel.
2. Connect the power source to the system unit.
Before you press the power-on button on your operator panel, observe the following:
v The power LED is slowly blinking.
v An OK prompt is visible in the operator panel display.
3. Press the power-on button on the operator panel, and observe the following:
a. The power LED begins to visibly blink faster.
b. The system cooling fans are activated and begin to accelerate to operating speed.
Note: There is approximately a 30-second transition period between the time the power button is
pressed and the power LED remains on solid (no longer blinking).
360 Service Guide
c. The power LED stays on solid and progress indicators, also referred to as checkpoints, are visible
on the operator panel display.
1 Power-On Button 6 Operator Panel Display
2 Power LED 7 (FS1) Front Serial Connector (RJ-48 Connector)
3 Attention LED 8 Service Processor Reset Switch (Pinhole)
4 SCSI Port Activity 9 System Reset Button
5 Ethernet Port Activity
Starting the System with an HMC Attached
Power on the system only after all of the following steps are completed:
v All I/O drawer cables are connected
v Base system cables are connected
v All PCI cables to supported subsystems are connected
v The hardware management console (HMC) is connected (if required)
v Power is connected to the system
After the required cables are installed and the power cables are connected, the HMC user interface
provides a power-on function to turn on the power to the system. If a system operator panel is available,
the power-on button on the operator panel can be pushed to initialize the system but the preferred method
is to use the HMC if you are booting logical partitions. Progress indicators, also referred to as checkpoints,
are visible on the operator panel display (if available) as the system power is turned on. The power LED
on the base system stops blinking and stays on, indicating the system power is on.
The base system and I/O subsystems are powered on through the system power control network (SPCN).
When power is applied, the power LEDs on the base system go from blinking to continuous, and the
power LEDs on the I/O subsystem(s) come on and stay on, which indicates that power levels are
satisfactory in the subsystems.
After the system is powered on, all logical partitions will be in a standby or powered on state. Each logical
partition must then be booted through the HMC. The state of all logical partitions that have been booted
will be running.
Chapter 9. Removal and Replacement Procedures 361
286/C4 Drawer-Release Latch and Brackets
The following procedure covers the removal and replacement of the system’s drawer-release latch.
Drawer-Release Latch Removal
To remove one of the 286/C4 drawer-release latches, do the following:
1. Open the front rack door and place the system into the service position as described in “Placing the
286/C4 into the Service Position” on page 364.
2. Remove the retaining screw located on the side of the release latch.
3. Pull the release latch off the latch bracket post.
1 System Drawer-Release Latch 5 System Drawer-Release Latch
2 System Drawer-Release Latch
Bracket (Right)
6 Retaining Screws
3 286/C4 7 Release Latch Retaining Screw
4 System Drawer-Release Latch
Bracket (Left)
8 Latch Bracket Post
362 Service Guide
Drawer-Release Latch Replacement
To replace one of the 286/C4 drawer-release latches, do the following:
1. If you have not already done so, open the front rack door and place the system into the service
position as described in “Placing the 286/C4 into the Service Position” on page 364.
2. Push the release latch onto the latch bracket post.
3. Secure the drawer-release latch onto the latch bracket post with a release latch retaining screw.
4. Push the system drawer back into the operating position as described in “Returning the 286/C4 to the
Operating Position” on page 365.
Drawer-Release Latch Bracket Removal
To remove the 286/C4 drawer-release latches brackets, do the following:
1. Open the front rack door and place the system into the service position as described in “Placing the
286/C4 into the Service Position” on page 364.
2. Remove the front bezel as described in “Front Bezel Removal (286/C4)” on page 377.
3. Remove the release latch as described in “Drawer-Release Latch Removal” on page 362.
4. Remove the retaining screws that secure the latch bracket to the side of the 286/C4.
1 System Drawer-Release Latch 5 System Drawer-Release Latch
2 System Drawer-Release Latch
Bracket (Right)
6 Retaining Screws
3 286/C4 7 Release Latch Retaining Screw
4 System Drawer-Release Latch
Bracket (Left)
8 Latch Bracket Post
Chapter 9. Removal and Replacement Procedures 363
Drawer-Release Latch Bracket Replacement
To replace the 286/C4 drawer-release latches brackets, do the following:
1. If you have not already done so, open the front rack door and place the system into the service
position as described in “Placing the 286/C4 into the Service Position”.
2. Using the supplied retaining screws, secure the latch brackets to the side of the 286/C4.
Note: The latch brackets are right-side and left-side dependent.
3. Replace the release latch as described in “Drawer-Release Latch Replacement” on page 363.
4. Replace the front bezel as described in “Front Bezel Replacement (286/C4)” on page 377.
5. Push the system drawer back into the operating position as described in “Returning the 286/C4 to the
Operating Position” on page 365.
Placing the 286/C4 into the Service Position
Attention: When placing the 286/C4 into the service position, it is essential that all stability plates must
be firmly in position to prevent the rack from toppling. Ensure that only one system drawer at a time is in
the service position.
Before doing any service actions inside the 286/C4, you must put the system into the service position. To
place the system into the service position, do the following:
1. Open the front rack door.
2. If your system is equipped with two thumbscrews securing it to the rack, remove the thumbscrews. The
screws are located on the bezel, just above each system-release latch.
1 286/C4 Front Bezel
2 M5 16-mm Retaining Screw
3. Release the system-release latches on both the left and right sides.
4. Pull the system drawer out from the rack until the rails are fully extended.
Note: When the system rails are fully extended, safety latches on the slide rails lock into place. This
action prevents the system from being accidentally pulled out too far and dropped. The following
illustration shows a system drawer in the service position.
364 Service Guide
1 286/C4 4 Stabilizing Bar
2 19-Inch Rack 5 Front Rack Door
3 Extension Rail Safety Release Tab 6 System Drawer-Release Latch
After completing the service actions, return the system to the operating position.
Returning the 286/C4 to the Operating Position
To return the 286/C4 to the operating position, do the following:
1. Press in on both spring-loaded release tabs, located on each rail. See the illustration on page 365 for
the location of the release tabs.
2. Ensure the system drawer-release latches, located on the system drawer, are in the open position. See
the illustration on page 365 for the location of the system drawer-release latches.
3. Push the system drawer straight back into the rack until both system drawer-release latches have
locked into position.
4. If you removed two thumbscrews from the bezel front, replace them at this time.
5. Close the front rack door.
Chapter 9. Removal and Replacement Procedures 365
Covers
Before performing the following procedures, read the “Safety Notices” on page xi.
Service Access Cover Removal (286/C4)
To remove the service access cover, do the following:
1. Ensure that the rack has been correctly stabilized before opening the front and rear rack doors.
2. Open the front rack door and place the 286/C4 into the service position as described in “Placing the
286/C4 into the Service Position” on page 364.
3. Loosen the three captive thumbscrews located on the rear of the cover. See the following illustration
for thumbscrew locations.
4. Placing both hands on each side of the cover, slide the cover toward the rear of the system drawer.
After the front of the service access cover has cleared the upper chassis ledge, lift the cover up and
off the system drawer.
Attention: For proper cooling and airflow, replace the cover before turning on the system. Operating the
system for extended periods of time (over 30 minutes) with the cover removed might damage the system
components.
1 Thumbscrews 4 Access Cover Locking Tab
2 Service Access Cover 5 Upper Chassis Ledge
3 286/C4
Service Access Cover Replacement (286/C4)
To replace the service access cover, do the following:
1. Align the service access cover with the top of the system, about 25 mm (1 inch) from the front of the
system. The flanges on the left and right sides of the cover should be on the outside of the system
chassis.
2. Hold the service access cover against the system drawer, slide it toward the front of the system. The
front edge of the service access cover slides beneath the upper chassis ledge.
3. Tighten the three thumbscrews located on the rear of the cover.
366 Service Guide
Top Cap Removal (286/E4)
To remove the top cap, do the following:
1. Remove the bezel door as described in “Bezel Door Removal” on page 376.
2. Remove the front bezel as described in “Front Bezel Removal (286/E4)” on page 378.
3. From the front of the top cap, grasp the two lower side edges and pry out to release the lower front
tabs.
4. Pivot the cover up to release the two back tabs.
5. Lift the top cap up and off the system.
1 Top Cap 4 Bezel Door
2 Top Cap Rear Locking Tabs 5 Bezel
3 Top Cap Front Locking Tabs 6 286/E4
Top Cap Replacement (286/E4)
To replace the top cap, do the following:
1. Align, and then insert the two tabs located on the back-bottom edge of the top cap with the two mating
slots located on the top cover.
2. Rotate the cover down, snapping the two lower front tabs into place on the operator panel bracket.
3. Replace the front bezel as described in “Front Bezel Replacement (286/E4)” on page 378.
4. Replace the bezel door as described in “Bezel Door Replacement” on page 376.
Chapter 9. Removal and Replacement Procedures 367
Top Cover Removal (286/E4)
To remove the top cover, do the following:
1. Remove the bezel door as described in “Bezel Door Removal” on page 376.
2. Remove the front bezel as described in “Front Bezel Removal (286/E4)” on page 378.
3. Remove the top cap as described in “Top Cap Removal (286/E4)” on page 367.
4. Remove the operator panel as described in “Operator Panel Removal” on page 419.
5. Remove the two retaining screws that secure the operator-panel EMC bracket to the operator-panel
bracket.
6. Remove the four retaining screws that secure the operator-panel bracket to the top cover.
7. Remove the four retaining screws that secure the top cover to the system chassis.
8. Remove the front screw that secures the top cover to the right panel assembly. This screw is located
on the front inside edge of the system, just below the top cover.
9. Remove the back cover as described in “Back Cover Removal (286/E4)” on page 372.
10. From the back of the system, remove the rear screw that secures the top cover to the right panel
assembly. This screw is located on the rear inside edge of the system, just below the top cover.
11. From the back of the system, remove the two retaining screws that secure the top cover to the
system top-cover rear bracket.
12. Lift the top cover up and off the system chassis.
1 Top Cap 6 286/E4
2 Operator Panel EMC Bracket 7 Bezel Door
3 Operator Panel 8 Bezel
4 Operator Panel Bracket 9 Rear Top Cover Mounting Bracket
5 Top Cover 10 Rear Cover
368 Service Guide
Top Cover Replacement (286/E4)
To replace the top cover, do the following:
1. Ensure that the rear top cover bracket is in place. If the rear top cover bracket has not been installed,
install it now with two retaining screws.
2. Mount, and then align the top cover onto the system chassis.
3. Replace the four retaining screws that secure the top cover to the system chassis.
Note: Do not tighten the screws at this time
4. Replace the front screw that secures the top cover to the right panel assembly. This screw will be
located on the front inside edge of the system, just below the top cover.
5. At the back of the system, replace the rear screw that secures the top cover to the right panel
assembly. This screw will be located on the rear inside edge of the system, just below the top cover.
6. At the back of the system, replace the two retaining screws that secure the top cover to the top-cover
rear bracket.
7. Tighten all replaced screws in step 3, then tighten the screws replaced in steps 4 through 6.
8. Replace the back cover as described in “Back Cover Replacement (286/E4)” on page 373.
9. Mount, and then align the operator-panel bracket, ensuring that the operator-panel bracket
screw-clearance holes are exactly centered over their mating screw holes, located on the top cover.
10. With a retaining screw, secure one of the operator-panel bracket corners . Repeat this step, securing
a second corner on the operator-panel bracket, diagonally across from the first.
Note: Before going to the next step, ensure that the four remaining clearance holes on the
operator-panel bracket are exactly centered over their mating screw holes. If not, loosen one
of the screws and realign the clearance holes to the screw holes.
11. With two retaining screws, secure the two remaining corners of the operator-panel bracket.
12. Mount, then secure with two retaining screws, the operator-panel EMC bracket onto the
operator-panel bracket.
13. Replace the operator panel as described “Operator Panel Replacement” on page 420.
14. Replace the top cap as described in “Top Cap Replacement (286/E4)” on page 367.
15. Replace the front bezel as described in “Front Bezel Replacement (286/E4)” on page 378.
16. Replace the bezel door as described in “Bezel Door Replacement” on page 376.
Chapter 9. Removal and Replacement Procedures 369
Outer Side Cover Removal (286/E4)
To remove the outer side cover, do the following:
1. Loosen the thumbscrew on the rear of the cover. See the following illustration for the thumbscrew
location.
2. Slide the cover toward the rear of the system to release the four locking tab slots located on the top
flange of the outer side cover.
3. Grasp the top back corner of the outer side cover.
4. Pull out the top edge of the outer side cover.
Note: The top edge of the outer side cover must be clear of the top cover.
5. Pull up to release the bottom edge of the outer side cover.
6. Put the cover in a safe place.
1 286/E4 4 Cover Retaining Slot
2 Locking Tab Slot for Outer Side Cover 5 Thumbscrew
3 Outer Side Cover
Outer Side Cover Replacement (286/E4)
To replace the outer side cover, do the following:
1. Place the bottom edge retaining slot of the outer side cover over the outer edge of the system base.
2. Pivot the outer side cover up toward the top of the system.
3. Align the four top locking tab slots, located on the outer side cover, with the locking tabs, located on
the bottom side of the top cover.
4. Push the cover in. When correctly seated, push the outer side cover forward, toward the front of the
chassis. The four top locking tabs are now locked in place.
5. Align the thumbscrew, located on the rear of the outer side cover, with the screw hole located on the
back of the chassis.
6. Tighten the thumbscrew.
370 Service Guide
Service Access Cover Removal (286/E4)
To remove the service access cover, do the following:
1. Remove the 286/E4’s outer side cover as described in “Outer Side Cover Removal (286/E4)” on
page 370.
2. Loosen the three captive thumbscrews located on the rear of the cover. See the following illustration
for thumbscrew locations.
3. Slide the cover toward the rear of the system drawer. After the front of the service access cover has
cleared the front chassis ledge, lift the cover out of the system.
Attention: For proper cooling and airflow, replace the cover before turning on the system. Operating the
system for extended periods of time (over 30 minutes) with the cover removed might damage the system
components.
1 286/E4 4 Service Access Cover
2 Service Access Cover Locking Tab 5 Outer Side Cover
3 Front Chassis Ledge 6 Thumbscrews
Chapter 9. Removal and Replacement Procedures 371
Service Access Cover Replacement (286/E4)
To replace the service access cover, do the following:
1. Align the service access cover with the side of the system, about 25 mm (1 inch) from the front of the
system. The flanges on the top and bottom of the cover wrap around the system chassis.
2. Hold the service access cover against the system drawer and slide it toward the front of the system.
The front edge of the service access cover slides beneath the chassis ledge.
3. Tighten the three thumbscrews located on the rear of the cover.
Back Cover Removal (286/E4)
To remove the back cover, do the following:
1. Grasp each side of the back cover and pull straight up and then away from the system. This action
releases the two top and two bottom tabs that secure the cover to the back of the system. See the
following illustration for the tab locations.
2. Put the back cover in a safe place.
1 Back Cover Top Retaining Tab (Quantity 2) 3 Back Cover Bottom Locking Slot (Quantity
2)
2 Back Cover Top Locking Slot (Quantity 2) 4 Back Cover Bottom Retaining Tab
(Quantity 2)
372 Service Guide
Back Cover Replacement (286/E4)
To replace the back cover, do the following:
1. Ensure all cables have been connected and are correctly seated.
2. Align the back cover side edges with the side edges of the system chassis back.
3. Raise the back cover about 1 inch higher than the system, ensuring tab to slot clearance.
4. Push the back cover straight down. This action seats the tabs into their retaining slots.
Disk Drive Access Cover Removal (286/E4)
To remove the disk drive access cover, do the following:
1. Open the bezel door.
2. Locate the disk drive access cover on the front bezel of the system, as shown in the following
illustration.
3. Press down on the release tab located at the top of the access cover.
4. Gently pull the top of the access cover out and away from the system.
Note: The two alignments tabs that are located on the bottom front edge of the access cover must be
lifted up and then out of their slots.
5. Put the cover in a safe place.
1 286/E4 4 Disk Drive Access Cover
2 Bezel Door 5 Alignment Tabs
3 Disk Drive Access Cover Release Tab
Disk Drive Access Cover Replacement (286/E4)
To replace the disk drive access cover, do the following:
1. Insert the two alignment tabs, located on the bottom front edge of the access cover, into the two slots
located on the front bezel.
2. After inserting the alignment tabs into the slots, pivot the access cover up until the release tab clicks
into the closed position.
3. Close the bezel door.
Chapter 9. Removal and Replacement Procedures 373
Right Side Cover (System Base) Removal (286/E4)
To remove the right side cover (system base), do the following:
1. Remove the bezel door as described in “Bezel Door Removal” on page 376.
2. Remove the front bezel as described in “Front Bezel Removal (286/E4)” on page 378.
3. Remove the top cap as described in “Top Cap Removal (286/E4)” on page 367.
4. Remove the operator panel as described in “Operator Panel Removal” on page 419.
5. Remove the back cover as described in “Back Cover Removal (286/E4)” on page 372.
6. Remove the outer service access cover as described in “Outer Side Cover Removal (286/E4)” on
page 370.
7. Remove the top cover as described in “Top Cover Removal (286/E4)” on page 368.
8. Remove the four retaining screws that secure the system chassis to the system base.
9. Use three people to lift the system up and off the alignment pins.
10. To remove the system base plate, lay the system onto its side and remove the four retaining screws.
1 Top Cap 8 Bezel
2 Operator Panel EMC Bracket 9 Rear Top Cover Mounting Bracket
3 Operator Panel 10 Rear Cover
4 Operator Panel Bracket 11 System Base Plate
5 Top Cover 12 Operator Panel Extension Cable
6 286/E4 13 Right Side Cover (System Base)
7 Bezel Door
374 Service Guide
Right Side Cover (System Base) Replacement (286/E4)
To replace the right side cover (system base), do the following:
1. Ensure that the system base plate is secured to the system chassis. If the system base plate has not
been secured to the system chassis, lay the system chassis onto its side and secure the system base
plate with four retaining screws.
2. Using three people, lift the system up and seat the system chassis and the attached mounting plate
over the two alignment pins. The alignment pins are located at each end of the system base.
3. Using four retaining screws, secure the system chassis to the system base .
4. Replace the top cover as described in “Top Cover Replacement (286/E4)” on page 369.
5. Replace the outer service access cover as described in “Outer Side Cover Replacement (286/E4)” on
page 370.
6. Replace the back cover as described in “Back Cover Replacement (286/E4)” on page 373.
7. Replace the top cap as described in “Top Cap Replacement (286/E4)” on page 367.
8. Replace the operator panel as described in “Operator Panel Replacement” on page 420.
9. Replace the front bezel as described in “Front Bezel Replacement (286/E4)” on page 378.
10. Replace the bezel door as described in “Bezel Door Replacement” on page 376.
Chapter 9. Removal and Replacement Procedures 375
Bezel Door (286/E4)
The following procedure covers removal and replacement of the bezel door.
Bezel Door Removal
Refer to the following illustration while you perform the steps in this procedure.
To remove the bezel door, do the following:
1. Open the bezel door by grasping the bezel door handle. Pull the bezel door out and away from the
system.
2. Press down on the top back edge of the door.
3. Gently swivel the top back edge of the door forward, out past the top of the system.
4. Lift the door up to release it from the lower retaining post.
1 286/E4 4 Bezel Door
2 System Handle Assembly 5 Bezel Door Handle
3 Bezel Door Top Retaining Post 6 Bezel Door Lock Tab
7 Bezel Door Lower Retaining Post
Bezel Door Replacement
To replace the bezel door, do the following:
1. Place the bezel door on the lower retaining post. See the preceding illustration.
2. Rotate the bezel door toward the top of the system.
3. Press down on the back edge of the bezel door, and seat the top post into its mating hole.
Note: The mating hole for the top post is located on the bottom side of the system handle assembly.
4. Close the bezel door.
376 Service Guide
Bezels
Before performing the following procedures, read the “Safety Notices” on page xi.
Front Bezel Removal (286/C4)
To remove the bezel, do the following:
1. Open the front rack door and place the system into the service position as described in “Placing the
286/C4 into the Service Position” on page 364.
2. Press in on both bezel-release tabs at the same time. See the following illustration for bezel-release
tab locations.
3. Gently pull the bezel away from the system.
4. Put the bezel in a safe place.
1 Rack Door 3 Bezel-Release Tab (Located on each side of bezel)
2 Bezel 4 19-Inch Rack
Front Bezel Replacement (286/C4)
To replace the front bezel, do the following:
1. Open the front rack door and place the system into the service position as described in “Placing the
286/C4 into the Service Position” on page 364.
2. Seat the two tabs located on the bottom edge of the bezel into their mating slots located on the bottom
edge of the system.
3. Pivot the bezel up, aligning the release tabs to the mating slots located on the front of the system.
4. Gently push the tabs into the slots until the bezel seats against the front of the system.
5. Push the system drawer back into the operating position as described in “Returning the 286/C4 to the
Operating Position” on page 365.
6. Close the rack door.
Chapter 9. Removal and Replacement Procedures 377
Front Bezel Removal (286/E4)
Refer to the following illustrations while you perform the steps in this procedure.
To remove the front bezel, do the following:
1. Remove the bezel door as described in “Bezel Door (286/E4)” on page 376.
2. Press down on the two bezel-release levers.
3. Pull the top of the bezel out and away from the system chassis.
4. Gently pull the bezel up off the base, releasing the lower bezel locking tabs.
5. Put the bezel and bezel door in a safe place.
1 286/E4 4 Front Door
2 Bezel-release lever 5 Lower Bezel Locking Tab (Quantity 2)
3 Front Bezel 6 Lower Bezel Retaining Slots (Quantity 2)
Front Bezel Replacement (286/E4)
To replace the front bezel, do the following:
1. Place the two lower bezel locking tabs into the retaining slots located on the chassis base.
2. Rotate the bezel up toward the top of the chassis, ensuring that the aligning pins are aligned with their
mating holes located on the chassis.
3. Gently push the bezel in until the two bezel release levers are seated in their respective slots.
4. Replace the bezel door as described in “Bezel Door (286/E4)” on page 376.
378 Service Guide
PCI Adapters
Before performing this procedure, read “Safety Notices” on page xi.
Notes:
1. Some PCI adapter cards are shipped from the manufacturer with a blue handle or support along the
back edge of the card. To use in this system unit, you must remove the blue handle or support from
the card.
2. With this system, you can install PCI adapters with the power on. These adapters are referred to as
hot-pluggable PCI adapters. Some adapters are not hot-pluggable, and the system must be powered
off for this type of adapter installation.
If you are removing a PCI adapter that is:
v Non-hot-pluggable, go to “Non-Hot-Pluggable PCI Adapter Removal” on page 381.
v Hot-pluggable, go to “Hot-Pluggable PCI Adapter Removal” on page 385.
PCI Hot-Plug Manager Access
The installation instructions for hot-pluggable PCI adapters refer you to these procedures when it is
appropriate to perform them.
Attention: The Linux operating system does not support any hot-pluggable procedures, nor does it
support the hot-plugging of any hot-pluggable PCI adapters. A system with the Linux operating system
installed on one or more partitions must be shut down and powered off before replacing any PCI adapter
assigned to a Linux partition. When replacing a PCI adapter in any partition with the Linux operating
system installed, follow the non-hot-pluggable adapter procedures.
Note: A PCI adapter is only hot-pluggable if the PCI adapter is supported for hot-plug applications. See
the PCI Adapter Placement Reference, order number SA38-0538, for complete information
regarding your PCI adapter.
Accessing Hot-Plug Management Functions
Note: Removing or installing a hot-pluggable PCI adapter requires the system administrator to take the
PCI adapter offline prior to performing the operation. Before taking an adapter offline, the devices
attached to the adapter must be taken offline as well. This action prevents a service representative
or user from causing an unexpected outage for system users.
For additional information about taking an adapter offline or removing it from the system
configuration, see the AIX System Management Guide: Operating System and Devices.
This publication is also contained on the AIX Documentation CD. The documentation is made
accessible by loading the documentation CD onto the hard disk or by mounting the CD in the
CD-ROM drive.
Chapter 9. Removal and Replacement Procedures 379
To access the hot-plug menus, do the following:
1. Log in as root user.
2. At the command line, type smitty.
3. Select Devices.
4. Select PCI Hot Plug Manager and press Enter.
5. The PCI Hot-Plug Manager menu displays. Return to the procedure that directed you here. The
following section describes the menu options.
PCI Hot-Plug Manager Menu
The following options are available from the PCI Hot Plug Manager menu:
Note: For information about the PCI slot LED states, see “Component LEDs” on page 466.
List PCI Hot-Plug Slots
Provides a descriptive list of all slots that support PCI hot-plug capability. If the listing for a slot indicates it
holds an “Unknown” device, select the Install/Configure Devices Added after IPL to configure the
adapter in that slot.
Add a PCI Hot-Plug Adapter
Allows the user to add a new PCI hot-plug-capable adapter to the slot with the system turned on. You will
be asked to identify the PCI slot that you have selected prior to the actual operation. The selected PCI slot
will go into the Action state and finally into the On state.
Note: The system will indicate the slot holds an “Unknown” device until you perform the Install/Configure
Devices Added After IPL option to configure the adapter.
Replace/Remove a PCI Hot-Plug Adapter
Allows the user to remove an existing adapter, or replace an existing adapter with an identical one. For
this option to work, the adapter must be in the Defined state (see the “Unconfigure a Device” option).
You will be asked to identify the PCI slot prior to the actual operation. The selected PCI slot will go into the
Action state.
Identify a PCI Hot-Plug Slot
Allows the user to identify a PCI slot. The selected PCI slot will go into the Identify state. See “Component
LEDs” on page 466.
Unconfigure a Device
Allows the user to put an existing PCI adapter into the Defined state if the device is no longer in use.
This step must be completed successfully before starting any removal or replacement operation. If this
step fails, the customer must take action to release the device.
380 Service Guide
Configure a Defined Device
Allows a new PCI adapter to be configured into the system if software support is already available for the
adapter. The selected PCI slot will go into the On state.
Install/Configure Devices Added After IPL
The system attempts to configure any new devices and tries to find and install any required software from
a user-selected source.
The add, remove, and replace functions return information to the user indicating whether the operation
was successful. If additional instructions are provided on the screen, complete the recommended actions.
If the instructions do not resolve the problem, do the following:
v If the adapter is listed as Unknown, perform the Install/Configure Devices Added After IPL option to
configure the adapter.
v If you receive a warning indicating that needed device packages are not installed, the system
administrator must install the specified packages before you can configure or diagnose the adapter.
v If you receive a failure message indicating a hardware error, the problem might be either the adapter or
the PCI slot. Isolate the problem by retrying the operation in a different PCI slot, or trying a different
adapter in the slot. If you determine that you have failing hardware, call your service representative.
v Do not use Install/Configure Devices Added After IPL if your system is set up to run HACMP
clustering. Consult with your system administrator or software support to determine the correct method
to configure the replacement device.
Non-Hot-Pluggable PCI Adapter Removal
To remove a PCI adapter, do the following:
1. On a 286/C4, open the front and rear rack doors and place the system into the service position as
described in “Placing the 286/C4 into the Service Position” on page 364.
On a 286/E4, open the bezel door.
2. Shut down the system as described in either “Stopping the System without an HMC Attached” on
page 359 or “Stopping the System with an HMC Attached and AIX Installed” on page 359.
3. Disconnect the power source to the system.
Note: This system may be equipped with a second power supply. Before continuing with this
procedure, ensure that the system power source has been completely disconnected.
4. Remove the service access cover as described in “Service Access Cover Removal (286/C4)” on
page 366 or “Service Access Cover Removal (286/E4)” on page 371. See the following illustration.
5. Determine which adapters you plan to remove, then label and disconnect all cables attached to that
adapter.
6. Record the slot number and location of each adapter being removed.
Note: Adapter slots are numbered on the rear of the system unit.
7. Before handling any card, board, or memory DIMM, be sure to use your electrostatic discharge strap
to minimize static-electric discharge. Refer to “Handling Static-Sensitive Devices” on page 359.
8. If you are removing a long adapter, make note of how the back edge of the adapter slides into the
molded grooves located on the back wall of the PCI cooling fan bracket.
Chapter 9. Removal and Replacement Procedures 381
1 Service Access Cover 3 System Unit
1a Outer Access Cover 4 PCI Cooling Fan Bracket
2 PCI Adapters
9. Rotate the adapter retainer clip counterclockwise, as shown in the following illustration.
10. The tab resting on the adapter retention bracket should spring to a vertical position, coming to rest
next to the adapter retention latch.
11. Carefully grasp the PCI adapter by its top edge or upper corners, and remove it from the system.
Store the adapter in a safe place.
12. If you do not plan to install another adapter into the vacated slot, seal the expansion slot using an
expansion-slot cover. If you are installing another adapter, go to “Adding or Replacing a
Non-Hot-Pluggable PCI Adapter” on page 383.
13. If you have other options to install, refer to “Procedure List” on page 358. If you do not have other
options to install, continue on to the next step.
14. Replace the service access cover as described in “Service Access Cover Replacement (286/C4)” on
page 366 or “Service Access Cover Replacement (286/E4)” on page 372.
15. On a 286/C4, push the system drawer back into the operating position as described in “Returning the
286/C4 to the Operating Position” on page 365.
16. Connect the adapter cables.
17. Reconnect the power source to the system.
18. On a 286/C4, route the cables through the cable-management arm.
19. Go to ²MAP 0410: Repair Checkout² in Diagnostic Information for Multiple Bus Systems. If necessary,
refer to “Starting the System without an HMC Attached” on page 360. After you have completed
checking the system close, the rack or system doors.
382 Service Guide
1 286/C4 or 286/E4 5 PCI adapter faceplate (dotted lines)
2 Adapter Retainer Assembly 6 Adapter Retainer Assembly (in the unlocked
position)
3 Adapter Retainer Assembly (retainer seat
down and the retainer clip in the locked
position)
7 PCI Adapter
4 Adapter Retainer Assembly (retainer seat
down and the retainer clip in the unlocked
position)
Adding or Replacing a Non-Hot-Pluggable PCI Adapter
To add or install an adapter, do the following:
1. On a 286/C4, open the front and rear rack doors and place the system into the service position as
described in “Placing the 286/C4 into the Service Position” on page 364.
On a 286/E4, open the bezel door.
2. Shut down the system as described in either “Stopping the System without an HMC Attached” on
page 359 or “Stopping the System with an HMC Attached and AIX Installed” on page 359.
3. Disconnect the power source to the system.
Note: This system may be equipped with a second power supply. Before continuing with this
procedure, ensure that the system has been completely disconnected from its power source.
4. Remove the service access cover as described in “Service Access Cover Removal (286/C4)” on
page 366 or “Service Access Cover Removal (286/E4)” on page 371.
5. Refer to the PCI Adapter Placement Reference for information regarding slot restrictions for adapters
that can be used in this system.
6. If necessary, remove the adapter expansion slot shield.
7. Before handling any card, board, or memory DIMM, be sure to use your electrostatic discharge strap
to minimize static-electric discharge. Refer to “Handling Static-Sensitive Devices” on page 359.
8. If necessary, remove the adapter from the antistatic package.
Attention: Avoid touching the components and gold-edge connectors on the adapter.
Chapter 9. Removal and Replacement Procedures 383
9. Place the adapter, component-side up, on a flat, static-protective surface.
10. Set any jumpers or switches as instructed by the adapter’s manufacturer.
11. Carefully grasp the adapter by its top edge, and align the adapter with the expansion slot and its
connector on the PCI riser card.
12. Press the adapter firmly into its connector.
Attention: When you install an adapter into the system, be sure that it is completely and correctly
seated in its connector located on the PCI riser card. Inserting and seating the adapter
card improperly might cause damage to the PCI riser card or the adapter.
13. Lower the tab onto the PCI adapter EMC shield/connector faceplate. Rotate the adapter retainer clip
clockwise until it covers the tab at approximately a 45-degree angle. See the following illustration.
1 System Unit
2 Adapter Retainer Assembly (in the unlocked position)
3 Adapter Retainer Assembly (retainer seat down and the retainer clip in the unlocked position)
4 Adapter Retainer Assembly (retainer seat down and the retainer clip in the locked position)
5 PCI adapter face plate (dotted lines)
14. Replace the service access cover as described in “Service Access Cover Replacement (286/C4)” on
page 366 or “Service Access Cover Replacement (286/E4)” on page 372.
15. If you are servicing a 286/C4, push the system drawer back into the operating position as described
in “Returning the 286/C4 to the Operating Position” on page 365.
16. Connect the adapter cables.
17. Reconnect the power source to the system.
18. If you are servicing a 286/C4, route the cables through the cable-management arm.
19. Go to ²MAP 0410: Repair Checkout² in Diagnostic Information for Multiple Bus Systems. If necessary,
refer to “Starting the System without an HMC Attached” on page 360. After you have completed
checking the system, close the rack or system doors.
384 Service Guide
Hot-Pluggable PCI Adapter Removal
Note: Removing a hot-pluggable PCI adapter requires the system administrator to take the PCI adapter
offline before performing any PCI adapter hot-plug procedures. Before taking an adapter offline, the
devices attached to the adapter must also be taken offline. This action prevents a service
representative or user from causing an unexpected outage for system users.
For additional information about taking an adapter offline or removing it from the system
configuration, see the AIX System Management Guide: Operating System and Devices.
This publication is also contained on the AIX Documentation CD. The documentation is made
accessible by loading the documentation CD onto the hard disk or by mounting the CD in the
CD-ROM drive.
To remove a hot-pluggable PCI adapter, do the following:
1. On a 286/C4, open the front and rear rack doors, and place the system into the service position as
described in “Placing the 286/C4 into the Service Position” on page 364.
On a 286/E4, open the bezel door.
2. Remove the service access cover as described in “Service Access Cover Removal (286/C4)” on
page 366 or “Service Access Cover Removal (286/E4)” on page 371.
3. Determine which adapters you plan to remove.
4. Record the slot number and location of each adapter being removed.
Note: Adapter slots are numbered on the rear of the system unit.
5. Ensure that any processes or applications that might use the adapter are stopped.
6. Refer to “PCI Hot-Plug Manager Access” on page 379, and follow the steps in the access procedure
to select PCI Hot Plug Manager. Then return here to continue.
7. Select Unconfigure a Device and press Enter.
8. Press F4 to display the Device Names menu.
9. Select the adapter you are removing.
10. Use the Tab key to answer NO to Keep Definition and YES to Unconfigure Child Devices. Press
Enter.
11. The ARE YOU SURE screen displays. Press Enter to verify the information. Successful
unconfiguration is indicated by the OK message displayed next to the Command field at the top of the
screen.
12. Label and disconnect all cables attached to that adapter.
13. Press F3 to return to the PCI Hot-Plug Manager menu.
14. Select List PCI Hot-Plug Slots.
15. If the List PCI Hot-Plug Slots menu indicates that the slot is not empty, go to step 16.
Chapter 9. Removal and Replacement Procedures 385
16. If the List PCI Hot-Plug Slots menu indicates that the slot is empty, do the following:
a. Select F3 to cancel.
b. Select Add PCI Hot-Plug Adapter, and press Enter. The display screen indicates that the PCI
slot is empty.
c. On the display screen, select the indicated empty slot. Look at the adapter you intend to remove.
A slow blinking amber LED located next to the adapter on the PCI riser card indicates that the slot
has been identified. Press Enter. The adapter has been put into the action state.
d. Disconnect any cables that are connected to the adapter being removed.
e. Completely remove the hot-plug adapter from the slot. The LED goes off.
f. Go to step 24 on page 387.
17. Move the cursor to select the adapter that you are removing and press Enter. (The description entry
displays as unknown).
18. Press the Tab key until the entry field displays the remove operation and then press the Enter key.
Follow the instructions that display on the screen until you are instructed to remove the adapter.
19. When you are instructed to remove the adapter from the adapter slot, disconnect any cables that are
connected to the adapter being removed.
20. Before handling any card, board, or memory DIMM, be sure to use your electrostatic discharge strap
to minimize static-electric discharge. Refer to “Handling Static-Sensitive Devices” on page 359.
21. If you are removing a long adapter, make note of how the back edge of the adapter slides into the
molded grooves located on the back wall of the PCI cooling fan bracket.
1 Service Access Cover 3 System Unit
1a Outer Side Cover 4 PCI Cooling Fan Bracket
2 PCI Adapters
22. Turn the locking latch, and lift the plastic retainer seat off the adapter.
386 Service Guide
1 System Unit 5 PCI adapter face plate (dotted lines)
2 Adapter Retainer Assembly 6 Adapter Retainer Assembly (in the unlocked
position)
3 Adapter Retainer Assembly (retainer seat
down and the retainer clip in the locked
position)
7 PCI Adapter
4 Adapter Retainer Assembly (retainer seat
down and the retainer clip in the unlocked
position)
23. Carefully grasp the adapter by the edges, and pull it straight out from the PCI riser card.
24. If you are not installing another adapter in this slot, place an expansion slot cover in the adapter slot
opening.
25. Lower the plastic retainer seat over the PCI adapter faceplate.
26. Rotate the locking latch clockwise until it clicks into the locked position.
27. Continue to follow the screen instructions until you receive a message that the adapter removal is
successful. Successful removal is indicated by the OK message displayed next to the Command field
at the top of the screen.
28. If you do not have other adapters to remove, continue with the next step.
OR
If you have other adapters to remove, press the F3 key to return to the PCI Hot-Plug Manager menu
and then return to step 14 on page 385.
29. Press F10 to exit the Hot-Plug Manager.
30. If you have added, removed, or replaced any adapters, run the diag -a command. If the system
responds with a menu or prompt, follow the instructions to complete the device configuration.
31. If you have other options to install, refer to “Procedure List” on page 358. If you do not have other
options to install, continue on to the next step.
32. Replace the service access cover as described in “Service Access Cover Replacement (286/C4)” on
page 366 or “Service Access Cover Replacement (286/E4)” on page 372.
33. On a 286/C4, push the system drawer back into the operating position as described in “Returning the
286/C4 to the Operating Position” on page 365.
34. Connect the adapter cables.
35. On a 286/C4, route the cables through the cable-management arm.
36. On a 286/C4, close the rack doors.
On a 286/E4, close the bezel door.
Chapter 9. Removal and Replacement Procedures 387
Replacing a Hot-Pluggable PCI Adapter
Notes:
1. Use this procedure only when you are replacing an adapter with an identical adapter. If you are
replacing an adapter with an adapter that is not identical to the adapter removed, go to “Hot-Pluggable
PCI Adapter Removal” on page 385 and “Installing a Hot-Pluggable PCI Adapter” on page 392.
2. Removing a hot-pluggable PCI adapter requires the system administrator to take the PCI adapter
offline before performing any PCI adapter hot-plug procedures. Before taking an adapter offline, the
devices attached to the adapter must be taken offline as well. This action prevents an unexpected
outage for system users.
For additional information about taking an adapter offline or removing it from the system configuration,
see the AIX System Management Guide: Operating System and Devices.
This publication is also contained on the AIX Documentation CD. The documentation is made
accessible by loading the documentation CD onto the hard disk or by mounting the CD in the CD-ROM
drive.
To replace a hot-plug PCI adapter, do the following:
1. On a 286/C4, open the front and rear rack doors and place the system into the service position as
described in “Placing the 286/C4 into the Service Position” on page 364.
On a 286/E4, open the bezel door.
2. Remove the service access cover as described in “Service Access Cover Removal (286/C4)” on
page 366 or “Service Access Cover Removal (286/E4)” on page 371.
3. Refer to the PCI Adapter Placement Reference, order number SA38-0538, for information regarding
slot restrictions for adapters that can be used in this system.
4. Before handling any card, board, or memory DIMM, be sure to use your electrostatic discharge strap
to minimize static-electric discharge. Refer to “Handling Static-Sensitive Devices” on page 359.
5. Determine the slot from which you are removing the adapter.
6. Ensure that any processes or applications that might use the adapter are stopped.
7. Refer to “PCI Hot-Plug Manager Access” on page 379, and follow the steps in the access procedure
to select PCI Hot Plug Manager. Then return here to continue.
8. Select Unconfigure a Device and press Enter.
Note: If the adapter or device has a problem that prevents it from being replaced by hot-plug, it must
be replaced by shutting down the system. Go to “Adding or Replacing a Non-Hot-Pluggable
PCI Adapter” on page 383.
9. Press F4 to display the Device Names menu.
388 Service Guide
10. From the menu, select the adapter you are removing.
11. Use the tab key to answer YES to Keep Definition and YES to set Unconfigure Child Devices.
Press Enter.
12. The ARE YOU SURE screen displays. Press Enter to verify the information. Successful
unconfiguration is indicated by the OK message displayed next to the Command field at the top of the
screen.
13. Press F3 to return to the PCI Hot-Plug Manager menu.
14. Select List PCI Hot-Plug Slots and press Enter.
15. If the List PCI Hot-Plug Slots menu indicates that the slot is not empty, go to substep 17. If the List
PCI Hot-Plug Slots menu indicates that the slot is empty, do the following:
a. Select F3 to cancel.
b. Select Add PCI Hot-Plug Adapter, and press Enter. The display screen indicates that the PCI
slot is empty.
c. On the display screen, select the indicated empty slot. Look at the adapter you intend to replace
or reseat. A slow blinking amber LED located on the PCI riser card is indicating that the slot has
been identified. Press Enter. The adapter is now in the action state.
d. Completely remove the hot-plug adapter from the slot. The LED goes off.
e. Reinstall the adapter, ensuring that it is correctly seated. Press Enter.
f. Select F3 to return to the PCI Hot-Plug Adapter Menu.
g. Select Install/Configure Devices Added after IPL.
h. Select F3 to return to the PCI Hot-Plug Adapter Menu.
i. Connect the appropriate cables and devices to the adapter.
j. Select List PCI Hot-Plug Slots, and press Enter.
k. Return to the error code or procedure that sent you here, if the List PCI Hot-Plug Slots menu
indicates that the slot for the just replaced or reseated adapter is empty.
16. Select F3 to cancel.
17. Select Replace/Remove a PCI Hot-Plug Adapter and press Enter. The Replace/Remove a PCI
Hot-Plug Adapter menu displays.
18. Move the cursor to select the adapter that you are removing, and press Enter.
19. Press the Tab key until the entry field displays the replace operation, and then press Enter. Follow the
instructions that display on the screen until you are instructed to remove the adapter.
20. When you are instructed to remove the adapter from the adapter slot, disconnect any cables that are
connected to the adapter being removed.
Chapter 9. Removal and Replacement Procedures 389
21. Turn the locking latch, and lift the plastic stop for the adapter.
1 System Unit 5 PCI adapter faceplate (dotted lines)
2 Adapter Retainer Assembly 6 Adapter Retainer Assembly (in the unlocked
position)
3 Adapter Retainer Assembly (retainer seat
down and the retainer clip in the locked
position)
7 PCI Adapter
4 Adapter Retainer Assembly (retainer seat
down and the retainer clip in the unlocked
position)
22. Carefully grasp the adapter by the edges, and pull it straight out from the PCI riser card.
23. If necessary, remove the replacement adapter from the antistatic package.
Attention: Avoid touching the components and gold-edge connectors on the adapter.
24. Place the adapter, component-side up, on a flat, static-protective surface.
25. Set any jumpers or switches as instructed by the adapter manufacturer.
26. Replace the adapter into the adapter slot. Carefully grasp the adapter by the edges and align the
adapter in the slot guides. Insert the adapter fully into the adapter slot connector. If you are installing
a full-length adapter, ensure that both ends of the adapter engage the card guides.
27. Lower the plastic retaining seat over the PCI adapter face plate (dotted lines). Rotate the locking latch
clockwise until it clicks into the locked position. Some full-length cards can be supported by rotating
the blue adapter latch on the right end of the adapter counterclockwise.
390 Service Guide
1 System Unit
2 Adapter Retainer Assembly (in the unlocked position)
3 Adapter Retainer Assembly (retainer seat down and the retainer clip in the unlocked position)
4 Adapter Retainer Assembly (retainer seat down and the retainer clip in the locked position)
5 PCI adapter faceplate (dotted lines)
28. Connect appropriate cables and devices to the adapter.
29. Continue to follow the screen instructions until you receive a message that the replacement is
successful. Successful replacement is indicated by the OK message displayed next to the Command
field at the top of the screen.
30. Press the F3 key to return to the PCI Hot-Plug Manager menu.
31. Select Install/Configure Devices Added After IPL and press Enter. Follow the instructions on the
screen. Successful replacement is indicated by the OK message displayed next to the Command field
at the top of the screen.
Do not use Install/Configure Devices Added After IPL if your system is set up to run HACMP
clustering. Consult with your system administrator or software support to determine the correct
method to configure the replacement device.
32. If you do not have other adapters to replace, continue with the next step.
OR
If you have other adapters to replace, press the F3 key to return to the PCI Hot-Plug Manager menu
and then return to step 14 on page 389.
33. Press F10 to exit the Hot-Plug Manager.
If you have added, removed, or replaced any adapters, run the diag -a command. If the system
responds with a menu or prompt, follow the instructions to complete the device configuration.
Chapter 9. Removal and Replacement Procedures 391
34. Replace the service access cover as described in “Service Access Cover Replacement (286/C4)” on
page 366 or “Service Access Cover Replacement (286/E4)” on page 372.
35. On a 286/C4, push the system drawer back into the operating position as described in “Returning the
286/C4 to the Operating Position” on page 365.
36. Connect the adapter cables.
37. On a 286/C4, route the cables through the cable-management arm.
38. On a 286/C4, close the rack doors.
On a 286/E4, close the bezel door.
Installing a Hot-Pluggable PCI Adapter
Note: This system supports PCI hot-pluggable adapters, so the system does not need to be shut down
and the power cables removed before adding or replacing an adapter.
To add or install an adapter, do the following:
1. On a 286/C4, open the front and rear rack doors and place the system into the service position as
described in “Placing the 286/C4 into the Service Position” on page 364.
On a 286/E4, open the bezel door.
2. Remove the service access cover as described in “Service Access Cover Removal (286/C4)” on
page 366 or “Service Access Cover Removal (286/E4)” on page 371.
3. Refer to the PCI Adapter Placement Reference, order number SA38-0538, for information regarding
slot restrictions for adapters that can be used in this system.
4. If necessary, remove the adapter expansion slot shield.
5. Before handling any card, board, or memory DIMM, be sure to use your electrostatic discharge strap
to minimize static-electric discharge. Refer to “Handling Static-Sensitive Devices” on page 359.
6. If necessary, remove the adapter from the antistatic package.
Attention: Avoid touching the components and gold-edge connectors on the adapter.
7. Place the adapter, component-side up, on a flat, static-protective surface.
8. Set any jumpers or switches as described by the adapter’s manufacturer.
9. Refer to “PCI Hot-Plug Manager Access” on page 379, and follow the steps in the access procedure
to select PCI Hot Plug Manager. Then return here to continue.
10. From the PCI Hot-Plug Manager menu, select Add a PCI Hot-Plug Adapter and press Enter. The
Add a Hot-Plug Adapter window displays.
11. See the PCI Adapter Placement Reference, order number SA38-0538, for adapter placement
information. Then select an empty PCI slot for the adapter.
12. Select the appropriate empty PCI slot from the ones listed on the screen, and press Enter.
392 Service Guide
13. Rotate the locking retainer clip, lift the plastic stop, and remove the expansion slot cover.
1 System Unit 5 PCI adapter faceplate (dotted lines)
2 Adapter Retainer Assembly 6 Adapter Retainer Assembly (in the unlocked
position)
3 Adapter Retainer Assembly (retainer seat
down and the retainer clip in the locked
position)
7 PCI Adapter
4 Adapter Retainer Assembly (retainer seat
down and the retainer clip in the unlocked
position)
14. Follow the instructions on the screen to install the adapter until the LED for the specified PCI slot is
set to the Action state. See “Component LEDs” on page 466.
15. When you are instructed to install the adapter in the adapter slot, carefully grasp the adapter by the
edges and align the adapter in the slot guides. Insert the adapter fully into the adapter slot connector.
If you are installing a full-length adapter, ensure that both ends of the adapter engage the card
guides.
Chapter 9. Removal and Replacement Procedures 393
16. Lower the tab onto the PCI adapter faceplate. Rotate the adapter retainer clip clockwise until it covers
the tab at approximately a 45-degree angle. See the following illustration.
1 System Unit
2 Adapter Retainer Assembly (in the unlocked position)
3 Adapter Retainer Assembly (retainer seat down and the retainer clip in the
unlocked position)
4 Adapter Retainer Assembly (retainer seat down and the retainer clip in the
locked position)
5 PCI adapter faceplate (dotted lines)
17. Replace the service access cover as described in “Service Access Cover Replacement (286/C4)” on
page 366 or “Service Access Cover Replacement (286/E4)” on page 372.
18. On a 286/C4, push the system drawer back into the operating position as described in “Returning the
286/C4 to the Operating Position” on page 365.
19. Connect the adapter cables.
20. On a 286/C4, route the cables through the cable-management arm.
21. On a 286/C4, close the rack doors.
On a 286/E4, close the bezel door.
394 Service Guide
Fans
Before performing the following procedures, read “Safety Notices” on page xi.
Removing a Processor Fan in a Non-Hot-Swap Environment
Note: A processor fan in a non-hot-swap environment means that your system is configured at the factory
with one processor fan installed. To place your system into an environment that enables the
hot-swapping of the processor fans, you must purchase a second processor fan and insert it into
the unused docking connector. It is not necessary to shut down the system when adding the
second processor fan.
To remove a processor fan in a non-hot-swap environment, do the following:
1. On a 286/C4, open the front and rear rack doors and place the system into the service position as
described in “Placing the 286/C4 into the Service Position” on page 364.
On a 286/E4, open the bezel door.
2. Shut down the system as described in either “Stopping the System without an HMC Attached” on
page 359 or “Stopping the System with an HMC Attached and AIX Installed” on page 359.
3. Disconnect the power source from the system.
Note: This system may be equipped with a second power supply. Before continuing with this
procedure, ensure that the power source to the system has been completely disconnected.
4. Continue with the procedure in “Hot-Swap Fan Assembly Removal” on page 397.
1 286/C4 4 Processor Fan Amber LED
2 Service Access Cover 5 Finger Holes
3 Processor Card Cooling Fan (Fan #1) 6 Processor Fan Bottom Connector
7 Processor Cooling Fans Access Cover
Chapter 9. Removal and Replacement Procedures 395
1
2
8
3
4
7
5
6
1 286/E4 5 Finger Holes
2 Outer Side Cover 6 Processor Fan Bottom Connector
3 Service Access Cover 7 Processor Cooling Fans Access Cover
4 Processor Fan Amber LED 8 Processor Card Cooling Fan (Fan #1)
Replacing a Processor Fan in a Non-Hot-Swap Environment
To replace a processor fan in a non-hot-swap environment, do the following:
1. Lift the processor fan access cover.
2. Place your thumb and index finger into the two holes located on the top of the replacement fan.
3. Align the connector located on the fan assembly with the docking connector located on the CEC
backplane.
4. Insert and firmly press the replacement fan into the docking connector closest to the back of the
chassis.
5. Ensure the cooling fan access cover springs back to the closed position.
6. Replace the service access cover as described in “Service Access Cover Replacement (286/C4)” on
page 366 or “Service Access Cover Replacement (286/E4)” on page 372.
7. On a 286/C4, push the system drawer back into the operating position as described in “Returning the
286/C4 to the Operating Position” on page 365.
8. Reconnect the power cables.
9. Power on the system as described in “Starting the System without an HMC Attached” on page 360.
10. After the 286/C4 has successfully powered on, close the rack doors.
After the 286/E4 has successfully powered on, close the bezel door.
396 Service Guide
Hot-Swap Fan Assembly Removal
This section contains procedures for removing processor cooling fans and PCI adapter cooling fans. Select
the removal procedure according to the hot-swap fan assembly that you are servicing.
To remove a hot-swap fan, do the following:
1. On a 286/C4, open the front and rear rack doors and place the system into the service position as
described in “Placing the 286/C4 into the Service Position” on page 364.
On a 286/E4, open the bezel door.
2. Remove the service access cover as described in “Service Access Cover Removal (286/C4)” on
page 366 or “Service Access Cover Removal (286/E4)” on page 371.
3. Locate the cooling fan assemblies, as shown in the following illustration.
4. The failing fan assembly has the amber-colored LED that is lit.
Note: All of the hot-swap fan assemblies used in your system have an amber-colored LED located on
the top of the fan assembly. When lit, this LED indicates that its fan assembly has a problem.
The fan might not be spinning or is not spinning fast enough to help cool the system unit.
Hot-Swap Processor Fan Removal
If you are servicing one of the processor cooling fan assemblies, do the following:
1. Lift the processor fan access cover.
2. Place your thumb and index finger into the two finger holes, located on the top of the faulty fan.
3. Lift the faulty fan out of the system.
Notes:
a. The processor cooling fans have a docking connector, located on the bottom of the fan assembly.
The docking connectors automatically disconnect as the fan is being removed from the chassis.
b. Unless you are performing a hot-swap removal or replacement of the processor fans, keep the
processor fan access cover closed.
Hot-Swap PCI Adapter Fan Removal
Attention: The system will power off if the failing PCI cooling fan has not been exchanged within five
minutes.
If you are servicing one of the PCI adapter cooling fan assemblies, do the following:
1. Disconnect the cooling fan’s power cable from the connector, located on the front of the fan assembly.
2. Place your index finger below the front connector and slide the fan out of the fan cage.
Note: The adapter cooling fan assemblies are arranged in stacked pairs. You cannot replace a single
fan in the stacked pair.
Chapter 9. Removal and Replacement Procedures 397
286/C4
1 286/C4 7 Processor Fan Bottom Connector
2 Service Access Cover 8 Processor Cooling Fans Access Cover
3 Processor Fan Amber LED 9 Adapter Cooling Fan Retaining Bracket
4 Finger Holes 10 PCI Adapter Cooling Fan Assembly (Fan
#4)
5 Processor Card Cooling Fan (Fan #1) 11 PCI Adapter Cooling Fan Assembly (Fan
#3)
6 Processor Card Cooling Fan (Fan #2) 12 PCI Adapter Fan Amber LED
13 Adapter Cooling Fan Front Connector
398 Service Guide
286/E4
1 286/E4 8 Processor Cooling Fans Access Cover
2 Outer Side Cover 9 Processor Card Cooling Fan (Fan #2)
3 Service Access Cover 10 Adapter Cooling Fan Assembly (Fan #4)
4 Processor Fan Amber LED 11 Adapter Cooling Fan Assembly (Fan #3)
5 Finger Holes 12 PCI Adapter Fan Amber LED
6 Processor Card Cooling Fan (Fan #1) 13 PCI Adapter Cooling Fan Front Connector
7 Processor Fan Bottom Connector
Hot-Swap Fan Assembly Replacement
Note: This note applies only to systems running the AIX operating system.
If a fan assembly is being replaced for a redundant failure, after the service repair action is
completed, ask the customer to check the crontab file for any power/cooling warning messages.
When a power or cooling error is encountered, AIX adds an entry to the crontab file to ²wall² a
warning message every 12 hours, to alert or remind the customer of the problem. Replacing the
faulty part does not clear this crontab entry, so unless the crontab file is edited to remove this
entry, the customer continues to be reminded of the failure despite its having been repaired. Use
the crontab -l command to read the crontab file to determine if an entry exists. Use the crontab -e
command to edit the file.
The hot-swap fan removal section has a procedure for processor cooling fans and PCI adapter cooling
fans. Select the removal procedure according to the hot-swap fan assembly you are servicing.
To replace your system’s hot-swap cooling fans, do the following:
Chapter 9. Removal and Replacement Procedures 399
Hot-Swap Processor Fan Replacement
When installing a hot-swap processor cooling fan, do the following:
1. Lift the processor fan access cover.
2. Place your thumb and index finger into the two holes located on the top of the replacement fan.
3. Align the connector located on the fan assembly with the docking connector located on the CEC
backplane.
4. Insert the replacement fan and firmly press it into its docking connector.
5. Verify that the replaced cooling fan’s LED is not lit. If the fan LED is lit, reseat the fan or the fan cable.
6. For proper cooling of the processor, ensure the processor cooling fan access cover springs back to the
closed position.
7. Replace the service access cover as described in “Service Access Cover Replacement (286/C4)” on
page 366 or “Service Access Cover Replacement (286/E4)” on page 372.
8. On a 286/C4, push the system drawer back into the operating position as described in “Returning the
286/C4 to the Operating Position” on page 365 and close the rack doors.
On a 286/E4, close the bezel door.
Hot-Swap PCI Adapter Fan Replacement
Attention: The system will power off if the failing PCI cooling fan has not been exchanged within five
minutes.
When installing a hot-swap PCI adapter cooling fan, do the following:
1. Before inserting the fan assembly into the fan cage, ensure that the fan connector located on the fan
housing is facing the front of the system.
2. Insert the replacement adapter cooling fan assembly into the cooling fan cage.
Note: When seated correctly, the top edge of the fan assembly and the fan cage are flush with each
other.
3. Reconnect the power cable to the connector located on the front of the fan assembly.
4. Verify that the replaced cooling fan’s LED is not lit. If the fan LED is lit, reseat the fan or the fan cable.
5. Replace the service access cover as described in “Service Access Cover Replacement (286/C4)” on
page 366 or “Service Access Cover Replacement (286/E4)” on page 372.
6. On a 286/C4, push the system drawer back into the operating position as described in “Returning the
286/C4 to the Operating Position” on page 365 and close the rack doors.
On a 286/E4, close the bezel door.
400 Service Guide
Media Drives (Diskette, CD-ROM, Tape, or Disk Drive)
Note: The two media bays can be used to house an IDE CD-ROM, IDE DVDROM, SCSI DVDRAM, tape
drive, or diskette drive. See the illustration on 402 and 403 for the location of the two media bays.
However, if your system has an IDE CD-ROM, it should be placed in media bay 1.
Before performing this procedure, read the following:
v “Safety Notices” on page xi
v “Laser Safety Information” on page xiii
Media Removal
To remove any of the media drives located in either media bay 1 or media bay 2, do the following:
1. On a 286/C4, open the front and rear rack doors and place the system into the service position as
described in “Placing the 286/C4 into the Service Position” on page 364.
On a 286/E4, open the bezel door.
2. If you are servicing a 286/E4, open the bezel door.
3. Shut down the system as described in either “Stopping the System without an HMC Attached” on
page 359 or “Stopping the System with an HMC Attached and AIX Installed” on page 359.
4. Disconnect the power source to the system.
Note: This system may be equipped with a second power supply. Before continuing with this
procedure, ensure that the power source to the system has been completely disconnected.
5. Remove the service access cover as described in “Service Access Cover Removal (286/C4)” on
page 366 or “Service Access Cover Removal (286/E4)” on page 371.
6. Remove the front bezel as described in “Front Bezel Removal (286/C4)” on page 377 or “Front Bezel
Removal (286/E4)” on page 378.
7. Disconnect the power and signal cables from the media device that you are servicing.
8. Grasping the top front of the media device, press the release tab located on the bottom center of the
media drive, and slide the media drive out of the media bay.
9. Remove the four retaining screws holding the carrier tray assembly to the media drive.
10. Put the media drive and carrier tray assembly in a safe place.
Chapter 9. Removal and Replacement Procedures 401
286/C4
1 Service Access Cover 6 Disk Drive (4 Pack)
2 286/C4 7 Optional Media Device (for example: Tape
Drive, DVD, CD-ROM, Diskette Drive)
3 Media Carrier Tray 8 IDE CD-ROM
4 Diskette Drive (Optional) 9 Media Carrier Trays
5 Diskette Drive Cage (Optional) 10 Front Bezel
402 Service Guide
286/E4
1 286/E4 7 IDE CD-ROM
2 Disk Drive (4 Pack) 8 Diskette Drive Carrier (Optional)
3 Media Carrier Trays 9 Diskette Drive Cage (Optional)
4 Front Bezel 10 Diskette Drive (Optional)
5 Bezel Door 11 Outer Side Cover
6 Optional Media Device (for example: Tape
Drive, DVD, CD-ROM, Diskette Drive)
12 Service Access Cover
Chapter 9. Removal and Replacement Procedures 403
Media Replacement
To replace a media drive, do the following:
1. Before handling any card, board, or memory DIMM, be sure to use your electrostatic discharge strap
to minimize static-electric discharge. Refer to “Handling Static-Sensitive Devices” on page 359.
2. Set any jumpers or switches on the device according to the documentation provided with the drive.
3. Using four retaining screws, secure the media carrier tray to the bottom of the media drive.
4. Insert the media drive into the media bay, pushing the drive in until the release tab clicks into the
locked position.
5. Connect the media signal and power cables to the drive.
6. Replace the bezel as described in “Front Bezel Replacement (286/C4)” on page 377 or “Front Bezel
Replacement (286/E4)” on page 378.
7. Replace the service access cover as described in “Service Access Cover Replacement (286/C4)” on
page 366 or “Service Access Cover Replacement (286/E4)” on page 372.
8. If you are servicing a 286/C4, push the system drawer back into the operating position as described
in “Returning the 286/C4 to the Operating Position” on page 365.
9. Reconnect the power source to the system.
10. Power on the system as described in “Starting the System without an HMC Attached” on page 360.
11. After the 286/C4 has successfully powered on, close the rack doors.
After the 286/E4 has successfully powered on, close the bezel door.
404 Service Guide
Hot-Plug Disk Drives
Before performing the following procedure, read “Safety Notices” on page xi.
Attention: The Linux operating system does not support hot-pluggable procedures, nor does it support
hot-plugging of any disk drives. A system with the Linux operating system installed on one or more
partitions must be shut down and powered off before replacing any disk drives assigned to a Linux
partition.
Note: You do not have to power off the system to remove a hot-plug disk drive. Before you perform these
procedures, ensure that the customer has taken appropriate actions to back up the data for the
drive you are removing, and that the drive has been removed from the configuration. Physically
removing a hot-plug disk drive from the system before it has been removed from the system
configuration can cause unrecoverable data corruption.
Deconfiguring (Removing) or Configuring a Disk Drive
Use this procedure if you are removing or installing a drive while the system power is turned on.
Deconfiguring (Removing)
Before removing a disk drive from a volume group, do the following:
1. Log in as root user.
2. Before a disk can be removed from a volume group, all data must be backed up and the data
removed from the disk to be replaced. Use AIX Logical Volume Manager to perform this task.
3. At the command line, type smitty.
4. Select System Storage Manager (Physical and Logical Storage) and press Enter.
5. Select Logical Volume Manager and press Enter.
6. Select Volume Groups and press Enter.
7. Select Set Characteristics of a Volume Group and press Enter.
8. Select Remove a Physical Volume from a Volume Group.
9. Press F4 to list the available volume groups, then select the volume group name and press Enter.
10. Press F4 to select a physical volume, and follow the instructions on the screen to select the physical
volume. Then press Enter.
11. Press F3, then select Cancel, to go back to the first menu.
12. Select System Storage Management (Physical and Logical Storage).
13. Select Removable Disk Management.
14. Select Remove a Disk.
15. Select the desired disk from the list on the screen and press Enter.
16. Follow the instructions on the screen to remove the drive.
17. When you are asked ²Are you sure?,² press Enter. The power LED on the drive that you selected will
remain on.
18. Remove the disk drive by pulling the disk drive lever toward you until it is completely open. Then
remove the disk drive from the slot. The LED on the top of the slot will turn off when the disk drive is
removed.
19. Press F10 to exit smitty.
Chapter 9. Removal and Replacement Procedures 405
Configuring (Replacing)
1. Remove the disk drive from its protective packaging, and open the drive latch handle.
2. Install the disk drive in the drive slot. Align the disk drive with the drive slot rails, and slide the disk
drive into the slot until it contacts the backplane at the rear of the drive bay. The drive should be in
far enough for the latch handle to engage the latch. Push the disk drive lever up and to the rear to
lock the disk drive. The LED above the slot will turn on.
3. Log in as root user.
4. At the command line, type smitty.
5. Select Devices.
6. Select Install/Configure Devices Added After IPL and press Enter. Successful configuration is
indicated by the OK message displayed next to the Command field at the top of the screen.
7. Press F3s, Cancel, to go back to the first menu and select System Storage Management (Physical
and Logical Storage) and press Enter.
8. Select Logical Volume Manager and press Enter.
9. Select Volume Groups and press Enter.
10. Select Set Characteristics of a Volume Group and press Enter.
11. Select Add a Physical Volume to a Volume Group.
12. Fill in the fields for the drive you are adding to the system. Press F4 for a list of selections.
13. To finish the drive configuration, see the AIX System Management Guide: Operating System and
Devices.
14. Press F10 to exit smitty.
Hot-Plug Disk Drive Removal
Attention: Before you perform the following procedure, ensure that the customer has taken appropriate
actions to back up the data for the drive you are removing, and that the drive has been removed from the
configuration. Physically removing a hot-plug drive from the system before it has been removed from the
system configuration can cause unrecoverable data corruption. Refer to “Deconfiguring (Removing) or
Configuring a Disk Drive” on page 405.
To remove a hot-plug disk drive, do the following:
1. Before removing the disk, ensure all data has been saved.
2. Deconfigure the drive you are removing from the system. See “Deconfiguring (Removing) or
Configuring a Disk Drive” on page 405.
3. On a 286/C4, open the rack front door.
On a 286/E4, open the front bezel door.
4. Remove the front bezel as described in “Front Bezel Removal (286/C4)” on page 377 or the disk drive
access cover as described in “Disk Drive Access Cover Removal (286/E4)” on page 373.
5. Locate the hot-plug disk drive you plan to remove. If you are removing a faulty disk drive, a lit amber
LED located at the front right side of each disk drive will help you isolate the faulty drive.
6. Log in as root user.
7. At the command line, type smitty.
406 Service Guide
8. Select System Storage Management (Physical and Logical Storage).
9. Select Removable Disk Management.
10. Select Remove a Disk.
11. Select the desired disk from the list on the screen and press Enter.
12. Remove the disk drive by placing the handle on the disk drive into the open position (perpendicular to
the drive) and pulling the hot-plug disk drive away from the bay. The lit LED on the disk drive will turn
off when the disk drive is removed.
1 286/C4 3 Disk Drive Release Handle
2 Disk Drive 4 Front Bezel
1 286/E4 3 Disk Drive Release Handle
2 Disk Drive 4 Disk Drive Access Cover
13. Press F10 to exit smitty.
Chapter 9. Removal and Replacement Procedures 407
Hot-Plug Disk Drive Replacement
Attention: Before you perform these procedures, ensure that the customer has taken appropriate
actions to back up the data for the drive you are removing, and that the drive has been removed from the
configuration. Physically removing a hot-plug drive from the system before it has been removed from the
system configuration can cause unrecoverable data corruption. Refer to “Deconfiguring (Removing) or
Configuring a Disk Drive” on page 405.
Do not use Install/Configure Devices Added After IPL if your system is set up to run HACMP clustering.
Consult with your system administrator or software support to determine the correct method to configure
the replacement disk.
To install a hot-plug disk drive, do the following:
1. Ensure the tray handle is open (perpendicular to the drive).
2. Align the drive/tray assembly so that it engages the guide rails in the bay.
3. Push the drive assembly into the bay until the tray handle engages the lock mechanism.
4. Push the tray handle in until it locks. The lit LED on the disk drive will turn on.
5. Log in as root user.
6. At the command line, type smitty.
7. Select Devices.
8. Select Install/Configure Devices Added After IPL and press Enter. Successful configuration is
indicated by the OK message displayed next to the Command field at the top of the screen.
9. Press F10 to exit smitty.
10. Replace the front bezel as described in “Front Bezel Replacement (286/C4)” on page 377 or “Disk
Drive Access Cover Replacement (286/E4)” on page 373.
11. On a 286/C4, close the rack doors.
On a 286/E4, close the bezel door.
408 Service Guide
PCI Riser Card
Note: To perform this procedure requires a considerable amount of force when removing or installing the
riser card from its docking connectors. If you are servicing a 286/C4, it is strongly recommended
that the system drawer be removed from the rack. If you are servicing a 286/E4, it is strongly
recommended that the system be placed on its side, on a flat and stable surface. To avoid potential
breakage, the 286/E4 stabilizer foot closest to the desktop or floor is free from resting on any
surface.
Before performing the following procedure, read “Safety Notices” on page xi.
PCI Riser Card Removal
To remove the PCI riser card, do the following:
1. If you are servicing a 286/E4, open the bezel door.
If you are servicing a 286/C4 rack-mounted drawer, open the front and rear rack doors.
2. Shut down the system as described in either “Stopping the System without an HMC Attached” on
page 359 or “Stopping the System with an HMC Attached and AIX Installed” on page 359.
3. Disconnect the power source from the system.
Note: This system may be equipped with a second power supply. Before continuing with this
procedure, ensure that the power source to the system has been completely disconnected.
4. Label and disconnect all adapter cables, located at the rear of the system unit.
5. If you are servicing a 286/C4, place the system drawer into the service position as described in
“Placing the 286/C4 into the Service Position” on page 364.
6. Remove the service access cover as described in “Service Access Cover Removal (286/C4)” on
page 366 or “Service Access Cover Removal (286/E4)” on page 371.
7. Disconnect all power and signal cables from the media drives, located in media bay 1 and media bay
2. Then remove the media drives as described in “Media Removal” on page 401.
8. Remove the front bezel as described in “Front Bezel Removal (286/C4)” on page 377 or “Front Bezel
Removal (286/E4)” on page 378.
9. Remove the media device support shelf by loosening the thumbscrew and grasping the back edge of
the shelf, sliding it back, then lifting it up and out of the system chassis.
Note: The support leg located on the bottom of the support shelf is used to deactivate a microswitch.
The microswitch is located on the CEC backplane and when deactivated, cuts off power to
many of the components and connectors located on the CEC backplane. This support shelf
must be correctly seated and secured in position after any new FRU is added to the system and
you perform a system boot.
10. Record the slot number and location of each adapter.
11. Remove the processor assembly #2 as described in “Processor Card Removal” on page 423. If your
system is equipped with a processor filler panel instead of a second processor card assembly,
remove the processor filler panel.
12. Before handling any card, board, or memory DIMM, be sure to use your electrostatic discharge strap
to minimize static-electric discharge. Refer to “Handling Static-Sensitive Devices” on page 359.
13. Remove the PCI adapters as described in “Non-Hot-Pluggable PCI Adapter Removal” on page 381.
14. Remove all power, signal, and SCSI cables from the PCI riser card.
Chapter 9. Removal and Replacement Procedures 409
15. Remove all screws that secure the PCI riser card to the CEC backplane and chassis.
1 PCI Riser Card (The six slotted riser card shown above. The four slotted riser
card mounts the same.)
2 Removal Sling
3 CEC Backplane
4 CEC Backplane Connectors to PCI Riser Card Connectors
5 ²Push² Labels
16. Grasp each end of the removal sling located toward the back of the system chassis. Slowly lift up the
back end of the PCI riser card. You may need to rock the sling back and forth several times to aid in
the removal of the PCI riser card connectors from their docking connectors located on the CEC
backplane. As the connectors begin to separate, grasp the front end of the PCI riser card and pull it
up and out of the system. Put the PCI riser card in a safe place.
Notes:
a. The PCI riser card may require up to 60 pounds of pulling force to free it from the docking
connectors. Depending on the system drawer’s location in the rack, it is strongly recommended
that the system drawer be removed from the rack to perform the PCI riser card removal and
replacement procedures.
b. When placing the 286/E4 on its side, ensure that the stabilizer foot closest to the desktop or floor
surface is free from resting on any surface.
410 Service Guide
286/C4
1 Service Access Cover 6 Media Support Tray
2 PCI Adapters 7 Front Bezel
3 PCI Riser Card 8 IDE CD-ROM
4 286/C4 9 Processor Card Assembly #2
5 Media Device 10 Processor Filler Panel
Chapter 9. Removal and Replacement Procedures 411
286/E4
1 Outer Side Cover 8 Media Device
2 Service Access Cover 9 IDE CD-ROM
3 PCI Adapters 10 Front Bezel
4 PCI Riser Card 11 Bezel Door
5 Back Cover 12 System Stabilizer Foot
6 286/E4 13 Processor Card Assembly #2
7 Media Device Support Tray 14 Processor Filler Panel
412 Service Guide
PCI Riser Card Replacement
To replace the PCI riser card, do the following:
1. Before handling any card, board, or memory DIMM, be sure to use your electrostatic discharge strap
to minimize static-electric discharge. Refer to “Handling Static-Sensitive Devices” on page 359.
2. If necessary, remove the PCI riser card from the antistatic package.
3. Carefully grasp the PCI riser card along two edges, and align the four connectors located on the
bottom side of the PCI riser card with the four standoff connectors located on the CEC backplane.
4. Ensure you have proper alignment of the PCI riser card connectors. Seat the PCI riser card into the
CEC backplane connectors.
a. To perform this procedure requires a considerable amount of force when removing or installing the
PCI riser card from its docking connectors. If you are servicing a 286/C4, it is strongly
recommended that the system drawer be removed from the rack. If you are servicing a 286/E4, it
is strongly recommended that the system be placed on its side, on a flat and stable surface. To
avoid potential breakage, the 286/E4 stabilizer foot closest to the desktop or floor is free from
resting on any surface.
1 PCI Riser Card 3 CEC Backplane Connectors to PCI
Riser Card Connectors
2 CEC Backplane 4 ²Push² Labels
b. To ensure the PCI riser card docking connectors are fully seated, using your thumbs, alternately
push down on the two ²push² labels that are mounted on the PCI riser card. Push until no more
movement is felt and then push down hard again on both ²push² labels. This alternate pushing
technique is allowed only for this card.
Note: Mounted below the CEC backplane is an aluminum plate. The plate aids in stiffening the
CEC backplane and keeps it from flexing or cracking when the PCI riser card is inserted.
c. If you experience any problem when you boot the system, repeat substep 4, but the second time,
you must push harder.
Chapter 9. Removal and Replacement Procedures 413
5. Replace the screws that secure the PCI riser card to the CEC backplane and chassis.
Note: For proper alignment, insert all screws before tightening.
6. Reconnect the power, signal, and SCSI cables.
7. Replace all adapters to their proper locations as described in “Adding or Replacing a
Non-Hot-Pluggable PCI Adapter” on page 383.
8. Reconnect all of the labeled PCI adapter cables to their respective PCI adapters.
9. Replace the processor #2 assembly as described in “Adding or Replacing a Processor Card” on
page 426. If you removed a processor filler panel, replace the processor filler panel now.
10. Replace the media device support shelf. Tighten the thumbscrew located on the support shelf.
Note: When replacing the support shelf, ensure that the lower support leg, located on the bottom
side of the support shelf, activates the microswitch located on the CEC backplane. Failure to
do so results in the system not being able to power on.
11. Replace all removed media drives, ensuring that the media drives removed are returned to the same
media bay locations from which they were removed. Refer to “Media Replacement” on page 404.
12. Reconnect all power and signal cables to their respective media drives.
13. Replace the front bezel as described in “Front Bezel Replacement (286/C4)” on page 377 or “Front
Bezel Replacement (286/E4)” on page 378.
14. Replace the service access cover as described in “Service Access Cover Replacement (286/C4)” on
page 366 or “Service Access Cover Replacement (286/E4)” on page 372.
15. Reconnect the power cables.
16. Replace the service access cover as described in “Service Access Cover Replacement (286/C4)” on
page 366 or “Service Access Cover Replacement (286/E4)” on page 372.
17. If you are servicing a 286/C4, push the system drawer back into the operating position as described
in “Returning the 286/C4 to the Operating Position” on page 365.
18. Power on the system as described in “Starting the System without an HMC Attached” on page 360.
Note: If you experience any problem when you boot the system, repeat substep 4, but the second
time, you must push harder.
19. If you are servicing a 286/C4, close the rack doors.
If you are servicing a 286/E4, close the bezel door.
414 Service Guide
Disk Drive Backplane
To perform these procedures requires you to remove and install the PCI riser card. Removing or Installing
a PCI riser card from its docking connectors, requires a considerable amount of force. If you are servicing
a 286/C4, it is strongly recommended that the system drawer be removed from the rack. If you are
servicing a 286/E4, it is strongly recommended that the system be placed on its side, on a flat and stable
surface. To avoid potential breakage, the 286/E4 stabilizer foot closest to the desktop or floor is free from
resting on any surface.
Before performing the following procedure, read “Safety Notices” on page xi.
Disk Drive Backplane Removal
To remove the disk drive backplane, do the following:
1. If you are servicing a 286/E4, open the bezel door.
If you are servicing a 286/C4 rack-mounted drawer, open the front and rear rack doors.
2. Shut down the system as described in either “Stopping the System without an HMC Attached” on
page 359 or “Stopping the System with an HMC Attached and AIX Installed” on page 359.
3. Disconnect the power source from the system.
Note: This system may be equipped with a second power supply. Before continuing with this
procedure, ensure that the power source to the system has been completely disconnected.
4. Label and disconnect all adapter cables, located at the rear of the system unit.
5. If you are servicing a 286/C4, place the system drawer into the service position as described in
“Placing the 286/C4 into the Service Position” on page 364.
6. Remove the service access cover as described in “Service Access Cover Removal (286/C4)” on
page 366 or “Service Access Cover Removal (286/E4)” on page 371.
7. Remove the front bezel as described in “Front Bezel Removal (286/C4)” on page 377 or “Front Bezel
Removal (286/E4)” on page 378.
8. Disconnect and label all cables from the media drives, located in media bay 1 and media bay 2, then
remove the media drives as described in “Media Removal” on page 401. Note the location of each
removed media drive.
9. Remove the media device support shelf by loosening the captive thumbscrew. Grasp the back edge
of the media device support shelf, slide the media device support shelf back, while lifting it up and out
of the system chassis.
10. Remove the processor assembly #2 as described in “Processor Card Removal” on page 423. If your
system is equipped with a processor filler panel instead of a second processor card assembly,
remove the processor filler panel.
11. Record the slot number and location of each adapter.
12. Before handling any card, board, or memory DIMM, be sure to use your electrostatic discharge strap
to minimize static-electric discharge. Refer to “Handling Static-Sensitive Devices” on page 359.
13. Remove the PCI adapters as described in “Non-Hot-Pluggable PCI Adapter Removal” on page 381.
14. Remove the PCI riser card as described in “PCI Riser Card Removal” on page 409.
15. Remove all hot-plug disk drives as described in “Hot-Plug Disk Drive Removal” on page 406.
16. Remove the PCI adapter cooling fans as described in “Hot-Swap Fan Assembly Removal” on
page 397.
17. Remove the PCI adapter cooling fan stabilizing cage by loosening the retaining screw located along
the bottom back edge of the stabilizing cage.
Chapter 9. Removal and Replacement Procedures 415
18. Disconnect all power and signal cables connected to the disk drive backplane.
19. Remove the four retaining screws that secure the disk drive cage to the front of the system chassis.
20. Slide the disk drive cage back until it clears the upper chassis ledge.
21. Pull the disk drive cage assembly out of the system.
22. Remove the six retaining screws attached to the disk drive backplane.
286/C4
1 Service Access Cover (drawer) 9 286/C4 Chassis
2 PCI Adapters 10 Disk Drive (4 Pack)
3 PCI Riser Card 11 Media Device
4 PCI Cooling Fan 12 Front Bezel
5 Cable-Management Bracket 13 IDE CD-ROM Media
6 PCI Cooling Fan 14 Media Device Support Shelf
7 Disk Drive Backplane 15 Processor Assembly #2
8 Disk Drive Cage 16 Processor Filler
416 Service Guide
286/E4
1 Outer Side Cover 11 Disk Drive (4 Pack)
2 Service Access Cover 12 Media Device Support Shelf
3 PCI Adapters 13 Media Device
4 PCI Cooling Fan 14 Front Bezel
5 Cable-Management Bracket 15 Bezel Door
6 Back Cover 16 IDE CD-ROM Media
7 PCI Riser Card 17 System Stabilizer Foot
8 286/E4 18 Processor Assembly #2
9 Disk Drive Backplane 19 Processor Filler
10 Disk Drive Cage
Chapter 9. Removal and Replacement Procedures 417
Disk Drive Backplane Replacement
To replace the disk drive backplane, do the following:
1. Using the six supplied retaining screws, secure the backplane to the disk drive cage.
2. Carefully place the disk drive cage assembly into the system.
3. Slide the disk drive cage assembly toward the front of the system.
4. Ensure the six screw holes located on the cage assembly correctly align with the mating screw holes
located on the system chassis.
5. Insert and finger-tighten the six retaining screws. After all screws have been started, finish tightening
them.
6. Reconnect all power and signal cables to the disk drive backplane.
7. Replace the PCI adapter cooling fan stabilizing cage by installing the two tabs located on the bottom
edge of the cage into their mating slots located on the chassis. Secure the cage to the chassis with a
retaining screw.
8. Replace the PCI adapter cooling fans as described in “Hot-Swap Fan Assembly Replacement” on
page 399.
9. Replace all hot-plug disk drives as described in “Hot-Plug Disk Drive Replacement” on page 408.
10. Replace the media device support shelf.
11. Replace all previously removed media drives, ensuring that the media drives are returned to the same
media bay from which they were removed. Refer to “Media Replacement” on page 404, if necessary.
12. Before handling any card, board, or memory DIMM, be sure to use your electrostatic discharge strap
to minimize static-electric discharge. Refer to “Handling Static-Sensitive Devices” on page 359.
13. Replace the PCI riser card as described in “PCI Riser Card Replacement” on page 413.
14. Replace the PCI adapters as described in “Adding or Replacing a Non-Hot-Pluggable PCI Adapter” on
page 383.
15. Reconnect all adapter cables, located at the rear of the system unit.
16. Reconnect the power source to the system.
17. Replace the front bezel as described in “Front Bezel Replacement (286/C4)” on page 377 or “Front
Bezel Replacement (286/E4)” on page 378.
18. Power on the system as described in “Starting the System without an HMC Attached” on page 360.
19. Replace the service access cover as described in “Covers” on page 366.
20. If you are servicing a 286/C4, place the system drawer into the operating position as described in
“Returning the 286/C4 to the Operating Position” on page 365.
21. If you are servicing a 286/C4, close the front and rear rack doors.
22. If you are servicing a 286/E4, close the bezel door.
418 Service Guide
Operator Panel
Before performing the following procedure, read “Safety Notices” on page xi.
Operator Panel Removal
To remove the operator panel, do the following:
1. On a 286/C4, open the front and rear rack doors and place the system into the service position as
described in “Placing the 286/C4 into the Service Position” on page 364.
On a 286/E4, open the bezel door.
2. Shut down the system as described in either “Stopping the System without an HMC Attached” on
page 359 or “Stopping the System with an HMC Attached and AIX Installed” on page 359.
3. Disconnect the power source from the system.
Note: This system may be equipped with a second power supply. Before continuing with this
procedure, ensure that the power source to the system has been completely disconnected.
4. If necessary, on a 286/E4, remove the bezel door as described in “Bezel Door Removal” on
page 376.
5. Remove the front bezel as described in “Front Bezel Removal (286/C4)” on page 377 or “Front Bezel
Removal (286/E4)” on page 378.
6. If you are servicing a 286/E4, remove the top cap as described in “Top Cap Removal (286/E4)” on
page 367.
7. If you are servicing a 286/C4, remove the service access cover as described in “Service Access
Cover Removal (286/C4)” on page 366.
8. Locate the operator panel, as shown in the following illustrations.
9. Remove the two retaining screws, located along the bottom edge of the operator panel.
10. Disconnect the power / signal cable from the back of the operator panel.
11. Remove the operator panel by grasping the edges of the panel and pulling it out of its bay.
286/C4
1 Service Access Cover (drawer) 4 Operator Panel Retaining Screws
2 System Unit 5 Operator Panel
3 Front Bezel
Chapter 9. Removal and Replacement Procedures 419
286/E4
1 Top Cap 4 Front Bezel
2 Operator Panel 5 Bezel Door
3 Operator Panel Retaining Screws 6 286/E4
Operator Panel Replacement
Attention: If you are replacing the operator panel with a new FRU, you must write the system VPD
information (machine type, model number, and serial number) into the VPD module. You must contact the
support center to obtain instructions and the necessary password to enter the information into the VPD
module.
Note: Refer to “System Vital Product Data (VPD) Update Procedure” on page 421 for the correct
procedure.
To replace the operator panel, do the following:
1. Carefully slide the operator panel back into the system bay.
2. Connect the signal and power cable to the connector on the new operator panel.
3. Secure the operator panel with the two retaining screws previously removed.
4. Replace the service access cover, as described in “Service Access Cover Replacement (286/C4)” on
page 366.
5. If you are servicing a 286/C4, push the system drawer back into the operating position as described
in “Returning the 286/C4 to the Operating Position” on page 365.
6. Reconnect all system power cables.
7. Replace the front bezel as described in “Front Bezel Replacement (286/C4)” on page 377 or “Front
Bezel Replacement (286/E4)” on page 378.
8. If necessary replace the bezel door as described in “Bezel Door Replacement” on page 376.
9. If you are servicing a 286/E4, replace the top cap as described in “Top Cap Replacement (286/E4)”
on page 367.
10. Power on the system as described in “Starting the System without an HMC Attached” on page 360.
11. If the 286/C4 has successfully powered on, close the rack doors.
If the 286/E4 has successfully powered on, close the bezel door.
420 Service Guide
System Vital Product Data (VPD) Update Procedure
If you replaced the operator panel with a new FRU, perform the following procedure to ensure that the
VPD is correct.
1. After you install the new operator panel, wait for the OK message to display in the operator panel
display.
2. Start the Service Processor Firmware menu.
1. Service Processor Setup
2. System Power Control Menu
3. System Information Menu
4. Language Selection
5. Call In/Call Out Setup
6. Set System Name
99. Exit from Menu
Main Menu
Service Processor Firmware
Firmware Level: xxxxxxxx
Copyright 1999, IBM Corporation
1>
Command Prompt
3. At the command prompt, type the code that accesses the hidden menus. If necessary, call your local
support center to obtain the code.
This menu is for IBM Authorized use only. If you
have not been authorized to use this menu, please
discontinue use immediately.
Press Return to continue, or X to return to menu 1.
4. Press Return to continue.
VPD Serial Number is not programmed.
Enter the VPD Serial Number (7 ASCII digits): xxxxxxx
Chapter 9. Removal and Replacement Procedures 421
5. Type the VPD serial number.
Note: The serial number must be entered correctly. Enter the last seven digits only. Do not include the
dash (-) in the serial number as a digit. If the serial number is not entered correctly, a new
operator panel must be ordered and installed.
VPD Serial Number has been programmed
successfully.
The current TM field is: xxxx-xxx
Do you want to change the TM field (y/n)?
6. Type y (yes) if the system unit’s type/model (TM) you are working on is different from the one listed on
the screen.
7. Type the machine type and model number of the system unit.
Enter the TM data (8 ASCII digits): xxxx-xxx
TM has been programmed successfully
The current MN field is 1980
Do you want to change the MN field (y/n)?
8. The MN field is for manufacturing use only. Enter n (no) in this field.
9. Enter 99 at the command line of the Main Menu to exit.
422 Service Guide
Processor Card
Before performing the following procedure, read “Safety Notices” on page xi.
Processor Card Removal
To remove the processor card, do the following:
1. On a 286/C4, open the front and rear rack doors and put the system into the service position as
described in “Placing the 286/C4 into the Service Position” on page 364.
On a 286/E4, open the bezel door.
2. Shut down the system as described in either “Stopping the System without an HMC Attached” on
page 359 or “Stopping the System with an HMC Attached and AIX Installed” on page 359.
3. Remove the service access cover as described in “Service Access Cover Removal (286/C4)” on
page 366 or “Service Access Cover Removal (286/E4)” on page 371.
4. Remove the front bezel as described in “Front Bezel Removal (286/C4)” on page 377 or “Front Bezel
Removal (286/E4)” on page 378.
5. Disconnect all power and signal cables from the media drives, located in media bay 1 and media bay
2. Then remove the media drives as described in “Media Removal” on page 401.
6. Remove the media device support shelf by loosening the thumbscrew and grasping the back edge of
the shelf, sliding it back, then lifting it up and out of the system chassis.
Note: The support leg located on the bottom of the support shelf is used to deactivate a microswitch.
The microswitch is located on the CEC backplane and when deactivated, cuts off power to
many of the components and connectors located on the CEC backplane. This support shelf
must be correctly seated and secured in position after any new FRU is added to the system
and you perform a system boot.
7. Before handling any card, board, or memory DIMM, touch any metal surface of the chassis with one
hand to minimize static electricity discharge. Refer to “Handling Static-Sensitive Devices” on
page 359.
8. Grasp the ends of both processor assembly release handles. Lift each handle until it is perpendicular
(90 degrees) to the top of the processor assembly. By placing the handle perpendicular to the top of
the processor assembly, the base or hinged portion of each handle acts as a cam and will gently pry
the processor card connector from the docking connector located on the CEC backplane.
9. Remove the processor assembly from the system unit.
Attention: To prevent damage to the card and the card connectors, open or close the retention
latches at the same time.
10. Lay the processor assembly, cover-side up, on a flat, static-protective surface.
Chapter 9. Removal and Replacement Procedures 423
11. From the bottom of the processor assembly, lift the side access cover. Four hinges hold the cover to
the top of the processor assembly. Pivot the cover up to reveal the memory DIMMs or DIMM
connectors.
12. Remove all memory DIMMs as described in “Memory DIMM Removal” on page 427. Put the DIMMs
in a safe place (you will need to install these DIMMs onto the new processor card).
286/C4
1 Service Access Cover 7 286/C4
2 Processor Access Cover 8 Optional Media Device (for example:
Diskette Drive or Tape Drive)
3 Processor Assembly Lifting and
Release Handles
9 IDE CD-ROM Drive
4 Processor Assembly 10 Front Bezel
5 Processor Card 11 Media Device Support Shelf
6 Memory DIMMs 12 Processor Filler
424 Service Guide
286/E4
1 286/E4 9 Service Access Cover
2 Media Device Support Shelf 10 Outer Side Cover
3 Media Carrier Trays 11 Processor Access Cover
4 Optional Media Device (for example:
Diskette Drive or Tape Drive)
12 Processor Assembly Lifting and
Release Handles
5 IDE CD-ROM Drive 13 Processor Card
6 Front Bezel 14 Processor Assembly
7 Bezel Door 15 Memory DIMMs
8 Processor Filler
Chapter 9. Removal and Replacement Procedures 425
Adding or Replacing a Processor Card
Notes:
1. Adding an additional processor card or installing a new processor card into your system may require
the updating of the system firmware.
2. Before handling any card, board, or memory DIMM, touch any metal surface of the chassis with one
hand to minimize static electricity discharge. Refer to “Handling Static-Sensitive Devices” on page 359.
3. If your system is configured with one processor card, place the card into location U0.1-P1-C1 on the
CEC backplane. The remaining processor connector must have the processor filler panel in place. The
filler panel protects the connector located on the CEC backplane from dust or damage and assists in
airflow and cooling.
4. If a second processor is to be added to your system, remove the protective filler panel from the
system, and install the new processor assembly into the secondary processor bay location
U0.1-P1-C2.
To add or replace a processor card, do the following:
1. From the bottom of the processor assembly, lift the processor access cover. Four hinges hold the
cover to the top of the processor assembly. Pivot the cover up to reveal the memory DIMMs or DIMM
connectors. Install the memory DIMMs as described in “Memory DIMMs” on page 427.
2. Pivot the processor access cover back into the closed position.
3. Grasp the release handles located on top of the processor assembly. Pivot the handle upward to 90
degrees. This action puts the handle cams into the correct position to help seat the processor card
connector into the docking connector located on the CEC backplane.
Attention: To prevent damage to the processor assembly and the processor card connector, open or
close both release handles at the same time.
4. Align the processor card connector with the docking connector located on the CEC backplane.
5. Carefully insert the processor assembly into the system.
6. As the processor connector is seated into its CEC backplane docking connector, begin lowering the
processor assembly release handles. This action correctly seats and locks the processor assembly to
the CEC backplane.
7. If necessary, replace the processor filler panel.
8. Replace the media device support shelf. Tighten the thumbscrew located on the support shelf.
Note: When replacing the media device support shelf, ensure that the lower support leg, located on
the bottom side of the support shelf, activates the microswitch located on the CEC backplane.
Failure to do so results in the system not being able to power on.
9. Replace all removed media drives, ensuring that the media drives removed are returned to the same
media bay locations from which they were removed. Refer to “Media Replacement” on page 404.
10. Reconnect all power and signal cables to their respective media drives.
11. Replace the front bezel as described in “Front Bezel Replacement (286/C4)” on page 377 or “Front
Bezel Replacement (286/E4)” on page 378.
12. Replace the service access cover as described in “Service Access Cover Replacement (286/C4)” on
page 366 or “Service Access Cover Replacement (286/E4)” on page 372.
13. On a 286/C4, push the system drawer back into the operating position as described in “Returning the
286/C4 to the Operating Position” on page 365.
14. Power on the system as described in “Starting the System without an HMC Attached” on page 360.
15. If the 286/C4 has successfully powered on, close the rack doors.
If the 286/E4 has successfully powered on, close the bezel door.
426 Service Guide
Memory DIMMs
Before performing the following procedures, read “Safety Notices” on page xi.
Memory DIMM Removal
To remove the memory DIMMs, do the following:
1. On a 286/C4, open the front and rear rack doors and put the system into the service position as
described in “Placing the 286/C4 into the Service Position” on page 364.
On a 286/E4, open the bezel door.
2. Shut down the system as described in either “Stopping the System without an HMC Attached” on
page 359 or “Stopping the System with an HMC Attached and AIX Installed” on page 359.
3. Remove the service access cover as described in “Service Access Cover Removal (286/C4)” on
page 366 or “Service Access Cover Removal (286/E4)” on page 371.
4. Remove the front bezel as described in “Front Bezel Removal (286/C4)” on page 377 or “Front Bezel
Removal (286/E4)” on page 378.
5. Disconnect all power and signal cables from the media drives, located in media bay 1 and media bay
2. Remove the media drives as described in “Media Removal” on page 401.
6. Remove the media device support shelf by loosening the thumbscrew and grasping the back edge of
the shelf, sliding it back, then lifting it up and out of the system chassis.
Note: The support leg located on the bottom of the support shelf is used to deactivate a microswitch.
The microswitch is located on the CEC backplane and when deactivated, cuts off power to
many of the components and connectors located on the CEC backplane. This support shelf
must be correctly seated and secured in position after any new FRU is added to the system
and you perform a system boot.
7. Before handling any card, board, or memory DIMM, touch any metal surface of the chassis with one
hand to minimize static electricity discharge. Refer to “Handling Static-Sensitive Devices” on
page 359.
8. Grasp the end of both processor assembly release handles. Lift each handle until it is perpendicular
(90 degrees) to the top of the processor assembly. By placing the handle perpendicular to the top of
the processor assembly, the base or hinged portion of each handle acts as a cam and will gently pry
the processor card connector from the docking connector located on the CEC backplane.
9. Remove the processor assembly from the system unit.
10. Lay the processor assembly, cover-side up, on a flat, static-protective surface.
Chapter 9. Removal and Replacement Procedures 427
11. From the bottom of the processor assembly, lift the side access cover. Four hinges hold the cover to
the top of the processor assembly. Pivot the cover up to reveal the memory DIMMs or DIMM
connectors.
286/C4
1 Service Access Cover 7 286/C4
2 Processor Access Cover 8 Optional Media Device (for example:
Diskette Drive, Tape Drive)
3 Processor Assembly Lifting and
Release Handle
9 IDE CD-ROM Drive
4 Processor Assembly 10 Front Bezel
5 Processor Card 11 Media Device Support Shelf
6 Memory DIMMs
428 Service Guide
286/E4
1 286/E4 9 Service Access Cover
2 Media Device Support Shelf 10 Outer Side Cover
3 Media Carrier Trays 11 Processor Access Cover
4 Optional Media Device (for example:
Diskette Drive or Tape Drive)
12 Processor Assembly Lifting and
Release Handles
5 IDE CD-ROM Drive 13 Processor Card
6 Front Bezel 14 Processor Assembly
7 Bezel Door 15 Memory DIMMs
8 Processor Filler
Chapter 9. Removal and Replacement Procedures 429
12. Locate the memory DIMMs, and determine which DIMM you want to remove.
Note: Memory DIMMs must be installed in quads (groups of 4) and in the correct slot. The only two
valid slot combinations that you can use are 1, 3, 6, and 8 OR 2, 4, 5, and 7. See the
following illustration.
Number Description Location Code
M1 Memory slot 1 (J2A) U0.1-P1-C1-M1
M2 Memory slot 2 (J2B) U0.1-P1-C1-M2
M3 Memory slot 3(J3A) U0.1-P1-C1-M3
M4 Memory slot 4(J3B) U0.1-P1-C1-M4
M5 Memory slot 5(J1B) U0.1-P1-C1-M5
M6 Memory slot 6(J1A) U0.1-P1-C1-M6
M7 Memory slot 7(J0B) U0.1-P1-C1-M7
M8 Memory slot 8(J0A) U0.1-P1-C1-M8
Note: The location code column indicates the location codes for processor card 1. If your system is configured for
processor card 2, substitute C2 in place of C1.
Attention: To prevent damage to the memory DIMM and the memory DIMM connectors, open or
close the retention latches at the same time.
13. Remove the memory DIMM by pushing the tabs out and then down. The tabs’ camming action forces
the memory DIMM out of the connector.
14. Pull the memory DIMM out of the connector as shown in the following illustration.
1
2
1
1 Connector Tabs (unlocked position)
2 Memory DIMM
430 Service Guide
Memory DIMM Replacement
To replace a memory DIMM, do the following:
1. Before handling any card, board, or memory DIMM, touch any metal surface of the chassis with one
hand to minimize static electricity discharge. Refer to “Handling Static-Sensitive Devices” on
page 359.
2. If necessary, remove the memory DIMM from its antistatic package.
3. Ensure that the connector locking tabs are pushed out in the unlocked position before installing a new
memory DIMM, as shown in the following illustration.
4. Carefully grasp the memory DIMM along two edges and align the connector.
5. Insert the memory card firmly into the connector.
6. Secure the memory DIMM with the locking tabs located at each end of the DIMM connector, as
shown in the following illustration.
Note: Memory DIMMs must be installed in groups of 4 (quads).
1 Locking Tabs (unlocked position)
2 Locking Tabs (locked position)
7. Pivot the processor access cover back into the closed position.
8. Grasp the release handles located on top of the processor assembly. Pivot the handle upward to 90
degrees. This action puts the handle cams into the correct position to help seat the processor card
connector into the docking connector located on the CEC backplane.
Attention: To prevent damage to the processor assembly and the processor card connector, open or
close both release handles at the same time.
9. Align the processor card connector with the docking connector located on the CEC backplane.
10. Carefully insert the processor assembly into the system.
11. As the processor connector is seated into its CEC backplane docking connector, begin lowering the
processor assembly release handles. This action correctly seats and locks the processor assembly to
the CEC backplane.
12. If necessary, replace the processor filler panel.
13. Replace the media device support shelf. Tighten the thumbscrew located on the support shelf.
Note: When replacing the support shelf, ensure that the lower support leg activates the microswitch
located on the CEC backplane. Failure to do so results in the system not being able to power
on.
14. Replace all removed media drives, ensuring that the media drives removed are returned to the same
media bay locations from which they were removed. Refer to “Media Replacement” on page 404.
15. Reconnect all power and signal cables to their respective media drives.
16. Replace the front bezel as described in “Front Bezel Replacement (286/C4)” on page 377 or “Front
Bezel Replacement (286/E4)” on page 378.
17. Replace the service access cover as described in “Service Access Cover Replacement (286/C4)” on
page 366 or “Service Access Cover Replacement (286/E4)” on page 372.
18. On a 286/C4, push the system drawer back into the operating position as described in “Returning the
286/C4 to the Operating Position” on page 365.
19. Power on the system as described in “Starting the System without an HMC Attached” on page 360.
20. If the 286/C4 has successfully powered on, close the rack doors.
If the 286/E4 has successfully powered on, close the bezel door.
Chapter 9. Removal and Replacement Procedures 431
Replacing the Battery
Before performing the following procedure, read “Safety Notices” on page xi.
CAUTION:
A lithium battery can cause fire, explosion, or severe burn. Do not recharge, disassemble, heat
above 100°C (212°F), solder directly to the cell, incinerate, or expose cell contents to water. Keep
away from children. Replace only with the part number specified for your system. Use of another
battery may present a risk of fire or explosion.
The battery connector is polarized; do not attempt to reverse polarity.
Dispose of the battery according to local regulations.
To replace the system’s battery, do the following:
1. On a 286/C4, open the front and rear rack doors and put the system into the service position as
described in “Placing the 286/C4 into the Service Position” on page 364.
On a 286/E4, open the bezel door.
2. Shut down the system as described in either “Stopping the System without an HMC Attached” on
page 359 or “Stopping the System with an HMC Attached and AIX Installed” on page 359.
3. Remove the service access cover as described in “Service Access Cover Removal (286/C4)” on
page 366 or “Service Access Cover Removal (286/E4)” on page 371.
4. Remove the front bezel as described in “Front Bezel Removal (286/C4)” on page 377 or “Front Bezel
Removal (286/E4)” on page 378.
5. Disconnect and label all power and signal cables from the media drives, located in media bay 1 and
media bay 2. Then remove the media drives as described in “Media Removal” on page 401. Note the
location of each media drive removed.
6. Remove the media device support shelf by loosening the thumbscrew and grasping the back edge of
the shelf, sliding it back, then lifting it up and out of the system chassis.
Note: The support leg located on the bottom of the support shelf is used to deactivate a microswitch.
The microswitch is located on the CEC backplane and when deactivated, cuts off power to
many of the components and connectors located on the CEC backplane. This support shelf
must be correctly seated and secured in position after any new FRU is added to the system
and you perform a system boot.
432 Service Guide
7. Remove processor assembly 1 as described in “Processor Card Removal” on page 423.
Note: Processor assembly 1 is the processor assembly that is closest to the power supplies. See the
following illustration.
286/C4
1 Service Access Cover 6 Front Bezel
2 Processor Assembly 1 7 Battery (positive (+) side facing up)
3 286/C4 8 Media Device Support Shelf
4 Media Device (for example: Tape Drive
or Diskette Drive)
9 Captive Thumbscrew
5 IDE CD-ROM Drive
Chapter 9. Removal and Replacement Procedures 433
286/E4
1 286/E4 7 Media Device Support Shelf
2 Media Carrier Trays 8 Captive Thumbscrew
3 Media Device (for example: Tape Drive
or Diskette Drive)
9 Battery (positive (+) side facing up)
4 IDE CD-ROM Drive 10 Service Access Cover
5 Front Bezel 11 Outer Side Cover
6 Bezel Door 12 Processor Assembly
434 Service Guide
8. Locate the battery on the CEC backplane as shown in the following illustration.
9. Use your fingernail to pry the battery out of its plastic mount. After the bottom of the battery has
cleared the top edge of the plastic mount, pull it up and away from the CEC backplane.
Note: The battery’s plastic mount connections are soldered to the CEC backplane. If the battery
mount is damaged or removed, permanent damage can result to the CEC backplane.
1 CEC backplane
2 Battery
3 Battery Plastic Mount
10. Replace the battery with (FRU) part number 00P3903 or equivalent.
11. When installing the new battery, ensure that the battery polarity is correct. For this system, the
positive (+) side of the battery should be facing up in its plastic mount.
12. Gently insert the new battery into its plastic mount.
13. Replace processor assembly 1 as described in “Adding or Replacing a Processor Card” on page 426.
14. Replace the media device support shelf. Tighten the thumbscrew located on the support shelf.
Note: When replacing the support shelf, ensure that the lower support leg activates the microswitch
located on the CEC backplane. Failure to do so results in the system not being able to power
on.
15. Replace all media drives, ensuring that the media drives removed are returned to the same media
bay from which they were removed as described in “Media Replacement” on page 404.
16. Reconnect all power and signal cables to their respective media drives.
17. Replace the front bezel as described in “Front Bezel Replacement (286/C4)” on page 377 or “Front
Bezel Replacement (286/E4)” on page 378.
18. Replace the service access cover, as described in “Service Access Cover Replacement (286/C4)” on
page 366 or “Service Access Cover Replacement (286/E4)” on page 372.
19. If you are servicing a 286/C4, push the system drawer back into the operating position as described
in “Returning the 286/C4 to the Operating Position” on page 365.
20. Turn on the system power.
21. If you are running AIX, the bootlist command may need to be run while logged in as root user. This
action rebuilds the customized boot list. IP (Internet protocol) parameters may also need to be
reentered.
Chapter 9. Removal and Replacement Procedures 435
Power Supplies
DANGER
Do not attempt to open the covers of the power supply. Power supplies are not serviceable and
are to be replaced as a unit.
D02
This system drawer has power supplies that have hot-swap capabilities. To use this feature, you must
have two power supplies in place.
Before performing any of the following procedures, read and understand all of the safety notices beginning
with “Safety Notices” on page xi.
Non-Hot-Swap Power Supply Removal
If you are servicing a system that has only one power supply installed, you must follow the non-hot-swap
power supply removal procedure. Systems that were configured with one power supply at the factory have
a power supply filler panel. The filler panel protects the vacant power supply connector from dust or
damage.
To remove a non-hot-swap power supply, do the following:
1. On a 286/C4, open the front and rear rack doors and place the system into the service position as
described in “Placing the 286/C4 into the Service Position” on page 364.
On a 286/E4, open the bezel door.
2. Shut down the system as described in either “Stopping the System without an HMC Attached” on
page 359 or “Stopping the System with an HMC Attached and AIX Installed” on page 359.
3. Remove the service access cover as described in “Service Access Cover Removal (286/C4)” on
page 366 or “Service Access Cover Removal (286/E4)” on page 371.
4. Remove the power cable from the rear of the power supply.
5. Wait at least 10 seconds before performing the next step.
6. Grasp the ends of both power supply handles, and slide the retractable spring-activated portion of
each handle toward its hinged base. This action releases the locking tab located on the bottom side of
each release handle. See the following illustration.
1 Power Supply
2 Locking Tab Receiver Hole
3 Retractable Spring-Activated Portion of the Handle
7. Lift the handle until it is perpendicular (90 degrees) to the top of the power supply. By placing the
handle perpendicular to the top of the power supply, the base or hinged portion of each handle acts as
a cam and will gently pry the power supply from its connector located on the CEC backplane.
436 Service Guide
1a Outer Side Cover (286/E4 Only) 4 Power Supply Filler Panel
1 Service Access Cover 5a 286/C4
2 Power Supply #1 (Primary) 5b 286/E4
3 Power Supply Release Handle
8. After the power supply is released from its connector, pull the power supply out of the system.
Note: During normal operation, each power-supply bay must have either a power supply or filler panel
installed for proper cooling.
Chapter 9. Removal and Replacement Procedures 437
Non-Hot-Swap Power Supply Replacement
To replace a non-hot-swap power supply, do the following:
1. Grasp the handles located on top of the new power supply. Pivot the handles upward to 90 degrees.
This action puts the handle cams into the correct position to help seat the power supply into its
connector.
2. Carefully insert the new power supply into the bay originally used by the faulty power supply.
3. Lower the power supply release handles, carefully pressing the power supply into the connector. The
spring latch located on the underside of each handle will snap into place, indicating that the power
supply is seated correctly and locked into position.
Note: Improper operation of the spring-activated portion of the power supply handles can result in
the following:
v The retractable spring remains compressed - This situation keeps the handle from locking
the power supply in its seated position. To eliminate this problem, lower the handle into the
locked position, and push the spring activated handle end toward the locking tab. This
action forces the spring latch to mat with the locking tab in the spring-latch receiver hole.
v The locking tab receiver is bent - Before lowering the handle to lock the power supply into
the seated position, ensure that the locking tab located on top of the power supply in the
receiver hole is not bent. Before the spring latch will seat onto the locking tab, the locking
tab, if bent, must be straightened.
1 Spring Latch 4 Power Supply
2 Spring-Activated Handle End 5 Spring-Latch Receiver Hole
3 Power Supply Release Handle 6 Locking Tab
4. Plug the power cable into the back of the power supply.
5. Reconnect the power source to the system.
6. The green AC Good LED located on the power supply comes on solid.
7. Replace the service access cover as described in “Service Access Cover Replacement (286/C4)” on
page 366 or “Service Access Cover Replacement (286/E4)” on page 372. If you are servicing a
286/E4, go to step 10.
If you are servicing a 286/C4, go to step 8.
8. Push the system drawer back into the operating position as described in “Returning the 286/C4 to the
Operating Position” on page 365.
9. Route the cables through the cable-management arm.
10. Power on the system drawer as described in “Starting the System without an HMC Attached” on
page 360.
11. After the 286/C4 has successfully powered on, close the rack doors.
After the 286/E4 has successfully powered on, close the bezel door.
438 Service Guide
Hot-Swap Power Supply Removal
Before performing this procedure, read the following “Safety Notices” on page xi.
If you are servicing a system that has two power supplies installed, you can replace each power supply
(one at a time) without shutting down the system.
To remove a hot-swap power supply, do the following:
Note: You do not need to turn off the power to the system to install hot-swap power supplies.
1. On a 286/C4, open the front and rear rack doors and place the system into the service position as
described in “Placing the 286/C4 into the Service Position” on page 364.
On a 286/E4, open the bezel door.
2. Visually check the power supply LEDs. If the amber LED is lit, suspect that power supply to be faulty.
3. Remove the service access cover as described in “Service Access Cover Removal (286/C4)” on
page 366 or “Service Access Cover Removal (286/E4)” on page 371.
4. Remove the power cable from the rear of the faulty power supply.
5. Wait at least 10 seconds before performing the next step.
6. Grasp the ends of both power supply handles, and slide the retractable spring-activated portion of
each handle toward its hinged base. This action releases the locking tab located on the bottom side
of each release handle. See the following illustration.
1 Power Supply
2 Locking Tab Receiver Hole
3 Retractable Spring-Activated Portion of the Handle
7. Lift the handle until it is perpendicular (90 degrees) to the top of the power supply. By placing the
handle perpendicular to the top of the power supply, the base or hinged portion of each handle acts
as a cam and will gently pry the power supply from its connector located on the CEC backplane.
8. After the power supply is released from its connector, pull the power supply out of the system.
9. If you are not replacing the power supply, do the following:
a. Install a power-supply filler panel.
Note: During normal operation, each power-supply bay must have either a power supply or filler
panel installed for proper cooling.
b. Disconnect the removed power supply’s power cable from either the power distribution panel or
electrical outlet.
10. Verify that the green DC Good LED on the remaining power supply is lit. The green LED indicates
that the power supply is operating correctly.
11. Replace the service access cover as described in “Service Access Cover Removal (286/C4)” on
page 366 or “Service Access Cover Removal (286/E4)” on page 371.
12. On a 286/C4, push the system drawer back into the operating position as described in “Returning the
286/C4 to the Operating Position” on page 365.
13. On a 286/C4, close the rack doors.
On a 286/E4, close the bezel door.
Chapter 9. Removal and Replacement Procedures 439
1a Outer Side Cover (286/E4 Only) 4 Power Supply Filler Panel
1 Service Access Cover 5a 286/C4
2 Power Supply #1 (Primary) 5b 286/E4
3 Power Supply Release Handle
440 Service Guide
Hot-Swap Power Supply Replacement
Note: This note applies only to systems running the AIX operating system.
If a power supply is being replaced for a redundant failure, after the service repair action is
completed, ask the customer to check the crontab file for any power/cooling warning messages.
When a power or cooling error is encountered, AIX adds an entry to the crontab file to ²wall² a
warning message every 12 hours, to alert or remind the customer of the problem. Replacing the
faulty part does not clear this crontab entry, so unless the crontab file is edited to remove this
entry, the customer continues to be reminded of the failure despite its having been repaired. Use
the crontab -l command to read the crontab file to determine if an entry exists. Use the crontab -e
command to edit the file.
To replace a hot-swap power supply, do the following:
1. If you are adding a new power supply to a system that was configured with only one power supply
installed, remove the power-supply filler panel from power supply #2 bay.
2. Grasp the handle located on top of the new power supply. Pivot the handle upward to 90 degrees.
This action puts the handle cams into the correct position to help seat the power supply into its
connector.
3. Carefully insert the power supply into the same power supply bay from which the faulty power supply
was just removed.
4. Lower the power supply handles, carefully pressing the power supply into the connector. The spring
latch located on the underside of each handle will snap into place, indicating that the power supply is
seated correctly and locked into position.
Note: Improper operation of the spring-activated portion of the power supply handles can result in
the following:
v The retractable spring remains compressed - This situation keeps the handle from locking
the power supply in its seated position. To eliminate this problem, lower the handle into the
locked position, and push the spring activated handle end toward the locking tab. This
action forces the spring latch to mat with the locking tab in the spring-latch receiver hole.
v The locking tab receiver is bent - Before lowering the handle to lock the power supply into
the seated position, ensure that the locking tab located on top of the power supply in the
receiver hole is not bent. Before the spring latch will seat onto the locking tab, the locking
tab, if bent, must be straightened.
Chapter 9. Removal and Replacement Procedures 441
1 Spring Latch 4 Power Supply
2 Spring-Activated Handle End 5 Spring-Latch Receiver Hole
3 Power Supply Release Handle 6 Locking Tab
5. Plug the power cable for the added power supply into the power cable connector located on the back
of the power supply.
6. Reconnect the power source to the system. The green AC Good LED located on the power supply
should come on solid.
7. Verify that the green DC Good LED on the power supply is lit. The green LED indicates that the
power supply is operating correctly.
8. Replace the service access cover as described in “Service Access Cover Removal (286/C4)” on
page 366 or “Service Access Cover Removal (286/E4)” on page 371.
9. If you are servicing a 286/C4, push the system drawer back into the operating position as described
in “Returning the 286/C4 to the Operating Position” on page 365.
10. On a 286/C4, close the rack doors.
On a 286/E4, close the bezel door.
442 Service Guide
Central Electronics Complex (CEC) Backplane
Before performing the following procedure, read the “Safety Notices” on page xi.
To perform this procedure, you must remove and install the PCI riser card. To remove or install the PCI
riser card from its docking connectors, requires a considerable amount of force. If you are servicing a
286/C4, it is strongly recommended that the system drawer be removed from the rack. If you are servicing
a 286/E4, it is strongly recommended that the system be placed on its side, on a flat and stable surface.
To avoid potential breakage, the 286/E4 stabilizer foot closest to the desktop or floor is free from resting
on any surface.
CEC Backplane Removal
Attention: Before you remove or disconnect any components, record their location. Before removing the
CEC backplane from the system for servicing or replacement, record the settings of any jumpers located
on the CEC backplane.
To remove the CEC backplane (system board), do the following:
1. On a 286/C4, open the front and rear rack doors and place the system into the service position as
described in “Placing the 286/C4 into the Service Position” on page 364.
On a 286/E4, open the bezel door.
2. Shut down the system as described in either “Stopping the System without an HMC Attached” on
page 359 or “Stopping the System with an HMC Attached and AIX Installed” on page 359.
3. Disconnect the power source from the system.
Note: This system may be equipped with a second power supply. Before continuing with this
procedure, ensure that the power source to the system has been completely disconnected.
4. Label and disconnect all other cables located at the rear of the system unit.
5. Remove the front bezel as described in “Front Bezel Removal (286/C4)” on page 377 or “Front Bezel
Removal (286/E4)” on page 378.
6. Remove the service access cover as described in “Service Access Cover Removal (286/C4)” on
page 366 or “Service Access Cover Removal (286/E4)” on page 371.
7. Record the slot number and location of each adapter being removed.
8. Before handling any card, board, or memory DIMM, be sure to use your electrostatic discharge strap
to minimize static-electric discharge. Refer to “Handling Static-Sensitive Devices” on page 359.
9. Remove the PCI adapters as described in “Non-Hot-Pluggable PCI Adapter Removal” on page 381.
10. Disconnect all power and signal cables from the media drives, located in media bay 1 and media bay
2. Then remove the media drives as described in “Media Removal” on page 401.
11. Remove the media device support shelf by loosening the thumbscrew and grasping the back edge of
the shelf, sliding it back, then lifting it up and out of the system chassis.
Note: The support leg located on the bottom of the support shelf is used to deactivate a microswitch.
The microswitch is located on the CEC backplane and when deactivated, cuts off power to
many of the components and connectors located on the CEC backplane. This support shelf
must be correctly seated and secured in position after any new FRU is added to the system
and you perform a system boot.
12. Remove all processor cards or the processor filler panel as described in “Processor Card Removal”
on page 423.
Chapter 9. Removal and Replacement Procedures 443
13. Remove all power supplies or the power supply filler panel as described in “Non-Hot-Swap Power
Supply Removal” on page 436 or “Hot-Swap Power Supply Removal” on page 439.
14. Remove the PCI riser card as described in “PCI Riser Card Removal” on page 409.
15. Remove the PCI cooling fans as described in “Hot-Swap Fan Assembly Removal” on page 397.
16. Remove the PCI cooling fan’s stabilizing bracket.
Note: The bracket is secured to the CEC backplane with one hex-head screw and two alignment
tabs.
17. Remove and label all remaining power, signal, and SCSI cables.
18. Remove the processor cooling fans as described in “Hot-Swap Fan Assembly Removal” on page 397.
19. Remove the processor cooling fan cage assembly by removing the two hex-head screws located at
the base of the cage and the eight hex-head screws that screw in from the back of the system
chassis.
20. Remove the 16 threaded standoffs that secure the rear connectors located on the CEC backplane to
the back of the system chassis.
21. Remove the power supply and processor assembly front support bracket.
Note: The bracket is secured to the system chassis with four hex-head screws.
22. Remove the plastic membrane that covers the CEC backplane.
23. Remove the CEC backplane retaining screws.
Note: For the locations of the retaining screws, see the following CEC backplane illustration.
Note: Use care when removing the CEC backplane. Standoffs attached to the chassis base may
damage the components attached to the bottom of the CEC backplane.
444 Service Guide
24. Slide the CEC backplane toward the front of the system (about 7 mm). A strip of sponge rubber
attached to the back (bottom) of the CEC backplane will compress against an aluminum plate that is
secured to the system chassis, preventing the CEC backplane from traveling forward any farther. This
action allows the connectors located on the back of the CEC backplane to have enough room to clear
the rear chassis bulkhead.
1 CEC Backplane
2 286/EC or 286/E4
25. To remove the CEC from the chassis, lift the front edge of the CEC backplane and pull it out and
away from the system chassis.
26. Put the CEC backplane in a safe place.
Chapter 9. Removal and Replacement Procedures 445
1 Service Access Cover (drawer) 11 Front Bezel
2 Primary and Redundant PCI Cooling Fans 12 IDE CD-ROM Drive
3 PCI Cooling Fan and Cable Management
Bracket
13 Operator Panel
4 Processor Cooling Fans 14 Front Power Supply and Processor Support
Bracket
5 Processor Cooling Fan Cage 15 Media Device Support Shelf
6 PCI Riser Card 16 Processor Assemblies 1 and 2
7 PCI Adapters 16a Processor Filler Panel
8 CEC Backplane 17 Power Supply #1 (Primary)
Power Supply #2 (Redundant)
9 286/C4 Chassis 17a Power Supply Filler Panel
10 Optional Media Device (for example diskette
drive or tape drive)
18 Plastic Membrane
446 Service Guide
1 Outer Side cover 13 Optional Media Device (for example diskette
drive or tape drive)
2 Service Access Cover 14 IDE CD-ROM Drive
3 PCI Cooling Fans 15 Bezel
4 PCI Cooling Fan and Cable Management
Bracket
16 Bezel Door
5 Processor Cooling Fans 17 System Front Stabilizer Foot
6 Processor Cooling Fan Cage 18 Front Power Supply and Processor Support
Bracket
7 PCI Riser Card 19 Power Supply #1 (Primary)
Power Supply #2 (Redundant)
8 PCI Adapters 19a Power Supply Filler Panel
9 Back Cover 20 Processor Assemblies 1 and 2
10 CEC Backplane 20a Processor Filler Panel
11 286/E4 21 Plastic Membrane
12 Media Device Support Shelf
Chapter 9. Removal and Replacement Procedures 447
CEC Backplane Replacement
To replace the CEC backplane (system board), do the following:
1. Before handling any card, board, or memory DIMM, be sure to use your electrostatic discharge strap
to minimize static-electric discharge. Refer to “Handling Static-Sensitive Devices” on page 359.
2. If necessary, remove the CEC backplane from the antistatic package.
3. Carefully grasp the CEC backplane along two edges, and align the retaining screw holes with its
mating screw holes located on the system chassis.
Note: Use care when replacing the CEC backplane. Standoffs attached to the chassis base may
damage the components attached to the bottom of the CEC backplane.
4. Replace the CEC backplane and slide it toward the back of the system. Ensure that all of the back
connectors located on the CEC backplane are positioned correctly through the rear chassis bulkhead.
5. Replace the retaining screws that secure the CEC backplane to the system chassis.
Note: For proper alignment, insert all screws and finger-tighten only. Ensure the CEC backplane is
not binding.
6. Tighten all CEC backplane retaining screws, including the 16 threaded standoffs that secure each
connector to the back of the system chassis.
7. Reconnect all power, signal, and SCSI cables that attach to the CEC backplane.
8. Replace the plastic membrane over the CEC backplane.
9. Replace the front power supply and processor support bracket.
10. Replace the processor cooling fan cage assembly.
11. Replace the eight screws that secure the processor cooling fan cage assembly to the rear chassis.
12. Replace the 16 threaded standoffs that secure the CEC backplane rear connectors to the rear
chassis.
13. Replace the processor cooling fans as described in “Hot-Swap Fan Assembly Replacement” on
page 399.
14. Replace the PCI adapter cooling fan stabilizing cage by installing the two tabs located on the bottom
edge of the cage into their mating slots located on the chassis. Secure the cage to the chassis with a
retaining screw.
15. Replace the PCI cooling fans as described in “Hot-Swap Fan Assembly Replacement” on page 399.
16. Replace the PCI riser card as described in “PCI Riser Card Replacement” on page 413.
17. Replace all PCI Adapters as described in “Adding or Replacing a Non-Hot-Pluggable PCI Adapter” on
page 383.
18. Replace the processor assembly and, if applicable, the processor assembly filler panel.
19. Replace the media device support shelf. Tighten the thumbscrew located on the media device support
shelf.
Note: When replacing the support shelf, ensure that the lower support leg activates the microswitch
located on the CEC backplane. Failure to do so results in the system not being able to power
on.
20. Replace all removed media drives, ensuring that the media drives removed are returned to the same
media bay from which they were removed. Refer to “Media Replacement” on page 404, if necessary.
21. Reconnect all power and signal cables to their respective media drives.
22. Replace the power supplies and, if applicable, the power supply filler panel as described in
“Non-Hot-Swap Power Supply Replacement” on page 438 or “Hot-Swap Power Supply Replacement”
on page 441.
448 Service Guide
23. Replace the front bezel as described in “Front Bezel Replacement (286/C4)” on page 377 or “Front
Bezel Replacement (286/E4)” on page 378.
24. Replace the service access cover, as described in “Service Access Cover Replacement (286/C4)” on
page 366 or “Service Access Cover Replacement (286/E4)” on page 372.
25. If you are servicing a 286/C4, push the system drawer back into the operating position as described
in “Returning the 286/C4 to the Operating Position” on page 365.
26. Reconnect all system power cables.
27. The service processor settings should have been set to their default values. If the service processor
settings have not been set to their default values, the default values must be restored. To restore the
default values settings, do one of the following:
v Manually reset all settings.
v If the service processor settings had previously been saved, The settings may be restored using
the procedure in “Saving and Restoring Service Processor Settings” on page 306.
28. Check the firmware level that is on the new CEC backplane. Look at the top of the service processor
main menu. If the firmware is not at the correct level, refer to “System Firmware Updates” on
page 332.
29. Power on the system as described in “Starting the System without an HMC Attached” on page 360.
30. Reset the time and date.
31. If you are running AIX, the bootlist command may need to be run while logged in as root user. This
rebuilds the customized boot list. Internet protocol (IP) parameters may also need to be reentered.
32. If you are servicing a 286/C4 and the system has successfully powered on, close the rack doors.
If you are servicing a 286/E4 and the system has successfully powered on, close the bezel door.
33. Check the firmware level on the new backplane; see “Determining the Level of Firmware on the
System” on page 334. If the firmware is back-level, download the latest firmware from CORE or the
Web. (See “System Firmware Updates” on page 332.)
Chapter 9. Removal and Replacement Procedures 449
450 Service Guide
Chapter 10. Parts Information
This chapter contains parts information for the 286/C4 and 286/E4.
451
System Parts
452 Service Guide
Index Number FRU Part
Number
Units Per
Assembly
Description
1 N/A 1 Service Access Cover (drawer)
2 09P5865 Up to 2 Processor Cooling Fans
3 09P5887 (Up to 6) EMC PCI-Slot Shield
4 N/A 1 Processor Cooling Fan Cage and Rear Bulk Head
Assembly
5 See Note Up to 6 PCI Adapters
6 09P5290 1 Six Slotted PCI Riser Card
09P2702 1 Four Slotted PCI Riser Card (not shown)
7 N/A 1 Chassis
8 00P4988 1 Right Rack Extension Rail (286/C4 Only)
9 09P5902 1 Diskette Drive Cage (Optional)
10 76H4091
93F2361
02K3488
1 Diskette Drive (Optional)
11 See Note 1 Media Device (example: Tape Drive)
12 09P5874 Up to 2 Bezel Media Slot Filler
13 09P5873 1 Front Bezel (286/C4 Only)
14 See Note 1 IDE CD-ROM (optional)
15 00P3210 1 Operator Panel
16 00P2797 2 Media Carrier Trays
17 00P2801 Up to 2 Media Bay Filler
18 00P4989 1 Left Rack Extension Rail (286/C4 Only)
19 00P4133 1 Media Device Support Shelf
20 See Note 4 Disk Drives (4 Pack)
21 00P4132 1 Disk Drive Cage/Backplane Assembly
22 00P2799 Up to 6 PCI Plastic Light Pipe and Air Flow Separators
23 00P2800 Up to 6 PCI Card Lock Assemblies
Note: See Diagnostic Information for Multiple Bus Systems for part numbers.
Chapter 10. Parts Information 453
System Parts (continued)
454 Service Guide
Index Number FRU Part
Number
Units Per
Assembly
Description
1 21P4312 1 Cable Management Arm Assembly (286/C4 Only)
2 00P2806 1 Cable Carrier Support Bracket (286/C4 Only)
3 00P2807 1 Rack Flange Mounting Bracket (286/C4 Only)
4
00P2733 1 Processor Assembly, 1.45 GHz, 1-way
00P2736 Up to 2 Processor Assembly, 1.45 GHz, 2-way
5
53P3224 Up to 16 256 MB DIMMs
53P3226 Up to 16 512 MB DIMMs
53P3230 Up to 16 1 GB DIMMs
53P3232 Up to 16 2 GB DIMMs
6
00P3203 1 per RIO
connector
RIO Connector Cover (only available on 286/C4 that has
not been upgraded to support an I/O subsystem)
7 00P4488 1 CEC Backplane
8 09P5866 2 Units PCI Cooling Fans (2 Fans per Unit)
9 N/A 1 Internal Cable Management bracket
10 N/A 1 Chassis
11 00P4134 1 Right Release Latch Bracket (286/C4 Only)
12 00P2804 2 Release Latch (286/C4 Only)
13 00P4135 1 Left Release Latch Bracket (286/C4 Only)
14 00P2805 2 Latch Mounting Bracket (286/C4 Only)
15 00P4342 Up to 2 Power Supply (Redundant Power Supply is Optional)
16 N/A 1 Front Power Supply and Processor Assembly Support
Bracket
17 N/A 1 Plastic separator
18 09P5893 1 Processor Filler
19 09P5892 1 Power Supply Filler
Not Shown 42F6841 Up to 2 AC Power Cable, (Power distribution bus to AC power
supply, 286/C4 only)
Note: See Diagnostic Information for Multiple Bus Systems for part numbers.
Chapter 10. Parts Information 455
286/E4 Cover Set and Brackets
Index Number FRU Part
Number
Units Per
Assembly
Description
1 00P3789 1 Top Cover Assembly (286/E4 Only)
2 00P3791 1 Operator Panel Extension Cable (286/E4 Only)
3 N/A 1 286/E4
4 00P3695 1 Top Cover Rear Mounting Bracket (286/E4 Only)
5 00P2810 1 System Bottom Mounting Plate (286/E4 Only)
6 00P3790 1 System Base and Right Cover Assembly (286/E4 Only)
7 00P3788 1 Power Cable Bracket (286/E4 Only)
456 Service Guide
Chapter 10. Parts Information 457
Index Number FRU Part
Number
Units Per
Assembly
Description
1 00P3787 1 Operator Panel EMC Bracket (286/E4 Only)
2 00P2811 1 Operator Panel Bracket (286/E4 Only)
3 00P4637 1 Operator Panel EMC Filler (286/E4 Only)
4 00P2871 1 Top Cap (286/E4 Only)
5 00P2869 1 Disk Drive Access Cover (286/E4 Only)
6 00P2868 1 Front Door (286/E4 Only)
7 09P5885 1 Front Bezel (286/E4 Only)
8 N/A 1 286/E4
9 00P2593 1 Left Side Outer Access Cover (286/E4 Only)
10 00P2594 1 Rear Cover (286/E4 Only)
458 Service Guide
System Internal Cables
The following diagrams show the system cable connections. The first diagram illustrates the routing of the
internal power cables. The second diagram illustrates the routing of the signal cables.
Internal Power Cable Routing Diagram
Index Number FRU Part
Number
Units Per
Assembly
Description
1 00P3210 1 Operator Panel
2 See Note 1 IDE CD-ROM
3 See Note Up to 2 SCSI Media Device
4 09P5870 1 CEC Backplane to Operator Panel Signal / Power Cable
5 09P5867 1 2-drop Power Cable from CEC Backplane to IDE
CD-ROM and other SCSI Media Devices
6 09P5888 1 CEC Backplane to Disk Drive Backplane Power Cable
7 00P4132 1 Disk Drive Backplane
8 09P5864 1 Diskette Drive to CEC Backplane Power Cable
9 09P5900 1 PCI Cooling Fans #3 and #4 Power Cable
10 See Note 1 Diskette Drive
11 00P4488 1 CEC Backplane
Notes:
1. For more information on power cables, go to “External AC Power Cables” on page 43.
2. See Diagnostic Information for Multiple Bus Systems for part numbers.
Note:
Chapter 10. Parts Information 459
Internal Signal Cable Routing Diagram
460 Service Guide
Index Number FRU Part
Number
Units Per
Assembly
Description
1 00P3210 1 Operator Panel
2 See Note 1 SCSI Media Device (50-Pin)
3 See Note 1 SCSI Media Device (68-Pin)
4 See Note 1 SCSI Media Device (68-Pin)
5 09P5870 1 CEC Backplane to Operator Panel Signal / Power Cable
5 00P3791 1 Operator Panel Extension Cable (286/E4 Only)
5 00P3822 1 Operator Panel Interposer (286/E4 Only)
6 1 SCSI Media 50-pin to 68-pin Converter Cable
7 09P5862 1 Integrated SCSI Media Device Signal Cable
8 09P5863 1 CEC Backplane Connector to Diskette Drive Signal Cable
9 09P5868 1 CEC Backplane to IDE CD-ROM Signal Cable
10 See Note 1 Diskette Drive
11 See Note 1 IDE CD-ROM
12 00P4132 1 Disk Drive Backplane
13 09P5895 1 PCI Riser Card to Disk Drive Backplane Signal Cable (50
pin)
14 09P5889 1 PCI Riser Card to Disk Drive Backplane Signal Cable (68
pin)
15 09P5290 1 6 Slotted PCI Riser Card
15a 09P2702 1 4 Slotted PCI Riser Card
16 00P4488 1 CEC Backplane
17 09P5869 1 PCI Riser Card to External SCSI Signal Cable
18 N/A 1 External SCSI Port
19 09P5899 1 CEC Backplane to Rack Indicator Signal Cable
20 N/A 1 Rack Indicator Port
21 00P3200 1 CEC Backplane to Debug Port Signal Cable
22 N/A 1 Debug Parallel Port
Note: See Diagnostic Information for Multiple Bus Systems for part numbers.
Chapter 10. Parts Information 461
Power Cable
Index Part Number Country or Region
1 1838574 Bahamas, Barbados, Bolivia, Brazil, Canada, Costa Rica, Dominican Republic, El
Salvador, Ecuador, Guatemala, Guyana, Haiti, Honduras, Jamaica, Japan, Netherlands
Antilles, Panama, Peru, Philippines, Taiwan, Thailand, Trinidad, Tobago, U.S.A. (except
Chicago), Venezuela
2 86G7648 Bahamas, Barbados, Bermuda, Bolivia, Brazil, Canada, Cayman Islands, Colombia, Costa
Rica, Dominican Republic, Ecuador, El Salvador, Guatemala, Guyana, Haiti, Honduras,
Jamaica, Japan, Korea (South), Mexico, Netherlands Antilles, Nicaragua, Panama, Peru,
Philippines, Puerto Rico, Saudi Arabia, Suriname, Trinidad, Taiwan, U.S.A. (except
Chicago), Venezuela
2 62X1045 Chicago, U.S.A.
3 6952311 Argentina, Australia, New Zealand
4 13F9979 Abu Dhabi, Austria, Belgium, Bulgaria, Botswana, China (Macau S.A.R), Egypt, Finland,
France, Germany, Greece, Iceland, Indonesia, Korea (South), Lebanon, Luxembourg,
Netherlands, Norway, Portugal, Saudi Arabia, Spain, Sudan, Sweden, Turkey, Yugoslavia
5 13F9997 Denmark
6 14F0015 Bangladesh, Burma, Pakistan, South Africa, Sri Lanka
7 14F0033 Bahrain, Bermuda, Brunei, Channel Islands, China (Hong Kong S.A.R), Cyprus, Ghana,
India, Iraq, Ireland, Jordan, Kenya, Kuwait, Malawi, Malaysia, Nigeria, Oman, People’s
Republic of China, Qatar, Sierra Leone, Singapore, Tanzania, Uganda, United Arab
Emirates (Dubai), United Kingdom, Zambia
8 14F0051 Liechtenstein, Switzerland
9 14F0069 Chile, Ethiopia, Italy
10 14F0087 Israel
11 6952291 Paraguay, Colombia, Uruguay
42F6841 AC Power Cable, (Power distribution bus to AC power supply, 286/C4 only)
1 2 3 4
5 6 7
8 9 10 11
462 Service Guide
Keyboards and Mouse (White)
Index Number FRU Part Number Units Per
Assembly
Description
1 93H8120 1 Keyboard, 101 United States English (ID 103P)
93H8123 1 Keyboard, 102 Spanish (ID 171)
93H8125 1 Keyboard, 102 Arabic (ID 238)
08L0904 1 Keyboard, 102 Belgium French (ID 120)
08L0905 1 Keyboard, 102 Belgium (ID 120)
93H8134 1 Keyboard, 102 Greek (ID 319)
93H8135 1 Keyboard, 101 Hebrew (ID 212)
08L0911 1 Keyboard, 102 Italy (ID 142)
08L0912 1 Keyboard, 102 Norwegian (ID 155)
93H8143 1 Keyboard, 101 Russian (ID 443)
08L0914 1 Keyboard, 102 Spanish (ID 172)
08L0915 1 Keyboard, 102 Sweden/Finland (ID 153)
08L0916 1 Keyboard, 105 Swiss F/G (ID 150)
08L0917 1 Keyboard, 102 UK English (ID 166)
93H8153 1 Keyboard, US English ISO9995 (ID 103P)
93H8154 1 Keyboard, 106 Japan (ID 194)
93H8155 1 Keyboard, 101 Chinese/US (ID 467)
93H8156 1 Keyboard, 103 Korea (ID 413)
2 76H5084 1 Three Button Mouse
Chapter 10. Parts Information 463
464 Service Guide
Chapter 11. Fault and Attention LEDs
This chapter contains information about fault and attention LEDs, which assist in identifying failing
components in your system.
Operator Panel Display
If a failing component is detected in your system, an amber-colored attention LED is turned on solid (not
blinking).
1 Power-On Button 6 Operator Panel Display
2 Power LED 7 (FS1) Front Serial Connector
(RJ-48 Connector)
3 Attention LED 8 Service Processor Reset Switch
(Pinhole)
4 SCSI Port Activity 9 System Reset Button
5 Ethernet Port Activity
Number Component Name Component Description
1 Power-On Button Turns the system power on and off.
2 Power LED Blinking - When connected to the power source (System is
in standby mode).
Solid - When power button has been pressed.
Note: There is approximately a 30-second transition period
from the time the power button is pressed until the power-on
LED goes from blinking to on solid. During the transition
period, you may observe the blinking intervals accelerate.
3 Attention LED Normal State - LED is off.
4 SCSI Port Activity Normal State - LED is on when there is SCSI activity.
5 Ethernet Port Activity Normal State - LED is on when there is Ethernet activity.
6 Operator Panel Display Displays current status of system startup, or diagnostic
information in the event of a hardware problem.
7 Front Serial Connector (FS1) Serial port uses RJ-48 connector. Use to plug in external
devices at the front of the system unit.
8 Service Processor Reset Switch
(Pinhole)
Service Personnel Use
9 System Reset Button Resets the system
465
Component LEDs
Individual LEDs are located on or near the failing components. The LEDs are located either on the
component itself or on the carrier of the component (for example, memory card, fan, memory module, or
processor). LEDs are either green or amber.
Green LEDs indicate either of the following:
v Electrical power is present.
v Activity is occurring on a link. (The system could be sending or receiving information.)
Amber LEDs indicate a fault or identify condition. If your system or one of the components on your system
has an amber LED turned on or blinking, identify the problem and take the appropriate action to restore
the system to normal.
The following table identifies the location, color and status of your system component LEDs.
466 Service Guide
Unit
(FRU)
LED
Function
Location LED Color OFF ON Blink
Power and Packaging LEDs
Rack
Indicator
Drawer Sum
Top of Rack
Amber (1x) Normal Fault Identify
AC Input Green (1x) No Input AC Good N/A
System Unit
Attention
Operator
Panel + Rear
Amber (2x) Normal Fault Identify
Power
Operator
Panel + Rear
Green (2x)
No AC System
Powered On
v Standby
v Transition of Power from
Standby to Power-On
(Blinking intervals may
accelerat before
remaining on solid)
Fans Identify Fan Top Amber (4x) Normal Fault N/A
Power
Supply (AC)
Input Source
Power
Supply Front
+ Top
Green (1x) No Input AC Input
Good
N/A
Fault Power
Supply Front
+ Top
Amber (1x) Normal Fault Fast Blink - Identify
Slow Blink - SPCN
communication failure .25
Hz
DC Output
Good
Power
Supply Front
+ Top
Green (1x)
DC Off DC Good Standby
Disk Drives
(DASD)
Activity DASD Board Green (4x) No Activity Activity N/A
Remove
Identify
DASD Board Amber (4x) Normal Action Identify
Optional
Media
No LED
Electronics Component LEDs
PCI Adapter
Connectors
(Slots)
Activity Chassis
Rear, Inside
System Next
to Adapter
Green Refer to the PCI Adapter Table on page 469.
Identify Chassis
Rear, Inside
System Next
to Adapter
Amber Refer to the PCI Adapter Table on page 469.
Memory
DIMMs
Identify Processor
Housing
(Top)
Amber (8x) Normal Identify
CEC
Backplane
Identify CEC
Backplane
Amber (1x) Normal Identify
Processor
Board
Identify Processor
Housing
(Top)
Amber (1x) Normal Identify
Chapter 11. Fault and Attention LEDs 467
Unit
(FRU)
LED
Function
Location LED Color OFF ON Blink
Unit
(FRU)
LED
Function
Location LED Color OFF ON Blink
PCI Riser
Card
Identify PCI Riser
Card
Amber (1x) Normal Identify
Disk Drive
Backplane
No LED
Port LED Indicators
RIO Port Identify CEC
Backplane
Amber (2x) Normal N/A Identify
SCSI Port Activity Operator
Panel
Green (1x) No Activity Activity
Ethernet Port
Ethernet 1,
Ethernet 2
Mode
Chassis Rear Green (2x) 10 Mbit
Mode
100 Mbit
Mode
Activity Chassis Rear Green (2x) No Activity Activity
Activity
Operator
Panel
Green (1x) No Activity Activity
See the following illustration for Ethernet connector LED locations on the rear of the 286/C4 and 286/E4.
1 Ethernet 1 Connector and LED
2 Ethernet 2 Connector and LED
3 Mode LED
10 Mbit - Off
100 Mbit - On
4 Activity LED
On - Receiving Information
Off - Not Receiving Information
468 Service Guide
See the following illustration for RIO connector LED locations on the rear of the 286/C4 and 286/E4.
1 RIO-G 0 Identify LED
2 System Drawer Rear Power LED
3 System Drawer Rear Attention LED
4 RIO-G 1 Identify LED
The following table explains the meaning of the on and off status of the PCI-X adapter LEDs.
PCI Adapter Table
LED Inactivity Active Identify Action
Green + Amber (2
LEDs)
Green - Off
Amber - Off
Green - On
Amber - Off
Green - On
Amber - Blinking
Green - Off
Amber - Blinking
Resetting the LEDs
After the repair action is completed, do the following:
1. Log in as root user.
2. At the command line, type diag.
3. Select Task Selection.
4. Select Log Repair Action.
5. Select the device that was repaired.
6. Press F10 to exit diagnostics.
If the Attention LED remains on after you have completed the repair action and reset the LEDs, call for
service support.
Chapter 11. Fault and Attention LEDs 469
470 Service Guide
Appendix A. Environmental Notices
Product Recycling and Disposal
Components of the system unit, such as structural parts and circuit boards, can be recycled where
recycling facilities exist. Companies are available to disassemble, reutilize, recycle, or dispose of electronic
products. Contact your account representative for more information. This system unit contains parts such
as circuit boards, cables, electromagnetic compatibility gaskets and connectors which may contain lead
and copper/beryllium alloys that require special handling and disposal at end of life. Before this unit is
disposed, these materials must be removed and recycled or discarded according to applicable regulations.
This book contains specific information on each battery type where applicable.
This product may contain a sealed lead acid, nickel cadmium, nickel metal hydride, lithium, or
lithium ion battery. Consult your user manual or service manual for specific battery information.
The battery must be recycled or disposed of properly. Recycling facilities may not be available in
your area. For information on disposal of batteries, contact your local waste disposal facility.
471
Acoustical Noise Emissions
The equivalent continuous A-weighted sound pressure level at workstations (emission sound pressure
level at the 1-meter bystander positions) does not exceed 70 dB(A).
Declared Acoustical Noise Emissions
Product
Configuration
Declared A-Weighted Sound Power Level,
LWAd (B)
Declared A-Weighted Sound Pressure
Level, (dB) at 1 meter Bystander
Position
Operating Idling Operating Idling
6E4 (Deskside) 6.0 5.9 42 41
6C4 (Rack Drawer) 6.1 6.0 44 43
Notes:
1. LWAd is the declared (upper limit) sound power level for a random sample of machines. (1B = 10dB)
2. LpAm is the mean value of the A-weighted sound pressure level at the 1-meter bystander positions for a random
sample of machines.
3. 286/E4 levels apply to the following system configurations: 2 way 1 gigahertz processor, 2 disk drives, 2048 GB
of RAM, non-redundant power supply (One power supply, 1 processor fan).
4. 286/C4 levels apply to the following hardware configurations: 2 way 1 gigahertz processor, 2 disk drives, 2048
GB of RAM, redundant power supply (Two 645 watt power supplies, 2 processor fans).
5. All measurements made in conformance with ISO 7779 and declared in conformance with ISO 9296.
472 Service Guide
Appendix B. Notices
This information was developed for products and services offered in the U.S.A.
The manufacturer may not offer the products, services, or features discussed in this document in other
countries. Consult the manufacturer’s representative for information on the products and services currently
available in your area. Any reference to the manufacturer’s product, program, or service is not intended to
state or imply that only that product, program, or service may be used. Any functionally equivalent product,
program, or service that does not infringe any intellectual property right of the manufacturer may be used
instead. However, it is the user’s responsibility to evaluate and verify the operation of any product,
program, or service.
The manufacturer may have patents or pending patent applications covering subject matter described in
this document. The furnishing of this document does not give you any license to these patents. You can
send license inquiries, in writing, to the manufacturer.
The following paragraph does not apply to the United Kingdom or any country where such
provisions are inconsistent with local law: THIS MANUAL IS PROVIDED ²AS IS² WITHOUT
WARRANTY OF ANY KIND, EITHER EXPRESSED OR IMPLIED, INCLUDING, BUT NOT LIMITED TO,
THE IMPLIED WARRANTIES OF NON-INFRINGEMENT, MERCHANTABILITY OR FITNESS FOR A
PARTICULAR PURPOSE. Some states do not allow disclaimer of express or implied warranties in certain
transactions; therefore, this statement may not apply to you.
This information could include technical inaccuracies or typographical errors. Changes are periodically
made to the information herein; these changes will be incorporated in new editions of the publication. The
manufacturer may make improvements and/or changes in the product(s) and/or the program(s) described
in this publication at any time without notice.
Information concerning products made by other than the manufacturer was obtained from the suppliers of
those products, their published announcements, or other publicly available sources. The manufacturer has
not tested those products and cannot confirm the accuracy of performance, compatibility or any other
claims related to products made by other than the manufacturer. Questions on the capabilities of products
made by other than the manufacturer should be addressed to the suppliers of those products.
473
474 Service Guide
Appendix C. Service Processor Setup and Test
Attention: This procedure applies to modems attached to the serial ports (S1, S2, and S3) on the
286/C4 and 286/E4 systems. It does not affect the operation of the modem attached to the Hardware
Management Console (HMC).
The call-out function is normally handled by the Service Focal Point application running on the HMC.
For your convenience, an example of a basic service processor setup checklist is included here. Your
setup may include more or fewer of the available features, so you can adjust this checklist for your own
application so that it is helpful to you.
Service Processor Setup Checklist
1. Unplug the power cord from the system.
2. Attach a local terminal for this setup procedure.
3. Plug in the system and power on the local terminal.
Attention: Make sure that system power remains off.
4. Start the service processor menus. See Chapter 7, “Using the Service Processor”, on page 305.
5. Set the system name. See “Privileged User Menus” on page 307.
6. Enable surveillance. See “Service Processor System Monitoring - Surveillance” on page 336.
7. Configure Call-In/Call-Out. See “Call-In/Call-Out Setup Menu” on page 325.
8. Exit the service processor menus.
9. Unplug the power cord from the system.
10. Attach modems (if needed). See “Accessing the Service Processor Menus Remotely” on page 306.
11. Plug in the system.
Attention: Make sure that the system power remains off.
12. Test both of the following:
v Call-In, see “Testing Call-In” on page 476
v Call-Out, see “Testing Call-Out” on page 476
13. Use the “Save or Restore Hardware Management Policies,” in the “Introduction to Tasks and Service
Aids” section of the Diagnostic Information for Multiple Bus Systems to back up the service processor
settings.
Note: This step is strongly recommended to protect the usefulness of the service processor and the
availability of the system.
Your service processor is now ready for use.
475
Testing the Setup
This section contains sample testing procedures to help ensure that your setup is working.
These tests include communicating with the system operating system. Before you start, ensure that the
necessary serial port(s) is configured. If you need assistance, refer to “Serial Port Configuration” on
page 477.
The system should be powered off as a result of the “Service Processor Setup Checklist” on page 475.
Testing Call-In
1. At your remote terminal, call in to your system. Your system answers and offers you the Service
Processor Main Menu after requesting your privileged access password.
2. Select System Power Control.
3. Select Power-On System.
When you are asked if you wish to continue powering on the system, type Y.
4. After the system firmware and operating system have initialized the system, the login prompt displays
at your remote terminal if you set up Seamless Modem Transfer (refer to “Transfer of a Modem
Session” on page 517 for more information). This may take several minutes. When the login prompt
displays, you have successfully called the service processor.
5. Type logout to disconnect from the operating system. The message No Carrier displays on your
remote terminal.
6. Call your system again. The operating system answers and offers you the login prompt. If these tests
are successful, call-in is working.
7. Log in and type shutdown to shut down your system.
8. The message No Carrier displays on your remote terminal.
Testing Call-Out
During the setup, you entered your phone numbers for the pager (on page 328) and customer voice (on
page 328). These numbers are used for this test.
1. Your remote terminal is disconnected as a result of the Call-In test.
2. Call your system again.
3. At the service processor main menu, select Call-In/Call-Out Setup menu, then select Call-Out test.
This action causes a simulated error condition for the purposes of this test.
4. After a few moments, a message displays, regarding an illegal entry. Press Enter to clear the message
and return to the main menu.
5. When your telephone rings, answer the call. You should hear the sound of a telephone being dialed.
Your computer is trying to page you.
If this test is successful, call-out is working correctly.
Return to the “Telephone Number Setup Menu” on page 327 to enter the actual telephone numbers your
system will use for reporting problems.
476 Service Guide
Serial Port Configuration
To configure the serial port on an AIX system, run the following from an AIX console:
1. Log in as root user.
2. To determine if you have any serial ports already configured, type:
lsdev -Cc tty
If no serial ports are configured, none are listed. If you want to configure serial ports that are not
listed, continue with the remaining steps.
3. Identify the serial port(s) with the modem(s).
4. Type smit tty
5. Select add tty
6. Select RS232
7. Select Baud rate 9600 or higher.
8. Select login enable and set the flow control to RTS.
9. Commit the selections and set up any other needed serial ports.
10. Exit SMIT.
Appendix C. Service Processor Setup and Test 477
478 Service Guide
Appendix D. Setting Up the System Rack
Before setting up a rack check through the following specifications and clearances, then go to “286/C4
Rack Installation Inventory” on page 481.
Rack Specifications
This section contains specifications for the system rack.
Model 14T/0 and 14T/4 Rack
The mechanical packaging, cooling, power supply, and environmental specifications for the rack are shown
in the following table:
Dimensions
Height 1804 mm 71.0 in.
Capacity 36 EIA Units
With PDP - DC only 1926 mm 75.8 in.
Width without side panels 623 mm 24.5 in.
With side panels 644 mm 25.4 in.
Depth with rear door 1042 mm 41.0 in.
RS/6000 T00 with both doors 1098 mm 43.3 in.
pSeries T00 with both doors 1147 mm 45.2 in.
Weight
Base Rack 244 kg 535 lbs
Full Rack 816 kg 1795 lbs.
Electricalæ
(sum specified values for drawers or enclosures in rack)
AC Rack Power source loading maximum in
kVA (per PDB)4
4.8
Voltage range (V ac) 200 to 240
Frequency (hertz) 50 or 60
Temperature Requirements
(see specifications for drawers or enclosures)
This table continues on the following page.
479
This table continued from the previous page.
Humidity
Requirements
(see specifications for drawers or enclosures)
Noise Emissions (see specifications for drawers or enclosures)
Clearances Front Back Left Right
Install/Air Flow Maintenance of a proper service clearance should allow proper airflow.
Service 915mm(36 in) 915mm(36 in) 915mm(36 in) 915mm(36 in)
1. Configuration dependent, base rack weight plus the weight of the drawers mounted in the rack. The rack can
support up to a maximum weight of 35 lbs/EIA Unit.
2. The total rack power should be derived from the sum of the power used by the drawers in the rack.
3. Each AC Power Distribution Bus (PDB) can supply 4.8 kVA. A rack can have up to four PDB’s as required by the
drawers mounted in the rack.
Noise Emission Notes
1. LWAd is the declared (upper limit) sound power emission level for a production series of machines.
2. LpAm is the mean value of the A-weighted sound pressure emission levels at the operator position (if
any) for a production series of machines.
3. m is the mean value of the space-averaged A-weighted sound pressure emission levels at the
one-meter positions for a production series of machines.
4. N/A = Not Applicable (no operator position).
5. All measurements are made in accordance with ISO DIS 779 and reported in conformance with ISO
DIS 7574/4.
480 Service Guide
System Service Clearances
The following illustration shows the minimum space required. The amount of space needed by the units
during service is indicated by the dotted line in the following figure.
For multiple racks placed side by side, the left and right clearances apply only to the leftmost and
rightmost rack.
Rack Configuration
2474 mm (97.4 in.)
3564 mm
915 mm (141 in.)
(36 in.)
915 mm
(36 in.)
610 mm (24 in.)
610 mm (24 in.)
Front
Caster
Location
121(4.8)
(3.1)
80
Note: Rack units are large and heavy and are not easily moved. Because maintenance activities require
access at both the front and back, allow for extra room. The footprint shows the radius of the
swinging doors on the rack.
286/C4 Rack Installation Inventory
h Books, CD-ROM and Other Media h ²About Your Machine² Document
h Power Cables (1 standard, 2 optional) h 9-Pin to 25-Pin Serial Converters (2) (optional)
Appendix D. Setting Up the System 481
h ASCII Terminal (optional) h Display, Cable (optional), and Cable Toroid (optional)
h Mouse (optional) h Keyboard (optional), Wrist/Palm Rest (optional)
h286/C4 h Rack-Mounting Template
h 2 Slide Rail Assemblies h Cable Management Arm
482 Service Guide
h Rack-Mounting Kit contains:
8 system to rail screws
8 rail to rack screws
2 nut clips
2 latch-mounting brackets
2 system-retaining thumbscrews
alignment pin bracket kit
8 assorted alignment pins
1 strip of self-adhesive placement dots
h RJ-48 to 9-Pin Converter Cable (1)
Appendix D. Setting Up the System 483
Read the Rack Safety Instructions
Review the following instructions for mounting the system drawer into the rack.
Rack Safety Instructions
v Do not install this unit in a rack where the ambient temperatures will exceed 35 degrees C.
v Do not install this unit in a rack where the airflow is compromised. Any side, front or back of the unit
used for air flow through the unit must not be in indirect contact with the rack.
v Ensure that a hazardous condition is not created due to uneven mechanical loading when installing this
unit in a rack. If the rack has a stabilizer it must be firmly attached before installing or removed this unit.
v Consider the connection of the equipment to the supply circuit so that the overloaded circuits do not
compromise the supply wiring or over-current protection.
v An electrical outlet that is not correctly wired could place hazardous voltage on the metal parts of the
system or the devices that attach to the system. It is the responsibility of the customer to ensure that
the outlet is correctly wired and grounded to prevent an electrical shock.
When installing the 286/C4, you will need the following items:
v Rack-Mounting Template
v 2 Slide Rail Assemblies
v Cable Management Arm
v Rack-Mounting Kit Envelope
v Screwdriver or similar tool, if available
484 Service Guide
Attaching the Mounting Hardware to the Rack Enclosure
Before performing this procedure, read through each step and study the illustrations.
Attention: Mounting the rails is a complex procedure. To install the rack rails correctly, you must read,
and then perform each procedure step in the order given. Failure to perform each step in the order given
may cause rail failure.
To install the rack-mounting hardware into the rack, do the following:
1. Locate the rack-mounting hardware kit, rack-mounting template, and the system rails that were shipped
with your system.
1 Rack-Mounting Template
2 Rack-Mounting Hardware Kit
3 System Rail Assemblies
Notes:
a. The system rails are front-to-back and left-to-right side dependent when you are standing in the
front of and facing the rack. The rails are labeled on the front lower corner, left or right. The back of
the rails can be identified by the two large alignment pins and the rail-length adjusting plate. The
top of the rails can be identified by the screws located on the top of the front and rear rail flange.
See the following illustration.
b. The rack-mounting hardware kit has four changeable rail-alignment brackets and two sets of
alignment pins. If the alignment pins that came adjoined to the rails do not fit your rack, remove the
brackets from the rails. To remove the alignment-pin bracket from each of the rails, do the
following:
1) Remove the screw located on top of the rail-bracket assembly.
2) Slide the top of the alignment pin bracket from beneath the front-rail bracket.
3) Lift the bottom tab out of the front-rail bracket slot.
After determining the correct size of the alignment pin to use, mount the alignment pins onto the
replacement brackets. Install the replacement brackets onto the rails. Rail bracket assembly
procedure is the reverse of the removal procedure.
c. Always mount the brackets that contain the short alignment pins onto the front of the rail.
Appendix D. Setting Up the System 485
1 Left and Right Inner Rails 5 Top-Rail Flange Screw (Rear)
2 Left Rail Label 6 Rear-Rail Alignment Pins
3 Left Rail Assembly 7 Rail-Length Adjusting Plate
4 Right Rail Assembly 8 Inner-Rail Safety Release Latch
2. Remove the front rack door, and if necessary, remove the rear rack door. For 14T/0 and 14T/4 racks,
refer to , order number SA23-2544, for information about removing the rack doors.
3. If you do not have a rack-mounting template, go to step 4. Using the rack-mounting template,
determine where in the rack to place the system. Make note of the Electronics Industries Association
(EIA) location number.
Note: The rack-mounting template has printed illustrations located on the front of the template. Each
illustration is designed to aid in identifying the EIA location holes used when planning to
populate your rack. Do not use the rack-mounting template without reading and understanding
the following substeps. Each step must be completed in its entirety.
Attention: Mounting the rails is a complex procedure. To install the rack rails correctly, you must
read, and then perform each procedure step in the order given. Failure to perform each step in the
order given may cause rail failure.
4. Note the following when using the rack-mounting template:
v Each black or white unit on the template is equal to 1 EIA unit.
v Each EIA unit consists of three holes.
v The EIA units illustrated on the template must be aligned with an EIA unit located on the rack.
v It is not necessary to align like-colored EIA units. For example, a black EIA unit illustrated on the
rack-mounting template does not have to be aligned with a black EIA unit located on the rack. A
black EIA unit on the rack-mounting template can be aligned with a white EIA unit located on the
rack. See the following illustration.
486 Service Guide
To use the rack mounting template, do the following:
a. Remove the protective coating from each adhesive strip located on the back of the rack-mounting
template. Lightly press the template into position on the rack. Ensure that both the left and right
sides are at the corresponding EIA locations.
b. Install the nut clips into the holes as indicated by the template. The nut clips aid in securing the
system drawer to the rack while in transit.
Note: Whenever a populated rack is being moved, secure your system drawers with two retaining
thumbscrews threaded through the nut clips. This action secures the system front bezel and
system chassis to the rack.
1 Nut Clip 2 Self-Adhesive Dots
c. Locate the four dots, two printed on each side of the template. Place a self-adhesive dot directly
across from the template’s printed dots on or near the rack’s EIA numbering strip. You will be using
these dots to aid in correctly positioning the rail-alignment pins located on the front of each rail.
d. Remove the rack-mounting template from the front of the rack. The front of your rack should now
contain two nut clips and four self-adhesive dots.
e. Mount the rack-mounting template to the rack’s rear EIA flange. Place the rack-mounting template
at the same EIA numbered location that was used on the front of the rack.
f. Wrap a self-adhesive dot directly across from the template’s printed dots. Ensure that a portion of
the self-adhesive dot wraps around the rack’s flange so that it can be seen from the front of the
rack.
g. Do not install any nut clips on the rack’s rear flange as indicated by the rack mounting template.
h. Remove the rack-mounting template from the rear of the rack. The rear of your rack should now
contain four self-adhesive dots that have been partially wrapped around the rack’s rear flanges.
i. Continue to step 5.
Appendix D. Setting Up the System 487
The following illustration shows one EIA unit and four EIA units. Depending on the rack manufacturer,
the EIA units may be separated either by color or by a line. Note the holes along the rail are not evenly
spaced. If your rack has no color or line separation between EIA units, assume that each EIA unit
begins where the hole spacing is closest together.
1 Self-Adhesive Dot Placement 3 Nut Clip
2 EIA Unit Hole Spacing
5. If you do not have a rack-mounting template, do the following:
a. Determine where in the rack to place the system. The system you are about to install measures 4
EIA units high. Make note of the EIA location number.
Note: An EIA unit on your rack consists of a grouping of three holes.
b. Facing the front of the rack and working from the right side, place a supplied self-adhesive dot next
to the top hole of the bottom EIA unit.
Note: The self-adhesive dots are used to aid in identifying locations on the rack. If you no longer
have any of the dots, use some other form of marking tool to aid you in identifying hole
locations (for example, tape, a marker, or pencil).
c. Place another self-adhesive dot next to the top hole of the above EIA unit.
Note: If you are counting the holes, begin with the hole identified by the first dot and count up four
holes. Place the second dot next to the fourth hole.
d. Secure a nut clip to the rack, four holes up from the top dot (bottom hole of the top EIA unit). The
nut clip aids in securing your system drawer to the rack while in transit.
Notes:
1) If you are counting the holes, begin with the hole identified by the top dot and count up four
holes. Place the nut clip next to the fourth hole.
2) Whenever a populated rack is being moved, secure the system drawers with two retaining
thumbscrews threaded through the nut clips. This action secures the system front bezel and
system chassis to the rack.
3) When counting from the nut clip to the bottom dot, there is an 8-hole span.
e. Repeat substeps a through d to the corresponding holes located on the left side of the rack.
488 Service Guide
6. Before installing the system rail assemblies into your rack, you must first determine the type of system
rail assembly shipped with your 286/C4. To determine the system rail assembly type, do the following:
a. Unpack the system rail assemblies.
b. Fully extend the inner and middle rail sections of each rail assembly.
c. Does the middle rail section of each system rail assembly have a retaining screw clearance hole
drilled through it as shown in the following illustration? If so, go to “System Rail Assembly (Type 1)”
on page 490.
1 System Rail Assembly (Type 1) 3 Middle Rail Section
2 Rail to System Retaining Screw
Clearance Hole
d. If your system rail assemblies do not have a clearance hole drilled through the middle rail section,
as shown in the following illustration, go to “System Rail Assembly (Type 2)” on page 493.
1 System Rail Assembly (Type 2) 3 Middle Rail Section
2 Outer Rail Section 4 Inner Rail Section
Appendix D. Setting Up the System 489
System Rail Assembly (Type 1)
Attention: Mounting the rails is a complex procedure. To install the rack rails correctly, you must read,
and then perform each procedure step in the order given. Failure to perform each step in the order given
may cause rail failure.
To install the system rail assembly (type 1), do the following:
1. Select the right rail assembly.
1 Right Rail Assembly (Type 1) 4 Middle Rail Section
2 Outer Rail Section 5 Inner Rail Section
3 Rail to System Retaining Screw
Clearance Hole
Note: The system rails are front-to-back and left-to-right side dependent when you are standing in
the front of and facing the rack. The rails are labeled on the front lower corner, left or right.
The back of the rails can be identified by the two large alignment pins and the rail-length
adjusting plate. The top of the rails can be identified by the screws located on the top of the
front and rear rail flange.
2. Position yourself at the front of the rack.
3. Facing the rack, insert the rail’s rear-alignment pins into the rack’s rear flange holes identified by the
self-adhesive placement dots previously installed.
4. Insert the rail’s front-alignment pins into the rack front flange holes identified by the self-adhesive
placement dots previously installed. Ensure that the EIA location is the same from front to back and
the rails are level from front to back.
5. Loosely thread one of the rail-retaining screws into the rail’s bottom screw hole. The screw hole is
located one position below the bottom alignment pin.
6. Locate one of the latch brackets, and place the bottom slot of the latch bracket over the rail-retaining
screw.
490 Service Guide
1 Front Slide Rail Alignment Pin
2 Retaining Screws
3 Latch Bracket
7. Finger-tighten a second retaining screw through the top screw hole of the latch bracket. After the
latch bracket is in place, ensure that both screws are finger-tight.
Attention: Do not tighten any of the screws more than finger-tight until instructed to do so.
Tightening the screws prevents the rails from self-aligning when the system drawer is attached.
8. Go to the back of the rack.
9. Loosen, but do not remove, the six retaining screws that secure the rail-length adjusting plate to the
rail.
Note: The rail-length adjusting plate should not move freely when loosened.
1 Rail-Length Adjusting Plate
2 Retaining Screws for Rail-Length Adjusting Plate
10. Facing the back of the rack, ensure that the right rail’s two alignment pins are positioned in the
correct EIA unit’s holes. Use two rack-retaining screws to attach the rail brackets to the rack flanges.
Finger-tighten both screws.
Appendix D. Setting Up the System 491
11. Using a screwdriver, tighten the six screws that secure the rail-length adjusting plate.
1 Rail Assembly (Labeled ²Left² in the Front Lower Corner)
2 Rack to Cable Management Arm Bracket
3 Top Rear Alignment Pin
4 Bottom Screw
5 Top Screw
12. Locate the bracket that attaches the cable management arm to the rack flange.
13. Repeat substeps 1 through 11 when installing the left rail.
14. Position the bracket’s bottom hole over the rail’s top alignment pin.
Note: The bracket’s bottom slot should now align with the top screw hole of the rail flange.
15. Attach a second retaining screw through the screw hole located below the top alignment pin.
Finger-tighten the screw.
16. Using a screwdriver, tighten the six screws that secure the rail-length adjusting plate.
17. After both rails have been installed, ensure that none of the rail’s front retaining screws are more than
finger-tight. The rails must be level from front to back and from left to right.
18. The rails are now completely installed. All screws used to attach the rails to the rack must be
finger-tight only.
Notes:
a. Both slide rails must be completely retracted into the rack.
b. The rails must be level from front to back and from left to right.
19. Read and understand the following caution notices, then go to “Installing the 286/C4 into the Rack
Enclosure” on page 497.
CAUTION:
The stabilizer must be firmly attached to the bottom rear of the rack to prevent the rack from
turning over when the drawers are pulled out of the rack. Do not pull out or install any drawer
or feature if the stabilizer is not attached to the rack.
C17
CAUTION:
This unit weighs between 32 kg (70.5 pounds) and 55 kg (121.2 pounds). Three persons are
required to safely move it. Using less than three persons to move it can result in injury.
C05
492 Service Guide
System Rail Assembly (Type 2)
Attention: Mounting the rails is a complex procedure. To install the rack rails correctly, you must read,
and then perform each procedure step in the order given. Failure to perform each step in the order given
may cause rail failure.
To install the system rail assembly (type 2), do the following:
1. Beginning with the right slide rail, remove the right inner rail, as shown in the following illustration.
1 Right Outer Rail Assembly
2 Right Inner Rail
2. Position yourself at the front of the rack.
3. Facing the rack, note the EIA number of the previously installed self-adhesive dots. Insert the rail’s
rear-alignment pins into the corresponding EIA unit holes located on the rear flange.
4. Insert the rail’s front-alignment pins into the rack front flange holes that have been identified by the
self-adhesive placement dots. Ensure that the EIA number is the same from front to back.
5. Loosely thread one of the rail-retaining screws into the rail’s bottom screw hole. The screw hole is
located one position below the bottom alignment pin.
6. Locate one of the latch brackets, and place the bottom slot of the latch bracket over the rail-retaining
screw. Finger-tighten the screw.
1 Front Slide Rail Alignment Pin
2 Retaining Screws
3 Latch Bracket
Appendix D. Setting Up the System 493
7. Finger-tighten a second retaining screw through the top screw hole of the latch bracket. After the
latch bracket is in place, ensure that both screws are finger-tight.
Attention: Do not tighten any of the screws more than finger-tight until instructed to do so.
Tightening the screws prevents the rails from self-aligning when the system drawer is attached.
8. Go to the back of the rack.
9. Loosen, but do not remove, the six retaining screws that secure the rail-length adjusting plate to the
rail.
Note: The rail-length adjusting plate should not move freely when loosened.
1 Rail-Length Adjusting Plate
2 Retaining Screws for Rail-Length Adjusting Plate
10. Facing the back of the rack, ensure that the right rail’s two alignment pins are positioned in the
correct EIA unit’s holes. Use two rack-retaining screws to attach the rail brackets to the rack flanges.
Finger-tighten both screws.
11. Using a screwdriver, tighten the six screws that secure the rail-length adjusting plate.
12. Locate the bracket that attaches the cable management arm to the rack flange.
13. Repeat substeps 1 through 11 when installing the left rail.
1 Rail Assembly (Labeled ²Left² in the Front Lower Corner)
2 Rack to Cable Management Arm Bracket
3 Top Rear Alignment Pin
4 Bottom Screw
5 Top Screw
14. Position the bracket’s bottom hole over the rail’s top alignment pin.
Note: The bracket’s bottom slot should now align with the top screw hole of the rail flange.
494 Service Guide
15. Attach a second retaining screw through the screw hole located below the top alignment pin.
Finger-tighten the screw.
16. Using a screwdriver, tighten the six screws that secure the rail-length adjusting plate.
17. After both rails have been installed, ensure that none of the rail’s front retaining screws are more than
finger-tight. The rails must be level from front to back and from left to right.
18. The rails are now completely installed. All screws used to attach the rails to the rack must be
finger-tight only.
Notes:
a. Both slide rails must be completely retracted into the rack.
b. The rails must be level from front to back and from left to right.
19. Read and understand the following caution notices, then go to “Installing the 286/C4 into the Rack
Enclosure” on page 497.
CAUTION:
The stabilizer must be firmly attached to the bottom rear of the rack to prevent the rack from
turning over when the drawers are pulled out of the rack. Do not pull out or install any drawer
or feature if the stabilizer is not attached to the rack.
C17
CAUTION:
This unit weighs between 32 kg (70.5 pounds) and 55 kg (121.2 pounds). Three persons are
required to safely move it. Using less than three persons to move it can result in injury.
C05
Appendix D. Setting Up the System 495
286/C4 Removal and Installation into a Rack
Before performing these procedures, read “Safety Notices” on page xi.
System Drawer Removal
CAUTION:
This unit weighs between 32 kg (70.5 pounds) and 55 kg (121.2 pounds). Three persons are
required to safely move it. Using fewer than three persons to move it can result in injury.
CAUTION:
v Removing the system drawer requires three people.
v Do not place more than one drawer in the service position at one time.
v Ensure that the system cables are not tangled and will not bind when placing the system drawer
into the service or operating positions.
To remove the system drawer from the rack, do the following:
1. If you have not already done so, shut down the system as described in “Stopping the System without
an HMC Attached” on page 359.
2. Open the front and rear rack doors.
3. Disconnect all cables from the rear of the system drawer.
4. Disconnect the cable-management arm by pulling out the two captive snap buttons from the rear of
the system drawer.
5. From the front of the system drawer, lift the lower section of the system drawer release latches, which
are located on each side of the system drawer. The system drawer release latch clicks as it is put
into the release position.
6. Grasp the top section of both latches, and slowly pull the system drawer out into the service position.
7. Depress both safety release tabs located on each inner rail, and pull the system drawer out about 2
inches.
8. Put the system drawer into the service position.
9. Using three persons, grasp the system drawer handles located on each side of the system.
10. Depress the safety latches located on each rail.
11. Remove the system by pulling it completely out and away from the rack and outer rack rails.
12. Push the extended outer rails carefully back into the rack.
Note: There is a gravity-activated locking tab at the back of each extended outer rail. The locking tab
locks the outer rails into their fully extended position and is unlocked by the action of the inner
rail being pushed back into the rack. To compensate for the separation of the inner rail from
the outer rail, center the lock tab with your hand, then push the outer rail back into its retracted
position.
13. Close the front and rear rack doors.
496 Service Guide
Installing the 286/C4 into the Rack Enclosure
Attention: When installing this system drawer into a rack, ensure that a hazardous condition is not
created due to uneven mechanical loading. If your rack uses a stabilizer bar, it must be firmly attached
before installing or removing this system.
Choose one of the following system installation procedures. The installation procedure you choose
depends on the type of system rail assembly that was installed into your rack.
v If your system rail assemblies are type 1, go to “Install the 286/C4 onto a System Rail Assembly (Type
1)”.
v If your system rail assemblies are type 2, go to “Install the 286/C4 onto a System Rail Assembly (Type
2)” on page 499.
Install the 286/C4 onto a System Rail Assembly (Type 1)
Attention: When installing this unit in a rack, ensure that a hazardous condition is not created due to
uneven mechanical loading. If your rack uses a stabilizer bar, it must be firmly attached before installing or
removing this system.
To install the 286/C4 onto a system rail assembly (type 1), do the following:
1. Ensure both rails are in the retracted position in the rack.
2. Using three persons, grasp the two handles located on each side of the system drawer, and lift the
system drawer.
3. Place the system drawer on top of the retracted slide rails.
Note: Built-in ledges located on each side of the system prevent the system from falling past the
slide rails.
4. Push the system all the way into the rack.
5. Pull the system out of the rack until you can see the second screw hole located on the inner rail.
Notice that the inner rails followed the system drawer out of the rack.
6. Align the second screw hole, located on the inner rail, with the second threaded screw hole, located
on the side of the system drawer. Secure the system drawer to the inner rail with M3 screws.
7. Align the front screw holes of each rail with the threaded screw holes on the system drawer. Secure
each screw hole with M3 screws.
8. To secure the third and fourth M3 retaining screws to the system, do the following:
a. Pull the system drawer from the rack.
Attention: Do not pull the system drawer out past the safety latches.
b. Align the rail to system retaining-screw access hole located on the middle rail to the third position
retaining screw hole located on the inner rail.
c. Secure an M3 screw into each rail’s third position retaining screw hole.
d. Align the rail to system retaining-screw access hole located on the middle rail to the fourth
position retaining screw hole located on the inner rail.
e. Secure an M3 screw into each rail’s fourth position retaining screw hole.
Appendix D. Setting Up the System 497
1 Rack Enclosure 3 Safety Release Latch
2 System Retaining-Screw Access Hole 4 286/C4
9. Tighten all eight M3 screws, securing the inner rail to the system drawer side.
10. Push the system drawer about halfway back into the rack. Use a screwdriver or similar tool to tighten
the front four M7 16-mm screws that secure the system rail assemblies to the rack’s front flange.
11. Push the system drawer completely into the rack. Use a screwdriver or similar tool to tighten the rear
four M7 16-mm screws that secure the system rail assemblies to the rack’s rear flange.
Note: After the system rail assemblies are installed, do not extend each rail section past its safety
release latch. The safety release latches stop the rails from overextending and separating. This
action prevents the system drawer from being accidentally pulled out too far and dropped.
12. Tighten the two M3 screws that secure each rail-length adjusting plate.
13. Slowly slide the system drawer back and forth on the rails, watching for any signs of binding.
14. For additional stability needed when transporting the rack, fasten the system drawer to the rack
enclosure by inserting an M8 16-mm thumbscrew through the bezel and chassis bracket, and
screwing it to the rack flange with a screw clip nut.
1 286/C4 Front Bezel
2 M8 16-mm Thumbscrew
3 System Drawer Release Latch
498 Service Guide
Install the 286/C4 onto a System Rail Assembly (Type 2)
To install the 286/C4 onto a system rail assembly (type 2), do the following:
1. Using eight M3 rail to system retaining screws, four on each side, mount the inner rails to each side of
the system drawer. Tighten each of the screws with a screwdriver or similar tool.
1 Outer Rail Section 4 M3 Rail to System Retaining Screws
2 Middle Rail Section 5 Safety Latch
3 Inner Rail Section 6 286/C4
Note: The safety release latch, located on each inner rail, is mounted onto the side of the system
facing away from the system drawer. See the illustration on page 486.
2. Using three persons, grasp the two handles located on each side of the system drawer, and lift the
system drawer.
3. Insert the inner rails located on the system drawer into the rails mounted in the rack.
Attention: When matching the inner rails to the outer rails, ensure that both rail faceplates glide past
each other. Do not, under any circumstances, force the rails. If the system drawer does not glide freely
into the rack, completely remove the drawer from the outer rails. After the drawer is clear of the outer
rails, reposition the system, then reinsert the inner rails. Repeat this process until the system drawer
glides freely into the rack.
4. Simultaneously press the safety latches, located on each side of the system drawer, and push the
system into the rack.
Appendix D. Setting Up the System 499
1 Rack Enclosure 3 286/C4
2 Safety Release Latch 4
5. Slide the system drawer in and out of the rack two or three times. This action aligns the system drawer
to the rails. The system drawer should glide on the rails.
6. Push the system drawer back into the rack. Using a screwdriver or similar tool, tighten the rear four
retaining screws that secure the slide rails to the rack’s rear flange.
7. Slide the system drawer about halfway out of the rack.
8. Using a screwdriver or similar tool, tighten the front four retaining screws that secure the slide rails to
the rack’s front flange.
Note: After the system rails are installed, do not extend them past their safety latches. The safety
release latches stop the rails from overextending and separating. This action prevents the
system drawer from being accidentally pulled out too far and dropped.
Attention: If any binding is detected, loosen the eight rail-retaining screws (front and rear), and
repeat substeps 4 through 7.
9. For additional stability needed when transporting the rack, fasten the system drawer to the rack
enclosure by inserting an M8 16-mm thumbscrew through the bezel and chassis bracket, and screwing
it to the rack flange with a screw clip nut.
1 286/C4 Front Bezel
2 Thumbscrew
3 System Drawer Release Latch
500 Service Guide
Installing the Cable-Management Arm
To install the 286/C4 cable-management arm, do the following:
1. Attach the cable carrier support bracket to the back of the chassis with two screws.
2. Place the two captive snap buttons, located on the cable-management arm, into the unlocked (pulled
out) position.
3. Align and insert the two captive snap buttons into the holes located on the cable carrier support
bracket. Push in on the head of the snap buttons to lock them into position.
1 286/C4 4 Rack Flange Mounting Bracket
2 Cable Carrier Support Bracket 5 Cable Management Arm
3 Captive Snap Button (Qty. 2)
4. Open the cable-management arm so that the hinged mounting plate lies flat against the
flange-mounting bracket.
5. Align the holes, located on the cable-management arm mounting plate, with the threaded holes on the
rack-flange mounting bracket. Secure the bracket with two screws.
Note: To avoid any binding of the cable-management arm, ensure that the cable-management arm is
level.
Appendix D. Setting Up the System 501
502 Service Guide
Appendix E. Installing a Hardware Management Console
(HMC)
The Hardware Management Console (HMC) user interface provides the functions needed to create and
maintain a single or multiple-partitioned environment.
Position the HMC and Monitor
Position the HMC and monitor at or near their desired location.
v The HMC weighs between 34 kg (75 pounds) and 41 kg (90 pounds). Do not try to lift the HMC by
yourself.
v Monitors can weigh as much as 35 kg (77 pounds). Use caution when lifting or moving the monitor.
v Leave enough space around the HMC to safely and easily complete the setup procedures.
v Observe standard ergonomic guidelines while arranging your system unit so that you can work
comfortably and safely. For more information on arranging your workstation, visit the Healthy Computing
Web address at: http://www.pc.ibm.com/us/healthycomputing.
v Be sure to maintain at least 51 mm (2 inches) of space on the sides of the system unit and 152 mm (6
inches) at the rear of the system unit to allow the system unit to cool properly. The front of the system
requires a minimum of 76 mm (3 inches) of space. Blocking the air vents can cause overheating, which
might result in a malfunction or permanent damage to the system unit.
v Place the HMC in a location where all necessary power outlets and network connections can safely be
reached.
v Place the display in a stable and sturdy location.
503
Connect the Cables
Use the following steps to connect the cables to your HMC. Look for the small icons on the back of your
HMC that show where to attach the keyboard, mouse and display cables.
1. Attach the monitor cable to the monitor connector and tighten the screws.
2. If a label for the monitor was shipped with your system, attach the label to the bottom-right corner of
the monitor.
3. Attach the power cables to the monitor as shown in the following illustration. If the HMC has a voltage
switch, ensure that it is in the correct position for the supply voltage.
Attention: Do not plug the power cables into an electrical outlet at this point.
4. Connect the mouse and keyboard to their connectors, as shown in the following illustration.
504 Service Guide
5. Connect the HMC serial cable into the HMC1 connector located on the back of your system unit. For
two HMCs, connect the redundant HMC serial cable into the HMC2 connector located on the back of
your system unit. The following illustration shows the location of the serial ports on the back of the
HMC, as well as HMC1 and HMC2 connectors located on back of your system unit.
1 286/C4 or 286/E4
2 HMC1 Connector
3 HMC2 Connector
4 Hardware Management Console (HMC)
The cable part numbers used for connecting the 286/C4 and 286/E4 to the HMC serial cables are as
follows:
11P3955 6m cable 9Pos to 9Pos
11P3956 15m cable 9Pos to 9Pos
31L7196 15m cable 9Pos to 25Pos
Appendix E. Install a HMC 505
Connect the 8-Port Adapter Cables
If you are using any optional 8-port adapters, connect the cables to the appropriate connectors in slots 1
through 4.
Connect the External Modem
To connect the external HMC modem, do the following:
1. Connect the modem cable to the external HMC modem (1).
2. Connect the other end of the modem cable to serial port 2.
3. Connect the phone cable line port of the external modem (2).
4. Connect the other end of the phone cable to the analog jack on your wall.
1 External HMC Modem
2 External Modem Connector
506 Service Guide
Check the Microswitch Setting on the Modem
Check the microswitch settings on the modem. The settings should be set as follows:
Switch Position Function
1 Up Force DTR
2 Up Flow Control &E4
3 Down Result Codes Enabled
4 Up Modem Emulation Disabled
5 *Down Auto Answer Enabled
6 Up Maximum Throughput Enabled
7 Up RTS Normal Functions
8 Down Enable Command Mode
9 Down Remote Digital Loopback Test Enabled
10 Up Dial-Up Line Enabled
11 Down AT Responses Enabled (Extended
Responses Disabled)
12 *Down Asynchronous Operation
13 UP 28.8KB Line Speed
14 Up
15 Up CD and DSR Normal Functions
16 Up 2-Wire Leased Line Enabled
Note: * Only switches 5 and 12 are changed from the factory default settings.
Appendix E. Install a HMC 507
Connect the LAN Cable (If Applicable)
The LAN cable is recommended because an active partition reports errors to the HMC’s Service Focal
Point application through the LAN network. The HMC must be attached to the system’s LAN, and a
partition should have a LAN adapter assigned as a resource.
Connect the LAN to the adapters assigned to the partition and to the LAN connection on the HMC system
hardware.
Plug in the HMC Power Cable
1. Plug in the power cable as shown in the following illustration.
2. Plug in the power cables for the monitor, HMC, and modem into electrical outlets.
Configure the Network
To configure the network, do the following:
v Log in to the (HMC) using the user ID hscroot and password abc123.
Note: User IDs and passwords are case-sensitive. Enter the user ID and password exactly as shown.
v Select the System Configuration Application in the Navigation area.
v Select Customize Network Settings in the Contents area.
v Type the appropriate network information in the IP Address, Name Services, and Routing tabs on the
Network Configuration window.
v Select OK.
For more information on configuring the network, refer to the Hardware Management Console
Maintenance Guide, order number SA23-1251.
Configure Inventory Scout Services
Inventory Scout Services is an AIX tool that surveys managed systems for hardware and software
information.
To set up Inventory Scout Services for each system managed by the HMC, and for the partition running an
instance of AIX, the partition must be configured in order to conduct microcode surveys or collect Vital
Product Data (VPD).
To set up Inventory Scout Services for each managed system and partition, you must be a member of one
of the following roles:
v System Administrator
v Advanced Operator
v Service Representative
508 Service Guide
Refer to the Hardware Management Console Maintenance Guide, order number SA23-1251 for more
information on the previous roles.
Note:
v The Inventory Scout listening port is set by starting the invscoutd daemon on the AIX images.
This port will default to 808.
v If a system has been powered on using the Full System Partition power-on option, you must
configure the Full System Partition to use Inventory Scout Services.
To set up Inventory Scout Services, do the following:
1. In the Navigation area, double-click the Inventory Scout Services icon.
2. In the Contents area, select Inventory Scout Profile Configuration.
3. From the list, select a managed system.
4. Select Next.
5. From the list, select the partition you want to configure.
6. Select Next.
7. The next window identifies the selected partition. Type the following:
v Partition password (Use the Inventory Scout user ID password)
v Inventory Scout listening port
v IP address of the AIX partition
Click Finish if you have completed configuration of all partitions.
OR
Click Back to continue to configure additional partitions until all are configured.
Collect Vital Product Data Information
Use this task to collect the Vital Product Data (VPD) for the specified managed system into a file.
To collect the managed system’s VPD, you must be a member of one of the following roles:
v System Administrator
v Advanced Operator
v Service Representative
To collect the managed systems VPD, do the following:
1. In the Navigation area, double-click the Inventory Scout Services icon.
2. In the Contents area, select Collect VPD Information.
3. From the list, select the name of the managed system for which you want to collect the Vital Product
Data.
4. Select Next.
5. The wizard requests confirmation about the managed system, and then prompts you to insert a blank,
DOS-formatted diskette into the HMC diskette drive.
6. Click Finish. This file is then copied to the diskette in the specified drive.
To conduct a microcode survey, refer to the Hardware Management Console Maintenance Guide, order
number SA23-1251.
Appendix E. Install a HMC 509
Configure the Service Agent
Note: You must configure the network before you configure the Service Agent.
To start the Service Agent, click on the Service Agent icon on the HMC virtual terminal window.
If you are configuring Service Agent for the first time, select Start Service Agent Processes to start the
service agent processes. Select Service Agent UI. You are prompted to type specific information or
parameters pertinent to Service Agent.
To configure the Service Agent, do the following:
1. Read and accept the License Agreement.
2. For the initial password prompt, type:
password
The word password is the default password that has been supplied.
Note: User IDs and passwords are case-sensitive. Enter the user ID and password exactly as
shown.
3. Type the information as follows:
Customer Contact Name
The person that we can reach when an error is reported to us by the Service Agent.
Customer Phone Number
The phone number associated with the person whose name is placed in the Customer
Contact Name field.
Customer Email
The e-mail address of the person whose name is placed in the Customer Contact Name
field.
Queue Country/Region
The country or region in which the HMC is located. Click the drop-down list to expand the
country or region choices. Type the first letter of the desired country or region to move to the
first occurrence of a country or region whose first letter matches your choice.
Gateway Type
The machine type of the processor subsystem.
Gateway Serial Number
The serial number of the processor subsystem that you designate to be the machine to send
Service Agent information to us.
Gateway Model Number
The model number of the processor subsystem that you designate to be the machine to send
Service Agent information to us.
4. Select Continue. The Service Agent interface displays.
5. Select Network. Fill in the appropriate information.
Note: Some of the fields are prefilled with information taken from the Service Agent initial
configuration panel.
6. Select OK.
7. Expand the contents of the Gateway category by clicking the key next to your Gateway machine.
8. Select Dialer.
9. Select Location. Select the location phone number closest to your physical location. Several of the
fields are then completed by Service Agent based on the location phone number you selected.
510 Service Guide
10. Select Modem. Select the modem that you will use for Service Agent communications. After your
selection, several of the remaining fields are filled in by Service Agent.
11. Select OK to save the Dialer configuration.
12. Expand the contents of the Administration category by clicking the key to its left.
13. Select Register.
14. Select your Gateway machine by clicking it.
15. Select Register. The ²Would you like to IGN now² window displays.
16. Select Yes. The Service Agent is now attempting to connect to the Service Agent server, using the
modem you have selected.
17. Select CallLog. Check the details pane to view the status of the just-attempted call. To determine the
success or failure of the connection attempt, examine the Description column.
For more information on the Service Agent, refer to the Hardware Management Console Maintenance
Guide, order number SA23-1251.
Appendix E. Install a HMC 511
512 Service Guide
Appendix F. Modem Configurations
Attention: This appendix applies only to modems attached to serial ports S1 and S2 located on the CEC
backplane (location U0.1-P1).
The service processor is designed to place little demand on an attached modem, thereby increasing the
setup and connection success rates.
Sample Modem Configuration Files
This appendix contains information about several sample modem configuration files that either work
directly with your modem or provide a good starting point for a custom setup.
The sample modem configuration files are located in your service processor firmware in the
/usr/share/modems subdirectory (if your system / server is running AIX) with the following names. A listing
of each specific file is included in this appendix.
Generic Modem Configuration Files
AIX File Name Service Processor Firmware File Name
modem_z.cfg modem_z_sp
modem_z0.cfg modem_z0_sp
modem_f.cfg modem_f_sp
modem_f0.cfg modem_f0_sp
modem_f1.cfg modem_f1_sp
Specific Modem Configuration Files
AIX File Name Service Processor Firmware File Name
modem_m0.cfg modem_m0_sp
modem_m1.cfg modem_m1_sp
Use the following selection procedures and your modem manual to determine which of the configuration
files is suitable for your use.
Configuration File Selection
Use the following steps to select a configuration file:
1. Is your modem an IBM 7852-400?
If Yes, use modem configuration file modem_m0.cfg and go to step 7 on page 514.
Note: The IBM 7852-400 modem has DIP switches on the right side of the unit. See “IBM 7852-400
DIP Switch Settings” on page 516 for the correct switch settings.
If No, continue with step 2 on page 514.
513
2. Is your modem an IBM 7857-017?
If Yes, use modem configuration file modem_m1.cfg and go to step 7.
Note: The IBM 7857-017 modem has two telephone line connections on the back of the unit. One is
marked LL (for Leased Line), and the other is marked PTSN (for Public Telephone Switched
Network). The service processor expects to use the modem on the public network, so the
telephone line should attach to the PTSN connector.
If No, continue with step 3.
3. Does your modem respond to the extended command set (prefixed with &)?
If Yes, go to step 5.
If No, continue with step 4.
4. Does your modem respond to:
v ATZ reset command
or
v ATZn reset commands, where n can be 0, 1, and so on?
If ATZ, configuration file modem_z.cfg is recommended.
If ATZn, configuration file modem_z0.cfg is recommended.
Go to step 7.
5. Does your modem command set include a test for V.42 error correction at the remote modem (often
called “Auto-Reliable Mode”)?
If Yes, disable this test. You can use sample configuration files /usr/share/modem_m0.cfg or
/usr/share/modem_m1.cfg as models to help you create a file for your particular modem. See
“Customizing the Modem Configuration Files” on page 515 for more information. Go to step 7.
If No, go to step 6.
6. Does your modem respond to:
v AT&F reset command
or
v AT&Fn reset commands, where n can be 0, 1, and so on?
If AT&F, configuration file modem_f.cfg is recommended.
If AT&Fn, configuration file modem_f0.cfg or modem_f1.cfg is recommended, depending on which
provides the hardware flow control profile.
7. You have completed selection of the configuration file.
If your modem configuration selection is not available in the Service Processor Modem Configuration
Menu, you must access it through the Configure Remote Maintenance Policy Service Aid.
If you find it necessary to adjust any of these configuration files, use the manual provided with your
modem to accomplish that task. It is recommended you select settings that enable hardware flow
control and respond to DTR.
Note: Some older modems do not respond to the X0 or &R1 commands. Edit out these commands
from the modem configuration file if yours is such a modem. See your modem manual for more
information.
Some modems, such as the IBM 7857-017, are not designed for the paging function. Although they can be
used for paging, they return an error message when they do not get the expected response from another
modem. Therefore, even though the paging was successful, the error message causes the service
processor to retry, continuing to place pager calls for the number of retries specified in the Call-Out Policy
Setup Menu. These retries result in redundant pages.
514 Service Guide
Examples for Using the Generic Sample Modem Configuration Files
The following table contains information to help you determine which modem configuration file to use with
various modems.
Modem Setup Z Setup Z0
(Rare)
Setup F Setup F0 Setup F1
AT&T DataPort 2001
(Ring interrupt only on first
ring)
X
Bocamodem 1440E X
Hayes Smart Modem 300 X
IBM 5841 X
IBM 5843 X
IBM 7851 X
IBM 7852-10 X
IBM 7855 X
USRobotics
36.6K Sportster
X
Zoom V.32 X
Customizing the Modem Configuration Files
You can create your own modem configuration files or modify the samples provided. After you customize
your modem configuration files, you must access them through the Configure Remote Maintenance Policy
Service Aid rather than from the service processor menus.
Note: If you have already set up your serial ports, line speeds, authorizations, and telephone numbers
from the service processor menus, use the service aid to specify your customized modem
configuration files.
If you have not already set up your serial ports, line speeds, authorizations, and telephone numbers
from the service processor menus, use the service aids to set them while you specify your
customized modem configuration files.
To disable Auto-Reliable Mode testing of the remote modem, use the sample modem configuration file
/usr/share/modems/modem_f.cfg as a model that you can modify, as follows:
1. Find the necessary command in your modem manual.
2. Copy the /usr/share/modems/modem_f.cfg file to a new file with a different name (for example,
modem_fx.cfg).
3. In the new file (modem_fx.cfg), change the line Send "ATE0T\r" to Send "ATcccE0T\r" where ccc is
the added command as specified in your modem manual, as follows:
Change the third line of each of the following stanzas:
v condout
v condin
v ripo
4. Save the changes.
Appendix F. Modem Configurations 515
IBM 7852-400 DIP Switch Settings
If you are using a 7852-400 modem to enable service processor communications, for proper operation, the
dual inline package (DIP) switches must be set according to the following table:
Switch Position Function
1 Up Force DTR
2 Up Flow Control &E4
3 Down Result Codes Enabled
4 Down Modem Emulation Disabled
5 Up Auto Answer Enabled
6 Up Maximum Throughput Enabled
7 Up RTS Normal Functions
8 Down Enable Command Mode
9 Down Remote Digital Loopback Test Enabled
10 Up Dial-Up Line Enabled
11 *Up AT Responses Enabled (Extended Responses Disabled)
12 *Down Asynchronous Operation
13 Up 28.8KB Line Speed
14 Up
15 Up CD and DSR Normal Functions
16 Up 2-Wire Leased Line Enabled
* Only switches 11 and 12 are changed from the factory default settings.
Xon/Xoff Modems
Some early modems assume software flow control (Xon/Xoff) between the computer and the modem.
Modems with this design send extra characters during and after the transmitted data. The service
processor cannot accept these extra characters. If your configuration includes such a modem, your
functional results may be unpredictable.
The sample modem configuration files included in this appendix do not support these modems, so custom
configuration files are necessary. Anchor Automation 2400E is an example of such a modem.
If you experience unexplainable performance problems that may be due to Xon/Xoff characters, it is
recommended that you upgrade your modem.
Ring Detection
Most modems produce an interrupt request each time they detect a ring signal. Some modems generate
an interrupt only on the first ring signal that they receive. AT&T DataPort 2001 is an example of such a
modem.
The service processor uses the ring interrupt request to count the number of rings when Ring Indicate
Power-On (RIPO) is enabled. If your modem produces an interrupt on only the first ring, set Ring Indicate
Power-On to start on the first ring. Otherwise, you can choose to start Ring Indicate Power-On on any ring
count.
516 Service Guide
Terminal Emulators
The service processor is compatible with simple ASCII terminals, and therefore compatible with most
emulators. When a remote session is handed off from the service processor to the operating system,
agreement between terminal emulators becomes important.
The server’s operating system will have some built-in terminal emulators. You may also have a
commercially available terminal emulation. It is important that the local and host computers select the
same or compatible terminal emulators so that the key assignments and responses match, ensuring
successful communications and control.
For best formatting, choose line wrap in your terminal emulator setup.
Recovery Procedures
Situations such as line noises and power surges can sometimes cause your modem to enter an undefined
state. When it is being used for dial-in, dial-out or ring indicate power-on, your modem is initialized each
time one of these actions is expected. If one of these environmental conditions occur after your modem
has been initialized, it might be necessary to recover your modem to a known state.
If your modem communicates correctly with remote users, it is probably in control. It may be wise to
occasionally change some of the functional settings and then change them back, just for the sense of
security that the modem is communicating, and to ensure it has been initialized recently.
If your system is particularly difficult to access physically, another strategy is to protect it with an
Uninterruptible Power Source (UPS) and a phone-line surge protector.
In case recovery becomes necessary, shut down your system using established procedures. Disconnect
the power cable and press the power button to drain capacitance while power is disconnected. Disconnect
and reconnect modem power, and then reconnect system power to completely reinitialize your system.
Transfer of a Modem Session
Because many modem command variations exist, the sample modem configuration files located at the end
of this appendix have been written to capture the largest number of workable modem settings.
The modem command &Dn (where n is a number) generally sets the modem response to the Data
Terminal Ready (DTR) signal from the server’s serial port. The desired response is that the modem will
hold a connection while DTR is enabled, and drop the connection when DTR is released. Using this
mechanism, the server hangs up on a connection under normal conditions.
Consult your modem manual for its specific response scheme for the &Dn command.
Two strategies are available for dealing with the modem’s response to DTR:
v Recovery
v Prevention
Before you use one of these strategies, determine if your server’s modem is set up correctly to respond to
DTR.
With the remote terminal connected to serial port 1 and defined as the Primary Console Device, there are
two tests you can perform:
1. Will the modem drop the connection after the System initialization complete message displays at
the remote terminal?
If Yes, the modem is set up correctly.
Appendix F. Modem Configurations 517
If No, try another &Dn setting for your server’s modem. See your modem manual for this information.
The &Dn command appears in three places each in three of the sample modem configuration files, as
follows:
2. Will the server’s modem disconnect when the power drops? You can make this observation at the
remote terminal by commanding your server to shut down and power off. (Use the AIX command
shutdown -F.) Watch for the message NO CARRIER on your remote terminal.
If Yes, this is the correct response. The modem is set up correctly.
If No, try another &Dn setting for your server’s modem. See your model manual for this information.
The &Dn command appears in three places each in three of the sample modem configuration files.
Only the following sample modem configuration files contain the &Dn command (in three places each):
v modem_f.cfg
v modem_f0.cfg
v modem_f1.cfg
If you are using modem_z.cfg or modem_z0.cfg, you cannot control DTR response. If your remote
terminal does not disconnect after logging off, you must command the remote terminal emulator to
hang up. This then breaks the connection.
Recovery Strategy
The recovery strategy consists of making two calls to establish a remote session. This solution is the
easiest to implement and allows more freedom for configuring your server’s serial ports.
To set up a remote terminal session, dial into the service processor and start the system. After the
operating system is loaded and initialized, the connection will be dropped. At this point, call the server
back and the operating system will answer and offer you the login prompt.
Prevention Strategy
The disconnect is caused by the operating system when it initializes the Primary Console. The tests listed
in “Transfer of a Modem Session” on page 517 are conducted with the remote terminal selected as the
primary console to manifest the modem’s response to DTR transitions.
v If a local ASCII terminal or a graphics console is to be a permanent part of your server, then make one
of them the primary console. Your remote terminal will no longer experience the connection loss.
v If a local console is not a permanent part of your server, you can still assign either the unused graphics
console or the unused serial port as the primary console. This gives you the desired seamless
connection at your remote terminal.
v If you choose to use the unused serial port as the primary console, some initialization traffic will be sent
to any serial device attached to that port. As a result, that serial device’s connection and function could
be affected. These impacts may make that port unattractive for devices other than a temporary local
ASCII terminal.
518 Service Guide
Modem Configuration Sample Files
Sample File modem_m0.cfg
##
COMPONENT_NAME: (ESPSETUP) ENTRY SERVICE PROCESSOR SETUP: modem_m0
##
FUNCTIONS: Modem configuration file specifically for IBM 7852-400
# modem with Auto-Reliable feature. This feature must be turned off
# for Catcher calls. This example uses the AT&F reset command to
# choose the factory defaults.
##
(C) COPYRIGHT International Business Machines Corp. 1996
# All Rights Reserved
# Licensed Materials - Property of IBM
##
US Government Users Restricted Rights - Use, duplication or
# disclosure restricted by GSA ADP Schedule Contract with IBM Corp.
###
The modem has configuration switches. They should be set to the
# factory default settings, except switches 11 and 12. These must be
# to UP ("AT" responses) and DOWN (Asynchronous operation), respectively.
ICDelay 1
DefaultTO 10
CallDelay 120
##
%N Call-Out phone number %R Return phone number
###
PROGRAMMING NOTE: No blanks between double quote marks (").
condout: send "AT&F&E2E0T\r" # Reset to factory defaults
# Reliable mode
# Echo off
ignore "0\r" or "OK\r\n" timeout 2 # Ignore modem response.
send "AT&E12&E14\r" # Disable pacing
# Disable data compression
expect "0\r" or "OK\r\n" timeout 2 # Confirm commands successful.
send "AT&SF1&S0S9=1\r" # DSR independent of CD
# Force DSR on.
# CD respond time=100ms
expect "0\r" or "OK\r\n" timeout 2 # Confirm commands successful.
send "ATV0S0=0\r" # Numeric response code
# Auto-Answer off
expect "0\r" or "OK\r\n" timeout 2 # Confirm commands successful.
done
connect: send "ATDT%N\r" # Tone dialing command.
# %N from Call Home setup.
# Expect a connection response.
expect "33\r" or "31\r" or "28\r" or "26\r" or "24\r" or "21\r" or
"19\r" or "13\r" or "12\r" or "1\r" busy "7\r"
timeout 60
done
retry: send "A/" # Repeat the previous command.
# Expect a connection response.
expect "33\r" or "31\r" or "28\r" or "26\r" or "24\r" or "21\r" or
Appendix F. Modem Configurations 519
"19\r" or "13\r" or "12\r" or "1\r" busy "7\r"
timeout 60
done
disconnect:
delay 2 # Separate from previous data.
send "+++" # Assure command mode.
delay 2 # Allow mode switching delay.
send "ATH0T\r" # Set modem switch-hook down
# (i.e., hang up).
ignore "0\r" or "OK\r" timeout 2 # Ignore modem response.
send "ATE0Q1\r" # Initialize modem: Echo OFF,
# Disable responses.
ignore "0\r" timeout 1
done
condin: send "AT&F&E2E0T\r" # Reset to factory defaults.
# Reliable mode
# Echo off
ignore "0\r" or "OK\r\n" timeout 2 # Ignore modem response.
send "AT&E12&E14\r" # Disable pacing
# Disable data compression
expect "0\r" or "OK\r\n" timeout 2 # Confirm commands successful
send "AT&SF1&S0S9=1\r" # DSR independent of CD.
# Force DSR on.
# CD respond time=100ms
expect "0\r" or "OK\r\n" timeout 2 # Confirm commands successful.
send "ATV0S0=2\r" # Numberic response code
# Answer on 2nd ring
expect "0\r" timeout 2 # Confirm commands successful.
done
waitcall: ignore "2\r" timeout 1 # Ignore first ring.
expect "2\r" timeout 10 # Pickup 2nd ring or timeout
# Expect a connection response.
expect "33\r" or "31\r" or "28\r" or "26\r" or "24\r" or "21\r" or
"19\r" or "13\r" or "12\r" or "1\r" busy "7\r"
timeout 60
done
page: send "ATDT%N,,,,%R;\r" # %N = pager call center number
# Add enough commas to wait for
# time to enter paging number.
# %R = paging number
expect "0\r" timeout 60 # Confirm successful command.
delay 2 # Wait before hanging up.
send "ATH0\r" # Hang up.
expect "0\r" timeout 2 # Confirm successful command.
done
ripo: send "AT&F&E2E0T\r" # Reset to factory defaults.
# Reliable mode
# Echo off
ignore "0\r" or "OK\r\n" timeout 2 # Ignore modem response.
send "AT&E12&E14\r" # Disable pacing
# Disable data compression
expect "0\r" or "OK\r\n" timeout 2 # Confirm successful command.
send "AT&SF1&S0S9=1\r" # DSR independent of CD.
# Force DSR on.
# CD respond time=100ms
520 Service Guide
expect "0\r" or "OK\r\n" timeout 2 # Confirm commands successful.
send "ATV0S0=0\r" # Numeric response code
# Auto Answer OFF
expect "0\r" timeout 2 # Confirm commands successful.
done #
error: # Handle unexpected modem
# responses.
expect "8\r" or "7\r" or "6\r" or "4\r" or "3\r"
delay 2
done
Sample File modem_m1.cfg
##
COMPONENT_NAME: (ESPSETUP) ENTRY SERVICE PROCESSOR SETUP modem_m1
##
FUNCTIONS: Modem configuration file specifically for IBM 7857-017 modem with
# Auto-Reliable feature. This feature must be turned off for Catcher calls.
# This example uses the AT&F reset command to choose the factory defaults.
##
To allow dial commands for digital pagers, it is necessary to reduce
# the number of characters in the dial command. Each comma (delay) has
# been set to 6 seconds (S8=6) for that reason.
###
(C) COPYRIGHT International Business Machines Corp. 1996
# All Rights Reserved
# Licensed Materials - Property of IBM
##
US Government Users Restricted Rights - Use, duplication or
# disclosure restricted by GSA ADP Schedule Contract with IBM Corp.
#
ICDelay 1
DefaultTO 10
CallDelay 120
##
%N Call-Out phone number %R Return phone number
###
PROGRAMMING NOTE: No blanks between double quote marks (").
condout: send "AT&F*E0E0\r" # Reset to factory defaults.
# *E0=data compression disabled
# E0=echo disabled
ignore "0\r" or "OK\r\n" timeout 2 # Ignore modem response.
send "AT#F0*Q2S8=6\r" # Trellis modulation disabled
# Retrain with adaptive rate
# Set ,=6second
expect "0\r" or "OK\r\n" timeout 2 # Confirm commands successful
send "ATV0X0S0=0\r" # Numeric response code
# AT compatible messages
# Auto-Answer disabled
expect "0\r" or "OK\r\n" timeout 2 # Confirm commands successful.
done
connect: send "ATDT%N\r" # Tone dialing command.
# %N from Call Home setup.
expect "1\r" busy "7\r" timeout 60 # Expect a connection response.
done
Appendix F. Modem Configurations 521
retry: send "A/" # Repeat the previous command.
expect "1\r" busy "7\r" timeout 60 # Expect a connection response.
done
disconnect:
delay 2 # Separate from previous data.
send "+++" # Assure command mode.
delay 2 # Allow mode switching delay.
send "ATH0\r" # Set modem switch-hook down
# (i.e., hang up).
ignore "0\r" or "OK\r" timeout 2 # Ignore modem response.
send "ATE0Q1\r" # Initialize modem: Echo OFF,
# Disable responses.
ignore "0\r" timeout 1
done
condin: send "AT&F*E0E0\r" # Reset to factory defaults.
# *E0=data compression disabled
# E0=echo disabled
ignore "0\r" or "OK\r\n" timeout 2 # Ignore modem response.
send "AT#F0*Q2\r" # Trellis modulation disabled
# Retrain with adaptive rate
expect "0\r" or "OK\r\n" timeout 2 # Confirm commands successful
send "ATV0X0S0=2\r" # Numeric response code
# AT compatible messages
# Answer on 2nd ring
expect "0\r" timeout 2 # Confirm commands successful.
done
waitcall: ignore "2\r" timeout 1 # Ignore first ring.
expect "2\r" timeout 10 # Pick up second ring
# or timeout.
# Expect a connection response.
expect "1\r" timeout 60
done
page: send "ATD%N,%R\r" # %N = pager call center number
# commas=6sec wait time to
# enter paging number.
# %R = return number
expect "0\r" or "3\r" timeout 30 # Confirm successful command.
delay 2 # Wait before hanging up.
send "+++" # Assure command mode.
delay 2 # Allow mode switching delay.
send "ATH0\r" # Hang up.
expect "0\r" timeout 2 # Confirm successful command.
done
ripo: send "AT&F*E0E0\r" # Reset to factory defaults.
# *E0=data compression disabled
# E0=echo disabled
ignore "0\r" or "OK\r\n" timeout 2 # Ignore modem response.
send "AT#F0*Q2\r" # Trellis modulation disabled
# Retrain with adaptive rate
expect "0\r" or "OK\r\n" timeout 2 # Confirm successful command.
send "ATV0X0S0=0\r" # Numeric response code
# AT compatible messages
# Auto-Answer disabled
expect "0\r" timeout 2 # Confirm commands successful.
522 Service Guide
done #
error: # Handle unexpected modem
# responses.
expect "8\r" or "7\r" or "4\r" or "3\r"
delay 2
done
Sample File modem_z.cfg
##
COMPONENT_NAME: (ESPSETUP) ENTRY SERVICE PROCESSOR SETUP Z
##
FUNCTIONS: Modem configuration file for many early Hayes* compatible modems.
# This example uses the ATZ reset command to choose the factory defaults.
# This setup will work for many modems, but it is required for early vintage
# modems which respond to neither the ATZ0 reset command nor the extended (&)
# commands. Refer to your modem manual.
##
* Trademark of Hayes Microcomputer Products, Inc.
##
(C) COPYRIGHT International Business Machines Corp. 1996
# All Rights Reserved
# Licensed Materials - Property of IBM
##
US Government Users Restricted Rights - Use, duplication or
# disclosure restricted by GSA ADP Schedule Contract with IBM Corp.
###
If the modem has configuration switches, they should be set to the
# factory default settings.
ICDelay 1
DefaultTO 10
CallDelay 120
# AT Attention Code , Inserts delay in dialing commands
# Z Reset to factory defaults Q0 Turn on responses
# E0 Turn echo off Q1 Turn off responses
# V0 Use numeric responses S0=0 Automatic answer inhibit
# +++ Escape to command mode S0=2 Answer on second ring
# H0 Hang-up T = Tone mode. When used as T\r, it is a
# no op to maintain program synchronization
# when modem may/will echo the commands.
##
%N Call-Out phone number %P Paging phone number
# %S Modem speed (available to users)
##
Following are common responses from a wide range of modems:
# 16, 15, 12, 10, 5 and 1 are connection responses. Add others as required.
# 7=busy; 6=no dial tone; 4=error; 3=no carrier; 2=ring; 0=OK
##
PROGRAMMING NOTE: No blanks between double quote marks (").
condout: send "ATZQ0T\r" # Reset to factory defaults.
ignore "0\r" or "OK\r\n" timeout 2 # Ignore modem response.
send "ATE0T\r" # Initialize modem: Echo OFF,
expect "0\r" or "OK\r\n" timeout 2 # Enable responses (Numeric),
send "ATQ0V0T\r" # Limit response codes.
expect "0\r" timeout 2 # Confirm commands successful.
send "ATS0=0\r" # Set AutoAnswer OFF
expect "0\r" timeout 2 # Confirm command successful.
Appendix F. Modem Configurations 523
done
connect: send "ATDT%N\r" # Tone dialing command.
# %N from Call Home setup.
# Expect a connection response.
expect "16\r" or "15\r" or "14\r" or "12\r" or "10\r" or "5\r" or "1\r" busy "7\r"
timeout 60
done
retry: send "A/" # Repeat the previous command.
# Expect a connection response.
expect "16\r" or "15\r" or "14\r" or "12\r" or "10\r" or "5\r" or "1\r" busy "7\r"
timeout 60
done
disconnect:
delay 2 # Separate from previous data.
send "+++" # Assure command mode.
delay 2 # Allow mode switching delay.
send "ATH0T\r" # Set modem switch-hook down
# (i.e., hang up).
ignore "0\r" or "OK\r" timeout 2 # Ignore modem response.
send "ATE0Q1\r" # Initialize modem: Echo OFF,
# Disable responses.
ignore "0\r" timeout 1
done
condin: send "ATZQ0T\r" # Reset to factory defaults.
ignore "0\r" or "OK\r\n" timeout 2 # Ignore modem response.
send "ATE0T\r" # Initialize modem: Echo OFF,
expect "0\r" or "OK\r\n" timeout 2 # Enable responses (Numeric),
send "ATQ0V0T\r" # Limit response codes.
expect "0\r" timeout 2 # Confirm commands successful.
send "ATS0=2\r" # Set AutoAnswer ON
expect "0\r" timeout 2 # Confirm command successful.
done
waitcall: ignore "2\r" timeout 1 # Ignore first ring.
expect "2\r" timeout 10 # Pick up second ring
# or timeout.
# Expect a connection response.
expect "16\r" or "15\r" or "14\r" or "12\r" or "10\r" or "5\r" or "1\r"
timeout 60
done
page: send "ATDT%N,,,,%R;\r" # %N = pager call center number
# Add enough commas to wait for
# time to enter paging number.
# %R = paging number
# Confirm successful command.
expect "0\r" timeout 60
delay 2 # Wait before hanging up.
send "ATH0T\r" # Hang up.
expect "0\r" timeout 2 # Confirm successful command.
done
ripo: send "ATZQ0T\r" # Reset to factory defaults.
524 Service Guide
ignore "0\r" or "OK\r\n" timeout 2 # Ignore modem response.
send "ATE0T\r" # Initialize modem: Echo OFF,
expect "0\r" or "OK\r\n" timeout 2 # Enable responses (Numeric),
send "ATQ0V0T\r" # Limit response codes.
expect "0\r" timeout 2 # Confirm commands successful.
send "ATS0=0\r" # Set AutoAnswer OFF
expect "0\r" timeout 2 # Confirm command successful.
done # RI Power On enabled.
error: # Handle unexpected modem
# responses.
expect "8\r" or "7\r" or "6\r" or "4\r" or "3\r"
delay 2
done
Sample File modem_z0.cfg
##
COMPONENT_NAME: (ESPSETUP) ENTRY SERVICE PROCESSOR SETUP Z0
##
FUNCTIONS: Modem configuration file for some early Hayes* compatible modems.
# This example uses the ATZ0 reset command to choose the factory defaults.
# This setup is recommended for modems that will respond to the ATZ0 command
# and which do not respond to the extended (&) commands. Refer to your modem
# manual.
##
* Trademark of Hayes Microcomputer Products, Inc.
##
(C) COPYRIGHT International Business Machines Corp. 1996
# All Rights Reserved
# Licensed Materials - Property of IBM
##
US Government Users Restricted Rights - Use, duplication or
# disclosure restricted by GSA ADP Schedule Contract with IBM Corp.
###
If the modem has configuration switches, they should be set to the
# factory default settings.
ICDelay 1
DefaultTO 10
CallDelay 120
# AT Attention Code , Inserts delay in dialing commands
# Z0 Reset. Restore Profile 0 Q0 Turn on responses
# E0 Turn echo off Q1 Turn off responses
# V0 Use numeric responses S0=0 Automatic answer inhibit
# +++ Escape to command mode S0=2 Answer on second ring
# H0 Hang-up X0=0 Limit modem response codes
# T = Tone mode. When used as T\r, it is a
# no op to maintain program synchronization
# when modem may/will echo the commands.
##
%N Call-Out phone number %P Paging phone number
# %S Modem speed (available to users)
##
Following are common responses from a wide range of modems:
# 16, 15, 12, 10, 5 and 1 are connection responses. Add others as required.
# 7=busy; 6=no dial tone; 4=error; 3=no carrier; 2=ring; 0=OK
##
PROGRAMMING NOTE: No blanks between double quote marks (").
Appendix F. Modem Configurations 525
condout: send "ATZ0Q0T\r" # Reset modem. Select profile 0
ignore "0\r" or "OK\r\n" timeout 2 # Ignore modem response.
send "ATE0T\r" # Initialize modem: Echo OFF,
expect "0\r" or "OK\r\n" timeout 2 # Enable responses (Numeric),
send "ATQ0V0X0T\r" # Limit response codes.
expect "0\r" timeout 2 # Confirm commands successful.
send "ATS0=0\r" # Set AutoAnswer OFF
expect "0\r" timeout 2 # Confirm command successful.
done
connect: send "ATDT%N\r" # Tone dialing command.
# %N from Call Home setup.
# Expect a connection response.
expect "16\r" or "15\r" or "14\r" or "12\r" or "10\r" or "5\r" or "1\r" busy "7\r"
timeout 60
done
retry: send "A/" # Repeat the previous command.
# Expect a connection response.
expect "16\r" or "15\r" or "14\r" or "12\r" or "10\r" or "5\r" or "1\r" busy "7\r"
timeout 60
done
disconnect:
delay 2 # Separate from previous data.
send "+++" # Assure command mode.
delay 2 # Allow mode switching delay.
send "ATH0T\r" # Set modem switch-hook down
# (i.e., hang up).
ignore "0\r" or "OK\r" timeout 2 # Ignore modem response.
send "ATE0Q1\r" # Initialize modem: Echo OFF,
# Disable responses.
ignore "0\r" timeout 1
done
condin: send "ATZ0Q0T\r" # Reset modem. Select profile 0
ignore "0\r" or "OK\r\n" timeout 2 # Ignore modem response.
send "ATE0T\r" # Initialize modem: Echo OFF,
expect "0\r" or "OK\r\n" timeout 2 # Enable responses (Numeric),
send "ATQ0V0X0T\r" # Limit response codes.
expect "0\r" timeout 2 # Confirm commands successful.
send "ATS0=2\r" # Set AutoAnswer ON
expect "0\r" timeout 2 # Confirm command successful.
done
waitcall: ignore "2\r" timeout 1 # Ignore first ring.
expect "2\r" timeout 10 # Pick up second ring
# or timeout.
# Expect a connection response.
expect "16\r" or "15\r" or "14\r" or "12\r" or "10\r" or "5\r" or "1\r"
timeout 60
done
page: send "ATDT%N,,,,%R;\r" # %N = pager call center number
# Add enough commas to wait for
# time to enter paging number.
# %R = paging number
526 Service Guide
# Confirm successful command.
expect "0\r" timeout 60
delay 2 # Wait before hanging up.
send "ATH0T\r" # Hang up.
expect "0\r" timeout 2 # Confirm successful command.
done
ripo: send "ATZ0Q0T\r" # Reset modem. Select profile 0
ignore "0\r" or "OK\r\n" timeout 2 # Ignore modem response.
send "ATE0T\r" # Initialize modem: Echo OFF,
expect "0\r" or "OK\r\n" timeout 2 # Enable responses (Numeric),
send "ATQ0V0X0T\r" # Limit response codes.
expect "0\r" timeout 2 # Confirm commands successful.
send "ATS0=0\r" # Set AutoAnswer OFF
expect "0\r" timeout 2 # Confirm command successful.
done # RI Power On enabled.
error: # Handle unexpected modem
# responses.
expect "8\r" or "7\r" or "6\r" or "4\r" or "3\r"
delay 2
done
Sample File modem_f.cfg
##
COMPONENT_NAME: (ESPSETUP) ENTRY SERVICE PROCESSOR SETUP F
##
FUNCTIONS: Modem configuration file for many recent Hayes* compatible modems.
# This example uses the AT&F reset command to choose the factory defaults.
# This set up is preferred for modems with extended (&) commands. For early
# vintage modems, setup Z or Z0 is recommended. If your modem responds to
# the extended (&) commands and to factory default choices (&Fn), setup file
# F0 or F1 is recommended.
##
* Trademark of Hayes Microcomputer Products, Inc.
##
(C) COPYRIGHT International Business Machines Corp. 1996
# All Rights Reserved
# Licensed Materials - Property of IBM
##
US Government Users Restricted Rights - Use, duplication or
# disclosure restricted by GSA ADP Schedule Contract with IBM Corp.
###
If the modem has configuration switches, they should be set to the
# factory default settings.
ICDelay 1
DefaultTO 10
CallDelay 120
# AT Attention Code , Inserts delay in dialing commands
# &F Reset to default profile Q0 Turn on responses
# E0 Turn echo off Q1 Turn off responses
# V0 Use numeric responses S0=0 Automatic answer inhibit
# +++ Escape to command mode S0=2 Answer on second ring
# H0 Hang-up X0=0 Limit modem response codes
# T = Tone mode. When used as T\r, it is a
# no op to maintain program synchronization
# when modem may/will echo the commands.
#
Appendix F. Modem Configurations 527
# &C1 Detect CD &D2 Respond to DTR (often the default)
##
%N Call-Out phone number %P Paging phone number
# %S Modem speed (available to users)
##
Following are common responses from a wide range of modems:
# 16, 15, 12, 10, 5 and 1 are connection responses. Add others as required.
# 7=busy; 6=no dial tone; 4=error; 3=no carrier; 2=ring; 0=OK
##
PROGRAMMING NOTE: No blanks between double quote marks (").
condout: send "AT&FQ0T\r" # Reset to factory defaults.
ignore "0\r" or "OK\r\n" timeout 2 # Ignore modem response.
send "ATE0T\r" # Initialize modem: Echo OFF,
expect "0\r" or "OK\r\n" timeout 2 # Enable responses (Numeric),
send "ATQ0V0X0T\r" # Limit response codes.
expect "0\r" timeout 2 # Confirm commands successful.
send "ATS0=0\r" # Set AutoAnswer OFF
expect "0\r" timeout 2 # Confirm command successful.
send "AT&C1&D2\r" # Detect carrier and DTR.
expect "0\r" timeout 2 # Confirm command successful.
done
connect: send "ATDT%N\r" # Tone dialing command.
# %N from Call Home setup.
# Expect a connection response.
expect "16\r" or "15\r" or "14\r" or "12\r" or "10\r" or "5\r" or "1\r" busy "7\r"
timeout 60
done
retry: send "A/" # Repeat the previous command.
# Expect a connection response.
expect "16\r" or "15\r" or "14\r" or "12\r" or "10\r" or "5\r" or "1\r" busy "7\r"
timeout 60
done
disconnect:
delay 2 # Separate from previous data.
send "+++" # Assure command mode.
delay 2 # Allow mode switching delay.
send "ATH0T\r" # Set modem switch-hook down
# (i.e., hang up).
ignore "0\r" or "OK\r" timeout 2 # Ignore modem response.
send "ATE0Q1\r" # Initialize modem: Echo OFF,
# Disable responses.
ignore "0\r" timeout 1
done
condin: send "AT&FQ0T\r" # Reset to factory defaults.
ignore "0\r" or "OK\r\n" timeout 2 # Ignore modem response.
send "ATE0T\r" # Initialize modem: Echo OFF,
expect "0\r" or "OK\r\n" timeout 2 # Enable responses (Numeric),
send "ATQ0V0X0T\r" # Limit response codes.
expect "0\r" timeout 2 # Confirm commands successful.
send "ATS0=2\r" # Set AutoAnswer ON
expect "0\r" timeout 2 # Confirm command successful.
send "AT&C1&D2\r" # Detect carrier and DTR.
expect "0\r" timeout 2 # Confirm command successful.
528 Service Guide
done
waitcall: ignore "2\r" timeout 1 # Ignore first ring.
expect "2\r" timeout 10 # Pick up second ring
# or timeout.
# Expect a connection response.
expect "16\r" or "15\r" or "14\r" or "12\r" or "10\r" or "5\r" or "1\r"
timeout 60
done
page: send "ATDT%N,,,,%R;\r" # %N = pager call center number
# Add enough commas to wait for
# time to enter paging number.
# %R = paging number
# Confirm successful command.
expect "0\r" timeout 60
delay 2 # Wait before hanging up.
send "ATH0T\r" # Hang up.
expect "0\r" timeout 2 # Confirm successful command.
done
ripo: send "AT&FQ0T\r" # Reset to factory defaults.
ignore "0\r" or "OK\r\n" timeout 2 # Ignore modem response.
send "ATE0T\r" # Initialize modem: Echo OFF,
expect "0\r" or "OK\r\n" timeout 2 # Enable responses (Numeric),
send "ATQ0V0X0T\r" # Limit response codes.
expect "0\r" timeout 2 # Confirm commands successful.
send "ATS0=0\r" # Set AutoAnswer OFF
expect "0\r" timeout 2 # Confirm command successful.
send "AT&C1&D2\r" # Detect carrier and DTR.
expect "0\r" timeout 2 # Confirm command successful.
done # RI Power On enabled.
error: # Handle unexpected modem
# responses.
expect "8\r" or "7\r" or "6\r" or "4\r" or "3\r"
delay 2
done
Sample File modem_f0.cfg
##
COMPONENT_NAME: (ESPSETUP) ENTRY SERVICE PROCESSOR SETUP F0
##
FUNCTIONS: Modem configuration file for many recent Hayes* compatible modems.
# This example uses the AT&F0 reset command to choose the factory defaults.
# This set up is preferred for modems with extended (&) commands. For early
# vintage modems, setup Z or Z0 is recommended. If your modem responds to
# the extended (&) commands and to factory default choices (&Fn), but doesn’t
# work properly with this setup file, setup F1 is recommended.
##
* Trademark of Hayes Microcomputer Products, Inc.
##
(C) COPYRIGHT International Business Machines Corp. 1996
# All Rights Reserved
# Licensed Materials - Property of IBM
##US Government Users Restricted Rights -
Use, duplication or
# disclosure restricted by GSA ADP Schedule Contract with IBM Corp.
#
Appendix F. Modem Configurations 529
##
If the modem has configuration switches, they should be set to the
# factory default settings.
ICDelay 1
DefaultTO 10
CallDelay 120
# AT Attention Code , Inserts delay in dialing commands
# &F0 Reset. Restore profile 0 Q0 Turn on responses
# E0 Turn echo off Q1 Turn off responses
# V0 Use numeric responses S0=0 Automatic answer inhibit
# +++ Escape to command mode S0=2 Answer on second ring
# H0 Hang-up X0=0 Limit modem response codes
# T = Tone mode. When used as T\r, it is a
# no op to maintain program synchronization
# when modem may/will echo the commands.
##
&C1 Detect CD &D2 Respond to DTR (often the default)
# &R1 Ignore RTS (CTS)
##
%N Call-Out phone number %P Paging phone number
# %S Modem speed (available to users)
##
Following are common responses from a wide range of modems:
# 16, 15, 12, 10, 5 and 1 are connection responses. Add others as required.
# 7=busy; 6=no dial tone; 4=error; 3=no carrier; 2=ring; 0=OK
##
PROGRAMMING NOTE: No blanks between double quote marks (").
condout: send "AT&F0Q0T\r" # Reset modem. Select profile 0
ignore "0\r" or "OK\r\n" timeout 2 # Ignore modem response.
send "ATE0T\r" # Initialize modem: Echo OFF,
expect "0\r" or "OK\r\n" timeout 2 # Enable responses (Numeric),
send "ATQ0V0X0T\r" # Limit response codes.
expect "0\r" timeout 2 # Confirm commands successful.
send "ATS0=0\r" # Set AutoAnswer OFF
expect "0\r" timeout 2 # Confirm command successful.
send "AT&C1&D2&R1\r" # Detect carrier and DTR,
# Ignore RTS.
expect "0\r" timeout 2 # Confirm command successful.
done
connect: send "ATDT%N\r" # Tone dialing command.
# %N from Call Home setup.
# Expect a connection response.
expect "16\r" or "15\r" or "14\r" or "12\r" or "10\r" or "5\r" or "1\r" busy "7\r"
timeout 60
done
retry: send "A/" # Repeat the previous command.
# Expect a connection response.
expect "16\r" or "15\r" or "14\r" or "12\r" or "10\r" or "5\r" or "1\r" busy "7\r"
timeout 60
done
disconnect:
delay 2 # Separate from previous data.
send "+++" # Assure command mode.
530 Service Guide
delay 2 # Allow mode switching delay.
send "ATH0T\r" # Set modem switch-hook down
# (i.e., hang up).
ignore "0\r" or "OK\r" timeout 2 # Ignore modem response.
send "ATE0Q1\r" # Initialize modem: Echo OFF,
# Disable responses.
ignore "0\r" timeout 1
done
condin: send "AT&F0Q0T\r" # Reset modem. Select profile 0
ignore "0\r" or "OK\r\n" timeout 2 # Ignore modem response.
send "ATE0T\r" # Initialize modem: Echo OFF,
expect "0\r" or "OK\r\n" timeout 2 # Enable responses (Numeric),
send "ATQ0V0X0T\r" # Limit response codes.
expect "0\r" timeout 2 # Confirm commands successful.
send "ATS0=2\r" # Set AutoAnswer ON
expect "0\r" timeout 2 # Confirm command successful.
send "AT&C1&D2&R1\r" # Detect carrier and DTR,
# Ignore RTS.
expect "0\r" timeout 2 # Confirm command successful.
done
waitcall: ignore "2\r" timeout 1 # Ignore first ring.
expect "2\r" timeout 10 # Pick up second ring
# or timeout.
# Expect a connection response.
expect "16\r" or "15\r" or "14\r" or "12\r" or "10\r" or "5\r" or "1\r"
timeout 60
done
page: send "ATDT%N,,,,%R;\r" # %N = pager call center number
# Add enough commas to wait for
# time to enter paging number.
# %R = paging number
# Confirm successful command.
expect "0\r" timeout 60
delay 2 # Wait before hanging up.
send "ATH0T\r" # Hang up.
expect "0\r" timeout 2 # Confirm successful command.
done
ripo: send "AT&F0Q0T\r" # Reset modem. Select profile 0
ignore "0\r" or "OK\r\n" timeout 2 # Ignore modem response.
send "ATE0T\r" # Initialize modem: Echo OFF,
expect "0\r" or "OK\r\n" timeout 2 # Enable responses (Numeric),
send "ATQ0V0X0T\r" # Limit response codes.
expect "0\r" timeout 2 # Confirm commands successful.
send "ATS0=0\r" # Set AutoAnswer OFF
expect "0\r" timeout 2 # Confirm command successful.
send "AT&C1&D2&R1\r" # Detect carrier and DTR,
# Ignore RTS.
expect "0\r" timeout 2 # Confirm command successful.
done # RI Power On enabled.
error: # Handle unexpected modem
# responses.
expect "8\r" or "7\r" or "6\r" or "4\r" or "3\r"
delay 2
done
Appendix F. Modem Configurations 531
Sample File modem_f1.cfg
##
COMPONENT_NAME: (ESPSETUP) ENTRY SERVICE PROCESSOR SETUP F1
##
FUNCTIONS: Modem configuration file for many recent Hayes* compatible modems.
# This example uses the AT&F1 reset command to choose the factory defaults.
# This set up is for modems with extended (&) commands and which do not work
# properly with setup F0. For early vintage modems, setup Z or Z0 is
# recommended.
##
* Trademark of Hayes Microcomputer Products, Inc.
##
(C) COPYRIGHT International Business Machines Corp. 1996
# All Rights Reserved
# Licensed Materials - Property of IBM
##
US Government Users Restricted Rights - Use, duplication or
# disclosure restricted by GSA ADP Schedule Contract with IBM Corp.
###
If the modem has configuration switches, they should be set to the
# factory default settings.
ICDelay 1
DefaultTO 10
CallDelay 120
# AT Attention Code , Inserts delay in dialing commands
# &F1 Reset. Restore profile 1 Q0 Turn on responses
# E0 Turn echo off Q1 Turn off responses
# V0 Use numeric responses S0=0 Automatic answer inhibit
# +++ Escape to command mode S0=2 Answer on second ring
# H0 Hang-up X0=0 Limit modem response codes
# T = Tone mode. When used as T\r, it is a
# no op to maintain program synchronization
# when modem may/will echo the commands.
##
&C1 Detect CD &D2 Respond to DTR (often the default)
# &R1 Ignore RTS (CTS)
##
%N Call-Out phone number %P Paging phone number
# %S Modem speed (available to users)
##
Following are common responses from a wide range of modems:
# 16, 15, 12, 10, 5 and 1 are connection responses. Add others as required.
# 7=busy; 6=no dial tone; 4=error; 3=no carrier; 2=ring; 0=OK
##
PROGRAMMING NOTE: No blanks between double quote marks (").
condout: send "AT&F1Q0T\r" # Reset modem. Select profile 1
ignore "0\r" or "OK\r\n" timeout 2 # Ignore modem response.
send "ATE0T\r" # Initialize modem: Echo OFF,
expect "0\r" or "OK\r\n" timeout 2 # Enable responses (Numeric),
send "ATQ0V0X0T\r" # Limit response codes.
expect "0\r" timeout 2 # Confirm commands successful.
send "ATS0=0\r" # Set AutoAnswer OFF
expect "0\r" timeout 2 # Confirm command successful.
send "AT&C1&D2&R1\r" # Detect carrier and DTR,
# Ignore RTS.
expect "0\r" timeout 2 # Confirm command successful.
done
532 Service Guide
connect: send "ATDT%N\r" # Tone dialing command.
# %N from Call Home setup.
# Expect a connection response.
expect "16\r" or "15\r" or "14\r" or "12\r" or "10\r" or "5\r" or "1\r" busy "7\r"
timeout 60
done
retry: send "A/" # Repeat the previous command.
# Expect a connection response.
expect "16\r" or "15\r" or "14\r" or "12\r" or "10\r" or "5\r" or "1\r" busy "7\r"
timeout 60
done
disconnect:
delay 2 # Separate from previous data.
send "+++" # Assure command mode.
delay 2 # Allow mode switching delay.
send "ATH0T\r" # Set modem switch-hook down
# (i.e., hang up).
ignore "0\r" or "OK\r" timeout 2 # Ignore modem response.
send "ATE0Q1\r" # Initialize modem: Echo OFF,
# Disable responses.
ignore "0\r" timeout 1
done
condin: send "AT&F1Q0T\r" # Reset modem. Select profile 1
ignore "0\r" or "OK\r\n" timeout 2 # Ignore modem response.
send "ATE0T\r" # Initialize modem: Echo OFF,
expect "0\r" or "OK\r\n" timeout 2 # Enable responses (Numeric),
send "ATQ0V0X0T\r" # Limit response codes.
expect "0\r" timeout 2 # Confirm commands successful.
send "ATS0=2\r" # Set AutoAnswer ON
expect "0\r" timeout 2 # Confirm command successful.
send "AT&C1&D2&R1\r" # Detect carrier and DTR,
# Ignore RTS.
expect "0\r" timeout 2 # Confirm command successful.
done
waitcall: ignore "2\r" timeout 1 # Ignore first ring.
expect "2\r" timeout 10 # Pick up second ring
# or timeout.
# Expect a connection response.
expect "16\r" or "15\r" or "14\r" or "12\r" or "10\r" or "5\r" or "1\r"
timeout 60
done
page: send "ATDT%N,,,,%R;\r" # %N = pager call center number
# Add enough commas to wait for
# time to enter paging number.
# %R = paging number
# Confirm successful command.
expect "0\r" timeout 60
delay 2 # Wait before hanging up.
send "ATH0T\r" # Hang up.
expect "0\r" timeout 2 # Confirm successful command.
done
Appendix F. Modem Configurations 533
ripo: send "AT&F1Q0T\r" # Reset modem. Select profile 1
ignore "0\r" or "OK\r\n" timeout 2 # Ignore modem response.
send "ATE0T\r" # Initialize modem: Echo OFF,
expect "0\r" or "OK\r\n" timeout 2 # Enable responses (Numeric),
send "ATQ0V0X0T\r" # Limit response codes.
expect "0\r" timeout 2 # Confirm commands successful.
send "ATS0=0\r" # Set AutoAnswer OFF
expect "0\r" timeout 2 # Confirm command successful.
send "AT&C1&D2&R1\r" # Detect carrier and DTR,
# Ignore RTS.
expect "0\r" timeout 2 # Confirm command successful.
done # RI Power On enabled.
error: # Handle unexpected modem
# responses.
expect "8\r" or "7\r" or "6\r" or "4\r" or "3\r"
delay 2
done
534 Service Guide
Index
A
access cover (rack) 366
replacement 366
acoustical
noise
emissions 472
adapters 379
removal 381, 385
replacement 383, 392
adding
FRUs 53
serviceable event comments 52
AIX codes 28
AIX location codes 24
attention LED 56
HMC 56
attention LEDs 465
automatic call home
overview 49
B
base system drawer will not power on and no error
codes are available 45
battery 432
disposal, recycling 471
bezel (286/C4) 377
removal 377
replacement 377
bezels (286/E4) 378
removal 378
replacement 378
boot list 304
boot mode menu 316
boot problems 173
C
cable management arm 501
cabling
HMC 504
LAN 508
power
HMC 508
call-in
testing 476
call-in/call-out setup menu 325
call-out
testing 476
call-out (call-home) 337
call-out policy setup menu 328
cannot power on the I/O drawer, power present LED
does not come on 45
checklist, inventory 481
checkpoints 46, 141
boot problems 173
firmware 150
checkpoints (continued)
service processor 143
clearances, service 481
code, error 179
common firmware error codes 286
component LEDs 46
configuration
client 303
network 508
NIM server 302
configure
memory 335
processor 335
configuring
service agent 510
console mirroring 338
enable/disable 310
console strategy 7
covers 366
access cover (rack) 366
removal 371
replacement 372
customer account setup menu 329
D
deconfigure
memory 335
processor 335
deconfiguring
disk drives 405
description, ipl flow 141
devices 510
attached device setup 510
setup 510
diagnostics
considerations 297
loading 302
online 298
concurrent mode 299, 300
maintenance mode 300
running
service mode 299
running online 298
service mode 298
standalone 301, 302
considerations 301
NIM server 302
partition considerations 301
using 297
diagnostics overview 45
diagram
system logic flow 21
disk drive backplane 415
disk drives
deconfiguring 405
documentation
AIX 300
535
E
electrical safety xi
laser compliance statement xiii
rack 484
emissions
noise 472
end of call MAP for systems with service focal point 45
entry MAP 45, 57
error code to FRU index 179
error codes
common firmware 286
firmware 192
operator panel 182
problem determination generated 295
service processor 211
SPCN 183
system firmware 286
error logs
service processor 339
extended error data
managing 52
F
fan assemblies 395
removal 397
replacement 399
fan locations 14
fan removal, hot-swap 397
fan removal, non-hot-swap 395
fan replace, hot-swap 399
fan replace, non-hot-swap 396
features, system 2
architecture, bus 2
keyboard 4
media drives 3
memory 3
mouse 4
operator panel 5
ports, input/output 5
ports, serial 5
power supply 4
processors 3
security 6
firmware 334
error codes 192
level 334
update 334
firmware checkpoints 150
firmware updates
system 332
flow, IPL 141
front door
removal 376
replacement 376
FRU
index 179
memory DIMM present bits 285
updating information 52
FRU isolation 47
FRU LEDs
activating 53
deactivating 53
FRUs
adding 53
replacing 53
G
general user menu 306
H
handling static–sensitive devices 359
hardware management console 6
hardware service functions 53
activating and deactivating FRU LEDs 53
highlighting xvii
HMC 6, 508
attention LED 56
cabling 504, 505
external
modem 506
installation 503
keyboard
connection 504
monitor
connection 504
mouse
connection 504
power cords 508
resetting the system attention LED 56
HMC (hardware management console)
system error interrogation 54
HMC attached system error interrogation 54
HMC system
resetting attention LED 56
hot-plug disk drives 405
removal 406
replacement 408
hot-plug management functions 379
hot-plug manager 379
hot-plug manager menu 380
hot-pluggable PCI adapter installation overview 379
I
I/O problem isolation 45
index, error code to FRU 179
indicators, POST 7
information, reference 1
install rack 497
installation
HMC 503
instructions, multiple FRU 26
instructions, rack safety 484
inventory 481
inventory scout 508
IPL
flow 141
phases 141
536 Service Guide
IPL flow without an HMC 142
isolation, FRU 47
J
JTAG problem isolation 45
K
keyboards 463
keys
numeric 1 key 8
L
LAN
cable 508
language selection menu 324
laser compliance statement xiii
laser safety information xiii
LCD progress indicator log 340
LEDs 465
attention LEDs 465
component 46, 466
fault LEDs 465
indicator menu 322
operator panel display LEDs 465
resetting 469
location codes 23
AIX 24
format 23
physical 23
locations
fans 14
memory DIMMs 18
power supply 12
system board 15, 16
system unit 9
system unit front view 10
system unit rear view 9
logic diagram, GP 21
logic diagram, GQ 22
M
maintenance analysis procedures 45
maintenance analysis procedures (MAPs) 57
minimum configuration Map 111
problem determination MAP 64
managing
extended error data 52
MAP 1520, power 89
MAP 1521, power 91
MAP 1548, minimum configuration 123
MAP rules 57
quick entry MAP 58
media drives 401
removal 401
replacement 404
memory
configure 335
deconfigure 335
module present bits 285
memory configuration/deconfiguration menu 320
memory DIMMs 427
memory DIMMs location 18
memory problem resolution 45
menu
boot mode 316
call-in/call-out 325
call-out policy setup 328
customer account setup 329
general
user 306
language selection 324
LED indicator 322
main 307
memory configuration/deconfiguration 320
modem configuration 326
OS surveillance
setup 310
power control network utilities 321
privileged
user 307
processor configuration/deconfiguration 318
reboot/restart policy setup 315
ring indicate 314
serial port selection 326
serial port speed setup 327
setup 309
system information 317
system power control 314
telephone setup 327
messages, service processor checkpoints 143
minimum configuration MAP 111
modem
configuration file selection 513
configurations 513
external
connection 506
transfer 517
modem_f.cfg, sample file 527
modem_f0.cfg, sample file 529
modem_f1.cfg, sample file 532
modem_m0.cfg, sample file 519
modem_m1.cfg, sample file 521
modem_z.cfg, sample file 523
modem_z0.cfg, sample file 525
N
network
configuration 508
NIM server
configuration 302
client 303
standalone diagnostics 302
noise
emissions 472
noise emission 480
Index 537
non-HMC attached system error interrogation 55
O
online diagnostics
concurrent mode 299, 300
maintenance mode 300
modes of operation 298
running 298
service mode 298
operating considerations
online and standalone diagnostics 297
operational phases
service processor 341
operator panel 19, 419
removal 419
replacement 420
operator panel display 465
operator panel error codes 182
overview
hot-pluggable PCI adapter 379
service focal point 48
overview, diagnostics 45
P
parameters
service processor
service mode 330
partitions
considerations
standalone 301
parts
keyboard 463
passwords 309
general
change 310
privileged
change 310
phases, IPL 141
physical location codes 23, 28
POST indicators 7
POST keys 8
1 key 8
5 key 8
6 key 8
numeric 1 8
power and lightpath LEDs 46
power cables 43
power control network utilities menu 321
power cords 508
power MAP 45
power supply 439
removal 439
replacement 441
power supply locations 12
power-on
methods 330
Power-On Self-Test 7
primary console 518
privileged user menus 307
problem determination generated error codes 295
problem determination MAP 45, 64
problem isolation procedures 45
processor
configure 335
deconfigure 335
processor card 423
removal 423
replacement 426
processor configuration/deconfiguration menu 318
processor subsystem problem isolation 45
product disposal 471
Q
quick entry MAP 45, 58
quick entry MAP for systems with service focal
point 45
R
rack setup 479
cable management arm 501
inventory 481
rack hardware 485
rack install 497
rack installation 479
rack safety 484
rack, configure 481
reboot/restart policy setup menu 315
recycling 471
reference information 1
related publications xvii
removal 371
adapters 381, 385
bezel (286/C4) 377
bezel (286/E4) 378
caution and danger 357
front door 376
HMC installation 503
hot-plug disk drives 406
hot-swap fan assembly 397
operator panel 419
power supply 439
processor card 423
service access cover 371
static–sensitive devices 359
system board 443
removal and replacement procedures 357
access cover removal 286/E4 371
adapter removal, hot-plug 385
adapter removal, non-hot-plug 381
adapter, adding or replacing a non-hot-plug 383
adapter, hot-plug management functions 379
adapter, hot-plug manager 379
adapter, hot-plug manager menu 380
adapter, install hot-plug 392
adapter, replace hot-plug 388
adapters, PCI 379
back cover removal, 286/E4 372
back cover replacement, 286/E4 373
538 Service Guide
removal and replacement procedures (continued)
battery, replacing 432
bezel door removal, 286/E4 376
bezel door replacement, 286/E4 376
bezel door, 286/E4 376
bezel removal, 286/C4 377
bezel removal, 286/E4 378
bezel replacement, 286/C4 377
bezel replacement, 286/E4 378
bezels 377
bracket, release latch 363
CD-ROM drive 401
cover removal, disk drive access 373
cover replacement, disk drive access 373
covers 366
devices, handling static sensitive 359
disk drive drive 401
disk drive, configuring 405
disk drive, deconfiguring 405
disk drive, hot-plug removal 406
disk drive, hot-plug replacement 408
disk drives, hot-plug 405
fan removal, hot-swap 397
fan removal, non-hot-swap 395
fan replace, hot-swap 399
fan replace, non-hot-swap 396
fans 395
HMC adapter cabling 506
HMC installation 503
HMC modem connections 506
HMC network, configure 508
HMC power cabling 508
HMC to LAN cabling 508
HMC to modem switch settings 507
HMC, cabling 504
HMC, monitor position 503
inventory scout, configure 508
media drives 401
media removal 401
media replacement 404
memory DIMM removal 427
memory DIMM replacement 431
memory DIMMs 427
operating position, 286/C4 365
outer side cover removal, 286/E4 370
outer side cover replacement, 286/E4 370, 372
PCI adapters 379
power supplies 436
power supply, hot-swap removal 439
power supply, non-hot-swap removal 436
power supply, non-hot-swap replacement 438, 441
processor card 423
processor card removal 423
processor card replacing 426
processor card, adding 426
release latch 362
removal and installation procedure, 286/C4 496
removal, system drawer from rack 496
service access cover removal, 286/C4 366
service access cover replacement, 286/C4 366
service agent, configure 510
removal and replacement procedures (continued)
service position, 286/C4 364
starting the system with an HMC 361
starting the system without an HMC 360
stopping the system with an HMC 359
stopping the system with an HMC and Linux 360
stopping the system without an HMC 359
vital product data, collect 509
replacement 372
adapters 383, 392
bezel (286/C4) 377, 378
front door 376
hot-plug disk drives 408
hot-swap fan assembly 399
media drives 401, 404
operator panel 420
power supply 441
processor card 426
service access cover 372
system board 448
top cover (rack) 366
replacing
FRUs 53
resetting the system attention LED 56
HMC 56
restart recovery 331
RETAIN 329
retries 329
ring indicate power-on menu 314
S
safety notices xi
saving service processor settings 475
Self-Test, Power-On 7
serial port
selection menu 326
snoop setup menu 311
speed setup menu 327
serial port configurations 5
service agent
configuring 510
service clearance 481
service focal point
activating and deactivating FRU LEDs 53
adding FRUs 53
call home overview 49
closing a serviceable event 52
enabling surveillance notifications 50
hardware service functions 53
overview 48
replacing FRUs 53
saving and managing extended error data 52
serviceable events 51
setting up surveillance 50
settings 49
testing error reporting 49
updating FRU information 52
viewing and adding serviceable event comments 52
viewing error details 51
viewing service processor error details 52
Index 539
service focal point (continued)
viewing serviceable event details 51
viewing serviceable event partition information 53
viewing serviceable events 51
service inspection guide 44
service position 364
service processor 305
access
local 305
remote 306
backup settings 475
boot mode menu 316
call-in/call-out setup menu 325
call-out call-home 337
call-out policy setup menu 328
checklist 475
checkpoints 143
console mirroring 338
customer account setup menu 329
error codes 211
error logs 339
general user menu 306
language selection menu 324
main menu 307
memory configuration/deconfiguration menu 320
menus 305
modem configuration menu 326
monitoring - surveillance 336
operational phases 341
OS surveillance setup menu 310
parameters
service mode 330
passwords 309
power control network utilities menu 321
privileged user menus 307
procedures in service mode 330
processor configuration/deconfiguration menu 318
reboot/restart policy setup menu 315
reboot/restart recovery 331
ring indicate power-on menu 314
serial port selection menu 326
serial port snoop setup menu 311
serial port speed setup menu 327
settings
saving and restoring 306
setup 475
setup checklist 475
setup menu 309
system information menu 317
system power control menu 314
telephone setup menu 327
test 475
service processor checkpoints 143
serviceable event
viewing partition information 53
serviceable events
adding comments 52
selecting and viewing 51
updating FRU information 52
viewing comments 52
viewing details 51
serviceable events (continued)
viewing error details 51
viewing service processor error details 52
working with 51
setup menu 309
slow boot 301
SPCN error codes 183
spcn error codes table directed you here and the error
code is 1011 1B0x 45
specifications 42, 479
noise emission notes 480
specifications, power cables 43
standalone diagnostics 301, 302
NIM server 302, 303
partition considerations 301
stopping the system with an HMC 359
stopping the system with an HMC and Linux 360
stopping the system without an HMC 359
strategy, console 7
surveillance
monitoring 336
system
attention LED 46
HMC attached system error interrogation 54
non-HMC attached system error interrogation 55
resetting attention LED 56
system board 443
removal 443
replacement 448
system board locations 15, 16
system cables 37, 459
system firmware update messages 286
system firmware updates 332
system information menu 317
system logic flow diagram 21
system management services 345
boot devices, select 354
boot options, select 352
error log, view 347
exiting SMS 356
IPL, remote setup 348
IPL, setup 348
multiboot startup 356
password, change options 347
password, privileged-access 347
SCSI settings, change 351
select console 351
select language 346
settings, display current 355
settings, restore default 356
system power-on methods 330
system specifications 479
system VPD update 421
system,
stopping with an HMC 359
stopping with an HMC and Linux 360
stopping without an HMC 359
system, starting with an HMC 361
system, starting without an HMC 360
540 Service Guide
T
telephone setup menu 327
terminal type
identifying 298
testing the setup
call-in 476
call-out 476
trademarks xviii
transfer of a modem session 517
U
updating
FRU information 52
V
viewing
serviceable event comments 52
serviceable event details 51
serviceable event error details 51, 52
serviceable event partition information 53
vital product data (VPD) 509
VPD update procedure 421
W
web sites
firmware yupdates 334
microcode updates 334
Index 541
542 Service Guide
Vos remarques sur ce document / Technical publication remark form
Titre / Title : Bull ESCALA PL420T and PL420R Service Guide
Nº Reférence / Reference Nº : 86 A1 42EG 00 Daté / Dated : February 2003
ERREURS DETECTEES / ERRORS IN PUBLICATION
AMELIORATIONS SUGGEREES / SUGGESTIONS FOR IMPROVEMENT TO PUBLICATION
Vos remarques et suggestions seront examinées attentivement.
Si vous désirez une réponse écrite, veuillez indiquer ci-après votre adresse postale complète.
Your comments will be promptly investigated by qualified technical personnel and action will be taken as required.
If you require a written reply, please furnish your complete mailing address below.
NOM / NAME : Date :
SOCIETE / COMPANY :
ADRESSE / ADDRESS :
Remettez cet imprimé à un responsable BULL ou envoyez-le directement à :
Please give this technical publication remark form to your BULL representative or mail to:
BULL CEDOC
357 AVENUE PATTON
B.P.20845
49008 ANGERS CEDEX 01
FRANCE
Technical Publications Ordering Form
Bon de Commande de Documents Techniques
To order additional publications, please fill up a copy of this form and send it via mail to:
Pour commander des documents techniques, remplissez une copie de ce formulaire et envoyez-la à :
BULL CEDOC
ATTN / Mr. L. CHERUBIN
357 AVENUE PATTON
B.P.20845
49008 ANGERS CEDEX 01
FRANCE
Phone / Téléphone : +33 (0) 2 41 73 63 96
FAX / Télécopie +33 (0) 2 41 73 60 19
E–Mail / Courrier Electronique : srv.Cedoc@franp.bull.fr
Or visit our web sites at: / Ou visitez nos sites web à:
http://www.logistics.bull.net/cedoc
http://www–frec.bull.com http://www.bull.com
CEDOC Reference #
No Référence CEDOC
Qty
Qté
CEDOC Reference #
No Référence CEDOC
Qty
Qté
CEDOC Reference #
No Référence CEDOC
Qty
Qté
_ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ]
_ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ]
_ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ]
_ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ]
_ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ]
_ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ]
_ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ]
[ _ _ ] : no revision number means latest revision / pas de numéro de révision signifie révision la plus récente
NOM / NAME : Date :
SOCIETE / COMPANY :
ADRESSE / ADDRESS :
PHONE / TELEPHONE : FAX :
E–MAIL :
For Bull Subsidiaries / Pour les Filiales Bull :
Identification:
For Bull Affiliated Customers / Pour les Clients Affiliés Bull :
Customer Code / Code Client :
For Bull Internal Customers / Pour les Clients Internes Bull :
Budgetary Section / Section Budgétaire :
For Others / Pour les Autres :
Please ask your Bull representative. / Merci de demander à votre contact Bull.
BULL CEDOC
357 AVENUE PATTON
B.P.20845
49008 ANGERS CEDEX 01
FRANCE
86 A1 42EG 00
ORDER REFERENCE
PLACE BAR CODE IN LOWER
LEFT CORNER
Utiliser les marques de découpe pour obtenir les étiquettes.
Use the cut marks to get the labels.
86 A1 42EG 00
Service Guide
ESCALA
PL420T and
PL420R
86 A1 42EG 00
Service Guide
ESCALA
PL420T and
PL420R
86 A1 42EG 00
|